< Explain other AI papers

OmniSVG: A Unified Scalable Vector Graphics Generation Model

Yiying Yang, Wei Cheng, Sijin Chen, Xianfang Zeng, Jiaxu Zhang, Liao Wang, Gang Yu, Xingjun Ma, Yu-Gang Jiang

2025-04-09

OmniSVG: A Unified Scalable Vector Graphics Generation Model

Summary

This paper talks about OmniSVG, a smart AI tool that creates high-quality vector images (like infinitely zoomable graphics) from text or pictures, making designs easier to edit and scale without losing quality.

What's the problem?

Current AI tools either make messy vector files that are hard to use or only create super simple black-and-white icons, limiting what designers can do.

What's the solution?

OmniSVG uses a pre-trained vision-language AI to turn drawing instructions into clear building blocks, trains on a huge dataset of 2 million examples, and creates detailed colorful graphics from text, images, or character references.

Why it matters?

This helps designers quickly create professional logos, illustrations, and icons that stay sharp at any size, saving time and boosting creativity in apps and websites.

Abstract

Scalable Vector Graphics (SVG) is an important image format widely adopted in graphic design because of their resolution independence and editability. The study of generating high-quality SVG has continuously drawn attention from both designers and researchers in the AIGC community. However, existing methods either produces unstructured outputs with huge computational cost or is limited to generating monochrome icons of over-simplified structures. To produce high-quality and complex SVG, we propose OmniSVG, a unified framework that leverages pre-trained Vision-Language Models (VLMs) for end-to-end multimodal SVG generation. By parameterizing SVG commands and coordinates into discrete tokens, OmniSVG decouples structural logic from low-level geometry for efficient training while maintaining the expressiveness of complex SVG structure. To further advance the development of SVG synthesis, we introduce MMSVG-2M, a multimodal dataset with two million richly annotated SVG assets, along with a standardized evaluation protocol for conditional SVG generation tasks. Extensive experiments show that OmniSVG outperforms existing methods and demonstrates its potential for integration into professional SVG design workflows.