VecFusion: Vector Font Generation with Diffusion

We present VecFusion, a new neural architecture that can generate vector fonts with varying topological structures and precise control point positions. Our approach is a cascaded diffusion model which consists of a raster diffusion model followed by a vector diffusion model. The raster model generates low-resolution, rasterized fonts with auxiliary control point information, capturing the global style and shape of the font, while the vector model synthesizes vector fonts conditioned on the low-resolution raster fonts from the first stage. To synthesize long and complex curves, our vector diffusion model uses a transformer architecture and a novel vector representation that enables the modeling of diverse vector geometry and the precise prediction of control points. Our experiments show that, in contrast to previous generative models for vector graphics, our new cascaded vector diffusion model generates higher quality vector fonts, with complex structures and diverse styles.

Learn More

Publications

VecFusion: Vector Font Generation with Diffusion

CVPR 2024

Publication date: June 19, 2024

Vikas Thamizharasan, Difan Liu, Shantanu Agarwal, Matt Fisher, Michaël Gharbi, Oliver Wang, Alec Jacobson, Evangelos Kalogerakis

CVPR Highlight

Research Areas: AI & Machine Learning Computer Vision, Imaging & Video Graphics (2D & 3D)