SVGen: Interpretable Vector Graphics Generation with Large Language Models

🎉 Accepted by ACM MM 2025

1. Introduction

SVGen is an end-to-end model that generates high-quality SVG code from text. We fine-tuned a Large Language Model on our custom SVG-1M dataset using curriculum learning, Chain-of-Thought (CoT), and reinforcement learning.

2. Dependencies

This repo is built upon LLaMA-Factory. Sincere thanks to their excellent work!

2.1 Clone the Repository

git clone https://github.com/gitcat-404/SVGen.git
cd SVGen

2.2 Create Conda Environment

conda create -n svgen python=3.10 -y
conda activate svgen

2.3 Dependencies for cairosvg

sudo apt update
sudo apt install libcairo2 libcairo2-dev

2.4 Python Dependencies

pip install torch==2.5.1+cu124 torchvision==0.20.0+cu124 --index-url https://download.pytorch.org/whl/cu124
pip install -r requirements.txt
cd LLaMA-Factory && pip install -e ".[torch,metrics]"

3. How to use

3.1 Download Model Weights

For this demonstration, we will be using our top-performing model, SVGen-Qwen2.5-Coder-7B-Instruct. Please download the model weights from Hugging Face and store them under the Models/ path.

pip install huggingface_hub
hf download gitcat-404/SVGen-Qwen2.5-Coder-7B-Instruct --local-dir Models/SVGen-Qwen2.5-Coder-7B-Instruct

3.2 Interactive demo

python app.py

3.3 Inference

Taking the test data for this task as an example, we will write the prompts that need to be inferred into a CSV file, such as the example provided in test/color_test.csv.

python inference.py \
    --model_path Models/SVGen-Qwen2.5-Coder-7B-Instruct \
    --csv_file_path test/color_test.csv \
    --prompt_type qwen \
    --output_folder "results/qwen_outputs"

4. Test

Our evaluation metrics in this article are: Fréchet Inception Distance (FID), CLIPScore-T2I, CLIPScore-I2I, Preference Scores (HPS), and Aesthetic Score. The specific implementation is available in test/ To test the model, first download sac+logos+ava1-l14-linearMSE.pth from Huggingface and hpc.pt from GitHub, placing them in the test/pretrain_weight/ folder. Next, modify test/test.py by filling in the folder containing the previously generated images, then run:"

python test/test.py

5.Train

All experiments are executed on 8×NVIDIA A800 GPUs using the AdamW optimizer (learning rate 4e-5), with a maximum sequence length of 8,000 tokens.In order to reproduce our experiments, please first download all data from 🤗SVG-1M-Json and place it in the json_data/ folder. We have provided the training scripts in the config/ folder. After modifying the configurations, you can execute them sequentially:

sh config/train_stage1.sh
sh config/train_stage2.sh
sh config/train_stage3.sh
sh config/train_stage_RL.sh

6.Get SVG metadata from website

This dataset was collected by web scraping public content from IconFont and is intended for non-commercial academic research and technical exchange purposes only.

python Spider/CrawlingSVG_Icon.py

7.Generated samples

💕 Acknowledgments:

We would like to extend our sincerest thanks to the projects and websites that inspired this work, specifically: IconFont LLaMA-Factory LLM4SVG OmniSVG Star-vector

Citation

@article{wang2025svgen,
  title={SVGen: Interpretable Vector Graphics Generation with Large Language Models},
  author={Wang, Feiyu and Zhao, Zhiyuan and Liu, Yuandong and Zhang, Da and Gao, Junyu and Sun, Hao and Li, Xuelong},
  journal={arXiv preprint arXiv:2508.09168},
  year={2025}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SVGen: Interpretable Vector Graphics Generation with Large Language Models

🎉 Accepted by ACM MM 2025

1. Introduction

2. Dependencies

2.1 Clone the Repository

2.2 Create Conda Environment

2.3 Dependencies for cairosvg

2.4 Python Dependencies

3. How to use

3.1 Download Model Weights

3.2 Interactive demo

3.3 Inference

4. Test

5.Train

6.Get SVG metadata from website

7.Generated samples

💕 Acknowledgments:

Citation

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
Data_preprocess		Data_preprocess
LLaMA-Factory		LLaMA-Factory
Label_inference		Label_inference
RL		RL
Spider		Spider
To_json		To_json
config		config
image		image
json_data		json_data
test		test
README.md		README.md
app.py		app.py
inference.py		inference.py
requirements.txt		requirements.txt

gitcat-404/SVGen

Folders and files

Latest commit

History

Repository files navigation

SVGen: Interpretable Vector Graphics Generation with Large Language Models

🎉 Accepted by ACM MM 2025

1. Introduction

2. Dependencies

2.1 Clone the Repository

2.2 Create Conda Environment

2.3 Dependencies for cairosvg

2.4 Python Dependencies

3. How to use

3.1 Download Model Weights

3.2 Interactive demo

3.3 Inference

4. Test

5.Train

6.Get SVG metadata from website

7.Generated samples

💕 Acknowledgments:

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages