Episode 494

LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer

February 7, 202525m 33s

Audio is streamed directly from the publisher (media.transistor.fm) as published in their RSS feed. Play Podcasts does not host this file. Rights-holders can request removal through the copyright & takedown page.

Original episode page

Show Notes

🤗 Upvotes: 7 | cs.CV

Authors:
Yiren Song, Danze Chen, Mike Zheng Shou

Title:
LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer

Arxiv:
http://arxiv.org/abs/2502.01105v1

Abstract:
Generating cognitive-aligned layered SVGs remains challenging due to existing methods' tendencies toward either oversimplified single-layer outputs or optimization-induced shape redundancies. We propose LayerTracer, a diffusion transformer based framework that bridges this gap by learning designers' layered SVG creation processes from a novel dataset of sequential design operations. Our approach operates in two phases: First, a text-conditioned DiT generates multi-phase rasterized construction blueprints that simulate human design workflows. Second, layer-wise vectorization with path deduplication produces clean, editable SVGs. For image vectorization, we introduce a conditional diffusion mechanism that encodes reference images into latent tokens, guiding hierarchical reconstruction while preserving structural integrity. Extensive experiments demonstrate LayerTracer's superior performance against optimization-based and neural baselines in both generation quality and editability, effectively aligning AI-generated vectors with professional design cognition.

← All episodes of Daily Paper Cast