FonTS: Text Rendering With Typography and Style Controls

Shi, Wenda; Song, Yiren; Zhang, Dengming; Liu, Jiaming; Zou, Xingxing

Wenda Shi, Yiren Song, Dengming Zhang, Jiaming Liu, Xingxing Zou; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2025, pp. 18463-18474

Abstract

Visual text rendering are widespread in various real-world applications, requiring careful font selection and typographic choices. Recent progress in diffusion transformer (DiT)-based text-to-image (T2I) models show promise in automating these processes. However, these methods still encounter challenges like inconsistent fonts, style variation, and limited fine-grained control, particularly at the word-level. This paper proposes a two-stage DiT-based pipeline to address these problems by enhancing controllability over typography and style in text rendering. We introduce typography control fine-tuning (TC-FT), an parameter-efficient fine-tuning method (on 5% key parameters) with enclosing typography control tokens (ETC-tokens), which enables precise word-level application of typographic features. To further address style inconsistency in text rendering, we propose a text-agnostic style control adapter (SCA) that prevents content leakage while enhancing style consistency. To implement TC-FT and SCA effectively, we incorporated HTML-render into the data synthesis pipeline and proposed the first word-level controllable dataset. Through comprehensive experiments, we demonstrate the effectiveness of our approach in achieving superior word-level typographic control, font consistency, and style consistency in text rendering tasks. Our project page is available at this site.

Related Material

[pdf] [supp] [arXiv]

[bibtex]

@InProceedings{Shi_2025_ICCV, author = {Shi, Wenda and Song, Yiren and Zhang, Dengming and Liu, Jiaming and Zou, Xingxing}, title = {FonTS: Text Rendering With Typography and Style Controls}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)}, month = {October}, year = {2025}, pages = {18463-18474} }