-
[pdf]
[supp]
[bibtex]@InProceedings{Deo_2025_WACV, author = {Deo, Anurag and Bhat, Savita and Karande, Shirish}, title = {VisualFusion: Enhancing Blog Content with Advanced Infographic Pipeline}, booktitle = {Proceedings of the Winter Conference on Applications of Computer Vision (WACV)}, month = {February}, year = {2025}, pages = {5591-5600} }
VisualFusion: Enhancing Blog Content with Advanced Infographic Pipeline
Abstract
Infographics represent a key component of any blog or article facilitating effective communication of ideas while fostering reader engagement. However many content creators possess limited expertise in crafting visually striking infographics. This gap is effectively addressed by our proposed pipeline designed to aid writers in generating compelling infographics tailored to their written content. Our pipeline uses textual content and tabular data from the blog to generate anchor plots. Leveraging LLM for prompt generation the pipeline integrates the generated prompts with these anchor plots through an Image to Image (I2I) generation model. We observe that the majority of the resulting images generated using this approach align with the article's narrative and effectively represent the underlying tabular data. Additionally we introduce our proposed AADaT (Aesthetical Adherence to Data and Text) Score adept at comprehensively assessing aesthetics textual alignment data fidelity and overall image quality concurrently. In comparative evaluations our pipeline has demonstrated around 15% superior performance relative to state-of-the-art models such as DALLE and Stable Diffusion Large by showcasing much better data adherence and aesthetics. While state-of-the-art models excel in some metrics but falter in others our pipeline demonstrates a balanced performance across all metrics.
Related Material