MM-ReCoder: Advancing Chart-to-Code Generation with Reinforcement Learning and Self-Correction

Tang, Zitian; Zhang, Xu; Yuan, Jianbo; Zou, Yang; Gunjal, Varad; Jiang, Songyao; Modolo, Davide

Zitian Tang, Xu Zhang, Jianbo Yuan, Yang Zou, Varad Gunjal, Songyao Jiang, Davide Modolo; Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026, pp. 22164-22173

Abstract

Multimodal Large Language Models (MLLMs) have recently demonstrated promising capabilities in multimodal coding tasks such as chart-to-code generation. However, existing methods primarily rely on supervised fine-tuning (SFT), which requires the model to learn code patterns through chart-code pairs but does not expose the model to a code execution environment. Moreover, while self-correction through execution feedback offers a potential route to improve coding quality, even state-of-the-art MLLMs have been shown to struggle with effective self-correction. In this work, we introduce MM-ReCoder, a chart-to-code generation model trained with reinforcement learning (RL) and equipped with self-correction ability. We propose a two-stage multi-turn self-correction RL strategy based on Group Relative Policy Optimization (GRPO). The first stage enhances the model's self-correction ability via rolling out a shared first turn, while the second stage improves the coding capability with full-trajectory optimization. MM-ReCoder learns to produce more accurate and executable code through the interaction with the environment and by iteratively correcting its own outputs. Our results on three chart-to-code benchmarks demonstrate the state-of-the-art performance of MM-ReCoder.

Related Material

[pdf] [supp] [arXiv]

[bibtex]

@InProceedings{Tang_2026_CVPR, author = {Tang, Zitian and Zhang, Xu and Yuan, Jianbo and Zou, Yang and Gunjal, Varad and Jiang, Songyao and Modolo, Davide}, title = {MM-ReCoder: Advancing Chart-to-Code Generation with Reinforcement Learning and Self-Correction}, booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)}, month = {June}, year = {2026}, pages = {22164-22173} }