Improved Iterative Refinement for Chart-to-Code Generation via Structured Instruction
Chengzhi Xu, Yuyang Wang, Lai Wei, Lichao Sun, Weiran Huang
2025-06-20
Summary
This paper talks about ChartIR, a method that helps large multimodal language models (MLLMs) generate better computer code from charts by breaking down the process and improving the results step by step.
What's the problem?
The problem is that existing models often struggle to accurately understand complex visual charts and then translate that understanding into correct code, because these tasks are difficult to do all at once.
What's the solution?
The researchers developed ChartIR, which separates the task into two parts: first, the model interprets the chart visually, and then it converts that interpretation into code. They also use iterative refinement, meaning the model improves its output in several steps to get the code right.
Why it matters?
This matters because it helps make AI better at turning visual data into useful computer programs, which can save time and reduce errors in industries that rely on generating code from charts or graphs.
Abstract
ChartIR uses structured instruction and iterative refinement to improve MLLM performance in chart-to-code generation by separating visual understanding and code translation tasks.