Parsing Line Chart Images Using Linear Programming

Hajime Kato, Mitsuru Nakazawa, Hsuan-Kung Yang, Mark Chen, Björn Stenger; Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2022, pp. 2109-2118

Abstract


This paper proposes a method for automatically recovering data from chart images. In particular we focus on the task of estimating line charts, as the most common chart type, in a fully automatic way that handles line occlusions, as well as lines of different styles, e.g., dashed or dotted. For this, we first train a single semantic segmentation network to predict probability maps for each different line styles. We then construct a graph based on this output and formulate the line tracing task as a minimum-cost-flow problem, optimizing a cost function using linear programming. From the traced lines, the axes, and text labels, we recover the numerical values used to generate the chart. In experiments on six datasets, containing both synthesized and crawled images, we show significant improvements over prior work.

Related Material


[pdf] [supp]
[bibtex]
@InProceedings{Kato_2022_WACV, author = {Kato, Hajime and Nakazawa, Mitsuru and Yang, Hsuan-Kung and Chen, Mark and Stenger, Bj\"orn}, title = {Parsing Line Chart Images Using Linear Programming}, booktitle = {Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)}, month = {January}, year = {2022}, pages = {2109-2118} }