Paper ID: 2207.07243

LineCap: Line Charts for Data Visualization Captioning Models

Anita Mahinpei, Zona Kostic, Chris Tanner

Data visualization captions help readers understand the purpose of a visualization and are crucial for individuals with visual impairments. The prevalence of poor figure captions and the successful application of deep learning approaches to image captioning motivate the use of similar techniques for automated figure captioning. However, research in this field has been stunted by the lack of suitable datasets. We introduce LineCap, a novel figure captioning dataset of 3,528 figures, and we provide insights from curating this dataset and using end-to-end deep learning models for automated figure captioning.

Submitted: Jul 15, 2022