OPENCQA：用图表回答的开放式问题

论文标题

OPENCQA：用图表回答的开放式问题

OpenCQA: Open-ended Question Answering with Charts

论文作者

Kantharaj, Shankar, Do, Xuan Long, Leong, Rixie Tiffany Ko, Tan, Jia Qing, Hoque, Enamul, Joty, Shafiq

论文摘要

图表非常受欢迎，可以分析数据并传达重要的见解。人们经常分析可视化以回答需要解释性答案的开放式问题。回答此类问题通常很困难且耗时，因为它需要大量的认知和感知努力。为了应对这一挑战，我们介绍了一个名为OpenCQA的新任务，目标是回答有关具有描述性文本的图表的开放式问题。我们介绍了注释过程和对数据集的深入分析。我们在三个实际设置下实施并评估一组基线。在第一个设置中，提供了图表和随附的文章作为模型的输入。第二个设置仅向图表而不是整个文章提供相关段落，而第三个设置要求模型仅基于图表生成答案。我们对结果的分析表明，最佳性能模型通常会产生流利而连贯的文本，而它们则在难以执行复杂的逻辑和算术推理。

Charts are very popular to analyze data and convey important insights. People often analyze visualizations to answer open-ended questions that require explanatory answers. Answering such questions are often difficult and time-consuming as it requires a lot of cognitive and perceptual efforts. To address this challenge, we introduce a new task called OpenCQA, where the goal is to answer an open-ended question about a chart with descriptive texts. We present the annotation process and an in-depth analysis of our dataset. We implement and evaluate a set of baselines under three practical settings. In the first setting, a chart and the accompanying article is provided as input to the model. The second setting provides only the relevant paragraph(s) to the chart instead of the entire article, whereas the third setting requires the model to generate an answer solely based on the chart. Our analysis of the results show that the top performing models generally produce fluent and coherent text while they struggle to perform complex logical and arithmetic reasoning.

下载PDF全文

下载文献需遵守相关版权规定

论文标题