论文标题

恩特姆:以实体为中心的摘要数据集

EntSUM: A Data Set for Entity-Centric Summarization

论文作者

Maddela, Mounica, Kulkarni, Mayank, Preotiuc-Pietro, Daniel

论文摘要

可控的摘要旨在提供摘要,以考虑用户指定的方面和偏好,以更好地帮助他们满足其信息需求,而不是标准摘要设置,该设置构建了文档的单个通用摘要。我们介绍了一个人类通知的数据集,以进行可控的摘要,重点是命名实体作为控制的方面。我们进行了广泛的定量分析,以激发以实体为中心的摘要的任务,并表明现有的可控摘要方法无法生成以实体为中心的摘要。我们建议对最新的摘要方法进行扩展,以在我们的数据集中获得更好的结果。我们的分析和结果表明,此任务和提议的数据集的挑战性质。

Controllable summarization aims to provide summaries that take into account user-specified aspects and preferences to better assist them with their information need, as opposed to the standard summarization setup which build a single generic summary of a document. We introduce a human-annotated data set EntSUM for controllable summarization with a focus on named entities as the aspects to control. We conduct an extensive quantitative analysis to motivate the task of entity-centric summarization and show that existing methods for controllable summarization fail to generate entity-centric summaries. We propose extensions to state-of-the-art summarization approaches that achieve substantially better results on our data set. Our analysis and results show the challenging nature of this task and of the proposed data set.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源