论文标题
恩特姆:以实体为中心的摘要数据集
EntSUM: A Data Set for Entity-Centric Summarization
论文作者
论文摘要
可控的摘要旨在提供摘要,以考虑用户指定的方面和偏好,以更好地帮助他们满足其信息需求,而不是标准摘要设置,该设置构建了文档的单个通用摘要。我们介绍了一个人类通知的数据集,以进行可控的摘要,重点是命名实体作为控制的方面。我们进行了广泛的定量分析,以激发以实体为中心的摘要的任务,并表明现有的可控摘要方法无法生成以实体为中心的摘要。我们建议对最新的摘要方法进行扩展,以在我们的数据集中获得更好的结果。我们的分析和结果表明,此任务和提议的数据集的挑战性质。
Controllable summarization aims to provide summaries that take into account user-specified aspects and preferences to better assist them with their information need, as opposed to the standard summarization setup which build a single generic summary of a document. We introduce a human-annotated data set EntSUM for controllable summarization with a focus on named entities as the aspects to control. We conduct an extensive quantitative analysis to motivate the task of entity-centric summarization and show that existing methods for controllable summarization fail to generate entity-centric summaries. We propose extensions to state-of-the-art summarization approaches that achieve substantially better results on our data set. Our analysis and results show the challenging nature of this task and of the proposed data set.