论文标题
自动文本摘要方法:全面评论
Automatic Text Summarization Methods: A Comprehensive Review
论文作者
论文摘要
由于互联网的快速增长而引起的最紧迫的问题之一就是信息超载。以摘要的形式简化相关信息将有助于许多人,因为关于任何主题的材料在互联网上都很丰富。对于人类而言,手动总结大量文本非常具有挑战性。因此,它增加了对更复杂和强大的摘要的需求。自1950年代以来,研究人员一直在尝试改善创建摘要的方法,以使机器生成的摘要与人类创建的摘要相匹配。这项研究提供了文本摘要概念的详细最新分析,例如摘要方法,所使用的技术,标准数据集,评估指标和未来研究范围。最常见的方法是提取和抽象的,在这项工作中详细研究了。评估摘要并增加可重复使用的资源和基础设施的开发有助于比较和复制发现,从而增加竞争以改善结果。本研究还讨论了生成摘要的不同评估方法。最后,在这项研究结束时,提到了与文本摘要研究有关的一些挑战和研究机会,这些挑战和研究机会对于在该领域的潜在研究人员可能很有用。
One of the most pressing issues that have arisen due to the rapid growth of the Internet is known as information overloading. Simplifying the relevant information in the form of a summary will assist many people because the material on any topic is plentiful on the Internet. Manually summarising massive amounts of text is quite challenging for humans. So, it has increased the need for more complex and powerful summarizers. Researchers have been trying to improve approaches for creating summaries since the 1950s, such that the machine-generated summary matches the human-created summary. This study provides a detailed state-of-the-art analysis of text summarization concepts such as summarization approaches, techniques used, standard datasets, evaluation metrics and future scopes for research. The most commonly accepted approaches are extractive and abstractive, studied in detail in this work. Evaluating the summary and increasing the development of reusable resources and infrastructure aids in comparing and replicating findings, adding competition to improve the outcomes. Different evaluation methods of generated summaries are also discussed in this study. Finally, at the end of this study, several challenges and research opportunities related to text summarization research are mentioned that may be useful for potential researchers working in this area.