论文标题
自动板:词汇约束文本生成的简单食谱
AutoTemplate: A Simple Recipe for Lexically Constrained Text Generation
论文作者
论文摘要
词汇约束的文本生成是受约束的文本生成任务之一,旨在生成涵盖所有给定约束词典的文本。尽管现有方法使用词汇约束的光束搜索算法或使用非自动回程解码来解决此问题,但生成的文本质量与硬约束满意度之间存在权衡。我们介绍了自动板,这是一个简单而有效的词汇约束的文本生成框架,分为模板生成和词汇化任务。模板生成是用占位符生成文本,并将词汇化取代将其替换为约束词典,以执行词汇约束的文本生成。我们对两个任务进行了实验:关键字到句子的世代和实体指导的摘要。实验结果表明,自动板在两个任务上都优于竞争基准,同时满足硬词汇约束。该代码可从https://github.com/megagonlabs/autotemplate获得
Lexically constrained text generation is one of the constrained text generation tasks, which aims to generate text that covers all the given constraint lexicons. While the existing approaches tackle this problem using a lexically constrained beam search algorithm or dedicated model using non-autoregressive decoding, there is a trade-off between the generated text quality and the hard constraint satisfaction. We introduce AutoTemplate, a simple yet effective lexically constrained text generation framework divided into template generation and lexicalization tasks. The template generation is to generate the text with the placeholders, and lexicalization replaces them into the constraint lexicons to perform lexically constrained text generation. We conducted the experiments on two tasks: keywords-to-sentence generations and entity-guided summarization. Experimental results show that the AutoTemplate outperforms the competitive baselines on both tasks while satisfying the hard lexical constraints. The code is available at https://github.com/megagonlabs/autotemplate