论文标题

双语同步:通过编辑操作恢复翻译关系

Bilingual Synchronization: Restoring Translational Relationships with Editing Operations

论文作者

Xu, Jitao, Crego, Josep, Yvon, François

论文摘要

机器翻译(MT)通常被视为一个单发过程,该过程从头开始生成类似于某些源文本的目标语言。我们在这里考虑一个更通用的设置,该设置假定初始目标序列,必须将其转换为源的有效翻译,从而恢复源和目标之间的并行性。对于这项双语同步任务,我们考虑了几种体系结构(自回归和非自动回忆性)和培训制度,并尝试使用多种实际设置,例如模拟的交互式MT,通过翻译记忆(TM)和TM清洁进行翻译。我们的结果表明,一旦进行了微调,可以将一个基于通用的编辑系统进行微调,可以与专门针对这些任务进行专门培训的专用系统进行比较。

Machine Translation (MT) is usually viewed as a one-shot process that generates the target language equivalent of some source text from scratch. We consider here a more general setting which assumes an initial target sequence, that must be transformed into a valid translation of the source, thereby restoring parallelism between source and target. For this bilingual synchronization task, we consider several architectures (both autoregressive and non-autoregressive) and training regimes, and experiment with multiple practical settings such as simulated interactive MT, translating with Translation Memory (TM) and TM cleaning. Our results suggest that one single generic edit-based system, once fine-tuned, can compare with, or even outperform, dedicated systems specifically trained for these tasks.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源