论文标题
通过级联的文本中风检测和擦除删除场景文本
Scene text removal via cascaded text stroke detection and erasing
论文作者
论文摘要
最近基于学习的方法显示了场景文本删除任务的有希望的性能。但是,这些方法通常会留下一些文本残留物并获得视觉上令人不愉快的结果。在这项工作中,我们根据准确的文本中风检测提出了一个新颖的“端到端”框架。具体而言,我们将文本删除问题解散为文本中风检测和中风去除。我们设计一个文本中风检测网络和一个文本删除生成网络,以分别解决这两个子问题。然后,我们将这两个网络组合为处理单元,并级联本机以获取最终模型以进行文本删除。实验结果表明,所提出的方法显着胜过定位和擦除场景文本的最新方法。由于当前可公开可用的数据集都是合成的,无法正确测量不同方法的性能,因此我们构建了一个新的现实世界数据集,该数据集将发布以促进相关研究。
Recent learning-based approaches show promising performance improvement for scene text removal task. However, these methods usually leave some remnants of text and obtain visually unpleasant results. In this work, we propose a novel "end-to-end" framework based on accurate text stroke detection. Specifically, we decouple the text removal problem into text stroke detection and stroke removal. We design a text stroke detection network and a text removal generation network to solve these two sub-problems separately. Then, we combine these two networks as a processing unit, and cascade this unit to obtain the final model for text removal. Experimental results demonstrate that the proposed method significantly outperforms the state-of-the-art approaches for locating and erasing scene text. Since current publicly available datasets are all synthetic and cannot properly measure the performance of different methods, we therefore construct a new real-world dataset, which will be released to facilitate the relevant research.