论文标题
Streokegan:通过中风编码的中国字体生成中的中国字体崩溃
StrokeGAN: Reducing Mode Collapse in Chinese Font Generation via Stroke Encoding
论文作者
论文摘要
中国字体的一代是许多应用中涉及的重要问题。现有的大多数生成方法基于深层生成模型,尤其是基于生成的对抗网络(GAN)模型。但是,这些深层生成模型可能会遇到模式崩溃问题,从而大大降低了生成结果的多样性和质量。在本文中,我们引入了一个单位中风编码,以捕获汉字的关键模式信息,然后将其纳入Cyclegan,这是Cyclegan,这是一种流行的中国字体生成的深层生成模型。结果,我们提出了一种称为streokegan的有效方法,主要是出于观察到中风编码包含汉字的模式信息的观察。为了重建相关生成的字符的一位中风编码,我们引入了对歧视者施加的中风编码重建损失。配备了这种单位中风编码和中风编码的重建损失,Cycledan的模式崩溃问题可以大大减轻,并改善了中风和生成字符多样性的保存和多样性。在具有不同字体的九个数据集上,一系列一代任务证明了Strokegan的有效性。数值结果表明,Strokegan通常在内容和识别精度以及某些笔触误差方面胜过最先进的方法,并且还会产生更真实的字符。
The generation of stylish Chinese fonts is an important problem involved in many applications. Most of existing generation methods are based on the deep generative models, particularly, the generative adversarial networks (GAN) based models. However, these deep generative models may suffer from the mode collapse issue, which significantly degrades the diversity and quality of generated results. In this paper, we introduce a one-bit stroke encoding to capture the key mode information of Chinese characters and then incorporate it into CycleGAN, a popular deep generative model for Chinese font generation. As a result we propose an efficient method called StrokeGAN, mainly motivated by the observation that the stroke encoding contains amount of mode information of Chinese characters. In order to reconstruct the one-bit stroke encoding of the associated generated characters, we introduce a stroke-encoding reconstruction loss imposed on the discriminator. Equipped with such one-bit stroke encoding and stroke-encoding reconstruction loss, the mode collapse issue of CycleGAN can be significantly alleviated, with an improved preservation of strokes and diversity of generated characters. The effectiveness of StrokeGAN is demonstrated by a series of generation tasks over nine datasets with different fonts. The numerical results demonstrate that StrokeGAN generally outperforms the state-of-the-art methods in terms of content and recognition accuracies, as well as certain stroke error, and also generates more realistic characters.