论文标题

全额图:可配置的印刷字符合成器

OmniPrint: A Configurable Printed Character Synthesizer

论文作者

Sun, Haozhe, Tu, Wei-Wei, Guyon, Isabelle

论文摘要

我们介绍了Omniprint,这是一个孤立印刷字符的合成数据生成器,旨在用于机器学习研究。它从MNIST,SVHN和Omniglot等著名数据集中汲取了灵感,但具有从各种语言,字体和样式以及具有自定义扭曲的各种语言,字体和样式的各种印刷字符的能力。我们包括来自27个脚本的935个字体和许多类型的失真。作为概念证明,我们显示了各种用例,包括为即将到来的元数据神经2021竞赛设计的元学习数据集的示例。 OmniPrint可在https://github.com/sunhaozhe/omniprint上找到。

We introduce OmniPrint, a synthetic data generator of isolated printed characters, geared toward machine learning research. It draws inspiration from famous datasets such as MNIST, SVHN and Omniglot, but offers the capability of generating a wide variety of printed characters from various languages, fonts and styles, with customized distortions. We include 935 fonts from 27 scripts and many types of distortions. As a proof of concept, we show various use cases, including an example of meta-learning dataset designed for the upcoming MetaDL NeurIPS 2021 competition. OmniPrint is available at https://github.com/SunHaozhe/OmniPrint.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源