论文标题
基于变压器的乌尔都语手写文本光学字符读取器
Transformer based Urdu Handwritten Text Optical Character Reader
论文作者
论文摘要
提取手写文本是数字化信息的最重要组成部分之一,并使其可用于大规模设置。手写光学角色读取器(OCR)是计算机视觉和自然语言处理计算的研究问题,英语已经做了很多工作,但是不幸的是,对于乌尔都语等低资源的语言,几乎没有完成工作。乌尔都语语言脚本非常困难,因为它具有基于其相对位置的字符形状的草书性质和变化,因此,需要提出一个模型,该模型可以理解复杂的特征并将其推广到各种手写样式。在这项工作中,我们提出了一个基于变压器的乌尔都语手写文本提取模型。由于变形金刚在自然语言理解任务中非常成功,我们将进一步探索它们以了解复杂的乌尔都语手写。
Extracting Handwritten text is one of the most important components of digitizing information and making it available for large scale setting. Handwriting Optical Character Reader (OCR) is a research problem in computer vision and natural language processing computing, and a lot of work has been done for English, but unfortunately, very little work has been done for low resourced languages such as Urdu. Urdu language script is very difficult because of its cursive nature and change of shape of characters based on it's relative position, therefore, a need arises to propose a model which can understand complex features and generalize it for every kind of handwriting style. In this work, we propose a transformer based Urdu Handwritten text extraction model. As transformers have been very successful in Natural Language Understanding task, we explore them further to understand complex Urdu Handwriting.