论文标题

通过在线大写笔迹的性别分类:一种依赖文本的合格方法

Gender classification by means of online uppercase handwriting: A text-dependent allographic approach

论文作者

Sesa-Nogueras, Enric, Faundez-Zanuy, Marcos, Roure-Alcobé, Josep

论文摘要

本文基于在线手写提供了性别分类架构。使用用数字平板电脑获取的样品来捕获写作的动力,它将作者分类为雄性或女性。提出的方法是Allographic,关于笔触是笔迹的结构单位。在写作设备没有在写作表面上施加任何压力时,也要考虑到笔触(空中)中风。该方法还与文本有关,这意味着训练和测试是通过完全相同的文本进行的。文本依赖性允许使用非常少量的文本进行分类。通过来自生物保健数据库的样品进行的实验,产生的结果落在人类法官期望的分类范围内。只有四个单个大写字重复,分类良好的作家的平均率为68%。有16个单词,汇率平均上升到72.6%。统计分析表明,上述速率非常重要。为了探索笔触的分类潜力,也可以考虑这些。尽管在这种情况下的结果不是结论性的,但是当将笔触的信息与笔向下的信息相结合时,获得了74%的分类作家的74%。

This paper presents a gender classification schema based on online handwriting. Using samples acquired with a digital tablet that captures the dynamics of the writing, it classifies the writer as a male or a female. The method proposed is allographic, regarding strokes as the structural units of handwriting. Strokes performed while the writing device is not exerting any pressure on the writing surface, pen-up (in-air) strokes, are also taken into account. The method is also text-dependent meaning that training and testing is done with exactly the same text. Text-dependency allows classification be performed with very small amounts of text. Experimentation, performed with samples from the BiosecurID database, yields results that fall in the range of the classification averages expected from human judges. With only four repetitions of a single uppercase word, the average rate of well classified writers is 68%; with sixteen words, the rate rises to an average 72.6%. Statistical analysis reveals that the aforementioned rates are highly significant. In order to explore the classification potential of the pen-up strokes, these are also considered. Although in this case results are not conclusive, an outstanding average of 74% of well classified writers is obtained when information from pen-up strokes is combined with information from pen-down ones.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源