论文标题

听证会和不幸:解密口语

Hearings and mishearings: decrypting the spoken word

论文作者

Mehta, Anita, Luck, Jean-Marc

论文摘要

我们提出了一个在出现不当的情况下对单个单词的语音感知的模型。这种现象学方法基于语言学中使用的概念,并提供了一种跨语言的形式主义。我们为单词长度分布提出了一种有效的两参数形式,并引入了一个简单的错位表示,我们在随后的单词识别建模中使用了这一数字。在无上下文的场景中,当逐步进入一个单词时,我们可以正确猜出其完整形式时通常会通过预期进行单词识别。就模型参数而言,当未发生混可能性时,我们对这种预期阈值进行定量估计。可以预料的是,当足够多的错误时,整个预期效应就会消失。我们解决语音感知问题的全球方法是本着优化问题的精神。例如,当单词长度小于阈值(以静态过渡而识别)时,我们表明语音感知很容易。我们将其扩展到单词识别的动力学,提出了一种直观的方法,突出了个人,孤立的不幸和连续不幸的群集之间的区别。至少在某些参数范围内,在达到静态转变之前就已经表现出了动态过渡,就像许多其他复杂系统示例一样。

We propose a model of the speech perception of individual words in the presence of mishearings. This phenomenological approach is based on concepts used in linguistics, and provides a formalism that is universal across languages. We put forward an efficient two-parameter form for the word length distribution, and introduce a simple representation of mishearings, which we use in our subsequent modelling of word recognition. In a context-free scenario, word recognition often occurs via anticipation when, part-way into a word, we can correctly guess its full form. We give a quantitative estimate of this anticipation threshold when no mishearings occur, in terms of model parameters. As might be expected, the whole anticipation effect disappears when there are sufficiently many mishearings. Our global approach to the problem of speech perception is in the spirit of an optimisation problem. We show for instance that speech perception is easy when the word length is less than a threshold, to be identified with a static transition, and hard otherwise. We extend this to the dynamics of word recognition, proposing an intuitive approach highlighting the distinction between individual, isolated mishearings and clusters of contiguous mishearings. At least in some parameter range, a dynamical transition is manifest well before the static transition is reached, as is the case for many other examples of complex systems.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源