论文标题
词汇模式的序数分析
Ordinal analysis of lexical patterns
论文作者
论文摘要
单词是通过含义连接思想和事物的基本语言单元。但是,单词在文本序列中并未独立出现。句法规则的存在引起相邻单词之间的相关性。使用序数模式方法,我们对11种主要语言的词汇统计连接进行了分析。我们发现,语言用来表达单词关系的各种举止产生了独特的模式结构分布。此外,给定语言的这些模式分布的波动可以使我们能够确定撰写文本及其作者的历史时期。综上所述,我们的结果强调了序数时间序列分析在语言类型学,历史语言学和风格测定法中的相关性。
Words are fundamental linguistic units that connect thoughts and things through meaning. However, words do not appear independently in a text sequence. The existence of syntactic rules induces correlations among neighboring words. Using an ordinal pattern approach, we present an analysis of lexical statistical connections for 11 major languages. We find that the diverse manners that languages utilize to express word relations give rise to unique pattern structural distributions. Furthermore, fluctuations of these pattern distributions for a given language can allow us to determine both the historical period when the text was written and its author. Taken together, our results emphasize the relevance of ordinal time series analysis in linguistic typology, historical linguistics and stylometry.