论文标题

语料库NonAnignéset adt。 Essai de比较entre les lesprésidentsfrançaisetbrésiliensdel'ère当代

Corpus non alignés et ADT. Essai de comparaison entre les présidents français et brésiliens de l'ère contemporaine

论文作者

Maciel, Carlos, Mayaffre, Damon, Vanni, Laurent

论文摘要

是否有可以处理非对准双语语料库的ADT方法?文本类型是否对话语产生了足够的强大限制,如果它们的类型相同,则可以使文本以不同的语言编写?为了回答这两个问题,一个方法论,另一种语言学,这一贡献在一个法语和巴西总统演讲中得到了当代时代(1950-2020)的总统演讲,从de Gaulle到Macron,从库比斯切克到卢拉,即1500万个单词。从简单的频率词典到单词的辅助概况的阶乘处理,提出了一种方法论路径,以建立通用的跨国总统言论。

Is there an ADT method that can deal with non-aligned bilingual corpora? Does the textual genre exert a sufficiently strong constraint on the discourse that would make texts written in different languages comparable, provided they are of identical genre? To answer these two questions, one methodological, the other linguistic, this contribution gathers in a single corpus French and Brazilian presidential speeches of the contemporary era (1950-2020), from de Gaulle to Macron, from Kubitschek to Lula, i.e. 15 million words. A methodological path is proposed from the simple frequency dictionary to the factorial treatment of the cooccurrencial profiles of words, in order to establish a generic transnational presidential speech.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源