使用变压器调制融合，用于语言声学情绪识别

论文标题

使用变压器调制融合，用于语言声学情绪识别

Modulated Fusion using Transformer for Linguistic-Acoustic Emotion Recognition

论文作者

Delbrouck, Jean-Benoit, Tits, Noé, Dupont, Stéphane

论文摘要

本文旨在为情感识别和情感分析的任务带来一种新的轻巧而强大的解决方案。我们的动机是提出两个基于变压器和调制的架构，这些体系结构结合了从广泛的数据集中的语言和声学输入，以挑战，有时甚至超过该领域的最先进。为了证明模型的效率，我们仔细评估了它们在IEMOCAP，MOSI，MOSEI和MELD数据集上的性能。可以直接复制实验，并且代码完全开放，以供将来的研究。

This paper aims to bring a new lightweight yet powerful solution for the task of Emotion Recognition and Sentiment Analysis. Our motivation is to propose two architectures based on Transformers and modulation that combine the linguistic and acoustic inputs from a wide range of datasets to challenge, and sometimes surpass, the state-of-the-art in the field. To demonstrate the efficiency of our models, we carefully evaluate their performances on the IEMOCAP, MOSI, MOSEI and MELD dataset. The experiments can be directly replicated and the code is fully open for future researches.

下载PDF全文

下载文献需遵守相关版权规定

论文标题