通过观察者的生理信号识别情绪

论文标题

通过观察者的生理信号识别情绪

Emotion Recognition Through Observer's Physiological Signals

论文作者

Liu, Yang, Gedeon, Tom, Caldwell, Sabrina, Lin, Shouxu, Jin, Zi

论文摘要

基于生理信号的情绪识别是一个热门话题，并且具有广泛的应用，例如安全驾驶，医疗保健和创造安全的社会。本文介绍了一个生理数据集PAFEW，该数据集是使用野生（AFEW）数据集中的动作面部表情的电影剪辑获得的。为了建立基线，我们在此数据集中使用电肌活动（EDA）信号，并从与每个电影剪辑相对应的每个信号系列中提取6个功能，以识别7种情感，即愤怒，厌恶，恐惧，快乐，惊喜，悲伤，悲伤和中立。总体而言，有24位观察员参加了我们的培训集，其中包括19名观察员，他们只参加了一个会话，观看了7个班级的80个视频，还有5位观察员参加了多次参与并观看所有视频。所有视频均以平衡的顺序呈现。该分类任务采用了一击。我们在此初始培训设置上报告了基线（三层网络）的分类精度，同时培训所有参与者，只有一个参与者，只有多个参与者。我们还研究了通过唤醒或价将数据集分组的识别准确性，该数据集分别达到68.66％和72.72％。最后，我们提供了一个两步网络。第一步是通过网络将功能分类为高/低唤醒或正/负价。然后将第一步的唤醒/价中间输出与特征集相连，以此作为情感识别的第二步的输入。我们发现，添加唤醒或价信息可以帮助提高分类准确性。此外，正/负价的信息在该数据集上提高了分类准确性的较高程度。

Emotion recognition based on physiological signals is a hot topic and has a wide range of applications, like safe driving, health care and creating a secure society. This paper introduces a physiological dataset PAFEW, which is obtained using movie clips from the Acted Facial Expressions in the Wild (AFEW) dataset as stimuli. To establish a baseline, we use the electrodermal activity (EDA) signals in this dataset and extract 6 features from each signal series corresponding to each movie clip to recognize 7 emotions, i.e., Anger, Disgust, Fear, Happy, Surprise, Sad and Neutral. Overall, 24 observers participated in our collection of the training set, including 19 observers who participated in only one session watching 80 videos from 7 classes and 5 observers who participated multiple times and watched all the videos. All videos were presented in an order balanced fashion. Leave-one-observer-out was employed in this classification task. We report the classification accuracy of our baseline, a three-layer network, on this initial training set while training with signals from all participants, only single participants and only multiple participants. We also investigate the recognition accuracy of grouping the dataset by arousal or valence, which achieves 68.66% and 72.72% separately. Finally, we provide a two-step network. The first step is to classify the features into high/low arousal or positive/negative valence by a network. Then the arousal/valence middle output of the first step is concatenated with feature sets as input of the second step for emotion recognition. We found that adding arousal or valence information can help to improve the classification accuracy. In addition, the information of positive/negative valence boosts the classification accuracy to a higher degree on this dataset.

下载PDF全文

下载文献需遵守相关版权规定

论文标题