论文标题
通过学习和融合视觉线索的自动抑郁症检测
Automatic Depression Detection via Learning and Fusing Features from Visual Cues
论文作者
论文摘要
抑郁症是最普遍的精神障碍之一,严重影响了一个人的生活。传统的抑郁诊断通常取决于量表的评级,这可能是劳动密集型和主观的。在这种情况下,自动抑郁症检测(ADD)因其低成本和客观性而引起了更多关注。添加系统能够从某些病历(例如视频序列)自动检测到抑郁症。但是,从长序列中有效提取特定于抑郁的信息,从而阻碍了令人满意的精度仍然具有挑战性。在本文中,我们通过学习和融合视觉提示的功能提出了一种新颖的添加方法。具体而言,我们首先构建了时间扩张的卷积网络(TDCN),其中设计和堆叠了多个扩张的卷积块(DCB),以从序列中学习远程时间信息。然后,采用特征注意力(FWA)模块来融合从TDCN中提取的不同特征。该模块学会为特征通道分配权重,旨在更好地结合各种视觉特征并进一步提高检测精度。我们的方法与其他基于视觉功能的方法相比,在DAIC_WOZ数据集上实现了最先进的性能,显示出其有效性。
Depression is one of the most prevalent mental disorders, which seriously affects one's life. Traditional depression diagnostics commonly depends on rating with scales, which can be labor-intensive and subjective. In this context, Automatic Depression Detection (ADD) has been attracting more attention for its low cost and objectivity. ADD systems are able to detect depression automatically from some medical records, like video sequences. However, it remains challenging to effectively extract depression-specific information from long sequences, thereby hindering a satisfying accuracy. In this paper, we propose a novel ADD method via learning and fusing features from visual cues. Specifically, we firstly construct Temporal Dilated Convolutional Network (TDCN), in which multiple Dilated Convolution Blocks (DCB) are designed and stacked, to learn the long-range temporal information from sequences. Then, the Feature-Wise Attention (FWA) module is adopted to fuse different features extracted from TDCNs. The module learns to assign weights for the feature channels, aiming to better incorporate different kinds of visual features and further enhance the detection accuracy. Our method achieves the state-of-the-art performance on the DAIC_WOZ dataset compared to other visual-feature-based methods, showing its effectiveness.