论文标题
深层多任务多标签CNN用于有效的面部属性分类
Deep Multi-task Multi-label CNN for Effective Facial Attribute Classification
论文作者
论文摘要
面部属性分类(FAC)引起了计算机视觉和模式识别的越来越多的关注。但是,最新的FAC方法执行面部检测/对齐方式,并独立进行FAC。这些任务之间的固有依赖关系并未完全利用。此外,大多数方法都使用相同的CNN网络体系结构预测所有面部属性,该架构忽略了面部属性的不同学习复杂性。为了解决上述问题,我们提出了一种新型的深层多任务多标签CNN,称为DMM-CNN,以实现有效的FAC。具体而言,DMM-CNN共同优化了两个密切相关的任务(即面部地标检测和FAC),以利用多任务学习的优势来提高FAC的性能。为了处理面部属性的多样化学习复杂性,我们将属性分为两组:客观属性和主观属性。分别设计了两个不同的网络体系结构,以提取两组属性的特征,并提出了一种新型的动态加权方案,以在训练过程中自动为每个面部属性分配减少权重。此外,制定了一种自适应阈值策略,以有效缓解多标签学习的阶级失衡问题。与几种最先进的FAC方法相比,有关挑战性Celeba和LFWA数据集的实验结果表明,所提出的DMM-CNN方法的优越性。
Facial Attribute Classification (FAC) has attracted increasing attention in computer vision and pattern recognition. However, state-of-the-art FAC methods perform face detection/alignment and FAC independently. The inherent dependencies between these tasks are not fully exploited. In addition, most methods predict all facial attributes using the same CNN network architecture, which ignores the different learning complexities of facial attributes. To address the above problems, we propose a novel deep multi-task multi-label CNN, termed DMM-CNN, for effective FAC. Specifically, DMM-CNN jointly optimizes two closely-related tasks (i.e., facial landmark detection and FAC) to improve the performance of FAC by taking advantage of multi-task learning. To deal with the diverse learning complexities of facial attributes, we divide the attributes into two groups: objective attributes and subjective attributes. Two different network architectures are respectively designed to extract features for two groups of attributes, and a novel dynamic weighting scheme is proposed to automatically assign the loss weight to each facial attribute during training. Furthermore, an adaptive thresholding strategy is developed to effectively alleviate the problem of class imbalance for multi-label learning. Experimental results on the challenging CelebA and LFWA datasets show the superiority of the proposed DMM-CNN method compared with several state-of-the-art FAC methods.