SqueezeFacePosenet：移动平台跨不同姿势的轻巧脸部验证

论文标题

SqueezeFacePosenet：移动平台跨不同姿势的轻巧脸部验证

SqueezeFacePoseNet: Lightweight Face Verification Across Different Poses for Mobile Platforms

论文作者

Alonso-Fernandez, Fernando, Barrachina, Javier, Hernandez-Diaz, Kevin, Bigun, Josef

论文摘要

通过移动平台的虚拟应用程序是AI中最关键和不断增长的领域之一，在通过移动设备提供的所有服务突破之后，无处不在和实时的人员身份验证变得至关重要。在这种情况下，鉴于这些设备中相机的可用性以及它们在日常应用中的广泛使用，面部验证技术可以提供可靠且可靠的用户身份验证。深度卷积神经网络的快速发展导致许多准确的面部验证体系结构。但是，它们的典型尺寸（数百个兆字节）使它们无法将其纳入可下载的移动应用程序中，这些移动应用程序通常不超过100 MB。因此，我们应对仅几个兆字节开发轻巧的面部识别网络的挑战，与大型模型相比，它们可以以足够的精度运行。鉴于通常在通常使用移动设备的不受控制的环境中观察到的可变性，该网络还应该能够在不同的姿势下运行。在本文中，我们适应了仅4.4MB的轻质压缩模型，以有效地提供交叉姿势识别。在MS-CELEB-1M和VGGFACE2数据库进行了训练之后，我们的模型在困难的额叶和配置文件比较上获得了1.23％的EER，而在个人资料与配置文件图像上，EER为0.54％。在任何涉及任何注册/查询图像对的额叶图像的极端变化下，EER被推到<0.3％，FRR的远处= 0.1％，至小于1％。这使我们的光模型适用于面部识别，至少可以控制入学图像的获取。以略有性能的降级为代价，我们还测试了一个更轻的模型（仅2.5mb），在该模型中，常规卷积被以深度可分离的卷积代替。

Virtual applications through mobile platforms are one of the most critical and ever-growing fields in AI, where ubiquitous and real-time person authentication has become critical after the breakthrough of all services provided via mobile devices. In this context, face verification technologies can provide reliable and robust user authentication, given the availability of cameras in these devices, as well as their widespread use in everyday applications. The rapid development of deep Convolutional Neural Networks has resulted in many accurate face verification architectures. However, their typical size (hundreds of megabytes) makes them infeasible to be incorporated in downloadable mobile applications where the entire file typically may not exceed 100 Mb. Accordingly, we address the challenge of developing a lightweight face recognition network of just a few megabytes that can operate with sufficient accuracy in comparison to much larger models. The network also should be able to operate under different poses, given the variability naturally observed in uncontrolled environments where mobile devices are typically used. In this paper, we adapt the lightweight SqueezeNet model, of just 4.4MB, to effectively provide cross-pose face recognition. After trained on the MS-Celeb-1M and VGGFace2 databases, our model achieves an EER of 1.23% on the difficult frontal vs. profile comparison, and0.54% on profile vs. profile images. Under less extreme variations involving frontal images in any of the enrolment/query images pair, EER is pushed down to<0.3%, and the FRR at FAR=0.1%to less than 1%. This makes our light model suitable for face recognition where at least acquisition of the enrolment image can be controlled. At the cost of a slight degradation in performance, we also test an even lighter model (of just 2.5MB) where regular convolutions are replaced with depth-wise separable convolutions.

下载PDF全文

下载文献需遵守相关版权规定

论文标题