论文标题
基于集合的卷积神经网络的新型手势检测和识别系统
A Novel Hand Gesture Detection and Recognition system based on ensemble-based Convolutional Neural Network
论文作者
论文摘要
如今,手势识别已成为人机相互作用的替代方法。它涵盖了大量应用,例如3D游戏技术,手语解释,VR(虚拟现实)环境和机器人技术。但是,在计算机视觉和模式识别社区中,对手部分的发现已成为一项艰巨的任务。深度学习算法(如卷积神经网络(CNN)体系结构)已成为分类任务的非常流行的选择,但是CNN体系结构遭受了一些问题,例如预测期间的差异很高,过度拟合问题以及预测错误。为了克服这些问题,本文介绍了基于CNN的方法的合奏。首先,通过使用基于二进制阈值的背景分离方法检测手势部分。之后,提取轮廓部分,并分割了手部区域。然后,已经调整图像并将其馈入三种单独的CNN模型,以并行训练它们。在最后一部分中,将CNN模型的输出得分进行平均以构建最终预测的最佳集合模型。两个包含红外图像的公开可用数据集(标记为DataSet-1和DataSet-2)和一个自构造的数据集已用于验证所提出的系统。将实验结果与现有的最新方法进行了比较,并且观察到我们提出的集合模型优于其他现有的方法。
Nowadays, hand gesture recognition has become an alternative for human-machine interaction. It has covered a large area of applications like 3D game technology, sign language interpreting, VR (virtual reality) environment, and robotics. But detection of the hand portion has become a challenging task in computer vision and pattern recognition communities. Deep learning algorithm like convolutional neural network (CNN) architecture has become a very popular choice for classification tasks, but CNN architectures suffer from some problems like high variance during prediction, overfitting problem and also prediction errors. To overcome these problems, an ensemble of CNN-based approaches is presented in this paper. Firstly, the gesture portion is detected by using the background separation method based on binary thresholding. After that, the contour portion is extracted, and the hand region is segmented. Then, the images have been resized and fed into three individual CNN models to train them in parallel. In the last part, the output scores of CNN models are averaged to construct an optimal ensemble model for the final prediction. Two publicly available datasets (labeled as Dataset-1 and Dataset-2) containing infrared images and one self-constructed dataset have been used to validate the proposed system. Experimental results are compared with the existing state-of-the-art approaches, and it is observed that our proposed ensemble model outperforms other existing proposed methods.