基于可变形卷积神经网络的手势识别方法

doi:10.3969/j.issn.10062475.2018.04.012

计算机与现代化 ›› 2018, Vol. 0 ›› Issue (04): 62-.doi: 10.3969/j.issn.10062475.2018.04.012

基于可变形卷积神经网络的手势识别方法

(华南农业大学数学与信息学院，广东广州510642)

出版日期:2018-04-28 发布日期:2018-05-02
作者简介: 苏军雄(1995)，男，广东中山人，华南农业大学数学与信息学院本科生，研究方向：图像处理; 见雪婷(1995)，女，本科生，研究方向：图像处理; 刘玮(1995)，女，广东广州人，本科生，研究方向：图像处理; 华俊达(1999)，男，广东茂名人，本科生，研究方向：图像处理; 通信作者：张胜祥(1969)，男，副教授，博士，研究方向：非线性系统理论。
基金资助:
2016年省级大学生创新训练计划项目(201610564356); 广州市科技计划项目(201707010031)

Gesture Recognition Method Based on Deformable Convolution Neural Network

(College of Mathematics and Informatics, South China Agricultural University, Guangzhou 510642, China)

Online:2018-04-28 Published:2018-05-02

摘要/Abstract

摘要： 卷积神经网络本身具有丰富的特征表达能力和学习能力，但本质上，其模块中几何变换能力是固定的。因此，引入可变形卷积核来改进VGG16的网络结构，搭建名为DCVGG的卷积神经网络结构来进行手势识别的研究。在不同数据集下，基于可变形卷积神经网络的手势识别方法能够直接把RGB图像数据输入网络。最终输出的结果，对手势的平均识别率达到97%以上，有效提高网络的性能，提升卷积神经网络对样本对象的容忍度和多样性，丰富卷积神经网络的特征表达能力，与传统LeNet5、VGG16结构和传统人工特征提取算法相比效果更佳，比传统结构更深，鲁棒性更好，识别率更强，可以为复杂背景下有效识别手势提供参考，具有一定的延拓能力。

关键词: 手势识别, 可变形卷积, 卷积神经网络, 卷积核, 双线性插值

Abstract: Convolution neural network itself has a rich ability of expressing features and learning, but in essence, the module geometric transformation ability is fixed. Therefore, the VGG16 network structure is improved by introducing a deformable convolution kernel, and a convolution neural network structure named DCVGG is built to study the gesture recognition. In different data sets, the gesture recognition method based on deformable convolution neural network can input RGB image data directly into the network. The results show that the average recognition rate of gestures is over 97%, which can improve the performance of the network, enhance the tolerance and diversity of the convolution neural network to the sample object, and enrich the expression ability of the convolution neural network. Compared with the traditional LeNet5, VGG16 structure and traditional feature extraction by hand, DCVGG is deeper than the traditional structure, the robustness is better, the recognition rate is stronger, which can provide reference for the effective recognition of gestures in complex background, and has some extension ability.

Key words: gesture recognition, deformable convolution, convolution neural network (CNN), convolution kernel, bilinear interpolation

中图分类号:

TP391.9

苏军雄，见雪婷，刘玮，华俊达，张胜祥. 基于可变形卷积神经网络的手势识别方法[J]. 计算机与现代化, 2018, 0(04): 62-.

SU Junxiong, JIAN Xueting, LIU Wei, HUA Junda, ZHANG Shengxiang. Gesture Recognition Method Based on Deformable Convolution Neural Network[J]. Computer and Modernization, 2018, 0(04): 62-.

参考文献

［1］ Pavlovic V I, Sharma R, Huang T S. Visual interpretation of hand gestures for humancomputer interaction: A review［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1997,19(7):677695.
［2］ Wu Ying, Huang T S. Visionbased gesture recognition: A review［C］// Proceedings of the 1999 International Gesture Workshop on Gesturebased Communication in HumanComputer Interaction. 1999:103115.
［3］ Jaimes A, Sebe N. Multimodal humancomputer interaction: A survey［J］. Computer Vision and Image Understanding, 2007,108(12):116134.
［4］ Xie Renqiang, Sun Xia, Xia Xiang, et al. Similarity matchingbased extensible hand gesture recognition［J］. IEEE Sensors Journal, 2015,15(6):34753483.
［5］庞海波,李占波,丁友东. 基于时间序列手势轮廓模型的动态手势识别［J］. 华南理工大学学报(自然科学版), 2015,43(1):140146.
［6］ Bhuyan M K, Kumar D A, MacDorman K F, et al. A novel set of features for continuous hand gesture recognition［J］. Journal on Multimodal User Interfaces, 2014,8(4):333343.
［7］李翠,王小妮,刘园园. 基于SIFT算法的手势控制系统的设计与实现［J］. 现代经济信息, 2016(10):337.
［8］ Wallach H M. Topic modeling: Beyond bagofwords［C］// Proceedings of the 23rd International Conference on Machine Learning. 2006:977984.
［9］曹洁,赵修龙,王进花. 基于RGBD信息的动态手势识别方法［J/OL］. http://www.arocmag.com/article/02201806050.html, 20170614.
［10］刘斌,赵兴,胡春海,等. 面向颜色深度图像手脸近距遮挡的手势识别［J］. 激光与光电子学进展, 2016(6):134143.
［11］Krizhevsky A, Sutskever I, Hinton G E. ImageNet classification with deep convolutional neural networks［C］// Proceedings of the 25th International Conference on Neural Information Processing Systems. 2012,1:10971105.
［12］陈祖雪. 基于深度卷积神经网络的手势识别研究［D］. 西安:陕西师范大学, 2016.
［13］柯圣财,赵永威,李弼程,等. 基于卷积神经网络和监督核哈希的图像检索方法［J］. 电子学报, 2017,45(1):157163.
［14］Fan Yin, Lu Xiangju, Li Dian, et al. Videobased emotion recognition using CNNRNN and C3D hybrid networks［C］// Proceedings of the 18th ACM International Conference on Multimodal Interaction. 2016:445450.
［15］左艳丽,马志强,左宪禹. 基于改进卷积神经网络的人体检测研究［J］. 现代电子技术, 2017,40(4):1215.
［16］赵志宏,杨绍普,马增强. 基于卷积神经网络LeNet5的车牌字符识别研究［J］. 系统仿真学报, 2010,22(3):638641.
［17］Dai Jifeng, Qi Haozhi, Xiong Yuwen, et al. Deformable Convolutional Networks［DB/OL］. https://arxiv.org/abs/1703.06211, 20170605.
［18］Multimedia Computing Laboratory. Large RGBD Extensible Hand Gesture Dataset［DB/OL］. http://mclab.citi.sinica.edu.tw/dataset/lared/lared.html#download, 20140718.

[1]	何思达, 陈平华. 基于意图的轻量级自注意力序列推荐模型[J]. 计算机与现代化, 2024, 0(12): 1-9.
[2]	黄庭培1, 马禄彪1, 李世宝2, 刘建航1. 基于WiFi和原型网络的手势识别方法[J]. 计算机与现代化, 2024, 0(12): 34-39.
[3]	张晓东1, 白广芝1, 李敏1, 李昊洋2. 基于经验小波变换的油气井产量预测模型 [J]. 计算机与现代化, 2024, 0(12): 53-58.
[4]	刘宝宝, 杨菁菁, 陶露, 王贺应. 基于注意力的DSMSC的遥感图像场景分类[J]. 计算机与现代化, 2024, 0(12): 72-77.
[5]	陈雪松1, 李衡1, 王浩畅2. 结合注意力机制和Mengzi模型的短文本分类[J]. 计算机与现代化, 2024, 0(09): 101-106.
[6]	马永, 王俊, 张子健, 赵煜阳, 张靖, 周明. 面向智慧运维系统的改进YOLOv8行为检测算法[J]. 计算机与现代化, 2024, 0(08): 43-48.
[7]	魏嘉焜, 王家润. 手势识别与交互综述[J]. 计算机与现代化, 2024, 0(08): 67-76.
[8]	高帅鹏, 王怡凡. 基于图像的群体情绪识别综述[J]. 计算机与现代化, 2024, 0(08): 98-107.
[9]	周宪溪, 牟莉. 基于改进TF-IDF和AGLCNN的新闻长文本分类模型[J]. 计算机与现代化, 2024, 0(08): 120-126.
[10]	杨江1, 孙晓梅1, 许韬2. 基于业务内容构建股票关联关系的股价预测[J]. 计算机与现代化, 2024, 0(07): 21-25.
[11]	刘存莉1, 雷占占2, 郑澳2. 基于循环卷积神经网络的排水管网缺陷检测方法[J]. 计算机与现代化, 2024, 0(07): 26-35.
[12]	李珊, 王林娜, 高丁佳, 宣海波. 基于图神经网络的多层银企网络融合研究[J]. 计算机与现代化, 2024, 0(05): 27-32.
[13]	钟海龙1, 2, 何月顺1, 何璘琳1, 陈杰1, 田鸣3, 郑瑞银4. 基于代价敏感卷积神经网络的加密流量分类#br# #br#[J]. 计算机与现代化, 2024, 0(05): 55-60.
[14]	高埂1, 肖风丽2, 杨飞1. 基于改进MobileNetV3-Small的色素减退性皮肤病诊断[J]. 计算机与现代化, 2024, 0(05): 120-126.
[15]	游嘉靖1, 2, 何月顺1, 何璘琳1, 钟海龙1, 2. 基于AHP-CNN的加密流量分类方法[J]. 计算机与现代化, 2024, 0(04): 83-87.

基于可变形卷积神经网络的手势识别方法

Gesture Recognition Method Based on Deformable Convolution Neural Network

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价