计算机与现代化 ›› 2024, Vol. 0 ›› Issue (08): 67-76.doi: 10.3969/j.issn.1006-2475.2024.08.012

• 人工智能 • 上一篇    下一篇

手势识别与交互综述




  

  1. (华北计算技术研究所战场环境专业部,北京 100083)
  • 出版日期:2024-08-28 发布日期:2024-08-28
  • 基金资助:
    华北计算技术研究所创新基金资助项目(X202202450)

Survey on Gesture Recognition and Interaction

  1. (Department of Battlefield Environment, North China Institute of Computing Technology, Beijing 100083, China)
  • Online:2024-08-28 Published:2024-08-28

摘要: 手势识别与交互技术是人机交互技术与人工智能技术前沿研究的基石任务。该任务以计算机和设备协同工作识别、处理手势信息并给出与手势相对应的机器操作为主要目标,融合应用了动作捕捉、图像处理、图像分类、多端协同交互工作等多项技术,是支撑指挥控制系统、机器人交互、医疗操作等当下前沿智能交互工作与人机交互工作的有力保障。目前,手势识别与交互的相关研究已经日渐成熟,应用领域广泛、应用场景丰富。本文主要对手势识别与交互的相关技术和硬件发展做出综述。首先,全面梳理手势识别与交互技术的研究进展,同时对手势识别的关键步骤进行归类描述;其次,分类阐述用于三维手势交互的当前主流手势识别深度传感器的相关工作;随后,对三维手势识别的真实感识别技术进行剖析和讨论;最后,分析手势识别与交互技术中存在的不足与亟待改进的问题,提出融合深度学习、模式识别等前沿技术与有可行性的研究思路和方法,对该领域未来的研究方向、技术发展和应用领域做出预测和展望。

关键词: 手势识别, 手势交互, 人机交互, 多模态智能交互, 计算机视觉

Abstract: Gesture recognition and interaction technology is the cornerstone task of frontier research in human-computer interaction technology and artificial intelligence technology. This task takes the collaborative work of computers and devices to recognize and process gesture information and give machine operations corresponding to gestures as the main goal, and integrates a number of technologies such as motion capture, image processing, image classification, and multi-terminal collaborative interaction, which is a powerful guarantee to support the command and control system, robot interaction, medical operation and other cutting-edge intelligent interaction and human-computer interaction work nowadays. At present, the research on gesture recognition and interaction has become more and more mature with a wide range of application fields and rich application scenarios. This paper mainly provides a review of the gesture recognition development and interaction related technologies and hardware. Firstly, it sorts the research progress of gesture recognition and interaction technology out comprehensively, and categories the key steps of gesture recognition at the same time. Secondly, it classifies and elaborates the related work of the current mainstream gesture recognition depth sensors used for 3D gesture interaction. Subsequently, it analyses and discusses the real sense recognition technology for 3D gesture recognition. Finally, it analyses the deficiencies and urgent problems in gesture recognition and interaction technology, proposes the integration of such cutting-edge technologies as deep learning, pattern recognition and other feasible research ideas and methods, and makes predictions and prospects for the future research direction, technology development and application areas in this field.

Key words: gesture recognition, gesture interaction, human-computer interaction, multimodal intelligent interaction, computer vision

中图分类号: