计算机与现代化

• 图像处理 • 上一篇    下一篇

基于局部时空特征的人体行为软分类识别

  

  1. 南京航空航天大学自动化学院,江苏南京210016
  • 收稿日期:2013-11-08 出版日期:2014-03-24 发布日期:2014-03-31
  • 作者简介:吕温(1989-),男,浙江温州人,南京航空航天大学自动化学院硕士研究生,研究方向:图像处理; 徐贵力(1972-),男,黑龙江富锦人,教授,博士生导师,研究方向:光电检测,计算机视觉,数字图像处理与模式识别,计算机测控。
  • 基金资助:
    江苏省科技支撑项目(BE2012173)

Soft Classification in Action Recognition Based on Local Spatio-temporal Features

  1. College of Automation Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China
  • Received:2013-11-08 Online:2014-03-24 Published:2014-03-31

摘要: 针对全局运动特征难以准确提取的问题,本文采用局部时空特征对人体行为进行表征。针对传统词袋中硬分类的方法量化误差大的不足,本文借鉴模糊聚类的思想,提出软分类的方法。根据兴趣点检测算法从视频中提取出视觉词汇,用Kmeans算法对其进行聚类,建立码本。在计算分类特征时,首先计算待分类视觉词汇到码本中各个码字的距离,根据距离计算这个视觉词汇隶属于各个码字的概率,最后统计得到每个视频中各码字出现的频率。在Weizmann和KTH数据库对本文提出的人体行为识别算法进行验证,Weizmann库的识别率比传统的词袋算法提高8%,KTH库的识别率比传统的词袋算法提高9%,因此本文提出的算法能更有效地对人体行为进行识别。

关键词: 算法, 兴趣点, 行为识别, 模糊聚类

Abstract: Aiming at the problem of the global action feature that is difficult to accurately extract, this paper uses space-time interest points to represent motion. In view of the great error in the traditional quantizative word bag hard classification method, this paper refers to the idea of fuzzy clustering and proposes a soft classification approach. Firstly, interest points detect algorithm is applied to extract visual words from the video, and this paper builds the codebook via K-means clustering. This paper calculates the distance of the visual words to be classified and each codeword in the codebook and then gets the membership probability to each codeword. Finally, the codeword’s frequency in each video clip can be calculated. The performance is investigated in Weizmann and KTH datasets. The experiment result shows that the average recognition rates increase 8% in Weizmann datasets and 9% in KTH datasets. This proves that the approach can recognize human behavior more effectively.

Key words:  algorithm, interest point, action recognition, fuzzy clustering

中图分类号: