计算机与现代化

• 人工智能 • 上一篇    下一篇

基于MFCC-SVM和交叉验证方法的环境音分类

  

  1. (广东司法警官职业学院信息管理系,广东 广州 510520)
  • 收稿日期:2016-01-22 出版日期:2016-08-18 发布日期:2016-08-11
  • 作者简介:李玲俐(1977-),女,湖北洪湖人,广东司法警官职业学院信息管理系副教授,硕士,研究方向:数据挖掘与模式识别。

Environmental Sound Classification Based on MFCC-SVM and Cross Validation Method

  1. (Department of Information Management, Guangdong Justice Police Vocational College, Guangzhou 510520, China)
  • Received:2016-01-22 Online:2016-08-18 Published:2016-08-11

摘要: 用于音乐和语音的识别方法不适用于环境音的识别。提出一种基于MFCC(Mel频率倒谱系数)-SVM(支持向量机)的方法,使用特征表示和学习优化共同来实现办公室10种环境音的分类。环境音数据使用的是IEEE Audio and Acoustic Signal Processing (AASP) Challenge Dataset下载的标准数据集。在分析和优化SVM参数过程中,通过改变Mel系数参数的个数,充分考虑有效的MFCC特征表示。实验结果表明,使用MFCC特征和SVM分类器,采用5-折交叉验证的测试方法,得到的平均分类准确率可达88.05%,分类效果明显优于默认的MFCC-SVM算法。

关键词: Mel频率倒谱系数, 支持向量机, 交叉验证, 环境音分类, 特征提取

Abstract: In general, recognition methods applied for music and speech data are not appropriate for the environmental sounds. In this paper, we propose a MFCC (Mel frequency cepstrum coefficients)-SVM (support vector machine)-based approach that exploits feature representation and learner optimization to achieve the classification of 10 different environmental sounds signals in the office. Environmental sounds events are obtained by using the IEEE AASP (Audio and Acoustic Signal Processing) Challenge Dataset. The proposed approach considers efficient representation of MFCC features by changing the number of Mel coefficients in analyzing as well as optimizing the SVM parameters. Experiment shows that, when the results of the proposed methods are chosen for MFFC feature and SVM classifier, the tests conducted through using 5-fold cross validation, the average classification accuracy can be up to 88.05%. The classification effect is significantly better than the default MFCC-SVM algorithm.

Key words: Mel frequency cepstrum coefficients (MFCC), support vector machine (SVM), cross validation, environmental sounds classification, feature extraction

中图分类号: