基于MFCC-SVM和交叉验证方法的环境音分类

doi:10.3969/j.issn.1006-2475.2016.08.008

计算机与现代化 ›› 2016, Vol. 0 ›› Issue (8): 36-39.doi: 10.3969/j.issn.1006-2475.2016.08.008

基于MFCC-SVM和交叉验证方法的环境音分类

(广东司法警官职业学院信息管理系，广东广州 510520)

收稿日期:2016-01-22 出版日期:2016-08-18 发布日期:2016-08-11
作者简介:李玲俐(1977-)，女，湖北洪湖人，广东司法警官职业学院信息管理系副教授，硕士，研究方向：数据挖掘与模式识别。

Environmental Sound Classification Based on MFCC-SVM and Cross Validation Method

(Department of Information Management, Guangdong Justice Police Vocational College, Guangzhou 510520, China)

Received:2016-01-22 Online:2016-08-18 Published:2016-08-11

摘要/Abstract

摘要： 用于音乐和语音的识别方法不适用于环境音的识别。提出一种基于MFCC(Mel频率倒谱系数)-SVM(支持向量机)的方法，使用特征表示和学习优化共同来实现办公室10种环境音的分类。环境音数据使用的是IEEE Audio and Acoustic Signal Processing (AASP) Challenge Dataset下载的标准数据集。在分析和优化SVM参数过程中，通过改变Mel系数参数的个数，充分考虑有效的MFCC特征表示。实验结果表明，使用MFCC特征和SVM分类器，采用5-折交叉验证的测试方法，得到的平均分类准确率可达88.05%，分类效果明显优于默认的MFCC-SVM算法。

关键词: Mel频率倒谱系数, 支持向量机, 交叉验证, 环境音分类, 特征提取

Abstract: In general, recognition methods applied for music and speech data are not appropriate for the environmental sounds. In this paper, we propose a MFCC (Mel frequency cepstrum coefficients)-SVM (support vector machine)-based approach that exploits feature representation and learner optimization to achieve the classification of 10 different environmental sounds signals in the office. Environmental sounds events are obtained by using the IEEE AASP (Audio and Acoustic Signal Processing) Challenge Dataset. The proposed approach considers efficient representation of MFCC features by changing the number of Mel coefficients in analyzing as well as optimizing the SVM parameters. Experiment shows that, when the results of the proposed methods are chosen for MFFC feature and SVM classifier, the tests conducted through using 5-fold cross validation, the average classification accuracy can be up to 88.05%. The classification effect is significantly better than the default MFCC-SVM algorithm.

Key words: Mel frequency cepstrum coefficients (MFCC), support vector machine (SVM), cross validation , environmental sounds classification, feature extraction

中图分类号:

TP391.42

李玲俐. 基于MFCC-SVM和交叉验证方法的环境音分类[J]. 计算机与现代化, 2016, 0(8): 36-39.

LI Ling-li. Environmental Sound Classification Based on MFCC-SVM and Cross Validation Method[J]. Computer and Modernization, 2016, 0(8): 36-39.

参考文献

[1] 刘波霞,陈建峰. 基于特征分析的环境声音事件识别算法[J]. 计算机工程, 2011,37(22):261-263.

[2] 魏丹芳,李应. 基于MFCC和加权动态特征组合的环境音分类[J]. 计算机与数字工程, 2010,38(2):7-10.

[3] Choi W-H, Kim S-I, Keum M-S, et al. Acoustic and visual signal based context awareness system for mobile application[J]. IEEE Transactions on Consumer Electronics, 2011,57(2):738-746.

[4] Ma Ling, Milner B, Smith D. Acoustic environment classification[J]. ACM Transactions on Speech and Language Processing, 2006,3(2).

[5] Wichern G, Xue Jiachen, Thornburg H, et al. Segmentation, indexing, and retrieval for environmental and natural sounds[J]. IEEE Transactions on Audio, Speech, and Language Processing, 2010,18(3):688-707.

[6] Mohanapriya S P, Sumesh E P, Karthika R. Environmental sound recognition using Gaussian mixture model and neural network classifier[C]// Proceedings of the 2014 International Conference on Green Computing Communication and Electrical Engineering (ICGCCEE). 2014.

[7] 王熙,李应. 多频带谱减法用于生态环境声音分类[J]. 计算机工程与应用, 2014,50(3):190-193.

[8] Giannoulis D, Benetos E, Stowel D, et al. Detection and classification of acoustic scenes and events: An IEEE AASP challenge[C]// Proceedings of the 2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. 2013.

[9] 肖勇,覃爱娜. 改进的HMM和小波神经网络的抗噪语音识别[J]. 计算机工程与应用, 2010,46(22):162-164.

[10] Tsukakoshi K, Ida K. Analysis of GMM by a Gaussian wavelet transform[J]. Procedia Computer Science, 2012,8:467-472.

[11] Liaw Y-C, Wu C-M, Leou M-L. Fast k-nearest neighbors search using modified principal axis search tree[J]. Digital Signal Processing, 2010,20(5):1494-1501.

[12] 王浩安,李应. 噪声环境下基于能量检测的生态声音识别[J]. 计算机工程, 2013,39(2):168-171.

[13] 余清清,李应,李勇. 基于SVM模型的自然环境声音的分类[J]. 计算机与数字工程, 2010,38(7):1-5.

[14] Chang C-C, Lin C-J. LIBSVM: A library for support vector machines[J]. ACM Transactions on Intelligent Systems and Technology, 2011,2(3): Article 27.

[1]	刘云海1, 冯广1, 吴晓婷2, 杨群2. 复杂施工场景下的安全帽佩戴检测算法[J]. 计算机与现代化, 2024, 0(12): 66-71.
[2]	余晨曦, 谷林. 基于人体骨架的电梯内异常行为识别预警[J]. 计算机与现代化, 2024, 0(09): 114-120.
[3]	岳有军1, 2, 张远锟1, 赵辉1, 2, 王红君1, 2. 基于多尺度特征与注意力模块的室内场景识别方法[J]. 计算机与现代化, 2024, 0(08): 37-42.
[4]	赵小明, 潘婷, 刘伟锋. 基于图像分类的自动绘画心理分析方法[J]. 计算机与现代化, 2024, 0(08): 92-97.
[5]	武丽1, 张征浩2, 葛彩成2, 俞俊2. 基于改进SCNN网络的车道线检测算法[J]. 计算机与现代化, 2024, 0(07): 87-92.
[6]	曹宁1, 严心娥1, 徐根祺2, 许又文1, 张正勃2, 杜倩云2. 基于DEFA-LSSAR的水利工程边坡力学参数预测模型[J]. 计算机与现代化, 2024, 0(07): 106-111.
[7]	周超, 丛鑫, 訾玲玲, 肖谷平. 基于DNN与注意力机制的推荐算法模型[J]. 计算机与现代化, 2024, 0(06): 1-7.
[8]	王志强, 郑爽. 基于半监督学习的StyleGAN图像生成模型[J]. 计算机与现代化, 2024, 0(06): 14-18.
[9]	刘力霈, 杨晓利, 李振伟. 基于边中心网络特征提取的癫痫脑电分类研究[J]. 计算机与现代化, 2024, 0(05): 22-26.
[10]	曹宁1, 徐根祺2, 张雯3, 许又文1, 何盼情1. 基于AFSPSO-ν-SVM的山洪灾害预测方法研究#br# #br#[J]. 计算机与现代化, 2024, 0(05): 33-37.
[11]	袁世一. 基于经验模态分解与极限学习机的粮食产量模型预测[J]. 计算机与现代化, 2024, 0(03): 47-53.
[12]	王秋忆, 周浩, 郑婷婷. 改进RetinaNet的电力设备目标检测方法[J]. 计算机与现代化, 2024, 0(01): 47-52.
[13]	杨博, 庄毅. 基于AOA-MSVM的控制集群故障检测方法[J]. 计算机与现代化, 2023, 0(12): 112-116.
[14]	刘静乐, 罗翔, 宫成荣, 张国鹏. 基于RF-RFECV和LightGBM算法的糖尿病预测[J]. 计算机与现代化, 2023, 0(11): 36-43.
[15]	杨柳青, 王冲. 基于极大熵的Web服务资源个性化推荐方法[J]. 计算机与现代化, 2023, 0(09): 32-37.

基于MFCC-SVM和交叉验证方法的环境音分类

Environmental Sound Classification Based on MFCC-SVM and Cross Validation Method

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价