计算机与现代化 ›› 2013, Vol. 1 ›› Issue (5): 201-205.doi: 10.3969/j.issn.1006-2475.2013.05.047

• 应用与开发 • 上一篇    下一篇

一种基于贪婪策略的说话人语音特征优选方法

陈觉之1,张贵荣2,周宇欢3

  

  1. 1.海军指挥学院信息系,江苏南京211800;2.92601部队计量站,广东湛江524009;3.解放军理工大学指挥信息系统学院,江苏南京210007
  • 收稿日期:2013-03-12 修回日期:1900-01-01 出版日期:2013-05-28 发布日期:2013-05-28

A Method of Speaker Acoustic Features Optimal Selection Based on Greedy Strategy

CHEN Jue-zhi1, ZHANG Gui-rong2, ZHOU Yu-huan3   

  1. 1. Department of Information, Institute of Naval Command, Nanjing 211800, China;2. Metering Station, Unit 92601 Troops of PLA, Zhanjiang 524009, China;3. Institute of Command Information System, PLA University of Science and Technology, Nanjing 210007, China)
  • Received:2013-03-12 Revised:1900-01-01 Online:2013-05-28 Published:2013-05-28

摘要: 为了提高说话人识别的性能,提出一种语音特征优选方法,从目前使用效果较好的特征参数中,采用贪婪算法优选出若干维特征用于识别。在TIMIT语音数据库上实验显示,识别率相比传统方法提高了1.6%;对于加入了噪声的语音,识别率提高了6%,识别速度提高了5倍左右。实验结果表明,优选后的特征参数能够去除不良特征对识别系统的干扰,有效提高说话人识别系统的识别率和识别速度。

关键词: 说话人识别, 特征选择, 高斯混合模型, 贪婪算法, 美尔倒谱系数

Abstract: In order to improve the performance of speaker recognition, a method for optimal speaker acoustic features selection is proposed, using greedy algorithm to select some dimensional features from a large dimensional feature set in turn. The recognition rate is increased by 1.6% on pure voice than traditional method and by 6% on noisy voice, the recognition speed is increased about 5 times in experiments based on TIMIT. The experiment results show that the optimal selective features which eliminate disturbance of other redundant features can effectively improve both the recognition accuracy and the computational speed on the speaker recognition system.

Key words: speaker recognition, feature selection, Gaussian mixture model (GMM), greedy algorithm, Mel frequency cepstrum coefficient(MFCC)

中图分类号: