计算机与现代化

• 模式识别 •    下一篇

结合差分演化和逻辑回归的构音障碍自动识别方法

  

  1. (广西大学计算机与电子信息学院,广西南宁530004)
  • 收稿日期:2019-01-27 出版日期:2019-08-15 发布日期:2019-08-16
  • 作者简介:黎雨星(1993-),女,江西樟树人,硕士研究生,研究方向:模式识别,语音识别,E-mail: 1210511121@qq.com; 通信作者:梁正友(1968-),男,广西天等人,教授,博士,研究方向:无线传感器网络,并行分布式计算,人工智能,E-mail: zhyliang@gxu.edu.cn; 孙宇(1981-),女,广西南宁人,讲师,博士,研究方向:智能算法,图像识别。
  • 基金资助:
    国家自然科学基金资助项目(61763002)

Automatic Recognition of Dysarthria Based on Differential Evolution and Logistic Regression

  1. (School of Computer and Electronics Information, Guangxi University, Nanning 530004, China)
  • Received:2019-01-27 Online:2019-08-15 Published:2019-08-16

摘要: 针对传统的构音障碍诊断方法存在耗时高、成本高等问题,提出一种构音障碍语音的计算机自动识别方法。结合Gammatone频率倒谱系数(Gammatone Frequency Cepstrum Coefficients, GFCC)与常用声学特征形成组合声学特征,应用差分演化算法进行特征选择,并使用逻辑回归分类器对构音障碍语音进行识别。将Torgo构音障碍语音数据库分成3个语音子集,分别是非词、短词语、限制句子集,提取24维GFCC和37维常用的声学特征构成组合声学特征,最后使用差分演化算法和逻辑回归分类器进行分类识别。实验表明:使用差分演化算法可以有效选择出具有更佳识别能力的特征,从而显著提高构音障碍识别率。在非词子集上的实验准确率达到98.18%,召回率为98.3%,精确率为98.3%。

关键词: GFCC, 差分演化算法, 逻辑回归, 构音障碍识别

Abstract: Aiming at the problems of high time consuming and cost in traditional diagnosis of dysarthria speech, a computer automatic recognition method for dysarthria is proposed. Combining the Gammatone Frequency Cepstrum Coefficients (GFCC) with the common acoustic features to form a combined acoustic feature, a differential evolution algorithm is applied for feature selection, and a logistic regression classifier is used to identify the dysarthria speech. The Torgo database is divided into three subsets, which are non-words, short words, restricted sentence. 24-dimensional GFCC and 37-dimensional commonly used acoustic features are extracted to form combined acoustic features. Finally, differential evolution algorithm and logistic regression classifier are used for identificaiton of dysarthria. Experiments show that the differential evolution algorithm can effectively select feature subsets with better ability to distinguish dysarthria and healthy speech, which can significantly improve performance in the classification of dysarthria. The experiment on non-word subsets achieves 98.18% of accuracy, 98.3% of recall, and 98.3% of precision.

Key words: GFCC, differential evolution algorithm, logistic regression, dysarthria recognition

中图分类号: