结合差分演化和逻辑回归的构音障碍自动识别方法

doi:10.3969/j.issn.1006-2475.2019.08.001

计算机与现代化 ›› 2019, Vol. 0 ›› Issue (08): 1-.doi: 10.3969/j.issn.1006-2475.2019.08.001

• 模式识别 • 下一篇

结合差分演化和逻辑回归的构音障碍自动识别方法

(广西大学计算机与电子信息学院，广西南宁530004)

收稿日期:2019-01-27 出版日期:2019-08-15 发布日期:2019-08-16
作者简介:黎雨星(1993-)，女，江西樟树人，硕士研究生，研究方向：模式识别，语音识别，E-mail: 1210511121@qq.com；通信作者：梁正友(1968-)，男，广西天等人，教授，博士，研究方向：无线传感器网络，并行分布式计算，人工智能，E-mail： zhyliang@gxu.edu.cn；孙宇(1981-)，女，广西南宁人，讲师，博士，研究方向：智能算法,图像识别。
基金资助:
国家自然科学基金资助项目(61763002)

Automatic Recognition of Dysarthria Based on Differential Evolution and Logistic Regression

(School of Computer and Electronics Information, Guangxi University, Nanning 530004, China)

Received:2019-01-27 Online:2019-08-15 Published:2019-08-16

摘要/Abstract

摘要： 针对传统的构音障碍诊断方法存在耗时高、成本高等问题，提出一种构音障碍语音的计算机自动识别方法。结合Gammatone频率倒谱系数(Gammatone Frequency Cepstrum Coefficients, GFCC)与常用声学特征形成组合声学特征，应用差分演化算法进行特征选择，并使用逻辑回归分类器对构音障碍语音进行识别。将Torgo构音障碍语音数据库分成3个语音子集，分别是非词、短词语、限制句子集，提取24维GFCC和37维常用的声学特征构成组合声学特征，最后使用差分演化算法和逻辑回归分类器进行分类识别。实验表明：使用差分演化算法可以有效选择出具有更佳识别能力的特征，从而显著提高构音障碍识别率。在非词子集上的实验准确率达到98.18%，召回率为98.3%，精确率为98.3%。

关键词: GFCC, 差分演化算法, 逻辑回归, 构音障碍识别

Abstract: Aiming at the problems of high time consuming and cost in traditional diagnosis of dysarthria speech, a computer automatic recognition method for dysarthria is proposed. Combining the Gammatone Frequency Cepstrum Coefficients (GFCC) with the common acoustic features to form a combined acoustic feature, a differential evolution algorithm is applied for feature selection, and a logistic regression classifier is used to identify the dysarthria speech. The Torgo database is divided into three subsets, which are non-words, short words, restricted sentence. 24-dimensional GFCC and 37-dimensional commonly used acoustic features are extracted to form combined acoustic features. Finally, differential evolution algorithm and logistic regression classifier are used for identificaiton of dysarthria. Experiments show that the differential evolution algorithm can effectively select feature subsets with better ability to distinguish dysarthria and healthy speech, which can significantly improve performance in the classification of dysarthria. The experiment on non-word subsets achieves 98.18% of accuracy, 98.3% of recall, and 98.3% of precision.

Key words: GFCC, differential evolution algorithm, logistic regression, dysarthria recognition

中图分类号:

TN912.34

黎雨星，梁正友，孙宇. 结合差分演化和逻辑回归的构音障碍自动识别方法[J]. 计算机与现代化, 2019, 0(08): 1-.

LI Yu-xing, LIANG Zheng-you, SUN Yu. Automatic Recognition of Dysarthria Based on Differential Evolution and Logistic Regression[J]. Computer and Modernization, 2019, 0(08): 1-.

参考文献

［1］ DOYLE P C, LEEPER H A, KOTLER A L, et al. Dysarthric speech: A comparison of computerized speech recognition and listener intelligibility［J］. Journal of Rehabilitation Research and Development, 1997,34(3):309-316.
［2］庞子建. 运动性构音障碍声学分析研究进展［C］// 第7届北京国际康复论坛. 2012:771-774.
［3］ DIETSCH A M, SOLOMON N P, SHARKEY L A, et al. Perceptual and instrumental assessments of orofacial muscle tone in dysarthric and normal speakers［J］. Journal of Rehabilitation Research and Development, 2014,51(7):1127-1142.
［4］ ZHANG C, DANG J, ZHANG J, et al. Investigation on articulatory and acoustic characteristics of dysarthria［C］// 2014 9th IEEE International Symposium on Chinese Spoken Language Processing (ISCSLP). 2014:326-330.
［5］ WANG Y T, KENT R D, DUFFY J R, et al. Dysarthria associated with traumatic brain injury: Speaking rate and emphatic stress［J］. Journal of Communication Disorders, 2005,38(3):231-260.
［6］ BHAT C, VACHHANI B, KOPPARAPU S K. Automatic assessment of dysarthria severity level using audio descriptors［C］// 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). 2017:5070-5074.
［7］ SPANGLER T, VINODCHANDRAN N V, SAMAL A, et al. Fractal features for automatic detection of dysarthria［C］// 2017 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI). 2017:437-440.
［8］ VYAS G, DUTTA M K, PRINOSIL J, et al. An automatic diagnosis and assessment of dysarthric speech using speech disorder specific prosodic features［C］// 2016 39th IEEE International Conference on Telecommunications and Signal Processing (TSP). 2016:515-518.
［9］ MARKAKI M, STYLIANOU Y. Using modulation spectra for voice pathology detection and classification［C］// Proceedings of IEEE EMBS Annual International Conference. 2009:2514-2517.
［10］邵明强,徐志京. 基于改进MFCC特征的语音识别算法［J］. 微型机与应用, 2017(21):52-54，57.
［11］宋静,张雪英,孙颖，等. 基于PAD情绪模型的情感语音识别［J］. 微电子学与计算机, 2016,33(9):128-131.
［12］纪正飚,王吉林,赵力. 基于模糊K近邻的语音情感识别［J］. 微电子学与计算机, 2015(3):59-62.
［13］BENBA A, JILBAB A, HAMMOUCH A, et al. Voiceprints analysis using MFCC and SVM for detecting patients with Parkinson's disease［C］// 2015 IEEE International Conference on Electrical and Information Technologies (ICEIT). 2015:300-304.
［14］RUDZICZ F, NAMASIVAYAM A K, WOLFF T. The Torgo database of acoustic and articulatory speech from speakers with dysarthria［J］. Language Resources and Evaluation, 2012,46(4):523-541.
［15］胡峰松,曹孝玉. 基于Gammatone滤波器组的听觉特征提取［J］. 计算机工程, 2012,38(21):168-170.
［16］张晓丹,黄丽霞,张雪英. 关于在噪声环境下语音识别优化研究［J］. 计算机仿真, 2016,33(8):172-176,291.
［17］熊冰峰,曾以成,谢小娟. 一种改进的听觉特征参数应用于说话人识别［J］. 计算机应用, 2016,36(a01):82-85.
［18］程小伟,王健,曾庆宁,等. 噪声环境下稳健的说话人识别特征研究［J］. 声学技术, 2017(5):83-87.
［19］黄永望,傅德慧. 嗓音的声学分析［J］. 中国听力语言康复科学杂志, 2016,14(5):351-355.
［20］HAQ S, ALI A, ASIF M, et al. Speaker-independent speech emotion recognition using Gaussian and SVM classifiers［J］. Sindh University Research Journal(Science Series), 2016,47(1):103-106.
［21］JIANG W, YING R, LIU P. Noise identification for model-based speech enhancement［C］// 2014 12th IEEE International Conference on Signal Processing (ICSP). 2014:478-483.
［22］STORN R, PRICE K. Differential evolution:A simple and efficient heuristic for global optimization over continuous spaces［J］. Journal of Global Optimization, 1997,11(4):341-359.
［23］孔祥勇,高立群,欧阳海滨,等. 无参数变异的二进制差分演化算法［J］. 东北大学学报 (自然科学版), 2014,35(4):484-487.
［24］吴炜,封兴华,毛天球,等. 腭裂患者术后语音障碍影响因素Logistic回归分析［J］. 口腔颌面外科杂志, 2005,15(3):271-274.

[1]	何若男1, 范翔2, 陈益1, 姜羽菲1, 曹辉1. 比例优势逻辑回归优化嗓音障碍指数算法[J]. 计算机与现代化, 2024, 0(08): 1-4.
[2]	肖宏宇, 曾文驱, 王淑营. 基于模型特征匹配的BIM模型混合推荐算法[J]. 计算机与现代化, 2022, 0(01): 28-32.
[3]	王垚,李为,吴克河,崔文超. GBDT与LR融合模型在加密流量识别中的应用[J]. 计算机与现代化, 2020, 0(03): 93-.
[4]	易文周. 基于差分演化和粒子群优化的改进WSN覆盖算法[J]. 计算机与现代化, 2019, 0(08): 33-.
[5]	许智彪. 基于代价敏感主动学习算法的2型糖尿病诊断[J]. 计算机与现代化, 2018, 0(06): 84-.
[6]	冯苗1，綦小蓉2，李智1. 基于蚁群路径优化决策树及逻辑回归的慢性肾病进展概率预测模型[J]. 计算机与现代化, 2018, 0(04): 117-.
[7]	耿俊成1，张小斐1，孙玉宝2，吴博1，周强2. 基于Ksupport稀疏逻辑回归的停电敏感度预测[J]. 计算机与现代化, 2018, 0(04): 68-.
[8]	付全兴,韩立新，杨艺. 基于生活场景的逻辑回归推荐算法[J]. 计算机与现代化, 2016, 0(12): 38-41.

结合差分演化和逻辑回归的构音障碍自动识别方法

Automatic Recognition of Dysarthria Based on Differential Evolution and Logistic Regression

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 8

编辑推荐

Metrics

本文评价