计算机与现代化

• 人工智能 • 上一篇    下一篇

基于聚类分析与说话人识别的语音跟踪

  

  1. (广东工业大学机电工程学院,广东广州510006)
  • 收稿日期:2019-10-22 出版日期:2020-04-22 发布日期:2020-04-24
  • 作者简介:郝敏(1993-),男,山西太原人,硕士研究生,研究方向:智能语音处理,E-mail: 18826220184@163.com; 刘航(1994-),男,江西萍乡人,硕士,研究方向:语音信号处理,E-mail: 15521331910@163.com; 李扬(1966-),男,广东徐闻人,教授,博士,研究方向:智能装备制造,自适应控制,E-mail: lyang@gdut.edu.cn; 简单(1995-),男,硕士研究生,研究方向:电动汽车电池管理,E-mail: easy_boy@163.com; 王俊影(1990-),女,硕士研究生,研究方向:图像识别,嵌入式技术,E-mail: wangjunying_666@163.com。
  • 基金资助:
    广东省省级科技计划项目(2013B011304008,2013B090600031); 佛山市产学研专项资金项目(2012HC100195)

Speech Tracking Based on Cluster Analysis and Speaker Recognition

  1. (School of Electro-Mechanical Engineering, Guangdong University of Technology, Guangzhou 510006, China)
  • Received:2019-10-22 Online:2020-04-22 Published:2020-04-24

摘要: 目前语音跟踪在说话人干扰的条件下,即一段语音中存在多个说话人的混合语音信号时,语音跟踪质量会严重下降。针对这种情况,提出一种基于聚类分析与说话人识别的语音跟踪算法。算法首先使用改进的聚类分析方法进行语音分离,具体包括在K-means聚类中对质心进行缓存并降低采样率,以及在embedding特征空间引入正则项。其次,算法采用GMM-UBM说话人模型进行语音跟踪。实验结果表明改进的聚类分析方法可以有效提高算法的实时性及其语音分离质量,GMM-UBM模型在3 s语音的测试中具有84%的识别率。

关键词: 单信道语音跟踪, 智能语音, 聚类分析, 高斯混合模型, 长短期记忆网络

Abstract: At present, the speech tracking quality will be seriously reduced under the condition of speaker interference, that is, mixed speech signals of multiple speakers in a speech segment. Aiming at this situation, a speech tracking algorithm based on cluster analysis and speaker recognition is proposed. Firstly, the improved clustering analysis method is used for speech separation. Specifically, it includes caching the center of mass and lowering the sampling rate in K-means clustering, and introducing regular terms into embedding feature space. Secondly, the GMM-UBM speaker model is used for speech tracking. The experimental results show that the improved cluster analysis method can effectively improve the real-time performance of the algorithm and the quality of speech separation, the GMM-UBM model has an 84% recognition rate in 3 s speech test.

Key words: single channel speech track, intelligent speech, clustering analysis, Gaussian mixture model, LSTM

中图分类号: