计算机与现代化

• 人工智能 • 上一篇    下一篇

基于T矩阵归一化PLDA的说话人确认

  

  1. 兰州理工大学电气工程与信息工程学院,甘肃兰州730050
  • 收稿日期:2017-03-07 出版日期:2017-10-30 发布日期:2017-10-31
  • 作者简介:缑新科(1966-),男,甘肃天水人,兰州理工大学电气工程与信息工程学院教授, 博士,研究方向:模式识别,信号处理; 王跃(1990-),男,山东滕州人,硕士研究生,研究方向:模式识别,语音信号处理。

Speaker Verification of Normalization PLDA Based on T Matrix

  1. College of Electrical and Information Engineering, Lanzhou University of Technology, Lanzhou 730050, China
  • Received:2017-03-07 Online:2017-10-30 Published:2017-10-31

摘要: 利用i-vector/PLDA模型进行说话人确认时,对于不定时间的语音,由于将长度归一化后的i-vector转化到PLDA模型时,伴随着不确定性的扭曲和缩放,影响识别率。本文通过对全变量空间矩阵T的列向量执行归一化,代替在PLDA模型上对i-vector进行长度归一化,避免因在i-vector上执行长度归一化,导致转移到PLDA模型上产生不良的扭曲。实验结果表明,该方法得到和长度归一化相似的效果,部分效果要优于长度归一化。

关键词: i-vector/PLDA, 长度归一化, T矩阵, 高斯通用背景模型

Abstract: Recently, speaker verification based on i-vector/PLDA has become the state-of-the-art technique in speaker recognition.For the indefinite time speech, uncertainty of distortion and scaling, when i-vector with length normalization is converted to PLDA model, it affects the recognition rate. In this paper, the normalization of the length of the i-vector on the PLDA model is replaced by the normalization of total variability matrix T, to avoid the poor distortions. Experiments show that the method is similar to the length normalization, some of the results are better than that of the length normalization.

Key words: i-vector/PLDA, normalization of length, matrix T, GMM-UBM