Computer and Modernization

    Next Articles

Application of Bidirectional Recurrent Neural Network in Speech Recognition

  

  1. (1. School of Computer Science, Qinghai Normal University, Xining 810008, China;
    2. Key Laboratory of Tibetan Information Processing, Ministry of Education, Xining 810008, China)
  • Received:2019-01-28 Online:2019-10-28 Published:2019-10-29

Abstract: In order to solve the problem that feed-forward neural network is difficult to process time series data, bidirectional recurrent neural network (BiRNN) is applied in acoustic modeling of automatic speech recognition. Firstly, the Mel frequency cepstrum coefficients are used for feature extraction. Secondly, bidirectional recurrent neural network is used as acoustic model. And finally, the effects of different parameters on system performance are tested. Experimental results on TIMIT dataset show that, compared with convolutional neural network and deep neural network, the recognition rate of the proposed system is improved by 1.3% and 4.0% respectively, which indicates that BiRNN is more suitable for automatic speech recognition.

Key words: bidirectional recurrent neural network, speech recognition, Mel frequency cepstrum coefficient, deep neural network

CLC Number: