Computer and Modernization ›› 2021, Vol. 0 ›› Issue (02): 73-77.

Previous Articles     Next Articles

Method of Nonparallel Speech Denoising Based on CycleGAN

  

  1. (College of Energy and Electrical Engineering, Hohai University, Nanjing 211100, China)
  • Online:2021-03-01 Published:2021-03-01

Abstract: To solve the problem of speech denoising, a method based on cyclic generation adversarial network (CycleGAN) is proposed. This method combines and optimizes the network model of CycleGAN with the voice conversion technology in different fields, extracts the spectrum envelope features of speech, and then encodes and decodes the speech, aiming to achieve the end-to-end denoising of speech with advanced generation technology. Thus, the proposed algorithm simplifies the high-order difference problem in the process of speech denoising, and generalizes its application scenarios. By training and testing the nonparallel data set and parallel data set, the denoising effect of this method is mainly compared with that of the traditional CycleGAN method. The experimental results show that PESQ, NR and SSNR are improved by 8.49%, 6.53% and 23.30% respectively, which effectively solves the problem of nonparallel speech denoising in the actual scene.

Key words: speech denoising, CycleGAN, voice conversion, nonparallel data set