Computer and Modernization ›› 2025, Vol. 0 ›› Issue (09): 61-66.doi: 10.3969/j.issn.1006-2475.2025.09.009

Previous Articles     Next Articles

Facial Sketch Image Conversion Based on CycleGAN and Attention Mechanism

  


  1. (School of Software and Internet of Things Engineering, Jiangxi University of Finance and Economics, Nanchang 330013, China)
  • Online:2025-09-24 Published:2025-09-24

Abstract:
Abstract: In recent years, because of its demand in law enforcement, criminal and entertainment fields, face sketch-photo synthesis has become a research hotspot. As deep learning model without paired image supervision, CycleGAN is good at cross-domain image conversion, providing a powerful tool for efficient conversion between sketches and photos. In view of the difficulty of collecting a large number of pairs of face images and sketch images, and the problems of fuzzy and low definition image details in face sketch image generation, an improved CycleGAN model is proposed. In this paper, the self-attention mechanism is introduced into the residual block of the ResNet architecture generator in the CycleGAN model, so that the CycleGAN generator model can learn the features of different channels and the importance of different regions in the face image more effectively, and automatically focus on the important regions of facial features, such as eyes, nose, mouth, etc., during image processing. At the same time, the edge clarity and integrity of the sketch are increased, so as to improve the quality of the generated face sketch image. The proposed model is implemented on the datasets CUHK and FS2K. The structural similarity, peak signal-to-noise ratio and multi-scale structural similarity are 0.7741, 11.7451 and 0.8504 respectively on CUHK and 0.7049, 13.2745 and 0.7970 respectively on FS2K. These results outperformed the comparison models of CycleGAN, Pix2Pix, MUNIT, and DCLGAN. According to the comparison experiment and subjective vision, the proposed model can effectively complete the process of face sketching and generate higher quality face sketching images.

Key words: Key words: CycleGAN; generative adversarial network; attention mechanism; residual network ,

CLC Number: