Computer and Modernization ›› 2025, Vol. 0 ›› Issue (12): 97-106.doi: 10.3969/j.issn.1006-2475.2025.12.014

Previous Articles     Next Articles

Face Sketch-photo Synthesis Network Based on Attention Mechanism

  


  1. (1. Office of Educational Administration, Jiangxi University of Finance and Economics, Nanchang 330032, China;
    2. Deep Image Vision Lab, Jiangxi University of Finance and Economics, Nanchang 330032, China;
    3. School of Software and Internet of Things Engineering, Jiangxi University of Finance and Economics, Nanchang 330032, China;
    4. Jiangxi Province Science and Technology Infrastructure Center, Nanchang 330003, China)
  • Online:2025-12-18 Published:2025-12-18

Abstract: Abstract: Face sketch-photo synthesis is an important branch of image transformation, with broad applications in digital entertainment, public security, and criminal investigation. Since face sketch images only contain grayscale information and lose most of the texture details of the face, existing methods often suffer from structural deficiencies, insufficient detail information, and color distortion in the synthesized face photos. To address these issues, this paper proposes a face sketch-photo synthesis network based on an attention mechanism. First, this paper designs a generator by combining Vision Transformer and U-Net to effectively extract global and local facial features, improving the structural integrity of the generated face photos. Additionally, an improved selective kernel attention module is constructed to enhance the model’s ability to capture fine details, enabling the generated images to retain more facial texture information. Finally, this paper designs a discriminator based on channel and pixel-wise attention to strengthen the adversarial learning capability of the generative adversarial network (GAN), reducing color distortion in the synthesized face photo images. Through subjective and objective experiments comparing with other state-of-the-art methods, the proposed approach demonstrates superior performance in both visual quality and objective metrics for face sketch-photo synthesis. On the CUHK, AR, and XM2VTS face sketch datasets, the proposed method achieves 11.6%, 6.2%, and 4.5% improvements in SSIM metric over the second-best results, respectively, proving the effectiveness of the proposed method.

Key words: Key words: face sketch-photo synthesis, attention mechanism, Transformer, generative adversarial network

CLC Number: