Computer and Modernization ›› 2024, Vol. 0 ›› Issue (12): 84-90.doi: 10.3969/j.issn.1006-2475.2024.12.013

Previous Articles     Next Articles

Detection and Recognition Algorithms for Chinese and English Scene Text Images

  

  1. (School of Interest of Things, Jiangnan University, Wuxi 214122, China)
  • Online:2024-12-31 Published:2024-12-31

Abstract:  The complex background of scene text images makes it challenging for detection algorithms to locate text regions accurately, leading to difficulties in recognition. To simultaneously detect and recognize scene text content in both Chinese and English languages, and improve the accuracy of detection and recognition, an improved algorithmic model TD-ABCNetv2 based on ABCNetv2 network is proposed. Addressing the issue of variations in text features such as shape, arrangement, and font, this model adopts SKNet as the backbone network and introduces the Selective Kernel module to help the network learn features of different scales, accommodating texts of various scales, shapes, and orientations. Considering the different character sizes and intervals of Chinese and English scene texts, the ECA attention module is added to the FPN structure to integrate the channel information more effectively, enhance the network’s sensitivity to different features, and make the feature fusion more targeted. Additionally, the CIoU loss function is introduced to more accurately measure the degree of overlap between bounding boxes, adapt to changes in the shape of the text, and enhance the generalization ability of the model. The experimental results show the proposed model is validated through experiments on several public datasets.

Key words: scene text, Chinese text detection, SKNet, attention mechanism, IoU

CLC Number: