Computer and Modernization ›› 2022, Vol. 0 ›› Issue (04): 38-44.

Previous Articles     Next Articles

Wheat Image Recognition Based on Global Self-attention

  

  1. (College of Electronics and Information Engineering, Sichuan University, Chengdu 610065, China) 
  • Online:2022-05-07 Published:2022-05-07

Abstract: In the actual application scenario, it is very challenging to identify wheat diseases and pests by image recognition. Compared with the previous methods based solely on convolutional neural network (CNN), the method of converting wheat images into a series of visual languages and recognizing wheat from a global perspective is more feasible and practical. The use of convolutional visual Transformers (CVT) to solve wheat recognition is divided into two links. First, two feature maps generated by two-branch CNN are used to realize attentional selective fusion (ASF). ASF obtains different information by fusing multiple features and global-local attention, and projects it into a series of visual languages. Secondly, inspired by the success of Transformers in natural language processing, global self-attention is used to model the relationship between these visual languages. Compared with classical classification networks LeNet-5, ResNet-18, VGG-16 and EfficientNet, CVT improves the recognition rate, and this method has good generalization ability.

Key words: wheat recognition, global-local attention, Transformer, global self-attention