Computer and Modernization ›› 2023, Vol. 0 ›› Issue (02): 40-49.

Previous Articles     Next Articles

A Review of Deep Neural Networks Combined with Attention Mechanism

  

  1. (College of Energy and Electric Engineering, Hohai University, Nanjing 211100, China)
  • Online:2023-04-10 Published:2023-04-10

Abstract: Attention mechanism has become one of the research hotspots in improving the learning ability of deep neural network. In view of the wide attention paid to the attention mechanism, this paper aims to give a comprehensive analysis and elaboration of attention mechanism in deep neural network from three aspects: the classification of attention mechanism, the way of combining with deep neural network, and the specific applications in natural language processing and computer vision. Specifically, attention mechanism has been divided into soft attention mechanism, hard attention mechanism and self-attention mechanism, and their advantages and disadvantages are compared. Then, the common ways of combining attention mechanism in recursive neural network and convolutional neural network are discussed respectively, and the representative model structures of each way are given. After that, the applications of attention mechanism in natural language processing and computer vision are illustrated. Finally, several future developments of attention mechanism are illustrated expecting to provide clues and directions for subsequent researches.

Key words: attention mechanisms, deep learning, neural networks, attention models