Computer and Modernization

Previous Articles     Next Articles

An Improved LSTM Model in the Application of Image Caption Generation

  

  1. (College of Science, Dalian Maritime University, Dalian 116000, China)
  • Received:2019-08-04 Online:2020-04-22 Published:2020-04-24

Abstract: In order to solve the problem of premature saturation of traditional Long Short-Term Memory(LSTM) neural network and generate a more accurate description for a given picture, this paper proposes a long short-term memory neural network model based on inverse tangent function(ITLSTM). Firstly, the classical convolutional neural network model is used to extract image features. Then, the ITLSTM neural network model is used to characterize the corresponding description of the image. Finally, the performance of the model is evaluated on the Flickr8K dataset and compared with several classic image caption generation models such as Google NIC. The experimental results show that the proposed model can effectively improve the accuracy of image caption generation.

Key words: image caption generation, inverse tangent function, Long Short-Term Memory (LSTM) neural network, convolutional neural network

CLC Number: