Computer and Modernization

Previous Articles     Next Articles

Action Recognition Technology Based on Improved C3D Neural Network

  

  1. (Eigth System Department, North China Institute of Computing Technology, Beijing 100083, China)
  • Received:2018-08-27 Online:2019-04-08 Published:2019-04-10

Abstract: Although the C3D convolutional neural network proposed by Facebook can achieve good video action recognition accuracy, there is still much room for improvement in terms of speed, and the model obtained by training is too large to be used by mobile devices. This paper uses small convolutional kernels to reduce the characteristics of parameters, optimizes the existing network structure, and proposes a new action recognition scheme, which decomposes the 3×3×3 convolutional kernel commonly used in the original C3D neural network into deep convolution and point convolution (1×1×1 convolution kernel), and training tests on the UCF101 dataset and ActivityNet dataset. The results show that compared with the original C3D network, the improved C3D network accuracy is 2.4% higher than C3D, 12.9% faster than C3D in speed, and the model size is compressed to 25.8%.

Key words: action recognition, convolution decomposition, recognition speed, model compression

CLC Number: