Compression Method of CNN Model for Parameter Reduction

Abstract

Abstract: In order to solve the problem that it is difficult to deploy convolutional neural network model on embedded devices with limited computing and storage resources due to the increasing scale of parameters, a convolutional neural network model compression method is proposed to reduce the scale of parameters. It is found that the number of convolution layer parameters is related to the number of input and output feature maps and the size of convolution kernel, while the number of full connection layer parameters is large and difficult to be reduced significantly. The number of input and output feature maps is reduced by grouping convolution, and the convolution kernel size is reduced by convolution resolution. At the same time, the global average pooling layers are used to replace the fully connected layers to solve the problem of large number of parameters in the fully connected layers. The above methods are applied to LeNet5 and AlexNet for experiments, the experimental results show that the parameters of LeNet5 model can be reduced by 97% and the recognition accuracy can be reduced by less than 2 percentage points by using the combined compression method, the parameters of AlexNet model can be reduced by 95% and the recognition accuracy can be improved by 6.72 percentage points after compression. On the premise of ensuring the accuracy of convolutional neural network, the parameters of the model can be greatly reduced.

Key words: convolutional neural networks, parameter scale, grouping convolution, convolution resolution, global average pooling

ZHU Xue-chen, CHEN San-lin, CAI Gang, HUANG Zhi-hong. Compression Method of CNN Model for Parameter Reduction[J]. Computer and Modernization, 2021, 0(09): 83-89.

References

［1］ BALOG M, GAUNT A L, BROCKSCHMIDT M, et al. Deepcoder: Learning to write programs［J］. Machine Learning, 2017， arXiv:1611.01989.
［2］ ERHAN D, SZEGEDY C, TOSHEV A, et al. Scalable object detection using deep neural networks［C］// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2014:2155-2162.
［3］ NAKJAI P, KATANYUKUL T. Hand sign recognition for Thai finger spelling: An application of convolution neural network［J］. Journal of VLSI Signal Processing Systems for Signal, Image, and Video Technology, 2019,91(2):131-146.
［4］朱亮. 卷积神经网络的全可编程SOC实现［D］. 昆明：云南大学, 2017.
［5］邹蕾,张先锋. 人工智能及其发展应用［J］. 信息网络安全, 2012(2):11-3.
［6］ HAILESELLASIE M, HASAN S R, KHALID F, et al. FPGA-based convolutional neural network architecture with reduced parameter requirements［C］// 2018 IEEE International Symposium on Circuits and Systems. 2018:1-5.
［7］ CHENG Y, WANG D, ZHOU P, et al. Model compression 〖HJ1.08mm〗and acceleration for deep neural networks: The principles, progress, and challenges［J］. IEEE Signal Processing Magazine, 2018,35(1):126-136.
［8］杨扬,蓝章礼,陈巍. 基于统计分析的卷积神经网络模型压缩方法［J］. 计算机系统应用, 2018,27(8):53-59.
［9］邹月娴,余嘉胜,陈泽晗,等. 图像分类卷积神经网络的特征选择模型压缩方法［J］. 控制理论与应用, 2017,34(6):746-752.
［10］黄芬芬. 深度卷积神经网络模型压缩方法研究及应用［D］. 北京:北京邮电大学, 2018.
［11］DENTON E L, ZAREMBA W, BRUNA J, et al. Exploiting linear structure within convolutional networks for efficient evaluation［C］// Proceedings of the 27th International Conference on Neural Information Processing Systems. 2014:1269-1277.
［12］JIAO L, LUO C, CAO W, et al. Accelerating low bit-width convolutional neural networks with embedded FPGA［C］// 2017 27th International Conference on Field Programmable Logic and Applications. 2017:1-4.
［13］SAHA S, VARMA G, JAWAHAR C V. Compressing deep neural networks for recognizing places［C］// 2017 4th IAPR Asian Conference on Pattern Recognition. 2017:352-357.
［14］HAN S, MAO H, DALLY W J. Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding［J］. Computer Vision and Pattern Recognition, 2016， arXiv:1510.00149.
［15］WEI X, CHEN H, LIU W C, et al. Mixed-precision quantization for CNN-based remote sensing scene classification［J］. IEEE Geoscience and Remote Sensing Letters, 2020(99):1-5.
［16］QIU J T, WANG J, SONG Y, et al. Going deeper with embedded FPGA platform for convolutional neural network［C］// Proceedings of the 2016 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays. 2016:26-35.
［17］ARKAH Z M, ALZUBAIDI L S. Convolutional neural network with global average pooling for image classification［C］// International Conference on Electrical, Communication, Electronics, Instrumentation and Computing. 2020:171-180.
［18］PANG Y W, SUN M L, JIANG X C, et al. Convolution in convolution for network in network［J］. IEEE Transactions on Neural Networks and Learning Systems, 2016(99):1587-1597.
［19］SZEGEDY C, LIU W, JIA Y, et al. Going deeper with convolutions［C］// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2015:1-9.
［20］HE K M, ZHANG X Y, REN S Q, et al. Deep residual learning for image recognition［C］// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2016:770-778.
［21］KRIZHEVSKY A, SUTSKEVER I, HINTON G E. ImageNet classification with deep convolutional neural networks［J］. Communications of the ACM, 2017,60(6):84-90.
［22］YANG Z J, LEI WANG L, LUO L, et al. Bactran: A hardware batch normalization implementation for CNN training engine［J］. IEEE Embedded Systems Letters, 2021,13(1):29-32.

[1]	LIU Baobao, YANG Jingjing, TAO Lu, WANG Heying . DSMSC Based on Attention Mechanism for Remote Sensing Image Scene Classification [J]. Computer and Modernization, 2024, 0(12): 72-77.
[2]	LIU Fu-qi, ZHANG Da, SONG Jian-hua, WANG Hai-dong. Fault Diagnosis of Hydraulic Systems Based on CNN-BiLSTM [J]. Computer and Modernization, 2023, 0(09): 10-19.
[3]	XU Ye-tong, GENG Xin-zhe, ZHAO Wei-qiang, ZHANG Yue, NING Hai-long, LEI Tao. A Remote Sensing Image Change Detection Model Based on CNN-Transformer Hybrid Structure [J]. Computer and Modernization, 2023, 0(07): 79-85.
[4]	QIAN Xiao-zhao, WANG Peng. Robust Defense Method for Graph Convolutional Neural Network [J]. Computer and Modernization, 2023, 0(01): 74-80.
[5]	ZHANG Xiao, LYU Ji-yu, ZHAO Shuang, WU Yu-lun, WANG Chun-le. SAR Ship Classification Based on Multi-convolutional Neural Network Fusion [J]. Computer and Modernization, 2023, 0(01): 37-42.
[6]	QIAN Jia-qi, HUANG He-ming, ZHANG Hui-yun, . Speech Emotion Recognition Based on ARCNN-GAP Network [J]. Computer and Modernization, 2021, 0(12): 91-95.
[7]	LIANG Chao, HUANG Hong-quan. Lightweight Image Super-Resolution Based on Convolutional Neural Network [J]. Computer and Modernization, 2020, 0(11): 23-27.
[8]	ZHOU Chen-yi, WANG Wen, LU Shan， XU Yi-bai. Real-time Semantic Segmentation Based on Multi-scale Fusion #br# and Its Application in Electric Power Scene [J]. Computer and Modernization, 2019, 0(08): 17-.
[9]	CHEN Xi-yuan, ZHU Jia. A Multiscale Convolutional Neural Network for Forex Trading Using Joint Feature Learning [J]. Computer and Modernization, 2018, 0(09): 122-.
[10]	LIU Xiao-ming1,2, ZHANG Ying1,2, ZHENG Qiu-sheng1,2. Sentiment Classification of Short Texts on Internet Based on Convolutional Neural Networks Model [J]. Computer and Modernization, 2017, 0(4): 73-77.
[11]	YANG Ning. Multi-GPU Parallel Framework of Deep Convolutional Neural Networks [J]. Computer and Modernization, 2016, 0(11): 95-98.