一种卷积神经网络的模型压缩算法

计算机与现代化 ›› 2021, Vol. 0 ›› Issue (10): 107-111.

一种卷积神经网络的模型压缩算法

(西安邮电大学通信与信息工程学院，陕西西安710121)

出版日期:2021-10-14 发布日期:2021-10-14
作者简介:包志强（1978—），男，陕西西安人，副教授，硕士生导师，博士，研究方向：深度学习，模式识别，导航抗干扰，E-mail: baozhiqiang@xupt.edu.cn; 程萍（1997—），女，安徽阜阳人，硕士研究生，研究方向：深度学习，数据挖掘，E-mail: 2843874970@qq.com; 黄琼丹（1979—），女，广西柳州人，副教授，硕士生导师，博士，研究方向：信号与信息处理，E-mail: xuezhemail@163.com; 吕少卿（1987—），男，山西五寨人，讲师，博士，研究方向：社交网络分析，网络表示学习，E-mail: lvsq3601@xupt.edu.cn。
基金资助:
陕西省重点研发计划项目(2018GY-150); 西安市科技计划项目(201805040YD18CG24-3, 2019218114GXRC017CG018-GXYD17.5)

A Model Compression Algorithm of Convolutional Neural Network

(School of Communications and Information Engineering, Xi’an University of Posts and Telecommunications, Xi’an 710121, China)

Online:2021-10-14 Published:2021-10-14

摘要/Abstract

摘要： 卷积神经网络的研究取得一系列突破性成果，其优秀表现是由深层结构支撑的。针对复杂的卷积神经网络在参数量及计算量上存在大量的冗余问题，提出一种简洁有效的网络模型压缩算法。首先，通过计算卷积核之间的皮尔逊相关系数判断相关性，循环删除冗余参数，从而压缩卷积层。其次，采用局部-全局的微调策略，恢复网络性能。最后，提出一种参数正交正则，促使卷积核之间的正交化，进而减少冗余特征。实验结果表明，在MNIST数据集上，该压缩算法能够在不损失测试精度的前提下，使AlexNet卷积层的参数量压缩率达到53.2%，浮点操作计算量可以减少42.8%，并且网络模型收敛后具有较小的误差。

关键词: 卷积神经网络, 卷积核, 皮尔逊相关系数, 模型压缩, 正交

Abstract: Convolutional neural network has achieved a series of breakthrough research results, and its superior performance is supported by deep structure. In order to solve the problem of the large amount of redundancy in parameters and computation of complex convolutional neural network, a concise and effective network model compression algorithm is proposed. Firstly, the correlation is judged by calculating the Pearson correlation coefficient between convolution kernels, and the redundant parameters are deleted circularly to compress the convolution layer. Secondly, a local-global fine tuning strategy is adopted to restore the network performance. Finally, a parameter orthogonality regularization is proposed to promote the orthogonalization between convolution kernels and reduce redundant features. The experimental results show that, on the MNIST data set, the compression ratio of the parameters of AlexNet convolutional layer can reach 53.2%, and the calculation amount of the floating point operation can be reduced by 42.8% without losing the test accuracy. In addition, the model has a small error after convergence.

Key words: convolutional neural network, convolution kernel, Pearson correlation coefficient, model compression, orthogonality

包志强, 程萍, 黄琼丹, 吕少卿. 一种卷积神经网络的模型压缩算法[J]. 计算机与现代化, 2021, 0(10): 107-111.

BAO Zhi-qiang, CHENG Ping, HUANG Qiong-dan, LYU Shao-qing. A Model Compression Algorithm of Convolutional Neural Network[J]. Computer and Modernization, 2021, 0(10): 107-111.

参考文献

［1］周飞燕,金林鹏,董军. 卷积神经网络研究综述［J］. 计算机学报, 2017,40(6):1229-1251.
［2］吴正文. 卷积神经网络在图像分类中的应用研究［D］. 成都:电子科技大学, 2015.
［3］李彦冬,郝宗波,雷航. 卷积神经网络研究综述［J］. 计算机应用, 2016,36(9):2508-2515.
［4］李旭冬,叶茂,李涛. 基于卷积神经网络的目标检测研究综述［J］. 计算机应用研究, 2017,34(10):2881-2886.
［5］ GIRSHICK R. Fast R-CNN［C］// Proceedings of the 2015 IEEE International Conference on Computer Vision. 2015:1440-1448.
［6］杨真真,匡楠,范露,等. 基于卷积神经网络的图像分类算法综述［J］. 信号处理, 2018,34(12):1474-1489.
［7］ SZEGEDY C, LIU W, JIA Y Q, et al. Going deeper with convolutions［C］// Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. 2015:1-9.
［8］李青华,李翠平,张静,等. 深度神经网络压缩综述［J］. 计算机科学, 2019,46(9):1-14.
［9］李思奇. 卷积神经网络算法模型的压缩与加速算法比较［J］. 信息与电脑, 2019(11):21-23.
［10］谢斌红,钟日新,潘理虎,等. 结合剪枝与流合并的卷积神经网络加速压缩方法［J］. 计算机应用, 2020,40(3):621-625.
［11］孙彦丽,叶炯耀. 基于剪枝与量化的卷积神经网络压缩方法［J］. 计算机科学, 2020,47(8):261-266.
［12］李浩,赵文杰,韩波. 基于滤波器裁剪的卷积神经网络加速算法［J］. 浙江大学学报(工学版), 2019,53(10):1994-2002.
［13］吴军,邱阳,卢忠亮. 基于改进的SqueezeNet的人脸识别［J］. 科学技术与工程, 2019,19(11):218-223.
［14］王继霄,李阳,王家宝,等. 基于SqueezeNet的轻量级图像融合方法［J］. 计算机应用, 2020,40(3):837-841.
［15］ROMERA E, ALVAREZ J M, BERGASA L M, et al. ERFNet: Efficient residual factorized convNet for real-time semantic segmentation［J］. IEEE Transactions on Intelligent Transportation Systems, 2018,19(1):263-272.
［16］ZHANG X Y, ZHOU X Y, LIN M X, et al. ShuffleNet: An extremely efficient convolutional neural network for mobile devices［C］// Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2018:6848-6856.
［17］JIN X J, YUAN X T, FENG J S, et al. Training skinny deep neural networks with iterative hard thresholding methods［J］. arXiv preprint arXiv:1607.05423, 2016.
［18］LEBEDEV V, LEMPITSKY V. Fast convNets using group-wise brain damage［C］// Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. 2016:2554-2564
［19］WEN W, WU C P, WANG Y D, et al. Learning structured sparsity in deep neural networks［C］// Proceedings of the 2016 30th Annual Conference on Neural Information Processing Systems. 2016:2082-2090.
［20］HAN S, POOL J, TRAN J, et al. Learning both weights and connections for efficient neural network［C］// Proceedings of the 28th International Conference on Neural Information Processing Systems. 2015:1135-1143.
［21］SUN Y, WANG X G, TANG X O. Sparsifying neural network connections for face recognition［C］// Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. 2016:4856-4864.
［22］YANG T J, CHEN Y H, SZE V. Designing energy-efficient convolutional neural networks using energy-aware pruning［C］// Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. 2017:6071-6079.
［23］ABADI M, AGARWAL A, BARHAM P, et al. TensorFlow: Large-scale machine learning on heterogeneous distributed systems［J］. arXiv preprint arXiv:1603.04467, 2016.
［24］KRIZHEVSKY A, SUTSKEVER I, HINTON G E. ImageNet classification with deep convolutional neural networks［J］. Communications of the ACM, 2017,60(6):84-90.

[1]	何思达, 陈平华. 基于意图的轻量级自注意力序列推荐模型[J]. 计算机与现代化, 2024, 0(12): 1-9.
[2]	张晓东1, 白广芝1, 李敏1, 李昊洋2. 基于经验小波变换的油气井产量预测模型 [J]. 计算机与现代化, 2024, 0(12): 53-58.
[3]	刘宝宝, 杨菁菁, 陶露, 王贺应. 基于注意力的DSMSC的遥感图像场景分类[J]. 计算机与现代化, 2024, 0(12): 72-77.
[4]	陈雪松1, 李衡1, 王浩畅2. 结合注意力机制和Mengzi模型的短文本分类[J]. 计算机与现代化, 2024, 0(09): 101-106.
[5]	高帅鹏, 王怡凡. 基于图像的群体情绪识别综述[J]. 计算机与现代化, 2024, 0(08): 98-107.
[6]	周宪溪, 牟莉. 基于改进TF-IDF和AGLCNN的新闻长文本分类模型[J]. 计算机与现代化, 2024, 0(08): 120-126.
[7]	杨江1, 孙晓梅1, 许韬2. 基于业务内容构建股票关联关系的股价预测[J]. 计算机与现代化, 2024, 0(07): 21-25.
[8]	刘存莉1, 雷占占2, 郑澳2. 基于循环卷积神经网络的排水管网缺陷检测方法[J]. 计算机与现代化, 2024, 0(07): 26-35.
[9]	张晶晶, 黄学军. 基于矩阵恢复的OFDM信道估计方法[J]. 计算机与现代化, 2024, 0(05): 1-4.
[10]	李珊, 王林娜, 高丁佳, 宣海波. 基于图神经网络的多层银企网络融合研究[J]. 计算机与现代化, 2024, 0(05): 27-32.
[11]	钟海龙1, 2, 何月顺1, 何璘琳1, 陈杰1, 田鸣3, 郑瑞银4. 基于代价敏感卷积神经网络的加密流量分类#br# #br#[J]. 计算机与现代化, 2024, 0(05): 55-60.
[12]	高埂1, 肖风丽2, 杨飞1. 基于改进MobileNetV3-Small的色素减退性皮肤病诊断[J]. 计算机与现代化, 2024, 0(05): 120-126.
[13]	游嘉靖1, 2, 何月顺1, 何璘琳1, 钟海龙1, 2. 基于AHP-CNN的加密流量分类方法[J]. 计算机与现代化, 2024, 0(04): 83-87.
[14]	李旺旺, 黄学军. 毫米波大规模MIMO-NOMA系统用户分簇和功率分配设计[J]. 计算机与现代化, 2024, 0(02): 29-35.
[15]	许跃雯1, 李明1, 李莉2. 基于对比学习MocoV2的COVID-19图像分类#br#[J]. 计算机与现代化, 2024, 0(02): 81-87.