计算机与现代化 ›› 2021, Vol. 0 ›› Issue (04): 98-103.

• 网络与通信 • 上一篇    下一篇

一种新的高效轻量级卷积神经网络模型

  

  1. (杭州电子科技大学计算机学院,浙江杭州310018)
  • 出版日期:2021-04-22 发布日期:2021-04-25
  • 作者简介:张舰舰(1994—),男,浙江金华人,硕士研究生,研究方向:深度学习,E-mail: 274494231@qq.com。

A New Efficient and Lightweight Convolutional Neural Network Model

  1. (School of Computer Science, Hangzhou Dianzi University, Hangzhou 310018, China)
  • Online:2021-04-22 Published:2021-04-25

摘要: 针对目前食物识别系统中网络模型参数量多、模型较大的问题,提出一种23层结构、参数量只有204k的网络模型,使用基本构造块(7×7、5×5、3×3)生成特征图,用不同感受野的2个池化层来融合卷积层的特征图,再用1×1的卷积核进行非线性组合,然后连接到空间金字塔池化层,最后在softmax分类器中分类。在公开数据集上的实验表明,与ResNet50和GoogLeNet相比,本文网络模型在分类性能不降低的情况下,模型参数分别减少了99.14%和96.63%。

关键词: 卷积神经网络, 深度学习, 食物分类, 空间金字塔池化

Abstract: Aiming at the problem that the current food recognition system has a large number of network model parameters and a large model, this paper proposes a 23-layer network model with only 204k parameters. The basic building blocks (7×7, 5×5, 3×3) are used to generate feature maps, and two pooling layers of different receptive fields are used to fuse the feature map of the convolutional layer, and a 1×1 convolution kernel is used for nonlinear combination. Then it is connected to the spatial pyramid pooling layer, and finally is classified in the softmax classifier. Experiments on public data sets show that, compared with ResNet50 and GoogLeNet, the network model in this paper reduces model parameters by 99.14% and 96.63% respectively without reducing classification performance.

Key words: convolutional neural network, deep learning, food classification, spatial pyramid pooling