基于改进RetinaNet的医用塑瓶装箱计数算法

摘要/Abstract

摘要： 为提高医用塑瓶包装生产线上装箱计数的效率、准确率及稳定性,本文提出一种基于深度学习的装箱计数检测算法，实现在线实时计数。首先，构建以ResNet为骨架网络,使用特征金字塔网络产生多尺度特征图并适当删减卷积层的改进RetinaNet网络。然后，使用聚类算法优化Anchor尺寸，使算法能够自适应歪瓶、倒瓶等异常情况下的计数检测，从而降低漏检率并提高定位精度。最后，在实际装箱数据集上对算法进行实验评测，结果表明该算法抗干扰能力强、稳健可靠，在满足生产条件下能够快速、准确地对装箱塑瓶进行计数检测，计数精度可达99.98%以上，单张检测时间为33 ms，满足了生产线实时检测要求。

关键词: 深度学习, 装箱计数, RetinaNet, 特征金字塔, 聚类

Abstract: In order to improve the efficiency, accuracy and stability of packing counting in medical plastic packaging production line, this paper proposes a packing counting detection algorithm based on deep learning which can realize automatic online counting. Firstly, an improved RetinaNet network was constructed with ResNet as the framework, the feature pyramid network was used to generate multi-scale feature maps, and the convolution layers were cut appropriately. Then, clustering algorithm is used to optimize the Anchor size, so that the algorithm can adapt to count detection under abnormal conditions such as crooked bottle and inverted bottle, so as to reduce the missed detection rate and improve the positioning accuracy. Finally, the experimental evaluation of the algorithm on the actual packing data set shows that the algorithm is robust and reliable, and can quickly and accurately count and detect the packing plastic bottles under the production conditions. The counting accuracy can reach more than 99.98%, and the single detection time is 33 ms, which meets the real-time detection requirements of the production line.

Key words: deep learning, packing counting, RetinaNet, feature pyramid, clustering

邱吕, 任德均, 郜明, 付磊, 吴华运, 胡云起. 基于改进RetinaNet的医用塑瓶装箱计数算法[J]. 计算机与现代化, 2020, 0(12): 99-103.

QIU Lyu, REN De-jun, GAO Ming, FU Lei, WU Hua-yun, HU Yun-qi. A Packing Counting Method of Medical Plastic Bottles Based on Improved RetinaNet[J]. Computer and Modernization, 2020, 0(12): 99-103.

参考文献

［1］杨东燕. 基于单片机的产品自动计数与装箱控制系统［J］. 内蒙古科技与经济, 2010(20):76.
［2］图尔克（天津）传感器有限公司. TURCK光电传感器在自动化生产线上的应用——啤酒/饮料自动化生产线上透明瓶子检测计数［J］. 国内外机电一体化技术, 2006(5):86.
［3］ MERLIN P M, FARBER D J. A parallel mechanism for detecting curves in pictures［J］. IEEE Transactions on Computers, 1975,24(1):96-98.
［4］〖KG-*4〗LEE D S. Effective Gaussian mixture learning for video background subtraction［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2005,27(5):827-832.
［5］夏天煜,汤晓华,李可,等. 在线智能视觉检测系统在小包装食盐装箱中的应用［J］. 北京工商大学学报(自然科学版), 2011,29(5):61-64.
［6］ LOWE D G. Distinctive image features from scale-invariant keypoints［J］. International Journal of Computer Vision, 2004,60（2）:91-110.
［7］ DALAL N, TRIGGS B. Histograms of oriented gradients for human detection［C］// Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE, 2005:886-893.
［8］王彩云. 目标检测的研究进展［C］// 中国计算机用户协会网络应用分会2019年第二十三届网络新技术与应用年会论文集. 2019:48-51.
［9］ GIRSHICK R, DONAHUE J, DARRELL T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation［C］// Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 2014:580-587.
［10］GIRSHICK R. Fast R-CNN［C］// Proceedings of the 2015 IEEE International Conference on Computer Vision. 2015:1440-1448.
［11］REN S Q, HE K M, GIRSHICK R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks［C］// Proceedings of the 28th International Conference on Neural Information Processing Systems. MIT Press, 2015:91-99.
［12］王静涛,宋文龙,李克新,等. 依据Faster R-CNN的活体植株叶片气孔检测方法［J/OL］. 东北林业大学学报, 2020,48(2):34-39［2020-02-19］.https://doi.org/10.13759/j.cnki.dlxb.2020.02.007.

［13］LIU W, ANGUELOV D, ERHAN D, et al. SSD: Single shot multibox detector［C］∥ Proceedings of the 14th European Conference on Computer Vision. Springer Verlag, 2016:21-37.

［14］REDMON J, DIVVALA S, GIRSHICK R, et al. You only look once: Unified, real-time object detection［C］// Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. 2016:779-788.
［15］刘小刚,范诚,李加念,等. 基于卷积神经网络的草莓识别方法［J/OL］. 农业机械学报:1-13［2020-02-19］. http://kns.cnki.net/kcms/detail/11.1964.S. 20191211.1308.004.html.
［16］董伟,钱蓉,张洁,等. 基于深度学习的蔬菜鳞翅目害虫自动识别与检测计数［J］. 中国农业科技导报, 2019,21(12):76-84.
［17］郑秋梅,王璐璐,王风华. 基于改进卷积神经网络的交通场景小目标检测［J/OL］. 计算机工程:1-9［2020-04-20］. https://doi.org/10.19678/j.issn.1000-3428.0056462.
［18］李佐龙,王帮海,卢增. 多尺度特征融合重建的行人检测方法［J/OL］. 计算机工程与应用:1-10［2020-04-20］. http://kns.cnki.net/kcms/detail/11.2127.TP.20200323.2226.014.html.
［19］马美荣,李东喜. 基于RetinaNet的手机主板缺陷检测研究［J］. 计算机工程与科学, 2020,42(4):673-682.
［20］LIN T Y, GOYAL P, GIRSHICK R, et al. Focal loss for dense object detection［C］// Proceedings of the 2017 IEEE International Conference on Computer Vision. 2017:2999-3007.
［21］HE K M, ZHANG X Y, REN S Q, et al. Deep residual learning for image recognition［C］// Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 2016:770-778.
［22］栾浩,王力,姜敏,等. 基于改进SSD的目标检测方法［J］. 软件, 2020,41(1):29-35.

[1]	吕美静1, 年梅1, 张俊1, 2, 付鲁森1. 基于自编码器的网络流量异常检测[J]. 计算机与现代化, 2024, 0(12): 40-44.
[2]	祁贤, 刘大铭, 常佳鑫. 基于改进自注意力机制的多视图三维重建[J]. 计算机与现代化, 2024, 0(11): 106-112.
[3]	陈凯1, 李宜汀1, 2, 全华凤1 . 基于改进YOLOv8的河道废弃瓶检测方法[J]. 计算机与现代化, 2024, 0(11): 113-120.
[4]	杨骏1, 胡为1, 朱文福2. 基于改进MobileNetV3的视觉SLAM回环检测算法[J]. 计算机与现代化, 2024, 0(10): 21-26.
[5]	王莹莹, 郝潇. 基于Res2Net和递归门控卷积的细粒度图像分类[J]. 计算机与现代化, 2024, 0(10): 74-79.
[6]	史星宇1, 李强2, 庄莉3, 梁懿3, 王秋琳3, 陈锴3, 伍臣周3, 常胜1. 一种面向工业部署的目标检测模型蒸馏技术[J]. 计算机与现代化, 2024, 0(10): 93-99.
[7]	刘文亮1, 吴飞1, 何德明1, 赵维伟2, 潘建宏3. 基于相异度矩阵的碎片化回复文本聚类方法[J]. 计算机与现代化, 2024, 0(09): 56-60.
[8]	张泽1, 张建权2, 3, 周国鹏2, 3. 基于改进YOLOv8s的摄像头模组缺陷检测[J]. 计算机与现代化, 2024, 0(09): 107-113.
[9]	程亚子1, 雷亮1, 2, 陈瀚1, 赵毅然1. 基于转置注意力的多尺度深度融合单目深度估计[J]. 计算机与现代化, 2024, 0(09): 121-126.
[10]	程萌, 李浩. 改进YOLOv5s的落叶树鸟巢检测方法[J]. 计算机与现代化, 2024, 0(08): 24-29.
[11]	王梦溪, 李峻. 老年人跌倒检测技术研究综述[J]. 计算机与现代化, 2024, 0(08): 30-36.
[12]	时现伟1, 范鑫2. 基于轻量化的视频帧场景语义分割方法[J]. 计算机与现代化, 2024, 0(08): 49-53.
[13]	徐新爱, 李钢. 基于DCGAN的课堂表情图像生成方法[J]. 计算机与现代化, 2024, 0(08): 88-91.
[14]	高帅鹏, 王怡凡. 基于图像的群体情绪识别综述[J]. 计算机与现代化, 2024, 0(08): 98-107.
[15]	黄文栋, 王怡凡. 基于模态类别的多模态信息处理与融合综述[J]. 计算机与现代化, 2024, 0(07): 47-62.