计算机与现代化 ›› 2020, Vol. 0 ›› Issue (12): 99-103.

• 图像处理 • 上一篇    下一篇

基于改进RetinaNet的医用塑瓶装箱计数算法

  

  1. (四川大学机械工程学院,四川成都610065)
  • 出版日期:2021-01-07 发布日期:2021-01-07
  • 作者简介:邱吕(1996—),女,四川广安人,硕士研究生,研究方向:机器视觉,图像处理,E-mail: 2391361235@qq.com; 通信作者:任德均(1971—),男,副教授,博士,研究方向:机器视觉,机器智能,E-mail: rendejun@scu.edu.cn; 郜明(1996—),男,硕士研究生,研究方向:机器视觉,图像处理,E-mail: 1204563110@qq.com; 付磊(1995—),男,硕士研究生,研究方向:机器视觉,深度学习,E-mail: 914508730@qq.com; 胡云起(1995—),男,硕士研究生,研究方向:机器视觉,机器人,E-mail: 1119063534@qq.com。

A Packing Counting Method of Medical Plastic Bottles Based on Improved RetinaNet

  1. (School of Mechanical Engineering, Sichuan University, Chengdu 610065, China)
  • Online:2021-01-07 Published:2021-01-07

摘要: 为提高医用塑瓶包装生产线上装箱计数的效率、准确率及稳定性,本文提出一种基于深度学习的装箱计数检测算法,实现在线实时计数。首先,构建以ResNet为骨架网络,使用特征金字塔网络产生多尺度特征图并适当删减卷积层的改进RetinaNet网络。然后,使用聚类算法优化Anchor尺寸,使算法能够自适应歪瓶、倒瓶等异常情况下的计数检测,从而降低漏检率并提高定位精度。最后,在实际装箱数据集上对算法进行实验评测,结果表明该算法抗干扰能力强、稳健可靠,在满足生产条件下能够快速、准确地对装箱塑瓶进行计数检测,计数精度可达99.98%以上,单张检测时间为33 ms,满足了生产线实时检测要求。

关键词: 深度学习, 装箱计数, RetinaNet, 特征金字塔, 聚类

Abstract: In order to improve the efficiency, accuracy and stability of packing counting in medical plastic packaging production line, this paper proposes a packing counting detection algorithm based on deep learning which can realize automatic online counting. Firstly, an improved RetinaNet network was constructed with ResNet as the framework, the feature pyramid network was used to generate multi-scale feature maps, and the convolution layers were cut appropriately. Then, clustering algorithm is used to optimize the Anchor size, so that the algorithm can adapt to count detection under abnormal conditions such as crooked bottle and inverted bottle, so as to reduce the missed detection rate and improve the positioning accuracy. Finally, the experimental evaluation of the algorithm on the actual packing data set shows that the algorithm is robust and reliable, and can quickly and accurately count and detect the packing plastic bottles under the production conditions. The counting accuracy can reach more than 99.98%, and the single detection time is 33 ms, which meets the real-time detection requirements of the production line.

Key words: deep learning, packing counting, RetinaNet, feature pyramid, clustering