计算机与现代化 ›› 2024, Vol. 0 ›› Issue (11): 113-120.doi: 10.3969/j.issn.1006-2475.2024.11.017

• 图像处理 • 上一篇    下一篇

基于改进YOLOv8的河道废弃瓶检测方法






  

  1. (1.贵州财经大学大数据统计学院,贵州 贵阳 550002;2.贵州大学现代制造技术教育部重点实验室,贵州 贵阳 550002) 
  • 出版日期:2024-11-29 发布日期:2024-12-10
  • 基金资助:
    贵州省教育厅青年科技人才成长项目(KY[2022]199); 贵州财经大学校级科研基金资助项目(2021KYYB08); 贵州省基础研究计划项目(黔科合基础-ZK[2023]一般029)

A River Discarded Bottles Detection Method Based on Improved YOLOv8 

  1. (1. School of Big Data Statistics, Guizhou University of Finance and Economics, Guiyang 550002, China;
    2. Key Laboratory of Modern Manufacturing Technology of Ministry of Education, Guizhou University, Guiyang 550002, China)
  • Online:2024-11-29 Published:2024-12-10

摘要: 目前,目标检测已经广泛应用在各个领域,并且日趋成熟。但在河道废弃瓶检测任务中,现有图像处理技术存在精度低、成本高且难部署等问题仍然是该项研究的难点。本文提出一种基于改进YOLOv8的深度学习检测模型。首先,针对无人船图像中小目标容易错检、漏检的共性问题,引入Bi-PAN-FPN思想改进YOLOv8-n中的Neck部分。通过充分考虑并复用多尺度特征,在尽量维持参数成本的情况下实现更高级、更完善的特征融合过程;其次,使用EIoU替换原网络中的CIoU来优化损失函数,考虑将预测框和真实框的纵横比的影响因子进行拆分,解决CIoU损失函数中难易样本不平衡的问题,提升模型的优化能力。本文以国际公开的FloW数据集进行实验,并设计消融实验、对比实验、损失函数性能分析实验以及特殊场景对比实验,从多个角度阐述所提方法的可行性和有效性。消融实验结果表明:改进后的YOLOv8-n模型相比基线网络,漏检情况得到明显改善,平均精度均值达到85.2%,比基线网络模型提升了2.7个百分点,检测效果提升明显。对比实验结果表明,改进模型的平均精度均值相比于Mobilnet-SSDv2、YOLOv4s、YOLOv5s、YOLOv7-tiny、YOLOv3-SPP、YOLOv5-MobileNetV3s这6个模型分别提升60.15%、18.99%、3.90%、7.30%、28.7%及55.47%,在FPS、参数数量、模型大小方面均体现了较优的综合性能,并在特殊场景中表现最优。因此,本文的改进模型的综合性能优于目前流行的河道垃圾检测模型,更加适用于河道废弃瓶的实时检测。

关键词: YOLOv8, 深度学习, 目标检测, 损失函数

Abstract:  Currently, object detection has been found extensive applications across various domains and is progressively reaching a state of maturity. However, in the task of detecting riverine discarded bottles, the ongoing challenges with existing image processing technologies include low accuracy, high costs, and deployment difficulties, which continue to be significant obstacles in this research field. Therefore, we present an enhanced deep learning detection model based on YOLOv8. Firstly, to tackle the common issue of false positives and false negatives for small objects in unmanned vessel images, we introduce the Bi-PAN-FPN concept to improve the Neck component of YOLOv8-n. By carefully considering and reusing multi-scale features, this approach aims to achieve a more sophisticated and comprehensive feature fusion process while minimizing parameter costs. Secondly, the CIoU loss function is optimized by introducing the EIoU and addressing the imbalance between hard and easy samples. This is achieved by separating the influence of aspect ratio factors between predicted and ground-truth boxes. These enhancements aim to improve the optimization capability of the model. We conduct experiments using the publicly available FloW dataset and includes multiple evaluations, such as ablation experiments, comparative experiments, performance analysis of loss functions, and contrast experiments in special scenarios. These evaluations provide comprehensive evidence of the feasibility and effectiveness of the proposed method from multiple perspectives. The results of the ablation experiments indicate that the enhanced YOLOv8-n model exhibits a significant improvement in reducing false negatives compared to the baseline network, achieving an average precision of 85.2%. This marks a 2.7 percentage points increase over the baseline network model, demonstrating a notable enhancement in detection performance. The results of the comparative experiments indicate that the average precision of the improved model surpasses six other models, namely Mobilenet-SSDv2, YOLOv4s, YOLOv5s, YOLOv7-tiny, YOLOv3-SPP, and YOLOv5-MobileNetV3s, by 60.15%, 18.99%, 3.90%, 7.30%, 28.7%, and 55.47%, respectively. These findings highlight the superior overall performance of the improved model in terms of FPS, parameter quantity, and model size. Additionally, the improved model demonstrates exceptional performance in special scenarios. Therefore, the comprehensive performance of the improved model in the paper outperforms the currently popular models for river garbage detection. As a result, it is more suitable for real-time detection of discarded bottles in rivers.

Key words: YOLOv8, deep learning, target detection, loss function

中图分类号: