计算机与现代化 ›› 2023, Vol. 0 ›› Issue (03): 16-22.

• 图像处理 • 上一篇    下一篇

水下声呐图像轻量级目标检测模型

  

  1. (河海大学物联网工程学院,江苏 常州 213022)
  • 出版日期:2023-04-17 发布日期:2023-04-17
  • 作者简介:范新南(1965—),男,江苏宜兴人,教授,博士生导师,博士,研究方向:信息获取与处理,机器视觉,E-mail: fanxn@hhuc.edu.cn; 通信作者:史朋飞(1985—),男,山东青岛人,副教授,硕士生导师,博士,研究方向:水下探测与成像,信息获取与处理,多源信息融合理论,E-mail: shipf@hhu.edu.cn。
  • 基金资助:
    中央高校基本科研业务费项目(B220202020, B220203032)

Lightweight Object Detection Model for Underwater Sonar Images

  1. (College of Internet of Things Engineering, Hohai University, Changzhou 213022, China)
  • Online:2023-04-17 Published:2023-04-17

摘要: 水下AUV搭载声呐进行探测成为水下目标检测的主流方式,水下环境的复杂及声呐成像方式导致声呐图像分辨率较低,使用形态学目标检测等传统方法时检测精度与实时性不高,深度学习如YOLO等算法直接用于水下声呐图像目标检测时仍然面临样本少、模型参数多等挑战,为此,本文提出一种声呐图像水下目标轻量化检测模型。针对低分辨率声呐图像数据特点以及水下AUV自动检测对实时性的要求,以YOLOv4模型为主要框架,进行模型裁剪、替换优化特征融合模块、目标预测框K均值聚类以及改进损失函数等,将构建的检测模型应用于声呐目标检测。所构建的声呐图像水下目标检测轻量化模型的mAP相对于SSD、YOLOv3、YOLOv3-DFPIN、YOLOv4-tiny分别提高了0.0659、0.0214、.0402和0.1701。在mAP相较于YOLOv4、CenterNet、EfficientdetD0分别低0.0186、0.0093、0.0074的情况下,FPS分别相对于YOLOv4提升一倍多、相对于EfficientdetD0提升近5倍、相对于CenterNet提升近一倍。同时,本文提出的模型兼具高精度和实时性的优点。实验结果表明,本文提出的特征提取网络能够减小网络参数冗余,提高模型效率和检测速度,结合自适应空间特征融合模块增强了不同尺度之间特征的相互融合和重用,提高了低分辨率声呐图像目标检测的精度。

关键词: 目标检测, 水下声呐图像, 深度学习, YOLOv4, Kmeans++

Abstract: With the development of unmanned underwater detection technology, AUV with sonar detection has become the main method of underwater object detection. However, due to the complexity of the underwater environment and the limitation of the sonar imaging mode, the sonar image resolution is low. Therefore, the traditional morphology based on object detection method has the problems of low detection accuracy and poor real-time performance. When deep learning algorithms such as YOLO are directly applied to underwater sonar image target detection, they still face challenges such as few underwater samples and many model parameters. This paper proposes a lightweight object detection model for sonar image datasets. In view of the characteristics of low-resolution sonar image data and the real-time requirements of underwater AUV automatic detection, the YOLOv4 model is used as the main framework to carry out model tailoring, replace the optimized feature fusion module, target prediction K-means clustering and improve the loss function, etc., and the constructed detection model is applied to sonar target detection. According to the experimental data, the mAP of the proposed model in this paper is 0.0659,0.0214,0.0402 and 0.1701 higher than that of SSD, YOLOv3, YOLOV3-DFPIN and YOLOV4-tiny respectively, Under the conditioms of the mAPs are only 0.0186 lower than that of YOLOv4, only 0.0093 lower than CenterNet, only 0.0074 lower than EfficientdetD0, however, FPS is more than twice as high as YOLOv4 and CenterNet, more than fifth as high as EfficientdetD0. At the same time, the proposed model in this paper has the advantages of both high precision and real time. The experimental results show that the proposed feature extraction network can greatly reduce the redundancy of network parameters and improve the model efficiency and detection speed. Combined with the adaptive spatial feature fusion module, the mutual fusion and reuse of features in different scales are enhanced, and the accuracy of low resolution sonar image target detection is improved.

Key words: object detection, underwater sonar image, deep learning, YOLOv4, Kmeans++