改进YOLOv5算法的遥感图像车辆检测

摘要/Abstract

摘要： 针对遥感图像中背景复杂目标、车辆小导致的成像模糊的目标漏检问题，提出一种基于YOLOv5s的改进模型。改进模型设计一种新的主干网络结构：改进模型的主干特征提取选用RepVGG网络，同时在主干网络中加入注意力机制CoordAttention来提高模型小目标的感知能力。增加多尺度特征融合，提高改进模型对于小目标的检测精度，边框回归的损失函数选择使用DIoU，帮助改进模型实现更加精准定位。实验结果表明，改进后的YOLOv5模型在遥感图像的目标检测，相较于原始模型在小目标车辆中检测精度提升5.3个百分点，与Faster R-CNN相比mAP提升16.88个百分点。改进后的模型与主流的检测算法相比能有较大的检测精度提升，相较于原始的YOLOv5s模型在遥感图像小车辆检测有更好的检测精度。

关键词: 遥感图像识别, 目标识别, YOLO, 注意力机制, 多尺度特征融合

Abstract: An improved model based on YOLOv5s is proposed for the problem of target miss detection in remote sensing images with complex targets in the background and blurred imaging due to small vehicles. A new backbone network structure is designed for the improved model: RepVGG network is selected for the backbone feature extraction of the improved model， while an attention mechanism， CoordAttention， is added to the backbone network to improve the perception capability of the model for small targets. Multi-scale feature fusion is added to improve the detection accuracy of the improved model for small targets， and the loss function of border regression is chosen to use DIoU to help the improved model achieve more accurate localization. After experiments， it is demonstrated that the improved YOLOv5 model improves the detection accuracy by 5.3 percentage points for target detection in remote sensing images compared to the original model in small target vehicles， and improves the mAP by 16.88 percentage points compared to Faster R-CNN. The improved model can have a larger detection accuracy improvement compared with the mainstream detection algorithms， and has a better detection accuracy than the original YOLOv5s model for small vehicle detection in remote sensing images.

Key words: remote sensing image recognition, target recognition, YOLO, attention mechanisms, multi-scale feature fusion

朱理清, 李祥, . 改进YOLOv5算法的遥感图像车辆检测[J]. 计算机与现代化, 2023, 0(05): 117-121.

ZHU Li-qing, LI Xiang, . Vehicle Detection of Remote Sensing Images Based on Improved YOLOv5 Algorithm[J]. Computer and Modernization, 2023, 0(05): 117-121.

参考文献

［1］ HU Y， LI X， ZHOU N， et al. A sample update-based convolutional neural network framework for object detection in large-area remote sensing images［J］. IEEE Geoscience and Remote Sensing Letters， 2019，16（6）:947-951.
［2］ WANG W H， WANG P Z， NIU Z D. A real-time detection algorithm for unmanned aerial vehicle target in infrared search system［C］// Proceedings of the 2018 IEEE International Conference on Signal Processing， Communications and Computing （ICSPCC）. 2018. DOI: 10.1109/ICSPCC.
2018.8567718.
［3］祝思君. 基于深度学习的无人机遥感图像目标识别方法研究［D］. 北京:北京建筑大学， 2018.
［4］薛雅丽，孙瑜，马瀚融. 航空遥感影像中的轻量级小目标检测［J］. 电光与控制， 2022，29（6）:11-15.
［5］ KRIZHEVSKY A， SUTSKEVER I， HINTON G E. ImageNet classification with deep convolutional networks［C］// Proceedings of the 2012 International Conference on Neural Information Processing Systems （NIPS）. 2012:1097-1105.
［6］ HE K M， ZHANG X Y， REN S Q， et al. Deep residual learning for image recognition［C］// Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition （CVPR）. 2016:770-778.
［7］林清平，张麒麟，肖蕾. 采用改进YOLOv5网络的遥感图像目标识别方法［J］. 空军预警学院学报， 2021，35（2）:117-120.
［8］李婕，周顺，朱鑫潮，等. 结合多通道注意力的遥感图像飞机目标检测［J］. 计算机工程与应用， 2022，58（1）:209-217.
［9］ LIU T， ZHOU B J， ZHAO Y S， et al. Ship detection algorithm based on improved YOLOv5［C］// Proceedings of the 2021 6th International Conference on Automation， Control and Robotics Engineering （CACRE）. 2021:483-487.
［10］ CHEN Y W， ZHANG C， QIAO T F， et al. Ship detection in optical sensing images based on YOLOv5［C］// Proceedings of the 2021 12th International Conference on Graphics and Image Processing （ICGIP 2020）. 2021. DOI: 10.1117/12.2589395.
［11］ REN S Q， HE K M， GIRSHICK R， et al. Faster R-CNN: Towards real-time object detection with region proposal networks［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence， 2017，39（6）:1137-1149.
［12］ HE K M， GKIOXARI G， DOLLAR P， et al. Mask R-CNN［C］// Proceedings of the 2017 IEEE International Conference on Computer Vision （ICCV）. 2017:2961-2969.
［13］ REDMON J， DIVVALA S， GIRSHICK R， et al. You only look once: Unified， real-time object detection［C］// Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition （CVPR）. 2016:779-788.
［14］ LIN T Y， GOYAL P， GIRSHICK R， et al. Focal loss for dense object detection［C］// Proceedings of the 2017 IEEE International Conference on Computer Vision （ICCV）. 2017:2980-2988.
［15］ HU J， SHEN L， SUN G. Squeeze-and-excitation networks［C］// Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition （CVPR）. 2018:7132-7141.
［16］刘丹，吴亚娟，罗南超，等. 嵌入注意力和特征交织模块的Gaussian-YOLOv3目标检测［J］. 计算机应用， 2020，40（8）:2225-2230.
［17］吴止锾，李磊，高永明. 遥感图像舰船检测的旋转卷积集成 YOLOv3 模型［J］. 计算机工程与应用， 2019，55（22）:146-151.
［18］ LIN T Y， DOLLAR P， GIRSHICK R， et al. Feature pyramid networks for object detection［C］// Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition （CVPR）. 2017:936-944.
［19］ ZHU X K， LYU S C， WANG X， et al. TPH-YOLOv5: Improved YOLOv5 based on transformer prediction head for object detection on drone-captured scenarios［C］// Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision （ICCV） Workshops. 2021:2778-2788.
［20］ QING Y H， LIU W Y， FENG L Y， et al. Improved YOLO network for free-angle remote sensing target detection［J］. Remote Sensing， 2021，13（11）. DOI: 10.3390/rs13112171.
［21］杨其晟，李文宽，杨晓峰，等. 改进YOLOv5的苹果花生长状态检测方法［J］. 计算机工程与应用， 2022，58（4）:237-246.
［22］ WANG Y M， MORARIU V I， DAVIS L S. Learning a discriminative filter bank within a CNN for fine-grained recognition［C］// Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition （CVPR）. 2018:4148-4157.
［23］ HAN J M， DING J， XUE N， et al. ReDet: A rotation-equivariant detector for aerial object detection［C］// Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition （CVPR）. 2021:2785-2794.
［24］ CHEN K， PANG J M， WANG J Q， et al. Hybrid task cascade for instance segmentation［C］// Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition （CVPR）. 2019:4969-4978.
［25］赵文清，孔子旭，周震东，等. 增强小目标特征的航空遥感目标检测［J］. 中国图象图形学报， 2021，26（3）:644-653.
［26］王胜科，任鹏飞，吕昕，等. 基于中心点和双重注意力机制的无人机高分辨率图像小目标检测算法［J］. 应用科学学报， 2021，39（4）:650-659.
［27］赵玉蓉，郭会明，焦函，等. 融合混合域注意力的YOLOv4在船舶检测中的应用［J］. 计算机与现代化， 2021（9）:75-82.
［28］张朕通，单玉刚，袁杰. 联合多尺度和注意力机制的遥感影像检测［J］. 计算机工程与应用， 2021，57（9）:212-216.
［29］ DING X H， ZHANG X Y， MA N N， et al. RepVGG: Making VGG-style convNets great again［C］// Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition （CVPR）. 2021:13728-13737.
［30］ HOU Q B， ZHOU D Q， FENG J S. Coordinate attention for efficient mobile network design［C］// Proceedings of the 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition （CVPR）. 2021:13708-13717.
［31］ KINGMA D P， BA J. Adam: A method for stochastic optimization［J］. arXiv preprint arXiv:1412.6980， 2014.

[1]	何思达, 陈平华. 基于意图的轻量级自注意力序列推荐模型[J]. 计算机与现代化, 2024, 0(12): 1-9.
[2]	赵晨阳, 薛涛, 刘俊华. 基于改进Stable Diffusion的时尚服饰图案生成[J]. 计算机与现代化, 2024, 0(12): 15-23.
[3]	黄庭培1, 马禄彪1, 李世宝2, 刘建航1. 基于WiFi和原型网络的手势识别方法[J]. 计算机与现代化, 2024, 0(12): 34-39.
[4]	张思敏, 刘新妹, 殷俊龄, 李宝玲. 基于YOLOv7改进的PCB缺陷检测方法[J]. 计算机与现代化, 2024, 0(12): 45-52.
[5]	张晓东1, 白广芝1, 李敏1, 李昊洋2. 基于经验小波变换的油气井产量预测模型 [J]. 计算机与现代化, 2024, 0(12): 53-58.
[6]	刘云海1, 冯广1, 吴晓婷2, 杨群2. 复杂施工场景下的安全帽佩戴检测算法[J]. 计算机与现代化, 2024, 0(12): 66-71.
[7]	谷岳, 邓松峰, 沈霁, 穆文涛, 赵恩棋. 基于改进YOLOv8的SAR舰船目标检测算法[J]. 计算机与现代化, 2024, 0(12): 78-83.
[8]	王艳媛, 茅正冲. 中英文场景文本图像的检测和识别算法[J]. 计算机与现代化, 2024, 0(12): 84-90.
[9]	李钧超1, 尤菲1, 张超2, 苏乐乐2, 龚龑2. 基于新型多目标浣熊优化算法的BiLSTM-Attention#br# 预测模型及误差分析[J]. 计算机与现代化, 2024, 0(11): 70-76.
[10]	张宇1, 2, 黎靖1, 2, 马铭1, 2, 王众祥1, 2, 孙妍1, 2. YOLOLW:一个新的轻量级目标检测模型[J]. 计算机与现代化, 2024, 0(11): 91-98.
[11]	董玉玟. 基于改进YOLOv7-tiny的多尺度运动目标检测算法[J]. 计算机与现代化, 2024, 0(11): 99-105.
[12]	祁贤, 刘大铭, 常佳鑫. 基于改进自注意力机制的多视图三维重建[J]. 计算机与现代化, 2024, 0(11): 106-112.
[13]	陈凯1, 李宜汀1, 2, 全华凤1 . 基于改进YOLOv8的河道废弃瓶检测方法[J]. 计算机与现代化, 2024, 0(11): 113-120.
[14]	杨骏1, 胡为1, 朱文福2. 基于改进MobileNetV3的视觉SLAM回环检测算法[J]. 计算机与现代化, 2024, 0(10): 21-26.
[15]	魏学诚1, 江凌云1, 李研2, 何非2. 改进YOLOv5的路侧单目视角小目标检测算法[J]. 计算机与现代化, 2024, 0(10): 27-34.