融合注意力机制和空洞卷积的滑坡图像检测

摘要/Abstract

摘要： 滑坡区域图像检测与识别在灾害范围识别、灾情数据分析和防灾减灾中具有丰富的应用和研究价值。本文针对滑坡图像滑坡体形状纹理的多样性，以及滑坡目标区域检测识别效果不够理想的问题，提出一种注意力机制CBAM与空洞卷积结合的目标检测方法。在传统的目标检测算法Faster R-CNN的基础上，将注意力机制模型添加到卷积神经网络层，通过空间注意力与通道注意力结合的CBAM模型来进行滑坡图像特征的提取，增加空洞卷积模块来加大感受野区域，提高神经网络对遥感图像区域中的滑坡目标识别、尺寸不规范等特点的学习能力，从而进一步提升滑坡目标区域的检测精度。实验结果表明，在传统的目标检测算法的基础上采用两者结合的方式进行检测，可提升滑坡遥感图像上目标检测的召回率和精确率，具有一定的有效性和鲁棒性。

关键词: 滑坡, 注意力机制, Faster R-CNN, 空洞卷积, 目标检测

Abstract: Landslide area image detection and recognition has rich application and research value in disaster scope recognition, disaster data analysis and disaster prevention and mitigation. In this paper, a target detection method combining attention mechanism CBAM and dilated convolution is proposed to solve the problems of the diversity of landslide body shape and texture in landslide image and the unsatisfactory detection and recognition effect of landslide target area. On the basis of the traditional target detection algorithm Faster R-CNN, the attention mechanism model is added to the convolutional neural network layer. The landslide image features are extracted through the CBAM model combining spatial attention and channel attention, and the dilated convolution module is added to enlarge the receptive field area, and to improve the learning ability of the landslide target recognition and non-standard size in the remote sensing image area of the neural network, so as to further improve the detection accuracy of the landslide target area. The experimental results show that, based on the traditional target detection algorithm, the combination of the two methods can improve the recall rate and precision rate of target detection on the remote sensing images of landslides, and it has a certain validity and robustness.

Key words: landslide, attention mechanism, Faster R-CNN, dilated convolution, target detection

刘学虎, 欧鸥, 张伟劲, 杜雪垒. 融合注意力机制和空洞卷积的滑坡图像检测[J]. 计算机与现代化, 2022, 0(04): 45-51.

LIU Xue-hu, OU Ou, ZHANG Wei-jing, DU Xue-lei. Landslide Image Detection Based on Dilated Convolution and Attention Mechanism[J]. Computer and Modernization, 2022, 0(04): 45-51.

参考文献

［1］刘坚,李树林,陈涛. 基于优化随机森林模型的滑坡易发性评价［J］. 武汉大学学报(信息科学版), 2018,43(7):1085-1091.
［2］黄汀,白仙富,庄齐枫,等. 高分一号汶川极震区滑坡提取研究［J］. 测绘通报, 2018(2):67-71.
［3］闫琦,李慧,荆林海,等. 灾后高分辨率遥感影像的地震型滑坡信息自动提取算法研究［J］. 激光与光电子学进展, 2017,54(11):410-420.
［4］郭加伟,李永树,李政,等. 迁移学习支持下的高分影像山地滑坡灾害解译模型［J］. 测绘科学技术学报, 2016,33(5):496-501.
［5］ BEHLING R, ROESSNER S, KAUFMANN H, et al. Automated spatiotemporal landslide mapping over large areas using rapidEye time series data［J］. Remote Sensing, 2014,6(9):8026-8055.
［6］方教勇. 基于GIS的清平地区滑坡分布分形特征与危险性评价研究［D］. 成都:成都理工大学, 2018.
［7］ OTHMAN A A, GLOAGUEN R. Automatic extraction and size distribution of landslides in Kurdistan Region, NE Iraq［J］. Remote Sensing, 2013,5(5):2389-2410.
［8］贺素歌. SAR图像用于震害信息变化检测中的处理方法研究［D］. 北京:中国地震局地震预测研究所, 2013.
［9］ JI S P, YU D W, SHEN C Y, et al. Landslide detection from an open satellite imagery and digital elevation model dataset using attention boosted convolutional neural networks［J］. Landslides, 2020,17(6):1337-1352.
［10］ PIRALILOU S T, SHAHABI H, JARIHANI B, et al. Landslide detection using multi-scale image segmentation and different machine learning models in the higher Himalayas［J］. Remote Sensing, 2019,11(21). DOI: 103390/rs11212575.
［11］ CAN R, KOCAMAN S, GOKCEOGLU C. A convolutional neural network architecture for auto-detection of landslide photographs to assess citizen science and volunteered geographic information data quality［J］. ISPRS International Journal of Geo-Information, 2019,8(7). DOI:10.3390/ijgi8070300.
［12］ GHORBANZADEH O, BLASCHKE T, GHOLAMNIA K, et al. Evaluation of different machine learning methods and deep-learning convolutional neural networks for landslide detection［J］. Remote Sensing, 2019,11(2). DOI: 10.3390/rs11020196.
［13］GIRSHICK R, DONAHUE J, DARRELL T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation［C］// Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition. 2014:580-587.
［14］GIRSHICK R. Fast R-CNN［C］// Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV). 2015:1440-1448.
［15］REN S Q, HE K M, GIRSHICK R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017,39(6):1137-1149.
［16］ LIN T Y, DOLLAR P, GIRSHICK R, et al. Feature pyramid networks for object detection［C］// Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition. 2017:936-944.
［17］HE K M, GKIOXARI G, DOLLAR P, et al. Mask R-CNN［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020,42(2):386-397.
［18］LIU W, ANGUELOV D, ERHAN D, et al. SSD: Single shot multiBox detector［C］// Proceedings of the 2016 European Conference on Computer Vision. 2016:21-37.
［19］REDMON J, DIVVALA S, GIRSHICK R, et al. You only look once: Unified, real-time object detection［C］// Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2016:779-788.
［20］LIN T Y, GOYAL P, GIRSHICK R, et al. Focal loss for dense object detection［C］// Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV). 2017:2999-3007.
［21］MA J Q, SHAO W Y, YE H, et al. Arbitrary-oriented scene text detection via rotation proposals［J］. IEEE Transactions on Multimedia, 2018,20(11):3111-3122.
［22］YU F, KOLTUN V. Multi-scale context aggregation by dilated convolutions［J］. arXiv preprint arXiv:1511.07122, 2015.
［23］HU J, SHEN L, SUN G. Squeeze-and-excitation networks［C］// Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2018:7132-7141.
［24］ZHANG Y L, LI K P, LI K, et al. Image super-resolution using very deep residual channel attention networks［C］// Proceedings of the 2018 European Conference on Computer Vision. 2018:294-310.
［25］WOO S Y, PARK J C, LEE J Y, et al. CBAM: Convolutional block attention module［C］// Proceedings of the 2018 European Conference on Computer Vision. 2018:3-19.
［26］HE K M, ZHANG X Y, REN S Q, et al. Deep residual learning for image recognition［C］// Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. 2016:770-778.
［27］JIANG B R, LUO R X, MAO J Y, et al. Acquisition of localization confidence for accurate object detection［C］// Proceedings of the 2018 European Conference on Computer Vision. 2018:816-832.
［28］REZATOFIGHI H, TSOI N, GWAK J Y, et al. Generalized intersection over union: A metric and a loss for bounding box regression［C］// Proceedings of the 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 2019:658-666.

[1]	何思达, 陈平华. 基于意图的轻量级自注意力序列推荐模型[J]. 计算机与现代化, 2024, 0(12): 1-9.
[2]	赵晨阳, 薛涛, 刘俊华. 基于改进Stable Diffusion的时尚服饰图案生成[J]. 计算机与现代化, 2024, 0(12): 15-23.
[3]	黄庭培1, 马禄彪1, 李世宝2, 刘建航1. 基于WiFi和原型网络的手势识别方法[J]. 计算机与现代化, 2024, 0(12): 34-39.
[4]	张晓东1, 白广芝1, 李敏1, 李昊洋2. 基于经验小波变换的油气井产量预测模型 [J]. 计算机与现代化, 2024, 0(12): 53-58.
[5]	刘云海1, 冯广1, 吴晓婷2, 杨群2. 复杂施工场景下的安全帽佩戴检测算法[J]. 计算机与现代化, 2024, 0(12): 66-71.
[6]	谷岳, 邓松峰, 沈霁, 穆文涛, 赵恩棋. 基于改进YOLOv8的SAR舰船目标检测算法[J]. 计算机与现代化, 2024, 0(12): 78-83.
[7]	王艳媛, 茅正冲. 中英文场景文本图像的检测和识别算法[J]. 计算机与现代化, 2024, 0(12): 84-90.
[8]	陈亮, 李诚, 易伟, 熊伟, 汪晓帆, 唐海东. 基于毫米波雷达与视觉融合的电力现场安全帽佩戴检测[J]. 计算机与现代化, 2024, 0(12): 100-107.
[9]	李钧超1, 尤菲1, 张超2, 苏乐乐2, 龚龑2. 基于新型多目标浣熊优化算法的BiLSTM-Attention#br# 预测模型及误差分析[J]. 计算机与现代化, 2024, 0(11): 70-76.
[10]	张宇1, 2, 黎靖1, 2, 马铭1, 2, 王众祥1, 2, 孙妍1, 2. YOLOLW:一个新的轻量级目标检测模型[J]. 计算机与现代化, 2024, 0(11): 91-98.
[11]	董玉玟. 基于改进YOLOv7-tiny的多尺度运动目标检测算法[J]. 计算机与现代化, 2024, 0(11): 99-105.
[12]	祁贤, 刘大铭, 常佳鑫. 基于改进自注意力机制的多视图三维重建[J]. 计算机与现代化, 2024, 0(11): 106-112.
[13]	陈凯1, 李宜汀1, 2, 全华凤1 . 基于改进YOLOv8的河道废弃瓶检测方法[J]. 计算机与现代化, 2024, 0(11): 113-120.
[14]	杨骏1, 胡为1, 朱文福2. 基于改进MobileNetV3的视觉SLAM回环检测算法[J]. 计算机与现代化, 2024, 0(10): 21-26.
[15]	魏学诚1, 江凌云1, 李研2, 何非2. 改进YOLOv5的路侧单目视角小目标检测算法[J]. 计算机与现代化, 2024, 0(10): 27-34.