计算机与现代化 ›› 2022, Vol. 0 ›› Issue (04): 45-51.

• 图像处理 • 上一篇    下一篇

融合注意力机制和空洞卷积的滑坡图像检测

  

  1. (成都理工大学计算机与网络安全学院(牛津布鲁克斯学院),四川成都610059)
  • 出版日期:2022-05-07 发布日期:2022-05-07
  • 作者简介:刘学虎(1995—),男,云南昭通人,硕士研究生,研究方向:计算机视觉,人工智能与模式识别,E-mail: liuxuehu@stu.cdut.edu.cn; 欧鸥(1978—),男,四川南充人,教授,博士,研究方向:复杂网络,智能计算,空间信息技术,E-mail: ouou@cdut.edu.cn; 张伟劲(1993—),男,四川宜宾人,硕士研究生,研究方向:深度学习,时间序列,神经网络,E-mail: 897535518@qq.com; 杜雪垒(1997—),男,湖北十堰人,硕士研究生,研究方向:深度学习,E-mail: 1320946997@qq.com。
  • 基金资助:
    国家重点研发计划项目(2018YFF01013304); 四川气象灾害预测预警与应急管理研究中心项目(ZHYJ17-YB08)

Landslide Image Detection Based on Dilated Convolution and Attention Mechanism

  1. (College of Computer and Network Security (Oxford Brookes College), Chengdu University of Technology, Chengdu 610059, China)

  • Online:2022-05-07 Published:2022-05-07

摘要: 滑坡区域图像检测与识别在灾害范围识别、灾情数据分析和防灾减灾中具有丰富的应用和研究价值。本文针对滑坡图像滑坡体形状纹理的多样性,以及滑坡目标区域检测识别效果不够理想的问题,提出一种注意力机制CBAM与空洞卷积结合的目标检测方法。在传统的目标检测算法Faster R-CNN的基础上,将注意力机制模型添加到卷积神经网络层,通过空间注意力与通道注意力结合的CBAM模型来进行滑坡图像特征的提取,增加空洞卷积模块来加大感受野区域,提高神经网络对遥感图像区域中的滑坡目标识别、尺寸不规范等特点的学习能力,从而进一步提升滑坡目标区域的检测精度。实验结果表明,在传统的目标检测算法的基础上采用两者结合的方式进行检测,可提升滑坡遥感图像上目标检测的召回率和精确率,具有一定的有效性和鲁棒性。

关键词: 滑坡, 注意力机制, Faster R-CNN, 空洞卷积, 目标检测

Abstract: Landslide area image detection and recognition has rich application and research value in disaster scope recognition, disaster data analysis and disaster prevention and mitigation. In this paper, a target detection method combining attention mechanism CBAM and dilated convolution is proposed to solve the problems of the diversity of landslide body shape and texture in landslide image and the unsatisfactory detection and recognition effect of landslide target area. On the basis of the traditional target detection algorithm Faster R-CNN, the attention mechanism model is added to the convolutional neural network layer. The landslide image features are extracted through the CBAM model combining spatial attention and channel attention, and the dilated convolution module is added to enlarge the receptive field area, and to improve the learning ability of the landslide target recognition and non-standard size in the remote sensing image area of the neural network, so as to further improve the detection accuracy of the landslide target area. The experimental results show that, based on the traditional target detection algorithm, the combination of the two methods can improve the recall rate and precision rate of target detection on the remote sensing images of landslides, and it has a certain validity and robustness.


Key words: landslide, attention mechanism, Faster R-CNN, dilated convolution, target detection