基于软硬件协同加速框架的遥感图像目标检测

计算机与现代化 ›› 2022, Vol. 0 ›› Issue (06): 109-115.

基于软硬件协同加速框架的遥感图像目标检测

(1.西安电子科技大学,陕西西安710071;2.陕西航天技术应用研究院有限公司,陕西西安710100)

出版日期:2022-06-23 发布日期:2022-06-23
作者简介:谭金林(1982—),男,陕西西安人,高级工程师，硕士,研究方向：航天电子信息系统，遥感图像处理，E-mail: tjl.king@163.com；范文童(1996—),男,安徽芜湖人,硕士研究生,研究方向：遥感图像处理，目标检测，FPGA设计，E-mail:slgwanmiao@sina.com; 刘亚虎(1986—),男,工程师,硕士,研究方向：遥感信息处理，E-mail: 15029202330@139.com.
基金资助:
陕西省自然科学基金青年基金资助项目（2019JQ270）

Object Detection in Remote Sensing Images Based on Software and Hardware Co-acceleration Framework

(1. Xidian University, Xi’an 710071, China;
2. Shaanxi Aerospace Technology Application Research Institute Co., Ltd., Xi’an 710100, China)

Online:2022-06-23 Published:2022-06-23

摘要/Abstract

摘要： 由于遥感图像目标检测模型计算复杂度和内存需求的急剧增加，难以应用在小尺寸和低功耗的嵌入式平台上。针对上述问题，本文提出一种基于现场可编程门阵列（Field-Programmable Gate Array, FPGA）的软硬件协同加速框架，实现遥感图像目标检测模型的推理加速。首先，遵循Vitis AI加速方案对训练后的YOLOv3网络参数进行压缩、编译；其次，在FPGA端搭建包含深度学习处理单元（Deep-Learning Processing Unit, DPU）模块的底层硬件工程，并在ARM上编写DPU任务调度程序；最后，在Zynq SoC开发平台上实现FPGA的推理加速。实验结果表明，该框架在Xilinx-Zynq-MPSoC上的平均吞吐率为1.75 TOPs（26.8 fps），并且在DIOR数据集上的平均精度（mean Average Precision, mAP）为56.7%。

关键词: 遥感图像, 目标检测, 卷积神经网络, 现场可编程门阵列

Abstract: Due to the rapid increase of computational complexity and memory requirement in the field of object detection in remote sensing images, it is quite difficult to be applied to the embedded platform with small size and low power. To address aforementioned issues, a hardware and software co-acceleration framework based on field-programmable gate array (FPGA) to promote the inference process of object detection in remote sensing images is proposed. Firstly, the trained YOLOv3 network are compressed and compiled according to the Vitis AI acceleration scheme. And then, the underlying hardware project including deep learning processing unit (DPU) module is built on FPGA, and the DPU task scheduler is written on ARM. Finally, the inference acceleration based on FPGA is implemented on Zynq SoC development platform. Experimental results show that our framework achieves an average throughput rate of 1.75 TOPS (26.8 fps) on the Xilinx Zynq MPSoC, and the mean Average Precision (mAP) on DIOR dataset is 56.7%.

Key words: remote sensing images, object detection, convolutional neural network, field programmable gate array

谭金林, 范文童, 刘亚虎, 梁志锋, 王梁, 刘斌, 黄斌. 基于软硬件协同加速框架的遥感图像目标检测[J]. 计算机与现代化, 2022, 0(06): 109-115.

TAN Jin-lin, FAN Wen-tong, LIU Ya-hu, LIANG Zhi-feng, WANG Liang, LIU Bin, HUANG Bin. Object Detection in Remote Sensing Images Based on Software and Hardware Co-acceleration Framework[J]. Computer and Modernization, 2022, 0(06): 109-115.

参考文献

［1］ ZHANG Z, WANG X, WEN Q, et al. Research progress in the application of land resources remotesensing［J］. Journal of Remote Sensing, 2016,20(5):1243-1258.
［2］尤慧,邓艳君,高华东,等. 洪湖湿地土地利用/土地覆盖变化遥感监测［J］. 江苏农业科学, 2021,49(2):162-166.
［3］ CHEBUD Y, NAJA G M, RIVERO R G, et al. Water quality monitoring using remote sensing and an artificial neural network［J］. Water, Air, & Soil Pollution, 2012,223(8):4875-4887.
［4］韩睿,黄鹏,芦楠,等. 吉林一号在地表水遥感监测中的应用［J］. 卫星应用, 2020,5(3):47-53.
［5］ WENG Q H. Thermal infrared remote sensing for urban climate and environmental studies: Methods, applications, and trends［J］. ISPRS Journal of Photogrammetry and Remote Sensing, 2009,64(4):335-344.
［6］苗世光,蒋维楣,梁萍,等. 城市气象研究进展［J］. 气象学报, 2020,78(3):477-499.
［7］ BAI X S. Study on visual interpretation of remote sensing image of military transport target［J］. Geomatics Science and Technology, 2020,8(4):133-138.
［8］许益乔,张占月,王登林,等. 军地遥感卫星联合使用问题研究［J］. 中国电子科学研究院学报, 2021,16(1):81-86.
［9］李德仁,张良培,夏桂松. 遥感大数据自动分析与数据挖掘［J］. 测绘学报, 2014,43(12):1212-1216．
［10］周亚男,赵威,范亚男. 遥感大数据实时渲染与交互可视化研究［J］. 地球信息科学学报, 2016,18(5):664-672．
［11］胡晓东,张新,屈靖生. 大数据架构的遥感资源存储管理方法［J］. 地球信息科学学报, 2016,18(5):681-689．
［12］YANG J M. Application of sentinel-1 satellite and data products［J］. Geospatial Information, 2016,14(12):18-20.
［13］XIANG H Y, XIAO Y W, ZHANG X, et al. Edge computing and network slicing technology in 5G［J］. Telecommunications Science, 2017(6):1-10.
［14］PROIA N, PAG V. Characterization of a Bayesian ship detection method in optical satelliteimages［J］. IEEE Geoscience and Remote Sensing Letters, 2010,7(2):226-230.
［15］YANN L, BENGIO Y, HINTON G. Deep learning［J］. Nature, 2015，521(7553):436-444.
［16］QIAN X L, LIN S, CHENG G, et al. Object detection in remote sensing images based on improved bounding box regression and multi-level features fusion［J］. Remote Sensing, 2020,12(1). DOI: 10.3390/rs12010143.
［17］HINTON G E, SALAKHUTDINOV R R. Reducing the dimensionality of data with neural networks［J］. Science, 2006,313(5786):504-507.
［18］REN S Q, HE K M, GIRSHICK R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016,39(6):1137-1149.
［19］REDMON J, FARHADI A. YOLOv3: An incremental improvement［J］. arXiv preprint arXiv:1804.02767, 2018.
［20］HE K M, ZHANG X Y, REN S Q, et al. Deep residual learning for image recognition［C］// 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2016:770-778.
［21］HE K M, ZHANG X Y, REN S Q, et al. Spatial pyramid pooling in deep convolutional networks for visual recognition［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2015,37(9):1904-1916.
［22］丁立德,胡怀香. 基于FPGA的CNN应用加速技术［J］. 信息技术, 2019,43(12):110-115.
［23］Xilinx. Zynq Ultrascale+ Device Technical Reference Manual v2.2［M］. San Jose: Xilinx, 2020.
［24］Xilinx. Zynq DPU Product Guide v3.2［M］. San Jose: Xilinx, 2020.
［25］LI K, WAN G, CHENG G, et al. Object detection in optical remote sensing images: A survey and a new benchmark［J］. ISPRS Journal of Photogrammetry and Remote Sensing, 2020,159:296-307.

[1]	何思达, 陈平华. 基于意图的轻量级自注意力序列推荐模型[J]. 计算机与现代化, 2024, 0(12): 1-9.
[2]	张晓东1, 白广芝1, 李敏1, 李昊洋2. 基于经验小波变换的油气井产量预测模型 [J]. 计算机与现代化, 2024, 0(12): 53-58.
[3]	刘云海1, 冯广1, 吴晓婷2, 杨群2. 复杂施工场景下的安全帽佩戴检测算法[J]. 计算机与现代化, 2024, 0(12): 66-71.
[4]	刘宝宝, 杨菁菁, 陶露, 王贺应. 基于注意力的DSMSC的遥感图像场景分类[J]. 计算机与现代化, 2024, 0(12): 72-77.
[5]	陈亮, 李诚, 易伟, 熊伟, 汪晓帆, 唐海东. 基于毫米波雷达与视觉融合的电力现场安全帽佩戴检测[J]. 计算机与现代化, 2024, 0(12): 100-107.
[6]	张宇1, 2, 黎靖1, 2, 马铭1, 2, 王众祥1, 2, 孙妍1, 2. YOLOLW:一个新的轻量级目标检测模型[J]. 计算机与现代化, 2024, 0(11): 91-98.
[7]	董玉玟. 基于改进YOLOv7-tiny的多尺度运动目标检测算法[J]. 计算机与现代化, 2024, 0(11): 99-105.
[8]	陈凯1, 李宜汀1, 2, 全华凤1 . 基于改进YOLOv8的河道废弃瓶检测方法[J]. 计算机与现代化, 2024, 0(11): 113-120.
[9]	魏学诚1, 江凌云1, 李研2, 何非2. 改进YOLOv5的路侧单目视角小目标检测算法[J]. 计算机与现代化, 2024, 0(10): 27-34.
[10]	史星宇1, 李强2, 庄莉3, 梁懿3, 王秋琳3, 陈锴3, 伍臣周3, 常胜1. 一种面向工业部署的目标检测模型蒸馏技术[J]. 计算机与现代化, 2024, 0(10): 93-99.
[11]	陈雪松1, 李衡1, 王浩畅2. 结合注意力机制和Mengzi模型的短文本分类[J]. 计算机与现代化, 2024, 0(09): 101-106.
[12]	程萌, 李浩. 改进YOLOv5s的落叶树鸟巢检测方法[J]. 计算机与现代化, 2024, 0(08): 24-29.
[13]	高帅鹏, 王怡凡. 基于图像的群体情绪识别综述[J]. 计算机与现代化, 2024, 0(08): 98-107.
[14]	周宪溪, 牟莉. 基于改进TF-IDF和AGLCNN的新闻长文本分类模型[J]. 计算机与现代化, 2024, 0(08): 120-126.
[15]	杨江1, 孙晓梅1, 许韬2. 基于业务内容构建股票关联关系的股价预测[J]. 计算机与现代化, 2024, 0(07): 21-25.