计算机与现代化 ›› 2024, Vol. 0 ›› Issue (10): 7-13.doi: 10.3969/j.issn.1006-2475.2024.10.002

• 算法设计与分析 • 上一篇    下一篇

雨天道路场景语义分割算法及其移动端部署


  

  1. (南京航空航天大学自动化学院,江苏 南京 211106)
  • 出版日期:2024-10-29 发布日期:2024-10-30
  • 基金资助:
    南京航空航天大学科研与实践创新计划项目(XCXJH20220318)

Semantic Segmentation Algorithm for Rainy Road Scene and Its Mobile Deployment

  1. (College of Automation Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China)
  • Online:2024-10-29 Published:2024-10-30

摘要: 现有语义分割模型容易受到雨滴遮蔽干扰,在雨天道路场景数据集上表现不佳;且没有着重关注道路场景中较为重要的车辆和行人2个类别。针对上述2个问题,设计雨天道路场景语义分割算法并将其部署在移动端,以促进自动驾驶技术的发展。提出快速融合金字塔池化模块(Fast Fusion Pyramid Pooling Module, FFPPM),使特征图融合丰富的全局语义信息和局部细节信息,有效分割雨滴遮蔽的场景;提出多重注意力融合模块(Multiple Attention Fusion Module, MAFM),并提高损失函数中车辆和行人的类别权重,增加模型对车辆和行人的关注度;借助Android Studio平台将模型部署到移动端,使用ONNX Runtime进行前向推理,分割效果与电脑端一致。在Rainy WCity数据集上与较新的5种模型进行比较,本文模型在电脑端和移动端分割精度一致;PA和mIoU分别为95.25%和72.96%,车辆PA和IoU分别为84.04%和74.15%,行人PA和IoU分别为34.91%和26.37%,均高于其他5种模型;此外本文模型在电脑端和移动端的FPS分别为45.46和1.26,分割速度较快。本文模型能够在移动端有效分割雨水遮蔽下的道路场景图像,对车辆和行人分割更加精确。

关键词: 语义分割, 道路场景, 雨滴遮蔽, 类别权重, 移动端部署

Abstract: The existing semantic segmentation models are susceptible to the interference of raindrop occlusion, and the performance is poor on the rainy road scene dataset. Moreover, they did not focus on the two important categories of vehicles and pedestrians in the road scene. Aiming at the above two problems, this paper designs a semantic segmentation algorithm for rainy road scenes and deploys it on a mobile terminal to promote the development of autonomous driving technology. A fast fusion pyramid pooling module is proposed to make the feature map integrate rich global semantic information and local detail information, and effectively segment the raindrop obscured scene. A multiple attention fusion module is proposed, and the category weight of vehicles and pedestrians in the loss function is increased to enhance the model’s attention to vehicles and pedestrians. The model is deployed to the mobile terminal with the help of Android Studio platform, and the ONNX Runtime is used for forward inference, and the segmentation effect is consistent with that of the computer terminal. Compared with five recent models on the Rainy WCity dataset, the segmentation accuracy of this model is the same on the computer terminal and the mobile terminal. Specifically, PA and mIoU are 95.25% and 72.96%, vehicle PA and IoU are 84.04% and 74.15%, and pedestrian PA and IoU are 34.91% and 26.37%, respectively, which are higher than those of the other five models. In addition, the FPS of the model in the computer and mobile terminals are 45.46 and 1.26, respectively, and the segmentation speed is fast. The model proposed in this paper can effectively segment the road scene image under the shelter of rain on the mobile terminal, and it is more accurate to segment vehicles and pedestrians.

Key words:  , semantic segmentation; road scene; raindrop masking; class weight; mobile deployment

中图分类号: