计算机与现代化 ›› 2025, Vol. 0 ›› Issue (02): 121-126.doi: 10.3969/j.issn.1006-2475.2025.02.017

• 图像处理 • 上一篇    

基于门控融合的实时语义分割


  

  1. (重庆师范大学计算机与信息科学学院,重庆 401331)
  • 出版日期:2025-02-28 发布日期:2025-02-28
  • 基金资助:
    重庆市技术创新与应用发展重点项目(cstc2019jscx-mbdxX0061)

Real-time Semantic Segmentation Based on Gate-controlled Fusion

  1. (College of Computer and Information Science, Chongqing Normal University, Chongqing 401331, China)
  • Online:2025-02-28 Published:2025-02-28

摘要: 实时语义分割中特征融合需要同时关注浅层信息与深层信息,而目前特征融合方法所需的计算量和参数量庞大,难以在精度与速度上满足实时语义分割的要求。针对这一问题,从网络的实时性和性能2个方面综合考虑,本文提出基于门控融合的实时语义分割方法。该方法包含编码器、门控特征融合模块、像素级特征提取模块以及门控聚合分割头。首先,将待分割图片通过编码器进行特征提取,其次利用像素级特征提取模块对重要特征信息进行精确提取,然后通过门控特征融合模块将深层语义信息与浅层位置信息进行特征融合,最后通过门控聚合分割头完成语义分割。在数据集CamVid上,模型分割的平均交并比为87.31%,分割的帧率为75.3 fps。在数据集Cityscapes上,模型分割的平均交并比为79.19%,分割帧率为44.1 fps。实验结果表明,本文方法在准确性和实时性方面均表现出色,可有效应用于实时语义分割任务

关键词: 实时语义分割, 特征融合, 图像处理, 图像语义分割

Abstract: Feature fusion in real-time semantic segmentation needs to pay attention to both shallow and deep information, while the current feature fusion methods require a huge amount of computation and parameter count, which is difficult to meet the requirements of real-time semantic segmentation in terms of accuracy and speed. To address this problem, a real-time semantic segmentation method based on gated fusion is proposed from the comprehensive consideration of both real-time and performance of the network. The method contains an encoder, a gated feature fusion module, a pixel-level feature extraction module, and a gated aggregation segmentation head. Firstly, the image to be segmented is feature extracted by the encoder. Secondly, the important feature information is accurately extracted by the pixel-level feature extraction module, then the deep semantic information and the shallow location information are feature fused by the gated feature fusion module. Finally the semantic segmentation is completed by the gated aggregation segmentation head. On the dataset CamVid, the mean intersection over union of the model segmentation is 87.31%, and the frame rate of segmentation is 75.3 fps. On the dataset Cityscapes, the mean intersection over union of the model segmentation is 79.19%, and the frame rate of segmentation is 44.1 fps. Experimental results show that the proposed segmentation method performs well in both accuracy and real-time, and it can be effectively applied to real-time semantic segmentation tasks.

Key words: real-time semantic segmentation, feature fusion, image processing, image semantic segmentation

中图分类号: