计算机与现代化 ›› 2023, Vol. 0 ›› Issue (12): 7-13.doi: 10.3969/j.issn.1006-2475.2023.12.002

• 人工智能 • 上一篇    下一篇

基于可学习记忆特征金字塔网络的小样本目标检测

  

  1. (1.广东工业大学计算机学院,广东 广州 510006; 2.上海交通大学自动化学院,上海 200030;
    3.威斯康星康考迪亚大学,威斯康星 梅库恩 WI 53097)
  • 出版日期:2023-12-24 发布日期:2024-01-24
  • 作者简介:夏千涵(1998—),女,吉林四平人,硕士研究生,研究方向:深度学习,机器视觉,E-mail: 541949442@qq.com; 何胜煌(1992—),男,福建龙岩人,博士后,研究方向:深度学习,医工结合,多智能体系统,E-mail: shhesjtu@sjtu.edu.cn; 通信作者:吴元清(1985—),男,广东广州人,教授,研究方向:无人智能小车编队控制,机器视觉处理,E-mail: yqwuzju@163.com; 赵乐乐(1984—),女,吉林长春人,硕士研究生,研究方向:机械设计,E-mail: lelezhaochina@163.com。
  • 基金资助:
    国家自然科学基金资助项目(U22A2065, 62003100, 62276074); 国家重点发展计划项目(2022YFB4701300); 广东省基础和应用基础研究基金资助项目(2021B15120058)

Few-shot Object Detection via Learnable Memory Feature Pyramid Network

  1. (1. School of Computer Science and Technology, Guangdong University of Technology, Guangzhou 510006, China;
    2. School of Automation, Shanghai Jiaotong University, Shanghai 200030, China;
    3. Concordia University Wisconsin, Mequon WI 53097, USA)
  • Online:2023-12-24 Published:2024-01-24

摘要: 摘要:现阶段,部分行业应用场景数据难以获取,从而产生的小样本问题成为制约深度学习技术应用推广的重要因素。本文通过小样本方法来提升模型在数据缺乏情况下的表现,降低深度学习模型对数据的依赖性,提出一种基于可学习记忆特征金字塔网络来保留更干净的多尺度特征信息用于分类器预测。借助自适应特征融合模块,让网络自行选择不同层级特征间的侧重比,最大化保留不同尺度的判别性特征信息。同时还加入回溯特征对齐模块,用于缓解特征层堆叠时引入的特征混淆效应。实验结果表明,通过克服样本依赖性可以有效地提升模型性能,改进后的模型可以在COCO数据集和VOC数据集上超越其他现有同类型的模型。特别地,在VOC数据集中将先验参数k设置为5的情况下,nAP50提高了4.8达到44.7;在COCO数据集中将先验参数k设置为30的情况下,nAP50提高了4.0达到29.4。

关键词: 关键词:小样本, 自适应融合, 特征对齐, 特征金字塔网络

Abstract: Abstract: At present, it is difficult to obtain the data of some industry application scenarios, and the problem of few shot has become an important factor restricting the application and promotion of deep learning technology. In this paper, few shot method is adopted to improve the performance of the model in the absence of data and reduce the dependence of the deep learning model on data, and few-shot object detection via learnable memory feature pyramid network is proposed to retain cleaner multi-scale feature information for classifier prediction. With the help of the adaptive feature fusion module, the network can choose the emphasis ratio among the features of different levels to maximize the retention of discriminant feature information of different scales. At the same time, we also add a retrospective feature alignment module to alleviate the feature confusion effect introduced by stacking feature layers. The experimental results show that the model performance can be effectively improved by overcoming the dependence on data, and the improved model can surpass other existing models of the same type in the COCO dataset and VOC dataset. In particular, when the prior parameter k is set to 5 in VOC dataset, nAP50 increases by 4.8 to 44.7; when the prior parameter k is set to 30 in COCO dataset, nAP50 increases by 4.0 to 29.4.

Key words: Key words: few shot, adaptive fusion, feature alignment, feature pyramid network

中图分类号: