Computer and Modernization ›› 2023, Vol. 0 ›› Issue (12): 7-13.doi: 10.3969/j.issn.1006-2475.2023.12.002

Previous Articles     Next Articles

Few-shot Object Detection via Learnable Memory Feature Pyramid Network

  

  1. (1. School of Computer Science and Technology, Guangdong University of Technology, Guangzhou 510006, China;
    2. School of Automation, Shanghai Jiaotong University, Shanghai 200030, China;
    3. Concordia University Wisconsin, Mequon WI 53097, USA)
  • Online:2023-12-24 Published:2024-01-24

Abstract: Abstract: At present, it is difficult to obtain the data of some industry application scenarios, and the problem of few shot has become an important factor restricting the application and promotion of deep learning technology. In this paper, few shot method is adopted to improve the performance of the model in the absence of data and reduce the dependence of the deep learning model on data, and few-shot object detection via learnable memory feature pyramid network is proposed to retain cleaner multi-scale feature information for classifier prediction. With the help of the adaptive feature fusion module, the network can choose the emphasis ratio among the features of different levels to maximize the retention of discriminant feature information of different scales. At the same time, we also add a retrospective feature alignment module to alleviate the feature confusion effect introduced by stacking feature layers. The experimental results show that the model performance can be effectively improved by overcoming the dependence on data, and the improved model can surpass other existing models of the same type in the COCO dataset and VOC dataset. In particular, when the prior parameter k is set to 5 in VOC dataset, nAP50 increases by 4.8 to 44.7; when the prior parameter k is set to 30 in COCO dataset, nAP50 increases by 4.0 to 29.4.

Key words: Key words: few shot, adaptive fusion, feature alignment, feature pyramid network

CLC Number: