重利用不可靠伪标签的单阶段半监督目标检测

doi:10.3969/j.issn.1006-2475.2025.03.008

摘要/Abstract

摘要： 半监督目标检测方法的关键是为无标签数据中的目标分配伪标签。为了确保伪标签的质量，半监督目标检测方法通常利用置信度阈值过滤不可靠伪标签，这会导致大量的伪标签因为置信度低而被滤除。本文提出改进后的半监督学习方法使用对比学习来重利用大量置信度低的不可靠伪标签，提升半监督目标检测方法的性能。具体来说，根据预测置信度将无标签数据的伪标签分为可靠与不可靠伪标签。除了利用可靠伪标签，还利用不可靠伪标签作为对比学习中的负样本训练模型。为了平衡类别间不可靠伪标签的数量，设计一个记忆模块用于保存训练过程中不同批次的不可靠伪标签。实验结果表明，在COCO数据集上，对训练数据进行1%、5%和10%的标注情况下，改进后的半监督学习方法的平均准确率达到13.6%、23.0%和27.5%，优于已有半监督学习方法；在COCO-additional数据集上，改进后的半监督学习方法的平均准确率达到44.7%，相较于监督学习，性能提高4.5个百分点。

关键词: 半监督学习, 目标检测, 对比学习, 重利用不可靠伪标签, 端到端训练

Abstract: The key to semi-supervised object detection methods is to assign pseudo labels to the targets of unlabeled data. To guarantee the quality of pseudo-labels， the semi-supervised object detection methods usually use a confidence threshold to filter low-quality pseudo-labels， which will cause most pseudo-labels to be removed due to their low confidence. Contrastive learning is used to reuse most of low-confidence unreliable pseudo labels for boosting the performance of semi-supervised object detection method. Specifically， the pseudo-labels are divided into reliable and unreliable ones according to the prediction confidence. Besides the reliable pseudo-labels， the unreliable pseudo-labels are exploited as negative samples for model training of contrast learning. To balance the number of unreliable pseudo-labels between different classes， a memory module is designed to store the unreliable pseudo-labels of different batches in the training process. The experimental results show that the mAP of the improved semi-supervised method on COCO data set is 13.6%， 23.0%， and 27.5% with the labeling ratio of 1%， 5%， and 10%， which is better than the existing semi-supervised learning methods. On the COCO-additional data set， the mAP of the improved semi-supervised method reaches 44.7%， which is 4.5 percentage points higher than supervised learning.

Key words: semi-supervised learning, object detection, contrastive learning, reusing unreliable pseudo-labels, end-to-end training

中图分类号:

TP391

邵叶秦1, 王海权2, 周昆阳3, 郭于荻2, 施佺1. 重利用不可靠伪标签的单阶段半监督目标检测[J]. 计算机与现代化, 2025, 0(03): 52-59.

SHAO Yeqin1, WANG Haiquan2, ZHOU Kunyang3, GUO Yudi2, SHI Quan1. One-stage Semi-supervised Object Detection by Reusing Unreliable Pseudo-labels[J]. Computer and Modernization, 2025, 0(03): 52-59.

参考文献

［1］ SOHN K， ZHANG Z Z， LI C L， et al. A simple semi-supervised learning framework for object detection［J］. arXiv preprint arXiv:2005.04757， 2020.
［2］ LIU Y C， MA C Y， HE Z J， et al. Unbiased teacher for semi-supervised object detection［J］. arXiv preprint arXiv:2102.09480， 2021.
［3］吕枫，王义，阮胡林，等. 深度嵌入关系空间下齿轮箱标记样本扩充及其半监督故障诊断方法［J］. 仪器仪表学报， 2021，42（2）:55-65.
［4］ CAI Z W， VASCONCELOS N. Cascade R-CNN: High quality object detection and instance segmentation［J］. arXiv preprint arXiv:1906.09756， 2019.
［5］ BOCHKOVSKIY A， WANG C Y， LIAO H Y M. YOLOv4: Optimal speed and accuracy of object detection［J］. arXiv preprint arXiv:2004.10934， 2020.
［6］ TIAN Z， SHEN C H， CHEN H， et al. FCOS: Fully convolutional one-stage pbject detection［J］. arXiv preprint arXiv:1904.01355， 2019.
［7］ WU S， DENG G C， LI J C， et al. Enhancing TripleGAN for semi-supervised conditional instance synthesis and classification［C］// 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition. IEEE， 2019:10083-10092.
［8］ TARVAINEN A， VALPOLA H. Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results［J］. arXiv preprint arXiv:1703.01780， 2017.
［9］ XU M D， ZHANG Z， HU H et al. End-to-end semi-supervised object detection with soft teacher［C］// 2021 IEEE/CVF International Conference on Computer Vision （ICCV）. IEEE， 2021:3040-3049.
［10］ ZHANG F Y， PAN T X， WANG B N. Semi-supervised object detection with adaptive class-rebalancing self-training［J］. arXiv preprint arXiv:2107.05031， 2021.
［11］杨雨龙，郭田德，韩丛英. 基于原型学习改进的伪标签半监督学习算法［J］. 中国科学院大学学报， 2021，38（6）:841-851.
［12］ PHAM H， DAI Z H， XIE Q Z， et al. Meta pseudo labels［J］. arXiv preprint arXiv:2003.10580， 2020.
［13］ ZHOU H Y， GE Z， LIU S T， et al. Dense teacher: Dense pseudo-labels for semi-supervised object detection［J］. arXiv preprint arXiv:2207.02541， 2022.
［14］ WANG Y C， WANG H C， SHEN Y J， et al. Semi-supervised semantic segmentation using unreliable pseudo-labels［J］. arXiv preprint arXiv:2203.03884， 2022.
［15］ JEONG J， LEE S， KIM J， et al. Consistency-based semi-supervised learning for object detection［C］// The 33rd Conference on Neural Information Processing Systems. ACM， 2019:10759-10768.
［16］ JEONG J， VERMA V， HYUN M， et al. Interpolation-based semi-supervised learning for object detection［J］. arXiv preprint arXiv:2006.02158， 2020.
［17］孙锐，单晓全，孙琦景，等. 双重对比学习框架下近红外-可见光人脸图像转换方法［J］. 光电工程， 2022，49（4）:28-40.
［18］ KONG S F， RICCI F， GUEVARRA D， et al. Density of states prediction for materials discovery via contrastive learning from probabilistic embeddings［J］. Nature Communications， 2022，13. DOI: 10.1038/s41467-022-28543-x.
［19］郝瑾琳，陈雪云. 结合对比学习与空间上下文的人脸活体检测［J］. 广西大学学报（自然科学版）， 2021，46（6）:1579-1591.
［20］ SHEN Z Q， LIU Z， LI J G， et al. DSOD: Learning deeply supervised object detectors from scratch［J］. arXiv preprint arXiv:1708.01241， 2017.
［21］ HE K M， FAN H Q， WU Y X， et al. Momentum contrast for unsupervised visual representation learning［J］. arXiv preprint arXiv:1911.05722v3， 2019.
［22］ XIE E Z， DING J， WANG W H， et al. DetCo: Unsupervised contrastive learning for object detection［J］. arXiv preprint arXiv: 2102.04803， 2021.
［23］ VAN DEN OORD A， LI Y Z， VINYALS O. Representation learning with contrastive predictive coding［J］. arXiv preprint arXiv:1807.03748.
［24］ LIN T Y， MAIRE M， BELONGIE S， et al. Microsoft COCO: Common objects in context［C］// European Conference on Computer Vision. Springer， 2014:740-755.
［25］ CHEN K， WANG J Q， PANG J M， et al. MMDetection: Open MMLab detection toolbox and benchmark［J］. arXiv preprint arXiv:1906.07155， 2019.
［26］周昆阳，赵梦婷，张海潮，等. 基于通道切分的人体姿态估计算法［J］. 计算机与现代化， 2021（12）:27-36.
［27］ DEVRIES T， TAYLOR G W. Improved regularization of convolutional neural networks with cutout［J］. arXiv preprint arXiv:1708.04552， 2017.
［28］ LIU Y C， MA C Y， KIRA Z， et al. Unbiased teacher v2: Semi-supervised object detection for anchor-free and anchor-based detectors［C］// Conference on Computer Vision and Pattern Recognition. IEEE， 2022:9809-9818.

[1]	张涛涛, 谢钧, 乔平娟. 基于多源无监督域适应的辐射源个体识别方法[J]. 计算机与现代化, 2025, 0(03): 45-51.
[2]	柳尧凯, 任德均, 刘重宜, 卢宇东. 面向小目标检测的自适应多维度特征融合网络[J]. 计算机与现代化, 2025, 0(03): 106-112.
[3]	郭晨光, 茅健, 汪云云. 基于语义拓展和嵌入的零样本学习[J]. 计算机与现代化, 2025, 0(02): 19-27.
[4]	何飞熊1, 谢海巍1, 蒲超2, 邹传铭2, 贾艺璇1. 基于改进YOLOv8网络的道路病害检测方法[J]. 计算机与现代化, 2025, 0(02): 108-113.
[5]	王鹏1, 倪彬1, 郭壮壮1, 张书盛1, 王志1, 蔡润楷2. 基于改进YOLOv5的架空线路关键部件典型缺陷识别[J]. 计算机与现代化, 2025, 0(02): 114-120.
[6]	陈思贇1, 马怀波2, 张华君2, 兰子柠2, 陈文鑫2, 胡杰1, 常胜1. 基于国产AI芯片的目标检测算法优化与部署[J]. 计算机与现代化, 2025, 0(01): 25-29.
[7]	李希, 潘誉. 基于改进YOLOv8的探地雷达管线目标检测方法[J]. 计算机与现代化, 2025, 0(01): 94-99.
[8]	刘飞, 杨德刚, 章鑫, 秦静. 基于YOLOv8改进的水下目标检测算法[J]. 计算机与现代化, 2025, 0(01): 113-119.
[9]	刘云海1, 冯广1, 吴晓婷2, 杨群2. 复杂施工场景下的安全帽佩戴检测算法[J]. 计算机与现代化, 2024, 0(12): 66-71.
[10]	陈亮, 李诚, 易伟, 熊伟, 汪晓帆, 唐海东. 基于毫米波雷达与视觉融合的电力现场安全帽佩戴检测[J]. 计算机与现代化, 2024, 0(12): 100-107.
[11]	张宇1, 2, 黎靖1, 2, 马铭1, 2, 王众祥1, 2, 孙妍1, 2. YOLOLW:一个新的轻量级目标检测模型[J]. 计算机与现代化, 2024, 0(11): 91-98.
[12]	董玉玟. 基于改进YOLOv7-tiny的多尺度运动目标检测算法[J]. 计算机与现代化, 2024, 0(11): 99-105.
[13]	陈凯1, 李宜汀1, 2, 全华凤1 . 基于改进YOLOv8的河道废弃瓶检测方法[J]. 计算机与现代化, 2024, 0(11): 113-120.
[14]	魏学诚1, 江凌云1, 李研2, 何非2. 改进YOLOv5的路侧单目视角小目标检测算法[J]. 计算机与现代化, 2024, 0(10): 27-34.
[15]	史星宇1, 李强2, 庄莉3, 梁懿3, 王秋琳3, 陈锴3, 伍臣周3, 常胜1. 一种面向工业部署的目标检测模型蒸馏技术[J]. 计算机与现代化, 2024, 0(10): 93-99.