基于卷积与稀疏编码的半监督学习方法

摘要/Abstract

摘要： 卷积神经网络（CNN）在半监督学习中取得了良好的成绩，其在训练阶段既利用有标记样本，也利用无标记样本帮助规范化学习模型。为进一步加强半监督模型的特征学习能力，提高其在图像分类时的性能表现，本文提出一种联合深度半监督卷积神经网络和字典学习的端到端半监督学习方法，称为Semi-supervised Learning based on Sparse Coding and Convolution（SSSConv）；该算法框架旨在学习到鉴别性更强的图像特征表示。SSSConv首先利用CNN提取特征，并对所提取特征进行正交投影变换，下一步通过学习其稀疏编码的低维嵌入以得到图像的特征表示，最后据此进行分类。整个模型框架可进行端到端的半监督学习训练，CNN提取特征部分和稀疏编码字典学习部分具有统一的损失函数，目标一致。本文利用共轭梯度下降算法、链式法则和反向传播等算法对目标函数的参数进行优化，将稀疏编码的相关参数约束于流形上，CNN参数既可定义在欧氏空间，也可以进一步定义在正交空间中。基于半监督分类任务的实验结果验证了所提出SSSConv框架的有效性，与现有方法相比具有较强的竞争力。

关键词: 稀疏表示, 字典学习, 卷积神经网络, 半监督学习, 流形, 几何优化

Abstract: Convolutional neural network （CNN） has achieved great success in semi-supervised learning. It uses both labelled samples and unlabelled samples in the training stage. Unlabelled samples can help standardize the learning model. To further improve the feature extraction ability of semi-supervised models， this paper proposes an end-to-end semi-supervised learning method combining deep semi-supervised convolutional neural network and sparse coding dictionary learning， called Semi-supervised Learning based on Sparse Coding and Convolution （SSSConv）， which aims to learn more discriminative image feature representation and improve the performance of classification tasks. Firstly， the proposed method uses CNN to extract features and performs orthogonal projection transformation on them. Then， learn the corresponding sparse coding and obtain the image representation. Finally， the classifier of the model can classify them. The whole semi-supervised learning process can be regarded as an end-to-end optimization problem. CNN part and sparse coding part have a unified loss function. In this paper， conjugate gradient descent algorithm， chain rule， and backpropagation algorithm are used to optimize the parameters of the objective function. Among them， we restrict the relevant parameters of sparse coding to the manifold， and the CNN parameters can be defined not only in Euclidean space but also in orthogonal space. Experimental results based on semi-supervised classification tasks verify the effectiveness of the proposed SSSConv framework， which is highly competitive with existing methods.

Key words: sparse representation, dictionary learning, convolutional neural network, semi-supervised learning, manifold, geometric optimization

刘缨杰, 兰海, 魏宪. 基于卷积与稀疏编码的半监督学习方法[J]. 计算机与现代化, 2022, 0(11): 9-16.

LIU Ying-jie, LAN Hai, WEI Xian. Semi-supervised Learning Method Based on Convolution and Sparse Coding[J]. Computer and Modernization, 2022, 0(11): 9-16.

参考文献

［1］ ZHU X J， GOLDBERG A B. Introduction to Semi-supervised Learning［M］. Morgan and Claypool Publishers， 2009.
［2］ TARVAINEN A， VALPOLA H. Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results［C］// Proceedings of the 31st International Conference on Neural Information Processing Systems. 2017:1195-1204.〖HJ1.6mm〗
［3］ BERTHELOT D， CARLINI N， GOODFELLOW I， et al. Mixmatch: A holistic approach to semi-supervised learning［C］// Proceedings of the 33rd International Conference on Neural Information Processing Systems. 2019:5049-5059.
［4］ BERTHELOT D， CARLINI N， CUBUK E D， et al. Remixmatch: Semi-supervised learning with distribution alignment and augmentation anchoring［J］. arXiv preprint arXiv:1911.09785， 2019.
［5］ SOHN K， BERTHELOT D， LI C L， et al. Fixmatch: Simplifying semi-supervised learning with consistency and confidence［J］. arXiv preprint arXiv:2001.07685， 2021.
［6］ LI J N， XIONG C M， HOI S C H. CoMatch: Semi-supervised learning with contrastive graph regularization［C］// 2021 IEEE/CVF International Conference on Computer Vision （ICCV）. 2021:9455-9464.
［7］ WANG H， NIE F P， CAI W D， et al. Semi-supervised robust dictionary learning via efficient l-norms minimization［C］// Proceedings of the IEEE International Conference on Computer Vision. 2013:1145-1152.
［8］ WANG D， ZHANG X Q， FAN M Y， et al. Semi-supervised dictionary learning via structural sparse preserving［C］ // Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence. 2016:2137-2144.
［9］ CHEN L， YU S Q， YANG M. Semi-supervised convolutional neural networks with label propagation for image classification［C］// 2018 24th International Conference on Pattern Recognition （ICPR）. 2018:1319-1324.
［10］YANG M， CHEN L. Discriminative semi-supervised dictionary learning with entropy regularization for pattern classification［C］// Proceedings of the 31st AAAI Conference on Artificial Intelligence. 2017: 1626-1632.
［11］LIU Y， CHEN Q C， CHEN W， et al. Dictionary learning inspired deep network for scene recognition［C］// Proceedings of the 32nd AAAI Conference on Artificial Intelligence and 30th Innovative Applications of Artificial Intelligence Conference and the 8th AAAI Symposium on Educational Advances in Artificial Intelligence. 2018:7178-7185.
［12］MAHDIZADEHAGHDAM S， PANAHI A， KRIM H， et al. Deep dictionary learning: A parametric network approach［J］. IEEE Transactions on Image Processing， 2019，28（10）:4790-4802.
［13］TANG H， LIU H， XIAO W， et al. When dictionary learning meets deep learning: Deep dictionary learning and coding network for image recognition with limited data［J］. IEEE Transactions on Neural Networks and Learning Systems， 2020，32（5）:2129-2141.
［14］孙劲光，李燕北，魏宪，等. 结合卷积神经网络和稀疏编码的高光谱图像分类［J］. 激光与光电子学进展， 2020，57（18）:399-408.
［15］WEI X， SHEN H， KLEINSTEUBER M. Trace quotient with sparsity priors for learning low dimensional image representations［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence， 2019，42（12）:3119-3135.
［16］KRIZHEVSKY A， SUTSKEVER I， HINTON G E. Imagenet classification with deep convolutional neural networks［J］. Advances in Neural Information Processing Systems， 2012，25:1097-1105.
［17］SIMONYAN K， ZISSERMAN A. Very deep convolutional networks for large-scale image recognition［J］. arXiv preprint arXiv:1409.1556， 2014.
［18］HE K M， ZHANG X， REN S Q， et al. Deep residual learning for image recognition［C］// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2016:770-778.
［19］ZAGORUYKO S， KOMODAKIS N. Wide residual networks［J］. arXiv preprint arXiv:1605.07146， 2016.
［20］HE K M， ZHANG X， REN S Q， et al. Identity mappings in deep residual networks［C］// European Conference on Computer Vision. Springer. 2016:630-645.
［21］XIE S N， GIRSHICK R， DOLLR P， et al. Aggregated residual transformations for deep neural networks［C］// 2017 IEEE Conference on Computer Vision and Pattern Recognition （CVPR）. 2017:5987-5995.
［22］RASMUS A， VALPOLA H， HONKALA M， et al. Semi-supervised learning with Ladder networks［C］// Proceedings of the 28th International Conference on Neural Information Processing Systems. 2015:3546-3554..
［23］JIANG B， ZHANG Z Y， LIN D D， et al. Semi-supervised learning with graph learning-convolutional networks［C］// 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition （CVPR）. 2019:11305-11312.
［24］KEJANI M T， DORNAIKA F， TALEBI H. Graph Convolution Networks with manifold regularization for semi-supervised learning［J］.Neural Networks， 2020，127:160-167.
［25］ZOU H， HASTIE T. Regularization and variable selection via the elastic net［J］. Journal of the Royal Statistical Society （series B）， 2005，67（2）:301-320.
［26］KOKIOPOULOU E， CHEN J， SAAD Y. Trace optimization and eigenproblems in dimension reduction methods［J］. Numerical Linear Algebra with Applications， 2011，18（3）:565-602.
［27］WANG J Y， CHEN Y B， CHAKRABORTY R， et al. Orthogonal convolutional neural networks［C］// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2020:11505-11515.
［28］ABSIL P A， MAHONY R， SEPULCHRE R.Optimization Algorithms on Matrix Manifolds［M］. Princeton University Press， 2009.
［29］KLEINSTEUBER M， HUPER K. An intrinsic CG algorithm for computing dominant subspaces［C］// 2007 IEEE International Conference on Acoustics， Speech and Signal Processing. 2007:1405-1408.

[1]	何思达, 陈平华. 基于意图的轻量级自注意力序列推荐模型[J]. 计算机与现代化, 2024, 0(12): 1-9.
[2]	张晓东1, 白广芝1, 李敏1, 李昊洋2. 基于经验小波变换的油气井产量预测模型 [J]. 计算机与现代化, 2024, 0(12): 53-58.
[3]	刘宝宝, 杨菁菁, 陶露, 王贺应. 基于注意力的DSMSC的遥感图像场景分类[J]. 计算机与现代化, 2024, 0(12): 72-77.
[4]	陈雪松1, 李衡1, 王浩畅2. 结合注意力机制和Mengzi模型的短文本分类[J]. 计算机与现代化, 2024, 0(09): 101-106.
[5]	高帅鹏, 王怡凡. 基于图像的群体情绪识别综述[J]. 计算机与现代化, 2024, 0(08): 98-107.
[6]	周宪溪, 牟莉. 基于改进TF-IDF和AGLCNN的新闻长文本分类模型[J]. 计算机与现代化, 2024, 0(08): 120-126.
[7]	杨江1, 孙晓梅1, 许韬2. 基于业务内容构建股票关联关系的股价预测[J]. 计算机与现代化, 2024, 0(07): 21-25.
[8]	刘存莉1, 雷占占2, 郑澳2. 基于循环卷积神经网络的排水管网缺陷检测方法[J]. 计算机与现代化, 2024, 0(07): 26-35.
[9]	李珊, 王林娜, 高丁佳, 宣海波. 基于图神经网络的多层银企网络融合研究[J]. 计算机与现代化, 2024, 0(05): 27-32.
[10]	钟海龙1, 2, 何月顺1, 何璘琳1, 陈杰1, 田鸣3, 郑瑞银4. 基于代价敏感卷积神经网络的加密流量分类#br# #br#[J]. 计算机与现代化, 2024, 0(05): 55-60.
[11]	高埂1, 肖风丽2, 杨飞1. 基于改进MobileNetV3-Small的色素减退性皮肤病诊断[J]. 计算机与现代化, 2024, 0(05): 120-126.
[12]	游嘉靖1, 2, 何月顺1, 何璘琳1, 钟海龙1, 2. 基于AHP-CNN的加密流量分类方法[J]. 计算机与现代化, 2024, 0(04): 83-87.
[13]	许跃雯1, 李明1, 李莉2. 基于对比学习MocoV2的COVID-19图像分类#br#[J]. 计算机与现代化, 2024, 0(02): 81-87.
[14]	周成诚, 曾庆军, 杨康, 胡家铭, 韩春伟. 基于高效通道注意力模块的运动想象脑电识别[J]. 计算机与现代化, 2023, 0(12): 19-23.
[15]	刘付琪, 张达, 宋建华, 王海东. 基于CNN-BiLSTM的液压系统故障诊断[J]. 计算机与现代化, 2023, 0(09): 10-19.