基于深度强化学习的黑盒对抗攻击算法

摘要/Abstract

摘要： 针对图像识别领域中的黑盒对抗攻击问题，基于强化学习中DDQN框架和Dueling网络结构提出一种黑盒对抗攻击算法。智能体通过模仿人类调整图像的方式生成对抗样本，与受攻击模型交互获得误分类结果，计算干净样本和对抗样本的结构相似性后获得奖励。攻击过程中仅获得了受攻击模型的标签输出信息。实验结果显示，攻击在CIFAR10和CIFAR100数据集上训练的4个深度神经网络模型的成功率均超过90%，生成的对抗样本质量与白盒攻击算法FGSM相近且成功率更有优势。

关键词: 对抗样本, 黑盒攻击, 深度学习, 强化学习

Abstract: Aiming at the problem of black box adversarial attack in the field of image recognition, a black box adversarial attack algorithm is proposed based on the DDQN framework and Dueling network structure in reinforcement learning. The agent generates an adversarial sample by imitating human adjustment of the image, interacts with the attacked model to obtain misclassification results, and calculates the structural similarity of the clean sample and the adversarial sample to generate a reward. During the attack, only the label output information of the attacked model was obtained. The experimental results show that the success rate of attacking the four deep neural network models trained on the CIFAR10 and CIFAR100 datasets exceeds 90%. The quality of the generated adversarial samples is similar to the white box attack algorithm FGSM and the success rate is more advantageous.

Key words: adversarial samples, black box attacks, deep learning, reinforcement learning

李蒙, 韩立新. 基于深度强化学习的黑盒对抗攻击算法[J]. 计算机与现代化, 2021, 0(04): 117-121.

LI Meng, HAN Li-xin. Black Box Adversarial Attack Algorithm Based on Deep Reinforcement Learning[J]. Computer and Modernization, 2021, 0(04): 117-121.

参考文献

［1］ TAIGMAN Y, YANG M, RANZATO M, et al. DeepFace: Closing the gap to human-level performance in face verification［C］// Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition. 2014:1701-1708.
［2］ GIRSHICK R, DONAHUE J, DARRELL T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation［C］// Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition. 2014:580-587.
［3］ DANELLJAN M, HAGER G, KHAN F S, et al. Convolutional features for correlation filter based visual tracking［C］// Proceedings of the 2015 IEEE International Conference on Computer Vision Workshop (ICCVW). 2015:621-629.
［4］ SZEGEDY C, ZAREMBA W, SUTSKEVER I, et al. Intriguing properties of neural networks［J］. arXiv preprint arXiv:1312.6199, 2013.
［5］ YUAN X Y, HE P, ZHU Q L, et al. Adversarial examples: Attacks and defenses for deep learning［J］. IEEE Transactions on Neural Networks and Learning Systems, 2019,30(9):2805-2824.
［6］〖KG-*4/5〗CHEN J B, JORDAN M I, WAINWRIGHT M J. HopSkipJumpAttack: A query-efficient decision-based attack［J］. arXiv preprint arXiv:1904.02144, 2019.
［7］ GOODFELLOW I J, SHLENS J, SZEGEDY C. Explaining and harnessing adversarial examples［J］. arXiv preprint arXiv:1412.6572, 2014.
［8］ PAPERNOT N, MCDANIEL P, JHA S, et al. The limitations of deep learning in adversarial settings［C］// Proceedings of the 2016 IEEE European Symposium on Security and Privacy (EuroS&P). 2016:372-387.
［9］ CHEN P Y, ZHANG H, SHARMA Y, et al. ZOO: Zeroth order optimization based black-box attacks to deep neural networks without training substitute models［C］// Proceedings of the 10th ACM Workshop on Artificial Intelligence and Security. 2017:15-26.
［10］SU J W, VARGAS D V, SAKURAI K. One pixel attack for fooling deep neural networks［J］. IEEE Transactions on Evolutionary Computation, 2019,23(5):828-841.
［11］RITTER S, BARRETT D G T, SANTORO A, et al. Cognitive psychology for deep neural networks: A shape bias case study［C］// Proceedings of the 34th International Conference on Machine Learning. 2017:2940-2949.
［12］LI Y X. Deep reinforcement learning: An overview［J］. arXiv preprint arXiv:1701.07274, 2017.
［13］MNIH V, KAVUKCUOGLU K, SILVER D, et al. Human-level control through deep reinforcement learning［J］. Nature, 2015,518(7540):529-533.
［14］SILVER D, HUANG A, MADDISON C J, et al. Mastering the game of Go with deep neural networks and tree search［J］. Nature, 2016,529(7587):484-489.
［15］VAN HASSELT H, GUEZ A, SILVER D. Deep reinforcement learning with double Q-Learning［C］// Proceedings of the 30th AAAI Conference on Artificial Intelligence. 2016:2094-2100.
［16］WANG Z Y, SCHAUL T, HESSEL M, et al. Dueling network architectures for deep reinforcement learning［C］// Proceedings of the 33rd International Conference on Machine Learning. 2016:1995-2003.
［17］WANG Z, BOVIK A C, SHEIKH H R, et al. Image quality assessment: From error visibility to structural similarity［J］. IEEE Transactions on Image Processing, 2004,13(4):600-612.
［18］YANG C Y, MA C, YANG M H. Single-image super-resolution: A benchmark［C］// Proceedings of the 2014 European Conference on Computer Vision. 2014:372-386
［19］LAI W S, HUANG J B, HU Z, et al. A comparative study for single image blind deblurring［C］// Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. 2016:1701-1709.
［20］KRIZHEVSKY A, HINTON G. Learning Multiple Layers of Features from Tiny Images［R］. University of Toronto, 2009.
［21］HU J, SHEN L, SUN G. Squeeze-and-excitation networks［C］// Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2018:7132-7141.
［22］SIMONYAN K, ZISSERMAN A. Very deep convolutional networks for large-scale image recognition［J］. arXiv preprint arXiv:1409.1556, 2014.
［23］SZEGEDY C, LIU W, JIA Y Q, et al. Going deeper with convolutions［C］// Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. 2015:1-9.
［24］HE K M, ZHANG X Y, REN S Q, et al. Deep residual learning for image recognition［C］// Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. 2016:770-778.
［25］KINGMA D P, BA J. Adam: A method for stochastic optimization［J］. arXiv preprint arXiv:1412.6980, 2014.

[1]	祁贤, 刘大铭, 常佳鑫. 基于改进自注意力机制的多视图三维重建[J]. 计算机与现代化, 2024, 0(11): 106-112.
[2]	陈凯1, 李宜汀1, 2, 全华凤1 . 基于改进YOLOv8的河道废弃瓶检测方法[J]. 计算机与现代化, 2024, 0(11): 113-120.
[3]	杨骏1, 胡为1, 朱文福2. 基于改进MobileNetV3的视觉SLAM回环检测算法[J]. 计算机与现代化, 2024, 0(10): 21-26.
[4]	王莹莹, 郝潇. 基于Res2Net和递归门控卷积的细粒度图像分类[J]. 计算机与现代化, 2024, 0(10): 74-79.
[5]	史星宇1, 李强2, 庄莉3, 梁懿3, 王秋琳3, 陈锴3, 伍臣周3, 常胜1. 一种面向工业部署的目标检测模型蒸馏技术[J]. 计算机与现代化, 2024, 0(10): 93-99.
[6]	张泽1, 张建权2, 3, 周国鹏2, 3. 基于改进YOLOv8s的摄像头模组缺陷检测[J]. 计算机与现代化, 2024, 0(09): 107-113.
[7]	程亚子1, 雷亮1, 2, 陈瀚1, 赵毅然1. 基于转置注意力的多尺度深度融合单目深度估计[J]. 计算机与现代化, 2024, 0(09): 121-126.
[8]	赵花蕊. STRL：基于强化学习的测试算法[J]. 计算机与现代化, 2024, 0(08): 5-10.
[9]	程萌, 李浩. 改进YOLOv5s的落叶树鸟巢检测方法[J]. 计算机与现代化, 2024, 0(08): 24-29.
[10]	王梦溪, 李峻. 老年人跌倒检测技术研究综述[J]. 计算机与现代化, 2024, 0(08): 30-36.
[11]	时现伟1, 范鑫2. 基于轻量化的视频帧场景语义分割方法[J]. 计算机与现代化, 2024, 0(08): 49-53.
[12]	徐新爱, 李钢. 基于DCGAN的课堂表情图像生成方法[J]. 计算机与现代化, 2024, 0(08): 88-91.
[13]	高帅鹏, 王怡凡. 基于图像的群体情绪识别综述[J]. 计算机与现代化, 2024, 0(08): 98-107.
[14]	黄文栋, 王怡凡. 基于模态类别的多模态信息处理与融合综述[J]. 计算机与现代化, 2024, 0(07): 47-62.
[15]	武丽1, 张征浩2, 葛彩成2, 俞俊2. 基于改进SCNN网络的车道线检测算法[J]. 计算机与现代化, 2024, 0(07): 87-92.