Black Box Adversarial Attack Algorithm Based on Deep Reinforcement Learning

Abstract

Abstract: Aiming at the problem of black box adversarial attack in the field of image recognition, a black box adversarial attack algorithm is proposed based on the DDQN framework and Dueling network structure in reinforcement learning. The agent generates an adversarial sample by imitating human adjustment of the image, interacts with the attacked model to obtain misclassification results, and calculates the structural similarity of the clean sample and the adversarial sample to generate a reward. During the attack, only the label output information of the attacked model was obtained. The experimental results show that the success rate of attacking the four deep neural network models trained on the CIFAR10 and CIFAR100 datasets exceeds 90%. The quality of the generated adversarial samples is similar to the white box attack algorithm FGSM and the success rate is more advantageous.

Key words: adversarial samples, black box attacks, deep learning, reinforcement learning

LI Meng, HAN Li-xin. Black Box Adversarial Attack Algorithm Based on Deep Reinforcement Learning[J]. Computer and Modernization, 2021, 0(04): 117-121.

References

［1］ TAIGMAN Y, YANG M, RANZATO M, et al. DeepFace: Closing the gap to human-level performance in face verification［C］// Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition. 2014:1701-1708.
［2］ GIRSHICK R, DONAHUE J, DARRELL T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation［C］// Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition. 2014:580-587.
［3］ DANELLJAN M, HAGER G, KHAN F S, et al. Convolutional features for correlation filter based visual tracking［C］// Proceedings of the 2015 IEEE International Conference on Computer Vision Workshop (ICCVW). 2015:621-629.
［4］ SZEGEDY C, ZAREMBA W, SUTSKEVER I, et al. Intriguing properties of neural networks［J］. arXiv preprint arXiv:1312.6199, 2013.
［5］ YUAN X Y, HE P, ZHU Q L, et al. Adversarial examples: Attacks and defenses for deep learning［J］. IEEE Transactions on Neural Networks and Learning Systems, 2019,30(9):2805-2824.
［6］〖KG-*4/5〗CHEN J B, JORDAN M I, WAINWRIGHT M J. HopSkipJumpAttack: A query-efficient decision-based attack［J］. arXiv preprint arXiv:1904.02144, 2019.
［7］ GOODFELLOW I J, SHLENS J, SZEGEDY C. Explaining and harnessing adversarial examples［J］. arXiv preprint arXiv:1412.6572, 2014.
［8］ PAPERNOT N, MCDANIEL P, JHA S, et al. The limitations of deep learning in adversarial settings［C］// Proceedings of the 2016 IEEE European Symposium on Security and Privacy (EuroS&P). 2016:372-387.
［9］ CHEN P Y, ZHANG H, SHARMA Y, et al. ZOO: Zeroth order optimization based black-box attacks to deep neural networks without training substitute models［C］// Proceedings of the 10th ACM Workshop on Artificial Intelligence and Security. 2017:15-26.
［10］SU J W, VARGAS D V, SAKURAI K. One pixel attack for fooling deep neural networks［J］. IEEE Transactions on Evolutionary Computation, 2019,23(5):828-841.
［11］RITTER S, BARRETT D G T, SANTORO A, et al. Cognitive psychology for deep neural networks: A shape bias case study［C］// Proceedings of the 34th International Conference on Machine Learning. 2017:2940-2949.
［12］LI Y X. Deep reinforcement learning: An overview［J］. arXiv preprint arXiv:1701.07274, 2017.
［13］MNIH V, KAVUKCUOGLU K, SILVER D, et al. Human-level control through deep reinforcement learning［J］. Nature, 2015,518(7540):529-533.
［14］SILVER D, HUANG A, MADDISON C J, et al. Mastering the game of Go with deep neural networks and tree search［J］. Nature, 2016,529(7587):484-489.
［15］VAN HASSELT H, GUEZ A, SILVER D. Deep reinforcement learning with double Q-Learning［C］// Proceedings of the 30th AAAI Conference on Artificial Intelligence. 2016:2094-2100.
［16］WANG Z Y, SCHAUL T, HESSEL M, et al. Dueling network architectures for deep reinforcement learning［C］// Proceedings of the 33rd International Conference on Machine Learning. 2016:1995-2003.
［17］WANG Z, BOVIK A C, SHEIKH H R, et al. Image quality assessment: From error visibility to structural similarity［J］. IEEE Transactions on Image Processing, 2004,13(4):600-612.
［18］YANG C Y, MA C, YANG M H. Single-image super-resolution: A benchmark［C］// Proceedings of the 2014 European Conference on Computer Vision. 2014:372-386
［19］LAI W S, HUANG J B, HU Z, et al. A comparative study for single image blind deblurring［C］// Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. 2016:1701-1709.
［20］KRIZHEVSKY A, HINTON G. Learning Multiple Layers of Features from Tiny Images［R］. University of Toronto, 2009.
［21］HU J, SHEN L, SUN G. Squeeze-and-excitation networks［C］// Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2018:7132-7141.
［22］SIMONYAN K, ZISSERMAN A. Very deep convolutional networks for large-scale image recognition［J］. arXiv preprint arXiv:1409.1556, 2014.
［23］SZEGEDY C, LIU W, JIA Y Q, et al. Going deeper with convolutions［C］// Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition. 2015:1-9.
［24］HE K M, ZHANG X Y, REN S Q, et al. Deep residual learning for image recognition［C］// Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition. 2016:770-778.
［25］KINGMA D P, BA J. Adam: A method for stochastic optimization［J］. arXiv preprint arXiv:1412.6980, 2014.

[1]	QI Xian, LIU Daming, CHANG Jiaxin. Multi-view 3D Reconstruction Based on Improved Self-attention Mechanism [J]. Computer and Modernization, 2024, 0(11): 106-112.
[2]	CHEN Kai1, LI Yiting1, 2, QUAN Huafeng1. A River Discarded Bottles Detection Method Based on Improved YOLOv8 [J]. Computer and Modernization, 2024, 0(11): 113-120.
[3]	YANG Jun1, HU Wei1, ZHU Wenfu2. Visual SLAM Loop Closure Detection Algorithm Based on Improved MobileNetV3 [J]. Computer and Modernization, 2024, 0(10): 21-26.
[4]	WANG Yingying, HAO Xiao. Fine-grained Image Classification Based on Res2Net and Recursive Gated Convolution [J]. Computer and Modernization, 2024, 0(10): 74-79.
[5]	SHI Xingyu1, LI Qiang2, ZHUANG Li3, LIANG Yi3, WANG Qiulin3, CHEN Kai3, WU Chenzhou3, CHANG Sheng1. Object Detection Models Distillation Technique for Industrial Deployment [J]. Computer and Modernization, 2024, 0(10): 93-99.
[6]	ZHANG Ze1, ZHANG Jianquan2, 3, ZHOU Guopeng2, 3. Camera Module Defect Detection Based on Improved YOLOv8s [J]. Computer and Modernization, 2024, 0(09): 107-113.
[7]	CHENG Yazi1, LEI Liang1, 2, CHEN Han1, ZHAO Yiran1. Multi-scale Depth Fusion Monocular Depth Estimation Based on Transposed Attention [J]. Computer and Modernization, 2024, 0(09): 121-126.
[8]	ZHAO Huarui. STRL: Testing Algorithm Based on Reinforcement Learning#br# #br# [J]. Computer and Modernization, 2024, 0(08): 5-10.
[9]	CHENG Meng, LI Hao. Improved Deciduous Tree Nest Detection Method Based on YOLOv5s [J]. Computer and Modernization, 2024, 0(08): 24-29.
[10]	WANG Mengxi, LI Jun. Review of Fall Detection Technologies for Elderly [J]. Computer and Modernization, 2024, 0(08): 30-36.
[11]	SHI Xianwei1, FAN Xin2. Semantic Segmentation of Video Frame Scene Based on Lightweight [J]. Computer and Modernization, 2024, 0(08): 49-53.
[12]	XU Xin’ai, LI Gang. An Image Generation Method of Classroom Expression Images [J]. Computer and Modernization, 2024, 0(08): 88-91.
[13]	GAO Shuaipeng, WANG Yifan. Survey on Group-level Emotion Recognition in Images [J]. Computer and Modernization, 2024, 0(08): 98-107.
[14]	HUANG Wendong, WANG Yifan. Survey on Multimodal Information Processing and Fusion Based on Modal Categories [J]. Computer and Modernization, 2024, 0(07): 47-62.
[15]	WU Li1, ZHANG Zhenghao2, GE Caicheng2, YU Jun2. Lane Line Detection Algorithm Based on Improved SCNN Network [J]. Computer and Modernization, 2024, 0(07): 87-92.