A Spark Streaming Parameter Optimization Method Based on Deep Reinforcement Learning

Abstract

Abstract: Spark Streaming is the mainstream open source distributed stream analysis framework, and its performance optimization is one of the current research hotspots. In Spark Streaming performance optimization, configuration parameter optimization in business scenarios is an important factor in its performance improvement. In the Spark Streaming system, there are more than 200 configurable parameters, which requires high experience for parameter tuning personnel. Non optimized parameter configuration will affect the execution performance of streaming jobs. Therefore, in view of the parameter configuration optimization problem of Spark Streaming, a Spark Streaming parameter optimization method based on deep reinforcement learning (DQN-SSPO) is proposed, which converts the parameter optimization configuration problem of Spark Streaming into the problem of obtaining the maximum return in deep reinforcement learning model training, and a weighted state space transfer method is proposed to increase the probability of high feedback rewards for model training. Experiments on three typical streaming analysis tasks show that the performance of streaming jobs on Spark Streaming after parameter optimization is reduced by 27.93% in total scheduling time and 42% in total processing time.

Key words: Spark Streaming, performance optimization, deep reinforcement learning, parameter tuning

LIU Lu, SHEN Guo-wei, GUO Chun, CUI Yun-he, JIANG Chao-hui, WU Da-yong. A Spark Streaming Parameter Optimization Method Based on Deep Reinforcement Learning[J]. Computer and Modernization, 2021, 0(10): 49-56.

References

［1］ TOSHNIWAL A, TANEJA S, SHUKLA A, et al. Storm@Twitter［C］// Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data. 2014:147-156.
［2］ CARBONE P, EWEN S, HARIDI S, et al. Apache FlinkTM: Stream and batch processing in a single engine［J］. Bulletin of the Technical Committee on Data Engineering, 2015,38(4):28-38.
［3］ ZAHARIA M, DAS T, LI H, et al. Discretized streams: Fault-tolerant streaming computation at scale［C］// Proceedings of the 24th ACM Symposium on Operating Systems Principles. 2013:423-438.
［4］ ZAHARIA M, CHOWDHURY M, FRANKLIN M J, et al. Spark: Cluster computing with working sets［C］// Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing. 2010, Article No.10.
［5］ KROSS J, KRCMAR H. Modeling and simulating Apache Spark streaming applications［J］. Softwaretechnik-Trends, 2016,36(4):1-3.
［6］ VENKATARAMAN S, PANDA A, OUSTERHOUT K, et al. Drizzle: Fast and adaptable stream processing at scale［C］// Proceedings of the 26th Symposium on Operating Systems Principles. 2017:374-389.
［7］ AJILA T, MAJUMDAR S. Data driven priority scheduling on a Spark streaming system［C］// Proceedings of the 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID). 2019:561-568.
［8］ BEI Z D, YU Z B, ZHANG H L, et al. RFHOC: A random-forest approach to auto-tuning Hadoop’s configuration［J］. IEEE Transactions on Parallel and Distributed Systems, 2016,27(5):1470-1483.
［9］ LIAO G D, DATTA K, WILLKE T L. Gunther: Search-based auto-tuning of MapReduce［C］// Proceedings of the 2013 European Conference on Parallel Processing. 2013:406-419.
［10］DING X A, LIU Y, QIAN D P. Jellyfish: Online performance tuning with adaptive configuration and elastic container in Hadoop yarn［C］// Proceedings of the 2015 IEEE 21st International Conference on Parallel and Distributed Systems (ICPADS). 2015:831-836.

［11］WANG K W, LIN X L, TANG W Z. Predator: An experience guided configuration optimizer for Hadoop MapReduce［C］// Proceedings of the 2012 IEEE 4th International Conference on Cloud Computing Technology and Science. 2012:419-426.

［12］WANG G L, XU J G, HE B. A novel method for tuning configuration parameters of spark based on machine learning［C］// Proceedings of the 2016 IEEE 18th International Conference on High Performance Computing and Communications, IEEE 14th International Conference on Smart City, IEEE 2nd International Conference on Data Science and Systems (HPCC/SmartCity/DSS). 2016:586-593.
［13］PRASAD B R, AGARWAL S. Performance analysis and optimization of Spark streaming applications through effective control parameters tuning［M］// Progress in Intelligent Computing Techniques: Theory, Practice, and Applications. Springer, Singapore, 2018:99-110.
［14］CHENG D Z, CHEN Y, ZHOU X B, et al. Adaptive scheduling of parallel jobs in Spark streaming［C］// Proceedings of the 2017 IEEE Conference on Computer Communications. 2017. DOI: 10.1109/INFOCOM.2017.8057206.
［15］SUTTON R S, MCALLESTER D A, SINGH S P, et al. Policy gradient methods for reinforcement learning with function approximation［C］// Proceedings of the 2000 Advances in Neural Information Processing Systems. 2000:1057-1063.
［16］MNIH V, KAVUKCUOGLU K, SILVER D, et al. Playing Atari with deep reinforcement learning［J］. arXiv preprint arXiv:1312.5602, 2013.
［17］崔晓龙,张敏,刘祥,等. Spark作业性能建模及参数优化［J］. 实验技术与管理, 2021,38(3):146-152.
［18］HIRAMAN B R, VIRESH M C, ABHIJEET C K. A study of Apache Kafka in big data stream processing［C］// Proceedings of the 2018 International Conference on Information, Communication, Engineering and Technology (ICICET). 2018. DOI: 10.1109/ICICET.2018.8533771.
［19］VAN HASSELT H, GUEZ A, SILVER D. Deep reinforcement learning with double q-learning［C］// Proceedings of the 30th AAAI Conference on Artificial Intelligence. 2016:2094-2100.
［20］SHARAFALDIN I, LASHKARI A H, GHORBANI A A. Toward generating a new intrusion detection dataset and intrusion traffic characterization［C］// Proceedings of the 4th International Conference on Information Systems Security and Privacy (ICISSP). 2018:108-116.
［21］詹剑锋,高婉铃,王磊,等. BigDataBench:开源的大数据系统评测基准［J］. 计算机学报, 2016,39(1):196-211.
［22］陈侨安,李峰,曹越,等. 基于运行数据分析的Spark任务参数优化［J］. 计算机工程与科学, 2016,38(1):11-19.
［23］阮树骅,潘梵梵,陈兴蜀,等. 一种Spark作业配置参数智能优化方法［J］. 工程科学与技术, 2020,52(1):191-197.
［24］GAO Z P, WANG T, WANG Q, et al. Execution time prediction for Apache Spark［C］// Proceedings of the 2018 International Conference on Computing and Big Data. 2018:47-51.

[1]	WANG Jian-ming1, WANG Xin1, LI Yang-hui2, WANG Dian-long1. Path Planning of Parking Robot Based on Improved D3QN Algorithm [J]. Computer and Modernization, 2024, 0(03): 7-14.
[2]	LI Peng, XU Luo. An Autonomous Navigation Method for Intelligent Vehicles in Urban Battlefield [J]. Computer and Modernization, 2024, 0(01): 92-98.
[3]	ZHANG Guo-you, SONG Shi-feng. Traffic Light Control Optimization Based On D3QN [J]. Computer and Modernization, 2023, 0(07): 30-35.
[4]	LAI Jian-bin, FENG Gang. An Experience Replay Strategy Based on Mixed Samples [J]. Computer and Modernization, 2023, 0(06): 33-38.
[5]	DING Zhong-lin, LI Yang, CAO Wei, TAN Yu-hao, XU Bo. Deep Q-learning Based Task Offloading in Power IoT [J]. Computer and Modernization, 2022, 0(11): 75-80.
[6]	XU Xian-hui, WANG Shu-ying, ZENG Wen-qu. ElasticSearch Index Optimization Strategy for Engineering Data Retrieval [J]. Computer and Modernization, 2022, 0(02): 79-84.
[7]	WU Shui-ming, JI Zhi-yuan, WANG Zhen-yu, JING Dong-sheng. Power Information Network Intrusion Detection Algorithm Based on Dueling-DDQN [J]. Computer and Modernization, 2021, 0(12): 43-47.
[8]	WANG Peng-yong, CHEN Gong-tao, ZHAO Jiang-shuo. Decision-making Method for Airport Taxi Drivers Based on Deep Reinforcement Learning [J]. Computer and Modernization, 2020, 0(08): 94-99.
[9]	YUAN Wen， LIU Hui-yi . Gait Optimization of Humanoid Robot Based on Deep Q Network [J]. Computer and Modernization, 2019, 0(04): 47-.
[10]	PENG Chen, HAN Li-xin. Deep Reinforcement Learning for Step Counting Approach [J]. Computer and Modernization, 2019, 0(01): 63-.
[11]	QI Yue1，2，3, HUANG Shuo-hua1. Portfolio Management Based on DDPG Algorithm of Deep Reinforcement Learning [J]. Computer and Modernization, 2018, 0(05): 93-.
[12]	DONG Xue， ZHANG De-ping. Global Sensitivity Analysis and Optimization of Submarine Combat Effectiveness #br# Based on Extreme Learning Machine [J]. Computer and Modernization, 2018, 0(05): 86-.
[13]	CHU Jia-qi1,2, LIU Cong-jun1,2. Application of WeChat Public Service Platform in E-government [J]. Computer and Modernization, 2018, 0(02): 33-.
[14]	ZHANG Yan-nian . Wireless Channel Back-off Algorithm Optimization Design #br# Based on RBF Neural Network [J]. Computer and Modernization, 2016, 0(12): 16-21.
[15]	QI Peng-nian, ZHU Jin, HAO Jun-hui, XU Feng-ping. Hadoop Speculation Execution Algorithm in Heterogeneous Environments [J]. Computer and Modernization, 2015, 0(8): 80-83,88.