一种基于深度强化学习的Spark Streaming参数优化方法

计算机与现代化 ›› 2021, Vol. 0 ›› Issue (10): 49-56.

一种基于深度强化学习的Spark Streaming参数优化方法

(1.贵州大学计算机科学与技术学院，贵州贵阳550025;2.贵州省软件工程与信息安全特色重点实验室，贵州贵阳550025;
3.科大讯飞股份有限公司，安徽合肥230011)

出版日期:2021-10-14 发布日期:2021-10-14
作者简介:刘露(1996—)，女，贵州贵阳人，硕士研究生，研究方向：大数据安全，E-mail: gzuliulu@163.com; 通信作者：申国伟(1986—)，男，湖南邵东人，副教授，硕士生导师，博士，研究方向：网络与信息安全，大数据，E-mail: gwshen@gzu.edu.cn; 郭春(1986—)，男，湖南邵阳人，副教授，硕士生导师，博士，研究方向：网络与信息安全，E-mail: E-mail: gc_gzedu@163.com; 崔允贺(1987—)，男，山东济宁人，副教授，硕士生导师，博士，研究方向：SDN，网络安全，E-mail: yhcui@gzu.edu.cn; 蒋朝惠(1965—)，男，四川广安人，教授，硕士生导师，硕士，研究方向：网络与信息安全，E-mail: jiangchaohui@126.com; 伍大勇(1977—)，男，黑龙江牡丹江人，高级工程师，博士，研究方向：自然语言处理，数据挖掘，E-mail: dywu2@iflytek.com。
基金资助:
国家自然科学基金资助项目(62062022); 贵州省科学技术基金资助项目(黔科合基础［2017］1051); 国家重点研发计划项目(2018YFC0807701)

A Spark Streaming Parameter Optimization Method Based on Deep Reinforcement Learning

(1. College of Computer Science and Technology, Guizhou University, Guiyang 550025, China;
2. Guizhou Provincial Key Laboratory of Software Engineering and Information Security, Guiyang 550025, China;
3. Iflytek Co., Ltd., Hefei 230011, China)

Online:2021-10-14 Published:2021-10-14

摘要/Abstract

摘要： Spark Streaming作为主流的开源分布式流分析框架，性能优化是目前的研究热点之一。在Spark Streaming性能优化中，业务场景下的配置参数优化是其性能提升的重要因素。在Spark Streaming系统中，可配置的参数有200多个，对参数调优人员的经验要求较高，未经优化的参数配置会影响流作业执行性能。因此，针对Spark Streaming的参数配置优化问题，提出一种基于深度强化学习的Spark Streaming参数优化方法（DQN-SSPO），将Spark Streaming参数优化配置问题转化为深度强化学习模型训练中的最大回报获得问题，并提出权重状态空间转移方法来增加模型训练获得高反馈奖励的概率。在3种典型的流分析任务上进行实验，结果表明经参数优化后Spark Streaming上的流作业性能在总调度时间上平均缩减27.93%，在总处理时间上平均缩减42%。

关键词: Spark Streaming, 性能优化, 深度强化学习, 参数调优

Abstract: Spark Streaming is the mainstream open source distributed stream analysis framework, and its performance optimization is one of the current research hotspots. In Spark Streaming performance optimization, configuration parameter optimization in business scenarios is an important factor in its performance improvement. In the Spark Streaming system, there are more than 200 configurable parameters, which requires high experience for parameter tuning personnel. Non optimized parameter configuration will affect the execution performance of streaming jobs. Therefore, in view of the parameter configuration optimization problem of Spark Streaming, a Spark Streaming parameter optimization method based on deep reinforcement learning (DQN-SSPO) is proposed, which converts the parameter optimization configuration problem of Spark Streaming into the problem of obtaining the maximum return in deep reinforcement learning model training, and a weighted state space transfer method is proposed to increase the probability of high feedback rewards for model training. Experiments on three typical streaming analysis tasks show that the performance of streaming jobs on Spark Streaming after parameter optimization is reduced by 27.93% in total scheduling time and 42% in total processing time.

Key words: Spark Streaming, performance optimization, deep reinforcement learning, parameter tuning

刘露, 申国伟, 郭春, 崔允贺, 蒋朝惠, 伍大勇. 一种基于深度强化学习的Spark Streaming参数优化方法[J]. 计算机与现代化, 2021, 0(10): 49-56.

LIU Lu, SHEN Guo-wei, GUO Chun, CUI Yun-he, JIANG Chao-hui, WU Da-yong. A Spark Streaming Parameter Optimization Method Based on Deep Reinforcement Learning[J]. Computer and Modernization, 2021, 0(10): 49-56.

参考文献

［1］ TOSHNIWAL A, TANEJA S, SHUKLA A, et al. Storm@Twitter［C］// Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data. 2014:147-156.
［2］ CARBONE P, EWEN S, HARIDI S, et al. Apache FlinkTM: Stream and batch processing in a single engine［J］. Bulletin of the Technical Committee on Data Engineering, 2015,38(4):28-38.
［3］ ZAHARIA M, DAS T, LI H, et al. Discretized streams: Fault-tolerant streaming computation at scale［C］// Proceedings of the 24th ACM Symposium on Operating Systems Principles. 2013:423-438.
［4］ ZAHARIA M, CHOWDHURY M, FRANKLIN M J, et al. Spark: Cluster computing with working sets［C］// Proceedings of the 2nd USENIX Conference on Hot Topics in Cloud Computing. 2010, Article No.10.
［5］ KROSS J, KRCMAR H. Modeling and simulating Apache Spark streaming applications［J］. Softwaretechnik-Trends, 2016,36(4):1-3.
［6］ VENKATARAMAN S, PANDA A, OUSTERHOUT K, et al. Drizzle: Fast and adaptable stream processing at scale［C］// Proceedings of the 26th Symposium on Operating Systems Principles. 2017:374-389.
［7］ AJILA T, MAJUMDAR S. Data driven priority scheduling on a Spark streaming system［C］// Proceedings of the 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID). 2019:561-568.
［8］ BEI Z D, YU Z B, ZHANG H L, et al. RFHOC: A random-forest approach to auto-tuning Hadoop’s configuration［J］. IEEE Transactions on Parallel and Distributed Systems, 2016,27(5):1470-1483.
［9］ LIAO G D, DATTA K, WILLKE T L. Gunther: Search-based auto-tuning of MapReduce［C］// Proceedings of the 2013 European Conference on Parallel Processing. 2013:406-419.
［10］DING X A, LIU Y, QIAN D P. Jellyfish: Online performance tuning with adaptive configuration and elastic container in Hadoop yarn［C］// Proceedings of the 2015 IEEE 21st International Conference on Parallel and Distributed Systems (ICPADS). 2015:831-836.

［11］WANG K W, LIN X L, TANG W Z. Predator: An experience guided configuration optimizer for Hadoop MapReduce［C］// Proceedings of the 2012 IEEE 4th International Conference on Cloud Computing Technology and Science. 2012:419-426.

［12］WANG G L, XU J G, HE B. A novel method for tuning configuration parameters of spark based on machine learning［C］// Proceedings of the 2016 IEEE 18th International Conference on High Performance Computing and Communications, IEEE 14th International Conference on Smart City, IEEE 2nd International Conference on Data Science and Systems (HPCC/SmartCity/DSS). 2016:586-593.
［13］PRASAD B R, AGARWAL S. Performance analysis and optimization of Spark streaming applications through effective control parameters tuning［M］// Progress in Intelligent Computing Techniques: Theory, Practice, and Applications. Springer, Singapore, 2018:99-110.
［14］CHENG D Z, CHEN Y, ZHOU X B, et al. Adaptive scheduling of parallel jobs in Spark streaming［C］// Proceedings of the 2017 IEEE Conference on Computer Communications. 2017. DOI: 10.1109/INFOCOM.2017.8057206.
［15］SUTTON R S, MCALLESTER D A, SINGH S P, et al. Policy gradient methods for reinforcement learning with function approximation［C］// Proceedings of the 2000 Advances in Neural Information Processing Systems. 2000:1057-1063.
［16］MNIH V, KAVUKCUOGLU K, SILVER D, et al. Playing Atari with deep reinforcement learning［J］. arXiv preprint arXiv:1312.5602, 2013.
［17］崔晓龙,张敏,刘祥,等. Spark作业性能建模及参数优化［J］. 实验技术与管理, 2021,38(3):146-152.
［18］HIRAMAN B R, VIRESH M C, ABHIJEET C K. A study of Apache Kafka in big data stream processing［C］// Proceedings of the 2018 International Conference on Information, Communication, Engineering and Technology (ICICET). 2018. DOI: 10.1109/ICICET.2018.8533771.
［19］VAN HASSELT H, GUEZ A, SILVER D. Deep reinforcement learning with double q-learning［C］// Proceedings of the 30th AAAI Conference on Artificial Intelligence. 2016:2094-2100.
［20］SHARAFALDIN I, LASHKARI A H, GHORBANI A A. Toward generating a new intrusion detection dataset and intrusion traffic characterization［C］// Proceedings of the 4th International Conference on Information Systems Security and Privacy (ICISSP). 2018:108-116.
［21］詹剑锋,高婉铃,王磊,等. BigDataBench:开源的大数据系统评测基准［J］. 计算机学报, 2016,39(1):196-211.
［22］陈侨安,李峰,曹越,等. 基于运行数据分析的Spark任务参数优化［J］. 计算机工程与科学, 2016,38(1):11-19.
［23］阮树骅,潘梵梵,陈兴蜀,等. 一种Spark作业配置参数智能优化方法［J］. 工程科学与技术, 2020,52(1):191-197.
［24］GAO Z P, WANG T, WANG Q, et al. Execution time prediction for Apache Spark［C］// Proceedings of the 2018 International Conference on Computing and Big Data. 2018:47-51.

[1]	李爽1, 2, 叶宁1, 2, 徐康1, 2, 王甦1, 王汝传1, 2. 面向智慧养老的边缘计算卸载方法[J]. 计算机与现代化, 2024, 0(06): 95-102.
[2]	王健铭1, 王欣1, 李养辉2, 王殿龙1. 基于改进D3QN算法的泊车机器人路径规划[J]. 计算机与现代化, 2024, 0(03): 7-14.
[3]	李鹏, 徐珞. 一种面向城市战场的智能车自主导航方法[J]. 计算机与现代化, 2024, 0(01): 92-98.
[4]	张国有, 宋世峰. 基于D3QN的交通灯控制优化[J]. 计算机与现代化, 2023, 0(07): 30-35.
[5]	赖建彬, 冯刚. 一种基于混合样本的经验回放策略[J]. 计算机与现代化, 2023, 0(06): 33-38.
[6]	丁忠林, 李洋, 曹委, 谈宇浩, 徐波. 基于深度Q学习的电力物联网任务卸载研究[J]. 计算机与现代化, 2022, 0(11): 75-80.
[7]	许贤慧, 王淑营, 曾文驱. 面向工程数据检索的ElasticSearch索引优化策略[J]. 计算机与现代化, 2022, 0(02): 79-84.
[8]	吴水明, 吉志远, 王震宇, 景栋盛. 基于Dueling-DDQN的电力信息网络入侵检测算法[J]. 计算机与现代化, 2021, 0(12): 43-47.
[9]	王鹏勇, 陈龚涛, 赵江烁. 基于深度强化学习的机场出租车司机决策方法[J]. 计算机与现代化, 2020, 0(08): 94-99.
[10]	袁雯，刘惠义. 基于深度Q网络的仿人机器人步态优化[J]. 计算机与现代化, 2019, 0(04): 47-.
[11]	彭琛,韩立新. 基于深度强化学习的计步方法[J]. 计算机与现代化, 2019, 0(01): 63-.
[12]	齐岳1，2，3，黄硕华1. 基于深度强化学习DDPG算法的投资组合管理[J]. 计算机与现代化, 2018, 0(05): 93-.
[13]	楚嘉琦1,2,刘从军1,2. 微信公众服务平台在电子政务中的应用[J]. 计算机与现代化, 2018, 0(02): 33-.
[14]	张延年. 基于RBF神经网络的无线信道退避算法优化设计[J]. 计算机与现代化, 2016, 0(12): 16-21.
[15]	祁鹏年,朱晋,郝君慧,许丰平. 异构环境下Hadoop推测执行算法[J]. 计算机与现代化, 2015, 0(8): 80-83,88.