基于最优初始值Q学习的电力信息网络防御策略学习算法
景栋盛1,杨钰1,薛劲松1,朱斐2,吴文2
A Defense Policy Learning Algorithm for Power Information Networks Based on Optimal Initial Value Q-learning
JING Dong-sheng1, YANG Yu1, XUE Jing-song1, ZHU Fei2, WU Wen2
计算机与现代化 . 2018, (11): 18 .  DOI: 10.3969/j.issn.1006-2475.2018.11.004