A Defense Policy Learning Algorithm for Power Information Networks Based on Optimal Initial Value Q-learning
JING Dong-sheng1, YANG Yu1, XUE Jing-song1, ZHU Fei2, WU Wen2
Computer and Modernization . 2018, (11): 18 .  DOI: 10.3969/j.issn.1006-2475.2018.11.004