Computer and Modernization

Previous Articles     Next Articles

Gait Optimization of Humanoid Robot Based on Deep Q Network

  

  1. (College of Computer and Information, Hohai University, Nanjing 210098, China)
  • Received:2018-09-18 Online:2019-04-26 Published:2019-04-30

Abstract: In order to realize the fast and stable walking of humanoid robot, and under the condition that the effective parameter combination is satisfied, a walking parameter training algorithm based on deep reinforcement learning is proposed to optimize the gait of humanoid robot. First of all, we capture the robot gait model parameters from the environment as the input of DQN. And then, DQN is used to fit the robot state-action value function. At last, by action selection strategy, we choose the gait of a robot to perform current action, at the same time produce reward function to achieve the aim of updating DQN. By selecting NAO robot as the experimental object and conducting experiments on the RoboCup3D simulation platform, the results show that using this algorithm, NAO robot can achieve stable bipedal walking.

Key words: humanoid robot, deep reinforcement learning, DQN, gait optimization, RoboCup3D

CLC Number: