Breaking Down Richard Sutton’s Policy Gradient With PyTorch And Lunar Lander
Published in
8 min readOct 16, 2019
In the early 2000s, a few papers have been published about the policy gradient methods (in one form or another) in reinforcement learning. Most notable of all was “Policy Gradient Methods for Reinforcement Learning with Function Approximation” by Richard Sutton et al.