Natural Policy Gradient
-
The journey from REINFORCE to the go-to algorithm in continuous control
16 min read -
Traditional policy gradient methods are inherently flawed. Natural gradients converge quicker and better, forming the…
17 min read