Making Sense of Big Data

Meta-Policy Gradients: A Survey

Automated Hyperparameter Tuning & RL Objective Discovery

Robert Lange
Towards Data Science
18 min readJan 2, 2021

--

Most learning curves plateau. After an initial absorption of statistical regularities, the system saturates and we reach the limits of hand-crafted learning rules and inductive biases. In the worst case, we start to overfit. But what if the learning…

--

--