Making Sense of Big Data

Meta-Policy Gradients: A Survey

Automated Hyperparameter Tuning & RL Objective Discovery

Robert Lange

Published in

Towards Data Science

18 min readJan 2, 2021

Most learning curves plateau. After an initial absorption of statistical regularities, the system saturates and we reach the limits of hand-crafted learning rules and inductive biases. In the worst case, we start to overfit. But what if the learning…

Making Sense of Big Data

Meta-Policy Gradients: A Survey

Automated Hyperparameter Tuning & RL Objective Discovery

Written by Robert Lange