Proximal Policy Optimization Tutorial (Part 1/2: Actor-Critic Method)

Let’s code from scratch a Reinforcement Learning football agent!

Published in

Towards Data Science

7 min readAug 12, 2019

Welcome to the first part of a math and code turorial series. I’ll be showing how to implement a Reinforcement Learning algorithm known as Proximal Policy Optimization (PPO) for teaching an AI agent how to play football/soccer. By the end…

Proximal Policy Optimization Tutorial (Part 1/2: Actor-Critic Method)

Let’s code from scratch a Reinforcement Learning football agent!

Written by Chintan Trivedi