Proximal Policy Optimization Tutorial (Part 1/2: Actor-Critic Method)

Let’s code from scratch a Reinforcement Learning football agent!

Chintan Trivedi
Towards Data Science
7 min readAug 12, 2019

--

Welcome to the first part of a math and code turorial series. I’ll be showing how to implement a Reinforcement Learning algorithm known as Proximal Policy Optimization (PPO) for teaching an AI agent how to play football/soccer. By the end…

--

--

AI, ML for Digital Games Researcher. Founder at DG AI Research Lab, India. Visit our publication homepage medium.com/deepgamingai for weekly AI & Games content!