Reinforcement Learning — The Value Function

A reinforcement learning algorithm for agents to learn the tic-tac-toe, using the value function

Jingles (Hong Jing)
Towards Data Science
7 min readJun 30, 2019

--

Intuition

After a long day at work, you are deciding between 2 choices: to head home and write a Medium article or hang out with friends at a bar. If you choose to hang out with friends, your friends…

--

--

Alibaba PhD in machine learning | write about machine learning, neuroscience, healthcare & blockchain | reach me at linkedin.com/in/jingles