Striking a Balance between Exploring and Exploiting

The dilemma of exploration and exploitation in reinforcement learning

Jingles (Hong Jing)
Towards Data Science
5 min readAug 14, 2019

--

The exploration-exploitation dilemma is faced by our agents while learning to play the game tic-tac-toe [Medium article]. This dilemma is a fundamental problem in reinforcement learning as well as in real life which we frequently face when choosing between…

--

--

Alibaba PhD in machine learning | write about machine learning, neuroscience, healthcare & blockchain | reach me at linkedin.com/in/jingles