Light on Math Machine Learning

Intuitive Guide to Understanding KL Divergence

Concept Grounding

What is a Distribution

What is an event?

Back to KL divergence

Problem we’re trying to solve

Intuition: KL divergence is a way of measuring the matching between two distributions (e.g. threads)

Let’s change a few things in the example

First try: Model this with a uniform distribution

Second try: Model this with a binomial distribution

Breaking down the equation

Mean and variance of the binomial distribution

Back to modeling

Let’s summarize what we have

How do we quantitatively decide which ones the best?

Intuitive breakdown of the KL divergence

Computing KL divergence

KL Divergence with respect to Binomial Mean

Conclusion

Fun with KL divergence

Reference

Author (Manning/Packt) | DataCamp instructor | Senior Data Scientist @ QBE | PhD. Youtube: @DeepLearningHero Twitter:@thush89, LinkedIN: thushan.ganegedara