Data Science

Probability Axioms in Pictures

The three Kolmogorov axioms of probability presented through pictures.

Jan 13, 2021

3 min read

The three Kolmogorov axioms underpin probability theory. Before we explore the axioms, some common probability language will be introduced.

An experiment is a process of observation where the output cannot be predicted with certainty due to random effects. Example: rolling a dice.

A trial is a single occurrence of an experiment. Multiple trials of an experiment can form a new experiment. Example: an experiment consists of rolling a dice twice and the trial is one instance of the twice-rolled dice experiment.

An outcome is an observed output of a trial. Examples: rolling a 3 or rolling a 1 and 5 in the twice-rolled dice experiment.

The sample space Ω is the set of all possible outcomes of an experiment. Examples: each side of the dice or ordered pairs of twice-rolled dice experiments.

Trials produce outcomes. Image by author.

An event A is a subset of outcomes in the sample space Ω. Examples: rolling a side less than 4, rolling an even number or rolling a 2 and then a 3.

An event is a set of outcomes. Image by author.

The operations of set theory apply to events. The union of events is the set of outcomes in A, B or both. The intersection of events is the set of outcomes in both A and B.

Union and intersection of two events. Image by author.

The event consisting of no outcomes is called the null event. If events A and B have no outcomes in common then A and B are disjoint events (mutually exclusive).

A Probability measure P is a function that assigns a real number to each measurable event. A probability measure must follow the axioms of probability.

A probability measure maps events to real numbers that must follow the axioms of probability. Image by author.

Now we will explore the three axioms of probability.

First axiom: non-negative, real number

The probability of an event is a non-negative real number.

A probability must be a non-negative real number. Image by author.

This axiom means that the smallest probability of an event is zero. It does not specify an upper bound, however a probability theorem does.

Second axiom: unitarity

The probability that at least one outcome in the sample space will occur is 1.

The probability of an outcome occuring is 1. Image by author.

This axiom means that it is certain that an outcome will occur from observing an experiment.

Third axiom: countable additivity

If there is an infinite set of disjoint events in a sample space Ω then the probability of the union of events is equal to the sum of probabilities of all events.

The probability of the union of disjoint events is equal to the sum of probabilities of all events. Image by author.

This axiom forms a relationship between a set of disjoint events in a sample space and the individual probabilities of each event. A probability theorem shows how a finite set of disjoint events can be represented as an infinite set too.

The axioms of probability can subsequently be used to derive the theorems of probability.

Written By

Ash Bellett

See all from Ash Bellett

Topics:

Data Science, Mathematics, Probability, Statistics

Share this article:

Related Articles

Implementing Convolutional Neural Networks in TensorFlow
Artificial Intelligence

Step-by-step code guide to building a Convolutional Neural Network

Shreya Rao

August 20, 2024

6 min read
How to Forecast Hierarchical Time Series
Artificial Intelligence

A beginner’s guide to forecast reconciliation

Dr. Robert Kübler

August 20, 2024

13 min read
Hands-on Time Series Anomaly Detection using Autoencoders, with Python
Data Science

Here’s how to use Autoencoders to detect signals with anomalies in a few lines of…

Piero Paialunga

August 21, 2024

12 min read
Solving a Constrained Project Scheduling Problem with Quantum Annealing
Data Science

Solving the resource constrained project scheduling problem (RCPSP) with D-Wave’s hybrid constrained quadratic model (CQM)

Luis Fernando PÉREZ ARMAS, Ph.D.

August 20, 2024

28 min read
Back To Basics, Part Uno: Linear Regression and Cost Function
Data Science

An illustrated guide on essential machine learning concepts

Shreya Rao

February 3, 2023

6 min read
Must-Know in Statistics: The Bivariate Normal Projection Explained
Data Science

Derivation and practical examples of this powerful concept

Luigi Battistoni

August 14, 2024

7 min read
How to Make the Most of Your Experience as a TDS Author
Data Science

A quick guide to our resources and FAQ

TDS Editors

September 13, 2022

4 min read
Our Columns
Data Science

Columns on TDS are carefully curated collections of posts on a particular idea or category…

TDS Editors

November 14, 2020

4 min read
Optimizing Marketing Campaigns with Budgeted Multi-Armed Bandits
Data Science

With demos, our new solution, and a video

Vadim Arzamasov

August 16, 2024

10 min read

Some areas of this page may shift around if you resize the browser window. Be sure to check heading and document order.