Unsupervised Machine Learning: Clustering Analysis

Published in

Towards Data Science

12 min readMar 6, 2019

Introduction to Unsupervised Learning

Up to know, we have only explored supervised Machine Learning algorithms and techniques to develop models where the data had labels previously known. In other words, our data had some target variables with specific values that we used to train our models.

However, when dealing with real-world problems, most of the time, data will not come with predefined labels, so we will want to develop machine learning models that can classify correctly this data, by finding by themselves some commonality in the features, that will be used to predict the classes on new data.

Unsupervised Learning Analysis Process

The overall process that we will follow when developing an unsupervised learning model can be summarized in the following chart:

Unsupervised learning main applications are:

Segmenting datasets by some shared atributes.
Detecting anomalies that do not fit to any group.
Simplify datasets by aggregating variables with similar atributes.

Create an account to read the full story.

The author made this story available to Medium members only.
If you’re new to Medium, create a new account to read this story on us.

Continue in app

Or, continue in mobile web

Sign up with Google

Sign up with Facebook

Sign up with email

Already have an account? Sign in

Published in Towards Data Science

Last published 1 hour ago

Your home for data science and AI. The world’s leading publication for data science, data analytics, data engineering, machine learning, and artificial intelligence professionals.

Written by Victor Roman

Industrial Engineer and passionate about 4.0 Industry. My goal is to encourage people to learn and explore its technologies and their infinite posibilites.

Responses (3)

What are your thoughts?

Also publish to my profile

Singhanimesh

about 5 years ago

Please suggest a few books to study more in-depth about unsupervised machine learning.
Thanks in advance!

--

nan wint yee myint

over 5 years ago

Divisive

top-don

--

nan wint yee myint

over 5 years ago

Divisive

top-down

--

More from Victor Roman and Towards Data Science

Algoritmos Naive Bayes: Fudamentos e Implementación

In

Ciencia y Datos

by

Victor Roman

Algoritmos Naive Bayes: Fudamentos e Implementación

¡Conviértete en un maestro de uno de los algoritmos mas usados en clasificación!

Apr 25, 2019

By implementing projects directly, you learn twice as much.

In

Towards Data Science

by

Sarah Lea

5 Simple Projects to Start Today: A Learning Roadmap for Data Engineering

Start with 5 practical projects to lay the foundation for your data engineering roadmap.

5d ago

Deep Learning for Outlier Detection on Tabular and Image Data

In

Towards Data Science

by

W Brett Kennedy

Deep Learning for Outlier Detection on Tabular and Image Data

The challenges and promises of deep learning for outlier detection, including self-supervised learning techniques

4d ago

Proyecto de Clasificación de Machine Learning: Encontrar Donantes

In

Ciencia y Datos

by

Victor Roman

Proyecto de Clasificación de Machine Learning: Encontrar Donantes

¡Predice quien será mas propenso a donar en este proyecto de clasificación!

May 20, 2019

See all from Victor Roman

See all from Towards Data Science

Recommended from Medium

Jeff Bezos Says the 1-Hour Rule Makes Him Smarter. New Neuroscience Says He’s Right

Jessica Stillman

Jeff Bezos Says the 1-Hour Rule Makes Him Smarter. New Neuroscience Says He’s Right

Jeff Bezos’s morning routine has long included the one-hour rule. New neuroscience says yours probably should too.

Oct 30, 2024

Advanced Techniques in K-Means Clustering

Amit Yadav

Advanced Techniques in K-Means Clustering

Hey, is this you?

Jul 18, 2024

Lists

Predictive Modeling w/ Python

20 stories1757 saves

Practical Guides to Machine Learning

10 stories2133 saves

Natural Language Processing

1884 stories1528 saves

data science and AI

40 stories312 saves

Photo by Kier in Sight on Unsplash

In

Towards Data Science

by

Kay Jan Wong

6 Types of Clustering Methods — An Overview

Types of clustering methods and algorithms and when to use them

Mar 24, 2023

K-Means vs. DBSCAN: Clustering Algorithms for Grouping Data

Hassaan Idrees

K-Means vs. DBSCAN: Clustering Algorithms for Grouping Data

A Comprehensive Comparison of Two Popular Clustering Techniques in Machine Learning

Oct 22, 2024

I used OpenAI’s o1 model to develop a trading strategy. It is DESTROYING the market

In

DataDrivenInvestor

by

Austin Starks

I used OpenAI’s o1 model to develop a trading strategy. It is DESTROYING the market

It literally took one try. I was shocked.

Sep 15, 2024

Real-time Sentiment Analysis

In

Top Python Libraries

by

Sai Krupa Reddy Surarapu

Real-time Sentiment Analysis

Real-time Sentiment Analysis processes Twitter data using Kafka, Spark, and MongoDB, and visualizes sentiment insights via a Django web…

Aug 7, 2024

See more recommendations

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams