The Three Types of Machine Learning

If "machine learning" sounds like one big thing, that's because that's how it's marketed. In practice, almost every ML system in the world falls into one of three buckets: supervised, unsupervised, or reinforcement learning. Knowing which bucket a problem belongs to is half the battle — it tells you what data you need, what tools to use, and what answers you can expect.

This lesson walks you through all three with real examples you'll recognize from your own apps.

What You'll Learn

The three families of machine learning, in plain English
How to spot which type a problem needs
Real-world examples for each (Spotify, Netflix, ChatGPT, self-driving cars)
A simple test you can run with ChatGPT to classify any ML problem

Type 1: Supervised Learning (Learning from Examples with Answers)

Supervised learning is the most common kind. The "supervisor" is a labeled dataset — examples that come with the correct answer attached.

Examples:

Spam detection — emails labeled "spam" or "not spam"
House price prediction — historical sales data with the actual sale price
Medical diagnosis — X-rays labeled "tumor present / not present" by doctors
Customer churn prediction — past customers labeled "stayed" or "left"

The model studies thousands of labeled examples, finds patterns linking inputs to outputs, and uses those patterns to predict new cases. Two flavors:

Classification — predicting a category (spam / not spam, dog / cat, fraud / legitimate)
Regression — predicting a number (price, temperature, sales)

The tell: if you can imagine listing examples with a "right answer" column, it's supervised learning.

Type 2: Unsupervised Learning (Finding Hidden Patterns)

Sometimes you have data but no labels. You don't know the "right answer" — you just want to see what's in there. That's unsupervised learning.

Examples:

Customer segmentation — group shoppers into types without knowing the types in advance
Spotify "Daily Mixes" — group your liked songs into vibes
Anomaly detection — find unusual patterns in network traffic or transactions
Topic discovery — group news articles by theme

The two main jobs:

Clustering — group similar items together
Dimensionality reduction — squish complex data into something simpler to visualize

The tell: if you're saying "I have data, what's interesting in it?" — that's unsupervised.

Type 3: Reinforcement Learning (Learning by Trial and Error)

Reinforcement learning is the closest thing to how a child learns to ride a bike. The system tries actions, gets rewards or penalties, and gradually learns a strategy.

Examples:

AlphaGo — Google's Go-playing AI, learned by playing millions of games against itself
Self-driving cars — learn from rewards (smooth driving) and penalties (collisions in simulation)
Recommendation systems — Netflix's homepage shuffles based on what you actually click
Robot vacuum routing — learns the most efficient path through your apartment
Trading algorithms — buy / sell actions rewarded with profit

There's no labeled dataset. There's a goal (maximize a reward) and a feedback loop.

The tell: if the system needs to make a sequence of decisions and the right action depends on the situation, reinforcement learning fits.

What About ChatGPT and Claude?

Modern large language models like ChatGPT, Claude, and Gemini use all three types stitched together:

Unsupervised pre-training on massive text data (find patterns in language)
Supervised fine-tuning on examples of good answers
Reinforcement learning from human feedback (RLHF) to make answers more helpful

This is why they feel so capable — they're not one ML technique, they're a stack of them.

A Practical Sorting Test

Open ChatGPT, Claude, or Gemini. Try this prompt:

"I'll describe three real-world problems. For each one, tell me whether it's a supervised, unsupervised, or reinforcement learning problem, and explain why in one sentence:

Predicting whether a credit card transaction is fraudulent

Grouping our customer support tickets into common themes we haven't named yet

Training a drone to fly through an obstacle course"

You should get: 1) supervised classification, 2) unsupervised clustering, 3) reinforcement learning. Now invent three problems from your own life or studies and test yourself.

How This Course Will Use Each Type

Type	Where you'll see it	Tool we'll use
Supervised	Predicting house prices, classifying images	Google Sheets, Teachable Machine
Unsupervised	Grouping survey responses	ChatGPT / Claude
Reinforcement	Discussion only (most no-code tools don't expose RL)	Conceptual examples

You don't need to memorize this table. Just know that 90% of business ML problems are supervised — that's where you'll spend most of your career using ML.

Common Beginner Mistake

People often try to apply supervised learning when they don't have labels yet. If you only have raw data and no "right answer" column, you're either doing:

Unsupervised learning (find structure)
Or you need to create labels first (often manually)

Don't skip the labeling step — bad labels create worse-than-useless supervised models.

Key Takeaways

Supervised learning needs labeled examples and predicts known categories or numbers
Unsupervised learning finds patterns in unlabeled data, often by clustering similar items
Reinforcement learning trains agents through trial-and-error rewards
ChatGPT and Claude combine all three under the hood
Knowing the type tells you what data you need and what answers you can expect

Next up: data — the actual fuel that makes any of this possible. Garbage data in means garbage predictions out, no matter how fancy the model.

The Three Types of Machine Learning

This lesson walks you through all three with real examples you'll recognize from your own apps.

What You'll Learn

The three families of machine learning, in plain English
How to spot which type a problem needs
Real-world examples for each (Spotify, Netflix, ChatGPT, self-driving cars)
A simple test you can run with ChatGPT to classify any ML problem

Type 1: Supervised Learning (Learning from Examples with Answers)

Supervised learning is the most common kind. The "supervisor" is a labeled dataset — examples that come with the correct answer attached.

Examples:

Spam detection — emails labeled "spam" or "not spam"
House price prediction — historical sales data with the actual sale price
Medical diagnosis — X-rays labeled "tumor present / not present" by doctors
Customer churn prediction — past customers labeled "stayed" or "left"

The model studies thousands of labeled examples, finds patterns linking inputs to outputs, and uses those patterns to predict new cases. Two flavors:

Classification — predicting a category (spam / not spam, dog / cat, fraud / legitimate)
Regression — predicting a number (price, temperature, sales)

The tell: if you can imagine listing examples with a "right answer" column, it's supervised learning.

Type 2: Unsupervised Learning (Finding Hidden Patterns)

Sometimes you have data but no labels. You don't know the "right answer" — you just want to see what's in there. That's unsupervised learning.

Examples:

Customer segmentation — group shoppers into types without knowing the types in advance
Spotify "Daily Mixes" — group your liked songs into vibes
Anomaly detection — find unusual patterns in network traffic or transactions
Topic discovery — group news articles by theme

The two main jobs:

Clustering — group similar items together
Dimensionality reduction — squish complex data into something simpler to visualize

The tell: if you're saying "I have data, what's interesting in it?" — that's unsupervised.

Type 3: Reinforcement Learning (Learning by Trial and Error)

Reinforcement learning is the closest thing to how a child learns to ride a bike. The system tries actions, gets rewards or penalties, and gradually learns a strategy.

Examples:

AlphaGo — Google's Go-playing AI, learned by playing millions of games against itself
Self-driving cars — learn from rewards (smooth driving) and penalties (collisions in simulation)
Recommendation systems — Netflix's homepage shuffles based on what you actually click
Robot vacuum routing — learns the most efficient path through your apartment
Trading algorithms — buy / sell actions rewarded with profit

There's no labeled dataset. There's a goal (maximize a reward) and a feedback loop.

The tell: if the system needs to make a sequence of decisions and the right action depends on the situation, reinforcement learning fits.

What About ChatGPT and Claude?

Modern large language models like ChatGPT, Claude, and Gemini use all three types stitched together:

Unsupervised pre-training on massive text data (find patterns in language)
Supervised fine-tuning on examples of good answers
Reinforcement learning from human feedback (RLHF) to make answers more helpful

This is why they feel so capable — they're not one ML technique, they're a stack of them.

A Practical Sorting Test

Open ChatGPT, Claude, or Gemini. Try this prompt:

"I'll describe three real-world problems. For each one, tell me whether it's a supervised, unsupervised, or reinforcement learning problem, and explain why in one sentence:

Predicting whether a credit card transaction is fraudulent

Grouping our customer support tickets into common themes we haven't named yet

Training a drone to fly through an obstacle course"

You should get: 1) supervised classification, 2) unsupervised clustering, 3) reinforcement learning. Now invent three problems from your own life or studies and test yourself.

How This Course Will Use Each Type

Type	Where you'll see it	Tool we'll use
Supervised	Predicting house prices, classifying images	Google Sheets, Teachable Machine
Unsupervised	Grouping survey responses	ChatGPT / Claude
Reinforcement	Discussion only (most no-code tools don't expose RL)	Conceptual examples

You don't need to memorize this table. Just know that 90% of business ML problems are supervised — that's where you'll spend most of your career using ML.

Common Beginner Mistake

People often try to apply supervised learning when they don't have labels yet. If you only have raw data and no "right answer" column, you're either doing:

Unsupervised learning (find structure)
Or you need to create labels first (often manually)

Don't skip the labeling step — bad labels create worse-than-useless supervised models.

Key Takeaways

Supervised learning needs labeled examples and predicts known categories or numbers
Unsupervised learning finds patterns in unlabeled data, often by clustering similar items
Reinforcement learning trains agents through trial-and-error rewards
ChatGPT and Claude combine all three under the hood
Knowing the type tells you what data you need and what answers you can expect

Next up: data — the actual fuel that makes any of this possible. Garbage data in means garbage predictions out, no matter how fancy the model.

The Three Types of Machine Learning

What You'll Learn

Type 1: Supervised Learning (Learning from Examples with Answers)

Type 2: Unsupervised Learning (Finding Hidden Patterns)

Type 3: Reinforcement Learning (Learning by Trial and Error)

What About ChatGPT and Claude?

A Practical Sorting Test

How This Course Will Use Each Type

Common Beginner Mistake

Key Takeaways

Quiz

Questions & Answers

The Three Types of Machine Learning

What You'll Learn

Type 1: Supervised Learning (Learning from Examples with Answers)

Type 2: Unsupervised Learning (Finding Hidden Patterns)

Type 3: Reinforcement Learning (Learning by Trial and Error)

What About ChatGPT and Claude?

A Practical Sorting Test

How This Course Will Use Each Type

Common Beginner Mistake

Key Takeaways

Quiz

Questions & Answers