Taiju Sanagi: Experiments

Unsupervised Learning

Note
Updated: April 22, 2025

🧩 Introduction to Unsupervised Learning

Unsupervised learning is a type of machine learning where the model learns from data without any labels.

There are no answers provided — just raw input data.
The goal is to discover patterns, structures, or groupings hidden inside the data.

1. How Is It Different from Supervised Learning?

In supervised learning, the model is trained with input-output pairs.
In unsupervised learning, the model only sees the inputs and has to figure things out on its own.

There is no teacher.
No right or wrong labels.
The model tries to understand the data just by looking at how the examples are distributed.

2. What Can Unsupervised Learning Do?

Some common tasks include:

• Clustering

Group similar data points together based on their features.
You don’t tell it what the groups are — it just finds them.

Example: Grouping customers with similar shopping habits.

• Dimensionality Reduction

Compress high-dimensional data into fewer dimensions while keeping as much information as possible.

Example: Visualizing complex data like images or documents in 2D.

• Anomaly Detection

Find rare or unusual points that don’t fit the general pattern.

Example: Detecting fraudulent transactions or system errors.

• Topic Modeling

Automatically discover topics inside a collection of text documents.

Example: Summarizing thousands of news articles by common themes.

3. Why Use Unsupervised Learning?

You might use unsupervised learning when:

  • You don’t have labels or annotated data
  • You want to explore the structure of your dataset
  • You want to compress, clean, or organize your data
  • You want to find hidden patterns that humans might not notice

4. Common Unsupervised Algorithms

Some popular unsupervised learning methods include:

  • K-Means Clustering
  • Hierarchical Clustering
  • DBSCAN
  • Principal Component Analysis (PCA)
  • Non-Negative Matrix Factorization (NMF)
  • t-SNE and UMAP
  • Topic Modeling (e.g. LDA)

Each of these techniques focuses on a different type of pattern or structure.

Summary

Unsupervised learning helps you find structure in unlabeled data.
It’s often used for exploration, discovery, visualization, and pattern detection.