Taiju Sanagi: Experiments

Manifold Learning

Note
Updated: April 22, 2025

🌐 Understanding Manifold Learning

PCA is powerful — but it assumes that the important structure in the data lies along straight (linear) directions.

But what if the data lies on a curved surface inside a high-dimensional space?

This is where manifold learning comes in.

1. Why PCA Isn't Always Enough

Imagine you have data shaped like a twisted ribbon or a spiral in 3D.
PCA would try to flatten it with a straight line or plane — and lose the true shape.

These kinds of shapes are called nonlinear manifolds — curved surfaces that live in high-dimensional space.

A manifold is a space that locally looks flat (like a sheet), but globally may be curved.

2. What Is Manifold Learning?

Manifold learning algorithms try to:

  • Unroll or flatten these curved structures
  • Find a lower-dimensional space that preserves the shape and neighborhoods of the data

Unlike PCA, manifold methods don't assume the data lies along straight lines.

3. Example: Swiss Roll

A classic example is the Swiss Roll dataset — a spiral sheet curled in 3D.

  • PCA flattens it by slicing straight across
  • Manifold methods like Isomap or t-SNE can unroll it correctly, keeping nearby points together

4. Common Manifold Learning Techniques

Here are a few popular ones:

• Isomap

  • Preserves geodesic distances (distance along the manifold surface)
  • Builds a graph between neighbors and computes shortest paths

• t-SNE (t-distributed Stochastic Neighbor Embedding)

  • Focuses on preserving local neighborhoods
  • Often used for visualizing high-dimensional clusters in 2D

• UMAP (Uniform Manifold Approximation and Projection)

  • Like t-SNE, but faster and often better at keeping global structure
  • Good for large datasets

5. When to Use Manifold Learning

Use it when:

  • PCA doesn't reveal interesting patterns
  • You believe the data has a curved structure
  • You're mainly trying to visualize or understand the shape of the data, not necessarily for modeling