← All Projects

Clustering Countries by Socio-Economic Profile

Grouping 167 nations by health, trade, and income indicators to guide aid and development decisions

Overview

We wanted to sort countries into meaningful groups based on how developed and well-off they are.

Methodology

flowchart LR
  A[Raw Data] --> B[Scale / Standardize]
  B --> C["Reduce: PCA / t-SNE"]
  C --> D["Cluster: K-Means / Hierarchical"]
  D --> E["Evaluate: Elbow / Silhouette"]
  E --> F[Interpret Clusters]

The Data

Each country is described by nine numbers covering its economy, trade, and population health.

Exploratory Analysis

Richer countries clearly tend to be healthier, and poverty tracks closely with high child mortality.

Clusters Discovered

Several methods all pointed to roughly three groups: rich, poor, and a large middle.

Interpretation & Recommendations

The cleanest grouping separates struggling countries that need aid from prosperous trade hubs.

Key Takeaways

Comparing five clustering methods produced a robust, three-tier view of global development.

More Visualizations

Tech Stack

Attribution

This project was completed as part of the MIT Applied Data Science Program (MIT IDSS / Great Learning). The program provided the case-study scaffolding; the analysis, code, and results are my own. Published with permission, for portfolio use only.