← All Projects

FIFA World Cup Analysis

Mining 80+ years of World Cup history to guide a new football club's strategy

Overview

A brand-new football club asked us to dig through World Cup history to learn what drives winning nations, attendance, and goals.

Methodology

flowchart LR
  A[Raw Data] --> B[Clean & Validate]
  B --> C[Univariate EDA]
  C --> D[Bivariate / Correlation]
  D --> E[Insights & Recommendations]

The Data

We combined three official datasets covering every tournament, every match, and every player up to 2014.

Exploratory Analysis

We checked distributions, spotted outliers with box plots, and mapped how the key numbers relate to each other.

Key Findings & Drivers

More goals over the years come from playing more matches, not from teams actually scoring more per game.

Insights & Recommendations

A handful of nations dominate the trophy, and host cities and attendance show clear, actionable patterns.

Key Takeaways

The analysis turns decades of raw results into a clear, evidence-based picture of what wins and what fills stadiums.

More Visualizations

Tech Stack

Attribution

This project was completed as part of the MIT Applied Data Science Program (MIT IDSS / Great Learning). The program provided the case-study scaffolding; the analysis, code, and results are my own. Published with permission, for portfolio use only.