← All Projects

Employee Attrition Prediction

Why employees leave — and predicting who is at risk

Overview

Losing employees is expensive — this finds why people leave and flags who might.

Methodology

flowchart LR
  A[Raw Data] --> B[Clean & Encode]
  B --> C[EDA]
  C --> D[Train/Test Split]
  D --> E["Logistic Regression / KNN / LDA / QDA"]
  E --> F["Tune (Cross-Validation)"]
  F --> G["Evaluate: Recall / F1 / ROC"]
  G --> H[Interpret]

The Data

One row per employee, mixing personal details with work-life metrics.

Exploratory Analysis

What the data looks like before any modeling.

Key Drivers of Attrition

The factors that most separate leavers from stayers.

Modeling & Results

How the prediction model was built and how well it performed.

Key Takeaways

What HR should actually do with this.

More Visualizations

Tech Stack

Attribution

This project was completed as part of the MIT Applied Data Science Program (MIT IDSS / Great Learning). The program provided the case-study scaffolding; the analysis, code, and results are my own. Published with permission, for portfolio use only.