← All Projects

Predicting Employee Attrition at McCurr Health Consultancy

An end-to-end classification pipeline to flag at-risk employees before they leave

Overview

We built a tool that predicts which employees are likely to quit so the company can act early to keep them.

Methodology

flowchart LR
  A[Raw Data] --> B[Clean & Encode]
  B --> C[EDA]
  C --> D[Train/Test Split]
  D --> E["Random Forest / Decision Tree / XGBoost / Gradient Boosting"]
  E --> F["Tune (Cross-Validation)"]
  F --> G["Evaluate: Recall / F1 / ROC"]
  G --> H[Interpret]

The Data

We started with HR records for nearly 3,000 employees described by dozens of personal and job-related details.

Exploratory Analysis

We profiled the workforce to understand who works there and where early warning signs of leaving appear.

Key Drivers of Attrition

The models agree that overtime, low pay, younger age, and short tenure are the biggest red flags for quitting.

Modeling & Results

We tested several models and the tuned Random Forest best balanced catching real leavers without too many false alarms.

Key Takeaways

HR can now flag likely leavers in advance and focus retention efforts on overtime and pay.

More Visualizations

Tech Stack

Attribution

This project was completed as part of the MIT Applied Data Science Program (MIT IDSS / Great Learning). The program provided the case-study scaffolding; the analysis, code, and results are my own. Published with permission, for portfolio use only.