Skip to content

reemkhaleed/Customer-Churn-Prediction-with-Scikit-learn

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

6 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

Customer-Churn-Prediction-with-Scikit-learn

An end-to-end machine learning pipeline for predicting customer churn using a Kaggle telecom dataset. This project includes data loading, cleaning, preprocessing, feature encoding, and model training with hyperparameter tuning using RandomizedSearchCV. Evaluation is performed using classification metrics and AUC score.

Churn Prediction Pipeline | pandas + sklearn

This project is an end-to-end machine learning pipeline to predict customer churn based on a telecom dataset from Kaggle. It includes steps from data loading to model training and evaluation using scikit-learn.


๐Ÿ“Š Dataset

  • Source: Customer Churn Analysis Dataset
  • The dataset contains customer information such as contract type, tenure, payment method, and service usage.
  • The target variable is Churn (Yes/No), indicating whether a customer left the company.

๐Ÿš€ Features

โœ… Load and explore real-world-like telecom data
โœ… Clean missing values and convert data types
โœ… Encode categorical variables using OneHotEncoding
โœ… Scale numeric features with StandardScaler
โœ… Use Random Forest classifier
โœ… Tune hyperparameters with RandomizedSearchCV
โœ… Evaluate performance using classification report and ROC AUC


๐Ÿงฑ Tech Stack

  • Python 3
  • pandas
  • scikit-learn
  • numpy
  • Jupyter / Google Colab

About

An end-to-end machine learning pipeline for predicting customer churn using a Kaggle telecom dataset. This project includes data loading, cleaning, preprocessing, feature encoding, and model training with hyperparameter tuning using RandomizedSearchCV. Evaluation is performed using classification metrics and AUC score.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

โšก