top of page
Image by Nik
Project Series

Predicting Churn in Telecom Services

  • Date: Apr 2023

  • Tools:

    • Power BI - Visualization

    • RapidMiner - Machine Learning Prediction

  • Data Volume: 7,043

  • Data Source: ​kaggle (Telco Customer Churn - IBM Sample Data Sets)

  • Project Outcome: Interactive Dashboard, Machine Learning model comparison

Project Summary

This project simulates a customer churn scenario to help businesses identify customers at risk of churning and the underlying factors contributing to their decision. The analysis is based on IBM's Telco Customer Churn dataset, sourced from Kaggle.

The study consists of two analytical components:

  

 1. Exploratory Visualization (Power BI):

Interactive dashboards provide insights into customer profiles, overall churn patterns, and the relationships between churn rate and various factors.

 

2. Predictive Modeling (RapidMiner):
Two machine learning algorithms—Logistic Regression and Decision Tree—were employed to predict customer churn and uncover significant risk variables. Model performance was compared to determine the more effective approach.

 

Insights from Exploratory Visualization (Power BI):

  • Churn rate is associated with tenure, contract type, service type, and monthly charges.

  • By analyzing the interaction between tenure and monthly charges, the highest and lowest churn risk segments were identified:

🔺 High monthly charge + Low tenure → Highest churn rate
🔻 Low monthly charge + High tenure → Lowest churn rate

 

Model Performance:

  • Logistic Regression achieved an accuracy of 78.8%, outperforming the Decision Tree model at 75.93%.

  • Contract type, tenure, and internet service emerged as significant predictors in both models.

  • Monthly charges were significant only in the Decision Tree model.

Visualizations

Demo - Interactive Dashboard

Dashboards: Customer Profile and Churn Analysis

Predictive Modeling

Churn Prediction with Decision Tree

decision tree.png

Churn Prediction with Logistic Regression

logistic regression.png

Conclusion

Churn is most prevalent among short-tenure customers with high monthly charges, and is also elevated among those with limited contract commitments. Logistic Regression achieved 78.8% accuracy, identifying tenure, contract type, and internet service as key churn predictors. Businesses can tailor retention strategies by targeting at-risk customer segments and optimizing service offerings based on these insights.

 © 2025 Yi-Chen Hsiao. All rights reserved.

bottom of page