Sahar Fawzy Abdel Razek|Publications:Machine-Learning Techniques for Customer Retention: A Comparative Study

You are in:Home/Publications/Machine-Learning Techniques for Customer Retention: A Comparative Study
Dr. Sahar Fawzy Abdel Razek :: Publications:

Title:	Machine-Learning Techniques for Customer Retention: A Comparative Study
Authors:	sahar f. Sabbeh
Year:	2018
Keywords:	Not Available
Journal:	International Journal of Advanced Computer Science and Applications(IJACSA),
Volume:	9
Issue:	2
Pages:	Not Available
Publisher:	Not Available
Local/International:	International
Paper Link:	Not Available
Full paper	Not Available
Supplementary materials	Not Available

Abstract:

Nowadays, customers have become more interested in the quality of service (QoS) that organizations can provide them. Services provided by different vendors are not highly distinguished which increases competition between organizations to maintain and increase their QoS. Customer Relationship Management systems are used to enable organizations to acquire new customers, establish a continuous relationship with them and increase customer retention for more profitability. CRM systems use machine-learning models to analyze customers’ personal and behavioral data to give organization a competitive advantage by increasing customer retention rate. Those models can predict customers who are expected to churn and reasons of churn. Predictions are used to design targeted marketing plans and service offers. This paper tries to compare and analyze the performance of different machine-learning techniques that are used for churn prediction problem. Ten analytical techniques that belong to different categories of learning are chosen for this study. The chosen techniques include Discriminant Analysis, Decision Trees (CART), instance-based learning (k-nearest neighbors), Support Vector Machines, Logistic Regression, ensemble–based learning techniques (Random Forest, Ada Boosting trees and Stochastic Gradient Boosting), Naïve Bayesian, and Multi-layer perceptron. Models were applied on a dataset of telecommunication that contains 3333 records. Results show that both random forest and ADA boost outperform all other techniques with almost the same accuracy 96%. Both Multi-layer perceptron and Support vector machine can be recommended as well with 94% accuracy. Decision tree achieved 90%, naïve Bayesian 88% and finally logistic regression and Linear Discriminant Analysis (LDA) with accuracy 86.7%.

Dr. Sahar Fawzy Abdel Razek :: Publications: