Analysis For Bank Customer Churn Prediction Using Artificial Intelligence Based on Logistic Regression and Decision Tree
Abstract
Customer churn is a significant problem in the banking sector because it can reduce profitability and increase the cost of attracting new customers. This study aims to evaluate the effectiveness of Logistic Regression and Decision Tree algorithms in predicting bank customer churn and identifying factors that influence customers' decisions to stop using bank services. The data used comes from Kaggle with a total of 10,000 data including demographic information, transaction activity, customer satisfaction, and other risk factors. The data was analyzed through a cleaning stage to eliminate duplicates and missing data, then continued with initial data exploration to understand patterns
and correlations between variables. The analysis was carried out using Logistic Regression and Decision Tree models. The performance of both models was evaluated using metrics such as accuracy, precision, recall, and F1-score, and measured using ROC and AUC to assess the model's ability to distinguish between churned and non-churned customers. The results showed that 79.63% of customers remained active, while 20.37% experienced churn, which is a risk indicator for the sustainability of the bank's business. Model evaluation revealed that decision tree outperformed Logistic Regression on all metrics with an accuracy of 0.798 and 0.777, precision of 0.778 and 0.767, recall of 0.819 and 0.779, and F1-score of 0.798 and 0.773. However, Logistic Regression showed a higher AUC of 0.85 and 0.80. Decision tree was superior in detecting customers who actually churned. To reduce churn, it is recommended that banks offer loyalty programs and customize products for high-risk customers, such as those with low balances or credit scores. The use of other models such as random forest and gradient boosting can be suggested to improve accuracy and provide deeper insights in reducing churn.
Keywords: bank customer churn, logistic regression, decision tree, artificial intelligence, churn prediction
Downloads
Published
Issue
Section
License
Open Access
HUMANIS (Humanities, Management and Science Proceedings) is a national peer reviewed and open access journal that publishes significant and important research from all area of agroindustry.
This journal provides immediate open access to its content that making research publish in this journal freely available to the public that supports a greater exchange of knowledge.
Copyright
Submission of a manuscript implies that the submitted work has not been published before (except as part of a thesis or report, or abstract); that it is not under consideration for publication elsewhere; that its publication has been approved by all co-authors. If and when the manuscript is accepted for publication, the author(s) still hold the copyright and retain publishing rights without restrictions. Authors or others are allowed to multiply article as long as not for commercial purposes. For the new invention, authors are suggested to manage its patent before published. The license type is CC-BY-SA 4.0.
Disclaimer
No responsibility is assumed by publisher and co-publishers, nor by the editors for any injury and/or damage to persons or property as a result of any actual or alleged libelous statements, infringement of intellectual property or privacy rights, or products liability, whether resulting from negligence or otherwise, or from any use or operation of any ideas, instructions, procedures, products or methods contained in the material therein.