Analysis For Bank Customer Churn Prediction Using Artificial Intelligence Based On Logistic Regression And Decision Tree

Authors

  • Tukiyat Tukiyat Universitas Pamulang, Tangerang Selatan, Banten, Indonesia
  • Suryatna Sacadibrata South Tangerang City Government, Ciputat, South Tangerang, Indonesia
  • Taufiqur Rahman Tech Mahindra Indonesia, South Jakarta, Jakarta, Indonesia

DOI:

https://doi.org/10.32493/iijoms.v1i1.53714

Keywords:

Bank Customer Churn, Logistic Regression, Decision Tree, Artificial Intelligence, Churn Prediction

Abstract

Customer churn is a significant problem in the banking sector because it can reduce profitability and increase the cost of attracting new customers. This study aims to evaluate the effectiveness of Logistic Regression and Decision Tree algorithms in predicting bank customer churn and identifying factors that influence customers' decisions to stop using bank services. The data used comes from Kaggle with a total of 10,000 data including demographic information, transaction activity, customer satisfaction, and other risk factors. The data was analyzed through a cleaning stage to eliminate duplicates and missing data, then continued with initial data exploration to understand patterns and correlations between variables. The analysis was carried out using Logistic Regression and Decision Tree models. The performance of both models was evaluated using metrics such as accuracy, precision, recall, and F1-score, and measured using ROC and AUC to assess the model's ability to distinguish between churned and non-churned customers. The results showed that 79.63% of customers remained active, while 20.37% experienced churn, which is a risk indicator for the sustainability of the bank's business. Model evaluation revealed that decision tree outperformed Logistic Regression on all metrics with an accuracy of 0.798 and 0.777, precision of 0.778 and 0.767, recall of 0.819 and 0.779, and F1-score of 0.798 and 0.773. However, Logistic Regression showed a higher AUC of 0.85 and 0.80. Decision tree was superior in detecting customers who actually churned. To reduce churn, it is recommended that banks offer loyalty programs and customize products for high-risk customers, such as those with low balances or credit scores. The use of other models such as random forest and gradient boosting can be suggested to improve accuracy and provide deeper insights in reducing churn.

References

Abdulsalam, Y., Ali, N., & Omer, I. (2022). Application of artificial intelligence for customer churn prediction in the banking sector . International Journal of Artificial Intelligence, 14(3), 45-67.

Anderson, R., Taylor, S., & Brown, J. (2024). Reducing customer churn through proactive communication: A banking case study . Journal of Financial Services Marketing, 29(2), 12-23.

Ashraf, A. (2024). Customer churn and its impact on profitability in the banking sector: A review . Journal of Business and Economics, 32(4), 134-145.

Chen, Y., Zhang, X., & Wu, H. (2023). Gender differences in customer churn prediction in the banking sector . Journal of Marketing Research, 60(1), 50-61.

He, H., Liu, X., & Zhang, T. (2008). Improved sampling techniques for imbalanced classification tasks: ADASYN and SMOTE comparison . IEEE Transactions on Knowledge and Data Engineering, 20(4), 452-463.

Hon, K., Lee, J., & Cheung, C. (2023). Demographic factors and transaction behavior in predicting churn in the banking sector . International Journal of Bank Marketing, 41(6), 334-349.

Hosmer, D., Lemeshow, S., & Sturdivant, R. (2013). Applied logistic regression (3rd ed.). John Wiley & Sons.

Johnson, D., & Lee, Y. (2023). The role of customer service in reducing churn in retail banking . Journal of Service Research, 28(2), 78-92.

Kumar, V. (2020). Customer retention strategies in banking: Using data analytics to predict churn . Journal of Financial Technology, 12(1), 56-69.

Kotios, S., Nikos, T., & Vasilis, K. (2022). Cost-benefit analysis of customer retention in banking . International Journal of Economic and Financial Issues, 8(3), 45-56.

Morgan, P., Harris, R., & Chen, Z. (2024). The impact of mobile banking applications on customer retention . Journal of Digital Banking, 15(2), 23-35.

Patel, N., Sharma, P., & Soni, R. (2022). The changing needs of customers and their impact on bank service offerings . Journal of Banking and Finance, 24(1), 101-112.

Quinlan, J. (1996). C4.5: Programs for machine learning . Morgan Kaufmann.

Smith, L., Rojas, F., & Walker, J. (2024). Predictive analytics in banking: Forecasting customer churn with machine learning . Journal of Business Analytics, 11(1), 22-36.

Taylor, S., & Brown, J. (2023). Proactive customer service as a churn prevention strategy in financial institutions . Journal of Consumer Satisfaction, 19(4), 101-114.

Wen, J. (2023). The application of logistic regression in customer churn prediction . Journal of Data Science, 11(3), 50-63.

Zhang, L., Liu, F., & Wang, M. (2022). A comparative study of decision trees and logistic regression in customer churn prediction . Journal of Computational Statistics, 34(4), 120-134.

Zeithaml, V., Parasuraman, A., & Berry, L. (2023). The behavioral consequences of service quality and customer satisfaction in the banking industry . Journal of Marketing Research, 28(2), 91-103.

Downloads

Published

2025-02-28