Classification of Customer Decisions in Opening Deposits Using the C4.5 Algorithm Implemented in Python
DOI:
https://doi.org/10.59934/jaiea.v5i1.1384Keywords:
Data Mining, C.45, Classification, Pyton, Deposits, BankingAbstract
The banking industry needs to understand customer behavior to improve marketing strategies, particularly for deposit products. This study aims to develop a classification model to predict customer decisions in opening deposits using the C4.5 decision tree algorithm. The dataset consists of secondary banking data, including attributes such as age, occupation, marital status, education, account balance, and deposit decisions. The research adopts the Knowledge Discovery in Databases (KDD) process, from data cleaning to model implementation using Python and the chefboost library. Based on 11,162 customer records, the model achieved 56.69% accuracy, 55.63% precision, and 42.50% recall. Evaluation through a confusion matrix resulted in 2,248 True Positives (TP), 4,080 True Negatives (TN), 1,793 False Positives (FP), and 3,041 False Negatives (FN). The findings suggest that the C4.5 algorithm serves as a baseline approach for predicting customer behavior in deposit decisions. However, further improvements are needed to enhance its performance. This study contributes to the development of data-driven decision support systems in the banking sector.
Downloads
References
D. H. Nuryuliani, “Analisis Prediksi Nasabah yang Berpotensi Membuka Deposito pada Bank Umum di Bekasi Menggunakan Algoritma C4.5 dan Naive Bayes,” J. Ilm. Komputasi, vol. 19, no. 3, pp. 317–326, Sep. 2020, doi: 10.32409/jikstik.19.3.66.
M. R. Takakobi and K. D. Hartomo, “Analisis Metode Klasifikasi Nasabah Potensial dalam Membuka Deposito Jangka Panjang Melalui Telemarketing Menggunakan Metode Gradient Boosting Classifier”.
N. Nia, “Klasifikasi Data Mining untuk Prediksi Potensi Nasabah dalam Membuat Deposito Berjangka,” J. Ilm. Intech Inf. Technol. J. UMUS, vol. 3, no. 1, pp. 65–75, May 2021, doi: 10.46772/intech.
W.-T. Wu et al., “Data mining in clinical big data: the frequently used databases, steps, and methodological models,” Mil. Med. Res., vol. 8, no. 1, Aug. 2021, doi: 10.1186/s40779-021-00338-z.
Y. F. S. Sugiarto, “Algoritma C4.5 sebagai Penerapan Decision Tree-Based Classification Model untuk Mengklasifikasikan Tingkat Omzet UMKM Berdasarkan Profil Bisnis,” J. Ekon. Manaj. Akunt. Dan Perbank. Syari’ah, vol. 13, no. 2, pp. 188–197, Sep. 2024.
Alfarizi et al., “Penggunaan Python sebagai Bahasa Pemrograman untuk Machine Learning dan Deep Learning,” J. Karimah Tauhid, vol. 2, no. 1, pp. 1–6, 2023.
A. D. Sidik and A. Ansawarman, “Prediksi Jumlah Kendaraan Bermotor Menggunakan Machine Learning,” Formosa J. Multidiscip. Res., vol. 1, no. 3, pp. 559–568, Jul. 2022, doi: 10.55927/fjmr.v1i3.745.
D. Normawati and S. A. Prayogi, “Implementasi Naïve Bayes Classifier Dan Confusion Matrix Pada Analisis Sentimen Berbasis Teks Pada Twitter,” J. Sains Komput. Inform. J-SAKTI, vol. 5, no. 2, pp. 697–711, 2021.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Journal of Artificial Intelligence and Engineering Applications (JAIEA)

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.







