Random Forest Algorithm to Improve the Classification Model for Recipients of Direct Cash Assistance in Danalampah Village, Pancalang District, Kuningan Regency

Authors

  • Rio Harsadino STMIK IKMI Cirebon
  • Rudi Kurniawan STMIK IKMI Cirebon
  • Saeful Anwar STMIK IKMI Cirebon

DOI:

https://doi.org/10.59934/jaiea.v4i2.924

Keywords:

Data Classification, Machine Learning, Data Imbalance, Random Forest, Data Balancing.

Abstract

Data classification plays a crucial role in machine learning, particularly in supporting data-driven decision-making processes. Data imbalance between majority and minority classes often leads to model bias toward the majority class, reducing accuracy in detecting the minority class. In this study, recipients of direct cash assistance (BLT) represent the minority class, whose numbers are significantly fewer than non-recipients. Therefore, specific strategies are required to ensure more accurate and fair classification results. This study aims to analyze the impact of data imbalance on the performance of the BLT classification model, evaluate the model's performance using accuracy, precision, and recall metrics, and explore the effectiveness of data balancing techniques in improving the model's sensitivity to the minority class. The Random Forest algorithm serves as the primary method for building the classification model. This algorithm works by constructing numerous decision trees based on randomly selected data and features, then combining predictions through majority voting. Additionally, data balancing is implemented through manual reduction in the majority class, resulting in a dataset with a more proportional ratio (1:2), comprising 66 records: 22 BLT recipients and 44 non-recipients. The results demonstrate that the Random Forest algorithm applied to the original dataset achieves high accuracy (99.59%), but the recall for BLT recipients is 0%, indicating bias toward the majority class. After data balancing, accuracy slightly decreases to 98.33%, while recall significantly improves to 66.67%, and precision reaches 65.83%. These findings indicate that data balancing successfully enhances the model's sensitivity to the minority class without significantly compromising accuracy. This study concludes that the integration of data balancing techniques and the Random Forest algorithm improves the fairness and representation of classification outcomes. These findings offer practical contributions to the application of social data analysis, particularly in classifying social assistance recipients, to support more targeted decision-making processes

Downloads

Download data is not yet available.

References

Junaidi Satrio, R. Valicia Anggela, and D. Kariman, “Klasifikasi Metode Data Mining untuk Prediksi Kelulusan Tepat Waktu Mahasiswa dengan Algoritma Naïve Bayes, Random Forest, Support Vector Machine (SVM) dan Artificial Neural Nerwork (ANN),” J. Appl. Comput. Sci. Technol., vol. 5, no. 1, pp. 109–119, 2024, doi: 10.52158/jacost.v5i1.489.

K. G. Putra, “Penentuan Penerima Bantuan Program Keluarga Harapan menggunakan Algoritma Random Forest di Desa Kebonrejo,” Innov. J. Soc. Sci. Res., vol. 4, pp. 7242–7257, 2024.

D. Mualfah, W. Fadila, and R. Firdaus, “Teknik SMOTE untuk Mengatasi Imbalance Data pada Deteksi Penyakit Stroke Menggunakan Algoritma Random Forest,” J. CoSciTech (Computer Sci. Inf. Technol., vol. 3, no. 2, pp. 107–113, 2022, doi: 10.37859/coscitech.v3i2.3912.

R. Firdaus et al., “Implementasi Algoritma Random Forest Untuk Klasifikasi Pencemaran Udara di Wilayah Jakarta Berdasarkan Jakarta Open Data,” J. FASILKOM, vol. 14, no. 2, pp. 520–525, 2021.

D. Sudrajat, A. I. Purnamasari, A. R. Dikananda, D. A. Kurnia, and A. Bahtiar, “Klasifikasi Mutu Pembelajaran Hybrid berdasarkan Algoritma C.45, Random Forest dan Naïve Bayes dengan Optimasi Bootsrap Areggating (Bagging) pada masa COVID-19,” JURIKOM (Jurnal Ris. Komputer), vol. 9, no. 6, p. 2227, 2022, doi: 10.30865/jurikom.v9i6.5179.

G. Ashari Rakhmat and W. Mutohar, “Prakiraan Hujan menggunakan Metode Random Forest dan Cross Validation,” J. MIND J. | ISSN, vol. 8, no. 2, pp. 173–187, 2023, [Online]. Available: https://doi.org/10.26760/mindjournal.v8i2.173-187

D. Benaya and B. Prasetiyo, “Implementasi Random Forest dalam Klasifikasi Kanker Paru-Paru,” JOINTER J. Informatics Eng., vol. 5, no. 01, pp. 27–31, 2024, doi: 10.53682/jointer.v5i01.331.

Downloads

Published

2025-02-15

How to Cite

Harsadino, R., Rudi Kurniawan, & Saeful Anwar. (2025). Random Forest Algorithm to Improve the Classification Model for Recipients of Direct Cash Assistance in Danalampah Village, Pancalang District, Kuningan Regency. Journal of Artificial Intelligence and Engineering Applications (JAIEA), 4(2), 1450–1456. https://doi.org/10.59934/jaiea.v4i2.924