Classification of Student Discipline Levels Using the C4.5 Al-gorithm Based on Violation Points in High School

Authors

  • Hendra Parsaulian Universitas Pelita Bangsa
  • Zacky Rafian Fawwauzy Universitas Pelita Bangsa
  • Elkin Rilvani Universitas Pelita Bangsa

DOI:

https://doi.org/10.59934/jaiea.v5i1.1390

Keywords:

Classification, C4.5 Algorithm, CRISP DM, Decision Tree, Educational Data Mining

Abstract

This study aims to classify student discipline levels based on accumulated violation points by implementing the C4.5 decision tree algorithm within the CRISP‑DM framework. The research follows a structured and iterative process, beginning with problem understanding, dataset exploration, and preparation, followed by model training and evaluation. The dataset consists of student demographic information, violation types, and total violation points collected over one academic year. The C4.5 algorithm was selected for its ability to process both categorical and numerical data and to generate interpretable classification rules. The model was trained using a split of training and testing data and further validated using cross‑validation to ensure reliability. The results indicate that the model effectively classifies students into high, medium, and low discipline levels, achieving strong predictive performance. The generated decision tree provides clear and interpretable rules, enabling educators to identify patterns in student behavior and prioritize targeted interventions. These findings highlight the potential of data‑driven approaches to enhance discipline management practices in educational institutions.

Downloads

Download data is not yet available.

References

C. Romero and S. Ventura, “Educational data mining and learning analytics: An updated survey,” arXiv preprint arXiv:2402.07956, Feb. 2024.

S. Kotsiantis, “Decision trees: A recent overview,” Artificial Intelligence Review, vol. 39, no. 4, pp. 261–283, 2013.

J. R. Quinlan, C4.5: Programs for Machine Learning. San Mateo, CA: Morgan Kaufmann, 1993.

M. A. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. H. Witten, “The WEKA data mining software: An update,” SIGKDD Explorations, vol. 11, no. 1, pp. 10–18, 2009.

D. L. S. Purnama and U. Apsiswanto, “Analysis of C4.5 Algorithm Performance for Predicting Student Achievement Using Educational Data,” J. Comput. Netw. Archit. High Perform. Comput., vol. 7, no. 1, pp. 190–199, Jan. 2025.

J. Han, M. Kamber, and J. Pei, Data Mining: Concepts and Techniques, 4th ed. Cambridge, MA: Morgan Kaufmann, 2022.

F. A. Orji and J. Vassileva, “Machine Learning Approach for Predicting Students’ Academic Performance and Behavior Patterns,” arXiv preprint arXiv:2210.08186, Oct. 2022.

J. R. M. Ledoh, I. D. K. Mahendra, and M. A. Benu, “C4.5 Algorithm Implementation to Predict Student Satisfaction Level on the Learning Process,” Komputasi, vol. 20, no. 2, pp. 126–134, 2023.

M. R. Rizal and R. S. Suryono, “A Comparative Study of Machine Learning Algorithms for Student Behavioral Prediction,” in Proc. 2023 Int. Conf. Data Sci. Educ., 2023, pp. 55–62.

M. Lintang, S. H. Pramono, and A. A. Nugroho, “Use of the C4.5 Algorithm to Analyze Student Interest and Its Impact on Learning Outcomes,” SHS Web of Conferences, vol. 149, p. 01048, 2022.

H. Witten, E. Frank, and M. Hall, Practical Machine Learning Tools and Techniques with Java Implementations. San Fran-cisco: Morgan Kaufmann, 2023

Downloads

Published

2025-10-15

How to Cite

Hendra Parsaulian, Zacky Rafian Fawwauzy, & Elkin Rilvani. (2025). Classification of Student Discipline Levels Using the C4.5 Al-gorithm Based on Violation Points in High School. Journal of Artificial Intelligence and Engineering Applications (JAIEA), 5(1), 662–666. https://doi.org/10.59934/jaiea.v5i1.1390