Spam Message Classification Using the Naïve Bayes Algorithm Based on RapidMiner
DOI:
https://doi.org/10.59934/jaiea.v5i2.1811Keywords:
Naïve Bayes, RapidMiner, Spam Classification, Text Mining, Machine Learning, Natural Language ProcessingAbstract
This study implements the Naïve Bayes algorithm for classifying spam and non-spam (ham) messages using the RapidMiner Studio platform. The dataset used was obtained from the SMS Spam Collection Dataset on the Kaggle platform, which consists of 5,759 messages with a distribution of 4,075 ham messages and 1,291 spam messages. The research stages included text pre-processing, model training, and performance evaluation using accuracy, precision, recall, and F1-score metrics. The experimental results showed that the Naïve Bayes model achieved an accuracy of 89.64% with a precision of 56.93%, a recall of 100%, and an F1-score of 72.56%. The research findings indicate that the Naïve Bayes algorithm is effective in detecting spam messages with adequate accuracy, and prove that RapidMiner is an efficient tool for implementing machine learning methods in text classification.
Downloads
References
H. A. Al-Kaabi, A. Darroudi, A. K. Jasim, H. Alaa, and A.-K. Hussain, “Survey of SMS Spam Detection Techniques: A Taxonomy,” Alkadhim Journal for Computer Science, vol. 4, no. 2, 2024, doi: 10.53523/ijoirVolxIxIDxx.
A. Sauddin, T. Azisah Nurman, N. Aeni, and S. Rahayu Sudarta, “Klasifikasi Spam SMS Menggunakan Naïve Bayes Classifier dan K-Nearest Neighbors.”
S. Charan Lanka, K. Pujita, K. Akhila, S. Mondal, P. Vidya Sagar, and S. Bulla, “International Journal of INTELLIGENT SYSTEMS AND APPLICATIONS IN ENGINEERING Optimization of Naïve Bayes Classifier for Spam E-Mail Detection.” [Online]. Available: www.ijisae.org
D. Irawan, E. B. Perkasa, Y. Yurindra, D. Wahyuningsih, and E. Helmud, “Perbandingan Klassifikasi SMS Berbasis Support Vector Machine, Naive Bayes Classifier, Random Forest dan Bagging Classifier,” Jurnal Sisfokom (Sistem Informasi dan Komputer), vol. 10, no. 3, pp. 432–437, Dec. 2021, doi: 10.32736/sisfokom.v10i3.1302.
D. A. Anggraini, M. Ikhsan, and S. Suhardi, “Implementation of the Naïve Bayes Algorithm in the SMS Spam Filtering System,” Journal of Computer Networks, Architecture and High Performance Computing, vol. 6, no. 2, pp. 838–849, May 2024, doi: 10.47709/cnahpc.v6i2.3875.
P. A. Raharja, M. F. Sidiq, and D. C. Fransisca, “Comparative Analysis of Multinomial Naïve Bayes and Logistic Regression Models for Prediction of SMS Spam,” JURNAL MEDIA INFORMATIKA BUDIDARMA, vol. 6, no. 3, p. 1290, Jul. 2022, doi: 10.30865/mib.v6i3.4019.
E. Triana, A. Irma Purnamasari, A. Bahtiar, and E. Tohidi, “Journal of Artificial Intelligence and Engineering Applications Improved Spam Email Detection Performance Based on Naïve Bayes Approach TF-IDF Vectorizer with Multi-Metric Optimization,” 2025. [Online]. Available: https://ioinformatic.org/
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Journal of Artificial Intelligence and Engineering Applications (JAIEA)

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.








