Spam Message Classification Using the Naïve Bayes Algorithm Based on RapidMiner

Authors

  • Muhamad Yusup Universitas Bina Sarana Informatika
  • Mochamad Isham Fadillah Universitas Bina Sarana Informatika
  • Rifky Adinanta Fauzanie Universitas Bina Sarana Informatika
  • Risca Lusiana Pratiwi Universitas Bina Sarana Informatika
  • Rani Irma Handayani Universitas Bina Sarana Informatika
  • Euis Widanengsih Universitas Bina Sarana Informatika

DOI:

https://doi.org/10.59934/jaiea.v5i2.1811

Keywords:

Naïve Bayes, RapidMiner, Spam Classification, Text Mining, Machine Learning, Natural Language Processing

Abstract

This study implements the Naïve Bayes algorithm for classifying spam and non-spam (ham) messages using the RapidMiner Studio platform. The dataset used was obtained from the SMS Spam Collection Dataset on the Kaggle platform, which consists of 5,759 messages with a distribution of 4,075 ham messages and 1,291 spam messages. The research stages included text pre-processing, model training, and performance evaluation using accuracy, precision, recall, and F1-score metrics. The experimental results showed that the Naïve Bayes model achieved an accuracy of 89.64% with a precision of 56.93%, a recall of 100%, and an F1-score of 72.56%. The research findings indicate that the Naïve Bayes algorithm is effective in detecting spam messages with adequate accuracy, and prove that RapidMiner is an efficient tool for implementing machine learning methods in text classification.

Downloads

Download data is not yet available.

References

H. A. Al-Kaabi, A. Darroudi, A. K. Jasim, H. Alaa, and A.-K. Hussain, “Survey of SMS Spam Detection Techniques: A Taxonomy,” Alkadhim Journal for Computer Science, vol. 4, no. 2, 2024, doi: 10.53523/ijoirVolxIxIDxx.

A. Sauddin, T. Azisah Nurman, N. Aeni, and S. Rahayu Sudarta, “Klasifikasi Spam SMS Menggunakan Naïve Bayes Classifier dan K-Nearest Neighbors.”

S. Charan Lanka, K. Pujita, K. Akhila, S. Mondal, P. Vidya Sagar, and S. Bulla, “International Journal of INTELLIGENT SYSTEMS AND APPLICATIONS IN ENGINEERING Optimization of Naïve Bayes Classifier for Spam E-Mail Detection.” [Online]. Available: www.ijisae.org

D. Irawan, E. B. Perkasa, Y. Yurindra, D. Wahyuningsih, and E. Helmud, “Perbandingan Klassifikasi SMS Berbasis Support Vector Machine, Naive Bayes Classifier, Random Forest dan Bagging Classifier,” Jurnal Sisfokom (Sistem Informasi dan Komputer), vol. 10, no. 3, pp. 432–437, Dec. 2021, doi: 10.32736/sisfokom.v10i3.1302.

D. A. Anggraini, M. Ikhsan, and S. Suhardi, “Implementation of the Naïve Bayes Algorithm in the SMS Spam Filtering System,” Journal of Computer Networks, Architecture and High Performance Computing, vol. 6, no. 2, pp. 838–849, May 2024, doi: 10.47709/cnahpc.v6i2.3875.

P. A. Raharja, M. F. Sidiq, and D. C. Fransisca, “Comparative Analysis of Multinomial Naïve Bayes and Logistic Regression Models for Prediction of SMS Spam,” JURNAL MEDIA INFORMATIKA BUDIDARMA, vol. 6, no. 3, p. 1290, Jul. 2022, doi: 10.30865/mib.v6i3.4019.

E. Triana, A. Irma Purnamasari, A. Bahtiar, and E. Tohidi, “Journal of Artificial Intelligence and Engineering Applications Improved Spam Email Detection Performance Based on Naïve Bayes Approach TF-IDF Vectorizer with Multi-Metric Optimization,” 2025. [Online]. Available: https://ioinformatic.org/

Downloads

Published

2026-02-15

How to Cite

Muhamad Yusup, Mochamad Isham Fadillah, Rifky Adinanta Fauzanie, Risca Lusiana Pratiwi, Rani Irma Handayani, & Euis Widanengsih. (2026). Spam Message Classification Using the Naïve Bayes Algorithm Based on RapidMiner. Journal of Artificial Intelligence and Engineering Applications (JAIEA), 5(2), 2183–2186. https://doi.org/10.59934/jaiea.v5i2.1811