Accuracy in Sentiment Analysis of the by.U Application Using Naïve Bayes and SMOTE Techniques

Authors

  • Athhar Hafizha Luthfi Teknik Informatika, STMIK IKMI Cirebon
  • Ahmad Faqih Teknik Informatika, STMIK IKMI Cirebon
  • Gifthera Dwilestari Sistem Informasi, STMIK IKMI Cirebon

DOI:

https://doi.org/10.59934/jaiea.v4i2.737

Keywords:

User Sentiment, Naïve Bayes, SMOTE, Sentiment Analysis, by.U

Abstract

Imbalanced data is a significant challenge in sentiment analysis, as it often impacts the performance of machine learning models. This study applies the Naïve Bayes algorithm, enhanced with the Synthetic Minority Oversampling Technique (SMOTE), to address class imbalance in user reviews of the by.U application. Using the Knowledge Discovery in Databases (KDD) framework, the research involves data selection, preprocessing (text cleaning, normalization, stemming), transformation using TF-IDF, and train-test data splitting. SMOTE is applied to the training data to improve minority class representation, while Naïve Bayes performs sentiment classification. Model evaluation using cross-validation demonstrates that SMOTE increases accuracy from 84.42% to 85.83%. These results underscore the effectiveness of integrating SMOTE with Naïve Bayes in addressing imbalanced data, offering meaningful insights into user sentiment and aiding the development of improved features for the by.U application.

Downloads

Download data is not yet available.

References

F. Sidik, I. Suhada, A. H. Anwar, and F. N. Hasan, “Analisis Sentimen Terhadap Pembelajaran Daring Dengan Algoritma Naive Bayes Classifier,” J. Linguist. Komputasional, vol. 5, no. 1, pp. 34–43, 2022, doi: 10.26418/jlk.v5i1.79.

I. Wickramasinghe and H. Kalutarage, “Naive Bayes: applications, variations and vulnerabilities: a review of literature with code snippets for implementation,” Soft Comput., vol. 25, no. 3, pp. 2277–2293, 2021, doi: 10.1007/s00500-020-05297-6.

E. Sutoyo and M. A. Fadlurrahman, “Penerapan SMOTE untuk Mengatasi Imbalance Class dalam Klasifikasi Television Advertisement Performance Rating Menggunakan Artificial Neural Network,” J. Edukasi dan Penelit. Inform., vol. 6, no. 3, pp. 379–385, 2020, doi: 10.26418/jp.v6i3.42896.

H. Utami, “Analisis Sentimen dari Aplikasi Shopee Indonesia Menggunakan Metode Recurrent Neural Network,” Indones. J. Appl. Stat., vol. 5, no. 1, pp. 31–38, 2022, doi: 10.13057/ijas.v5i1.56825.

A. N. Kasanah, M. Muladi, and U. Pujianto, “Penerapan Teknik SMOTE untuk Mengatasi Imbalance Class dalam Klasifikasi Objektivitas Berita Online Menggunakan Algoritma KNN,” J. RESTI (Rekayasa Sist. dan Teknol. Informasi), vol. 3, no. 2, pp. 196–201, 2019, doi: 10.29207/resti.v3i2.945.

D. Darwis, N. Siskawati, and Z. Abidin, “Penerapan Algoritma Naive Bayes Untuk Analisis Sentimen Review Data Twitter Bmkg Nasional,” J. Tekno Kompak, vol. 15, no. 1, pp. 131–145, 2021, doi: 10.33365/jtk.v15i1.744.

J. W. Iskandar and Y. Nataliani, “Perbandingan Naïve Bayes, SVM, dan k-NN untuk Analisis Sentimen Gadget Berbasis Aspek,” J. RESTI (Rekayasa Sist. dan Teknol. Informasi), vol. 5, no. 6, pp. 1120–1126, 2021, doi: 10.29207/resti.v5i6.3588.

I. S. Hidayat, S. Defit, and G. W. Nurcahyo, “Simulasi dalam Optimalisasi Pengadaan Barang menggunakan Metode K-Mean Clustering,” J. Sistim Inf. dan Teknol., vol. 3, no. 4, pp. 281–286, 2021, doi: 10.37034/jsisfotek.v3i4.79.

F. Putra, H. F. Tahiyat, R. M. Ihsan, R. Rahmaddeni, and L. Efrizoni, “Penerapan Algoritma K-Nearest Neighbor Menggunakan Wrapper Sebagai Preprocessing untuk Penentuan Keterangan Berat Badan Manusia,” MALCOM Indones. J. Mach. Learn. Comput. Sci., vol. 4, no. 1, pp. 273–281, 2024, doi: 10.57152/malcom.v4i1.1085.

L. A. Andika, P. A. N. Azizah, and R. Respatiwulan, “Analisis Sentimen Masyarakat terhadap Hasil Quick Count Pemilihan Presiden Indonesia 2019 pada Media Sosial Twitter Menggunakan Metode Naive Bayes Classifier,” Indones. J. Appl. Stat., vol. 2, no. 1, p. 34, 2019, doi: 10.13057/ijas.v2i1.29998.

W. Yulita, E. D. Nugroho, and M. H. Algifari, “Analisis Sentimen Terhadap Opini Masyarakat Tentang Vaksin Covid-19 Menggunakan Algoritma Naïve Bayes Classifier,” J. Data Min. dan Sist. Inf., vol. 2, no. 2, pp. 1–9, 2021, doi: 10.33365/jdmsi.v2i2.1344.

A. Karimah, G. Dwilestari, and M. Mulyawan, “Analisis Sentimen Komentar Video Mobil Listrik Di Platform Youtube Dengan Metode Naive Bayes,” JATI (Jurnal Mhs. Tek. Inform., vol. 8, no. 1, pp. 767–737, 2024, doi: 10.36040/jati.v8i1.8373.

E. Fitri, “Analisis Sentimen Terhadap Aplikasi Ruangguru Menggunakan Algoritma Naive Bayes, Random Forest Dan Support Vector Machine,” J. Transform., vol. 18, no. 1, pp. 71–80, 2020, doi: 10.26623/transformatika.v18i1.2317.

A. I. Tanggraeni and M. N. N. Sitokdana, “Analisis Sentimen Aplikasi E-Government pada Google Play Menggunakan Algoritma Naïve Bayes,” JATISI (Jurnal Tek. Inform. dan Sist. Informasi), vol. 9, no. 2, pp. 785–795, 2022, doi: 10.35957/jatisi.v9i2.1835.

C. A. Ramezan, T. A. Warner, and A. E. Maxwell, “Evaluation of sampling and cross-validation tuning strategies for regional-scale machine learning classification,” Remote Sens., vol. 11, no. 2, pp. 1–21, 2019, doi: 10.3390/rs11020185.

Downloads

Published

2025-02-15

How to Cite

Athhar Hafizha Luthfi, Ahmad Faqih, & Gifthera Dwilestari. (2025). Accuracy in Sentiment Analysis of the by.U Application Using Naïve Bayes and SMOTE Techniques. Journal of Artificial Intelligence and Engineering Applications (JAIEA), 4(2), 708–719. https://doi.org/10.59934/jaiea.v4i2.737