Critical Sentiment Analysis of Tokopedia Electronic Products Using SVM-Logistic & TF-IDF Ensemble Methods
DOI:
https://doi.org/10.59934/jaiea.v4i3.1194Keywords:
Analisis Sentimen, Ensemble Learning, Produk Elektronik, SVM, TF-IDF, TokopediaAbstract
This research aims to analyze customer review sentiment for electronic products on Tokopedia using a Support Vector Machine (SVM) classification method with Term Frequency-Inverse Document Frequency (TF-IDF) based features, enhanced by an ensemble approach with logistic regression. Utilizing a Tokopedia review dataset from 2023, this study seeks to identify critical sentiments embedded in customer reviews, which can provide valuable insights for sellers and the platform. The methodology involves comprehensive textual data preprocessing, feature extraction using TF-IDF for vector representation, and the implementation of an SVM-Logistic ensemble model via a stacking strategy. The results indicate that the SVM-Logistic ensemble model can classify review sentiments with high accuracy and superior performance metrics, effectively distinguishing between positive, negative, and neutral sentiments. These findings highlight the significant potential of machine learning methods in automatically understanding customer feedback, which is crucial for continuous improvement in product and service quality on e-commerce platforms, and for supporting more strategic business decisions.
Downloads
References
S. P. Mohanty, U. Choppali, and E. Kougianos, “Everything you wanted to know about smart cities,” IEEE Consum. Electron. Mag., vol. 5, no. 3, pp. 60–70, 2016, doi: 10.1109/MCE.2016.2556879.
B. Pang and L. Lee, "Opinion Mining and Sentiment Analysis," Foundations and Trends in Information Retrieval, vol. 2, no. 1–2, pp. 1–135, 2008.
A. M. H. Pardede, M. Zarlis, and H. Mawengkang, “Optimization of Health Care Services with Limited Resources,” Int. J. Adv. Sci. Eng. Inf. Technol., vol. 9, no. 4, pp. 1444–1449, 2019, doi: 10.18517/ijaseit.9.4.8348.
J. Devlin, M. W. Chang, K. Lee, and K. Toutanova, "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding," arXiv preprint arXiv:1810.04805, 2018.
F.-Z. Younsi, A. Bounnekar, D. Hamdadou, and O. Boussaid, “SEIR-SW, Simulation Model of Influenza Spread Based on the Small World Network,” Tsinghua Sci. Technol., vol. 20, no. 5, pp. 460–473, 2015.
C. D. Manning, P. Raghavan, and H. Schütze, Introduction to Information Retrieval. Cambridge University Press, 2008.
P. D. Turney, "Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews," in Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, 2002, pp. 417–424. doi: 10.3115/1073083.1073153.
R. Socher, A. Perelygin, J. Wu, J. Chuang, C. D. Manning, A. Ng, and C. Potts, "Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank," Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pp. 1631-1642, 2013.
K. Sparck Jones, "A statistical interpretation of term specificity and its application in retrieval," Journal of Documentation, vol. 60, no. 5, pp. 493-502, 2004.
C. Cortes and V. Vapnik, "Support-Vector Networks," Machine Learning, vol. 20, no. 3, pp. 273-297, 1995.
S. B. Kotsiantis, "Supervised Machine Learning: A Review of Classification Techniques," Informatica, vol. 31, no. 3, pp. 249-268, 2007.
D. W. Hosmer Jr, S. Lemeshow, and R. X. Sturdivant, Applied Logistic Regression. John Wiley & Sons, 2013.
T. M. Mitchell, Machine Learning. McGraw-Hill, 1997.
T. G. Dietterich, "Ensemble methods in machine learning," in International Workshop on Multiple Classifier Systems, 2000, pp. 1-15. doi: 10.1007/3-540-45014-9_1.
L. Breiman, "Stacked regressions," Machine Learning, vol. 24, no. 1, pp. 49-64, 1996.
A. Karmakar, S. S. Roy, F. Vercauteren, and I. Verbauwhede, “Efficient finite field multiplication for isogeny based post quantum cryptography,” 2017, doi: 10.1007/978-3-319-55227-9_14.
A. M. H. Pardede, Y. Maulita, and R. Buaton, “Application modeling ipv6 (internet protocol version 6) on e-id card for identification number for effectiveness and efficiency of registration process identification of population,” in Journal of Physics: Conference Series, 2018, vol. 978, no. 1, doi: 10.1088/1742-6596/978/1/012017.
W. A. Jabbar, W. K. Saad, and M. Ismail, “MEQSA-OLSRv2: A multicriteria-based hybrid multipath protocol for energy-efficient and QoS-aware data routing in MANET-WSN convergence scenarios of IoT,” IEEE Access, 2018, doi: 10.1109/ACCESS.2018.2882853.
D. Niyigena, C. Habineza, and T. S. Ustun, “Computer-based smart energy management system for rural health centers,” 2016, doi: 10.1109/IRSEC.2015.7455005.
A. K. Singh, B. Singh, and M. Singh, "Sentiment Analysis using Machine Learning Techniques: A Comparative Study," International Journal of Advanced Research in Computer Science and Software Engineering, vol. 5, no. 6, pp. 83-88, 2015.
W. A. Jabbar, N. T. D. N. N. R. Saady, W. K. Saad, and M. Ismail, "A Hybrid Multi-criteria Based Multipath Protocol for Energy-efficient and QoS-aware Data Routing in MANET-WSN Convergence Scenarios of IoT," IEEE Access, vol. 6, pp. 80562-80579, 2018. doi: 10.1109/ACCESS.2018.2882853.
A. M. H. Pardede, A. S. Ginting, and M. Zarlis, "Management of data and information systems in health services based on information technology," Journal of Physics: Conference Series, vol. 1116, no. 1, p. 012002, 2018. doi: 10.1088/1742-6596/1116/1/012002.
Y. Bengio, A. Courville, and P. Vincent, "Representation Learning: A Review and New Perspectives," IEEE transactions on pattern analysis and machine intelligence, vol. 35, no. 8, pp. 1798-1828, 2013.
T. Mikolov, K. Chen, G. Corrado, and J. Dean, "Efficient Estimation of Word Representations in Vector Space," arXiv preprint arXiv:1301.3781, 2013.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Journal of Artificial Intelligence and Engineering Applications (JAIEA)

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.