Application of K-Means Clustering: Bot Activity and Sybill Attack Detection on the Solana Blockchain
DOI:
https://doi.org/10.59934/jaiea.v5i3.2235Keywords:
Big Data; Solana Blockchain; Data Mining; Clustering; K-Means; Bot DetectionAbstract
With the development of Blockchain technology, for example, the Solana Blockchain has generated enormous amounts of data and possesses the 5Vs of Big Data: volume, velocity, value, veracity, and variety. This has brought challenges, for example, in distinguishing transactions carried out by humans from automated bots that often carry out market manipulation or Sybil attacks. Therefore, this research aims to detect bot activity on the Solana network by applying data mining techniques, namely the K-Means Clustering algorithm. From the large transaction data that will be extracted only a portion from the public Solana dataset in BigQuery, it will then be processed through a preprocessing stage to normalize the data and simplify complex data into simpler variables before being grouped. Because the extracted data is in the form of unlabeled data groups (unsupervised data), the Clustering Method is used because of its ability to recognize data groups based on behavioral or characteristic similarities without requiring initial data labels (unsupervised learning). The main variables used for the grouping process include transaction frequency, inter-arrival time (inter-transaction), and the number of unique program interactions. The results of this analysis are expected to map transaction accounts into several clusters based on their transaction patterns, allowing for the classification of bots and humans. This research is expected to demonstrate that Big Data infrastructure such as Google Cloud, using data mining techniques (Clustering), can be used to maintain the security and integrity of the blockchain ecosystem.
Downloads
References
Hartawan, M. S., dkk. (2022). Big Data (Informasi dan Kasus). Kun Fayakun.
Kurniawan, S. D., dkk. (2024). Big Data: Mengenal Big Data & Implementasinya di Berbagai Bidang. Sonpedia Publishing Indonesia.
Maulani, G., dkk. (2024). Penerapan Data Mining di Berbagai Bidang. HEI Publishing.
Wibowo, A. (2025). Pengantar AI, Big Data dan Ilmu Data. Yayasan Prima Agus Teknik.
Yulia, & Silalahi, M. (2021). Penerapan Data Mining Clustering Dalam Mengelompokan Buku Dengan Metode K-Means. Indonesian Journal of Computer Science.
Singh, S., & Chander, S. (2024). Prevention of Sybil Attack on Blockchain to Ensure Security of Wireless Sensor Network. International Journal of Intelligent Systems and Applications in Engineering, 12(8s), 14–24.
Hsiao, S.-J., et al. (2024). Enhancing the Security and Reliability of Wireless Sensor Networks Using Blockchain Technology. International Journal of Intelligent Systems and Applications in Engineering (IJISAE), 12(8s), 14–24.
Nakamoto, S. (2008). Bitcoin: A Peer-to-Peer Electronic Cash System. Retrieved from www.bitcoin.org
Drescher, D. (2017). Blockchain basics: A non-technical introduction in 25 steps. New York: Apress
Rousseeuw, P. J. (1987). Silhouettes: A Graphical Aid to the Interpretation and Validation of Cluster Analysis. Journal of Computational and Applied Mathematics, 20, 53–65.
Leskovec, J., Rajaraman, A., & Ullman J. D. (2020). Mining of Massive Datasets (3rded.). Cambridge University Press.
Pham, T., & Lee, S. (2016). Anomaly Detection in Bitcoin Network Using Unsupervised Learning Methods.
Amelia, V. R., Pratama, P., Ramadhan, M. A. N., Lampang, M. A., Pratama, M. H., Mentari, T., & Widyaningsih, D. S. (2024). A systematic literature review of blockchain-based triple-entry accounting in crypto assets. Jurnal Bisnis Mahasiswa, 4(4).
Pradana, I. G. M. T., Djatna, T., Hermadi, I., & Yuliashih, I. (2024). Model of integrated assessment layer for implementation readiness of blockchain-based traceability system. Jurnal Teknologi Industri Pertanian, 34(2), 127–139.
Raghav, N., & Bhola, A. K. (2023). Detecting Sybil attack in blockchain and preventing through universal unique identifier in health care sector for privacy preservation. International Journal on Recent and Innovation Trends in Computing and Communication, 11(8).
Supriyanto, D., Desembrianita, E., Prihadi, D. J., Suryaningrum, D. A., & Alhakim, B. A. (2025). The urgency of blockchain in developing the tourism industry in Indonesia. Jurnal Teknologi dan Manajemen Industri Terapan (JTMIT), 4(1), 75–80.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Journal of Artificial Intelligence and Engineering Applications (JAIEA)

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.








