Knearst Algorithm Analysis – Neighbor Breast Cancer Prediction Coimbra

I Gusti Prahmana; Kristina Annatasia Br Sitepu

doi:10.59934/jaiea.v1i3.97

Authors

I Gusti Prahmana STMIK Kaputama
Kristina Annatasia Br Sitepu STMIK KAPUTAMA

DOI:

https://doi.org/10.59934/jaiea.v1i3.97

Keywords:

K – Nearest Neighbor, Breast Cancer Coimbra

Abstract

A process to explain the results of the KNN algorithm analysis with the prediction of Breast Cancer Coimbra disease (Breast Cancer). The prediction output of the KNN algorithm will be added with the Simple Linear Regression algorithm modeling to measure the predictive data through a straight line as an illustration of the correlation relationship between 2 or more variables. Linear regression prediction is used as a technique for the relationship between variables in the prediction process of the Breast Cancer Coimbra data set (Breast Cancer). for the value of K in analyzing the KNN algorithm, take the nearest neighbor with the ranking results with K = 5 nearest neighbors which are taken in the KNN calculation. Which is where the output of the KNN algorithm classification will be analyzed with the Simple Linear Regression algorithm with Dependent (Cause) and Independent (effect) variables. The test results determine that the patient has breast cancer and the number of predictions based on age with glucose means that the patient is predicted to have breast cancer. analyze the KNN algorithm with Simple Liner Regression modeling with Python programming language.

Downloads

Download data is not yet available.

References

M. Dunhan, “Data Mining: Introductory and Advanced Topics. Prentice Hall,” Engineering, 2003.

J. Han and M. Kamber, “Data Mining : Concepts and Techniques ( 2nd edition ) Bibliographic Notes for Chapter 11 Applications and Trends in Data Mining,” SIGKDD Explor., 2006.

Eko Prasetyo, Data Mining : Konsep Dan Aplikasi Menggunakan Matlab. 2013.

F. Yunita, “Penerapan Data Mining Menggunkan Algoritma K-Means Clustring Pada Penerimaan Mahasiswa Baru (STUDI KASUS : UNIVERSITAS ISLAM INDRAGIRI),” Sistemasi, vol. 7, no. 3, 2018.

A. M. H. Pardede et al., “Implementation of Data Mining to Classify the Consumer’s Complaints of Electricity Usage Based on Consumer’s Locations Using Clustering Method,” 2019, doi: 10.1088/1742-6596/1363/1/012079.

S. R. Kumaran, M. S. Othman, and L. M. Yusuf, “Data mining approaches in business intelligence: Postgraduate data analytic,” J. Teknol., vol. 78, no. 8–2, 2016, doi: 10.11113/jt.v78.9544.

C. Vercellis, Business Intelligence: Data Mining and Optimization for Decision Making. 2009.

A. Danades, D. Pratama, D. Anggraini, and D. Anggriani, “Comparison of accuracy level K-Nearest Neighbor algorithm and support vector machine algorithm in classification water quality status,” 2017, doi: 10.1109/FIT.2016.7857553.

Y. Wang, Z. Pan, and Y. Pan, “A Training Data Set Cleaning Method by Classification Ability Ranking for the k -Nearest Neighbor Classifier,” IEEE Trans. Neural Networks Learn. Syst., vol. 31, no. 5, 2020, doi: 10.1109/TNNLS.2019.2920864.

J. Gou, Y. Zhan, Y. Rao, X. Shen, X. Wang, and W. He, “Improved pseudo nearest neighbor classification,” Knowledge-Based Syst., vol. 70, 2014, doi: 10.1016/j.knosys.2014.07.020.