Efficient Parallel Processing of k-Nearest Neighbor Queries by Using a Centroid-based and Hierarchical Clustering Algorithm

Elaheh Gavagsaz

doi:10.30564/aia.v4i1.4668

Abstract

The k-Nearest Neighbor method is one of the most popular techniques for both classification and regression purposes. Because of its operation, the application of this classification may be limited to problems with a certain number of instances, particularly, when run time is a consideration. However, the classification of large amounts of data has become a fundamental task in many real-world applications. It is logical to scale the k-Nearest Neighbor method to large scale datasets. This paper proposes a new k-Nearest Neighbor classification method (KNN-CCL) which uses a parallel centroid-based and hierarchical clustering algorithm to separate the sample of training dataset into multiple parts. The introduced clustering algorithm uses four stages of successive refinements and generates high quality clusters. The k-Nearest Neighbor approach subsequently makes use of them to predict the test datasets. Finally, sets of experiments are conducted on the UCI datasets. The experimental results confirm that the proposed k-Nearest Neighbor classification method performs well with regard to classification accuracy and performance.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Artificial Intelligence Advances	Publication Date: May 26, 2022
Citations: 3	License type: CC BY-NC 4.0

R Discovery Prime

R Discovery Prime

Efficient Parallel Processing of k-Nearest Neighbor Queries by Using a Centroid-based and Hierarchical Clustering Algorithm

Abstract

Talk to us

Similar Papers

More From: Artificial Intelligence Advances

Lead the way for us

Similar Papers

Classification based on K-Nearest Neighbor and Logistic Regression method of coffee using Electronic Nose
D R Prehanto ... A D Indriyanti
IOP Conference Series: Materials Science and Engineering | VOL. 1098
D R Prehanto, et. al.D R Prehanto ... A D Indriyanti
01 Mar 2021
IOP Conference Series: Materials Science and Engineering | VOL. 1098

Feature Selection using Information Gain on the K-Nearest Neighbor (KNN) and Modified K-Nearest Neighbor (MKNN) Methods for Chronic Kidney Disease Classification
Aweldri Ramadhan ... Siti Ramadhani
Jurnal CoreIT: Jurnal Hasil Penelitian Ilmu Komputer dan Teknologi Informasi | VOL. 9
Aweldri Ramadhan, et. al.Aweldri Ramadhan ... Siti Ramadhani
14 Dec 2023
Jurnal CoreIT: Jurnal Hasil Penelitian Ilmu Komputer dan Teknologi Informasi | VOL. 9

Comparison of Support Vector Machine and K-Nearest Neighbors in Breast Cancer Classification
Adinda Ayu Lestari ... Yuli Andriani
Pattimura International Journal of Mathematics (PIJMath) | VOL. 1
Adinda Ayu Lestari, et. al.Adinda Ayu Lestari ... Yuli Andriani
01 May 2022
Pattimura International Journal of Mathematics (PIJMath) | VOL. 1

Analyze Important Features of PIMA Indian Database For Diabetes Prediction Using KNN
Aziz Perdana ... Donny Avianto
Jurnal Sisfokom (Sistem Informasi dan Komputer) | VOL. 12
Aziz Perdana, et. al.Aziz Perdana ... Donny Avianto
13 Mar 2023
Jurnal Sisfokom (Sistem Informasi dan Komputer) | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Efficient Parallel Processing of k-Nearest Neighbor Queries by Using a Centroid-based and Hierarchical Clustering Algorithm

Abstract

Talk to us

Similar Papers

More From: Artificial Intelligence Advances