Analysis and Prediction Of Pima Indian Diabetes Dataset Using SDKNN Classifier Technique

Radhanath Patra,Bonomali Khuntia

doi:10.1088/1757-899x/1070/1/012059

Radhanath Patra, Bonomali Khuntia

Open Access

https://doi.org/10.1088/1757-899x/1070/1/012059

Copy DOI

Abstract

The newly proposed weighted k nearest neighbour is known as standard deviation K nearest neighbour(SDKNN) classifier technique. It is based on the principle of standard deviation. Standard deviation measures spreading of attribute about mean. Spreading of attribute plays a significant role to improve the classification accuracy of a dataset. Most of our distance calculation method between two points is determined by using euclidean distance process for finding nearest neighbour. Our proposed technique is based on a new distance calculation formula to find nearest neighbour in KNN. We apply here standard deviations of attributes as power for calculating distance between train dataset and test dataset. Distance calculation between two points in k nearest neighbour classifier is modified according to the standard deviation of attribute. In this paper, standard deviation of attributes are used. In first attempt, we have used standard deviation of attributes as power for calculating K Nearest Neighbour to improve classification accuracy and in second attempt, based on mean of standard deviation attributes, distance in K Nearest Neighbour is processed to further improve the classification accuracy. Our concept is implemented on Pima Indian Diabetes Dataset (PIDD). The analysis on Pima Indian Diabetes Dataset (PIDD) is carried out by splitting dataset in to 90% training data and 10% testing data. We have found that, in our proposed technique, average classification accuracy gives result 83.2%, a great improvement as compared to other conventional technique.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IOP Conference Series: Materials Science and Engineering	Publication Date: Feb 1, 2021
Citations: 8	License type: cc-by

R Discovery Prime

R Discovery Prime

Analysis and Prediction Of Pima Indian Diabetes Dataset Using SDKNN Classifier Technique

Abstract

Talk to us

Similar Papers

More From: IOP Conference Series: Materials Science and Engineering

Lead the way for us

Similar Papers

Trainable segmentation for transmission electron microscope images of inorganic nanoparticles.
Cameron G Bell ... Kevin P Treder
Journal of Microscopy | VOL. 288
Cameron G Bell, et. al.Cameron G Bell ... Kevin P Treder
11 May 2022
Journal of Microscopy | VOL. 288

Adaptive Learning-Based -Nearest Neighbor Classifiers With Resilience to Class Imbalance.
Sankha Subhra Mullick ... Shounak Datta
IEEE Transactions on Neural Networks and Learning Systems | VOL. 29
Sankha Subhra Mullick, et. al.Sankha Subhra Mullick ... Shounak Datta
27 Mar 2018
IEEE Transactions on Neural Networks and Learning Systems | VOL. 29

Improving nearest neighbor classification with cam weighted distance
Chang Yin Zhou ... Yan Qiu Chen
Pattern Recognition | VOL. 39
Chang Yin Zhou, et. al.Chang Yin Zhou ... Yan Qiu Chen
26 Oct 2005
Pattern Recognition | VOL. 39

A novel weighted nearest neighbor ensemble classifier
Sam Hamzeloo ... Homeira Shahparast
-
Sam Hamzeloo, et. al.Sam Hamzeloo ... Homeira Shahparast
01 May 2012
01 May 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Analysis and Prediction Of Pima Indian Diabetes Dataset Using SDKNN Classifier Technique

Abstract

Talk to us

Similar Papers

More From: IOP Conference Series: Materials Science and Engineering