Abstract

When identifying an unknown sample, k nearest neighbors (KNN) algorithm requires calculating the distances between this sample and all samples in the training set. It cannot satisfy high real time situation, such power disturbance recognition. We propose a novel method to reduce the size of the training set for KNN algorithm. The proposed method is based on iterative clustering. Clustering the original training set, if the cluster only contains one class, the samples in this cluster is represented by the cluster's center; otherwise, we cluster again. Finally, the original training set is represented by the subset which consists of all cluster's centers. Experimental results show that the classifier become much simpler after sample reduction. For the problem of power disturbance, only 6.28 percent of the original training set requires to be reserved while the accuracy has no significant difference.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.