Abstract

The instance reduction is one of the data preprocessing methods and aims to remove noises and (or) redundant instances from the training set. In the instance reduction, one of the most representative techniques is the edition method which can remove harmful instances from the training set to improve the prediction accuracy of k nearest neighbor (KNN). Nevertheless, most of existing edition methods still have some drawbacks, such as the parameter dependency, high computational time and relatively low accuracy. To solve these problems, we present a new fast parameter-free edition method based on local sets with natural neighbors (ELS). In ELS, we define a new concept of local sets by introducing natural neighbors. ELS can use the local sets to keep more reasonable class boundaries and effectively filtering out noisy instances (including global outliers). The main advantages of ELS are that (a) it is parameter-free; (b) it can remove global outliers and noisy instances; (c) it is relatively fast. Experiments clearly verify that (a) ELS outperforms existing representative edition methods in improving the prediction accuracy of KNN; (b) ELS can improve the performance of the condensation method and hybrid method in terms of both accuracy and reduction; (c) ELS consumes relatively low running time.

Highlights

  • The k-nearest neighbor (KNN) [1]–[3] is one of the most representative instance-based classifiers and usually found as a benchmark for experimental and theoretical studies in data mining, image processing, pattern recognition and machine learning, etc

  • In Edition Method Based on Local Sets (ELS), we introduce natural neighbors [33] to define a new concept for local sets without any parameters

  • Many hybrid methods, such as the ATISA [18], the BNNT [19] and the IRB [21], use the ENN as a step to filter out noisy instances

Read more

Summary

INTRODUCTION

The k-nearest neighbor (KNN) [1]–[3] is one of the most representative instance-based classifiers and usually found as a benchmark for experimental and theoretical studies in data mining, image processing, pattern recognition and machine learning, etc. Edition methods aim to improve the prediction accuracy of the trained classifier by filtering out harmful instances (i.e., noises). Li: ELS: A Fast Parameter-Free Edition Algorithm With Natural Neighbors-Based Local Sets for KNN. Some parameter-free methods [27], [31] are developed, they still achieve relatively low accuracy or (and) consume relatively high running time To solve these drawbacks above, we introduce a new fast parameter-free edition method with natural neighbors-based local sets (ELS) in this paper. (a) We propose a fast parameter-free edition algorithm (ELS) to remove noisy instances and several global outliers effectively. ELS is parameter-free, has faster computational time and improves the accuracy of KNN better.

RELATED WORK
NATURAL NEIGHBORS
EXPERIMENTAL SETTINGS
EXPERIMENT ONE
EXPERIMENT TWO
EXPERIMENT THREE
Findings
CONCLUSIONS

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.