Abstract
Machine learning to diagnose breast cancer is an important, real-world medical problem. As one of the most popular machine learning algorithm, K-Nearest Neighbors (KNN) algorithm is widely used in breast cancer classification and can provide high classification accuracy and effective diagnostic capabilities. However, selection of parameter K is still an unsolved problem for K-Nearest Neighbors (KNN). To address the problem, we introduce a novel neighbor form, Natural Neighbor (NaN), which is obtained adaptively by its search algorithm. We firstly use a noise filter, which called edited natural neighbor algorithm (ENaN), to eliminate the noises and global outliers of Wisconsin’s breast cancer prognosis dataset. Then, cleaned data set and natural neighbor algorithm are used to construct a new non parameter diagnostic system. The main advantages of the proposed algorithm are that it does not need any parameters and can maintain high classification accuracy. Experiments show that the classification accuracy of proposed method is similar to the highest classification accuracy obtained by traditional K-Nearest Neighbors algorithm with different K values.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.