Abstract

Breast cancer is one of the most common and deadliest cancer types in women worldwide. Research on this disease has become very important because early diagnosis stages, clinical applications and the speed of response to treatment are facilitated in diseases such as cancer. In this study, an approach is proposed in which a Subspace kNN algorithm is used together with Stacked autoencoder (SAE) for diagnosis of disease on the breast cancer microarray dataset for the first time. Such hybrid approaches can provide better results when classifying data sets with high-dimensional and uncertainty. The data set used in the study was taken from Kent Ridge-2 database. It consists of 97 samples (51 benign, 46 malicious) and 24482 attributes. The performance of the proposed method was evaluated and the results were compared with other well-known methods of dimension reduction and machine learning. As a result of the comparison, the data set was reduced to 100 attributes by using SAE and Subspace kNN and 91.24% accuracy was achieved. The result obtained provides important classification accuracy, especially in high-dimensional data sets. The importance of this study is that the models that were created by using various classifiers to increase the success rate of the stacked autoencoder-softmax classifier model in the breast cancer microarray data set were applied for the first time. In this regard, it is considered that automation-based studies will provide diagnostic decision support system a solution using the proposed method in future works.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.