Abstract

With the fast pace in collating big data healthcare framework, a streamlining of machine learning algorithms together with apache spark has been designed for effective classification of images and stages of lung cancer to the greatest extent. We experiment on a combination of binary classification (SVM-non linear SVM with Radial Basis Function RBF) and Multi-class classification (WTA-SVM winner-takes-all with support vector machine) with threshold technique (T-BMSVM) to classify nodules into malignant or benign nodules and also their malignancy levels respectively. We have argued for handling and processing large sizes of data sets as sputum cell images in the field of classification using the map-reduce framework in Pyspark, which works better with Apache spark. The outsourced outcomes reveal that classification works better with Multi-class SVM, therefore, may be considered as a promising tool to diagnose the stages of nodules. Also, Scalability and convergence analysis embed to prove the improving results of multi-class classification than SVM.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.