Abstract

Extrapulmonary small cell carcinoma (EPSCC) is rare, and its knowledge is mainly extrapolated from small cell lung carcinoma. Reliable survival prediction tools are lacking. A total of 3,921 cases of EPSCC were collected from the Surveillance Epidemiology and End Results (SEER) database, which form the training and internal validation cohorts of the survival prediction model. The endpoint was an overall survival of 0.5-5 years. Internal validation performances of machine learning algorithms were compared, and the best model was selected. External validation (n = 68) was performed to evaluate the generalization ability of the selected model. Among machine learning algorithms, the random forest model performs best on internal validation, whose area under the curve (AUC) is 0.736-0.800. The net benefit is higher than the TNM classification in decision curve analysis. The AUC of this model on the external validation cohort is 0.739-0.811. This model was then deployed online as a free, publicly available prediction tool of EPSCC (http://42.192.80.13:4399/). This study provides an excellent online survival prediction tool for EPSCC with machine learning and large-scale data. Age, TNM stages, and surgery (including potential performance status information) are the most critical factors for the prediction model.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.