Abstract

e13554 Background: Approximately 3-5% of human cancers are cancer of unknown primary (CUP). Treatment of a cancer patient is largely dependent on the tumor origin. Therefore, identification of the tumor origin can improve the survival of patients with CUP. We developed a multi-class classification model using DNA methylation profile as biomarker to determine the primary site of CUP. Methods: We split 7,082 primary tumor samples of 19 cancers and 679 normal samples of 15 tissues from TCGA into a 75% training set and a 25% testing set to develop the classification model. We started with multiple support vector machine (SVM) models, and then combined them into an optimal multi-class ensemble model. Predictors included tumor-specific markers and tissue-specific markers, which were filtered by comparing between groups. Only the training samples were used for feature selection and model development. A validation dataset consisting of 150 primary tissues, 54 metastasis tissues, 105 plasma samples with known cancer site origins from 12 classes was generated in house by a self-designed panel. Performance was measured by area under the curve (AUC) using the one-vs-all approach. Results: 7,453 tumor-specific and 1,533 tissue-specific markers were selected for model construction. AUCs of all cancer types were high in TCGA training and testing set (AUC≥0.96 for all classes). In our validation tissues, esophageal cancer, pancreatic cancer, colorectal cancer, lung adenocarcinoma, breast cancer and liver cancer achieved high AUC in both primary (0.83, 0.83, 0.82, 0.82, 0.80 and 0.79 respectively) and metastasis (0.74, 0.92, 0.86, 0.61, 0.92 and 0.65 respectively). Lung adenocarcinoma, colorectal cancer, liver cancer, breast cancer and esophageal cancer even achieved high AUC in the plasmas. Conclusions: Performance of our model in tissue and plasma samples indicated the potential clinical application of DNA methylation profile in unknown cancer origin identification.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.