Abstract
Abstract Introduction Various classes of genomic alterations can now be robustly detected from targeted Next Generation Sequencing (NGS)-based tumor profiling assays. Despite recent advances in genome-directed cancer therapies and lineage-agnostic basket clinical trials, tissue of origin remains a critical determinant of tumor biology and therapeutic sensitivity. Therefore, efficiently harnessing the mutational information to predict tissue of origin from NGS data can further aid diagnosis and treatment. Methods We have developed a novel random forest machine learning classifier that infers tissue of origin from the mutational features of each tumor NGS profile. We trained and validated the model using prospective sequencing data from >30,000 patients profiled by MSK-IMPACT, a custom FDA-authorized clinical sequencing assay. An initial version of this classifier trained on a smaller cohort, presented previously, has been further optimized to incorporate additional tumor types, genomic features, mutational signatures, and accessibility features for clinicians. Altogether, the model evaluates a comprehensive range of mutational features to generate predictions. In addition, we developed a framework for the prospective clinical implementation of our method that allows for extension to the expanding MSK-IMPACT cohort and utilization at the point of care. Results Overall, we predicted the correct cancer type in 74% of cases, with nearly half of cases predicted with high confidence (>95%). In order to make this tool accessible to pathologists for real-time diagnostic and treatment decisions, we also have implemented APIs that transmit our classifier predictions to the cBioPortal for Cancer Genomics and the MPath console from MSK Molecular Diagnostics Service. With the additional samples and genomic features, our model predictions have been improved and are considered during clinical review and sign-out. This practice has brought about critical diagnostic changes and orthogonal validation in several cases. In one case, a patient referred to MSK with a misdiagnosis of metastatic breast cancer to the bladder underwent MSK-IMPACT sequencing for her primary breast tumor and bladder lesion. The model predictions, confirmed by orthogonal testing, classified two lesions as independent primaries of breast and bladder origins, leading to significant changes in treatment regimens. Conclusion Our work delineates the framework of utilizing and optimizing machine learning models on NGS-based sequencing data to aid diagnosis and treatment. The same framework can be applied to cell-free DNA sequencing, which will capture tumor spatial heterogeneity and improve classifier performance. These results indicate that leveraging machine learning to predict tissue of origin complements conventional histologic review to provide integrated diagnoses, often with critical therapeutic implications. Citation Format: Youyun Zheng, Alexander V. Penson, Niedzica Camacho, Evan Biederstedt, Ahmet Zehir, Cyriac Kandoth, Aijazuddin Syed, Anna M. Varghese, Hikmat A. Al-Ahmadie, Nikolaus Schultz, Marc Ladanyi, David B. Solit, David S. Klimstra, David Hyman, Barry Taylor, Michael F. Berger. Clinical validation of a genomics-based classifier to predict tissue of origin from targeted tumor sequencing [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2019; 2019 Mar 29-Apr 3; Atlanta, GA. Philadelphia (PA): AACR; Cancer Res 2019;79(13 Suppl):Abstract nr 1672.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.