Abstract
Repurposing FDA-approved drugs with the aid of gene signatures of disease can accelerate the development of new therapeutics. A major challenge to developing reliable drug predictions is heterogeneity. Different gene signatures of the same disease or drug treatment often show poor overlap across studies, as a consequence of both biological and technical variability, and this can affect the quality and reproducibility of computational drug predictions. Existing algorithms for signature-based drug repurposing use only individual signatures as input. But for many diseases, there are dozens of signatures in the public domain. Methods that exploit all available transcriptional knowledge on a disease should produce improved drug predictions. Here, we adapt an established meta-analysis framework to address the problem of drug repurposing using an ensemble of disease signatures. Our computational pipeline takes as input a collection of disease signatures, and outputs a list of drugs predicted to consistently reverse pathological gene changes. We apply our method to conduct the largest and most systematic repurposing study on lung cancer transcriptomes, using 21 signatures. We show that scaling up transcriptional knowledge significantly increases the reproducibility of top drug hits, from 44% to 78%. We extensively characterize drug hits in silico, demonstrating that they slow growth significantly in nine lung cancer cell lines from the NCI-60 collection, and identify CALM1 and PLA2G4A as promising drug targets for lung cancer. Our meta-analysis pipeline is general, and applicable to any disease context; it can be applied to improve the results of signature-based drug repurposing by leveraging the large number of disease signatures in the public domain.
Highlights
Lung cancer accounts for the largest number of cancer-related deaths, and the 5-year survival rate is only 16% [1]; there is an urgent need for new therapeutics to help treat it
We found that CMapBatch consistently identifies the same drugs as combatting lung cancer, even when it is trained on completely different sets of lung cancer signatures
For the remainder of this paper, we focus on characterizing and prioritizing the full set of significant drugs identified by CMapBatch using all 21 gene signatures of lung cancer
Summary
Lung cancer accounts for the largest number of cancer-related deaths, and the 5-year survival rate (across all stages) is only 16% [1]; there is an urgent need for new therapeutics to help treat it. Integrative computational methods that mine these data are fast, cheap, and can complement traditional methods of drug screening; complementary information in these distinct resources can be leveraged to develop comprehensive in silico screens for novel cancer therapeutics [2]. One such resource, the Connectivity Map (CMap), which is the focus of our analyses, catalogues the transcriptional responses to drug treatment in human cell lines for over a thousand small molecules [3]. In vitro experiments confirmed the inhibitory activity of many of their top hits, and in vivo testing showed promising results for imipramine and promethazine
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.