Abstract

Summary An increasing amount of evidence suggests that canonical pathways and standard molecular signature databases are incomplete and inadequate to model the complex behavior of cell physiology and pathology. Yet, many Gene Set Analysis (GSA) studies still rely on these databases to identify disease biomarkers and molecular mechanisms within a specific cell context. While tremendous effort has been invested in developing GSA tools, there is limited number of studies focusing on de novo assembly of context-specific gene sets as opposed to simply applying GSA using the standard gene set database.In this paper, we propose a pipeline to derive the entire collection of Cell context-Specific Gene Sets (CSGS) from a molecular interaction network, based on the hypothesis that molecular events linked to a specific phenotypic response should cluster within a subnet of interacting genes. Gene sets are assigned using both physical properties of the network and functional annotations of the neighboring nodes. The identified gene sets could provide a precise starting point such that the downstream GSA will cover all functional pathways in this particular cell context and, at the same time, avoid the noise and excessive multiple-hypothesis testing due to inclusion of irrelevant gene sets from the standard database. We applied the pipeline in the context of cardiomyopathy and demonstrated its superiority over MSigDB gene set collection in terms of: (i) reproducibility and robustness in GSA, (ii) effectiveness in uncovering molecular mechanisms associated with cardiomyopathy, and (iii) the performance in distinguishing diseased vs. normal states.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.