Abstract
Computational gene prioritization provides a basis for the utilization of high-throughput expression data; however, methods for prioritization considering relevance to both biological processes and selected keywords are lacking. In this study, a method was developed to rank differentially expressed genes (DEGs) by utilizing the Gene Ontology (GO) database for keyword searches and propagating the results through a protein–protein interaction network. A scoring system that effectively biases the scores of genes relevant to given keywords for avoidance and preference was established. This scoring method was combined with scoring based on expression characteristic groups (ECGs) with network propagation to obtain a final combined score (cScore). The performance of the new approach was evaluated using a rat middle cerebral artery occlusion (MCAO) dataset, revealing that the method more effectively filtered out DEGs compared with conventional methods based on both significance and fold change values, excluding 76% of genes in average, while retaining genes of interest. Further improvements, including addressing the inability of downward accumulation to balance terms with conflicting hits and the limited number of databases utilized for scoring, can further improve the performance of the method. Overall, the newly developed method can improve the interpretation of DEG data.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.