Abstract

Computational gene prioritization provides a basis for the utilization of high-throughput expression data; however, methods for prioritization considering relevance to both biological processes and selected keywords are lacking. In this study, a method was developed to rank differentially expressed genes (DEGs) by utilizing the Gene Ontology (GO) database for keyword searches and propagating the results through a protein–protein interaction network. A scoring system that effectively biases the scores of genes relevant to given keywords for avoidance and preference was established. This scoring method was combined with scoring based on expression characteristic groups (ECGs) with network propagation to obtain a final combined score (cScore). The performance of the new approach was evaluated using a rat middle cerebral artery occlusion (MCAO) dataset, revealing that the method more effectively filtered out DEGs compared with conventional methods based on both significance and fold change values, excluding 76% of genes in average, while retaining genes of interest. Further improvements, including addressing the inability of downward accumulation to balance terms with conflicting hits and the limited number of databases utilized for scoring, can further improve the performance of the method. Overall, the newly developed method can improve the interpretation of DEG data.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call