Abstract

Systematic, genome-wide RNA interference (RNAi) analysis is a powerful approach to identify gene functions that support or modulate selected biological processes. An emerging challenge shared with some other genome-wide approaches is that independent RNAi studies often show limited agreement in their lists of implicated genes. To better understand this, we analyzed four genome-wide RNAi studies that identified host genes involved in influenza virus replication. These studies collectively identified and validated the roles of 614 cell genes, but pair-wise overlap among the four gene lists was only 3% to 15% (average 6.7%). However, a number of functional categories were overrepresented in multiple studies. The pair-wise overlap of these enriched-category lists was high, ∼19%, implying more agreement among studies than apparent at the gene level. Probing this further, we found that the gene lists implicated by independent studies were highly connected in interacting networks by independent functional measures such as protein-protein interactions, at rates significantly higher than predicted by chance. We also developed a general, model-based approach to gauge the effects of false-positive and false-negative factors and to estimate, from a limited number of studies, the total number of genes involved in a process. For influenza virus replication, this novel statistical approach estimates the total number of cell genes involved to be ∼2,800. This and multiple other aspects of our experimental and computational results imply that, when following good quality control practices, the low overlap between studies is primarily due to false negatives rather than false-positive gene identifications. These results and methods have implications for and applications to multiple forms of genome-wide analysis.

Highlights

  • RNA interference (RNAi) is a gene-specific silencing process directed by short double stranded RNAs or small interfering RNAs that can ‘‘knock down’’ expression of a selected gene by inducing messenger RNA degradation in a sequence-specific manner [1]

  • Genome-wide RNA interference assays of gene functions offer the potential for systematic, global analysis of biological processes

  • We present new methods to examine the relations between individual genome-wide RNAi studies, using studies of host genes in influenza virus replication as a test case

Read more

Summary

Introduction

RNA interference (RNAi) is a gene-specific silencing process directed by short double stranded RNAs or small interfering RNAs (siRNAs) that can ‘‘knock down’’ expression of a selected gene by inducing messenger RNA (mRNA) degradation in a sequence-specific manner [1]. RNAi has been widely used as a molecular tool to selectively inhibit the expression of a chosen gene. By expanding this technique to use large-scale RNAi libraries, high-throughput RNAi analysis has become a powerful approach to screen essentially all genes of an organism, to identify gene functions that support or modulate any biological process of interest. One important application of genome-wide RNAi screening has been to identify host genes that are required for the replication of a particular virus [2,3,4]. There is only 3–6% overlap of gene lists identified from three genome wide RNAi studies for host factors of HIV [6,7,12,13]

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call