Abstract

Functional characterization of open reading frames in nonmodel organisms, such as the common opportunistic fungal pathogen Candida albicans, can be labor-intensive. To meet this challenge, we built a comprehensive and unbiased coexpression network for C. albicans, which we call CalCEN, from data collected from 853 RNA sequencing runs from 18 large-scale studies deposited in the NCBI Sequence Read Archive. Retrospectively, CalCEN is highly predictive of known gene function annotations and can be synergistically combined with sequence similarity and interaction networks in Saccharomyces cerevisiae through orthology for additional accuracy in gene function prediction. To prospectively demonstrate the utility of the coexpression network in C. albicans, we predicted the function of underannotated open reading frames (ORFs) and identified CCJ1 as a novel cell cycle regulator in C. albicans This study provides a tool for future systems biology analyses of gene function in C. albicans We provide a computational pipeline for building and analyzing the coexpression network and CalCEN itself at http://github.com/momeara/CalCENIMPORTANCECandida albicans is a common and deadly fungal pathogen of humans, yet the genome of this organism contains many genes of unknown function. By determining gene function, we can help identify essential genes, new virulence factors, or new regulators of drug resistance, and thereby give new targets for antifungal development. Here, we use information from large-scale RNA sequencing (RNAseq) studies and generate a C. albicans coexpression network (CalCEN) that is robust and able to predict gene function. We demonstrate the utility of this network in both retrospective and prospective testing and use CalCEN to predict a role for C4_06590W/CCJ1 in cell cycle. This tool will allow for a better characterization of underannotated genes in pathogenic yeasts.

Highlights

  • Functional characterization of open reading frames in nonmodel organisms, such as the common opportunistic fungal pathogen Candida albicans, can be labor-intensive

  • There are many transcriptomic analyses of C. albicans available through NCBI Sequence Read Archive (SRA), suggesting that it is feasible to create a coexpression network for C. albicans as a complementary approach for predicting gene function

  • To build the Candida albicans coexpression network (CalCEN), we identified RNA sequencing (RNAseq) studies from the NCBI Sequence Read Archives (SRA), which we filtered for studies with at least 20 C. albicans samples based on the guidelines from Ballouz and colleagues [2], yielding 12 unpaired and 6 paired end studies, listed in Table S1 in the supplemental material

Read more

Summary

Introduction

Functional characterization of open reading frames in nonmodel organisms, such as the common opportunistic fungal pathogen Candida albicans, can be labor-intensive To meet this challenge, we built a comprehensive and unbiased coexpression network for C. albicans, which we call CalCEN, from data collected from 853 RNA sequencing runs from 18 large-scale studies deposited in the NCBI Sequence Read Archive. We demonstrate the utility of this network in both retrospective and prospective testing and use CalCEN to predict a role for C4_06590W/ CCJ1 in cell cycle This tool will allow for a better characterization of underannotated genes in pathogenic yeasts. O’Meara and O’Meara high predictive accuracy for gene function annotations, it captures evolutionaryscale changes in cell identity [7] For nonmodel organisms such as Candida albicans, there are two important questions. There are many transcriptomic analyses of C. albicans available through NCBI Sequence Read Archive (SRA), suggesting that it is feasible to create a coexpression network for C. albicans as a complementary approach for predicting gene function

Objectives
Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.