Orthology Prediction Research Articles

The analysis and comparison of genomes rely on diﬀerent tools for tasks such as annotation, orthology prediction, and phylogenetic inference. Most tools are specialized for a single task, and additional eﬀorts are necessary to integrate and visualize the results. To ﬁll this gap, we developed zDB, an application integrating a Nextﬂow analysis pipeline and a Python visualization platform built on the Django framework. The application is available on GitHub (https://github.com/metagenlab/zDB) and from the bioconda channel. Starting from annotated Genbank ﬁles, zDB identiﬁes orthologs and infers a phylogeny for each orthogroup. A species phylogeny is also constructed from shared single-copy orthologs. The results can be enriched with Pfam protein domain prediction, Cluster of Orthologs Genes and Kyoto Encyclopedia of Genes and Genomes annotations, and Swissprot homologs. The web application allows searching for speciﬁc genes or annotations, running Blast queries, and comparing genomic regions and whole genomes. The metabolic capacities of organisms can be compared at either the module or pathway levels. Finally, users can run queries to examine the conservation of speciﬁc genes or annotations across a chosen subset of genomes and display the results as a list of genes, Venn diagram, or heatmaps. Those features make zDB useful for both bioinformaticians and researchers more accustomed to laboratory research.IMPORTANCEGenome comparison and analysis rely on many independent tools, leaving to scientists the burden to integrate and visualize their results for interpretation. To alleviate this burden, we have built zDB, a comparative genomics tool that includes both an analysis pipeline and a visualization platform. The analysis pipeline automates gene annotation, orthology prediction, and phylogenetic inference, while the visualization platform allows scientists to easily explore the results in a web browser. Among other features, the interface allows users to visually compare whole genomes and targeted regions, assess the conservation of genes or metabolic pathways, perform Blast searches, or look for speciﬁc annotations. Altogether, this tool will be useful for a broad range of applications in comparative studies between two and hundred genomes. Furthermore, it is designed to allow sharing of data sets easily at a local or international scale, thereby supporting exploratory analyses for non-bioinformaticians on the genome of their favorite organisms.

Read full abstract

Penicillium echinulatum 2HH is an ascomycete well known for its production of cellulolytic enzymes. Understanding lignocellulolytic and sugar uptake systems is essential to obtain efficient fungi strains for the production of bioethanol. In this study we performed a genome-wide functional annotation of carbohydrate-active enzymes and sugar transporters involved in the lignocellulolytic system of P. echinulatum 2HH and S1M29 strains (wildtype and mutant, respectively) and eleven related fungi. Additionally, signal peptide and orthology prediction were carried out. We encountered a diverse assortment of cellulolytic enzymes in P. echinulatum, especially in terms of β-glucosidases and endoglucanases. Other enzymes required for the breakdown of cellulosic biomass were also found, including cellobiohydrolases, lytic cellulose monooxygenases and cellobiose dehydrogenases. The S1M29 mutant, which is known to produce an increased cellulase activity, and the 2HH wild type strain of P. echinulatum did not show significant differences between their enzymatic repertoire. Nevertheless, we unveiled an amino acid substitution for a predicted intracellular β-glucosidase of the mutant, which might contribute to hyperexpression of cellulases through a cellodextrin induction pathway. Most of the P. echinulatum enzymes presented orthologs in P. oxalicum 114–2, supporting the presence of highly similar cellulolytic mechanisms and a close phylogenetic relationship between these fungi. A phylogenetic analysis of intracellular β-glucosidases and sugar transporters allowed us to identify several proteins potentially involved in the accumulation of intracellular cellodextrins. These may prove valuable targets in the genetic engineering of P. echinulatum focused on industrial cellulases production. Our study marks an important step in characterizing and understanding the molecular mechanisms employed by P. echinulatum in the enzymatic hydrolysis of lignocellulosic biomass.

Read full abstract

Orthology Prediction Research Articles

Related Topics

Articles published on Orthology Prediction

NCBI RefSeq: reference sequence standards through 25years of curation and annotation.

Quest for Orthologs in the Era of Biodiversity Genomics.

ZDB: bacterial comparative genomics made easy.

Functional genomic regions associated with blast disease resistance in rice predicted syntenic orthologs and potential resistance gene candidates from diverse cereal genomes

KEGG orthology prediction of bacterial proteins using natural language processing

GTDrift: a resource for exploring the interplay between genetic drift, genomic and transcriptomic characteristics in eukaryotes.

First whole-genome sequence and assembly of the Ecuadorian brown-headed spider monkey (Ateles fusciceps fusciceps), a critically endangered species, using Oxford Nanopore Technologies.

Identification of Incomplete Annotations of Biosynthesis Pathways in Rhodophytes Using a Multi-Omics Approach.

VEuPathDB: the eukaryotic pathogen, vector and host bioinformatics resource center in 2023.

AlexandrusPS: A User-Friendly Pipeline for the Automated Detection of Orthologous Gene Clusters and Subsequent Positive Selection Analysis.

HiMAP2: Identifying phylogenetically informative genetic markers from diverse genomic resources.

InParanoiDB 9: Ortholog Groups for Protein Domains and Full-Length Proteins

Coexpression reveals conserved gene programs that co-vary with cell type across kingdoms.

InParanoid-DIAMOND: faster orthology analysis with the InParanoid algorithm.

Drosophila functional screening of de novo variants in autism uncovers damaging variants and facilitates discovery of rare neurodevelopmental diseases.

Analysis of carbohydrate-active enzymes and sugar transporters in Penicillium echinulatum: A genome-wide comparative study of the fungal lignocellulolytic system

Orthology Prediction and Phylogenetic Analysis Methods in Plants.

Paralog Explorer: A resource for mining information about paralogs in common research organisms

Prediction and enrichment analyses of the Homo sapiens-Drosophila melanogaster COPD-related orthologs: potential for modeling of human COPD genomic responses with the fruit fly.

Echinobase: leveraging an extant model organism database to build a knowledgebase supporting research on the genomics and biology of echinoderms.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Orthology Prediction Research Articles

Related Topics

Articles published on Orthology Prediction

NCBI RefSeq: reference sequence standards through 25years of curation and annotation.

Quest for Orthologs in the Era of Biodiversity Genomics.

ZDB: bacterial comparative genomics made easy.

Functional genomic regions associated with blast disease resistance in rice predicted syntenic orthologs and potential resistance gene candidates from diverse cereal genomes

KEGG orthology prediction of bacterial proteins using natural language processing

GTDrift: a resource for exploring the interplay between genetic drift, genomic and transcriptomic characteristics in eukaryotes.

First whole-genome sequence and assembly of the Ecuadorian brown-headed spider monkey (Ateles fusciceps fusciceps), a critically endangered species, using Oxford Nanopore Technologies.

Identification of Incomplete Annotations of Biosynthesis Pathways in Rhodophytes Using a Multi-Omics Approach.

VEuPathDB: the eukaryotic pathogen, vector and host bioinformatics resource center in 2023.

AlexandrusPS: A User-Friendly Pipeline for the Automated Detection of Orthologous Gene Clusters and Subsequent Positive Selection Analysis.

HiMAP2: Identifying phylogenetically informative genetic markers from diverse genomic resources.

InParanoiDB 9: Ortholog Groups for Protein Domains and Full-Length Proteins

Coexpression reveals conserved gene programs that co-vary with cell type across kingdoms.

InParanoid-DIAMOND: faster orthology analysis with the InParanoid algorithm.

Drosophila functional screening of de novo variants in autism uncovers damaging variants and facilitates discovery of rare neurodevelopmental diseases.

Analysis of carbohydrate-active enzymes and sugar transporters in Penicillium echinulatum: A genome-wide comparative study of the fungal lignocellulolytic system

Orthology Prediction and Phylogenetic Analysis Methods in Plants.

Paralog Explorer: A resource for mining information about paralogs in common research organisms

Prediction and enrichment analyses of the Homo sapiens-Drosophila melanogaster COPD-related orthologs: potential for modeling of human COPD genomic responses with the fruit fly.

Echinobase: leveraging an extant model organism database to build a knowledgebase supporting research on the genomics and biology of echinoderms.