Abstract

The biological function and evolutionary history of protein-coding genes are not only written in their nucleotide sequences. The comparison of genomic contexts throughout different lineages may highlight genomic mechanisms in the generation of new protein families, while the conservation of gene clusters may unravel, for instance, metabolic pathways. Various tools and databases exist that allow for the analysis and comparison of genomic contexts, but each has its own limitations. Online databases allow for quick comparisons, but only for those genomes for which data were pre-calculated. More advanced tools may allow for the comparison of any genome, but are often limited to a given phylogenetic kingdom or provide only a snapshot of the genomic contexts without further information about the genes involved. Here, we introduce GCsnap, a flexible Python-based tool that allows for the interactive comparison of the genomic contexts of protein-coding genes from any genome at any taxonomic level, integrating them with functional and structural information. By connecting the output to different protein databases, users can navigate through the different genomic contexts from a simple interactive platform, facilitating the further analysis of the contexts found. GCsnap is not limited to a single input format, can perform batch jobs and accepts protein classification maps. Results are stored in detailed, human and machine-readable files, and customizable, publication-ready figures. GCsnap is freely available from https://github.com/JoanaMPereira/GCsnap and can be set up easily on any computer.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call