Abstract

Large double-stranded DNA viruses of the phylum Nucleocytoviricota (Giant Viruses; GVs) are the largest known viruses, infecting various eukaryotic hosts, particularly protists and algae. These viruses impact biogeochemical cycles and host genome evolution but are challenging to identify and classify due to their complex genomes. We present GVClass, a tool for identifying giant viruses in sequence data, providing taxonomic assignments, and estimating genome completeness and contamination. GVClass employs optimized gene calling and a conservative approach using consensus single-protein phylogenies for robust taxonomic classification, relying on highly conserved orthologous groups. Benchmarking demonstrates over 90% accuracy at the genus-level and >99% at higher taxonomic ranks. GVClass addresses classification challenges and is available as a standalone tool and integrated into the Integrated Microbial Genomes/Virus database (IMG/VR).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.