Abstract
The present article reports the complete draft genome annotation of earthworm Eisenia fetida, obtained from the manuscript entitled “Timing and Scope of Genomic Expansion within Annelida: Evidence from Homeoboxes in the Genome of the Earthworm E. fetida” (Zwarycz et al., 2015) and provides the data on the repetitive elements, protein coding genes and noncoding RNAs present in the genome dataset of the species. The E. fetida protein coding genes were predicted from AUGUSTUS gene prediction and subsequently annotated based on their sequence similarity, Gene Ontology (GO) functional terms, InterPro domains, Clusters of Orthologous Groups (COGs) and KEGG (Kyoto Encyclopedia of Genes and Genomes) pathways information. The genome wide comparison of orthologous clusters and phylogenomic analysis of the core genes were performed to understand the events of genome evolution and genomic diversity between E. fetida and its related metazoans. In addition, the genome dataset was screened to identify the crucial stem cell markers, regeneration specific genes and immune-related genes and their functionally enriched GO terms were predicted from Fisher׳s enrichment analysis. The E. fetida genome annotation data containing the GFF (general feature format) annotation file, predicted coding gene sequences and translated protein sequences were deposited to the figshare repository under the DOI: https://doi.org/10.6084/m9.figshare.6142322.v1.
Highlights
Data on genome annotation and analysis of earthworm Eisenia fetidaSayan Paul a,1, Arun Arumugaperumal a,1, Rashmi Rathy a, Vasanthakumar Ponesakki a, Palavesam Arunachalam b, Sudhakar Sivasubramaniam a,n a Department of Biotechnology, Manonmaniam Sundaranar University, Tirunelveli, Tamilnadu 627012, India b Department of Animal Science, Manonmaniam Sundaranar University, Tirunelveli, Tamilnadu 627012, India article info
Annotation data of the E. fetida protein coding genes against these three databases was demonstrated by three-way Venn diagram, plotted by using the Venny 2.1 tool [7]
The repetitive contents including the tandem repeats and transposable elements were detected by using the repeat identification tools namely TEclass [2] and RepeatMasker [3]
Summary
Sayan Paul a,1, Arun Arumugaperumal a,1, Rashmi Rathy a, Vasanthakumar Ponesakki a, Palavesam Arunachalam b, Sudhakar Sivasubramaniam a,n a Department of Biotechnology, Manonmaniam Sundaranar University, Tirunelveli, Tamilnadu 627012, India b Department of Animal Science, Manonmaniam Sundaranar University, Tirunelveli, Tamilnadu 627012, India article info. Article history: Received 19 April 2018 Received in revised form 12 August 2018 Accepted 21 August 2018 Available online 29 August 2018.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.