GenBank Data Research Articles

Dear Editor-in-Chief, In Annals of Parasitology 2021, 67(1), 55-65, a paper entitled "Genetic characterization and phylogenetic analysis of Fasciola species based on ITS2 gene sequence, with first molecular evidence of intermediate Fasciola from water buffaloes in Aswan, Egypt" was published with great interest [1]. After reading the article carefully and critically, we think some points should be noted. Fasciola species are meiotically functional diploid, can produce sperm and temporarily and store in the seminal vesicles. This type is named spermic fluke [2]. On the other hand, intermediate Fasciola with morphological characteristics intermediates between F. hepatica and F. gigantica with no sperm or aspermic and no sperm in seminal vesicles. However, this is also seen in older flukes [3-5]. It seems that morphological studies based on spermatogenesis ability were necessary for this study. Also, this parasite's anthelmintic resistance is due to aspects of biology, and population structure depends on genetic diversity [6]. We question whether there are any documents about and sequences of mitochondrial markers as COX (Cytochrome Oxidase) and NAD (Nicotinamide Adenine Dinucleotide) to analyze intraspecific phylogenetic relationship in addition to nuclear gene? In Table 3, the pairwise distances between three groups of Fasciola spp. from different livestock animals were low, ranging from 0.004 to 0.01 with an overall mean of 0.008. Genetic diversity is described as a tendency of genetic characteristics to vary and serves as a way for the population to adapt to changing hosts and environments [7]. The nature of the nuclear gene (ITS) is instability. It is better to use mitochondrial sequence data to compare diversity. Also, genetic discrimination grade from infra population to meta population is annotated by Fst value ranging; 0 to 1. Fst values between 0-0.05 indicated a low genetic differentiation population [8]. It seems that by calculating Fst and showing the gene migration based on mitochondrial sequences data of specimens, this study's species population will be obtained. Also, Tajima's D and Fu's F in all loci populations based on GenBank data may show the Fasciola haplotypes' population proximity. Here we recommend, that Omar et al. [1] studies that molecular phylogeny with mitochondrial DNA efectively used for appropriate diferentiation of haplotypes and spermatogenic ability by carmen allium staining helps them find the physiological aspects. Of course, more prominent populations are needed to find intermediate types. [1] Omar M.A, Elmajdoub L.O., Ali A.O., Ibrahim D.A., Sorour S.S., Al-Wabel M.A., Suresh M., Metwally A.M. 2021. Genetic characterization and phylogenetic analysis of Fasciola species based on ITS2 gene sequence, with first molecular evidence of intermediate Fasciola from water buffaloes in Aswan, Egypt. Annals of Parasitology 67: 55-65. doi:10.17420/ap6701.312 [2] Sanderson A. 1953. Maturation and probable gynogenesis in the liver fluke, Fasciola hepatica L. Nature 172: 110-112. doi:10.1038/172110a0 [3] Hayashi K., Ichikawa-Seki M., Mohanta U.K., Singh T.S., Shoriki T., Sugiyama H., Itagaki T. 2015. Molecular phylogenetic analysis of Fasciola flukes from eastern India. Parasitology International 64: 334-338. https://doi.org/10.1016/j.parint.2015.04.004 [4] Ichikawa-Seki M., Tokashiki M., Opara M.N., Iroh G., Hayashi K., Kumar U.M., Itagaki T. 2017. Molecular characterization and phylogenetic analysis of Fasciola gigantica from Nigeria. Parasitology International 66: 893-897. doi:10.1016/j.parint.2016.10.010 [5] Rouhani S., Raeghi S., Mirahmadi H., Fasihi Harandi M., Haghighi A., Spotin A. 2017. Identification of Fasciola spp. in the east of Iran, based on the spermatogenesis and nuclear ribosomal DNA (ITS1) and mitochondrial (ND1) genes. Archives of Clinical Infectious Diseases 12:e57283. doi:10.5812/archcid.57283 [6] Hodgkinson J., Cwiklinski K., Beesley N., Paterson S., Williams D., Devaney E. 2013. Identification of putative markers of triclabendazole resistance by a genome-wide analysis of genetically recombinant Fasciola hepatica. Parasitology 140: 1523. doi:10.1017/S0031182013000528 [7] Bozorgomid A., Rouhani S., Harandi M.F., Ichikawa- Seki M., Raeghi S. 2020. Genetic diversity and distribution of Fasciola hepatica haplotypes in Iran: molecular and phylogenetic studies. Veterinary Parasitology: Regional Studies and Reports 19: 00359. [8] Rouhani S., Raeghi S., Spotin A. 2017. Spermatogenic and phylo-molecular characterizations of isolated Fasciola spp. from cattle, North West Iran. Pakistan Journal of Biological Sciences 20: 204-209.

Read full abstract

DNA barcoding technology has become employed widely for biodiversity and molecular biology researchers to identify species and analyze their phylogeny. Recently, DNA metabarcoding and environmental DNA (eDNA) technology have developed by expanding the concept of DNA barcoding. These techniques analyze the diversity and quantity of organisms within an environment by detecting biogenic DNA in water and soil. It is particularly popular for monitoring fish species living in rivers and lakes (Takahara et al. 2012). BOLD Systems (Barcode of Life Database systems, Ratnasingham and Hebert 2007) is a database for DNA barcoding, archiving 8.5 million of barcodes (as of August 2020) along with the voucher specimen, from which the DNA barcode sequence is derived, including taxonomy, collected country, and museum vouchered as metadata (e.g. https://www.boldsystems.org/index.php/Public_RecordView?processid=TRIBS054-16). Also, many barcoding data are submitted to GenBank (Sayers et al. 2020), which is a database for DNA sequences managed by NCBI (National Center for Biotechnology Information, US). The number of the records of DNA barcodes, i.e. COI (cytochrome c oxidase I) gene for animal, has grown significantly (Porter and Hajibabaei 2018). BOLD imports DNA barcoding data from GenBank, and lots of DNA barcoding data in GenBank are also assigned BOLD IDs. However, we have to refer to both BOLD and GenBank data when performing DNA barcoding. I have previously investigated the registration of DNA barcoding data in GenBank, especially the association with BOLD, using insects and flowering plants as examples (Nakazato 2019). Here, I surveyed the number of species covered by BOLD and GenBank. I used fish data as an example because eDNA research is particularly focused on fish. I downloaded all GenBank files for vertebrates from NCBI FTP (File Transfer Protocol) sites (as of November 2019). Of the GenBank fish entries, 86,958 (7.3%) were assigned BOLD identifiers (IDs). The NCBI taxonomy database has registrations for 39,127 species of fish, and 20,987 scientific names at the species level (i.e., excluding names that included sp., cf. or aff.). GenBank entries with BOLD IDs covered 11,784 species (30.1%) and 8,665 species-level names (41.3%). I also obtained whole "specimens and sequences combined data" for fish from BOLD systems (as of November 2019). In the BOLD, there are 273,426 entries that are registered as fish. Of these entries, 211,589 BOLD entries were assigned GenBank IDs, i.e. with values in “genbank_accession” column, and 121,748 entries were imported from GenBank, i.e. with "Mined from GenBank, NCBI" description in "institution_storing" column. The BOLD data covered 18,952 fish species and 15,063 species-level names, but 35,500 entries were assigned no species-level names and 22,123 entries were not even filled with family-level names. At the species level, 8,067 names co-occurred in GenBank and BOLD, with 6,997 BOLD-specific names and 599 GenBank-specific names. GenBank has 425,732 fish entries with voucher IDs, of which 340,386 were not assigned a BOLD ID. Of these 340,386 entries, 43,872 entries are registrations for COI genes, which could be candidates for DNA barcodes. These candidates include 4,201 species that are not included in BOLD, thus adding these data will enable us to identify 19,863 fish to the species level. For researchers, it would be very useful if both BOLD and GenBank DNA barcoding data could be searched in one place. For this purpose, it is necessary to integrate data from the two databases. A lot of biodiversity data are recorded based on the Darwin Core standard while DNA sequencing data are sometimes integrated or cross-linked by RDF (Resource Description Framework). It may not be technically difficult to integrate these data, but the species data referenced differ from the EoL (The Encyclopedia of Life) for BOLD and the NCBI taxonomy for GenBank, and the differences in taxonomic systems make it difficult to match by scientific name description. GenBank has fields for the latitude and longitude of the specimens sampled, and Porter and Hajibabaei 2018 argue that this information should be enhanced. However, this information may be better described in the specimen and occurrence databases. The integration of barcoding data with the specimen and occurrence data will solve these problems. Most importantly, it will save the researcher from having to register the same information in multiple databases. In the field of biodiversity, only DNA barcode sequences may have been focused on and used as gene sequences. The museomics community regards museum-preserved specimens as rich resources for DNA studies because their biodiversity information can accompany the extraction and analysis of their DNA (Nakazato 2018). GenBank is useful for biodiversity studies due to its low rate of mislabelling (Leray et al. 2019). In the future, we will be working with a variety of DNA, including genomes from museum specimens as well as DNA barcoding. This will require more integrated use of biodiversity information and DNA sequence data. This integration is also of interest to molecular biologists and bioinformaticians.

Read full abstract

GenBank Data Research Articles

Articles published on GenBank Data

Novel Universal Primers to Identify the Expression of MAGE A1-A10 in the Core Biopsy of Lung Cancer

Comments on "Genetic characterization and phylogenetic analysis of Fasciola species based on ITS2 gene sequence, with first molecular evidence of intermediate Fasciola from water buffaloes in Aswan, Egypt".

Genetic analysis of the tree leaf disease microfungus Rhytisma acerinum

Helix straminea Briganti, 1825 in Italy (Gastropoda: Pulmonata): taxonomic history, morphology, biology, distribution and phylogeny

Indonesia Schistosoma japonicum: Origin, Genus Oncomelania and Elimination of the Parasite With Cluster Genes Innoculated Into Female Oncomelania hupensis lindoensis Via CRISPR/Cas9 System

Testing of antibiotic combinations in NDM-1-producing nosocomial carbapenem resistant Acinetobacter baumannii

Генотипирование черноморских трематод семейства Opecoelidae по митохондриальным маркерам

Comparative genomics of swine leukocyte antigen class I of Nigerian and Kenyan pigs

NCBI Genome Workbench: Desktop Software for Comparative Genomics, Visualization, and GenBank Data Submission.

Evaluation of Genetic Diversity and Identification of <i>Huperzia</i> Species Collected in Some Different Areas in Vietnam by Molecular Markers

Molecular characterization of Acanthamoeba isolated from contact lens paraphernalia: An evidence of potentially pathogenic T4 genotype in Malaysia

Evaluation of Genetic Diversity and Identification of <i>Huperzia</i> Species Collected in Some Different Areas in Vietnam by Molecular Markers

Genetic diversity of Indonesian protected eclectus parrot (Eclectus roratus) based on mitochondrial gene sequences

Characteristics of cytochrome C oxidase subunit I gene in giant clam from Wakatobi National Park Waters, Indonesia

Survey of Species Covered by DNA Barcoding Data in BOLD and GenBank for Integration of Data for Museomics

Life stages and phylogenetic position of the wool carder bee mite Sennertionyx manicati (Acari: Acaridae).

Taxonomic Study of the Family Ectobiidae (Order Blattodea) and Its Phylogenetic Relationships with Other Egyptian Blattodea

Comparative analysis of the constituents of two cultivars of Dioscorea dumetorum (Kunth) Pax. and their molecular barcoding

Neodactylariales, Neodactylariaceae (Dothideomycetes, Ascomycota): new order and family, with a new species from China.

Genetic Evidence for a Mixed Composition of the Genus Myoxocephalus (Cottoidei: Cottidae) Necessitates Generic Realignment.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

GenBank Data Research Articles

Articles published on GenBank Data

Novel Universal Primers to Identify the Expression of MAGE A1-A10 in the Core Biopsy of Lung Cancer

Comments on "Genetic characterization and phylogenetic analysis of Fasciola species based on ITS2 gene sequence, with first molecular evidence of intermediate Fasciola from water buffaloes in Aswan, Egypt".

Genetic analysis of the tree leaf disease microfungus Rhytisma acerinum

Helix straminea Briganti, 1825 in Italy (Gastropoda: Pulmonata): taxonomic history, morphology, biology, distribution and phylogeny

Indonesia Schistosoma japonicum: Origin, Genus Oncomelania and Elimination of the Parasite With Cluster Genes Innoculated Into Female Oncomelania hupensis lindoensis Via CRISPR/Cas9 System

Testing of antibiotic combinations in NDM-1-producing nosocomial carbapenem resistant Acinetobacter baumannii

Генотипирование черноморских трематод семейства Opecoelidae по митохондриальным маркерам

Comparative genomics of swine leukocyte antigen class I of Nigerian and Kenyan pigs

NCBI Genome Workbench: Desktop Software for Comparative Genomics, Visualization, and GenBank Data Submission.

Evaluation of Genetic Diversity and Identification of &lt;i&gt;Huperzia&lt;/i&gt; Species Collected in Some Different Areas in Vietnam by Molecular Markers

Molecular characterization of Acanthamoeba isolated from contact lens paraphernalia: An evidence of potentially pathogenic T4 genotype in Malaysia

Evaluation of Genetic Diversity and Identification of &lt;i&gt;Huperzia&lt;/i&gt; Species Collected in Some Different Areas in Vietnam by Molecular Markers

Genetic diversity of Indonesian protected eclectus parrot (Eclectus roratus) based on mitochondrial gene sequences

Characteristics of cytochrome C oxidase subunit I gene in giant clam from Wakatobi National Park Waters, Indonesia

Survey of Species Covered by DNA Barcoding Data in BOLD and GenBank for Integration of Data for Museomics

Life stages and phylogenetic position of the wool carder bee mite Sennertionyx manicati (Acari: Acaridae).

Taxonomic Study of the Family Ectobiidae (Order Blattodea) and Its Phylogenetic Relationships with Other Egyptian Blattodea

Comparative analysis of the constituents of two cultivars of Dioscorea dumetorum (Kunth) Pax. and their molecular barcoding

Neodactylariales, Neodactylariaceae (Dothideomycetes, Ascomycota): new order and family, with a new species from China.

Genetic Evidence for a Mixed Composition of the Genus Myoxocephalus (Cottoidei: Cottidae) Necessitates Generic Realignment.

Evaluation of Genetic Diversity and Identification of <i>Huperzia</i> Species Collected in Some Different Areas in Vietnam by Molecular Markers

Evaluation of Genetic Diversity and Identification of <i>Huperzia</i> Species Collected in Some Different Areas in Vietnam by Molecular Markers