Query Sequence Research Articles

Environmental DNA (eDNA) metabarcoding has been commonly used in recent years (Jeunen et al. 2019) for the identification of the species composition of environmental samples. By making use of genetic markers anchored in conserved gene regions, universally present acrooss the species of large taxonomy groups, eDNA metabarcoding exploits both extra- and intra-cellular DNA fragments for biodiversity assessment. However, there is not a truly “universal” marker gene that is capable of amplifying all species across different taxa (Kress et al. 2015). The mitochondrial cytochrome C oxidase subunit I gene (COI) has many of the desirable properties of a “universal" marker and has been widely used for assessing species identity in Eukaryotes, especially metazoans (Andjar et al. 2018). However, a great number of COI Operational Taxonomic Units (OTUs) or/and Amplicon Sequence Variants (ASVs) retrieved from such studies do not match reference sequences and are often referred to as “dark matter” (Deagle et al. 2014). The aim of this study was to discover the origins and identities of these COI dark matter sequences. We built a reference phylogenetic tree that included as many COI-sequence-related information across the tree of life as possible. An overview of the steps followed is presented in Fig. 1a. Briefly, the Midori reference 2 database was used to retrieve eukaryotes sequences (183,330 species). In addition, the API of the BOLD database was used as source for the corresponding Bacteria (559 genera) and Archaea (41 genera) sequences. Consensus sequences at the family level were constructed from each of these three initial COI datasets. The COI-oriented reference phylogenetic tree of life was then built by using 1,240 consensus sequences with more than 80% of those coming from eukaryotic taxa. Phylogeny-based taxonomic assignment was then used to place query sequences. The a) total number of sequences, b) sequences assigned to Eukaryotes and c) unassigned subsets of OTUs, from marine and freshwater samples, retrieved during in-house metabarcoding experiments, were placed in the reference tree (Fig. 1b). It is clear that a large proportion of sequences targeting the COI region of Eukaryotes actually represents bacterial branches in the phylogenetic tree (Fig. 1b). We conclude that COI metabarcoding studies targeting Eukaryotes may come with a great bias derived from amplification and sequencing of bacterial taxa, depending on the primer pair used. However, for the time being, publicly available bacterial COI sequences are far too few to represent the bacterial variability; thus, a reliable taxonomic identification of them is not possible. We suggest that bacterial COI sequences should be included in the reference databases used for the taxonomy assignment of OTUs/ASVs in COI-based eukaryote metabarcoding studies to allow for bacterial sequences that were amplified to be excluded enabling researchers to exclude non-target sequences. Further, the approach presented here allows researchers to better understand the unknown unknowns and shed light on the dark matter of their metabarcoding sequence data.

BackgroundCocoonase is a proteolytic enzyme that helps in dissolving the silk cocoon shell and exit of silk moth. Chemicals like anhydrous Na2CO3, Marseille soap, soda, ethylene diamine and tartaric acid-based degumming of silk cocoon shell have been in practice. During this process, solubility of sericin protein increased resulting in the release of sericin from the fibroin protein of the silk. However, this process diminishes natural color and softness of the silk. Cocoonase enzyme digests the sericin protein of silk at the anterior portion of the cocoon without disturbing the silk fibroin. However, no thorough characterization of cocoonase and sericin protein as well as imaging analysis of chemical- and enzyme-treated silk sheets has been carried out so far. Therefore, present study aimed for detailed characterization of cocoonase and sericin proteins, phylogenetic analysis, secondary and tertiary structure prediction, and computational validation as well as their interaction with other proteins. Further, identification of tasar silkworm (Antheraea mylitta) pupa stage for cocoonase collection, its purification and effect on silk sheet degumming, scanning electron microscope (SEM)-based comparison of chemical- and enzyme-treated cocoon sheets, and its optical coherence tomography (OCT)-based imaging analysis have been investigated. Various computational tools like Molecular Evolutionary Genetics Analysis (MEGA) X and Figtree, Iterative Threading Assembly Refinement (I-TASSER), self-optimized predicted method with alignment (SOPMA), PROCHECK, University of California, San Francisco (UCSF) Chimera, and Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) were used for characterization of cocoonase and sericin proteins. Sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE), protein purification using Sephadex G 25-column, degumming of cocoon sheet using cocoonase enzyme and chemical Na2CO3, and SEM and OCT analysis of degummed cocoon sheet were performed. ResultsPredicted normalized B-factors of cocoonase and sericin with respect to α and β regions showed that these regions are structurally more stable in cocoonase while less stable in sericin. Conserved domain analysis revealed that B. mori cocoonase contains a trypsin-like serine protease with active site range 45 to 180 query sequences while substrate binding site from 175 to 200 query sequences. SDS-PAGE analysis of cocoonase indicated its molecular weight of 25–26 kDa. Na2CO3 treatment showed more degumming effect (i.e., cocoon sheet weight loss) as compared to degumming with cocoonase. However, cocoonase-treated silk cocoon sheet holds the natural color of tasar silk, smoothness, and luster compared with the cocoon sheet treated with Na2CO3. SEM-based analysis showed the noticeable variation on the surface of silk fiber treated with cocoonase and Na2CO3. OCT analysis also exemplified the variations in the cross-sectional view of the cocoonase and Na2CO3-treated silk sheets. ConclusionsPresent study enlightens on the detailed characteristics of cocoonase and sericin proteins, comparative degumming activity, and image analysis of cocoonase enzyme and Na2CO3 chemical-treated silk sheets. Obtained findings illustrated about use of cocoonase enzyme in the degumming of silk cocoon at larger scale that will be a boon to the silk industry.

Query Sequence Research Articles

Related Topics

Articles published on Query Sequence

OMAmer: tree-driven and alignment-free protein assignment to subfamilies outperforms closest sequence approaches.

Optimal Online Algorithms for File-Bundle Caching and Generalization to Distributed Caching

EORNA, a barley gene and transcript abundance database

Demonstrating the utility of flexible sequence queries against indexed short reads with FlexTyper.

Evaluation of Machine Learning Algorithms in Predicting the Next SQL Query from the Future

MaCPepDB: A Database to Quickly Access All Tryptic Peptides of the UniProtKB.

Bacteria are everywhere, even in your COI data: Τhe art of getting to know the unknown unknowns and shine light on the dark matter!

Integrative analysis of liver-specific non-coding regulatory SNPs associated with the risk of coronary artery disease

Study on cocoonase, sericin, and degumming of silk cocoon: computational and experimental

PPIT: an R package for inferring microbial taxonomy from nifH sequences.

Detecting high-scoring local alignments in pangenome graphs.

English

Re-purposing software for functional characterization of the microbiome

Use of Bioinformatics Technologies and Databases to Teach Analysis of Genetic Sequences to Undergraduate Students in Physics, Biotechnology, and Biology: The Specific Case of the SARS-CoV-2 Spike Protein

ExBWS: extended bioinformatics web services for sequence analyses

Dual Encoding for Video Retrieval by Text.

In silico Structural Modelling of Ribokinase from Salmonella Typhi

Demonstrating the utility of flexible sequence queries against indexed short reads with FlexTyper

FireProtASR: A Web Server for Fully Automated Ancestral Sequence Reconstruction.

Molecular Characterization and in-vitro Regeneration of Wild Ganoderma lucidum from Abuja, Nigeria

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Query Sequence Research Articles

Related Topics

Articles published on Query Sequence

OMAmer: tree-driven and alignment-free protein assignment to subfamilies outperforms closest sequence approaches.

Optimal Online Algorithms for File-Bundle Caching and Generalization to Distributed Caching

EORNA, a barley gene and transcript abundance database

Demonstrating the utility of flexible sequence queries against indexed short reads with FlexTyper.

Evaluation of Machine Learning Algorithms in Predicting the Next SQL Query from the Future

MaCPepDB: A Database to Quickly Access All Tryptic Peptides of the UniProtKB.

Bacteria are everywhere, even in your COI data: Τhe art of getting to know the unknown unknowns and shine light on the dark matter!

Integrative analysis of liver-specific non-coding regulatory SNPs associated with the risk of coronary artery disease

Study on cocoonase, sericin, and degumming of silk cocoon: computational and experimental

PPIT: an R package for inferring microbial taxonomy from nifH sequences.

Detecting high-scoring local alignments in pangenome graphs.

English

Re-purposing software for functional characterization of the microbiome

Use of Bioinformatics Technologies and Databases to Teach Analysis of Genetic Sequences to Undergraduate Students in Physics, Biotechnology, and Biology: The Specific Case of the SARS-CoV-2 Spike Protein

ExBWS: extended bioinformatics web services for sequence analyses

Dual Encoding for Video Retrieval by Text.

In silico Structural Modelling of Ribokinase from Salmonella Typhi

Demonstrating the utility of flexible sequence queries against indexed short reads with FlexTyper

FireProtASR: A Web Server for Fully Automated Ancestral Sequence Reconstruction.

Molecular Characterization and in-vitro Regeneration of Wild Ganoderma lucidum from Abuja, Nigeria