Implementation Of Database Research Articles

BackgroundMicrobiome studies commonly use 16S rRNA gene amplicon sequencing to characterize microbial communities. Errors introduced at multiple steps in this process can affect the interpretation of the data. Here we evaluate the accuracy of operational taxonomic unit (OTU) generation, taxonomic classification, alpha- and beta-diversity measures for different settings in QIIME, MOTHUR and a pplacer-based classification pipeline, using a novel software package: DECARD.ResultsIn-silico we generated 100 synthetic bacterial communities approximating human stool microbiomes to be used as a gold-standard for evaluating the colligative performance of microbiome analysis software. Our synthetic data closely matched the composition and complexity of actual healthy human stool microbiomes. Genus-level taxonomic classification was correctly done for only 50.4–74.8% of the source organisms. Miscall rates varied from 11.9 to 23.5%. Species-level classification was less successful, (6.9–18.9% correct); miscall rates were comparable to those of genus-level targets (12.5–26.2%). The degree of miscall varied by clade of organism, pipeline and specific settings used. OTU generation accuracy varied by strategy (closed, de novo or subsampling), reference database, algorithm and software implementation. Shannon diversity estimation accuracy correlated generally with OTU-generation accuracy. Beta-diversity estimates with Double Principle Coordinate Analysis (DPCoA) were more robust against errors introduced in processing than Weighted UniFrac. The settings suggested in the tutorials were among the worst performing in all outcomes tested.ConclusionsEven when using the same classification pipeline, the specific OTU-generation strategy, reference database and downstream analysis methods selection can have a dramatic effect on the accuracy of taxonomic classification, and alpha- and beta-diversity estimation. Even minor changes in settings adversely affected the accuracy of the results, bringing them far from the best-observed result. Thus, specific details of how a pipeline is used (including OTU generation strategy, reference sets, clustering algorithm and specific software implementation) should be specified in the methods section of all microbiome studies. Researchers should evaluate their chosen pipeline and settings to confirm it can adequately answer the research question rather than assuming the tutorial or standard-operating-procedure settings will be adequate or optimal.

Read full abstract

A major part of the interface to a database is made up of the queries that can be addressed to this database and answered (processed) in an efficient way, contributing to the quality of the developed software. Efficiently processed spatial queries constitute a fundamental part of the interface to spatial databases due to the wide area of applications that may address such queries, like geographical information systems (GIS), location-based services, computer visualization, automated mapping, facilities management, etc. Another important capability of the interface to a spatial database is to offer the creation of efficient index structures to speed up spatial query processing. The xBR+-tree is a balanced disk-resident quadtree-based index structure for point data, which is very efficient for processing such queries. Bulk-loading refers to the process of creating an index from scratch, when the dataset to be indexed is available beforehand, instead of creating the index gradually (and more slowly), when the dataset elements are inserted one-by-one. In this paper, we present an algorithm for bulk-loading xBR+-trees for big datasets residing on disk, using a limited amount of main memory. The resulting tree is not only built fast, but exhibits high performance in processing a broad range of spatial queries, where one or two datasets are involved. To justify these characteristics, using real and artificial datasets of various cardinalities, first, we present an experimental comparison of this algorithm vs. a previous version of the same algorithm and STR, a popular algorithm of bulk-loading R-trees, regarding tree creation time and the characteristics of the trees created, and second, we experimentally compare the query efficiency of bulk-loaded xBR+-trees vs. bulk-loaded R-trees, regarding I/O and execution time. Thus, this paper contributes to the implementation of spatial database interfaces and the efficient storage organization for big spatial data management.

Read full abstract

Implementation Of Database Research Articles

Related Topics

Articles published on Implementation Of Database

Benchmarking top-[formula omitted] keyword and top-[formula omitted] document processing with T[formula omitted]K[formula omitted] and T[formula omitted]K[formula omitted]D[formula omitted

Implementation of Database Massively Parallel Processing System to Build Scalability on Process Data Warehouse

Creation and Implementation of an Electronic Database and Marking System for Identifying and Obtaining Information on Plant Species in Botanical Gardens, Dendrological Parks and Park Monuments of the Landscape Art

USING GEOMATICS FOR ASSESSING VULNERABILITY TO CUTANEOUS LEISHMANISAIS. APPLICATION TO THE WILAYA OF BATNA (ALGERIA)

La ciudad cerrada y su diversificación como reto del Área Metropolitana de Guadalajara, México

Database design and implementation of CSR functions: a case study of Saudi Arabian banking environment

The Effect of Prescription Drug Monitoring Programs on Opioid Prescriptions and Heroin Crime Rates

Querying clinical data in HL7 RIM based relational model with morph-RDB

Geographic information systems for mapping the National Exam Result of Junior High School in 2014 at West Java Province

Recording the LHCb data and software dependencies

Assessment of Optical Coherence Tomography Color Probability Codes in Myopic Glaucoma Eyes After Applying a Myopic Normative Database

Scalable Directory Service for IoT Applications

Evaluation of MALDI-TOF mass spectrometry and MALDI BioTyper in comparison to 16S rDNA sequencing for the identification of bacteria isolated from Arctic sea water.

Database Implementation and Testing of Dynamic Credit Card Fraud Detection System

The Research on On-Line Car Service Platform for Mobile Internet

Geospatial Analysis of Earthquake Damage Probability of Water Pipelines Due to Multi-Hazard Failure

Evaluating the accuracy of amplicon-based microbiome computational pipelines on simulated human gut microbial communities

An efficient algorithm for bulk-loading xBR[formula omitted]-trees

Implementation of a Web-Based Electronic Database for Healthcare-Associated Infection Case Reviews

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Implementation Of Database Research Articles

Related Topics

Articles published on Implementation Of Database

Benchmarking top-[formula omitted] keyword and top-[formula omitted] document processing with T[formula omitted]K[formula omitted] and T[formula omitted]K[formula omitted]D[formula omitted

Implementation of Database Massively Parallel Processing System to Build Scalability on Process Data Warehouse

Creation and Implementation of an Electronic Database and Marking System for Identifying and Obtaining Information on Plant Species in Botanical Gardens, Dendrological Parks and Park Monuments of the Landscape Art

USING GEOMATICS FOR ASSESSING VULNERABILITY TO CUTANEOUS LEISHMANISAIS. APPLICATION TO THE WILAYA OF BATNA (ALGERIA)

La ciudad cerrada y su diversificación como reto del Área Metropolitana de Guadalajara, México

Database design and implementation of CSR functions: a case study of Saudi Arabian banking environment

The Effect of Prescription Drug Monitoring Programs on Opioid Prescriptions and Heroin Crime Rates

Querying clinical data in HL7 RIM based relational model with morph-RDB

Geographic information systems for mapping the National Exam Result of Junior High School in 2014 at West Java Province

Recording the LHCb data and software dependencies

Assessment of Optical Coherence Tomography Color Probability Codes in Myopic Glaucoma Eyes After Applying a Myopic Normative Database

Scalable Directory Service for IoT Applications

Evaluation of MALDI-TOF mass spectrometry and MALDI BioTyper in comparison to 16S rDNA sequencing for the identification of bacteria isolated from Arctic sea water.

Database Implementation and Testing of Dynamic Credit Card Fraud Detection System

The Research on On-Line Car Service Platform for Mobile Internet

Geospatial Analysis of Earthquake Damage Probability of Water Pipelines Due to Multi-Hazard Failure

Evaluating the accuracy of amplicon-based microbiome computational pipelines on simulated human gut microbial communities

An efficient algorithm for bulk-loading xBR[formula omitted]-trees

Implementation of a Web-Based Electronic Database for Healthcare-Associated Infection Case Reviews