Original Source Material Research Articles

Metadata management for sequence data is essential for the accurate description of Earth’s biodiversity. Within metadata attributes, those that reference the biological sources of sequences and samples and allow linking to the specimen or sample of origin are fundamental for facilitating connections between molecular biology, taxonomy, systematic biology and biodiversity research, increasing the discoverability and usability of data by researchers worldwide. Sequence data is publicly archived at the International Nucleotide Sequence Database Collaboration (INSDC) that includes the National Centre for Biotechnology Information (NCBI), the DNA Data Bank of Japan (DDBJ) and the European Nucleotide Archive (ENA). Sequences stored at INSDC have associated a considerable range of metadata, including attributes related to its biological source, such as references to natural history collections or culture collections. But, these source attributes are not always submitted or may be incomplete, limiting the association of the sequence records to the original source material, hampering further data connections (e.g., biological data associated with the voucher or species distribution data). Therefore, we have developed the ENA Source Attribute Helper API, a tool that aims to assist users on the submission of accurate attributes referring to the biological source of samples and sequence data. This tool was developed within the scope of BiCIKL (Biodiversity Community Integrated Knowledge Library) (Penev et al. 2022), a Horizon 2020 project which targets building a wide, biodiversity related community for connecting data along the different axes of biodiversity research. The first version of the tool was designed to support correct annotation of the attributes that identify the source material from which the sample or sequence were obtained, namely /specimen_voucher, /culture_collection, and /biomaterial (INSDC 2021). These attributes follow a Darwin Core Triplet format (Wieczorek et al. 2012), composed of institution code, collection code and the specimen, culture, or material identifier, accordingly. Since the submission of the biological source attributes to the INSDC may be performed both when data is initially uploaded or on following updates using a variety of tools, we developed the API as an open source tool that is publicly accessible and may be used as a free-standing service. The API is built using Representational State Transfer (REST) API Architecture and it is designed to use the data available in the NCBI BioCollections (Sharma et al. 2018). NCBI Biocollections is a curated database of metadata for natural history collections, associated with records in INSDC, that includes the institution and collection codes. The API main functions include the querying of the metadata (the API presents both exact matches and similar matches) for the institutions and collections based on the user input, validation of institution and collection codes in the attribute strings provided by the user, and the construction of the attribute string based on the user-provided information. The API does not include the search or validation of the voucher specimen codes. The API is designed in a way that it can be extended easily for any future enhancements and initially expected to promote and support the submission and any subsequent curation of better structured and more richly described source data. We expect this tool to contribute to better connected biodiversity data and hence provide a stronger foundation to strengthen the value of natural history collections, taxonomic expertise, and biodiversity knowledge.

Read full abstract

ObjectiveAccurate vital statistics data are critical for monitoring population health and strategizing public health interventions. Previous analyses of statewide birth data have identified several factors that may reduce birth certificate accuracy including systematic errors and limited data review by clinicians. The aim of this initiative was to increase the proportion of hospitals in Alabama reporting accurate birth certificate data from 67% to 87% within 1 year.MethodsThe Alabama Perinatal Quality Collaborative led this statewide collaborative effort. Process measures included monthly monitoring of 11 variables across 5–10 patient birth certificates per month per hospital. Accuracy determination, defined as ≥95% accuracy of the variables analyzed, was performed by health care specialists at each hospital by comparing birth certificate variables from vital statistics with data obtained from original hospital source materials. Three months of retrospective, baseline accuracy data were collected before project initiation from which actionable drivers and change ideas were identified at individual hospitals. Data were analyzed using statistical process control measures.ResultsThirty‐one hospitals entered data throughout the course of the initiative, accounting for 850 chart analyses and 9350 variable assessments. The least accurately reported variables included birth weight, maternal hypertension, and antenatal corticosteroid exposure. At baseline, 67% of hospitals reported birth certificate accuracy rates ≥ 95%, which increased to 90% of hospitals within 2 months and was sustained for the remainder of the initiative.ConclusionStatewide, multidisciplinary quality improvement efforts increased birth certificate accuracy vital to public health surveillance.

Read full abstract

Original Source Material Research Articles

Related Topics

Articles published on Original Source Material

Lost in Adaptation: Exploring the Phenomenon of Over-Translation in Arabic Translations of English Movies

Moving from a Textual to Visual Medium: Transposition of Ruskin Bond’s The Blue Umbrella to the Cinematic Canvas

&nbsp; <em>&ldquo;Large wen&rdquo; or &ldquo;swelling&rdquo;?</em> <em>&nbsp;Exploring Myths and Misconceptions about Nicholas Sander&rsquo;s Description of Anne Boleyn and Its Link to Witchcraft</em>

Adaptation Odyssey: Tracing the Evolution of Postcolonial Narrative from Fiction, Film to Digital Gaming

Methods for Evaluating Database Coverage

Establishing the true identity of Passiflora nephrodes (Passifloraceae), resulting in Passiflora rosacea, a new species from Santa Cruz, Bolivia

Transforming Attitudes Towards the Turk in Edward Ravenscroft’s Mamamouchi, or The citizen turn’d gentleman (1672) and Molière’s Le bourgeois Gentilhomme (1668)

The Whitworth: a place for Industry and Art

Ghostbusting in the Late Anthropocene

Seven years with Orion

The ENA Source Attribute Helper: An API for improved biological source data

The Pedagogical Readings as a unique historical source for research on the pedagogical work with disabled pupils in the GDR educational system

Logistics Contracts and the Political Economy of State Failure: Evidence from Somalia

The Elwyn Archives and Museum

Embracing the Crowd: People-powered Research to Preserve the History of Astronomy

Improving birth certificate data accuracy in Alabama.

American Warsaw: The Rise, Fall, and Rebirth of Polish ChicagoPoles in Illinois

The distribution of fake British pounds in the biggest money counterfeiting scheme in history

Azusa Street and the Lost Doctrine of Humility

The Rand Transcript Revealed

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Original Source Material Research Articles

Related Topics

Articles published on Original Source Material

Lost in Adaptation: Exploring the Phenomenon of Over-Translation in Arabic Translations of English Movies

Moving from a Textual to Visual Medium: Transposition of Ruskin Bond’s The Blue Umbrella to the Cinematic Canvas

&amp;nbsp; &lt;em&gt;&amp;ldquo;Large wen&amp;rdquo; or &amp;ldquo;swelling&amp;rdquo;?&lt;/em&gt; &lt;em&gt;&amp;nbsp;Exploring Myths and Misconceptions about Nicholas Sander&amp;rsquo;s Description of Anne Boleyn and Its Link to Witchcraft&lt;/em&gt;

Adaptation Odyssey: Tracing the Evolution of Postcolonial Narrative from Fiction, Film to Digital Gaming

Methods for Evaluating Database Coverage

Establishing the true identity of Passiflora nephrodes (Passifloraceae), resulting in Passiflora rosacea, a new species from Santa Cruz, Bolivia

Transforming Attitudes Towards the Turk in Edward Ravenscroft’s Mamamouchi, or The citizen turn’d gentleman (1672) and Molière’s Le bourgeois Gentilhomme (1668)

The Whitworth: a place for Industry and Art

Ghostbusting in the Late Anthropocene

Seven years with Orion

The ENA Source Attribute Helper: An API for improved biological source data

The Pedagogical Readings as a unique historical source for research on the pedagogical work with disabled pupils in the GDR educational system

Logistics Contracts and the Political Economy of State Failure: Evidence from Somalia

The Elwyn Archives and Museum

Embracing the Crowd: People-powered Research to Preserve the History of Astronomy

Improving birth certificate data accuracy in Alabama.

American Warsaw: The Rise, Fall, and Rebirth of Polish ChicagoPoles in Illinois

The distribution of fake British pounds in the biggest money counterfeiting scheme in history

Azusa Street and the Lost Doctrine of Humility

The Rand Transcript Revealed

  <em>“Large wen” or “swelling”?</em> <em> Exploring Myths and Misconceptions about Nicholas Sander’s Description of Anne Boleyn and Its Link to Witchcraft</em>