Exploring and retrieving sequence and metadata for species across the tree of life with NCBI Datasets

Nuala A O’Leary,Eric Cox,J Bradley Holmes,W Ray Anderson,Robert Falk,Vichet Hem,Mirian T N Tsuchiya,Gregory D Schuler,Xuan Zhang,John Torcivia,Anne Ketter,Laurie Breen,Jonathan Cothran,Hena Bajwa,Jovany Tinne,Peter A Meric,Wratko Hlavina,Valerie A Schneider

doi:10.1038/s41597-024-03571-y

Nuala A O’Leary, Eric Cox + Show 16 more

Open Access

https://doi.org/10.1038/s41597-024-03571-y

Copy DOI

Journal: Scientific Data	Publication Date: Jul 5, 2024
Citations: 3	License type: cc-by

Abstract

To explore complex biological questions, it is often necessary to access various data types from public data repositories. As the volume and complexity of biological sequence data grow, public repositories face significant challenges in ensuring that the data is easily discoverable and usable by the biological research community. To address these challenges, the National Center for Biotechnology Information (NCBI) has created NCBI Datasets. This resource provides straightforward, comprehensive, and scalable access to biological sequences, annotations, and metadata for a wide range of taxa. Following the FAIR (Findable, Accessible, Interoperable, and Reusable) data management principles, NCBI Datasets offers user-friendly web interfaces, command-line tools, and documented APIs, empowering researchers to access NCBI data seamlessly. The data is delivered as packages of sequences and metadata, thus facilitating improved data retrieval, sharing, and usability in research. Moreover, this data delivery method fosters effective data attribution and promotes its further reuse. This paper outlines the current scope of data accessible through NCBI Datasets and explains various options for exploring and downloading the data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Exploring and retrieving sequence and metadata for species across the tree of life with NCBI Datasets

Abstract

Talk to us

Similar Papers

More From: Scientific Data

Lead the way for us

Similar Papers

Applying the FAIR principles to data in a hospital: challenges and opportunities in a pandemic
Núria Queralt-Rosinach ... Rajaram Kaliyaperumal
Journal of biomedical semantics | VOL. 13
Núria Queralt-Rosinach, et. al.Núria Queralt-Rosinach ... Rajaram Kaliyaperumal
25 Apr 2022
Journal of biomedical semantics | VOL. 13

SPARClink: an interactive tool to visualize the impact of the SPARC program.
Sanjay Soundarajan ... Jongchan Kim
F1000Research | VOL. 11
Sanjay Soundarajan, et. al.Sanjay Soundarajan ... Jongchan Kim
31 Jan 2022
F1000Research | VOL. 11

Submission of Microarray Data to Public Repositories
Catherine A Ball ... Ronald Taylor
PLoS Biology | VOL. 2
Catherine A Ball, et. al.Catherine A Ball ... Ronald Taylor
31 Aug 2004
PLoS Biology | VOL. 2

Initiatives, Concepts, and Implementation Practices of FAIR (Findable, Accessible, Interoperable, and Reusable) Data Principles in Health Data Stewardship Practice: Protocol for a Scoping Review.
Esther Thea Inau ... Atinkut Alamirrew Zeleke
JMIR Research Protocols | VOL. 10
Esther Thea Inau, et. al.Esther Thea Inau ... Atinkut Alamirrew Zeleke
02 Feb 2021
JMIR Research Protocols | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Exploring and retrieving sequence and metadata for species across the tree of life with NCBI Datasets

Abstract

Talk to us

Similar Papers

More From: Scientific Data