A hybrid human and machine resource curation pipeline for the Neuroscience Information Framework

A E Bandrowski,P Ciccarese,L Marenco,J S Grethe,Y Li,V Astakhov,P W Sternberg,J Cachat,M E Martone,T Clark,R Wang,H M Muller

doi:10.1093/database/bas005

Abstract

The breadth of information resources available to researchers on the Internet continues to expand, particularly in light of recently implemented data-sharing policies required by funding agencies. However, the nature of dense, multifaceted neuroscience data and the design of contemporary search engine systems makes efficient, reliable and relevant discovery of such information a significant challenge. This challenge is specifically pertinent for online databases, whose dynamic content is ‘hidden’ from search engines. The Neuroscience Information Framework (NIF; http://www.neuinfo.org) was funded by the NIH Blueprint for Neuroscience Research to address the problem of finding and utilizing neuroscience-relevant resources such as software tools, data sets, experimental animals and antibodies across the Internet. From the outset, NIF sought to provide an accounting of available resources, whereas developing technical solutions to finding, accessing and utilizing them. The curators therefore, are tasked with identifying and registering resources, examining data, writing configuration files to index and display data and keeping the contents current. In the initial phases of the project, all aspects of the registration and curation processes were manual. However, as the number of resources grew, manual curation became impractical. This report describes our experiences and successes with developing automated resource discovery and semiautomated type characterization with text-mining scripts that facilitate curation team efforts to discover, integrate and display new content. We also describe the DISCO framework, a suite of automated web services that significantly reduce manual curation efforts to periodically check for resource updates. Lastly, we discuss DOMEO, a semi-automated annotation tool that improves the discovery and curation of resources that are not necessarily website-based (i.e. reagents, software tools). Although the ultimate goal of automation was to reduce the workload of the curators, it has resulted in valuable analytic by-products that address accessibility, use and citation of resources that can now be shared with resource owners and the larger scientific community.Database URL: http://neuinfo.org

Highlights

The Neuroscience Information Framework (NIF) is a rich and diverse system for discovering biological information of broad relevance to neuroscience
We describe the development of automated resource discovery and curation techniques
The NIF project is maintained by two full-time curators and curatorial assistants who are responsible for identification, representation, updation and integration of resources within the NIF Registry and Data Federation

Summary

Introduction

The Neuroscience Information Framework (NIF) is a rich and diverse system for discovering biological information of broad relevance to neuroscience. The Data Federation is an extension of the registry, providing access to deep and continuously updated (see description of DISCO tools below) content of over 100 of those databases and data sets.

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Database	Publication Date: Mar 20, 2012
Citations: 20	License type: cc-by

R Discovery Prime

R Discovery Prime

A hybrid human and machine resource curation pipeline for the Neuroscience Information Framework

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Database

Lead the way for us

Similar Papers

Chapter Three - A Survey of the Neuroscience Resource Landscape: Perspectives from the Neuroscience Information Framework
Jonathan Cachat ... Stephen D Larson
International Review of Neurobiology | VOL. -
Jonathan Cachat, et. al.Jonathan Cachat ... Stephen D Larson
01 Jan 2012
International Review of Neurobiology | VOL. -

The Neuroscience Information Framework (NIF): accessing diverse neuroscience resources through a single interface
Miller Perry
Frontiers in Neuroinformatics | VOL. 2
Miller PerryMiller Perry
01 Jan 2008
Frontiers in Neuroinformatics | VOL. 2

DISCO: An Internet-based initiative to facilitate data integration for the Neuroscience Information Framework
Consortium Nif
Frontiers in Neuroinformatics | VOL. 5
Consortium NifConsortium Nif
01 Jan 2010
Frontiers in Neuroinformatics | VOL. 5

Orchestration of web services in the NIF project: using the Kepler workflow engine for data fusion
Astakhov Vadim ... Bandrowski Anita
Frontiers in Neuroinformatics | VOL. 8
Astakhov Vadim, et. al.Astakhov Vadim ... Bandrowski Anita
01 Jan 2014
Frontiers in Neuroinformatics | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A hybrid human and machine resource curation pipeline for the Neuroscience Information Framework

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Database