Building Open-Source Digital Curation Services and Repositories at Scale

Michael Kurtz,Will Thomas,Gregory Jansen,Richard Marciano,Sohan Shah

doi:10.2218/ijdc.v13i1.621

Abstract

The focus of this article is to share several in-progress research and development open-source approaches that seek to design, build, and test digital curation services and repositories that have the potential to scale (the IMLS-funded Fedora DRAS-TIC and the NSF-funded Brown Dog). We also discuss the creation of a big records testbed of justice, human rights, and cultural heritage collections (100 TB and 100 million records), the emergence of Computational Archival Science (CAS), and the resulting efforts at integrating digital curation education and research. We ultimately seek to develop a sustainable community of users and developers, with solutions that serve the international library, archives, and scientific data management communities. We are also focused on digital curation training and education in these innovative environments.

Highlights

We present two approaches to the design and curation of digital repositories that exploit current technology to address the emerging issues of capacity scaling, heterogeneous content, and sustainability: The first approach exploits NoSQL distributed database technology to support repositories that can scale out horizontally to thousands of commodity servers
This was recently funded through a U.S Institute of Museum and Library Services (IMLS) grant, called DRAS-TIC Fedora1, as part of IMLS’s National Digital Platform (NDP) program
Brown Dog is a $10.5M National Science Foundation (NSF)/DIBBs-funded collaboration with the University of Illinois National Center for Supercomputing Applications (NCSA) Supercomputing Center and industry partners (NetApp and Archive Analytics Solutions). This project aims to help accelerate the development of digital curation processes and services and create a data observatory to provide access to Big Records training sets and teach students practical digital curation skills

Summary

Introduction

Brown Dog is a $10.5M NSF/DIBBs-funded collaboration with the University of Illinois NCSA Supercomputing Center and industry partners (NetApp and Archive Analytics Solutions) This project aims to help accelerate the development of digital curation processes and services and create a data observatory to provide access to Big Records training sets and teach students practical digital curation skills. Human rights, and cultural heritage themes (community displacement, racial zoning, refugee narrative, citizen narrative, movement of people, and revealing untold stories) and cyberinfrastructure for the curation and management of digital assets at scale themes (preservation services in the cloud, and scalable distributed repositories) These projects are supported by the development of the DRAS-TIC open-source software which currently manages 100 million files and 100TB of cultural heritage data. It explores eight topics: 1) Evolutionary prototyping and computational linguistics (Bill Underwood), 2) Graph analytics, digital humanities and archival representation (Richard Marciano), 3) Computational finding aids (Greg Jansen), 4) Digital curation (Michael Kurtz), 5) Public engagement with (archival) content (Mark Hedges), 6) Authenticity (Victoria Lemieux), 7) Confluences between archival theory and computational methods (Maria Esteva), and 8) Spatial and temporal analytics (Mark Conrad)

Computational Methods

Curation and Appraisal

Creation and Management of Current Records

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Digital Curation	Publication Date: Dec 27, 2018
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Building Open-Source Digital Curation Services and Repositories at Scale

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: International Journal of Digital Curation

Lead the way for us

Similar Papers

Digital curation in museums
Joyce Ray
Library Hi Tech | VOL. 35
Joyce RayJoyce Ray
20 Mar 2017
Library Hi Tech | VOL. 35

Digital Curation Education in Practice: Catching up with Two Former Fellows
Lisa Gregory ... Samantha Guss
International Journal of Digital Curation | VOL. 6
Lisa Gregory, et. al.Lisa Gregory ... Samantha Guss
25 Jul 2011
International Journal of Digital Curation | VOL. 6

Digital Curators’ Education: Professional Identity vs. Convergence of LAM (Libraries, Archives, Museums)
Anna Maria Tammaro ... Melody Madrid
-
Anna Maria Tammaro, et. al.Anna Maria Tammaro ... Melody Madrid
01 Jan 2013
01 Jan 2013

Competency-based Curriculum: An Effective Approach to Digital Curation Education
Jeonghyun Kim
Journal of Education for Library and Information Science Online | VOL. 56
Jeonghyun KimJeonghyun Kim
01 Jan 2015
Journal of Education for Library and Information Science Online | VOL. 56

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Building Open-Source Digital Curation Services and Repositories at Scale

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: International Journal of Digital Curation