Abstract

This paper describes batch loading workflows developed for the Knowledge Bank, The Ohio State University’s institutional repository. In the five years since the inception of the repository approximately 80 percent of the items added to the Knowledge Bank, a DSpace repository, have been batch loaded. Most of the batch loads utilized Perl scripts to automate the process of importing metadata and content files. Custom Perl scripts were used to migrate data from spreadsheets or comma-separated values files into the DSpace archive directory format, to build collections and tables of contents, and to provide data quality control. Two projects are described to illustrate the process and workflows.

Highlights

  • ■■ Literature ReviewBatch ingesting is acknowledged in the literature as a means of populating institutional repositories

  • Background extended version of the defaultDSpace Qualified DC schema, which includes several additional element qualifiers

  • The Knowledge Bank contains the abstracts of the papers presented at the OSU International Symposium on Molecular Spectroscopy (MSS), which has met annually since 1946

Read more

Summary

■■ Literature Review

Batch ingesting is acknowledged in the literature as a means of populating institutional repositories. The XML source metadata they used was generated by the National Library of New Zealand Metadata Extraction Tool.[7] Two subsequent projects for the HRC revisited the workflow described by Kim, Dong, and Durden.[8] Proudfoot and her colleagues discuss importing metadata-only records from departmental RefBase, Thomson Reuters EndNote, and Microsoft Access databases into ePrints. The Knowledge Bank workflows described in this interfaces: the original interface based on JavaServer paper use Perl scripts to generate DC XML and create the Pages (JSPUI) and the newer Manakin (XMLUI) interface archive directory for batch loading metadata records and based on the Apache Cocoon framework At this writing, content files into DSpace using Excel spreadsheets or CSV the Knowledge Bank continues to use the JSPUI interface.

The default metadata used by DSpace is a Qualified
The Issues of the Ohio Journal of Science
Knowledge Bank Dublin Core
The Abstracts of the OSU International Symposium on Molecular Spectroscopy
Retrospective MSS Batch Loads
Annual MSS Batch Loads
■■ Acknowledgments

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.