Abstract

Poor quality metadata can have negative impact not only on the way research datasets are retrieved, shared and used by scientists, but also on the way research data repositories are managed and audited. The aim of the research reported in this paper was to perform a descriptive analysis of the Dublin Core's Subject metadata element and identify its quality problems, if any, in the context of the Dryad research data repository following a novel data-preprocessing method using SQL queries. The findings showed quality problems related to the lack of controlled vocabulary and standardisation, like the inconsistent use of singular and plural forms, adjectives and synonyms. This study has both practical and methodological implications for the evaluation of metadata and the improvement of the quality of the research data annotation process in open research data repositories.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call