Research data are considered the primary result and output of scientific research, and sharing and reusing data are key aspects of the transition to open science on a European level. However, there are many unanswered questions regarding research data management, such as understanding which data might be shared and under what conditions, who might share them, or even what research data is (Borgman, 2012). In the humanities, research data are among the most diverse of all scientific disciplines because nearly any data on human activity can be considered research data, such as newspapers or photographs. Thus, the boundary between data and publication in this context is often quite vague (Borgman, 2008; Thoegersen, 2018).This paper aims to examine and answer several research questions related to datasets in the field of humanities. The first research question will analyse the types of research data represented in humanities (1). In literature, a consensus has yet to be reached on the definition, and often, research data in the humanities are used as an umbrella term that includes different types of sources for research. DARIAH-DE defines research data in humanities as all sources/materials and results collected, written, described, and/or evaluated in the context of a research and research questions in the field of human and cultural sciences, presented in machine-readable form for archiving, citation and further use. This definition aims to account for the particular characteristics of human-scientific research and the resulting heterogeneity of the underlying data (DARIAH- DE, 2021). For the purpose of this research, the authors have chosen to adopt the DARIAH- DE definition of research data which will be the base for determining types of data represented in humanities. The research was conducted for humanities fields, including philosophy, theology, philology, history, art history, art science, archaeology, ethnology and anthropology, religious studies, and interdisciplinary humanities science. An analysis of datasets published in institutional and thematic repositories in the field of humanities in Croatia and Europe was conducted using platforms such as Zenodo, Digital Academic Archives and Repositories (DABAR), CLARIN, CROSSDA, and DARIAH. The research aimed to identify the most prevalent data types in the humanities found in repositories. In a study on the storage of research data from the humanities in repositories, Buddenbohm et al. (2016) note that a culture of sharing and reusing research data has not yet been established; although research data are, to some extent, stored in repositories, they are difficult to find. In light of this, the second research question will examine the extent to which datasets in the field of humanities are represented in repositories in open access and under which licences (2). The third research question will examine to what extent research data align with FAIR principles (3). The evaluation of dataset FAIRness will be conducted using the principles outlined in the Wilkinson et al. (2016) FAIR guidelines and Routledge Open Research data guidelines, as mentioned in Grant (2022) for FAIR humanities data. In addition, based on FAIR guidelines, authors will provide a practical framework for managing research data in humanities, which can be used to assess the quality of research datasets. The paper will present the types of research data in humanities that are represented in repositories, assess their level of openness, licensing, and alignment with FAIR data principles.
Read full abstract