Task design and assignment of full-text generation on mass Chinese historical archives in digital humanities

Jihong Liang,Hao Wang,Xiaojing Li

doi:10.1108/ajim-09-2019-0245

Abstract

PurposeThe purpose of this paper is to explore the task design and assignment of full-text generation on mass Chinese historical archives (CHAs) by crowdsourcing, with special attention paid to how to best divide full-text generation tasks into smaller ones assigned to crowdsourced volunteers and to improve the digitization of mass CHAs and the data-oriented processing of the digital humanities.Design/methodology/approachThis paper starts from the complexities of character recognition of mass CHAs, takes Sheng Xuanhuai archives crowdsourcing project of Shanghai Library as a case study, and makes use of the theories of archival science, including diplomatics of Chinese archival documents, and the historical approach of Chinese archival traditions as the theoretical basis and analysis methods. The results are generated through the comprehensive research.FindingsThis paper points out that volunteer tasks of full-text generation include transcription, punctuation, proofreading, metadata description, segmentation, and attribute annotation in digital humanities and provides a metadata element set for volunteers to use in creating or revising metadata descriptions and also provides an attribute tag set. The two sets can be used across the humanities to construct overall observations about texts and the archives of which they are a part. Along these lines, this paper presents significant insights for application in outlining the principles, methods, activities, and procedures of crowdsourced full-text generation for mass CHAs.Originality/valueThis study is the first to explore and identify the effective design and allocation of tasks for crowdsourced volunteers completing full-text generation on CHAs in digital humanities.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Task design and assignment of full-text generation on mass Chinese historical archives in digital humanities

Abstract

Talk to us

Similar Papers

More From: Aslib Journal of Information Management

Lead the way for us

Journal: Aslib Journal of Information Management	Publication Date: Mar 25, 2020
Citations: 8

Similar Papers

Forum Introduction
Lauren Tilton ... Jesse P Karlsberg
American Quarterly | VOL. 70
Lauren Tilton, et. al.Lauren Tilton ... Jesse P Karlsberg
01 Jan 2018
American Quarterly | VOL. 70

Transforming Text: Four Valences of a Digital Humanities Informed Writing Analytics
Gregory J Palermo
The Journal of Writing Analytics | VOL. 1
Gregory J PalermoGregory J Palermo
01 Jan 2017
The Journal of Writing Analytics | VOL. 1

Using parsed and annotated corpora to analyze parliamentarians' talk in Finland
Mykola Andrushchenko ... Jussi Kurunmäki
Journal of the Association for Information Science and Technology | VOL. 73
Mykola Andrushchenko, et. al.Mykola Andrushchenko ... Jussi Kurunmäki
04 Jun 2021
Journal of the Association for Information Science and Technology | VOL. 73

Visual Analysis of Scientific Life of Scholars Based on Digital Humanities
Wei Liu ... Xiaoju Dong
-
Wei Liu, et. al.Wei Liu ... Xiaoju Dong
01 Jan 2018
01 Jan 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Task design and assignment of full-text generation on mass Chinese historical archives in digital humanities

Abstract

Talk to us

Similar Papers

More From: Aslib Journal of Information Management