The implications of handwritten text recognition for accessing the past at scale

Joseph Nockels,Paul Gooding,Melissa Terras

doi:10.1108/jd-09-2023-0183

Joseph Nockels, Paul Gooding + Show 1 more

Open Access

https://doi.org/10.1108/jd-09-2023-0183

Copy DOI

Journal: Journal of Documentation	Publication Date: Apr 18, 2024
Citations: 1	License type: cc-by

Affiliation: University of Edinburgh

Abstract

PurposeThis paper focuses on image-to-text manuscript processing through Handwritten Text Recognition (HTR), a Machine Learning (ML) approach enabled by Artificial Intelligence (AI). With HTR now achieving high levels of accuracy, we consider its potential impact on our near-future information environment and knowledge of the past.Design/methodology/approachIn undertaking a more constructivist analysis, we identified gaps in the current literature through a Grounded Theory Method (GTM). This guided an iterative process of concept mapping through writing sprints in workshop settings. We identified, explored and confirmed themes through group discussion and a further interrogation of relevant literature, until reaching saturation.FindingsCatalogued as part of our GTM, 120 published texts underpin this paper. We found that HTR facilitates accurate transcription and dataset cleaning, while facilitating access to a variety of historical material. HTR contributes to a virtuous cycle of dataset production and can inform the development of online cataloguing. However, current limitations include dependency on digitisation pipelines, potential archival history omission and entrenchment of bias. We also cite near-future HTR considerations. These include encouraging open access, integrating advanced AI processes and metadata extraction; legal and moral issues surrounding copyright and data ethics; crediting individuals’ transcription contributions and HTR’s environmental costs.Originality/valueOur research produces a set of best practice recommendations for researchers, data providers and memory institutions, surrounding HTR use. This forms an initial, though not comprehensive, blueprint for directing future HTR research. In pursuing this, the narrative that HTR’s speed and efficiency will simply transform scholarship in archives is deconstructed.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The implications of handwritten text recognition for accessing the past at scale

Abstract

Talk to us

Similar Papers

More From: Journal of Documentation

Lead the way for us

Similar Papers

ONLINE HANDWRITING RECOGNITION SYSTEM FOR ENGLISH LANGUAGE USING SUPPORT VECTOR MACHINE (SVM) APPROACH
...
International Journal of Advanced Research in Computer Science | VOL. 9
, et. al. ...
02 May 2018
International Journal of Advanced Research in Computer Science | VOL. 9

Online handwriting recognition systems for Indic and non-Indic scripts: a review
Harjeet Singh ... V P Singh
Artificial Intelligence Review | VOL. 54
Harjeet Singh, et. al.Harjeet Singh ... V P Singh
08 Aug 2020
Artificial Intelligence Review | VOL. 54

Machine Learning Techniques in Handwriting Recognition
Hong Lee ... Michael Li
-
Hong Lee, et. al.Hong Lee ... Michael Li
01 Jan 2012
01 Jan 2012

Understanding the application of handwritten text recognition technology in heritage contexts: a systematic review of Transkribus in published research
Joe Nockels ... Paul Gooding
Archival Science | VOL. 22
Joe Nockels, et. al.Joe Nockels ... Paul Gooding
17 Jun 2022
Archival Science | VOL. 22

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The implications of handwritten text recognition for accessing the past at scale

Abstract

Talk to us

Similar Papers

More From: Journal of Documentation