Cross-disciplinary Collaborations to Enrich Access to Non-Western Language Material in the Cultural Heritage Sector

Tom Derrick,Nora Mcgregor

doi:10.1145/3322905.3322907

Abstract

The British Library is home to millions of items representing every age of written civilisation, including books, manuscripts and newspapers in all written languages. Large digitisation programmes currently underway are opening up access to this rich and unique historical content on an ever increasing scale. However, particularly for historical material written in non-Latin scripts, enabling enriched full-text discovery and analysis across the digitised output, something which would truly transform access and scholarship, is still out of reach. This is due in part to commercial text recognition solutions currently on the market today having largely been optimised for modern documents and Latin scripts. This paper will report on a series of initiatives undertaken by the British Library to investigate, evaluate and support new research into enhancing text recognition capabilities for two major digitised collections of non-Western language collections: printed Bangla and handwritten Arabic. It seeks to present lessons learned and opportunities gained from cross-disciplinary collaboration between the cultural heritage sector and researchers working at the cutting edge of text recognition, with a view towards informing and encouraging future such partnerships.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Cross-disciplinary Collaborations to Enrich Access to Non-Western Language Material in the Cultural Heritage Sector

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Generalization of Hindi OCR Using Adaptive Segmentation and Font Files
Mudit Agrawal ... David Doermann
-
Mudit Agrawal, et. al.Mudit Agrawal ... David Doermann
01 Jan 2009
01 Jan 2009

Digital Cultural Heritage Roadmap for Preservation
Antonella Fresa
International Journal of Humanities and Arts Computing | VOL. 8
Antonella FresaAntonella Fresa
01 Mar 2014
International Journal of Humanities and Arts Computing | VOL. 8

Linked Data and Cultural Heritage
Edie Davis ... Bahareh Heravi
Journal on Computing and Cultural Heritage | VOL. 14
Edie Davis, et. al.Edie Davis ... Bahareh Heravi
10 May 2021
Journal on Computing and Cultural Heritage | VOL. 14

Text Block Segmentation in Comic Speech Bubbles
Christophe Rigaud ... Nhu-Van Nguyen
-
Christophe Rigaud, et. al.Christophe Rigaud ... Nhu-Van Nguyen
01 Jan 2020
01 Jan 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Cross-disciplinary Collaborations to Enrich Access to Non-Western Language Material in the Cultural Heritage Sector

Abstract

Talk to us

Similar Papers