Abstract
With invaluable help from and in close co-operation with colleagues from around the world, the Electronic Text Corpus of Sumerian Literature project at the University of Oxford has compiled, lemmatised and made publicly available a large body of Sumerian literature. Building a corpus of literary compositions originally written on clay tablets in the cuneiform script, and dating back nearly four thousand years, poses special challenges, not least with regard to mark-up and automatic processing of data. Some of these challenges are discussed in this paper together with issues relating to the fact that Sumerian is a language isolate and lacks resources we take for granted when working with other languages, modern or extinct, such as a standardised sign list and a comprehensive dictionary.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.