The DiGreC Treebank

Morgan Macleod,Elena Anagnostopoulou,Christina Sevdali,Dionysios Mertyris

doi:10.1163/24523666-06010004

The DiGreC Treebank

Morgan Macleod, Elena Anagnostopoulou + Show 2 more

Open Access

https://doi.org/10.1163/24523666-06010004

Copy DOI

Abstract

Abstract The DiGreC (DIachrony of GREek Case) treebank is a corpus of selected sentences from Greek texts, ranging from Homer to Modern Greek, which have been annotated morphosyntactically and semantically. The corpus comprises excerpts from 655 texts, for a total of 3385 sentences and 56,440 word tokens; automated tagging and lemmatisation has been supplemented with manual review to ensure accuracy. The data exist in xml and csv formats, which can be manipulated and converted automatically to other schemata. A web site has also been created to allow users to interact with the data more easily, and to provide specialised functionality for searching and visualisation. This corpus was created to inform theoretical debates regarding the role of case in grammar, and may be of use to researchers searching for specific attestations of a range of different constructions in Greek.

Highlights

The goal of this project has been to use the Greek language, which furnishes a large quantity of linguistic data over an unusually long span of time, to investigate syntactic phenomena, and to provide a clearer picture of the Greek case system and its changes over time, which has the potential to inform theoretical discussions on the nature of linguistic case
From the classifications found in traditional Greek grammars (e.g., Goodwin, 1894; Smyth, 1920; Tzartzanos, 1940), and from the Greek equivalents of verbs listed in semantic classifications such as Levin (1993)
The DiGreC treebank represents an attempt to make the data from our project accessible to and reusable by other researchers. This via free access treebank provides syntactically and semantically annotated data from a more diverse range of texts, over a broader time span, than many existing resources. It does not exhaustively represent the full surviving body of Ancient Greek texts, it can be used by researchers seeking examples of specific constructions, for research on those aspects of grammar on which we have focused but on the many other phenomena which our data embody

Summary

Introduction

The goal of this project has been to use the Greek language, which furnishes a large quantity of linguistic data over an unusually long span of time, to investigate syntactic phenomena, and to provide a clearer picture of the Greek case system and its changes over time, which has the potential to inform theoretical discussions on the nature of linguistic case. We have chosen to make the data used in this project available to the public in the form of a morphosyntactically and semantically annotated treebank. This article describes the features of this treebank, as well as the data selection principles and methodology involved in its construction

Context

Methods

Findings

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Research Data Journal for the Humanities and Social Sciences	Publication Date: Dec 6, 2021
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

The DiGreC Treebank

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Research Data Journal for the Humanities and Social Sciences

Lead the way for us

Similar Papers

Syntaktische Aramaismen im Markusevangelium: Praesens historicum und καὶ εὐθύς
Gregor Geiger
Liber Annuus | VOL. 64
Gregor GeigerGregor Geiger
01 Jan 2014
Liber Annuus | VOL. 64

Plotinus on Beauty (Enneads 1.6 and 5.8.1–2): The Greek Text with Notes by Andrew Smith
Daniel Regnier
The Catholic Biblical Quarterly | VOL. 83
Daniel RegnierDaniel Regnier
01 Jan 2020
Plotinus on Beauty (Enneads 1.6 and 5.8.1–2): The Greek Text with Notes by Andrew Smith
Daniel Regnier

WWW.Cell Biology Education: Evolution Web Sites
Dennis Liu
Cell Biology Education | VOL. 4
Dennis LiuDennis Liu
01 Sep 2005
Cell Biology Education | VOL. 4

Audio-Visual Materials in Classics
Janice F Siegel
Classical World | VOL. 101
Janice F SiegelJanice F Siegel
01 Mar 2008
Classical World | VOL. 101

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The DiGreC Treebank

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Research Data Journal for the Humanities and Social Sciences