Dependency Treebanks of Ancient Greek Prose

Vanessa B Gorman

doi:10.5334/johd.13

Abstract

This dataset is a collection of dependency syntax trees of representative texts from ancient Greek prose authors (Aeschines, Antiphon, Appian, Athenaeus, Demosthenes, Dionysius of Halicarnassus, Herodotus, Josephus, Lysias, Plutarch, Polybius, Thucydides, and Xenophon), totaling to date 550,000+ tokens. It is hand-annotated by one person, using the Arethusa program on the Perseids website. Original texts were obtained from the Perseus Digital Library, and some (as indicated) were computer pre-parsed at the Pedalion Project. The database is stored in a stable form (2019-12-31) on Zenodo (DOI: 10.5281/zenodo.3596076 ) and in a continuously updated form on GitHub in .xml format ( https://vgorman1.github.io/ ). The repository can be used for pedagogical purposes and for research in linguistics analysis and corpus linguistics, stylistics, natural language processing, classification, and literary and historical analysis.

Highlights

Context (2) Methods Steps I made the trees using the Arethusa software on the Perseids website [13]
Original text files were obtained from the Perseus Project [14] (Tufts Univ.) and from the Pedalion Project (UK Leuven)
I followed the rules of dependency syntax, employing the standard AGDT 1.1 tagset [2] and refining them according to the discussion of dependency syntax offed by Pinkster [15]

Summary

Introduction

Context (2) Methods Steps I made the trees using the Arethusa software on the Perseids website [13]. I have created more detailed instructions for annotating major linguistic phenomena not covered in Bamman and Crane [2] in the ‘Treebanking Tips’ file within this dataset, relying heavily on the parallel interpretation of dependency syntax offered for Latin by Pinkster [15]. Dataset Creators Vanessa Gorman is the manual annotator of these trees.

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Open Humanities Data	Publication Date: Mar 26, 2020
Citations: 6	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Dependency Treebanks of Ancient Greek Prose

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Open Humanities Data

Lead the way for us

Similar Papers

State of the Art in Authorship Attribution With Impact Analysis of Stylometric Features on Style Breach Prediction
Rajesh Shardanand Prasad ... Midhun Chakkaravarthy
Journal of Cases on Information Technology | VOL. 24
Rajesh Shardanand Prasad, et. al.Rajesh Shardanand Prasad ... Midhun Chakkaravarthy
28 Jan 2022
Journal of Cases on Information Technology | VOL. 24

Augmenting Qualitative Text Analysis with Natural Language Processing: Methodological Study.
Timothy C Guetterman ... Vg Vinod Vydiswaran
Journal of Medical Internet Research | VOL. 20
Timothy C Guetterman, et. al.Timothy C Guetterman ... Vg Vinod Vydiswaran
29 Jun 2018
Journal of Medical Internet Research | VOL. 20

Towards automatic causality boundary identification from root cause analysis reports
Sanghee Kim ... Ken Wallace
Journal of Intelligent Manufacturing | VOL. 20
Sanghee Kim, et. al.Sanghee Kim ... Ken Wallace
19 Jun 2008
Journal of Intelligent Manufacturing | VOL. 20

Ontology and Knowledge Graphs for Semantic Analysis in Natural Language Processing
Ujwala Bharambe ... Chhaya Narvekar
-
Ujwala Bharambe, et. al.Ujwala Bharambe ... Chhaya Narvekar
07 Nov 2022
07 Nov 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Dependency Treebanks of Ancient Greek Prose

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Journal of Open Humanities Data