A Graph-based Lattice Dependency Parser for Joint Morphological Segmentation and Syntactic Analysis

Wolfgang Seeker,Özlem Çetinoğlu

doi:10.1162/tacl_a_00144

Wolfgang Seeker, Özlem Çetinoğlu

Open Access

https://doi.org/10.1162/tacl_a_00144

Copy DOI

Abstract

Space-delimited words in Turkish and Hebrew text can be further segmented into meaningful units, but syntactic and semantic context is necessary to predict segmentation. At the same time, predicting correct syntactic structures relies on correct segmentation. We present a graph-based lattice dependency parser that operates on morphological lattices to represent different segmentations and morphological analyses for a given input sentence. The lattice parser predicts a dependency tree over a path in the lattice and thus solves the joint task of segmentation, morphological analysis, and syntactic parsing. We conduct experiments on the Turkish and the Hebrew treebank and show that the joint model outperforms three state-of-the-art pipeline systems on both data sets. Our work corroborates findings from constituency lattice parsing for Hebrew and presents the first results for full lattice parsing on Turkish.

Highlights

Linguistic theory has provided examples from many different languages in which grammatical information is expressed via case marking, morphological agreement, or clitics
For Hebrew, the baseline is the disambiguated lattices provided by the SPMRL 2014 Shared Task
The IGeval metric is designed to evaluate the syntactic quality with less attention to morphological analysis and segmentation. Both PIPELINE and JOINT achieve very similar results and none of the differences is statistical significant. These results suggest that a good part of the improvements in the lattice parser occurs in the morphological analysis/segmentation, whereas the quality of syntactic annotation basically stays the same between the pipeline and the joint model

Summary

Introduction

Linguistic theory has provided examples from many different languages in which grammatical information is expressed via case marking, morphological agreement, or clitics. In these languages, configurational information is less important than in English since the words are overtly marked for their syntactic relations to each other. Configurational information is less important than in English since the words are overtly marked for their syntactic relations to each other Such morphologically rich languages pose many new challenges to today’s natural language processing technology, which has often been developed for English. One of the first challenges is the question on how to represent morphologically rich languages and what are the basic units of analysis (Tsarfaty et al, 2010). A space-delimited word in the treebank can consist of several morphemes that may belong to independent syntactic contexts

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Transactions of the Association for Computational Linguistics	Publication Date: Dec 1, 2015
Citations: 87	License type: cc-by

R Discovery Prime

R Discovery Prime

A Graph-based Lattice Dependency Parser for Joint Morphological Segmentation and Syntactic Analysis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics

Lead the way for us

Similar Papers

Integration of Morphological and Syntactic Analysis based on LR Parsing Algorithm
Hozumi Tanaka ... Michio Aizawa
Journal of Natural Language Processing | VOL. 2
Hozumi Tanaka, et. al.Hozumi Tanaka ... Michio Aizawa
01 Jan 1995
Journal of Natural Language Processing | VOL. 2

Modern Bulgarian Literature and the Turkish Loan Words
Sadik Haci ... Zeynep Zafer
Balkanistic Forum | VOL. 30
Sadik Haci, et. al.Sadik Haci ... Zeynep Zafer
01 Jun 2021
Balkanistic Forum | VOL. 30

The Effects of Semantic and Syntactic Prediction on Reading Aloud
Elisa Gavard ... Johannes C Ziegler
Experimental Psychology | VOL. 69
Elisa Gavard, et. al.Elisa Gavard ... Johannes C Ziegler
01 Nov 2022
Experimental Psychology | VOL. 69

Revisiting the incremental effects of context on word processing: Evidence from single-word event-related brain potentials.
Brennan R Payne ... Chia‐Lin Lee
Psychophysiology | VOL. 52
Brennan R Payne, et. al.Brennan R Payne ... Chia‐Lin Lee
27 Aug 2015
Psychophysiology | VOL. 52

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Graph-based Lattice Dependency Parser for Joint Morphological Segmentation and Syntactic Analysis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Transactions of the Association for Computational Linguistics