Russian Tagging and Dependency Parsing Models for Stanford CoreNLP Natural Language Toolkit

Liubov Kovriguina,Alexander Shipilo,Ivan Shilin,Alina Putintseva

doi:10.1007/978-3-319-69548-8_8

Liubov Kovriguina, Alexander Shipilo + Show 2 more

https://doi.org/10.1007/978-3-319-69548-8_8

Copy DOI

Export

Save

Cite

Publication Date: Jan 1, 2017

Citations: 2

Affiliation: ITMO University

Abstract
Full-Text
Similar Papers

Abstract

Listen

The paper concerns implementing maximum entropy tagging model and neural net dependency parser model for Russian language in Stanford CoreNLP toolkit, an extensible pipeline that provides core natural language analysis. Russian belongs to morphologically rich languages and demands full morphological analysis including annotating input texts with POS tags, features and lemmas (unlike the case of case-, person-, etc. insensitive languages when stemming and POS-tagging give enough information about grammatical behavior of a word form). Rich morphology is accompanied by free word order in Russian which adds indeterminacy to head finding rules in parsing procedures. In the paper we describe training data, linguistic features used to learn the classifiers, training and evaluation of tagging and parsing models.

Full Text