ADAPTING A CONSTITUENCY PARSER TO USER-GENERATED CONTENT IN POLISH OPINION MINING

Agnieszka Pluwak,Wojciech Korczynski,Marek Kisiel-Dorohinicki

doi:10.7494/csci.2016.17.1.23

ADAPTING A CONSTITUENCY PARSER TO USER-GENERATED CONTENT IN POLISH OPINION MINING

Agnieszka Pluwak, Wojciech Korczynski + Show 1 more

Open Access

https://doi.org/10.7494/csci.2016.17.1.23

Copy DOI

Journal: Computer Science	Publication Date: Jan 1, 2016
Citations: 22	License type: publisher-specific-oa

Affiliation: Polish Academy of Sciences, Fido Intelligence (Poland), AGH University of Krakow

#Parser Adaptation #Text Normalization + Show 4 more

Abstract
Full-Text PDF
Similar Papers

Abstract

The paper focuses on the adjustment of NLP tools for Polish; e.g., morphological analyzers and parsers, to user-generated content (UGC). The authors discuss two rule-based techniques applied to improve their efficiency: pre-processing (text normalization) and parser adaptation (modified segmentation and parsing rules). A new solution to handle OOVs based on inflectional translation is also offered.

Full Text