Abstract

RESUMO Este artigo descreve a metodologia para a integração de predicados nominais, do tipo construções com verbo-suporte (CVS), no analisador sintático automático XIP, que é utilizado pela cadeia de processamento do Português STRING. Trata-se, mais especificamente, de 580 CVS com o verbo dar e um nome predicativo, cujas propriedades sintático-semânticas foram descritas, formalizadas e, em seguida, integradas à gramática do XIP, por meio de regras, a fim de extrair a dependência SUPPORT entre o nome predicativo (Npred) e o verbo-suporte (Vsup). A necessidade de tratar automaticamente as CVS decorre do fato de que elas são diferentes de construções com verbo pleno, possuem estruturas sintáticas complexas, possuem propriedades sintático-semânticas específicas e admitem transformações sintáticas sistemáticas, ainda que lexicalmente determinadas. O conceito de CVS, bem como a abordagem léxico-sintática adotada, segue os princípios teóricos e metodológicos do Léxico-Gramática. Como resultado da integração desses dados ao parser XIP, o sistema atingiu precisão de 85%, abrangência de 87%, acurácia de 80% e medida-F de 86%.

Highlights

  • Support verb constructions (SVC) are nominal predicates formed by a support verb (Vsup) and a predicative noun (Npred)

  • The sub-sample of the phrases randomly selected from the set of manually annotated SVC was processed by the STRING system and its output was analyzed in comparison with the reference corpus

  • The usual metrics were used to evaluate the output of the system, namely Precision, which measures the fraction of correctly found instances over the total of instances found: (TP/(TP+False positives (FP))); Recall, measuring the fraction of relevant instances that were found: (TP/(TP+FN)); Accuracy, which computes both the correct found instances and the correctly missed cases: ((TP+that should not (TN))/ (TP+TN+FP+FN)); and F-measure, which is the harmonic mean between precision and recall: (2PR/(P+R))

Read more

Summary

Introduction

Support verb constructions (SVC) are nominal predicates formed by a support verb (Vsup) and a predicative noun (Npred). Though there are different types of nominal predicates, in this work we will deal with nominal constructions whose predicative nucleus is a noun (called predicative noun, Npred) and this noun is auxiliated by a verb (called a support verb, Vsup) In this sense, we developed a systematic linguistic analysis of SVC, we adopted a formalization of the data based on the proposal of the Lexicon-Grammar (GROSS, 1975, 1981), we integrated the data in an automatic processing chain of Portuguese, STRING (MAMEDE et al, 2012), and we evaluated the result of the system based on the manual annotation of a corpus.

Notation
Evaluation
Conclusions and future work
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call