Abstract

The identification of valid terms in any domain is fundamental to its computerization. For this reason, in this paper we present a method for obtaining automated morphosyntactic patterns, which will help researchers obtain valid terms from the proposed patterns, in order to build quality ontologies for the translation from one language to another, or to find relevant terms in short sentences, which can be used as parameters in question-answer systems. For this purpose, we use some statistical methods which show candidates in a pattern vector. Then, a heuristic process unfolds to refine the pattern vector obtained, basing on two main parameters: the statistical results previously obtained and the length of the pattern analyzed. As a result, we obtain the collection of the best patterns for the detection of real multiword terms. Key words: Morphosyntactic patterns, multiword terms, incremental learning.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.