Abstract
The problem of determining semantic similarity between words affects the understanding of synonymy and creates obstacles to the work oflexico graphers. The study was carried out as a part ofa larger research project on expert assessment of synonymic rows in RuWordNet thesaurus (a WordNet–like thesaurus for the Russian language). The aim of this study is to analyze RuWordNet thesaurus and compare it with classical dictionaries of Russian synonyms. For this purpose, the authors singled out entry words (adjectives N = 68 and verbs N = 117) and their analogues (adjectives N = 558 and verbs N = 1410) from the New Explanatory Dictionary of Russian Synonyms by Yu. Apresyan (NEDS). An analogue is viewed as aword whose meaning essentially intersects with the general meaning of a given synonymic row, although it lacks the needed semantic similarity that could indicate the presence of synonymy or near–synonymy (Apresyan). The quantitative analysis based on the breadth–first search (BFS) algorithm estimated the distance between each pair entry word! analogue. The quantitative method revealed that the analogues described in NEDS correlate with the hyponyms and hyperonyms in RuWordNet which contributes to the study of near–synonymy. The qualitative method (observation and linguistic interpretation) was used to analyze pairs entry word! analogue which showed the longest distance; such words were 52 adjectives and15 verbs. First, the meanings of entry words and analogues were checked against two Russian language thesauri, then, their representation in the tree graph of RuWordNet was traced. The analysis revealed inaccuracies concerning the similarity between certainwords. The recommendations for further improvement of RuWordNet were given.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.