Dictionary Of Words Research Articles

Dictionary is helpful tool for most of the context-based Natural Language Processing researches. The words in the language dictionary establish the context coverage for a specific application area. In the study, a novel model is proposed to generate thematic dictionary using the web resources. The model gets the benefit of different text similarity algorithms to enhance dictionary coverage and increase its internal similarity. For example, in order to create a financial dictionary, algorithm was started with a general seed word “finance”. Web search was executed with this word, and the top three web pages returned by the web search engine were processed. The words in the contents of these web pages were ranked according to their meaning values using the term frequency-inverse document frequency metric. Then, selected words were initially inserted into three different dictionaries which were controlled by WordNet, Spacy, and Simhash text similarity algorithms separately. All of these words added into these dictionaries were used for further web search again together. This process (search and dictionary update) of the algorithm was repeated for each dictionary separately until each reaches to the upper count of words (250 words have been set). Finally, these three dictionaries are merged to form the final financial dictionary. This financial dictionary was compared with the manually created financial dictionary in terms of quality. Consequently, the internal WordNet similarity rate of the words in the automatic financial dictionary was 29.01%, while it was 23.41% in the manual financial dictionary. For the similarity measure of both dictionaries, when the words were merged in the automatic and manual dictionaries into full texts and evaluated both in terms of Simhash similarity, then 72.30% similarity was obtained. It was seen that although both dictionaries produce almost similar words, the automatic dictionary had stronger internal semantic representation.

Read full abstract

This article is a fragment of the general theme "The particle is still in stable combinations." The subject of the study is the union combination "or else" (more) in Russian language, not recorded in lexicographic sources, but quite common in modern speech, as evidenced by the data of the National Corpus of the Russian Language (NKRR). In terms of syntagmatic activity, it is still significantly superior to other particles. There are many official formations that include the particle "more", in particular combinations with unions. Such combinations have varying degrees of stability – from free combinations to phraseologized formations that function as an integral union. The latter case is represented, for example, by a union "and even" in an aggravating sense. The main research method is the traditional descriptive one, which includes the following techniques: observation, generalization and systematization of linguistic phenomena. In addition, the method of contextual analysis was used in the work, as well as a corpus method of collecting research material. As a result of the analysis of the linguistic material (more than 1,300 uses from the NKRJ), it was found that the conjunction "and that" (more) in the studied combination is used in two main meanings – alternative motivation and separative (mutual exclusion). The multivalued particle "also" has two meanings in these combinations: the undesirability of the assumed, possible, corresponding to an alternative motivation, and the value of the addition corresponding to the separation. In the two highlighted values of the union combination, "or else" each component performs a specific function. In the first meaning, which can be schematically represented as "and that + more", the conjunction and that expresses the meaning of alternative motivation, and the particle emphasizes this meaning and introduces an emotional coloring. In the second meaning, we can talk about a stable combination of something else, where something else has a separating value, and the particle still expresses the value of addition. The results obtained can be used in lexicographic practice – when compiling a dictionary of service words.

Read full abstract

Dictionary Of Words Research Articles

Related Topics

Articles published on Dictionary Of Words

Automated thematic dictionary creation using the web based on WordNet, Spacy, and Simhash

Features of the functioning of the union combination "a to eshe" (more)

Comparison of text information from information sources based on the cosine similarity algorithm

Efficient incremental training using a novel NMT-SMT hybrid framework for translation of low-resource languages.

“My Mom Is a Fighter”: A Qualitative Analysis of the Use of Combat Metaphors in ICU Clinician Notes

The Case of the Cookie Jar: Differences in Typical Language Use in Dementia.

Applications of NLP to Human Resource Management: From Word Dictionaries to Large Language Models

The economics of L2 English. Evidence from 2.0 mln subjects suggests an economics of language framework to account for country differences in L2 English proficiency

Analysis of the presentation of compound words in Korean dictionaries - Focusing on ""X+2l"" compound words from Pyojun Korean Dictionary

Інноваційні процеси творення аксіологійних значень в українській лінгвокультурі: проєкція на антропоніми та їхні похід

Anatomy of sovereign yield behaviour using textual news

Pathos in Natural Language Argumentation: Emotional Appeals and Reactions

Narrative Emotions and Market Crises

Cultural values and the P-O fit: comparative NLP analysis of German online job advertisements

Definitions of Suffixed Loanwords in Dictionaries of Foreign Words in Slovak

Association Between Young Chinese Children’s Early Writing Skills and the Chinese Preschool Classroom Writing Environment

Types of Chinese loanwords and their ways of translation into Russian

A Study on the Semantics of the Sichuan Dialect Word Ba Based on the Cognitive Perspective

Some Missionary-Influenced Early Borrowed Words and Names in Xhosa

ON THE LEXICAL COMPOSITION OF KAZAKH WORDS FROM THE DICTIONARY OF V.V. RADLOV

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Dictionary Of Words Research Articles

Related Topics

Articles published on Dictionary Of Words

Automated thematic dictionary creation using the web based on WordNet, Spacy, and Simhash

Features of the functioning of the union combination "a to eshe" (more)

Comparison of text information from information sources based on the cosine similarity algorithm

Efficient incremental training using a novel NMT-SMT hybrid framework for translation of low-resource languages.

“My Mom Is a Fighter”: A Qualitative Analysis of the Use of Combat Metaphors in ICU Clinician Notes

The Case of the Cookie Jar: Differences in Typical Language Use in Dementia.

Applications of NLP to Human Resource Management: From Word Dictionaries to Large Language Models

The economics of L2 English. Evidence from 2.0 mln subjects suggests an economics of language framework to account for country differences in L2 English proficiency

Analysis of the presentation of compound words in Korean dictionaries - Focusing on ""X+2l"" compound words from Pyojun Korean Dictionary

Інноваційні процеси творення аксіологійних значень в українській лінгвокультурі: проєкція на антропоніми та їхні похід

Anatomy of sovereign yield behaviour using textual news

Pathos in Natural Language Argumentation: Emotional Appeals and Reactions

Narrative Emotions and Market Crises

Cultural values and the P-O fit: comparative NLP analysis of German online job advertisements

Definitions of Suffixed Loanwords in Dictionaries of Foreign Words in Slovak

Association Between Young Chinese Children’s Early Writing Skills and the Chinese Preschool Classroom Writing Environment

Types of Chinese loanwords and their ways of translation into Russian

A Study on the Semantics of the Sichuan Dialect Word Ba Based on the Cognitive Perspective

Some Missionary-Influenced Early Borrowed Words and Names in Xhosa

ON THE LEXICAL COMPOSITION OF KAZAKH WORDS FROM THE DICTIONARY OF V.V. RADLOV