Semantic Expansion Research Articles

Multiple studies have investigated bibliometric factors predictive of the citation count a research article will receive. In this article, we go beyond bibliometric data by using a range of machine learning techniques to find patterns predictive of citation count using both article content and available metadata. As the input collection, we use the CORD-19 corpus containing research articles—mostly from biology and medicine—applicable to the COVID-19 crisis. Our study employs a combination of state-of-the-art machine learning techniques for text understanding, including embeddings-based language model BERT, several systems for detection and semantic expansion of entities: ConceptNet, Pubtator and ScispaCy. To interpret the resulting models, we use several explanation algorithms: random forest feature importance, LIME, and Shapley values. We compare the performance and comprehensibility of models obtained by “black-box” machine learning algorithms (neural networks and random forests) with models built with rule learning (CORELS, CBA), which are intrinsically explainable. Multiple rules were discovered, which referred to biomedical entities of potential interest. Of the rules with the highest lift measure, several rules pointed to dipeptidyl peptidase4 (DPP4), a known MERS-CoV receptor and a critical determinant of camel to human transmission of the camel coronavirus (MERS-CoV). Some other interesting patterns related to the type of animal investigated were found. Articles referring to bats and camels tend to draw citations, while articles referring to most other animal species related to coronavirus are lowly cited. Bat coronavirus is the only other virus from a non-human species in the betaB clade along with the SARS-CoV and SARS-CoV-2 viruses. MERS-CoV is in a sister betaC clade, also close to human SARS coronaviruses. Thus both species linked to high citation counts harbor coronaviruses which are more phylogenetically similar to human SARS viruses. On the other hand, feline (FIPV, FCOV) and canine coronaviruses (CCOV) are in the alpha coronavirus clade and more distant from the betaB clade with human SARS viruses. Other results include detection of apparent citation bias favouring authors with western sounding names. Equal performance of TF-IDF weights and binary word incidence matrix was observed, with the latter resulting in better interpretability. The best predictive performance was obtained with a “black-box” method—neural network. The rule-based models led to most insights, especially when coupled with text representation using semantic entity detection methods. Follow-up work should focus on the analysis of citation patterns in the context of phylogenetic trees, as well on patterns referring to DPP4, which is currently considered as a SARS-Cov-2 therapeutic target.

Read full abstract

Etymology is a section of linguistics, a section of comparative-historical linguistics that studies the origin of language words; a set of research methods aimed at clarifying the origin of the word, as well as the result of this clarification. The etymology of the word is its linguistic and cultural-historical passport, its biography, which reflects the structural and semantic status of the word in the ancient period of language development and its place in the circle of related and unrelated languages. Etymology is designed to explain all the changes (or invariance) of the form of the word, at the same time all the metamorphoses of its meaning in the light of the cultural and historical conditions in which a language developed; rationally assess the potential inherent in the word for further development and trace the mechanisms of its semantic development. . The purpose of research is to clarify the role of etymology and axiology in semantic and etymological analysis. Semantic and etymological analysis of reference values as information centers of the grid of linguistic meanings of the nominative system reveals the original forms and original semantic dominants in the typological context, determine their potential for further semantic and nominative development in the languages studied. It is a kind of "bridge" to the new semantic links, correlated with those already established. Typological comparison allows to identify semasiological parallels (similar semantic dominants and similar semantic expansion) against the background of areal and genetic. Semasiological parallels are research-motivated, because we are dealing with one concept, which is expressed in languages with semasiologically common roots. Based on the analysis of the components of the onomasiological paradigm, we can distinguish two types of assessment: axiological evaluation on an objective basis and axiological evaluation on a subjective basis. Axiological evaluation on an objective basis focuses on rational evaluation, which researchers traditionally associate with the notion of stereotype used in logical evaluation theories. Axiological evaluation on a subjective basis. In this type of evaluation, the leading role belongs to the emotional component: the already evaluated phenomenon here is layered with the actual subjective evaluation, so the already evaluated objective feature is evaluated again, subjectively. It is a mix of a sign and its evaluation, another assessment is subjective, emotional. The formation of axiologically evaluative semantics of units of all types in the totality of meaning and form can be carried out on the basis of typological and specific for each language psychological associations of figurative and non- figurative nature, due to the presence of different people, different ideas and concepts. Language is an expression not only of the linguistic thinking of an individual nation, but also of the linguistic and cultural experience inherent in humanity as a whole.

Read full abstract

Semantic Expansion Research Articles

Related Topics

Articles published on Semantic Expansion

Identification and Classification of Depressed Mental State for End-User over Social Media.

ESKİ TÜRKÇE METİNLER BAĞLAMINDA HUKUKİ TERMİNOLOJİNİN SEMANTİK ZENGİNLEŞMESİ

Why was this cited? Explainable machine learning applied to COVID-19 research literature.

Should we look for Fibonacci's «golden ratio» in the processes of waste formation

Динамические процессы в русском эмотивно-оценочном словаре

КОНЦЕПТУАЛЬНІ МЕЖІ КАТЕГОРІЇ ESPACE COSMIQUE

한중 냉각형용사의 의미 확장 대조 연구

The Verb שָׁאַף in Biblical Hebrew

EANDC: An explainable attention network based deep adaptive clustering model for mental health treatment

ANALYSE STYLISTIQUE DU SIGNIFIÉ ET EXPANSIONDU SENS LEXICAL DANS LE POÈME « TEMPS SANSMÉMOIRE » DE B. ZADI ZAOUROU

A cross‐lingual secure semantic searching scheme with semantic analysis on ciphertext

Etymology of the Word and Axiological-Evaluative Semantics

Learning Sentence-to-Hashtags Semantic Mapping for Hashtag Recommendation on Microblogs

I-Dataquest: A heterogeneous information retrieval tool using data graph for the manufacturing industry

Truth and Semantic Change in the Gospel of John

Semantic query expansion method based on pay-as-yougo fashion for graph model

ЕСТЕТИКА МОВНО-ОБРАЗНОГО ЗНАКА-СИМВОЛУ КРИНИЦЯ У ПОЕТИЧНОМУ КОНТИНУУМІ ВАСИЛЯ ГОЛОБОРОДЬКА

A Contrast Study of the Semantic Expansion of Korean and Chinese Spatial Adjectives : Focusing on the Words “넓다/宽 (wide) and 좁다/窄 (narrow)”

A comparative study on the formation of Sinoxenic person nouns in Korean, Chinese, Japanese, and Vietnamese

The Cognitive Map of Barbarity: Term, Notion, Innovative Essence

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Semantic Expansion Research Articles

Related Topics

Articles published on Semantic Expansion

Identification and Classification of Depressed Mental State for End-User over Social Media.

ESKİ TÜRKÇE METİNLER BAĞLAMINDA HUKUKİ TERMİNOLOJİNİN SEMANTİK ZENGİNLEŞMESİ

Why was this cited? Explainable machine learning applied to COVID-19 research literature.

Should we look for Fibonacci's «golden ratio» in the processes of waste formation

Динамические процессы в русском эмотивно-оценочном словаре

КОНЦЕПТУАЛЬНІ МЕЖІ КАТЕГОРІЇ ESPACE COSMIQUE

한중 냉각형용사의 의미 확장 대조 연구

The Verb שָׁאַף in Biblical Hebrew

EANDC: An explainable attention network based deep adaptive clustering model for mental health treatment

ANALYSE STYLISTIQUE DU SIGNIFIÉ ET EXPANSIONDU SENS LEXICAL DANS LE POÈME « TEMPS SANSMÉMOIRE » DE B. ZADI ZAOUROU

A cross‐lingual secure semantic searching scheme with semantic analysis on ciphertext

Etymology of the Word and Axiological-Evaluative Semantics

Learning Sentence-to-Hashtags Semantic Mapping for Hashtag Recommendation on Microblogs

I-Dataquest: A heterogeneous information retrieval tool using data graph for the manufacturing industry

Truth and Semantic Change in the Gospel of John

Semantic query expansion method based on pay-as-yougo fashion for graph model

ЕСТЕТИКА МОВНО-ОБРАЗНОГО ЗНАКА-СИМВОЛУ КРИНИЦЯ У ПОЕТИЧНОМУ КОНТИНУУМІ ВАСИЛЯ ГОЛОБОРОДЬКА

A Contrast Study of the Semantic Expansion of Korean and Chinese Spatial Adjectives : Focusing on the Words “넓다/宽 (wide) and 좁다/窄 (narrow)”

A comparative study on the formation of Sinoxenic person nouns in Korean, Chinese, Japanese, and Vietnamese

The Cognitive Map of Barbarity: Term, Notion, Innovative Essence