Cosine Similarity Metric Research Articles

Today, various interactive tools or partially available artificial intelligence applications are actively used in educational processes to solve multiple problems for resource-rich languages, such as English, Spanish, French, etc. Unfortunately, the situation is different and more complex for low-resource languages, like Kazakh, Uzbek, Mongolian, and others, due to the lack of qualitative and accessible resources, morphological complexity, and the semantics of agglutinative languages. This article presents research on early childhood learning resources for the low-resource Kazakh language. Generally, a dictionary for children differs from classical educational dictionaries. The difference between dictionaries for children and adults lies in their purpose and methods of presenting information. A themed dictionary will make learning and remembering new words easier for children because they will be presented in a specific context. This article discusses developing an approach to creating a thematic children’s dictionary of the low-resource Kazakh language using artificial intelligence. The proposed approach is based on several important stages: the initial formation of a list of English words with the use of ChatGPT; identification of their semantic weights; generation of phrases and sentences with the use of the list of semantically related words; translation of obtained phrases and sentences from English to Kazakh, dividing them into bigrams and trigrams; and processing with Kazakh language POS pattern tag templates to adapt them for children. When the dictionary was formed, the semantic proximity of words and phrases to the given theme and age restrictions for children were taken into account. The formed dictionary phrases were evaluated using the cosine similarity, Euclidean similarity, and Manhattan distance metrics. Moreover, the dictionary was extended with video and audio data by implementing models like DALL-E 3, Midjourney, and Stable Diffusion to illustrate the dictionary data and TTS (Text to Speech) technology for the Kazakh language for voice synthesis. The developed thematic dictionary approach was tested, and a SUS (System Usability Scale) assessment of the application was conducted. The experimental results demonstrate the proposed approach’s high efficiency and its potential for wide use in educational purposes.

Read full abstract

Dementia is a cognitive decline that leads to the progressive deterioration of an individual's ability to perform daily activities independently. As a result, a considerable amount of time and resources are spent on caretaking. Early detection of dementia can significantly reduce the effort and resources needed for caretaking. This research proposes an approach for assessing cognitive decline by analysing speech data, specifically focusing on speech relevance as a crucial indicator for memory recall. This is a cross-sectional, online, self-administered. The proposed method used deep learning architecture based on transformers, with BERT (Bidirectional Encoder Representations from Transformers) and Sentence-Transformer to derive encoded representations of speech transcripts. These representations provide contextually descriptive information that is used to analyse the relevance of sentences in their respective contexts. The encoded information is then compared using cosine similarity metrics to measure the relevance of uttered sequences of sentences. The study uses the Pitt Corpus Dementia dataset for experimentation, which consists of speech data from individuals with and without dementia. The accuracy of the proposed multi-QA-MPNet (Multi-Query Maximum Inner Product Search Pretraining) model is compared with other pretrained transformer models of Sentence-Transformer. The results show that the proposed approach outperforms the other models in capturing context level information, particularly semantic memory. Additionally, the study explores the suitability of different similarity measures to evaluate the relevance of uttered sequences of sentences. The experimentation reveals that cosine similarity is the most appropriate measure for this task. This finding has significant implications for the early warning signs of dementia, as it suggests that cosine similarity metrics can effectively capture the semantic relevance of spoken language. The persistent cognitive decline over time acts as one of the indicators for prevalence of dementia. Additionally early dementia could be recognised by analysis on other modalities like speech and brain images. What is already known on this subject It is already known that speech- and language-based detection methods can be useful for dementia diagnosis, as language difficulties are often early signs of the disease. Additionally, deep learning algorithms have shown promise in detecting and diagnosing dementia through analysing large datasets, particularly in speech- and language-based detection methods. However, further research is needed to validate the performance of these algorithms on larger and more diverse datasets and to address potential biases and limitations. What this paper adds to existing knowledge This study presents a unique and effective approach for cognitive decline assessment through analysing speech data. The study provides valuable insights into the importance of context and semantic memory in accurately detecting the potential in dementia and demonstrates the applicability of deep learning models for this purpose. The findings of this study have important clinical implications and can inform future research and development in the field of dementia detection and care. What are the potential or actual clinical implications of this work? The proposed approach for cognitive decline assessment using speech data and deep learning models has significant clinical implications. It has the potential to improve the accuracy and efficiency of dementia diagnosis, leading to earlier detection and more effective treatments, which can improve patient outcomes and quality of life.

Read full abstract

Cosine Similarity Metric Research Articles

Related Topics

Articles published on Cosine Similarity Metric

Pre-trained models for linking process in data washing machine

Forming Dataset of The Undergraduate Thesis using Simple Clustering Methods

Quantum Approach for Contextual Search, Retrieval, and Ranking of Classical Information.

Evaluation-Focused Multidimensional Score for Turkish Abstractive Text Summarization

Development of a Children’s Educational Dictionary for a Low-Resource Language Using AI Tools

Atlas-Based Labeling of Resting-State fMRI.

Evaluating the performance of multilingual models in answer extraction and question generation

Enhanced content-based fashion recommendation system through deep ensemble classifier with transfer learning

Unsupervised Content Mining in CBIR: Harnessing Latent Diffusion for Complex Text-Based Query Interpretation.

Using Probe Counts to Provide High-Resolution Detector Data for a Microscopic Traffic Simulation

A Natural Language Processing-Based Classification and Mode-Based Ranking of Musculoskeletal Disorder Risk Factors

Optimization of traditional methods for determining the similarity of project names and purchases using large language models

Automatic generation of conclusions from neuroradiology MRI reports through natural language processing.

SSL-CPCD: Self-supervised learning with composite pretext-class discrimination for improved generalisability in endoscopic image analysis.

Bearing Digital Twin Based on Response Model and Reinforcement Learning

Cognitive decline assessment using semantic linguistic content and transformer deep learning architecture.

Robust multi-sensor image matching based on normalized self-similarity region descriptor

Tracing the Proliferation of Socialist Realism Doctrine in Latvian Periodicals: Case Study of "Literature and Art" and "The Flag"

Infrastructure tools to support an effective Radiation Oncology Learning Health System.

A Query Expansion Benchmark on Social Media Information Retrieval: Which Methodology Performs Best and Aligns with Semantics?

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Cosine Similarity Metric Research Articles

Related Topics

Articles published on Cosine Similarity Metric

Pre-trained models for linking process in data washing machine

Forming Dataset of The Undergraduate Thesis using Simple Clustering Methods

Quantum Approach for Contextual Search, Retrieval, and Ranking of Classical Information.

Evaluation-Focused Multidimensional Score for Turkish Abstractive Text Summarization

Development of a Children’s Educational Dictionary for a Low-Resource Language Using AI Tools

Atlas-Based Labeling of Resting-State fMRI.

Evaluating the performance of multilingual models in answer extraction and question generation

Enhanced content-based fashion recommendation system through deep ensemble classifier with transfer learning

Unsupervised Content Mining in CBIR: Harnessing Latent Diffusion for Complex Text-Based Query Interpretation.

Using Probe Counts to Provide High-Resolution Detector Data for a Microscopic Traffic Simulation

A Natural Language Processing-Based Classification and Mode-Based Ranking of Musculoskeletal Disorder Risk Factors

Optimization of traditional methods for determining the similarity of project names and purchases using large language models

Automatic generation of conclusions from neuroradiology MRI reports through natural language processing.

SSL-CPCD: Self-supervised learning with composite pretext-class discrimination for improved generalisability in endoscopic image analysis.

Bearing Digital Twin Based on Response Model and Reinforcement Learning

Cognitive decline assessment using semantic linguistic content and transformer deep learning architecture.

Robust multi-sensor image matching based on normalized self-similarity region descriptor

Tracing the Proliferation of Socialist Realism Doctrine in Latvian Periodicals: Case Study of "Literature and Art" and "The Flag"

Infrastructure tools to support an effective Radiation Oncology Learning Health System.

A Query Expansion Benchmark on Social Media Information Retrieval: Which Methodology Performs Best and Aligns with Semantics?