Lexical Model Research Articles

Introduction:Clinical trials (CTs) often fail due to inadequate patient recruitment. Finding eligible patients involves comparing the patient’s information with the CT eligibility criteria. Automated patient matching offers the promise of improving the process, yet the main difficulties of CT retrieval lie in the semantic complexity of matching unstructured patient descriptions with semi-structured, multi-field CT documents and in capturing the meaning of negation coming from the eligibility criteria. Objectives:This paper tackles the challenges of CT retrieval by presenting an approach that addresses the patient-to-trials paradigm. Our approach involves two key components in a pipeline-based model: (i) a data enrichment technique for enhancing both queries and documents during the first retrieval stage, and (ii) a novel re-ranking schema that uses a Transformer network in a setup adapted to this task by leveraging the structure of the CT documents. Methods:We use named entity recognition and negation detection in both patient description and the eligibility section of CTs. We further classify patient descriptions and CT eligibility criteria into current, past, and family medical conditions. This extracted information is used to boost the importance of disease and drug mentions in both query and index for lexical retrieval. Furthermore, we propose a two-step training schema for the Transformer network used to re-rank the results from the lexical retrieval. The first step focuses on matching patient information with the descriptive sections of trials, while the second step aims to determine eligibility by matching patient information with the criteria section. ResultsOur findings indicate that the inclusion criteria section of the CT has a great influence on the relevance score in lexical models, and that the enrichment techniques for queries and documents improve the retrieval of relevant trials. The re-ranking strategy, based on our training schema, consistently enhances CT retrieval and shows improved performance by 15% in terms of precision at retrieving eligible trials. ConclusionThe results of our experiments suggest the benefit of making use of extracted entities. Moreover, our proposed re-ranking schema shows promising effectiveness compared to larger neural models, even with limited training data. These findings offer valuable insights for improving methods for retrieval of clinical documents.

BackgroundTriage of textual telemedical queries is a safety-critical task for medical service providers with limited remote health resources. The prioritization of patient queries containing medically severe text is necessary to optimize resource usage and provide care to those with time-sensitive needs.ObjectiveWe aim to evaluate the effectiveness of transfer learning solutions on the task of telemedical triage and provide a thorough error analysis, identifying telemedical queries that challenge state-of-the-art natural language processing (NLP) systems. Additionally, we aim to provide a publicly available telemedical query data set with labels for severity classification for telemedical triage of respiratory issues.MethodsWe annotated 573 medical queries from 3 online health platforms: HealthTap, HealthcareMagic, and iCliniq. We then evaluated 6 transfer learning solutions utilizing various text-embedding strategies. Specifically, we first established a baseline using a lexical classification model with term frequency–inverse document frequency (TF-IDF) features. Next, we investigated the effectiveness of global vectors for text representation (GloVe), a pretrained word-embedding method. We evaluated the performance of GloVe embeddings in the context of support vector machines (SVMs), bidirectional long short-term memory (bi-LSTM) networks, and hierarchical attention networks (HANs). Finally, we evaluated the performance of contextual text embeddings using transformer-based architectures. Specifically, we evaluated bidirectional encoder representation from transformers (BERT), Bio+Clinical-BERT, and Sentence-BERT (SBERT) on the telemedical triage task.ResultsWe found that a simple lexical model achieved a mean F1 score of 0.865 (SD 0.048) on the telemedical triage task. GloVe-based models using SVMs, HANs, and bi-LSTMs achieved a 0.8-, 1.5-, and 2.1-point increase in the F1 score, respectively. Transformer-based models, such as BERT, Bio+Clinical-BERT, and SBERT, achieved a mean F1 score of 0.914 (SD 0.034), 0.904 (SD 0.041), and 0.917 (SD 0.037), respectively. The highest-performing model, SBERT, provided a statistically significant improvement compared to all GloVe-based and lexical baselines. However, no statistical significance was found when comparing transformer-based models. Furthermore, our error analysis revealed highly challenging query types, including those with complex negations, temporal relationships, and patient intents.ConclusionsWe showed that state-of-the-art transfer learning techniques work well on the telemedical triage task, providing significant performance increase over lexical models. Additionally, we released a public telemedical triage data set using labeled questions from online medical question-and-answer (Q&A) platforms. Our analysis highlights various avenues for future works that explicitly model such query challenges.

Lexical Model Research Articles

Related Topics

Articles published on Lexical Model

Robust neural tracking of linguistic speech representations using a convolutional neural network

The lexical constructional model meets syntax: guidelines of the formalized lexical-constructional model (FL_CxG )

Simplification of Arabic text: A hybrid approach integrating machine translation and transformer-based lexical model

Effective matching of patients to clinical trials using entity extraction and neural re-ranking

Lexical modeling for the development of Amharic automatic speech recognition systems

Auditory pseudoword rhyming effects in bilingual children reflect second language proficiency: An ERP study

Synergetic Properties of Lexical Structures in Chinese and English

‘Pregnancy no bi disease’: Contextual beliefs in antenatal classes in selected Nigerian hospitals

Cross-script L1–L2 and L2–L1 masked translation priming and phonological priming: Evidence from unbalanced Korean–English bilinguals

Metaphors in Italian and Croatian compounds

Fostering Engineering Students’ Competences Development Through Lexical Aspect Acquisition Model

Ontology-based semantic retrieval of documents using Word2vec model

Automated MeSH term suggestion for effective query formulation in systematic reviews literature search

Effects of Native Translation Frequency and L2 Proficiency on L2 word Recognition: Evidence from Korean Speakers of English as a Foreign Language.

Split Lexical Insertion in Parasitic Gap Constructions*

Grammatical Constructions of Time and Date Nominations in the Russian and Chinese Languages

Hierarchy, Not Lexical Regularity, Modulates Low-Frequency Neural Synchrony During Language Comprehension.

Assessing receptive vocabulary using state‑of‑the‑art natural language processing techniques

CONTINUUM OF PARAPHRASING – SYNTAX SYNONYMY IN THE DICHOTOMY LANGUAGE IS SPEECH (on the material of modern French artistic prose)

Identifying the Perceived Severity of Patient-Generated Telemedical Queries Regarding COVID: Developing and Evaluating a Transfer Learning-Based Solution.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Lexical Model Research Articles

Related Topics

Articles published on Lexical Model

Robust neural tracking of linguistic speech representations using a convolutional neural network

The lexical constructional model meets syntax: guidelines of the formalized lexical-constructional model (FL_CxG )

Simplification of Arabic text: A hybrid approach integrating machine translation and transformer-based lexical model

Effective matching of patients to clinical trials using entity extraction and neural re-ranking

Lexical modeling for the development of Amharic automatic speech recognition systems

Auditory pseudoword rhyming effects in bilingual children reflect second language proficiency: An ERP study

Synergetic Properties of Lexical Structures in Chinese and English

‘Pregnancy no bi disease’: Contextual beliefs in antenatal classes in selected Nigerian hospitals

Cross-script L1–L2 and L2–L1 masked translation priming and phonological priming: Evidence from unbalanced Korean–English bilinguals

Metaphors in Italian and Croatian compounds

Fostering Engineering Students’ Competences Development Through Lexical Aspect Acquisition Model

Ontology-based semantic retrieval of documents using Word2vec model

Automated MeSH term suggestion for effective query formulation in systematic reviews literature search

Effects of Native Translation Frequency and L2 Proficiency on L2 word Recognition: Evidence from Korean Speakers of English as a Foreign Language.

Split Lexical Insertion in Parasitic Gap Constructions*

Grammatical Constructions of Time and Date Nominations in the Russian and Chinese Languages

Hierarchy, Not Lexical Regularity, Modulates Low-Frequency Neural Synchrony During Language Comprehension.

Assessing receptive vocabulary using state‑of‑the‑art natural language processing techniques

CONTINUUM OF PARAPHRASING – SYNTAX SYNONYMY IN THE DICHOTOMY LANGUAGE IS SPEECH (on the material of modern French artistic prose)

Identifying the Perceived Severity of Patient-Generated Telemedical Queries Regarding COVID: Developing and Evaluating a Transfer Learning-Based Solution.