Semantic Similarity Research Articles

e13609 Background: Precision oncology revolutionized cancer treatment by identifying molecular biomarkers to guide personalized care. The ever-growing body of medical literature presents a challenge for oncologists researching targeted therapies. While recent studies investigated large language models (LLMs) to streamline this process, LLM reliance on general rather than medical knowledge limits clinical relevance and trustworthiness. To address these limitations, we developed a retrieval augmented generation (RAG) system that integrates PubMed clinical studies, trial databases and oncological guidelines with LLMs to support targeted treatment recommendations. The Molecular Tumor Board (MTB) at the Center of Personalized Medicine (ZPMTUM) guided and evaluated treatment options proposed by the LLM to assess their applicability for clinical decision support. Methods: We used 10 publicly accessible fictional patient cases with 7 tumor types and 59 distinct molecular alterations. Our LLM system MEREDITH (Medical Evidence Retrieval and Data Integration for Tailored Healthcare) consists of Google's Gemini Pro, enhanced with RAG and Chain-of-Thought (CoT) prompting. To establish a benchmark, clinical experts at ZPMTUM manually annotated the cases. Informed by MTB expert feedback, we iteratively improved our LLM system from a draft system relying on PubMed-indexed data to an enhanced system, which replicated expert annotation processes by incorporating oncology guidelines, drug availability and trial databases (ClinicalTrials.gov, QuickQueck.de). ZPMTUM assessed credibility and clinical relevance of manually annotated and LLM-generated recommendations. Patient-level data on (likely) pathogenic molecular alterations and recommended treatment options were summarized using median and interquartile range (IQR). Semantic similarity between LLM and clinician responses was assessed using cosine similarity of text vector embeddings; paired t-test evaluated significance. Results: The median of (likely) pathogenic molecular alterations per patient was 2.5 (IQR: 2-3). ZPMTUM identified a median of 2 treatment options per patient (IQR: 1-3), while the enhanced LLM identified a median of 4 (IQR: 3-5). MEREDITH proposed multiple relevant treatment suggestions, including therapies based on preclinical studies, and molecular interactions, for further assessment by the MTB. ZPMTUM prioritized the most suitable clinical option. The mean semantic textual similarity of LLM responses increased significantly from 0.69 in the draft system to 0.76 in the enhanced system (p <0.001). Thus, feedback from ZPMTUM enhanced the model's ability to align its responses with clinician thought processes. Conclusions: Leveraging expert thought processes to instruct LLMs holds promise as a novel decision support tool for precision oncology.

Read full abstract

Named entity recognition (NER) is to identify and categorize entities in unstructured text, which serves as a fundamental task for a variety of natural language processing (NLP) applications. In particular, emerging few-shot NER methods aim to learn model parameters well with few samples and have received considerable attention. The dominant few-shot NER methods usually employ pre-trained language models (PLMs) as their basic architecture and fine-tune model parameters with few NER samples. Since the sample size is small and there are a large number of parameters in PLMs, fine-tuning may result in the parameters of PLMs being highly biased. To address this issue, this study introduces the semantic distribution distance constraints to optimize the fine-tuning process of few-shot NER models and develops a framework named Semantic Constraints on few-shot Named Entity Recognition (SCNER). Specifically, the framework formulates the general knowledge transfer of PLMs as an optimal transport procedure with a semantic prior. And, a Semantics-induced Optimal Transport (SOT) regularizer is developed to utilize the importance and similarities of tokens within sentences. SOT builds the semantic distribution of the sentence and defines the transport costs between tokens to achieve the token-level optimal transport procedures. Finally, SOT is employed as a regularization term of few-shot NER to introduce the semantic distribution distance constraint for effectively transferring general knowledge from PLMs. The experiments on four public datasets demonstrate that the proposed method significantly improves the performance of NER models in both few-shot and fully supervised scenarios. SCNER is a common framework that can be applied to a variety of models without adding additional learning parameters, and can be used to enhance the generalization ability and adaptability of various few-shot NER models.

Read full abstract

Semantic Similarity Research Articles

Related Topics

Articles published on Semantic Similarity

CLSESSP: Contrastive learning of sentence embedding with strong semantic prototypes

Uniqueness meets Semantics: A Novel Semantically Meaningful Bag-of-Words Approach for Matching Resumes to Job Profiles

A hybrid model to improve IC-related metrics of semantic similarity between words

Trajectory privacy protection method based on sensitive semantic location replacement

Enhancing Medical Image Retrieval with UMLS-Integrated CNN-Based Text Indexing.

Discourse Fluctuation of “Chinese Aerospace” in the Mainstream Media

Detecting coordinated and bot-like behavior in Twitter: the Jürgen Conings case

Artificial intelligence in forensic medicine and related sciences – selected issues = Sztuczna inteligencja w medycynie sądowej i naukach pokrewnych – wybrane zagadnienia

Artificial intelligence in forensic medicine and related sciences - selected issues.

Lexical pathway from L2 to L1 activation in intermediate proficient bilinguals: behavioral and ERP evidence.

Large language models for precision oncology: Clinical decision support through expert-guided learning.

Adaptive resonance demodulation semantic-induced zero-shot compound fault diagnosis for railway bearings

Co-augmentation of structure and feature for boosting graph contrastive learning

Experimental study on short-text clustering using transformer-based semantic similarity measure

Improving few-shot named entity recognition via Semantics induced Optimal Transport

Weight Saliency search with Semantic Constraint for Neural Machine Translation attacks

A framework for trust-related knowledge transfer in human–robot interaction

MMGPL: Multimodal Medical Data Analysis with Graph Prompt Learning

The formation of inaccurate cognitive destination image from a memory process perspective

Lightweight railroad semantic segmentation network and distance estimation for railroad Unmanned aerial vehicle images

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Semantic Similarity Research Articles

Related Topics

Articles published on Semantic Similarity

CLSESSP: Contrastive learning of sentence embedding with strong semantic prototypes

Uniqueness meets Semantics: A Novel Semantically Meaningful Bag-of-Words Approach for Matching Resumes to Job Profiles

A hybrid model to improve IC-related metrics of semantic similarity between words

Trajectory privacy protection method based on sensitive semantic location replacement

Enhancing Medical Image Retrieval with UMLS-Integrated CNN-Based Text Indexing.

Discourse Fluctuation of “Chinese Aerospace” in the Mainstream Media

Detecting coordinated and bot-like behavior in Twitter: the Jürgen Conings case

Artificial intelligence in forensic medicine and related sciences – selected issues = Sztuczna inteligencja w medycynie sądowej i naukach pokrewnych – wybrane zagadnienia

Artificial intelligence in forensic medicine and related sciences - selected issues.

Lexical pathway from L2 to L1 activation in intermediate proficient bilinguals: behavioral and ERP evidence.

Large language models for precision oncology: Clinical decision support through expert-guided learning.

Adaptive resonance demodulation semantic-induced zero-shot compound fault diagnosis for railway bearings

Co-augmentation of structure and feature for boosting graph contrastive learning

Experimental study on short-text clustering using transformer-based semantic similarity measure

Improving few-shot named entity recognition via Semantics induced Optimal Transport

Weight Saliency search with Semantic Constraint for Neural Machine Translation attacks

A framework for trust-related knowledge transfer in human–robot interaction

MMGPL: Multimodal Medical Data Analysis with Graph Prompt Learning

The formation of inaccurate cognitive destination image from a memory process perspective

Lightweight railroad semantic segmentation network and distance estimation for railroad Unmanned aerial vehicle images