General Language Research Articles

Automatic radiology report generation is a task that combines artificial intelligence and medical information processing, and it fully relies on computer vision and natural language processing techniques. Nowadays, automatic radiology report generation is still a very challenging task because it requires semantically adequate alignment of data from two modalities: radiology images and text. Existing approaches tend to focus on coarse-grained alignment at the global level and do not take into account the disease characteristics of radiology images at fine-grained semantics, which results in the generated reports potentially omitting key disease diagnostic descriptions. In this work, we propose a new approach, disease-knowledge-enhanced fine-grained image–text alignment for automatic radiology report generation (DKA-RG). The method combines global and disease-level alignment, thus facilitating the extraction of fine-grained disease features by the model. Our approach also introduces a knowledge graph to inject medical domain expertise into the model. Our proposed DKA-RG consists of two training steps: the image–report alignment stage and the image-to-report generation stage. In the alignment stage, we use global contrastive learning to align images and texts from a high level and also augment disease contrastive learning with medical knowledge to enhance the disease detection capability. In the report generation stage, the report text generated from the images is more accurate in describing the disease information thanks to sufficient alignment. Through extensive quantitative and qualitative experiments on two widely used datasets, we validate the effectiveness of our DKA-RG on the task of radiology report generation. Our DKA-RG achieves superior performance on multiple types of metrics (natural language generation and clinical efficacy metrics) compared to existing methods, demonstrating that the method can improve the reliability and accuracy of automatic radiology report generation systems.

Medication errors, which could often be detected in advance, are a significant cause of patient deaths each year, highlighting the critical importance of medication safety. The rapid advancement of data analysis technologies has made intelligent medication assistance applications possible, and these applications rely heavily on medical knowledge graphs. However, current knowledge graph construction techniques are predominantly focused on general domains, leaving a gap in specialized fields, particularly in the medical domain for medication assistance. The specialized nature of medical knowledge and the distinct distribution of vocabulary between general and biomedical texts pose challenges. Applying general natural language processing techniques directly to the medical domain often results in lower accuracy due to the inadequate utilization of contextual semantics and entity information. To address these issues and enhance knowledge graph production, this paper proposes an optimized model for named entity recognition and relationship extraction in the Chinese medical domain. Key innovations include utilizing Medical Bidirectional Encoder Representations from Transformers (MCBERT) for character-level embeddings pre-trained on Chinese biomedical corpora, employing Bi-directional Gated Recurrent Unit (BiGRU) networks for extracting enriched contextual features, integrating a Conditional Random Field (CRF) layer for optimal label sequence output, using the Piecewise Convolutional Neural Network (PCNN) to capture comprehensive semantic information and fusing it with entity features for better classification accuracy, and implementing a microservices architecture for the medication assistance review system. These enhancements significantly improve the accuracy of entity relationship classification in Chinese medical texts. The model achieved good performance in recognizing most entity types, with an accuracy of 88.3%, a recall rate of 85.8%, and an F1 score of 87.0%. In the relationship extraction stage, the accuracy reached 85.7%, the recall rate 82.5%, and the F1 score 84.0%.

General Language Research Articles

Related Topics

Articles published on General Language

DKA-RG: Disease-Knowledge-Enhanced Fine-Grained Image–Text Alignment for Automatic Radiology Report Generation

Enhancing AI Responses in Chemistry: Integrating Text Generation, Image Creation, and Image Interpretation through Different Levels of Prompts

Even laypeople use legalese

Large language models (LLMs): survey, technical frameworks, and future challenges

Navigating the Ethical Landscape of ChatGPT Integration in Scientific Research: Review of Challenges and Recommendations

Comprehensive Review on Natural Language Generation for Automated Report Writing in Finance

Comparative analysis of paraphrasing performance of ChatGPT, GPT‐3, and T5 language models using a new ChatGPT generated dataset: ParaGPT

Generative artificial intelligence: a systematic review and applications

Bias and Fairness in Large Language Models: A Survey

Clinical Applications of Generative Artificial Intelligence in Radiology: Image Translation, Synthesis and Text Generation

Utilizing language models for advanced electrocardiogram analysis

University policies on generative AI in Asia: promising practices, gaps, and future directions

A Novel Rational Medicine Use System Based on Domain Knowledge Graph

Utilizing large language models in infectious disease transmission modelling for public health preparedness

DialogueNeRF: towards realistic avatar face-to-face conversation video generation

Knowledge-aware audio-grounded generative slot filling for limited annotated data

Crystal Composition Transformer: Self-Learning Neural Language Model for Generative and Tinkering Design of Materials.

Questions Analysis of the General English language Exam for the Sixth Vocational Grade in Light of Bloom’s Taxonomy

Components of Mathematical Language and Mathematical Ability in 5-Year-Old Dual Language Learners

Easy-read and large language models: on the ethical dimensions of LLM-based text simplification

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

General Language Research Articles

Related Topics

Articles published on General Language

DKA-RG: Disease-Knowledge-Enhanced Fine-Grained Image–Text Alignment for Automatic Radiology Report Generation

Enhancing AI Responses in Chemistry: Integrating Text Generation, Image Creation, and Image Interpretation through Different Levels of Prompts

Even laypeople use legalese

Large language models (LLMs): survey, technical frameworks, and future challenges

Navigating the Ethical Landscape of ChatGPT Integration in Scientific Research: Review of Challenges and Recommendations

Comprehensive Review on Natural Language Generation for Automated Report Writing in Finance

Comparative analysis of paraphrasing performance of ChatGPT, GPT‐3, and T5 language models using a new ChatGPT generated dataset: ParaGPT

Generative artificial intelligence: a systematic review and applications

Bias and Fairness in Large Language Models: A Survey

Clinical Applications of Generative Artificial Intelligence in Radiology: Image Translation, Synthesis and Text Generation

Utilizing language models for advanced electrocardiogram analysis

University policies on generative AI in Asia: promising practices, gaps, and future directions

A Novel Rational Medicine Use System Based on Domain Knowledge Graph

Utilizing large language models in infectious disease transmission modelling for public health preparedness

DialogueNeRF: towards realistic avatar face-to-face conversation video generation

Knowledge-aware audio-grounded generative slot filling for limited annotated data

Crystal Composition Transformer: Self-Learning Neural Language Model for Generative and Tinkering Design of Materials.

Questions Analysis of the General English language Exam for the Sixth Vocational Grade in Light of Bloom’s Taxonomy

Components of Mathematical Language and Mathematical Ability in 5-Year-Old Dual Language Learners

Easy-read and large language models: on the ethical dimensions of LLM-based text simplification