Hybrid Deep Learning for Medication-Related Information Extraction From Clinical Texts in French: MedExt Algorithm Development Study.

Jordan Jouffroy,Jordan Jouffroy,Anita Burgun,Anita Burgun,Antoine Neuraz,Antoine Neuraz,Ivan Lerner,Ivan Lerner,Sarah F Feldman,Sarah F Feldman,Bastien Rance,Bastien Rance

doi:10.2196/17934

Abstract

BackgroundInformation related to patient medication is crucial for health care; however, up to 80% of the information resides solely in unstructured text. Manual extraction is difficult and time-consuming, and there is not a lot of research on natural language processing extracting medical information from unstructured text from French corpora.ObjectiveWe aimed to develop a system to extract medication-related information from clinical text written in French.MethodsWe developed a hybrid system combining an expert rule–based system, contextual word embedding (embedding for language model) trained on clinical notes, and a deep recurrent neural network (bidirectional long short term memory–conditional random field). The task consisted of extracting drug mentions and their related information (eg, dosage, frequency, duration, route, condition). We manually annotated 320 clinical notes from a French clinical data warehouse to train and evaluate the model. We compared the performance of our approach to those of standard approaches: rule-based or machine learning only and classic word embeddings. We evaluated the models using token-level recall, precision, and F-measure.ResultsThe overall F-measure was 89.9% (precision 90.8; recall: 89.2) when combining expert rules and contextualized embeddings, compared to 88.1% (precision 89.5; recall 87.2) without expert rules or contextualized embeddings. The F-measures for each category were 95.3% for medication name, 64.4% for drug class mentions, 95.3% for dosage, 92.2% for frequency, 78.8% for duration, and 62.2% for condition of the intake.ConclusionsAssociating expert rules, deep contextualized embedding, and deep neural networks improved medication information extraction. Our results revealed a synergy when associating expert knowledge and latent knowledge.

Highlights

In 2017, medication consumption in France represented €37.8 billion in spending and 16% of the French health budget [1]
Adverse drug reactions are an important public health problem, representing a major cause of mortality (0.15% in France); one-third of admissions caused by adverse drug reactions are preventable, associated with a poorly reported drug history or rare adverse events [2,3]
Up to 80% of relevant clinical information is present solely in the form of unstructured text, which represents a major barrier to the secondary use of this type of information [5,6]

Summary

Introduction

In 2017, medication consumption in France represented €37.8 billion (approximately US $45.5 billion) in spending and 16% of the French health budget [1]. Up to 80% of relevant clinical information is present solely in the form of unstructured text, which represents a major barrier to the secondary use of this type of information [5,6]. To overcome this issue, natural language processing techniques can be used to extract, normalize, and restructure drug-related information from clinical texts [6,7] and increase the information available for research and health care. Methods: We developed a hybrid system combining an expert rule–based system, contextual word embedding (embedding for language model) trained on clinical notes, and a deep recurrent neural network (bidirectional long short term memory–conditional random field). Our results revealed a synergy when associating expert knowledge and latent knowledge

Objectives

Methods

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: JMIR medical informatics	Publication Date: Mar 16, 2021
Citations: 22	License type: cc-by

R Discovery Prime

R Discovery Prime

Hybrid Deep Learning for Medication-Related Information Extraction From Clinical Texts in French: MedExt Algorithm Development Study.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: JMIR medical informatics

Lead the way for us

Similar Papers

RadioBERT: A deep learning-based system for medical report generation from chest X-ray images using contextual embeddings
Navdeep Kaur ... Ajay Mittal
Journal of Biomedical Informatics | VOL. 135
Navdeep Kaur, et. al.Navdeep Kaur ... Ajay Mittal
10 Oct 2022
Journal of Biomedical Informatics | VOL. 135

Combining Contextualized Embeddings and Prior Knowledge for Clinical Named Entity Recognition: Evaluation Study.
Min Jiang ... Todd Sanger
JMIR medical informatics | VOL. 7
Min Jiang, et. al.Min Jiang ... Todd Sanger
13 Nov 2019
JMIR medical informatics | VOL. 7

Assessing the Impact of Contextual Embeddings for Portuguese Named Entity Recognition
Joaquim Santos ... Juliano Terra
-
Joaquim Santos, et. al.Joaquim Santos ... Juliano Terra
01 Oct 2019
01 Oct 2019

Personalized Query Expansion with Contextual Word Embeddings
Elias Bassani ... Gabriella Pasi
ACM transactions on information systems | VOL. 42
Elias Bassani, et. al.Elias Bassani ... Gabriella Pasi
11 Dec 2023
ACM transactions on information systems | VOL. 42

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Hybrid Deep Learning for Medication-Related Information Extraction From Clinical Texts in French: MedExt Algorithm Development Study.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: JMIR medical informatics