Abstract

BackgroundUse of medication can cause adverse drug reactions (ADRs), unwanted or unexpected events, which are a major safety concern. Drug labels, or prescribing information or package inserts, describe ADRs. Therefore, systematically identifying ADR information from drug labels is critical in multiple aspects; however, this task is challenging due to the nature of the natural language of drug labels.ResultsIn this paper, we present a machine learning- and rule-based system for the identification of ADR entity mentions in the text of drug labels and their normalization through the Medical Dictionary for Regulatory Activities (MedDRA) dictionary. The machine learning approach is based on a recently proposed deep learning architecture, which integrates bi-directional Long Short-Term Memory (Bi-LSTM), Convolutional Neural Network (CNN), and Conditional Random Fields (CRF) for entity recognition. The rule-based approach, used for normalizing the identified ADR mentions to MedDRA terms, is based on an extension of our in-house text-mining system, SciMiner. We evaluated our system on the Text Analysis Conference (TAC) Adverse Drug Reaction 2017 challenge test data set, consisting of 200 manually curated US FDA drug labels. Our ML-based system achieved 77.0% F1 score on the task of ADR mention recognition and 82.6% micro-averaged F1 score on the task of ADR normalization, while rule-based system achieved 67.4 and 77.6% F1 scores, respectively.ConclusionOur study demonstrates that a system composed of a deep learning architecture for entity recognition and a rule-based model for entity normalization is a promising approach for ADR extraction from drug labels.

Highlights

  • Use of medication can cause adverse drug reactions (ADRs), unwanted or unexpected events, which are a major safety concern

  • In addition to using medical reports for detecting ADRs [3], it has been proposed to use data from social media [4], since users tend to discuss their sicknesses, treatments and prescribed drugs and their effects in social media platforms. These discussions are confined to social networks dedicated to health-related issues, but they exist in generic platforms which could all be Tiftikci et al BMC Bioinformatics 2019, 20(Suppl 21):707 used for multi-corpus training to increase the accuracy of text mining systems for ADR recognition [5]

  • We investigated the integration of machine learning and dictionary/rule-based methods in identifying ADR terms from drug labels and normalizing them to Medical Dictionary for Regulatory Activities (MedDRA) preferred terms (PT)

Read more

Summary

Introduction

Use of medication can cause adverse drug reactions (ADRs), unwanted or unexpected events, which are a major safety concern. In addition to using medical reports for detecting ADRs [3], it has been proposed to use data from social media [4], since users tend to discuss their sicknesses, treatments and prescribed drugs and their effects in social media platforms. These discussions are confined to social networks dedicated to health-related issues, but they exist in generic platforms which could all be Tiftikci et al BMC Bioinformatics 2019, 20(Suppl 21):707 used for multi-corpus training to increase the accuracy of text mining systems for ADR recognition [5]. Detailed information on ADR extraction with different techniques on various data sources is available in [7, 8]

Methods
Results
Discussion
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call