Abstract
Social media recently became convenient platforms for different groups with common concerns to share their experiences, including Adverse Drug Reactions (ADRs). In this paper, we propose a two stage intelligent algorithm which we call “Simi_to_Prob”, that utilizes social media forums; for ranking ADRs, and evaluating the ADRs prevalence considering different age and gender groups as its first stage. In the second stage, ADRs are predicted utilizing a different data set from the Food and Drug Administration (FDA). In particular, Natural Language Processing (NLP) is used on social media to extract ranked lists of ADRs, which are then validated using novel intrinsic evaluation methods. In the second stage, feature engineering is used to extend the input feature space, then a two stage supervised machine learning method is used to predict future ADRs incidences. Our results show correct ranked list of ADRs for three antihypertensive drugs, where high Spearman’s rank correlation coefficients (rs) of of 0.7458, 0.6678 and 0.5929 were obtained between SIDER database for drug ADRs, and our obtained lists from social media. Furthermore, Relatedness between ADRs and age and gender groups achieved high area under the ROC curve (AUC) reaching 0.959. The second stage results showed high AUCs of 0.96 and 0.99 for the prediction of future ADRs probabilities. The proposed algorithm shows that mining social media can provide reliable source of information, and additional features that can be used to boost supervised machine learning methods’ performance in different domains including Pharmacovigilance research.
Highlights
Over the last decade, social media have proliferated to be a medium of interaction with family and friends, and as convenient platforms for patients who have common diseases, to voluntarily share knowledge, discuss their diseases’ related symptoms, and drugs adverse reactions [1]
The variability of the area under the ROC curve (AUC) values refers to the extent to which a specific Adverse Drug Reactions (ADRs) similarity to an age-group is discriminative from the other two age-groups
It can be seen that almost all ADRs obtained very high AUCs with value of 1 in some cases, in predicting gender-groups, which means that the ADRs affect both genders differently, as reflected by the similarity values
Summary
Social media have proliferated to be a medium of interaction with family and friends, and as convenient platforms for patients who have common diseases, to voluntarily share knowledge, discuss their diseases’ related symptoms, and drugs adverse reactions [1]. This has led to the generation of large amount of information that have been progressively utilized in medical, Pharmacovigilance and other domains [1,2,3,4]. Most of those approaches either only detect or extract ADRs from manually labelled text
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.