An ensemble method for extracting adverse drug events from social media

Jing Liu,Xiaodi Zhang,Songzheng Zhao

doi:10.1016/j.artmed.2016.05.004

Abstract

Because adverse drug events (ADEs) are a serious health problem and a leading cause of death, it is of vital importance to identify them correctly and in a timely manner. With the development of Web 2.0, social media has become a large data source for information on ADEs. The objective of this study is to develop a relation extraction system that uses natural language processing techniques to effectively distinguish between ADEs and non-ADEs in informal text on social media. We develop a feature-based approach that utilizes various lexical, syntactic, and semantic features. Information-gain-based feature selection is performed to address high-dimensional features. Then, we evaluate the effectiveness of four well-known kernel-based approaches (i.e., subset tree kernel, tree kernel, shortest dependency path kernel, and all-paths graph kernel) and several ensembles that are generated by adopting different combination methods (i.e., majority voting, weighted averaging, and stacked generalization). All of the approaches are tested using three data sets: two health-related discussion forums and one general social networking site (i.e., Twitter). When investigating the contribution of each feature subset, the feature-based approach attains the best area under the receiver operating characteristics curve (AUC) values, which are 78.6%, 72.2%, and 79.2% on the three data sets. When individual methods are used, we attain the best AUC values of 82.1%, 73.2%, and 77.0% using the subset tree kernel, shortest dependency path kernel, and feature-based approach on the three data sets, respectively. When using classifier ensembles, we achieve the best AUC values of 84.5%, 77.3%, and 84.5% on the three data sets, outperforming the baselines. Our experimental results indicate that ADE extraction from social media can benefit from feature selection. With respect to the effectiveness of different feature subsets, lexical features and semantic features can enhance the ADE extraction capability. Kernel-based approaches, which can stay away from the feature sparsity issue, are qualified to address the ADE extraction problem. Combining different individual classifiers using suitable combination methods can further enhance the ADE extraction effectiveness.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An ensemble method for extracting adverse drug events from social media

Abstract

Talk to us

Similar Papers

More From: Artificial Intelligence in Medicine

Lead the way for us

Journal: Artificial Intelligence in Medicine	Publication Date: Jun 1, 2016
Citations: 47

Similar Papers

SSEL-ADE: A semi-supervised ensemble learning framework for extracting adverse drug events from social media.
Jing Liu ... Songzheng Zhao
Artificial Intelligence in Medicine | VOL. 84
Jing Liu, et. al.Jing Liu ... Songzheng Zhao
27 Oct 2017
Artificial Intelligence in Medicine | VOL. 84

Coastal upwelling, its sediment record. Part A: Response of the sedimentary regime to present coastal upwelling; Part B: Sedimentary records of Ancient Coastal Upwelling: Jo¨hn Thiede and Erwin Suess, 1983. Nato conference series, series VI: Marine sciences, Plenum Press, New York, N.Y., A: xv + 604 pp., U.S.$85.00; B: xv + 610 pp., U.S.$85.00 (hardcover)
H.-E Reineck
Earth Science Reviews | VOL. 22
H.-E ReineckH.-E Reineck
01 Sep 1985
Earth Science Reviews | VOL. 22

Concordance and predictive value of two adverse drug event data sets.
Aurel Cami ... Ben Y Reis
BMC Medical Informatics and Decision Making | VOL. 14
Aurel Cami, et. al.Aurel Cami ... Ben Y Reis
22 Aug 2014
BMC Medical Informatics and Decision Making | VOL. 14

A study of the effectiveness of machine learning methods for classification of clinical interview fragments into a large number of categories
Mehedi Hasan ... Kathryn Brogan Hartlieb
Journal of Biomedical Informatics | VOL. 62
Mehedi Hasan, et. al.Mehedi Hasan ... Kathryn Brogan Hartlieb
13 May 2016
Journal of Biomedical Informatics | VOL. 62

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An ensemble method for extracting adverse drug events from social media

Abstract

Talk to us

Similar Papers

More From: Artificial Intelligence in Medicine