Reliably Filter Drug-Induced Liver Injury Literature With Natural Language Processing and Conformal Prediction.

Xianghao Zhan,Olivier Gevaert,Fanjin Wang

doi:10.1109/jbhi.2022.3193365

Xianghao Zhan, Olivier Gevaert + Show 1 more

Open Access

https://doi.org/10.1109/jbhi.2022.3193365

Copy DOI

Abstract

Drug-induced liver injury describes the adverse effects of drugs that damage the liver. Life-threatening results were also reported in severe cases. Therefore, liver toxicity is an important assessment for new drug candidates. These reports are documented in research papers that contain preliminary in vitro and in vivo experiments. Conventionally, data extraction from publications relies on resource-demanding manual labeling, which restricts the efficiency of the information extraction. The development of natural language processing techniques enables the automatic processing of biomedical texts. Herein, based on around 28,000 papers (titles and abstracts) provided by the Critical Assessment of Massive Data Analysis challenge, this study benchmarked model performances on filtering liver-damage-related literature. Among five text embedding techniques, the model using term frequency-inverse document frequency (TF-IDF) and logistic regression outperformed others with an accuracy of 0.957 on the validation set. Furthermore, an ensemble model with similar overall performances was developed with a logistic regression model on the predicted probability given by separate models with different vectorization techniques. The ensemble model achieved a high accuracy of 0.954 and an F1 score of 0.955 in the hold-out validation data in the challenge. Moreover, important words in positive/negative predictions were identified via model interpretation. The prediction reliability was quantified with conformal prediction, which provides users with a control over the prediction uncertainty. Overall, the ensemble model and TF-IDF model reached satisfactory classification results, which can be used by researchers to rapidly filter literature that describes events related to liver injury induced by medications.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE Journal of Biomedical and Health Informatics	Publication Date: Oct 1, 2022
Citations: 12	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Reliably Filter Drug-Induced Liver Injury Literature With Natural Language Processing and Conformal Prediction.

Abstract

Talk to us

Similar Papers

More From: IEEE Journal of Biomedical and Health Informatics

Lead the way for us

Similar Papers

A Unifying Ontology to Integrate Histological and Clinical Observations for Drug-Induced Liver Injury
Yuping Wang ... Weida Tong
The American Journal of Pathology | VOL. 182
Yuping Wang, et. al.Yuping Wang ... Weida Tong
08 Feb 2013
The American Journal of Pathology | VOL. 182

Drug-Induced Liver Injury, Dosage, and Drug Disposition: Is Idiosyncrasy Really Unpredictable?
James H Lewis
Clinical Gastroenterology and Hepatology | VOL. 12
James H LewisJames H Lewis
12 Feb 2014
Clinical Gastroenterology and Hepatology | VOL. 12

Liver Injury Associated With Drugs and Complementary and Alternative Medicines in India
Einar S Björnsson
Journal of Clinical and Experimental Hepatology | VOL. 11
Einar S BjörnssonEinar S Björnsson
27 Apr 2021
Journal of Clinical and Experimental Hepatology | VOL. 11

This Month in Gastroenterology
Jan Tack ... John M Carethers
Gastroenterology | VOL. 135
Jan Tack, et. al.Jan Tack ... John M Carethers
11 Nov 2008
Gastroenterology | VOL. 135

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Reliably Filter Drug-Induced Liver Injury Literature With Natural Language Processing and Conformal Prediction.

Abstract

Talk to us

Similar Papers

More From: IEEE Journal of Biomedical and Health Informatics