Investigating deep-learning NLP for automating the extraction of oncology efficacy endpoints from scientific literature

Aline Gendrin-Brokmann,Emilie Louvet,Francisco Costa,Sashka Dimitrievska,Paul Metcalfe,Ines Smit,Leonidas Souliotis,David Milward,Eden Harrison,Julianne Noveras,Harris Vince

doi:10.1016/j.ibmed.2024.100152

Abstract

ObjectiveBenchmarking drug efficacy is a critical step in clinical trial design and planning. The challenge is that much of the data on efficacy endpoints is stored in scientific papers in free text form, so extraction of such data is currently a largely manual task. Our objective is to automate this task as much as possible. MethodsIn this study we have developed and optimised a framework to extract efficacy endpoints from text in scientific papers, using a machine learning approach. ResultsOur machine learning model predicts 25 classes associated with efficacy endpoints and leads to high F1 scores (harmonic mean of precision and recall) of 96.4 % on the test set, and 93.9 % and 93.7 % on two case studies. ConclusionThese methods were evaluated against – and showed strong agreement with – subject matter experts and show significant promise in the future of automating the extraction of clinical endpoints from free text. SignificanceClinical information extraction from text data is currently a laborious manual task which scales poorly and is prone to human error. Demonstrating the ability to extract efficacy endpoints automatically shows great promise for accelerating clinical trial design moving forwards.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Investigating deep-learning NLP for automating the extraction of oncology efficacy endpoints from scientific literature

Abstract

Talk to us

Similar Papers

More From: Intelligence-Based Medicine

Lead the way for us

Journal: Intelligence-Based Medicine	Publication Date: Jan 1, 2024
License type: cc-by-nc-nd

Similar Papers

Information extraction from multi-institutional radiology reports.
Saeed Hassanpour ... Curtis P Langlotz
Artificial Intelligence in Medicine | VOL. 66
Saeed Hassanpour, et. al.Saeed Hassanpour ... Curtis P Langlotz
03 Oct 2015
Artificial Intelligence in Medicine | VOL. 66

Machine-Learning Implementation in Clinical Anesthesia: Opportunities and Challenges.
Danton S Char ... Alyssa Burgart
Anesthesia & Analgesia | VOL. 130
Danton S Char, et. al.Danton S Char ... Alyssa Burgart
01 Jun 2020
Anesthesia & Analgesia | VOL. 130

Performance assessment of different machine learning approaches in predicting diabetic ketoacidosis in adults with type 1 diabetes using electronic health records data.
Lin Li ... Chuang‐Chung Lee
Pharmacoepidemiology and drug safety | VOL. 30
Lin Li, et. al.Lin Li ... Chuang‐Chung Lee
03 Feb 2021
Pharmacoepidemiology and drug safety | VOL. 30

Internet-Based Injury Profile Developer (IPD) User's Manual, Version 1.0
Matthew Knapp ... Paul Kizakevic
-
Matthew Knapp, et. al.Matthew Knapp ... Paul Kizakevic
01 Oct 2002
01 Oct 2002

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Investigating deep-learning NLP for automating the extraction of oncology efficacy endpoints from scientific literature

Abstract

Talk to us

Similar Papers

More From: Intelligence-Based Medicine