Natural Language Processing for Automated Classification of Qualitative Data From Interviews of Patients With Cancer

Chao Fang,Natasha Markuzon,Nikunj Patel,Juan-David Rueda

doi:10.1016/j.jval.2022.06.004

Abstract

ObjectivesThis study sought to explore the use of novel natural language processing (NLP) methods for classifying unstructured, qualitative textual data from interviews of patients with cancer to identify patient-reported symptoms and impacts on quality of life. MethodsWe tested the ability of 4 NLP models to accurately classify text from interview transcripts as “symptom,” “quality of life impact,” and “other.” Interview data sets from patients with hepatocellular carcinoma (HCC) (n = 25), biliary tract cancer (BTC) (n = 23), and gastric cancer (n = 24) were used. Models were cross-validated with transcript subsets designated for training, validation, and testing. Multiclass classification performance of the 4 models was evaluated at paragraph and sentence level using the HCC testing data set and analyzed by the one-versus-rest technique quantified by the receiver operating characteristic area under the curve (ROC AUC) score. ResultsNLP models accurately classified multiclass text from patient interviews. The Bidirectional Encoder Representations from Transformers model generally outperformed all other models at paragraph and sentence level. The highest predictive performance of the Bidirectional Encoder Representations from Transformers model was observed using the HCC data set to train and BTC data set to test (mean ROC AUC, 0.940 [SD 0.028]), with similarly high predictive performance using balanced and imbalanced training data sets from BTC and gastric cancer populations. ConclusionsNLP models were accurate in predicting multiclass classification of text from interviews of patients with cancer, with most surpassing 0.9 ROC AUC at paragraph level. NLP may be a useful tool for scaling up processing of patient interviews in clinical studies and, thus, could serve to facilitate patient input into drug development and improving patient care.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Value in health : the journal of the International Society for Pharmacoeconomics and Outcomes Research	Publication Date: Jul 12, 2022
Citations: 8	License type: cc-by-nc-nd

R Discovery Prime

R Discovery Prime

Natural Language Processing for Automated Classification of Qualitative Data From Interviews of Patients With Cancer

Abstract

Talk to us

Similar Papers

More From: Value in health : the journal of the International Society for Pharmacoeconomics and Outcomes Research

Lead the way for us

Similar Papers

P109 Early diagnosis of inflammatory arthritis (IA) using machine learning analysis of GP referral letters and blood tests to improve pre-hospital referral triage
Anthony Bradlow ... Eghosa Bazuaye
Rheumatology (Oxford, England) | VOL. 61
Anthony Bradlow, et. al.Anthony Bradlow ... Eghosa Bazuaye
23 Apr 2022
Rheumatology (Oxford, England) | VOL. 61

INTRAINDIVIDUAL DIFFERENCES IN LEVELS OF WRITTEN LANGUAGE
Virginia W Berninger ... Russell Bragg
Reading & Writing Quarterly | VOL. 10
Virginia W Berninger, et. al.Virginia W Berninger ... Russell Bragg
01 Jul 1994
Reading & Writing Quarterly | VOL. 10

Agile in-litero experiments

-

01 Jan 2015
01 Jan 2015

MmPose-NLP: A Natural Language Processing Approach to Precise Skeletal Pose Estimation Using mmWave Radars.
Arindam Sengupta ... Siyang Cao
IEEE transactions on neural networks | VOL. 34
Arindam Sengupta, et. al.Arindam Sengupta ... Siyang Cao
01 Nov 2023
IEEE transactions on neural networks | VOL. 34

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Natural Language Processing for Automated Classification of Qualitative Data From Interviews of Patients With Cancer

Abstract

Talk to us

Similar Papers

More From: Value in health : the journal of the International Society for Pharmacoeconomics and Outcomes Research