Free Version Research Articles

231 Background: Non-small cell lung cancer (NSCLC) targeted therapies are complex, often requiring extensive patient education beyond what can be conveyed in a single provider visit. Given the average US reading level is at the 8th grade, and many patients seek additional information online, it is vital to evaluate the readability of online resources. This study aims to determine the patient friendliness of information provided by AI chatbots like ChatGPT in explaining NSCLC targeted therapies compared to other commonly accessed sources. Methods: This analysis included targeted therapies approved by the FDA for EGFR-mutated NSCLC: Afatinib, Erlotinib, Osimertinib, Dacomitinib, and Gefitinib. Information sources included ChatGPT 4, ChatGPT 3.5, the Patient Information section of the FDA label, and the Google featured snippet. Each version of ChatGPT was queried 10 times for each individual therapy – “My doctor is suggesting starting therapy with (Drug Name). What can I expect with this medication?”. All responses, labels, and snippets were examined by Microsoft Word for word count (WC) and Flesch Reading Ease Score (FRES), which considers average sentence length and number of syllables per word to assess readability. Data analysis was performed using ANOVA in SAS. Results: Across all therapies, Mean FRES for ChatGPT 4 was 31.21, ChatGPT 3.5 26.69, FDA labels 50.78, and Google snippets 33.1. Mean WC for ChatGPT 4 was 312.54, ChatGPT 3.5 355.78, FDA labels 1220, and Google snippets 42. Significant differences in FRES were found among sources for Afatinib (p-value <0.0001), Erlotinib (p-value 0.0206), Osimertinib (p-value <0.0001), Dacomitinib (p-value 0.0125), and Gefitinib (p-value <0.0001). Significant differences in WC were also found among sources for Afatinib (p-value <0.0001), Erlotinib (p-value <0.0001), Osimertinib (p-value <0.0001), and Gefitinib (p-value 0.0045). No significant difference in WC was found for Dacomitinib (p-value 0.2807). Conclusions: While the FDA label provided the most readable information, none of the sources aligned with the average 8th-grade reading level in the US, indicating a gap in accessible patient education for NSCLC targeted therapies. Notably, readability varies significantly between ChatGPT versions, affecting accessibility for users of the free version, ChatGPT 3.5. Although ChatGPT responses are more succinct compared to the verbose FDA labels, their higher education-level readability presents a barrier to comprehension. This study highlights the need for further research into optimizing AI-generated patient education materials to be both accurate and comprehensible at appropriate reading levels, thereby improving health communication strategies.

Read full abstract

Digital health care apps, including digital therapeutics, have the potential to increase accessibility and improve patient engagement by overcoming the limitations of traditional facility-based medical treatments. However, there are no established tools capable of quantitatively measuring long-term engagement at present. This study aimed to evaluate an existing engagement index (EI) in a commercial health management app for long-term use and compare it with a newly developed EI. Participants were recruited from cancer survivors enrolled in a randomized controlled trial that evaluated the impact of mobile health apps on recovery. Of these patients, 240 were included in the study and randomly assigned to the Noom app (Noom Inc). The newly developed EI was compared with the existing EI, and a long-term use analysis was conducted. Furthermore, the new EI was evaluated based on adapted measurements from the Web Matrix Visitor Index, focusing on click depth, recency, and loyalty indices. The newly developed EI model outperformed the existing EI model in terms of predicting EI of a 6- to 9-month period based on the EI of a 3- to 6-month period. The existing model had a mean squared error of 0.096, a root mean squared error of 0.310, and an R2 of 0.053. Meanwhile, the newly developed EI models showed improved performance, with the best one achieving a mean squared error of 0.025, root mean squared error of 0.157, and R2 of 0.610. The existing EI exhibited significant associations: the click depth index (hazard ratio [HR] 0.49, 95% CI 0.29-0.84; P<.001) and loyalty index (HR 0.17, 95% CI 0.09-0.31; P<.001) were significantly associated with improved survival, whereas the recency index exhibited no significant association (HR 1.30, 95% CI 1.70-2.42; P=.41). Among the new EI models, the EI with a menu combination of menus available in the app's free version yielded the most promising result. Furthermore, it exhibited significant associations with the loyalty index (HR 0.32, 95% CI 0.16-0.62; P<.001) and the recency index (HR 0.47, 95% CI 0.30-0.75; P<.001). The newly developed EI model outperformed the existing model in terms of the prediction of long-term user engagement and compliance in a mobile health app context. We emphasized the importance of log data and suggested avenues for future research to address the subjectivity of the EI and incorporate a broader range of indices for comprehensive evaluation.

Read full abstract

Free Version Research Articles

Articles published on Free Version

Can ChatGPT Support Clinical Coding Using the ICD-10-CM/PCS?

Analisis Pengaruh Kelengkapan Produk dan Promosi Penjualan Terhadap Keputusan Pembelian Konsumen di Transmart KarawanG

A novel smart irrigation framework with timing allocation using solenoid valves and Arduino microcontroller

Analysis of School Financial Management in the Context of Tax Accounting: A Case Study at SMP Negeri 3 Hilisalawa'ahe

Political-organisational aspects for the prevention of genetic diseases

Patient-friendliness of ChatGPT responses: NSCLC targeted therapy.

EFFECTIVENESS OF DUOLINGO IN FOREIGN LANGUAGE LEARNING

La confección de la utopía comunitaria

Early insights into SLA with chatGPT: Navigating CS teachers and student perspectives in an opinion-based exploration

Predicting Long-Term Engagement in mHealth Apps: Comparative Study of Engagement Indices.

Pengaruh Tinggi Bukaan Katup terhadap Torsi dan Daya pada Motor Besin 4 Langkah Berkapasitas 200CC

Fighting free with free: Freemium vs. Piracy

L’utilité du facteur d’épissage SRSF1 dans le grading des tumeurs gliales: Une méta-analyse

The Potential Impact of Large Language Models on Doctor-Patient Communication: A Case Study in Prostate Cancer.

Can Chat-GPT read and understand guidelines? An example using the S2k guideline intrauterine growth restriction of the German Society for Gynecology and Obstetrics

A Minkowski type inequality with free boundary in space forms

Comparative Analysis of Artificial Intelligence Chatbot Performance

IMPROVED DECISION-MAKING ALGORITHM FOR THE REDEVELOPMENT OF INDUSTRIAL AREAS

RECONSTRUCTION OF THE SYSTEM OF GREENING TERRITORIES OF THE DNIPRO AREA IN THE MINDS OF FURTHER TRANSFORMATIONS OF EXOGENIC, ENDOGENIC AND ANTHROPOGENIC FACTORS OF THE OLD DEVELOPMENT

ChatGPT sitting for FRCS Urology examination: Will artificial intelligence get certified?

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Free Version Research Articles

Articles published on Free Version

Can ChatGPT Support Clinical Coding Using the ICD-10-CM/PCS?

Analisis Pengaruh Kelengkapan Produk dan Promosi Penjualan Terhadap Keputusan Pembelian Konsumen di Transmart KarawanG

A novel smart irrigation framework with timing allocation using solenoid valves and Arduino microcontroller

Analysis of School Financial Management in the Context of Tax Accounting: A Case Study at SMP Negeri 3 Hilisalawa'ahe

Political-organisational aspects for the prevention of genetic diseases

Patient-friendliness of ChatGPT responses: NSCLC targeted therapy.

EFFECTIVENESS OF DUOLINGO IN FOREIGN LANGUAGE LEARNING

La confección de la utopía comunitaria

Early insights into SLA with chatGPT: Navigating CS teachers and student perspectives in an opinion-based exploration

Predicting Long-Term Engagement in mHealth Apps: Comparative Study of Engagement Indices.

Pengaruh Tinggi Bukaan Katup terhadap Torsi dan Daya pada Motor Besin 4 Langkah Berkapasitas 200CC

Fighting free with free: Freemium vs. Piracy

L’utilité du facteur d’épissage SRSF1 dans le grading des tumeurs gliales: Une méta-analyse

The Potential Impact of Large Language Models on Doctor-Patient Communication: A Case Study in Prostate Cancer.

Can Chat-GPT read and understand guidelines? An example using the S2k guideline intrauterine growth restriction of the German Society for Gynecology and Obstetrics

A Minkowski type inequality with free boundary in space forms

Comparative Analysis of Artificial Intelligence Chatbot Performance

IMPROVED DECISION-MAKING ALGORITHM FOR THE REDEVELOPMENT OF INDUSTRIAL AREAS

RECONSTRUCTION OF THE SYSTEM OF GREENING TERRITORIES OF THE DNIPRO AREA IN THE MINDS OF FURTHER TRANSFORMATIONS OF EXOGENIC, ENDOGENIC AND ANTHROPOGENIC FACTORS OF THE OLD DEVELOPMENT

ChatGPT sitting for FRCS Urology examination: Will artificial intelligence get certified?