Simple Measure Of Gobbledygook Research Articles

Abstract Background Chat-based artificial intelligence (AI) web interfaces that aim to mimic human conversation have increasing utilization in healthcare to help with simple tasks such as scheduling appointments, and even more complex tasks such as providing patient educational responses to COVID-19 questions as done by the World Health Organization.1 Chat-based AI has also been shown to provide accurate responses to cardiovascular disease prevention questions.2 Its ability to provide patient education for more complex treatments like atrial fibrillation (AF) ablation has not been explored. Purpose To evaluate the quality of a popular chat-based AI program’s answers to patient questions about AF ablation. Methods Twenty commonly asked questions ("prompts") regarding AF ablation were entered into ChatGPT (Chat Generative Pre-trained Transformer), a large language model-based AI program (Fig. 1). Prompts were written in plain language; technical terms were avoided except for "radiofrequency", "cryoablation" and "pulsed field ablation" (PFA). SMOG readability calculator was used to assess responses for difficulty and grade-level, as healthcare organizations recommend ≤ 8th-grade level complexity for patient information. Response content was graded by 3 experienced cardiac electrophysiologists as "reasonable", "missing important elements/some inaccuracies" or "misleading/inappropriate". Responses are presented in mean +/- standard deviation and percentages. Results Responses averaged 118±67 words (Fig. 1). Of 20 responses, 17 (85%) were deemed reasonable, 3 (15%) missing important elements/some inaccuracies and none inappropriate or misleading; 16 (80%) emphasized discussion of issues with the healthcare team (Fig. 2). Responses missing important elements/some inaccuracies were those about risks/complications of ablation [missing phrenic nerve palsy, atrioesophageal fistula (AEF), potential need for emergent cardiac surgery or pacemaker, death], concerning symptoms post-procedure (missing symptoms of hematoma, AEF, stroke), and that PFA is not yet approved for use in all regions. Average reading grade level of responses was 13.8 (college level or "professional"): 17 (85%) responses were 12th grade level, 11 (55%) were college-level or higher, and 6 (30%) were college-graduate level ("extremely difficult"). None were ≤ 8th grade level (Fig. 2). Conclusions A majority of ChatGPT responses to common patient questions about AF ablation had reasonable content quality that frequently emphasized the importance of discussion with the healthcare team. However, responses to more difficult questions regarding risks, symptoms of potential complications, or newer technology missed important details; more than half of responses required college-level reading skills. While use of Chat-AI for patient education on EP topics appears promising, patients should be advised to use caution. Further AI training to improve content and readability should be explored.

Read full abstract

Introduction Osteoarthritis (OA) is an age-related degenerative joint disease. There is a 25% risk of symptomatic hip OA in patients who live up to 85 years of age. It can impair a person's daily activities and increase their reliance on healthcare services. It is primarily managed with education, weight loss and exercise, supplemented with pharmacological interventions. Poor health literacy is associated with negative treatment outcomes and patient dissatisfaction. A literature search found there are no previously published studies examining the readability of online information about hip OA. Objectives To assess the readability of healthcare websites regarding hip OA. Methods The terms "hip pain", "hip osteoarthritis", "hip arthritis", and "hip OA" were searched on Google and Bing. Of 240 websites initially considered, 74 unique websites underwent evaluation using the WebFX online readability software (WebFX®, Harrisburg, USA). Readability was determined using the Flesch Reading Ease Score (FRES), Flesch-Kincaid Reading Grade Level (FKGL), Gunning Fog Index (GFI), Simple Measure of Gobbledygook (SMOG), Coleman-Liau Index (CLI), and Automated Readability Index (ARI). In line with recommended guidelines and previous studies, FRES >65 or a grade level score of sixth grade and under was considered acceptable. Results The average FRES was 56.74±8.18 (range 29.5-79.4). Only nine (12.16%) websites had a FRES score >65. The average FKGL score was 7.62±1.69 (range 4.2-12.9). Only seven (9.46%) websites were written at or below a sixth-grade level according to the FKGL score. The average GFI score was 9.20±2.09 (range 5.6-16.5). Only one (1.35%) website was written at or below a sixth-grade level according to the GFI score.The average SMOG score was 7.29±1.41 (range 5.4-12.0). Only eight (10.81%) websites were written at or below a sixth-grade level according to the SMOG score. The average CLI score was 13.86±1.75 (range 9.6-19.7). All 36 websites were written above a sixth-grade level according to the CLI score. The average ARI score was 6.91±2.06 (range 3.1-14.0). Twenty-eight(37.84%) websites were written at or below a sixth-grade level according to the ARI score. One-sample t-tests showed that FRES (p<0.001, CI -10.2 to -6.37), FKGL (p<0.001, CI 1.23 to 2.01), GFI (p<0.001, CI 2.72 to 3.69), SMOG (p<0.001, CI 0.97 to 1.62), CLI (p<0.001, CI 7.46 to 8.27), and ARI (p<0.001, CI 0.43 to 1.39) scores were significantly different from the accepted standard. One-way analysis of variance (ANOVA) testing of FRES scores (p=0.009) and CLI scores (p=0.009) showed a significant difference between categories. Post hoc testing showed a significant difference between academic and non-profit categories for FRES scores (p=0.010, CI -15.17 to -1.47) and CLI scores (p=0.008, CI 0.35 to 3.29). Conclusions Most websites regarding hip OA are written above recommended reading levels, hence exceeding the comprehension levels of the average patient. Readability of these resources must be improved to improve patient access to online healthcare information which can lead to improved patient understanding of their own condition and treatment outcomes.

Read full abstract

Simple Measure Of Gobbledygook Research Articles

Related Topics

Articles published on Simple Measure Of Gobbledygook

Assessing the readability, reliability, and quality of artificial intelligence chatbot responses to the 100 most searched queries about cardiopulmonary resuscitation: An observational study.

Quality, Reliability, Readability, and Accountability of Online Information on Leukocoria.

Performance of an online chat-based artificial intelligence interface for patient education on atrial fibrillation ablation

Assessment of the Arabic patient-centered online information about orthodontic pain: A quality and readability assessment.

Readability and quality assessment of online patient education materials for spinal and epidural anesthesia.

A Cross-Sectional Analysis of the Readability of Online Information Regarding Hip Osteoarthritis.

Accuracy, readability, and understandability of large language models for prostate cancer information to the public.

Pregnancy Loss: A Comparative Analysis of Information Delivery Between ACOG FAQs and ChatGPT [ID 2683594

Evaluation of online text-based information resources of gynaecological cancer symptoms.

An Analysis of the Readability of Online Sarcoidosis Resources.

Using Large Language Models to Generate Educational Materials on Childhood Glaucoma

Readability and Quality of Online Patient Education Materials Concerning Posterior Cruciate Ligament Reconstruction.

Readability of online patient education materials for shoulder instability surgery in English and Spanish

How good is ChatGPT at answering patients’ questions related to early detection of oral (mouth) cancer?

The quality and readability of patient information provided by ChatGPT: can AI reliably explain common ENT operations?

The Emerging Role of AI in Patient Education: A Comparative Analysis of the Accuracy of Large Language Models for Pelvic Organ Prolapse

Quality, Reliability, and Readability of Online Information on Idiopathic Intracranial Hypertension.

A health literacy analysis of online patient-directed educational materials about mycobacterium avium complex

Application of generative language models to orthopaedic practice

Evaluating the Use of ChatGPT to Accurately Simplify Patient-centered Information about Breast Cancer Prevention and Screening.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Simple Measure Of Gobbledygook Research Articles

Related Topics

Articles published on Simple Measure Of Gobbledygook

Assessing the readability, reliability, and quality of artificial intelligence chatbot responses to the 100 most searched queries about cardiopulmonary resuscitation: An observational study.

Quality, Reliability, Readability, and Accountability of Online Information on Leukocoria.

Performance of an online chat-based artificial intelligence interface for patient education on atrial fibrillation ablation

Assessment of the Arabic patient-centered online information about orthodontic pain: A quality and readability assessment.

Readability and quality assessment of online patient education materials for spinal and epidural anesthesia.

A Cross-Sectional Analysis of the Readability of Online Information Regarding Hip Osteoarthritis.

Accuracy, readability, and understandability of large language models for prostate cancer information to the public.

Pregnancy Loss: A Comparative Analysis of Information Delivery Between ACOG FAQs and ChatGPT [ID 2683594

Evaluation of online text-based information resources of gynaecological cancer symptoms.

An Analysis of the Readability of Online Sarcoidosis Resources.

Using Large Language Models to Generate Educational Materials on Childhood Glaucoma

Readability and Quality of Online Patient Education Materials Concerning Posterior Cruciate Ligament Reconstruction.

Readability of online patient education materials for shoulder instability surgery in English and Spanish

How good is ChatGPT at answering patients’ questions related to early detection of oral (mouth) cancer?

The quality and readability of patient information provided by ChatGPT: can AI reliably explain common ENT operations?

The Emerging Role of AI in Patient Education: A Comparative Analysis of the Accuracy of Large Language Models for Pelvic Organ Prolapse

Quality, Reliability, and Readability of Online Information on Idiopathic Intracranial Hypertension.

A health literacy analysis of online patient-directed educational materials about mycobacterium avium complex

Application of generative language models to orthopaedic practice

Evaluating the Use of ChatGPT to Accurately Simplify Patient-centered Information about Breast Cancer Prevention and Screening.