Flesch-Kincaid Research Articles

The COVID-19 pandemic has significantly strained healthcare systems globally, leading to an overwhelming influx of patients and exacerbating resource limitations. Concurrently, an "infodemic" of misinformation, particularly prevalent in women's health, has emerged. This challenge has been pivotal for healthcare providers, especially gynecologists and obstetricians, in managing pregnant women's health. The pandemic heightened risks for pregnant women from COVID-19, necessitating balanced advice from specialists on vaccine safety versus known risks. Additionally, the advent of generative Artificial Intelligence (AI), such as large language models (LLMs), offers promising support in healthcare. However, they necessitate rigorous testing. To assess LLMs' proficiency, clarity, and objectivity regarding COVID-19 impacts in pregnancy. This study evaluates four major AI prototypes (ChatGPT-3.5, ChatGPT-4, Microsoft Copilot, and Google Bard) using zero-shot prompts in a questionnaire validated among 159 Israeli gynecologists and obstetricians. The questionnaire assesses proficiency in providing accurate information on COVID-19 in relation to pregnancy. Text-mining, sentiment analysis, and readability (Flesch-Kincaid grade level and Flesch Reading Ease Score) were also conducted. In terms of LLMs' knowledge, ChatGPT-4 and Microsoft Copilot each scored 97% (n=32/33), Google Bard 94% (n=31/33), and ChatGPT-3.5 82% (n=27/33). ChatGPT-4 incorrectly stated an increased risk of miscarriage due to COVID-19. Google Bard and Microsoft Copilot had minor inaccuracies concerning COVID-19 transmission and complications. At the sentiment analysis, Microsoft Copilot achieved the least negative score (-4), followed by ChatGPT-4 (-6) and Google Bard ( -7), while ChatGPT-3.5 obtained the most negative score (-12). Finally, concerning the readability analysis, Flesch-Kincaid Grade Level and Flesch Reading Ease Score showed that Microsoft Copilot was the most accessible at 9.9 and 49, followed by ChatGPT-4 at 12.4 and 37.1, while ChatGPT-3.5 (12.9 and 35.6) and Google Bard (12.9 and 35.8) generated particularly complex responses. The study highlights varying knowledge levels of LLMs in relation to COVID-19 and pregnancy. ChatGPT-3.5 showed the least knowledge and alignment with scientific evidence. Readability and complexity analyses suggest that each AI's approach was tailored to specific audiences, with ChatGPT versions being more suitable for specialized readers and Microsoft Copilot for the general public. Sentiment analysis revealed notable variations in the way LLMs communicated critical information, underscoring the essential role of neutral and objective healthcare communication in ensuring that pregnant women, particularly vulnerable during the COVID-19 pandemic, receive accurate and reassuring guidance. Overall, ChatGPT-4, Microsoft Copilot, and Google Bard generally provided accurate, updated information on COVID-19 and vaccines in maternal and fetal health, aligning with health guidelines. The study demonstrated the potential role of AI in supplementing healthcare knowledge, with a need for continuous updating and verification of AI knowledge bases. The choice of AI tool should consider the target audience and required information detail level.

Read full abstract

To determine the readability and quality of both English and Spanish Web sites for the topic of hearing aids. Cross-sectional Web site analysis. Various online search engines. The term "hearing aid" was queried across four popular search engines. The first resulted 75 English Web sites and first resulted 75 Spanish Web sites were extracted for data collection. Web sites that met the inclusion criteria were stratified by the presence of a Health on the Net Code (HONCode) certificate. Articles were then compiled to be independently reviewed by experts on hearing aids, using the DISCERN criteria, which allowed assessment of the quality of the Web sites. Readability was assessed by calculating the Flesch Reading Ease Score in English and the Fernandez Huerta Formula in Spanish. Readability and quality were both analyzed, comparing scores to their respective language and cross-comparing. There were 37 English Web sites and 30 Spanish Web sites that met inclusion criteria. When analyzing readability, English Web sites were determined to be significantly more difficult to read (average = 55.37, standard deviation [SD] = 7.73, 95% confidence interval [CI] = 52.9-57.9) than the Spanish Web site counterparts (average = 58.64, SD = 5.26, 95% CI = 56.8-60.5, p = 0.035). For quality, Spanish Web sites (average = 38, SD = 9.7, 95% CI = 34.5-41.5) were determined to be of significantly higher quality than English Web sites (average = 32.16, SD = 10.60, 95% CI = 29.7-34.6). Additionally, there was a significant difference between the non-HONCode English Web sites versus the non-HONCode Spanish Web sites (p = 0.0081), signifying that Spanish non-HONCode certified Web sites were less reliable than non-HONCode certified English Web sites. The present study highlights the importance and necessity of providing quality, readable materials to patients seeking information regarding hearing aids. This study shows that both English and Spanish Web sites are written at a level that is much higher than the American Medical Association (AMA)-recommended sixth-grade reading level, and no Web site included in this study fell at or below the AMA-recommended sixth-grade reading level. English and Spanish Web sites also lacked consistency and quality, as evidenced by their wide variability in DISCERN scores. Specifically, Hispanic patients are more likely to suffer long-term consequences of their health care due to low levels of health literacy. It is important to bridge this gap by providing adequate reading materials. It is especially important to provide evidence-based claims that are directly supported by experts in the field.

Read full abstract

Flesch-Kincaid Research Articles

Related Topics

Articles published on Flesch-Kincaid

Comparative Performance of the Leading Large Language Models in Answering Complex Rhinoplasty Consultation Questions.

Pediatric Supracondylar Humerus and Diaphyseal Femur Fractures: A Comparative Analysis of Chat Generative Pretrained Transformer and Google Gemini Recommendations Versus American Academy of Orthopaedic Surgeons Clinical Practice Guidelines.

Tailoring glaucoma education using large language models: Addressing health disparities in patient comprehension.

Patient- and clinician-based evaluation of large language models for patient education in prostate cancer radiotherapy.

Proficiency, Clarity, and Objectivity of Large Language Models Versus Specialists' Knowledge on COVID-19 Impacts in Pregnancy: A Cross-Sectional Pilot Study.

Improving readability in AI-generated medical information on fragility fractures: the role of prompt wording on ChatGPT's responses.

Evaluation of Multi-Lingual Simplifications of IR Procedural Reports Using GPT-4.

Enhancing patient education on the role of tibial osteotomy in the management of knee osteoarthritis using a customized ChatGPT: a readability and quality assessment

Investigating the role of large language models on questions about refractive surgery.

Osteoporosis Knowledge Assessment Tool – Tamil (OKAT-T) in Postmenopausal Women: A Validity and Reliability Study.

Enhancing Patient Comprehension of Glomerular Disease Treatments Using ChatGPT.

Artificial Doctors: Performance of Chatbots as a Tool for Patient Education on Keratoconus.

Quality and readability of online information on Keratoconus in Portugal and Brazil

Large Language Models May Help Patients Understand Peer-Reviewed Scientific Articles About Ophthalmology: Development and Usability Study.

Readability of cerebrovascular diseases online educational material from major cerebrovascular organizations

Acceptability and readability of ChatGPT-4 based responses for frequently asked questions about strabismus and amblyopia.

Readability of Online Pediatric Orthopaedic Trauma Patient Education Materials.

Quality and Readability of Hearing Aid-Related Websites in English and Spanish.

ChatGPT as a patient education tool in colorectal cancer-An in-depth assessment of efficacy, quality and readability.

Leveraging large language models to improve patient education on dry eye disease.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Flesch-Kincaid Research Articles

Related Topics

Articles published on Flesch-Kincaid

Comparative Performance of the Leading Large Language Models in Answering Complex Rhinoplasty Consultation Questions.

Pediatric Supracondylar Humerus and Diaphyseal Femur Fractures: A Comparative Analysis of Chat Generative Pretrained Transformer and Google Gemini Recommendations Versus American Academy of Orthopaedic Surgeons Clinical Practice Guidelines.

Tailoring glaucoma education using large language models: Addressing health disparities in patient comprehension.

Patient- and clinician-based evaluation of large language models for patient education in prostate cancer radiotherapy.

Proficiency, Clarity, and Objectivity of Large Language Models Versus Specialists' Knowledge on COVID-19 Impacts in Pregnancy: A Cross-Sectional Pilot Study.

Improving readability in AI-generated medical information on fragility fractures: the role of prompt wording on ChatGPT's responses.

Evaluation of Multi-Lingual Simplifications of IR Procedural Reports Using GPT-4.

Enhancing patient education on the role of tibial osteotomy in the management of knee osteoarthritis using a customized ChatGPT: a readability and quality assessment

Investigating the role of large language models on questions about refractive surgery.

Osteoporosis Knowledge Assessment Tool – Tamil (OKAT-T) in Postmenopausal Women: A Validity and Reliability Study.

Enhancing Patient Comprehension of Glomerular Disease Treatments Using ChatGPT.

Artificial Doctors: Performance of Chatbots as a Tool for Patient Education on Keratoconus.

Quality and readability of online information on Keratoconus in Portugal and Brazil

Large Language Models May Help Patients Understand Peer-Reviewed Scientific Articles About Ophthalmology: Development and Usability Study.

Readability of cerebrovascular diseases online educational material from major cerebrovascular organizations

Acceptability and readability of ChatGPT-4 based responses for frequently asked questions about strabismus and amblyopia.

Readability of Online Pediatric Orthopaedic Trauma Patient Education Materials.

Quality and Readability of Hearing Aid-Related Websites in English and Spanish.

ChatGPT as a patient education tool in colorectal cancer-An in-depth assessment of efficacy, quality and readability.

Leveraging large language models to improve patient education on dry eye disease.