Medical Research Articles

Background The launch of ChatGPT (OpenAI) in November 2022 attracted public attention and academic interest to large language models (LLMs), facilitating the emergence of many other innovative LLMs. These LLMs have been applied in various fields, including health care. Numerous studies have since been conducted regarding how to use state-of-the-art LLMs in health-related scenarios. Objective This review aims to summarize applications of and concerns regarding conversational LLMs in health care and provide an agenda for future research in this field. Methods We used PubMed, ACM, and the IEEE digital libraries as primary sources for this review. We followed the guidance of PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) to screen and select peer-reviewed research articles that (1) were related to health care applications and conversational LLMs and (2) were published before September 1, 2023, the date when we started paper collection. We investigated these papers and classified them according to their applications and concerns. Results Our search initially identified 820 papers according to targeted keywords, out of which 65 (7.9%) papers met our criteria and were included in the review. The most popular conversational LLM was ChatGPT (60/65, 92% of papers), followed by Bard (Google LLC; 1/65, 2% of papers), LLaMA (Meta; 1/65, 2% of papers), and other LLMs (6/65, 9% papers). These papers were classified into four categories of applications: (1) summarization, (2) medical knowledge inquiry, (3) prediction (eg, diagnosis, treatment recommendation, and drug synergy), and (4) administration (eg, documentation and information collection), and four categories of concerns: (1) reliability (eg, training data quality, accuracy, interpretability, and consistency in responses), (2) bias, (3) privacy, and (4) public acceptability. There were 49 (75%) papers using LLMs for either summarization or medical knowledge inquiry, or both, and there are 58 (89%) papers expressing concerns about either reliability or bias, or both. We found that conversational LLMs exhibited promising results in summarization and providing general medical knowledge to patients with a relatively high accuracy. However, conversational LLMs such as ChatGPT are not always able to provide reliable answers to complex health-related tasks (eg, diagnosis) that require specialized domain expertise. While bias or privacy issues are often noted as concerns, no experiments in our reviewed papers thoughtfully examined how conversational LLMs lead to these issues in health care research. Conclusions Future studies should focus on improving the reliability of LLM applications in complex health-related tasks, as well as investigating the mechanisms of how LLM applications bring bias and privacy issues. Considering the vast accessibility of LLMs, legal, social, and technical efforts are all needed to address concerns about LLMs to promote, improve, and regularize the application of LLMs in health care.

Social media platforms have transformed the dissemination of health information, allowing for rapid and widespread sharing of content. However, alongside valuable medical knowledge, these platforms have also become channels for the spread of health misinformation, including false claims and misleading advice, which can lead to significant public health risks. Susceptibility to health misinformation varies and is influenced by individuals' cultural, social, and personal backgrounds, further complicating efforts to combat its spread. This study aimed to examine the extent to which individuals report encountering health-related misinformation on social media and to assess how racial, ethnic, and sociodemographic factors influence susceptibility to such misinformation. Data from the Health Information National Trends Survey (HINTS; Cycle 6), conducted by the National Cancer Institute with 5041 US adults between March and November 2022, was used to explore associations between racial and sociodemographic factors (age, gender, race/ethnicity, annual household income, marital status, and location) and susceptibility variables, including encounters with misleading health information on social media, difficulty in assessing information truthfulness, discussions with health providers, and making health decisions based on such information. Over 35.61% (1740/4959) of respondents reported encountering "a lot" of misleading health information on social media, with an additional 45% (2256/4959) reporting seeing "some" amount of health misinformation. Racial disparities were evident in comparison with Whites, with non-Hispanic Black (odds ratio [OR] 0.45, 95% CI 0.33-0.6, P<.01) and Hispanic (OR 0.54, 95% CI 0.41-0.71, P<.01) individuals reporting lower odds of finding deceptive information, while Hispanic (OR 1.68, 95% CI 1.48-1.98, P<.05) and non-Hispanic Asian (OR 1.96, 95% CI 1.21-3.18, P<.01) individuals exhibited higher odds in having difficulties in assessing the veracity of health information found on social media. Hispanic and Asian individuals were more likely to discuss with providers and make health decisions based on social media information. Older adults aged ≥75 years exhibited challenges in assessing health information on social media (OR 0.63, 95% CI 0.43-0.93, P<.01), while younger adults (18-34) showed increased vulnerability to health misinformation. In addition, income levels were linked to higher exposure to health misinformation on social media: individuals with annual household incomes between US $50,000 and US $75,000 (OR 1.74, 95% CI 1.14-2.68, P<.01), and greater than US $75,000 (OR 1.78, 95% CI 1.20-2.66, P<.01) exhibited greater odds, revealing complexities in decision-making and information access. This study highlights the pervasive presence of health misinformation on social media, revealing vulnerabilities across racial, age, and income groups, underscoring the need for tailored interventions.

Medical Research Articles

Related Topics

Articles published on Medical

Applications and Concerns of ChatGPT and Other Conversational Large Language Models in Health Care: Systematic Review

Addressing diagnostic delays in inflammatory bowel diseases in Germany

A Comparative, Individual Values-Based Scoring Approach to the Secure Flourish Index Among Clinical Health Professions Students

Conjugation mediates large-scale chromosomal transfer in Streptomyces driving diversification of antibiotic biosynthetic gene clusters.

Study protocol for a multicenter randomized controlled trial on simulation-based communication training for pediatric cardiology trainees (SIMUL-CHD).

Voice or text? The role of physician media choice on patient experience in online medical communities

Conflicting interpretations and FDA reputation: the case of post-market surveillance of breast implants

Chemical analysis and concentrations of cannabidiol substances used for refractory epilepsy in Chilean patients. An underestimated worldwide risk.

The impact of using bee pollen in poultry systems

Evaluations of State Medical Cannabis Programs in the US: A Narrative Review

Lifetime prevalence of questionable health behaviors and their psychological roots: A preregistered nationally representative survey.

Medical Lysenkoism.

Nutri-Score : what can we tell patients about this current scientific topic?

PFAS and 5G: what can we tell patients about these two current scientific topics?

Racial and Demographic Disparities in Susceptibility to Health Misinformation on Social Media: National Survey-Based Analysis.

Atrial fibrillation – a comparative review of one of the most common arrhythmias in dogs and humans

Herbal medicine use in patients seeking treatment in emergency departments

CRISPR Technology Acts as a Dual-Purpose Tool in Pig Breeding: Enhancing Both Agricultural Productivity and Biomedical Applications

A critical comparative study of the performance of three AI-assisted programs for bone age determination.

Economic analysis of implementing Systemic Coronary Risk Estimation (SCORE2) scale and long-term consequences

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Medical Research Articles

Related Topics

Articles published on Medical

Applications and Concerns of ChatGPT and Other Conversational Large Language Models in Health Care: Systematic Review

Addressing diagnostic delays in inflammatory bowel diseases in Germany

A Comparative, Individual Values-Based Scoring Approach to the Secure Flourish Index Among Clinical Health Professions Students

Conjugation mediates large-scale chromosomal transfer in Streptomyces driving diversification of antibiotic biosynthetic gene clusters.

Study protocol for a multicenter randomized controlled trial on simulation-based communication training for pediatric cardiology trainees (SIMUL-CHD).

Voice or text? The role of physician media choice on patient experience in online medical communities

Conflicting interpretations and FDA reputation: the case of post-market surveillance of breast implants

Chemical analysis and concentrations of cannabidiol substances used for refractory epilepsy in Chilean patients. An underestimated worldwide risk.

The impact of using bee pollen in poultry systems

Evaluations of State Medical Cannabis Programs in the US: A Narrative Review

Lifetime prevalence of questionable health behaviors and their psychological roots: A preregistered nationally representative survey.

Medical Lysenkoism.

Nutri-Score : what can we tell patients about this current scientific topic?

PFAS and 5G: what can we tell patients about these two current scientific topics?

Racial and Demographic Disparities in Susceptibility to Health Misinformation on Social Media: National Survey-Based Analysis.

Atrial fibrillation – a comparative review of one of the most common arrhythmias in dogs and humans

Herbal medicine use in patients seeking treatment in emergency departments

CRISPR Technology Acts as a Dual-Purpose Tool in Pig Breeding: Enhancing Both Agricultural Productivity and Biomedical Applications

A critical comparative study of the performance of three AI-assisted programs for bone age determination.

Economic analysis of implementing Systemic Coronary Risk Estimation (SCORE2) scale and long-term consequences