Critical Tool Research Articles

BackgroundChatGPT, a recently developed artificial intelligence (AI) chatbot, has demonstrated improved performance in examinations in the medical field. However, thus far, an overall evaluation of the potential of ChatGPT models (ChatGPT-3.5 and GPT-4) in a variety of national health licensing examinations is lacking. This study aimed to provide a comprehensive assessment of the ChatGPT models’ performance in national licensing examinations for medical, pharmacy, dentistry, and nursing research through a meta-analysis.MethodsFollowing the PRISMA protocol, full-text articles from MEDLINE/PubMed, EMBASE, ERIC, Cochrane Library, Web of Science, and key journals were reviewed from the time of ChatGPT’s introduction to February 27, 2024. Studies were eligible if they evaluated the performance of a ChatGPT model (ChatGPT-3.5 or GPT-4); related to national licensing examinations in the fields of medicine, pharmacy, dentistry, or nursing; involved multiple-choice questions; and provided data that enabled the calculation of effect size. Two reviewers independently completed data extraction, coding, and quality assessment. The JBI Critical Appraisal Tools were used to assess the quality of the selected articles. Overall effect size and 95% confidence intervals [CIs] were calculated using a random-effects model.ResultsA total of 23 studies were considered for this review, which evaluated the accuracy of four types of national licensing examinations. The selected articles were in the fields of medicine (n = 17), pharmacy (n = 3), nursing (n = 2), and dentistry (n = 1). They reported varying accuracy levels, ranging from 36 to 77% for ChatGPT-3.5 and 64.4–100% for GPT-4. The overall effect size for the percentage of accuracy was 70.1% (95% CI, 65–74.8%), which was statistically significant (p < 0.001). Subgroup analyses revealed that GPT-4 demonstrated significantly higher accuracy in providing correct responses than its earlier version, ChatGPT-3.5. Additionally, in the context of health licensing examinations, the ChatGPT models exhibited greater proficiency in the following order: pharmacy, medicine, dentistry, and nursing. However, the lack of a broader set of questions, including open-ended and scenario-based questions, and significant heterogeneity were limitations of this meta-analysis.ConclusionsThis study sheds light on the accuracy of ChatGPT models in four national health licensing examinations across various countries and provides a practical basis and theoretical support for future research. Further studies are needed to explore their utilization in medical and health education by including a broader and more diverse range of questions, along with more advanced versions of AI chatbots.

Read full abstract

Care robots have been proposed in response to nursing shortages in assisted living facilities (ALFs) and the growing population of older adults. While the use of care robots may improve the general health and well-being of older adults, their introduction changes the work of nursing staff fundamentally, and it has implications for the entire health care system. In developing such technology, it is important to include end users, but so far, the nursing staff's perspectives have largely been ignored. This study aims to examine the literature on nursing staff's attitudes, needs, and preferences related to the use of care robots in ALFs, in order to discover gaps in the literature and guide future research. This review follows the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) 2020 protocol. On May 12, 2023, we searched PubMed, CINAHL Plus with Full Text, PsycINFO, the IEEE Xplore Digital Library, and the ACM Digital Library using predetermined search terms. Included publications, written in English, focused on the predevelopment phase, in which information was gathered on nursing staff's attitudes, needs, and preferences regarding care robots for ALFs. Publications were excluded if they did not provide peer-reviewed empirical data. The studies' findings were summarized, coded, and analyzed into major themes using thematic analysis and narrative synthesis. Their quality was assessed using McGill University's Mixed Methods Appraisal Tool and the Joanna Briggs Institute's critical appraisal tools. The final sample included 15 studies. Most of the studies (n=11, 73%) were rated as good quality; however, there was a general lack of reporting on important methodological decisions and sample characteristics. Nursing staff desired care robots that could assist with physically demanding tasks and reduce their workload but had mixed feelings on whether robots could or should assist with social tasks. In addition, nursing staff are concerned about the ethics of care robots, as well as about their safety, accessibility, and operability. The nursing staff's culture, qualification, and role in the facility may influence their perspectives of care robots. The studies lacked theory-driven designs and large sample sizes. Eight (53%) studies mentioned using a participatory design approach, but a lack of established criteria for what constitutes participatory design leads to varying degrees of methodological quality. There was consensus among nursing staff that care robots should serve as nursing assistants to reduce workload. Whether robots could or should assist with social tasks remains a question. Further research is needed to mitigate nursing staff's concerns and understand the socioecological factors that influence their perspectives of care robots and their adoption in ALFs. In addition, theory-driven and large sample size study designs are necessary, as well as work to develop clear criteria for related participatory design research.

Read full abstract

Critical Tool Research Articles

Related Topics

Articles published on Critical Tool

The Economy of Critical Attention: A Computational Analysis of Roland Barthes’s S/Z

A Proposed Framework for Rigor and Transparency in Dysphagia Research: Prologue.

Patient-Reported Outcome Measures for Swallowing: A Proposed Checklist.

The role of cardiac acoustic biomarkers in monitoring patients with heart failure: A systematic literature review.

Two decades of stakeholder voices: Exploring engagement in Romania's FSC forest management certification

Performance of ChatGPT-3.5 and GPT-4 in national licensing examinations for medicine, pharmacy, dentistry, and nursing: a systematic review and meta-analysis

Nursing Staff's Perspectives of Care Robots for Assisted Living Facilities: Systematic Literature Review.

Dr. Dharmendra's Enduring Influence on Leprosy: From Laboratory to Disease Control.

The Role of Nurses in Delivering Bad News to the Patient and Family: An Integrated Review Study

Biopsy: The benchmark for diagnostic excellence amidst advanced technological advancements – A review

Structural Analysis for Assessing the Effectiveness of Anti-Corruption Strategies on Public Trust

Nursing Interventions in Primary Care for the Management of Maladaptive Grief: A Scoping Review.

FDA Warning Letter Trends: A 15-Year Analysis

Prevalence, trends, and distribution of hepatitis C virus among the general population in sub-Saharan Africa: A systematic review and meta-analysis.

Diabetes in axial spondyloarthritis: a systematic review and meta-analysis of observational studies

Exploring meta-heuristics for partitional clustering: methods, metrics, datasets, and challenges

Campylobacter spp. in men who have sex with men: A systematic review.

Evidence based management of medical devices: A follow-up experiment

Bayesian inference for Laplace distribution based on complete and censored samples with illustrations

Paradoxical Reaction to Tuberculosis Therapy among HIV-Negative Children: A Systematic Review and Meta-Analysis.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Critical Tool Research Articles

Related Topics

Articles published on Critical Tool

The Economy of Critical Attention: A Computational Analysis of Roland Barthes’s S/Z

A Proposed Framework for Rigor and Transparency in Dysphagia Research: Prologue.

Patient-Reported Outcome Measures for Swallowing: A Proposed Checklist.

The role of cardiac acoustic biomarkers in monitoring patients with heart failure: A systematic literature review.

Two decades of stakeholder voices: Exploring engagement in Romania's FSC forest management certification

Performance of ChatGPT-3.5 and GPT-4 in national licensing examinations for medicine, pharmacy, dentistry, and nursing: a systematic review and meta-analysis

Nursing Staff's Perspectives of Care Robots for Assisted Living Facilities: Systematic Literature Review.

Dr. Dharmendra's Enduring Influence on Leprosy: From Laboratory to Disease Control.

The Role of Nurses in Delivering Bad News to the Patient and Family: An Integrated Review Study

Biopsy: The benchmark for diagnostic excellence amidst advanced technological advancements – A review

Structural Analysis for Assessing the Effectiveness of Anti-Corruption Strategies on Public Trust

Nursing Interventions in Primary Care for the Management of Maladaptive Grief: A Scoping Review.

FDA Warning Letter Trends: A 15-Year Analysis

Prevalence, trends, and distribution of hepatitis C virus among the general population in sub-Saharan Africa: A systematic review and meta-analysis.

Diabetes in axial spondyloarthritis: a systematic review and meta-analysis of observational studies

Exploring meta-heuristics for partitional clustering: methods, metrics, datasets, and challenges

Campylobacter spp. in men who have sex with men: A systematic review.

Evidence based management of medical devices: A follow-up experiment

Bayesian inference for Laplace distribution based on complete and censored samples with illustrations

Paradoxical Reaction to Tuberculosis Therapy among HIV-Negative Children: A Systematic Review and Meta-Analysis.