A new frontier in biostatistics: evaluating the accuracy of ChatGPT-4 vs. R in analysing liver resection data

Basel Jobeir,Dimitri Raptis,Abdulmajeed Alahdal,Fuat Saner,Sebastian Staubli,Dieter Broering

doi:10.52872/001c.123577

Abstract

Background The rise of ChatGPT-4’s Data Analyst tool presents a new frontier for biostatistical computations. This study evaluates the reliability and improvements of ChatGPT-4 Data Analyst tool by comparing it to R package in performing biostatistical analysis on liver surgery patients. Methods Utilizing data from LiverGroup.org, we conducted our comparative study between October 2023 and March 2024. The variables analyzed by the R package and ChatGPT-4 Data Analyst included age, sex, hospital stay duration, income group, and mortality. Analysis on ChatGPT-4 were performed using two methods: a holistic prompt which included all-at-once analysis were requested and segmented prompts, one-by-one test request for analysis. After the analysis figures were requested from ChatGPT-4, comparison with R package figures was done. Results Descriptive analysis including N (%), Standard Deviation, and (25th–75th Percentile) were consistent between ChatGPT-4 March version and R with a minor variation in the holistic approach on the analysis performed in October. The inferential statistical results of ChatGPT-4 showed inconsistencies in October 2023 while March 2024 revealed accurate results with Crosstabulations, Kruskal Wallis, Wilcoxon Rank Sum, T-test, Pearson’s Chi-squared, and Fisher’s Exact test p-value. ChatGPT-4 March 2024 version was able to inform the user with possible inaccuracies in certain tests (Mann-Whitney U Test: Hospital stay vs mortality p value, Levene’s Test p-value: Age vs mortality, and Fisher’s Exact Test: Odds ratio gender vs. mortality 95% CI). The survival curve and box-and-whisker plot generated by ChatGPT-4 in March 2024 matched those generated by R package except for the CI of survival curve. Conclusions The high accuracy of ChatGPT-4 in certain biostatistical analysis has reached the point where it can replace established statistical software like R for some purposes. Artificial intelligence tools show significant promise but should still be used in conjunction with traditional methods to ensure precision in complex analysis. Consensus on the use of these tools is needed by the scientific community.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A new frontier in biostatistics: evaluating the accuracy of ChatGPT-4 vs. R in analysing liver resection data

Abstract

Talk to us

Similar Papers

More From: Journal of Global Health Economics and Policy

Lead the way for us

Journal: Journal of Global Health Economics and Policy	Publication Date: Oct 6, 2024
License type: CC BY 4.0

Similar Papers

Inflammation in areas of fibrosis: The DeKAF prospective cohort.
Arthur J Matas ... Fernando Cosio
American Journal of Transplantation | VOL. 20
Arthur J Matas, et. al.Arthur J Matas ... Fernando Cosio
15 Apr 2020
American Journal of Transplantation | VOL. 20

Dream recall, short term memory and a urine marker for pyrrole disorder - A pilot study

-

24 Jan 2019
24 Jan 2019

Salivary Gland Involvement in Chronic Graft-Versus-Host Disease: Prevalence, Clinical Significance, and Recommendations for Evaluation
Matin M Imanguli ... Steven Z Pavletic
Biology of Blood and Marrow Transplantation | VOL. 16
Matin M Imanguli, et. al.Matin M Imanguli ... Steven Z Pavletic
29 Mar 2010
Biology of Blood and Marrow Transplantation | VOL. 16

Review of Case-Mix Corrected Survival Curves
Todd A Mackenzie ... Gary L Grunkemeier
The Annals of Thoracic Surgery | VOL. 93
Todd A Mackenzie, et. al.Todd A Mackenzie ... Gary L Grunkemeier
25 Apr 2012
The Annals of Thoracic Surgery | VOL. 93

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A new frontier in biostatistics: evaluating the accuracy of ChatGPT-4 vs. R in analysing liver resection data

Abstract

Talk to us

Similar Papers

More From: Journal of Global Health Economics and Policy