Evaluating the Efficacy of Large Language Models in CPT Coding for Craniofacial Surgery: A Comparative Analysis.

Emily L Isch,Advith Sarikonda,Abhijeet Sambangi,Angeleah Carreras,Adrija Sircar,D Mitchell Self,Theodore E Habarth-Morales,E J Caterson,Mario Aycart

doi:10.1097/scs.0000000000010575

Abstract

The advent of Large Language Models (LLMs) like ChatGPT has introduced significant advancements in various surgical disciplines. These developments have led to an increased interest in the utilization of LLMs for Current Procedural Terminology (CPT) coding in surgery. With CPT coding being a complex and time-consuming process, often exacerbated by the scarcity of professional coders, there is a pressing need for innovative solutions to enhance coding efficiency and accuracy. This observational study evaluated the effectiveness of 5 publicly available large language models-Perplexity.AI, Bard, BingAI, ChatGPT 3.5, and ChatGPT 4.0-in accurately identifying CPT codes for craniofacial procedures. A consistent query format was employed to test each model, ensuring the inclusion of detailed procedure components where necessary. The responses were classified as correct, partially correct, or incorrect based on their alignment with established CPT coding for the specified procedures. The results indicate that while there is no overall significant association between the type of AI model and the correctness of CPT code identification, there are notable differences in performance for simple and complex CPT codes among the models. Specifically, ChatGPT 4.0 showed higher accuracy for complex codes, whereas Perplexity.AI and Bard were more consistent with simple codes. The use of AI chatbots for CPT coding in craniofacial surgery presents a promising avenue for reducing the administrative burden and associated costs of manual coding. Despite the lower accuracy rates compared with specialized, trained algorithms, the accessibility and minimal training requirements of the AI chatbots make them attractive alternatives. The study also suggests that priming AI models with operative notes may enhance their accuracy, offering a resource-efficient strategy for improving CPT coding in clinical practice. This study highlights the feasibility and potential benefits of integrating LLMs into the CPT coding process for craniofacial surgery. The findings advocate for further refinement and training of AI models to improve their accuracy and practicality, suggesting a future where AI-assisted coding could become a standard component of surgical workflows, aligning with the ongoing digital transformation in health care.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Evaluating the Efficacy of Large Language Models in CPT Coding for Craniofacial Surgery: A Comparative Analysis.

Abstract

Talk to us

Similar Papers

More From: The Journal of craniofacial surgery

Lead the way for us

Similar Papers

Differentiation of ileostomy from colostomy procedures: Assessing the accuracy of current procedural terminology codes and the utility of natural language processing
Elaine Vo ... Mike K Liang
Surgery | VOL. 154
Elaine Vo, et. al.Elaine Vo ... Mike K Liang
18 Jun 2013
Surgery | VOL. 154

Administrative Coding Methods Impact Surgical Site Infection Rates
Mohammed Alsuhaibani ... Kyle Jenn
Infection Control & Hospital Epidemiology | VOL. 41
Mohammed Alsuhaibani, et. al.Mohammed Alsuhaibani ... Kyle Jenn
01 Oct 2020
Infection Control & Hospital Epidemiology | VOL. 41

Discordance in Current Procedural Terminology Coding for Foot and Ankle Procedures Between Residents and Attending Surgeons
Robert F Murphy ... David R Richardson
Journal of Surgical Education | VOL. 71
Robert F Murphy, et. al.Robert F Murphy ... David R Richardson
03 Oct 2013
Journal of Surgical Education | VOL. 71

Construction and Utilization of a Neural Network Model to Predict Current Procedural Terminology Codes from Pathology Report Texts
Jay J Ye
Journal of Pathology Informatics | VOL. 10
Jay J YeJay J Ye
01 Jan 2019
Journal of Pathology Informatics | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Evaluating the Efficacy of Large Language Models in CPT Coding for Craniofacial Surgery: A Comparative Analysis.

Abstract

Talk to us

Similar Papers

More From: The Journal of craniofacial surgery