A Quantitative Assessment of ChatGPT as a Neurosurgical Triaging Tool.

Max Ward , Prashin Unadkat , Daniel Toscano , Alon Kashanian , Daniel G Lynch , Alexander C Horn , Randy S D'Amico , Mark Mittler , Griffin R Baum

Neurosurgery

Department of Neurological Surgery, Zucker School of Medicine at Hofstra/Northwell, Hempstead , New York , USA.

Published: August 2024

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Background And Objectives: ChatGPT is a natural language processing chatbot with increasing applicability to the medical workflow. Although ChatGPT has been shown to be capable of passing the American Board of Neurological Surgery board examination, there has never been an evaluation of the chatbot in triaging and diagnosing novel neurosurgical scenarios without defined answer choices. In this study, we assess ChatGPT's capability to determine the emergent nature of neurosurgical scenarios and make diagnoses based on information one would find in a neurosurgical consult.

Methods: Thirty clinical scenarios were given to 3 attendings, 4 residents, 2 physician assistants, and 2 subinterns. Participants were asked to determine if the scenario constituted an urgent neurosurgical consultation and what the most likely diagnosis was. Attending responses provided a consensus to use as the answer key. Generative pretraining transformer (GPT) 3.5 and GPT 4 were given the same questions, and their responses were compared with the other participants.

Results: GPT 4 was 100% accurate in both diagnosis and triage of the scenarios. GPT 3.5 had an accuracy of 92.59%, slightly below that of a PGY1 (96.3%), an 88.24% sensitivity, 100% specificity, 100% positive predictive value, and 83.3% negative predicative value in triaging each situation. When making a diagnosis, GPT 3.5 had an accuracy of 92.59%, which was higher than the subinterns and similar to resident responders.

Conclusion: GPT 4 is able to diagnose and triage neurosurgical scenarios at the level of a senior neurosurgical resident. There has been a clear improvement between GPT 3.5 and 4. It is likely that the recent updates in internet access and directing the functionality of ChatGPT will further improve its utility in neurosurgical triage.

Download full-text PDF	Source
http://dx.doi.org/10.1227/neu.0000000000002867	DOI Listing

Publication Analysis

Top Keywords

neurosurgical scenarios

neurosurgical

gpt accuracy

accuracy 9259%

gpt

scenarios

quantitative assessment

chatgpt

assessment chatgpt

chatgpt neurosurgical

A PHP Error was encountered