The role of ChatGPT-4o in differential diagnosis and management of vertigo-related disorders.

Sci Rep

ENT Institute, Department of Otorhinolaryngology, Eye & ENT Hospital, Fudan University, Shanghai, 200031, China.

Published: May 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

To compare the diagnostic accuracy of an artificial intelligence chatbot and clinical experts in vertigo-related diseases and evaluate the ability of the AI chatbot to address vertigo-related issues. 20 clinical questions about vertigo were input to ChatGPT-4o, and three otologists evaluated the responses using a 5-point Likert scale for accuracy, comprehensiveness, clarity, practicality, and credibility. Readability was assessed using Flesch Reading Ease and Flesch-Kincaid Grade Level formulas. The model and two otologists diagnosed 15 outpatient vertigo cases, and the diagnostic accuracy was calculated. The Kruskal-Wallis test, Analysis of Variance (ANOVA), and paired t-test were employed for statistical analysis. ChatGPT-4o scored highest in credibility (4.78). Repeated Measures ANOVA showed that ChatGPT's responses to the 20 questions exhibited statistically significant differences across the five scoring dimensions (F = 2.682, p = 0.038). Readability analysis showed that diagnosis-related outputs were more challenging compared to other types of content. The model's diagnostic accuracy was comparable to a clinician with one year of experience but inferior to a clinician with five years of experience, and the differences in accuracy among the three methods are statistically significant (p = 0.04). ChatGPT-4o shows promise as a supplementary tool for managing vertigo but requires improvements in readability and diagnostic capabilities.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12119837PMC
http://dx.doi.org/10.1038/s41598-025-96309-8DOI Listing

Publication Analysis

Top Keywords

diagnostic accuracy
12
accuracy
5
role chatgpt-4o
4
chatgpt-4o differential
4
differential diagnosis
4
diagnosis management
4
management vertigo-related
4
vertigo-related disorders
4
disorders compare
4
diagnostic
4

Similar Publications

Rationale: Physicians sometimes encounter various types of gut feelings (GFs) during clinical diagnosis. The type of GF addressed in this paper refers to the intuitive sense that the generated hypothesis might be incorrect. An appropriate diagnosis cannot be obtained unless these GFs are articulated and inventive solutions are devised.

View Article and Find Full Text PDF

Autoimmune nodopathies: emerging insights and clinical implications.

Curr Opin Neurol

October 2025

Neuromuscular Diseases Unit, Department of Neurology, IR SANT PAU, Hospital de la Santa Creu i Sant Pau, CIBERER, Barcelona, Spain.

Purpose Of Review: Autoimmune nodopathies (AN) are a recognized distinct group of immune-mediated peripheral neuropathies with unique immunopathological features and therapeutic implications. This review synthesizes recent advances in their pathogenesis, diagnosis, and management, which have refined their clinical classification and informed targeted treatment strategies.

Recent Findings: AN are characterized by autoantibodies targeting surface proteins in the nodal-paranodal area (anti-contactin-1, anti-contactin-associated protein 1, anti-neurofascin-155, anti-pan-neurofascin), predominantly of IgG4 subclass.

View Article and Find Full Text PDF

Background: Prostate cancer is one of the most common malignancies in males worldwide. Serum prostate-specific antigen is a frequently employed biomarker in the diagnosis and risk stratification of prostate cancer; however, it is known for its low predictive accuracy for disease progression. New prognostic biomarkers are needed to distinguish aggressive prostate cancer from low-risk disease.

View Article and Find Full Text PDF

Objectives: Non-small cell lung cancer (NSCLC) is associated with poor prognosis, with 30% of patients diagnosed at an advanced stage. Mutations in the and genes are important prognostic factors for NSCLC, and targeted therapies can significantly improve survival in these patients. Although tissue biopsy remains the gold standard for detecting gene mutations, it has limitations, including invasiveness, sampling errors due to tumor heterogeneity, and poor reproducibility.

View Article and Find Full Text PDF

Artificial Intelligence in Contact Dermatitis: Current and Future Perspectives.

Dermatitis

September 2025

From the Department of Dermatology, Venereology and Leprology, All India Institute of Medical Sciences (AIIMS), Bhopal, India.

Contact dermatitis (CD), which includes both allergic CD and irritant CD, is a common inflammatory condition that can pose significant diagnostic challenges. Although patch testing is the gold standard for identifying causative allergens for allergic contact dermatitis (ACD), it is time-consuming, subjective, and requires expert interpretation. Recent advancements in artificial intelligence (AI), particularly in machine learning (ML) and deep learning, have shown promise in improving the accuracy, efficiency, and accessibility of CD diagnosis and management.

View Article and Find Full Text PDF