Performance of vision language models for optic disc swelling identification on fundus photographs.

Kelvin Zhenghao Li , Tuyet Thao Nguyen , Heather E Moss

Front Digit Health

Department of Ophthalmology, Stanford University, Palo Alto, CA, United States.

Published: August 2025

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Introduction: Vision language models (VLMs) combine image analysis capabilities with large language models (LLMs). Because of their multimodal capabilities, VLMs offer a clinical advantage over image classification models for the diagnosis of optic disc swelling by allowing a consideration of clinical context. In this study, we compare the performance of non-specialty-trained VLMs with different prompts in the classification of optic disc swelling on fundus photographs.

Methods: A diagnostic test accuracy study was conducted utilizing an open-sourced dataset. Five different prompts (increasing in context) were used with each of five different VLMs (Llama 3.2-vision, LLaVA-Med, LLaVA, GPT-4o, and DeepSeek-4V), resulting in 25 prompt-model pairs. The performance of VLMs in classifying photographs with and without optic disc swelling was measured using Youden's index (YI), F1 score, and accuracy rate.

Results: A total of 779 images of normal optic discs and 295 images of swollen discs were obtained from an open-source image database. Among the 25 prompt-model pairs, valid response rates ranged from 7.8% to 100% (median 93.6%). Diagnostic performance ranged from YI: 0.00 to 0.231 (median 0.042), F1 score: 0.00 to 0.716 (median 0.401), and accuracy rate: 27.5 to 70.5% (median 58.8%). The best-performing prompt-model pair was GPT-4o with role-playing with Chain-of-Thought and few-shot prompting. On average, Llama 3.2-vision performed the best (average YI across prompts 0.181). There was no consistent relationship between the amount of information given in the prompt and the model performance.

Conclusions: Non-specialty-trained VLMs could classify photographs of swollen and normal optic discs better than chance, with performance varying by model. Increasing prompt complexity did not consistently improve performance. Specialty-specific VLMs may be necessary to improve ophthalmic image analysis performance.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12415036	PMC
http://dx.doi.org/10.3389/fdgth.2025.1660887	DOI Listing

Publication Analysis

Top Keywords

optic disc

disc swelling

language models

vision language

image analysis

non-specialty-trained vlms

llama 32-vision

prompt-model pairs

normal optic

optic discs

Similar Publications

Recurrent idiopathic neuroretinitis: the role of optical coherence tomography in a challenging diagnosis.

BMJ Case Rep

September 2025

Ophthalmology, Federal University of Parana, Curitiba, Brazil

Ana Bárbara Dias Lopes Urzedo , Bruna Depieri Michels , Lucas Leao Santoro , Kenzo Hokazono

Neuroretinitis (NR) is characterised by optic disc oedema associated with macular exudates in a star-shaped pattern. Several aetiologies of NR have been described, with cat-scratch disease being the most common. However, despite thorough investigations, one-quarter of cases are classified as idiopathic neuroretinitis (INR), in which visual prognosis is generally good.

View Article and Find Full Text PDF

Similar Publications

Diabetes and optic atrophy in a young adult: consider Wolfram syndrome.

Pract Neurol

September 2025

Neurology Department, Croydon University Hospital, London, England, UK

Angela Yan , Frederick Schon , Patrick Yu Wai Man , Arani Nitkunan

A 22-year-old woman had an 8-year history of progressive bilateral vision loss and of diabetes mellitus. Her mother had diabetes and two first cousins had severe congenital deafness. On examination, her visual acuities were 6/36 bilaterally, with absent colour vision and gross optic disc pallor.

View Article and Find Full Text PDF

Similar Publications

Comment on "OCT and OCTA Features of Optic Disc Melanocytoma: PHOMS, Perfusion Deficits, and Association with Vision Loss".

Am J Ophthalmol

September 2025

Department of Ophthalmology, Kyorin University Suginami Hospital, Tokyo, Japan. Electronic address:

Gábor Holló , Yoshiyuki Kita

View Article and Find Full Text PDF

Similar Publications

Performance of vision language models for optic disc swelling identification on fundus photographs.

Front Digit Health

August 2025

Department of Ophthalmology, Stanford University, Palo Alto, CA, United States.

Kelvin Zhenghao Li , Tuyet Thao Nguyen , Heather E Moss

View Article and Find Full Text PDF

Similar Publications

Semaglutide and Non-arteritic Anterior Ischemic Optic Neuropathy: A Systematic Review.

Cureus

August 2025

Faculty of Medicine, University of Costa Rica, San Jose, CRI.

Roberto A Hidalgo Ramos , Marcelo Ortiz , Sebastián Dufner Krieger , Daniela Secades

This systematic review examines the potential association between semaglutide, a glucagon-like peptide-1 (GLP-1) receptor agonist, and the development of non-arteritic anterior ischemic optic neuropathy (NAION). Nine studies were included, consisting of retrospective cohort analyses, case series, and pharmacovigilance reports. Findings across the literature were inconsistent, with some studies reporting an increased risk while others found no significant association.

View Article and Find Full Text PDF

Similar Publications