Comparative analysis of large language models on rare disease identification.

A PHP Error was encountered

Severity: Warning

Message: file_get_contents(https://...@gmail.com&api_key=61f08fa0b96a73de8c900d749fcb997acc09&a=1): Failed to open stream: HTTP request failed! HTTP/1.1 429 Too Many Requests

Filename: helpers/my_audit_helper.php

Line Number: 197

Backtrace:

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 197
Function: file_get_contents

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 271
Function: simplexml_load_file_from_url

File: /var/www/html/application/helpers/my_audit_helper.php
Line: 3165
Function: getPubMedXML

File: /var/www/html/application/controllers/Detail.php
Line: 597
Function: pubMedSearch_Global

File: /var/www/html/application/controllers/Detail.php
Line: 511
Function: pubMedGetRelatedKeyword

File: /var/www/html/index.php
Line: 317
Function: require_once

Comparative analysis of large language models on rare disease identification. | LitMetric

Comparative analysis of large language models on rare disease identification.

Guangyu Ao , Min Chen , Jing Li , Huibing Nie , Lei Zhang , Zejun Chen

Orphanet J Rare Dis

Department of Nephrology, Chengdu First People's Hospital, No.18 Wanxiang North Road, High-tech District, Chengdu, 610095, Sichuan, China.

Published: April 2025

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Diagnosing rare diseases is challenging due to their low prevalence, diverse presentations, and limited recognition, often leading to diagnostic delays and errors. This study evaluates the effectiveness of multiple large language models (LLMs) in identifying rare diseases, comparing their performance with that of human physicians using real clinical cases. We analyzed 152 rare disease cases from the Chinese Medical Case Repository using four LLMs: ChatGPT-4o, Claude 3.5 Sonnet, Gemini Advanced, and Llama 3.1 405B. Overall, the LLMs performed better than human physicians, and Claude 3.5 Sonnet achieved the highest accuracy at 78.9%, significantly surpassing the accuracy of human physicians, which was 26.3%. These findings suggest that LLMs can improve rare disease diagnosis and serve as valuable tools in clinical settings, particularly in regions with limited resources. However, further validation and careful consideration of ethical and privacy issues are necessary for their effective integration into medical practice.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11959745	PMC
http://dx.doi.org/10.1186/s13023-025-03656-w	DOI Listing

Publication Analysis

Top Keywords

rare disease

human physicians

large language

language models

rare diseases

claude sonnet

rare

comparative analysis

analysis large

models rare

Publication Analysis

Top Keywords

A PHP Error was encountered

Comparative analysis of large language models on rare disease identification.

Category Ranking

Total Visits

Avg Visit Duration

Citations

Article Abstract

Download full-text PDF

Similar Publications