Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Background: The Bosniak classification system is widely used to assess malignancy risk in renal cystic lesions, yet inter-observer variability poses significant challenges. Large language models (LLMs) may offer a standardized approach to classification when provided with textual descriptions, such as those found in radiology reports.

Objective: This study evaluated the performance of five LLMs-GPT‑4 (ChatGPT), Gemini, Copilot, Perplexity, and NotebookLM-in classifying renal cysts based on synthetic textual descriptions mimicking CT report content.

Methods: A synthetic dataset of 100 diagnostic scenarios (20 cases per Bosniak category) was constructed using established radiological criteria. Each LLM was evaluated using zero-shot and few-shot prompting strategies, while NotebookLM employed retrieval-augmented generation (RAG). Performance metrics included accuracy, sensitivity, and specificity. Statistical significance was assessed using McNemar's and chi-squared tests.

Results: GPT‑4 achieved the highest accuracy (87% zero-shot, 99% few-shot), followed by Copilot (81-86%), Gemini (55-69%), and Perplexity (43-69%). NotebookLM, tested only under RAG conditions, reached 87% accuracy. Few-shot learning significantly improved performance (p < 0.05). Classification of Bosniak IIF lesions remained challenging across models.

Conclusion: When provided with well-structured textual descriptions, LLMs can accurately classify renal cysts. Few-shot prompting significantly enhances performance. However, persistent difficulties in classifying borderline lesions such as Bosniak IIF highlight the need for further refinement and real-world validation.

Download full-text PDF

Source
http://dx.doi.org/10.1007/s00117-025-01499-xDOI Listing

Publication Analysis

Top Keywords

bosniak classification
8
renal cysts
8
large language
8
language models
8
textual descriptions
8
classification renal
4
cysts large
4
models a comparative
4
a comparative study
4
study background
4

Similar Publications

The value of contrast-enhanced ultrasound in the follow-up of Bosniak IIF cystic renal lesions.

BMC Med Imaging

August 2025

Department of Medical Imaging, Medical School, University of Pécs, Ifjúság út 13, Pécs, H-7624, Hungary.

Background: Contrast-enhanced ultrasound (CEUS) is increasingly used in the characterization of cystic renal lesions. Bosniak IIF lesions warrant follow-up, and their reported progression rate remains variable.

Methods: In this single-center retrospective study we assessed renal CEUS exams (SonoVue) with a diagnosis of Bosniak IIF lesion, conducted between 2015 and 2020.

View Article and Find Full Text PDF

Background: The Bosniak classification system is widely used to assess malignancy risk in renal cystic lesions, yet inter-observer variability poses significant challenges. Large language models (LLMs) may offer a standardized approach to classification when provided with textual descriptions, such as those found in radiology reports.

Objective: This study evaluated the performance of five LLMs-GPT‑4 (ChatGPT), Gemini, Copilot, Perplexity, and NotebookLM-in classifying renal cysts based on synthetic textual descriptions mimicking CT report content.

View Article and Find Full Text PDF

Purpose: Local thermal ablation is considered a standard treatment for small kidney masses. However, few studies have investigated the efficacy and safety of thermal ablation for cystic kidney masses.

Methods: The MEDLINE library, Cochrane, and SCOPUS databases were screened for studies investigating the efficacy of thermal ablation for cystic renal masses, comprising studies between 1995 and February 2024.

View Article and Find Full Text PDF

Background When abdominal CT scans are performed for reasons other than problems with the urinary tract, renal cysts are frequently seen. The objective of this study was to determine how common renal cysts are during CT imaging for other medical reasons in adult patients. Methods This study was conducted as a prospective cross-sectional study with 120 adults who were having abdominal CT scans for conditions such as those of the intestines, liver, or blood vessels.

View Article and Find Full Text PDF

Objectives: We aimed to compare Bosniak III and IV cystic renal masses (CRM) using Bosniak classification version 2005 (BC-v2005) versus BC-v2019 and analyze radiological findings and patterns of benignity and malignancy.

Methods: We retrospectively reviewed all Bosniak III-IV CRMs using BC-v2005 at our center with four-phase CT confirmed pathology, between January 2014 and June 2019. Two radiologists independently re-evaluated each lesion using both BCs, including findings of benignity and malignancy.

View Article and Find Full Text PDF