98%
921
2 minutes
20
Background: The Bosniak classification system is widely used to assess malignancy risk in renal cystic lesions, yet inter-observer variability poses significant challenges. Large language models (LLMs) may offer a standardized approach to classification when provided with textual descriptions, such as those found in radiology reports.
Objective: This study evaluated the performance of five LLMs-GPT‑4 (ChatGPT), Gemini, Copilot, Perplexity, and NotebookLM-in classifying renal cysts based on synthetic textual descriptions mimicking CT report content.
Methods: A synthetic dataset of 100 diagnostic scenarios (20 cases per Bosniak category) was constructed using established radiological criteria. Each LLM was evaluated using zero-shot and few-shot prompting strategies, while NotebookLM employed retrieval-augmented generation (RAG). Performance metrics included accuracy, sensitivity, and specificity. Statistical significance was assessed using McNemar's and chi-squared tests.
Results: GPT‑4 achieved the highest accuracy (87% zero-shot, 99% few-shot), followed by Copilot (81-86%), Gemini (55-69%), and Perplexity (43-69%). NotebookLM, tested only under RAG conditions, reached 87% accuracy. Few-shot learning significantly improved performance (p < 0.05). Classification of Bosniak IIF lesions remained challenging across models.
Conclusion: When provided with well-structured textual descriptions, LLMs can accurately classify renal cysts. Few-shot prompting significantly enhances performance. However, persistent difficulties in classifying borderline lesions such as Bosniak IIF highlight the need for further refinement and real-world validation.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1007/s00117-025-01499-x | DOI Listing |
BMC Med Imaging
August 2025
Department of Medical Imaging, Medical School, University of Pécs, Ifjúság út 13, Pécs, H-7624, Hungary.
Background: Contrast-enhanced ultrasound (CEUS) is increasingly used in the characterization of cystic renal lesions. Bosniak IIF lesions warrant follow-up, and their reported progression rate remains variable.
Methods: In this single-center retrospective study we assessed renal CEUS exams (SonoVue) with a diagnosis of Bosniak IIF lesion, conducted between 2015 and 2020.
Radiologie (Heidelb)
August 2025
Department of Radiology, Recep Tayyip Erdogan University, Rize, Turkey.
Background: The Bosniak classification system is widely used to assess malignancy risk in renal cystic lesions, yet inter-observer variability poses significant challenges. Large language models (LLMs) may offer a standardized approach to classification when provided with textual descriptions, such as those found in radiology reports.
Objective: This study evaluated the performance of five LLMs-GPT‑4 (ChatGPT), Gemini, Copilot, Perplexity, and NotebookLM-in classifying renal cysts based on synthetic textual descriptions mimicking CT report content.
Diagn Interv Radiol
August 2025
University Hospital Leipzig, Institute of Laboratory Medicine, Clinical Chemistry and Molecular Diagnostics, Leipzig, Germany.
Purpose: Local thermal ablation is considered a standard treatment for small kidney masses. However, few studies have investigated the efficacy and safety of thermal ablation for cystic kidney masses.
Methods: The MEDLINE library, Cochrane, and SCOPUS databases were screened for studies investigating the efficacy of thermal ablation for cystic renal masses, comprising studies between 1995 and February 2024.
Cureus
June 2025
Medicine, Bolan Medical College, Bolan University of Medical and Health Sciences, Quetta, PAK.
Background When abdominal CT scans are performed for reasons other than problems with the urinary tract, renal cysts are frequently seen. The objective of this study was to determine how common renal cysts are during CT imaging for other medical reasons in adult patients. Methods This study was conducted as a prospective cross-sectional study with 120 adults who were having abdominal CT scans for conditions such as those of the intestines, liver, or blood vessels.
View Article and Find Full Text PDFAbdom Radiol (NY)
July 2025
Hospital Clínic de Barcelona, Barcelona, Spain.
Objectives: We aimed to compare Bosniak III and IV cystic renal masses (CRM) using Bosniak classification version 2005 (BC-v2005) versus BC-v2019 and analyze radiological findings and patterns of benignity and malignancy.
Methods: We retrospectively reviewed all Bosniak III-IV CRMs using BC-v2005 at our center with four-phase CT confirmed pathology, between January 2014 and June 2019. Two radiologists independently re-evaluated each lesion using both BCs, including findings of benignity and malignancy.