Publications by Felix Busch | LitMetric

Publications by authors named "Felix Busch"

Page 1 of 2

Evaluating large language model-generated brain MRI protocols: performance of GPT4o, o3-mini, DeepSeek-R1 and Qwen2.5-72B.

Su Hwan Kim , Severin Schramm , Lena Schmitzer , Kerem Serguen , Sebastian Ziegelmayer , Felix Busch

Eur Radiol

September 2025

Objectives: To evaluate the potential of LLMs to generate sequence-level brain MRI protocols.

Materials And Methods: This retrospective study employed a dataset of 150 brain MRI cases derived from local imaging request forms. Reference protocols were established by two neuroradiologists.

View Article and Find Full Text PDF

Segmenting Whole-Body MRI and CT for Multiorgan Anatomic Structure Delineation.

Hartmut Häntze , Lina Xu , Christian J Mertens , Felix J Dorfner , Leonhard Donle , Felix Busch

Radiol Artif Intell

August 2025

Purpose To develop and validate MRSegmentator, a retrospective cross-modality deep learning model for multiorgan segmentation of MRI scans. Materials and Methods This retrospective study trained MRSegmentator on 1,200 manually annotated UK Biobank Dixon MRI sequences (50 participants), 221 in-house abdominal MRI sequences (177 patients), and 1228 CT scans from the TotalSegmentator-CT dataset. A human-in-the-loop annotation workflow leveraged cross-modality transfer learning from an existing CT segmentation model to segment 40 anatomic structures.

View Article and Find Full Text PDF

: A Question Answering Benchmark with Long Clinical Documents.

Lisa Adams , Felix Busch , Tianyu Han , Jean-Baptiste Excoffier , Matthieu Ortala

J Healthc Inform Res

September 2025

Unlabelled: Recent advancements in large language models (LLMs) offer potential benefits in healthcare, particularly in processing extensive patient records. However, existing benchmarks do not fully assess LLMs' capability in handling real-world, lengthy clinical data. We present the benchmark, comprising 20 detailed fictional patient cases across various diseases, with each case containing 5090 to 6754 words.

View Article and Find Full Text PDF

Privacy-Preserving Generation of Structured Lymphoma Progression Reports from Cross-sectional Imaging: A Comparative Analysis of Llama 3.3 and Llama 4.

Philipp Prucker , Keno K Bressem , Su Hwan Kim , Dominik Weller , Avan Kader , Felix Busch

J Imaging Inform Med

July 2025

Efficient processing of radiology reports for monitoring disease progression is crucial in oncology. Although large language models (LLMs) show promise in extracting structured information from medical reports, privacy concerns limit their clinical implementation. This study evaluates the feasibility and accuracy of two of the most recent Llama models for generating structured lymphoma progression reports from cross-sectional imaging data in a privacy-preserving, real-world clinical setting.

View Article and Find Full Text PDF

[Not Available].

Nedim Christoph Beste , Felix Busch , Anne Frisch , Florian Tilman Gassert , Emily Hoffmann

Rofo

August 2025

View Article and Find Full Text PDF

Performance of open-source and proprietary large language models in generating patient-friendly radiology chest CT reports.

Philipp Prucker , Felix Busch , Felix Dorfner , Christian J Mertens , Nadine Bayerl

Clin Imaging

September 2025

Rationale And Objectives: Large Language Models (LLMs) show promise for generating patient-friendly radiology reports, but the performance of open-source versus proprietary LLMs needs assessment. To compare open-source and proprietary LLMs in generating patient-friendly radiology reports from chest CTs using quantitative readability metrics and qualitative assessments by radiologists.

Materials And Methods: Fifty chest CT reports were processed by seven LLMs: three open-source models (Llama-3-70b, Mistral-7b, Mixtral-8x7b) and four proprietary models (GPT-4, GPT-3.

View Article and Find Full Text PDF

Leveraging large language models for accurate classification of liver lesions from MRI reports.

Daniel Spitzl , Markus Mergen , Ulrike Bauer , Friederike Jungmann , Keno K Bressem , Felix Busch

Comput Struct Biotechnol J

May 2025

Background & Aims: The rapid advancement of large language models (LLMs) has generated interest in their potential integration in clinical workflows. However, their effectiveness in interpreting complex (imaging) reports remains underexplored and has at times yielded suboptimal results. This study aims to assess the capability of state-of-the-art LLMs to classify liver lesions based solely on textual descriptions from MRI reports, challenging the models to interpret nuanced medical language and diagnostic criteria.

View Article and Find Full Text PDF

Multinational Attitudes Toward AI in Health Care and Diagnostics Among Hospital Patients.

Felix Busch , Lena Hoffmann , Lina Xu , Long Jiang Zhang , Bin Hu

JAMA Netw Open

June 2025

Importance: The successful implementation of artificial intelligence (AI) in health care depends on its acceptance by key stakeholders, particularly patients, who are the primary beneficiaries of AI-driven outcomes.

Objectives: To survey hospital patients to investigate their trust, concerns, and preferences toward the use of AI in health care and diagnostics and to assess the sociodemographic factors associated with patient attitudes.

Design, Setting, And Participants: This cross-sectional study developed and implemented an anonymous quantitative survey between February 1 and November 1, 2023, using a nonprobability sample at 74 hospitals in 43 countries.

View Article and Find Full Text PDF

Intermuscular adipose tissue and lean muscle mass assessed with MRI in people with chronic back pain in Germany: a retrospective observational study.

Sebastian Ziegelmayer , Hartmut Häntze , Christian Mertens , Felix Busch , Tristan Lemke

Lancet Reg Health Eur

July 2025

Background: Chronic back pain (CBP) affects over 80 million people in Europe, contributing to substantial healthcare costs and disability. Understanding modifiable risk factors, such as muscle composition, may aid in prevention and treatment. This study investigates the association between lean muscle mass (LMM) and intermuscular adipose tissue (InterMAT) with CBP using noninvasive whole-body magnetic resonance imaging (MRI).

View Article and Find Full Text PDF

Cybersecurity Threats and Mitigation Strategies for Large Language Models in Health Care.

Tugba Akinci D'Antonoli , Ali S Tejani , Bardia Khosravi , Christian Bluethgen , Felix Busch

Radiol Artif Intell

July 2025

The integration of large language models (LLMs) into health care offers tremendous opportunities to improve medical practice and patient care. Besides being susceptible to biases and threats common to all artificial intelligence (AI) systems, LLMs pose unique cybersecurity risks that must be carefully evaluated before these AI models are deployed in health care. LLMs can be exploited in several ways, such as malicious attacks, privacy breaches, and unauthorized manipulation of patient data.

View Article and Find Full Text PDF

Evaluating large language model workflows in clinical decision support for triage and referral and diagnosis.

Farieda Gaber , Maqsood Shaik , Fabio Allega , Agnes Julia Bilecz , Felix Busch

NPJ Digit Med

May 2025

Accurate medical decision-making is critical for both patients and clinicians. Patients often struggle to interpret their symptoms, determine their severity, and select the right specialist. Simultaneously, clinicians face challenges in integrating complex patient data to make timely, accurate diagnoses.

View Article and Find Full Text PDF

Evaluating the effectiveness of biomedical fine-tuning for large language models on clinical tasks.

Felix J Dorfner , Amin Dada , Felix Busch , Marcus R Makowski , Tianyu Han

J Am Med Inform Assoc

June 2025

Objectives: Large language models (LLMs) have shown potential in biomedical applications, leading to efforts to fine-tune them on domain-specific data. However, the effectiveness of this approach remains unclear. This study aims to critically evaluate the performance of biomedically fine-tuned LLMs against their general-purpose counterparts across a range of clinical tasks.

View Article and Find Full Text PDF

Evaluation of a Retrieval-Augmented Generation-Powered Chatbot for Pre-CT Informed Consent: a Prospective Comparative Study.

Felix Busch , Lukas Kaibel , Hai Nguyen , Tristan Lemke , Sebastian Ziegelmayer

J Imaging Inform Med

March 2025

This study aims to investigate the feasibility, usability, and effectiveness of a Retrieval-Augmented Generation (RAG)-powered Patient Information Assistant (PIA) chatbot for pre-CT information counseling compared to the standard physician consultation and informed consent process. This prospective comparative study included 86 patients scheduled for CT imaging between November and December 2024. Patients were randomly assigned to either the PIA group (n = 43), who received pre-CT information via the PIA chat app, or the control group (n = 43), with standard doctor-led consultation.

View Article and Find Full Text PDF

Current applications and challenges in large language models for patient care: a systematic review.

Felix Busch , Lena Hoffmann , Christopher Rueger , Elon Hc van Dijk , Rawen Kader

Commun Med (Lond)

January 2025

Background: The introduction of large language models (LLMs) into clinical practice promises to improve patient education and empowerment, thereby personalizing medical care and broadening access to medical knowledge. Despite the popularity of LLMs, there is a significant gap in systematized information on their use in patient care. Therefore, this systematic review aims to synthesize current applications and limitations of LLMs in patient care.

View Article and Find Full Text PDF

Open-source Large Language Models can Generate Labels from Radiology Reports for Training Convolutional Neural Networks.

Fares Al Mohamad , Leonhard Donle , Felix Dorfner , Laura Romanescu , Kristin Drechsler , Felix Busch

Acad Radiol

May 2025

Rationale And Objectives: Training Convolutional Neural Networks (CNN) requires large datasets with labeled data, which can be very labor-intensive to prepare. Radiology reports contain a lot of potentially useful information for such tasks. However, they are often unstructured and cannot be directly used for training.

View Article and Find Full Text PDF

Large Language Model Ability to Translate CT and MRI Free-Text Radiology Reports Into Multiple Languages.

Aymen Meddeb , Sophia Lüken , Felix Busch , Lisa Adams , Lorenzo Ugga

Radiology

December 2024

Background High-quality translations of radiology reports are essential for optimal patient care. Because of limited availability of human translators with medical expertise, large language models (LLMs) are a promising solution, but their ability to translate radiology reports remains largely unexplored. Purpose To evaluate the accuracy and quality of various LLMs in translating radiology reports across high-resource languages (English, Italian, French, German, and Chinese) and low-resource languages (Swedish, Turkish, Russian, Greek, and Thai).

View Article and Find Full Text PDF

Autonomous medical evaluation for guideline adherence of large language models.

Dennis Fast , Lisa C Adams , Felix Busch , Conor Fallon , Marc Huppertz

NPJ Digit Med

December 2024

Autonomous Medical Evaluation for Guideline Adherence (AMEGA) is a comprehensive benchmark designed to evaluate large language models' adherence to medical guidelines across 20 diagnostic scenarios spanning 13 specialties. It includes an evaluation framework and methodology to assess models' capabilities in medical reasoning, differential diagnosis, treatment planning, and guideline adherence, using open-ended questions that mirror real-world clinical interactions. It includes 135 questions and 1337 weighted scoring elements designed to assess comprehensive medical knowledge.

View Article and Find Full Text PDF

Multilingual feasibility of GPT-4o for automated Voice-to-Text CT and MRI report transcription.

Felix Busch , Philipp Prucker , Alexander Komenda , Sebastian Ziegelmayer , Marcus R Makowski

Eur J Radiol

January 2025

Purpose: Large language models (LLMs) promise to streamline radiology reporting. With the release of OpenAI's GPT-4o (Generative Pre-trained Transformers-4 omni), which processes not only text but also speech, multimodal LLMs might now also be used as medical speech recognition software for radiology reporting in multiple languages. This proof-of-concept study investigates the feasibility of using GPT-4o for automated voice-to-text transcription of radiology reports in English and German.

View Article and Find Full Text PDF

Comparing Commercial and Open-Source Large Language Models for Labeling Chest Radiograph Reports.

Felix J Dorfner , Liv Jürgensen , Leonhard Donle , Fares Al Mohamad , Tobias R Bodenmann , Felix Busch

Radiology

October 2024

Article Synopsis

Advances in large language models (LLMs) have led to numerous commercial and open-source models, but there has been no real-world comparison of OpenAI's GPT-4 against these models for extracting information from radiology reports.
The study aimed to compare GPT-4 with several leading open-source LLMs in extracting relevant findings from chest radiograph reports using datasets from the ImaGenome and Massachusetts General Hospital.
Results showed that GPT-4 slightly outperformed the best open-source model, Llama 2-70B, in terms of accuracy scores, with both showing strong performance in extracting findings from the reports.

View Article and Find Full Text PDF

Large language models for structured reporting in radiology: past, present, and future.

Felix Busch , Lena Hoffmann , Daniel Pinto Dos Santos , Marcus R Makowski , Luca Saba

Eur Radiol

May 2025

Structured reporting (SR) has long been a goal in radiology to standardize and improve the quality of radiology reports. Despite evidence that SR reduces errors, enhances comprehensiveness, and increases adherence to guidelines, its widespread adoption has been limited. Recently, large language models (LLMs) have emerged as a promising solution to automate and facilitate SR.

View Article and Find Full Text PDF

Large Language Models for Simplified Interventional Radiology Reports: A Comparative Analysis.

Elif Can , Wibke Uller , Katharina Vogt , Michael C Doppler , Felix Busch

Acad Radiol

February 2025

Purpose: To quantitatively and qualitatively evaluate and compare the performance of leading large language models (LLMs), including proprietary models (GPT-4, GPT-3.5 Turbo, Claude-3-Opus, and Gemini Ultra) and open-source models (Mistral-7b and Mistral-8×7b), in simplifying 109 interventional radiology reports.

Methods: Qualitative performance was assessed using a five-point Likert scale for accuracy, completeness, clarity, clinical relevance, naturalness, and error rates, including trust-breaking and post-therapy misconduct errors.

View Article and Find Full Text PDF

Global cross-sectional student survey on AI in medical, dental, and veterinary education and practice at 192 faculties.

Felix Busch , Lena Hoffmann , Daniel Truhn , Esteban Ortiz-Prado , Marcus R Makowski

BMC Med Educ

September 2024

Background: The successful integration of artificial intelligence (AI) in healthcare depends on the global perspectives of all stakeholders. This study aims to answer the research question: What are the attitudes of medical, dental, and veterinary students towards AI in education and practice, and what are the regional differences in these perceptions?

Methods: An anonymous online survey was developed based on a literature review and expert panel discussions. The survey assessed students' AI knowledge, attitudes towards AI in healthcare, current state of AI education, and preferences for AI teaching.

View Article and Find Full Text PDF

Llama 3 Challenges Proprietary State-of-the-Art Large Language Models in Radiology Board-style Examination Questions.

Lisa C Adams , Daniel Truhn , Felix Busch , Felix Dorfner , Jawed Nawabi

Radiology

August 2024

View Article and Find Full Text PDF

Navigating the European Union Artificial Intelligence Act for Healthcare.

Felix Busch , Jakob Nikolas Kather , Christian Johner , Marina Moser , Daniel Truhn

NPJ Digit Med

August 2024

Article Synopsis

The EU's AI Act is the first detailed legal framework focused on artificial intelligence, particularly impacting healthcare.
Existing regulations like the Medical Device Regulation do not specifically address medical AI applications, making the AI Act crucial for this sector.
The commentary highlights key elements of the AI Act, providing clear references to specific chapters for better understanding.

View Article and Find Full Text PDF

Correction: Integrating Text and Image Analysis: Exploring GPT-4V's Capabilities in Advanced Radiological Applications Across Subspecialties.

Felix Busch , Tianyu Han , Marcus R Makowski , Daniel Truhn , Keno K Bressem

J Med Internet Res

July 2024

[This corrects the article DOI: 10.2196/54948.].

View Article and Find Full Text PDF