Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

The evolving field of medical education is being shaped by technological advancements, including the integration of Large Language Models (LLMs) like ChatGPT. These models could be invaluable resources for medical students, by simplifying complex concepts and enhancing interactive learning by providing personalized support. LLMs have shown impressive performance in professional examinations, even without specific domain training, making them particularly relevant in the medical field. This study aims to assess the performance of LLMs in radiology examinations for medical students, thereby shedding light on their current capabilities and implications.This study was conducted using 151 multiple-choice questions, which were used for radiology exams for medical students. The questions were categorized by type and topic and were then processed using OpenAI's GPT-3.5 and GPT- 4 via their API, or manually put into Perplexity AI with GPT-3.5 and Bing. LLM performance was evaluated overall, by question type and by topic.GPT-3.5 achieved a 67.6% overall accuracy on all 151 questions, while GPT-4 outperformed it significantly with an 88.1% overall accuracy (p<0.001). GPT-4 demonstrated superior performance in both lower-order and higher-order questions compared to GPT-3.5, Perplexity AI, and medical students, with GPT-4 particularly excelling in higher-order questions. All GPT models would have successfully passed the radiology exam for medical students at our university.In conclusion, our study highlights the potential of LLMs as accessible knowledge resources for medical students. GPT-4 performed well on lower-order as well as higher-order questions, making ChatGPT-4 a potentially very useful tool for reviewing radiology exam questions. Radiologists should be aware of ChatGPT's limitations, including its tendency to confidently provide incorrect responses. · ChatGPT demonstrated remarkable performance, achieving a passing grade on a radiology examination for medical students that did not include image questions.. · GPT-4 exhibits significantly improved performance compared to its predecessors GPT-3.5 and Perplexity AI with 88% of questions answered correctly.. · Radiologists as well as medical students should be aware of ChatGPT's limitations, including its tendency to confidently provide incorrect responses.. · Gotta J, Le Hong QA, Koch V et al. Large language models (LLMs) in radiology exams for medical students: Performance and consequences. Rofo 2025; 197: 1057-1067.

Download full-text PDF

Source
http://dx.doi.org/10.1055/a-2437-2067DOI Listing

Publication Analysis

Top Keywords

medical students
16
large language
8
language models
8
models llms
8
llms radiology
8
radiology exams
8
exams medical
8
medical
6
llms
4
students
4

Similar Publications

Background: In Canada, the Indigenous population is the youngest and fastest growing, yet ongoing health disparities for Indigenous peoples are widely recognized. There is a concerning lack of research on childhood disabilities and health conditions in Indigenous populations in Canada. For children with disabilities and chronic health conditions, ongoing access to rehabilitation services, such as occupational therapy, physical therapy, speech-language pathology, and audiology, is critical in promoting positive health and developmental outcomes.

View Article and Find Full Text PDF

School activity participation and sense of belonging among U.S. college students.

J Am Coll Health

September 2025

Columbia-Bassett Program, Vagelos College of Physicians and Surgeons, Columbia University, New York, NY, USA.

To determine whether activity participation is associated with a greater sense of belonging among U.S. college students.

View Article and Find Full Text PDF

The aim of the paper is to reflect on the importance of the teacher of the medical profession in graduate and postgraduate education. The objective of the analysis was a narrative reflection on the profession of a teacher of medical professionals based on the principles of medical education and specialization programs applicable in Poland. The core curriculum for teaching in the field of medicine was analysed in detail, including also the insufficiently developed principles of selection and education of academic and vocational teachers.

View Article and Find Full Text PDF

The amphibian dissection for medical students was halted by the restrictions imposed by the National regulatory guidelines, prompting medical curricula to revise and innovate instructional methods. Hence there is a critical need for potential innovative solutions to enhance students' understanding of physiological concepts. Therefore, this study aimed (a) to evaluate the gain in knowledge and retention with computer assisted simulation (CAS) vs traditional (TT) teaching learning strategies in first year medical and paramedical students, and (b) to obtain students' and faculty feedback about strengths and limitations of both strategies.

View Article and Find Full Text PDF

This study aimed to evaluate the effectiveness of online synchronous and asynchronous teaching formats for undergraduate physiology education in a medical program in Ireland, with a specific focus on the use of LabTutor (Lt) LabStation online laboratory platform for remote access. To understand how the Lt platform was used by students and whether it enhanced their learning experience in physiology, we conducted a survey and questionnaire. We focused on students' access to Lt activities and examined any gender differences in the utilization of, and attitudes towards, these activities in a 'Fundamentals of Medicine' module for first-year medical students (n=65).

View Article and Find Full Text PDF