Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

The remarkable performance of ChatGPT, launched in November 2022, has significantly impacted the field of natural language processing, inspiring the application of large language models as supportive tools in clinical practice and research worldwide. Although GPT-3.5 recently scored high on the United States Medical Licensing Examination, its performance on medical licensing examinations of other nations, especially non-English speaking nations, has not been sufficiently evaluated. This study assessed GPT's performance on the National Medical Licensing Examination (NMLE) in Japan and compared it with the actual minimal passing rate for this exam. In particular, the performances of both the GPT-3.5 and GPT-4 models were considered for the comparative analysis. We initially used the GPT models and several prompts for 290 questions without image data from the 116th NMLE (held in February 2022 in Japan) to maximize the performance for delivering correct answers and explanations of the questions. Thereafter, we tested the performance of the best GPT model (GPT-4) with optimized prompts on a dataset of 262 questions without images from the latest 117th NMLE (held in February 2023). The best model with the optimized prompts scored 82.7% for the essential questions and 77.2% for the basic and clinical questions, both of which sufficed the minimum passing scoring rates of 80.0% and 74.6%, respectively. After an exploratory analysis of 56 incorrect answers from the model, we identified the three major factors contributing to the generation of the incorrect answers-insufficient medical knowledge, information on Japan-specific medical system and guidelines, and mathematical errors. In conclusion, GPT-4 with our optimized prompts achieved a minimum passing scoring rate in the latest 117th NMLE in Japan. Beyond its original design of answering examination questions for humans, these artificial intelligence (AI) models can serve as one of the best "sidekicks" for solving problems and addressing the unmet needs in the medical and healthcare fields.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10805303PMC
http://dx.doi.org/10.1371/journal.pdig.0000433DOI Listing

Publication Analysis

Top Keywords

medical licensing
16
licensing examination
12
optimized prompts
12
national medical
8
nmle japan
8
nmle held
8
held february
8
gpt-4 optimized
8
latest 117th
8
117th nmle
8

Similar Publications

Geographic and Policy Factors Influence Telehealth Availability for Substance Use Disorder Treatment.

J Behav Health Serv Res

September 2025

Department of Health Policy and Management, Fay W. Boozman College of Public Health, University of Arkansas for Medical Sciences, 4301 W. Markham St., Little Rock, AR, USA.

Telehealth is increasingly a standard and routine clinical option, indicating a changing outlook for SUD treatment from in-person to the more convenient option of telehealth. As populations across geographies increasingly prefer telehealth, more research is warranted that focuses on how where a person lives is associated with telehealth availability. The authors used the Mental Health and Addiction Treatment Tracking Repository (MATTR 2024) to identify telehealth availability among all known licensed SUD treatment facilities in the USA (N = 10,492 facilities).

View Article and Find Full Text PDF

Background: Individuals with kidney failure experience elevated cardiovascular risk, potentially worsened by the presence of sleep disordered breathing. Despite this association, prevalence of sleep apnoea, and evidence for effective treatments are poorly understood in people with kidney failure. This review examines sleep apnoea prevalence, types of sleep apnoea, and treatment interventions in people with kidney failure receiving dialysis.

View Article and Find Full Text PDF

Beyond their classical functions as redox cofactors, recent fundamental and clinical research has expanded our understanding of the diverse roles of nicotinamide adenine dinucleotide (NAD) and nicotinamide adenine dinucleotide phosphate (NADP) in signaling pathways, epigenetic regulation and energy homeostasis. Moreover, NAD and NADP influence numerous diseases as well as the processes of aging, and are emerging as targets for clinical intervention. Here, we summarize safety, bioavailability and efficacy data from NAD-related clinical trials, focusing on aging and neurodegenerative diseases.

View Article and Find Full Text PDF

AI-informed retinal biomarkers predict 10-year risk of onset of multiple hematological malignancies.

Eur J Cancer

August 2025

Emory University, Atlanta, USA; Wallace H. Coulter Department of Biomedical Engineering, Georgia Institute of Technology and Emory University, Atlanta, GA, USA; Atlanta Veterans Administration Medical Center, Atlanta, USA. Electronic address:

Background: Early detection of hematological malignancies improves long-term survival but remains a critical challenge due to heterogeneity in clinical presentation. Chronic inflammation is a key driver in hematologic cancers and is known to induce compensatory microvascular changes. High-resolution, non-invasive retinal imaging can allow the quantification of microvascular changes for the early detection of hematological malignancies.

View Article and Find Full Text PDF

Natural killer (NK) cell licensing is an educational process that enhances responsiveness to activating signals in maturing NK cells and is predominantly regulated by major histocompatibility complex (MHC) class I-specific inhibitory signals. However, the role of non-MHC signalling in this process remains unclear. Here, we investigated the role of FcRγ, an adaptor protein associated with activating receptors, in the regulation of NK cell responsiveness.

View Article and Find Full Text PDF