Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Recent advancements in large language models (LLMs) show significant potential in medical applications but are hindered by limited specialized medical knowledge. We present Me-LLaMA, a family of open-source medical LLMs integrating extensive domain-specific knowledge with robust instruction-following capabilities. Me-LLaMA is developed through continual pretraining and instruction tuning of LLaMA2 models using diverse biomedical and clinical data sources (e.g., biomedical literature and clinical notes). We evaluated Me-LLaMA on six text analysis tasks using 12 benchmarks (e.g., PubMedQA and MIMIC-CXR) and assessed its clinical utility in complex case diagnosis through automatic and human evaluations. Me-LLaMA outperforms existing open medical LLMs in zero-shot and supervised settings and surpasses ChatGPT and GPT-4 after task-specific instruction tuning for most text analysis tasks. Its performance is also comparable to ChatGPT and GPT-4 for diagnosing complex clinical cases. Our findings highlight the importance of combining domain-specific continual pretraining with instruction tuning to enhance performance in medical LLMs.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11882967PMC
http://dx.doi.org/10.1038/s41746-025-01533-1DOI Listing

Publication Analysis

Top Keywords

text analysis
12
medical llms
12
instruction tuning
12
large language
8
language models
8
continual pretraining
8
pretraining instruction
8
analysis tasks
8
chatgpt gpt-4
8
medical
6

Similar Publications

Background: Addictive disorders remain a global problem, affecting health, society and the economy. The etiopathogenesis of addictions, which have a multifactorial nature, is poorly understood, making it difficult to develop personalized treatment approaches. Of particular interest is the gene, which regulates serotonergic transmission.

View Article and Find Full Text PDF

Background: Diabetes mellitus is a major health challenge among older adults in Asia. Challenges include limited healthcare access and poor self-care adherence. Continuity of care has emerged as a key strategy to enhance diabetes self-management in this population.

View Article and Find Full Text PDF

Aims: Cardiogenic shock remains a significant cause of mortality despite multiple advancements in medical interventions. Veno-arterial extracorporeal membrane oxygenation (VA-ECMO) provides crucial circulatory support but also increases left ventricular (LV) after-load, potentially worsening outcomes. Effective LV unloading strategies can enhance patient survival during VA-ECMO treatment.

View Article and Find Full Text PDF

Predicting career trajectories is a complex yet impactful task, offering significant benefits for personalized career counseling, recruitment optimization, and workforce planning. However, effective career path prediction (CPP) modeling faces challenges including highly variable career trajectories, free-text resume data, and limited publicly available benchmark datasets. In this study, we present a comprehensive comparative evaluation of CPP models-linear projection, multilayer perceptron (MLP), LSTM, and large language models (LLMs)-across multiple input settings and two recently introduced public datasets.

View Article and Find Full Text PDF

Systematic review of prospective hazard analysis in radiation therapy.

Med Phys

September 2025

Image X Institute, Faculty of Medicine and Health, University of Sydney, Eveleigh, New South Wales, Australia.

Introduction: Prospective hazard analysis (PHA) was introduced to the wider medical physics community by the initiation of American association of physicists in medicine task group 100 in 2003. Since then, there has been increasing interest in the applicability of PHA to radiotherapy for the purpose of keeping patients safe and assessing the risks within the whole practice of radiotherapy. The purpose of this research was to review the PHA literature focusing on which techniques and technologies have been assessed, how they have been assessed, and what can be learnt.

View Article and Find Full Text PDF