Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

The aim of this work was to create a gold-standard curated cohort of 10,000+ cases from the Veteran Affairs (VA) corporate data warehouse (CDW) for virtual emulation of a randomized clinical trial (CSP#592). The trial had six inclusion/exclusion criteria lacking adequate structured data. We therefore used a hybrid computer/human approach to extract information from clinical notes. Rule-based NLP output was iteratively adjudicated by a panel of trained non-clinician content experts and non-experts using an easy-to-use spreadsheet-based rapid adjudication display. This group-adjudication process iteratively sharpened both the computer algorithm and clinical decision criteria, while simultaneously training the non-experts. The cohort was successfully created with each inclusion/exclusion decision backed by a source document. Less than 0.5% of cases required referral to specialist clinicians. It is likely that such curated datasets capturing specialist reasoning and using a process-supervised approach will acquire greater importance as training tools for future clinical AI applications.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099393PMC

Publication Analysis

Top Keywords

curated cohort
8
clinical trial
8
clinical
5
combining rule-based
4
rule-based nlp-lite
4
nlp-lite rapid
4
rapid iterative
4
iterative chart
4
chart adjudication
4
adjudication creation
4

Similar Publications

Aims/hypothesis: Severe hypoglycaemia events (SHE) remain frequent in people with type 1 diabetes despite advanced diabetes technologies. We examined whether time below range (TBR) 3.9 mmol/l (70 mg/dl; TBR70) or 3.

View Article and Find Full Text PDF

Introduction: Elevated peripheral blood monocyte counts (PBMC) are associated with disease progression and mortality in patients with idiopathic pulmonary fibrosis (IPF). However, evidence for progression stems primarily from highly curated cohort studies or post-hoc analysis of clinical trials. We used real-world data to examine the association between PBMC and IPF mortality among a national cohort of Veterans with IPF.

View Article and Find Full Text PDF

Charting the equine miRNA landscape: An integrated pipeline and browser for annotating, quantifying, and visualizing expression.

PLoS Genet

September 2025

Department of Veterinary Population Medicine, College of Veterinary Medicine, University of Minnesota, St. Paul, Minnesota, United States of America.

MicroRNAs (miRNAs) are essential regulators of gene expression, yet few comprehensive databases exist for miRNA expression in non-model species, limiting our ability to characterize their roles in gene regulation, development, and disease. Similarly, isomiRs - length and sequence isoforms of canonical miRNAs with potentially altered regulatory targets and functions - have received even less attention in non-model species, including the horse, leaving a critical gap in our understanding of their biological significance. To address these challenges, we developed an open-source, containerized pipeline for identifying and quantifying miRNAs and isomiRs (FARmiR: Framework for Analysis and Refinement of miRNAs), and an associated interactive browser (AIMEE: Animal IsomiR and MiRNA Expression Explorer).

View Article and Find Full Text PDF

Background: Cancer morbidity disproportionately affects patients in low- and middle-income countries (LMICs), where timely and accurate tumor profiling is often nonexistent. Immunohistochemistry-based assessment of estrogen receptor (ER) status, a critical step to guide use of endocrine therapy (ET) in breast cancer, is often delayed or unavailable. As a result, ET is often prescribed empirically, leading to ineffective and toxic treatment for ER-negative patients.

View Article and Find Full Text PDF

LitAutoScreener: Development and Validation of an Automated Literature Screening Tool in Evidence-Based Medicine Driven by Large Language Models.

Health Data Sci

September 2025

Key Laboratory of Epidemiology of Major Diseases, Ministry of Education/Department of Epidemiology and Biostatistics, School of Public Health, Peking University, Beijing, China.

The traditional manual literature screening approach is limited by its time-consuming nature and high labor costs. A pressing issue is how to leverage large language models to enhance the efficiency and quality of evidence-based evaluations of drug efficacy and safety. This study utilized a manually curated reference literature database-comprising vaccine, hypoglycemic agent, and antidepressant evaluation studies-previously developed by our team through conventional systematic review methods.

View Article and Find Full Text PDF