98%
921
2 minutes
20
Background: Clinical trials are vital for developing new therapies but can also delay drug development. Efficient trial data management, optimized trial protocol, and accurate patient identification are critical for reducing trial timelines. Natural language processing (NLP) has the potential to achieve these objectives.
Objective: This study aims to assess the feasibility of using data-driven approaches to optimize clinical trial protocol design and identify eligible patients. This involves creating a comprehensive eligibility criteria knowledge base integrated within electronic health records using deep learning-based NLP techniques.
Methods: We obtained data of 3281 industry-sponsored phase 2 or 3 interventional clinical trials recruiting patients with non-small cell lung cancer, prostate cancer, breast cancer, multiple myeloma, ulcerative colitis, and Crohn disease from ClinicalTrials.gov, spanning the period between 2013 and 2020. A customized bidirectional long short-term memory- and conditional random field-based NLP pipeline was used to extract all eligibility criteria attributes and convert hypernym concepts into computable hyponyms along with their corresponding values. To illustrate the simulation of clinical trial design for optimization purposes, we selected a subset of patients with non-small cell lung cancer (n=2775), curated from the Mount Sinai Health System, as a pilot study.
Results: We manually annotated the clinical trial eligibility corpus (485/3281, 14.78% trials) and constructed an eligibility criteria-specific ontology. Our customized NLP pipeline, developed based on the eligibility criteria-specific ontology that we created through manual annotation, achieved high precision (0.91, range 0.67-1.00) and recall (0.79, range 0.50-1) scores, as well as a high F-score (0.83, range 0.67-1), enabling the efficient extraction of granular criteria entities and relevant attributes from 3281 clinical trials. A standardized eligibility criteria knowledge base, compatible with electronic health records, was developed by transforming hypernym concepts into machine-interpretable hyponyms along with their corresponding values. In addition, an interface prototype demonstrated the practicality of leveraging real-world data for optimizing clinical trial protocols and identifying eligible patients.
Conclusions: Our customized NLP pipeline successfully generated a standardized eligibility criteria knowledge base by transforming hypernym criteria into machine-readable hyponyms along with their corresponding values. A prototype interface integrating real-world patient information allows us to assess the impact of each eligibility criterion on the number of patients eligible for the trial. Leveraging NLP and real-world data in a data-driven approach holds promise for streamlining the overall clinical trial process, optimizing processes, and improving efficiency in patient identification.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11319878 | PMC |
http://dx.doi.org/10.2196/50800 | DOI Listing |
JACC Heart Fail
September 2025
Université de Lorraine, Inserm, Centre d'Investigations Cliniques Plurithématique 1433, Centre Hospitalier Régional Universitaire de Nancy, Nancy, France.
JMIR Res Protoc
September 2025
National Institute of Public Health, University of Southern Denmark, Copenhagen K, Denmark.
Background: The high and increasing rate of poor mental health among young people is a matter of global concern. Experiencing poor mental health during this formative stage of life can adversely impact interpersonal relationships, academic and professional performance, and future health and well-being if not addressed early. However, only a few of those in need seek help.
View Article and Find Full Text PDFJMIR Res Protoc
September 2025
Institute for Collaboration on Health, Intervention, and Policy, University of Connecticut, Storrs, CT, United States.
Background: Children in the United States have poor diet quality, increasing their risk for chronic disease burden later in life. Caregivers' feeding behaviors are a critical factor in shaping lifelong dietary habits. The Strong Families Start at Home/Familias Fuertes Comienzan en Casa (SFSH) was a 6-month, home-based, pilot randomized-controlled feasibility trial that aimed to improve the diet quality of 2-5-year-old children and promote positive parental feeding practices among a predominantly Hispanic/Latine sample.
View Article and Find Full Text PDFAnn Am Thorac Soc
September 2025
Brigham and Women's Hospital, Division of Sleep and Circadian Disorders, Boston, Massachusetts, United States.
Rationale: There are insufficient data to inform the management of central sleep apnea (CSA) in patients with heart failure (HF) with reduced ejection fraction (HFrEF). Nocturnal oxygen therapy (NOT) has been postulated to benefit CSA patients with HFrEF, but has not been rigorously studied. This article is open access and distributed under the terms of the Creative Commons Attribution Non-Commercial No Derivatives License 4.
View Article and Find Full Text PDFN Engl J Med
September 2025
Rwanda Biomedical Center, Kigali.
Background: On September 27, 2024, Rwanda reported an outbreak of Marburg virus disease (MVD), after a cluster of cases of viral hemorrhagic fever was detected at two urban hospitals.
Methods: We report key aspects of the epidemiology, clinical manifestations, and treatment of MVD during this outbreak, as well as the overall response to the outbreak. We performed a retrospective epidemiologic and clinical analysis of data compiled across all pillars of the outbreak response and a case-series analysis to characterize clinical features, disease progression, and outcomes among patients who received supportive care and investigational therapeutic agents.