Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Background: Clinical trials are vital for developing new therapies but can also delay drug development. Efficient trial data management, optimized trial protocol, and accurate patient identification are critical for reducing trial timelines. Natural language processing (NLP) has the potential to achieve these objectives.

Objective: This study aims to assess the feasibility of using data-driven approaches to optimize clinical trial protocol design and identify eligible patients. This involves creating a comprehensive eligibility criteria knowledge base integrated within electronic health records using deep learning-based NLP techniques.

Methods: We obtained data of 3281 industry-sponsored phase 2 or 3 interventional clinical trials recruiting patients with non-small cell lung cancer, prostate cancer, breast cancer, multiple myeloma, ulcerative colitis, and Crohn disease from ClinicalTrials.gov, spanning the period between 2013 and 2020. A customized bidirectional long short-term memory- and conditional random field-based NLP pipeline was used to extract all eligibility criteria attributes and convert hypernym concepts into computable hyponyms along with their corresponding values. To illustrate the simulation of clinical trial design for optimization purposes, we selected a subset of patients with non-small cell lung cancer (n=2775), curated from the Mount Sinai Health System, as a pilot study.

Results: We manually annotated the clinical trial eligibility corpus (485/3281, 14.78% trials) and constructed an eligibility criteria-specific ontology. Our customized NLP pipeline, developed based on the eligibility criteria-specific ontology that we created through manual annotation, achieved high precision (0.91, range 0.67-1.00) and recall (0.79, range 0.50-1) scores, as well as a high F-score (0.83, range 0.67-1), enabling the efficient extraction of granular criteria entities and relevant attributes from 3281 clinical trials. A standardized eligibility criteria knowledge base, compatible with electronic health records, was developed by transforming hypernym concepts into machine-interpretable hyponyms along with their corresponding values. In addition, an interface prototype demonstrated the practicality of leveraging real-world data for optimizing clinical trial protocols and identifying eligible patients.

Conclusions: Our customized NLP pipeline successfully generated a standardized eligibility criteria knowledge base by transforming hypernym criteria into machine-readable hyponyms along with their corresponding values. A prototype interface integrating real-world patient information allows us to assess the impact of each eligibility criterion on the number of patients eligible for the trial. Leveraging NLP and real-world data in a data-driven approach holds promise for streamlining the overall clinical trial process, optimizing processes, and improving efficiency in patient identification.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11319878PMC
http://dx.doi.org/10.2196/50800DOI Listing

Publication Analysis

Top Keywords

clinical trial
24
eligibility criteria
16
real-world data
12
clinical trials
12
criteria knowledge
12
knowledge base
12
nlp pipeline
12
hyponyms corresponding
12
corresponding values
12
trial
10

Similar Publications

Background: The high and increasing rate of poor mental health among young people is a matter of global concern. Experiencing poor mental health during this formative stage of life can adversely impact interpersonal relationships, academic and professional performance, and future health and well-being if not addressed early. However, only a few of those in need seek help.

View Article and Find Full Text PDF

Background: Children in the United States have poor diet quality, increasing their risk for chronic disease burden later in life. Caregivers' feeding behaviors are a critical factor in shaping lifelong dietary habits. The Strong Families Start at Home/Familias Fuertes Comienzan en Casa (SFSH) was a 6-month, home-based, pilot randomized-controlled feasibility trial that aimed to improve the diet quality of 2-5-year-old children and promote positive parental feeding practices among a predominantly Hispanic/Latine sample.

View Article and Find Full Text PDF

Rationale: There are insufficient data to inform the management of central sleep apnea (CSA) in patients with heart failure (HF) with reduced ejection fraction (HFrEF). Nocturnal oxygen therapy (NOT) has been postulated to benefit CSA patients with HFrEF, but has not been rigorously studied. This article is open access and distributed under the terms of the Creative Commons Attribution Non-Commercial No Derivatives License 4.

View Article and Find Full Text PDF

Background: On September 27, 2024, Rwanda reported an outbreak of Marburg virus disease (MVD), after a cluster of cases of viral hemorrhagic fever was detected at two urban hospitals.

Methods: We report key aspects of the epidemiology, clinical manifestations, and treatment of MVD during this outbreak, as well as the overall response to the outbreak. We performed a retrospective epidemiologic and clinical analysis of data compiled across all pillars of the outbreak response and a case-series analysis to characterize clinical features, disease progression, and outcomes among patients who received supportive care and investigational therapeutic agents.

View Article and Find Full Text PDF