Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Background: Application of novel machine learning approaches to electronic health record (EHR) data could provide valuable insights into disease processes. We utilized this approach to build predictive models for progression to prediabetes and type 2 diabetes (T2D).

Methods: Using a novel analytical platform (Reverse Engineering and Forward Simulation [REFS]), we built prediction model ensembles for progression to prediabetes or T2D from an aggregated EHR data sample. REFS relies on a Bayesian scoring algorithm to explore a wide model space, and outputs a distribution of risk estimates from an ensemble of prediction models. We retrospectively followed 24 331 adults for transitions to prediabetes or T2D, 2007-2012. Accuracy of prediction models was assessed using an area under the curve (AUC) statistic, and validated in an independent data set.

Results: Our primary ensemble of models accurately predicted progression to T2D (AUC = 0.76), and was validated out of sample (AUC = 0.78). Models of progression to T2D consisted primarily of established risk factors (blood glucose, blood pressure, triglycerides, hypertension, lipid disorders, socioeconomic factors), whereas models of progression to prediabetes included novel factors (high-density lipoprotein, alanine aminotransferase, C-reactive protein, body temperature; AUC = 0.70).

Conclusions: We constructed accurate prediction models from EHR data using a hypothesis-free machine learning approach. Identification of established risk factors for T2D serves as proof of concept for this analytical approach, while novel factors selected by REFS represent emerging areas of T2D research. This methodology has potentially valuable downstream applications to personalized medicine and clinical research.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4738229PMC
http://dx.doi.org/10.1177/1932296815620200DOI Listing

Publication Analysis

Top Keywords

prediction models
16
models progression
16
machine learning
12
ehr data
12
progression prediabetes
12
reverse engineering
8
models
8
type diabetes
8
electronic health
8
prediabetes t2d
8

Similar Publications

Crop growth rate is a critical physiological trait for forage and bioenergy crops like sorghum [Sorghum bicolor (L.) Moench], influencing overall crop productivity, particularly in photoperiod-sensitive (PS) types. Crop growth rate studies focus on either a physiological approach utilizing a few genotypes to analyze biomass accumulation or a genetic approach characterizing easily scorable proxy traits in larger populations.

View Article and Find Full Text PDF

Background: Cardio-kidney-metabolic (CKM) disease represents a significant public health challenge. While proteomics-based risk scores (ProtRS) enhance cardiovascular risk prediction, their utility in improving risk prediction for a composite CKM outcome beyond traditional risk factors remains unknown.

Methods: We analyzed 23 815 UK Biobank participants without baseline CKM disease, defined by -Tenth Revision codes as cardiovascular disease (coronary artery disease, heart failure, stroke, peripheral arterial disease, atrial fibrillation/flutter), kidney disease (chronic kidney disease or end-stage renal disease), or metabolic disease (type 2 diabetes or obesity).

View Article and Find Full Text PDF

Preclinical stroke research faces a critical translational gap, with animal studies failing to reliably predict clinical efficacy. To address this, the field is moving toward rigorous, multicenter preclinical randomized controlled trials (mpRCTs) that mimic phase 3 clinical trials in several key components. This collective statement, derived from experts involved in mpRCTs, outlines considerations for designing and executing such trials.

View Article and Find Full Text PDF

Background: At the 2020 UN General Assembly, China pledged to peak carbon emissions before 2030 and achieve carbon neutrality by 2060. However, the traditional social development model has led to increasing carbon emissions annually, highlighting the need to resolve the contradiction between development and carbon reduction. This study examines the relationship between carbon emissions, economy, population, and energy consumption in a specific region to support carbon peak and neutrality goals.

View Article and Find Full Text PDF

Oral cancer is a major global health burden, ranking sixth in prevalence, with oral squamous cell carcinoma (OSCC) being the most common type. Importantly, OSCC is often diagnosed at late stages, underscoring the need for innovative methods for early detection. The oral microbiome, an active microbial community within the oral cavity, holds promise as a biomarker for the prediction and progression of cancer.

View Article and Find Full Text PDF