Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Objectives: This study aimed to develop an accurate prediction model for the risk of Non-alcoholic fatty liver disease (NAFLD) using the random survival forests (RSF), and to investigate the distribution of NAFLD risk with time.

Methods: This retrospective cohort study included subjects who had annual health checkups from 1 January 2021 to 31 December 2024. The hold-out strategy, that all the subjects were divided into a training set and a test set, was employed to develop and evaluate our models. Important predictors were then extracted from all the candidate variables using the LASSO regression on the training set. Two prediction models were constructed using the Cox model and the RSF model. Feature importance and their 95% CIs were calculated using the VIMP with bootstrap resampling. The integrated area under the curve (iAUC), the time-dependent area under the curve (tAUC), the integrated Brier score (iBS), and the time-dependent prediction error (PE) were used to evaluate the discrimination and calibration of our models.

Results: A total of 18,250 patients fulfilled the criteria, and 14 predictors were extracted through the LASSO regression for the next model development. The RSF model showed exceptional discrimination (iAUC of 0.856) and calibration (iBS of 0.116) compared to the Cox model (iAUC of 0.759 and iBS of 0.148). Based on the RSF model predictions, subjects were stratified into the high- and low-risk groups with significant differences, with a mean NAFLD-free time of 20.86 and 36.76 months (P <.0001), respectively.

Conclusions: In this study, the RSF prediction model for the risk of NAFLD was developed, which outperformed the traditional Cox model, achieved remarkable risk stratification for NAFLD, and provided novel insights into the distribution of NAFLD risk with time.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12261573PMC
http://dx.doi.org/10.1186/s12876-025-04120-6DOI Listing

Publication Analysis

Top Keywords

rsf model
12
non-alcoholic fatty
8
fatty liver
8
liver disease
8
training set
8
predictors extracted
8
lasso regression
8
cox model
8
area curve
8
model
7

Similar Publications

Neuro-Immuno-Stromal Context in Colorectal Cancer: An Enteric Glial Cell-Driven Prognostic Model via Machine Learning Predicts Survival, Recurrence, and Therapy Response.

Exp Cell Res

September 2025

Key Laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Gastrointestinal Cancer Center, Peking University Cancer Hospital and Institute, Beijing, China. Electronic address:

Background: Enteric glial cells (EGCs) have been implicated in colorectal cancer (CRC) progression. This study aimed to develop and validate a prognostic model integrating EGC- and CRC-associated gene expression to predict patient survival, recurrence, metastasis, and therapy response.

Methods: Bulk and single-cell RNA sequencing data were analyzed, and a machine learning-based model was constructed using the RSF random forest algorithm.

View Article and Find Full Text PDF

Introduction: Surgery remains the primary treatment for patients with esophageal cancer (EC), yet postoperative prognosis is often unsatisfactory. Accurate prediction of cancer-specific survival (CSS) can assist clinicians in personalized treatment planning. This study aimed to develop an interactive web-based tool to estimate CSS in patients with T1~3N0~2M0 EC after surgery, based on the log odds of negative lymph nodes/T stage ratio (LONT).

View Article and Find Full Text PDF

Introduction: Our study aimed to identify risk factors associated with the survival of gastric cancer patients with Type 2 diabetes mellitus (T2DM) and create a risk-scoring system for predicting their survival probabilities.

Methods: We gathered data from 1,912 individuals with both gastric cancer and T2DM from the Hong Kong Hospital Authority Data Collaboration Laboratory (HADCL), spanning from 2000 to 2020. We used conventional Cox proportional hazards regression and tree-based machine learning algorithms to construct models for prognosis risk prediction.

View Article and Find Full Text PDF

Statistical learning methods for improving predictive performance in time-dependent survival models.

Genomics Inform

September 2025

Department of Statistics and Actuarial Science, Soongsil University, Seoul, 06978, South Korea.

Background: The COVID-19 pandemic has highlighted the need for survival models to assess risk factors and time-dependent effects in infectious diseases. However, the Cox proportional hazards (PH) model, which assumes constant covariate effects, struggles to capture disease dynamics. This underscores the need for advanced models that incorporate time-dependent coefficients and covariates for improved accuracy.

View Article and Find Full Text PDF

Effects of aging on anticancer therapy in murine cancer models.

Cancer Metastasis Rev

August 2025

Department of Carcinogenesis and Oncogerontology, N.N. Petrov National Medical Research Center of Oncology, 68 Leningradskaya Ul, Pesochny, Saint Petersburg, 197758, Russia.

More than half of cancer patients are over 65 years old. This proportion will increase with further population aging. Cancer properties significantly depend on patients' age, and, as a rule, cancer responsiveness to therapy decreases with patients' aging.

View Article and Find Full Text PDF