Comparison Between Statistical Model and Machine Learning Methods for Predicting the Risk of Renal Function Decline Using Routine Clinical Data in Health Screening.

Risk Manag Healthc Policy

National Clinical Research Center for Mental Disorders, and Department of Psychiatry, The Second Xiangya Hospital, Central South University, Changsha, Hunan, People's Republic of China.

Published: April 2022


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Purpose: Using machine learning method to predict and judge unknown data offers opportunity to improve accuracy by exploring complex interactions between risk factors. Therefore, we evaluate the performance of machine learning (ML) algorithms and to compare them with logistic regression for predicting the risk of renal function decline (RFD) using routine clinical data.

Patients And Methods: This retrospective cohort study includes datasets from 2166 subjects, aged 35-74 years old, provided by an adult health screening follow-up program between 2010 and 2020. Seven different ML models were considered - random forest, gradient boosting, multilayer perceptron, support vector machine, K-nearest neighbors, adaptive boosting, and decision tree - and were compared with standard logistic regression. There were 24 independent variables, and the baseline estimate glomerular filtration rate (eGFR) was used as the predictive variable.

Results: A total of 2166 participants (mean age 49.2±11.2 years old, 63.3% males) were enrolled and randomly divided into a training set (n=1732) and a test set (n=434). The area under receiver operating characteristic curve (AUROC) for detecting RFD corresponding to the different models were above 0.85 during the training phase. The gradient boosting algorithms exhibited the best average prediction accuracy (AUROC: 0.914) among all algorithms validated in this study. Based on AUROC, the ML algorithms improved the RFD prediction performance, compared to logistic regression model (AUROC:0.882), except the K-nearest neighbors and decision tree algorithms (AUROC:0.854 and 0.824, respectively). However, the improvement differences with logistic regression were small (less than 4%) and nonsignificant.

Conclusion: Our results indicate that the proposed health screening dataset-based RFD prediction model using ML algorithms is readily applicable, produces validated results. But logistic regression yields as good performance as ML models to predict the risk of RFD with simple clinical predictors.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9056070PMC
http://dx.doi.org/10.2147/RMHP.S346856DOI Listing

Publication Analysis

Top Keywords

logistic regression
20
machine learning
12
health screening
12
predicting risk
8
risk renal
8
renal function
8
function decline
8
routine clinical
8
gradient boosting
8
k-nearest neighbors
8

Similar Publications

An Investigation of Hyperostosis Frontalis Interna in a Modern Anatomical Body Donor Population.

Clin Anat

September 2025

Department of Communication Disorders and Sciences, Rush University Medical Center, Chicago, Illinois, USA.

This research sought to examine the prevalence and severity of hyperostosis frontalis interna (HFI) in the Chicagoland anatomical body donor population. The study further aimed to elucidate potential demographic risk factors for HFI, including sex, age at death, and structural vulnerability index (SVI), as well as any common comorbidities, as gleaned from death certificates. HFI is an irregular bony overgrowth of the endocranial surface of the frontal bone.

View Article and Find Full Text PDF

Background And Objectives: Pollen-food allergy syndrome (PFAS) is a frequent comorbidity in individuals with hay fever. Identifying risk factors and allergen clusters can aid targeted interventions and management strategies. Objective: This study characterizes PFAS in patients with hay fever and identifies associated risk factors using the mobile health platform, AllerSearch.

View Article and Find Full Text PDF

Background: Poststroke cognitive impairment (PSCI) affects 30% to 50% of stroke survivors, severely impacting functional outcomes and quality of life. This study uses functional near-infrared spectroscopy (fNIRS) to assess task-evoked brain activation and its potential for stratifying the severity in patients with PSCI.

Method: A cross-sectional study was conducted at Nanchong Central Hospital between June 2023 and April 2024.

View Article and Find Full Text PDF

Background: Risk stratification in posterior circulation ischemic stroke (PCIS) is challenging. Although the Posterior Circulation Ischemic Stroke Outcome Score (PCISOS) was developed to address this, its utility in minor PCIS and in identifying homogeneous populations for clinical trials or treatment-responsive subgroups remains uncertain.

Methods: CHANCE-2 (Clopidogrel in High-Risk Patients With Acute Non-disabling Cerebrovascular Events-II) was a multicenter, randomized trial that enrolled patients with minor stroke or high-risk transient ischemic attack who carried CYP2C19 loss-of-function alleles.

View Article and Find Full Text PDF

Background And Aims: Dental caries in children remains a global health challenge. Fissure sealant therapy (FST) is an effective preventive measure, yet parental acceptance remains low. This study aimed to identify predictors of parental FST behavior for children aged 6-12 years in Bandar Abbas, Iran, using the health belief model (HBM).

View Article and Find Full Text PDF