Identifying individuals at risk of post-stroke depression: Development and validation of a predictive model.

Saudi Med J

From the Department of Basic Medical Sciences, Taibah University, Al-Madinah Al-Munawarah, Kingdom of Saudi Arabia.

Published: May 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Objectives: To identify the factors associated with post-stroke depression (PSD) and develop a machine learning predictive model using a large dataset, considering sociodemographic, lifestyle, and clinical factors.

Methods: Our 2025 study used data from the 2023 Behavioral Risk Factor Surveillance System, released in September 2024. Data processing was carried out using Google Colab and Python. We carried out descriptive statistics, logistic regression, and feature importance analyses (mutual information and adjusted mutual information). A total of 4 machine-learning models were trained and evaluated: random forest, decision tree, gradient boosting, and logistic regression. Model performance was assessed using the accuracy, precision, recall, harmonic mean of precision and recall (F1-score), and area under the curve - receiver operating characteristic (AUC-ROC). The best-performing model was fine-tuned using GridSearchCV with 5-fold cross-validation.

Results: Increasing age, male gender, being married, higher income, and physical activity were associated with lower odds of PSD. Obesity, smoking, diabetes, and high cholesterol are associated with increased odds of PSD. Age and gender were the most informative features for predicting the PSD. Random forest demonstrated the best performance for predicting PSD (accuracy=0.73, precision=0.71, recall=0.77, F1-score=0.74, and AUC-ROC=0.81), which was further improved by hyperparameter optimization.

Conclusion: Post-stroke depression's complex etiology involves sociodemographic, lifestyle, and clinical factors, notably age and gender. A random forest model effectively predicts PSD, highlighting the need for comprehensive assessment, early intervention, and management of modifiable risks (obesity, smoking, and inactivity) to improve stroke survivors' outcomes.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12074046PMC
http://dx.doi.org/10.15537/smj.2025.46.5.20250080DOI Listing

Publication Analysis

Top Keywords

random forest
12
post-stroke depression
8
predictive model
8
sociodemographic lifestyle
8
lifestyle clinical
8
logistic regression
8
precision recall
8
odds psd
8
obesity smoking
8
age gender
8

Similar Publications

Estimation of Brachial-Ankle Pulse Wave Velocity With Hierarchical Regression Model From Wrist Photoplethysmography and Electrocardiographic Signals: Method Design.

JMIR Biomed Eng

August 2025

Cardiovascular Center and Divisions of Cardiology and Hospital Medicine, Department of Internal Medicine, National Taiwan University Hospital, No.7, Chung Shan S Rd, Taipei, 100225, Taiwan, 886 2-2312-3456.

Background: Photoplethysmography (PPG) signals captured by wearable devices can provide vascular age information and support pervasive and long-term monitoring of personal health condition.

Objective: In this study, we aimed to estimate brachial-ankle pulse wave velocity (baPWV) from wrist PPG and electrocardiography (ECG) from smartwatch.

Methods: A total of 914 wrist PPG and ECG sequences and 278 baPWV measurements were collected via the smartwatch from 80 men and 82 women with average age of 63.

View Article and Find Full Text PDF

Optimization of Nitrogen Application and Root Biomass Modulates 2-Acetyl-1-Pyrroline Biosynthesis in Fragrant Rice.

Physiol Plant

September 2025

State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, College of Agriculture, South China Agricultural University, Guangzhou, China.

The rice root system mediates nutrient uptake while adapting to tillage, management, and environmental changes. While optimized nitrogen (N) supply is known to enhance 2-acetyl-1-pyrroline (2-AP) biosynthesis in fragrant rice, the underlying mechanisms linking nitrogen availability, root development, and their combined effects on physiological processes and aroma formation remain unclear. To address this knowledge gap, we conducted a pot experiment employing two fragrant rice cultivars (Huahangxiangyinzhen and Qingxiangyou19xiang) under three nitrogen regimes (0, 1.

View Article and Find Full Text PDF

Background: A clear understanding of minimal clinically important difference (MCID) and substantial clinical benefit (SCB) is essential for effectively implementing patient-reported outcome measurements (PROMs) as a performance measure for total knee arthroplasty (TKA). Since not achieving MCID and SCB may reflect suboptimal surgical benefit, the primary aim of this study was to use machine learning to predict patients who may not achieve the threshold-based outcomes (i.e.

View Article and Find Full Text PDF

Background: Variants of uncertain significance (VUS) represent a major diagnostic challenge in the interpretation of genetic testing results, particularly in the context of inborn errors of immunity such as severe combined immunodeficiency (SCID). The inconsistency among computational prediction tools often necessitates expensive and time-consuming wet-lab analyses.

Objective: This study aimed to develop disease-specific, multi-class machine learning models using in silico scores to classify SCID-associated genetic variants and improve the interpretation of VUS.

View Article and Find Full Text PDF

Purpose: Accurate prediction of human clearance (CL) is essential in early drug development. Single Species Scaling (SSS) using rat pharmacokinetic (PK) data, particularly with unbound plasma fraction (f), is widely used. However, its accuracy declines for compounds with extremely low f, and no systematic method has addressed this limitation.

View Article and Find Full Text PDF