Machine learning identification of key genes in cardioembolic stroke and atherosclerosis: their association with pan-cancer and immune cells.

Eur J Med Res

Henan Digital Image and Intelligent Processing of Big Data Engineering Research Center, Computer Science and Technology, Nanyang Normal University, Nanyang, 473000, China.

Published: July 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Background: Cardioembolic stroke (CS) and atherosclerosis (AS) are closely related diseases. Ferroptosis, a novel form of programmed cell death, may play a key role in CS and AS. However, the pathophysiological mechanisms underlying their coexistence remain unclear. This study aims to identify the hub genes and pathways involved in developing both diseases.

Methods: CS (GSE58294) and AS (GSE20129) datasets were obtained from the Gene Expression Omnibus database, and a ferroptosis (FR)-related gene dataset was downloaded from the FR database. A study was conducted to examine differentially expressed genes (DEGs) in healthy individuals and patients diagnosed with CS and AS. Gene ontology and Kyoto encyclopedia of genes and genomes analyses were performed to explore the functions of common FR-related DEGs (FRDEGs). Two machine learning algorithms, Least Absolute Shrinkage and Selection Operator (LASSO) regression and Support Vector Machine Recursive Feature Elimination (SVM-RFE), were used to screen for overlapping FRDEGs in CS and AS. To validate the prediction results, blood samples were collected from healthy controls and patients with CS and AS for quantitative real-time PCR. The correlation between biomarkers and clinical features was also evaluated.

Results: A total of 69 and 39 FRDEGs were identified in CS and AS, respectively. The hub genes, CIRBP, CREB5, MAPK14, PEBP1, and PTGS2, were identified using multiple methods. The area under the curve was > 0.7 for both models constructed using CS and AS datasets. A strong correlation was observed between neutrophil levels and expression of the hub genes. Additionally, several types of cancer indicated elevated expression of these hub genes compared to normal tissues.

Conclusions: In summary, the diagnostic model based on the FR-related gene PTGS2 demonstrated significant and specific diagnostic value for CS and AS, reflecting the status of blood lymphocytes, monocytes, and neutrophils. A pan-cancer study suggested it could serve as a new clinical prognostic marker and therapeutic target across various cancer types. This model may aid in the diagnosis of CS and AS. The findings offer new insights into the pathogenesis of these diseases.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12288349PMC
http://dx.doi.org/10.1186/s40001-025-02940-6DOI Listing

Publication Analysis

Top Keywords

hub genes
16
machine learning
8
cardioembolic stroke
8
stroke atherosclerosis
8
fr-related gene
8
expression hub
8
genes
7
learning identification
4
identification key
4
key genes
4

Similar Publications

Multi-Omics and Clinical Validation Identify Key Glycolysis- and Immune-Related Genes in Sepsis.

Int J Gen Med

September 2025

Department of Geriatrics, Sichuan Provincial People's Hospital, University of Electronic Science and Technology of China, Chengdu, 610072, People's Republic of China.

Background: Sepsis is characterized by profound immune and metabolic perturbations, with glycolysis serving as a pivotal modulator of immune responses. However, the molecular mechanisms linking glycolytic reprogramming to immune dysfunction remain poorly defined.

Methods: Transcriptomic profiles of sepsis were obtained from the Gene Expression Omnibus.

View Article and Find Full Text PDF

Introduction: While nucleus pulposus cell (NPC) degeneration is a primary driver of intervertebral disc degeneration (IVDD), the cellular heterogeneity and molecular interactions underlying NPC degeneration remain poorly characterized. Previous studies have shown that EGFR signaling plays a significant role in NPC differentiation and collagen matrix production. Consequently, this study aims to identify the critical downstream regulatory molecule of EGFR in the process of NPC degeneration.

View Article and Find Full Text PDF

Background: Synaptic dysfunction and synapse loss occur in Alzheimer's disease (AD). The current study aimed to identify synaptic-related genes with diagnostic potential for AD.

Methods: Differentially expressed genes (DEGs) were overlapped with phenotype-associated module selected through weighted gene co-expression network analysis (WGCNA), and synaptic-related genes.

View Article and Find Full Text PDF

Climatic challenges increasingly threaten global food security, necessitating crops with enhanced multi-stress resilience. Through systematic transcriptomic analysis of 100 wheat genotypes under heat, drought, cold, and salt stress, we identified 3237 differentially expressed genes (DEGs) enriched in key stress-response pathways. Core transcription factors (, , ) and two functional modules governing abiotic tolerance were characterized.

View Article and Find Full Text PDF

Background: Differentially expressed genes (DEGs) have been known to provide important information on disease mechanisms and potential therapeutic targets. The traditional Chinese medicine (TCM) offers a large reservoir of bioactive compounds that could modulate at these targets. This study is an attempt to investigate the biomarkers in Sepsis and COVID-19 using gene expression analysis and molecular modeling validation of TCM-derived candidate compounds targeting key DEGs associated with sepsis.

View Article and Find Full Text PDF