Interpretable machine learning driven biomarker identification and validation for prostate cancer.

Transl Androl Urol

Department of Surgery, The Second Affiliated Hospital of Chongqing Medical University, Chongqing Medical University, Chongqing, China.

Published: June 2025

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Background: Prostate cancer (PCa), a common malignancy among men globally, requires the identification of biomarkers for early diagnosis and predicting progression. This study aimed to identify the key genes involved in the occurrence and development of PCa.

Methods: Leveraging data from the Gene Expression Omnibus (GEO) database, this study integrated multi-chip datasets, conducting differential expression analysis and enrichment analysis to pinpoint PCa-related genes. Subsequently, machine learning models were constructed using least absolute shrinkage and selection operator (LASSO) regression, support vector machine (SVM), and random forest (RF) methods. The optimal model was selected for further study and the contribution of related genes was explained using SHapley Additive exPlanations (SHAP) analysis. Furthermore, gene set enrichment analysis (GSEA) and immune cell infiltration analysis were utilized to uncover the underlying molecular mechanisms.

Results: In this study, 222 differentially expressed genes (DEGs) were identified and found to be enriched in functions and pathways potentially associated with PCa. Using multiple machine learning models, eight PCa-related core genes (, , , , , , , and ) were identified. The most accurate RF model was selected for further study with SHAP analysis, which also revealed the contribution of the above genes. GSEA and immune cell infiltration analysis uncovered distinctions between PCa and normal tissues.

Conclusions: This study offered potential biomarkers and a theoretical basis for the diagnosis and treatment for PCa.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12271943	PMC
http://dx.doi.org/10.21037/tau-2025-242	DOI Listing

Publication Analysis

Top Keywords

machine learning

prostate cancer

enrichment analysis

learning models

model selected

selected study

contribution genes

shap analysis

gsea immune

immune cell

Similar Publications

Letter to editor about "Utilizing explainable machine learning for progression-free survival prediction in high-grade serous ovarian cancer: insights from a prospective cohort study".

Int J Surg

September 2025

Shenzhen Traditional Chinese Medicine Hospital, The Fourth Clinical Medical College of Guangzhou University of Chinese Medicine, Shenzhen, People's Republic of China.

Mengying Bai , Wenbo Wu , Yuehui Zheng

View Article and Find Full Text PDF

Similar Publications

Unveiling molecular signatures for precision drug design: machine learning insights from trypanothione reductase, PKC-θ, and CB1.

Mol Divers

September 2025

Department of Biotechnology, National Institute of Technology Raipur, Raipur, Chhattisgarh, 492001, India.

Sunil Sahu , Adarsh Anmol , Tushar Nishad , Satya Eswari Jujjavarapu

Traditional drug discovery methods like high-throughput screening and molecular docking are slow and costly. This study introduces a machine learning framework to predict bioactivity (pIC₅₀) and identify key molecular properties and structural features for targeting Trypanothione reductase (TR), Protein kinase C theta (PKC-θ), and Cannabinoid receptor 1 (CB1) using data from the ChEMBL database. Molecular fingerprints, generated via PaDEL-Descriptor and RDKit, encoded structural features as binary vectors.

View Article and Find Full Text PDF

Similar Publications

Oral bioavailability property prediction based on task similarity transfer learning.

Mol Divers

September 2025

Laboratory of Molecular Design and Drug Discovery, School of Science, China Pharmaceutical University, Nanjing, 211198, China.

Chen Zeng , Chengcheng Xu , Yingxu Liu , Yunya Jiang , Lidan Zheng

Drug absorption significantly influences pharmacokinetics. Accurately predicting human oral bioavailability (HOB) is essential for optimizing drug candidates and improving clinical success rates. The traditional method based on experiment is a common way to obtain HOB, but the experimental method is time-consuming and costly.

View Article and Find Full Text PDF

Similar Publications

Decoding binocular color differences via EEG signals: linking ERP dynamics to chromatic disparity in CIELAB space.

Exp Brain Res

September 2025

School of Information Science and Technology, Yunnan Normal University, Kunming, 650500, China.

Famiao Mou , Zhineng Lv , Xuesong Jin , Jijun Pan , Lijun Yun

This study explores how differences in colors presented separately to each eye (binocular color differences) can be identified through EEG signals, a method of recording electrical activity from the brain. Four distinct levels of green-red color differences, defined in the CIELAB color space with constant luminance and chroma, are investigated in this study. Analysis of Event-Related Potentials (ERPs) revealed a significant decrease in the amplitude of the P300 component as binocular color differences increased, suggesting a measurable brain response to these differences.

View Article and Find Full Text PDF

Similar Publications

Using Medication Dispensation Data to Identify Clusters with Similar Prescribing Patterns in Older Adults Living with Dementia.

Drugs Aging

September 2025

Dalla Lana School of Public Health, University of Toronto, V1 06, 2075 Bayview Avenue, Toronto, ON, M4N 3M5, Canada.

Abby Emdin , Therese A Stukel , Jennifer Bethell , Xuesong Wang , Andrea Iaboni

Background And Objectives: Older adults living with dementia are a heterogeneous group, which can make studying optimal medication management challenging. Unsupervised machine learning is a group of computing methods that rely on unlabeled data-that is, where the algorithm itself is discovering patterns without the need for researchers to label the data with a known outcome. These methods may help us to better understand complex prescribing patterns in this population.

View Article and Find Full Text PDF

Similar Publications