Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Background And Objective: Spread through air spaces (STAS) is an important factor in determining the aggressiveness and recurrence risk of lung cancer, especially in early-stage adenocarcinoma. Preoperative identification of STAS is crucial for optimizing surgical strategies. This study aimed to develop and validate machine learning models to predict the presence of STAS using preoperative clinical, radiological, and pathological data in lung cancer patients.

Patients And Methods: A retrospective analysis was conducted on 1,290 lung cancer patients from two hospitals: Qilu Hospital of Shandong University and Qianfoshan Hospital. Data from 1,174 patients from Qilu Hospital were used for model training and internal validation, while 116 patients from Qianfoshan Hospital were used for external validation. Thirteen key variables, identified using least absolute shrinkage and selection operator (LASSO) regression, were included in the construction of eight machine learning models: decision tree (DT), random forest (RF), regularized support vector machine (RSVM), logistic regression (LR), extreme gradient boosting (XGBoost), multilayer perceptron (MLP), light gradient boosting machine (LightGBM), and K-nearest neighbors (KNN). Model performance was evaluated using receiver operating characteristic (ROC) curves, area under the curve (AUC), calibration curves, decision curve analysis (DCA), and SHapley additive explanations (SHAP) plots.

Results: The XGBoost model achieved the best performance with an AUC of 0.931 (95% CI: 0.897-0.964) in the internal validation cohort and 0.904 (95% CI: 0.835-0.973) in the external validation cohort, outperforming other models. DCA demonstrated the clinical utility of XGBoost, LightGBM, and RF models, which provided superior net benefit across various threshold probabilities. SHAP analysis revealed that the most influential factors in predicting STAS were carcinoembryonic antigen (CEA), forced expiratory volume in one second (FEV1), consolidation-to-tumor ratio (CTR), maximal voluntary ventilation (MVV), and CT value.

Conclusion: The XGBoost model demonstrated robust predictive performance for preoperative identification of STAS in lung cancer patients, showing high generalizability in external validation. These findings suggest that machine learning-based predictions could guide clinical decision-making and improve surgical outcomes by identifying high-risk patients for more aggressive treatment strategies.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11887330PMC
http://dx.doi.org/10.1186/s12885-025-13783-zDOI Listing

Publication Analysis

Top Keywords

lung cancer
16
machine learning
12
external validation
12
stas lung
8
preoperative identification
8
identification stas
8
learning models
8
cancer patients
8
qilu hospital
8
qianfoshan hospital
8

Similar Publications

The MET receptor tyrosine kinase is a pivotal regulator of cellular survival, motility, and proliferation. Mutations leading to skipping of exon 14 (METΔex14) within the juxtamembrane domain of MET impair receptor degradation and prolong oncogenic signaling, contributing significantly to tumor progression across multiple cancer types. METΔex14 mutations are associated with aggressive clinical behavior, therapeutic resistance, and poor outcomes.

View Article and Find Full Text PDF

MicroRNAs (miRNAs) are critical regulators of gene expression in cancer biology, yet their spatial dynamics within tumor microenvironments (TMEs) remain underexplored due to technical limitations in current spatial transcriptomics (ST) technologies. To address this gap, we present STmiR, a novel XGBoost-based framework for spatially resolved miRNA activity prediction. STmiR integrates bulk RNA-seq data (TCGA and CCLE) with spatial transcriptomics profiles to model nonlinear miRNA-mRNA interactions, achieving high predictive accuracy (Spearman's ρ > 0.

View Article and Find Full Text PDF

Heart failure (HF) and lung cancer (LC) often coexist, yet their shared molecular mechanisms are unclear. We analyzed transcriptome data from the NCBI Gene Expression Omnibus (GEO) database (GSE141910, GSE57338) to identify 346 HF‑related differentially expressed genes (DEGs), then combined weighted gene co-expression network analysis (WGCNA) pinpointed 70 hub candidates. Further screening of these 70 hub candidates in TCGA lung cancer cohorts via LASSO, Random Forest, and multivariate Cox regression suggested CYP4B1 as the only independent prognostic marker.

View Article and Find Full Text PDF

Volumetric modulated arc therapy (VMAT) for lung cancer involves complex multileaf collimator (MLC) motion, which increases sensitivity to interplay effects with tumour motion. Current dynamic conformal arc methods address this issue but may limit the achievable dose distribution optimisation compared with standard VMAT. This study examined the clinical utility of a VMAT technique with monitor unit limits (VMATliMU) to mimic conformal arc delivery and reduce interplay effects while maintaining plan quality.

View Article and Find Full Text PDF

Importance: Patients with advanced cancer frequently receive broad-spectrum antibiotics, but changing use patterns across the end-of-life trajectory remain poorly understood.

Objective: To describe the patterns of broad-spectrum antibiotic use across defined end-of-life intervals in patients with advanced cancer.

Design, Setting, And Participants: This nationwide, population-based, retrospective cohort study used data from the South Korean National Health Insurance Service database to examine broad-spectrum antibiotic use among patients with advanced cancer who died between July 1, 2002, and December 31, 2021.

View Article and Find Full Text PDF