Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Background: Non-invasive determination of epidermal growth factor receptor (EGFR) mutation status is essential for selecting lung adenocarcinoma patients suitable for EGFR-tyrosine kinase inhibitors (EGFR-TKIs). This study aimed to develop and validate an online ensemble machine learning (EML) model that combines multiple machine learning (ML) models to predict the EGFR mutation status in lung adenocarcinoma.

Methods: A total of 823 lung adenocarcinoma patients with known EGFR mutation status from three medical centers were divided into a training cohort (n=556) and a validation cohort (n=267) (ChiCTR2400083082 in the WHO International Clinical Trials Registry). Five ML models incorporating clinical and radiological characteristics-random forest (RF), logistic regression (LR), support vector machine (SVM), light gradient boosting machine (LightGBM), and extreme gradient boosting (XGBoost)-along with a CT-based deep learning (DL) model were constructed to predict EGFR mutation status. Subsequently, an EML model was created by combining these models. Model performance was assessed using the area under the receiver operating characteristic curve (AUC), and the SHapley Additive exPlanation (SHAP) method was used to explain the EML model.

Results: In the training cohort, the AUCs for the RF, LR, SVM, LightGBM, XGBoost, DL, and EML were 0.851, 0.790, 0.810, 0.835, 0.853, 0.884, and 0.928, respectively. In the validation cohort, the AUCs for the RF, LR, SVM, LightGBM, XGBoost, DL, and EML were 0.753, 0.744, 0.732, 0.749, 0.751, 0.754, and 0.813, respectively. The Delong test indicated that the AUC of the EML model showed outstanding performance compared to the single models in both the training and validation cohorts. Decision curve analysis indicated that the EML model provided a clinically useful net benefit, and calibration curves showed good agreement. SHAP analysis identified predictive characteristics ranked by their contribution to the EML model: DL score, long-axis diameter, smoking history, pleural retraction, texture, vascular convergence, sex, air bronchogram, and bubblelike lucency. These characteristics were further used to develop an online web tool.

Conclusions: The EML model could serve as a non-invasive and accurate method for predicting EGFR mutation status in lung adenocarcinoma.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12337052PMC
http://dx.doi.org/10.21037/tlcr-2025-237DOI Listing

Publication Analysis

Top Keywords

mutation status
24
eml model
24
egfr mutation
20
lung adenocarcinoma
16
machine learning
12
status lung
12
model
9
eml
9
ensemble machine
8
learning model
8

Similar Publications

Background: Trials of neoadjuvant chemoimmunotherapy (chemoIO) have changed the standard of care for resectable nonsmall cell lung cancer (NSCLC). This study characterizes the outcomes of off-trial patients who received treatment with neoadjuvant chemoIO.

Methods: The authors analyzed records of patients with stage IB-III NSCLC who received neoadjuvant chemoIO with an intent to proceed to surgical resection at three US academic institutions.

View Article and Find Full Text PDF

Background: This study aims to gain further insights into the characteristics of the rare subtype of acute myeloid leukemia (AML) with BCR∷ABL by analyzing laboratory detection results of various gene mutations, such as NPM1.

Methods: Laboratory detection results of multiple gene missense mutations, including NPM1, were analyzed in a case of primary AML with BCR∷ABL.

Results: The patient exhibited morphological features of acute leukemia in the bone marrow.

View Article and Find Full Text PDF

Background: Rhabdomyosarcoma (RMS) is the most common soft tissue sarcoma of childhood. For stratification purposes, rhabdomyosarcoma is classified into fusion-positive RMS (alveolar rhabdomyosarcoma) and fusion-negative RMS (embryonal or spindle cell/sclerosing, FN-RMS) subtypes according to its fusion status. This study aims to highlight the pathologic and molecular characteristics of a cohort of FN-RMS using a targeted NGS RNA-Seq assay.

View Article and Find Full Text PDF

Since its discovery, the novel severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the causative agent of coronavirus disease 2019 (COVID-19), has become the epicenter of public health concern. This was mainly attributed to the complexity of COVID-19 that resulted in variable disease progression with some developing asymptomatic infections, some suffering mild to moderate infections that resolved without the need for hospitalizations, and a few infected persons developing severe infections that required intensive care unit (ICU) admission and mechanical ventilation. The COVID-19 pandemic spread globally, affecting billions of people and killing millions.

View Article and Find Full Text PDF

Artificial Intelligence Predicts GBA1 Mutated Status in Parkinson's Disease Patients.

Mov Disord Clin Pract

September 2025

Neurology Unit, Neuromotor and Rehabilitation Department, Azienda USL-IRCCS di Reggio Emilia, Reggio Emilia, Italy.

Background: GBA1 variants are the major genetic risk factor for Parkinson's Disease (PD) and account for 5-30% of PD cases depending on the population and age at onset of the disease.

Objectives: The aim of this study was to assess whether Artificial Intelligence (AI) could predict GBA1-mutated genotype in PD (GBA1-PD). Particularly, the main objective was to identify a Machine Learning (ML) model capable of accurately providing a pre-test estimate of GBA1-mutated status, relying on the clinical and demographic variables with the highest predictive value.

View Article and Find Full Text PDF