Interpretation of the Transcriptome-Based Signature of Tumor-Initiating Cells, the Core of Cancer Development, and the Construction of a Machine Learning-Based Classifier.

Cells

Laboratory of Bioimaging Probe Development, Singapore Bioimaging Consortium, Agency for Science, Technology and Research (A*STAR), Singapore 138667, Singapore.

Published: August 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Tumor-initiating cells (TICs) constitute a subpopulation of cancer cells with stem-like properties contributing to tumorigenesis, progression, recurrence, and therapeutic resistance. Despite their biological importance, their molecular signatures that distinguish them from non-TICs remain incompletely characterized. This study aimed to comprehensively analyze transcriptomic differences between TICs and non-TICs, identify TIC-specific gene expression patterns, and construct a machine learning-based classifier that could accurately predict TIC status. RNA sequencing data were obtained from four human cell lines representing TIC (TS10 and TS32) and non-TIC (32A and Epi). Transcriptomic profiles were analyzed via principal component, hierarchical clustering, and differential expression analysis. Gene-Ontology and Kyoto-Encyclopedia of Genes and Genomes pathway enrichment analyses were conducted for functional interpretation. A logistic-regression model was trained on differentially expressed genes to predict TIC status. Model performance was validated using synthetic data and external projection. TICs exhibited distinct transcriptomic signatures, including enrichment of non-coding RNAs (e.g., MIR4737 and SNORD19) and selective upregulation of metabolic transporters (e.g., SLC25A1, SLC16A1, and FASN). Functional pathway analysis revealed TIC-specific activation of oxidative phosphorylation, PI3K-Akt signaling, and ribosome-related processes. The logistic-regression model achieved perfect classification (area under the curve of 1.00), and its key features indicated metabolic and translational reprogramming unique to TICs. Transcriptomic state-space embedding analysis suggested reversible transitions between TIC and non-TIC states driven by transcriptional and epigenetic regulators. This study reveals a unique transcriptomic landscape defining TICs and establishes a highly accurate machine learning-based TIC classifier. These findings enhance our understanding of TIC biology and show promising strategies for TIC-targeted diagnostics and therapeutic interventions.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12384272PMC
http://dx.doi.org/10.3390/cells14161255DOI Listing

Publication Analysis

Top Keywords

machine learning-based
12
tumor-initiating cells
8
learning-based classifier
8
predict tic
8
tic status
8
logistic-regression model
8
tic
6
tics
5
transcriptomic
5
interpretation transcriptome-based
4

Similar Publications

Correction to: Comment on "Machine learning-based forest fire susceptibility mapping of Southern Mizoram, a part of Indo-Burma Biodiversity Hotspot".

Environ Sci Pollut Res Int

September 2025

Department of Research Analytics, Saveetha Dental College and Hospitals, Saveetha Institute of Medical and Technical Sciences, Saveetha University, Chennai, India.

View Article and Find Full Text PDF

PERC: a suite of software tools for the curation of cryoEM data with application to simulation, modeling and machine learning.

Acta Crystallogr F Struct Biol Commun

October 2025

Science and Technology Facilities Council, Research Complex at Harwell, Didcot OX11 0FA, United Kingdom.

Ease of access to data, tools and models expedites scientific research. In structural biology there are now numerous open repositories of experimental and simulated data sets. Being able to easily access and utilize these is crucial to allow researchers to make optimal use of their research effort.

View Article and Find Full Text PDF

Machine learning-based analysis of the impact of 5' untranslated region on protein expression.

Nucleic Acids Res

September 2025

School of Biomedical and Pharmaceutical Sciences, Guangdong University of Technology, No. 100 Waihuanxi Road, Guangzhou 510006, China.

The 5' untranslated region (5'UTR) plays a crucial regulatory role in messenger RNA (mRNA), with modified 5'UTRs extensively utilized in vaccine production, gene therapy, etc. Nevertheless, manually optimizing 5'UTRs may encounter difficulties in balancing the effects of various cis-elements. Consequently, multiple 5'UTR libraries have been created, and machine learning models have been employed to analyze and predict translation efficiency (TE) and protein expression, providing insights into critical regulatory features.

View Article and Find Full Text PDF

Purpose: To determine the accuracy of a new machine learning-based open-source IOL formula (PEARLS-DGS) in 100 patients who underwent uncomplicated cataract surgery and had a history of laser refractive surgery for myopic defects.

Methods: The setting for this retrospective study was HUMANITAS Research Hospital, Milan, Italy. Data from 100 patients with a history of photorefractive keratectomy or laser in situ keratomileusis were retrospectively analyzed to assess the accuracy of the formula.

View Article and Find Full Text PDF