98%
921
2 minutes
20
Topic modeling is a popular technique in machine learning and natural language processing, where a corpus of text documents is classified into themes or topics using word frequency analysis. This approach has proven successful in various biological data analysis applications, such as predicting cancer subtypes with high accuracy and identifying genes, enhancers, and stable cell types simultaneously from sparse single-cell epigenomics data. The advantage of using a topic model is that it not only serves as a clustering algorithm, but it can also explain clustering results by providing word probability distributions over topics. Our study proposes a novel topic modeling approach for clustering single cells and detecting topics (gene signatures) in single-cell datasets that measure multiple omics simultaneously. We applied this approach to examine the transcriptional heterogeneity of luminal and triple-negative breast cancer cells using patient-derived xenograft models with acquired resistance to chemotherapy and targeted therapy. Through this approach, we identified protein-coding genes and long non-coding RNAs (lncRNAs) that group thousands of cells into biologically similar clusters, accurately distinguishing drug-sensitive and -resistant breast cancer types. In comparison to standard state-of-the-art clustering analyses, our approach offers an optimal partitioning of genes into topics and cells into clusters simultaneously, producing easily interpretable clustering outcomes. Additionally, we demonstrate that an integrative clustering approach, which combines the information from mRNAs and lncRNAs treated as disjoint omics layers, enhances the accuracy of cell classification.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11011054 | PMC |
http://dx.doi.org/10.3390/cancers16071350 | DOI Listing |
BMC Cancer
September 2025
Klinik für Innere Medizin II, Universitätsklinikum Jena, Am Klinikum 1, Jena, 07747, Germany.
Acta Pharmacol Sin
September 2025
Department of Physiology and Pathophysiology, School of Basic Medical Sciences, Fudan University, Shanghai, 200032, China.
Chemotherapeutic resistance is a significant issue in the treatment of breast cancer, which is related to pyroptosis inhibition. Increasing evidence suggests that long non-coding RNAs (lncRNAs) contribute to tumorigenesis and drug resistance. In this study we investigated the role of the lncRNA STMN1P2 in doxorubicin resistance in breast cancer, as well as its correlation with pyroptosis inhibition.
View Article and Find Full Text PDFJ Hum Genet
September 2025
Division of Integrative Genomics, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan.
Comprehensive genomic profiling (CGP) expands treatment options for solid tumor patients and identifies hereditary cancers. However, in Japan, confirmatory tests have been conducted in only 31.6% of patients with presumed germline pathogenic variants (GPVs) detected through tumor-only testing.
View Article and Find Full Text PDFCardiovasc Intervent Radiol
September 2025
The Department of Radiology, Wakayama Medical University, Wakayama, Japan.
Purpose: Recent advancements in medical technologies have made trans-arterial treatment of breast cancer feasible. Consequently, understanding the vascular anatomies of breast cancers and axillary lymph node metastases has become indispensable for sophisticated treatments. The aim of this study was to determine the vascular anatomy of the breast, which is crucial for trans-arterial chemoembolization in patients with breast cancer.
View Article and Find Full Text PDFNat Commun
September 2025
Department of Preventive Medicine, Keck School of Medicine, University of Southern California Norris Comprehensive Cancer Center, Los Angeles, 90033, California, USA.