98%
921
2 minutes
20
Background: Clustering analysis is fundamental in single-cell RNA sequencing (scRNA-seq) data analysis for elucidating cellular heterogeneity and diversity. Recent graph-based scRNA-seq clustering methods, particularly graph neural networks (GNNs), have significantly improved in tackling the challenges of high-dimension, high-sparsity, and frequent dropout events that lead to ambiguous cell population boundaries. However, one major challenge for GNN-based methods is their reliance on hard graph constructions derived from similarity matrices. These constructions introduce difficulties when applied to scRNA-seq data due to: (i) The simplification of intercellular relationships into binary edges (0 or 1) by applying thresholds, which restricts the capture of continuous similarity features among cells and leads to significant information loss. (ii) The presence of significant inter-cluster connections within hard graphs, which can confuse GNN methods that rely heavily on graph structures, potentially causing erroneous message propagation and biased clustering outcomes.
Results: To tackle these challenges, we introduce scSGC, a Soft Graph Clustering for single-cell RNA sequencing data, which aims to more accurately characterize continuous similarities among cells through non-binary edge weights, thereby mitigating the limitations of rigid data structures. The scSGC framework comprises three core components: (i) a zero-inflated negative binomial (ZINB)-based feature autoencoder designed to effectively handle the sparsity and dropout issues in scRNA-seq data; (ii) a dual-channel cut-informed soft graph embedding module, constructed through deep graph-cut information, capturing continuous similarities between cells while preserving the intrinsic data structures of scRNA-seq; and (iii) an optimal transport-based clustering optimization module, achieving optimal delineation of cell populations while maintaining high biological relevance.
Conclusion: By integrating dual-channel cut-informed soft graph representation learning, a ZINB-based feature autoencoder, and optimal transport-driven clustering optimization, scSGC effectively overcomes the challenges associated with traditional hard graph constructions in GNN methods. Extensive experiments across ten datasets demonstrate that scSGC outperforms 13 state-of-the-art clustering models in clustering accuracy, cell type annotation, and computational efficiency. These results highlight its substantial potential to advance scRNA-seq data analysis and deepen our understanding of cellular heterogeneity.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12291377 | PMC |
http://dx.doi.org/10.1186/s12859-025-06231-z | DOI Listing |
Front Oncol
August 2025
Department of Imaging, Yantaishan Hospital, Yantai, Shangdong, China.
This systematic review evaluates the integration of radiomics, artificial intelligence (AI), and molecular signatures for diagnosing and prognosticating bone and soft tissue tumors (BSTTs). Following PRISMA 2020 guidelines, we analyzed 24 studies from 1,141 initial records across PubMed, Scopus, Web of Science, and Google Scholar. Our findings reveal that while radiomics-AI pipelines are well-developed for BSTT assessment - particularly using MRI (72% of studies) and CT (25%) with machine learning classifiers like random forests (42%) and CNNs (17%) - molecular data integration remains virtually absent.
View Article and Find Full Text PDFWater Res
August 2025
Centre for Water Systems, University of Exeter, Exeter EX4 4QF, United Kingdom; KWR Water Research Institute, Nieuwegein 3430 BB, The Netherlands. Electronic address:
Water distribution networks (WDNs) constitute essential urban infrastructure, yet their monitoring is hindered by limited monitoring conditions. Soft sensing methods have been applied to estimate pressure at unmonitored nodes using the latest deep learning models, however, they rely on large datasets from the same WDNs for training. There is a critical gap in pressure estimation of WDNs under realistic monitoring limitations.
View Article and Find Full Text PDFCancer Epidemiol
October 2025
School of Medicine and Dentistry, Griffith University, Gold Coast Campus, Gold Coast 4222, Australia. Electronic address:
Background: Cutaneous melanoma incidence is rising globally, and survival rates have improved significantly due to advances in early detection and treatment. As a result, the long-term health of melanoma survivors is gaining increasing clinical attention. One emerging concern is the development of second primary cancers (SPCs), which may result from shared risk factors, genetic susceptibility, or late effects of cancer treatment, including radiotherapy.
View Article and Find Full Text PDFJ Cheminform
August 2025
Bioinformatics Institute, Agency for Science, Technology and Research, 30 Biopilis Street, Singapore, 138671, Singapore.
Cyclic peptides are promising drug candidates due to their ability to modulate intracellular protein-protein interactions, a property often inaccessible to small molecules. However, their typically poor membrane permeability limits therapeutic applicability. Accurate computational prediction of permeability can accelerate the identification of cell-permeable candidates, reducing reliance on time-consuming and costly experimental screening.
View Article and Find Full Text PDFEntropy (Basel)
August 2025
School of Mathematics and Computer Application, Shangluo University, Shangluo 726000, China.
Unsupervised feature selection aims to characterize the cluster structure of original features and select the optimal subset without label guidance. However, existing methods overlook fuzzy information in the data, failing to model cluster structures between data effectively, and rely on squared error for data reconstruction, exacerbating noise impact. Therefore, a robust unsupervised feature selection algorithm based on fuzzy anchor graphs (FWFGFS) is proposed.
View Article and Find Full Text PDF