TransAnno-Net: A Deep Learning Framework for Accurate Cell Type Annotation of Mouse Lung Tissue Using Self-supervised Pretraining.

Comput Methods Programs Biomed

School of Computer Science and Engineering, Hubei Key Laboratory of Intelligent Robot, Wuhan Institute of Technology, Wuhan, 430205, Hubei, PR China. Electronic address:

Published: July 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Background: Single-cell RNA sequencing (scRNA-seq) has become a significant tool for addressing complex issuess in the field of biology. In the context of scRNA-seq analysis, it is imperative to accurately determine the type of each cell. However, conventional supervised or semi-supervised methodologies are contingent on expert labels and incur substantial labeling costs, In contrast self-supervised pre-training strategies leverage unlabeled data during the pre-training phase and utilise a limited amount of labeled data in the fine-tuning phase, thereby greatly reducing labor costs. Furthermore, the fine-tuning does not need to learn the feature representations from scratch, enhancing the efficiency and transferability of the model.

Methods: The proposed methodology is outlined below. The deep learning framework, TransAnno-Net, is based on transfer learning and a Transformer architecture. It has been designed for efficient and accurate cell type annotations in large-scale scRNA-seq datasets of mouse lung organs. Specifically, TransAnno-Net is pre-trained on the scRNA-seq lung data of approximately 100,000 cells to acquire gene-gene similarities via self-supervised learning. It is then migrated to a relatively small number of datasets to fine-tune specific cell type annotation tasks. To address the issue of imbalance in cell types commonly observed in scRNA-seq data, we applied a random oversampling technique is applied to the fine-tuned dataset. This is done to mitigate the impact of distributional imbalance on the annotation outcomes.

Results: The experimental findings demonstrate that TransAnno-Net exhibits superior performance with an AUC of 0.979, 0.901, and 0.982, respectively, on three mouse lung datasets, outperforming eight state-of-the-art (SOTA) methods. In addition, TransAnno-Net demonstrates robust performance on cross-organ, cross-platform datasets, and is competitive with the fully supervised learning-based method.

Conclusion: The TransAnno-Net method is a highly effective cross-platform and cross-data set single-cell type annotation method for mouse lung tissues and supports cross-organ cell type annotation. This approach is expected to enhance the efficiency of research on the biological mechanisms of complex biological systems and diseases.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.cmpb.2025.108809DOI Listing

Publication Analysis

Top Keywords

cell type
16
type annotation
16
mouse lung
16
deep learning
8
learning framework
8
accurate cell
8
transanno-net
6
cell
6
type
6
annotation
5

Similar Publications

Background: Alzheimer's disease (AD) patients and animal models exhibit an altered gut microbiome that is associated with pathological changes in the brain. Intestinal miRNA enters bacteria and regulates bacterial metabolism and proliferation. This study aimed to investigate whether the manipulation of miRNA could alter the gut microbiome and AD pathologies.

View Article and Find Full Text PDF

Background: Gastric cancer is one of the most common cancers worldwide, with its prognosis influenced by factors such as tumor clinical stage, histological type, and the patient's overall health. Recent studies highlight the critical role of lymphatic endothelial cells (LECs) in the tumor microenvironment. Perturbations in LEC function in gastric cancer, marked by aberrant activation or damage, disrupt lymphatic fluid dynamics and impede immune cell infiltration, thereby modulating tumor progression and patient prognosis.

View Article and Find Full Text PDF

Background: Centromeres are crucial for precise chromosome segregation and maintaining genome stability during cell division. However, their evolutionary dynamics, particularly in polyploid organisms with complex genomic architectures, remain largely enigmatic. Allopolyploid wheat, with its well-defined hierarchical ploidy series and recent polyploidization history, serves as an excellent model to explore centromere evolution.

View Article and Find Full Text PDF

Background: Volatile anesthetics are gaining recognition for their benefits in long-term sedation of mechanically ventilated patients with bacterial pneumonia and acute respiratory distress syndrome. In addition to their sedative role, they also exhibit anti-bacterial and anti-inflammatory properties, though the mechanisms behind these effects remain only partially understood. In vitro studies examining the prolonged impact of volatile anesthetics on bacterial growth, inflammatory cytokine response, and surfactant proteins - key to maintaining lung homeostasis - are still lacking.

View Article and Find Full Text PDF