MkcDBGAS: a reference-free approach to identify comprehensive alternative splicing events in a transcriptome.

Brief Bioinform

MOE Key Laboratory for Biodiversity Science and Ecological Engineering and Beijing Key Laboratory of Gene Resource and Molecular Development, College of Life Sciences, Beijing Normal University, Beijing 100875, China.

Published: September 2023


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Alternative splicing (AS) is an essential post-transcriptional mechanism that regulates many biological processes. However, identifying comprehensive types of AS events without guidance from a reference genome is still a challenge. Here, we proposed a novel method, MkcDBGAS, to identify all seven types of AS events using transcriptome alone, without a reference genome. MkcDBGAS, modeled by full-length transcripts of human and Arabidopsis thaliana, consists of three modules. In the first module, MkcDBGAS, for the first time, uses a colored de Bruijn graph with dynamic- and mixed- kmers to identify bubbles generated by AS with precision higher than 98.17% and detect AS types overlooked by other tools. In the second module, to further classify types of AS, MkcDBGAS added the motifs of exons to construct the feature matrix followed by the XGBoost-based classifier with the accuracy of classification greater than 93.40%, which outperformed other widely used machine learning models and the state-of-the-art methods. Highly scalable, MkcDBGAS performed well when applied to Iso-Seq data of Amborella and transcriptome of mouse. In the third module, MkcDBGAS provides the analysis of differential splicing across multiple biological conditions when RNA-sequencing data is available. MkcDBGAS is the first accurate and scalable method for detecting all seven types of AS events using the transcriptome alone, which will greatly empower the studies of AS in a wider field.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10576019PMC
http://dx.doi.org/10.1093/bib/bbad367DOI Listing

Publication Analysis

Top Keywords

events transcriptome
12
types events
12
mkcdbgas
8
alternative splicing
8
reference genome
8
module mkcdbgas
8
types
5
mkcdbgas reference-free
4
reference-free approach
4
approach identify
4

Similar Publications

Objective: The key molecular events signifying the -induced gastric carcinogenesis process are largely unknown.

Methods: Bulk tissue-proteomics profiling were leveraged across multi-stage gastric lesions from Linqu ( = 166) and Beijing sets ( = 99) and single-cell transcriptomic profiling ( = 18) to decipher key molecular signatures of -related gastric lesion progression and gastric cancer (GC) development. The association of key proteins association with gastric lesion progression and GC development were prospectively studied building on follow-up of the Linqu set and UK Biobank ( = 48,529).

View Article and Find Full Text PDF

Objectives: Loeys-Dietz syndrome comprises genetically discrete subtypes of varying clinical severity. This study integrates longitudinal Loeys-Dietz syndrome clinical outcomes after aortic root replacement with transcriptomic analysis of aortic smooth muscle cell dysregulation to investigate mechanisms governing this subtype-specific aortic vulnerability.

Methods: Single institutional experience with aortic root replacement for nondissected aneurysm in patients with Loeys-Dietz syndrome was reviewed for midterm survival and distal aortic events (subsequent aortic intervention, aneurysm, or dissection).

View Article and Find Full Text PDF

An mTOR-Tfeb-Fabp7a Axis Ameliorates bag3 Cardiomyopathy via Decelerating Cardiac Aging.

Aging Cell

September 2025

Department of Biochemistry and Molecular Biology, Department of Cardiovascular Medicine, Mayo Clinic, Rochester, Minnesota, USA.

While BAG3 has been identified as a causative gene for dilated cardiomyopathy, the major pathological events in BAG3-related cardiomyopathy that could be targeted for therapeutic benefit remain to be discovered. Here, we aim to uncover novel pathological events through genetic studies in a zebrafish bag3 cardiomyopathy model. Given the known cardioprotective effects of mtor inhibition and the fact that transcription factor EB (tfeb) encodes a direct downstream phosphorylation target of mTOR signaling, we generated a cardiomyocyte-specific transgenic line overexpressing tfeb (Tg[cmlc2:tfeb]).

View Article and Find Full Text PDF

The purpose of this study was to investigate potential therapeutic targets for osteosarcoma (OS) and offer hints regarding genetic factors for OS treatment using a bioinformatics method. This study processed 3 OS datasets from the gene expression omnibus database using R software, screening for differentially expressed genes (DEGs). After enrichment analysis, based on expression quantitative trait loci data and the genome-wide association study data of OS, Mendelian randomization analysis was used to screen the genes closely related to OS disease, which intersect with DEGs to obtain co-expressed genes, validation datasets were employed to verify the results.

View Article and Find Full Text PDF

Skeletal muscle atrophy and weakness are major contributors to morbidity, prolonged recovery, and long-term disability across a wide range of diseases. Atrophy is caused by breakdown of sarcomeric proteins resulting in loss of muscle mass and strength. Molecular mechanism underlying the onset of muscle atrophy and its progression have been analysed in patients, mice, and cell culture but the complementarity of these model systems remains to be explored.

View Article and Find Full Text PDF