Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

exonization, or the recruitment of intronic elements into gene sequences, has contributed to functional diversification; however, its extent and the ways in which it influences gene regulation are not fully understood. We developed an unbiased approach to predict exonization events from genomic sequences implemented in a deep learning model, eXAlu, that overcomes the limitations of tissue or condition specificity and the computational burden of RNA-seq analysis. The model captures previously reported characteristics of exonized sequences and can predict sequence elements important for exonization. Using eXAlu, we estimate the number of elements in the human genome undergoing exonization to be between 55-110K, 11-21 fold more than represented in the GENCODE gene database. Using RT-PCR we were able to validate selected predicted exonization events, supporting the accuracy of our method. Lastly, we highlight a potential application of our method to identify polymorphic insertion exonizations in individuals and in the population from whole genome sequencing data.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10802380PMC
http://dx.doi.org/10.1101/2024.01.03.574099DOI Listing

Publication Analysis

Top Keywords

human genome
8
deep learning
8
learning model
8
exonization events
8
exonization
5
predicting exonization
4
exonization human
4
genome deep
4
model exonization
4
exonization recruitment
4

Similar Publications

The Essence of Nature Can be the Simplest (6)-Lifespan: Determined by Extracellular Fenton Chemistry.

Chem Biodivers

September 2025

State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan & Yunnan Key Laboratory of Basic Research and Innovative Application for Green Biological Production, Key Laboratory for Microbial Resources of the Ministry of Education, School of Life Sciences, Yunnan University, Kunm

Understanding the determinants of lifespan is a central objective in biology. Lifespan is shaped by dynamic, stage-specific changes in metabolism, energy allocation, and genome integrity. Heart rate serves as a physiological marker that reflects both life stage and metabolic state.

View Article and Find Full Text PDF

Clusters of deep intronic RbFox motifs embedded in large assembly of splicing regulators sequences regulate alternative splicing.

PLoS Genet

September 2025

Neural Development Section, Mouse Cancer Genetics Program, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Frederick, Maryland, United States of America.

The RbFox RNA binding proteins regulate alternative splicing of genes governing mammalian development and organ function. They bind to the RNA sequence (U)GCAUG with high affinity but also non-canonical secondary motifs in a concentration dependent manner. However, the hierarchical requirement of RbFox motifs, which are widespread in the genome, is still unclear.

View Article and Find Full Text PDF

Background: Salmonella enterica encompasses over 2,600 serovars, including several commonly associated with severe infection in humans. Salmonella is a major cause of sepsis in Africa; however, diagnosis requires clinical microbiology facilities. Environmental surveillance has the potential to play a role in Salmonella surveillance.

View Article and Find Full Text PDF

American black bear (Ursus americanus) as a potential host for Campylobacter jejuni.

PLoS One

September 2025

School of Animal and Comparative Biomedical Sciences, College of Agriculture and Life Sciences, University of Arizona, Tucson, Arizona, United States of America.

The Gram-negative bacterium Campylobacter jejuni is part of the commensal gut microbiota of numerous animal species and a leading cause of bacterial foodborne illness in humans. Most complete genomes of C. jejuni are from strains isolated from human clinical, poultry, and ruminant samples.

View Article and Find Full Text PDF

Recessive TMEM167A variants cause neonatal diabetes, microcephaly and epilepsy syndrome.

J Clin Invest

September 2025

Department of Clinical and Biomedical Sciences, Faculty of Health and Life Sciences, University of Exeter, Exeter, United Kingdom.

Understanding the genetic causes of diseases affecting pancreatic β cells and neurons can give insights into pathways essential for both cell types. Microcephaly, epilepsy and diabetes syndrome (MEDS) is a congenital disorder with two known aetiological genes, IER3IP1 and YIPF5. Both genes encode proteins involved in endoplasmic reticulum (ER) to Golgi trafficking.

View Article and Find Full Text PDF