98%
921
2 minutes
20
exonization, or the recruitment of intronic elements into gene sequences, has contributed to functional diversification; however, its extent and the ways in which it influences gene regulation are not fully understood. We developed an unbiased approach to predict exonization events from genomic sequences implemented in a deep learning model, eXAlu, that overcomes the limitations of tissue or condition specificity and the computational burden of RNA-seq analysis. The model captures previously reported characteristics of exonized sequences and can predict sequence elements important for exonization. Using eXAlu, we estimate the number of elements in the human genome undergoing exonization to be between 55-110K, 11-21 fold more than represented in the GENCODE gene database. Using RT-PCR we were able to validate selected predicted exonization events, supporting the accuracy of our method. Lastly, we highlight a potential application of our method to identify polymorphic insertion exonizations in individuals and in the population from whole genome sequencing data.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10802380 | PMC |
http://dx.doi.org/10.1101/2024.01.03.574099 | DOI Listing |
Chem Biodivers
September 2025
State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan & Yunnan Key Laboratory of Basic Research and Innovative Application for Green Biological Production, Key Laboratory for Microbial Resources of the Ministry of Education, School of Life Sciences, Yunnan University, Kunm
Understanding the determinants of lifespan is a central objective in biology. Lifespan is shaped by dynamic, stage-specific changes in metabolism, energy allocation, and genome integrity. Heart rate serves as a physiological marker that reflects both life stage and metabolic state.
View Article and Find Full Text PDFPLoS Genet
September 2025
Neural Development Section, Mouse Cancer Genetics Program, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Frederick, Maryland, United States of America.
The RbFox RNA binding proteins regulate alternative splicing of genes governing mammalian development and organ function. They bind to the RNA sequence (U)GCAUG with high affinity but also non-canonical secondary motifs in a concentration dependent manner. However, the hierarchical requirement of RbFox motifs, which are widespread in the genome, is still unclear.
View Article and Find Full Text PDFPLoS Negl Trop Dis
September 2025
Department of Clinical Science, Liverpool School of Tropical Medicine, Liverpool, United Kingdom.
Background: Salmonella enterica encompasses over 2,600 serovars, including several commonly associated with severe infection in humans. Salmonella is a major cause of sepsis in Africa; however, diagnosis requires clinical microbiology facilities. Environmental surveillance has the potential to play a role in Salmonella surveillance.
View Article and Find Full Text PDFPLoS One
September 2025
School of Animal and Comparative Biomedical Sciences, College of Agriculture and Life Sciences, University of Arizona, Tucson, Arizona, United States of America.
The Gram-negative bacterium Campylobacter jejuni is part of the commensal gut microbiota of numerous animal species and a leading cause of bacterial foodborne illness in humans. Most complete genomes of C. jejuni are from strains isolated from human clinical, poultry, and ruminant samples.
View Article and Find Full Text PDFJ Clin Invest
September 2025
Department of Clinical and Biomedical Sciences, Faculty of Health and Life Sciences, University of Exeter, Exeter, United Kingdom.
Understanding the genetic causes of diseases affecting pancreatic β cells and neurons can give insights into pathways essential for both cell types. Microcephaly, epilepsy and diabetes syndrome (MEDS) is a congenital disorder with two known aetiological genes, IER3IP1 and YIPF5. Both genes encode proteins involved in endoplasmic reticulum (ER) to Golgi trafficking.
View Article and Find Full Text PDF