98%
921
2 minutes
20
Background: relies on posttranscriptional control to regulate gene expression. Protein-coding genes are synthesised as polycistronic precursors that are processed into individual mRNAs by -splicing adding the spliced leader (SL) RNA to the 5’-end and 3’ cleavage-polyadenylation. Here, we employ Nanopore direct RNA sequencing (DRS) combined with Illumina RNA-Seq to comprehensively interrogate the transcriptomes of developmental stages at single-molecule resolution.
Results: Analysis of DRS full-length reads of poly(A)+-enriched RNA from developmental stages enabled us to precisely determine the primary SL and poly(A) sites for 52% of the protein-coding transcripts and to accurately define their 5’- and 3’-end and the length of UTRs. In addition, our analysis confirmed the motifs ‘[C/A/T] A|G’ being associated with 94.8% of the SL cleavage sites and better defined the genomic context for cleavage and polyadenylation. Overall, we observed more diversity for poly(A) than SL sites per transcript. The frequency of the primary SL and poly(A) sites was 64.2% and 24%, respectively, with most transcripts having additional poly(A) sites nearby. Alternative polyadenylation was detected in 11-13% of transcripts with ~ 20% of these having different primary poly(A) sites between promastigote and amastigote developmental stages. Furthermore, DRS uncovered multiple processing events occurring mostly within 3’UTRs, leading to the formation of long non-coding RNAs (lncRNAs). The transcriptome expresses a rich repertoire of 1,825 lncRNAs, of which 98% were not previously annotated in and only 21.5% were found in . These lncRNAs exhibit generally distinct expression patterns from the 3’UTRs they derived and several are developmentally regulated, representing ~ 27% of the stage-regulated transcriptome. Their expression was generally higher in amastigotes than in promastigotes, highlighting their importance in parasite intracellular development. Protein prediction tools combined to mass-spectrometry revealed that 7.6% of these lncRNAs have a limited protein-coding potential.
Conclusions: This is the first comprehensive transcriptomic analysis of developmental stages using single-molecule Nanopore DRS. Our findings advance knowledge on existing expression datasets and provide new insights into the transcriptome complexity and dynamics of both protein-coding and non-coding sequences throughout the parasite development.
Supplementary Information: The online version contains supplementary material available at 10.1186/s12864-025-11767-8.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12211216 | PMC |
http://dx.doi.org/10.1186/s12864-025-11767-8 | DOI Listing |
Nat Commun
August 2025
Feil Family Brain and Mind Research Institute, Weill Cornell Medicine, New York, NY, USA.
Genome-wide spatial long-read approaches often lack single-cell resolution and yield limited read lengths. Here, we introduce spatial ISOform sequencing (Spl-ISO-Seq), which reveals exons and polyadenylation sites with near-single-cell resolution. Spl-ISO-Seq selects long cDNAs and doubles to triples read lengths compared to standard preparations.
View Article and Find Full Text PDFNat Commun
August 2025
National Cancer Institute, National Institutes of Health, Frederick, MD, 21702, USA.
Tristetraprolin family of proteins regulate mRNA stability by binding to specific AU-rich elements in transcripts. This binding promotes the shortening of the mRNA poly(A) tail, or deadenylation, initiating mRNA degradation. The CCR4-NOT complex plays a central role in deadenylation, while the cytoplasmic poly(A)-binding protein PABPC1 typically protects mRNAs from decay.
View Article and Find Full Text PDFCpG dinucleotides are under-represented in the genomes of most RNA viruses. Synonymously increasing CpG content of a range of RNA virus genomes reliably causes replication defects due to the recognition of CpG motifs in RNA by cellular Zinc-finger Antiviral Protein (ZAP). Prior to the discovery of ZAP as a CpG sensor, we described an engineered influenza A virus (IAV) enriched for CpGs in segment 5 that displays the expected replication defects.
View Article and Find Full Text PDFBiochemistry
August 2025
Department of Chemistry, University of North Carolina, Chapel Hill, North Carolina 27599-3290, United States.
Extensive RNA-protein interactions occur throughout the lifecycle of an mRNA that up- and down-regulate mRNA translation and degradation. Modulating interactions between regulatory proteins and mRNAs can have large effects on gene expression and might be useful for creating therapeutic manipulations, especially for difficult-to-drug proteins. Here, we directed three degradation-inducing proteins that normally bind the 5' cap, 3'-untranslated region (UTR), or 3' poly(A) tail to unconventional sites spanning coding and noncoding regions across a reporter mRNA.
View Article and Find Full Text PDFEwha Med J
July 2025
Department of Microbiology, Ewha Womans University College of Medicine, Seoul, Korea.
Purpose: Internal ribosome entry site (IRES) elements, present in both viral and cellular messenger RNAs (mRNAs), facilitate cap-independent translation by recruiting ribosomes to internal regions of mRNA. This study aimed to investigate the impact of inserting G-quadruplex and hairpin structures into the 5' untranslated region (UTR) and poly(A) sequences on the translation efficiency of the encephalomyocarditis virus (EMCV) IRES, using an IRES-based RNA platform encoding OX40L, 4-1BBL, and GFP.
Methods: G-quadruplex and hairpin structures, derived from HIV-1 (human immunodeficiency virus type 1) or custom-designed, were synthesized and inserted into the 5' UTR and poly(A) tail regions of EMCV IRES vectors.