98%
921
2 minutes
20
The majority of bacteriophage diversity remains uncharacterized, and new intriguing mechanisms of their biology are being continually described. Members of some phage lineages, such as the , repurpose stop codons to encode an amino acid by using alternate genetic codes. Here, we investigated the prevalence of stop codon reassignment in phage genomes and its subsequent impacts on functional annotation. We predicted 76 genomes within INPHARED and 712 vOTUs from the Unified Human Gut Virome Catalogue (UHGV) that repurpose a stop codon to encode an amino acid. We re-annotated these sequences with modified versions of Pharokka and Prokka, called Pharokka-gv and Prokka-gv, to automatically predict stop codon reassignment prior to annotation. Both tools significantly improved the quality of annotations, with Pharokka-gv performing best. For sequences predicted to repurpose TAG to glutamine (translation table 15), Pharokka-gv increased the median gene length (median of per genome median) from 287 to 481 bp for UHGV sequences (67.8% increase) and from 318 to 550 bp for INPHARED sequences (72.9% increase). The re-annotation increased median coding capacity from 66.8% to 90.0% and from 69.0% to 89.8% for UHGV and INPHARED sequences predicted to use translation table 15. Furthermore, the proportion of genes that could be assigned functional annotation increased, including an increase in the number of major capsid proteins that could be identified. We propose that automatic prediction of stop codon reassignment before annotation is beneficial to downstream viral genomic and metagenomic analyses.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11210395 | PMC |
http://dx.doi.org/10.1093/ismeco/ycae079 | DOI Listing |
Int J Mol Sci
July 2025
Department of Bioinformatics and Genomics, Faculty of Biotechnology, University of Wrocław, ul. Joliot-Curie 14a, 50-383 Wrocław, Poland.
The standard genetic code (SGC) plays a fundamental role in encoding biological information, but its evolutionary origins remain unresolved and widely debated. Thus, we used a methodology based on the evolutionary algorithm to investigate the emergence of stable coding systems. The simulation began with a population of varied primitive genetic codes that ambiguously encoded only a limited set of amino acids (labels).
View Article and Find Full Text PDFCold Spring Harb Perspect Biol
July 2025
Institute of Microbiology, Czech Academy of Sciences, 142 20 Prague, Czechia
The standard genetic code, which applies almost without exception, is the key to our understanding of molecular biological processes. Although it is close to impossible to imagine that sparse code changes occur naturally given proteomic constraints, specific cases of codon usage alterations have been documented, mostly in unicellular eukaryotes. Here, we summarize what we have learned about , a little-known parasitic flagellate with all three stop codons reassigned to sense codons, which uses UAA as the only universal stop codon.
View Article and Find Full Text PDFNucleic Acids Res
July 2025
Department of Pharmacology and Physiology, University of Rochester School of Medicine and Dentistry, Rochester, NY 14642, United States.
Nonsense mutations arise from single nucleotide substitutions that result in premature termination codons (PTCs). PTCs result in little to no full-length protein production and decreased mRNA stability due to the nonsense-mediated mRNA decay (NMD) pathway. We provide evidence that anticodon-edited (ACE-) tRNAs efficiently suppress the most prevalent cystic fibrosis (CF)-causing PTCs, promoting significant rescue of endogenous cystic fibrosis transmembrane conductance regulator (CFTR) transcript abundance and channel function in different model systems.
View Article and Find Full Text PDFBiochemistry
July 2025
Medical Research Council Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, England, U.K.
The genetic code defines the correspondence between codons in genes and amino acids in proteins. Reassignment of sense codons to different amino acids can create cells with refactored genetic codes that are distinct from the canonical genetic code. By encoding essential genes according to the refactored genetic code, this code becomes locked-in, making it essential to the host cell.
View Article and Find Full Text PDFNature
July 2025
Synthetic and Functional Biomolecules Center, Beijing National Laboratory for Molecular Sciences, College of Chemistry and Molecular Engineering, Peking University, Beijing, China.
The incorporation of non-canonical amino acids (ncAAs) enables customized chemistry to tailor protein functions. Genetic code expansion offers a general approach for ncAA encoding by reassigning stop codons as the 'blank' codon; however, it is not completely orthogonal to translation termination for cellular transcripts. Here, to generate more bona fide blank codons, we developed an RNA codon-expansion (RCE) strategy that introduces and decodes bioorthogonally assignable pseudouridine (Ψ) codons (ΨGA, ΨAA or ΨAG) on specified mRNA transcripts to incorporate ncAAs in mammalian cells.
View Article and Find Full Text PDF