98%
921
2 minutes
20
Identifying and characterizing mobile genetic elements in sequencing data is essential for understanding their diversity, ecology, biotechnological applications and impact on public health. Here we introduce geNomad, a classification and annotation framework that combines information from gene content and a deep neural network to identify sequences of plasmids and viruses. geNomad uses a dataset of more than 200,000 marker protein profiles to provide functional gene annotation and taxonomic assignment of viral genomes. Using a conditional random field model, geNomad also detects proviruses integrated into host genomes with high precision. In benchmarks, geNomad achieved high classification performance for diverse plasmids and viruses (Matthews correlation coefficient of 77.8% and 95.3%, respectively), substantially outperforming other tools. Leveraging geNomad's speed and scalability, we processed over 2.7 trillion base pairs of sequencing data, leading to the discovery of millions of viruses and plasmids that are available through the IMG/VR and IMG/PR databases. geNomad is available at https://portal.nersc.gov/genomad .
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11324519 | PMC |
http://dx.doi.org/10.1038/s41587-023-01953-y | DOI Listing |
Nat Rev Mol Cell Biol
September 2025
Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA, USA.
The defining property of eukaryotic cells is the storage of heritable genetic material in a nuclear compartment. For eukaryotic cells to carry out the myriad biochemical processes necessary for their function, macromolecules must be efficiently exchanged between the nucleus and cytoplasm. The nuclear pore complex (NPC) - which is a massive assembly of ~35 different proteins present in multiple copies totalling ~1,000 protein subunits and architecturally conserved across eukaryotes - establishes a size-selective channel for regulated bidirectional transport of folded macromolecules and macromolecular assemblies across the nuclear envelope.
View Article and Find Full Text PDFNPJ Antimicrob Resist
September 2025
Antimicrobial Resistance & Microbiome Research Group, Department of Biology, The Kathleen Lonsdale Institute for Human Health Research, Maynooth University, Maynooth, Co, Kildare, Ireland.
Plasmids facilitate antimicrobial resistance (AMR) gene spread via horizontal gene transfer, yet the mobility of genes in wastewater treatment plant (WWTP) resistomes remains unclear. We sequenced 173 circularised plasmids transferred from WWTP effluent into Escherichia coli and characterised their genetic content. Multiple multidrug-resistant plasmids were identified, with a significant number of mega-plasmids (>100 kb).
View Article and Find Full Text PDFPLoS Genet
September 2025
MIVEGEC, University of Montpellier, CNRS, IRD, Montpellier, France.
Cytoplasmic Incompatibility (CI) causes embryonic lethality in arthropods, resulting in a significant reduction in reproductive success. In most cases, this reproductive failure is driven by Wolbachia endosymbionts through their cifA/cifB gene pair, whose products disrupts arthropod DNA replication during embryogenesis. While a cif pair has been considered a hallmark of Wolbachia, its presence and functional significance in other bacterial lineages remains poorly investigated.
View Article and Find Full Text PDFMicrob Genom
September 2025
School of Animal and Veterinary Sciences, The University of Adelaide, Roseworthy, South Australia 5371, Australia.
causes otitis media and severe diseases including pneumonia, meningitis and bacteraemia. The rise of antimicrobial resistance (AMR) in , facilitated by mobile genetic elements (MGEs), complicates infection treatment. While pneumococcal conjugate vaccine (PCV) deployment has reduced disease burden, non-vaccine serotypes (NVTs) have increased and now cause invasive disease.
View Article and Find Full Text PDFNucleic Acids Res
September 2025
School of Microbiology, University College Cork, Cork, T12 Y337, Ireland.
The genomes of 43 distinct lactococcal strains were reconstructed by a combination of long- and short-read sequencing, resolving the plasmid complement and methylome of these strains. The genomes comprised 43 chromosomes of approximately 2.5 Mb each and 269 plasmids ranging from 2 to 211 kb (at an average occurrence of 6 per strain).
View Article and Find Full Text PDF