Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Identifying and characterizing mobile genetic elements in sequencing data is essential for understanding their diversity, ecology, biotechnological applications and impact on public health. Here we introduce geNomad, a classification and annotation framework that combines information from gene content and a deep neural network to identify sequences of plasmids and viruses. geNomad uses a dataset of more than 200,000 marker protein profiles to provide functional gene annotation and taxonomic assignment of viral genomes. Using a conditional random field model, geNomad also detects proviruses integrated into host genomes with high precision. In benchmarks, geNomad achieved high classification performance for diverse plasmids and viruses (Matthews correlation coefficient of 77.8% and 95.3%, respectively), substantially outperforming other tools. Leveraging geNomad's speed and scalability, we processed over 2.7 trillion base pairs of sequencing data, leading to the discovery of millions of viruses and plasmids that are available through the IMG/VR and IMG/PR databases. geNomad is available at https://portal.nersc.gov/genomad .

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11324519PMC
http://dx.doi.org/10.1038/s41587-023-01953-yDOI Listing

Publication Analysis

Top Keywords

mobile genetic
8
genetic elements
8
sequencing data
8
plasmids viruses
8
genomad
6
identification mobile
4
elements genomad
4
genomad identifying
4
identifying characterizing
4
characterizing mobile
4

Similar Publications

Structure, function and assembly of nuclear pore complexes.

Nat Rev Mol Cell Biol

September 2025

Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA, USA.

The defining property of eukaryotic cells is the storage of heritable genetic material in a nuclear compartment. For eukaryotic cells to carry out the myriad biochemical processes necessary for their function, macromolecules must be efficiently exchanged between the nucleus and cytoplasm. The nuclear pore complex (NPC) - which is a massive assembly of ~35 different proteins present in multiple copies totalling ~1,000 protein subunits and architecturally conserved across eukaryotes - establishes a size-selective channel for regulated bidirectional transport of folded macromolecules and macromolecular assemblies across the nuclear envelope.

View Article and Find Full Text PDF

Communities of plasmids as strategies for antimicrobial resistance gene survival in wastewater treatment plant effluent.

NPJ Antimicrob Resist

September 2025

Antimicrobial Resistance & Microbiome Research Group, Department of Biology, The Kathleen Lonsdale Institute for Human Health Research, Maynooth University, Maynooth, Co, Kildare, Ireland.

Plasmids facilitate antimicrobial resistance (AMR) gene spread via horizontal gene transfer, yet the mobility of genes in wastewater treatment plant (WWTP) resistomes remains unclear. We sequenced 173 circularised plasmids transferred from WWTP effluent into Escherichia coli and characterised their genetic content. Multiple multidrug-resistant plasmids were identified, with a significant number of mega-plasmids (>100 kb).

View Article and Find Full Text PDF

Cytoplasmic Incompatibility (CI) causes embryonic lethality in arthropods, resulting in a significant reduction in reproductive success. In most cases, this reproductive failure is driven by Wolbachia endosymbionts through their cifA/cifB gene pair, whose products disrupts arthropod DNA replication during embryogenesis. While a cif pair has been considered a hallmark of Wolbachia, its presence and functional significance in other bacterial lineages remains poorly investigated.

View Article and Find Full Text PDF

causes otitis media and severe diseases including pneumonia, meningitis and bacteraemia. The rise of antimicrobial resistance (AMR) in , facilitated by mobile genetic elements (MGEs), complicates infection treatment. While pneumococcal conjugate vaccine (PCV) deployment has reduced disease burden, non-vaccine serotypes (NVTs) have increased and now cause invasive disease.

View Article and Find Full Text PDF

The genomes of 43 distinct lactococcal strains were reconstructed by a combination of long- and short-read sequencing, resolving the plasmid complement and methylome of these strains. The genomes comprised 43 chromosomes of approximately 2.5 Mb each and 269 plasmids ranging from 2 to 211 kb (at an average occurrence of 6 per strain).

View Article and Find Full Text PDF