mettannotator: a comprehensive and scalable Nextflow annotation pipeline for prokaryotic assemblies.

Bioinformatics

European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Cambridge CB10 1SD, United Kingdom.

Published: February 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Summary: In recent years, there has been a surge in prokaryotic genome assemblies, coming from both isolated organisms and environmental samples. These assemblies often include novel species that are poorly represented in reference databases creating a need for a tool that can annotate both well-described and novel taxa, and can run at scale. Here, we present mettannotator-a comprehensive, scalable Nextflow pipeline for prokaryotic genome annotation that identifies coding and noncoding regions, predicts protein functions, including antimicrobial resistance, and delineates gene clusters. The pipeline summarizes these results in a GFF (General Feature Format) file that can be easily utilized in downstream analysis or visualized using common genome browsers. Here, we show how it works on 200 genomes from 29 prokaryotic phyla, including isolate genomes and known and novel metagenome-assembled genomes, and present metrics on its performance in comparison to other tools.

Availability And Implementation: The pipeline is written in Nextflow and Python and published under an open source Apache 2.0 licence. Instructions and source code can be accessed at https://github.com/EBI-Metagenomics/mettannotator. The pipeline is also available on WorkflowHub: https://workflowhub.eu/workflows/1069.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11842068PMC
http://dx.doi.org/10.1093/bioinformatics/btaf037DOI Listing

Publication Analysis

Top Keywords

comprehensive scalable
8
scalable nextflow
8
pipeline prokaryotic
8
prokaryotic genome
8
pipeline
5
mettannotator comprehensive
4
nextflow annotation
4
annotation pipeline
4
prokaryotic
4
prokaryotic assemblies
4

Similar Publications

Bimetallic FeNi-ZSM-5-catalyzed pyrolysis of photovoltaic waste: Selective and high-yield aromatic valorization for circular resource recovery.

Environ Res

September 2025

Guangdong Education Department Key Laboratory of Resources Comprehensive Utilization and Cleaner Production, School of Environmental Science and Engineering, Guangdong University of Technology, Guangzhou, 510006, China.

Catalytic pyrolysis, an efficient thermochemical process, offers a promising pathway to valorize thermoset photovoltaic backsheets (TPV) into high-value chemicals. This study investigates the ex situ catalytic pyrolysis of TPV using two acidic catalysts, ZSM-5 and FeNi-ZSM-5, under varied operational conditions, with a focus on product distribution and process efficiency. The catalytic intervention significantly enhanced pyrolysis performance.

View Article and Find Full Text PDF

Scalable Acid-Aided Lysis of Skin Samples Improves Proteome Coverage.

J Invest Dermatol

September 2025

LEO Foundation Skin Immunology Research Center, Department of Immunology and Microbiology, University of Copenhagen, Copenhagen, DK. Electronic address:

Liquid chromatography-mass spectrometry (LC-MS) is an evolving tool for comprehensive proteomic analyses across tissues. Despite the widespread use of LC-MS in dermatology, full-thickness human skin remains challenging to analyse. The skin extracellular matrix (ECM) presents two major obstacles: the extensive crosslinking complicates protein extraction and the high abundance of ECM proteins can mask lower-abundance proteins, reducing identification numbers.

View Article and Find Full Text PDF

SuperGLUE facilitates an explainable training framework for multi-modal data analysis.

Cell Rep Methods

August 2025

Interdepartmental Program in Computational Biology & Bioinformatics, Yale University, New Haven, CT 06511, USA; Department of Biostatistics, Yale University, New Haven, CT 06511, USA. Electronic address:

Single-cell multi-modal data integration has been an area of active research in recent years. However, it is difficult to unify the integration process of different omics in a pipeline and evaluate the contributions of data integration. In this article, we revisit the definition and contributions of multi-modal data integration and propose a strong and scalable method based on probabilistic deep learning with an explainable framework powered by statistical modeling to extract meaningful information after data integration.

View Article and Find Full Text PDF

A multi-channel integrated auditory function test system.

Neuroscience

September 2025

Department of Biomedical Engineering, Southern University of Science and Technology, Shenzhen, Guangdong 518055, China. Electronic address:

The auditory brainstem response (ABR) remains the gold standard for evaluating hearing function in both animal models and humans. Features of ABR, including threshold, wave I amplitude and latency are critical for diagnosing and investigating the mechanisms of hearing loss. Critically, the rapid proliferation of genetically engineered mouse models in hearing research has created an imperative demand for high-throughput ABR testing capabilities.

View Article and Find Full Text PDF

The urgent need to reduce fossil fuel emissions demands advanced control technologies beyond conventional catalysts. This review uniquely offers a comprehensive analysis of composite catalysts tailored to capture the full spectrum of fossil fuel pollutants, unlike prior studies that address individual emissions separately. It covers fundamental principles, reaction mechanisms, and recent material innovations, emphasizing multi-metallic, nanostructured, and hybrid catalyst designs.

View Article and Find Full Text PDF