Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

MGnify (http://www.ebi.ac.uk/metagenomics) provides a free to use platform for the assembly, analysis and archiving of microbiome data derived from sequencing microbial populations that are present in particular environments. Over the past 2 years, MGnify (formerly EBI Metagenomics) has more than doubled the number of publicly available analysed datasets held within the resource. Recently, an updated approach to data analysis has been unveiled (version 5.0), replacing the previous single pipeline with multiple analysis pipelines that are tailored according to the input data, and that are formally described using the Common Workflow Language, enabling greater provenance, reusability, and reproducibility. MGnify's new analysis pipelines offer additional approaches for taxonomic assertions based on ribosomal internal transcribed spacer regions (ITS1/2) and expanded protein functional annotations. Biochemical pathways and systems predictions have also been added for assembled contigs. MGnify's growing focus on the assembly of metagenomic data has also seen the number of datasets it has assembled and analysed increase six-fold. The non-redundant protein database constructed from the proteins encoded by these assemblies now exceeds 1 billion sequences. Meanwhile, a newly developed contig viewer provides fine-grained visualisation of the assembled contigs and their enriched annotations.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7145632PMC
http://dx.doi.org/10.1093/nar/gkz1035DOI Listing

Publication Analysis

Top Keywords

analysis pipelines
8
assembled contigs
8
analysis
5
mgnify microbiome
4
microbiome analysis
4
analysis resource
4
resource 2020
4
2020 mgnify
4
mgnify http//wwwebiacuk/metagenomics
4
http//wwwebiacuk/metagenomics free
4

Similar Publications

Objective: This study aims to develop a robust, multi-task deep learning framework that integrates vessel segmentation and radiomic analysis for the automated classification of four retinal conditions- diabetic retinopathy (DR), hypertensive retinopathy (HR), papilledema, and normal fundus-using fundus images.

Materials: AND.

Methods: A total of 2,165 patients from eight medical centers were enrolled.

View Article and Find Full Text PDF

SuperGLUE facilitates an explainable training framework for multi-modal data analysis.

Cell Rep Methods

August 2025

Interdepartmental Program in Computational Biology & Bioinformatics, Yale University, New Haven, CT 06511, USA; Department of Biostatistics, Yale University, New Haven, CT 06511, USA. Electronic address:

Single-cell multi-modal data integration has been an area of active research in recent years. However, it is difficult to unify the integration process of different omics in a pipeline and evaluate the contributions of data integration. In this article, we revisit the definition and contributions of multi-modal data integration and propose a strong and scalable method based on probabilistic deep learning with an explainable framework powered by statistical modeling to extract meaningful information after data integration.

View Article and Find Full Text PDF

A machine learning-based analysis method for small molecule high content screening of three-dimensional cancer spheroid morphology.

Mol Pharmacol

August 2025

Division of Preclinical Innovation, National Center for Advancing Translational Sciences, National Institutes of Health, Rockville, Maryland. Electronic address:

Although multiparameter cellular morphological profiling methods and three-dimensional (3D) biological model systems can potentially provide complex insights for pharmaceutical discovery campaigns, there have been relatively few reports combining these experimental approaches. In this study, we used the U87 glioblastoma cell line grown in a 3D spheroid format to validate a multiparameter cellular morphological profiling screening method. The steps of this approach include 3D spheroid treatment, cell staining, fully automated digital image acquisition, image segmentation, numerical feature extraction, and multiple machine learning approaches for cellular profiling.

View Article and Find Full Text PDF

This paper presents a novel multiscale signal processing framework for power quality disturbance (PQD) and cyber intrusion detection in smart grids, combining Non-Subsampled Contourlet Transform (NSCT), Split Augmented Lagrangian Shrinkage Algorithm (SALSA), and Morphological Component Analysis (MCA). A key innovation lies in an adaptive weighting mechanism within NSCT's directional sub bands, enabling dynamic energy redistribution and enhanced representation of both low-frequency anomalies (e.g.

View Article and Find Full Text PDF

Comprehensive in silico analyses of keratin heterodimerisation.

Eur J Cell Biol

August 2025

Institute of Molecular Pharmacology, Medical Faculty, RWTH Aachen University, Wendlingweg 2, Aachen 52074, Germany. Electronic address:

Keratins are the largest and most diverse group of intermediate filament proteins, providing structural integrity and mechanical strength to epithelial cells. Although their assembly as heterodimers is well established, the specific pairing preferences and molecular basis of keratin dimerisation remain largely unknown. Here, we employ a high-throughput computational pipeline that integrates AlphaFold Multimer (AFM) modelling, VoroIF-GNN interaction interface quality assessment, interaction energy calculations and structural comparisons with experimentally solved structures to systematically investigate keratin heterodimerisation and to provide a guideline for further analysis of intermediate filament assembly.

View Article and Find Full Text PDF