Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Background: Genome assembly of viruses with high mutation rates, such as Norovirus and other RNA viruses, or from metagenome samples, poses a challenge for the scientific community due to the coexistence of several viral quasispecies and strains. Furthermore, there is no standard method for obtaining whole-genome sequences in non-related patients. After polyA RNA isolation and sequencing in eight patients with acute gastroenteritis, we evaluated two de Bruijn graph assemblers (SPAdes and MEGAHIT), combined with four different and common pre-assembly strategies, and compared those yielding whole genome Norovirus contigs.

Results: Reference-genome guided strategies with both host and target virus did not present any advantages compared to the assembly of non-filtered data in the case of SPAdes, and in the case of MEGAHIT, only host genome filtering presented improvements. MEGAHIT performed better than SPAdes in most samples, reaching complete genome sequences in most of them for all the strategies employed. Read binning with CD-HIT improved assembly when paired with different analysis strategies, and more notably in the case of SPAdes.

Conclusions: Not all metagenome assemblies are equal and the choice in the workflow depends on the species studied and the prior steps to analysis. We may need different approaches even for samples treated equally due to the presence of high intra host variability. We tested and compared different workflows for the accurate assembly of Norovirus genomes and established their assembly capacities for this purpose.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8611953PMC
http://dx.doi.org/10.1186/s12864-021-08067-2DOI Listing

Publication Analysis

Top Keywords

genome assembly
8
metagenome samples
8
assembly
6
genome
5
benchmarking approaches
4
norovirus
4
approaches norovirus
4
norovirus genome
4
assembly metagenome
4
samples
4

Similar Publications

Dysregulated spine morphology is a common feature in the pathology of many neurodevelopmental and neuropsychiatric disorders. Overabundant immature dendritic spines in the hippocampus are causally related to cognitive deficits of Fragile X syndrome (FXS), the most common form of heritable intellectual disability. Recent findings from us and others indicate autophagy plays important roles in synaptic stability and morphology, and autophagy is downregulated in FXS neurons.

View Article and Find Full Text PDF

The rapid decline in global biodiversity highlights the urgent need for conservation efforts, with botanical gardens playing a crucial role in ex situ plant preservation. Monumental plants, such as the 400-year-old Goethe's Palm (Chamaerops humilis L.) at the Padua Botanical Garden serve as vital flagship species with significant ecological and cultural value.

View Article and Find Full Text PDF

EASY-edit: a toolbox for high-throughput single-step custom genetic editing in bacteria.

Nucleic Acids Res

September 2025

Expression génétique microbienne, UMR8261 CNRS, Université Paris Cité, Institut de Biologie Physico-Chimique, Paris 75005, France.

Targeted gene editing can be achieved using CRISPR-Cas9-assisted recombineering. However, high-efficiency editing requires careful optimization for each locus to be modified, which can be tedious and time-consuming. In this work, we developed a simple, fast and cheap method: Engineered Assembly of SYnthetic operons for targeted editing (EASY-edit) in Escherichia coli.

View Article and Find Full Text PDF

Crystal structures of distinct parallel and antiparallel DNA G-quadruplexes reveal structural polymorphism in C9orf72 G4C2 repeats.

Nucleic Acids Res

September 2025

State Key Laboratory of Vaccines for Infectious Diseases, School of Public Health, Xiamen University, Xiamen 361102, Fujian, China.

The abnormal expansion of GGGGCC (G4C2) repeats in the noncoding region of the C9orf72 gene is a major genetic cause of two devastating neurodegenerative disorders, amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD). These G4C2 repeats are known to form G-quadruplex (G4) structures, which are hypothesized to contribute to disease pathogenesis. Here, we demonstrated that four DNA G4C2 repeats can fold into two structurally distinct G4 conformations: a parallel and an antiparallel topology.

View Article and Find Full Text PDF

Despite advancements in genome annotation tools, challenges persist for non-classical model organisms with limited genomic resources, such as Schmidtea mediterranea. To address these challenges, we developed a flexible and scalable genome annotation pipeline that integrates short-read (Illumina) and long-read (PacBio) sequencing technologies. The pipeline combines reference-based and de novo assembly methods, effectively handling genomic variability and alternative splicing events.

View Article and Find Full Text PDF