Puzzle Hi-C: An accurate scaffolding software.

PLoS One

State Key Laboratory for Conservation and Utilization of Bio-resource, School of Ecology and Environment, School of Life Sciences and School of Medicine, Yunnan University, Kunming, Yunnan, China.

Published: July 2024


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

High-quality, chromosome-scale genomes are essential for genomic analyses. Analyses, including 3D genomics, epigenetics, and comparative genomics rely on a high-quality genome assembly, which is often accomplished with the assistance of Hi-C data. Curation of genomes reveal that current Hi-C-assisted scaffolding algorithms either generate ordering and orientation errors or fail to assemble high-quality chromosome-level scaffolds. Here, we offer the software Puzzle Hi-C, which uses Hi-C reads to accurately assign contigs or scaffolds to chromosomes. Puzzle Hi-C uses the triangle region instead of the square region to count interactions in a Hi-C heatmap. This strategy dramatically diminishes scaffolding interference caused by long-range interactions. This software also introduces a dynamic, triangle window strategy during assembly. Initially small, the window expands with interactions to produce more effective clustering. Puzzle Hi-C outperforms available scaffolding tools.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11249255PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0298564PLOS

Publication Analysis

Top Keywords

puzzle hi-c
16
hi-c
6
puzzle
4
hi-c accurate
4
scaffolding
4
accurate scaffolding
4
scaffolding software
4
software high-quality
4
high-quality chromosome-scale
4
chromosome-scale genomes
4

Similar Publications

In mammalian interphase cells, genomes are folded by cohesin loop extrusion limited by directional CTCF barriers. This process enriches cohesin at barriers, isolates neighboring topologically associating domains, and elevates contact frequency between convergent CTCF barriers across the genome. However, recent in vivo measurements present a puzzle: reported CTCF residence times on chromatin are in the range of a few minutes, whereas cohesin lifetimes are much longer.

View Article and Find Full Text PDF

In mammalian interphase cells, genomes are folded by cohesin loop extrusion limited by directional CTCF barriers. This interplay leads to the enrichment of cohesin at barriers, isolation between neighboring topologically associating domains, and elevated contact frequency between convergent CTCF barriers across the genome. However, recent measurements present a puzzle: reported residence times for CTCF on chromatin are in the range of a few minutes, while lifetimes for cohesin are much longer.

View Article and Find Full Text PDF

Puzzle Hi-C: An accurate scaffolding software.

PLoS One

July 2024

State Key Laboratory for Conservation and Utilization of Bio-resource, School of Ecology and Environment, School of Life Sciences and School of Medicine, Yunnan University, Kunming, Yunnan, China.

High-quality, chromosome-scale genomes are essential for genomic analyses. Analyses, including 3D genomics, epigenetics, and comparative genomics rely on a high-quality genome assembly, which is often accomplished with the assistance of Hi-C data. Curation of genomes reveal that current Hi-C-assisted scaffolding algorithms either generate ordering and orientation errors or fail to assemble high-quality chromosome-level scaffolds.

View Article and Find Full Text PDF

Bacterial chromosomes are large molecules that need to be highly compacted to fit inside the cells. Chromosome compaction must facilitate and maintain key biological processes such as gene expression and DNA transactions (replication, recombination, repair, and segregation). Chromosome and chromatin 3D-organization in bacteria has been a puzzle for decades.

View Article and Find Full Text PDF

Circuit topology analysis of cellular genome reveals signature motifs, conformational heterogeneity, and scaling.

iScience

March 2022

Medical Systems Biophysics and Bioengineering, Leiden Academic Centre for Drug Research, Faculty of Science, Leiden University, Einsteinweg 55, 2333CC Leiden, the Netherlands.

Reciprocal regulation of genome topology and function is a fundamental and enduring puzzle in biology. The wealth of data provided by Hi-C libraries offers the opportunity to unravel this relationship. However, there is a need for a comprehensive theoretical framework in order to extract topological information for genome characterization and comparison.

View Article and Find Full Text PDF