Assembly of allele-aware, chromosomal-scale autopolyploid genomes based on Hi-C data.

Nat Plants

Center for Genomics and Biotechnology, Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Key Laboratory of Genetics, Breeding and Multiple Utilization of Corps, Ministry of Education, Fujian Agriculture and Forestry University, Fuzhou, China.

Published: August 2019


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Construction of chromosome-level assembly is a vital step in achieving the goal of a 'Platinum' genome, but it remains a major challenge to assemble and anchor sequences to chromosomes in autopolyploid or highly heterozygous genomes. High-throughput chromosome conformation capture (Hi-C) technology serves as a robust tool to dramatically advance chromosome scaffolding; however, existing approaches are mostly designed for diploid genomes and often with the aim of reconstructing a haploid representation, thereby having limited power to reconstruct chromosomes for autopolyploid genomes. We developed a novel algorithm (ALLHiC) that is capable of building allele-aware, chromosomal-scale assembly for autopolyploid genomes using Hi-C paired-end reads with innovative 'prune' and 'optimize' steps. Application on simulated data showed that ALLHiC can phase allelic contigs and substantially improve ordering and orientation when compared to other mainstream Hi-C assemblers. We applied ALLHiC on an autotetraploid and an autooctoploid sugar-cane genome and successfully constructed the phased chromosomal-level assemblies, revealing allelic variations present in these two genomes. The ALLHiC pipeline enables de novo chromosome-level assembly of autopolyploid genomes, separating each allele. Haplotype chromosome-level assembly of allopolyploid and heterozygous diploid genomes can be achieved using ALLHiC, overcoming obstacles in assembling complex genomes.

Download full-text PDF

Source
http://dx.doi.org/10.1038/s41477-019-0487-8DOI Listing

Publication Analysis

Top Keywords

autopolyploid genomes
16
chromosome-level assembly
12
genomes
9
allele-aware chromosomal-scale
8
chromosomes autopolyploid
8
diploid genomes
8
assembly autopolyploid
8
assembly
5
autopolyploid
5
allhic
5

Similar Publications

Plants rely on tight coordination between nuclear, mitochondrial, and chloroplast genomes to form essential multi-enzyme cytonuclear complexes. Whole-genome duplication (WGD) doubles the nuclear genome, potentially disrupting cytonuclear stoichiometry unless organellar genomes respond accordingly. Targeted analyses of chloroplasts and mitochondria enabled us to dissect the extent and mechanisms of adjustments in both organelles immediately after WGD and across generations in Arabidopsis auto- and allopolyploids.

View Article and Find Full Text PDF

Repeated polyploidization shapes divergence in floral morphology in (Saxifragaceae).

Proc Natl Acad Sci U S A

August 2025

Department of Plant Ecology and Evolution, Evolutionary Biology Centre, Uppsala University, Norbyvägen, Uppsala 18D, 752 36, Sweden.

Polyploidization is an important driver of evolution and diversification in flowering plants. Here, we assess how repeated polyploidization may have shaped diversification of floral morphology in (Saxifragaceae). This species comprises multiple cytotypes and varies geographically in its interactions with specialized pollinating moths in the genus (Prodoxidae).

View Article and Find Full Text PDF

Varigraph: An accurate and widely applicable pangenome graph-based variant genotyper for diploid and polyploid genomes.

Mol Plant

September 2025

National Key Laboratory for Germplasm Innovation and Utilization of Horticultural Crops, Huazhong Agricultural University, Wuhan, China; Hubei Hongshan Laboratory, Wuhan, China; Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, Chin

Accurate variant genotyping is crucial for genomics-assisted breeding. Graph pangenome references can address single-reference bias, thereby enhancing the performance of variant genotyping and empowering downstream applications in population genetics and quantitative genetics. However, existing pangenome-based genotyping methods are ineffective in handling large or complex pangenome graphs, particularly in polyploid genomes.

View Article and Find Full Text PDF

Near-complete genome assembly of allotetraploid Erianthus rockii reveals unique chromosome evolution and lineage-divergence trajectories in the Saccharum complex.

Plant Commun

July 2025

State Key Lab for Conservation and Utilization of Subtropical AgroBiological Resources, College of Life Science and Technology, Guangxi University, Nanning, Guangxi 530004, China. Electronic address:

The Saccharum complex is known for having one of the most intricate genomes among plants, primarily originating from autopolyploidization. Erianthus rockii (E. rockii), an allotetraploid species within the Saccharum complex, serves as a key phylogenetic reference for studying polyploidization in Saccharum.

View Article and Find Full Text PDF

Abundance of established polyploid lineages varies across lineages, evolutionary time, and geography, suggesting both genetics and environment play a role in polyploid persistence. We show Arabidopsis lyrata is the most polyploid-rich species complex in the Arabidopsis genus, with multiple origins of autotetraploidy. This is revealed by genomic data from over 400 A.

View Article and Find Full Text PDF