Comparison of three assembly strategies for a heterozygous seedless grapevine genome assembly.

BMC Genomics

Agronomy, Horticulture and Plant Science Department and BioSNTR, 247 McFadden BioStress Laboratory, South Dakota State University, Brookings, SD, 57006, USA.

Published: January 2018


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Background: De novo heterozygous assembly is an ongoing challenge requiring improved assembly approaches. In this study, three strategies were used to develop de novo Vitis vinifera 'Sultanina' genome assemblies for comparison with the inbred V. vinifera (PN40024 12X.v2) reference genome and a published Sultanina ALLPATHS-LG assembly (AP). The strategies were: 1) a default PLATANUS assembly (PLAT_d) for direct comparison with AP assembly, 2) an iterative merging strategy using METASSEMBLER to combine PLAT_d and AP assemblies (MERGE) and 3) PLATANUS parameter modifications plus GapCloser (PLAT*_GC).

Results: The three new assemblies were greater in size than the AP assembly. PLAT*_GC had the greatest number of scaffolds aligning with a minimum of 95% identity and ≥1000 bp alignment length to V. vinifera (PN40024 12X.v2) reference genome. SNP analysis also identified additional high quality SNPs. A greater number of sequence reads mapped back with zero-mismatch to the PLAT_d, MERGE, and PLAT*_GC (>94%) than was found in the AP assembly (87%) indicating a greater fidelity to the original sequence data in the new assemblies than in AP assembly. A de novo gene prediction conducted using seedless RNA-seq data predicted > 30,000 coding sequences for the three new de novo assemblies, with the greatest number (30,544) in PLAT*_GC and only 26,515 for the AP assembly. Transcription factor analysis indicated good family coverage, but some genes found in the VCOST.v3 annotation were not identified in any of the de novo assemblies, particularly some from  the MYB and ERF families.

Conclusions: The PLAT_d and PLAT*_GC had a greater number of synteny blocks with the V. vinifera (PN40024 12X.v2) reference genome than AP or MERGE. PLAT*_GC provided the most contiguous assembly with only 1.2% scaffold N, in contrast to AP (10.7% N), PLAT_d (6.6% N) and Merge (6.4% N). A PLAT*_GC pseudo-chromosome assembly with chromosome alignment to the reference genome V. vinifera, (PN40024 12X.v2) provides new information for use in seedless grape genetic mapping studies. An annotated de novo gene prediction for the PLAT*_GC assembly, aligned with VitisNet pathways provides new seedless grapevine specific transcriptomic resource that has excellent fidelity with the seedless short read sequence data.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5773036PMC
http://dx.doi.org/10.1186/s12864-018-4434-2DOI Listing

Publication Analysis

Top Keywords

vinifera pn40024
16
pn40024 12xv2
16
reference genome
16
assembly
14
12xv2 reference
12
assembly strategies
8
seedless grapevine
8
greatest number
8
greater number
8
merge plat*_gc
8

Similar Publications

Mgaloblishvili, a grapevine variety from Georgia (Southern Caucasus), exhibits a unique resistance mechanism against downy mildew. Mgaloblishvili resistance mechanism, involving pathogen recognition, activation of ethylene signalling pathway, and structural and chemical defences, is mediated by the resistance loci , , and . Mgaloblishvili genome was sequenced using PacBio HiFi, resulting in a chromosome-scale diploid assembly of 986 Mbp, including 58 912 predicted protein-coding genes across two phased chromosome sets.

View Article and Find Full Text PDF
Article Synopsis
  • Pathogens like oomycetes cause significant yield loss in viticulture, leading to the need for extensive pesticide use; breeders are developing resistant grape varieties using wild germplasm.
  • Researchers created a genetic map from 244 F individuals by crossing a susceptible cultivar 'Tigvoasa' with a resistant breeding line, analyzing 627 molecular markers to find resistance traits.
  • The study identified a major quantitative trait locus (QTL) on linkage Group 9 that is strongly associated with downy mildew resistance and may differ from previously known resistant loci, indicating a novel genetic variant.
View Article and Find Full Text PDF

Thanks to several backcrosses with an initial L. × (previously ) interspecific cross, the MrRUN1/MrRPV1 locus (resistance to downy and powdery mildews) was introgressed in genotypes phenotypically close to varieties. To check the consequences of introgressing parts of the genome on gene expression during fruit development, we conducted a comparative RNA-seq study on single berries from different cultivars and × hybrids, including 'G5' and two derivative microvine lines, 'MV102' (resistant) and 'MV32' (susceptible) segregating for the MrRUN1/RPV1 locus.

View Article and Find Full Text PDF

Background: Grapevine (Vitis) is one of the world's most valuable fruit crops, but insect herbivory can decrease yields. Understanding insect herbivory resistance is critical to mitigating these losses. Vitis labrusca, a wild North American grapevine species, has been leveraged in breeding programs to generate hybrid grapevines with enhanced abiotic and biotic stress resistance, rendering it a valuable genetic resource for sustainable viticulture.

View Article and Find Full Text PDF

genome assembly reveals diversification between wild and cultivated grapevine genomes.

Front Plant Sci

August 2023

Department of Horticulture and Crop Science, The Ohio State University, Columbus, OH, United States.

Wild grapevines are important genetic resources in breeding programs to confer adaptive fitness traits and unique fruit characteristics, but the genetics underlying these traits, and their evolutionary origins, are largely unknown. To determine the factors that contributed to grapevine genome diversification, we performed comprehensive intragenomic and intergenomic analyses with three cultivated European (including the PN40024 reference genome) and two wild North American grapevine genomes, including our newly released genome. We found the heterozygosity of the cultivated grapevine genomes was twice as high as the wild grapevine genomes studied.

View Article and Find Full Text PDF