98%
921
2 minutes
20
Despite their suitability for studying evolution, many conifer species have large and repetitive giga-genomes (16-31 Gbp) that create hurdles to producing high coverage SNP data sets that capture diversity from across the entirety of the genome. Due in part to multiple ancient whole genome duplication events, gene family expansion and subsequent evolution within Pinaceae, false diversity from the misalignment of paralog copies creates further challenges in accurately and reproducibly inferring evolutionary history from sequence data. Here, we leverage the cost-saving benefits of pool-seq and exome-capture to discover SNPs in two conifer species, Douglas-fir (Pseudotsuga menziesii var. menziesii (Mirb.) Franco, Pinaceae) and jack pine (Pinus banksiana Lamb., Pinaceae). We show, using minimal baseline filtering, that allele frequencies estimated from pooled individuals show a strong, positive correlation with those estimated by sequencing the same population as individuals (r > .948), on par with such comparisons made in model organisms. Further, we highlight the utility of haploid megagametophyte tissue for identifying sites that are probably due to misaligned paralogs. Together with additional minor filtering, we show that it is possible to remove many of the loci with large frequency estimate discrepancies between individual and pooled sequencing approaches, improving the correlation further (r > .973). Our work addresses bioinformatic challenges in non-model organisms with large and complex genomes, highlights the use of megagametophyte tissue for the identification of paralogous artefacts, and suggests the combination of pool-seq and exome capture to be robust for further evolutionary hypothesis testing in these systems.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9292622 | PMC |
http://dx.doi.org/10.1111/1755-0998.13474 | DOI Listing |
NAR Genom Bioinform
September 2025
Department of Human Genetics and Genomic Medicine, University of Southampton, Southampton, SO16 6YD, UK.
Whole exome sequencing (WES) is a well-established tool for clinical diagnostics, is more cost-effective and faster to analyse than whole genome sequencing and has been implemented to uplift diagnostic rates in human disease. However, challenges remain to achieve comprehensive and uniform coverage of targets, and high sensitivity and specificity. Differences in genomic target regions and exome capture mechanism between kits may lead to differences in overall coverage uniformity and capture efficiency.
View Article and Find Full Text PDFBMC Genomics
September 2025
Element Biosciences, San Diego, CA, USA.
Background: Hybrid capture is a critical technology for selective enrichment of genomic regions of interest in genomic analysis. Despite its widespread adoption, the core methodology has remained largely unchanged for over 15 years, with traditional workflows involving time-consuming bead-based capture steps, multiple temperature-controlled washes, and post-hybridization PCR. These steps introduce workflow complexity, increase turnaround time, and can negatively impact library complexity and variant calling accuracy.
View Article and Find Full Text PDFNAR Genom Bioinform
September 2025
School of Mathematical and Statistical Sciences, University of Galway, University Road, Galway H91 TK33, Ireland.
Oxidative damage can introduce G>T mutations upon DNA replication. When this damage occurs , sequenced DNA exhibits strand asymmetry, whereby sequence alignment yields G>T mismatches without corresponding C>A mismatches on the complementary strand at a given locus. Strand asymmetry is used to identify potential sequencing artefacts in somatic variant calls in cancer sequencing projects.
View Article and Find Full Text PDFNPJ Genom Med
August 2025
Department of Human Genetics, Yokohama City University Graduate School of Medicine, Yokohama, Kanagawa, Japan.
Short-read genome sequencing (GS) is a powerful technique for investigating the genetic etiologies of rare diseases, capturing diverse genetic variations that are challenging to approach with exome sequencing (ES). We performed GS on 260 families with intellectual disability/developmental delay. GS detected potentially disease-related variants in 55 of the 260 families, with structural resolution by long-read sequencing or optical genome mapping, and functional assessment by RNA sequencing.
View Article and Find Full Text PDFBMC Cancer
August 2025
Department of Internal Medicine, Seoul National University Hospital, and Seoul National University Cancer Research Institute, Seoul, Korea.
Background: Tumor-informed circulating tumor DNA (ctDNA) analysis allows for the sensitive detection of minimal residual disease (MRD) and has the potential to enhance patient stratification for adjuvant chemotherapy. We hypothesize that intensifying adjuvant chemotherapy in colon cancer patients with postoperative MRD positivity may reduce recurrence and improve survival outcomes.
Methods: This multi-center platform trial (NCT05534087) consists of a prospective observational study (Part 1) and an interventional randomized trial (Part 2).