Prioritized SNP Selection from Whole-Genome Sequencing Improves Genomic Prediction Accuracy in Sturgeons Using Linear and Machine Learning Models.

Int J Mol Sci

Fisheries Science Institute, Beijing Academy of Agriculture and Forestry Sciences & Beijing Key Laboratory of Fisheries Biotechnology, Beijing 100068, China.

Published: July 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Genomic prediction has emerged as a powerful tool in aquaculture breeding, but its effectiveness depends on the careful selection of informative single nucleotide polymorphisms (SNPs) and the application of appropriate prediction models. This study aimed to enhance genomic prediction accuracy in Russian sturgeon () by optimizing SNP selection strategies and exploring the performance of linear and machine learning models. Three economically important traits-caviar yield, caviar color, and body weight-were selected due to their direct relevance to breeding goals and market value. Whole-genome sequencing (WGS) data were obtained from 971 individuals with an average sequencing depth of 13.52×. To reduce marker density and eliminate redundancy, three SNP selection strategies were applied: (1) genome-wide association study (GWAS)-based prioritization to select trait-associated SNPs; (2) linkage disequilibrium (LD) pruning to retain independent markers; and (3) random sampling as a control. Genomic prediction was conducted using both linear (e.g., GBLUP) and machine learning models (e.g., random forest) across varying SNP densities (1 K to 50 K). Results showed that GWAS-based SNP selection consistently outperformed other strategies, especially at moderate densities (≥10 K), improving prediction accuracy by up to 3.4% compared to the full WGS dataset. LD-based selection at higher densities (30 K and 50 K) achieved comparable performance to full WGS. Notably, machine learning models, particularly random forest, exceeded the performance of linear models, yielding an additional 2.0% increase in accuracy when combined with GWAS-selected SNPs. In conclusion, integrating WGS data with GWAS-informed SNP selection and advanced machine learning models offers a promising framework for improving genomic prediction in sturgeon and holds promise for broader applications in aquaculture breeding programs.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12295944PMC
http://dx.doi.org/10.3390/ijms26147007DOI Listing

Publication Analysis

Top Keywords

snp selection
20
genomic prediction
20
machine learning
20
learning models
20
prediction accuracy
12
whole-genome sequencing
8
linear machine
8
aquaculture breeding
8
selection strategies
8
performance linear
8

Similar Publications

Liver abscesses are a concern in feedlot cattle, and little is known about the role of genetics in their development. This study aimed to estimate genetic parameters and to identify single nucleotide polymorphisms (SNP) associated with liver abscesses. Crossbred cattle representing 18 breeds in the United States Meat Animal Research Center Germplasm Evaluation Program were phenotyped for liver abscesses at slaughter (n = 9,044).

View Article and Find Full Text PDF

Bacterial leaf streak (BLS), caused by pv. (), has recently emerged as a significant threat to wheat production in the Northern Great Plains region of the US. Deploying resistant cultivars is an economical and practical method of controlling BLS.

View Article and Find Full Text PDF

Genome-wide association study reveals candidate loci for resistance to anthracnose in blueberry.

G3 (Bethesda)

September 2025

Blueberry Breeding and Genomics Lab, Horticultural Sciences Department, University of Florida, Gainesville, FL 32611, USA.

Anthracnose, caused by Colletotrichum gloeosporioides, poses a significant threat to blueberries, necessitating a deeper understanding of the genetic mechanisms underlying resistance to develop efficient breeding strategies. Here, we conducted a genome-wide association study on 355 advanced selections of southern highbush blueberry from the University of Florida Blueberry Breeding and Genomics Program. Visual scores and image analyses were used for assessing disease severity.

View Article and Find Full Text PDF

De novo inherited Xq25 deletion: hints from preimplantation genetic testing in alobar holoprosencephaly.

Eur J Obstet Gynecol Reprod Biol

August 2025

Reproductive Medicine Center, Shenzhen Maternity and Child Healthcare Hospital, Southern Medical University, Shenzhen 518000 Guangdong, China; Shenzhen Clinical Research Center for Obstetrics & Gynecology and Reproductive System Diseases, Shenzhen 518000 Guangdong, China. Electronic address: szfyart

Objective: This study investigates the association between alobar holoprosencephaly (HPE) and de novo germline microdeletions in the Xq25 region. To develop a Preimplantation Genetic Testing for Monogenic Disorders (PGT-M) based workflow enabling high-resolution preimplantation detection of sub-Mb microdeletions, overcoming the >1 Mb resolution limit of conventional whole genome amplification(WGA) copy number variation(CNV) sequencing to identify causative Xq25 variants and prevent pathogenic microdeletion transmission.

Methods: This study presents a clinical case involving a couple with an adverse obstetric history accompanied by two occurrences of HPE.

View Article and Find Full Text PDF

Nitrogen (N), phosphorus (P), and sulfur (S) are essential nutrients for plant health. Deficiencies in N, P, or S in plants lead to lower seed production and seed quality in grain crops, including soybean seed. Soybean seed is a source of protein, oil, essential amino acids, and minerals.

View Article and Find Full Text PDF