Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Background: The availability of large cohorts of whole-genome sequenced individuals, combined with functional annotation, is expected to provide opportunities to improve the accuracy of genomic selection (GS). However, such benefits have not often been observed in initial applications. The reference population for GS in Belgian Blue Cattle (BBC) continues to grow. Combined with the availability of reference panels of sequenced individuals, it provides an opportunity to evaluate GS models using whole genome sequence (WGS) data and functional annotation.

Results: Here, we used data from 16,508 cows, with phenotypes for five muscular development traits and imputed at the WGS level, in combination with in silico functional annotation and catalogs of putative regulatory variants obtained from experimental data. We evaluated first GS models using the entire WGS data, with or without functional annotation. At this marker density, we were able to run two approaches, assuming either a highly polygenic architecture (GBLUP) or allowing some variants to have larger effects (BayesRR-RC, a Bayesian mixture model), and observed an increased reliability compared to the official GBLUP model at medium marker density (on average 0.016 and 0.018 for GBLUP and BayesRR-RC, respectively). When functional annotation was used, we observed slightly higher reliabilities with an extension of GBLUP that included multiple polygenic terms (one per functional group), while reliabilities decreased with BayesRR-RC. We then used large subsets of variants selected based on functional information or with a linkage disequilibrium (LD) pruning approach, which allowed us to evaluate two additional approaches, BayesCπ and Bayesian Sparse Linear Mixed Model (BSLMM). Reliabilities were higher for these panels than for the WGS data, with the highest accuracies obtained when markers were selected based on functional information. In our setting, BSLMM systematically achieved higher reliabilities than other methods.

Conclusions: GS with large panels of functional variants selected from WGS data allowed a significant increase in reliability compared to the official genomic evaluation approach. However, the benefits of using WGS and functional data remained modest, indicating that there is still room for improvement, for example by further refining the functional annotation in the BBC breed.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11881496PMC
http://dx.doi.org/10.1186/s12711-025-00955-5DOI Listing

Publication Analysis

Top Keywords

functional annotation
24
wgs data
16
functional
12
data functional
12
genomic selection
8
models genome
8
genome sequence
8
data
8
belgian blue
8
blue cattle
8

Similar Publications

Understanding the structural and functional diversity of toxin proteins is critical for elucidating macromolecular behavior, mechanistic variability, and structure-driven bioactivity. Traditional approaches have primarily focused on binary toxicity prediction, offering limited resolution into distinct modes of action of toxins. Here, we present MultiTox, an ensemble stacking framework for the classification of toxin proteins based on their molecular mode of action: neurotoxins, cytotoxins, hemotoxins, and enterotoxins.

View Article and Find Full Text PDF

Biosynthetic potential of the culturable foliar fungi associated with field-grown lettuce.

Appl Microbiol Biotechnol

September 2025

School of Plant Sciences, The University of Arizona, 1140 E South Campus Drive, Forbes 303, Tucson, AZ, 85721, USA.

Fungal endophytes and epiphytes associated with plant leaves can play important ecological roles through the production of specialized metabolites encoded by biosynthetic gene clusters (BGCs). However, their functional capacity, especially in crops like lettuce (Lactuca sativa L.), remains poorly understood.

View Article and Find Full Text PDF

Trichoderma species exhibit remarkable versatility in adaptability and in occupying habitats with lifestyles ranging from mycoparasitism and saprotrophy to endophytism. In this study, we present the first high-quality whole-genome assembly and annotation of T. lixii using Illumina HiSeq technology to explore the mechanisms of endophytic lifestyle and plant colonization.

View Article and Find Full Text PDF

Background: Autism spectrum disorder (ASD) is a complex neurodevelopmental disorder lacking objective biomarkers for early diagnosis. DNA methylation is a promising epigenetic marker, and machine learning offers a data-driven classification approach. However, few studies have examined whole-blood, genome-wide DNA methylation profiles for ASD diagnosis in school-aged children.

View Article and Find Full Text PDF

Currently, there is an increasing use of whole-genome sequencing (WGS) studies to investigate the molecular taxonomy, metabolic properties, enzyme capabilities, and bioactive substances of lactic acid bacteria (LAB) species. In this study, the genome of strain Pediococcus pentosaceus BBS1 was sequenced using the Illumina HiSeq. 2500 platform to determine its classification, annotate its main features, and evaluate its safety characteristics.

View Article and Find Full Text PDF