Allele coding in genomic evaluation.

Genet Sel Evol

Biotechnology and Food Research, MTT Agrifood Research Finland, FI-31600 Jokioinen, Finland.

Published: June 2011


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Background: Genomic data are used in animal breeding to assist genetic evaluation. Several models to estimate genomic breeding values have been studied. In general, two approaches have been used. One approach estimates the marker effects first and then, genomic breeding values are obtained by summing marker effects. In the second approach, genomic breeding values are estimated directly using an equivalent model with a genomic relationship matrix. Allele coding is the method chosen to assign values to the regression coefficients in the statistical model. A common allele coding is zero for the homozygous genotype of the first allele, one for the heterozygote, and two for the homozygous genotype for the other allele. Another common allele coding changes these regression coefficients by subtracting a value from each marker such that the mean of regression coefficients is zero within each marker. We call this centered allele coding. This study considered effects of different allele coding methods on inference. Both marker-based and equivalent models were considered, and restricted maximum likelihood and Bayesian methods were used in inference.

Results: Theoretical derivations showed that parameter estimates and estimated marker effects in marker-based models are the same irrespective of the allele coding, provided that the model has a fixed general mean. For the equivalent models, the same results hold, even though different allele coding methods lead to different genomic relationship matrices. Calculated genomic breeding values are independent of allele coding when the estimate of the general mean is included into the values. Reliabilities of estimated genomic breeding values calculated using elements of the inverse of the coefficient matrix depend on the allele coding because different allele coding methods imply different models. Finally, allele coding affects the mixing of Markov chain Monte Carlo algorithms, with the centered coding being the best.

Conclusions: Different allele coding methods lead to the same inference in the marker-based and equivalent models when a fixed general mean is included in the model. However, reliabilities of genomic breeding values are affected by the allele coding method used. The centered coding has some numerical advantages when Markov chain Monte Carlo methods are used.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3154140PMC
http://dx.doi.org/10.1186/1297-9686-43-25DOI Listing

Publication Analysis

Top Keywords

allele coding
56
genomic breeding
24
breeding values
24
allele
16
coding methods
16
coding
15
marker effects
12
regression coefficients
12
equivalent models
12
genomic
10

Similar Publications

Study Objectives: There are large individual differences in the homeostatic response to sleep deprivation, as reflected in slow wave sleep (SWS) and electroencephalogram (EEG) spectral power, which have largely been left unexplained. Recent evidence suggests the possible involvement of the activity-regulated cytoskeleton-associated protein () gene. Here we assessed the effects of the "c.

View Article and Find Full Text PDF

The aim of this study was to investigate three unrelated Simmental calves with atypical white coat color, identify potential genetic causes using a trio-based whole-genome sequencing approach, and assess the prevalence of the identified variants in the breed. Several inherited alleles affecting coat color, ranging from fawn to red spotted and white-headed, have been described in Simmental cattle originating from Switzerland. However, no genetic variant has yet been associated with an almost completely white coat in this breed.

View Article and Find Full Text PDF

Angelman syndrome (AS) is a debilitating neurodevelopmental disorder caused by loss of maternally-inherited UBE3A. In neurons, paternally-inherited UBE3A is silenced in cis by a long non-coding RNA called Ube3a-ATS. Here, we found that Neisseria meningitidis Cas9 with two mutations (D15A and H587A) in the nuclease domains (dNmCas9) can unsilence the dormant paternal Ube3a allele in mouse and human neurons when targeted to Snord115 snoRNA genes located in introns of Ube3a-ATS.

View Article and Find Full Text PDF

Targeting a pathogenic cryptic exon that drives HLRCC to induce exon skipping.

Mol Ther Nucleic Acids

September 2025

Laboratory of Cell Biology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA.

Hereditary leiomyomatosis and renal cell carcinoma (HLRCC) is an autosomal dominant cancer predisposition syndrome driven by the loss of fumarate hydratase (FH) activity. Recently, we identified a pathogenic variant in intron 9 of the gene that disrupts splicing by creating a novel splice acceptor site, resulting in the aberrant inclusion of a cryptic exon. Inclusion of the cryptic exon introduces a premature termination codon, leading to loss of FH activity.

View Article and Find Full Text PDF

Genome structural variants (SVs) comprise a sizable portion of functionally important genetic variation in all organisms; yet, many SVs evade discovery using short reads. While long-read sequencing can find the hidden SVs, the role of SVs in variation in organismal traits remains largely unclear. To address this gap, we investigate the molecular basis of 50 classical phenotypes in 11 strains using highly contiguous genome assemblies generated with Oxford Nanopore long reads.

View Article and Find Full Text PDF