98%
921
2 minutes
20
Background: For marker effect models and genomic animal models, computational requirements increase with the number of loci and the number of genotyped individuals, respectively. In the latter case, the inverse genomic relationship matrix (GRM) is typically needed, which is computationally demanding to compute for large datasets. Thus, there is a great need for dimensionality-reduction methods that can analyze massive genomic data. For this purpose, we developed reduced-dimension singular value decomposition (SVD) based models for genomic prediction.
Methods: Fast SVD is performed by analyzing different chromosomes/genome segments in parallel and/or by restricting SVD to a limited core of genotyped individuals, producing chromosome- or segment-specific principal components (PC). Given a limited effective population size, nearly all the genetic variation can be effectively captured by a limited number of PC. Genomic prediction can then be performed either by PC ridge regression (PCRR) or by genomic animal models using an inverse GRM computed from the chosen PC (PCIG). In the latter case, computation of the inverse GRM will be feasible for any number of genotyped individuals and can be readily produced row- or element-wise.
Results: Using simulated data, we show that PCRR and PCIG models, using chromosome-wise SVD of a core sample of individuals, are appropriate for genomic prediction in a larger population, and results in virtually identical predicted breeding values as the original full-dimension genomic model (r = 1.000). Compared with other algorithms (e.g. algorithm for proven and young animals, APY), the (chromosome-wise SVD-based) PCRR and PCIG models were more robust to size of the core sample, giving nearly identical results even down to 500 core individuals. The method was also successfully tested on a large multi-breed dataset.
Conclusions: SVD can be used for dimensionality reduction of large genomic datasets. After SVD, genomic prediction using dense genomic data and many genotyped individuals can be done in a computationally efficient manner. Using this method, the resulting genomic estimated breeding values were virtually identical to those computed from a full-dimension genomic model.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5831701 | PMC |
http://dx.doi.org/10.1186/s12711-018-0373-2 | DOI Listing |
JMIR Res Protoc
September 2025
Department of Medical Oncology, Early Phase Unit, Georges-François Leclerc Centre, Dijon, France.
Background: Sarcomas are rare cancer with a heterogeneous group of tumors. They affect both genders across all age groups and present significant heterogeneity, with more than 70 histological subtypes. Despite tailored treatments, the high metastatic potential of sarcomas remains a major factor in poor patient survival, as metastasis is often the leading cause of death.
View Article and Find Full Text PDFMicrob Genom
September 2025
International Centre of Excellence for Aquatic Animal Health, The Centre for Environment, Fisheries and Aquaculture Science, Weymouth, DT4 8UB, UK.
High rates of mortality of the common cockle, , have occurred in the Wash Estuary, UK, since 2008. A previous study linked the mortalities to a novel genotype of , with a strong correlation between cockle moribundity and the presence of . Here, we characterize a novel iridovirus, identified by chance during metagenomic sequencing of a gradient purification of cells, with the presence also correlated to cockle moribundity.
View Article and Find Full Text PDFPLoS Negl Trop Dis
September 2025
Department of Clinical Science, Liverpool School of Tropical Medicine, Liverpool, United Kingdom.
Background: Salmonella enterica encompasses over 2,600 serovars, including several commonly associated with severe infection in humans. Salmonella is a major cause of sepsis in Africa; however, diagnosis requires clinical microbiology facilities. Environmental surveillance has the potential to play a role in Salmonella surveillance.
View Article and Find Full Text PDFPLoS One
September 2025
Department of Hepatobiliary Surgery, The First Affiliated Hospital of Kunming Medical University, Kunming, Yunnan, China.
Objective: This study employs integrated network toxicology and molecular docking to investigate the molecular basis underlying 4-nonylphenol (4-NP)-mediated enhancement of breast cancer susceptibility.
Methods: We integrated data from multiple databases, including ChEMBL, STITCH, Swiss Target Prediction, GeneCards, OMIM and TTD. Core compound-disease-associated target genes were identified through Protein-Protein Interaction (PPI) network analysis.
J Alzheimers Dis
September 2025
Institute for Public Health Genetics, University of Washington, Seattle, WA, USA.
Genetic risk prediction for Alzheimer's disease (AD) has high potential impact, yet few studies have assessed the reliability of various polygenic risk score (PRS) methods at the individual level. Here, we evaluated the reliability of AD PRS estimates among 6338 participants from the Multi-Ethnic Study of Atherosclerosis. We compared four PRS models that have been previously associated with dementia risk.
View Article and Find Full Text PDF