98%
921
2 minutes
20
Recent algorithmic advancements have enabled the inference of genome-wide ancestral recombination graphs (ARGs) from genomic data in large cohorts. These inferred ARGs provide a detailed representation of genealogical relatedness along the genome and have been shown to complement genotype imputation in complex trait analyses by capturing the effects of unobserved genomic variants. An inferred ARG can be used to construct a genetic relatedness matrix, which can be leveraged within a linear mixed model for the analysis of complex traits. However, these analyses are computationally infeasible for large datasets. We introduce a computationally efficient approach, called ARG-RHE, to estimate narrow-sense heritability and perform region-based association testing using an ARG. ARG-RHE leverages a method for computing genotype-matrix products from genealogical data in sublinear time, along with scalable randomized algorithms. This enables fast estimation of variance components and their statistical significance, supports parallel analysis of multiple quantitative traits, and facilitates other linear mixed-model analyses. We conduct extensive simulations to verify the computational efficiency, statistical power, and robustness of this approach. We then apply it to detect associations between 21,159 genes and 52 blood-related traits, using an ARG inferred from genotype data of 337,464 individuals from the UK Biobank. In these analyses, combining ARG-based and imputation-based testing yields 8% more gene-trait associations than using imputation alone, suggesting that inferred genome-wide genealogies may effectively complement genotype imputation in the analysis of complex traits.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12330685 | PMC |
http://dx.doi.org/10.1101/2024.08.31.610262 | DOI Listing |
Nat Genet
September 2025
Department of Statistics, University of California, Berkeley, CA, USA.
The Ancestral Recombination Graph (ARG), which describes the genealogical history of a sample of genomes, is a vital tool in population genomics and biomedical research. Recent advancements have substantially increased ARG reconstruction scalability, but they rely on approximations that can reduce accuracy, especially under model misspecification. Moreover, they reconstruct only a single ARG topology and cannot quantify the considerable uncertainty associated with ARG inferences.
View Article and Find Full Text PDFGenetics
September 2025
Institute of Ecology and Evolution, School of Biological Sciences, The University of Edinburgh, Edinburgh, EH9 3FL, United Kingdom.
Recent advances in methods to infer and analyse ancestral recombination graphs (ARGs) are providing powerful new insights in evolutionary biology and beyond. Existing inference approaches tend to be designed for use with fully-phased datasets, and some rely on model assumptions about demography and recombination rate. Here I describe a simple model-free approach for genealogical inference along the genome from unphased genotype data called Sequential Tree Inference by Collecting Compatible Sites (sticcs).
View Article and Find Full Text PDFBioresour Technol
September 2025
State Key Laboratory of Food Science and Resources, Jiangnan University, 1800 Lihu Avenue, Wuxi 214122, China; School of Biotechnology and Key Laboratory of Industrial Biotechnology Ministry of Education, Jiangnan University, 1800 Lihu Avenue, Wuxi 214122, China; International Joint Laboratory on Fo
Recombinant proteins have been widely applied in the food, biomedical, and scientific fields. Prokaryotic expression systems are preferred platforms for recombinant protein production due to their rapid growth and high protein yields. Nevertheless, disparities between recombinant expression environment and native physiological conditions frequently result in protein misfolding, leading to aggregation into non-functional inclusion bodies or proteolytic degradation.
View Article and Find Full Text PDFGenetics
September 2025
Department of Statistics, University of Oxford, Oxford OX1 3LB, UK.
Phantom epistasis arises when, in the course of testing for gene-by-gene interactions, the omission of a causal variant with a purely additive effect on the phenotype causes the spurious inference of a significant interaction between two SNPs. This is more likely to arise when the two SNPs are in relatively close proximity, so while true epistasis between nearby variants could be commonplace, in practice there is no reliable way of telling apart true epistatic signals from false positives. By considering the causes of phantom epistasis from a genealogy-based perspective, we leverage the rich information contained within reconstructed genealogies (in the form of ancestral recombination graphs) to address this problem.
View Article and Find Full Text PDFDev Comp Immunol
September 2025
Department of Aquatic Life Medicine, Gangneung-Wonju National University, Gangneung, Korea. Electronic address:
TNFRSF6B, commonly referred to as decoy receptor 3, interacts with TNFSF6, TNFSF14, and TNFSF15, thereby imparting anti-apoptotic and anti-inflammatory properties. This study identifies two isoforms, TNFRSF6B.1 and TNFRSF6B.
View Article and Find Full Text PDF