Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Recent algorithmic advancements have enabled the inference of genome-wide ancestral recombination graphs (ARGs) from genomic data in large cohorts. These inferred ARGs provide a detailed representation of genealogical relatedness along the genome and have been shown to complement genotype imputation in complex trait analyses by capturing the effects of unobserved genomic variants. An inferred ARG can be used to construct a genetic relatedness matrix, which can be leveraged within a linear mixed model for the analysis of complex traits. However, these analyses are computationally infeasible for large datasets. We introduce a computationally efficient approach, called ARG-RHE, to estimate narrow-sense heritability and perform region-based association testing using an ARG. ARG-RHE leverages a method for computing genotype-matrix products from genealogical data in sublinear time, along with scalable randomized algorithms. This enables fast estimation of variance components and their statistical significance, supports parallel analysis of multiple quantitative traits, and facilitates other linear mixed-model analyses. We conduct extensive simulations to verify the computational efficiency, statistical power, and robustness of this approach. We then apply it to detect associations between 21,159 genes and 52 blood-related traits, using an ARG inferred from genotype data of 337,464 individuals from the UK Biobank. In these analyses, combining ARG-based and imputation-based testing yields 8% more gene-trait associations than using imputation alone, suggesting that inferred genome-wide genealogies may effectively complement genotype imputation in the analysis of complex traits.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12330685PMC
http://dx.doi.org/10.1101/2024.08.31.610262DOI Listing

Publication Analysis

Top Keywords

ancestral recombination
8
recombination graphs
8
complement genotype
8
genotype imputation
8
analysis complex
8
complex traits
8
fast variance
4
variance component
4
analysis
4
component analysis
4

Similar Publications

The Ancestral Recombination Graph (ARG), which describes the genealogical history of a sample of genomes, is a vital tool in population genomics and biomedical research. Recent advancements have substantially increased ARG reconstruction scalability, but they rely on approximations that can reduce accuracy, especially under model misspecification. Moreover, they reconstruct only a single ARG topology and cannot quantify the considerable uncertainty associated with ARG inferences.

View Article and Find Full Text PDF

A model-free method for genealogical inference without phasing and its application for topology weighting.

Genetics

September 2025

Institute of Ecology and Evolution, School of Biological Sciences, The University of Edinburgh, Edinburgh, EH9 3FL, United Kingdom.

Recent advances in methods to infer and analyse ancestral recombination graphs (ARGs) are providing powerful new insights in evolutionary biology and beyond. Existing inference approaches tend to be designed for use with fully-phased datasets, and some rely on model assumptions about demography and recombination rate. Here I describe a simple model-free approach for genealogical inference along the genome from unphased genotype data called Sequential Tree Inference by Collecting Compatible Sites (sticcs).

View Article and Find Full Text PDF

Optimizing protein folding in prokaryotes: Strategies to enhance soluble expression of recombinant proteins.

Bioresour Technol

September 2025

State Key Laboratory of Food Science and Resources, Jiangnan University, 1800 Lihu Avenue, Wuxi 214122, China; School of Biotechnology and Key Laboratory of Industrial Biotechnology Ministry of Education, Jiangnan University, 1800 Lihu Avenue, Wuxi 214122, China; International Joint Laboratory on Fo

Recombinant proteins have been widely applied in the food, biomedical, and scientific fields. Prokaryotic expression systems are preferred platforms for recombinant protein production due to their rapid growth and high protein yields. Nevertheless, disparities between recombinant expression environment and native physiological conditions frequently result in protein misfolding, leading to aggregation into non-functional inclusion bodies or proteolytic degradation.

View Article and Find Full Text PDF

Phantom epistasis through the lens of genealogies.

Genetics

September 2025

Department of Statistics, University of Oxford, Oxford OX1 3LB, UK.

Phantom epistasis arises when, in the course of testing for gene-by-gene interactions, the omission of a causal variant with a purely additive effect on the phenotype causes the spurious inference of a significant interaction between two SNPs. This is more likely to arise when the two SNPs are in relatively close proximity, so while true epistasis between nearby variants could be commonplace, in practice there is no reliable way of telling apart true epistatic signals from false positives. By considering the causes of phantom epistasis from a genealogy-based perspective, we leverage the rich information contained within reconstructed genealogies (in the form of ancestral recombination graphs) to address this problem.

View Article and Find Full Text PDF

TNFRSF6B, commonly referred to as decoy receptor 3, interacts with TNFSF6, TNFSF14, and TNFSF15, thereby imparting anti-apoptotic and anti-inflammatory properties. This study identifies two isoforms, TNFRSF6B.1 and TNFRSF6B.

View Article and Find Full Text PDF