A powerful and versatile colocalization test.

PLoS Comput Biol

Division of Biostatistics, University of Minnesota, Minneapolis, Minnesota, United States of America.

Published: April 2020


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Transcriptome-wide association studies (TWAS and PrediXcan) have been increasingly applied to detect associations between genetically predicted gene expressions and GWAS traits, which may suggest, however do not completely determine, causal genes for GWAS traits, due to the likely violation of their imposed strong assumptions for causal inference. Testing colocalization moves it closer to establishing causal relationships: if a GWAS trait and a gene's expression share the same associated SNP, it may suggest a regulatory (and thus putative causal) role of the SNP mediated through the gene on the GWAS trait. Accordingly, it is of interest to develop and apply various colocalization testing approaches. The existing approaches may each have some severe limitations. For instance, some methods test the null hypothesis that there is colocalization, which is not ideal because often the null hypothesis cannot be rejected simply due to limited statistical power (with too small sample sizes). Some other methods arbitrarily restrict the maximum number of causal SNPs in a locus, which may lead to loss of power in the presence of wide-spread allelic heterogeneity. Importantly, most methods cannot be applied to either GWAS/eQTL summary statistics or cases with more than two possibly correlated traits. Here we present a simple and general approach based on conditional analysis of a locus on multiple traits, overcoming the above and other shortcomings of the existing methods. We demonstrate that, compared with other methods, our new method can be applied to a wider range of scenarios and often perform better. We showcase its applications to both simulated and real data, including a large-scale Alzheimer's disease GWAS summary dataset and a gene expression dataset, and a large-scale blood lipid GWAS summary association dataset. An R package "jointsum" implementing the proposed method is publicly available at github.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7176287PMC
http://dx.doi.org/10.1371/journal.pcbi.1007778DOI Listing

Publication Analysis

Top Keywords

gwas traits
8
gwas trait
8
null hypothesis
8
gwas summary
8
gwas
6
causal
5
methods
5
powerful versatile
4
colocalization
4
versatile colocalization
4

Similar Publications

Whole genome sequence analysis of low-density lipoprotein cholesterol across 246 K individuals.

Genome Biol

September 2025

Center for Genomic Medicine, Cardiovascular Research Center, , Massachusetts General Hospital Simches Research Center, 185 Cambridge Street, CPZN 5.238,, Boston, MA, 02114, USA.

Background: Rare genetic variation provided by whole genome sequence datasets has been relatively less explored for its contributions to human traits. Meta-analysis of sequencing data offers advantages by integrating larger sample sizes from diverse cohorts, thereby increasing the likelihood of discovering novel insights into complex traits. Furthermore, emerging methods in genome-wide rare variant association testing further improve power and interpretability.

View Article and Find Full Text PDF

Global wheat (Triticum aestivum L.) production faces significant challenges due to the destructive nature of leaf (Puccinia triticina; leaf rust [Lr]), stem (Puccinia graminis; stem rust [Sr]), and stripe (Puccinia striiformis; stripe rust [Yr]) rust diseases. Despite ongoing efforts to develop resistant varieties, these diseases remain a persistent challenge due to their highly evolving nature.

View Article and Find Full Text PDF

Systematic Exploration of Potential Druggable Genes for Ischemic Stroke Employing Genome-Wide Mendelian Randomization Analysis.

Brain Behav

September 2025

Department of Thoracic Surgery II, Department of Lung Transplantation, Organ Transplantation Center, the First Hospital of Jilin University, Changchun, China.

Background: Ischemic stroke (IS) treatment remains a significant challenge. This study aimed to identify potential druggable genes for IS using a systematic druggable genome-wide Mendelian Randomization (MR) analysis.

Methods: Two-sample MR analysis was conducted to identify the causal association between potential druggable genes and IS.

View Article and Find Full Text PDF

Breast cancer is a major health threat to women, with limited effective indicators for early screening and prognosis. The role of sphingosine 1-phosphate receptor 1 (S1PR1) in breast cancer remains controversial. This study aims to explore the potential causal relationship between S1PR1 and breast cancer risk, considering estrogen receptor (ER) status.

View Article and Find Full Text PDF

The purpose of this study was to investigate potential therapeutic targets for osteosarcoma (OS) and offer hints regarding genetic factors for OS treatment using a bioinformatics method. This study processed 3 OS datasets from the gene expression omnibus database using R software, screening for differentially expressed genes (DEGs). After enrichment analysis, based on expression quantitative trait loci data and the genome-wide association study data of OS, Mendelian randomization analysis was used to screen the genes closely related to OS disease, which intersect with DEGs to obtain co-expressed genes, validation datasets were employed to verify the results.

View Article and Find Full Text PDF