Knockoff procedure improves susceptibility gene identifications in conditional transcriptome-wide association studies.

Am J Hum Genet

Department of Biostatistics, School of Public Health, Yale University, New Haven, CT, USA. Electronic address:

Published: August 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Transcriptome-wide association studies (TWASs) have been developed to identify candidate genes associated with complex traits by integrating genome-wide association studies (GWASs) with expression quantitative trait loci (eQTL) data. However, most existing TWAS methods assess the marginal association between a single gene and a trait of interest, ignoring the influence of other genes in the same genomic region. Furthermore, false-positive gene-trait associations may arise due to correlations between eQTLs and nearby causal genetic variants. We introduce TWASKnockoff, a knockoff-based framework for detecting susceptibility genes using GWAS summary statistics and eQTL data. Unlike traditional TWAS approaches that rely on marginal testing, TWASKnockoff evaluates the conditional independence of each gene-trait pair, accounting for both cis-predicted expression correlations across genes and correlations between gene expression levels and genetic variants. TWASKnockoff estimates the correlation matrix of all genetic elements (including cis-predicted gene expression levels and genetic variant genotypes) by averaging estimations from parametric bootstrap samples, then applies knockoff-based inference to identify susceptibility genes while controlling the false discovery rate (FDR). Through simulations and an application to type 2 diabetes mellitus (T2D) data, we demonstrate that TWASKnockoff achieves superior FDR control and enhances power in detecting relevant gene-trait pairs at a fixed FDR level.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12412983PMC
http://dx.doi.org/10.1016/j.ajhg.2025.08.007DOI Listing

Publication Analysis

Top Keywords

association studies
12
transcriptome-wide association
8
eqtl data
8
genetic variants
8
susceptibility genes
8
gene expression
8
expression levels
8
levels genetic
8
genes
5
knockoff procedure
4

Similar Publications

Spinal muscular atrophy (SMA) is a neuromuscular disease caused by low levels of SMN protein. Several therapeutic approaches boosting SMN are approved for human patients, delivering remarkable improvements in lifespan and symptoms. However, emerging phenotypes, including neurodevelopmental comorbidities, are being reported in some treated SMA patients, indicative of alterations in brain development.

View Article and Find Full Text PDF

Cardiac hypertrophy is a common adaptation to cardiovascular stress and often a prelude to heart failure. We examined how S-palmitoylation of the small GTPase, Ras-related C3 botulinum toxin substrate 1 (Rac1), impacts cardiomyocyte stress signaling. Mutation of the cysteine-178 palmitoylation site impaired activation of Rac1 when overexpressed in cardiomyocytes.

View Article and Find Full Text PDF

Role of Systemic Glucocorticoids in Reducing IgA and Galactose-Deficient IgA1 Levels in IgA Nephropathy.

Clin J Am Soc Nephrol

September 2025

Kidney Division, Peking University First Hospital, Peking University Institute of Nephrology; Key Laboratory of Kidney Disease, Ministry of Health of China; Key Laboratory of Chronic Kidney Disease Prevention and Treatment (Peking University), Ministry of Education, China.

Background: The Therapeutic Effects of Steroids in IgA Nephropathy Global (TESTING) trial demonstrated that glucocorticoid therapy reduced proteinuria and improved kidney outcomes in patients with Immunoglobulin A Nephropathy (IgAN). Galactose-deficient IgA1 (Gd-IgA1) plays a central role in IgAN pathogenesis by promoting immune complex formation. However, the effects of glucocorticoid on pathogenic IgA levels remain unclear.

View Article and Find Full Text PDF

Background: Following SARS-CoV-2 infection, ~10-35% of COVID-19 patients experience long COVID (LC), in which debilitating symptoms persist for at least three months. Elucidating biologic underpinnings of LC could identify therapeutic opportunities.

Methods: We utilized machine learning methods on biologic analytes provided over 12-months after hospital discharge from >500 COVID-19 patients in the IMPACC cohort to identify a multi-omics "recovery factor", trained on patient-reported physical function survey scores.

View Article and Find Full Text PDF