Efficient count-based models improve power and robustness for large-scale single-cell eQTL mapping.

medRxiv

Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California.

Published: March 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Population-scale single-cell transcriptomic technologies (scRNA-seq) enable characterizing variant effects on gene regulation at the cellular level (e.g., single-cell eQTLs; sc-eQTLs). However, existing sc-eQTL mapping approaches are either not designed for analyzing sparse counts in scRNA-seq data or can become intractable in extremely large datasets. Here, we propose jaxQTL, a flexible and efficient sc-eQTL mapping framework using highly efficient count-based models given pseudobulk data. Using extensive simulations, we demonstrated that jaxQTL with a negative binomial model outperformed other models in identifying sc-eQTLs, while maintaining a calibrated type I error. We applied jaxQTL across 14 cell types of OneK1K scRNA-seq data (=982), and identified 11-16% more eGenes compared with existing approaches, primarily driven by jaxQTL ability to identify lowly expressed eGenes. We observed that fine-mapped sc-eQTLs were further from transcription starting site (TSS) than fine-mapped eQTLs identified in all cells (bulk-eQTLs; =1x10) and more enriched in cell-type-specific enhancers (=3x10), suggesting that sc-eQTLs improve our ability to identify distal eQTLs that are missed in bulk tissues. Overall, the genetic effect of fine-mapped sc-eQTLs were largely shared across cell types, with cell-type-specificity increasing with distance to TSS. Lastly, we observed that sc-eQTLs explain more SNP-heritability ( ) than bulk-eQTLs (9.90 ± 0.88% vs. 6.10 ± 0.76% when meta-analyzed across 16 blood and immune-related traits), improving but not closing the missing link between GWAS and eQTLs. As an example, we highlight that sc-eQTLs in T cells (unlike bulk-eQTLs) can successfully nominate as a candidate gene for rheumatoid arthritis. Overall, jaxQTL provides an efficient and powerful approach using count-based models to identify missing disease-associated eQTLs.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11908335PMC
http://dx.doi.org/10.1101/2025.01.18.25320755DOI Listing

Publication Analysis

Top Keywords

count-based models
12
efficient count-based
8
sc-eqtl mapping
8
scrna-seq data
8
cell types
8
ability identify
8
fine-mapped sc-eqtls
8
cells bulk-eqtls
8
sc-eqtls
7
eqtls
5

Similar Publications

Despite decades of research, there is no scientific consensus method for representing the menstrual cycle as a continuous timeline. Common phase- and count-based methods oversimplify hormonal dynamics and overlook individual variability in ovulation timing, reducing statistical power and misaligning trajectories. To address this, we introduce Phase-Aligned Cycle Time Scaling (PACTS) and its companion R package, `menstrualcycleR`, which generates continuous time variables anchored to both menses and ovulation, improving alignment of hormonal dynamics across individuals and cycles in an accessible, reproducible way.

View Article and Find Full Text PDF

Introduction: Polypharmacy, typically defined as the use of five or more medications, has become increasingly common among older adults due to the rising prevalence of multimorbidity. While polypharmacy can be clinically necessary, it poses substantial risks for adverse drug events, including acute kidney injury (AKI). Drug-induced AKI accounts for a significant proportion of hospital-acquired cases and can result in prolonged hospitalization, increased healthcare costs, and higher mortality.

View Article and Find Full Text PDF

Patient Preferences for Metastatic Colorectal Cancer Treatment: A Multi-method Approach Using Discrete Choice Experiments and Best-Worst Scaling.

Patient

July 2025

Department of Community Health Sciences, Cumming School of Medicine, Health Research Innovation Centre (HRIC) - 3C56, University of Calgary, 3280 Hospital Drive NW, Calgary, Alberta T2N 4Z6, Canada.

Background: Treatment decisions for metastatic colorectal cancer (mCRC) require patients to balance survival benefits, health-related quality of life (HRQoL), and potential risks of side effects while also factoring in their own preferences for different treatment options. Despite growing interest, quantitative patient preferences are not yet integrated into health technology assessments (HTAs) for drug reimbursement recommendations.

Objectives: The Colorectal Cancer Canada's Patient Values Project aims to explore approaches to incorporate quantitative patient preferences into cancer treatment HTA decision-making processes.

View Article and Find Full Text PDF

The N400 is a central electrophysiological event-related-potential (ERP) marker thought to reflect meaning comprehension in the human brain. Typically, the N400 is larger when a word does not fit into a specific context (e.g.

View Article and Find Full Text PDF

Pedestrian crash causation analysis near bus stops: Insights from random parameters Negative Binomial-Lindley model.

Accid Anal Prev

September 2025

Zachry Department of Civil & Environmental Engineering, Texas A&M University, College Station, TX 77843, USA.

Pedestrian safety remains a pressing concern near bus stops along urban transit, where frequent pedestrian-vehicle interactions occur. While prior research has primarily focused on intersections and midblock locations, bus stops have often been treated as secondary contributors rather than as distinct sites requiring targeted safety assessments. This has left a critical gap in understanding how traffic exposure, roadway characteristics, and bus stop design features specifically influence pedestrian crash risks around bus stop locations.

View Article and Find Full Text PDF