Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

The expansion of biobanks has significantly propelled genomic discoveries yet the sheer scale of data within these repositories poses formidable computational hurdles, particularly in handling extensive matrix operations required by prevailing statistical frameworks. In this work, we introduce computational optimizations to the SAIGE (Scalable and Accurate Implementation of Generalized Mixed Model) algorithm, notably employing a GPU-based distributed computing approach to tackle these challenges. We applied these optimizations to conduct a large-scale genome-wide association study (GWAS) across 2,068 phenotypes derived from electronic health records of 635,969 diverse participants from the Veterans Affairs (VA) Million Veteran Program (MVP). Our strategies enabled scaling up the analysis to over 6,000 nodes on the Department of Energy (DOE) Oak Ridge Leadership Computing Facility (OLCF) Summit High-Performance Computer (HPC), resulting in a 20-fold acceleration compared to the baseline model. We also provide a Docker container with our optimizations that was successfully used on multiple cloud infrastructures on UK Biobank and All of Us datasets where we showed significant time and cost benefits over the baseline SAIGE model.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11142062PMC
http://dx.doi.org/10.1101/2024.05.17.594583DOI Listing

Publication Analysis

Top Keywords

veteran program
8
accelerating genome-
4
genome- phenome-wide
4
phenome-wide association
4
association studies
4
studies gpus
4
gpus case
4
case study
4
study data
4
data veteran
4

Similar Publications

Introduction: Human papillomavirus (HPV) infection has been implicated in autoimmune processes, yet concerns remain about the potential autoimmune risks of HPV vaccination. Juvenile idiopathic arthritis (JIA) is a chronic autoimmune condition that typically manifests in childhood. The relationship between HPV vaccination and the development of JIA remains uncertain.

View Article and Find Full Text PDF

Background And Aim: The () gene plays a pivotal role in regulating growth, metabolism, and fat deposition in cattle. Genetic polymorphisms in this gene can influence phenotypic traits and may serve as molecular markers for selection in breeding programs. However, comprehensive characterization of gene variants in local Indonesian breeds, such as Madura cattle, remains limited.

View Article and Find Full Text PDF

Contingency Management for Stimulant Use Disorder and Association With Mortality: A Cohort Study.

Am J Psychiatry

September 2025

Michigan Innovations in Addiction Care Through Research and Education (MI-ACRE) Program, Department of Psychiatry, University of Michigan, Ann Arbor.

Objective: While opioid overdose has begun to decrease in recent years, stimulant overdose has continued to increase and has not been adequately addressed. Unlike opioid use disorder, there are no medications approved by the U.S.

View Article and Find Full Text PDF

De-implementation of established practices is a common challenge in infection prevention and antimicrobial stewardship and a necessary part of the life cycle of healthcare quality improvement programs. Promoting de-implementation of ineffective antimicrobial use and increasingly of low-value diagnostic testing are cornerstones of stewardship practice. Principles of de-implementation science and the interplay of implementation and de-implementation are discussed in part I of this Society for Healthcare Epidemiology of America White Paper Series.

View Article and Find Full Text PDF

Whole genome sequence analysis of low-density lipoprotein cholesterol across 246 K individuals.

Genome Biol

September 2025

Center for Genomic Medicine, Cardiovascular Research Center, , Massachusetts General Hospital Simches Research Center, 185 Cambridge Street, CPZN 5.238,, Boston, MA, 02114, USA.

Background: Rare genetic variation provided by whole genome sequence datasets has been relatively less explored for its contributions to human traits. Meta-analysis of sequencing data offers advantages by integrating larger sample sizes from diverse cohorts, thereby increasing the likelihood of discovering novel insights into complex traits. Furthermore, emerging methods in genome-wide rare variant association testing further improve power and interpretability.

View Article and Find Full Text PDF