Background: Polygenic risk scores (PRSs) improve type 2 diabetes (T2D) prediction beyond clinical risk factors but perform poorly in non-European populations, where T2D burden is often higher, undermining their global clinical utility.
Methods: We conducted the largest global effort to date to harmonize T2D genome-wide association study (GWAS) meta-analyses across five ancestries-European (EUR), African/African American (AFR), Admixed American (AMR), South Asian (SAS), and East Asian (EAS)-including 360,000 T2D cases and 1·8 million controls (41% non-EUR). We constructed ancestry-specific and multi-ancestry PRSs in training datasets including 11,000 T2D cases and 32,000 controls, and validated their performance in independent datasets including 39,000 T2D cases and 126,000 controls of diverse ancestries.
Sharing diverse genomic and other biomedical datasets is critical to advancing scientific discoveries and their equitable translation to improve human health. However, data sharing remains challenging in the context of legacy datasets, evolving policies, multi-institutional consortium science, and international stakeholders. The NIH-funded Polygenic Risk Methods in Diverse Populations (PRIMED) Consortium was established to improve the performance of polygenic risk estimates for a broad range of health and disease outcomes with global impacts.
View Article and Find Full Text PDFHere, we present a multi-omics study of type 2 diabetes and quantitative blood lipid and lipoprotein traits conducted to date in Hispanic/Latino populations (n = 63,184). We conduct a meta-analysis of 16 type 2 diabetes and 19 lipid trait GWAS, identifying 20 genome-wide significant loci for type 2 diabetes, including one novel locus and novel signals at two known loci, based on fine-mapping. We also identify sixty-one genome-wide significant loci across the lipid/lipoprotein traits, including nine novel loci, and novel signals at 19 known loci through fine-mapping.
View Article and Find Full Text PDFAm J Hum Genet
February 2025
Mosaic loss of Y (mLOY) is the most common somatic chromosomal alteration detected in human blood. The presence of mLOY is associated with altered blood cell counts and increased risk of Alzheimer disease, solid tumors, and other age-related diseases. We sought to gain a better understanding of genetic drivers and associated phenotypes of mLOY through analyses of whole-genome sequencing (WGS) of a large set of genetically diverse males from the Trans-Omics for Precision Medicine (TOPMed) program.
View Article and Find Full Text PDFPolygenic risk scores (PRSs) depend on genetic ancestry due to differences in allele frequencies between ancestral populations. This leads to implementation challenges in diverse populations. We propose a framework to calibrate PRS based on ancestral makeup.
View Article and Find Full Text PDFChronic () airway infection is common and a key contributor to diminished lung function and early mortality in persons with cystic fibrosis (PwCF). Risk factors for chronic among PwCF include (cystic fibrosis transmembrane conductance regulator) genotype, genetic modifiers, and environmental factors. Intensive antibiotic therapy and highly effective modulators do not eradicate in most adolescents and adults with cystic fibrosis.
View Article and Find Full Text PDFAm J Hum Genet
December 2024
Clonal hematopoiesis (CH) is characterized by the acquisition of a somatic mutation in a hematopoietic stem cell that results in a clonal expansion. These driver mutations can be single nucleotide variants in cancer driver genes or larger structural rearrangements called mosaic chromosomal alterations (mCAs). The factors that influence the variations in mCA fitness and ultimately result in different clonal expansion rates are not well understood.
View Article and Find Full Text PDFMosaic loss of Y (mLOY) is the most common somatic chromosomal alteration detected in human blood. The presence of mLOY is associated with altered blood cell counts and increased risk of Alzheimer's disease, solid tumors, and other age-related diseases. We sought to gain a better understanding of genetic drivers and associated phenotypes of mLOY through analyses of whole genome sequencing of a large set of genetically diverse males from the Trans-Omics for Precision Medicine (TOPMed) program.
View Article and Find Full Text PDFClonal hematopoiesis (CH) is characterized by the acquisition of a somatic mutation in a hematopoietic stem cell that results in a clonal expansion. These driver mutations can be single nucleotide variants in cancer driver genes or larger structural rearrangements called mosaic chromosomal alterations (mCAs). The factors that influence the variations in mCA fitness and ultimately result in different clonal expansion rates are not well-understood.
View Article and Find Full Text PDFJ Am Med Inform Assoc
June 2023
Type 2 diabetes (T2D) is a heterogeneous disease that develops through diverse pathophysiological processes. To characterise the genetic contribution to these processes across ancestry groups, we aggregate genome-wide association study (GWAS) data from 2,535,601 individuals (39.7% non-European ancestry), including 428,452 T2D cases.
View Article and Find Full Text PDFEver larger Structural Variant (SV) catalogs highlighting the diversity within and between populations help researchers better understand the links between SVs and disease. The identification of SVs from DNA sequence data is non-trivial and requires a balance between comprehensiveness and precision. Here we present a catalog of 355,667 SVs (59.
View Article and Find Full Text PDFHow race, ethnicity, and ancestry are used in genomic research has wide-ranging implications for how research is translated into clinical care and incorporated into public understanding. Correlation between race and genetic ancestry contributes to unresolved complexity for the scientific community, as illustrated by heterogeneous definitions and applications of these variables. Here, we offer commentary and recommendations on the use of race, ethnicity, and ancestry across the arc of genetic research, including data harmonization, analysis, and reporting.
View Article and Find Full Text PDFBackground: The availability of whole-genome sequencing data in large studies has enabled the assessment of coding and noncoding variants across the allele frequency spectrum for their associations with blood pressure.
Methods: We conducted a multiancestry whole-genome sequencing analysis of blood pressure among 51 456 Trans-Omics for Precision Medicine and Centers for Common Disease Genomics program participants (stage-1). Stage-2 analyses leveraged array data from UK Biobank (N=383 145), Million Veteran Program (N=318 891), and Reasons for Geographic and Racial Differences in Stroke (N=10 643) participants, along with whole-exome sequencing data from UK Biobank (N=199 631) participants.
F508del (c.1521_1523delCTT, p.Phe508delPhe) is the most common pathogenic allele underlying cystic fibrosis (CF), and its frequency varies in a geographic cline across Europe.
View Article and Find Full Text PDF