Polygenic scores (PGS) have promising clinical applications for risk stratification, disease screening, and personalized medicine. However, most PGS are trained on predominantly European ancestry cohorts and have limited portability to external populations. While cross-population PGS methods have demonstrated greater generalizability than single-ancestry PGS, they fail to properly account for individuals with recent admixture between continental ancestry groups.
View Article and Find Full Text PDFImportance: Traditional clinical predictors for disease risks have limitations in capturing underlying disease complexity. Multi-omics technologies, such as metabolomics and proteomics, offer deeper molecular perspectives that could enhance risk prediction, but large-scale studies integrating the two omics are scarce.
Objectives: The primary objective is to systematically evaluate whether adding metabolomics and/or proteomics data to traditional clinical predictors improves risk prediction for 17 common incident diseases.
Background: Polygenic risk scores (PRSs) improve type 2 diabetes (T2D) prediction beyond clinical risk factors but perform poorly in non-European populations, where T2D burden is often higher, undermining their global clinical utility.
Methods: We conducted the largest global effort to date to harmonize T2D genome-wide association study (GWAS) meta-analyses across five ancestries-European (EUR), African/African American (AFR), Admixed American (AMR), South Asian (SAS), and East Asian (EAS)-including 360,000 T2D cases and 1·8 million controls (41% non-EUR). We constructed ancestry-specific and multi-ancestry PRSs in training datasets including 11,000 T2D cases and 32,000 controls, and validated their performance in independent datasets including 39,000 T2D cases and 126,000 controls of diverse ancestries.
Mosaic loss of Y chromosome (mLOY) in blood cells is an age-related somatic mutation, but its relationship with pulmonary health remains undercharacterized. Leveraging mLOY assessment in over 12,000 men, including 5,097 from the COPDGene Study and 7,235 from six additional cohorts in Trans-Omics for Precision Medicine program, we investigated its association with respiratory outcomes and epigenetic aging. Cross-sectionally, mLOY was associated with airflow obstruction with prevalence increasing with age, particularly in men with a former smoking history.
View Article and Find Full Text PDFEpigenetics
December 2025
Sickle cell trait (SCT) is due to heterozygosity for the β-globin sickle cell mutation. SCT recently has been associated with increased risk of various adverse health outcomes. DNA methylation (DNAm) is one potential mechanism by which SCT may impact disease risk.
View Article and Find Full Text PDFStudy Objectives: Excessive daytime sleepiness (EDS), influenced by environmental and social-behavioral factors, is reported by a subset of patients with sleep apnea - a group that may be at elevated cardiovascular risk. However, it is unclear whether sleep apnea with and without EDS have distinct genetic underpinnings. In this study, we perform gene-by-EDS interaction analyses for apnea hypopnea index (AHI), a diagnostic marker of sleep apnea severity, to understand EDS's influence on its underlying genetic risk.
View Article and Find Full Text PDFPolygenic scores (PGSs) for body mass index (BMI) may guide early prevention and targeted treatment of obesity. Using genetic data from up to 5.1 million people (4.
View Article and Find Full Text PDFCirculation
August 2025
Background: The association of overall cardiovascular health (CVH) with changes in DNA methylation (DNAm) has not been well characterized.
Methods: We calculated the American Heart Association's Life's Essential 8 score to reflect CVH in 5 cohorts with diverse backgrounds (mean age 54 years, 55% women, and enrollment year ranging from 1989 to 2012). Epigenome-wide association studies (EWAS) for Life's Essential 8 score were conducted, followed by bioinformatic analyses.
Background: Substantial efforts have been dedicated to exploring the link between genetic regulation and the proteome, informing studies of complex trait mechanisms. Most of these efforts have been limited to populations of European ancestry.
Results: We conduct an Olink protein quantitative trait locus (pQTL) analysis on 1245 proteins involving 1033 self-identified African American (AA) and 1764 non-Hispanic White (NHW) participants from the Women's Health Initiative and Framingham Heart Study.
Vitamin D deficiency is prevalent in Africa, but its genetic determinants are understudied. We report a genome-wide analysis of 25-hydroxyvitamin D (25(OH)D) concentrations in 3670 children from five countries across Africa with replication in four diaspora African ancestry populations (n=21,610). We identify a previously unreported locus at genome-wide significance in West African populations: (Oxysterol Binding Protein Like 11, lead variant, rs2979356, p=8.
View Article and Find Full Text PDFBackground And Aims: Cardiovascular disease (CVD) is one of the leading causes of morbidity and mortality worldwide, yet the underlying molecular mechanisms remain less understood. Chronic low-grade inflammation is a complex immune response contributing to the pathophysiology of cardiovascular disease. This response is signaled in part by interleukin-6 (IL-6), a pleiotropic, pro-inflammatory cytokine.
View Article and Find Full Text PDFBackground: Obstructive sleep apnoea (OSA) is a common chronic condition, with obesity its strongest risk factor. Polygenic scores (PGSs) summarise the genetic liability to phenotype and can provide insights into relationships between phenotypes. Recently, large datasets that include genetic data and OSA status became available, providing an opportunity to utilise PGS approaches to study the genetic relationship between OSA and other phenotypes, while differentiating OSA-specific from obesity-specific genetic factors.
View Article and Find Full Text PDFWith age, hematopoietic stem cells can acquire somatic mutations in leukemogenic genes that confer a proliferative advantage in a phenomenon termed CHIP. How these mutations result in increased risk for numerous age-related diseases remains poorly understood. We conduct a multiracial meta-analysis of EWAS of CHIP in the Framingham Heart Study, Jackson Heart Study, Cardiovascular Health Study, and Atherosclerosis Risk in Communities cohorts (N = 8196) to elucidate the molecular mechanisms underlying CHIP and illuminate how these changes influence cardiovascular disease risk.
View Article and Find Full Text PDFChromatin accessibility quantitative trait locus (caQTL) studies have identified regulatory elements that underlie genetic effects on gene expression and metabolic traits. However, caQTL discovery has been limited by small sample sizes. Here, we map caQTLs in liver tissue from 138 human donors and identify caQTLs for 35,361 regulatory elements, including population-specific caQTLs driven by differences in allele frequency across populations.
View Article and Find Full Text PDFHeavy menstrual bleeding (HMB) is a widespread occurrence among women of reproductive age and inflicts a substantial impact on their well-being and on health care expenses. To better characterize the genetic architecture of HMB, we conducted a meta-analysis of the summary statistics of genome-wide association studies (GWAS) from 5 biobanks that included up to 84 633 HMB cases and 598 195 controls from several ancestries. Of the 21 signals significantly associated with HMB in a discovery GWAS meta-analysis that combined 4 biobanks, 20 had a concordant direction of effect in the remaining cohort, including 10 that were significantly replicated.
View Article and Find Full Text PDFObesity is a major public health crisis associated with high mortality rates. Previous genome-wide association studies (GWAS) investigating body mass index (BMI) have largely relied on imputed data from European individuals. This study leveraged whole-genome sequencing (WGS) data from 88,873 participants from the Trans-Omics for Precision Medicine (TOPMed) Program, of which 51% were of non-European population groups.
View Article and Find Full Text PDFPolygenic risk score (PRS) prediction of complex diseases can be improved by leveraging related phenotypes. This has motivated the development of several multi-trait PRS methods that jointly model genetically correlated traits. However, these methods do not account for vertical pleiotropy, where one trait acts as a mediator for another.
View Article and Find Full Text PDFKey Points: We aimed to elucidate potential methylation, proteomic, and metabolomic mechanisms by which variants may be linked to kidney disease. We report distinct methylation profiling between risk allele carriers and noncarriers, many near gene family. We report higher APOL1 protein and lower C18:1 cholesteryl ester in two risk allele carriers.
View Article and Find Full Text PDFAm J Hum Genet
April 2025
Recently, admixed populations make up an increasing percentage of the US and global populations, and the admixture is not uniform over space or time or across genomes. Therefore, it becomes indispensable to evaluate local ancestry in addition to global ancestry to improve genetic epidemiological studies. Recent advances in representing human genome diversity, coupled with large-scale whole-genome sequencing initiatives and improved tools for local ancestry inference, have enabled studies to demonstrate that incorporating local ancestry information enhances both genetic association analyses and polygenic risk predictions.
View Article and Find Full Text PDFAlthough both short and long sleep duration are associated with elevated hypertension risk, our understanding of their interplay with biological pathways governing blood pressure remains limited. To address this, we carried out genome-wide cross-population gene-by-short-sleep and long-sleep duration interaction analyses for three blood pressure traits (systolic, diastolic, and pulse pressure) in 811,405 individuals from diverse population groups. We discovered 22 novel gene-sleep duration interaction loci for blood pressure, mapped to 23 genes.
View Article and Find Full Text PDFCirculating lipid concentrations are clinically associated with cardiometabolic diseases. The phenotypic variance explained by identified genetic variants remains limited, highlighting the importance of searching for additional factors beyond genetic sequence variants. DNA methylation has been linked to lipid concentrations in previous studies, although most of the studies harbored moderate sample sizes and exhibited underrepresentation of non-European ancestry populations.
View Article and Find Full Text PDFDespite the availability of effective vaccines and a recent decrease in annual deaths, COVID-19 remains a leading cause of death. Serological studies provide insights into host immunobiology of adaptive immune response to infection, which holds promise for identifying high-risk individuals for adverse COVID-19 outcomes. We investigated correlates of anti-nucleocapsid antibody responses following SARS-CoV-2 infection in a US population-based meta-cohort of adults participating in longstanding National Institutes of Health-funded cohort studies.
View Article and Find Full Text PDFBackground: Previous work has shown a role of CCL2, a key chemokine governing monocyte trafficking, in atherosclerosis. However, it remains unknown whether targeting CCR2, the cognate receptor of CCL2, provides protection against human atherosclerotic cardiovascular disease.
Methods: Computationally predicted damaging or loss-of-function (REVEL > 0.
Atrial fibrillation (AF) is a prevalent and morbid abnormality of the heart rhythm with a strong genetic component. Here, we meta-analyzed genome and exome sequencing data from 36 studies that included 52,416 AF cases and 277,762 controls. In burden tests of rare coding variation, we identified novel associations between AF and the genes MYBPC3, LMNA, PKP2, FAM189A2 and KDM5B.
View Article and Find Full Text PDFMost genetic variants associated with complex traits and diseases occur in non-coding genomic regions and are hypothesized to regulate gene expression. To understand the genetics underlying gene expression variability, we characterize 14,324 ancestrally diverse RNA-sequencing samples from the NHLBI Trans-Omics for Precision Medicine (TOPMed) program and integrate whole genome sequencing data to perform and expression and splicing quantitative trait locus (-/trans-e/sQTL) analyses in six tissues and cell types, most notably whole blood (N=6,454) and lung (N=1,291). We show this dataset enables greater detection of secondary cis-e/sQTL signals than was achieved in previous studies, and that secondary cis-eQTL and primary trans-eQTL signal discovery is not saturated even though eGene discovery is.
View Article and Find Full Text PDF