Polygenic scores (PGS) have promising clinical applications for risk stratification, disease screening, and personalized medicine. However, most PGS are trained on predominantly European ancestry cohorts and have limited portability to external populations. While cross-population PGS methods have demonstrated greater generalizability than single-ancestry PGS, they fail to properly account for individuals with recent admixture between continental ancestry groups.
View Article and Find Full Text PDFIntroduction: Prior work in predominantly European ancestry populations has explained how the risk associated with demographic, lifestyle, and health factors differs with underlying genetic susceptibility to type 2 diabetes (T2D), but less is known about these relationships in Black Americans.
Methods: We used covariate-adjusted logistic regression models of T2D to examine interactions between a published trans-ancestry derived T2D polygenic risk score (PRS) and various demographic, lifestyle, and health-related factors among 28,251 self-identified Black Americans from six cohort studies.
Results: The results are generally consistent with prior work in White populations.
Autism spectrum disorder is a heritable neurodevelopmental condition that displays heterogeneity in both presentation and etiology and it often presents with concomitant communication difficulties. The hypothesis behind the New Jersey Language and Autism Genetic Study is that genetic heterogeneity for component phenotypes of autism spectrum disorder may be reduced relative to the disorder as a whole. We previously published an initial phase of this study with family recruitment that used very restricted inclusion/exclusion criteria for both autism and language deficits in other family members.
View Article and Find Full Text PDFKey Points: We aimed to elucidate potential methylation, proteomic, and metabolomic mechanisms by which variants may be linked to kidney disease. We report distinct methylation profiling between risk allele carriers and noncarriers, many near gene family. We report higher APOL1 protein and lower C18:1 cholesteryl ester in two risk allele carriers.
View Article and Find Full Text PDFPolygenic risk scores (PRS) hold prognostic value for identifying individuals at higher risk of type 2 diabetes (T2D). However, further characterization is needed to understand the generalizability of T2D PRS in diverse populations across various contexts. We characterized a multi-ancestry T2D PRS among 244,637 cases and 637,891 controls across eight populations from the Population Architecture Genomics and Epidemiology (PAGE) Study and 13 additional biobanks and cohorts.
View Article and Find Full Text PDFPolygenic risk scores (PRSs) are now showing promising predictive performance on a wide variety of complex traits and diseases, but there exists a substantial performance gap across populations. We propose MUSSEL, a method for ancestry-specific polygenic prediction that borrows information in summary statistics from genome-wide association studies (GWASs) across multiple ancestry groups via Bayesian hierarchical modeling and ensemble learning. In our simulation studies and data analyses across four distinct studies, totaling 5.
View Article and Find Full Text PDFAnn Clin Transl Neurol
April 2024
Objective: Mutations in the glucocerebrosidase (GBA1) gene and subthalamic nucleus deep brain stimulation (STN-DBS) are independently associated with cognitive dysfunction in persons with Parkinson's disease (PwP). We hypothesized that PwP with both GBA1 mutations and STN-DBS are at greater risk of cognitive dysfunction than PwP with only GBA1 mutations or STN-DBS, or neither. In this study, we determined the pattern of cognitive dysfunction in PwP based on GBA1 mutation status and STN-DBS treatment.
View Article and Find Full Text PDFGenetics researchers increasingly combine data across many sources to increase power and to conduct analyses that cross multiple individual studies. However, there is often a lack of alignment on outcome measures when the same constructs are examined across studies. This inhibits comparison across individual studies and may impact the findings from meta-analysis.
View Article and Find Full Text PDFAutism spectrum disorder (ASD) is a neurodevelopmental disorder characterized by restrictive interests and/or repetitive behaviors and deficits in social interaction and communication. ASD is a multifactorial disease with a complex polygenic genetic architecture. Its genetic contributing factors are not yet fully understood, especially large structural variations (SVs).
View Article and Find Full Text PDFPolygenic risk scores (PRS) are now showing promising predictive performance on a wide variety of complex traits and diseases, but there exists a substantial performance gap across different populations. We propose MUSSEL, a method for ancestry-specific polygenic prediction that borrows information in the summary statistics from genome-wide association studies (GWAS) across multiple ancestry groups. MUSSEL conducts Bayesian hierarchical modeling under a MUltivariate Spike-and-Slab model for effect-size distribution and incorporates an Ensemble Learning step using super learner to combine information across different tuning parameter settings and ancestry groups.
View Article and Find Full Text PDFNucleic Acids Res
January 2023
Large biobank-scale whole genome sequencing (WGS) studies are rapidly identifying a multitude of coding and non-coding variants. They provide an unprecedented resource for illuminating the genetic basis of human diseases. Variant functional annotations play a critical role in WGS analysis, result interpretation, and prioritization of disease- or trait-associated causal variants.
View Article and Find Full Text PDFInflammatory bowel disease (IBD) is an immune-mediated chronic intestinal disorder with major phenotypes: ulcerative colitis (UC) and Crohn's disease (CD). Multiple studies have identified over 240 IBD susceptibility loci. However, most studies have centered on European (EUR) and East Asian (EAS) populations.
View Article and Find Full Text PDFHum Genet
February 2023
Autism spectrum disorder (ASD) and attention-deficit/hyperactivity disorder (ADHD) are two major neurodevelopmental disorders that frequently co-occur. However, the genetic mechanism of the co-occurrence remains unclear. The New Jersey Language and Autism Genetics Study (NJLAGS) collected more than 100 families with at least one member affected by ASD.
View Article and Find Full Text PDFBackground: Concurrent variation in adiposity and inflammation suggests potential shared functional pathways and pleiotropic disease underpinning. Yet, exploration of pleiotropy in the context of adiposity-inflammation has been scarce, and none has included self-identified Hispanic/Latino populations. Given the high level of ancestral diversity in Hispanic American population, genetic studies may reveal variants that are infrequent/monomorphic in more homogeneous populations.
View Article and Find Full Text PDFWe report a genome-wide association study (GWAS) of coronary artery disease (CAD) incorporating nearly a quarter of a million cases, in which existing studies are integrated with data from cohorts of white, Black and Hispanic individuals from the Million Veteran Program. We document near equivalent heritability of CAD across multiple ancestral groups, identify 95 novel loci, including nine on the X chromosome, detect eight loci of genome-wide significance in Black and Hispanic individuals, and demonstrate that two common haplotypes at the 9p21 locus are responsible for risk stratification in all populations except those of African origin, in which these haplotypes are virtually absent. Moreover, in the largest GWAS for angiographically derived coronary atherosclerosis performed to date, we find 15 loci of genome-wide significance that robustly overlap with established loci for clinical CAD.
View Article and Find Full Text PDFAutism spectrum disorder (ASD) is a childhood neurodevelopmental disorder with a complex and heterogeneous genetic etiology. MicroRNA (miRNA), a class of small non-coding RNAs, could regulate ASD risk genes post-transcriptionally and affect broad molecular pathways related to ASD and associated disorders. Using whole-genome sequencing, we analyzed 272 samples in 73 families in the New Jersey Language and Autism Genetics Study (NJLAGS) cohort.
View Article and Find Full Text PDFGenome-wide association studies using large-scale genome and exome sequencing data have become increasingly valuable in identifying associations between genetic variants and disease, transforming basic research and translational medicine. However, this progress has not been equally shared across all people and conditions, in part due to limited resources. Leveraging publicly available sequencing data as external common controls, rather than sequencing new controls for every study, can better allocate resources by augmenting control sample sizes or providing controls where none existed.
View Article and Find Full Text PDFBackground: The Centers for Disease Control and Prevention contracted with laboratories to sequence the SARS-CoV-2 genome from positive samples across the United States to enable public health officials to investigate the impact of variants on disease severity as well as the effectiveness of vaccines and treatment. Herein we present the initial results correlating RT-PCR quality control metrics with sample collection and sequencing methods from full SARS-CoV-2 viral genomic sequencing of 24,441 positive patient samples between April and June 2021.
Methods: RT-PCR confirmed (N Gene Ct value < 30) positive patient samples, with nucleic acid extracted from saliva, nasopharyngeal and oropharyngeal swabs were selected for viral whole genome SARS-CoV-2 sequencing.
One mechanism by which genetic factors influence complex traits and diseases is altering gene expression. Direct measurement of gene expression in relevant tissues is rarely tenable; however, genetically regulated gene expression (GReX) can be estimated using prediction models derived from large multi-omic datasets. These approaches have led to the discovery of many gene-trait associations, but whether models derived from predominantly European ancestry (EA) reference panels can map novel associations in ancestrally diverse populations remains unclear.
View Article and Find Full Text PDFPurpose: Mendelian disease genomic research has undergone a massive transformation over the past decade. With increasing availability of exome and genome sequencing, the role of Mendelian research has expanded beyond data collection, sequencing, and analysis to worldwide data sharing and collaboration.
Methods: Over the past 10 years, the National Institutes of Health-supported Centers for Mendelian Genomics (CMGs) have played a major role in this research and clinical evolution.
Aims/hypothesis: Type 2 diabetes is a growing global public health challenge. Investigating quantitative traits, including fasting glucose, fasting insulin and HbA, that serve as early markers of type 2 diabetes progression may lead to a deeper understanding of the genetic aetiology of type 2 diabetes development. Previous genome-wide association studies (GWAS) have identified over 500 loci associated with type 2 diabetes, glycaemic traits and insulin-related traits.
View Article and Find Full Text PDFGenomic discovery and characterization of risk loci for type 2 diabetes (T2D) have been conducted primarily in individuals of European ancestry. We conducted a multiethnic genome-wide association study of T2D among 53,102 cases and 193,679 control subjects from African, Hispanic, Asian, Native Hawaiian, and European population groups in the Population Architecture Genomics and Epidemiology (PAGE) and Diabetes Genetics Replication and Meta-analysis (DIAGRAM) Consortia. In individuals of African ancestry, we discovered a risk variant in the gene (rs11466334, risk allele frequency (RAF) = 6.
View Article and Find Full Text PDFA key goal of whole-genome sequencing for studies of human genetics is to interrogate all forms of variation, including single-nucleotide variants, small insertion or deletion (indel) variants and structural variants. However, tools and resources for the study of structural variants have lagged behind those for smaller variants. Here we used a scalable pipeline to map and characterize structural variants in 17,795 deeply sequenced human genomes.
View Article and Find Full Text PDFNeurodevelopment requires precise regulation of gene expression, including post-transcriptional regulatory events such as alternative splicing and mRNA translation. However, translational regulation of specific isoforms during neurodevelopment and the mechanisms behind it remain unknown. Using RNA-seq analysis of mouse neocortical polysomes, here we report translationally repressed and derepressed mRNA isoforms during neocortical neurogenesis whose orthologs include risk genes for neurodevelopmental disorders.
View Article and Find Full Text PDF