Publications by authors named "Benjamin B Sun"

Artificial intelligence (AI) is transforming many fields, including healthcare and medicine. In biomarker discovery, AI algorithms have had a profound impact, thanks to their ability to derive insights from complex high-dimensional datasets and integrate multi-modal datatypes (such as omics, electronic health records, imaging or sensor and wearable data). However, despite the proliferation of AI-powered biomarkers, significant hurdles still remain in translating them to the clinic and driving adoption, including lack of population diversity, difficulties accessing harmonised data, costly and time-consuming clinical studies, evolving AI regulatory frameworks and absence of scalable diagnostic infrastructure.

View Article and Find Full Text PDF

Plasma proteomic signatures accurately predict disease risk, but our understanding of the mechanisms contributing to the predictive value of the proteome remains limited. Here, we characterized proteomic biomarkers of 19 age-related diseases, based on observational associations between 2,923 protein levels and incidence of these outcomes in the UK Biobank (N = 45,438). To identify the subset of these biomarkers that may represent causal drivers of disease, we first employed Mendelian Randomization (MR) and found that only 8% of the protein-disease associations with genetic instruments showed suggestive evidence of causal relationships, and were more likely to pertain to only a single disease.

View Article and Find Full Text PDF

The biological mechanisms through which most nonprotein-coding genetic variants affect disease risk are unknown. To investigate gene-regulatory mechanisms, we mapped blood gene expression and splicing quantitative trait loci (QTLs) through bulk RNA sequencing in 4,732 participants and integrated protein, metabolite and lipid data from the same individuals. We identified cis-QTLs for the expression of 17,233 genes and 29,514 splicing events (in 6,853 genes).

View Article and Find Full Text PDF

Genomics can provide insight into the etiology of type 2 diabetes and its comorbidities, but assigning functionality to non-coding variants remains challenging. Polygenic scores, which aggregate variant effects, can uncover mechanisms when paired with molecular data. Here, we test polygenic scores for type 2 diabetes and cardiometabolic comorbidities for associations with 2,922 circulating proteins in the UK Biobank.

View Article and Find Full Text PDF

Previous studies have suggested that systemic viral infections may increase risks of dementia. Whether this holds true for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) virus infections is unknown. Determining this is important for anticipating the potential future incidence of dementia.

View Article and Find Full Text PDF
Article Synopsis
  • This study investigates how gene-environment interactions (GEIs) affect protein levels in the blood, which could help in discovering and validating new biomarkers.
  • Researchers analyzed blood protein data from over 52,000 UK Biobank participants and identified 677 unique variance quantitative trait loci (vQTLs), which are genetic variations linked to protein levels.
  • The findings reveal over 1,100 GEIs between identified proteins and environmental factors, providing insights into why some genetic variants affect protein levels without obvious main effects, and emphasizing the roles of age, sex, and other factors in genetic variability.
View Article and Find Full Text PDF

The circulating proteome offers insights into the biological pathways that underlie disease. Here, we test relationships between 1,468 Olink protein levels and the incidence of 23 age-related diseases and mortality in the UK Biobank (n = 47,600). We report 3,209 associations between 963 protein levels and 21 incident outcomes.

View Article and Find Full Text PDF

The molecular pathology of stress-related disorders remains elusive. Our brain multiregion, multiomic study of posttraumatic stress disorder (PTSD) and major depressive disorder (MDD) included the central nucleus of the amygdala, hippocampal dentate gyrus, and medial prefrontal cortex (mPFC). Genes and exons within the mPFC carried most disease signals replicated across two independent cohorts.

View Article and Find Full Text PDF

Advances in proteomic assay technologies have significantly increased coverage and throughput, enabling recent increases in the number of large-scale population-based proteomic studies of human plasma and serum. Improvements in multiplexed protein assays have facilitated the quantification of thousands of proteins over a large dynamic range, a key requirement for detecting the lowest-ranging, and potentially the most disease-relevant, blood-circulating proteins. In this perspective, we examine how populational proteomic datasets in conjunction with other concurrent omic measures can be leveraged to better understand the genomic and non-genomic correlates of the soluble proteome, constructing biomarker panels for disease prediction, among others.

View Article and Find Full Text PDF
Article Synopsis
  • The Pharma Proteomics Project is a large-scale research initiative analyzing blood protein profiles from over 54,000 UK Biobank participants to understand links between genetics and health.
  • The project identifies significant genetic associations with proteins, revealing many novel interactions and highlighting ancestry-specific variations, along with insights into disease mechanisms and potential drug targets.
  • By making their findings publicly accessible, the consortium aims to advance research in biomarker development and therapeutic strategies, enhancing our understanding of how genetic factors influence health outcomes.
View Article and Find Full Text PDF
Article Synopsis
  • Integrating human genomics and proteomics can uncover disease mechanisms and identify potential biomarkers and drug targets.
  • Our study highlights the importance of rare protein-coding variants, finding 5,433 associations with plasma protein levels that were largely missed in previous studies.
  • We also demonstrate practical applications of our findings, including potential biomarkers for diseases and insights into the effects of clonal hematopoiesis on protein levels.
View Article and Find Full Text PDF

Aims/hypothesis: This study aimed to elucidate the aetiological role of plasma proteins in glucose metabolism and type 2 diabetes development.

Methods: We measured 233 proteins at baseline in 1653 participants from the Cooperative Health Research in the Region of Augsburg (KORA) S4 cohort study (median follow-up time: 13.5 years).

View Article and Find Full Text PDF

Rapid increase in resistance of Helicobacter pylori (H. pylori) has hindered antibiotics-based eradication efforts worldwide and raises the need for additional approaches. Here, we investigate the role of zinc-based compounds in inhibiting H.

View Article and Find Full Text PDF
Article Synopsis
  • The study investigates the genetic links to the structure of the brain's folds by analyzing data from 40,169 individuals using MRI scans from the UK Biobank.
  • Researchers found 388 associations between regional brain folding and 77 genetic loci, with genes in these areas tied to brain development and functioning.
  • The findings also include an interactive 3D visualization to help explore these associations, emphasizing their potential for studying brain health and neuropsychiatric conditions in the future.
View Article and Find Full Text PDF

Genome-wide association studies (GWAS) have identified thousands of genetic variants linked to the risk of human disease. However, GWAS have so far remained largely underpowered in relation to identifying associations in the rare and low-frequency allelic spectrum and have lacked the resolution to trace causal mechanisms to underlying genes. Here we combined whole-exome sequencing in 392,814 UK Biobank participants with imputed genotypes from 260,405 FinnGen participants (653,219 total individuals) to conduct association meta-analyses for 744 disease endpoints across the protein-coding allelic frequency spectrum, bridging the gap between common and rare variant studies.

View Article and Find Full Text PDF

Efficacy of eradication therapy has declined due to rapid rises in antibiotic resistance. We investigated how increased temperature affected (NCTC 11637) growth and its sensitivity to metronidazole . We performed transcriptomic profiling using RNA-sequencing to identify differentially expressed genes (DEGs) associated with increased temperature.

View Article and Find Full Text PDF

Genome-wide association studies (GWAS) have identified thousands of genomic regions affecting complex diseases. The next challenge is to elucidate the causal genes and mechanisms involved. One approach is to use statistical colocalization to assess shared genetic aetiology across multiple related traits (e.

View Article and Find Full Text PDF

The human proteome is a major source of therapeutic targets. Recent genetic association analyses of the plasma proteome enable systematic evaluation of the causal consequences of variation in plasma protein levels. Here we estimated the effects of 1,002 proteins on 225 phenotypes using two-sample Mendelian randomization (MR) and colocalization.

View Article and Find Full Text PDF
Article Synopsis
  • Increased urinary albumin-to-creatinine ratio (UACR) is linked to higher risks of kidney disease and cardiovascular issues, yet the underlying causes are not fully understood.
  • A large meta-analysis identified 68 genetic loci associated with UACR, highlighting connections to conditions like proteinuria, hyperlipidemia, and hypertension.
  • Specific genes (such as TGFB1 and PRKCI) were implicated in kidney function, and experiments showed that disrupting these genes in fruit flies affects albumin processing, suggesting new avenues for research to lower albumin levels.
View Article and Find Full Text PDF

Reduced lung function predicts mortality and is key to the diagnosis of chronic obstructive pulmonary disease (COPD). In a genome-wide association study in 400,102 individuals of European ancestry, we define 279 lung function signals, 139 of which are new. In combination, these variants strongly predict COPD in independent populations.

View Article and Find Full Text PDF

Background: The Asp358Ala variant (rs2228145; A>C) in the IL (interleukin)-6 receptor ( IL6R) gene has been implicated in the development of abdominal aortic aneurysms (AAAs), but its effect on AAA growth over time is not known. We aimed to investigate the clinical association between the IL6R-Asp358Ala variant and AAA growth and to assess the effect of blocking the IL-6 signaling pathway in mouse models of aortic aneurysm rupture or dissection.

Methods: Using data from 2863 participants with AAA from 9 prospective cohorts, age- and sex-adjusted mixed-effects linear regression models were used to estimate the association between the IL6R-Asp358Ala variant and annual change in AAA diameter (mm/y).

View Article and Find Full Text PDF

Quantitative trait locus (QTL) mapping of molecular phenotypes such as metabolites, lipids and proteins through genome-wide association studies represents a powerful means of highlighting molecular mechanisms relevant to human diseases. However, a major challenge of this approach is to identify the causal gene(s) at the observed QTLs. Here, we present a framework for the 'Prioritization of candidate causal Genes at Molecular QTLs' (ProGeM), which incorporates biological domain-specific annotation data alongside genome annotation data from multiple repositories.

View Article and Find Full Text PDF

In the originally published version of this Article, financial support was not fully acknowledged. The sentence "KS was supported by the 'Biomedical Research Program' funds at Weill Cornell Medicine in Qatar, a program funded by the Qatar Foundation" has been added to the acknowledgement section in both the PDF and HTML versions of the Article.

View Article and Find Full Text PDF