Emerg Top Life Sci
September 2025
Artificial intelligence (AI) is transforming many fields, including healthcare and medicine. In biomarker discovery, AI algorithms have had a profound impact, thanks to their ability to derive insights from complex high-dimensional datasets and integrate multi-modal datatypes (such as omics, electronic health records, imaging or sensor and wearable data). However, despite the proliferation of AI-powered biomarkers, significant hurdles still remain in translating them to the clinic and driving adoption, including lack of population diversity, difficulties accessing harmonised data, costly and time-consuming clinical studies, evolving AI regulatory frameworks and absence of scalable diagnostic infrastructure.
View Article and Find Full Text PDFPlasma proteomic signatures accurately predict disease risk, but our understanding of the mechanisms contributing to the predictive value of the proteome remains limited. Here, we characterized proteomic biomarkers of 19 age-related diseases, based on observational associations between 2,923 protein levels and incidence of these outcomes in the UK Biobank (N = 45,438). To identify the subset of these biomarkers that may represent causal drivers of disease, we first employed Mendelian Randomization (MR) and found that only 8% of the protein-disease associations with genetic instruments showed suggestive evidence of causal relationships, and were more likely to pertain to only a single disease.
View Article and Find Full Text PDFThe biological mechanisms through which most nonprotein-coding genetic variants affect disease risk are unknown. To investigate gene-regulatory mechanisms, we mapped blood gene expression and splicing quantitative trait loci (QTLs) through bulk RNA sequencing in 4,732 participants and integrated protein, metabolite and lipid data from the same individuals. We identified cis-QTLs for the expression of 17,233 genes and 29,514 splicing events (in 6,853 genes).
View Article and Find Full Text PDFGenomics can provide insight into the etiology of type 2 diabetes and its comorbidities, but assigning functionality to non-coding variants remains challenging. Polygenic scores, which aggregate variant effects, can uncover mechanisms when paired with molecular data. Here, we test polygenic scores for type 2 diabetes and cardiometabolic comorbidities for associations with 2,922 circulating proteins in the UK Biobank.
View Article and Find Full Text PDFPrevious studies have suggested that systemic viral infections may increase risks of dementia. Whether this holds true for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) virus infections is unknown. Determining this is important for anticipating the potential future incidence of dementia.
View Article and Find Full Text PDFThe circulating proteome offers insights into the biological pathways that underlie disease. Here, we test relationships between 1,468 Olink protein levels and the incidence of 23 age-related diseases and mortality in the UK Biobank (n = 47,600). We report 3,209 associations between 963 protein levels and 21 incident outcomes.
View Article and Find Full Text PDFThe molecular pathology of stress-related disorders remains elusive. Our brain multiregion, multiomic study of posttraumatic stress disorder (PTSD) and major depressive disorder (MDD) included the central nucleus of the amygdala, hippocampal dentate gyrus, and medial prefrontal cortex (mPFC). Genes and exons within the mPFC carried most disease signals replicated across two independent cohorts.
View Article and Find Full Text PDFMol Cell Proteomics
July 2024
Advances in proteomic assay technologies have significantly increased coverage and throughput, enabling recent increases in the number of large-scale population-based proteomic studies of human plasma and serum. Improvements in multiplexed protein assays have facilitated the quantification of thousands of proteins over a large dynamic range, a key requirement for detecting the lowest-ranging, and potentially the most disease-relevant, blood-circulating proteins. In this perspective, we examine how populational proteomic datasets in conjunction with other concurrent omic measures can be leveraged to better understand the genomic and non-genomic correlates of the soluble proteome, constructing biomarker panels for disease prediction, among others.
View Article and Find Full Text PDFAims/hypothesis: This study aimed to elucidate the aetiological role of plasma proteins in glucose metabolism and type 2 diabetes development.
Methods: We measured 233 proteins at baseline in 1653 participants from the Cooperative Health Research in the Region of Augsburg (KORA) S4 cohort study (median follow-up time: 13.5 years).
Rapid increase in resistance of Helicobacter pylori (H. pylori) has hindered antibiotics-based eradication efforts worldwide and raises the need for additional approaches. Here, we investigate the role of zinc-based compounds in inhibiting H.
View Article and Find Full Text PDFGenome-wide association studies (GWAS) have identified thousands of genetic variants linked to the risk of human disease. However, GWAS have so far remained largely underpowered in relation to identifying associations in the rare and low-frequency allelic spectrum and have lacked the resolution to trace causal mechanisms to underlying genes. Here we combined whole-exome sequencing in 392,814 UK Biobank participants with imputed genotypes from 260,405 FinnGen participants (653,219 total individuals) to conduct association meta-analyses for 744 disease endpoints across the protein-coding allelic frequency spectrum, bridging the gap between common and rare variant studies.
View Article and Find Full Text PDFEfficacy of eradication therapy has declined due to rapid rises in antibiotic resistance. We investigated how increased temperature affected (NCTC 11637) growth and its sensitivity to metronidazole . We performed transcriptomic profiling using RNA-sequencing to identify differentially expressed genes (DEGs) associated with increased temperature.
View Article and Find Full Text PDFGenome-wide association studies (GWAS) have identified thousands of genomic regions affecting complex diseases. The next challenge is to elucidate the causal genes and mechanisms involved. One approach is to use statistical colocalization to assess shared genetic aetiology across multiple related traits (e.
View Article and Find Full Text PDFThe human proteome is a major source of therapeutic targets. Recent genetic association analyses of the plasma proteome enable systematic evaluation of the causal consequences of variation in plasma protein levels. Here we estimated the effects of 1,002 proteins on 225 phenotypes using two-sample Mendelian randomization (MR) and colocalization.
View Article and Find Full Text PDFAn amendment to this paper has been published and can be accessed via a link at the top of the paper.
View Article and Find Full Text PDFReduced lung function predicts mortality and is key to the diagnosis of chronic obstructive pulmonary disease (COPD). In a genome-wide association study in 400,102 individuals of European ancestry, we define 279 lung function signals, 139 of which are new. In combination, these variants strongly predict COPD in independent populations.
View Article and Find Full Text PDFCirc Genom Precis Med
February 2019
Background: The Asp358Ala variant (rs2228145; A>C) in the IL (interleukin)-6 receptor ( IL6R) gene has been implicated in the development of abdominal aortic aneurysms (AAAs), but its effect on AAA growth over time is not known. We aimed to investigate the clinical association between the IL6R-Asp358Ala variant and AAA growth and to assess the effect of blocking the IL-6 signaling pathway in mouse models of aortic aneurysm rupture or dissection.
Methods: Using data from 2863 participants with AAA from 9 prospective cohorts, age- and sex-adjusted mixed-effects linear regression models were used to estimate the association between the IL6R-Asp358Ala variant and annual change in AAA diameter (mm/y).
Nucleic Acids Res
January 2019
Quantitative trait locus (QTL) mapping of molecular phenotypes such as metabolites, lipids and proteins through genome-wide association studies represents a powerful means of highlighting molecular mechanisms relevant to human diseases. However, a major challenge of this approach is to identify the causal gene(s) at the observed QTLs. Here, we present a framework for the 'Prioritization of candidate causal Genes at Molecular QTLs' (ProGeM), which incorporates biological domain-specific annotation data alongside genome annotation data from multiple repositories.
View Article and Find Full Text PDFIn the originally published version of this Article, financial support was not fully acknowledged. The sentence "KS was supported by the 'Biomedical Research Program' funds at Weill Cornell Medicine in Qatar, a program funded by the Qatar Foundation" has been added to the acknowledgement section in both the PDF and HTML versions of the Article.
View Article and Find Full Text PDF