Publications by authors named "Jeremy P Gygi"

Background: Following SARS-CoV-2 infection, ~10-35% of COVID-19 patients experience long COVID (LC), in which debilitating symptoms persist for at least three months. Elucidating biologic underpinnings of LC could identify therapeutic opportunities.

Methods: We utilized machine learning methods on biologic analytes provided over 12-months after hospital discharge from >500 COVID-19 patients in the IMPACC cohort to identify a multi-omics "recovery factor", trained on patient-reported physical function survey scores.

View Article and Find Full Text PDF

Systems vaccinology studies have been used to build computational models that predict individual vaccine responses and identify the factors contributing to differences in outcome. Comparing such models is challenging due to variability in study designs. To address this, we established a community resource to compare models predicting B.

View Article and Find Full Text PDF

The post-acute sequelae of SARS-CoV-2 (PASC), also known as long COVID, remain a significant health issue that is incompletely understood. Predicting which acutely infected individuals will go on to develop long COVID is challenging due to the lack of established biomarkers, clear disease mechanisms, or well-defined sub-phenotypes. Machine learning (ML) models offer the potential to address this by leveraging clinical data to enhance diagnostic precision.

View Article and Find Full Text PDF

Following SARS-CoV-2 infection, ~10-35% of COVID-19 patients experience long COVID (LC), in which often debilitating symptoms persist for at least three months. Elucidating the biologic underpinnings of LC could identify therapeutic opportunities. We utilized machine learning methods on biologic analytes and patient reported outcome surveys provided over 12 months after hospital discharge from >500 hospitalized COVID-19 patients in the IMPACC cohort to identify a multi-omics "recovery factor".

View Article and Find Full Text PDF

Hospitalized COVID-19 patients exhibit diverse immune responses during acute infection, which are associated with a wide range of clinical outcomes. However, understanding these immune heterogeneities and their links to various clinical complications, especially long COVID, remains a challenge. In this study, we performed unsupervised subtyping of longitudinal multi-omics immunophenotyping in over 1,000 hospitalized patients, identifying two critical subtypes linked to mortality or mechanical ventilation with prolonged hospital stay and three severe subtypes associated with timely acute recovery.

View Article and Find Full Text PDF

Systems vaccinology studies have been used to build computational models that predict individual vaccine responses and identify the factors contributing to differences in outcome. Comparing such models is challenging due to variability in study designs. To address this, we established a community resource to compare models predicting booster responses and generate experimental data for the explicit purpose of model evaluation.

View Article and Find Full Text PDF

BACKGROUNDPatients hospitalized for COVID-19 exhibit diverse clinical outcomes, with outcomes for some individuals diverging over time even though their initial disease severity appears similar to that of other patients. A systematic evaluation of molecular and cellular profiles over the full disease course can link immune programs and their coordination with progression heterogeneity.METHODSWe performed deep immunophenotyping and conducted longitudinal multiomics modeling, integrating 10 assays for 1,152 Immunophenotyping Assessment in a COVID-19 Cohort (IMPACC) study participants and identifying several immune cascades that were significant drivers of differential clinical outcomes.

View Article and Find Full Text PDF

Motivation: Predictive biological signatures provide utility as biomarkers for disease diagnosis and prognosis, as well as prediction of responses to vaccination or therapy. These signatures are identified from high-throughput profiling assays through a combination of dimensionality reduction and machine learning techniques. The genes, proteins, metabolites, and other biological analytes that compose signatures also generate hypotheses on the underlying mechanisms driving biological responses, thus improving biological understanding.

View Article and Find Full Text PDF

Systems vaccinology studies have identified factors affecting individual vaccine responses, but comparing these findings is challenging due to varying study designs. To address this lack of reproducibility, we established a community resource for comparing Bordetella pertussis booster responses and to host annual contests for predicting patients' vaccination outcomes. We report here on our experiences with the "dry-run" prediction contest.

View Article and Find Full Text PDF

Hospitalized COVID-19 patients exhibit diverse clinical outcomes, with some individuals diverging over time even though their initial disease severity appears similar. A systematic evaluation of molecular and cellular profiles over the full disease course can link immune programs and their coordination with progression heterogeneity. In this study, we carried out deep immunophenotyping and conducted longitudinal multi-omics modeling integrating ten distinct assays on a total of 1,152 IMPACC participants and identified several immune cascades that were significant drivers of differential clinical outcomes.

View Article and Find Full Text PDF

Overfitting describes the phenomenon where a highly predictive model on the training data generalizes poorly to future observations. It is a common concern when applying machine learning techniques to contemporary medical applications, such as predicting vaccination response and disease status in infectious disease or cancer studies. This review examines the causes of overfitting and offers strategies to counteract it, focusing on model complexity reduction, reliable model evaluation, and harnessing data diversity.

View Article and Find Full Text PDF

Computational models that predict an individual's response to a vaccine offer the potential for mechanistic insights and personalized vaccination strategies. These models are increasingly derived from systems vaccinology studies that generate immune profiles from human cohorts pre- and post-vaccination. Most of these studies involve relatively small cohorts and profile the response to a single vaccine.

View Article and Find Full Text PDF

The IMPACC cohort, composed of >1,000 hospitalized COVID-19 participants, contains five illness trajectory groups (TGs) during acute infection (first 28 days), ranging from milder (TG1-3) to more severe disease course (TG4) and death (TG5). Here, we report deep immunophenotyping, profiling of >15,000 longitudinal blood and nasal samples from 540 participants of the IMPACC cohort, using 14 distinct assays. These unbiased analyses identify cellular and molecular signatures present within 72 h of hospital admission that distinguish moderate from severe and fatal COVID-19 disease.

View Article and Find Full Text PDF

Motivation: Predictive biological signatures provide utility as biomarkers for disease diagnosis and prognosis, as well as prediction of responses to vaccination or therapy. These signatures are iden-tified from high-throughput profiling assays through a combination of dimensionality reduction and machine learning techniques. The genes, proteins, metabolites, and other biological analytes that compose signatures also generate hypotheses on the underlying mechanisms driving biological responses, thus improving biological understanding.

View Article and Find Full Text PDF

Reporter ion interference remains a limitation of isobaric tag-based sample multiplexing. Advances in instrumentation and data acquisition modes, such as the recently developed real-time database search (RTS), can reduce interference. However, interference persists as does the need to benchmark upstream sample preparation and data acquisition strategies.

View Article and Find Full Text PDF

Sample multiplexing using isobaric tagging is a powerful strategy for proteome-wide protein quantification. One major caveat of isobaric tagging is ratio compression that results from the isolation, fragmentation, and quantification of coeluting, near-isobaric peptides, a phenomenon typically referred to as "ion interference". A robust platform to ensure quality control, optimize parameters, and enable comparisons across samples is essential as new instrumentation and analytical methods evolve.

View Article and Find Full Text PDF

Multiplexing strategies are at the forefront of mass-spectrometry-based proteomics, with SPS-MS3 methods becoming increasingly commonplace. A known caveat of isobaric multiplexing is interference resulting from coisolated and cofragmented ions that do not originate from the selected precursor of interest. The triple knockout (TKO) standard was designed to benchmark data collection strategies to minimize interference.

View Article and Find Full Text PDF

The electron transport chain (ETC) is an important participant in cellular energy conversion, but its biogenesis presents the cell with numerous challenges. To address these complexities, the cell utilizes ETC assembly factors, which include the LYR protein family. Each member of this family interacts with the mitochondrial acyl carrier protein (ACP), the scaffold protein upon which the mitochondrial fatty acid synthesis (mtFAS) pathway builds fatty acyl chains from acetyl-CoA.

View Article and Find Full Text PDF