Publications by authors named "Karl R Clauser"

Lung adenocarcinomas (LUAD) are a pressing global health problem with enduring lethality and rapidly shifting epidemiology. Proteogenomic studies integrating proteomics and post-translational modifications with genomics can identify clinical strata and oncogenic mechanisms, but have been underpowered to examine effects of ethnicity, smoking and environmental exposures, or sex on this heterogeneous disease. This comprehensive proteogenomic analysis of LUAD tumors and matched normal adjacent tissues from 406 patients across diverse geographic and demographic backgrounds explores the impact of understudied driver mutations, prognostic role of chromosomal instability, patterns of immune signaling, differential and sex-specific effects of endogenous mutagens and environmental carcinogens, and pathobiology of early-stage tumors with "late-like" characteristics.

View Article and Find Full Text PDF

Defining viral proteomes is crucial to understanding viral life cycles and immune recognition but the landscape of translated regions remains unknown for most viruses. We have developed massively parallel ribosome profiling (MPRP) to determine open reading frames (ORFs) across tens of thousands of designed oligonucleotides. MPRP identified 4208 unannotated ORFs in 679 human-associated viral genomes.

View Article and Find Full Text PDF

Translation of the noncoding genome in cancer can generate cryptic (noncanonical) peptides capable of presentation by human leukocyte antigen class I (HLA-I); however, the cancer specificity and immunogenicity of noncanonical HLA-I-bound peptides (ncHLAp) are incompletely understood. Using high-resolution immunopeptidomics, we discovered that cryptic peptides are abundant in the pancreatic cancer immunopeptidome. Approximately 30% of ncHLAp exhibited cancer-restricted translation, and a substantial subset were shared among patients.

View Article and Find Full Text PDF

We investigate the impact of germline variants on cancer patients' proteomes, encompassing 1,064 individuals across 10 cancer types. We introduced an approach, "precision peptidomics," mapping 337,469 coding germline variants onto peptides from patients' mass spectrometry data, revealing their potential impact on post-translational modifications, protein stability, allele-specific expression, and protein structure by leveraging the relevant protein databases. We identified rare pathogenic and common germline variants in cancer genes potentially affecting proteomic features, including variants altering protein abundance and structure and variants in kinases (ERBB2 and MAP2K2) impacting phosphorylation.

View Article and Find Full Text PDF

Clear cell renal cell carcinoma (ccRCC), despite having a low mutational burden, is considered immunogenic because it occasionally undergoes spontaneous regressions and often responds to immunotherapies. The signature lesion in ccRCC is inactivation of the VHL tumor suppressor gene and consequent upregulation of the HIF transcription factor. An earlier case report described a ccRCC patient who was cured by an allogeneic stem cell transplant and later found to have donor-derived T cells that recognized a ccRCC-specific peptide encoded by a HIF-responsive endogenous retrovirus (ERV), ERVE-4.

View Article and Find Full Text PDF
Article Synopsis
  • The study focuses on T cell alloreactivity against minor histocompatibility antigens (mHAgs), which play a crucial role in the success of allogeneic hematopoietic cell transplantation (allo-HCT) by affecting graft-versus-leukemia (GvL) and graft-versus-host disease (GvHD) reactions.
  • A new analytic framework was developed to identify mHAgs by integrating data from whole-exome sequencing, organ-specific expression, and proteome analysis from donor-recipient pairs.
  • The research found that the overall and organ-specific mHAg load in 220 matched D-R pairs could predict the risk of acute and chronic Gv
View Article and Find Full Text PDF

A hallmark of high-risk childhood medulloblastoma is the dysregulation of RNA translation. Currently, it is unknown whether medulloblastoma dysregulates the translation of putatively oncogenic non-canonical open reading frames (ORFs). To address this question, we performed ribosome profiling of 32 medulloblastoma tissues and cell lines and observed widespread non-canonical ORF translation.

View Article and Find Full Text PDF

Targeted synthetic vaccines have the potential to transform our response to viral outbreaks, yet the design of these vaccines requires a comprehensive knowledge of viral immunogens. Here, we report severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) peptides that are naturally processed and loaded onto human leukocyte antigen-II (HLA-II) complexes in infected cells. We identify over 500 unique viral peptides from canonical proteins as well as from overlapping internal open reading frames.

View Article and Find Full Text PDF

Unveiling the complete proteome of viruses is crucial to our understanding of the viral life cycle and interaction with the host. We developed Massively Parallel Ribosome Profiling (MPRP) to experimentally determine open reading frames (ORFs) in 20,170 designed oligonucleotides across 679 human-associated viral genomes. We identified 5,381 ORFs, including 4,208 non-canonical ORFs, and show successful detection of both annotated coding sequences (CDSs) and reported non-canonical ORFs.

View Article and Find Full Text PDF
Article Synopsis
  • Post-translational modifications (PTMs) significantly influence cell signaling and physiology in both healthy and cancerous cells, with recent advancements in mass spectrometry allowing for precise analysis of these modifications.* -
  • This study utilizes the largest dataset of proteogenomics from 1,110 cancer patients to uncover widespread patterns of protein changes, particularly focusing on acetylation and phosphorylation across 11 cancer types.* -
  • Findings show that specific cancer types exhibit unique PTM-related alterations linked to processes like DNA repair, immune response, kinase activity, and histone regulation, suggesting new potential therapeutic targets.*
View Article and Find Full Text PDF

Cancer driver events refer to key genetic aberrations that drive oncogenesis; however, their exact molecular mechanisms remain insufficiently understood. Here, our multi-omics pan-cancer analysis uncovers insights into the impacts of cancer drivers by identifying their significant cis-effects and distal trans-effects quantified at the RNA, protein, and phosphoprotein levels. Salient observations include the association of point mutations and copy-number alterations with the rewiring of protein interaction networks, and notably, most cancer genes converge toward similar molecular states denoted by sequence-based kinase activity profiles.

View Article and Find Full Text PDF
Article Synopsis
  • Ribosome profiling (Ribo-Seq) has revealed thousands of noncanonical open reading frames (ORFs) that might expand the number of human protein-coding sequences (CDSs) by up to 30%, increasing the count from approximately 19,500 to over 26,000.
  • * However, there are significant uncertainties about how many of these noncanonical ORFs actually produce functional proteins, with estimates varying widely from a few thousand to several hundred thousand.
  • * This research gap has left the genomics and proteomics communities excited but also in need of guidance on how to evaluate the coding potential of these noncanonical ORFs.*
View Article and Find Full Text PDF

Targeted synthetic vaccines have the potential to transform our response to viral outbreaks; yet the design of these vaccines requires a comprehensive knowledge of viral immunogens, including T-cell epitopes. Having previously mapped the SARS-CoV-2 HLA-I landscape, here we report viral peptides that are naturally processed and loaded onto HLA-II complexes in infected cells. We identified over 500 unique viral peptides from canonical proteins, as well as from overlapping internal open reading frames (ORFs), revealing, for the first time, the contribution of internal ORFs to the HLA-II peptide repertoire.

View Article and Find Full Text PDF

CD4+ T cell responses are exquisitely antigen specific and directed toward peptide epitopes displayed by human leukocyte antigen class II (HLA-II) on antigen-presenting cells. Underrepresentation of diverse alleles in ligand databases and an incomplete understanding of factors affecting antigen presentation in vivo have limited progress in defining principles of peptide immunogenicity. Here, we employed monoallelic immunopeptidomics to identify 358,024 HLA-II binders, with a particular focus on HLA-DQ and HLA-DP.

View Article and Find Full Text PDF
Article Synopsis
  • - Ribosome profiling (Ribo-seq) has revealed that there may be at least 7,000 non-canonical open reading frames (ORFs) in the human genome that could expand the number of recognized protein-coding sequences by 30% from around 19,500 to over 26,000.
  • - Despite the exciting possibilities for new coding regions, the scientific community faces challenges in verifying how many of these ORFs actually produce proteins, as estimates of their quantity range widely from a few thousand to several hundred thousand.
  • - The article discusses ongoing research on non-canonical ORFs, the use of ribosome profiling and immunopeptidomics to study them, and the need to understand the evidence required to classify
View Article and Find Full Text PDF

A hallmark of high-risk childhood medulloblastoma is the dysregulation of RNA translation. Currently, it is unknown whether medulloblastoma dysregulates the translation of putatively oncogenic non-canonical open reading frames. To address this question, we performed ribosome profiling of 32 medulloblastoma tissues and cell lines and observed widespread non-canonical ORF translation.

View Article and Find Full Text PDF

Comprehensive and in-depth identification of the human leukocyte antigen class I (HLA-I) and class II (HLA-II) tumor immunopeptidome can inform the development of cancer immunotherapies. Mass spectrometry (MS) is a powerful technology for direct identification of HLA peptides from patient-derived tumor samples or cell lines. However, achieving sufficient coverage to detect rare and clinically relevant antigens requires highly sensitive MS-based acquisition methods and large amounts of sample.

View Article and Find Full Text PDF

Unlabelled: Metastases are hard to detect and treat, and they cause most cancer-related deaths. The relative lack of therapies targeting metastases represents a major unmet clinical need. The extracellular matrix (ECM) forms a major component of the tumor microenvironment in both primary and metastatic tumors, and certain ECM proteins can be selectively and abundantly expressed in tumors.

View Article and Find Full Text PDF

Serial multi-omic analysis of proteome, phosphoproteome, and acetylome provides insights into changes in protein expression, cell signaling, cross-talk and epigenetic pathways involved in disease pathology and treatment. However, ubiquitylome and HLA peptidome data collection used to understand protein degradation and antigen presentation have not together been serialized, and instead require separate samples for parallel processing using distinct protocols. Here we present MONTE, a highly sensitive multi-omic native tissue enrichment workflow, that enables serial, deep-scale analysis of HLA-I and HLA-II immunopeptidome, ubiquitylome, proteome, phosphoproteome, and acetylome from the same tissue sample.

View Article and Find Full Text PDF
Article Synopsis
  • Cancers like Merkel cell carcinoma (MCC) can evade the immune system by disrupting HLA class I antigen presentation, which is crucial for immune detection.
  • Researchers generated cell lines from MCC patients and found that certain genes responsible for HLA-I presentation were suppressed.
  • They identified two main regulators of HLA-I loss: MYCL and the PRC1.1 complex, with a focus on USP7 as a potential target for drug development to enhance HLA-I expression in MCC.
View Article and Find Full Text PDF

Loss-of-function mutations in the secreted enzyme ADAMTS7 (a disintegrin and metalloproteinase with thrombospondin motifs 7) are associated with protection for coronary artery disease. ADAMTS7 catalytic inhibition has been proposed as a therapeutic strategy for treating coronary artery disease; however, the lack of an endogenous substrate has hindered the development of activity-based biomarkers. To identify ADAMTS7 extracellular substrates and their cleavage sites relevant to vascular disease, we used TAILS (terminal amine isotopic labeling of substrates), a method for identifying protease-generated neo-N termini.

View Article and Find Full Text PDF

Genomic analyses in cancer have been enormously impactful, leading to the identification of driver mutations and development of targeted therapies. But the functions of the vast majority of somatic mutations and copy number variants in tumours remain unknown, and the causes of resistance to targeted therapies and methods to overcome them are poorly defined. Recent improvements in mass spectrometry-based proteomics now enable direct examination of the consequences of genomic aberrations, providing deep and quantitative characterization of tumour tissues.

View Article and Find Full Text PDF

Within the tumor immunology community, the topic of proteasomal spliced peptides (PSP) has generated a great deal of controversy. In the earliest reports, careful biological validation led to the conclusion that proteasome-catalyzed peptide splicing was a rare event. To date, six PSPs have been validated biologically.

View Article and Find Full Text PDF

Tumor-associated epitopes presented on MHC-I that can activate the immune system against cancer cells are typically identified from annotated protein-coding regions of the genome, but whether peptides originating from novel or unannotated open reading frames (nuORFs) can contribute to antitumor immune responses remains unclear. Here we show that peptides originating from nuORFs detected by ribosome profiling of malignant and healthy samples can be displayed on MHC-I of cancer cells, acting as additional sources of cancer antigens. We constructed a high-confidence database of translated nuORFs across tissues (nuORFdb) and used it to detect 3,555 translated nuORFs from MHC-I immunopeptidome mass spectrometry analysis, including peptides that result from somatic mutations in nuORFs of cancer samples as well as tumor-specific nuORFs translated in melanoma, chronic lymphocytic leukemia and glioblastoma.

View Article and Find Full Text PDF

MS is the most effective method to directly identify peptides presented on human leukocyte antigen (HLA) molecules. However, current standard approaches often use 500 million or more cells as input to achieve high coverage of the immunopeptidome, and therefore, these methods are not compatible with the often limited amounts of tissue available from clinical tumor samples. Here, we evaluated microscaled basic reversed-phase fractionation to separate HLA peptide samples offline followed by ion mobility coupled to LC-MS/MS for analysis.

View Article and Find Full Text PDF