Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Multibatch isobaric labeling experiments are frequently applied for clinical and pharmaceutical studies of large sample cohorts. To tackle the critical issue of missing values in such studies, we introduce the ProSIMSIt pipeline. It combines the advantages of tandem mass spectrum clustering via SIMSI-Transfer and data-driven rescoring via Prosit and Oktoberfest. We demonstrate that these two tools are complementary and mutually beneficial. On large-scale cancer cohort data, ProSIMSIt increased the number of peptide spectrum matches (PSMs) by 40% on both global and phosphoproteome data sets. Furthermore, on data from proteome-wide drug-response profiling of post-translational modifications (decryptM), our pipeline substantially increased drug-PTM relations and revealed previously unseen downstream effects of drug target inhibition. ProSIMSIt is available as an open-source Python package with a simple command line interface that allows easy application to MaxQuant result files.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11976853PMC
http://dx.doi.org/10.1021/acs.jproteome.4c00967DOI Listing

Publication Analysis

Top Keywords

data-driven rescoring
8
prosimsit
4
prosimsit best
4
best worlds
4
worlds data-driven
4
rescoring identification
4
identification transfer
4
transfer multibatch
4
multibatch isobaric
4
isobaric labeling
4

Similar Publications

It has been shown that integrating peptide property predictions such as fragment intensity into the scoring process of peptide spectrum match can greatly increase the number of confidently identified peptides compared to using traditional scoring methods. Here, we introduce Prosit-XL, a robust and accurate fragment intensity predictor covering the cleavable (DSSO/DSBU) and non-cleavable cross-linkers (DSS/BS3), achieving high accuracy on various holdout sets with consistent performance on external datasets without fine-tuning. Due to the complex nature of false positives in XL-MS, an approach to data-driven rescoring was developed that benefits from Prosit-XL's predictions while limiting the overestimation of the false discovery rate (FDR).

View Article and Find Full Text PDF

This review explores state of the art machine learning and deep learning models for peptide property prediction in mass spectrometry-based proteomics, including, but not limited to, models for predicting digestibility, retention time, charge state distribution, collisional cross section, fragmentation ion intensities, and detectability. The combination of these models enables not only the in silico generation of spectral libraries but also finds many additional use cases in the design of targeted assays or data-driven rescoring. This review serves as both an introduction for newcomers and an update for experienced researchers aiming to develop accessible and reproducible models for peptide property predictions.

View Article and Find Full Text PDF

Multibatch isobaric labeling experiments are frequently applied for clinical and pharmaceutical studies of large sample cohorts. To tackle the critical issue of missing values in such studies, we introduce the ProSIMSIt pipeline. It combines the advantages of tandem mass spectrum clustering via SIMSI-Transfer and data-driven rescoring via Prosit and Oktoberfest.

View Article and Find Full Text PDF

Mass spectrometry-based discovery of bacterial immunopeptides presented by infected cells allows untargeted discovery of bacterial antigens that can serve as vaccine candidates. However, reliable identification of bacterial epitopes is challenged by their extremely low abundance. Here, we describe an optimized bioinformatic framework to enhance the confident identification of bacterial immunopeptides.

View Article and Find Full Text PDF

The high throughput analysis of proteins with mass spectrometry (MS) is highly valuable for understanding human biology, discovering disease biomarkers, identifying therapeutic targets, and exploring pathogen interactions. To achieve these goals, specialized proteomics subfields, including plasma proteomics, immunopeptidomics, and metaproteomics, must tackle specific analytical challenges, such as an increased identification ambiguity compared to routine proteomics experiments. Technical advancements in MS instrumentation can mitigate these issues by acquiring more discerning information at higher sensitivity levels.

View Article and Find Full Text PDF