ProSIMSIt: The Best of Both Worlds in Data-Driven Rescoring and Identification Transfer.

Firas Hamood , Wassim Gabriel , Pia Pfeiffer , Bernhard Kuster , Mathias Wilhelm , Matthew The

J Proteome Res

Chair of Proteomics and Bioanalytics, School of Life Sciences, Technical University of Munich, 85354 Freising, Germany.

Published: April 2025

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Multibatch isobaric labeling experiments are frequently applied for clinical and pharmaceutical studies of large sample cohorts. To tackle the critical issue of missing values in such studies, we introduce the ProSIMSIt pipeline. It combines the advantages of tandem mass spectrum clustering via SIMSI-Transfer and data-driven rescoring via Prosit and Oktoberfest. We demonstrate that these two tools are complementary and mutually beneficial. On large-scale cancer cohort data, ProSIMSIt increased the number of peptide spectrum matches (PSMs) by 40% on both global and phosphoproteome data sets. Furthermore, on data from proteome-wide drug-response profiling of post-translational modifications (decryptM), our pipeline substantially increased drug-PTM relations and revealed previously unseen downstream effects of drug target inhibition. ProSIMSIt is available as an open-source Python package with a simple command line interface that allows easy application to MaxQuant result files.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11976853	PMC
http://dx.doi.org/10.1021/acs.jproteome.4c00967	DOI Listing

Publication Analysis

Top Keywords

data-driven rescoring

prosimsit

prosimsit best

best worlds

worlds data-driven

rescoring identification

identification transfer

transfer multibatch

multibatch isobaric

isobaric labeling

Similar Publications

Prosit-XL: enhanced cross-linked peptide identification by fragment intensity prediction to study protein interactions and structures.

Nat Commun

July 2025

Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany.

Mostafa Kalhor , Cemil Can Saylan , Mario Picciani , Lutz Fischer , Falk Boudewijn Schimweg

It has been shown that integrating peptide property predictions such as fragment intensity into the scoring process of peptide spectrum match can greatly increase the number of confidently identified peptides compared to using traditional scoring methods. Here, we introduce Prosit-XL, a robust and accurate fragment intensity predictor covering the cleavable (DSSO/DSBU) and non-cleavable cross-linkers (DSS/BS3), achieving high accuracy on various holdout sets with consistent performance on external datasets without fine-tuning. Due to the complex nature of false positives in XL-MS, an approach to data-driven rescoring was developed that benefits from Prosit-XL's predictions while limiting the overestimation of the false discovery rate (FDR).

View Article and Find Full Text PDF

Similar Publications

Peptide Property Prediction for Mass Spectrometry Using AI: An Introduction to State of the Art Models.

Proteomics

May 2025

Computational Mass Spectrometry, Technical University of Munich, Freising, Germany.

Jesse Angelis , Eva Ayla Schröder , Zixuan Xiao , Wassim Gabriel , Mathias Wilhelm

This review explores state of the art machine learning and deep learning models for peptide property prediction in mass spectrometry-based proteomics, including, but not limited to, models for predicting digestibility, retention time, charge state distribution, collisional cross section, fragmentation ion intensities, and detectability. The combination of these models enables not only the in silico generation of spectral libraries but also finds many additional use cases in the design of targeted assays or data-driven rescoring. This review serves as both an introduction for newcomers and an update for experienced researchers aiming to develop accessible and reproducible models for peptide property predictions.

View Article and Find Full Text PDF

Similar Publications

ProSIMSIt: The Best of Both Worlds in Data-Driven Rescoring and Identification Transfer.

J Proteome Res

April 2025

Chair of Proteomics and Bioanalytics, School of Life Sciences, Technical University of Munich, 85354 Freising, Germany.

Firas Hamood , Wassim Gabriel , Pia Pfeiffer , Bernhard Kuster , Mathias Wilhelm

View Article and Find Full Text PDF

Similar Publications

Maximizing Immunopeptidomics-Based Bacterial Epitope Discovery by Multiple Search Engines and Rescoring.

J Proteome Res

April 2025

VIB-UGent Center for Medical Biotechnology, VIB, 9052 Ghent, Belgium.

Patrick Willems , Fabien Thery , Laura Van Moortel , Margaux De Meyer , An Staes

Mass spectrometry-based discovery of bacterial immunopeptides presented by infected cells allows untargeted discovery of bacterial antigens that can serve as vaccine candidates. However, reliable identification of bacterial epitopes is challenged by their extremely low abundance. Here, we describe an optimized bioinformatic framework to enhance the confident identification of bacterial immunopeptides.

View Article and Find Full Text PDF

Similar Publications

TIMSRescore: A Data Dependent Acquisition-Parallel Accumulation and Serial Fragmentation-Optimized Data-Driven Rescoring Pipeline Based on MSRescore.

J Proteome Res

March 2025

VIB-UGent Center for Medical Biotechnology, VIB, Ghent 9052, Belgium.

Arthur Declercq , Robbe Devreese , Jonas Scheid , Caroline Jachmann , Tim Van Den Bossche

The high throughput analysis of proteins with mass spectrometry (MS) is highly valuable for understanding human biology, discovering disease biomarkers, identifying therapeutic targets, and exploring pathogen interactions. To achieve these goals, specialized proteomics subfields, including plasma proteomics, immunopeptidomics, and metaproteomics, must tackle specific analytical challenges, such as an increased identification ambiguity compared to routine proteomics experiments. Technical advancements in MS instrumentation can mitigate these issues by acquiring more discerning information at higher sensitivity levels.

View Article and Find Full Text PDF

Similar Publications