A deep attention model for wide-genome protein-peptide binding affinity prediction at a sequence level.

Int J Biol Macromol

College of Chemistry and Life Science, Beijing University of Technology, Beijing 100124, China. Electronic address:

Published: September 2024


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Peptides are pivotal in numerous biological activities by engaging in up to 40 % of protein-protein interactions in many cellular processes. Due to their exceptional specificity and effectiveness, peptides have emerged as promising candidates for drug design. However, accurately predicting protein-peptide binding affinity remains a challenging. Aiming at the problem, we develop a prediction model PepPAP based on convolutional neural network and multi-head attention, which relies solely on sequence features. These features include physicochemical properties, intrinsic disorder, sequence encoding, and especially interface propensity which is extracted from 16,689 non-redundant protein-peptide complexes. Notably, the adopted regression stratification cross-validation scheme proposed in our previous work is beneficial to improve the prediction for the cases with extreme binding affinity values. On three benchmark test datasets: T100, a series of peptides targeting to PDZ domain and CXCR4, PepPAP shows excellent performance, outperforming the existing methods and demonstrating its good generalization ability. Furthermore, PepPAP has good results in binary interaction prediction, and the analysis of the feature space distribution visualization highlights PepPAP's effectiveness. To the best of our knowledge, PepPAP is the first sequence-based deep attention model for wide-genome protein-peptide binding affinity prediction, and holds the potential to offer valuable insights for the peptide-based drug design.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.ijbiomac.2024.133811DOI Listing

Publication Analysis

Top Keywords

binding affinity
16
protein-peptide binding
12
deep attention
8
attention model
8
model wide-genome
8
wide-genome protein-peptide
8
affinity prediction
8
drug design
8
prediction
5
protein-peptide
4

Similar Publications

The interactions of three berberine mid-chain fatty acid salts ([BBR][C], n = 6, 7, 8) with lysozyme (Lyz) are investigated in detail using multi-spectroscopic and molecular docking techniques. Steady-state fluorescence and UV-visible absorption experiments suggest that the binding mechanism of [BBR][C] on Lyz is a static quenching with a binding ratio of 1:1. The compound [BBR][C] exhibits a moderate binding affinity toward Lyz.

View Article and Find Full Text PDF

The global rise in antibiotic resistance demands the urgent development of new antibacterial agents. This study investigated the antibacterial potential of four synthesized methoxy and thiophene chalcone derivatives (designated 3a, 4a, 3b, and 4b) against clinically relevant bacterial pathogens. These compounds were prepared through Claisen-Schmidt condensation, while their chemical structures were verified through applying Fourier-transform infrared, mass spectrometry, H nuclear magnetic resonance (NMR), and C NMR.

View Article and Find Full Text PDF

The Influence of Single-Stranded or Double-Stranded DNA Tags on Ligand Binding Affinity in DNA-Encoded Libraries.

Anal Chem

September 2025

Laboratory of Organic Chemistry, Department of Chemistry and Applied Biosciences, ETH Zurich, 8093 Zurich, Switzerland.

DNA-encoded libraries have become widely used in drug discovery, and several different setups to link chemical compounds to DNA have been employed in the field, including single-stranded and double-stranded DNA tags as well as a variety of linker chemistries. In our previous study, we observed distinct differences in binding affinities between ligands coupled either to single-stranded or double-stranded DNA; however, the molecular basis for these differences remained unclear. Here, we present a native ion mobility mass spectrometry approach that incorporates gas- and solution-phase activation techniques to systematically investigate these differences, specifically the impact of DNA tags on binding performance in protein-ligand interactions.

View Article and Find Full Text PDF

Clusters of deep intronic RbFox motifs embedded in large assembly of splicing regulators sequences regulate alternative splicing.

PLoS Genet

September 2025

Neural Development Section, Mouse Cancer Genetics Program, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Frederick, Maryland, United States of America.

The RbFox RNA binding proteins regulate alternative splicing of genes governing mammalian development and organ function. They bind to the RNA sequence (U)GCAUG with high affinity but also non-canonical secondary motifs in a concentration dependent manner. However, the hierarchical requirement of RbFox motifs, which are widespread in the genome, is still unclear.

View Article and Find Full Text PDF

Objective: This study employs integrated network toxicology and molecular docking to investigate the molecular basis underlying 4-nonylphenol (4-NP)-mediated enhancement of breast cancer susceptibility.

Methods: We integrated data from multiple databases, including ChEMBL, STITCH, Swiss Target Prediction, GeneCards, OMIM and TTD. Core compound-disease-associated target genes were identified through Protein-Protein Interaction (PPI) network analysis.

View Article and Find Full Text PDF