Spatio-temporal learning from molecular dynamics simulations for protein-ligand binding affinity prediction.

Bioinformatics

Institute of Organic and Analytical Chemistry (ICOA), UMR7311, Université d'Orléans, CNRS, Pôle de chimie rue de Chartres, 45067 Orléans Cedex 2, France.

Published: August 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Motivation: The field of protein-ligand binding affinity prediction continues to face significant challenges. While deep learning (DL) models can leverage 3D structural information of protein-ligand complexes, they perform well only on heavily biased test sets containing information leaked from training sets. This lack of generalization arises from the limited availability of training data and the models' inability to effectively learn from protein-ligand interactions. Since these interactions are inherently time-dependent, molecular dynamics (MD) simulations offer a potential solution by incorporating conformational sampling and providing interaction rich information.

Results: We have developed MDbind, a dataset comprising 63 000 simulations of protein-ligand interactions, along with novel neural networks capable of learning from these simulations to predict binding affinity. By utilizing MD as data augmentation, our models achieved state-of-the-art performance on the PDBbind v.2016 core set and an external test set, the free energy perturbation (FEP) dataset. Additionally, when trained on the full MD simulations, the models demonstrated less biased predictions.

Availability And Implementation: The code for neural networks is available at https://github.com/ICOA-SBC/MD_DL_BA. The models, the results and the training/validation/test sets are available for download at https://zenodo.org/records/10390550. The MDbind trajectories are being transferred to the MDDB: https://mmb-dev.mddbr.eu/#/browse? option=mdbind.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12371333PMC
http://dx.doi.org/10.1093/bioinformatics/btaf429DOI Listing

Publication Analysis

Top Keywords

binding affinity
12
molecular dynamics
8
dynamics simulations
8
simulations protein-ligand
8
protein-ligand binding
8
affinity prediction
8
protein-ligand interactions
8
neural networks
8
simulations
5
protein-ligand
5

Similar Publications

The terminal deoxynucleotidyl transferase is a unique polymerase that incorporates nucleotides at the 3'-terminus of single-stranded DNA primers in a template-independent manner. This biological function propels the development of numerous biomedical and bioengineering applications. However, the extensive use of TdT is constrained by its low expression levels in E.

View Article and Find Full Text PDF

Nucleic acid aptamers are artificial recognition elements with great potential in biotechnology. For their effective integration into nanodevices, rational strategies for optimizing aptamer affinity and regulating activity are essential. Artificial nucleotide analogs offer versatile tools for both fundamental and applied research in the aptamer field.

View Article and Find Full Text PDF

Developing artificial hosts with temperature-driven conformational switching behaviors facilitates our understanding of the temperature-dependent allostery and adaptation mechanisms in natural recognition systems. Herein, we report the design and synthesis of three pairs of water-soluble, enantiomeric binaphthalene-based tetraimidazolium macrocycles (SS/RR-1•4Cl- - SS/RR-3•4Cl-) as artificial hosts for exploring sequence-selective recognition of dinucleotides in aqueous media. Owing to the reversible rotational conformation of axially chiral binaphthyl units, SS-1•4Cl- demonstrates the conformational switching, converting from cis-conformation (SS-1) to trans-conformation (SS-1) by increasing temperature, thereby causing the recognition cavity to transition from a closed to an open state.

View Article and Find Full Text PDF

Docking is a structure-based cheminformatics tool broadly employed in early drug discovery. Based on the tridimensional structure of the protein target, docking is used to predict the binding interactions between the protein and a ligand, estimate the corresponding binding affinity, or perform virtual screenings (VSs) to identify new active compounds. This study introduces the ligand B-factor index (LBI), a novel computational metric for prioritizing protein-ligand complexes for docking.

View Article and Find Full Text PDF

Non-small cell lung cancer (NSCLC) is an aggressive malignancy with a poor prognosis. Abnormal expression of focal adhesion kinase (FAK) is closely linked to NSCLC progression, highlighting the need for effective FAK inhibitors in NSCLC treatment. In this study we conducted high-throughput virtual screening combined with cellular assays to identify potential FAK inhibitors for NSCLC treatment.

View Article and Find Full Text PDF