Fine-tuning molecular mechanics force fields to experimental free energy measurements.

bioRxiv

Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY 10065.

Published: January 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Alchemical free energy methods using molecular mechanics (MM) force fields are essential tools for predicting thermodynamic properties of small molecules, especially via free energy calculations that can estimate quantities relevant for drug discovery such as affinities, selectivities, the impact of target mutations, and ADMET properties. While traditional MM forcefields rely on hand-crafted, discrete atom types and parameters, modern approaches based on graph neural networks (GNNs) learn continuous embedding vectors that represent chemical environments from which MM parameters can be generated. Excitingly, GNN parameterization approaches provide a fully end-to-end differentiable model that offers the possibility of systematically improving these models using experimental data. In this study, we treat a pretrained GNN force field-here, espaloma-0.3.2-as a model and its charge model using limited quantities of experimental hydration free energy data, with the goal of assessing the degree to which this can systematically improve the prediction of other related free energies. We demonstrate that a highly efficient "one-shot fine-tuning" method using an exponential (Zwanzig) reweighting free energy estimator can improve prediction accuracy without the need to resimulate molecular configurations. To achieve this "one-shot" improvement, we demonstrate the importance of using effective sample size (ESS) regularization strategies to retain good overlap between initial and fine-tuned force fields. Moreover, we show that leveraging low-rank projections of embedding vectors can achieve comparable accuracy improvements as higher-dimensional approaches in a variety of data-size regimes. Our results demonstrate that linearly-perturbative fine-tuning of foundation model electrostatic parameters to limited experimental data offers a cost-effective strategy that achieves state-of-the-art performance in predicting hydration free energies on the FreeSolv dataset.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11741335PMC
http://dx.doi.org/10.1101/2025.01.06.631610DOI Listing

Publication Analysis

Top Keywords

free energy
20
force fields
12
molecular mechanics
8
mechanics force
8
embedding vectors
8
experimental data
8
hydration free
8
improve prediction
8
free energies
8
free
7

Similar Publications

To address the increasingly limited water availability, using metal-organic frameworks (MOFs) to capture atmospheric water vapor as usable resources has emerged as a promising strategy. The adsorption characteristics of MOFs as well as their step pressure (i.e.

View Article and Find Full Text PDF

A series of Cu-based single-atom catalysts (SACs) with asymmetric coordination were designed to accelerate lithium-sulfur (Li-S) chemistry. The electronegativity contrast from the dopant induces a localized electronic asymmetry that amplifies Jahn-Teller distortion at the Cu center. This distortion profoundly modulates the Cu 3d electronic structure and its interaction with Li-S intermediates.

View Article and Find Full Text PDF

Cyclin-dependent kinase 20 (CDK20), also known as cell cycle-related kinase (CCRK), plays a pivotal role in hepatocellular carcinoma (HCC) progression by regulating β-catenin signaling and promoting uncontrolled proliferation. Despite its emerging significance, selective small-molecule inhibitors of CDK20 remain unexplored. In this study, a known CDK20 inhibitor, ISM042-2-048, was employed as a reference to retrieve structurally similar compounds from the PubChem database using an 85% similarity threshold.

View Article and Find Full Text PDF

Design and synthesis of novel indolinone Aurora B kinase inhibitors based on fragment-based drug discovery (FBDD).

Mol Divers

September 2025

State Key Laboratory Basis of Xinjiang Indigenous Medicinal Plants Resource Utilization, Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, Urumqi, 830011, Xinjiang, China.

Aurora kinases are a group of serine/threonine kinases essential for cell mitosis, comprising Aurora A, B, and C. However, the Aurora B is overexpressed in multiple tumors and the aurone has been proved to exhibit potent inhibitory activity against Aurora B kinase by our group. The indolinone was considered as an aurone scaffold hopping analog, and the indolinone-based Aurora B inhibitor library (3577 molecules) was constructed by FBDD strategy.

View Article and Find Full Text PDF

To overcome the potential issue of active site blockage by surfactants in colloidal synthesis, alternative synthetic approaches must be explored. In this study, we investigated both solvent-free and colloidal thermolysis routes to synthesize nickel sulfides (NiS and NiS) using sulfur-based Ni complexes, [Ni(SCO(CH))] (Ni-Xan) and [Ni(SCN(CH))] (Ni-DTC) as precursors. The solvent-free decomposition of these complexes produced ligand-free NiS and NiS in the absence or presence of triphenylphosphine (TPP), respectively.

View Article and Find Full Text PDF