FlowDock: Geometric flow matching for generative protein-ligand docking and affinity prediction.

Bioinformatics

Department of Electrical Engineering & Computer Science, NextGen Precision Health, University of Missouri-Columbia, Columbia, MO 65211, United States.

Published: July 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Motivation: Powerful generative AI models of protein-ligand structure have recently been proposed, but few of these methods support both flexible protein-ligand docking and affinity estimation. Of those that do, none can directly model multiple binding ligands concurrently or have been rigorously benchmarked on pharmacologically relevant drug targets, hindering their widespread adoption in drug discovery efforts.

Results: In this work, we propose FlowDock, the first deep geometric generative model based on conditional flow matching (CFM) that learns to directly map unbound (apo) structures to their bound (holo) counterparts for an arbitrary number of binding ligands. Furthermore, FlowDock provides predicted structural confidence scores and binding affinity values with each of its generated protein-ligand complex structures, enabling fast virtual screening of new (multi-ligand) drug targets. For the well-known PoseBusters Benchmark dataset, FlowDock outperforms single-sequence AlphaFold 3 (AF3) with a 51% blind docking success rate using unbound (apo) protein input structures and without any information derived from multiple sequence alignments, and for the challenging new DockGen-E dataset, FlowDock outperforms single-sequence AF3 and matches single-sequence Chai-1 for binding pocket generalization. Additionally, in the ligand category of the 16th community-wide Critical Assessment of Techniques for Structure Prediction, FlowDock ranked among the top-5 methods for pharmacological binding affinity estimation across 140 protein-ligand complexes, demonstrating the efficacy of its learned representations in virtual screening.

Availability And Implementation: Source code, data, and pre-trained models are available at https://github.com/BioinfoMachineLearning/FlowDock.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12261468PMC
http://dx.doi.org/10.1093/bioinformatics/btaf187DOI Listing

Publication Analysis

Top Keywords

flow matching
8
protein-ligand docking
8
docking affinity
8
affinity estimation
8
binding ligands
8
drug targets
8
unbound apo
8
binding affinity
8
dataset flowdock
8
flowdock outperforms
8

Similar Publications

Background: Anal squamous cell cancer incidence has risen 2.2% each year over the past decade. Current screening includes anal cytology and high-resolution anoscopy but is burdened with sampling error and patient discomfort.

View Article and Find Full Text PDF

Coarse-grained (CG) molecular dynamics simulations extend the length and time scales of atomistic simulations by replacing groups of correlated atoms with CG beads. Machine-learned coarse-graining (MLCG) has recently emerged as a promising approach to construct highly accurate force fields for CG molecular dynamics. However, the calibration of MLCG force fields typically hinges on force matching, which demands extensive reference atomistic trajectories with corresponding force labels.

View Article and Find Full Text PDF

Introduction:  Endothelial dysfunction has been reported in rheumatoid arthritis (RA) patients without classical cardiovascular risk factors, but findings remain inconsistent.

Objectives:  To assess whether endothelial function is impaired in RA with moderate inflammatory burden in the absence of established cardiovascular risk factors.

Patients And Methods:  This cross-sectional study was conducted in 64 patients with RA without classical CV risk factors and 60 healthy age- and sex-matched controls.

View Article and Find Full Text PDF

Background: Clonotyping of immunoglobulin heavy chain (IGH) gene rearrangements is critical for diagnosis, prognostication, and measurable residual disease monitoring in chronic lymphocytic leukemia (CLL). Although short-read next-generation sequencing (NGS) platforms, such as Illumina MiSeq, are widely used, they face challenges in spanning full VDJ rearrangements. Long-read sequencing via Oxford Nanopore Technologies (ONT) offers a potential alternative using the compact and cost-effective flow cells.

View Article and Find Full Text PDF

This study evaluated immune cell subset variations in immune thrombocytopenia (ITP) by comparing frequencies at diagnosis with controls and assessing changes post-therapy. A single-center prospective observational study enrolled 25 untreated acute and chronic ITP patients and 20 matched controls from January 2018 to January 2019. Immune cell subsets, including CD4+, CD8+, NK cells, NK-T cells, and T regulatory cells (Tregs), were analyzed using flow cytometric immunophenotyping.

View Article and Find Full Text PDF