Enhancing cardiac hemodynamic and pulsatility in heart failure via deep reinforcement learning: An in-silico and in-vitro validation study of percutaneous ventricular assist devices.

Yuyang Shi , Zhike Xu , Chenghan Chen , Feng He , Pengfei Hao , Xiwen Zhang

Comput Methods Programs Biomed

Department of Engineering Mechanics, School of Aerospace Engineering, Tsinghua University, Beijing 100084, China. Electronic address:

Published: October 2025

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Background And Objective: Percutaneous ventricular assist devices (pVADs) are critical for bridging heart failure (HF) patients to recovery or transplantation, yet existing control strategies-constant speed control and preprogrammed pulsatile control-lack adaptability to dynamic physiological variations, leading to reduced pulsatility and hemodynamic mismatch. This study proposes a deep reinforcement learning (DRL)-based adaptive control framework to optimize pVAD performance. The goal is to restore physiological pulsatile hemodynamics while autonomously adjusting to different HF conditions, heart rate fluctuations, and intra-cycle ejection phase variability.

Methods: Following a dual-validation pathway designed to bridge simulation with physical testing, a cardiovascular-pVAD in-silico model was developed and its fidelity confirmed against an in-vitro pulsatile mock circulatory loop. This validated platform was then used to design and test the DRL controller. A modified Deep Deterministic Policy Gradient (DDPG) algorithm with embedded LSTM layers was designed to capture temporal characteristics in aortic pressure (AOP) and aortic flow(AF) waveforms. The reward function integrated hemodynamic recoverability, pulsatile waveform similarity, and control stability and safety penalty.

Results: Comparative simulations and experiments demonstrated the DRL controller's superiority over conventional strategies. Under the moderate HF condition, DRL controller achieved near-physiological AOP (DTW-AOP: 1.17 vs. 16.42 for constant speed control; 2.72 for preprogrammed pulsatile control) and AF (DTW-AF: 21.23 vs. 71.74/48.96), with pulsatility indices (PI: 1.69 vs. 1.05/1.54) and pulse pressures (PP: 34.42 mmHg vs. 3.20/24.90 mmHg) closely matching healthy reference. The framework exhibited robust adaptability to heart rate shifts (75→120 bpm) and ejection phase delays (0.1 s), maintaining stability despite sensor noise and physiological perturbations.

Conclusions: This DRL controller enables real-time synchronization with native cardiac cycles and generalization across pathologies, paving the way for precision pVAD support and future clinical translation.

Download full-text PDF	Source
http://dx.doi.org/10.1016/j.cmpb.2025.108975	DOI Listing

Publication Analysis

Top Keywords

drl controller

heart failure

deep reinforcement

reinforcement learning

percutaneous ventricular

ventricular assist

assist devices

speed control

preprogrammed pulsatile

heart rate

Similar Publications

Sensor-Driven Surrogate Modeling and Control of Nonlinear Dynamical Systems Using FAE-CAE-LSTM and Deep Reinforcement Learning.

Sensors (Basel)

August 2025

School of Innovation, Design and Engineering, Mälardalen University, 722 20 Västerås, Sweden.

Mahdi Kherad , Mohammad Kazem Moayyedi , Faranak Fotouhi-Ghazvini , Maryam Vahabi , Hossein Fotouhi

In cyber-physical systems governed by nonlinear partial differential equations (PDEs), real-time control is often limited by sparse sensor data and high-dimensional system dynamics. Deep reinforcement learning (DRL) has shown promise for controlling such systems, but training DRL agents directly on full-order simulations is computationally intensive. This paper presents a sensor-driven, non-intrusive reduced-order modeling (NIROM) framework called FAE-CAE-LSTM, which combines convolutional and fully connected autoencoders with a long short-term memory (LSTM) network.

View Article and Find Full Text PDF

Similar Publications

NaviNeRF++: Towards Interpretable 3D Reconstruction via Unsupervised Disentangled Representation Learning.

IEEE Trans Pattern Anal Mach Intell

August 2025

Baao Xie , Zequn Zhang , Huanting Guo , Qiuyu Chen , Hu Zhu

3D reconstruction is a pivotal technology that recreates three-dimensional structures from two-dimensional representations, facilitating AI's understanding and interaction with the real world. However, existing methods pose challenges from two perspectives, i.e.

View Article and Find Full Text PDF

Similar Publications

A model-based method for reporting mammographic diagnostic reference levels for any compressed breast thickness: A refined reporting approach.

J Appl Clin Med Phys

September 2025

Cancer Control Program, Ministry of Health, Riyadh, Saudi Arabia.

Ahmad A Alhulail , Salman M Albeshan , Maha M Almuqbil

Background: Mammography is a critical tool for early breast cancer detection, but its use of ionizing radiation necessitates careful monitoring and optimization of patient exposure to ensure safety. Conventional methods for reporting diagnostic reference levels (DRLs) rely on wide compressed breast thickness (CBT) ranges, which lack the precision to account for individual variations, limiting their effectiveness in optimizing mammographic radiation doses.

Purpose: To develop an equation-based approach that provides a DRL for any given CBT.

View Article and Find Full Text PDF

Similar Publications

Patient-Specific Deep Reinforcement Learning for Automatic Replanning in Head-and-Neck Cancer Proton Therapy.

ArXiv

August 2025

Department of Radiation and Cellular Oncology, University of Chicago, Chicago, IL, USA.

Malvern Madondo , Yuan Shao , Yingzi Liu , Jun Zhou , Xiaofeng Yang

Anatomical changes in head-and-neck cancer (HNC) patients during intensity-modulated proton therapy (IMPT) can shift the Bragg Peak of proton beams, risking tumor underdosing and organ-at-risk (OAR) overdosing. As a result, treatment replanning is often required to maintain clinically acceptable treatment quality. However, current manual replanning processes are often resource intensive and time consuming.

View Article and Find Full Text PDF

Similar Publications

Knowledge-GPT Guided Generalizable Reinforcement Learning for Intelligent Emergency Generator Tripping in Power System.

IEEE Trans Neural Netw Learn Syst

August 2025

Bing Wang , Tianjing Wang , Yong Tang , Yanhao Huang

Emergency control is essential for ensuring transient stability in power systems after faults. This study addresses the limitations in existing methods by proposing a knowledge-generative pretrained transformer (GPT)-guided generalizable reinforcement learning (RL) approach for intelligent emergency generator tripping. This approach incorporates general electrical principles and knowledge-GPT to assist deep reinforcement learning (DRL).

View Article and Find Full Text PDF

Similar Publications