Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Background And Objective: Percutaneous ventricular assist devices (pVADs) are critical for bridging heart failure (HF) patients to recovery or transplantation, yet existing control strategies-constant speed control and preprogrammed pulsatile control-lack adaptability to dynamic physiological variations, leading to reduced pulsatility and hemodynamic mismatch. This study proposes a deep reinforcement learning (DRL)-based adaptive control framework to optimize pVAD performance. The goal is to restore physiological pulsatile hemodynamics while autonomously adjusting to different HF conditions, heart rate fluctuations, and intra-cycle ejection phase variability.

Methods: Following a dual-validation pathway designed to bridge simulation with physical testing, a cardiovascular-pVAD in-silico model was developed and its fidelity confirmed against an in-vitro pulsatile mock circulatory loop. This validated platform was then used to design and test the DRL controller. A modified Deep Deterministic Policy Gradient (DDPG) algorithm with embedded LSTM layers was designed to capture temporal characteristics in aortic pressure (AOP) and aortic flow(AF) waveforms. The reward function integrated hemodynamic recoverability, pulsatile waveform similarity, and control stability and safety penalty.

Results: Comparative simulations and experiments demonstrated the DRL controller's superiority over conventional strategies. Under the moderate HF condition, DRL controller achieved near-physiological AOP (DTW-AOP: 1.17 vs. 16.42 for constant speed control; 2.72 for preprogrammed pulsatile control) and AF (DTW-AF: 21.23 vs. 71.74/48.96), with pulsatility indices (PI: 1.69 vs. 1.05/1.54) and pulse pressures (PP: 34.42 mmHg vs. 3.20/24.90 mmHg) closely matching healthy reference. The framework exhibited robust adaptability to heart rate shifts (75→120 bpm) and ejection phase delays (0.1 s), maintaining stability despite sensor noise and physiological perturbations.

Conclusions: This DRL controller enables real-time synchronization with native cardiac cycles and generalization across pathologies, paving the way for precision pVAD support and future clinical translation.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.cmpb.2025.108975DOI Listing

Publication Analysis

Top Keywords

drl controller
12
heart failure
8
deep reinforcement
8
reinforcement learning
8
percutaneous ventricular
8
ventricular assist
8
assist devices
8
speed control
8
preprogrammed pulsatile
8
heart rate
8

Similar Publications

In cyber-physical systems governed by nonlinear partial differential equations (PDEs), real-time control is often limited by sparse sensor data and high-dimensional system dynamics. Deep reinforcement learning (DRL) has shown promise for controlling such systems, but training DRL agents directly on full-order simulations is computationally intensive. This paper presents a sensor-driven, non-intrusive reduced-order modeling (NIROM) framework called FAE-CAE-LSTM, which combines convolutional and fully connected autoencoders with a long short-term memory (LSTM) network.

View Article and Find Full Text PDF

3D reconstruction is a pivotal technology that recreates three-dimensional structures from two-dimensional representations, facilitating AI's understanding and interaction with the real world. However, existing methods pose challenges from two perspectives, i.e.

View Article and Find Full Text PDF

Background: Mammography is a critical tool for early breast cancer detection, but its use of ionizing radiation necessitates careful monitoring and optimization of patient exposure to ensure safety. Conventional methods for reporting diagnostic reference levels (DRLs) rely on wide compressed breast thickness (CBT) ranges, which lack the precision to account for individual variations, limiting their effectiveness in optimizing mammographic radiation doses.

Purpose: To develop an equation-based approach that provides a DRL for any given CBT.

View Article and Find Full Text PDF

Anatomical changes in head-and-neck cancer (HNC) patients during intensity-modulated proton therapy (IMPT) can shift the Bragg Peak of proton beams, risking tumor underdosing and organ-at-risk (OAR) overdosing. As a result, treatment replanning is often required to maintain clinically acceptable treatment quality. However, current manual replanning processes are often resource intensive and time consuming.

View Article and Find Full Text PDF

Emergency control is essential for ensuring transient stability in power systems after faults. This study addresses the limitations in existing methods by proposing a knowledge-generative pretrained transformer (GPT)-guided generalizable reinforcement learning (RL) approach for intelligent emergency generator tripping. This approach incorporates general electrical principles and knowledge-GPT to assist deep reinforcement learning (DRL).

View Article and Find Full Text PDF