Model enhanced reinforcement learning to enable precision dosing: A theoretical case study with dosing of propofol.

CPT Pharmacometrics Syst Pharmacol

Roche Pharma Research and Early Development (pRED), F. Hoffmann La Roche Ltd., Basel, Switzerland.

Published: November 2022


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Extending the potential of precision dosing requires evaluating methodologies offering more flexibility and higher degree of personalization. Reinforcement learning (RL) holds promise in its ability to integrate multidimensional data in an adaptive process built toward efficient decision making centered on sustainable value creation. For general anesthesia in intensive care units, RL is applied and automatically adjusts dosing through monitoring of patient's consciousness. We further explore the problem of optimal control of anesthesia with propofol by combining RL with state-of-the-art tools used to inform dosing in drug development. In particular, we used pharmacokinetic-pharmacodynamic (PK-PD) modeling as a simulation engine to generate experience from dosing scenarios, which cannot be tested experimentally. Through simulations, we show that, when learning from retrospective trial data, more than 100 patients are needed to reach an accuracy within the range of what is achieved with a standard dosing solution. However, embedding a model of drug effect within the RL algorithm improves accuracy by reducing errors to target by 90% through learning to take dosing actions maximizing long-term benefit. Data residual variability impacts accuracy while the algorithm efficiently coped with up to 50% interindividual variability in the PK and 25% in the PD model's parameters. We illustrate how extending the state definition of the RL agent with meaningful variables is key to achieve high accuracy of optimal dosing policy. These results suggest that RL constitutes an attractive approach for precision dosing when rich data are available or when complemented with synthetic data from model-based tools used in model-informed drug development.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9662205PMC
http://dx.doi.org/10.1002/psp4.12858DOI Listing

Publication Analysis

Top Keywords

precision dosing
12
dosing
10
reinforcement learning
8
drug development
8
data
5
model enhanced
4
enhanced reinforcement
4
learning
4
learning enable
4
enable precision
4

Similar Publications

Reconstructing bone defects remains a significant challenge in clinical practice, driving the urgent need for advanced artificial grafts that simultaneously promote vascularization and osteogenesis. Addressing the critical trade-off between achieving high porosity/strength and effective bioactivity at safe ion doses, we incorporated strontium (Sr) into β-tricalcium phosphate (β-TCP) scaffolds with a triply periodic minimal surface (TPMS) structure using digital light processing (DLP)-based three-dimensional (3D) printing. Systematically screening Sr concentrations (0-10 mol%), we identified 10 mol% as optimal, leveraging the synergy between the biomimetic TPMS architecture, providing exceptional mechanical strength (up to 1.

View Article and Find Full Text PDF

Efficacy and Safety of Aldosterone Synthase Inhibitors for Resistant Hypertension: A Systematic Review and Meta-Analysis.

Rev Cardiovasc Med

August 2025

Department of Emergency Medicine, General Practice Medical Center, National Clinical Research Center for Geriatrics, West China Hospital, 610041 Chengdu, Sichuan, China.

Background: Compared to patients with controllable hypertension, those with resistant hypertension (RH) have a higher incidence of cardiovascular complications, including stroke, left ventricular hypertrophy, and congestive heart failure. Therefore, an urgent need exists for improved management and control, along with more effective medications. Aldosterone synthase inhibitors (ASIs) are newly emerging drugs that have gradually attracted an increasing amount of attention.

View Article and Find Full Text PDF

Background: Radiotherapy workflows conventionally deliver one treatment plan multiple times throughout the treatment course. Non-coplanar techniques with beam angle optimization or dosimetrically optimized pathfinding (DOP) exploit additional degrees of freedom to improve spatial conformality of the dose distribution compared to widely used techniques like volumetric-modulated arc therapy (VMAT). The temporal dimension of dose delivery can be exploited using multiple plans (sub-plans) within one treatment course.

View Article and Find Full Text PDF

A machine learning based dual-energy CT elemental decomposition method and its physical-biological impacts on carbon ion therapy.

Med Phys

September 2025

Key Laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Department of Radiation Oncology, Peking University Cancer Hospital & Institute, Beijing, China.

Background: Dual-energy computed tomography (DECT) enhances material differentiation by leveraging energy-dependent attenuation properties particularly for carbon ion therapy. Accurate estimation of tissue elemental composition via DECT can improve quantification of physical and biological doses.

Objective: This study proposed a novel machine-learning-based DECT (ML-DECT) method to predict the physical density and mass ratios of H, C, N, O, P, and Ca.

View Article and Find Full Text PDF

Background: Janus kinase inhibitors (JAKIs) are small molecules used orally to treat inflammatory and hematological disorders. They have demonstrated impressive efficacy across multiple indications. However, concerns have emerged regarding their safety profile.

View Article and Find Full Text PDF