A platform-agnostic deep reinforcement learning framework for effective Sim2Real transfer towards autonomous driving.

Commun Eng

Chair of Econometrics and Statistics, esp. in the Transport Sector, Technische Universität Dresden, Dresden, Germany.

Published: October 2024


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Autonomous driving presents unique challenges, particularly in transferring agents trained in simulation to real-world environments due to the discrepancies between the two. To address this issue, here we propose a robust Deep Reinforcement Learning (DRL) framework that incorporates platform-dependent perception modules to extract task-relevant information, enabling the training of a lane-following and overtaking agent in simulation. This framework facilitates the efficient transfer of the DRL agent to new simulated environments and the real world with minimal adjustments. We assess the performance of the agent across various driving scenarios in both simulation and the real world, comparing it to human drivers and a proportional-integral-derivative (PID) baseline in simulation. Additionally, we contrast it with other DRL baselines to clarify the rationale behind choosing this framework. Our proposed approach helps bridge the gaps between different platforms and the Simulation to Reality (Sim2Real) gap, allowing the trained agent to perform consistently in both simulation and real-world scenarios, effectively driving the vehicle.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11487131PMC
http://dx.doi.org/10.1038/s44172-024-00292-3DOI Listing

Publication Analysis

Top Keywords

deep reinforcement
8
reinforcement learning
8
autonomous driving
8
simulation real-world
8
simulation
6
platform-agnostic deep
4
framework
4
learning framework
4
framework effective
4
effective sim2real
4

Similar Publications

Background: In 2021, Dr Kalra embraced an opportunity for a leadership role at a start-up healthcare organisation in India. This gave him an opportunity to adapt his National Health Service (NHS) leadership experience to the evolving Indian private healthcare landscape. This paper shares his lived experience as a National Medical Director and delves into the experiences and leadership insights he acquired during this.

View Article and Find Full Text PDF

'Still struggling, but there is a small glimmer of light at the end of the tunnel' - Individual in-depth interviews six-months after stillbirth: A follow-up study in Sweden.

Women Birth

September 2025

Department of Women's and Children's Health, Uppsala University, Uppsala, Sweden; Department of Nursing, Mid Sweden University, Sundsvall, Sweden.

Background: Few parents experience stillbirth in Sweden, and their perspectives on the grieving process remain largely unknown.

Objective: To explore parents' perspectives, memories, reflections and insights in the grieving and recovery process six months after stillbirth.

Methods: A mixed-method study involving nine in-depth interviews and responses to eleven quantitative statements.

View Article and Find Full Text PDF

Subthalamic deep brain stimulation (STN-DBS) provides unprecedented spatiotemporal precision for the treatment of Parkinson's disease (PD), allowing for direct real-time state-specific adjustments. Inspired by findings from optogenetic stimulation in mice, we hypothesized that STN-DBS can mimic dopaminergic reinforcement of ongoing movement kinematics during stimulation. To investigate this hypothesis, we delivered DBS bursts during particularly fast and slow movements in 24 patients with PD.

View Article and Find Full Text PDF

This study aims to optimize the dynamic administration regimen of prophylactic enoxaparin in critically ill patients to reduce the risk of VTE, major bleeding, and 30-day all-cause mortality. We developed and internally and externally validated an artificial intelligence (AI) policy utilizing Double dueling deep Q network, using data from the Medical Information Mart for Intensive Care IV (MIMIC-IV) database (training and internal test set) and the eICU Collaborative Research Database (eICU-CRD, external test set). We compared the performance among the AI policy, the clinician's policy, the weight-tiered policy, and the fixed 40- mg-once-daily (QD) policy.

View Article and Find Full Text PDF

The increasing dependence on cloud computing as a cornerstone of modern technological infrastructures has introduced significant challenges in resource management. Traditional load-balancing techniques often prove inadequate in addressing cloud environments' dynamic and complex nature, resulting in suboptimal resource utilization and heightened operational costs. This paper presents a novel smart load-balancing strategy incorporating advanced techniques to mitigate these limitations.

View Article and Find Full Text PDF