Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Lifelong deep reinforcement learning (DRL) approaches are commonly employed to adapt continuously to new tasks without forgetting previously acquired knowledge. While current lifelong DRL methods have shown promising advancements in retaining acquired knowledge, they suffer from significant adaptation efforts (i.e., longer training duration) and suboptimal policy when transferring to a new task that significantly deviates from previously learned tasks, a phenomenon known as the few-shot generalization challenge. In this work, we propose a generic approach that equips existing lifelong DRL methods with the capability of few-shot generalization. First, we employ selective experience reuse by leveraging the experience of encountered states, improving adaptation training for new tasks. Then, a relaxed softmax function is applied to the target Q values to improve the accuracy of evaluated Q values, leading to more optimal policies. Finally, we measure and reduce the discrepancy in data distribution between the policy and off-policy samples, resulting in improved adaptation efficiency. Extensive experiments have been conducted on three typical benchmarks to compare our approach with six representative lifelong DRL methods and two state-of-the-art (SOTA) few-shot DRL methods regarding their training speed, episode return, and average return of all episodes. Experimental results substantiate that our method improves the return of six lifelong DRL methods by at least 25%.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TNNLS.2024.3385570DOI Listing

Publication Analysis

Top Keywords

drl methods
20
lifelong drl
16
lifelong deep
8
deep reinforcement
8
reinforcement learning
8
acquired knowledge
8
few-shot generalization
8
lifelong
6
drl
6
methods
5

Similar Publications

Diagnostic reference levels (DRLs) are essential for optimizing radiologic practices and ensuring patient safety. This study aimed to establish typical DRLs for nuclear medicine (NM) procedures performed at a Brazilian public university hospital. A retrospective analysis of 2,609 patient records from 13 routine NM procedures was conducted.

View Article and Find Full Text PDF

The increasing dependence on cloud computing as a cornerstone of modern technological infrastructures has introduced significant challenges in resource management. Traditional load-balancing techniques often prove inadequate in addressing cloud environments' dynamic and complex nature, resulting in suboptimal resource utilization and heightened operational costs. This paper presents a novel smart load-balancing strategy incorporating advanced techniques to mitigate these limitations.

View Article and Find Full Text PDF

Turbulent convection governs heat transport in both natural and industrial settings, yet optimizing it under extreme conditions remains a significant challenge. Traditional control strategies, such as predefined temperature modulation, struggle to achieve substantial enhancement. Here, we introduce a deep reinforcement learning (DRL) framework that autonomously discovers optimal control policies to maximize heat transfer in turbulent Rayleigh-Bénard convection.

View Article and Find Full Text PDF

This review covers recent advances (2023-2024) in neuroimaging research into the pathophysiology, progression, and treatment of Alzheimer's disease (AD) and related dementias (ADRD). Despite the rapid emergence of blood-based biomarkers, neuroimaging continues to be a vital area of research in ADRD. Here, we discuss neuroimaging as a powerful tool to topographically visualize and quantify amyloid, tau, neurodegeneration, inflammation, and vascular disease in the brain.

View Article and Find Full Text PDF

Introduction: Diagnostic reference levels (DRLs) are essential for optimising ionising radiation use in medical imaging and minimising patient exposure. Radiographers play a key role in implementing DRLs to ensure dose optimisation and high-quality imaging. However, gaps in awareness and understanding can hinder effective application.

View Article and Find Full Text PDF