Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

This article dedicates to investigating a methodology for enhancing adaptability to environmental changes of reinforcement learning (RL) techniques with data efficiency, by which a joint control protocol is learned using only data for multiagent systems (MASs). Thus, all followers are able to synchronize themselves with the leader and minimize their individual performance. To this end, an optimal synchronization problem of heterogeneous MASs is first formulated, and then an arbitration RL mechanism is developed for well addressing key challenges faced by the current RL techniques, that is, insufficient data and environmental changes. In the developed mechanism, an improved Q-function with an arbitration factor is designed for accommodating the fact that control protocols tend to be made by historic experiences and instinctive decision-making, such that the degree of control over agents' behaviors can be adaptively allocated by on-policy and off-policy RL techniques for the optimal multiagent synchronization problem. Finally, an arbitration RL algorithm with critic-only neural networks is proposed, and theoretical analysis and proofs of synchronization and performance optimality are provided. Simulation results verify the effectiveness of the proposed method.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TCYB.2024.3440333DOI Listing

Publication Analysis

Top Keywords

reinforcement learning
8
multiagent systems
8
environmental changes
8
synchronization problem
8
synchronization
4
learning synchronization
4
synchronization heterogeneous
4
heterogeneous multiagent
4
systems improved
4
improved q-functions
4

Similar Publications

A machine learning-designed "supramolecular armor" imparts exceptional stability to perovskite quantum dots. A guanidinium crosslinker reinforces a β-cyclodextrin layer, creating a robust yet permeable interface that enables direct contact sensing in challenging aqueous environments.

View Article and Find Full Text PDF

Every day we encounter situations in which decisions require trade-offs between the delay to one reward and the likelihood of receiving another reward. The current study was designed to extend a general discounting framework to gain insights into this fundamental trade-off process. Forty-three undergraduates adjusted the probability of receiving an immediate hypothetical monetary reward (either $200 or $10,000) until that probabilistic reward was judged subjectively equal in value to the same reward received with certainty after a delay (ranging from 1 month to 25 years).

View Article and Find Full Text PDF

Eye Contact: To Teach or Not to Teach? That is Not the Question.

Perspect Behav Sci

September 2025

ABA Clinic, United Kingdom of Great Britain and Northern Ireland, 40A Burgess Road, Southampton, SO16 7AH UK.

In recent years, the question has been raised as to whether teaching eye contact to autistic children is an ethically defensible educational objective. In the present article, I suggest that this question may be best answered by first defining contact with the eyes not as behavior, but as a consequence for the behavior of looking. Looking at people's faces, and in particular the eyes, provides information regarding the discriminative functions and reinforcing value of social stimuli, of people, of what they do, what they say, and what they feel, and is a critical part of all social behavior.

View Article and Find Full Text PDF

Medication adherence is critical for the recovery of adolescents and young adults (AYAs) who have undergone hematopoietic cell transplantation. However, maintaining adherence is challenging for AYAs after hospital discharge, who experience both individual (e.g.

View Article and Find Full Text PDF

Artificial General Intelligence and Its Threat to Public Health.

J Eval Clin Pract

September 2025

Academic Unit of Population and Lifespan Sciences, School of Medicine, Nottingham City Hospital Campus, University of Nottingham, Clinical Sciences Building, Nottingham, UK.

Background: Artificial intelligence (AI) is increasingly applied across healthcare and public health, with evidence of benefits including enhanced diagnostics, predictive modelling, operational efficiency, medical education, and disease surveillance.However, potential harms - such as algorithmic bias, unsafe recommendations, misinformation, privacy risks, and sycophantic reinforcement - pose challenges to safe implementation.Far less attention has been directed to the public health threats posed by artificial general intelligence (AGI), a hypothetical form of AI with human-level or greater cognitive capacities.

View Article and Find Full Text PDF