Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

In multiagent systems, learning optimal behavior policies for individual agents remains a challenging yet crucial task. While recent research has made strides in this area, the issue of when agents should maintain consistent behaviors with one another is still not adequately addressed. This article proposes a novel approach to enable agents to autonomously decide whether their behaviors should align with those of their peers by leveraging intrinsic rewards to optimize their policies. We define behavior consistency as the divergence between the actions taken by two agents given the same observations. To encourage agents to be aware of each other's behaviors, we propose dynamic consistency-based intrinsic reward (DCIR), which guides agents in determining when to synchronize their behaviors. In addition, we introduce a dynamic scaling network (DSN) that provides learnable scaling factors at each time step, enabling agents to dynamically decide the extent of rewarding consistent behavior. Our method is evaluated on environments including Multiagent Particle, Google Research Football, and StarCraft II Micromanagement. Experimental results demonstrate its effectiveness in learning optimal policies.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TNNLS.2025.3598301DOI Listing

Publication Analysis

Top Keywords

behavior consistency
8
multiagent systems
8
intrinsic rewards
8
learning optimal
8
agents
7
align dynamic
4
behavior
4
dynamic behavior
4
consistency multiagent
4
systems intrinsic
4

Similar Publications

Community Connectedness as a Source of Adherence to HIV Prevention Behaviors and Resilience Among Transgender Women of Color in New York City, 2020-2022.

Am J Public Health

October 2025

Alexander Furuya, Asa Radix, Adam Whalen, Jessica Contreras, Jenesis Merriman, Krish J. Bhatt, Roberta Scheinmann, and Dustin T. Duncan are with the Department of Epidemiology, Columbia University Mailman School of Public Health, New York, NY. Yusuf Ransome is with the Department of Social and Behav

To examine how one's community connectedness may act as a source of resilience and promote HIV prevention and care behaviors among transgender women of color. We analyzed survey data from 313 transgender women of color living in New York City collected from August 2020 to November 2022. The Community Connectedness Scale asks participants about their baseline feelings of connection, feelings of inclusion, feelings of belonging, feelings of isolation, and feelings of being unlike in relation to the transgender community.

View Article and Find Full Text PDF

Human parainfluenza virus 2 (HPIV-2) and human parainfluenza virus 4 (HPIV-4) are significant but underappreciated respiratory pathogens, particularly among high-risk populations including children, the elderly, and immunocompromised individuals. In this study, we sequenced 101 HPIV-2 and HPIV-4 genomes from respiratory samples collected in western Washington State and performed comprehensive evolutionary analyses using both new and publicly available sequences. Phylogenetic and phylodynamic analyses revealed that both HPIV-2 and HPIV-4 evolve at significantly faster rates compared to mumps virus, a reference human orthorubulavirus.

View Article and Find Full Text PDF

Investigating audience preferences within the hybrid competitive-comedic format of taskmaster UK.

PLoS One

September 2025

Remiza AI, Poughkeepsie, New York, United States of America.

Background: Hybrid entertainment formats combining competitive and comedic elements present opportunities to investigate factors driving audience engagement. I analyzed Taskmaster UK (2015-2023), a BAFTA-winning comedy panel show where comedians compete in creative tasks judged by a host, to quantify relationships between scoring mechanics, performer characteristics, and viewer ratings.

Methods: I analyzed 154 episodes encompassing 917 tasks performed by 90 contestants, with audience reception measured through 32,607 IMDb votes.

View Article and Find Full Text PDF

Background: Reinfections with SARS-CoV-2 have gained increasing relevance in the context of emerging immune-evasive variants and waning population immunity. Understanding their frequency and distribution is essential to guide public health strategies, particularly in middle-income countries. This study investigates the epidemiological patterns of SARS-CoV-2 reinfections in Espírito Santo, Brazil, using integrated notification and vaccination databases.

View Article and Find Full Text PDF

Transcriptome analysis of shade-induced growth and photosynthetic responses in soybean cultivars.

PLoS One

September 2025

Institute of Crop Sciences, Chinese Academy of Agricultural Sciences/National Key Facility for Crop Gene Resources and Genetic Improvement (NFCRl). Ministry of Agriculture and Rural Affairs/Key Laboratory of Crop Gene Resource and Germplasm Enhancement, Ministry of Agriculture and Rural Affairs, Bei

Shade stress alters soybean growth through transcriptomic changes and adaptive responses that optimize light capture and utilization, regulated by a phytohormonal network. This study examined the physiological, morphological, and molecular responses of Guru (shade-tolerant) and Heinong 53 (shade-sensitive) soybean cultivars under 0% (control), 30%, and 70% shade. Results revealed morphological responses where Heinong 53 exhibited greater plant height (52.

View Article and Find Full Text PDF