HyperComm: Hypergraph-based communication in multi-agent reinforcement learning.

Neural Netw

School of Mathematics, Southeast University, Nanjing 210096, China; Purple Mountain Laboratories, Nanjing 211111, China; Ahlia University, Manama 10878, Bahrain.

Published: October 2024


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

In the realm of fully cooperative multi-agent reinforcement learning (MARL), effective communication can induce implicit cooperation among agents and improve overall performance. In current communication strategies, agents are allowed to exchange local observations or latent embeddings, which can augment individual local policy inputs and mitigate uncertainty in local decision-making processes. Unfortunately, in previous communication schemes, agents may potentially receive irrelevant information, which increases training difficulty and leads to poor performance in complex settings. Furthermore, most existing works lack the consideration of the impact of small coalitions formed by agents in the multi-agent system. To address these challenges, we propose HyperComm, a novel framework that uses the hypergraph to model the multi-agent system, improving the accuracy and specificity of communication among agents. Our approach brings the concept of hypergraph for the first time in multi-agent communication for MARL. Within this framework, each agent can communicate more effectively with other agents within the same hyperedge, leading to better cooperation in environments with multiple agents. Compared to those state-of-the-art communication-based approaches, HyperComm demonstrates remarkable performance in scenarios involving a large number of agents.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.neunet.2024.106432DOI Listing

Publication Analysis

Top Keywords

multi-agent reinforcement
8
reinforcement learning
8
agents
8
multi-agent system
8
communication
6
multi-agent
5
hypercomm hypergraph-based
4
hypergraph-based communication
4
communication multi-agent
4
learning realm
4

Similar Publications

Medication adherence is critical for the recovery of adolescents and young adults (AYAs) who have undergone hematopoietic cell transplantation. However, maintaining adherence is challenging for AYAs after hospital discharge, who experience both individual (e.g.

View Article and Find Full Text PDF

Cooperation is a hallmark of social species, enabling individuals to achieve goals that are unattainable alone. Across species, cooperative behaviors are often organized by distinct social roles such as leaders and followers, yet the neural mechanisms supporting such role-based coordination remain elusive. Here we introduce a new paradigm for studying cooperation in mice, where pairs of animals engage in a joint spatial foraging task that naturally gives rise to stable leader-follower roles predictive of learning speed.

View Article and Find Full Text PDF

A robot scheduling method based on rMAPPO for H-beam riveting and welding work cell.

PLoS One

September 2025

Hubei Key Laboratory of Broadband Wireless Communication and Sensor Networks, School of Information Engineering, Wuhan University of Technology, Wuhan, Hubei, China.

The H-beam riveting and welding work cell is an automated unit used for processing H-beams. By coordinating the gripping and welding robots, the work cell achieves processes such as riveting and welding stiffener plates, transforming the H-beam into a stiffened H-beam. In the context of intelligent manufacturing, there is still significant potential for improving the productivity of riveting and welding tasks in existing H-beam riveting and welding work cells.

View Article and Find Full Text PDF

This paper addresses the critical issue of monitoring high-density crowds in public spaces like transportation hubs to prevent accidents from overcrowding. It highlights the limitations of prevailing simulation tools in dealing with real-world challenges such as diverse pedestrian destinations, multi-directional flows, and the medley space designs in communal areas. The paper aims to introduce a data-driven, multi-agent framework that assesses crowd dynamics and early warning conditions in different spatial layouts.

View Article and Find Full Text PDF

The coevolution of signalling is a complex problem within animal behaviour, and is also central to communication between artificial agents. The Sir Philip Sidney game was designed to model this dyadic interaction from an evolutionary biology perspective, and was formulated to demonstrate the emergence of honest signalling. We use Multi-Agent Reinforcement Learning (MARL) to show that in the majority of cases, the resulting behaviour adopted by agents is not that shown in the original derivation of the model.

View Article and Find Full Text PDF