Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

This paper develops a novel Proportional-Integral-Derivative (PID) tuning method for multi-agent systems with a reinforced self-learning capability for achieving the optimal consensus of all agents. Unlike the traditional model-based and data-driven PID tuning methods, the developed PID self-learning method updates the controller parameters by actively interacting with unknown environment, with the outcomes of guaranteed consensus and performance optimization of agents. Firstly, the PID control-based consensus problem of multi-agent systems is formulated. Then, finding the PID gains is converted into solving a nonzero-sum game problem, thus an off-policy Q-learning algorithm with the critic-only structure is proposed to update the PID gains using only data, without the knowledge of dynamics of agents. Finally, simulations are given to verify the effectiveness of the proposed method.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.isatra.2022.06.026DOI Listing

Publication Analysis

Top Keywords

multi-agent systems
12
pid tuning
8
pid gains
8
pid
6
reinforcement learning
4
learning based
4
based proportional-integral-derivative
4
proportional-integral-derivative controllers
4
controllers design
4
consensus
4

Similar Publications

Background: AI agents built on large language models (LLMs) can plan tasks, use external tools, and coordinate with other agents. Unlike standard LLMs, agents can execute multi-step processes, access real-time clinical information, and integrate multiple data sources. There has been interest in using such agents for clinical and administrative tasks, however, there is limited knowledge on their performance and whether multi-agent systems function better than a single agent for healthcare tasks.

View Article and Find Full Text PDF

A robot scheduling method based on rMAPPO for H-beam riveting and welding work cell.

PLoS One

September 2025

Hubei Key Laboratory of Broadband Wireless Communication and Sensor Networks, School of Information Engineering, Wuhan University of Technology, Wuhan, Hubei, China.

The H-beam riveting and welding work cell is an automated unit used for processing H-beams. By coordinating the gripping and welding robots, the work cell achieves processes such as riveting and welding stiffener plates, transforming the H-beam into a stiffened H-beam. In the context of intelligent manufacturing, there is still significant potential for improving the productivity of riveting and welding tasks in existing H-beam riveting and welding work cells.

View Article and Find Full Text PDF

Agentic AI refers to autonomous systems that can perceive their environment, make decisions, and take actions to achieve goals with minimal or no human intervention. Recent advances in Large Language Models (LLMs) have opened new pathways to imbue robots with such "agentic" behaviors by leveraging the LLMs' vast knowledge and reasoning capabilities for planning and control. This survey provides the first comprehensive exploration of LLM-based robotic systems integration into agentic behaviors that have been validated in real-world applications.

View Article and Find Full Text PDF

Bits of confidence: Metacognition as uncertainty reduction.

Psychon Bull Rev

September 2025

Department of Psychology, Ariel University, Ariel, Israel.

How do people know when they are right? Confidence judgments - the ability to assess the correctness of one's own decisions - are a key aspect of human metacognition. This self-evaluative act plays a central role in learning, memory, consciousness, and group decision-making. In this paper, I reframe metacognition as a structured exchange of information between stimulus, decision-maker (the actor), and confidence judge (the rater), akin to a multi-agent communication system.

View Article and Find Full Text PDF

We tested state-of-the-art large language models (LLMs) in two configurations for clinical-scale workloads: a single agent handling heterogeneous tasks versus an orchestrated multi-agent system assigning each task to a dedicated worker. Across retrieval, extraction, and dosing calculations, we varied batch sizes from 5 to 80 to simulate clinical traffic. Multi-agent runs maintained high accuracy under load (pooled accuracy 90.

View Article and Find Full Text PDF