Reinforcement learning based proportional-integral-derivative controllers design for consensus of multi-agent systems.

Jinna Li , Jiaqi Wang

ISA Trans

School of Information and Control Engineering, Liaoning Petrochemical University, Fushun, 113001, PR China.

Published: January 2023

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

This paper develops a novel Proportional-Integral-Derivative (PID) tuning method for multi-agent systems with a reinforced self-learning capability for achieving the optimal consensus of all agents. Unlike the traditional model-based and data-driven PID tuning methods, the developed PID self-learning method updates the controller parameters by actively interacting with unknown environment, with the outcomes of guaranteed consensus and performance optimization of agents. Firstly, the PID control-based consensus problem of multi-agent systems is formulated. Then, finding the PID gains is converted into solving a nonzero-sum game problem, thus an off-policy Q-learning algorithm with the critic-only structure is proposed to update the PID gains using only data, without the knowledge of dynamics of agents. Finally, simulations are given to verify the effectiveness of the proposed method.

Download full-text PDF	Source
http://dx.doi.org/10.1016/j.isatra.2022.06.026	DOI Listing

Publication Analysis

Top Keywords

multi-agent systems

pid tuning

pid gains

pid

reinforcement learning

learning based

based proportional-integral-derivative

proportional-integral-derivative controllers

controllers design

consensus

Similar Publications

AI Agents in Clinical Medicine: A Systematic Review.

medRxiv

August 2025

Alon Gorenshtein , Mahmud Omar , Benjamin S Glicksberg , Girish N Nadkarni , Eyal Klang

Background: AI agents built on large language models (LLMs) can plan tasks, use external tools, and coordinate with other agents. Unlike standard LLMs, agents can execute multi-step processes, access real-time clinical information, and integrate multiple data sources. There has been interest in using such agents for clinical and administrative tasks, however, there is limited knowledge on their performance and whether multi-agent systems function better than a single agent for healthcare tasks.

View Article and Find Full Text PDF

Similar Publications

A robot scheduling method based on rMAPPO for H-beam riveting and welding work cell.

PLoS One

September 2025

Hubei Key Laboratory of Broadband Wireless Communication and Sensor Networks, School of Information Engineering, Wuhan University of Technology, Wuhan, Hubei, China.

Jianbin Zheng , Chuyi Zhou , Yang Gao , Ziyao Chen , Yifan Gao

The H-beam riveting and welding work cell is an automated unit used for processing H-beams. By coordinating the gripping and welding robots, the work cell achieves processes such as riveting and welding stiffener plates, transforming the H-beam into a stiffened H-beam. In the context of intelligent manufacturing, there is still significant potential for improving the productivity of riveting and welding tasks in existing H-beam riveting and welding work cells.

View Article and Find Full Text PDF

Similar Publications

Agentic LLM-based robotic systems for real-world applications: a review on their agenticness and ethics.

Front Robot AI

August 2025

Information Technologies Institute, The Centre for Research and Technology Hellas, Thessaloniki, Greece.

Emmanuel K Raptis , Athanasios Ch Kapoutsis , Elias B Kosmatopoulos

Agentic AI refers to autonomous systems that can perceive their environment, make decisions, and take actions to achieve goals with minimal or no human intervention. Recent advances in Large Language Models (LLMs) have opened new pathways to imbue robots with such "agentic" behaviors by leveraging the LLMs' vast knowledge and reasoning capabilities for planning and control. This survey provides the first comprehensive exploration of LLM-based robotic systems integration into agentic behaviors that have been validated in real-world applications.

View Article and Find Full Text PDF

Similar Publications

Bits of confidence: Metacognition as uncertainty reduction.

Psychon Bull Rev

September 2025

Department of Psychology, Ariel University, Ariel, Israel.

Daniel Fitousi

How do people know when they are right? Confidence judgments - the ability to assess the correctness of one's own decisions - are a key aspect of human metacognition. This self-evaluative act plays a central role in learning, memory, consciousness, and group decision-making. In this paper, I reframe metacognition as a structured exchange of information between stimulus, decision-maker (the actor), and confidence judge (the rater), akin to a multi-agent communication system.

View Article and Find Full Text PDF

Similar Publications

Orchestrated multi agents sustain accuracy under clinical-scale workloads compared to a single agent.

medRxiv

August 2025

The Windreich Department of Artificial Intelligence and Human Health, Mount Sinai Medical Center, NY, USA.

Eyal Klang , Mahmud Omar , Ganesh Raut , Reem Agbareia , Prem Timsina

We tested state-of-the-art large language models (LLMs) in two configurations for clinical-scale workloads: a single agent handling heterogeneous tasks versus an orchestrated multi-agent system assigning each task to a dedicated worker. Across retrieval, extraction, and dosing calculations, we varied batch sizes from 5 to 80 to simulate clinical traffic. Multi-agent runs maintained high accuracy under load (pooled accuracy 90.

View Article and Find Full Text PDF

Similar Publications