98%
921
2 minutes
20
This paper develops a novel Proportional-Integral-Derivative (PID) tuning method for multi-agent systems with a reinforced self-learning capability for achieving the optimal consensus of all agents. Unlike the traditional model-based and data-driven PID tuning methods, the developed PID self-learning method updates the controller parameters by actively interacting with unknown environment, with the outcomes of guaranteed consensus and performance optimization of agents. Firstly, the PID control-based consensus problem of multi-agent systems is formulated. Then, finding the PID gains is converted into solving a nonzero-sum game problem, thus an off-policy Q-learning algorithm with the critic-only structure is proposed to update the PID gains using only data, without the knowledge of dynamics of agents. Finally, simulations are given to verify the effectiveness of the proposed method.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1016/j.isatra.2022.06.026 | DOI Listing |
Background: AI agents built on large language models (LLMs) can plan tasks, use external tools, and coordinate with other agents. Unlike standard LLMs, agents can execute multi-step processes, access real-time clinical information, and integrate multiple data sources. There has been interest in using such agents for clinical and administrative tasks, however, there is limited knowledge on their performance and whether multi-agent systems function better than a single agent for healthcare tasks.
View Article and Find Full Text PDFPLoS One
September 2025
Hubei Key Laboratory of Broadband Wireless Communication and Sensor Networks, School of Information Engineering, Wuhan University of Technology, Wuhan, Hubei, China.
The H-beam riveting and welding work cell is an automated unit used for processing H-beams. By coordinating the gripping and welding robots, the work cell achieves processes such as riveting and welding stiffener plates, transforming the H-beam into a stiffened H-beam. In the context of intelligent manufacturing, there is still significant potential for improving the productivity of riveting and welding tasks in existing H-beam riveting and welding work cells.
View Article and Find Full Text PDFFront Robot AI
August 2025
Information Technologies Institute, The Centre for Research and Technology Hellas, Thessaloniki, Greece.
Agentic AI refers to autonomous systems that can perceive their environment, make decisions, and take actions to achieve goals with minimal or no human intervention. Recent advances in Large Language Models (LLMs) have opened new pathways to imbue robots with such "agentic" behaviors by leveraging the LLMs' vast knowledge and reasoning capabilities for planning and control. This survey provides the first comprehensive exploration of LLM-based robotic systems integration into agentic behaviors that have been validated in real-world applications.
View Article and Find Full Text PDFPsychon Bull Rev
September 2025
Department of Psychology, Ariel University, Ariel, Israel.
How do people know when they are right? Confidence judgments - the ability to assess the correctness of one's own decisions - are a key aspect of human metacognition. This self-evaluative act plays a central role in learning, memory, consciousness, and group decision-making. In this paper, I reframe metacognition as a structured exchange of information between stimulus, decision-maker (the actor), and confidence judge (the rater), akin to a multi-agent communication system.
View Article and Find Full Text PDFmedRxiv
August 2025
The Windreich Department of Artificial Intelligence and Human Health, Mount Sinai Medical Center, NY, USA.
We tested state-of-the-art large language models (LLMs) in two configurations for clinical-scale workloads: a single agent handling heterogeneous tasks versus an orchestrated multi-agent system assigning each task to a dedicated worker. Across retrieval, extraction, and dosing calculations, we varied batch sizes from 5 to 80 to simulate clinical traffic. Multi-agent runs maintained high accuracy under load (pooled accuracy 90.
View Article and Find Full Text PDF