Article Synopsis

  • The paper examines how to design better control systems for agile mobile robots, specifically in the context of autonomous drone racing.
  • Researchers found that a neural network controller using reinforcement learning (RL) outperformed traditional optimal control (OC) methods due to RL's ability to optimize more relevant objectives.
  • The study highlights the limitations of OC's planning-control decomposition, which constrains behavior, while RL adapts better to uncertainties, achieving impressive drone performance with rapid training.

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

A central question in robotics is how to design a control system for an agile mobile robot. This paper studies this question systematically, focusing on a challenging setting: autonomous drone racing. We show that a neural network controller trained with reinforcement learning (RL) outperformed optimal control (OC) methods in this setting. We then investigated which fundamental factors have contributed to the success of RL or have limited OC. Our study indicates that the fundamental advantage of RL over OC is not that it optimizes its objective better but that it optimizes a better objective. OC decomposes the problem into planning and control with an explicit intermediate representation, such as a trajectory, that serves as an interface. This decomposition limits the range of behaviors that can be expressed by the controller, leading to inferior control performance when facing unmodeled effects. In contrast, RL can directly optimize a task-level objective and can leverage domain randomization to cope with model uncertainty, allowing the discovery of more robust control responses. Our findings allowed us to push an agile drone to its maximum performance, achieving a peak acceleration greater than 12 times the gravitational acceleration and a peak velocity of 108 kilometers per hour. Our policy achieved superhuman control within minutes of training on a standard workstation. This work presents a milestone in agile robotics and sheds light on the role of RL and OC in robot control.

Download full-text PDF

Source
http://dx.doi.org/10.1126/scirobotics.adg1462DOI Listing

Publication Analysis

Top Keywords

control
8
optimal control
8
reinforcement learning
8
reaching limit
4
limit autonomous
4
autonomous racing
4
racing optimal
4
control versus
4
versus reinforcement
4
learning central
4

Similar Publications

Background: The aim of this study was to establish a rat model of premature ovarian failure (POF) with cyclophosphamide (CTX), and explore the molecular basis of POF and the mechanism of Guishen-Erxian Decoction (GSEXD) to improve POF from the perspective of oxidative stress regulation of ovarian granulosa cell (OGC) DNA fragmentation.

Method: The study utilized SD rats to establish a POF model via CTX. Rats were divided into Control, POF group, three GSEXD dosage groups (low, medium, high), and a GSEXD+PI3K agonist group to assess GSEXD's therapeutic effects on oxidative stress, DNA fragmentation and ovarian damage.

View Article and Find Full Text PDF

Beyond Fixed-Size Skyrmions in Nanodots: Switchable Multistability with Ferromagnetic Rings.

Nano Lett

September 2025

Depto. Polimeros y Materiales Avanzados: Fisica, Quimica y Tecnologia, Universidad del País Vasco, UPV/EHU, 20018 San Sebastian, Spain.

We demonstrate a novel approach to controlling and stabilizing magnetic skyrmions in ultrathin multilayer nanostructures through spatially engineered magnetostatic fields generated by ferromagnetic nanorings. Using analytical modeling and micromagnetic simulations, we show that the stray fields from a Co/Pd ferromagnetic ring with out-of-plane magnetic anisotropy significantly enhance the Néel-type skyrmion stability in an Ir/Co/Pt nanodot, even stabilizing the skyrmion in the absence of Dzyaloshinskii-Moriya interactions. We demonstrate precise control over the skyrmion size and stability.

View Article and Find Full Text PDF

Crop growth rate is a critical physiological trait for forage and bioenergy crops like sorghum [Sorghum bicolor (L.) Moench], influencing overall crop productivity, particularly in photoperiod-sensitive (PS) types. Crop growth rate studies focus on either a physiological approach utilizing a few genotypes to analyze biomass accumulation or a genetic approach characterizing easily scorable proxy traits in larger populations.

View Article and Find Full Text PDF

Tuning the Electronic Structure in the MoS/SrTiO Heterojunction via Phase Evolution of the SrTiO Substrate.

ACS Nano

September 2025

Department of Chemical Physics, Key Laboratory of Precision and Intelligent Chemistry, University of Science and Technology of China, Hefei, Anhui 230026, China.

The coupling between transition metal dichalcogenides (TMDCs) and SrTiO has recently emerged as a fertile platform for discovering interfacial phenomena, where particle interactions, lattice coupling, and dielectric screening give rise to interesting physical effects. These hybrid systems hold significant promise for two-dimensional (2D) electronics, ferroelectric state control, and metastable phase engineering. However, effective modulation of the interfacial electronic structure remains a critical challenge.

View Article and Find Full Text PDF

Background: Atherosclerosis is a chronic inflammatory disease characterized by the accumulation of lipid-laden foam cells and plaques within the arterial wall. Dysfunctional vascular smooth muscle cells (VSMCs), fibroblasts, endothelial cells, and macrophages contribute to disease progression. Here, we report that macrophage-specific expression of epsins, highly conserved endocytic adaptor proteins involved in clathrin-mediated endocytosis, accelerates atherosclerosis in Western diet-fed mice.

View Article and Find Full Text PDF