GeneWorker: An end-to-end robotic reinforcement learning approach with collaborative generator and worker networks.

Hao Wang , Hengyu Man , Wenxue Cui , Riyu Lu , Chenxin Cai , Xiaopeng Fan

Neural Netw

Faculty of Computing, Harbin Institute of Technology, Harbin, 150001, Heilongjiang, China; Pengcheng Laboratory, Shenzhen, 518055, Guangdong, China. Electronic address:

Published: October 2024

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Reinforcement learning aided by the skill conception exhibits potent capabilities in guiding autonomous agents toward acquiring meaningful behaviors. However, in the current landscape of reinforcement learning, a skill is often merely a rudimentary abstraction of a sequence of primitive actions, serving as a component of the input to policy networks with fixed network parameters. This rigid methodology presents obstacles when attempting to integrate with burgeoning techniques such as meta-learning and large language models. To address this issue, we introduce a unique neural skill representation that abstracts the activation of neurons in each neural layer. Based on this, a novel end-to-end robotic reinforcement learning algorithm is proposed, in which two sub-networks, i.e., generator and worker networks, implement collaborative inferences via neural skills. Specifically, the generator produces a series of multi-spatial neural skills, providing efficient guidance for subsequent decision-making; by integrating these skills, the worker can determine its own network weights and biases to cope with various environmental conditions. Therefore, actions can be sampled with flexibly changeable network parameters through the collaboration between generator and worker networks. The experiments demonstrate that GeneWorker can achieve a mean success rate of over 90.67% on continuous robotic tasks and outperforms previous state-of-the-art methods by a minimum of 54% on the pick-and-place task.

Download full-text PDF	Source
http://dx.doi.org/10.1016/j.neunet.2024.106472	DOI Listing

Publication Analysis

Top Keywords

reinforcement learning

generator worker

worker networks

end-to-end robotic

robotic reinforcement

network parameters

neural skills

geneworker end-to-end

reinforcement

learning

Similar Publications

A Silent Invader: Asymptomatic Rhodococcus Infection Unmasked in A Patient with Ectopic ACTH-Dependent Cushing's Syndrome.

Eur J Case Rep Intern Med

August 2025

Charleston Area Medical Center, Charleston, USA.

Shahzeb Saeed , George Fawzy , Ayesha Shah , Molly John

Introduction: species, particularly , are rare opportunistic pathogens that typically affect immunocompromised individuals. These infections usually present with respiratory or systemic symptoms and are often linked to environmental exposure. Asymptomatic infections are exceedingly rare and pose unique diagnostic and therapeutic challenges.

View Article and Find Full Text PDF

Similar Publications

Box of Lessons: An Open Educational Resource for Exploring Biomolecular Structure and Function.

J Coll Sci Teach

March 2025

RCSB Protein Data Bank, Institute for Quantitative Biomedicine, Rutgers University, Piscataway, New Jersey, United States.

Alexandra S Pettit , Keith A Johnson , Brian Gadd , Shuchismita Dutta

Structure-function relationships are a core concept in many STEM disciplines. Most biology curricula introduce students to macromolecules, their building blocks, and other small molecules that play key roles in biological processes. However, the shapes, interactions, and functions of these molecules are often discussed using schematic diagrams, ignoring the vast amounts of three-dimensional structural and bioinformatics data freely available from public data resources.

View Article and Find Full Text PDF

Similar Publications

Enhancing Knowledge Retention by Simulation-Based Learning Among First-Year Medical Students.

Cureus

August 2025

Physiology, SGT University, Gurugram, IND.

Nimarpreet Kaur , Bhupendra Yadav , Deepti Dwivedi , Harminder Kaur , Pragyashaa Chaudhary

Introduction Simulation-based training has been a vital part of medical education since Competency-Based Medical Education (CBME) was introduced, and new guidelines since 2023 have expanded to include simulation as a mandatory methodology of teaching. This method enables learners to build and develop both technical and non-technical abilities in a safe and controlled setting, enhancing their preparedness for real-life medical scenarios. Simulation-based training improves skill acquisition and retention and enhances learners' confidence, reduces anxiety, reinforces learning, corrects errors, and promotes reflective practice, in contrast with the traditional method of teaching.

View Article and Find Full Text PDF

Similar Publications

High spatiotemporal-resolution abdominal 4D-MRI through respiratory-synchronized frame collaborative reconstruction.

Med Phys

September 2025

Department of Health Technology and Informatics, The Hong Kong Polytechnic University, Hong Kong SAR, China.

Yinghui Wang , Lu Wang , Yidan Feng , Zhi Chen , Jing Qin

Background: Four-dimensional magnetic resonance imaging (4D-MRI) holds great promise for precise abdominal radiotherapy guidance. However, current 4D-MRI methods are limited by an inherent trade-off between spatial and temporal resolutions, resulting in compromised image quality characterized by low spatial resolution and significant motion artifacts, hindering clinical implementation. Despite recent advancements, existing methods inadequately exploit redundant frame information and struggle to restore structural details from highly undersampled acquisitions.

View Article and Find Full Text PDF

Similar Publications

Perceived reward certainty in the assessment of delay discounting.

J Exp Anal Behav

September 2025

Fralin Biomedical Research Institute at VTC, Roanoke, VA, United States of America.

Haylee Downey , Alicia Alvarez , Wenyan Ji , Alicia Lozano , Alexandra Hanlon

Reward delays are often associated with reduced probability of reward, although standard assessments of delay discounting do not specify degree of reward certainty. Thus, the extent to which estimates of delay discounting are influenced by uncontrolled variance in perceived reward certainty remains unclear. Here we examine 370 participants who were randomly assigned to complete a delay discounting task when reward certainty was either unspecified (n=184) or specified as 100% (n = 186) in the task trials and task instructions.

View Article and Find Full Text PDF

Similar Publications