Pixel level deep reinforcement learning for accurate and robust medical image segmentation.

Yunxin Liu , Di Yuan , Zhenghua Xu , Yuefu Zhan , Hongwei Zhang , Jun Lu , Thomas Lukasiewicz

Sci Rep

Institute of Logic and Computation, Vienna University of Technology, Vienna, Austria.

Published: March 2025

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Existing deep learning methods have achieved significant success in medical image segmentation. However, this success largely relies on stacking advanced modules and architectures, which has created a path dependency. This path dependency is unsustainable, as it leads to increasingly larger model parameters and higher deployment costs. To break this path dependency, we introduce deep reinforcement learning to enhance segmentation performance. However, current deep reinforcement learning methods face challenges such as high training cost, independent iterative processes, and high uncertainty of segmentation masks. Consequently, we propose a Pixel-level Deep Reinforcement Learning model with pixel-by-pixel Mask Generation (PixelDRL-MG) for more accurate and robust medical image segmentation. PixelDRL-MG adopts a dynamic iterative update policy, directly segmenting the regions of interest without requiring user interaction or coarse segmentation masks. We propose a Pixel-level Asynchronous Advantage Actor-Critic (PA3C) strategy to treat each pixel as an agent whose state (foreground or background) is iteratively updated through direct actions. Our experiments on two commonly used medical image segmentation datasets demonstrate that PixelDRL-MG achieves more superior segmentation performances than the state-of-the-art segmentation baselines (especially in boundaries) using significantly fewer model parameters. We also conducted detailed ablation studies to enhance understanding and facilitate practical application. Additionally, PixelDRL-MG performs well in low-resource settings (i.e., 50-shot or 100-shot), making it an ideal choice for real-world scenarios.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11894052	PMC
http://dx.doi.org/10.1038/s41598-025-92117-2	DOI Listing

Publication Analysis

Top Keywords

deep reinforcement

reinforcement learning

medical image

image segmentation

path dependency

segmentation

accurate robust

robust medical

learning methods

model parameters

Similar Publications

Smart load balancing in cloud computing: Integrating feature selection with advanced deep learning models.

PLoS One

September 2025

College of Business Administration, Northern Border University (NBU), Arar, Kingdom of Saudi Arabia.

Yousef Sanjalawe , Salam Fraihat , Salam Al-E'mari , Mosleh Abualhaj , Sharif Makhadmeh

The increasing dependence on cloud computing as a cornerstone of modern technological infrastructures has introduced significant challenges in resource management. Traditional load-balancing techniques often prove inadequate in addressing cloud environments' dynamic and complex nature, resulting in suboptimal resource utilization and heightened operational costs. This paper presents a novel smart load-balancing strategy incorporating advanced techniques to mitigate these limitations.

View Article and Find Full Text PDF

Similar Publications

Deep reinforcement learning control unlocks enhanced heat transfer in turbulent convection.

Proc Natl Acad Sci U S A

September 2025

Max Planck Institute for Solar System Research, Göttingen 37077, Germany.

Zisong Zhou , Xiaojue Zhu

Turbulent convection governs heat transport in both natural and industrial settings, yet optimizing it under extreme conditions remains a significant challenge. Traditional control strategies, such as predefined temperature modulation, struggle to achieve substantial enhancement. Here, we introduce a deep reinforcement learning (DRL) framework that autonomously discovers optimal control policies to maximize heat transfer in turbulent Rayleigh-Bénard convection.

View Article and Find Full Text PDF

Similar Publications

Comparative analysis of cervical cancer classification of DPAGCHE-enhanced Pap smear images using convolutional neural network models.

PLoS One

September 2025

Department of Pathology, Hospital Tuanku Fauziah, Jalan Tun Abdul Razak, Kangar, Perlis, Malaysia.

Khalis Khiruddin , Wan Azani Mustafa , Md Ashequl Islam , Khairur Rijal Jamaludin , Hiam Alquran

Cervical cancer remains a significant cause of female mortality worldwide, primarily due to abnormal cell growth in the cervix. This study proposes an automated classification method to enhance detection accuracy and efficiency, addressing contrast and noise issues in traditional diagnostic approaches. The impact of image enhancement on classification performance is evaluated by comparing transfer learning-based Convolutional Neural Network (CNN) models trained on both original and enhanced images.

View Article and Find Full Text PDF

Similar Publications

Multimodal deep learning methods for speech and language rehabilitation: a cross-sectional observational study.

Disabil Rehabil Assist Technol

September 2025

School of Foreign Languages, Ningbo University of Technology, Ningbo, China.

Xinqiao Cen

The speech and language rehabilitation are essential to people who have disorders of communication that may occur due to the condition of neurological disorder, developmental delays, or bodily disabilities. With the advent of deep learning, we introduce an improved multimodal rehabilitation pipeline that incorporates audio, video, and text information in order to provide patient-tailored therapy that adapts to the patient. The technique uses a cross-attention fusion multimodal hierarchical transformer architectural model that allows it to jointly design speech acoustics as well as the facial dynamics, lip articulation, and linguistic context.

View Article and Find Full Text PDF

Similar Publications

Automated Configuration of Evolutionary Algorithms via Deep Reinforcement Learning for Constrained Multiobjective Optimization.

IEEE Trans Cybern

September 2025

Fei Ming , Wenyin Gong , Bing Xue , Mengjie Zhang , Yaochu Jin

Learning to optimize and automated algorithm design are attracting increasing attention, but it is still in its infancy in constrained multiobjective optimization evolutionary algorithms (CMOEAs). Current learning-assisted CMOEAs are typically crafted by human experts using manually designed techniques, which tend to be overly tuned, ad hoc, and lacking versatility. To alleviate these limitations, this work proposes transforming the online configuration of CMOEA into determinations of discrete and continuous parameters, which are then solved by deep reinforcement learning (DRL) techniques.

View Article and Find Full Text PDF

Similar Publications