Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Radiotherapy is a primary method for cancer treatment, wherein radiation doses are divided into multiple sessions or fractions to effectively target tumors and minimize damage to surrounding tissues. In this study, we leverage reinforcement learning (RL) to enhance treatment planning with the aim of improving the adaptability and robustness of RL agents given the inherent inaccuracies in tumor growth models. A 2D simulation model of tumor growth is employed, where tabular RL techniques are used to determine the optimal treatment strategies. We emphasize the significance of tissue damage predictions and incorporate the Lyman NTCP model to assess treatment outcomes, analyzing complications across three simulated body sites: the rectum, head and neck and lung. For all the tumor sites, the RL approach significantly reduces healthy tissue damage by 10.7%, 49.1% and 37.5%, respectively, for rectal, head and neck and lung cancers compared with the baseline treatment. The RL-based approach in radiotherapy not only achieves tumor eradication but also significantly reduces healthy tissue damage compared with traditional treatment methods. This study demonstrates the potential of reinforcement learning to optimize treatment planning in radiotherapy, offering a promising path towards more personalized and effective cancer treatments.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12190542PMC
http://dx.doi.org/10.3390/biomedicines13061367DOI Listing

Publication Analysis

Top Keywords

reinforcement learning
12
tumor growth
12
tissue damage
12
treatment planning
8
head neck
8
neck lung
8
reduces healthy
8
healthy tissue
8
treatment
7
tumor
5

Similar Publications

Background: Primary liver cancer, particularly hepatocellular carcinoma (HCC), poses significant clinical challenges due to late-stage diagnosis, tumor heterogeneity, and rapidly evolving therapeutic strategies. While systematic reviews and meta-analyses are essential for updating clinical guidelines, their labor-intensive nature limits timely evidence synthesis.

Objective: This study proposes an automated literature screening workflow powered by large language models (LLMs) to accelerate evidence synthesis for HCC treatment guidelines.

View Article and Find Full Text PDF

Distinct prelimbic cortex ensembles encode response execution and inhibition.

Proc Natl Acad Sci U S A

September 2025

Behavioral Neuroscience Research Branch, Intramural Research Program, National Institute on Drug Abuse, Baltimore, MD 21224.

Learning when to initiate or withhold actions is essential for survival, requiring the integration of past experiences with new information to adapt to changing environments. The prelimbic cortex (PL) plays a central role in this process, with a stable PL neuronal population (ensemble) recruited during operant reward learning to encode response execution. However, it is unknown how this established reward-learning ensemble adapts to changing reward contingencies, such as reward omission during extinction.

View Article and Find Full Text PDF

Cervical cancer remains a significant cause of female mortality worldwide, primarily due to abnormal cell growth in the cervix. This study proposes an automated classification method to enhance detection accuracy and efficiency, addressing contrast and noise issues in traditional diagnostic approaches. The impact of image enhancement on classification performance is evaluated by comparing transfer learning-based Convolutional Neural Network (CNN) models trained on both original and enhanced images.

View Article and Find Full Text PDF

Previous studies suggested that acute stress can impair flexible goal-directed action control in favor of habitual action control. In addition, there is evidence that acute stress differentially affects the processing of rewards and punishments. Therefore, we aimed at investigating whether acute stress affects the balance between goal-directed and habitual behavior not only for behavior aiming at reward but also for behavior motivated by avoiding punishments.

View Article and Find Full Text PDF

Artificial intelligence (AI) based anticancer drug recommendation systems have emerged as powerful tools for precision dosing. Although existing methods have advanced in terms of predictive accuracy, they encounter three significant obstacles, including the "black-box" problem resulting in unexplainable reasoning, the computational difficulty for graphbased structures, and the combinatorial explosion during multistep reasoning. To tackle these issues, we introduce a novel Macro-Micro agent Drug sensitivity inference (MarMirDrug).

View Article and Find Full Text PDF