98%
921
2 minutes
20
During the learning process, a child develops a mental representation of the task he or she is learning. A Machine Learning algorithm develops also a latent representation of the task it learns. We investigate the development of the knowledge construction of an artificial agent through the analysis of its behavior, i.e., its sequences of moves while learning to perform the Tower of Hanoï (TOH) task. The TOH is a well-known task in experimental contexts to study the problem-solving processes and one of the fundamental processes of children's knowledge construction about their world. We position ourselves in the field of explainable reinforcement learning for developmental robotics, at the crossroads of cognitive modeling and explainable AI. Our main contribution proposes a 3-step methodology named Implicit Knowledge Extraction with eXplainable Artificial Intelligence (IKE-XAI) to extract the implicit knowledge, in form of an automaton, encoded by an artificial agent during its learning. We showcase this technique to solve and explain the TOH task when researchers have only access to moves that represent observational behavior as in human-machine interaction. Therefore, to extract the agent acquired knowledge at different stages of its training, our approach combines: first, a Q-learning agent that learns to perform the TOH task; second, a trained recurrent neural network that encodes an implicit representation of the TOH task; and third, an XAI process using a post-hoc implicit rule extraction algorithm to extract finite state automata. We propose using graph representations as visual and explicit explanations of the behavior of the Q-learning agent. Our experiments show that the IKE-XAI approach helps understanding the development of the Q-learning agent behavior by providing a global explanation of its knowledge evolution during learning. IKE-XAI also allows researchers to identify the agent's Aha! moment by determining from what moment the knowledge representation stabilizes and the agent no longer learns.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1016/j.neunet.2022.08.002 | DOI Listing |
West J Emerg Med
July 2025
University of Ottawa, Department of Emergency Medicine, Ottawa, Ontario, Canada.
Introduction: The Predictors of Workload in the Emergency Room (POWER) study, published in 2009 using data from 2003, examined the workload of emergency physicians using the Canadian Triage and Acuity Scale (CTAS) as a surrogate marker. Many hospitals use a case-mix formula incorporating annual census and POWER's study data to determine staffing levels. However, significant changes in emergency medicine have occurred since its publication, including the implementation of electronic health record systems, increased patient complexity, real-time dictation software, and human health resource challenges due to the COVID-19 pandemic.
View Article and Find Full Text PDFCan J Surg
August 2025
From the University of Ottawa, Department of Surgery, The Ottawa Hospital - General Campus, Ottawa, Ont. (Skulsky); the University Health Network, Department of Surgery, Toronto, Ont. (Al-Sukhni); the University of Toronto Faculty of Medicine, Toronto, Ont. (Al-Sukhni); the Centre hospitalier régio
The incidence of early-onset colorectal cancer (CRC), defined as CRC occurring in individuals younger than 50 years, is increasing globally. Emerging evidence suggests that the incidence and prevalence of CRC in individuals aged 45-49 years approach those in individuals aged 50-59 years. To address this concerning trend, many health care systems and clinical specialist societies are advocating for lowering the age of initiation for CRC screening in individuals at average risk to 45 years.
View Article and Find Full Text PDFWearable Technol
July 2025
Division of Neurology, Department of Medicine, The University of Hong Kong, Hong Kong SAR.
This is a proof-of-concept study to compare the effects of a 2-week program of "Remind-to-move" (RTM) treatment using closed-loop and open-loop wearables for hemiparetic upper extremity in patients with chronic stroke in the community. The RTM open-loop wearable device has been proven in our previous studies to be useful to address the learned nonuse phenomenon of the hemiparetic upper extremity. A closed-loop RTM wearable device, which emits reminding cues according to actual arm use, was developed in this study.
View Article and Find Full Text PDFJ Voice
July 2025
Department of Communication Sciences and Disorders and Department of Linguistics and Cognitive Science, University of Delaware, Newark, DE.
Purpose: Prior research has emphasized the superiority of an external focus of attention for motor learning. According to the Constrained Action Hypothesis, an external focus encourages automaticity by directing attention to movement outcomes, while an internal focus is thought to impair learning by promoting controlled processing directed at biomechanics. However, prior research has largely conflated two attentional constructs: the locus of attention (external vs internal) and the target of attention (movement outcomes vs biomechanics).
View Article and Find Full Text PDF