98%
921
2 minutes
20
Recent years have witnessed the rapid development of general human action understanding. However, when applied to real-world applications such as sports analysis, most existing datasets are still unsatisfactory, because of the limitations in rich labels on multiple tasks, language instructions, high-quality 3D data, and diverse environments. In this paper, we present FLAG3D++, a large-scale benchmark for 3D fitness activity comprehension, which contains 180 K sequences of 60 activity categories with language instruction. FLAG3D++ features the following four aspects: 1) fine-grained annotations of the temporal intervals of actions in the untrimmed long sequences and how well these actions are performed, 2) detailed and professional language instruction to describe how to perform a specific activity, 3) accurate and dense 3D human pose captured from advanced MoCap system to handle the complex activity and large movement, 4) versatile video resources from a high-tech MoCap system, rendering software, and cost-effective smartphones in natural environments. In light of the specified features, we present two new practical applications as language-guided repetition action counting (L-RAC) and language-guided action quality assessment (L-AQA), which aim to take the language descriptions as references to count the repetitive times of an action and assess the quality of action respectively. Furthermore, we propose a Hierarchical Language-Guided Graph Convolutional Network (HL-GCN) model to better fuse the language information and skeleton sequences for L-RAC and L-AQA. To be specific, the HL-GCN performs cross-modal alignments by the early fusion of the linguistic feature and the hierarchical node features of the skeleton-based sequences encoded by the multiple intermediate graph convolutional layers. Extensive experiments show the superiority of our HL-GCN on both L-RAC and L-AQA, as well as the great research value of FLAG3D++ for various challenges, such as dynamic human mesh recovery and cross-domain human action recognition. Our dataset, source code, and trained models are made publicly available at FLAG3D++.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/TPAMI.2025.3590012 | DOI Listing |
MedEdPublish (2016)
May 2025
Newcastle University Faculty of Medical Sciences, Newcastle upon Tyne, England, UK.
Background: Whilst debriefing literature offers valuable tools for healthcare education, there remains a gap in resources specifically designed for debriefing communication skills. Effective communication is fundamental to patient care, particularly during sensitive interactions. This article provides a specialised toolkit for educators to enhance communication skills debriefing, developed through synthesis of existing literature and the authors' extensive experience teaching communication skills through simulation.
View Article and Find Full Text PDFCommun Biol
September 2025
Department of Physiology Anatomy and Genetics, University of Oxford, Oxford, UK.
Primate lateral intraparietal area (LIP) has been directly linked to perceptual categorization and decision-making. However, the intrinsic LIP circuitry that gives rise to the flexible generation of motor responses to sensory instruction remains unclear. Using retrograde tracers, we delineate two distinct operational compartments based on different intrinsic connectivity patterns of dorsal and ventral LIP.
View Article and Find Full Text PDFJ Am Soc Cytopathol
August 2025
Department of Pathology, Ruffolo, Hooper & Associates, University of South Florida, Tampa, Florida.
In recent years, social media (SoMe) has revolutionized medical education within the field of pathology; however, its performance in cytopathology has not been explored in detail. This systematic review aims to analyze SoMe trends, hashtag metrics, and online resources within cytopathology over the period of 7 years. A systematic review of 4 databases (PubMed, Medline, Embase, and Scopus) was conducted between January 1st, 2017, and December 22nd, 2022, in order to identify relevant English-language articles about SoMe and cytopathology.
View Article and Find Full Text PDFJ Voice
September 2025
Department of Speech-Language-Hearing Sciences, Medical School, Federal University of Minas Gerais - UFMG, Belo Horizonte, Minas Gerais, Brazil. Electronic address:
Objective: To analyze the association between the risk of voice disorders and sociodemographic, work, and general health factors in urban and rural school teachers.
Methods: This is an observational, cross-sectional, analytical study with 1705 teachers from urban schools and 202 from rural schools teaching elementary and high school in Minas Gerais, Brazil. The exclusion criteria were being retired or no longer teaching and/or not accepting to participate in the study.
JCO Glob Oncol
May 2025
Department of Obstetrics and Gynaecology, Stanford University School of Medicine, Stanford, CA.
Purpose: Expanding high-risk human papillomavirus (HPV) vaccine coverage in resource-constrained settings is critical to bridging the cervical cancer gap and achieving the global action plan for elimination. Mobile health (mHealth) technology via short message services (SMS) has the potential to improve HPV vaccination uptake. The mHealth-HPVac study evaluated the effectiveness of mHealth interventions in increasing HPV vaccine uptake among mothers of unvaccinated girls aged 9-14 years in Lagos, Nigeria.
View Article and Find Full Text PDF