Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

With the rapid development of smart furniture, gesture recognition has gained increasing attention as a natural and intuitive interaction method. However, in practical applications, issues such as limited data resources and insufficient semantic understanding have significantly constrained the effectiveness of gesture recognition technology. To address these challenges, this study proposes HAVIT, a hybrid deep learning model based on Vision Transformer and ALBEF, aimed at enhancing the performance of gesture recognition systems under data-scarce conditions. The model achieves efficient feature extraction and accurate recognition of gesture characteristics through the organic integration of Vision Transformer's feature extraction capabilities and ALBEF's semantic understanding mechanism. Experimental results demonstrate that on a fully labeled dataset, the HAVIT model achieved a classification accuracy of 91.83% and an AUC value of 0.92; under 20% label deficiency conditions, the model maintained an accuracy of 86.89% and an AUC value of 0.88, exhibiting strong robustness. The research findings provide new solutions for the development of smart furniture interaction technology and hold significant implications for advancing practical applications in this field.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12304170PMC
http://dx.doi.org/10.1038/s41598-025-10758-9DOI Listing

Publication Analysis

Top Keywords

smart furniture
12
gesture recognition
12
development smart
8
practical applications
8
semantic understanding
8
conditions model
8
feature extraction
8
gesture
5
havit vision-language
4
vision-language gesture
4

Similar Publications

Intelligent Bionic Design of Robust Superamphiphobic Fire Sensing EP/F-POS@FeO Coating on Wood Substrate.

ACS Appl Mater Interfaces

September 2025

Yunnan Provincial Key Laboratory of Wood Adhesives and Glued Products, Southwest Forestry University, Kunming 650224, People's Republic of China.

Wood is a widely used carbon-storing material, but its applications are constrained by vulnerabilities to water, oil and fire. Existing coatings have limited functionalities, failing to meet the intelligent requirements of modern wood products and constructions. Inspired by bionics, a robust superamphiphobic fire sensing EP/F-POS@FeO coating was designed on wood substrate, fabricated from functional ferroferric oxide (FeO) particles, tetraethyl orthosilicate (TEOS, hydrolyzed into polysiloxane), 1H,1H,2H,2H-perfluorodecyltrimethoxysilane (PFDTMS), and epoxy resin (EP) adhesive.

View Article and Find Full Text PDF

Background: Physical inactivity is widespread at universities. To promote physical activity among students, it is important to understand their needs. Behavioral and cultural insights (BCIs) help to identify barriers to physical activity and to develop appropriate interventions.

View Article and Find Full Text PDF

To enhance user interaction experience in furniture customization, this study optimizes an Internet of Things (IoT)-driven Artificial Intelligence (AI)-assisted design system. First, the study analyzes human-computer interaction theories in IoT environments. Second, a personalized furniture design model based on a Generative Adversarial Network (GAN) is constructed.

View Article and Find Full Text PDF

Wireless communication systems can enhance their capabilities by exploring new opportunities and addressing emerging challenges through the integration of the Internet of Things (IoT) in 6G networks. Visible Light Communication (VLC) stands out as a promising wireless access technology for IoT devices. This paper presents a novel Teaching-Learning-Based Optimization (TLBO) optimized Intelligent Reflecting Surface (IRS)-assisted VLC system aimed at maximizing Signal-to-Noise Ratio (SNR) and enhancing illuminance uniformity.

View Article and Find Full Text PDF

With the rapid development of smart furniture, gesture recognition has gained increasing attention as a natural and intuitive interaction method. However, in practical applications, issues such as limited data resources and insufficient semantic understanding have significantly constrained the effectiveness of gesture recognition technology. To address these challenges, this study proposes HAVIT, a hybrid deep learning model based on Vision Transformer and ALBEF, aimed at enhancing the performance of gesture recognition systems under data-scarce conditions.

View Article and Find Full Text PDF