Three-dimensional atrous inception module for crowd behavior classification.

Sci Rep

Department of Artificial Intelligence, Inha University, Incheon, 22212, South Korea.

Published: June 2024


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Recent advances in deep learning have led to a surge in computer vision research, including the recognition and classification of human behavior in video data. However, most studies have focused on recognizing individual behaviors, whereas recognizing crowd behavior remains a complex problem because of the large number of interactions and similar behaviors among individuals or crowds in video surveillance systems. To solve this problem, we propose a three-dimensional atrous inception module (3D-AIM) network, which is a crowd behavior classification model that uses atrous convolution to explore interactions between individuals or crowds. The 3D-AIM network is a 3D convolutional neural network that can use receptive fields of various sizes to effectively identify specific features that determine crowd behavior. To further improve the accuracy of the 3D-AIM network, we introduced a new loss function called the separation loss function. This loss function focuses the 3D-AIM network more on the features that distinguish one type of crowd behavior from another, thereby enabling a more precise classification. Finally, we demonstrate that the proposed model outperforms existing human behavior classification models in terms of accurately classifying crowd behaviors. These results suggest that the 3D-AIM network with a separation loss function can be valuable for understanding complex crowd behavior in video surveillance systems.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11193784PMC
http://dx.doi.org/10.1038/s41598-024-65003-6DOI Listing

Publication Analysis

Top Keywords

crowd behavior
24
3d-aim network
20
loss function
16
behavior classification
12
three-dimensional atrous
8
atrous inception
8
inception module
8
behavior
8
human behavior
8
behavior video
8

Similar Publications

The production process of , a filamentous fungus of dairy interest, involves transition from a solid to a liquid medium, allowing acquisition of a sufficient quantity of spores for transfer to a bioreactor. This step is hardly referenced whereas its impact on growth can be substantial. The aim of this study was to define the best condition for spore production on solid medium that maximizes the quality of produced spores for the transition to liquid medium.

View Article and Find Full Text PDF

A dual-cavity lasing platform is reported in which thioflavin T (ThT), a rotor-sensitive molecular probe, is employed to map molecular-crowding effects within starch granules via coupled Fabry-Perot (FP) and whispering gallery mode (WGM) resonances. In this architecture, global standing-wave feedback is furnished by a planar FP cavity, while size-tunable WGMs are supported by ThT-coated starch granules. Granules were sorted into five diameter classes (<20, 20-30, 30-40, 40-60, and >60 μm), and lasing thresholds alongside fluorescence lifetimes were determined.

View Article and Find Full Text PDF

Background: Orthodontic malocclusions could affect oral health-related quality of life (OHR-QoL). The aim of this study was to evaluate the impact of overjet, overbite, and anterior crowding on OHR-QoL of adolescents.

Materials And Methods: This cross-sectional study involved 143 adolescents (71 boys and 72 girls) aged 10-15 years seeking orthodontic treatment.

View Article and Find Full Text PDF

Computational models of early language development involve implementing theories of learning as functional learning algorithms, exposing these models to realistic language input, and comparing learning outcomes to those in infants. While recent research has made major strides in developing more powerful learning models and evaluation protocols grounded in infant data, models are still predominantly trained with non-naturalistic input data, such as crowd-sourced read speech or text transcripts. This is due to the lack of suitable child-directed speech (CDS) corpora in terms of scale and quality.

View Article and Find Full Text PDF

Background: Adequate staffing and manageable workloads are crucial for high-quality emergency care. However, high perceived workloads in the emergency department (ED) threaten both. Increased demand and staff shortages intensify these issues and cause crowding.

View Article and Find Full Text PDF