Novel dataset and model for restroom sound event classification.

Ali Emre Öztürk , Erkan Kıymık , Kağan Mehmet Özkök

Sci Rep

Sanko School, Gaziantep, Turkey.

Published: September 2025

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

This study presents a novel privacy-preserving deep learning framework for accurately classifying fine-grained hygiene and water-usage events in restroom environments. Leveraging a comprehensive, curated dataset comprising approximately 460 min of stereo audio recordings from five acoustically diverse bathrooms, our method robustly identifies 11 distinct events, including nuanced variations in faucet counts and flow rates, toilet flushing, and handwashing activities. Stereo audio inputs were transformed into triple-channel Mel spectrograms using an adaptive one-dimensional convolutional neural network (1D-CNN), dynamically synthesizing spatial cues to enhance discriminative power. Extensive experimentation identified the RegNetY-008 architecture as the most effective backbone, further improved by employing a semi-supervised learning strategy via pseudo-labeling and targeted data augmentation techniques such as XY masking and horizontal CutMix. The proposed ensemble model, combining RegNetY-008 networks with complementary third-channel generation strategies, achieved outstanding generalization performance, yielding an accuracy of 97.8% and macro-averaged F1-score of 0.966 across acoustically distinct test environments. Our publicly available dataset addresses critical gaps in existing resources, promoting future research in intelligent, privacy-conscious restroom monitoring.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12402066	PMC
http://dx.doi.org/10.1038/s41598-025-18154-z	DOI Listing

Publication Analysis

Top Keywords

stereo audio

novel dataset

dataset model

model restroom

restroom sound

sound event

event classification

classification study

study presents

presents novel

Similar Publications

Novel dataset and model for restroom sound event classification.

Sci Rep

September 2025

Sanko School, Gaziantep, Turkey.

Ali Emre Öztürk , Erkan Kıymık , Kağan Mehmet Özkök

View Article and Find Full Text PDF

Similar Publications

Correction: Embedded solution to detect and classify head level objects using stereo vision for visually impaired people with audio feedback.

Sci Rep

August 2025

School of Electrical and Electronic Engineering, Universidad del Valle, Cali, Colombia.

Kevin Muñoz , Mario Chavarria , Luisa Ortiz , Silvan Suter , Klaus Schönenberger

View Article and Find Full Text PDF

Similar Publications

Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided Mixture-of-Experts.

IEEE Trans Pattern Anal Mach Intell

August 2025

Xiang Deng , Youxin Pang , Xiaochen Zhao , Chao Xu , Lizhen Wang

This paper introduces Stereo-Talker, a novel one-shot audio-driven human video synthesis system that generates 3D talking videos with precise lip synchronization, expressive body gestures, temporally consistent photo-realistic quality, and continuous viewpoint control. The process follows a two-stage approach. In the first stage, the system maps audio input to high-fidelity motion sequences, encompassing upper-body gestures and facial expressions.

View Article and Find Full Text PDF

Similar Publications

Preparation of Intact Tissue for Microscopic Analysis of the Endosperm Cell Layer in Developing and Mature Arabidopsis Seeds.

J Vis Exp

May 2025

Department of Life Sciences, School of Agriculture, Meiji University;

Keisuke Seta , Daiki Shinozaki , Kohki Yoshimoto

In Arabidopsis seeds, the endosperm, a single layer of living cells located between the embryo and the testa, plays a critical role in regulating seed maturation, dormancy, and germination. Microscopic analysis of intact endosperm cells is essential for understanding the physiological functions of the endosperm at cellular and molecular levels. However, sample preparation has been challenging due to the small size of Arabidopsis seeds and the location of the endosperm cell layer beneath the testa.

View Article and Find Full Text PDF

Similar Publications

Embedded solution to detect and classify head level objects using stereo vision for visually impaired people with audio feedback.

Sci Rep

May 2025

School of Electrical and Electronic Engineering, Universidad del Valle, Cali, Colombia.

Kevin Muñoz , Mario Chavarria , Luisa Ortiz , Silvan Suter , Klaus Schönenberger

This work presents an embedded solution for detecting and classifying head-level objects using stereo vision to assist blind individuals. A custom dataset was created, featuring five classes of head-level objects, selected based on a survey of visually impaired users. Object detection and classification were achieved using deep-neural networks such as YoloV5.

View Article and Find Full Text PDF

Similar Publications