A Lightweight Multi-Stage Visual Detection Approach for Complex Traffic Scenes.

Sensors (Basel)

School of Electronic Information and Electrical Engineering, Yangtze University, Jingzhou 434023, China.

Published: August 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

In complex traffic environments, image degradation due to adverse factors such as haze, low illumination, and occlusion significantly compromises the performance of object detection systems in recognizing vehicles and pedestrians. To address these challenges, this paper proposes a robust visual detection framework that integrates multi-stage image enhancement with a lightweight detection architecture. Specifically, an image preprocessing module incorporating ConvIR and CIDNet is designed to perform defogging and illumination enhancement, thereby substantially improving the perceptual quality of degraded inputs. Furthermore, a novel enhancement strategy based on the Horizontal/Vertical-Intensity color space is introduced to decouple brightness and chromaticity modeling, effectively enhancing structural details and visual consistency in low-light regions. In the detection phase, a lightweight state-space modeling network, Mamba-Driven Lightweight Detection Network with RT-DETR Decoding, is proposed for object detection in complex traffic scenes. This architecture integrates VSSBlock and XSSBlock modules to enhance detection performance, particularly for multi-scale and occluded targets. Additionally, a VisionClueMerge module is incorporated to strengthen the perception of edge structures by effectively fusing multi-scale spatial features. Experimental evaluations on traffic surveillance datasets demonstrate that the proposed method surpasses the mainstream YOLOv12s model in terms of mAP@50-90, achieving a performance gain of approximately 1.0 percentage point (from 0.759 to 0.769). While ensuring competitive detection accuracy, the model exhibits reduced parameter complexity and computational overhead, thereby demonstrating superior deployment adaptability and robustness. This framework offers a practical and effective solution for object detection in intelligent transportation systems operating under visually challenging conditions.

Download full-text PDF

Source
http://dx.doi.org/10.3390/s25165014DOI Listing

Publication Analysis

Top Keywords

complex traffic
12
object detection
12
detection
10
visual detection
8
traffic scenes
8
lightweight detection
8
lightweight
4
lightweight multi-stage
4
multi-stage visual
4
detection approach
4

Similar Publications

International research projects, such as Horizon 2020 (H2020) and ERASMUS+, generate numerous scientific and educational outcomes. However, these are often disseminated in fragmented formats, limiting long-term access and impact. Language barriers further complicate the dissemination in professional communities that do not speak English.

View Article and Find Full Text PDF

Unrelated pathogens, including viruses and bacteria, use a common short linear motif (SLiM) to interact with cellular kinases of the RSK (p90 S6 ribosomal kinase) family. Such a "DDVF" (D/E-D/E-V-F) SLiM occurs in the leader (L) protein encoded by picornaviruses of the genus , including Theiler's murine encephalomyelitis virus (TMEV), Boone cardiovirus (BCV), and Encephalomyocarditis virus (EMCV). The L-RSK complex is targeted to the nuclear pore, where RSK triggers FG-nucleoporins hyperphosphorylation, thereby causing nucleocytoplasmic trafficking disruption.

View Article and Find Full Text PDF

Introduction And Clinical Importance: To present a case of traumatic third cranial nerve palsy and discuss the management challenges associated with this condition.

Case Presentation: A 27-year-old male patient was referred to our hospital following a road traffic accident that resulted in multiple injuries, including traumatic brain injury, orbital injury. The patient presented with left complete upper lid ptosis, a fixed dilated pupil, and restricted extraocular muscle movements in the left eye except abduction with large exotropia >90 PD and hypotropia 25 PD diagnosed as left oculomotor nerve palsy.

View Article and Find Full Text PDF

Introduction: To address the challenges of current 4D trajectory prediction-specifically, limited multi-factor feature extraction and excessive computational cost-this study develops a lightweight prediction framework tailored for real-time air-traffic management.

Methods: We propose a hybrid RCBAM-TCN-LSTM architecture enhanced with a teacher-student knowledge distillation mechanism. The Residual Convolutional Block Attention Module (RCBAM) serves as the teacher network to extract high-dimensional spatial features via residual structures and channel-spatial attention.

View Article and Find Full Text PDF

Potato virus Y (PVY) is one of the most economically detrimental phytoviruses affecting global Solanaceae, possessing challenges in agrochemical control. The structural elucidation of PVY coat protein (CP) offers opportunities for the rational design of CP-targeted antivirals; however, the feasibility of identifying lead compounds via virtual screening remains largely unexplored. Herein, we report the successful case of structure-based virtual screening leveraging PVY CP, enabling the identification of a structurally novel lead with a unique mechanism of action.

View Article and Find Full Text PDF