A Deep-Learning Model with Task-Specific Bounding Box Regressors and Conditional Back-Propagation for Moving Object Detection in ADAS Applications.

Guan-Ting Lin , Vinay Malligere Shivanna , Jiun-In Guo

Sensors (Basel)

Department of Electronics Engineering and Institute of Electronics, National Chiao Tung University, Hsinchu 30010, Taiwan.

Published: September 2020

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

This paper proposes a deep-learning model with task-specific bounding box regressors (TSBBRs) and conditional back-propagation mechanisms for detection of objects in motion for advanced driver assistance system (ADAS) applications. The proposed model separates the object detection networks for objects of different sizes and applies the proposed algorithm to achieve better detection results for both larger and tinier objects. For larger objects, a neural network with a larger visual receptive field is used to acquire information from larger areas. For the detection of tinier objects, the network of a smaller receptive field utilizes fine grain features. A conditional back-propagation mechanism yields different types of TSBBRs to perform data-driven learning for the set criterion and learn the representation of different object sizes without degrading each other. The design of dual-path object bounding box regressors can simultaneously detect objects in various kinds of dissimilar scales and aspect ratios. Only a single inference of neural network is needed for each frame to support the detection of multiple types of object, such as bicycles, motorbikes, cars, buses, trucks, and pedestrians, and to locate their exact positions. The proposed model was developed and implemented on different NVIDIA devices such as 1080 Ti, DRIVE-PX2 and Jetson TX-2 with the respective processing performance of 67 frames per second (fps), 19.4 fps, and 8.9 fps for the video input of 448 × 448 resolution, respectively. The proposed model can detect objects as small as 13 × 13 pixels and achieves 86.54% accuracy on a publicly available Pascal Visual Object Class (VOC) car database and 82.4% mean average precision (mAP) on a large collection of common road real scenes database (iVS database).

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7571112	PMC
http://dx.doi.org/10.3390/s20185269	DOI Listing

Publication Analysis

Top Keywords

bounding box

box regressors

conditional back-propagation

proposed model

deep-learning model

model task-specific

task-specific bounding

object detection

adas applications

tinier objects

Similar Publications

Improving Generalized Visual Grounding with Instance-aware Joint Learning.

IEEE Trans Pattern Anal Mach Intell

September 2025

Ming Dai , Wenxuan Cheng , Jiang-Jiang Liu , Lingfeng Yang , Zhenhua Feng

Generalized visual grounding tasks, including Generalized Referring Expression Comprehension (GREC) and Segmentation (GRES), extend the classical visual grounding paradigm by accommodating multi-target and non-target scenarios. Specifically, GREC focuses on accurately identifying all referential objects at the coarse bounding box level, while GRES aims for achieve fine-grained pixel-level perception. However, existing approaches typically treat these tasks independently, overlooking the benefits of jointly training GREC and GRES to ensure consistent multi-granularity predictions and streamline the overall process.

View Article and Find Full Text PDF

Similar Publications

Leveraging Multi-Text Joint Prompts in SAM for Robust Medical Image Segmentation.

IEEE J Biomed Health Inform

September 2025

Xu Zhang , Huangxuan Zhao , Lefei Zhang , Yuan Xiong

The Segment Anything Model (SAM) has attracted considerable attention due to its impressive performance and demonstrates potential in medical image segmentation. Compared to SAM's native point and bounding box prompts, text prompts offer a simpler and more efficient alternative in the medical field, yet this approach remains relatively underexplored. In this paper, we propose a SAM-based framework that integrates a pre-trained vision-language model to generate referring prompts, with SAM handling the segmentation task.

View Article and Find Full Text PDF

Similar Publications

Approximate incidence geometry in the plane.

Res Math Sci

September 2025

Department of Mathematics and Statistics, University of Jyväskylä, P.O. Box 35 (MaD), 40014 Jyväskylä, Finland.

Tuomas Orponen

These are lecture notes for a mini-course given in Banff in June 2024. They discuss the problem of bounding the number of under various hypotheses on and . The main focus will be on hypotheses relevant for the .

View Article and Find Full Text PDF

Similar Publications

DVF-YOLO-Seg: A two-stage breast mass segmentation model with enhanced feature extraction and small lesion detection.

Digit Health

September 2025

Department of Respiratory and Critical Care Medicine, The Sixth Affiliated Hospital of Xinjiang Medical University, Urumqi, China.

Halidanmu Abudukelimu , Yuxin Gao , Abudukelimu Abulizi , Mayilamu Musideke , Shuqin Wu

Objective: Accurate segmentation of breast lesions, especially small ones, remains challenging in digital mammography due to complex anatomical structures and low-contrast boundaries. This study proposes DVF-YOLO-Seg, a two-stage segmentation framework designed to improve feature extraction and enhance small-lesion detection performance in mammographic images.

Methods: The proposed method integrates an enhanced YOLOv10-based detection module with a segmentation stage based on the Visual Reference Prompt Segment Anything Model (VRP-SAM).

View Article and Find Full Text PDF

Similar Publications

DDX55 safeguards naïve T cell homeostasis by suppressing activation-promoting transposable elements.

Sci Immunol

September 2025

Laboratory of Epigenetics and Immunology, West China Institute of Women and Children's Health, NHC Key Laboratory of Chronobiology, State Key Laboratory of Biotherapy, West China Second University Hospital, Sichuan University, Chengdu, China.

Mengyue Wu , Kepan Linghu , Qimin Yin , Ping He , Xinyang Yu

Naïve T cells are maintained in a homeostatic state to preserve a stable T cell pool with diverse T cell receptor (TCR) repertoires, ensuring preparedness for priming. However, the underlying mechanisms controlling naïve T cell homeostasis and priming remain unclear. Leveraging a machine learning-based functional genetic screen, we identified () as the top factor responsible for naïve T cell homeostasis.

View Article and Find Full Text PDF

Similar Publications