Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

This paper proposes a deep-learning model with task-specific bounding box regressors (TSBBRs) and conditional back-propagation mechanisms for detection of objects in motion for advanced driver assistance system (ADAS) applications. The proposed model separates the object detection networks for objects of different sizes and applies the proposed algorithm to achieve better detection results for both larger and tinier objects. For larger objects, a neural network with a larger visual receptive field is used to acquire information from larger areas. For the detection of tinier objects, the network of a smaller receptive field utilizes fine grain features. A conditional back-propagation mechanism yields different types of TSBBRs to perform data-driven learning for the set criterion and learn the representation of different object sizes without degrading each other. The design of dual-path object bounding box regressors can simultaneously detect objects in various kinds of dissimilar scales and aspect ratios. Only a single inference of neural network is needed for each frame to support the detection of multiple types of object, such as bicycles, motorbikes, cars, buses, trucks, and pedestrians, and to locate their exact positions. The proposed model was developed and implemented on different NVIDIA devices such as 1080 Ti, DRIVE-PX2 and Jetson TX-2 with the respective processing performance of 67 frames per second (fps), 19.4 fps, and 8.9 fps for the video input of 448 × 448 resolution, respectively. The proposed model can detect objects as small as 13 × 13 pixels and achieves 86.54% accuracy on a publicly available Pascal Visual Object Class (VOC) car database and 82.4% mean average precision (mAP) on a large collection of common road real scenes database (iVS database).

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7571112PMC
http://dx.doi.org/10.3390/s20185269DOI Listing

Publication Analysis

Top Keywords

bounding box
12
box regressors
12
conditional back-propagation
12
proposed model
12
deep-learning model
8
model task-specific
8
task-specific bounding
8
object detection
8
adas applications
8
tinier objects
8

Similar Publications

Generalized visual grounding tasks, including Generalized Referring Expression Comprehension (GREC) and Segmentation (GRES), extend the classical visual grounding paradigm by accommodating multi-target and non-target scenarios. Specifically, GREC focuses on accurately identifying all referential objects at the coarse bounding box level, while GRES aims for achieve fine-grained pixel-level perception. However, existing approaches typically treat these tasks independently, overlooking the benefits of jointly training GREC and GRES to ensure consistent multi-granularity predictions and streamline the overall process.

View Article and Find Full Text PDF

The Segment Anything Model (SAM) has attracted considerable attention due to its impressive performance and demonstrates potential in medical image segmentation. Compared to SAM's native point and bounding box prompts, text prompts offer a simpler and more efficient alternative in the medical field, yet this approach remains relatively underexplored. In this paper, we propose a SAM-based framework that integrates a pre-trained vision-language model to generate referring prompts, with SAM handling the segmentation task.

View Article and Find Full Text PDF

Approximate incidence geometry in the plane.

Res Math Sci

September 2025

Department of Mathematics and Statistics, University of Jyväskylä, P.O. Box 35 (MaD), 40014 Jyväskylä, Finland.

These are lecture notes for a mini-course given in Banff in June 2024. They discuss the problem of bounding the number of under various hypotheses on and . The main focus will be on hypotheses relevant for the .

View Article and Find Full Text PDF

Objective: Accurate segmentation of breast lesions, especially small ones, remains challenging in digital mammography due to complex anatomical structures and low-contrast boundaries. This study proposes DVF-YOLO-Seg, a two-stage segmentation framework designed to improve feature extraction and enhance small-lesion detection performance in mammographic images.

Methods: The proposed method integrates an enhanced YOLOv10-based detection module with a segmentation stage based on the Visual Reference Prompt Segment Anything Model (VRP-SAM).

View Article and Find Full Text PDF

DDX55 safeguards naïve T cell homeostasis by suppressing activation-promoting transposable elements.

Sci Immunol

September 2025

Laboratory of Epigenetics and Immunology, West China Institute of Women and Children's Health, NHC Key Laboratory of Chronobiology, State Key Laboratory of Biotherapy, West China Second University Hospital, Sichuan University, Chengdu, China.

Naïve T cells are maintained in a homeostatic state to preserve a stable T cell pool with diverse T cell receptor (TCR) repertoires, ensuring preparedness for priming. However, the underlying mechanisms controlling naïve T cell homeostasis and priming remain unclear. Leveraging a machine learning-based functional genetic screen, we identified () as the top factor responsible for naïve T cell homeostasis.

View Article and Find Full Text PDF