98%
921
2 minutes
20
This paper proposes a deep-learning model with task-specific bounding box regressors (TSBBRs) and conditional back-propagation mechanisms for detection of objects in motion for advanced driver assistance system (ADAS) applications. The proposed model separates the object detection networks for objects of different sizes and applies the proposed algorithm to achieve better detection results for both larger and tinier objects. For larger objects, a neural network with a larger visual receptive field is used to acquire information from larger areas. For the detection of tinier objects, the network of a smaller receptive field utilizes fine grain features. A conditional back-propagation mechanism yields different types of TSBBRs to perform data-driven learning for the set criterion and learn the representation of different object sizes without degrading each other. The design of dual-path object bounding box regressors can simultaneously detect objects in various kinds of dissimilar scales and aspect ratios. Only a single inference of neural network is needed for each frame to support the detection of multiple types of object, such as bicycles, motorbikes, cars, buses, trucks, and pedestrians, and to locate their exact positions. The proposed model was developed and implemented on different NVIDIA devices such as 1080 Ti, DRIVE-PX2 and Jetson TX-2 with the respective processing performance of 67 frames per second (fps), 19.4 fps, and 8.9 fps for the video input of 448 × 448 resolution, respectively. The proposed model can detect objects as small as 13 × 13 pixels and achieves 86.54% accuracy on a publicly available Pascal Visual Object Class (VOC) car database and 82.4% mean average precision (mAP) on a large collection of common road real scenes database (iVS database).
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7571112 | PMC |
http://dx.doi.org/10.3390/s20185269 | DOI Listing |
IEEE Trans Pattern Anal Mach Intell
September 2025
Generalized visual grounding tasks, including Generalized Referring Expression Comprehension (GREC) and Segmentation (GRES), extend the classical visual grounding paradigm by accommodating multi-target and non-target scenarios. Specifically, GREC focuses on accurately identifying all referential objects at the coarse bounding box level, while GRES aims for achieve fine-grained pixel-level perception. However, existing approaches typically treat these tasks independently, overlooking the benefits of jointly training GREC and GRES to ensure consistent multi-granularity predictions and streamline the overall process.
View Article and Find Full Text PDFIEEE J Biomed Health Inform
September 2025
The Segment Anything Model (SAM) has attracted considerable attention due to its impressive performance and demonstrates potential in medical image segmentation. Compared to SAM's native point and bounding box prompts, text prompts offer a simpler and more efficient alternative in the medical field, yet this approach remains relatively underexplored. In this paper, we propose a SAM-based framework that integrates a pre-trained vision-language model to generate referring prompts, with SAM handling the segmentation task.
View Article and Find Full Text PDFRes Math Sci
September 2025
Department of Mathematics and Statistics, University of Jyväskylä, P.O. Box 35 (MaD), 40014 Jyväskylä, Finland.
These are lecture notes for a mini-course given in Banff in June 2024. They discuss the problem of bounding the number of under various hypotheses on and . The main focus will be on hypotheses relevant for the .
View Article and Find Full Text PDFDigit Health
September 2025
Department of Respiratory and Critical Care Medicine, The Sixth Affiliated Hospital of Xinjiang Medical University, Urumqi, China.
Objective: Accurate segmentation of breast lesions, especially small ones, remains challenging in digital mammography due to complex anatomical structures and low-contrast boundaries. This study proposes DVF-YOLO-Seg, a two-stage segmentation framework designed to improve feature extraction and enhance small-lesion detection performance in mammographic images.
Methods: The proposed method integrates an enhanced YOLOv10-based detection module with a segmentation stage based on the Visual Reference Prompt Segment Anything Model (VRP-SAM).
Sci Immunol
September 2025
Laboratory of Epigenetics and Immunology, West China Institute of Women and Children's Health, NHC Key Laboratory of Chronobiology, State Key Laboratory of Biotherapy, West China Second University Hospital, Sichuan University, Chengdu, China.
Naïve T cells are maintained in a homeostatic state to preserve a stable T cell pool with diverse T cell receptor (TCR) repertoires, ensuring preparedness for priming. However, the underlying mechanisms controlling naïve T cell homeostasis and priming remain unclear. Leveraging a machine learning-based functional genetic screen, we identified () as the top factor responsible for naïve T cell homeostasis.
View Article and Find Full Text PDF