A siamese network-based approach for vehicle pose estimation.

Front Bioeng Biotechnol

Hubei Key Laboratory of Hydroelectric Machinery Design & Maintenance, China Three Gorges University, Yichang, China.

Published: September 2022


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

We propose a deep learning-based vehicle pose estimation method based on a monocular camera called FPN PoseEstimateNet. The FPN PoseEstimateNet consists of a feature extractor and a pose calculate network. The feature extractor is based on Siamese network and a feature pyramid network (FPN) is adopted to deal with feature scales. Through the feature extractor, a correlation matrix between the input images is obtained for feature matching. With the time interval as the label, the feature extractor can be trained independently of the pose calculate network. On the basis of the correlation matrix and the standard matrix, the vehicle pose changes can be predicted by the pose calculate network. Results show that the network runs at a speed of 6 FPS, and the parameter size is 101.6 M. In different sequences, the angle error is within 8.26° and the maximum translation error is within 31.55 m.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9478513PMC
http://dx.doi.org/10.3389/fbioe.2022.948726DOI Listing

Publication Analysis

Top Keywords

feature extractor
16
vehicle pose
12
pose calculate
12
calculate network
12
pose estimation
8
fpn poseestimatenet
8
network feature
8
correlation matrix
8
feature
7
pose
6

Similar Publications

The coffee roasting process is a critical factor in determining the final quality of the beverage, influencing its flavour, aroma, and acidity. Traditionally, roast-level classification has relied on manual inspection, which is time-consuming, subjective, and prone to inconsistencies. However, advancements in machine learning (ML) and computer vision, particularly convolutional neural networks (CNNs), have shown great promise in automating and improving the accuracy of this process.

View Article and Find Full Text PDF

Inter-modality feature prediction through multimodal fusion for 3D shape defect detection.

Neural Netw

September 2025

School of Automation and Intelligent Sensing, Shanghai Jiao Tong University, Shanghai, 200240, China; Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University, Shanghai, 200240, China; Institute of Medical Robotics, Shanghai Jiao Tong University, Shanghai, 200240, China.

3D shape defect detection plays an important role in autonomous industrial inspection. However, accurate detection of anomalies remains challenging due to the complexity of multimodal sensor data, especially when both color and structural information are required. In this work, we propose a lightweight inter-modality feature prediction framework that effectively utilizes multimodal fused features from the inputs of RGB, depth and point clouds for efficient 3D shape defect detection.

View Article and Find Full Text PDF

Parkinson's disease (PD) is a challenging neurodegenerative condition often prone to diagnostic errors, where early and accurate diagnosis is critical for effective clinical management. However, existing diagnostic methods often fail to fully exploit multimodal data or systematically incorporate expert domain knowledge. To address these limitations, we propose MKD-Net, a multimodal and knowledge-driven diagnostic framework that integrates imaging and non-imaging clinical data with structured expert insights to enhance diagnostic performance.

View Article and Find Full Text PDF

The Greek island of Corfu (Kérkyra) is considered the type locality of two species described in 1834 by Rossmässler, namely and . In this work, Corfu populations of these species were investigated by an integrative approach including analysis of morphological features of shell and distal genitalia as well as molecular features of selected mitochondrial and nuclear gene fragments to establish the relationships between Corfu and as well as between Corfu and Italian . Shell features did not differentiate the pairs analysed, i.

View Article and Find Full Text PDF

Accurate early prediction of epileptic seizures is crucial for improving patients' quality of life. However, existing seizure prediction methods often rely on large-scale labeled datasets and face challenges in generalization and real-time performance. To address these issues, this study proposes an efficient seizure prediction framework that achieves high performance even with limited labeled data, significantly reducing dependence on extensive annotations.

View Article and Find Full Text PDF