Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

In many robotic applications, creating a map is crucial, and 3D maps provide a method for estimating the positions of other objects or obstacles. Most of the previous research processes 3D point clouds through projection-based or voxel-based models, but both approaches have certain limitations. This paper proposes a hybrid localization and mapping method using stereo vision and LiDAR. Unlike the traditional single-sensor systems, we construct a pose optimization model by matching ground information between LiDAR maps and visual images. We use stereo vision to extract ground information and fuse it with LiDAR tensor voting data to establish coplanarity constraints. Pose optimization is achieved through a graph-based optimization algorithm and a local window optimization method. The proposed method is evaluated using the KITTI dataset and compared against the ORB-SLAM3, F-LOAM, LOAM, and LeGO-LOAM methods. Additionally, we generate 3D point cloud maps for the corresponding sequences and high-definition point cloud maps of the streets in sequence 00. The experimental results demonstrate significant improvements in trajectory accuracy and robustness, enabling the construction of clear, dense 3D maps.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11548508PMC
http://dx.doi.org/10.3390/s24216828DOI Listing

Publication Analysis

Top Keywords

stereo vision
8
pose optimization
8
point cloud
8
cloud maps
8
maps
5
stereo lidar
4
lidar loosely
4
loosely coupled
4
coupled slam
4
slam constrained
4

Similar Publications

Perception plays a crucial role in autonomous driving and computer vision, particularly in interpreting traffic scenes from monocular cameras. In this article, we present a comprehensive collection of traffic scene datasets organized into four distinct groups: (1) Traffic Scene Datasets, (2) Top-View Datasets - both introduced in the authors' earlier research, (3) MultiHeightView Datasets and (4) Depth Datasets. The Traffic Scene Datasets include RealStreet, which captures authentic traffic scenarios, and SynthStreet, its synthetic counterpart.

View Article and Find Full Text PDF

Current finger-vein or palm-vein recognition systems usually require direct contact of the subject with the apparatus. This can be problematic in environments where hygiene is of primary importance. In this work we present a contactless vascular biometrics sensor platform named which can be used for hand vascular biometrics studies (wrist, palm, and finger-vein) and surface features such as palmprint.

View Article and Find Full Text PDF

Traditional X-Ray Computed Tomography (CT) generally requires thousands of projections for reconstruction, resulting in long acquisition times that limit its applicability in scenarios demanding rapid acquisition of 3D information. In this work, computer stereo vision technology was applied to X-ray imaging, and a single-shot 3D stereo imaging method based on dual polycapillary parallel X-ray lenses (PPXRLs) was proposed using a laboratory X-ray source. Two PPXRLs were used to generate two quasi-parallel beams, which were then Bragg reflected by two SiC crystals.

View Article and Find Full Text PDF

X-ray computed tomography (XCT) is a powerful tool for volumetric imaging, where three-dimensional (3D) images are generated from a large number of individual X-ray projection images. However, collecting the required number of low-noise projection images is time-consuming, limiting its applicability to scenarios requiring high temporal resolution, such as the study of dynamic processes. Inspired by stereo vision, we previously developed stereo X-ray imaging methods that operate with only two X-ray projections, enabling the 3D reconstruction of point and line fiducial markers at significantly faster temporal resolutions.

View Article and Find Full Text PDF

SSIFNet: Spatial-temporal stereo information fusion network for self-supervised surgical video inpainting.

Comput Med Imaging Graph

August 2025

Institute of Medical Robotics, School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai 200240, China. Electronic address:

During minimally invasive robot-assisted surgical procedures, surgeons rely on stereo endoscopes to provide image guidance. Nevertheless, the field-of-view is typically restricted owing to the limited size of the endoscope and constrained workspace. Such a visualization challenge becomes even more severe when surgical instruments are inserted into the already restricted field-of-view, where important anatomical landmarks and relevant clinical contents may become occluded by the inserted instruments.

View Article and Find Full Text PDF