98%
921
2 minutes
20
In many robotic applications, creating a map is crucial, and 3D maps provide a method for estimating the positions of other objects or obstacles. Most of the previous research processes 3D point clouds through projection-based or voxel-based models, but both approaches have certain limitations. This paper proposes a hybrid localization and mapping method using stereo vision and LiDAR. Unlike the traditional single-sensor systems, we construct a pose optimization model by matching ground information between LiDAR maps and visual images. We use stereo vision to extract ground information and fuse it with LiDAR tensor voting data to establish coplanarity constraints. Pose optimization is achieved through a graph-based optimization algorithm and a local window optimization method. The proposed method is evaluated using the KITTI dataset and compared against the ORB-SLAM3, F-LOAM, LOAM, and LeGO-LOAM methods. Additionally, we generate 3D point cloud maps for the corresponding sequences and high-definition point cloud maps of the streets in sequence 00. The experimental results demonstrate significant improvements in trajectory accuracy and robustness, enabling the construction of clear, dense 3D maps.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11548508 | PMC |
http://dx.doi.org/10.3390/s24216828 | DOI Listing |
Data Brief
October 2025
School of Information and Communication Technology, Mongolian University of Science and Technology, Ulaanbaatar 13341, Mongolia.
Perception plays a crucial role in autonomous driving and computer vision, particularly in interpreting traffic scenes from monocular cameras. In this article, we present a comprehensive collection of traffic scene datasets organized into four distinct groups: (1) Traffic Scene Datasets, (2) Top-View Datasets - both introduced in the authors' earlier research, (3) MultiHeightView Datasets and (4) Depth Datasets. The Traffic Scene Datasets include RealStreet, which captures authentic traffic scenarios, and SynthStreet, its synthetic counterpart.
View Article and Find Full Text PDFSensors (Basel)
August 2025
Idiap Research Institute, 1920 Martigny, Switzerland.
Current finger-vein or palm-vein recognition systems usually require direct contact of the subject with the apparatus. This can be problematic in environments where hygiene is of primary importance. In this work we present a contactless vascular biometrics sensor platform named which can be used for hand vascular biometrics studies (wrist, palm, and finger-vein) and surface features such as palmprint.
View Article and Find Full Text PDFOpt Lett
September 2025
Traditional X-Ray Computed Tomography (CT) generally requires thousands of projections for reconstruction, resulting in long acquisition times that limit its applicability in scenarios demanding rapid acquisition of 3D information. In this work, computer stereo vision technology was applied to X-ray imaging, and a single-shot 3D stereo imaging method based on dual polycapillary parallel X-ray lenses (PPXRLs) was proposed using a laboratory X-ray source. Two PPXRLs were used to generate two quasi-parallel beams, which were then Bragg reflected by two SiC crystals.
View Article and Find Full Text PDFTomography
July 2025
Institute of Sound and Vibration Research, University of Southampton, Southampton SO17 1BJ, UK.
X-ray computed tomography (XCT) is a powerful tool for volumetric imaging, where three-dimensional (3D) images are generated from a large number of individual X-ray projection images. However, collecting the required number of low-noise projection images is time-consuming, limiting its applicability to scenarios requiring high temporal resolution, such as the study of dynamic processes. Inspired by stereo vision, we previously developed stereo X-ray imaging methods that operate with only two X-ray projections, enabling the 3D reconstruction of point and line fiducial markers at significantly faster temporal resolutions.
View Article and Find Full Text PDFComput Med Imaging Graph
August 2025
Institute of Medical Robotics, School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai 200240, China. Electronic address:
During minimally invasive robot-assisted surgical procedures, surgeons rely on stereo endoscopes to provide image guidance. Nevertheless, the field-of-view is typically restricted owing to the limited size of the endoscope and constrained workspace. Such a visualization challenge becomes even more severe when surgical instruments are inserted into the already restricted field-of-view, where important anatomical landmarks and relevant clinical contents may become occluded by the inserted instruments.
View Article and Find Full Text PDF