98%
921
2 minutes
20
With advancements in robust stereo matching and optical flow estimation networks, models pre-trained on synthetic data demonstrate strong robustness to unseen domains. However, their robustness can be seriously degraded when fine-tuning them in real-world scenarios. This paper investigates fine-tuning stereo matching and optical flow estimation networks without compromising their robustness to unseen domains. Specifically, we divide the pixels into consistent and inconsistent regions by comparing Ground Truth (GT) with Pseudo Label (PL) and demonstrate that the imbalance learning of consistent and inconsistent regions in GT causes robustness degradation. Based on our analysis, we propose the DKT framework, which utilizes PL to balance the learning of different regions in GT. The core idea is to utilize an exponential moving average (EMA) teacher to measure what the student network has learned and dynamically adjust the learning regions. We further propose the DKT++ framework, which improves target-domain performances and network robustness by applying slow-fast update teachers to generate more accurate PL, introducing the unlabeled data and synthetic data. We integrate our frameworks with state-of-the-art networks and evaluate their effectiveness on several real-world datasets. Extensive experiments show that our method effectively preserves the robustness of stereo matching and optical flow networks during fine-tuning.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/TPAMI.2025.3584847 | DOI Listing |
IEEE Trans Image Process
September 2025
3D imaging based on phase-shifting structured light is widely used in industrial measurement due to its non-contact nature. However, it typically requires a large number of additional images (multi-frequency heterodyne (M-FH) method) or introduces intensity features that compromise accuracy (space domain modulation phase-shifting (SDM-PS) method) for phase unwrapping, and it remains sensitive to motion. To overcome these issues, this article proposes a nonlinear phase coding-based stereo phase unwrapping (NPC-SPU) method that requires no additional patterns while maintaining measurement accuracy.
View Article and Find Full Text PDFComput Med Imaging Graph
August 2025
Institute of Medical Robotics, School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai 200240, China. Electronic address:
During minimally invasive robot-assisted surgical procedures, surgeons rely on stereo endoscopes to provide image guidance. Nevertheless, the field-of-view is typically restricted owing to the limited size of the endoscope and constrained workspace. Such a visualization challenge becomes even more severe when surgical instruments are inserted into the already restricted field-of-view, where important anatomical landmarks and relevant clinical contents may become occluded by the inserted instruments.
View Article and Find Full Text PDFFringe projection profilometry, particularly phase-shifting profilometry, has been extensively studied and widely adopted due to its non-contact operation, high accuracy, and efficiency. However, in dynamic applications, it faces two major challenges: the increased number of encoded patterns required for absolute phase retrieval, and motion-induced phase errors. To address these limitations while preserving measurement accuracy, this study proposes a motion-induced phase-shifting method that integrates Fourier fringe analysis with speckle correlation-assisted phase matching.
View Article and Find Full Text PDF3D contour measurement is critical in ensuring proper assembly and optimal performance for sheet metal parts. The contour is a primary feature of sheet metal parts. However, during the stamping process, severe deformation and the formation of reflective roll-over zones often lead to interference factors such as surface discontinuities, viewing angle obstructions, and uneven reflectivity at the contour.
View Article and Find Full Text PDFHigh-precision stereo calibration is essential for achieving accurate 3D vision measurements. Marker-based calibration methods, such as the checkerboard and circular markers, are widely adopted for their robustness and high accuracy. However, the checkerboard is relatively sensitive to noise, while circular markers suffer from eccentricity errors.
View Article and Find Full Text PDF