
doi: 10.1002/rcs.70078
pmid: 40413787
ABSTRACTBackgroundAccurate endoscopic camera pose estimation is crucial for real‐time AR navigation systems. While current methods primarily use depth and optical flow, they often ignore structural inconsistencies between images.MethodsLeveraging the RAFT framework, we process sequential stereo RGB pairs to extract optical flow and depth features for pose estimation. To address structural inconsistencies, we refine the weights for both 2D and 3D residuals by computing SSIM indices for the left and right views, as well as pre‐ and post‐optical flow transformations. The SSIM metric is also used in the loss function.ResultsExperiments on the StereoMIS dataset demonstrate our method's improved pose estimation accuracy compared to rigid SLAM methods, showing a lower accumulated trajectory error (ATE‐RMSE: 18.5 mm). Additionally, ablation experiments achieved an 11.49% reduction in average error.ConclusionThe pose estimation accuracy has been improved by incorporating SSIM. The code is available at: https://github.com/lianrq/pose‐estimation‐by‐SSIM‐Integration.
Imaging, Three-Dimensional, Surgery, Computer-Assisted, Robotic Surgical Procedures, Image Processing, Computer-Assisted, Humans, Reproducibility of Results, Endoscopy, Algorithms
Imaging, Three-Dimensional, Surgery, Computer-Assisted, Robotic Surgical Procedures, Image Processing, Computer-Assisted, Humans, Reproducibility of Results, Endoscopy, Algorithms
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 0 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Average | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
