Self-supervised 6D object pose tracking using Rao-Blackwell particle filter and vision transformer

Ullah, Faheem, Patel, Preeti, Das, Sonjoy Ranjon, Farooqi, Ashfaq Hussain and Hassan, Bilal (2026) Self-supervised 6D object pose tracking using Rao-Blackwell particle filter and vision transformer. In: International Conference on Frontiers of Intelligent Computing: Theory and Applications, FICTA-2026, London. (In Press)

Abstract

In the realm of computer vision, tracking the 6D pose of objects is crucial for ro-botic manipulation and navigation tasks. This paper introduces a self-supervised method for accurate 6D object pose tracking by integrating EfficientDet and a vi-sion transformer-based masked auto-encoder within the Rao-Blackwellized parti-cle filter framework. Addressing challenges in capturing pose uncertainties and encoding depth information, our approach discretizes 3D translations and rota-tions to estimate full posteriors over 6D poses. Validation on the YCB-Video dataset using the ADD(-S) metric demonstrates significant improvements, achiev-ing average accuracies of 72.80% with 50 particles and 75.51% with 200 particles, surpassing state-of-the-art methods. This research provides a robust framework for enhancing 6D object pose tracking in robotic applications.

Documents
11328:56762
[thumbnail of Submitted Version 18 Feb 2026.pdf]
Submitted Version 18 Feb 2026.pdf - Accepted Version
Restricted to Repository staff only until 8 March 2027.

Download (756kB) | Request a copy
Details
Record
View Item View Item