6D Object Pose Estimation
Instance-level 6D object pose estimation estimate 6D poses of seen objects, mainly targeting to report improved results overcoming instances' challenges, such as viewpoint variability, occlusion, clutter, and similar-looking distractors. However, instance-based methods cannot easily be generalized for category-level 6D object pose estimation, which involves the challenges such as distribution shift among source and target domains, high intra-class variations, and shape discrepencies between objects.
The authors formulated the instance-level 6D pose estimation as follows: given an RGB-D image $I$ where an instance $S$ of the interested object $O$ exists, we estimate the 3D translation $\mathbf{x}=(x, y, z)$ and the 3D rotation $\theta = (r, p, y)$ as
\[ (\mathbf{x}, \theta)^* = \arg\max_{\mathbf{x}, \theta} p(\mathbf{x}, \theta \mid I, S) \]
Extensions includes other input formats, such as multiple instances $\mathcal{S}$, an instance $C$ from a category $c$, and multiple instances $\mathcal{C}$ from a category $c$.
|