- AutorIn
- Eric Brachmann
- Titel
- Learning to Predict Dense Correspondences for 6D Pose Estimation
- Zitierfähige Url:
- https://nbn-resolving.org/urn:nbn:de:bsz:14-qucosa-236564
- Datum der Einreichung
- 05.09.2017
- Datum der Verteidigung
- 17.01.2018
- Abstract (EN)
- Object pose estimation is an important problem in computer vision with applications in robotics, augmented reality and many other areas. An established strategy for object pose estimation consists of, firstly, finding correspondences between the image and the object’s reference frame, and, secondly, estimating the pose from outlier-free correspondences using Random Sample Consensus (RANSAC). The first step, namely finding correspondences, is difficult because object appearance varies depending on perspective, lighting and many other factors. Traditionally, correspondences have been established using handcrafted methods like sparse feature pipelines. In this thesis, we introduce a dense correspondence representation for objects, called object coordinates, which can be learned. By learning object coordinates, our pose estimation pipeline adapts to various aspects of the task at hand. It works well for diverse object types, from small objects to entire rooms, varying object attributes, like textured or texture-less objects, and different input modalities, like RGB-D or RGB images. The concept of object coordinates allows us to easily model and exploit uncertainty as part of the pipeline such that even repeating structures or areas with little texture can contribute to a good solution. Although we can train object coordinate predictors independent of the full pipeline and achieve good results, training the pipeline in an end-to-end fashion is desirable. It enables the object coordinate predictor to adapt its output to the specificities of following steps in the pose estimation pipeline. Unfortunately, the RANSAC component of the pipeline is non-differentiable which prohibits end-to-end training. Adopting techniques from reinforcement learning, we introduce Differentiable Sample Consensus (DSAC), a formulation of RANSAC which allows us to train the pose estimation pipeline in an end-to-end fashion by minimizing the expectation of the final pose error.
- Freie Schlagwörter (DE)
- Posenschätzung, Maschinelles Lernen, Mustererkennung
- Freie Schlagwörter (EN)
- Pose Estimation, Machine Learning, Computer Vision
- Klassifikation (DDC)
- 004
- Klassifikation (RVK)
- ST 320, ST 330
- GutachterIn
- Prof. Dr. Stefan Gumhold
- Prof. Ing. PhD. Jiri Matas
- BetreuerIn
- Prof. Dr. Stefan Gumhold
- Prof. PhD. Carsten Rother
- Den akademischen Grad verleihende / prüfende Institution
- Technische Universität Dresden, Dresden
- URN Qucosa
- urn:nbn:de:bsz:14-qucosa-236564
- Veröffentlichungsdatum Qucosa
- 06.06.2018
- Dokumenttyp
- Dissertation
- Sprache des Dokumentes
- Englisch
- Lizenz / Rechtehinweis