Joint Learning of Object Detection and Pose Estimation using Augmented Autoencoder

説明

This paper proposes a method for estimating the pose of a rigid object. While an appearance-based pose estimator requires a bounding box of each target object, an object detector is in general trained independently of the pose estimator. Recent pose estimators are robust to occlusion and image deviation, if the object region is correctly located by the detector. In reality, however, it is difficult to detect correct bounding-boxes, and such erroneous bounding-boxes make pose estimation inaccurate. Our proposed method integrates the object detector and the pose estimator so that they share feature maps and support to each other for improving the pose estimation accuracy. Experimental results demonstrate that the performance of our method is 7.54 times better than the SoTA pose estimation method.

収録刊行物

詳細情報 詳細情報について

問題の指摘

ページトップへ