Few-Shot Viewpoint Estimation
Hung-Yu Tseng (University of California, Merced), Shalini De Mello (NVIDIA Research), Jonathan Tremblay (NVIDIA), Sifei Liu (NVIDIA), Stan Birchfield (Clemson University), Ming-Hsuan Yang (University of California at Merced), Jan Kautz (NVIDIA) AbstractViewpoint estimation for known categories of objects has been improved significantly thanks to deep networks and large datasets, but generalization to unknown categories is still very challenging. With an aim towards improving performance on unknown categories, we introduce the problem of category-level few-shot viewpoint estimation. We design a novel framework to successfully train viewpoint networks for new categories with few examples (10 or less). We formulate the problem as one of learning to estimate category-specific 3D canonical shapes, their associated depth estimates, and semantic 2D keypoints. We apply meta-learning to learn weights for our network that are amenable to category-specific few-shot fine-tuning. Furthermore, we design a flexible meta-Siamese network that maximizes information sharing during meta-learning. Through extensive experimentation on the ObjectNet3D and Pascal3D+ benchmark datasets, we demonstrate that our framework, which we call MetaView, significantly outperforms fine-tuning the state-of-the-art models with few examples, and that the specific architectural innovations of our method are crucial to achieving good performance.
DOI
10.5244/C.33.18
https://dx.doi.org/10.5244/C.33.18
Files
BibTeX
@inproceedings{BMVC2019,
title={Few-Shot Viewpoint Estimation},
author={Hung-Yu Tseng and Shalini De Mello and Jonathan Tremblay and Sifei Liu and Stan Birchfield and Ming-Hsuan Yang and Jan Kautz},
year={2019},
month={September},
pages={18.1--18.13},
articleno={18},
numpages={13},
booktitle={Proceedings of the British Machine Vision Conference (BMVC)},
publisher={BMVA Press},
editor={Kirill Sidorov and Yulia Hicks},
doi={10.5244/C.33.18},
url={https://dx.doi.org/10.5244/C.33.18}
}
title={Few-Shot Viewpoint Estimation},
author={Hung-Yu Tseng and Shalini De Mello and Jonathan Tremblay and Sifei Liu and Stan Birchfield and Ming-Hsuan Yang and Jan Kautz},
year={2019},
month={September},
pages={18.1--18.13},
articleno={18},
numpages={13},
booktitle={Proceedings of the British Machine Vision Conference (BMVC)},
publisher={BMVA Press},
editor={Kirill Sidorov and Yulia Hicks},
doi={10.5244/C.33.18},
url={https://dx.doi.org/10.5244/C.33.18}
}