Bilinear Siamese Networks with Background Suppression for Visual Object Tracking

Hankyeol Lee (Korea Advanced Institute of Science and Technology), Seokeon Choi (Korea Advanced Institute of Science and Technology), Youngeun Kim (Korea Advanced Institute of Science and Technology), Changick Kim (Korea Advanced Institute of Science and Technology)

Abstract
In recent years, siamese networks have shown to be useful for visual tracking with high accuracy and real-time speed. However, since the networks only use the output of the last convolution layer, low-level feature maps which provide important spatial details for visual tracking are ignored. In this paper, we propose bilinear siamese networks for visual object tracking to take into account both high- and low-level feature maps. To effectively incorporate feature maps extracted from multiple layers, we adopt factorized bilinear pooling into our network. Also, we introduce a novel background suppression module to reduce the background interference. This module collects negative feature maps for the background in the first frame and suppresses the background information during tracking. Therefore, the module makes the tracker more robust to the background interference. Experimental results on the OTB-50 and OTB-100 benchmarks demonstrate that the proposed tracker has comparable performance with that of the state-of-the-art trackers while running in real-time.

DOI
10.5244/C.33.112
https://dx.doi.org/10.5244/C.33.112

Files
Paper (PDF)

BibTeX
@inproceedings{BMVC2019,
title={Bilinear Siamese Networks with Background Suppression for Visual Object Tracking},
author={Hankyeol Lee and Seokeon Choi and Youngeun Kim and Changick Kim},
year={2019},
month={September},
pages={112.1--112.12},
articleno={112},
numpages={12},
booktitle={Proceedings of the British Machine Vision Conference (BMVC)},
publisher={BMVA Press},
editor={Kirill Sidorov and Yulia Hicks},
doi={10.5244/C.33.112},
url={https://dx.doi.org/10.5244/C.33.112}
}