A Top-Down Unified Framework for Instance-level Human Parsing

Haifang Qin (Peking University), Weixiang Hong (National University of Singapore), Wei-Chih Hung (University of California, Merced), Yi-Hsuan Tsai (NEC Labs America), Ming-Hsuan Yang (University of California, Merced)

Abstract
Instance-level human parsing is one of the essential tasks for human-centric analysis which aims to segment various body parts and associate each part with the corresponding human instance simultaneously. Most state-of-the-art methods group instances upon multi-human parsing results, but they tend to miss instances and fail in grouping under the crowded scene. To address this problem, we propose a top-down unified framework to simultaneously detect human instance and parse every part within that instance. To better parse the single human, we also design an attention module, which is aggregated to our parsing network. As a result, our approach is capable of obtaining fine-grained parsing results and the corresponding human mask in a single forward pass. Experiments show that the proposed algorithm performs favorably against state-of-the-art methods on the CIHP and PASCAL-Person-Part datasets.

DOI
10.5244/C.33.6
https://dx.doi.org/10.5244/C.33.6

Files
Paper (PDF)
Supplementary material (ZIP)

BibTeX
@inproceedings{BMVC2019,
title={A Top-Down Unified Framework for Instance-level Human Parsing},
author={Haifang Qin and Weixiang Hong and Wei-Chih Hung and Yi-Hsuan Tsai and Ming-Hsuan Yang},
year={2019},
month={September},
pages={6.1--6.14},
articleno={6},
numpages={14},
booktitle={Proceedings of the British Machine Vision Conference (BMVC)},
publisher={BMVA Press},
editor={Kirill Sidorov and Yulia Hicks},
doi={10.5244/C.33.6},
url={https://dx.doi.org/10.5244/C.33.6}
}