Bootstrapping the performance of webly supervised semantic segmentation

Shen, T.; Lin, G.; Shen, C.; Reid, I.

Please use this identifier to cite or link to this item: https://hdl.handle.net/2440/124478

Scopus	Web of Science®	Altmetric
Citations
?	?

Type:	Conference paper
Title:	Bootstrapping the performance of webly supervised semantic segmentation
Author:	Shen, T. Lin, G. Shen, C. Reid, I.
Citation:	Proceedings / CVPR, IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2018, pp.1363-1371
Publisher:	IEEE
Publisher Place:	Piscataway, NJ.
Issue Date:	2018
Series/Report no.:	IEEE Conference on Computer Vision and Pattern Recognition
ISBN:	1538664208 9781538664209
ISSN:	1063-6919 2575-7075
Conference Name:	IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (18 Jun 2018 - 23 Jun 2018 : Salt Lake City, USA)
Statement of Responsibility:	Tong Shen, Guosheng Lin, Chunhua Shen, Ian Reid
Abstract:	Fully supervised methods for semantic segmentation require pixel-level class masks to train, the creation of which is expensive in terms of manual labour and time. In this work, we focus on weak supervision, developing a method for training a high-quality pixel-level classifier for semantic segmentation, using only image-level class labels as the provided ground-truth. Our method is formulated as a twostage approach in which we first aim to create accurate pixel-level masks for the training images via a bootstrapping process, and then use these now-accurately segmented images as a proxy ground-truth in a more standard supervised setting. The key driver for our work is that in the target dataset we typically have reliable ground-truth image-level labels, while data crawled from the web may have unreliable labels, but can be filtered to comprise only easy images to segment, therefore having reliable boundaries. These two forms of information are complementary and we use this observation to build a novel bi-directional transfer learning framework. This framework transfers knowledge between two domains, target domain and web domain, bootstrapping the performance of weakly supervised semantic segmentation. Conducting experiments on the popular benchmark dataset PASCAL VOC 2012 based on both a VGG16 network and on ResNet50, we reach state-of-the-art performance with scores of 60.2% IoU and 63.9% IoU respectively¹.
Rights:	© 2018 IEEE
DOI:	10.1109/CVPR.2018.00148
Grant ID:	http://purl.org/au-research/grants/arc/CE140100016 http://purl.org/au-research/grants/arc/FT120100969
Published version:	https://ieeexplore.ieee.org/xpl/conhome/8576498/proceeding
Appears in Collections:	Aurora harvest 4 Computer Science publications

Files in This Item:

There are no files associated with this item.

Show full item record

Adelaide Research & Scholarship