Skip to main navigation Skip to search Skip to main content

SPGNet: Semantic prediction guidance for scene parsing

  • Bowen Cheng
  • , Uiuc Uiuc
  • , Liang Chieh Chen
  • , Yunchao Wei
  • , Yukun Zhu
  • , Zilong Huang
  • , Jinjun Xiong
  • , Thomas Huang
  • , Wen Mei Hwu
  • , Honghui Shi
  • UIUC
  • ReLER
  • IBM
  • University of Oregon

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

107 Scopus citations

Abstract

Multi-scale context module and single-stage encoder-decoder structure are commonly employed for semantic segmentation. The multi-scale context module refers to the operations to aggregate feature responses from a large spatial extent, while the single-stage encoder-decoder structure encodes the high-level semantic information in the encoder path and recovers the boundary information in the decoder path. In contrast, multi-stage encoder-decoder networks have been widely used in human pose estimation and show superior performance than their single-stage counterpart. However, few efforts have been attempted to bring this effective design to semantic segmentation. In this work, we propose a Semantic Prediction Guidance (SPG) module which learns to re-weight the local features through the guidance from pixel-wise semantic prediction. We find that by carefully re-weighting features across stages, a two-stage encoder-decoder network coupled with our proposed SPG module can significantly outperform its one-stage counterpart with similar parameters and computations. Finally, we report experimental results on the semantic segmentation benchmark Cityscapes, in which our SPGNet attains 81.1% on the test set using only 'fine' annotations.

Original languageEnglish
Title of host publicationProceedings - 2019 International Conference on Computer Vision, ICCV 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages5217-5227
Number of pages11
ISBN (Electronic)9781728148038
DOIs
StatePublished - Oct 2019
Event17th IEEE/CVF International Conference on Computer Vision, ICCV 2019 - Seoul, Korea, Republic of
Duration: Oct 27 2019Nov 2 2019

Publication series

NameProceedings of the IEEE International Conference on Computer Vision
ISSN (Print)1550-5499

Conference

Conference17th IEEE/CVF International Conference on Computer Vision, ICCV 2019
Country/TerritoryKorea, Republic of
CitySeoul
Period10/27/1911/2/19

Fingerprint

Dive into the research topics of 'SPGNet: Semantic prediction guidance for scene parsing'. Together they form a unique fingerprint.

Cite this