Skip to main navigation Skip to search Skip to main content

NAS-Count: Counting-by-Density with Neural Architecture Search

  • Yutao Hu
  • , Xiaolong Jiang
  • , Xuhui Liu
  • , Baochang Zhang
  • , Jungong Han
  • , Xianbin Cao
  • , David Doermann
  • Beihang University
  • Alibaba Group Holding Ltd.
  • Aberystwyth University
  • Ministry of Industry and Information Technology
  • Beijing Advanced Innovation Center for Big Data-Based Precision Medicine

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

73 Scopus citations

Abstract

Most of the recent advances in crowd counting have evolved from hand-designed density estimation networks, where multi-scale features are leveraged to address the scale variation problem, but at the expense of demanding design efforts. In this work, we automate the design of counting models with Neural Architecture Search (NAS) and introduce an end-to-end searched encoder-decoder architecture, Automatic Multi-Scale Network (AMSNet). Specifically, we utilize a counting-specific two-level search space. The encoder and decoder in AMSNet are composed of different cells discovered from micro-level search, while the multi-path architecture is explored through macro-level search. To solve the pixel-level isolation issue in MSE loss, AMSNet is optimized with an auto-searched Scale Pyramid Pooling Loss (SPPLoss) that supervises the multi-scale structural information. Extensive experiments on four datasets show AMSNet produces state-of-the-art results that outperform hand-designed models, fully demonstrating the efficacy of NAS-Count.

Original languageEnglish
Title of host publicationComputer Vision – ECCV 2020 - 16th European Conference, 2020, Proceedings
EditorsAndrea Vedaldi, Horst Bischof, Thomas Brox, Jan-Michael Frahm
PublisherSpringer Science and Business Media Deutschland GmbH
Pages747-766
Number of pages20
ISBN (Print)9783030585419
DOIs
StatePublished - 2020
Event16th European Conference on Computer Vision, ECCV 2020 - Glasgow, United Kingdom
Duration: Aug 23 2020Aug 28 2020

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12367 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference16th European Conference on Computer Vision, ECCV 2020
Country/TerritoryUnited Kingdom
CityGlasgow
Period08/23/2008/28/20

Keywords

  • Crowd counting
  • Multi-scale
  • Neural Architecture Search

Fingerprint

Dive into the research topics of 'NAS-Count: Counting-by-Density with Neural Architecture Search'. Together they form a unique fingerprint.

Cite this