Skip to main navigation Skip to search Skip to main content

Single-Shot Scale-Aware Network for Real-Time Face Detection

  • Shifeng Zhang
  • , Longyin Wen
  • , Hailin Shi
  • , Zhen Lei
  • , Siwei Lyu
  • , Stan Z. Li
  • CAS - Institute of Automation
  • University of Chinese Academy of Sciences
  • JD Digits
  • JD AI Research

Research output: Contribution to journalArticlepeer-review

54 Scopus citations

Abstract

In this work, we describe a single-shot scale-aware convolutional neural network based face detector (SFDet). In comparison with the state-of-the-art anchor-based face detection methods, the main advantages of our method are summarized in four aspects. (1) We propose a scale-aware detection network using a wide scale range of layers associated with appropriate scales of anchors to handle faces with various scales, and describe a new equal density principle to ensure anchors with different scales to be evenly distributed on the image. (2) To improve the recall rates of faces with certain scales (e.g., the scales of the faces are quite different from the scales of designed anchors), we design a new anchor matching strategy with scale compensation. (3) We introduce an IoU-aware weighting scheme for each training sample in classification loss calculation to encode samples accurately in training process. (4) Considering the class imbalance issue, a max-out background strategy is used to reduce false positives. Several experiments are conducted on public challenging face detection datasets, i.e., WIDER FACE, AFW, PASCAL Face, FDDB, and MAFA, to demonstrate that the proposed method achieves the state-of-the-art results and runs at 82.1 FPS for the VGA-resolution images.

Original languageEnglish
Pages (from-to)537-559
Number of pages23
JournalInternational Journal of Computer Vision
Volume127
Issue number6-7
DOIs
StatePublished - Jun 1 2019

Keywords

  • Class imbalance
  • Face detection
  • Scale-aware
  • Single-shot

Fingerprint

Dive into the research topics of 'Single-Shot Scale-Aware Network for Real-Time Face Detection'. Together they form a unique fingerprint.

Cite this