Skip to main navigation Skip to search Skip to main content

Early Action Recognition with Category Exclusion Using Policy-Based Reinforcement Learning

  • Nanyang Technological University
  • Harvard University

Research output: Contribution to journalArticlepeer-review

44 Scopus citations

Abstract

The goal of early action recognition is to predict action label when the sequence is partially observed. The existing methods treat the early action recognition task as sequential classification problems on different observation ratios of an action sequence. Since these models are trained by differentiating positive category from all negative classes, the diverse information of different negative categories is ignored, which we believe can be collected to help improve the recognition performance. In this paper, we step towards to a new direction by introducing category exclusion to early action recognition. We model the exclusion as a mask operation on the classification probability output of a pre-trained early action recognition classifier. Specifically, we use policy-based reinforcement learning to train an agent. The agent generates a series of binary masks to exclude interfering negative categories during action execution and hence help improve the recognition accuracy. The proposed method is evaluated on three benchmark recognition datasets, NTU-RGBD, First-Person Hand Action, as well as UCF-101. The proposed method enhances the recognition accuracy consistently over all different observation ratios on the three datasets, where the accuracy improvements on the early stages are especially significant.

Original languageEnglish
Article number9016114
Pages (from-to)4626-4638
Number of pages13
JournalIEEE Transactions on Circuits and Systems for Video Technology
Volume30
Issue number12
DOIs
StatePublished - Dec 2020

Keywords

  • Category exclusion
  • early action recognition
  • policy-based reinforcement learning

Fingerprint

Dive into the research topics of 'Early Action Recognition with Category Exclusion Using Policy-Based Reinforcement Learning'. Together they form a unique fingerprint.

Cite this