Skip to main navigation Skip to search Skip to main content

Sparse Representation with Spatio-Temporal Online Dictionary Learning for Promising Video Coding

  • Wenrui Dai
  • , Yangmei Shen
  • , Xin Tang
  • , Junni Zou
  • , Hongkai Xiong
  • , Chang Wen Chen
  • University of California at San Diego
  • Shanghai Jiao Tong University
  • Shanghai University

Research output: Contribution to journalArticlepeer-review

10 Scopus citations

Abstract

Classical dictionary learning methods for video coding suffer from high computational complexity and interfered coding efficiency by disregarding its underlying distribution. This paper proposes a spatio-temporal online dictionary learning (STOL) algorithm to speed up the convergence rate of dictionary learning with a guarantee of approximation error. The proposed algorithm incorporates stochastic gradient descents to form a dictionary of pairs of 3D low-frequency and high-frequency spatio-temporal volumes. In each iteration of the learning process, it randomly selects one sample volume and updates the atoms of dictionary by minimizing the expected cost, rather than optimizes empirical cost over the complete training data, such as batch learning methods, e.g., K-SVD. Since the selected volumes are supposed to be independent identically distributed samples from the underlying distribution, decomposition coefficients attained from the trained dictionary are desirable for sparse representation. Theoretically, it is proved that the proposed STOL could achieve better approximation for sparse representation than K-SVD and maintain both structured sparsity and hierarchical sparsity. It is shown to outperform batch gradient descent methods (K-SVD) in the sense of convergence speed and computational complexity, and its upper bound for prediction error is asymptotically equal to the training error. With lower computational complexity, extensive experiments validate that the STOL-based coding scheme achieves performance improvements than H.264/AVC or High Efficiency Video Coding as well as existing super-resolution-based methods in rate-distortion performance and visual quality.

Original languageEnglish
Article number7523418
Pages (from-to)4580-4595
Number of pages16
JournalIEEE Transactions on Image Processing
Volume25
Issue number10
DOIs
StatePublished - Oct 2016

Keywords

  • K-SVD
  • Online dictionary learning
  • sparse representation
  • stochastic gradient descent
  • video coding

Fingerprint

Dive into the research topics of 'Sparse Representation with Spatio-Temporal Online Dictionary Learning for Promising Video Coding'. Together they form a unique fingerprint.

Cite this