Skip to main navigation Skip to search Skip to main content

Conditional random field based side-information fusion for distributed multi-view video coding

  • Shanghai Jiao Tong University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

This paper presents a new temporal and inter-view side-information fusion algorithm for distributed multi-view video coding (DMVC). Unlike existing fusion algorithms in DMVC schemes that produce the fusion mask by finding the motion vector outliers, it introduces conditional random fields (CRF) to exploit the intrinsic geometric regularity and temporal consistency constraint in multi-view video sequences. Specifically, Wyner-Ziv (WZ) frames are modeled by CRF with the temporal and the inter-view side-information as two observations. The observation distribution models the local accuracy of the temporal and the inter-view side-information. The transition distribution of the CRF model represents the local geometric regularity, e.g., the edge directions and the local smoothness of the WZ frame. Its parameters are trained from previously decoded WZ frames, and the inference is made on trained weights to generate fused side-information. The accurate modeling is validated to show a significant performance gain over the existing fusion algorithms by experiments.

Original languageEnglish
Title of host publication2011 IEEE Visual Communications and Image Processing, VCIP 2011
DOIs
StatePublished - 2011
Event2011 IEEE Visual Communications and Image Processing, VCIP 2011 - Tainan, Taiwan, Province of China
Duration: Nov 6 2011Nov 9 2011

Publication series

Name2011 IEEE Visual Communications and Image Processing, VCIP 2011

Conference

Conference2011 IEEE Visual Communications and Image Processing, VCIP 2011
Country/TerritoryTaiwan, Province of China
CityTainan
Period11/6/1111/9/11

Fingerprint

Dive into the research topics of 'Conditional random field based side-information fusion for distributed multi-view video coding'. Together they form a unique fingerprint.

Cite this