TY - GEN
T1 - Conditional random field based side-information fusion for distributed multi-view video coding
AU - Zhang, Yongsheng
AU - Xiong, Hongkai
AU - Wang, Hao
AU - Chen, Chang Wen
PY - 2011
Y1 - 2011
N2 - This paper presents a new temporal and inter-view side-information fusion algorithm for distributed multi-view video coding (DMVC). Unlike existing fusion algorithms in DMVC schemes that produce the fusion mask by finding the motion vector outliers, it introduces conditional random fields (CRF) to exploit the intrinsic geometric regularity and temporal consistency constraint in multi-view video sequences. Specifically, Wyner-Ziv (WZ) frames are modeled by CRF with the temporal and the inter-view side-information as two observations. The observation distribution models the local accuracy of the temporal and the inter-view side-information. The transition distribution of the CRF model represents the local geometric regularity, e.g., the edge directions and the local smoothness of the WZ frame. Its parameters are trained from previously decoded WZ frames, and the inference is made on trained weights to generate fused side-information. The accurate modeling is validated to show a significant performance gain over the existing fusion algorithms by experiments.
AB - This paper presents a new temporal and inter-view side-information fusion algorithm for distributed multi-view video coding (DMVC). Unlike existing fusion algorithms in DMVC schemes that produce the fusion mask by finding the motion vector outliers, it introduces conditional random fields (CRF) to exploit the intrinsic geometric regularity and temporal consistency constraint in multi-view video sequences. Specifically, Wyner-Ziv (WZ) frames are modeled by CRF with the temporal and the inter-view side-information as two observations. The observation distribution models the local accuracy of the temporal and the inter-view side-information. The transition distribution of the CRF model represents the local geometric regularity, e.g., the edge directions and the local smoothness of the WZ frame. Its parameters are trained from previously decoded WZ frames, and the inference is made on trained weights to generate fused side-information. The accurate modeling is validated to show a significant performance gain over the existing fusion algorithms by experiments.
UR - https://www.scopus.com/pages/publications/84862972552
U2 - 10.1109/VCIP.2011.6116053
DO - 10.1109/VCIP.2011.6116053
M3 - Conference contribution
AN - SCOPUS:84862972552
SN - 9781457713200
T3 - 2011 IEEE Visual Communications and Image Processing, VCIP 2011
BT - 2011 IEEE Visual Communications and Image Processing, VCIP 2011
T2 - 2011 IEEE Visual Communications and Image Processing, VCIP 2011
Y2 - 6 November 2011 through 9 November 2011
ER -