TY - GEN
T1 - Joint facial landmark detection and action estimation based on deep probabilistic random forest
AU - Yu, Jun
AU - Chen, Chang Wen
N1 - Publisher Copyright:
© 2017 IEEE.
PY - 2017/7/2
Y1 - 2017/7/2
N2 - Random forest is effective and efficient for detecting facial landmark from visual images, and has achieved the state-of-the-art performance, both in accuracy and speed, by regressing local binary features (LBF). This paper aims to increase the detection accuracy of random forest for facial landmarks and extends it to facial action estimation. First, probabilistic features are designed to overcome the weaknesses of LBF, e.g., feature sparseness and tracking jitter. Second, a deep architecture is introduced to random forest for enhancing the capacity of representation learning. Third, the initial detected facial landmarks are refined and 3D facial actions are estimated jointly by registering a deformable facial model to images based on an optimized iterative closest point framework. Experiments show that the proposed methods significantly outperform the state-of-the-art ones in terms of accuracy, as well as achieve the excellent tracking stability and real-time ability at about 60 fps on an ordinary PC.
AB - Random forest is effective and efficient for detecting facial landmark from visual images, and has achieved the state-of-the-art performance, both in accuracy and speed, by regressing local binary features (LBF). This paper aims to increase the detection accuracy of random forest for facial landmarks and extends it to facial action estimation. First, probabilistic features are designed to overcome the weaknesses of LBF, e.g., feature sparseness and tracking jitter. Second, a deep architecture is introduced to random forest for enhancing the capacity of representation learning. Third, the initial detected facial landmarks are refined and 3D facial actions are estimated jointly by registering a deformable facial model to images based on an optimized iterative closest point framework. Experiments show that the proposed methods significantly outperform the state-of-the-art ones in terms of accuracy, as well as achieve the excellent tracking stability and real-time ability at about 60 fps on an ordinary PC.
KW - facial action estimation
KW - Facial alignment
KW - facial landmarks
KW - iterative closest fitting
KW - random forest
UR - https://www.scopus.com/pages/publications/85050588213
U2 - 10.1109/VCIP.2017.8305115
DO - 10.1109/VCIP.2017.8305115
M3 - Conference contribution
AN - SCOPUS:85050588213
T3 - 2017 IEEE Visual Communications and Image Processing, VCIP 2017
SP - 1
EP - 4
BT - 2017 IEEE Visual Communications and Image Processing, VCIP 2017
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2017 IEEE Visual Communications and Image Processing, VCIP 2017
Y2 - 10 December 2017 through 13 December 2017
ER -