2012
DOI: 10.1007/s11263-012-0549-0
|View full text |Cite
|
Sign up to set email alerts
|

Random Forests for Real Time 3D Face Analysis

Abstract: We present a random forest-based framework for real time head pose estimation from depth images and extend it to localize a set of facial features in 3D. Our algorithm takes a voting approach, where each patch extracted from the depth image can directly cast a vote for the head pose or each of the facial features. Our system proves capable of handling large rotations, partial occlusions, and the noisy depth data acquired using commercial sensors. Moreover, the algorithm works on each frame independently and ac… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
357
1
1

Year Published

2015
2015
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 498 publications
(359 citation statements)
references
References 72 publications
0
357
1
1
Order By: Relevance
“…Biwi Kinect Head Pose dataset [25] was generated by computer vision laboratory of ETH Zurich in 2013 for estimating the location and orientation of a person's head from the depth data. This dataset is recorded when some people are facing to a Kinect (about one meter away) and turning their heads around randomly.…”
Section: Biwi Kinect Head Pose Datasetmentioning
confidence: 99%
“…Biwi Kinect Head Pose dataset [25] was generated by computer vision laboratory of ETH Zurich in 2013 for estimating the location and orientation of a person's head from the depth data. This dataset is recorded when some people are facing to a Kinect (about one meter away) and turning their heads around randomly.…”
Section: Biwi Kinect Head Pose Datasetmentioning
confidence: 99%
“…Fanelli et al [22][23][24] adopted a voting method to directly determine head pose. However, their feature selection method for depth images degenerates into using 2D features, i.e., the RGB information used in 2D images was replaced by xyz-coordinate values in depth images.…”
Section: Head Pose Estimationmentioning
confidence: 99%
“…Papazov et al [25] also used a random forest-based framework, in a similar way to the methods in Refs. [22][23][24]. They replaced depth features by more elaborate triangular surface patch (TSP) features to ensure view-invariance.…”
Section: Head Pose Estimationmentioning
confidence: 99%
See 1 more Smart Citation
“…It can be described as four steps: (1) extract visual words from images by local feature descriptors like SURF [6], SIFT [7][8], PCA-SIFT [9], (2) construct the visual vocabulary by the clustering algorithms like k-means [10][11] and the random clustering forests [12], (3) quantify the images by the histogram of the extracted words in the vocabulary, and (4) use the sample images to train and test the classifier. The direct using of BOW model has been proved to be well applied to the classification of the repeated/near-duplicate images.…”
Section: Bag Of Words Modelmentioning
confidence: 99%