Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 2005
DOI: 10.1109/iccv.2005.59
|View full text |Cite
|
Sign up to set email alerts
|

Conditional models for contextual human motion recognition

Abstract: We present algorithms for recognizing human motion in monocular video sequences, based on discriminative Conditional Random Field (CRF) and Maximum Entropy Markov Models (MEMM). Existing approaches to this problem typically use generative (joint) structures like the Hidden Markov Model (HMM). Therefore they have to make simplifying, often unrealistic assumptions on the conditional independence of observations given the motion class labels and cannot accommodate overlapping features or long term contextual depe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
118
0

Year Published

2009
2009
2021
2021

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 236 publications
(118 citation statements)
references
References 19 publications
0
118
0
Order By: Relevance
“…A different approach is taken by Mün-dermann et al (2007) and Cheng and Trivedi (2007), who align a 3D visual hull body model to an observed visual hull constructed from silhouettes seen in multiple viewpoints in order to achieve viewpoint independence. Other researchers (Taycher et al 2006;Sminichisescu et al 2006) utilise statistical time-series models such as conditional random fields (CRFs) and maximum entropy Markov models (MEMMs) to perform tracking. However, failures in detecting the true limb or the full silhouette and the need to train limb detectors specific to the person being tracked means that discriminative approaches have difficulty with observation 'errors' (e.g.…”
Section: Background and Related Workmentioning
confidence: 99%
“…A different approach is taken by Mün-dermann et al (2007) and Cheng and Trivedi (2007), who align a 3D visual hull body model to an observed visual hull constructed from silhouettes seen in multiple viewpoints in order to achieve viewpoint independence. Other researchers (Taycher et al 2006;Sminichisescu et al 2006) utilise statistical time-series models such as conditional random fields (CRFs) and maximum entropy Markov models (MEMMs) to perform tracking. However, failures in detecting the true limb or the full silhouette and the need to train limb detectors specific to the person being tracked means that discriminative approaches have difficulty with observation 'errors' (e.g.…”
Section: Background and Related Workmentioning
confidence: 99%
“…Many efforts in video event detection have been done to overcome these challenges [1][2][3][4][5][6][7][8][9][10]. According to the proposed methods, a typical video event detection system can be divided into two components which are information abstraction component and event modeling component.…”
Section: Introductionmentioning
confidence: 99%
“…In contrast to the generative approaches discussed above, there exist discriminative approaches for modeling human actions. An in-depth discussion of discriminative models is beyond the scope of this paper, and we refer the reader to [17,10] and references therein.…”
Section: Introductionmentioning
confidence: 99%