2014
DOI: 10.1007/978-3-319-07064-3_8
|View full text |Cite
|
Sign up to set email alerts
|

Classifying Behavioral Attributes Using Conditional Random Fields

Abstract: A human behavior recognition method with an application to political speech videos is presented. We focus on modeling the behavior of a subject with a conditional random field (CRF). The unary terms of the CRF employ spatiotemporal features (i.e., HOG3D, STIP and LBP). The pairwise terms are based on kinematic features such as the velocity and the acceleration of the subject. As an exact solution to the maximization of the posterior probability of the labels is generally intractable, loopy belief propagation w… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
12
0

Year Published

2015
2015
2021
2021

Publication Types

Select...
4
2
1

Relationship

2
5

Authors

Journals

citations
Cited by 11 publications
(12 citation statements)
references
References 11 publications
0
12
0
Order By: Relevance
“…Social networking CCV (Jiang et al, 2011) 2001 20 9.317 FPSI (Fathi et al, 2012) 2012 6 8 ≈42 h 1,280 × 720 Broadcast field hockey (Lan et al, 2012b) 2012 11 58 USAA (Fu et al, 2012) 2012 8 ≈200 Sports-1M (Karpathy et al, 2014) 2014 487 1 M ActivityNet (Heilbron et al, 2015) 2015 203 27,801 1,280 × 720 WWW Crowd (Shao et al, 2015) 2015 94 10,000 640 × 360 Behavior BEHAVE (Fisher, 2007a) 2007 8 321 640 × 480 Canal9 (Vinciarelli et al, 2009) 2009 2 190 ≈42 h 720 × 576 USC Creative IT (Metallinou et al, 2010) 2010 50 16 100 Parliament (Vrigkas et al, 2014b) 2014 3 20 228 320 × 240 a frontal view camera. The non-complex backgrounds and the non-intraclass variations in human movements make these datasets non-applicable for real world applications.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Social networking CCV (Jiang et al, 2011) 2001 20 9.317 FPSI (Fathi et al, 2012) 2012 6 8 ≈42 h 1,280 × 720 Broadcast field hockey (Lan et al, 2012b) 2012 11 58 USAA (Fu et al, 2012) 2012 8 ≈200 Sports-1M (Karpathy et al, 2014) 2014 487 1 M ActivityNet (Heilbron et al, 2015) 2015 203 27,801 1,280 × 720 WWW Crowd (Shao et al, 2015) 2015 94 10,000 640 × 360 Behavior BEHAVE (Fisher, 2007a) 2007 8 321 640 × 480 Canal9 (Vinciarelli et al, 2009) 2009 2 190 ≈42 h 720 × 576 USC Creative IT (Metallinou et al, 2010) 2010 50 16 100 Parliament (Vrigkas et al, 2014b) 2014 3 20 228 320 × 240 a frontal view camera. The non-complex backgrounds and the non-intraclass variations in human movements make these datasets non-applicable for real world applications.…”
Section: Discussionmentioning
confidence: 99%
“…(Liu et al, 2011b;Martinez et al, 2014). Behavioral methods aim to recognize behavioral attributes, non-verbal multimodal cues, such as gestures, facial expressions, and auditory cues (Song et al, 2012a;Vrigkas et al, 2014b). Finally, social networking methods model the characteristics and the behavior of humans in several layers of human-to-human interactions in social events from gestures, body motion, and speech (Patron-Perez et al, 2012;Marín-Jiménez et al, 2014).…”
Section: Previous Surveys and Taxonomiesmentioning
confidence: 99%
“…Indeed, it is evident that the difference in appearance between various activities is more noticeable than among people performing the same activity. So, techniques based on the appearance of body parts are proposed as in [203]. (d) 3D models: They attempt to build matches between characteristics of the model based on various features of the images.…”
Section: I) First Stage Methods (Detection)mentioning
confidence: 99%
“…Parliament [55]: This dataset is a collection of 228 video sequences, depicting political speeches in the Greek parliament, at a resolution of 320 × 240 pixels at 25 fps. The video sequences were manually labeled with one of three behavioral labels: friendly, aggressive, or neutral.…”
Section: Datasetsmentioning
confidence: 99%