Proceedings of the British Machine Vision Conference 2014 2014
DOI: 10.5244/c.28.100
|View full text |Cite
|
Sign up to set email alerts
|

Real-time Activity Recognition by Discerning Qualitative Relationships Between Randomly Chosen Visual Features

Abstract: Motivation. Automatic recognition of human activities (or events) from video is important to many potential applications of computer vision. One of the most common approach is the bag-of-visual-features, which aggregate space-time features globally, from the entire video clip containing complete execution of a single activity. The bag-of-visual-features does not encode the spatio-temporal structure in the video. For this reason, there is a growing interest in modeling spatio-temporal structure between visual f… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2015
2015
2021
2021

Publication Types

Select...
5

Relationship

1
4

Authors

Journals

citations
Cited by 5 publications
(3 citation statements)
references
References 39 publications
0
3
0
Order By: Relevance
“…To address this, we use coarse bandwidth (t c = T /3). It allows f g to focus on different temporal parts of a video, motivated by [4] that uses before, during and after to capture the temporal relationships in a video. Moreover, driver secondary activities often involve human-object interactions (e.g.…”
Section: Glimpse Sensormentioning
confidence: 99%
“…To address this, we use coarse bandwidth (t c = T /3). It allows f g to focus on different temporal parts of a video, motivated by [4] that uses before, during and after to capture the temporal relationships in a video. Moreover, driver secondary activities often involve human-object interactions (e.g.…”
Section: Glimpse Sensormentioning
confidence: 99%
“…A large number of HAR methods are based on SIFT features [22] and its extensions [23]. For example, Behera et al proposed a random forest that unifies randomization, discriminative relationships mining and a Markov temporal structure for real-time activity recognition with SIFT features [24]. However, SIFT features only encode appearance information and are not able to represent temporal information.…”
Section: Human Activity Recognitionmentioning
confidence: 99%
“…In 2014, Arbab-Zavar et al (Arbab-Zavar et al, 2014) exploited shape and motion features extracted from an overhead video in order to identify highly structured tasks and activities within a car manufacturing plant. A Markov temporal structure based decision system has been proposed in (Behera et al, 2014) to model spatio-temporal relationships during object manipulations tasks and has been tested for continuous activity recognition in assembling a pump system. Yet, in ORs, dozens of tasks are carried out by many different people and cannot be defined as easily as in strictly designed industrial environments.…”
Section: Related Workmentioning
confidence: 99%