2014
DOI: 10.1016/j.patrec.2014.07.014
|View full text |Cite
|
Sign up to set email alerts
|

Bag-of-words with aggregated temporal pair-wise word co-occurrence for human action recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
6
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 10 publications
(5 citation statements)
references
References 40 publications
0
5
0
Order By: Relevance
“…Bag of words (BoW) [40] is most likely used as a feature representation scheme for the still images and the videos in HAR. The bag of visual words, also known as the BoW, is a symbolic scheme used to symbolize the documents for retrieval purposes.…”
Section: Bowmentioning
confidence: 99%
See 1 more Smart Citation
“…Bag of words (BoW) [40] is most likely used as a feature representation scheme for the still images and the videos in HAR. The bag of visual words, also known as the BoW, is a symbolic scheme used to symbolize the documents for retrieval purposes.…”
Section: Bowmentioning
confidence: 99%
“…The Bi-GRU [40] uses exceptional gates (ut), known as reset and update gates for the declining gradient dispersion with smaller loss. The ut substitute input and forget gate of the LSTM, which depict the preservation degree of the preceding data.…”
Section: Bi-gru Variant Of Rnn Based Recognitionmentioning
confidence: 99%
“…In addition, Equation (6) shows an equation for calculating the Fmeasure using accuracy and recall. The performance evaluation uses the proposed miningbased mutual information (MbMI), existing mining-based word frequency (MbWF) [37,38], word concurrence frequency (WCoF) [39,40] in the document to find the relationship between words. It performs performance evaluation while repeatedly changing minimum support.…”
Section: B Performance Evaluationmentioning
confidence: 99%
“…Since the effectiveness of the hand-crafted features directly influences the overall recognition performance, a lot of well-designed features were proposed, whose instances include HoG [11], histogram of optical flows (HoF) [13], 3D Harris corners [14], 3D scale invariant feature transform (3D SIFT) [15], and so on. These features are generally extracted from spatiotemporal local regions in an input video and integrated into a video-wise feature vector with bags of visual word (BoVW) strategy [16].…”
Section: A Supervised Activity Video Analysismentioning
confidence: 99%