Motion of Oriented Magnitudes Patterns for Human Action Recognition

Phan, Hai-Hong; Vu, Ngoc-Son; Nguyen, Vu-Lam; Quoy, Mathias

doi:10.1007/978-3-319-50832-0_17

Cited by 7 publications

(15 citation statements)

References 27 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…By several hundred test cases where the recognition rates are calculated on 300 KTH action videos with different parameters, we found that the optimal parameters, as follows: Feature extraction – MOMP descriptor: we follow the same experimental settings as in [23]: o Step 1: Number of orientations

d = 5

is used to compute gradient and quantise orientation. o Step 2: We accumulate the magnitude of the pixels from their neighbours by Gaussian filter with the kernel size

5 \times 5

and the standard deviation

σ = 1

. o Step 3: We encode the features based on LTP‐based self‐similarity and the SSD of gradient magnitudes among successive frames. To calculate SSD, we choose a cell size

r \times r

r = 3

; the number of neighbouring cells in each block

n = 8

and a threshold

T = r \times r \times τ^{2}

(where

r \times r

is cell size, the threshold

τ

is varying from 5 to 7).…”

Section: Resultsmentioning

confidence: 99%

“…In order to evaluate the contribution of the feature post‐processing – PCA and feature selection – some filter methods, we compare the results on different systems separately in ours 1, ours 2, and ours 3. Ours 1 [23], which integrates our descriptor MOMP to VLAD and rbf‐SVM, obtains 94.4%. In comparison to ours 1, ours 2 has a better result (95.4%).…”

Section: Resultsmentioning

confidence: 99%

“…The features encode the action across different orientations of consecutive frames by the local gradient distributions. MOMP features are calculated as follows (see the detail of MOMP in [23]): Step 1: We calculate and quantise the gradient of each frame based on Haar features. We use soft assignment technique to decrease the loss of quantisation (Fig.…”

Section: Proposed Methodsmentioning

confidence: 99%

“…98.4 [12] 91.8 Klaser et al [19] 91.4 Yeffet and Wolf (LTP) [44] 90.1 Kovashka and Grauman [45] 94.5 Taylor et al (CNN) [46] 90.0 Le et al (CNN) [2] 93.9 Wang et al [47] 93.8 Action Bank [48] 98.2 Jiang et al [49] 95.8 Kliper-Gross et al (MIP) [50] 93.0 Zhou and Wang [51] 93.3 Wang et al [13] 94.2 Shabani et al [52] 95.3 Ji et al (CNN) [1] 90.2 Iosifidis et al [53] 92.1 Liu et al [54] 93.5 ours 1 (MOMP + VLAD + rbf-SVM) [23] 94. 4 We use a leave-one-out scheme to test the efficiency of our approach in recognition, i.e.…”

Section: Ours 3 (Momp + Pca + Vlad + Fs + Linear Svm)mentioning

confidence: 99%

See 3 more Smart Citations

Action recognition based on motion of oriented magnitude patterns and feature selection

Phan

Nguyen

et al. 2018

IET Computer Vision

Self Cite

View full text Add to dashboard Cite

Here, the authors introduce a novel system which incorporates the discriminative motion of oriented magnitude patterns (MOMP) descriptor into simple yet efficient techniques. The authors' descriptor both investigates the relations of the local gradient distributions in neighbours among consecutive image sequences and characterises information changing across different orientations. The proposed system has two main contributions: (i) the authors adopt feature post-processing principal component analysis followed by vector of locally aggregated descriptors encoding to de-correlate MOMP descriptor and reduce the dimension in order to speed up the algorithm; (ii) then the authors include the feature selection (i.e. statistical dependency, mutual information, and minimal redundancy maximal relevance) to find out the best feature subset to improve the performance and decrease the computational expense in classification through support vector machine techniques. Experiment results on four data sets, Weizmann (98.4%), KTH (96.3%), UCF Sport (82.0%), and HMDB51 (31.5%), prove the efficiency of the authors' algorithm.

show abstract

d = 5

is used to compute gradient and quantise orientation. o Step 2: We accumulate the magnitude of the pixels from their neighbours by Gaussian filter with the kernel size

5 \times 5

and the standard deviation

σ = 1

. o Step 3: We encode the features based on LTP‐based self‐similarity and the SSD of gradient magnitudes among successive frames. To calculate SSD, we choose a cell size

r \times r

r = 3

; the number of neighbouring cells in each block

n = 8

and a threshold

T = r \times r \times τ^{2}

(where

r \times r

is cell size, the threshold

τ

is varying from 5 to 7).…”

Section: Resultsmentioning

confidence: 99%

Section: Resultsmentioning

confidence: 99%

Section: Proposed Methodsmentioning

confidence: 99%

Section: Ours 3 (Momp + Pca + Vlad + Fs + Linear Svm)mentioning

confidence: 99%

See 2 more Smart Citations

Action recognition based on motion of oriented magnitude patterns and feature selection

Phan

Nguyen

et al. 2018

IET Computer Vision

Self Cite

View full text Add to dashboard Cite

show abstract

“…Based on the taxonomy proposed by Goodfellow et al [26], the approaches to this problem can be categorized into two groups: (i) traditional methods [6,56,77,92], where the action representation is explicitly chosen and the action recognition is defined under conventional machine learning algorithms, and (ii) representation-learning strategies that explore machine learning techniques for both tasks. The latter includes shallow approaches, such as dictionary-based methods [44,55,57,83], and deep learning strategies [33,34,35,50,59,67].…”

Section: Chapter 1 Introductionmentioning

confidence: 99%

Multi-strange hadrons in Pb-Pb collisions at the LHC with ALICE /

Concha¹

View full text Add to dashboard Cite

Avanços na tecnologia digital aumentaram as capacidades de reconhecimento de eventos por meio do desenvolvimento de dispositivos com alta resolução, pequenas dimensões físicas e altas taxas de amostragem. O reconhecimento de eventos complexos em vídeos possui várias aplicações relevantes, particularmente devido à grande disponibilidade de câmeras digitais em ambientes como aeroportos, bancos, estradas, entre outros. A grande quantidade de dados produzidos é o cenário ideal para o desenvolvimento de métodos automáticos baseados em aprendizado de máquina profundo. Apesar do progresso significativo alcançado com as redes neurais profundas aplicadas a imagens, a compreensão do conteúdo de vídeos ainda enfrenta desafios na modelagem de relações espaço-temporais. Nesta dissertação, o problema do reconhecimento de ações humanas em vídeos foi investigada. Uma rede de múltiplos canais é a arquitetura de escolha para incorporar informações temporais, uma vez que se pode beneficiar de redes profundas pré-treinadas para imagens e de características tradicionais para inicialização. Além disso, seu custo de treinamento é geralmente menor do que o das redes neurais para vídeos. Imagens de ritmo visual são exploradas, pois codificam informações de longo prazo quando comparadas a quadros estáticos e fluxo ótico. Um novo método baseado em rastreamento de pontos é deesnvolvido para decidir a melhor direção do ritmo visual para cada vídeo. Além disso, redes neurais recorrentes foram treinadas a partir das características extraídas dos canais da arquitetura proposta. Experimentos conduzidos nas desafiadoras bases de dados públicas UCF101 e HMDB51 mostraram que a abordagem é capaz de melhorar o desempenho da rede, alcançando taxas de acurácia comparáveis aos métodos da literatura. Embora os ritmos visuais sejam originalmente criados a partir de imagens RGB, outros tipos de fontes e estratégias para sua criação são explorados e discutidos, tais como fluxo ótico, gradientes de imagem e histogramas de cores.

show abstract

Information theory based pruning for CNN compression and its application to image classification and action recognition

Phan

2019

2019 16th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS)

View full text Add to dashboard Cite

Motion of Oriented Magnitudes Patterns for Human Action Recognition

Cited by 7 publications

References 27 publications

Action recognition based on motion of oriented magnitude patterns and feature selection

Action recognition based on motion of oriented magnitude patterns and feature selection

Multi-strange hadrons in Pb-Pb collisions at the LHC with ALICE /

Information theory based pruning for CNN compression and its application to image classification and action recognition

Contact Info

Product

Resources

About