2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2016
DOI: 10.1109/cvpr.2016.216
|View full text |Cite
|
Sign up to set email alerts
|

A Multi-stream Bi-directional Recurrent Neural Network for Fine-Grained Action Detection

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
322
0
1

Year Published

2017
2017
2021
2021

Publication Types

Select...
3
2
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 420 publications
(323 citation statements)
references
References 21 publications
0
322
0
1
Order By: Relevance
“…Fully-supervised Learning Approaches: In the third category, the action segmentation task has been explored by numbers of works by developing various types of network architectures. For example, multi-stream bi-directional recurrent neural network (MSB-RNN) [62], temporal deformable residual network 1. The language signal should not be treated as supervision since the steps are not directly given, but need to be further explored in an unsupervised manner.…”
Section: Methods For Instructional Video Analysismentioning
confidence: 99%
“…Fully-supervised Learning Approaches: In the third category, the action segmentation task has been explored by numbers of works by developing various types of network architectures. For example, multi-stream bi-directional recurrent neural network (MSB-RNN) [62], temporal deformable residual network 1. The language signal should not be treated as supervision since the steps are not directly given, but need to be further explored in an unsupervised manner.…”
Section: Methods For Instructional Video Analysismentioning
confidence: 99%
“…Aside from using handcrafted features, approaches have been introduced using deep networks. Singh et al [45] introduced a multi-stream bi-directional recurrent neural network utilising both spatial and temporal information; while Lea et al [24] incorporates a spatio-temporal CNN with a constrained segmental model. In [23], the authors have introduced temporal convolutional networks (TCN) for fine grained action detection and segmentation.…”
Section: Related Workmentioning
confidence: 99%
“…Here the second GAN is coupled as an auxiliary network, which takes supplementary information. This supplementary information may vary across datasets; for instance we use depth information for the 50 salads dataset [47] and optical flow for MERL shopping [45] and Georgia Tech Egocentric activity [9] datasets. Both GANs aim to generate realistic action codes to fool their respective discriminators using their differing inputs, and the coupled adversarial loss can be defined as,…”
Section: Coupling Multi-model Informationmentioning
confidence: 99%
“…RNNs, and in particular Long Short-Term Memorys (LSTMs, which are explained in detail in Sec. 3.2.1) have demonstrated potential in computer vision for analysis of dynamic systems [15,16,17,18,19]. In this study we utilize LSTMs to carefully model the growth patterns of plants.…”
Section: Introductionmentioning
confidence: 99%
“…RNNs (and LSTMs in particular) are able to grasp and learn long-range and complex dynamics and have recently become very popular for the task of activity recognition. More specifically, [15,16,17,18,19] used LSTM in conjunction with CNN for action and activity recognition were shown to provide a significant improvement in performance over previous studies of video data. In this paper, we treat the growth and development of plants as an action recognition problem, and use CNN for extracting discriminative features, and LSTM for encoding the growth behavior of the plants.…”
Section: Introductionmentioning
confidence: 99%