Joe Yue-Hei Ng scite author profile

Convolutional neural networks (CNNs) have been extensively applied for image recognition problems giving stateof-the-art results on recognition, detection, segmentation and retrieval. In this work we propose and evaluate several deep neural network architectures to combine image information across a video over longer time periods than previously attempted. We propose two methods capable of handling full length videos. The first method explores various convolutional temporal feature pooling architectures, examining the various design choices which need to be made when adapting a CNN for this task. The second proposed method explicitly models the video as an ordered sequence of frames. For this purpose we employ a recurrent neural network that uses Long Short-Term Memory (LSTM) cells which are connected to the output of the underlying CNN. Our best networks exhibit significant performance improvements over previously published results on the Sports 1 million dataset (73.1% vs. 60.9%) and the UCF-101 datasets with (88.6% vs. 88.0%) and without additional optical flow information (82.6% vs. 73.0%).

show abstract

ActionFlowNet: Learning Motion Representation for Action Recognition

Choi

Neumann

et al. 2018

View full text Add to dashboard Cite

FASON: First and Second Order Information Fusion Network for Texture Recognition

Dai

Davis

2017

View full text Add to dashboard Cite

Beyond Short Snippets: Deep Networks for Video Classification

Ng¹,

Hausknecht²,

Vijayanarasimhan³

et al. 2015

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Joe Yue-Hei Ng

Exploiting local features from deep networks for image retrieval

Beyond short snippets: Deep networks for video classification

ActionFlowNet: Learning Motion Representation for Action Recognition

FASON: First and Second Order Information Fusion Network for Texture Recognition

Beyond Short Snippets: Deep Networks for Video Classification

Contact Info

Product

Resources

About