2018 19th IEEE/ACIS International Conference on Software Engineering, Artificial Intelligence, Networking and Parallel/Distribu 2018
DOI: 10.1109/snpd.2018.8441034
|View full text |Cite
|
Sign up to set email alerts
|

Multi-label Class-imbalanced Action Recognition in Hockey Videos via 3D Convolutional Neural Networks

Abstract: Automatic analysis of the video is one of most complex problems in the fields of computer vision and machine learning. A significant part of this research deals with (human) activity recognition (HAR) since humans, and the activities that they perform, generate most of the video semantics. Video-based HAR has applications in various domains, but one of the most important and challenging is HAR in sports videos. Some of the major issues include high inter-and intra-class variations, large class imbalance, the p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
31
0
1

Year Published

2018
2018
2021
2021

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 47 publications
(32 citation statements)
references
References 20 publications
0
31
0
1
Order By: Relevance
“…Video summarized using the proposed system received a good score of 4 out of 5 from the participants. Sozykin K et al [17] proposed a 3D CNN-based multi-label deep Human Action Recognition (HAR) system for sports video summarization for the sport of Hockey and presented more than ten classes. Data pre-processing techniques like resizing, normalization, windowing, and sequence labeling were used.…”
Section: Scene Classification Via Deep-learning Approachmentioning
confidence: 99%
“…Video summarized using the proposed system received a good score of 4 out of 5 from the participants. Sozykin K et al [17] proposed a 3D CNN-based multi-label deep Human Action Recognition (HAR) system for sports video summarization for the sport of Hockey and presented more than ten classes. Data pre-processing techniques like resizing, normalization, windowing, and sequence labeling were used.…”
Section: Scene Classification Via Deep-learning Approachmentioning
confidence: 99%
“…They used pre-trained CNN to first extract the features then use LSTM for classification of the five types of events which are dump in, dump out, loose puck recovery, pass and shot. Sozykin et al [35] presented a 3D CNN based action recognition system for multi-class imbalanced in ice hockey. They first extract features from both single image and a slice of frames using CNN.…”
Section: Deep Learning Architecture In Sport Video Analysismentioning
confidence: 99%
“…The use of multiple labels to represent multiple actions has grown in popularity due to the interest in detecting and recognizing simultaneous activity in videos. For example, concurrent action recognition in hockey videos [1] could indicate that a 'Play', 'Face Off' and 'Fight' took place at the same time. Similarly, the ability to tag multiple facial expressions in videos can be accomplished using multilabels to detect emotions, a crucial component of HCI [2].…”
Section: Introductionmentioning
confidence: 99%
“…Similarly, the ability to tag multiple facial expressions in videos can be accomplished using multilabels to detect emotions, a crucial component of HCI [2]. However, in all of the aforementioned references, an action or a combination of actions is assigned to a label, which in turn is given a binary assignment representing its absence (0) or its presence (1).…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation