2014
DOI: 10.1016/j.cviu.2014.06.014
|View full text |Cite
|
Sign up to set email alerts
|

Evaluation of video activity localizations integrating quality and quantity measurements

Abstract: Evaluating the performance of computer vision algorithms is classically done by reporting classification error or accuracy, if the problem at hand is the classification of an object in an image, the recognition of an activity in a video or the categorization and labeling of the image or video. If in addition the detection of an item in an image or a video, and/or its localization are required, frequently used metrics are Recall and Precision, as well as ROC curves. These metrics give quantitative performance v… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
41
0

Year Published

2015
2015
2020
2020

Publication Types

Select...
5
1
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 70 publications
(41 citation statements)
references
References 28 publications
0
41
0
Order By: Relevance
“…An initial research was conducted to analyze several datasets from different sources like LIRIS (Laboratoire d'InfoRmatique en Image et Systèmes d'information) dataset [19], CMU (Carnegie Mellon University) MoCap dataset 5 , MSR-Action3D and MSRDailyActivity3D dataset [13] and verify it's suitability to our problem. All these datasets contain only isolated actions, and for our task we require sequences of actions.…”
Section: Temporal Segmentationmentioning
confidence: 99%
“…An initial research was conducted to analyze several datasets from different sources like LIRIS (Laboratoire d'InfoRmatique en Image et Systèmes d'information) dataset [19], CMU (Carnegie Mellon University) MoCap dataset 5 , MSR-Action3D and MSRDailyActivity3D dataset [13] and verify it's suitability to our problem. All these datasets contain only isolated actions, and for our task we require sequences of actions.…”
Section: Temporal Segmentationmentioning
confidence: 99%
“…The results also suggest that it outperforms the original MIML model [33], the state-ofthe-art weakly supervised approaches [25,24,18], as well as fully supervised methods [4,29,22,32] in the literature across the three datasets. The paper is organised as follows: Section 2 provides a review of related work for MIML techniques and weakly supervised action detection; Section 3 details the feature representation of video under the weakly supervised setting; Section 4 formulates the proposed framework and introduces the generation of instances and bags in the setting of weakly supervised action detection; Section 5 describes the experiments, such as data and implementation details; Section 6 demonstrates results and analysis of the experiments; 3 finally Section 7 concludes this work and points out possible future work.…”
Section: Introductionmentioning
confidence: 71%
“…With the release of low-cost range cameras such as Kinect (Microsoft), new datasets in the form of 3D depth images were generated for the purpose of human action analysis and recognition. Some examples of public available ones are MSR action 3D [1] and MSR daily activity [2], LIRIS human activity [3], and UT Kinect action [4], etc. These depth images were acquired in much closer ranges (< 4m) than the typical operational range of low-grade commercial LIDARs (80~100m).…”
Section: Discussionmentioning
confidence: 99%