2020 25th International Conference on Pattern Recognition (ICPR) 2021
DOI: 10.1109/icpr48806.2021.9413336
|View full text |Cite
|
Sign up to set email alerts
|

SL-DML: Signal Level Deep Metric Learning for Multimodal One-Shot Action Recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
20
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 32 publications
(20 citation statements)
references
References 23 publications
0
20
0
Order By: Relevance
“…Our PROFORMER model yields state-of-the-art on three datasets [15], [16], [26], surpassing the best published approach on the challenging NTU-120 benchmark by 1.84% for one-shot action recognition. 4) As a side-observation, we discover that PROFORMER optimization strategy is much more resistant to noise corruptions, outperforming the same backbone trained with conventional deep metric learning strategy.…”
Section: Introductionmentioning
confidence: 91%
See 3 more Smart Citations
“…Our PROFORMER model yields state-of-the-art on three datasets [15], [16], [26], surpassing the best published approach on the challenging NTU-120 benchmark by 1.84% for one-shot action recognition. 4) As a side-observation, we discover that PROFORMER optimization strategy is much more resistant to noise corruptions, outperforming the same backbone trained with conventional deep metric learning strategy.…”
Section: Introductionmentioning
confidence: 91%
“…This is impractical in real robotics applications, where data-efficient learning of new concepts on-the-fly remains a key challenge [9]. Problems of learning activity representations which adapt well to new data-scarce categories are often posed in the form of few-shot recognition, where the methods usually fall into one of two categories: (1) meta-learning-based methods [18], [31], [32], [33], which reinitialize a new set of tasks every epoch following the "learning to learn" paradigm and (2) metric-learning-based methods [15], [16], [17], which aim to project the input to a lower-dimensional space, where same-category samples are close to each other and the intercategory ones are far apart. Our approach falls in the latter category.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…With an exception of very recent models [53,54,59,60], FSAR approaches that learn from skeleton-based 3D body joints are scarce. The above situation prevails despite AR from articulated sets of connected body joints, expressed as 3D coordinates, does offer a number of advantages over videos such as (i) the lack of the background clutter, (ii) the volume of data being several orders of magnitude smaller, and (iii) the 3D geometric manipulations of sequences being relatively friendly.…”
Section: Introductionmentioning
confidence: 99%