2020
DOI: 10.1038/s41597-020-00776-9
|View full text |Cite
|
Sign up to set email alerts
|

The MoCA dataset, kinematic and multi-view visual streams of fine-grained cooking actions

Abstract: MoCA is a bi-modal dataset in which we collect Motion Capture data and video sequences acquired from multiple views, including an ego-like viewpoint, of upper body actions in a cooking scenario. It has been collected with the specific purpose of investigating view-invariant action properties in both biological and artificial systems. Besides that, it represents an ideal test bed for research in a number of fields – including cognitive science and artificial vision – and application domains – as motor control a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
17
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 18 publications
(17 citation statements)
references
References 34 publications
0
17
0
Order By: Relevance
“…Atomic actions 12 are movements of a person describing a certain motion that can be part of more complex activities, whereas gestures are considered as primitive movements of the body. Activity is hierarchically defined as a sequence of actions 13 . The first part of the experimentation consisted of the training phase, in which each participant had to simulate 10 different actions, sequentially.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Atomic actions 12 are movements of a person describing a certain motion that can be part of more complex activities, whereas gestures are considered as primitive movements of the body. Activity is hierarchically defined as a sequence of actions 13 . The first part of the experimentation consisted of the training phase, in which each participant had to simulate 10 different actions, sequentially.…”
Section: Methodsmentioning
confidence: 99%
“…olor video cameras are also often used in human activity recognition to monitor several human people activities. As an example, the Multiview Cooking Actions dataset (MoCA) 5 is a bi-modal dataset in which six VICON infrared cameras are used to collect Motion Capture data and video sequences from multiple views of upper body actions in a cooking scenario. Even if this approach is very interesting, it is based on a very complex and cumbersome system such as the VICON, which cannot be transported and easily installed in a domestic environment.…”
Section: Background and Summarymentioning
confidence: 99%
“…Indeed, this is a novel dataset: systematically searching Scopus and Web of Science reveals very few datasets including food flipping movements 16 18 . They include varied tasks, but each one recorded in a single condition ( e.g ., always with the same food).…”
Section: Background and Summarymentioning
confidence: 99%
“…Like ours, their dataset contains position/orientation and force/torque signals. The third dataset 18 includes an action of “turning pancakes” but, once more, without a utensil: throwing the pancake with the pan. The three datasets also differ from ours on the context in which the task was performed.…”
Section: Background and Summarymentioning
confidence: 99%
See 1 more Smart Citation