The MoCA dataset, kinematic and multi-view visual streams of fine-grained cooking actions

Nicora, Elena; Goyal, Gaurvi; Vignolo, Alessia; Sciutti, Alessandra; Odone, Francesca

doi:10.1038/s41597-020-00776-9

Cited by 18 publications

(17 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Atomic actions 12 are movements of a person describing a certain motion that can be part of more complex activities, whereas gestures are considered as primitive movements of the body. Activity is hierarchically defined as a sequence of actions 13 . The first part of the experimentation consisted of the training phase, in which each participant had to simulate 10 different actions, sequentially.…”

Section: Methodsmentioning

confidence: 99%

“…olor video cameras are also often used in human activity recognition to monitor several human people activities. As an example, the Multiview Cooking Actions dataset (MoCA) 5 is a bi-modal dataset in which six VICON infrared cameras are used to collect Motion Capture data and video sequences from multiple views of upper body actions in a cooking scenario. Even if this approach is very interesting, it is based on a very complex and cumbersome system such as the VICON, which cannot be transported and easily installed in a domestic environment.…”

Section: Background and Summarymentioning

confidence: 99%

See 1 more Smart Citation

The VISTA datasets, a combination of inertial sensors and depth cameras data for activity recognition

et al. 2022

View full text Add to dashboard Cite

This paper makes the VISTA database, composed of inertial and visual data, publicly available for gesture and activity recognition. The inertial data were acquired with the SensHand, which can capture the movement of wrist, thumb, index and middle fingers, while the RGB-D visual data were acquired simultaneously from two different points of view, front and side. The VISTA database was acquired in two experimental phases: in the former, the participants have been asked to perform 10 different actions; in the latter, they had to execute five scenes of daily living, which corresponded to a combination of the actions of the selected actions. In both phase, Pepper interacted with participants. The two camera point of views mimic the different point of view of pepper. Overall, the dataset includes 7682 action instances for the training phase and 3361 action instances for the testing phase. It can be seen as a framework for future studies on artificial intelligence techniques for activity recognition, including inertial-only data, visual-only data, or a sensor fusion approach.

show abstract

Section: Methodsmentioning

confidence: 99%

Section: Background and Summarymentioning

confidence: 99%

The VISTA datasets, a combination of inertial sensors and depth cameras data for activity recognition

et al. 2022

View full text Add to dashboard Cite

show abstract

“…Indeed, this is a novel dataset: systematically searching Scopus and Web of Science reveals very few datasets including food flipping movements 16 – 18 . They include varied tasks, but each one recorded in a single condition ( e.g ., always with the same food).…”

Section: Background and Summarymentioning

confidence: 99%

“…Like ours, their dataset contains position/orientation and force/torque signals. The third dataset 18 includes an action of “turning pancakes” but, once more, without a utensil: throwing the pancake with the pan. The three datasets also differ from ours on the context in which the task was performed.…”

Section: Background and Summarymentioning

confidence: 99%

“…Moreover, most published experiments use inorganic food models for the study of cooking tasks, e.g . cardboard/plastic pancakes 18 , 19 , polystyrene cakes or even empty eggshells 16 . However, these are incomplete representations of real cooked food, because they lack the important ability of releasing fluids (water, fat) and hardly reproduce the deformability of food and the stickiness of its compounds ( e.g .…”

Section: Background and Summarymentioning

confidence: 99%

See 1 more Smart Citation

Flipping food during grilling tasks, a dataset of utensils kinematics and dynamics, food pose and subject gaze

et al. 2022

View full text Add to dashboard Cite

This paper presents a multivariate dataset of 2866 food flipping movements, performed by 4 chefs and 5 home cooks, with different grilled food and two utensils (spatula and tweezers). The 3D trajectories of strategic points in the utensils were tracked using optoelectronic motion capture. The pinching force of the tweezers, the bending force and torsion torque of the spatula were also recorded, as well as videos and the subject gaze. These data were collected using a custom experimental setup that allowed the execution of flipping movements with freshly cooked food, without having the sensors near the dangerous cooking area. Complementary, the 2D position of food was computed from the videos. The action of flipping food is, indeed, gaining the attention of both researchers and manufacturers of foodservice technology. The reported dataset contains valuable measurements (1) to characterize and model flipping movements as performed by humans, (2) to develop bio-inspired methods to control a cooking robot, or (3) to study new algorithms for human actions recognition.

show abstract

GCK-Maps: A Scene Unbiased Representation for Efficient Human Action Recognition

Nicora,

Pastore,

Noceti

2023

Lecture Notes in Computer Science

View full text Add to dashboard Cite

The MoCA dataset, kinematic and multi-view visual streams of fine-grained cooking actions

Cited by 18 publications

References 34 publications

The VISTA datasets, a combination of inertial sensors and depth cameras data for activity recognition

The VISTA datasets, a combination of inertial sensors and depth cameras data for activity recognition

Flipping food during grilling tasks, a dataset of utensils kinematics and dynamics, food pose and subject gaze

GCK-Maps: A Scene Unbiased Representation for Efficient Human Action Recognition

Contact Info

Product

Resources

About