Development of a Basic Educational Kit for Robotic System with Deep Neural Networks

Kanamura, Momomi; Suzuki, Kanata; Suga, Yuki; Ogata, Tetsuya

doi:10.3390/s21113804

Cited by 5 publications

(3 citation statements)

References 30 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The proposed framework comprises lightweight models, and the computational time and cost required for motion generation are low. Moreover, implementing each of the previous functions as components makes it possible to easily reuse the implemented system when tasks or robot hardware changes, or devices are added [15].…”

Section: Motion Generationmentioning

confidence: 99%

Composition of Robot Motions based on the Concept of Deep Predictive Learning

Suzuki

Ito

Yamada

et al. 2022

JRSJ

Self Cite

View full text Add to dashboard Cite

A deep learning-based approach can generalize model performance while reducing feature design costs by learning end-to-end environment recognition and motion generation. However, the process incurs huge training data collection costs and time and human resources for trial-and-error when involving physical contact with robots. Therefore, we propose "deep predictive learning," a motion learning concept that assumes imperfections in the predictive model and minimizes the prediction error with the real-world situation. Deep predictive learning is inspired by the "free energy principle and predictive coding theory," which explains how living organisms behave to minimize the prediction error between the real world and the brain. Robots predict near-future situations based on sensorimotor information and generate motions that minimize the gap with reality. The robot can flexibly perform tasks in unlearned situations by adjusting its motion in realtime while considering the gap between learning and reality. This paper describes the concept of deep predictive learning, its implementation, and examples of its application to real robots. The code and document are available at https: //ogata-lab.github.io/eipl-docs

show abstract

Section: Motion Generationmentioning

confidence: 99%

Composition of Robot Motions based on the Concept of Deep Predictive Learning

Suzuki

Ito

Yamada

et al. 2022

JRSJ

Self Cite

View full text Add to dashboard Cite

show abstract

“…Next, the steps from 3 to 8 are performed in exactly the same manner except for the opening and closing of the gripper. The robot reaches its hand out to the exact same place as in 5 , makes the pregrasp posture at 10 , grasps the object at 11 , and returns to the initial posture. By repeating this procedure many times, it is possible for the robot to autonomously collect data.…”

Section: B Data Collection and Training Experimentsmentioning

confidence: 99%

“…This is based on the fact that while it is difficult for the robot to directly grasp an object by visual recognition, it can grasp an object by reaching out to the exact same place if the object had been placed by itself. There is a similar data collection method [11], but it focuses only on automatic data collection for a rigid robot, and its goal is different from this study, which utilizes the motion reproducibility of a low-rigidity robot. Although our data collection method is limited to pick-and-place tasks, it is a basic motion common to various tasks, and we believe that it would be useful for cost effective robot that cannot move accurately due to low rigidity to learn such tasks autonomously.…”

Section: Introductionmentioning

confidence: 99%

Self-Supervised Learning of Visual Servoing for Low-Rigidity Robots Considering Temporal Body Changes

Kawaharazuka

Kanazawa

Okada

et al. 2022

IEEE Robot. Autom. Lett.

View full text Add to dashboard Cite

Recognition of the current state is indispensable for the operation of a robot. There are various states to be recognized, such as whether an elevator door is open or closed, whether an object has been grasped correctly, and whether the TV is turned on or off. Until now, these states have been recognized by programmatically describing the state of a point cloud or raw image, by annotating and learning images, by using special sensors, etc. In contrast to these methods, we apply Visual Question Answering (VQA) from a Pre-Trained Vision-Language Model (PTVLM) trained on a large-scale dataset, to such binary state recognition. This idea allows us to intuitively describe state recognition in language without any re-training, thereby improving the recognition ability of robots in a simple and general way. We summarize various techniques in questioning methods and image processing, and clarify their properties through experiments.

show abstract