2021 IEEE International Conference on Robotics and Automation (ICRA) 2021
DOI: 10.1109/icra48506.2021.9561446
|View full text |Cite
|
Sign up to set email alerts
|

droidlet: modular, heterogenous, multi-modal agents

Abstract: In recent years, there have been significant advances in building end-to-end Machine Learning (ML) systems that learn at scale. But most of these systems are: (a) isolated (perception, speech, or language only); (b) trained on static datasets. On the other hand, in the field of robotics, large-scale learning has always been difficult. Supervision is hard to gather and real world physical interactions are expensive.In this work we introduce and open-source droidlet, a modular, heterogeneous agent architecture a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(3 citation statements)
references
References 30 publications
0
3
0
Order By: Relevance
“…With some probability, the agent executes a command (to either: build an object, destroy an object, move to a location, dig a hole, follow an NPC, etc.) The execution of a command is scripted; the task executor is from the Minecraft agent in (Pratik et al 2021). Whether or not the agent executes a task, the world steps a fixed number of times (so, e.g., NPCs may move or act).…”
Section: Environmentmentioning
confidence: 99%
See 2 more Smart Citations
“…With some probability, the agent executes a command (to either: build an object, destroy an object, move to a location, dig a hole, follow an NPC, etc.) The execution of a command is scripted; the task executor is from the Minecraft agent in (Pratik et al 2021). Whether or not the agent executes a task, the world steps a fixed number of times (so, e.g., NPCs may move or act).…”
Section: Environmentmentioning
confidence: 99%
“…Following (Pratik et al 2021), the environment is presented to the agent as an object-centered key-value store. Each object, NPC, and the agent's self have a "memid" keying a data structure that depends on the object type, and may contain string data (for example a name) or float or integer data (e.g.…”
Section: Environmentmentioning
confidence: 99%
See 1 more Smart Citation