Automated aerial suspended cargo delivery through reinforcement learning

Faust, Aleksandra; Palunko, Ivana; Cruz, Patricio J.; Fierro, Rafael; Tapia, Lydia

doi:10.1016/j.artint.2014.11.009

Cited by 139 publications

(82 citation statements)

References 24 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Since manually accounting for all possibilities is often infeasible, sampling-based, learning-based, and other intelligent methods are the norm [18]. Reinforcement learning (RL), in particular, has been successful for robotic task learning [15] in several problems such as table tennis [21], swing-free UAV delivery [7], and a self-driving car [12]. However, traditional RL methods do not handle continuous and high-dimensional state spaces well [11].…”

Section: Introductionmentioning

confidence: 99%

“…Because the method learns and performs the task in the feature space, such transfer is possible. Previously, we empirically showed that, using hand-crafted features, batch RL learns in small spaces and acts on larger problems [8], [7], but did not address when learning transfer is possible and how to do it for an arbitrary problem. This paper formalizes the feature selection and the conditions under which transfer is possible so that it can be applied for classes of PBTs.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Avoiding moving obstacles with stochastic hybrid dynamics using PEARL: PrEference Appraisal Reinforcement Learning

Faust

Chiang

Rackley

et al. 2016

2016 IEEE International Conference on Robotics and Automation (ICRA)

Self Cite

View full text Add to dashboard Cite

Manual derivation of optimal robot motions for task completion is difficult, especially when a robot is required to balance its actions between opposing preferences. One solution has been proposed to automatically learn near optimal motions with Reinforcement Learning (RL). This has been successful for several tasks including swing-free UAV flight, table tennis, and autonomous driving. However, highdimensional problems remain a challenge. We address this dimensionality constraint with PrEference Appraisal Reinforcement Learning (PEARL), which solves tasks with opposing preferences for acceleration controlled robots. PEARL projects the high-dimensional continuous robot state space to a low dimensional preference feature space resulting in efficient and adaptable planning. We demonstrate that on a dynamic obstacle avoidance robotic task, a single learning on a much simpler problem performs real-time decision-making for significantly larger, high-dimensional problems working in unbounded continuous states and actions. We trained the agent with 4 static obstacles, while the trained agent avoids up to 900 moving obstacles with complex hybrid stochastic obstacle dynamics in a highly constrained space using only limited information about the environment. We compare these tasks to traditional, often manually tuned solutions for these high-dimensional problems.

show abstract

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Avoiding moving obstacles with stochastic hybrid dynamics using PEARL: PrEference Appraisal Reinforcement Learning

Faust

Chiang

Rackley

et al. 2016

2016 IEEE International Conference on Robotics and Automation (ICRA)

Self Cite

View full text Add to dashboard Cite

show abstract

“…Papers [13] and [16] also deal with learning, but put a strong emphasis on managing the complexity of doing so, thus addressing RC3. [13] introduces a scalable methodology to learn and transfer knowledge of the transition (and reward) models for model-based reinforcement learning in a complex world.…”

Section: Papers Focusing On Ca1: Robots That Knowmentioning

confidence: 99%

“…The authors use a formulation of Markov decision processes that support efficient online-learning of relevant problem features in order to approximate world dynamics. [16] presents a reinforcement learning approach for aerial cargo delivery tasks in environments with static obstacles. The authors plan and create swing-free trajectories with bounded load displacements.…”

Section: Papers Focusing On Ca1: Robots That Knowmentioning

confidence: 99%

Towards a science of integrated AI and Robotics

Rajan

Saffiotti

2017

Artificial Intelligence

126

View full text Add to dashboard Cite

The early promise of the impact of machine intelligence did not involve the partitioning of the nascent field of Artificial Intelligence. The founders of AI envisioned the notion of embedded intelligence as being conjoined between perception, reasoning and actuation. Yet over the years the fields of AI and Robotics drifted apart. Practitioners of AI focused on problems and algorithms abstracted from the real world. Roboticists, generally with a background in mechanical and electrical engineering, concentrated on sensorimotor functions. That divergence is slowly being bridged with the maturity of both fields and with the growing interest in autonomous systems. This special issue brings together the state of the art and practice of the emergent field of integrated AI and Robotics, and highlights the key areas along which this current evolution of machine intelligence is heading.

show abstract

“…Reinforcement learning has already been applied to quadrotors with a slung load. 3,[12][13][14] All these studies use a model of the system dynamics. In this paper, a two-dimensional version of this slung load problem is used with a model-free reinforcement learning controller.…”

Section: Introductionmentioning

confidence: 99%

Human Demonstrations for Fast and Safe Exploration in Reinforcement Learning

Schonebaum

Junell

Kampen

2017

AIAA Information Systems-Aiaa Infotech @ Aerospace

View full text Add to dashboard Cite

Reinforcement learning is a promising framework for controlling complex vehicles with a high level of autonomy, since it does not need a dynamic model of the vehicle, and it is able to adapt to changing conditions. When learning from scratch, the performance of a reinforcement learning controller may initially be poor and -for real life applicationsunsafe. In this paper the effects of using human demonstrations on the performance of reinforcement learning is investigated, using a combination of offline and online least squares policy iteration. It is found that using the human as an efficient explorer improves learning time and performance for a benchmark reinforcement learning problem. The benefit of the human demonstration is larger for problems where the human can make use of its understanding of the problem to efficiently explore the state space. Applied to a simplified quadrotor slung load drop off problem, the use of human demonstrations reduces the number of crashes during learning. As such, this paper contributes to safer and faster learning for model-free, adaptive control problems.

show abstract

Automated aerial suspended cargo delivery through reinforcement learning

Cited by 139 publications

References 24 publications

Avoiding moving obstacles with stochastic hybrid dynamics using PEARL: PrEference Appraisal Reinforcement Learning

Avoiding moving obstacles with stochastic hybrid dynamics using PEARL: PrEference Appraisal Reinforcement Learning

Towards a science of integrated AI and Robotics

Human Demonstrations for Fast and Safe Exploration in Reinforcement Learning

Contact Info

Product

Resources

About