Akifumi Wachi scite author profile

Akifumi Wachi

4Publications

46Citation Statements Received

11Citation Statements Given

How they've been cited

How they cite others

Affiliations

IBM (United States), The University of Tokyo, Vaughn College of Aeronautics and Technology

Publications

Order By: Most citations

Failure-Scenario Maker for Rule-Based Agent using Multi-agent Adversarial Reinforcement Learning and its Application to Autonomous Driving

Wachi¹

2019

View full text Add to dashboard Cite

We examine the problem of adversarial reinforcement learning for multi-agent domains including a rule-based agent. Rule-based algorithms are required in safety-critical applications for them to work properly in a wide range of situations. Hence, every effort is made to find failure scenarios during the development phase. However, as the software becomes complicated, finding failure cases becomes difficult. Especially in multi-agent domains, such as autonomous driving environments, it is much harder to find useful failure scenarios that help us improve the algorithm. We propose a method for efficiently finding failure scenarios; this method trains the adversarial agents using multiagent reinforcement learning such that the tested rule-based agent fails. We demonstrate the effectiveness of our proposed method using a simple environment and autonomous driving simulator. 2 We have an option to define a virtual reward for the player. However, it is often difficult to precisely define the (virtual) reward.

show abstract

Safe Exploration and Optimization of Constrained MDPs Using Gaussian Processes

Wachi

Sui

Yue

et al. 2018

AAAI

View full text Add to dashboard Cite

We present a reinforcement learning approach to explore and optimize a safety-constrained Markov Decision Process(MDP). In this setting, the agent must maximize discounted cumulative reward while constraining the probability of entering unsafe states, defined using a safety function being within some tolerance. The safety values of all states are not known a priori, and we probabilistically model them via aGaussian Process (GP) prior. As such, properly behaving in such an environment requires balancing a three-way trade-off of exploring the safety function, exploring the reward function, and exploiting acquired knowledge to maximize reward. We propose a novel approach to balance this trade-off. Specifically, our approach explores unvisited states selectively; that is, it prioritizes the exploration of a state if visiting that state significantly improves the knowledge on the achievable cumulative reward. Our approach relies on a novel information gain criterion based on Gaussian Process representations of the reward and safety functions. We demonstrate the effectiveness of our approach on a range of experiments, including a simulation using the real Martian terrain data.

show abstract

The conceptual design of a novel, small and simple Mars lander

Takahashi

Sakagami

Wachi

et al. 2018

View full text Add to dashboard Cite

Integral design method for simple and small Mars lander system using membrane aeroshell

et al. 2018

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Akifumi Wachi

Failure-Scenario Maker for Rule-Based Agent using Multi-agent Adversarial Reinforcement Learning and its Application to Autonomous Driving

Safe Exploration and Optimization of Constrained MDPs Using Gaussian Processes

The conceptual design of a novel, small and simple Mars lander

Integral design method for simple and small Mars lander system using membrane aeroshell

Contact Info

Product

Resources

About