Eric A. Hansen scite author profile

We describe how to convert the heuristic search algorithm A* into an anytime algorithm that finds a sequence of improved solutions and eventually converges to an optimal solution. The approach we adopt uses weighted heuristic search to find an approximate solution quickly, and then continues the weighted search to find improved solutions as well as to improve a bound on the suboptimality of the current solution. When the time available to solve a search problem is limited or uncertain, this creates an anytime heuristic search algorithm that allows a flexible tradeoff between search time and solution quality. We analyze the properties of the resulting Anytime A* algorithm, and consider its performance in three domains; sliding-tile puzzles, STRIPS planning, and multiple sequence alignment. To illustrate the generality of this approach, we also describe how to transform the memoryefficient search algorithm Recursive Best-First Search (RBFS) into an anytime algorithm.

show abstract

Breadth-first heuristic search

Zhou

Hansen

2006

Artificial Intelligence

160

119

View full text Add to dashboard Cite

Policy Iteration for Decentralized Control of Markov Decision Processes

Bernstein

Amato

Hansen

et al. 2009

jair

View full text Add to dashboard Cite

Coordination of distributed agents is required for problems arising in many areas, including multi-robot systems, networking and e-commerce. As a formal framework for such problems, we use the decentralized partially observable Markov decision process (DEC-POMDP). Though much work has been done on optimal dynamic programming algorithms for the single-agent version of the problem, optimal algorithms for the multiagent case have been elusive. The main contribution of this paper is an optimal policy iteration algorithm for solving DEC-POMDPs. The algorithm uses stochastic finite-state controllers to represent policies. The solution can include a correlation device, which allows agents to correlate their actions without communicating. This approach alternates between expanding the controller and performing value-preserving transformations, which modify the controller without sacrificing value. We present two efficient value-preserving transformations: one can reduce the size of the controller and the other can improve its value while keeping the size fixed. Empirical results demonstrate the usefulness of value-preserving transformations in increasing value while keeping controller size to a minimum. To broaden the applicability of the approach, we also present a heuristic version of the policy iteration algorithm, which sacrifices convergence to optimality. This algorithm further reduces the size of the controllers at each step by assuming that probability distributions over the other agents' actions are known. While this assumption may not hold in general, it helps produce higher quality solutions in our test problems.

show abstract

Within-Plant Distribution of <I>Frankliniella</I> species (Thysanoptera: Thripidae) and <I>Orius insidiosus</I> (Heteroptera: Anthocoridae) in Field Pepper

Hansen¹,

Funderburk

Reitz³

et al. 2003

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Eric A. Hansen

LAO∗: A heuristic search algorithm that finds solutions with loops

Anytime Heuristic Search

Breadth-first heuristic search

Policy Iteration for Decentralized Control of Markov Decision Processes

Within-Plant Distribution of <I>Frankliniella</I> species (Thysanoptera: Thripidae) and <I>Orius insidiosus</I> (Heteroptera: Anthocoridae) in Field Pepper

Contact Info

Product

Resources

About