Tackling Sparse Rewards in Real-Time Games with Statistical Forward Planning Methods

Gaina, Raluca D.; Lucas, Simon M.; Peŕez-Liebana, Diego

doi:10.1609/aaai.v33i01.33011691

Cited by 14 publications

(11 citation statements)

References 16 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…A different type of information was used by Gaina et al [39] to dynamically adjust the length of the individuals in RHEA: the flatness of the fitness landscape is used to shorten or lengthen the individuals in order for the algorithm to better deal with sparse reward environments (using longer rollouts for identification of further away rewards), while not harming performance in dense reward games (using shorter rollouts for focus on immediate rewards). However, this had a detrimental effect in RHEA, while boosting MCTS results.…”

Section: Evolutionary Methodsmentioning

confidence: 99%

General Video Game AI: A Multitrack Framework for Evaluating Agents, Games, and Content Generation Algorithms

et al. 2019

Self Cite

View full text Add to dashboard Cite

General Video Game Playing (GVGP) aims at designing an agent that is capable of playing multiple video games with no human intervention. In 2014, The General Video Game AI (GVGAI) competition framework was created and released with the purpose of providing researchers a common open-source and easy to use platform for testing their AI methods with potentially infinity of games created using Video Game Description Language (VGDL). The framework has been expanded into several tracks during the last few years to meet the demand of different research directions. The agents are required either to play multiple unknown games with or without access to game simulations, or to design new game levels or rules. This survey paper presents the VGDL, the GVGAI framework, existing tracks, and reviews the wide use of GVGAI framework in research, education and competitions five years after its birth. A future plan of framework improvements is also described.

show abstract

Section: Evolutionary Methodsmentioning

confidence: 99%

General Video Game AI: A Multitrack Framework for Evaluating Agents, Games, and Content Generation Algorithms

et al. 2019

Self Cite

View full text Add to dashboard Cite

show abstract

“…MADRL with delayed rewards faces the challenge of high dimension (C.3) due to the large state-action spaces. Training performance can be enhanced by: (a) enabling agents to receive rewards at each training step, including dense reward function that produces reward values for majority of transitions, enabling agents to receive rewards in almost every time step, particularly at the early stage of learning [95], for achieving optimal accumulated reward; (b) tailor-made reward functions by experts to assign rewards to behaviors that lead to optimal goal with faster learning speed (O.2); and (c) using credit assignment (or reward shaping [94]) that assigns credits to an action that produces reward to identify the particular action that triggers the reward [92]. Overall, properly designed reward functions ensure a higher convergence speed (O.2) and accumulated reward (P.2).…”

Section: Enhancing Training Performance In Madrl Using Delayed Rewardsmentioning

confidence: 99%

Applications of Multi-Agent Deep Reinforcement Learning: Models and Algorithms

et al. 2021

View full text Add to dashboard Cite

Recent advancements in deep reinforcement learning (DRL) have led to its application in multi-agent scenarios to solve complex real-world problems, such as network resource allocation and sharing, network routing, and traffic signal controls. Multi-agent DRL (MADRL) enables multiple agents to interact with each other and with their operating environment, and learn without the need for external critics (or teachers), thereby solving complex problems. Significant performance enhancements brought about by the use of MADRL have been reported in multi-agent domains; for instance, it has been shown to provide higher quality of service (QoS) in network resource allocation and sharing. This paper presents a survey of MADRL models that have been proposed for various kinds of multi-agent domains, in a taxonomic approach that highlights various aspects of MADRL models and applications, including objectives, characteristics, challenges, applications, and performance measures. Furthermore, we present open issues and future directions of MADRL.

show abstract

“…individual length, mutation rate), but also the very structure of the algorithm (keeping the population evolved from one game tick to the next with a shift buffer, including or excluding evolutionary operators, adding Monte Carlo rollouts at the end of the individual when evaluating, etc.). These options are all collected from past literature [23], [41], [44], [45] for a resulting EA with a parameter search space size of 1.741E12.…”

Section: B Planning Modulementioning

confidence: 99%

Project Thyia: A Forever Gameplayer

Gaina

Lucas

Peŕez-Liebana

2019

2019 IEEE Conference on Games (CoG)

Self Cite

View full text Add to dashboard Cite

The space of Artificial Intelligence entities is dominated by conversational bots. Some of them fit in our pockets and we take them everywhere we go, or allow them to be a part of human homes. Siri, Alexa, they are recognised as present in our world. But a lot of games research is restricted to existing in the separate realm of software. We enter different worlds when playing games, but those worlds cease to exist once we quit. Similarly, AI game-players are run once on a game (or maybe for longer periods of time, in the case of learning algorithms which need some, still limited, period for training), and they cease to exist once the game ends. But what if they didn't? What if there existed artificial game-players that continuously played games, learned from their experiences and kept getting better? What if they interacted with the real world and us, humans: livestreaming games, chatting with viewers, accepting suggestions for strategies or games to play, forming opinions on popular game titles? In this paper, we introduce the vision behind a new project called Thyia, which focuses around creating a present, continuous, 'always-on', interactive game-player.

show abstract

Tackling Sparse Rewards in Real-Time Games with Statistical Forward Planning Methods

Cited by 14 publications

References 16 publications

General Video Game AI: A Multitrack Framework for Evaluating Agents, Games, and Content Generation Algorithms

General Video Game AI: A Multitrack Framework for Evaluating Agents, Games, and Content Generation Algorithms

Applications of Multi-Agent Deep Reinforcement Learning: Models and Algorithms

Project Thyia: A Forever Gameplayer

Contact Info

Product

Resources

About