Learning World Graphs to Accelerate Hierarchical Reinforcement Learning

Shang, Wenling; Trott, Alex; Zheng, Stephan; Xiong, Caiming; Socher, Richard

doi:10.48550/arxiv.1907.00664

Cited by 5 publications

(7 citation statements)

References 52 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Other examples of hybrid symbolic and sub-symbolic methods where a knowledge-base tool or graph-perspective enhances the neural (e.g., language [308]) model are in [309,310]. In reinforcement learning, very few examples of symbolic (graphical [311] or relational [75,312]) hybrid models exist, while in recommendation systems, for instance, explainable autoencoders are proposed [313].…”

Section: Prediction Explanationmentioning

confidence: 99%

“…Attention Networks [267,278,330,331,332,333,334] Representation Disentanglement [113,279,335,336,337,338,339,340,341,342] Explanation Generation [276,343,344,345] Hybrid Transparent and Black-box Methods Neural-symbolic Systems [297,298,299,300] KB-enhanced Systems [24,169,301,308,309,310] Deep Formulation [264,302,303,304,305] Relational Reasoning [75,312,313,314] Case-base Reasoning [316,317,318] Figure 11: (a) Alternative Deep Learning specific taxonomy extended from the categorization from [13]; and (b) its connection to the taxonomy in Figure 6.…”

Section: Explanation Of Deep Network Representationmentioning

confidence: 99%

See 1 more Smart Citation

Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI

et al. 2020

View full text Add to dashboard Cite

In the last few years, Artificial Intelligence (AI) has achieved a notable momentum that, if harnessed appropriately, may deliver the best of expectations over many application sectors across the field. For this to occur shortly in Machine Learning, the entire community stands in front of the barrier of explainability, an inherent problem of the latest techniques brought by sub-symbolism (e.g. ensembles or Deep Neural Networks) that were not present in the last hype of AI (namely, expert systems and rule based models). Paradigms underlying this problem fall within the so-called eXplainable AI (XAI) field, which is widely acknowledged as a crucial feature for the practical deployment of AI models. The overview presented in this article examines the existing literature and contributions already done in the field of XAI, including a prospect toward what is yet to be reached. For this purpose we summarize previous efforts made to define explainability in Machine Learning, establishing a novel definition of explainable Machine Learning that covers such prior conceptual propositions with a major focus on the audience for which the explainability is sought. Departing from this definition, we propose and discuss about a taxonomy of recent contributions related to the explainability of different Machine Learning models, including those aimed at explaining Deep Learning methods for which a second dedicated taxonomy is built and examined in detail. This critical literature analysis serves as the motivating background for a series of challenges faced by XAI, such as the interesting crossroads of data fusion and explainability. Our prospects lead toward the concept of Responsible Artificial Intelligence, namely, a methodology for the large-scale implementation of AI methods in real organizations with fairness, model explainability and accountability at its core. Our ultimate goal is to provide newcomers to the field of XAI with a thorough taxonomy that can serve as reference material in order to stimulate future research advances, but also to encourage experts and professionals from other disciplines to embrace the benefits of AI in their activity sectors, without any prior bias for its lack of interpretability.

show abstract

Section: Prediction Explanationmentioning

confidence: 99%

Section: Explanation Of Deep Network Representationmentioning

confidence: 99%

Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI

et al. 2020

View full text Add to dashboard Cite

show abstract

“…Keramati et al (2018) propose a model-based framework to solve sparsereward domains, and incorporate macro-actions in the form of fixed action sequences that can be selected as a single decision. Shang et al (2019) use variational inference to construct a world graph similar to our region space. However, unlike our model-free method, the option policies are trained using dynamic programming, which requires knowledge of the environment dynamics.…”

Section: Related Workmentioning

confidence: 99%

Hierarchical reinforcement learning for efficient exploration and transfer

Steccanella,

Totaro,

Allonsius

et al. 2020

Preprint

View full text Add to dashboard Cite

Sparse-reward domains are challenging for reinforcement learning algorithms since significant exploration is needed before encountering reward for the first time. Hierarchical reinforcement learning can facilitate exploration by reducing the number of decisions necessary before obtaining a reward. In this paper, we present a novel hierarchical reinforcement learning framework based on the compression of an invariant state space that is common to a range of tasks. The algorithm introduces subtasks which consist in moving between the state partitions induced by the compression. Results indicate that the algorithm can successfully solve complex sparse-reward domains, and transfer knowledge to solve new, previously unseen tasks more quickly.

show abstract

“…planning in model-based reinforcement learning) as the same future point in time can be reached with shorter and more accurate state rollouts (Pertsch et al, 2020;Zakharov et al, 2021). These benefits have recently prompted a number of proposed models that either perform transitions with temporal jumps of arbitrary length (Koutnik et al, 2014;Saxena et al, 2021), or aim to identify significant events (or key-frames) in sequential data and model the transitions between these events (Chung et al, 2017;Neitz et al, 2018;Jayaraman et al, 2018;Shang et al, 2019;Kipf et al, 2019;Kim et al, 2019;Pertsch et al, 2020;Zakharov et al, 2021).…”

Section: Introductionmentioning

confidence: 99%

“…The large variety of different event criteria defined in these studies demonstrates the lack of a widely established definition of this concept in this literature. For instance, important events are either selected as the points in time that contain maximum information about a full video sequence (Pertsch et al, 2020), about the agent's actions (Shang et al, 2019), as the most predictable (Neitz et al, 2018;Jayaraman et al, 2018) or as the most surprising (Zakharov et al, 2021) points in time. A clearer picture can be drawn when viewing events from the perspective of cognitive psychology, where events are defined as segments of time "conceived by an observer to have a beginning and an end" (Zacks & Tversky, 2001).…”

Section: Introductionmentioning

confidence: 99%

Variational Predictive Routing with Nested Subjective Timescales

Zakharov¹,

Guo²,

Fountas³

2021

Preprint

View full text Add to dashboard Cite

Discovery and learning of an underlying spatiotemporal hierarchy in sequential data is an important topic for machine learning. Despite this, little work has been done to explore hierarchical generative models that can flexibly adapt their layerwise representations in response to datasets with different temporal dynamics. Here, we present Variational Predictive Routing (VPR) -a neural probabilistic inference system that organizes latent representations of video features in a temporal hierarchy, based on their rates of change, thus modeling continuous data as a hierarchical renewal process. By employing an event detection mechanism that relies solely on the system's latent representations (without the need of a separate model), VPR is able to dynamically adjust its internal state following changes in the observed features, promoting an optimal organisation of representations across the levels of the model's latent hierarchy. Using several video datasets, we show that VPR is able to detect event boundaries, disentangle spatiotemporal features across its hierarchy, adapt to the dynamics of the data, and produce accurate timeagnostic rollouts of the future. Our approach integrates insights from neuroscience and introduces a framework with high potential for applications in model-based reinforcement learning, where flexible and informative state-space rollouts are of particular interest.

show abstract

Learning World Graphs to Accelerate Hierarchical Reinforcement Learning

Cited by 5 publications

References 52 publications

Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI

Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI

Hierarchical reinforcement learning for efficient exploration and transfer

Variational Predictive Routing with Nested Subjective Timescales

Contact Info

Product

Resources

About