Graph kernels and Gaussian processes for relational reinforcement learning

Driessens, Kurt; Ramon, Jan; Gärtner, Thomas

doi:10.1007/s10994-006-8258-y

Cited by 48 publications

(28 citation statements)

References 19 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In [10], the major algorithms are explained. The authors of [7] propose to use graph kernels and gaussian processes for relational reinforcement learning.…”

Section: Related Workmentioning

confidence: 99%

An Autonomic Testing Framework for IPv6 Configuration Protocols

Becker

Abdelnur

State

et al. 2010

Mechanisms for Autonomous Management of Networks and Services

View full text Add to dashboard Cite

Abstract. The current underutilization of IPv6 enabled services makes accesses to them very attractive because of higher availability and better response time, like the IPv6 specific services from Google and Youtube have recently got a lot of requests. In this paper, we describe a fuzzing framework for IPv6 protocols. Fuzzing is a process by which faults are injected in order to find vulnerabilities in implementations. Our paper describes a machine learning approach, that leverages reinforcement based fuzzing method. We describe a reinforcement learning algorithm to allow the framework to autonomically learn the best fuzzing mechanisms and to automatically test stability and reliability of IPv6.

show abstract

“…In [10], the major algorithms are explained. The authors of [7] propose to use graph kernels and gaussian processes for relational reinforcement learning.…”

Section: Related Workmentioning

confidence: 99%

An Autonomic Testing Framework for IPv6 Configuration Protocols

Becker

Abdelnur

State

et al. 2010

Mechanisms for Autonomous Management of Networks and Services

View full text Add to dashboard Cite

show abstract

“…The idea is to describe important world features in terms of abstract logical formulas enabling generalization over objects and situations. Examples of model-free approaches employ relational regression trees [8] or instance-based regression using distance metrices between relational states such as graph kernels [7] to learn Q-functions. Modelfree approaches have the disadvantage to be inflexible as they enable planning only for the specific problem type used in the training examples.…”

Section: Related Workmentioning

confidence: 99%

Relevance Grounding for Planning in Relational Domains

Lang

Toussaint

2009

Machine Learning and Knowledge Discovery in Databases

View full text Add to dashboard Cite

Abstract. Probabilistic relational models are an efficient way to learn and represent the dynamics in realistic environments consisting of many objects. Autonomous intelligent agents that ground this representation for all objects need to plan in exponentially large state spaces and large sets of stochastic actions. A key insight for computational efficiency is that successful planning typically involves only a small subset of relevant objects. In this paper, we introduce a probabilistic model to represent planning with subsets of objects and provide a definition of object relevance. Our definition is sufficient to prove consistency between repeated planning in partially grounded models restricted to relevant objects and planning in the fully grounded model. We propose an algorithm that exploits object relevance to plan efficiently in complex domains. Empirical results in a simulated 3D blocksworld with an articulated manipulator and realistic physics prove the effectiveness of our approach.

show abstract

“…Initialize the Q-function hypothesis 0 Q e ← 0 repeat Examples ← ∅ Generate a starting schedule state s 0 i ← 0 repeat choose a repair operator a i at s i using a policy (e.g., ε-greedy) based on the current hypothesis ê Q implement operator a i , observe r i and the resulting schedule s i+1 i ← i +1 until schedule state s i is a goal state for j =i -1 to 0 do generate example Several incremental relational regression techniques have been developed that meet the above requirements for RRL implementation: an incremental relational tree learner TG , an instance based learner , a kernel-based method (Gärtner et al, 2003;Driessens et al, 2006) and a combination of a decision tree learner with an instance-based learner (Driessens and Džeroski, 2004). Of these algorithms, the TG is the most popular one, mainly because it is relatively easy to specify background knowledge in the form of a language bias.…”

Section: Brazilian Journal Of Chemical Engineeringmentioning

confidence: 99%

“…Of these algorithms, the TG is the most popular one, mainly because it is relatively easy to specify background knowledge in the form of a language bias. In the other methods, it is necessary to specify a distance function between modeled objects (Gärtner, 2008) or a kernel function is needed between (state, action)-pairs (Driessens et al, 2006). …”

Section: Brazilian Journal Of Chemical Engineeringmentioning

confidence: 99%

Learning to Repair Plans and Schedules Using a Relational (Deictic) Representation

Palombarini

Martínez

2009

Computer Aided Chemical Engineering

View full text Add to dashboard Cite

-Unplanned and abnormal events may have a significant impact on the feasibility of plans and schedules which requires to repair them 'on-the-fly' to guarantee due date compliance of orders-in-progress and negotiating delivery conditions for new orders. In this work, a repair-based rescheduling approach based on the integration of intensive simulations with logical and relational reinforcement learning is proposed. Based on a relational (deictic) representation of schedule states, a number of repair operators have been designed to guide the search towards a goal state. The knowledge generated via simulation is encoded in a relational regression tree for the Q-value function defining the utility of applying a given repair operator at a given schedule state. A prototype implementation in Prolog language is discussed using a representative example of three batch extruders processing orders for four different products. The learning curve for the problem of inserting a new order vividly illustrates the advantages of logical and relational learning in rescheduling.

show abstract

Graph kernels and Gaussian processes for relational reinforcement learning

Cited by 48 publications

References 19 publications

An Autonomic Testing Framework for IPv6 Configuration Protocols

An Autonomic Testing Framework for IPv6 Configuration Protocols

Relevance Grounding for Planning in Relational Domains

Learning to Repair Plans and Schedules Using a Relational (Deictic) Representation

Contact Info

Product

Resources

About