Goal-directed graph construction using reinforcement learning

Darvariu, Victor-Alexandru; Hailes, Stephen; Musolesi, Mirco

doi:10.1098/rspa.2021.0168

Cited by 5 publications

(11 citation statements)

References 58 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Namely, Darvariu et al . [12] reports a wall clock time that is equivalent to 56 h of a single core of a comparable CPU to train a model on graphs of size N=20 and, due to the complexity of the problem, does not train models directly beyond graphs with N=50. By contrast, on similar computational infrastructure, our proposed SG-UCT requires 11 h on average to optimize a much larger graph with N=200 nodes.…”

Section: Methodsmentioning

confidence: 99%

“…In particular, we use the definition in [15], which considers the size of the LCC as nodes are removed from the network. We consider only the targeted attack case as previous work has found it is more challenging [12,37]. We define the robustness measure as scriptFRfalse(Gfalse)=double-struckEξfalse[false(1/Nfalse)∑i=1Ns(G,ξ,i)false], where sfalse(G,ξ,ifalse) denotes the fraction of nodes in the LCC of G after the removal of the first i nodes in the permutation ξ (in which nodes appear in descending order of their degrees).…”

Section: Preliminaries and Backgroundmentioning

confidence: 99%

“…In the machine learning community, model-free RL techniques have been applied for deriving adversarial examples for graph-based classifiers by changing the network structure [13] and the goal-directed generation of molecular graphs [10]. [12] formulated the goal-directed construction of a graph as an MDP and proposed a method based on RL and graph neural networks, showing some advantages over prior methods in the network science literature in terms of its ability to optimize the objective and the fast runtime of the policy once it has been trained.…”

Section: Preliminaries and Backgroundmentioning

confidence: 99%

Section: (Ii) Robustnessmentioning

confidence: 99%

See 3 more Smart Citations

Planning spatial networks with Monte Carlo tree search

2023

Self Cite

View full text Add to dashboard Cite

We tackle the problem of goal-directed graph construction: given a starting graph, finding a set of edges whose addition maximally improves a global objective function. This problem emerges in many transportation and infrastructure networks that are of critical importance to society. We identify two significant shortcomings of present reinforcement learning methods: their exclusive focus on topology to the detriment of spatial characteristics (which are known to influence the growth and density of links), as well as the rapid growth in the action spaces and costs of model training. Our formulation as a deterministic Markov decision process allows us to adopt the Monte Carlo tree search framework, an artificial intelligence decision-time planning method. We propose improvements over the standard upper confidence bounds for trees (UCT) algorithm for this family of problems that addresses their single-agent nature, the trade-off between the cost of edges and their contribution to the objective, and an action space linear in the number of nodes. Our approach yields substantial improvements over UCT for increasing the efficiency and attack resilience of synthetic networks and real-world Internet backbone and metro systems, while using a wall clock time budget similar to other search-based algorithms. We also demonstrate that our approach scales to significantly larger networks than previous reinforcement learning methods, since it does not require training a model.

show abstract

Section: Methodsmentioning

confidence: 99%

Section: Preliminaries and Backgroundmentioning

confidence: 99%

Section: Preliminaries and Backgroundmentioning

confidence: 99%

Section: (Ii) Robustnessmentioning

confidence: 99%

See 2 more Smart Citations

Planning spatial networks with Monte Carlo tree search

2023

Self Cite

View full text Add to dashboard Cite

show abstract

“…In addition to the methods mentioned above that are more related to our proposed approach, there are other categories of modern graph generation approaches, the most noteworthy of which are autoencoder-based methods [ 18 , 38 – 42 ], RL-based approaches [ 43 – 45 ], GAN-based generating strategies [ 15 , 19 , 46 ], and flow-based models [ 47 , 48 ].…”

Section: Related Workmentioning

confidence: 99%

SCGG: A deep structure-conditioned graph generative model

et al. 2022

View full text Add to dashboard Cite

Deep learning-based graph generation approaches have remarkable capacities for graph data modeling, allowing them to solve a wide range of real-world problems. Making these methods able to consider different conditions during the generation procedure even increases their effectiveness by empowering them to generate new graph samples that meet the desired criteria. This paper presents a conditional deep graph generation method called SCGG that considers a particular type of structural conditions. Specifically, our proposed SCGG model takes an initial subgraph and autoregressively generates new nodes and their corresponding edges on top of the given conditioning substructure. The architecture of SCGG consists of a graph representation learning network and an autoregressive generative model, which is trained end-to-end. More precisely, the graph representation learning network is designed to compute continuous representations for each node in a graph, which are not only affected by the features of adjacent nodes, but also by the ones of farther nodes. This network is primarily responsible for providing the generation procedure with the structural condition, while the autoregressive generative model mainly maintains the generation history. Using this model, we can address graph completion, a rampant and inherently difficult problem of recovering missing nodes and their associated edges of partially observed graphs. The computational complexity of the SCGG method is shown to be linear in the number of graph nodes. Experimental results on both synthetic and real-world datasets demonstrate the superiority of our method compared with state-of-the-art baselines.

show abstract