2017
DOI: 10.1093/bioinformatics/btx602
|View full text |Cite
|
Sign up to set email alerts
|

graphkernels: R and Python packages for graph comparison

Abstract: SummaryMeasuring the similarity of graphs is a fundamental step in the analysis of graph-structured data, which is omnipresent in computational biology. Graph kernels have been proposed as a powerful and efficient approach to this problem of graph comparison. Here we provide graphkernels, the first R and Python graph kernel libraries including baseline kernels such as label histogram based kernels, classic graph kernels such as random walk based kernels, and the state-of-the-art Weisfeiler-Lehman graph kernel.… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
23
0

Year Published

2018
2018
2025
2025

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 37 publications
(23 citation statements)
references
References 10 publications
0
23
0
Order By: Relevance
“…To assess the non-randomness of the constructed network, 1000 random networks with the same number of vertices and interactions as the OS-specific network were generated using the Erdos-Renyi model (19) in the igraph R package (Version 0.7.1; ) (20). The arithmetic average values of the shortest path distance and clustering coefficient were calculated.…”
Section: Methodsmentioning
confidence: 99%
“…To assess the non-randomness of the constructed network, 1000 random networks with the same number of vertices and interactions as the OS-specific network were generated using the Erdos-Renyi model (19) in the igraph R package (Version 0.7.1; ) (20). The arithmetic average values of the shortest path distance and clustering coefficient were calculated.…”
Section: Methodsmentioning
confidence: 99%
“…The approach is very general, as feature vectors can be defined in very different forms. A recent paper 48 , introducing an R/Python implementation, summarizes 14 different kernel types among the most popular ones: the majority of them are based, in different forms, on statistics on node/edge labels (thus they fall out of the scope of our work, as we do not assume labels on nodes/edges). Two of them are based on graphlet count, and the remaining on the comparison of random walks on the two graphs.…”
Section: Measuring the Distance Between Networkmentioning
confidence: 99%
“…We compared networks using the connected graphlet algorithm described by Shervashidze et al 44 that is provided in the graphkernels 1.4 R package 45 and obtained a similarity matrix of the networks. The connected graphlet algorithm measures similarity between 2 graphs (networks) by comparing the distribution of graphlets (subnetworks) within 2 networks rather than node and edge labels, and has been shown to give competitive performance on unlabeled networks 44 .…”
Section: Methodsmentioning
confidence: 99%