2002
DOI: 10.1093/comjnl/45.6.631
|View full text |Cite
|
Sign up to set email alerts
|

RASCAL: Calculation of Graph Similarity using Maximum Common Edge Subgraphs

Abstract: A new graph similarity calculation procedure is introduced for comparing labeled graphs. Given a minimum similarity threshold, the procedure consists of an initial screening process to determine whether it is possible for the measure of similarity between the two graphs to exceed the minimum threshold, followed by a rigorous maximum common edge subgraph (MCES) detection algorithm to compute the exact degree and composition of similarity. The proposed MCES algorithm is based on a maximum clique formulation of t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
260
0
2

Year Published

2003
2003
2017
2017

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 272 publications
(262 citation statements)
references
References 29 publications
0
260
0
2
Order By: Relevance
“…This is a computationally expensive process, and thus there are a number of strategies to simplify the computations. For instance Raymond et al (2002) propose to first compute an upper bound of the similarity measure, and only compute the actual MCS for those molecules for which the upper bound is over a given threshold. Computing the MCS is a very related problem to that of finding the anti-unification in refinement graphs, and thus S λ is related to graph-based similarities for molecules.…”
Section: Related Workmentioning
confidence: 99%
“…This is a computationally expensive process, and thus there are a number of strategies to simplify the computations. For instance Raymond et al (2002) propose to first compute an upper bound of the similarity measure, and only compute the actual MCS for those molecules for which the upper bound is over a given threshold. Computing the MCS is a very related problem to that of finding the anti-unification in refinement graphs, and thus S λ is related to graph-based similarities for molecules.…”
Section: Related Workmentioning
confidence: 99%
“…Most of these algorithms avoid the computational complexity by computing approximate values. Raymond et al [14] also propose an exact multi-step algorithm which defines a similarity based on computing the maximum common subgraph. This algorithm is theoretically still NP-complete, but makes use of advanced heuristics to reduce the number of matchings required.…”
Section: Related Workmentioning
confidence: 99%
“…Our method is based on this idea of graph similarity in function of the maximum common subgraph, and thus shares the intuitiveness of [14], although the latter requires an advanced graph-theoretical problem transformation and is difficult to implement. Another difference is that, by using the BBP subgraph isomorphism of [8], we can obtain a polynomial algorithm which only takes into account a subset of matchings and in this way imposes a bias on the features that will be used for computing the similarity.…”
Section: Related Workmentioning
confidence: 99%
“…This provides a natural way of calculating the degree of similarity between a pair of molecules but the NP-complete nature of the maximum common subgraph isomorphism problem has ruled out the large-scale use of MCS-based similarities. We have recently described a new MCS algorithm, called RASCAL, that is sufficiently rapid in execution to permit graph-based similarity searching of large chemical databases 16,17 and that seems to provide a viable complement, or even an alternative, to existing, fingerprint-based approaches to virtual screening 18 .…”
Section: Introductionmentioning
confidence: 99%