2008
DOI: 10.1002/sam.10004
|View full text |Cite
|
Sign up to set email alerts
|

ORIGAMI: A Novel and Effective Approach for Mining Representative Orthogonal Graph Patterns

Abstract: Abstract:In this paper, we introduce the concept of α-orthogonal patterns to mine a representative set of graph patterns. Intuitively, two graph patterns are α-orthogonal if their similarity is bounded above by α. Each α-orthogonal pattern is also a representative for those patterns that are at least β similar to it. Given user defined α, β ∈ [0, 1], the goal is to mine an α-orthogonal, β-representative set that minimizes the set of unrepresented patterns.We present ORIGAMI, an effective algorithm for mining t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
48
0

Year Published

2009
2009
2020
2020

Publication Types

Select...
4
3

Relationship

2
5

Authors

Journals

citations
Cited by 45 publications
(48 citation statements)
references
References 17 publications
0
48
0
Order By: Relevance
“…In the next experiment, we show the effectiveness of the sampling approach over traditional graph mining algorithms. We use two large graph datasets: (1) protein-interaction (PI) graphs taken from [7] and (2) cell-graphs [1]. The cell-graphs have class labels based on whether the corresponding graph belongs to a benign (0) or invasive (1) tissue samples; in this dataset there are an equal number of graphs of either class.…”
Section: Sampling Results On Large Graphsmentioning
confidence: 99%
See 3 more Smart Citations
“…In the next experiment, we show the effectiveness of the sampling approach over traditional graph mining algorithms. We use two large graph datasets: (1) protein-interaction (PI) graphs taken from [7] and (2) cell-graphs [1]. The cell-graphs have class labels based on whether the corresponding graph belongs to a benign (0) or invasive (1) tissue samples; in this dataset there are an equal number of graphs of either class.…”
Section: Sampling Results On Large Graphsmentioning
confidence: 99%
“…For this graph dataset, none of the existing algorithms could finish in 2 full days for 50% support on a dual-core 2.2 GHz machine with 2GB of memory. [7] also reported similar problem with protein interaction network graphs. This motivates the need to find algorithms that can find a small set of interesting and useful patterns instead of attempting to enumerate the entire set of patterns.…”
Section: Introductionmentioning
confidence: 88%
See 2 more Smart Citations
“…Recently, the randomization idea has also been suggested in the area of pattern mining. Chaoji et al (2008) have introduced a feature construction method that obtains good patterns by sampling under diversity constraints. However, the suggested method requires the user to tune and specify two parameters that control the diversity (orthogonality) and representativeness respectively.…”
Section: Related Workmentioning
confidence: 99%