2015 49th Asilomar Conference on Signals, Systems and Computers 2015
DOI: 10.1109/acssc.2015.7421397
|View full text |Cite
|
Sign up to set email alerts
|

Counting triangles in real-world graph streams: Dealing with repeated edges and time windows

Abstract: Real-world graphs often manifest as a massive temporal "stream" of edges. The need for real-time analysis of such large graph streams has led to progress on low memory, one-pass streaming graph algorithms. These algorithms were designed for simple graphs, assuming an edge is not repeated in the stream. Real graph streams however, are almost always multigraphs i.e., they contain many duplicate edges. The assumption of no repeated edges requires an extra pass storing all the edges just for deduplication, which d… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(7 citation statements)
references
References 29 publications
0
7
0
Order By: Relevance
“…1 http://nskeylab.xjtu.edu.cn/dataset/phwang/code/PartitionCT.zip In summary, edges in the graph streaming model studied in this paper have: 1) duplicates, i.e., an edge in Π may appear more than once, which is similar to the data streaming model in [19,22]; 2) time-variant edge direction labels, i.e., each edge label l (t) (t) may change with t, which results in existing streaming algorithms failing to approximately count directed triangle patterns over time.…”
Section: Problem Formulationmentioning
confidence: 80%
See 2 more Smart Citations
“…1 http://nskeylab.xjtu.edu.cn/dataset/phwang/code/PartitionCT.zip In summary, edges in the graph streaming model studied in this paper have: 1) duplicates, i.e., an edge in Π may appear more than once, which is similar to the data streaming model in [19,22]; 2) time-variant edge direction labels, i.e., each edge label l (t) (t) may change with t, which results in existing streaming algorithms failing to approximately count directed triangle patterns over time.…”
Section: Problem Formulationmentioning
confidence: 80%
“…The algorithms are implemented in C++, and run on a computer with a Quad-Core Intel(R) Xeon(R) CPU E3-1226 v3 CPU 3.30GHz processor. We compare our method PartitionCT with four state-of-the-art methods: TRIÉST [44], MASCOT [30], FURL [22], and MG-Triangle [19]. For TRIÉST, both of its basic TRIÉST-BASE and improved TRIÉST-IMPR variants are considered.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…However, GPS requires more memory usage to store the sampling weights of sampled edges, and more runtime for sampling weights calculation and update. [43], [47], [48] develop one-pass streaming algorithms to deal with large graph streams including edge duplications. In detail, Wang et al [43] develop PartitionCT for triangle count approximation with a fixed memory usage, which uses a family of hash functions to uniformly sample distinct edges at a high speed, and this can reduce the sampling cost per edge to O(1) without additional memory usage.…”
Section: A Counting Triangles On Just a Machinementioning
confidence: 99%
“…In detail, Wang et al [43] develop PartitionCT for triangle count approximation with a fixed memory usage, which uses a family of hash functions to uniformly sample distinct edges at a high speed, and this can reduce the sampling cost per edge to O(1) without additional memory usage. Jha et al [47] present MG-TRIANGLE algorithm to estimate the triangle counts in multigraph streams. Jung et al [48] develop FURL to approximate local triangles for all nodes in multigraph streams.…”
Section: A Counting Triangles On Just a Machinementioning
confidence: 99%