Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 2008
DOI: 10.1145/1401890.1401898
|View full text |Cite
|
Sign up to set email alerts
|

Efficient semi-streaming algorithms for local triangle counting in massive graphs

Abstract: In this paper we study the problem of local triangle counting in large graphs. Namely, given a large graph G = (V, E) we want to estimate as accurately as possible the number of triangles incident to every node v ∈ V in the graph. The problem of computing the global number of triangles in a graph has been considered before, but to our knowledge this is the first paper that addresses the problem of local triangle counting with a focus on the efficiency issues arising in massive graphs. The distribution of the l… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
208
0

Year Published

2009
2009
2017
2017

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 282 publications
(209 citation statements)
references
References 31 publications
1
208
0
Order By: Relevance
“…For many applications, especially in the context of large social networks, an exact count is not crucial but rather a fast, high quality estimate. Most of the work on approximate triangle counting is sampling-based and has considered a (semi-)streaming setting [5,6,7,14,23]. A different line of research is based on a linear algebraic approach [4,21].…”
Section: Introductionmentioning
confidence: 99%
“…For many applications, especially in the context of large social networks, an exact count is not crucial but rather a fast, high quality estimate. Most of the work on approximate triangle counting is sampling-based and has considered a (semi-)streaming setting [5,6,7,14,23]. A different line of research is based on a linear algebraic approach [4,21].…”
Section: Introductionmentioning
confidence: 99%
“…Semi-streaming model Recently, Becchetti, Boldi, Castillo and Gionis introduced the semi-streaming model in [5] to solve the local triangle counting problem. Their method relies on the locality sensitivity hashing concept.…”
Section: Exact Counting Methodsmentioning
confidence: 99%
“…Besides the significance of triangles in network analysis statistics, they also play an important role in graph mining applications: Eckmann and Moses showed how one can use triangles in order to uncover the hidden thematic structure of the web [11] and Beccheti et al in [5] used the local distribution of triangles and the clustering coefficient to detect spamming activity. Furthermore, triangle-related power laws [21] can be used to define outliers in a graph with respect to triangles.…”
Section: Introductionmentioning
confidence: 99%
“…This algorithm can be viewed as a special case of a streaming algorithm, since there exist algorithms, e.g., [29], that perform a constant number of passes over the non-zero elements of the matrix to produce a good low rank matrix approximation. In [5] the semi-streaming model for counting triangles is introduced, which allows log n passes over the edges. The key observation is that since counting triangles reduces to computing the intersection of two sets, namely the induced neighborhoods of two adjacent nodes, ideas from locality sensitivity hashing [6] are applicable to the problem.…”
Section: Existing Workmentioning
confidence: 99%
“…and is one of the main reasons which gave rise to the definitions of the transitivity ratio and the clustering coefficients of a graph in complex network analysis [27]. Triangles are used in several applications such as uncovering the hidden thematic structure of the web [13], as a feature to assist the classification of web activity [5] and for link recommendation in online social networks [36]. Furthermore, triangles are used as a network statistic in the exponential random graph model [14].…”
Section: Introductionmentioning
confidence: 99%