Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 2014
DOI: 10.1145/2623330.2623640
|View full text |Cite
|
Sign up to set email alerts
|

Scalable histograms on large probabilistic data

Abstract: Histogram construction is a fundamental problem in data management, and a good histogram supports numerous mining operations. Recent work has extended histograms to probabilistic data [5][6][7]. However, constructing histograms for probabilistic data can be extremely expensive, and existing studies suffer from limited scalability [5][6][7]. This work designs novel approximation methods to construct scalable histograms on probabilistic data. We show that our methods provide constant approximations compared to t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
1
0

Year Published

2014
2014
2020
2020

Publication Types

Select...
3
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 30 publications
0
1
0
Order By: Relevance
“…The focus of this work was to adjust the balance between trees ("Tributaries") and multipath ("Deltas"), in response to varying network conditions, expressed in terms of the packets drop rate, for the purpose of robust and efficient in-network computation of aggregates. In particular, algorithms are presented in [8], [9], [10], [11], [12], [13], [14], [15] for computing frequent item sets, quantiles, histograms, and spatio-temporal data along with the criteria for changing the role of a particular node. In this paper, the problem of constructing the number of data aggregation trees with minimum energy cost will be studied.…”
mentioning
confidence: 99%
“…The focus of this work was to adjust the balance between trees ("Tributaries") and multipath ("Deltas"), in response to varying network conditions, expressed in terms of the packets drop rate, for the purpose of robust and efficient in-network computation of aggregates. In particular, algorithms are presented in [8], [9], [10], [11], [12], [13], [14], [15] for computing frequent item sets, quantiles, histograms, and spatio-temporal data along with the criteria for changing the role of a particular node. In this paper, the problem of constructing the number of data aggregation trees with minimum energy cost will be studied.…”
mentioning
confidence: 99%