2005
DOI: 10.1109/tkde.2005.187
|View full text |Cite
|
Sign up to set email alerts
|

Using datacube aggregates for approximate querying and deviation detection

Abstract: Abstract-Much research has been devoted to the efficient computation of relational aggregations and, specifically, the efficient execution of the datacube operation. In this paper, we consider the inverse problem, that of deriving (approximately) the original data from the aggregates. We motivate this problem in the context of two specific application areas, approximate query answering and data analysis. We propose a framework based on the notion of information entropy that enables us to estimate the original … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
29
0
1

Year Published

2009
2009
2018
2018

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 35 publications
(31 citation statements)
references
References 31 publications
1
29
0
1
Order By: Relevance
“…The Approximate Query Answering (AQUA) (Gibbons & Matias 1998) system provides approximate answers using small precomputed synopses of the underlying base data. In (Palpanas et al 2005), the authors consider the problem of deriving approximately the original data from the aggregates. They propose a framework for estimating the original values based on the notion of information entropy.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…The Approximate Query Answering (AQUA) (Gibbons & Matias 1998) system provides approximate answers using small precomputed synopses of the underlying base data. In (Palpanas et al 2005), the authors consider the problem of deriving approximately the original data from the aggregates. They propose a framework for estimating the original values based on the notion of information entropy.…”
Section: Related Workmentioning
confidence: 99%
“…In such cases, it is quite useful to generate an estimate or approximate answers using approximate query processing techniques. A key issue is the accuracy of the estimates for aggregate queries (e.g., queries computing SUM or COUNT expressions), and was the focus of recent research activity (e.g., (Palpanas, Koudas & Mendelson 2005), (Pourabbas & Shoshani 2007)). …”
Section: Introductionmentioning
confidence: 99%
“…The principle maintains that since we have no plausible reason to bias the distribution towards a certain form, the reconstructed distribution should be as uniform and "uninformative" as possible, subject to the observed constraints. The technique has been successfully applied for purposes similar to our own before [14,16,12,13].…”
Section: Related Workmentioning
confidence: 99%
“…, wr). The estimation approach that we use is based on the widely accepted Principle of Maximum Entropy [8] and has been successfully employed before [14,16,12,13].…”
Section: Estimation Of Word Co-occurrencesmentioning
confidence: 99%
“…Nevertheless, the proposed solutions are specific to the particular tasks for which they were developed. Previous work has also studied the problem of applying data mining techniques in the context of data warehouses [14,16,17,13], which are used by business analysts during the decision-making process.…”
Section: Related Workmentioning
confidence: 99%