2017
DOI: 10.1007/s11280-017-0434-4
|View full text |Cite
|
Sign up to set email alerts
|

Efficient schemes for similarity-aware refinement of aggregation queries

Abstract: Interactive data exploration platforms in Web, business and scientific domains are becoming increasingly popular. Typically, users without prior knowledge of data interact with these platforms in an exploratory manner hoping they might retrieve the results they are looking for. One way to explore large-volume data is by posing aggregate queries which group values of multiple rows by an aggregate operator to form a single value: an aggregated value. Though, when a query fails, i.e., returns undesired aggregated… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
2
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(2 citation statements)
references
References 27 publications
0
2
0
Order By: Relevance
“…For example, since HC[14] chooses refinement steps based on evaluating the relative error locally, it is vulnerable to getting stuck at a local minima when query similarity is included in assessing the relative error of each step. While this might not be true for SW framework[58], it still suffers from high I/O and CPU costs from exhaustively evaluating all cells in the partitioned space when there are no shape-based conditions.This thesis positions itself with[78,17,10,2,123,125,11,12] since it shares with all of these works a similar assumption. This assumption is a common problem that users often face when performing DE tasks.…”
mentioning
confidence: 66%
See 1 more Smart Citation
“…For example, since HC[14] chooses refinement steps based on evaluating the relative error locally, it is vulnerable to getting stuck at a local minima when query similarity is included in assessing the relative error of each step. While this might not be true for SW framework[58], it still suffers from high I/O and CPU costs from exhaustively evaluating all cells in the partitioned space when there are no shape-based conditions.This thesis positions itself with[78,17,10,2,123,125,11,12] since it shares with all of these works a similar assumption. This assumption is a common problem that users often face when performing DE tasks.…”
mentioning
confidence: 66%
“…Accordingly, researchers have proposed highly specialized and optimized DE techniques to support users with their diverse exploration tasks. For example, some of these tasks are to recommend relevant data [30,29], to identify interesting subspaces of data that are highly deviated from the rest of data or a reference [124], to explain why outliers show up in the results [104,129], to summarize and present representative sets of the potentially huge result sets [28,65], to formulate or refine queries based on user-defined constraints [33,119,58,125,2].…”
Section: Data Explorationmentioning
confidence: 99%