2020
DOI: 10.1145/3410448
|View full text |Cite
|
Sign up to set email alerts
|

Large-scale Data Exploration Using Explanatory Regression Functions

Abstract: Analysts wishing to explore multivariate data spaces, typically issue queries involving selection operators, i.e., range or equality predicates, which define data subspaces of potential interest. Then, they use aggregation functions, the results of which determine a subspace's interestingness for further exploration and deeper analysis. However, Aggregate Query (AQ) results are scalars and convey limited information and explainability about the queried subspaces for enhanced exploratory analysis. Analysts have… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
2
0

Year Published

2021
2021
2025
2025

Publication Types

Select...
3
2
1

Relationship

2
4

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 49 publications
0
2
0
Order By: Relevance
“…Hence, a node selection mechanism is required to determine which node could improve the model performance and which can lead to a model that can forget what it has learned. Unfortunately, upon any incoming analytics query [13], we do not have full access to data of all participants; thus, we cannot extract the data space and patterns. In order to extract this knowledge per incoming query, it is deemed appropriate to define a pre-test mechanism to check if participants have a similar or different data patterns and relevant to the current query or not.…”
Section: Related Work and Rationalementioning
confidence: 99%
“…Hence, a node selection mechanism is required to determine which node could improve the model performance and which can lead to a model that can forget what it has learned. Unfortunately, upon any incoming analytics query [13], we do not have full access to data of all participants; thus, we cannot extract the data space and patterns. In order to extract this knowledge per incoming query, it is deemed appropriate to define a pre-test mechanism to check if participants have a similar or different data patterns and relevant to the current query or not.…”
Section: Related Work and Rationalementioning
confidence: 99%
“…For a particular data mining application, the importance of the prediction and description may differ some times. The goals of description and prediction are accomplished by using variety of mining methods of data [7] [8].…”
Section: Introductionmentioning
confidence: 99%
“…Typical use case scenarios include: (i) data scientists analyzing a large volume of data to obtain in-depth knowledge about the data for follow-up tasks like, data trend explanation (Savva et al, 2020), report summarization (Marcel and Negre, 2011) and (ii) users gathering information by discovering scholarly articles to conduct literature review using online services (Krause and Guestrin, 2011). These tasks can be conceptualized as the system recommending queries and the user either accepting or rejecting these recommendations, thus forming a closed loop interactive environment.…”
Section: Introductionmentioning
confidence: 99%