2017
DOI: 10.1609/aaai.v31i1.11157
|View full text |Cite
|
Sign up to set email alerts
|

Towards Continuous Scientific Data Analysis and Hypothesis Evolution

Abstract: Scientific data is continuously generated throughout the world. However, analyses of these data are typically performed exactly once and on a small fragment of recently generated data. Ideally, data analysis would be a continuous process that uses all the data available at the time, and would be automatically re-run and updated when new data appears. We present a framework for automated discovery from data repositories that tests user-provided hypotheses using expert-grade data analysis strategies, and reasse… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0
2

Year Published

2018
2018
2024
2024

Publication Types

Select...
3
2
1

Relationship

2
4

Authors

Journals

citations
Cited by 9 publications
(7 citation statements)
references
References 23 publications
0
5
0
2
Order By: Relevance
“…But it may also be useful to apply the same analysis on different (or updated) data. For instance, data analysis workflows can be exploited for continuous reevaluation of hypotheses by updating data analyses when new data is available [10].…”
Section: Results Checking and Data Reusementioning
confidence: 99%
“…But it may also be useful to apply the same analysis on different (or updated) data. For instance, data analysis workflows can be exploited for continuous reevaluation of hypotheses by updating data analyses when new data is available [10].…”
Section: Results Checking and Data Reusementioning
confidence: 99%
“…We are developing a framework, called DISK, to make hypothesis testing and data analysis more systematic (Gil et al 2017a). We look at the discovery cycle, starting with formulating new hypotheses, determining what type of data and method can be used to test it (we call this a line of inquiry), retrieving the data from a shared repository, analyzing the data, and then revising the hypothesis.…”
Section: Ai For Systematic Scientific Data Analysismentioning
confidence: 99%
“…For example, a citation for a bundle containing workflows and execution details for Gil et al (2017) is:…”
Section: Workflow Citationmentioning
confidence: 99%
“…A citation for a dataset consists of a descriptive name (or title) for the dataset, its creators, the name of the repository where it can be accessed, and the permanent URL. For example, a citation for a dataset in Gil et al (2017) is: Adusumilli, Ravali. (2016).…”
Section: Data Citationmentioning
confidence: 99%
See 1 more Smart Citation