2003
DOI: 10.1017/s0956796802004562
|View full text |Cite
|
Sign up to set email alerts
|

AutoBayes: a system for generating data analysis programs from statistical models

Abstract: Data analysis is an important scientific task which is required whenever information needs to be extracted from raw data. Statistical approaches to data analysis, which use methods from probability theory and numerical analysis, are well-founded but difficult to implement: the development of a statistical data analysis program for any given application is time-consuming and requires substantial knowledge and experience in several areas.In this paper, we describe AutoBayes, a program synthesis system for the ge… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
52
0

Year Published

2005
2005
2010
2010

Publication Types

Select...
5
2

Relationship

2
5

Authors

Journals

citations
Cited by 81 publications
(52 citation statements)
references
References 25 publications
0
52
0
Order By: Relevance
“…To start with, the vast majority of standalone probabilistic languages are implemented as interpreters rather than compilers (with the notable exceptions of AutoBayes [13] and HBC [5]). In contrast, an embedded probabilistic language can piggyback on its host language's compilers to remove some interpretive overhead.…”
Section: Rain Sprinkler Grass_is_wetmentioning
confidence: 99%
“…To start with, the vast majority of standalone probabilistic languages are implemented as interpreters rather than compilers (with the notable exceptions of AutoBayes [13] and HBC [5]). In contrast, an embedded probabilistic language can piggyback on its host language's compilers to remove some interpretive overhead.…”
Section: Rain Sprinkler Grass_is_wetmentioning
confidence: 99%
“…1) is a code certification extension to the AutoBayes synthesis system, which is used in the statistical data analysis domain [2]. Its input specification is a statistical model, i.e., it describes how the statistical variables are distributed and depend on each other and which parameters have to be estimated for the given task.…”
Section: The Autobayes/cc Systemmentioning
confidence: 99%
“…The synthesized program uses an iterative EM (expectation maximization) algorithm and consists of roughly 380 lines of code, 90 of which are auto-generated comments to explain the code. For details see http://ase.arc.nasa.gov/schumann/AutoBayesCC and [2]. The code is annotated to prove division-by-zero and array-bounds safety.…”
Section: A Certification Examplementioning
confidence: 99%
See 1 more Smart Citation
“…The range of possible behaviors for the entire system is not currently tractable to explicit techniques. At the system level, we use a combination of unsupervised 19 and supervised 20,21 machine learning techniques in a directed Monte Carlo global sensitivity analysis 22 to model the behavioral structure of the component and to predict the component-level input test vectors. Simulation-based validation depends on heuristics and cannot guarantee that the system is safe; however, validation testing is likely to uncover unsafe behaviors not discovered by using formal methods on simplified systems [23][24][25] .…”
Section: Introductionmentioning
confidence: 99%