2019
DOI: 10.1145/3290350
|View full text |Cite
|
Sign up to set email alerts
|

Bayesian synthesis of probabilistic programs for automatic data modeling

Abstract: We present new techniques for automatically constructing probabilistic programs for data analysis, interpretation, and prediction. These techniques work with probabilistic domain-specific data modeling languages that capture key properties of a broad class of data generating processes, using Bayesian inference to synthesize probabilistic programs in these modeling languages given observed data. We provide a precise formulation of Bayesian synthesis for automatic data modeling that identifies sufficient conditi… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
23
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 27 publications
(23 citation statements)
references
References 48 publications
0
23
0
Order By: Relevance
“…DPMMs can approximate a broad class of multivariate distributions without requiring a priori specification of the number of components in the mixture model. The mixture models generated via a DPMM prior can be converted to probabilistic programs for inference to generate the kinds of conditional simulations used in Figure 6 (Saad et al, 2019). In this representation, each pre-bout to post-bout prey transformation made by a zebrafish can be thought of as arising from a program that first chooses a prototypical transform (corresponding to a component in the mixture), and then generates a random transform from a distribution over transforms associated with the prototype.…”
Section: Methodsmentioning
confidence: 99%
“…DPMMs can approximate a broad class of multivariate distributions without requiring a priori specification of the number of components in the mixture model. The mixture models generated via a DPMM prior can be converted to probabilistic programs for inference to generate the kinds of conditional simulations used in Figure 6 (Saad et al, 2019). In this representation, each pre-bout to post-bout prey transformation made by a zebrafish can be thought of as arising from a program that first chooses a prototypical transform (corresponding to a component in the mixture), and then generates a random transform from a distribution over transforms associated with the prototype.…”
Section: Methodsmentioning
confidence: 99%
“…This motivated (1) abstractionrefinement over the MDP representation [10], and (2) counterexample-guided inductive synthesis (CEGIS) for MCs [9], mentioned earlier. The alternative problem of sketching for probabilistic programs that fit given data is studied, e.g., in [32,38].…”
Section: Related Workmentioning
confidence: 99%
“…For the first question, we obtained 23 benchmarks from prior work [40] and collected 6 new benchmarks. The 29 benchmark programs consist of (i) example models from Anglican [62], Turing [20], and Pyro [8], as well as (ii) PCFG models, including a Gaussian-process domain-specific language (DSL) [50] and synthetic models (such as examples shown in this paper). Compared to prior work [40], a larger subset of benchmark models are expressible and type-checked in our PPL.…”
Section: Experimental Evaluationmentioning
confidence: 99%