Fabular: regression formulas as probabilistic programming

Borgström, Johannes; Gordon, Andrew D.; Ouyang, Long; Russo, Claudio; Ścibior, Adam; Szymczak, Marcin

doi:10.1145/2914770.2837653

Cited by 2 publications

(3 citation statements)

References 11 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The dissertation also described a new, arguably more rigorous and elegant, semantics of Core Tabular models. Fabular, presented by Borgström et al (2016), extends Tabular with hierarchical linear regression formulas, extending the formula notation used by R packages such as lmer. Such formulas allow for a concise representation of a wide class of models and can be used in Tabular like any other model expressions.…”

Section: Retrospective and Related Projectsmentioning

confidence: 99%

Tabular: Probabilistic Inference from the Spreadsheet

Gordon¹,

Russo²,

Szymczak³

et al. 2020

Foundations of Probabilistic Programming

Self Cite

View full text Add to dashboard Cite

Tabular is a domain-specific language for expressing probabilistic models of relational data. Tabular has several features that set it apart from other probabilistic programming languages including: (1) programs and data are stored as spreadsheet tables; (2) programs consist of probabilistic annotations on the relational schema of the data; and (3) inference returns estimations of missing values and latent columns, as well as parameters. Our primary implementation is for Microsoft Excel and relies on Infer.NET for inference. Still, the language can be called independently of Excel and can target alternative inference engines. OverviewProbabilistic programming languages promise to make machine learning more accessible by allowing users to write their generative models as computer programs and providing generic inference engines capable of performing inference on all valid programs expressible in the given language. However, as most of the currently existing languages are essentially probabilistic extensions of conventional programming languages, they are arguably not ideally suited for the job.For one thing, they are still difficult to use for people who are not professional programmers. Meanwhile, many people who may want to use probabilistic mod-

show abstract

Section: Retrospective and Related Projectsmentioning

confidence: 99%

Tabular: Probabilistic Inference from the Spreadsheet

Gordon¹,

Russo²,

Szymczak³

et al. 2020

Foundations of Probabilistic Programming

Self Cite

View full text Add to dashboard Cite

show abstract

“…-Elimination is widely applied to discrete and continuous variables [de Salvo Braz et al 2007;Dechter 1998;Poole and Zhang 2003;Sanner and Abbasnejad 2012;Poole 1994, 1996] and is known in various contexts as Rao-Blackwellization [Blackwell 1947;Casella and Robert 1996;Gelfand and Smith 1990;Kolmogorov 1950;Murray et al 2018;Rao 1945], collapse [Koller and Friedman 2009;Venugopal and Gogate 2013], marginalization [Meng and van Dyk 1999;Obermeyer et al 2018], and integrating out [Griffiths and Steyvers 2004;Resnik and Hardisty 2010]. -Conjugacy is a preferred starting point and basic building block of Bayesian data modeling [Gelman et al 2014, page 36] and underlies such popular applications as Naive Bayes classification [Bayes 1763] and Bayesian linear regression [Borgström et al 2016]. • Loop optimization includes reordering sums to achieve superlinear speedups, and fusing and specializing loops to obtain one more order of magnitude in performance.…”

Section: Simplifying and Optimizing Probabilistic Programmingmentioning

confidence: 99%

“…collapse [Koller and Friedman 2009;Venugopal and Gogate 2013], marginalization [Meng and van Dyk 1999;Obermeyer et al 2018], and integrating out [Griffiths and Steyvers 2004;Resnik and Hardisty 2010]. -Conjugacy is a preferred starting point and basic building block of Bayesian data modeling [Gelman et al 2014, page 36] and underlies such popular applications as Naive Bayes classification [Bayes 1763] and Bayesian linear regression [Borgström et al 2016]. • Loop optimization includes reordering sums to achieve superlinear speedups, and fusing and specializing loops to obtain one more order of magnitude in performance.…”

Section: Simplifying and Optimizing Probabilistic Programmingmentioning

confidence: 99%

From high-level inference algorithms to efficient code

Walia

Narayanan

Carette

et al. 2019

Proc. ACM Program. Lang.

View full text Add to dashboard Cite

Probabilistic programming languages are valuable because they allow domain experts to express probabilistic models and inference algorithms without worrying about irrelevant details. However, for decades there remained an important and popular class of probabilistic inference algorithms whose efficient implementation required manual low-level coding that is tedious and error-prone. They are algorithms whose idiomatic expression requires random array variables that are latent or whose likelihood is conjugate. Although that is how practitioners communicate and compose these algorithms on paper, executing such expressions requires eliminating the latent variables and recognizing the conjugacy by symbolic mathematics. Moreover, matching the performance of handwritten code requires speeding up loops by more than a constant factor.We show how probabilistic programs that directly and concisely express these desired inference algorithms can be compiled while maintaining efficiency. We introduce new transformations that turn high-level probabilistic programs with arrays into pure loop code. We then make great use of domain-specific invariants and norms to optimize the code, and to specialize and JIT-compile the code per execution. The resulting performance is competitive with manual implementations.

show abstract

Fabular: regression formulas as probabilistic programming

Cited by 2 publications

References 11 publications

Tabular: Probabilistic Inference from the Spreadsheet

Tabular: Probabilistic Inference from the Spreadsheet

From high-level inference algorithms to efficient code

Contact Info

Product

Resources

About