2006
DOI: 10.1214/009053605000000822
|View full text |Cite
|
Sign up to set email alerts
|

Sequential importance sampling for multiway tables

Abstract: We describe an algorithm for the sequential sampling of entries in multiway contingency tables with given constraints. The algorithm can be used for computations in exact conditional inference. To justify the algorithm, a theory relates sampling values at each step to properties of the associated toric ideal using computational commutative algebra. In particular, the property of interval cell counts at each step is related to exponents on lead indeterminates of a lexicographic Gr\"{o}bner basis. Also, the appr… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
81
0

Year Published

2006
2006
2019
2019

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 71 publications
(81 citation statements)
references
References 28 publications
0
81
0
Order By: Relevance
“…. , u 4 Proof. We proceed by induction on k. For the base case k = 1, note that ∂∆ 1 is two isolated points.…”
Section: β-Avoiding Simplicial Complexesmentioning
confidence: 98%
See 1 more Smart Citation
“…. , u 4 Proof. We proceed by induction on k. For the base case k = 1, note that ∂∆ 1 is two isolated points.…”
Section: β-Avoiding Simplicial Complexesmentioning
confidence: 98%
“…In previous work of Rauh and the second author [11], normality was identified as a key property of hierarchical models to be able to apply the toric fiber product construction to calculate a Markov basis. If A C,d is unimodular, it is also easy to solve the integer programs that arise in sequential importance sampling [4]. It is a major open problem to classify the normal hierarchical models.…”
Section: Definition 12mentioning
confidence: 99%
“…In particular, a set of minimal Markov bases allows us to build a connected Markov chain and perform a random walk over all the points in the fiber that have the same fixed marginals and/or conditionals. Thus we can either enumerate or sample from the space of tables via Sequential Importance Sampling (SIS) or Markov Chain Monte Carlo (MCMC) sampling, e.g., see Chen et al [5]. Some disadvantages of the algebraic approach are that (1) calculation of Markov bases are computationally infeasible even for tables of small dimension, and (2) for conditionals, Markov bases are extremely sensitive to rounding of cell probabilities.…”
Section: Link Between Maximum Likelihood Estimates and Cellmentioning
confidence: 99%
“…In theory we could enumerate the number of possible tables utilizing algebraic techniques and software such as LattE [8] or sampling techniques such as MCMC and SIS [5]. Due to the large dimension of the solution polytope for this example, however, LattE is currently unable the execute the computation because the space of possible tables is extremely large.…”
Section: Examplementioning
confidence: 99%
“…Related algorithms using elegant random scan Gibbs samplers were given by ; McDonald, Smith, and Forster (1999); . Furthermore, relevant recent developments in sequential importance sampling (Chen, Diaconis, Holmes, and Liu 2005;Chen, Dinwoodie, and Sullivant 2006) are applicable to this setting. We refer the reader to Caffo and Booth (2003) for an overview of Monte Carlo algorithms in this area.…”
Section: The Softwarementioning
confidence: 99%