2018
DOI: 10.1080/01621459.2018.1448827
|View full text |Cite
|
Sign up to set email alerts
|

Speeding Up MCMC by Efficient Data Subsampling

Abstract: We propose Subsampling MCMC, a Markov Chain Monte Carlo (MCMC) framework where the likelihood function for n observations is estimated from a random subset of m observations. We introduce a highly efficient unbiased estimator of the loglikelihood based on control variates, such that the computing cost is much smaller than that of the full log-likelihood in standard MCMC. The likelihood estimate is bias-corrected and used in two dependent pseudo-marginal algorithms to sample from a perturbed posterior, for w… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
167
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
7
1

Relationship

2
6

Authors

Journals

citations
Cited by 122 publications
(167 citation statements)
references
References 39 publications
0
167
0
Order By: Relevance
“…Typically a small fraction of observations are included, hence speeding up the execution time significantly compared to MH. However, the augmentation scheme severely affects the mixing of the Firefly algorithm and it has been demonstrated to perform poorly compared to MH and other subsampling approaches (Bardenet et al, 2015;Quiroz et al, 2016Quiroz et al, , 2017. We conclude that delayed acceptance, out of the discussed methods, seems to be the only feasible route to obtain exact simulation via data subsampling.…”
Section: Introductionmentioning
confidence: 95%
See 2 more Smart Citations
“…Typically a small fraction of observations are included, hence speeding up the execution time significantly compared to MH. However, the augmentation scheme severely affects the mixing of the Firefly algorithm and it has been demonstrated to perform poorly compared to MH and other subsampling approaches (Bardenet et al, 2015;Quiroz et al, 2016Quiroz et al, , 2017. We conclude that delayed acceptance, out of the discussed methods, seems to be the only feasible route to obtain exact simulation via data subsampling.…”
Section: Introductionmentioning
confidence: 95%
“…Other authors use a subsample of the data in each MCMC iteration to speed up the algorithm, see e.g. Korattikara et al (2014), Bardenet et al (2014), Maclaurin and Adams (2014), Maire et al (2015), Bardenet et al (2015) and Quiroz et al (2016Quiroz et al ( , 2017. Finally, delayed acceptance MCMC has been used to speed up computations (Banterle et al, 2014;Payne and Mallick, 2015).…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…The log-likelihood is estimated by an unbiased subsampled version, and an unbiased estimator of the likelihood is obtained by correcting the bias of the exponentiation of this estimator. Quiroz, Villani and Kohn (2014) proposed subsampling the data using probability proportional-to-size (PPS) sampling to obtain an approximately unbiased estimate of the likelihood which is used in the MH acceptance step. The subsampling approach was further improved in Quiroz, Villani and Kohn (2015) using the efficient and robust difference estimator from the survey sampling literature.…”
Section: Methodsmentioning
confidence: 99%
“…Recently, there have been several articles working with a random subset of the full dataset at a time as a way to speed up computations (Maclaurin and Adams, 2014;Quiroz et al, 2014). These methods require access to the full dataset, that needs to remain on-hold, and therefore require fre-3…”
Section: Accepted Manuscriptmentioning
confidence: 99%