Abstract:This paper studies clustering for possibly high dimensional data (e.g. images, time series, gene expression data, and many other settings), and rephrase it as low rank matrix estimation in the PAC-Bayesian framework. Our approach leverages the well known Burer-Monteiro factorisation strategy from large scale optimisation, in the context of low rank estimation. Moreover, our Burer-Monteiro factors are shown to lie on a Stiefel manifold. We propose a new generalized Bayesian estimator for this problem and prove … Show more
“…Minimax rates for matrix completion and more general matrix estimation problems were derived in [32,8,30,31,41]. Bayesian estimators and aggregation procedures were studied in [1,47,38,2,37,4,17,36,16,15].…”
Matrix factorization is a powerful data analysis tool. It has been used in multivariate time series analysis, leading to the decomposition of the series in a small set of latent factors. However, little is known on the statistical performances of matrix factorization for time series. In this paper, we extend the results known for matrix estimation in the i.i.d setting to time series. Moreover, we prove that when the series exhibit some additional structure like periodicity or smoothness, it is possible to improve on the classical rates of convergence.Acknowledgements. The authors gratefully acknowledge Maxime Ossonce for discussions which helped to improve the paper. The first author was working at CREST, EN-SAE Paris when this paper was written; he gratefully acknowledges financial support from Labex ECODEC (ANR-11-LABEX-0047).
“…Minimax rates for matrix completion and more general matrix estimation problems were derived in [32,8,30,31,41]. Bayesian estimators and aggregation procedures were studied in [1,47,38,2,37,4,17,36,16,15].…”
Matrix factorization is a powerful data analysis tool. It has been used in multivariate time series analysis, leading to the decomposition of the series in a small set of latent factors. However, little is known on the statistical performances of matrix factorization for time series. In this paper, we extend the results known for matrix estimation in the i.i.d setting to time series. Moreover, we prove that when the series exhibit some additional structure like periodicity or smoothness, it is possible to improve on the classical rates of convergence.Acknowledgements. The authors gratefully acknowledge Maxime Ossonce for discussions which helped to improve the paper. The first author was working at CREST, EN-SAE Paris when this paper was written; he gratefully acknowledges financial support from Labex ECODEC (ANR-11-LABEX-0047).
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.