Changxiao Cai scite author profile

This paper investigates a problem of broad practical interest, namely, the reconstruction of a large-dimensional low-rank tensor from highly incomplete and randomly corrupted observations of its entries. Although a number of papers have been dedicated to this tensor completion problem, prior algorithms either are computationally too expensive for large-scale applications or come with suboptimal statistical performance. Motivated by this, we propose a fast two-stage nonconvex algorithm—a gradient method following a rough initialization—that achieves the best of both worlds: optimal statistical accuracy and computational efficiency. Specifically, the proposed algorithm provably completes the tensor and retrieves all low-rank factors within nearly linear time, while at the same time enjoying near-optimal statistical guarantees (i.e., minimal sample complexity and optimal estimation accuracy). The insights conveyed through our analysis of nonconvex optimization might have implications for a broader family of tensor reconstruction problems beyond tensor completion.

show abstract

Structured Low-Rank Matrix Factorization for Haplotype Assembly

Cai

Sanghavi

Vikalo

2016

IEEE J. Sel. Top. Signal Process.

View full text Add to dashboard Cite

Subspace estimation from unbalanced and incomplete data matrices: ℓ2,∞ statistical guarantees

Cai

Chi

et al. 2021

Ann. Statist.

View full text Add to dashboard Cite

This paper is concerned with estimating the column space of an unknown low-rank matrix A ∈ R d 1 ×d 2 , given noisy and partial observations of its entries. There is no shortage of scenarios where the observations-while being too noisy to support faithful recovery of the entire matrix-still convey sufficient information to enable reliable estimation of the column space of interest. This is particularly evident and crucial for the highly unbalanced case where the column dimension d 2 far exceeds the row dimension d 1 , which is the focal point of the current paper.We investigate an efficient spectral method, which operates upon the sample Gram matrix with diagonal deletion. While this algorithmic idea has been studied before, we establish new statistical guarantees for this method in terms of both 2 and 2,∞ estimation accuracy, which improve upon prior results if d 2 is substantially larger than d 1 . To illustrate the effectiveness of our findings, we derive matching minimax lower bounds with respect to the noise levels, and develop consequences of our general theory for three applications of practical importance: (1) tensor completion from noisy data, (2) covariance estimation/principal component analysis with missing data and (3) community recovery in bipartite graphs. Our theory leads to improved performance guarantees for all three cases.

show abstract

Is Q-Learning Minimax Optimal? A Tight Sample Complexity Analysis

Li¹,

Cai²,

Chen³

et al. 2021

Preprint

View full text Add to dashboard Cite

Conditional Rényi Divergence Saddlepoint and the Maximization of α-Mutual Information

Cai

Verdú²

2019

Entropy

View full text Add to dashboard Cite

Rényi-type generalizations of entropy, relative entropy and mutual information have found numerous applications throughout information theory and beyond. While there is consensus that the ways A. Rényi generalized entropy and relative entropy in 1961 are the “right” ones, several candidates have been put forth as possible mutual informations of order α . In this paper we lend further evidence to the notion that a Bayesian measure of statistical distinctness introduced by R. Sibson in 1969 (closely related to Gallager’s E 0 function) is the most natural generalization, lending itself to explicit computation and maximization, as well as closed-form formulas. This paper considers general (not necessarily discrete) alphabets and extends the major analytical results on the saddle-point and saddle-level of the conditional relative entropy to the conditional Rényi divergence. Several examples illustrate the main application of these results, namely, the maximization of α -mutual information with and without constraints.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Changxiao Cai

Nonconvex Low-Rank Tensor Completion from Noisy Data

Structured Low-Rank Matrix Factorization for Haplotype Assembly

Subspace estimation from unbalanced and incomplete data matrices: ℓ2,∞ statistical guarantees

Is Q-Learning Minimax Optimal? A Tight Sample Complexity Analysis

Conditional Rényi Divergence Saddlepoint and the Maximization of α-Mutual Information

Contact Info

Product

Resources

About