Haque, Saminul scite author profile

Haque, Saminul

4Publications

0Citation Statements Received

60Citation Statements Given

How they've been cited

How they cite others

Affiliations

University of Toronto

Publications

Order By: Most citations

A Fast Algorithm for Adaptive Private Mean Estimation

Duchi¹,

Saminul²,

Kuditipudi³

2023

Preprint

View full text Add to dashboard Cite

We design an (ε, δ)-differentially private algorithm to estimate the mean of a d-variate distribution, with unknown covariance Σ, that is adaptive to Σ. To within polylogarithmic factors, the estimator achieves optimal rates of convergence with respect to the induced Mahalanobis norm • Σ , takes time O(nd 2 ) to compute, has near linear sample complexity for sub-Gaussian distributions, allows Σ to be degenerate or low rank, and adaptively extends beyond sub-Gaussianity. Prior to this work, other methods required exponential computation time or the superlinear scaling n = Ω(d 3/2 ) to achieve non-trivial error with respect to the norm • Σ .

show abstract

Undersampling is a Minimax Optimal Robustness Intervention in Nonparametric Classification

Chatterji¹,

Saminul²,

Hashimoto³

2022

Preprint

View full text Add to dashboard Cite

While a broad range of techniques have been proposed to tackle distribution shift, the simple baseline of training on an undersampled dataset often achieves close to state-of-the-art-accuracy across several popular benchmarks. This is rather surprising, since undersampling algorithms discard excess majority group data. To understand this phenomenon, we ask if learning is fundamentally constrained by a lack of minority group samples. We prove that this is indeed the case in the setting of nonparametric binary classification. Our results show that in the worst case, an algorithm cannot outperform undersampling unless there is a high degree of overlap between the train and test distributions (which is unlikely to be the case in real-world datasets), or if the algorithm leverages additional structure about the distribution shift. In particular, in the case of label shift we show that there is always an undersampling algorithm that is minimax optimal. While in the case of group-covariate shift we show that there is an undersampling algorithm that is minimax optimal when the overlap between the group distributions is small. We also perform an experimental case study on a label shift dataset and find that in line with our theory the test accuracy of robust neural network classifiers is constrained by the number of minority samples. * Equal contribution.

show abstract

A comparison of two classifications of solvable Lie algebras

Bryenton

Davies

Douglas

et al. 2018

View full text Add to dashboard Cite

The literature contains two different classifications of solvable Lie algebras of dimensions up to and including 4. This paper is devoted to comparing the two classifications and translating each into the other. In particular, we exhibit an isomorphism between each solvable Lie algebra of one classification and the corresponding algebra of the second. The first classification is provided by de Graaf, and the second classification is from a recent book by Šnobl and Winternitz.

show abstract

Is Importance Weighting Incompatible with Interpolating Classifiers?

Wang¹,

Chatterji²,

Saminul³

et al. 2021

Preprint

View full text Add to dashboard Cite

Importance weighting is a classic technique to handle distribution shifts. However, prior work has presented strong empirical and theoretical evidence demonstrating that importance weights can have little to no effect on overparameterized neural networks. Is importance weighting truly incompatible with the training of overparameterized neural networks? Our paper answers this in the negative. We show that importance weighting fails not because of the overparameterization, but instead, as a result of using exponentially-tailed losses like the logistic or cross-entropy loss. As a remedy, we show that polynomially-tailed losses restore the effects of importance reweighting in correcting distribution shift in overparameterized models. We characterize the behavior of gradient descent on importance weighted polynomially-tailed losses with overparameterized linear models, and theoretically demonstrate the advantage of using polynomially-tailed losses in a label shift setting. Surprisingly, our theory shows that using weights that are obtained by exponentiating the classical unbiased importance weights can improve performance. Finally, we demonstrate the practical value of our analysis with neural network experiments on a subpopulation shift and a label shift dataset. When reweighted, our loss function can outperform reweighted cross-entropy by as much as 9% in test accuracy. Our loss function also gives test accuracies comparable to, or even exceeding, well-tuned state-of-the-art methods for correcting distribution shifts.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Haque, Saminul

A Fast Algorithm for Adaptive Private Mean Estimation

Undersampling is a Minimax Optimal Robustness Intervention in Nonparametric Classification

A comparison of two classifications of solvable Lie algebras

Is Importance Weighting Incompatible with Interpolating Classifiers?

Contact Info

Product

Resources

About