Rudrajit Das scite author profile

In this paper, we propose a new method to perform Sparse Kernel Principal Component Analysis (SKPCA) and also mathematically analyze the validity of SKPCA. We formulate SKPCA as a constrained optimization problem with elastic net regularization (Hastie et al.) in kernel feature space and solve it. We consider outlier detection (where KPCA is employed) as an application for SKPCA, using the RBF kernel. We test it on 5 real world datasets and show that by using just 4% (or even less) of the principal components (PCs), where each PC has on average less than 12% non-zero elements in the worst case among all 5 datasets, we are able to nearly match and in 3 datasets even outperform KPCA. We also compare the performance of our method with a recently proposed method for SKPCA by Wang et al., and show that our method performs better in terms of both accuracy and sparsity. We also provide a novel probabilistic proof to justify the existence of sparse solutions for KPCA using the RBF kernel. To the best of our knowledge, this is the first attempt at theoretically analyzing the validity of SKPCA.

show abstract

Sparse Kernel PCA for Outlier Detection

Das¹,

Golatkar²,

Awate³

2018

Preprint

View full text Add to dashboard Cite

On the Benefits of Multiple Gossip Steps in Communication-Constrained Decentralized Optimization

Hashemi¹,

Acharya²,

Das³

et al. 2020

Preprint

View full text Add to dashboard Cite

In decentralized optimization, it is common algorithmic practice to have nodes interleave (local) gradient descent iterations with gossip (i.e. averaging over the network) steps. Motivated by the training of large-scale machine learning models, it is also increasingly common to require that messages be lossy compressed versions of the local parameters. In this paper we show that, in such compressed decentralized optimization settings, there are benefits to having multiple gossip steps between subsequent gradient iterations, even when the cost of doing so is appropriately accounted for e.g. by means of reducing the precision of compressed information. In particular, we show that having O(log 1 ) gradient iterations with constant step size -and O(log 1 ) gossip steps between every pair of these iterations -enables convergence to within of the optimal value for smooth non-convex objectives satisfying Polyak-Łojasiewicz condition. This result also holds for smooth strongly convex objectives. To our knowledge, this is the first work that derives convergence results for nonconvex optimization under arbitrary communication compression. 1 * Equal Contribution 1 Throughout the paper we have used gossip/consensus, graph/network and node/client/agent interchangeably.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Rudrajit Das

Faster Non-Convex Federated Learning via Global and Local Momentum

On the Benefits of Multiple Gossip Steps in Communication-Constrained Decentralized Federated Learning

Sparse Kernel PCA for Outlier Detection

Sparse Kernel PCA for Outlier Detection

On the Benefits of Multiple Gossip Steps in Communication-Constrained Decentralized Optimization

Contact Info

Product

Resources

About