Ronak Mehta scite author profile

Ronak Mehta

5Publications

12Citation Statements Received

54Citation Statements Given

How they've been cited

How they cite others

Affiliations

Indian Institute of Technology Gandhinagar, Johns Hopkins University, University of Cape Town

Publications

Order By: Most citations

Towards a theory of out-of-distribution learning

Geisa¹,

Mehta²,

Helm³

et al. 2021

Preprint

View full text Add to dashboard Cite

What is learning? 20 th century formalizations of learning theory-which precipitated revolutions in artificial intelligence-focus primarily on in-distribution learning, that is, learning under the assumption that the training data are sampled from the same distribution as the evaluation distribution. This assumption renders these theories inadequate for characterizing 21 st century real world data problems, which are typically characterized by evaluation distributions that differ from the training data distributions (referred to as out-of-distribution learning). We therefore make a small change to existing formal definitions of learnability by relaxing that assumption. We then introduce learning efficiency (LE) to quantify the amount a learner is able to leverage data for a given problem, regardless of whether it is an in-or out-of-distribution problem. We then define and prove the relationship between generalized notions of learnability, and show how this framework is sufficiently general to characterize transfer, multitask, meta, continual, and lifelong learning. We hope this unification helps bridge the gap between empirical practice and theoretical guidance in real world problems. Finally, because biological learning continues to outperform machine learning algorithms on certain OOD challenges, we discuss the limitations of this framework vis-á-vis its ability to formalize biological learning, suggesting multiple avenues for future research.

show abstract

Representation Ensembling for Synergistic Lifelong Learning with Quasilinear Complexity

Vogelstein¹,

Dey²,

Helm³

et al. 2020

Preprint

View full text Add to dashboard Cite

Manifold Oblique Random Forests: Towards Closing the Gap on Convolutional Deep Networks

A¹,

Perry²,

Huynh³

et al. 2019

Preprint

View full text Add to dashboard Cite

Decision forests (DF), in particular random forests and gradient boosting trees, have demonstrated state-of-the-art accuracy compared to other methods in many supervised learning scenarios. In particular, DFs dominate other methods in tabular data, that is, when the feature space is unstructured, so that the signal is invariant to permuting feature indices. However, in structured data lying on a manifold-such as images, text, and speech-neural nets (NN) tend to outperform DFs. We conjecture that at least part of the reason for this is that the input to NN is not simply the feature magnitudes, but also their indices (for example, the convolution operation uses "feature locality"). In contrast, naïve DF implementations fail to explicitly consider feature indices. A recently proposed DF approach demonstrates that DFs, for each node, implicitly sample a random matrix from some specific distribution. Here, we build on that to show that one can choose distributions in a manifold aware fashion. For example, for image classification, rather than randomly selecting pixels, one can randomly select contiguous patches. We demonstrate the empirical performance of data living on three different manifolds: images, time-series, and a torus. In all three cases, our Manifold Forest (Morf) algorithm empirically dominates other state-of-the-art approaches that ignore feature space structure, achieving a lower classification error on all sample sizes. This dominance extends to the MNIST data set as well. Moreover, both training and test time is significantly faster for manifold forests as compared to deep nets. This approach, therefore, has promise to enable DFs and other machine learning methods to close the gap with deep nets on manifold-valued data.

show abstract

Independence Testing for Multivariate Time Series

Mehta¹,

Chung²,

Shen³

et al. 2019

Preprint

View full text Add to dashboard Cite

Deep Unlearning via Randomized Conditionally Independent Hessians

Mehta¹,

Pal²,

Singh³

et al. 2022

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Ronak Mehta

Towards a theory of out-of-distribution learning

Representation Ensembling for Synergistic Lifelong Learning with Quasilinear Complexity

Manifold Oblique Random Forests: Towards Closing the Gap on Convolutional Deep Networks

Independence Testing for Multivariate Time Series

Deep Unlearning via Randomized Conditionally Independent Hessians

Contact Info

Product

Resources

About