Shreyas Chaudhari scite author profile

Shreyas Chaudhari

4Publications

57Citation Statements Received

93Citation Statements Given

How they've been cited

How they cite others

109

Affiliations

Carnegie Mellon University, Uber AI (United States), Indian Institute of Technology Madras

Publications

Order By: Most citations

Off-Dynamics Reinforcement Learning: Training for Transfer with Domain Classifiers

Eysenbach¹,

Asawa²,

Chaudhari³

et al. 2020

Preprint

View full text Add to dashboard Cite

We propose a simple, practical, and intuitive approach for domain adaptation in reinforcement learning. Our approach stems from the idea that the agent's experience in the source domain should look similar to its experience in the target domain. Building off of a probabilistic view of RL, we formally show that we can achieve this goal by compensating for the difference in dynamics by modifying the reward function. This modified reward function is simple to estimate by learning auxiliary classifiers that distinguish source-domain transitions from target-domain transitions. Intuitively, the modified reward function penalizes the agent for visiting states and taking actions in the source domain which are not possible in the target domain. Said another way, the agent is penalized for transitions that would indicate that the agent is interacting with the source domain, rather than the target domain. Our approach is applicable to domains with continuous states and actions and does not require learning an explicit model of the dynamics. On discrete and continuous control tasks, we illustrate the mechanics of our approach and demonstrate its scalability to high-dimensional tasks. * Equal contribution.Preprint. Under review.

show abstract

Unsupervised Clustering of Time Series Signals Using Neuromorphic Energy-Efficient Temporal Neural Networks

Chaudhari

Nair

Moura

et al. 2021

View full text Add to dashboard Cite

Unsupervised time series clustering is a challenging problem with diverse industrial applications such as anomaly detection, bio-wearables, etc. These applications typically involve small, low-power devices on the edge that collect and process real-time sensory signals. State-of-the-art time-series clustering methods perform some form of loss minimization that is extremely computationally intensive from the perspective of edge devices. In this work, we propose a neuromorphic approach to unsupervised time series clustering based on Temporal Neural Networks that is capable of ultra lowpower, continuous online learning. We demonstrate its clustering performance on a subset of UCR Time Series Archive datasets. Our results show that the proposed approach either outperforms or performs similarly to most of the existing algorithms while being far more amenable for efficient hardware implementation. Our hardware assessment analysis shows that in 7 nm CMOS the proposed architecture, on average, consumes only about 0.005 mm 2 die area and 22 µW power and can process each signal with about 5 ns latency.

show abstract

Multi-Armed Bandits With Correlated Arms

Gupta

Chaudhari

Joshi

et al. 2021

IEEE Trans. Inform. Theory

View full text Add to dashboard Cite

A Unified Approach to Translate Classical Bandit Algorithms to the Structured Bandit Setting

Gupta

Chaudhari

Mukherjee

et al. 2020

IEEE J. Sel. Areas Inf. Theory

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.