Govardana Sachithanandam Ramachandran scite author profile

Govardana Sachithanandam Ramachandran

5Publications

5Citation Statements Received

40Citation Statements Given

How they've been cited

How they cite others

Affiliations

Salesforce (United States)

Publications

Order By: Most citations

[CASPI] Causal-aware Safe Policy Improvement for Task-oriented Dialogue

Ramachandran¹,

Hashimoto²,

Xiong³

2022

View full text Add to dashboard Cite

The recent success of reinforcement learning (RL) in solving complex tasks is often attributed to its capacity to explore and exploit an environment. Sample efficiency is usually not an issue for tasks with cheap simulators to sample data online. On the other hand, Taskoriented Dialogues (ToD) are usually learnt from offline data collected using human demonstrations. Collecting diverse demonstrations and annotating them is expensive. Unfortunately, RL policy trained on off-policy data are prone to issues of bias and generalization, which are further exacerbated by stochasticity in human response and non-markovian nature of annotated belief state of a dialogue management system. To this end, we propose a batch-RL framework for ToD policy learning: Causal-aware Safe Policy Improvement (CASPI). CASPI includes a mechanism to learn fine-grained reward that captures intention behind human response and also offers guarantee on dialogue policy's performance against a baseline. We demonstrate the effectiveness of this framework on end-to-end dialogue task of the Multiwoz2.0 dataset. The proposed method outperforms the current state of the art. Further more we demonstrate sample efficiency, where our method trained only on 20% of the data, are comparable to current state of the art method trained on 100% data on two out of there evaluation metrics.

show abstract

GAEA: Graph Augmentation for Equitable Access via Reinforcement Learning

Ramachandran

Brugere

Varshney

et al. 2021

View full text Add to dashboard Cite

GAEA: Graph Augmentation for Equitable Access via Reinforcement Learning

Ramachandran¹,

Brugere

Varshney

et al. 2020

Preprint

View full text Add to dashboard Cite

Proceedings of the 2021 AAAI/ACM Conference on AI, Ethics, and Society

Ramachandran¹,

Brugere²,

Varshney³

et al. 2021

View full text Add to dashboard Cite

Causal-aware Safe Policy Improvement for Task-oriented dialogue

Ramachandran¹,

Hashimoto²,

Xiong³

2021

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.