Shang‐Yu Su scite author profile

Shang‐Yu Su

5Publications

203Citation Statements Received

70Citation Statements Given

How they've been cited

210

202

How they cite others

Affiliations

National Taiwan University, AU Optronics (Taiwan)

Publications

Order By: Most citations

Discriminative Deep Dyna-Q: Robust Planning for Dialogue Policy Learning

Su¹,

Li²,

Gao³

et al. 2018

View full text Add to dashboard Cite

This paper presents a Discriminative DeepDyna-Q (D3Q) approach to improving the effectiveness and robustness of Deep Dyna-Q (DDQ), a recently proposed framework that extends the Dyna-Q algorithm to integrate planning for task-completion dialogue policy learning. To obviate DDQ's high dependency on the quality of simulated experiences, we incorporate an RNN-based discriminator in D3Q to differentiate simulated experience from real user experience in order to control the quality of training data. Experiments show that D3Q significantly outperforms DDQ by controlling the quality of simulated experience used for planning. The effectiveness and robustness of D3Q is further demonstrated in a domain extension setting, where the agent's capability of adapting to a changing environment is tested. 1

show abstract

Lifelong Language Knowledge Distillation

Chuang¹,

Su²,

Chen³

2020

View full text Add to dashboard Cite

It is challenging to perform lifelong language learning (LLL) on a stream of different tasks without any performance degradation comparing to the multi-task counterparts. To address this issue, we present Lifelong Language Knowledge Distillation (L2KD), a simple but efficient method that can be easily applied to existing LLL architectures in order to mitigate the degradation. Specifically, when the LLL model is trained on a new task, we assign a teacher model to first learn the new task, and pass the knowledge to the LLL model via knowledge distillation. Therefore, the LLL model can better adapt to the new task while keeping the previously learned knowledge. Experiments show that the proposed L2KD consistently improves previous state-ofthe-art models, and the degradation comparing to multi-task models in LLL tasks is well mitigated for both sequence generation and text classification tasks. 1

show abstract

How Time Matters: Learning Time-Decay Attention for Contextual Spoken Language Understanding in Dialogues

Su¹,

Yuan²,

Chen³

2018

View full text Add to dashboard Cite

Spoken language understanding (SLU) is an essential component in conversational systems. Most SLU components treat each utterance independently, and then the following components aggregate the multi-turn information in the separate phases. In order to avoid error propagation and effectively utilize contexts, prior work leveraged history for contextual SLU. However, most previous models only paid attention to the related content in history utterances, ignoring their temporal information. In the dialogues, it is intuitive that the most recent utterances are more important than the least recent ones, in other words, timeaware attention should be in a decaying manner. Therefore, this paper designs and investigates various types of time-decay attention on the sentence-level and speaker-level, and further proposes a flexible universal time-decay attention mechanism. The experiments on the benchmark Dialogue State Tracking Challenge (DSTC4) dataset show that the proposed time-decay attention mechanisms significantly improve the state-of-the-art model for contextual understanding performance 1 .

show abstract

Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy Learning

Peng¹,

Li²,

Gao³

et al. 2018

Preprint

View full text Add to dashboard Cite

Natural Language Generation by Hierarchical Decoding with Linguistic Patterns

Yeh

et al. 2018

View full text Add to dashboard Cite

Natural language generation (NLG) is a critical component in spoken dialogue systems. Classic NLG can be divided into two phases: (1) sentence planning: deciding on the overall sentence structure, (2) surface realization: determining specific word forms and flattening the sentence structure into a string. Many simple NLG models are based on recurrent neural networks (RNN) and sequence-to-sequence (seq2seq) model, which basically contains a encoder-decoder structure; these NLG models generate sentences from scratch by jointly optimizing sentence planning and surface realization using a simple cross entropy loss training criterion. However, the simple encoderdecoder architecture usually suffers from generating complex and long sentences, because the decoder has to learn all grammar and diction knowledge. This paper introduces a hierarchical decoding NLG model based on linguistic patterns in different levels, and shows that the proposed method outperforms the traditional one with a smaller model size. Furthermore, the design of the hierarchical decoding is flexible and easily-extensible in various NLG systems 1 .

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Shang‐Yu Su

Discriminative Deep Dyna-Q: Robust Planning for Dialogue Policy Learning

Lifelong Language Knowledge Distillation

How Time Matters: Learning Time-Decay Attention for Contextual Spoken Language Understanding in Dialogues

Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy Learning

Natural Language Generation by Hierarchical Decoding with Linguistic Patterns

Contact Info

Product

Resources

About