A Simple Meta-learning Paradigm for Zero-shot Intent Classification with Mixture Attention Mechanism

Liu, Han; Zhao, Siyang; Zhang, Xiaolin; Zhang, Feng; Junjie, Sun,; Yu, Hong; Zhang, Xianchao

doi:10.1145/3477495.3531803

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2023

2024

Publication Types

Select...

Article2

Other2

Book1

Relationship

Self Cite0

Independent5

Authors

Journals

Cited by 8 publications

References 30 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

Cognitive Programming Assistant

Banipal,

Asthana,

Mazumder

et al. 2024

Lecture Notes in Networks and Systems

View full text Add to dashboard Cite

Cognitive Programming Assistant

Banipal,

Asthana,

Mazumder

et al. 2024

Lecture Notes in Networks and Systems

View full text Add to dashboard Cite

Zero-shot multitask intent and emotion prediction from multimodal data: A benchmark study

Singh,

Firdaus,

Chauhan

et al. 2024

Neurocomputing

View full text Add to dashboard Cite

Dialogue agents 101: a beginner’s guide to critical ingredients for designing effective conversational systems

Kumar,

Bhatia,

Aggarwal

et al. 2024

Nat. lang. processing

View full text Add to dashboard Cite

Sharing ideas through communication with peers is the primary mode of human interaction. Consequently, extensive research has been conducted in the area of conversational AI, leading to an increase in the availability and diversity of conversational tasks, datasets, and methods. However, with numerous tasks being explored simultaneously, the current landscape of conversational AI has become fragmented. Consequently, initiating a well-thought-out model for a dialogue agent can pose significant challenges for a practitioner. Toward highlighting the critical ingredients needed for a practitioner to design a dialogue agent from scratch, the current study provides a comprehensive overview of the primary characteristics of a dialogue agent, the supporting tasks, their corresponding open-domain datasets, and the methods used to benchmark these datasets. We observe that different methods have been used to tackle distinct dialogue tasks. However, building separate models for each task is costly and does not leverage the correlation among the several tasks of a dialogue agent. As a result, recent trends suggest a shift toward building unified foundation models. To this end, we proposeUnit, a Unified dialogue dataset constructed from conversations of varying datasets for different dialogue tasks capturing the nuances for each of them. We then train a Unified dialogue foundation model, GPT-2$^{\textrm{U}}$and present a concise comparative performance of GPT-2$^{\textrm{U}}$against existing large language models. We also examine the evaluation strategies used to measure the performance of dialogue agents and highlight the scope for future research in the area of conversational AI with a thorough discussion of popular models such as ChatGPT.

show abstract

A Simple Meta-learning Paradigm for Zero-shot Intent Classification with Mixture Attention Mechanism

Cited by 8 publications

References 30 publications

Cognitive Programming Assistant

Cognitive Programming Assistant

Zero-shot multitask intent and emotion prediction from multimodal data: A benchmark study

Dialogue agents 101: a beginner’s guide to critical ingredients for designing effective conversational systems

Contact Info

Product

Resources

About