Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conferen 2019
DOI: 10.18653/v1/d19-1676
|View full text |Cite
|
Sign up to set email alerts
|

Cross-lingual intent classification in a low resource industrial setting

Abstract: This paper explores different approaches to multilingual intent classification in a low resource setting. Recent advances in multilingual text representations promise crosslingual transfer for classifiers. We investigate the potential for this transfer in an applied industrial setting and compare to multilingual classification using machine translated text. Our results show that while the recently developed methods show promise, practical application calls for a combination of techniques for useful results.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 13 publications
(14 citation statements)
references
References 10 publications
0
9
0
Order By: Relevance
“…This work has also empirically validated that there is still ample room for improvement in the intent detection task especially in low-data regimes. Therefore, similar to recent work (Upadhyay et al, 2018;Khalil et al, 2019;Liu et al, 2019c), we will also investigate how to transfer intent detectors to low-resource target languages in few-shot and zero-shot scenarios. We also plan to extend the models to handle out-of-scope prediction (Larson et al, 2019).…”
Section: Discussionmentioning
confidence: 93%
“…This work has also empirically validated that there is still ample room for improvement in the intent detection task especially in low-data regimes. Therefore, similar to recent work (Upadhyay et al, 2018;Khalil et al, 2019;Liu et al, 2019c), we will also investigate how to transfer intent detectors to low-resource target languages in few-shot and zero-shot scenarios. We also plan to extend the models to handle out-of-scope prediction (Larson et al, 2019).…”
Section: Discussionmentioning
confidence: 93%
“…Sources for parallel text can be the OPUS project (Tiedemann, 2012), Bible corpora (Mayer and Cysouw, 2014;Christodoulopoulos and Steedman, 2015) or the recent JW300 corpus (Agić and Vulić, 2019). Instead of using parallel corpora, existing high-resource labeled datasets can also be machine-translated into the low-resource language (Khalil et al, 2019;Zhang et al, 2019a;Fei et al, 2020;Amjad et al, 2020). Cross-lingual projections have even been used with English as a target language for detecting linguistic phenomena like modal sense and telicity that are easier to identify in a different language (Zhou et al, 2015;Marasović et al, 2016;Friedrich and Gateva, 2017).…”
Section: Cross-lingual Annotation Projectionsmentioning
confidence: 99%
“…Compared with English, other languages rarely have datasets with semantic slot values and generally only contain intent category labels. Khalil et al [31] explored the intention classification based on the multilingual transfer ability of English and French. Xie et al [32] used the multiple semantic features to study Chinese user intention classification based on ECDT [33] dataset.…”
Section: Complexitymentioning
confidence: 99%