Self-Adaptation for Unsupervised Domain Adaptation

Cui, Xia; Bollegala, Danushka

doi:10.26615/978-954-452-056-4_025

Cited by 19 publications

(13 citation statements)

References 30 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The two approaches differ in the specifics of the method to construct the shared space. SCL uses auxiliary functions inspired by (Ando and Zhang, 2005) AdaptaBERT (Li et al, 2019) background embeddings (Gururangan et al, 2020) multi-phase pre-training Hybrid: (Ruder and Plank, 2018) Mt-Tri (Jia et al, 2019) cross-domain LM (Rotman and Reichart, 2019) deep self-training (Cui and Bollegala, 2019) SelfAdapt (Peng and Dredze, 2017) MTL-DA (Guo et al, 2020) DistanceNet-Bandit domain-specific and domain-general features is the key idea of EasyAdapt (Daum III, 2007) a seminal supervised DA method. A recent line of work (Ziser and Reichart, 2017;Ziser and Reichart, 2018a;Ziser and Reichart, 2018b;Ziser and Reichart, 2019) brings SCL back to neural networks.…”

Section: Feature-centric Methodsmentioning

confidence: 99%

“…Data-centric methods focus on the data aspect and either involve pseudo-labeling (or bootstrapping) to bridge the domain gap, e.g. (Abney, 2007;Zhu and Goldberg, 2009;Ruder and Plank, 2018;Cui and Bollegala, 2019), data selection, e.g. (Axelrod et al, 2011;Plank and van Noord, 2011;Ruder and Plank, 2017) and pre-training methods, e.g.…”

Section: A Categorization Of Domain Adaptation In Nlpmentioning

confidence: 99%

“…Work on the intersection of data-centric and model-centric methods can be plentiful. It currently includes combining semi-supervised objectives with an adversarial loss (Lim et al, 2020;Alam et al, 2018b), combining pivot-based approaches with pseudo-labeling (Cui and Bollegala, 2019), or work that uses multi-task approaches, e.g., to combine in-domain data with auxiliary tasks such as language modeling (Jia et al, 2019). Depending on the specifics, the auxiliary-task pre-training methods discussed in Section 5.3 can be seen as a form of hybrid approach as well.…”

Section: Hybrid Approachesmentioning

confidence: 99%

See 2 more Smart Citations

Neural Unsupervised Domain Adaptation in NLP---A Survey

Ramponi¹,

Plank²

2020

Preprint

View full text Add to dashboard Cite

Deep neural networks excel at learning from labeled data and achieve state-of-the-art results on a wide array of Natural Language Processing tasks. In contrast, learning from unlabeled data, especially under domain shift, remains a challenge. Motivated by the latest advances, in this survey we review neural unsupervised domain adaptation techniques which do not require labeled target domain data. This is a more challenging yet a more widely applicable setup. We outline methods, from early approaches in traditional non-neural methods to pre-trained model transfer. We also revisit the notion of domain, and we uncover a bias in the type of Natural Language Processing tasks which received most attention. Lastly, we outline future directions, particularly the broader need for out-of-distribution generalization of future intelligent NLP. 1

show abstract

Section: Feature-centric Methodsmentioning

confidence: 99%

Section: A Categorization Of Domain Adaptation In Nlpmentioning

confidence: 99%

Section: Hybrid Approachesmentioning

confidence: 99%

See 1 more Smart Citation

Neural Unsupervised Domain Adaptation in NLP---A Survey

Ramponi¹,

Plank²

2020

Preprint

View full text Add to dashboard Cite

show abstract

“…Work on the intersection of data-centric and model-centric methods can be plentiful. It currently includes combining semi-supervised objectives with an adversarial loss (Lim et al, 2020;Alam et al, 2018b), combining pivot-based approaches with pseudo-labeling (Cui and Bollegala, 2019) and very recently with contextualized word embeddings (Ben-David et al, 2020), and combining multi-task approaches with domain shift (Jia et al, 2019), multi-task learning with pseudo-labeling (multi-task tritraining) (Ruder and Plank, 2018), and adaptive ensembling (Desai et al, 2019), which uses a studentteacher network with a consistency-based self-ensembling loss and a temporal curriculum. They apply adaptive ensembling to study temporal and topic drift in political data classification (Desai et al, 2019).…”

Section: Hybrid Approachesmentioning

confidence: 99%

“…Model-centric methods target approaches to augment the feature space, alter the loss function, the architecture or model parameters (Blitzer et al, 2006;Pan et al, 2010;Ganin et al, 2016). Data-centric methods focus on the data aspect and either involve pseudo-labeling (or bootstrapping) to bridge the domain gap (Abney, 2007;Zhu and Goldberg, 2009;Ruder and Plank, 2018;Cui and Bollegala, 2019), data selection (Axelrod et al, 2011;Plank and van Noord, 2011;Ruder and Plank, 2017) and pre-training methods (Han and Eisenstein, 2019;Guo et al, 2020). As some approaches take elements of both, we include a hybrid category.…”

Section: Introductionmentioning

confidence: 99%

Neural Unsupervised Domain Adaptation in NLP—A Survey

Ramponi¹,

Plank²

2020

Proceedings of the 28th International Conference on Computational Linguistics

154

108

View full text Add to dashboard Cite

Deep neural networks excel at learning from labeled data and achieve state-of-the-art results on a wide array of Natural Language Processing tasks. In contrast, learning from unlabeled data, especially under domain shift, remains a challenge. Motivated by the latest advances, in this survey we review neural unsupervised domain adaptation techniques which do not require labeled target domain data. This is a more challenging yet a more widely applicable setup. We outline methods, from early traditional non-neural methods to pre-trained model transfer. We also revisit the notion of domain, and we uncover a bias in the type of Natural Language Processing tasks which received most attention. Lastly, we outline future directions, particularly the broader need for out-of-distribution generalization of future NLP. 1

show abstract