Cross Lingual Transfer Learning for Zero-Resource Domain Adaptation

Abad, Alberto; Bell, P. J.; Carmantini, Andrea; Renais, Steve

doi:10.1109/icassp40776.2020.9054468

“…In vision study, although the images from training distribution and test distribution can be sufficiently different, the train and test distributions mostly share the same support (the pixels are always sampled from a 0-255 integer space), although the density of these distributions can be very different (e.g., photos vs. sketches). On the other hand, domain adaptation of NLP sometimes studies the regime where the supports of the data differ, e.g., the vocabularies can be significantly different in cross-lingual studies (Abad et al, 2020;Zhang et al, 2020a).…”

Section: Continuous Vs Discrete In Search Spacementioning

confidence: 99%

Measure and Improve Robustness in NLP Models: A Survey

Wang¹,

Wang²,

Yang³

2022

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Langua

View full text Add to dashboard Cite

As NLP models achieved state-of-the-art performances over benchmarks and gained wide applications, it has been increasingly important to ensure the safe deployment of these models in the real world, e.g., making sure the models are robust against unseen or challenging scenarios. Despite robustness being an increasingly studied topic, it has been separately explored in applications like vision and NLP, with various definitions, evaluation and mitigation strategies in multiple lines of research. In this paper, we aim to provide a unifying survey of how to define, measure and improve robustness in NLP. We first connect multiple definitions of robustness, then unify various lines of work on identifying robustness failures and evaluating models' robustness. Correspondingly, we present mitigation strategies that are data-driven, model-driven, and inductive-prior-based, with a more systematic view of how to effectively improve robustness in NLP models. Finally, we conclude by outlining open challenges and future directions to motivate further research in this area.

show abstract

“…Pour améliorer cet état de fait, l'approche naturelle est d'adapter un modèle entraîné sur une quantité bien plus grande de parole d'adultes, en se servant de ces 13 heures comme données d'adaptation. Il s'agit de la méthode d'apprentissage par transfert (Transfer Learning, TL), très utilisée en apprentissage profond en général (Abad et al, 2020;Duan et al, 2020). Nous suivons les recommandations de (Shivakumar & Georgiou, 2020), où les auteurs suggèrent, pour de très jeunes enfants (5-8 ans), d'appliquer le TL sur l'ensemble des couches du modèle source.…”

Section: Apprentissage Par Transfertunclassified

Améliorations d’un système Transformer de reconnaissance de phonèmes appliqué à la parole d'enfants apprenants lecteurs

Gelin¹,

Pellegrini²,

Pinquier³

et al. 2022

XXXIVe Journées d'Études Sur La Parole -- JEP 2022

1

0

View full text Add to dashboard Cite

Les performances des systèmes de reconnaissance automatique de la parole d'enfants n'égalent pas celles des systèmes adultes : la parole d'enfant est difficile à reconnaître et peu de données sont disponibles en Français. Nous faisons de plus face ici à la présence d'erreurs de lecture de jeunes enfants. Nous adaptons un système Transformer end-to-end à la parole d'enfants apprenant•e•s lecteur•rice•s. Un entraînement multi-objectif avec une fonction Connectionist Temporal Classification (CTC) et un décodage joint CTC/attention réduit le taux d'erreur phonème (PER) de 22,9% à 19,6%. Nous combinons également une augmentation par ajout de bruit de salle de classe et une augmentation innovante par simulation d'erreurs de lecture pour améliorer la robustesse du système, et atteignons un PER de 15,1%. Des analyses détaillées montrent que le système est plus robuste au bruit, et que l'ajout de la fonction CTC et l'augmentation d'erreurs synthétiques aident à mieux reconnaître les erreurs des enfants.

show abstract

“…In vision study, although the images from training distribution and test distribution can be sufficiently different, the train and test distributions mostly share the same support (the pixels are always sample from a 0-255 integer space), although the density of these distributions can be very different (photos vs. sketches). On the other hand, domain adaptation of NLP sometimes studies the regime where the supports of the data differ (e.g., the vocabularies can be significantly different in cross-lingual study (Abad et al, 2020;Zhang et al, 2020a)).…”

Section: Continuous Vs Discrete In Search Spacementioning

confidence: 99%

Measure and Improve Robustness in NLP Models: A Survey

Wang¹,

Wang²,

Yang³

2021

Preprint

View full text Add to dashboard Cite

As NLP models achieved state-of-the-art performances over benchmarks and gained wide applications, it has been increasingly important to ensure the safe deployment of these models in the real world, e.g., making sure the models are robust against unseen or challenging scenarios. Despite robustness being an increasingly studied topic, it has been separately explored in applications like vision and NLP, with various definitions, evaluation and mitigation strategies in multiple lines of research. In this paper, we aim to provide a unifying survey of how to define, measure and improve robustness in NLP. We first connect multiple definitions of robustness, then unify various lines of work on identifying robustness failures and evaluating models' robustness. Correspondingly, we present mitigation strategies that are data-driven, model-driven, and inductive-prior-based, with a more systematic view of how to effectively improve robustness in NLP models. Finally, we conclude by outlining open challenges and future directions to motivate further research in this area.

show abstract

Cross Lingual Transfer Learning for Zero-Resource Domain Adaptation

Cited by 15 publications

References 22 publications

Measure and Improve Robustness in NLP Models: A Survey

Measure and Improve Robustness in NLP Models: A Survey

Améliorations d’un système Transformer de reconnaissance de phonèmes appliqué à la parole d'enfants apprenants lecteurs

Measure and Improve Robustness in NLP Models: A Survey

Contact Info

Product

Resources

About