CamemBERT: a Tasty French Language Model

Martin, Louis; Müller, Benjamin; Suárez, Pedro Javier Ortiz; Dupont, Yoann; Romary, Laurent; Clergerie, Éric Villemonte de la; Seddah, Djamé; Sagot, Benoît

doi:10.18653/v1/2020.acl-main.645

Cited by 491 publications

(242 citation statements)

References 36 publications

Supporting

Mentioning

234

Contrasting

Unclassified

Order By: Relevance

“…In this study, linguistic representation will be used to process French data. Very recently, CamemBERT [26], FlauBERT [27] and GermanBERT 1 models were released for French and German while Ernie models are only available in Chinese and English. As far as the authors know, this is the first time that such models are used for feature extraction in order to perform SER tasks.…”

Section: Linguistic Representationmentioning

confidence: 99%

On the Use of Self-Supervised Pre-Trained Acoustic and Linguistic Features for Continuous Speech Emotion Recognition

Macary

Tahon

Estève³

et al. 2021

2021 IEEE Spoken Language Technology Workshop (SLT)

View full text Add to dashboard Cite

Pre-training for feature extraction is an increasingly studied approach to get better continuous representations of audio and text content. In the present work, we use wav2vec and camemBERT as self-supervised learned models to represent our data in order to perform continuous emotion recognition from speech (SER) on AlloSat, a large French emotional database describing the satisfaction dimension, and on the state of the art corpus SEWA focusing on valence, arousal and liking dimensions. To the authors' knowledge, this paper presents the first study showing that the joint use of wav2vec and BERT-like pre-trained features is very relevant to deal with continuous SER task, usually characterized by a small amount of labeled training data. Evaluated by the well-known concordance correlation coefficient (CCC), our experiments show that we can reach a CCC value of 0.825 instead of 0.592 when using MFCC in conjunction with word2vec word embedding on the AlloSat dataset.

show abstract

Section: Linguistic Representationmentioning

confidence: 99%

On the Use of Self-Supervised Pre-Trained Acoustic and Linguistic Features for Continuous Speech Emotion Recognition

Macary

Tahon

Estève³

et al. 2021

2021 IEEE Spoken Language Technology Workshop (SLT)

View full text Add to dashboard Cite

show abstract

“…A benchmark of NER models on French commercial legal cases has been developed (Benesty 2019). The results encourage the use of the NER bi-directional long short term memory (Bi-LSTM) model by using the Flair library (Akbik, Blythe, and Vollgraf 2018) and the NER model of CamemBERT (Martin et al 2020). CamemBERT is a French version of BERT -Bidirectional Encoder Representations from Transformers- (Devlin et al 2018), which is itself based on the encoder part of the Transformer architecture (Vaswani et al 2017).…”

Section: Introductionmentioning

confidence: 89%

De-identification of Emergency Medical Records in French: Survey and Comparison of State-of-the-Art Automated Systems

Bourdois

Avalos

Chenais

et al. 2021

FLAIRS

View full text Add to dashboard Cite

In France, structured data from emergency room (ER) visits are aggregated at the national level to build a syndromic surveillance system for several health events. For visits motivated by a traumatic event, information on the causes are stored in free-text clinical notes. To exploit these data, an automated de-identification system guaranteeing protection of privacy is required.In this study we review available de-identification tools to de-identify free-text clinical documents in French. A key point is how to overcome the resource barrier that hampers NLP applications in languages other than English. We compare rule-based, named entity recognition, new Transformer-based deep learning and hybrid systems using, when required, a fine-tuning set of 30,000 unlabeled clinical notes. The evaluation is performed on a test set of 3,000 manually annotated notes.Hybrid systems, combining capabilities in complementary tasks, show the best performance. This work is a first step in the foundation of a national surveillance system based on the exhaustive collection of ER visits reports for automated trauma monitoring.

show abstract

“…In order to tackle specific languages problems, different monolingual versions of BERT were trained in different languages. For example BERTje [36] is a Dutch version, AlBERTo [37] is an Italian version, and CamemBERT [38] and FlauBERT [39] are two different models for French. These models outperform vanilla BERT in different NLP tasks specific to these languages.…”

Section: Specific Language Modelsmentioning

confidence: 99%

Overview of the Transformer-based Models for NLP Tasks

Gillioz¹,

Casas²,

Mugellini³

et al. 2020

Annals of Computer Science and Information Systems

198

View full text Add to dashboard Cite

In 2017, Vaswani et al. proposed a new neural network architecture named Transformer. That modern architecture quickly revolutionized the natural language processing world. Models like GPT and BERT relying on this Transformer architecture have fully outperformed the previous state-of-theart networks. It surpassed the earlier approaches by such a wide margin that all the recent cutting edge models seem to rely on these Transformer-based architectures. In this paper, we provide an overview and explanations of the latest models. We cover the auto-regressive models such as GPT, GPT-2 and XLNET, as well as the auto-encoder architecture such as BERT and a lot of post-BERT models like RoBERTa, ALBERT, ERNIE 1.0/2.0.

show abstract

CamemBERT: a Tasty French Language Model

Cited by 491 publications

References 36 publications

On the Use of Self-Supervised Pre-Trained Acoustic and Linguistic Features for Continuous Speech Emotion Recognition

On the Use of Self-Supervised Pre-Trained Acoustic and Linguistic Features for Continuous Speech Emotion Recognition

De-identification of Emergency Medical Records in French: Survey and Comparison of State-of-the-Art Automated Systems

Overview of the Transformer-based Models for NLP Tasks

Contact Info

Product

Resources

About