2021
DOI: 10.48550/arxiv.2110.06273
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Småprat: DialoGPT for Natural Language Generation of Swedish Dialogue by Transfer Learning

Abstract: Building open-domain conversational systems (or chatbots) that produce convincing responses is a recognized challenge.Recent state-of-theart (SoTA) transformer-based models for the generation of natural language dialogue have demonstrated impressive performance in simulating human-like, single-turn conversations in English. This work investigates, by an empirical study, the potential for transfer learning of such models to Swedish language. DialoGPT, an English language pre-trained model, is adapted by trainin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
8
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(8 citation statements)
references
References 21 publications
0
8
0
Order By: Relevance
“…GPT models are specifically designed to generate natural language content (Black et al, 2022;Adewumi et al, 2021), including sentences, paragraphs, and even entire papers, while upholding grammatical consistency and human language rules . GPT models' main feature is their capacity to be pre-trained on substantial volumes of textual data (Markel et al, 2023) and then adjusted for certain tasks that come after, including text categorization or question answering (Floridi & Chiriatti, 2020).…”
Section: Gptmentioning
confidence: 99%
See 1 more Smart Citation
“…GPT models are specifically designed to generate natural language content (Black et al, 2022;Adewumi et al, 2021), including sentences, paragraphs, and even entire papers, while upholding grammatical consistency and human language rules . GPT models' main feature is their capacity to be pre-trained on substantial volumes of textual data (Markel et al, 2023) and then adjusted for certain tasks that come after, including text categorization or question answering (Floridi & Chiriatti, 2020).…”
Section: Gptmentioning
confidence: 99%
“…ChatGPT has the capacity to impact a substantial segment of the world's populace (Haleem et l., 2022). The field of Natural Language Processing (NLP) (Peng et al, 2023;Adewumi et al, 2021), a subfield of Artificial Intelligence (AI), is where ChatGPT first emerged (Remountakis et al, 2023). NLP is devoted to helping machines understand and produce human language (Remountakis et al, 2023).…”
Section: Introductionmentioning
confidence: 99%
“…The second technique involves the use of the dialogue (conversation) model checkpoint by Adewumi et al [8], which was finetuned on the Multi-Domain Wizard-of-Oz (MultiWOZ) dataset by Eric et al [46]. It is an autoregressive model based on the pretrained DialoGPT-medium model by Zhang et al [7].…”
Section: Data Augmentationmentioning
confidence: 99%
“…Additional hyperparameters include maximum decoding length, set to 200 tokens; temperature, set to 0.8; and maximum ngram repeat limit, set to 3. These hyperparameters are based on previous work, as they have been shown to perform well [8].…”
Section: Data Augmentationmentioning
confidence: 99%
See 1 more Smart Citation