2018
DOI: 10.1098/rsos.171920
|View full text |Cite
|
Sign up to set email alerts
|

Evaluating prose style transfer with the Bible

Abstract: In the prose style transfer task a system, provided with text input and a target prose style, produces output which preserves the meaning of the input text but alters the style. These systems require parallel data for evaluation of results and usually make use of parallel data for training. Currently, there are few publicly available corpora for this task. In this work, we identify a high-quality source of aligned, stylistically distinct text in different versions of the Bible. We provide a standardized split,… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
36
0
1

Year Published

2019
2019
2024
2024

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 29 publications
(37 citation statements)
references
References 35 publications
0
36
0
1
Order By: Relevance
“…We treat these corpora as domain-unspecific and stylistically neutral. As sources, we used the paraphrases corpus presented in Wieting and Gimpel (2018), MultiNLI corpus (Williams et al, 2018), Quora Kaggle dataset (Quora, 2017) and Bible corpus (Carlson et al, 2018 In the M2A task we removed the Bible subcorpus from the dataset of paraphrases to perform the multi-task training with a small amount of style data (like in the N2C task).…”
Section: Paraphrases Datamentioning
confidence: 99%
See 3 more Smart Citations
“…We treat these corpora as domain-unspecific and stylistically neutral. As sources, we used the paraphrases corpus presented in Wieting and Gimpel (2018), MultiNLI corpus (Williams et al, 2018), Quora Kaggle dataset (Quora, 2017) and Bible corpus (Carlson et al, 2018 In the M2A task we removed the Bible subcorpus from the dataset of paraphrases to perform the multi-task training with a small amount of style data (like in the N2C task).…”
Section: Paraphrases Datamentioning
confidence: 99%
“…The idea of using the Bible as a parallel corpus suitable for ST tasks was presented in Carlson et al (2018). The authors claim that traditionally used sentence and verse demarcation makes for easy sentence alignment.…”
Section: Modern-to-antique Bible Datamentioning
confidence: 99%
See 2 more Smart Citations
“…Moreover, the system that does not rely on an idea of disentangled latent representations at all shows weaker results than systems with high information disentanglement. It is important to note that there is a variety of methods to assess the quality of style transfer such as PINC (Paraphrase In N-gram Changes) score (Carlson et al, 2018), POS distance (Tian et al, 2018), language fluency , etc. The methodology of style transfer quality assessment is addressed in detail in (Tikhonov et al, 2019), but BLEU between output and input is a very natural all-purpose metric for the task of such type that is common in the style transfer literature.…”
Section: Empirical Measure Of Information Decomposition Qualitymentioning
confidence: 99%