2017
DOI: 10.1007/s10579-017-9407-6
|View full text |Cite
|
Sign up to set email alerts
|

The corpus of Basque simplified texts (CBST)

Abstract: In this paper we present the corpus of Basque simplified texts. This corpus compiles 227 original sentences of science popularisation domain and two simplified versions of each sentence. The simplified versions have been created following different approaches: the structural, by a court translator who considers easy-to-read guidelines and the intuitive, by a teacher based on her experience. The aim of this corpus is to make a comparative analysis of simplified text. To that end, we also present the annotation … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 9 publications
(7 citation statements)
references
References 22 publications
0
7
0
Order By: Relevance
“…Since 2016 such corpora have been created for a few other languages e.g. for French (Grabar and Cardon, 2018) or Basque (Gonzalez-Dios et al, 2018) -Hungarian, however, is not among these languages.…”
Section: Corporamentioning
confidence: 99%
“…Since 2016 such corpora have been created for a few other languages e.g. for French (Grabar and Cardon, 2018) or Basque (Gonzalez-Dios et al, 2018) -Hungarian, however, is not among these languages.…”
Section: Corporamentioning
confidence: 99%
“…In this case, texts were simplified by a school teacher who aimed at making them easier–to–read for students, especially L2 learners. Similarly, for the Basque language, Gonzalez-Dios et al ( 2017 ) gathered a collection of documents belonging to the scientific popularization domain manually simplified by a court translator according to easy-to-read guidelines (as representative of the “structural” strategy) and by a teacher based on her/his experience (as representative of the “intuitive” strategy).…”
Section: Manual Vs (Semi-)automatic Approach To Corpora Constructionmentioning
confidence: 99%
“…The first type of typologies aims at identifying the linguistic operations at work during the simplification process. They have been studied for a variety of languages: Spanish (Bott and Saggion, 2014), Italian (Brunato et al, 2014(Brunato et al, , 2022, French (Koptient et al, 2019;Gala et al, 2020), Brazilian Portuguese (Caseli et al, 2009, Basque (Gonzalez-Dios et al, 2018) and English (Amancio and Specia, 2014). While they share some operations, for instance the transition from the verbal passive voice to the active voice, those typologies also have specific categories such as proximization (Bott and Saggion, 2014) -changing a sentence in order to address the reader directly -or specification (Koptient et al, 2019) -keeping a difficult term but adding an explanation next to it.…”
Section: Linguistically Based Operation Descriptionsmentioning
confidence: 99%