2013
DOI: 10.5087/dad.2013.208
|View full text |Cite
|
Sign up to set email alerts
|

Turkish Discourse Bank: Porting a discourse annotation style to a morphologically rich language

Abstract: This paper briefly describes the Turkish Discourse Bank, the first publicly available annotated discourse resource for Turkish. It focuses on the challenges posed by annotating Turkish, a free word order language with rich inflectional and derivational morphology. It shows the usefulness of the PDTB style annotation but points out the need to expand this annotation style with the needs of the target language.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
18
0

Year Published

2014
2014
2020
2020

Publication Types

Select...
6
2
1

Relationship

3
6

Authors

Journals

citations
Cited by 48 publications
(19 citation statements)
references
References 13 publications
1
18
0
Order By: Relevance
“…The Penn Discourse Treebank (PDTB V3, Prasad et al 2019) is the largest discourse annotated corpus of English, and the largest resource annotated explicitly for discourse relation signals such as connectives, with similar corpora having been developed for a variety of languages (e.g. Zeyrek et al 2013for Turkish, Zhou et al 2014. However the annotation scheme used by PDTB is ahierarchical, annotating only pairs of textual argument spans connected by a discourse relation, and disregarding relations at higher levels, such as relations between paragraphs or other groups of discourse units.…”
Section: Discourse Relation Signal Annotationsmentioning
confidence: 99%
“…The Penn Discourse Treebank (PDTB V3, Prasad et al 2019) is the largest discourse annotated corpus of English, and the largest resource annotated explicitly for discourse relation signals such as connectives, with similar corpora having been developed for a variety of languages (e.g. Zeyrek et al 2013for Turkish, Zhou et al 2014. However the annotation scheme used by PDTB is ahierarchical, annotating only pairs of textual argument spans connected by a discourse relation, and disregarding relations at higher levels, such as relations between paragraphs or other groups of discourse units.…”
Section: Discourse Relation Signal Annotationsmentioning
confidence: 99%
“…In building the TCL, we use three PDTBinspired annotated corpora to compile explicit DCs, namely, Turkish Discourse Bank or TDB 1.0 (Zeyrek et al, 2013), TDB 1.1 (Zeyrek and Kurfalı, 2017), and the Turkish section of TED-MDB.…”
Section: Data Sourcesmentioning
confidence: 99%
“…There are several discourse-annotated corpora in different theoretical frameworks. The PDTB [18] style of annotation has been applied to other languages besides English, such as Turkish [33], Chinese [35], Czech [26], and applied to English and French speech data [6]. For Brazilian Portuguese, several corpora have been annotated in the RST and CST frameworks (CSTNews, CorpusTCC, Rhetalho, Summ-it) [1,14].…”
Section: Related Workmentioning
confidence: 99%