Companion Proceedings of the Web Conference 2020 2020
DOI: 10.1145/3366424.3382716
|View full text |Cite
|
Sign up to set email alerts
|

Keeping Their Words: Direct and Indirect Chinese Quote Attribution from Newspapers

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 2 publications
0
3
0
Order By: Relevance
“…The existing corpus widely used in quotation extraction and attribution is mostly available in English (Pareti, 2016;Newell et al, 2018;Zhang and Liu, 2022) and Portuguese (Fernandes et al, 2011). Other studies have built language corpora in Chinese (Lee et al, 2020), Norwegian (Salway et al, 2017), Italian (Pareti and Prodanof, 2010), German (Li et al, 2012), Dutch (Atteveldt, 2013) and Arabic (Al-Saif et al, 2018).…”
Section: Recent Workmentioning
confidence: 99%
“…The existing corpus widely used in quotation extraction and attribution is mostly available in English (Pareti, 2016;Newell et al, 2018;Zhang and Liu, 2022) and Portuguese (Fernandes et al, 2011). Other studies have built language corpora in Chinese (Lee et al, 2020), Norwegian (Salway et al, 2017), Italian (Pareti and Prodanof, 2010), German (Li et al, 2012), Dutch (Atteveldt, 2013) and Arabic (Al-Saif et al, 2018).…”
Section: Recent Workmentioning
confidence: 99%
“…Machine learning is often used to solve the task of linking text span to speakers. Sequence labelling using machine learning has been used to assign each quote to the appropriate speaker (O’Keefe et al , 2012; Pareti et al , 2013; Yeung and Lee, 2017; Lee et al , 2020). Both the studies by O’Keefe and Pareti above also attempted to apply the no sequence labelling method.…”
Section: Quotation Extraction and Attribution Taskmentioning
confidence: 99%
“…The corpus that exists today and widely used in research is mostly available in English (Pareti, 2016; Newell et al , 2018) and Portuguese (Fernandes et al , 2011). Other studies have tried to build corpus in Chinese (Lee et al , 2020), Norwegian (Salway et al , 2017), Italian (Pareti and Prodanof, 2010), German (Li et al , 2012), Dutch (Atteveldt, 2013) and Arabic (Alsaif et al , 2018). There is an opportunity to create a new corpus with different languages that uses a standard scheme for named entity recognition (NER) task and add more attributes to enrich the knowledge.…”
Section: Introductionmentioning
confidence: 99%