Proceedings of the Fourth International Workshop on Computatinal Linguistics of Uralic Languages 2018
DOI: 10.18653/v1/w18-0201
|View full text |Cite
|
Sign up to set email alerts
|

Dependency Parsing of Code-Switching Data with Cross-Lingual Feature Representations

Abstract: This paper describes the test of a dependency parsing method which is based on bidirectional LSTM feature representations and multilingual word embedding, and evaluates the results on mono-and multilingual data. The results are similar in all cases, with a slightly better results achieved using multilingual data. The languages under investigation are Komi-Zyrian and Russian. Examination of the results by relation type shows that some language specific constructions are correctly recognized even when they appea… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
9
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 11 publications
(9 citation statements)
references
References 16 publications
0
9
0
Order By: Relevance
“…This means that NLP experiments often use annotations that are too coarse to be linguistically informative with regard to C-S. Constraint-free theories (Mahootian and Santorini, 1996;MacSwan, 2000) hold that nothing restricts switching apart from the grammatical requirements of the contributing languages. Testing such theories in NLP experiments would require syntactically parsed corpora that are rare for mixed language data (Partanen et al, 2018). In sum, working together, theoretical and computational linguists could create better tools for processing C-S than those currently available.…”
Section: Competing Models Of C-smentioning
confidence: 99%
“…This means that NLP experiments often use annotations that are too coarse to be linguistically informative with regard to C-S. Constraint-free theories (Mahootian and Santorini, 1996;MacSwan, 2000) hold that nothing restricts switching apart from the grammatical requirements of the contributing languages. Testing such theories in NLP experiments would require syntactically parsed corpora that are rare for mixed language data (Partanen et al, 2018). In sum, working together, theoretical and computational linguists could create better tools for processing C-S than those currently available.…”
Section: Competing Models Of C-smentioning
confidence: 99%
“…CS benchmarks: From one of the recent surveys (Sitaram et al, 2019), linguistic CS has been studied in the context of many NLP tasks including language identification (Solorio et al, 2014) (Bali et al, 2014), POS tagging (Das, 2016), NER (Aguilar et al, 2019), parsing (Partanen et al, 2018), sentiment analysis (Vilares et al, 2015), and question answering ) (Raghavi et al, 2015. Many CS datasets have been made available through the shared-task series FIRE (Roy et al, 2013) and CALCS (Aguilar et al, 2018), which have focused mostly on core NLP tasks.…”
Section: Related Workmentioning
confidence: 99%
“…CS benchmarks: From one of the recent surveys (Sitaram et al, 2019), linguistic CS has been studied in the context of many NLP tasks including language identification (Solorio et al, 2014) (Bali et al, 2014), POS tagging (Das, 2016), NER (Aguilar et al, 2019), parsing (Partanen et al, 2018), sentiment analysis (Vilares et al, 2015), and question answering ) (Raghavi et al, 2015. Many CS datasets have been made available through the shared-task series FIRE (Roy et al, 2013) and CALCS (Aguilar et al, 2018), which have focused mostly on core NLP tasks.…”
Section: Related Workmentioning
confidence: 99%