Proceedings of the 55th Annual Meeting of the Association For Computational Linguistics (Volume 1: Long Papers) 2017
DOI: 10.18653/v1/p17-1050
|View full text |Cite
|
Sign up to set email alerts
|

Predicting Native Language from Gaze

Abstract: A fundamental question in language learning concerns the role of a speaker's first language in second language acquisition. We present a novel methodology for studying this question: analysis of eye-movement patterns in second language reading of free-form text. Using this methodology, we demonstrate for the first time that the native language of English learners can be predicted from their gaze fixations when reading English. We provide analysis of classifier uncertainty and learned features, which indicates … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
18
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
3
3
1

Relationship

1
6

Authors

Journals

citations
Cited by 14 publications
(18 citation statements)
references
References 25 publications
0
18
0
Order By: Relevance
“…We make use of the lang2vec tool (Lit-tell et al, 2017) using the URIEL Typological Database (Littell et al, 2016) to compute the similarity between pairs of languages. Similar to the approach of Berzak et al (2017), all the 103 available morphosyntactic features in URIEL are obtained; these are derived from the World Atlas of Language Structures (WALS) (Dryer and Haspelmath, 2013), Syntactic Structures of the Worlds Languages (SSWL) (Collins and Kayne, 2009) and Ethnologue (Lewis et al, 2009). Missing feature values are filled with a prediction from a k-nearest neighbors classifier.…”
Section: Effect Of Translationese On Different Language Pairsmentioning
confidence: 99%
See 1 more Smart Citation
“…We make use of the lang2vec tool (Lit-tell et al, 2017) using the URIEL Typological Database (Littell et al, 2016) to compute the similarity between pairs of languages. Similar to the approach of Berzak et al (2017), all the 103 available morphosyntactic features in URIEL are obtained; these are derived from the World Atlas of Language Structures (WALS) (Dryer and Haspelmath, 2013), Syntactic Structures of the Worlds Languages (SSWL) (Collins and Kayne, 2009) and Ethnologue (Lewis et al, 2009). Missing feature values are filled with a prediction from a k-nearest neighbors classifier.…”
Section: Effect Of Translationese On Different Language Pairsmentioning
confidence: 99%
“…Truncating features with the same value for all the languages present in our study, 87 features remain, consisting of 60 syntactic features and 27 family tree features. We then measure the level of relatedness between two languages using the linguistic similarity (LS) by Berzak et al (2017) (Equation 1), i.e. the cosine similarity between the URIEL feature vectors for two languages v y and v y .…”
Section: Effect Of Translationese On Different Language Pairsmentioning
confidence: 99%
“…This feature set was previously used in Berzak et al (2017) and consists of reading times for words within fixed contexts. We extract FP and TF durations for the 900 words in the Fixed Text sentences, resulting in a total of 1,800 WFC features.…”
Section: Words In Fixed Context (Wfc)mentioning
confidence: 99%
“…Reading Speed Normalization To reduce bias towards fast readers, the feature representations used for Eyescore are normalized to be invariant to the reading speed of the participant. Specifically, for the S-Clusters and WFC feature sets we follow the normalization procedure of Berzak et al (2017), where for a given participant, the reading time of a word w i according to a fixation metric M is normalized by S M,C , the metric's fixation time per word in the linguistic context C:…”
Section: English Proficiency Scoring Based On Eye Movements In Readingmentioning
confidence: 99%
See 1 more Smart Citation