Proceedings of the First Workshop on Evaluation and Comparison of NLP Systems 2020
DOI: 10.18653/v1/2020.eval4nlp-1.12
|View full text |Cite
|
Sign up to set email alerts
|

One of these words is not like the other: a reproduction of outlier identification using non-contextual word representations

Abstract: Word embeddings are an active topic in the NLP research community. State-of-the-art neural models achieve high performance on downstream tasks, albeit at the cost of computationally expensive training. Cost aware solutions require cheaper models that still achieve good performance. We present several reproduction studies of intrinsic evaluation tasks that evaluate non-contextual word representations in multiple languages. Furthermore, we present 50-8-8, a new data set for the outlier identification task, which… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
0
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 28 publications
0
0
0
Order By: Relevance
“…In each evaluation round, one outlier is added to the in-group items, and the algorithm is tasked with finding the outlier. Existing outlier detection datasets either did not explicitly target sense-ambiguous words (8-8-8 (Camacho-Collados and Navigli, 2016), WikiSem500 (Blair et al, 2016)) or explicitly removed ambiguous words altogether (25-8-8-sem (Brink Andersen et al, 2020)).…”
Section: Evaluation Via Outlier Detectionmentioning
confidence: 99%
See 1 more Smart Citation
“…In each evaluation round, one outlier is added to the in-group items, and the algorithm is tasked with finding the outlier. Existing outlier detection datasets either did not explicitly target sense-ambiguous words (8-8-8 (Camacho-Collados and Navigli, 2016), WikiSem500 (Blair et al, 2016)) or explicitly removed ambiguous words altogether (25-8-8-sem (Brink Andersen et al, 2020)).…”
Section: Evaluation Via Outlier Detectionmentioning
confidence: 99%
“…The starting point for our dataset is 25-8-8-Sem (Brink Andersen et al, 2020). This dataset contains 25 test groups, each with 8 in-group elements and 8 outliers, resulting in 200 unique test cases.…”
Section: Evaluation Via Outlier Detectionmentioning
confidence: 99%