Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume 2021
DOI: 10.18653/v1/2021.eacl-main.297
|View full text |Cite
|
Sign up to set email alerts
|

Unsupervised Word Polysemy Quantification with Multiresolution Grids of Contextual Embeddings

Abstract: The number of senses of a given word, or polysemy, is a very subjective notion, which varies widely across annotators and resources. We propose a novel method to estimate polysemy based on simple geometry in the contextual embedding space. Our approach is fully unsupervised and purely data-driven. Through rigorous experiments, we show that our rankings are well correlated, with strong statistical significance, with 6 different rankings derived from famous human-constructed resources such as WordNet, OntoNotes,… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 28 publications
0
3
0
Order By: Relevance
“…Goodwin et al, 2020;Pimentel et al, 2020b,a;Aghazadeh et al, 2022;Tucker et al, 2022;Arps et al, 2022). In contrast, descriptive probing looks into the intrinsic structure of representations Zhou and Srikumar, 2021;Xypolopoulos et al, 2021;Chang et al, 2022), and focuses on discovering properties of the representation using cluster analysis (Aharoni and Goldberg, 2020) or visualization technqiues Vig, 2019).…”
Section: Two Characteristics Of Representationsmentioning
confidence: 99%
“…Goodwin et al, 2020;Pimentel et al, 2020b,a;Aghazadeh et al, 2022;Tucker et al, 2022;Arps et al, 2022). In contrast, descriptive probing looks into the intrinsic structure of representations Zhou and Srikumar, 2021;Xypolopoulos et al, 2021;Chang et al, 2022), and focuses on discovering properties of the representation using cluster analysis (Aharoni and Goldberg, 2020) or visualization technqiues Vig, 2019).…”
Section: Two Characteristics Of Representationsmentioning
confidence: 99%
“…Conneau et al, 2018;Kassner and Schütze, 2020;Goodwin et al, 2020;Pimentel et al, 2020b,a;Aghazadeh et al, 2022;Tucker et al, 2022;Gonen et al, 2022;Arps et al, 2022). In contrast, descriptive probing looks into the intrinsic structure of representations (Ethayarajh, 2019;Zhou and Srikumar, 2021;Xypolopoulos et al, 2021;Chang et al, 2022), and focuses on discovering properties of the representation using cluster analysis (Aharoni and Goldberg, 2020) or visualization technqiues (Reimers et al, 2019;Vig, 2019).…”
Section: Two Characteristics Of Representationsmentioning
confidence: 99%
“…Understanding the geometry of the BERT-space is not easy. Some attempts in this direction have been made (Coenen et al, 2019;Ethayarajh, 2019;Michael et al, 2020;Mickus et al, 2020;Xypolopoulos et al, 2021;Garí Soler and Apidianaki, 2020), but a more thorough investigation is lacking. As opposed to predictive methods such as probing, descriptive methods that rely on geometric features of the space analyze the information in CRs directly.…”
Section: Analyzing Contextual Representationsmentioning
confidence: 99%