2020
DOI: 10.1609/aaai.v34i05.6324
|View full text |Cite
|
Sign up to set email alerts
|

CSI: A Coarse Sense Inventory for 85% Word Sense Disambiguation

Abstract: Word Sense Disambiguation (WSD) is the task of associating a word in context with one of its meanings. While many works in the past have focused on raising the state of the art, none has even come close to achieving an F-score in the 80% ballpark when using WordNet as its sense inventory. We contend that one of the main reasons for this failure is the excessively fine granularity of this inventory, resulting in senses that are hard to differentiate between, even for an experienced human annotator. In this pape… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
17
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 19 publications
(17 citation statements)
references
References 27 publications
0
17
0
Order By: Relevance
“…Indeed, while the sense clustering provided by CSI (Lacerra et al 2020) covers all PoS categories, it extends BabelDomains (Camacho-Collados and Navigli 2017), a domain clustering resource that covers mainly nouns. Although out of scope for this article, in the future it would be interesting to investigate verb-specific clustering methods (e.g., Peterson and Palmer 2018).…”
Section: Resultsmentioning
confidence: 99%
“…Indeed, while the sense clustering provided by CSI (Lacerra et al 2020) covers all PoS categories, it extends BabelDomains (Camacho-Collados and Navigli 2017), a domain clustering resource that covers mainly nouns. Although out of scope for this article, in the future it would be interesting to investigate verb-specific clustering methods (e.g., Peterson and Palmer 2018).…”
Section: Resultsmentioning
confidence: 99%
“…Nevertheless, they suffer from the knowledge acquisition bottleneck, which hampers the creation of large manually-curated corpora (Gale, Church, and Yarowsky 1992), and in turn hinders the ability of these approaches to scale over unseen words and new languages. To overcome the aforementioned shortcomings, coarser sense inventories (Lacerra et al 2020) and automatic data augmentation approaches (Pasini and Navigli 2017;Pasini, Elia, and Navigli 2018;Scarlini, Pasini, and Navigli 2019) have been developed to cover more words, senses and languages. At the same time, dedicated architectures have been built to exploit the definitional information of a knowledge base (Luo et al 2018;Kumar et al 2019).…”
Section: Related Workmentioning
confidence: 99%
“…WSD relies on annotated sense labels, which in turn requires determining whether any given pair of word uses belong to the same or distinct senses-i.e., whether to "lump" or "split". There is considerable debate about how granular word sense inventories should be (Hanks, 2000;Brown, 2008a); 3 resources range in granularity from Word-Net (Fellbaum, 1998) to the Coarse Sense Inventory, or CSI (Lacerra et al, 2020). Recent work using coarse-grained sense inventories has achieved success rates of 85% and beyond (Lacerra et al, 2020;Loureiro et al, 2020).…”
Section: Word Sense Disambiguationmentioning
confidence: 99%
“…There is considerable debate about how granular word sense inventories should be (Hanks, 2000;Brown, 2008a); 3 resources range in granularity from Word-Net (Fellbaum, 1998) to the Coarse Sense Inventory, or CSI (Lacerra et al, 2020). Recent work using coarse-grained sense inventories has achieved success rates of 85% and beyond (Lacerra et al, 2020;Loureiro et al, 2020).…”
Section: Word Sense Disambiguationmentioning
confidence: 99%