2016
DOI: 10.1613/jair.4917
|View full text |Cite
|
Sign up to set email alerts
|

Semi-supervised Learning with Induced Word Senses for State of the Art Word Sense Disambiguation

Abstract: Word Sense Disambiguation (WSD) aims to determine the meaning of a word in context, and successful approaches are known to benefit many applications in Natural Language Processing. Although supervised learning has been shown to provide superior WSD performance, current sense-annotated corpora do not contain a sufficient number of instances per word type to train supervised systems for all words. While unsupervised techniques have been proposed to overcome this data sparsity problem, such techniques have not … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
8
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
6
3
1

Relationship

0
10

Authors

Journals

citations
Cited by 15 publications
(8 citation statements)
references
References 50 publications
0
8
0
Order By: Relevance
“…In this work we compare supervised systems and study the role of their underlying senseannotated training corpus. Since semi-supervised models have been shown to outperform fully supervised systems in some settings (Taghipour and Ng, 2015b;Başkaya and Jurgens, 2016;Iacobacci et al, 2016;Yuan et al, 2016), we evaluate and compare models using both manually-curated and automatically-constructed sense-annotated corpora for training.…”
Section: Supervised Wsdmentioning
confidence: 99%
“…In this work we compare supervised systems and study the role of their underlying senseannotated training corpus. Since semi-supervised models have been shown to outperform fully supervised systems in some settings (Taghipour and Ng, 2015b;Başkaya and Jurgens, 2016;Iacobacci et al, 2016;Yuan et al, 2016), we evaluate and compare models using both manually-curated and automatically-constructed sense-annotated corpora for training.…”
Section: Supervised Wsdmentioning
confidence: 99%
“…Supervised models have been shown to consistently outperform knowledge-based ones in all standard benchmarks (Raganato et al, 2017), at the expense, however, of harder training and limited flexibility. First of all, obtaining reliable sense-annotated corpora is highly expensive and especially difficult when non-expert annotators are involved (de Lacalle and Agirre, 2015), and as a consequence approaches based on unlabeled data and semisupervised learning are emerging (Taghipour and Ng, 2015b;Başkaya and Jurgens, 2016;Yuan et al, 2016;Pasini and Navigli, 2017).…”
Section: Introductionmentioning
confidence: 99%
“…Mahmoodv and Hourali [29] used the machine learning algorithm with minimal supervision to disambiguate word senses based on features of target word and collaborative learning method. Başkaya and Jurgens [30] gave a semisupervised WSD method that combines a small amount of annotated data with information from word sense induction. Navigli and Velardi [31] created structural specifications of possible senses for each word in context and selected the best hypothesis with G grammar.…”
Section: Related Workmentioning
confidence: 99%