Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - ACL '03 2003
DOI: 10.3115/1075096.1075105
|View full text |Cite
|
Sign up to set email alerts
|

Clustering polysemic subcategorization frame distributions semantically

Abstract: Previous research has demonstrated the utility of clustering in inducing semantic verb classes from undisambiguated corpus data. We describe a new approach which involves clustering subcategorization frame (SCF) distributions using the Information Bottleneck and nearest neighbour methods. In contrast to previous work, we particularly focus on clustering polysemic verbs. A novel evaluation scheme is proposed which accounts for the effect of polysemy on the clusters, offering us a good insight into the potential… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
77
0

Year Published

2005
2005
2014
2014

Publication Types

Select...
3
3
2

Relationship

1
7

Authors

Journals

citations
Cited by 44 publications
(77 citation statements)
references
References 20 publications
0
77
0
Order By: Relevance
“…In this paper we investigate the influence of syntax, which represents one of the possible feature sources. Syn-tactic subcategorization frames tend to be good predictors for the semantics of verbs in general: verbs that are similar in meaning also tend to have similar subcategorization frames and selectional preferences (Schulte im Walde, 2000;Merlo and Stevenson, 2001;Korhonen et al, 2003;Schulte im Walde, 2006a;Joanis et al, 2008). But, as we will show below, PV-BV pairs tend to have a special behavior with respect to their subcategorization, even if their meanings are closely related.…”
Section: Introductionmentioning
confidence: 99%
“…In this paper we investigate the influence of syntax, which represents one of the possible feature sources. Syn-tactic subcategorization frames tend to be good predictors for the semantics of verbs in general: verbs that are similar in meaning also tend to have similar subcategorization frames and selectional preferences (Schulte im Walde, 2000;Merlo and Stevenson, 2001;Korhonen et al, 2003;Schulte im Walde, 2006a;Joanis et al, 2008). But, as we will show below, PV-BV pairs tend to have a special behavior with respect to their subcategorization, even if their meanings are closely related.…”
Section: Introductionmentioning
confidence: 99%
“…First, we make multiple data points for each verb to deal with verb polysemy (cf. polysemy-aware previous studies still represented a verb as one data point (Korhonen et al, 2003;Miyao and Tsujii, 2009)). To do that, we induce verb-specific semantic frames by clustering verb uses.…”
Section: Overviewmentioning
confidence: 99%
“…The most closely related work to our polysemyaware task of unsupervised verb class induction is the work of Korhonen et al (2003), who used distributions of subcategorization frames to cluster verbs. They adopted the Nearest Neighbor (NN) and Information Bottleneck (IB) methods for clustering.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…As for the acquisition of verb classes, automatic methods such as those suggested by Korhonen et al (2003), Schulte im Walde (2006), or Joanis et al (2008) have the potential of reducing the prohibitive cost of manual methods. However, they require decisions about both the experiment setup (with regard to feature selection) and the choice of a manually constructed gold standard for evaluation.…”
Section: Introductionmentioning
confidence: 99%