A non-negative tensor factorization model for selectional preference induction

Cruys, Tim Van de

doi:10.1017/s1351324910000148

Cited by 35 publications

(17 citation statements)

References 35 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…A quantitative evaluation on a pseudo-disambiguation task shows that our models achieve state of the art performance. The results for our two-way neural network are on a par with Erk et al's (2010) similaritybased approach, while our three-way neural network slightly outperforms the tensor-based factorization model (Van de Cruys, 2009) for multi-way selectional preference induction.…”

Section: Discussionsupporting

confidence: 56%

“…Our model computes selectional preference scores for the test set in a matter of seconds, whereas for Erk et al's model, we ended up sampling from the test set, as computing preference values for the complete test set proved prohibitively expensive. Table 4 compares the results of our neural network architecture for three-way selectional preference acquisition to the results of the tensor-based factorization method (Van de Cruys, 2009 The results indicate that the neural network approach slightly outperforms the tensor-based factorization method. Again the model difference is sta-tistically significant (paired t-test, p < 0.01).…”

Section: Two-way Modelmentioning

confidence: 99%

See 1 more Smart Citation

A Neural Network Approach to Selectional Preference Acquisition

Cruys¹

2014

Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)

Self Cite

View full text Add to dashboard Cite

This paper investigates the use of neural networks for the acquisition of selectional preferences. Inspired by recent advances of neural network models for NLP applications, we propose a neural network model that learns to discriminate between felicitous and infelicitous arguments for a particular predicate. The model is entirely unsupervised -preferences are learned from unannotated corpus data. We propose two neural network architectures: one that handles standard two-way selectional preferences and one that is able to deal with multi-way selectional preferences. The model's performance is evaluated on a pseudo-disambiguation task, on which it is shown to achieve state of the art performance.

show abstract

Section: Discussionsupporting

confidence: 56%

Section: Two-way Modelmentioning

confidence: 99%

A Neural Network Approach to Selectional Preference Acquisition

Cruys¹

2014

Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)

Self Cite

View full text Add to dashboard Cite

show abstract

“…For instance, ongoing experiments indicate that the same parameters apply when Lin's similarity is replaced by cosine. Finally, we would like to compare the proposed heuristics with more sophisticated filtering strategies like singular value decomposition (Landauer and Dumais, 1997) and non-negative matrix factorization (Van de Cruys, 2009). …”

Section: Discussionmentioning

confidence: 99%

“…We would like to thank the support of projects CAPES/COFECUB 707/11, PNPD 2484/2009, FAPERGS-INRIA 1706-2551/13-7, CNPq 312184/2012-3, 551964/2011-1, 482520/2012-4 and 312077/2012 …”

Section: Acknowledgmentsmentioning

confidence: 99%

Nothing like Good Old Frequency: Studying Context Filters for Distributional Thesauri

Padró¹,

Idiart²,

Villavicencio³

et al. 2014

Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP)

View full text Add to dashboard Cite

Much attention has been given to the impact of informativeness and similarity measures on distributional thesauri. We investigate the effects of context filters on thesaurus quality and propose the use of cooccurrence frequency as a simple and inexpensive criterion. For evaluation, we measure thesaurus agreement with WordNet and performance in answering TOEFL-like questions. Results illustrate the sensitivity of distributional thesauri to filters.

show abstract

“…Non-negative tensor factorization models have also been applied to other language processing applications, including subject-verb-object selectional preference induction [9] and learning semantic word similarity [10]. Without drawing the connection to low rank tensors, Lowd and Domingos [11] propose Naive Bayes models for estimating arbitrary probability distributions that can be seen as a generalization of (6).…”

Section: A Modelmentioning

confidence: 99%

Low Rank Language Models for Small Training Sets

Hutchinson

Ostendorf

Fazel

2011

IEEE Signal Process. Lett.

View full text Add to dashboard Cite

Abstract-Several language model smoothing techniques are available that are effective for a variety of tasks; however, training with small data sets is still difficult. This letter introduces the low rank language model, which uses a low rank tensor representation of joint probability distributions for parameter-tying and optimizes likelihood under a rank constraint. It obtains lower perplexity than standard smoothing techniques when the training set is small and also leads to perplexity reduction when used in domain adaptation via interpolation with a general, out-of-domain model.Index Terms-Language model, low rank tensor.

show abstract

A non-negative tensor factorization model for selectional preference induction

Cited by 35 publications

References 35 publications

A Neural Network Approach to Selectional Preference Acquisition

A Neural Network Approach to Selectional Preference Acquisition

Nothing like Good Old Frequency: Studying Context Filters for Distributional Thesauri

Low Rank Language Models for Small Training Sets

Contact Info

Product

Resources

About