Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers) 2016
DOI: 10.18653/v1/p16-2081
|View full text |Cite
|
Sign up to set email alerts
|

Semantics-Driven Recognition of Collocations Using Word Embeddings

Abstract: L2 learners often produce "ungrammatical" word combinations such as, e.g., *give a suggestion or *make a walk. This is because of the "collocationality" of one of their items (the base) that limits the acceptance of collocates to express a specific meaning ('perform' above). We propose an algorithm that delivers, for a given base and the intended meaning of a collocate, the actual collocate lexeme(s) (make / take above). The algorithm exploits the linear mapping between bases and collocates from examples and g… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
12
0
1

Year Published

2017
2017
2024
2024

Publication Types

Select...
4
2
2

Relationship

3
5

Authors

Journals

citations
Cited by 14 publications
(13 citation statements)
references
References 17 publications
0
12
0
1
Order By: Relevance
“…During the annotation process, 447 combinations were marked as doubt, and out of these, 260 were finally considered collocations by the language experts. Even if we do not make explicit use of lexical functions in this paper, it is worth mentioning that the collocations in these final corpora are labeled using 60 LFs, which may be useful to evaluate extraction and classification strategies Rodríguez-Fernández et al, 2016;Kolesnikova and Gelbukh, 2018).…”
Section: Final Resources and Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…During the annotation process, 447 combinations were marked as doubt, and out of these, 260 were finally considered collocations by the language experts. Even if we do not make explicit use of lexical functions in this paper, it is worth mentioning that the collocations in these final corpora are labeled using 60 LFs, which may be useful to evaluate extraction and classification strategies Rodríguez-Fernández et al, 2016;Kolesnikova and Gelbukh, 2018).…”
Section: Final Resources and Resultsmentioning
confidence: 99%
“…A related task consists of automatically classifying the semantic properties of collocations by means of lexical functions or glosses. In this regard, some studies apply machine learning methods to train classifiers (Wanner et al, 2006Gelbukh and Kolesnikova, 2012), while others use distributional semantics to identify a collocate given a base and a lexical function (Rodríguez-Fernández et al, 2016).…”
Section: Related Workmentioning
confidence: 99%
“…In the future, we would like to experiment with more data, so that enough training data can be obtained for less frequent LFs. To this end, we could benefit from the supervised approach proposed in (Rodríguez-Fernández et al, 2016), and then filter by pairwise correlation strength metrics such as PMI. Another exciting avenue would involve exploring cross-lingual transfer of LFs, taking advantage of recent development in unsupervised cross-lingual embedding learning (Artetxe et al, 2017;Conneau et al, 2017).…”
Section: Discussionmentioning
confidence: 99%
“…While its main field of application is Information Retrieval, it has also been used in NLP tasks such as collocation recognition (Wu et al, 2010;Rodríguez-Fernández et al, 2016).…”
Section: Mean Reciprocal Rank (Mrr)mentioning
confidence: 99%