2013
DOI: 10.1017/s1351324913000144
|View full text |Cite
|
Sign up to set email alerts
|

Similarity computation using semantic networks created from web-harvested data

Abstract: We investigate language-agnostic algorithms for the construction of unsupervised distributional semantic models using web-harvested corpora. Specifically, a corpus is created from web document snippets, and the relevant semantic similarity statistics are encoded in a semantic network. We propose the notion of semantic neighborhoods that are defined using co-occurrence or context similarity features. Three neighborhood-based similarity metrics are proposed, motivated by the hypotheses of attributional and maxim… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
27
0

Year Published

2014
2014
2024
2024

Publication Types

Select...
4
4
1

Relationship

1
8

Authors

Journals

citations
Cited by 27 publications
(28 citation statements)
references
References 44 publications
1
27
0
Order By: Relevance
“…Models built on this assumption are called Distributional Similarity Models (DSMs) and take into account the co-occurrence distributions of the words in order to cluster them together. Several implementations of DSMs have been proposed in the last decades [3,5,8,10,15] and have being used in tasks such as query expansion [1], building bilingual comparable corpora [16], clustering [2], discovering of meaning of noun compounds [14] etc.…”
Section: Introductionmentioning
confidence: 99%
“…Models built on this assumption are called Distributional Similarity Models (DSMs) and take into account the co-occurrence distributions of the words in order to cluster them together. Several implementations of DSMs have been proposed in the last decades [3,5,8,10,15] and have being used in tasks such as query expansion [1], building bilingual comparable corpora [16], clustering [2], discovering of meaning of noun compounds [14] etc.…”
Section: Introductionmentioning
confidence: 99%
“…Semantic similarity is a measurement method that defines each document or term which has a distance based on semantic meaning [14,15]. There are two types of similarity calculation, which are based on the existing resources, such as thesaurus and based on the spread of words in a corpus [22].…”
Section: Semantic Similaritymentioning
confidence: 99%
“…The semantic-affective mapping was expanded in [31] for estimating scores for other dimensions, such as word familiarity and age acquisition. An example of the exploitation of affective spaces for semantic tasks can be found in [32] dealing with the detection of semantic opposition.…”
Section: Related Workmentioning
confidence: 99%