Proceedings of the 15th International Conference on World Wide Web 2006
DOI: 10.1145/1135777.1135834
|View full text |Cite
|
Sign up to set email alerts
|

A web-based kernel function for measuring the similarity of short text snippets

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
395
0
2

Year Published

2010
2010
2018
2018

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 591 publications
(397 citation statements)
references
References 14 publications
0
395
0
2
Order By: Relevance
“…Many works have focused on this aspect by proposing enriched text representations and proximity metrics that attempt to get more realistic semantic comparisons. These approaches have included the use of additional information obtained from the Web [39,52], external resources like Wikipedia [2] and Wordnet [21], combinations of internal and external semantics [23] and learning term-weighting functions for similarity measures [51]. Although these proposals are still far from getting the semantic level previously explained in the cognitive science works, they present interesting research lines for future work.…”
Section: Related Workmentioning
confidence: 99%
“…Many works have focused on this aspect by proposing enriched text representations and proximity metrics that attempt to get more realistic semantic comparisons. These approaches have included the use of additional information obtained from the Web [39,52], external resources like Wikipedia [2] and Wordnet [21], combinations of internal and external semantics [23] and learning term-weighting functions for similarity measures [51]. Although these proposals are still far from getting the semantic level previously explained in the cognitive science works, they present interesting research lines for future work.…”
Section: Related Workmentioning
confidence: 99%
“…One general strategy for solving this problem is to expand text representation by exploiting related text documents, which is related to smoothing of a document language model in information retrieval [105]. A specific technique, which leverages a search engine to expand text representation, was proposed in [79]. A comparison of several simple measures for computing similarity of short text segments can be found in [66].…”
Section: Distance-based Clustering Algorithmsmentioning
confidence: 99%
“…Sahami and Heilman employed a similarity kernel function to estimate the short text similarity by making use of the search engine to extend features of short text [15]. Wang et al [16] and Yuan [17] proposed a mining algorithm based on association rule to extract association relationship between features included in training and testing sets, which further obtains the extended features corresponding to the words.…”
Section: Related Workmentioning
confidence: 99%
“…The indexed terms in are unique terms of prefixes in , where each indexed term refers to a list composed of all videos whose prefixes contain the corresponding indexed term. In the following stage, may be possible similar to ∈ by the inverted index (line [8][9][10][11][12][13][14][15][16][17][18][19]. For each item ∈ , we firstly figure out the prefix Pre( ).…”
Section: Prefix Filtering Based On Derived Jaccardmentioning
confidence: 99%