2010
DOI: 10.1613/jair.2880
|View full text |Cite
|
Sign up to set email alerts
|

Text Relatedness Based on a Word Thesaurus

Abstract: The computation of relatedness between two fragments of text in an automated manner requires taking into account a wide range of factors pertaining to the meaning the two fragments convey, and the pairwise relations between their words. Without doubt, a measure of relatedness between text segments must take into account both the lexical and the semantic relatedness between words. Such a measure that captures well both aspects of text relatedness may help in many tasks, such as text retrieval, classification an… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
99
0
3

Year Published

2010
2010
2017
2017

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 111 publications
(102 citation statements)
references
References 72 publications
0
99
0
3
Order By: Relevance
“…1(a) and 1(b) respectively). The latter is the core of the proposed method and utilizes WordNet, and Wikipedia as knowledge bases, as well as two respective measures of semantic relatedness: a dictionarybased measure, namely Omiotis [4], which is based on WordNet, and a Wikipedia-based [3]. Both measures have been shown to achieve state-of-the-art performance in measuring word-to-word semantic relatedness [4].…”
Section: Automated Annotation Of Text With Domain Ontology Conceptsmentioning
confidence: 99%
See 1 more Smart Citation
“…1(a) and 1(b) respectively). The latter is the core of the proposed method and utilizes WordNet, and Wikipedia as knowledge bases, as well as two respective measures of semantic relatedness: a dictionarybased measure, namely Omiotis [4], which is based on WordNet, and a Wikipedia-based [3]. Both measures have been shown to achieve state-of-the-art performance in measuring word-to-word semantic relatedness [4].…”
Section: Automated Annotation Of Text With Domain Ontology Conceptsmentioning
confidence: 99%
“…A Web application that implements KDTA is publicly available online 4 . Firstly, the user may upload an ontology in OWL format or may select an already existing ontology by leaving the corresponding browsing path empty.…”
Section: System Demonstrationmentioning
confidence: 99%
“…The setting of the problem is that users always ask for those most semantically related to their queries from a huge text collection. A common solution is applying the state-of-the-art short texts similarity measurement techniques (Islam and Inkpen, 2008;Li et al, 2006;Mihalcea et al, 2006;Sahami and Heilman, 2006;Tsatsaronis et al, 2010;Mohler et al, 2011;, and then return the top-k ones * Corresponding author.…”
Section: Introductionmentioning
confidence: 99%
“…Moreover, we focus on the top-k issue because users commonly do not care about the individual similarity score but only the sorted results. Furthermore, most of the previous studies (Islam and Inkpen, 2008;Li et al, 2006;Tsatsaronis et al, 2010; need to set predefined threshold to filter out those dissimilar texts which is rather difficult to determine by users.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation