2015
DOI: 10.1109/tetc.2015.2418716
|View full text |Cite
|
Sign up to set email alerts
|

Wikipedia-Based Semantic Similarity Measurements for Noisy Short Texts Using Extended Naive Bayes

Abstract: This paper proposes a Wikipedia-based semantic similarity measurement method that is intended for real-world noisy short texts. Our method is a kind of Explicit Semantic Analysis (ESA) which adds a bag of Wikipedia entities (Wikipedia pages) to a text as its semantic representation and uses the vector of entities for computing the semantic similarity. Adding related entities to a text, not a single word or phrase, is a challenging practical problem because it usually consists of several subproblems, e.g. key t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
20
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 44 publications
(20 citation statements)
references
References 26 publications
0
20
0
Order By: Relevance
“…The process of semantic classification implements the method of CTS using RTHM. The CTS method derives the mechanism of "Naïve Bayesian Classifier (NBC)" [9] to recognize the probable semantic relation. The CTS method initially as assumes that the extracted features terms are independent from a class due to it variance in values, which is defined as condition independency in NBC to make simplification in individual relation computation.…”
Section: Semantic Classification Using Rthmmentioning
confidence: 99%
See 2 more Smart Citations
“…The process of semantic classification implements the method of CTS using RTHM. The CTS method derives the mechanism of "Naïve Bayesian Classifier (NBC)" [9] to recognize the probable semantic relation. The CTS method initially as assumes that the extracted features terms are independent from a class due to it variance in values, which is defined as condition independency in NBC to make simplification in individual relation computation.…”
Section: Semantic Classification Using Rthmmentioning
confidence: 99%
“…It is logical to involve them in the classification process to support semantic applications by providing a domain-oriented visualization [6], [7]. But, with the growth of the semantic web and the existence of many tools to create and manage ontology, the ontology of different domains of knowledge is already available [8], [9]. Their participation in the discovery of knowledge based on automated learning has increased and promising results have been obtained.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…They converted the task of linking tweets with concepts in Wikipedia into a ranking problem of concepts that are related to the post [4]. Wikipedia was used to solve entity disambiguation for noisy short texts by Shirakawa as well [12].…”
Section: Introductionmentioning
confidence: 99%
“…The ambiguities related to search query can be removed by prompting user to select from the different contexts of the queries. For example an apple may resemble to a fruit family or a Company or some other ambiguity [3].…”
Section: Introductionmentioning
confidence: 99%