2018
DOI: 10.11591/ijeecs.v12.i3.pp1239-1246
|View full text |Cite
|
Sign up to set email alerts
|

Developing Corpora using Wikipedia and Word2vec for Word Sense Disambiguation

Abstract: Word Sense Disambiguation (WSD) is one of the most difficult problems in the artificial intelligence field or well known as AI-hard or AI-complete. A lot of problems can be solved using word sense disambiguation approaches like sentiment analysis, machine translation, search engine relevance, coherence, anaphora resolution, and inference. In this paper, we do research to solve WSD problem with two small corpora. We propose the use of Word2vec and Wikipedia to develop the corpora. After developing the corpora, … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
6
1

Relationship

2
5

Authors

Journals

citations
Cited by 8 publications
(8 citation statements)
references
References 7 publications
0
8
0
Order By: Relevance
“…Paraphrase lexicon is also needed to detect the phrase in the document. There are several previous studies that applied word sense disambiguation and were proven to improve results and more understanding context [41,42].…”
Section: Resultsmentioning
confidence: 99%
“…Paraphrase lexicon is also needed to detect the phrase in the document. There are several previous studies that applied word sense disambiguation and were proven to improve results and more understanding context [41,42].…”
Section: Resultsmentioning
confidence: 99%
“…The above technique is also used for the similarity of a document [8]. This technique is also used to extract Twitter data features for crisis event classification cases [9].…”
Section: Of 13mentioning
confidence: 99%
“…We compared our opinion words polarity from HEOLS with the opinion word polarity from: 1) Opinion Lexicon; 2) the first sense of adjective word SentiWordNet [30] (positive if the SentiWordNet score > 0 and vice versa), we use SentiWordNet because it was used in previous research [31,32]; and 3) same as in point 2 but we add Word Sense Disambiguation (WSD) using Adapted Lesk [33] to improve the performance [34][35][36].…”
Section: Comparisonmentioning
confidence: 99%