2019
DOI: 10.1109/access.2019.2901933
|View full text |Cite
|
Sign up to set email alerts
|

Supervised Paragraph Vector: Distributed Representations of Words, Documents and Class Labels

Abstract: While the traditional method of deriving representations for documents was bag-of-words, they suffered from high dimensionality and sparsity. Recently, many methods to obtain lower dimensional and densely distributed representations were proposed. Paragraph Vector is one of such algorithms, which extends the word2vec algorithm by considering the paragraph as an additional word. However, it generates a single representation for all tasks, while different tasks may require different representations. In this pape… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 15 publications
(3 citation statements)
references
References 26 publications
0
3
0
Order By: Relevance
“…B) Feature extraction: NLP model cannot work on the raw text data directly. So, feature extraction methods are required to convert text into a numerical representation of features or a matrix (or vector) of features [82], [98], [102]. It maps words into numerical vector space, which is considered a richer representation of text input in NLP.…”
Section: ) Named Entity Recognition (Ner)mentioning
confidence: 99%
“…B) Feature extraction: NLP model cannot work on the raw text data directly. So, feature extraction methods are required to convert text into a numerical representation of features or a matrix (or vector) of features [82], [98], [102]. It maps words into numerical vector space, which is considered a richer representation of text input in NLP.…”
Section: ) Named Entity Recognition (Ner)mentioning
confidence: 99%
“…To overcome these flaws, future research and work is necessary for this method. Park et al [26] proposed an algorithm that results the top search results for IR queries. For their approach Boolean interface is used without ranking functions.…”
Section: Related Workmentioning
confidence: 99%
“…The doc2vec model was proposed by Le et al (Le and Mikolov, 2014) (paragraph vectors) as a document-embedding method in 2014. This model is applicable in text classification and document similarity calculation (Park et al , 2019). Text classification is known as the task of classifying a given text into a set of predefined classes (Dalal and Zaveri, 2011).…”
Section: Introductionmentioning
confidence: 99%