2020
DOI: 10.3390/app10114009
|View full text |Cite
|
Sign up to set email alerts
|

A Rule-Based Approach to Embedding Techniques for Text Document Classification

Abstract: With the growth of online information and sudden expansion in the number of electronic documents provided on websites and in electronic libraries, there is difficulty in categorizing text documents. Therefore, a rule-based approach is a solution to this problem; the purpose of this study is to classify documents by using a rule-based. This paper deals with the rule-based approach with the embedding technique for a document to vector (doc2vec) files. An experiment was performed on two data sets Reuters-21578 an… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0
1

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
3

Relationship

1
9

Authors

Journals

citations
Cited by 34 publications
(12 citation statements)
references
References 31 publications
0
10
0
1
Order By: Relevance
“…6 and Fig. 15 defines a detailed comparative results analysis of the CIDD-ADODNN model on the test Chess dataset [26]. The resultant values addressed that ZeroR and SVM models have depicted poor performance by accomplishing minimal accuracy values of 0.390 and From the detailed experimental analysis, it is evident that the CIDD-ADODNN model has accomplished an effective outcome on all the applied dataset.…”
Section: Performance Validationmentioning
confidence: 90%
“…6 and Fig. 15 defines a detailed comparative results analysis of the CIDD-ADODNN model on the test Chess dataset [26]. The resultant values addressed that ZeroR and SVM models have depicted poor performance by accomplishing minimal accuracy values of 0.390 and From the detailed experimental analysis, it is evident that the CIDD-ADODNN model has accomplished an effective outcome on all the applied dataset.…”
Section: Performance Validationmentioning
confidence: 90%
“…Word embeddings are unsupervised learning applications that also talk about transfer learning as it is incorporated in the given user corpus. Embeddings can be character level or word level [14]. The word level embeddings use word2vec method where the basic construct of embeddings is converting words into vectors and then mathematically apply relations on them based on the corpus being used.…”
Section: A Word Embeddingsmentioning
confidence: 99%
“…In [30], Ronran et al evaluated the combination of different types of embedding features in a bidirectional long short-term memory (Bi-LSTM) conditional random field (CRF) model for named entity recognition (NER). The authors in [31] [32] proposed a rule-based approach for text document classification. The study in [33] proposed an RE model based on a dual pointer network with a multi-head attention mechanism to address the association of multiple entities in a sentence according to various relations.…”
Section: Document Analysis Information Extraction and Text Miningmentioning
confidence: 99%