2021
DOI: 10.1016/j.eswa.2021.115394
|View full text |Cite
|
Sign up to set email alerts
|

Bengali text document categorization based on very deep convolution neural network

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
12
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 35 publications
(12 citation statements)
references
References 76 publications
0
12
0
Order By: Relevance
“…e model of neural network CNN [12,13] or BiLSTM [9,10] combined with CRF is based on the recognition model after word segmentation, and then, word2vec is used for word vector representation. To improve the effect of entity recognition, some scholars [24,25] use semantic information to improve the word vector as a result.…”
Section: Character-based Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…e model of neural network CNN [12,13] or BiLSTM [9,10] combined with CRF is based on the recognition model after word segmentation, and then, word2vec is used for word vector representation. To improve the effect of entity recognition, some scholars [24,25] use semantic information to improve the word vector as a result.…”
Section: Character-based Methodsmentioning
confidence: 99%
“…It shows that the BiLSTM method is superior to the CRF baseline model in different granularity legal domain corpora. Yin et al [11] proposed a method of judicial named entity recognition combining CNN and [12,13] LSTM. First, word embedding and CNN are used to obtain character-level embedding representation, and then, BiLSTM is used as an encoder, and one-way LSTM and character-level CNN are used as a decoder.…”
Section: Introductionmentioning
confidence: 99%
“…Despite the limited amount of the data, Hossain et al [24] discovered that text-graph convolutional neural (GCN) accomplished better than GRU-LSTM, BiLSTM, Char-CNN, LSTM, and bidirectional encoder representations from transformers (BERT) in classifying online Bangla news. As so far, there is a scarcity of standard data set in the Bangla language, so a data set have been prepared by scraping the news articles from various electronic news sites such as 'https://www.prothomalo.com/'.…”
Section: Data Collectionmentioning
confidence: 99%
“…Sarcasm detection on different low resource languages like Indonesian [6] , Hindi [7] , Czech [8] , Japanese [9] are available. Nonetheless, despite being the world's seventh most spoken language, with 240 million native speakers [10] , research on sarcasm detection in Bengali is underdeveloped and underutilized. Identifying sarcasm from Bengali text is currently a difficult task for NLP researchers due to limited resources and a paucity of large-scale sarcasm data.…”
Section: Introductionmentioning
confidence: 99%