2021
DOI: 10.1016/j.csl.2020.101182
|View full text |Cite
|
Sign up to set email alerts
|

Attention-based BiLSTM fused CNN with gating mechanism model for Chinese long text classification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
64
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 113 publications
(64 citation statements)
references
References 7 publications
0
64
0
Order By: Relevance
“…is redundant detail may contain some sort of knowledge too. us, the classification of long text requires an efficient model [6].…”
Section: Introductionmentioning
confidence: 99%
“…is redundant detail may contain some sort of knowledge too. us, the classification of long text requires an efficient model [6].…”
Section: Introductionmentioning
confidence: 99%
“…The probability of the switch can be worked out based on the encoder/decoder context vector and the current hiddenlayer state, as in [29]: denotes the bias term. The weight matrix and bias term in (7) are parameters that can be updated during the model training iteration. The linear weighted sum of these parameters is nonlinearly activated by the sigmoid function and mapped between 0 and 1 as a soft switch to control the source of the input layer, based on the information of the two partsthe source text and the vocabulary.…”
Section: B 'Da-pn' Modelmentioning
confidence: 99%
“…With a RNN, each word is entered into the model according to its order of appearance in the original text, which is good for recording the sequence information of the text but limits the speed of network training and text summary generation. If both the encoder and decoder adopt a Convolutional Neural Network (CNN) [7,8], this will allow the model to use parallel computing in the phase of training, which will not only maintain accuracy but will also improve efficiency. At the same time, if multi-step attention is used [9], every layer of the semantic vector, generated in the decoder, would look forward, which could further improve accuracy.…”
Section: Introductionmentioning
confidence: 99%
“…As a well-known deep learning method, CNN (Convolutional Neural Network) has been widely utilized in various pattern recognition areas such as medical image analysis [33], crop mapping [34], text classification [35], fraud identification [36], voice recognition [37], and so on. CNN has already been used for the sensor faults diagnosis of UAV [38], rotating machinery [19] and the multi-operation forging process [39].…”
Section: Diagnosis Model Based On Modified Cnnmentioning
confidence: 99%