2019 IEEE International Conference on Big Data (Big Data) 2019
DOI: 10.1109/bigdata47090.2019.9006248
|View full text |Cite
|
Sign up to set email alerts
|

Empirical Comparisons of CNN with Other Learning Algorithms for Text Classification in Legal Document Review

Abstract: Research has shown that Convolutional Neural Networks (CNN) can be effectively applied to text classification as part of a predictive coding protocol. That said, most research to date has been conducted on data sets with short documents that do not reflect the variety of documents in real world document reviews. Using data from four actual reviews with documents of varying lengths, we compared CNN with other popular machine learning algorithms for text classification, including Logistic Regression, Support Vec… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
11
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
4
1

Relationship

1
9

Authors

Journals

citations
Cited by 23 publications
(11 citation statements)
references
References 13 publications
0
11
0
Order By: Relevance
“…TextCNN produces good results in short text classification. Other experiments [48] show that when the text length exceeds 2000, the performance of TextCNN is not significantly improved compared with the traditional methods. TextRNN [49] is a well-known text classification model; however, it is not effective at handling long documents because a long text leads to a vanishing gradient.…”
Section: Deep Learning-based Methodsmentioning
confidence: 97%
“…TextCNN produces good results in short text classification. Other experiments [48] show that when the text length exceeds 2000, the performance of TextCNN is not significantly improved compared with the traditional methods. TextRNN [49] is a well-known text classification model; however, it is not effective at handling long documents because a long text leads to a vanishing gradient.…”
Section: Deep Learning-based Methodsmentioning
confidence: 97%
“…However, it has recently been applied to natural language processing problems and has given very good results. In addition to the two layers input and output, CNN network architecture is composed of several types of layers including [13,14]. Recurrent Neural Network (RNN) is an algorithm that has received a lot of attention recently because of its good results obtained in the field of natural language processing [15]: − using the software reuse method: we reuse some available software in Vietnamese document processing such as data cleaning, word separation, vectorization in the process of building a tool to detect fake speech freeforms.…”
Section: Literature Review and Problem Statementmentioning
confidence: 99%
“…With the rapid development of artificial intelligence technology, machine learning, especially deep learning, have become a new trend in the development of intelligent judicial technology in the judicial field. Robert et al use convolutional neural networks (CNNs) to extract semantic features from case text [13], thus supporting the deep mining of case data. Although CNNs can perform well in many tasks, CNN models cannot model case summary information accurately, thus failing to achieve the case grouping encoding required in this paper.…”
Section: Related Workmentioning
confidence: 99%