Fusheng Wei scite author profile

Fusheng Wei

5Publications

39Citation Statements Received

38Citation Statements Given

How they've been cited

110

How they cite others

Affiliations

Fujikura (United States), EP Analytics (United States)

Publications

Order By: Most citations

Empirical Study of Deep Learning for Text Classification in Legal Document Review

Wei

Qin

et al. 2018

View full text Add to dashboard Cite

Predictive coding has been widely used in legal matters to find relevant or privileged documents in large sets of electronically stored information. It saves the time and cost significantly. Logistic Regression (LR) and Support Vector Machines (SVM) are two popular machine learning algorithms used in predictive coding. Recently, deep learning received a lot of attentions in many industries. This paper reports our preliminary studies in using deep learning in legal document review. Specifically, we conducted experiments to compare deep learning results with results obtained using a SVM algorithm on the four datasets of real legal matters. Our results showed that CNN performed better with larger volume of training dataset and should be a fit method in the text classification in legal industry.

show abstract

Empirical Comparisons of CNN with Other Learning Algorithms for Text Classification in Legal Document Review

Keeling

Chhatwal²,

Huber-Fliflet³

et al. 2019

View full text Add to dashboard Cite

Research has shown that Convolutional Neural Networks (CNN) can be effectively applied to text classification as part of a predictive coding protocol. That said, most research to date has been conducted on data sets with short documents that do not reflect the variety of documents in real world document reviews. Using data from four actual reviews with documents of varying lengths, we compared CNN with other popular machine learning algorithms for text classification, including Logistic Regression, Support Vector Machine, and Random Forest. For each data set, classification models were trained with different training sample sizes using different learning algorithms. These models were then evaluated using a large randomly sampled test set of documents, and the results were compared using precision and recall curves. Our study demonstrates that CNN performed well, but that there was no single algorithm that performed the best across the combination of data sets and training sample sizes. These results will help advance research into the legal profession's use of machine learning algorithms that maximize performance.

show abstract

Image Analytics for Legal Document Review : A Transfer Learning Approach

Huber-Fliflet¹,

Wei²,

Zhao³

et al. 2019

View full text Add to dashboard Cite

Though technology assisted review in electronic discovery has been focusing on text data, the need of advanced analytics to facilitate reviewing multimedia content is on the rise. In this paper, we present several applications of deep learning in computer vision to Technology Assisted Review of image data in legal industry. These applications include image classification, image clustering, and object detection. We use transfer learning techniques to leverage established pretrained models for feature extraction and fine tuning. These applications are first of their kind in the legal industry for image document review. We demonstrate effectiveness of these applications with solving real world business challenges.

show abstract

An Empirical Comparison of DistilBERT, Longformer and Logistic Regression for Predictive Coding

Wei¹,

Yang²,

Mao³

et al. 2022

View full text Add to dashboard Cite

Empirical Comparisons of CNN with Other Learning Algorithms for Text Classification in Legal Document Review

Keeling¹,

Chhatwal

Huber-Fliflet³

et al. 2019

Preprint

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Fusheng Wei

Empirical Study of Deep Learning for Text Classification in Legal Document Review

Empirical Comparisons of CNN with Other Learning Algorithms for Text Classification in Legal Document Review

Image Analytics for Legal Document Review : A Transfer Learning Approach

An Empirical Comparison of DistilBERT, Longformer and Logistic Regression for Predictive Coding

Empirical Comparisons of CNN with Other Learning Algorithms for Text Classification in Legal Document Review

Contact Info

Product

Resources

About