Multi-task Legal Judgement Prediction Combining a Subtask of the Seriousness of Charges

Xu, Zhuopeng; Li, Xia; Li, Yinlin; Wang, Zihan; Fanxu, Yujie; Lai, Xiaoyan

doi:10.1007/978-3-030-63031-7_30

Cited by 13 publications

(5 citation statements)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Work [ 14 ] builds the first Legal Judgment Prediction (LJP) model for UK court cases by creating a labeled dataset of UK court decisions and subsequently applying the machine learning model with high performance and experimentally demonstrating the high performance capabilities of the proposed LJP model. Work [ 15 ] presents a multitask Legal Judgment Prediction model that combines the subtask of allegation severity with the defendant's position, enabling it to focus on contextual information about the defendant. Experiments show that the model achieves better performance on the public CAIL2018 dataset.…”

Section: Related Workmentioning

confidence: 99%

Study of Deep Learning-Based Legal Judgment Prediction in Internet of Things Era

Zheng

Liu

Sun

2022

Computational Intelligence and Neuroscience

View full text Add to dashboard Cite

Legal judgment prediction is the most typical application of artificial intelligence technology, especially natural language processing methods, in the judicial field. In a practical environment, the performance of algorithms is often restricted by the computing resource conditions due to the uneven computing performance of the devices. Reducing the computational resource consumption of the model and improving the inference speed can effectively reduce the deployment difficulty of the legal judgment prediction model. To improve the prediction accuracy, enhance the model inference speed, and reduce the model memory consumption, we propose a BERT knowledge distillation-based legal decision prediction model, called KD-BERT. To reduce the resource consumption in the model inference process, we use the BERT pretraining model with lower memory requirements to be the encoder. Then, the knowledge distillation strategy transfers the knowledge to the student model of the shallow transformer structure. Experiment results show that the proposed KD-BERT has the highest F1-score compared with traditional BERT models. Its inference speed is also much faster than the other BERT models.

show abstract

Section: Related Workmentioning

confidence: 99%

Study of Deep Learning-Based Legal Judgment Prediction in Internet of Things Era

Zheng

Liu

Sun

2022

Computational Intelligence and Neuroscience

View full text Add to dashboard Cite

show abstract

“…Li et al [21] proposed a multi-channel attention neural network to jointly predict charges, relevant law articles, and the term of penalty. Xu et al [3] proposed LADAN, which also addressed the above three tasks jointly by multi-task learning. LADAN can effectively distinguish subtle differences between confusing law articles with a community-based graph neural network.…”

Section: Related Workmentioning

confidence: 99%

“…LJP has been studied for decades, and most existing works regard this task as a text classification problem. Researchers have proposed various methods based on machine learning and deep learning models and made significant progress in LJP tasks like like predicting charges [1], relevant law articles [2], and the term of penalty [3]. Some of the existing works have already achieved an accuracy of over 90% on predicting charges and relevant law articles, which is very close to human judges.…”

Section: Introductionmentioning

confidence: 99%

Sequential Multi-task Learning with Task Dependency for Appeal Judgment Prediction

Song¹,

Han²,

Liu³

et al. 2022

Preprint

View full text Add to dashboard Cite

Legal Judgment Prediction (LJP) aims to automatically predict judgment results, such as charges, relevant law articles, and the term of penalty. It plays a vital role in legal assistant systems and has become a popular research topic in recent years. This paper concerns a worthwhile but not well-studied LJP task, Appeal judgment Prediction (AJP), which predicts the judgment of an appellate court on an appeal case based on the textual description of case facts and grounds of appeal. There are two significant challenges in practice to solve the AJP task. One is how to model the appeal judgment procedure appropriately. The other is how to improve the interpretability of the prediction results. We propose a Sequential Multi-task Learning Framework with Task Dependency for Appeal Judgement Prediction (SMAJudge) to address these challenges. SMAJudge utilizes two sequential components to model the complete proceeding from the lower court to the appellate court and employs an attention mechanism to make the prediction more explainable, which handles the challenges of AJP effectively. Experimental results obtained with a dataset consisting of more than 30K appeal judgment documents have revealed the effectiveness and superiority of SMAJudge.

show abstract

“…Li et al [18] designed a multichannel neural network model framework with attention mechanism to complete entire LJP tasks. Xu et al [19] proposed a new unified LJP model which capture the attention weights of different terms of penalty and the position of defendant. Yang et.al [20] presented a multi-perspective bi-feedback network with the word collocation attention mechanism for LJP task.…”

Section: Related Workmentioning

confidence: 99%

ADAN: An Intelligent Approach Based on Attentive Neural Network and Relevant Law Articles for Charge Prediction

Zhao

Chen

et al. 2021

IEEE Access

View full text Add to dashboard Cite

The charge prediction task aims to predict appropriate charges for a given legal case automatically, which still confronts some challenging problems such as performance improvement and confusing charges issue. In this paper, inspired by the impressive success of deep neural networks in legal intelligence field, we present an end-to-end framework named law article deduplication attention neural network, ADAN, to address these problems. The incorporation of hierarchical sequence encoder and attention mechanism is employed to learn better semantic representations of fact description texts. To distinguish confusing charges, we use the relevant law articles of a given case as auxiliary information, and propose a novel difference aggregation mechanism among similar law articles for extracting effective distinguishable features. The experimental results on real-world datasets show that the performance of our proposed model is significantly better than existing methods on all evaluation metrics. INDEX TERMS charge prediction; hierarchical attention mechanism; bidirectional gated recurrent unit; text classification.

show abstract

Multi-task Legal Judgement Prediction Combining a Subtask of the Seriousness of Charges

Cited by 13 publications

References 14 publications

Study of Deep Learning-Based Legal Judgment Prediction in Internet of Things Era

Study of Deep Learning-Based Legal Judgment Prediction in Internet of Things Era

Sequential Multi-task Learning with Task Dependency for Appeal Judgment Prediction

ADAN: An Intelligent Approach Based on Attentive Neural Network and Relevant Law Articles for Charge Prediction

Contact Info

Product

Resources

About