2020
DOI: 10.1186/s13640-020-00539-x
|View full text |Cite
|
Sign up to set email alerts
|

Named entity recognition for Chinese judgment documents based on BiLSTM and CRF

Abstract: Chinese named entity recognition (CNER) in the judicial domain is an important and fundamental task in the analysis of judgment documents. However, only a few researches have been devoted to this task so far. For Chinese named entity recognition in judgment documents, we propose the use a bidirectional long-short-term memory (BiLSTM) model, which uses character vectors and sentence vectors trained by distributed memory model of paragraph vectors (PV-DM). The output of BiLSTM is used by conditional random field… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 13 publications
(9 citation statements)
references
References 23 publications
0
9
0
Order By: Relevance
“…BiLSTM is a bidirectional LSTM model, that is, a neural network that combines forward LSTM and backward LSTM. Through two-way propagation, BiLSTM can obtain the coding information from back to front and capture the context relationship through two-way coding 48 . The structure of the BiLSTM model is shown in Fig.…”
Section: Methodsmentioning
confidence: 99%
“…BiLSTM is a bidirectional LSTM model, that is, a neural network that combines forward LSTM and backward LSTM. Through two-way propagation, BiLSTM can obtain the coding information from back to front and capture the context relationship through two-way coding 48 . The structure of the BiLSTM model is shown in Fig.…”
Section: Methodsmentioning
confidence: 99%
“…Although IE systems have a long history in the scientific literature, there are few studies that analyze their use in commercial NLP pipelines. There are legal NER datasets similar to the ones used in this work [3,2,19,17], but they rarely reflect the complexities of production pipelines, such as processing scanned documents with OCR errors and extracting nested and discontiguous entities.…”
Section: Related Workmentioning
confidence: 99%
“…As such, one multi-task learning architecture has been designed jointly with two aims, recognizing the entities and their types simultaneously. e work in [13] proposed the combination of utilizing character and sentence vectors trained by distributed memory model of paragraph vectors (PV-DM). Additionally, the BiLSTM model was implemented with an additional conditional random field (CRF) layer, while this CRF layer is used to tag the input sentence.…”
Section: Entity Identificationmentioning
confidence: 99%