Proceedings of the 2018 Conference of the North American Chapter Of the Association for Computational Linguistics: De 2018
DOI: 10.18653/v1/n18-5012
|View full text |Cite
|
Sign up to set email alerts
|

VnCoreNLP: A Vietnamese Natural Language Processing Toolkit

Abstract: We present an easy-to-use and fast toolkit, namely VnCoreNLP-a Java NLP annotation pipeline for Vietnamese. Our VnCoreNLP supports key natural language processing (NLP) tasks including word segmentation, part-of-speech (POS) tagging, named entity recognition (NER) and dependency parsing, and obtains state-of-the-art (SOTA) results for these tasks. We release VnCoreNLP to provide rich linguistic annotations to facilitate research work on Vietnamese NLP. Our VnCoreNLP is open-source and available at: https:// gi… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
54
0
1

Year Published

2019
2019
2022
2022

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 113 publications
(55 citation statements)
references
References 16 publications
0
54
0
1
Order By: Relevance
“…We then make use of the information of word, those generated POS, and chunking tags to train VNER model. Moreover, we also evaluate VNER model on VLSP-2016 dataset with another setup as in the experiment of VnCoreNLP ( [11]) in which the contiguous syllable constituting a PER tag is merged to form a word. In comparison to other neural network for Vietnamese NER task, we compare the performance of VNER model with two neural models: NNVLP model ( [8]) that makes use of the combination of bidirectional LSTM, CNN, and CRF models; and vie-ner-lstm model ( [14]) that incorporates automatic syntactic features with word embeddings as input for bidirectional LSTM network.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…We then make use of the information of word, those generated POS, and chunking tags to train VNER model. Moreover, we also evaluate VNER model on VLSP-2016 dataset with another setup as in the experiment of VnCoreNLP ( [11]) in which the contiguous syllable constituting a PER tag is merged to form a word. In comparison to other neural network for Vietnamese NER task, we compare the performance of VNER model with two neural models: NNVLP model ( [8]) that makes use of the combination of bidirectional LSTM, CNN, and CRF models; and vie-ner-lstm model ( [14]) that incorporates automatic syntactic features with word embeddings as input for bidirectional LSTM network.…”
Section: Resultsmentioning
confidence: 99%
“…There have been considerable work proposed by Vietnamese researchers in solving the NER problem such as dynamic feature induction model ( [11]), CRF model ( [2]), or LSTM ( [8], [9]). CRF-based model achieves state-of-the-art results on the VLSP 2016 and VLSP 2018 competitions; however, it still suffers from the linear statistical model drawbacks as mentioned above.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Finally, we create a relationship for the triple (e i , v i , e i+1 ). Besides that, we also experienced another library named VnCoreNLP [23]. VnCoreNLP is also an open source that can be downloaded from [23].…”
Section: ��mentioning
confidence: 99%
“…Besides that, we also experienced another library named VnCoreNLP [23]. VnCoreNLP is also an open source that can be downloaded from [23]. The difference with underthesea is that VnCoreNLP has dependency parsing feature that can analyze relationships between parts of speech in a Vietnamese sentence.…”
Section: ��mentioning
confidence: 99%