VnCoreNLP: A Vietnamese Natural Language Processing Toolkit

Vu, Thanh; Nguyen, Dat Quoc; Nguyen, Dai Quoc; Dras, Mark; Johnson, Mark

doi:10.18653/v1/n18-5012

Cited by 113 publications

(55 citation statements)

References 16 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…We then make use of the information of word, those generated POS, and chunking tags to train VNER model. Moreover, we also evaluate VNER model on VLSP-2016 dataset with another setup as in the experiment of VnCoreNLP ( [11]) in which the contiguous syllable constituting a PER tag is merged to form a word. In comparison to other neural network for Vietnamese NER task, we compare the performance of VNER model with two neural models: NNVLP model ( [8]) that makes use of the combination of bidirectional LSTM, CNN, and CRF models; and vie-ner-lstm model ( [14]) that incorporates automatic syntactic features with word embeddings as input for bidirectional LSTM network.…”

Section: Resultsmentioning

confidence: 99%

“…There have been considerable work proposed by Vietnamese researchers in solving the NER problem such as dynamic feature induction model ( [11]), CRF model ( [2]), or LSTM ( [8], [9]). CRF-based model achieves state-of-the-art results on the VLSP 2016 and VLSP 2018 competitions; however, it still suffers from the linear statistical model drawbacks as mentioned above.…”

Section: Related Workmentioning

confidence: 99%

“…Due to VLSP-2016 dataset that does not have development set, hence we create a development set by sampling randomly 2000 samples of train set as in [11]; and the rest of train set is used for training VNER model. We then train VNER model on both VLSP-2016 and VLSP-2018 datasets with the train set and further tune up the model with the development set.…”

Section: B Experimental Settingsmentioning

confidence: 99%

See 2 more Smart Citations

Attentive Neural Network for Named Entity Recognition in Vietnamese

Nguyen

Nanping

Nguyen

2019

2019 IEEE-RIVF International Conference on Computing and Communication Technologies (RIVF)

View full text Add to dashboard Cite

We propose an attentive neural network for the task of named entity recognition in Vietnamese. The proposed attentive neural model makes use of character-based language models and word embeddings to encode words as vector representations. A neural network architecture of encoder, attention, and decoder layers is then utilized to encode knowledge of input sentences and to label entity tags. The experimental results show that the proposed attentive neural network achieves the state-of-the-art results on the benchmark named entity recognition datasets in Vietnamese in comparison to both hand-crafted features based models and neural models.Index Terms-named entity recognition, neural network, conditional random fields

show abstract

Section: Resultsmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: B Experimental Settingsmentioning

confidence: 99%

See 1 more Smart Citation

Attentive Neural Network for Named Entity Recognition in Vietnamese

Nguyen

Nanping

Nguyen

2019

2019 IEEE-RIVF International Conference on Computing and Communication Technologies (RIVF)

View full text Add to dashboard Cite

show abstract

“…Finally, we create a relationship for the triple (e i , v i , e i+1 ). Besides that, we also experienced another library named VnCoreNLP [23]. VnCoreNLP is also an open source that can be downloaded from [23].…”

Section: ��mentioning

confidence: 99%

“…Besides that, we also experienced another library named VnCoreNLP [23]. VnCoreNLP is also an open source that can be downloaded from [23]. The difference with underthesea is that VnCoreNLP has dependency parsing feature that can analyze relationships between parts of speech in a Vietnamese sentence.…”

Section: ��mentioning

confidence: 99%

BERT+vnKG: Using Deep Learning and Knowledge Graph to Improve Vietnamese Question Answering System

Phan¹,

Do²

2020

IJACSA

View full text Add to dashboard Cite

A question answering (QA) system based on natural language processing and deep learning is a prominent area and is being researched widely. The Long Short-Term Memory (LSTM) model that is a variety of Recurrent Neural Network (RNN) used to be popular in machine translation, and question answering system. However, that model still has certainly limited capabilities, so a new model named Bidirectional Encoder Representation from Transformer (BERT) emerged to solve these restrictions. BERT has more advanced features than LSTM and shows state-of-the-art results in many tasks, especially in multilingual question answering system over the past few years. Nevertheless, we tried applying multilingual BERT model for a Vietnamese QA system and found that BERT model still has certainly limitation in term of time and precision to return a Vietnamese answer. The purpose of this study is to propose a method that solved above restriction of multilingual BERT and applied for question answering system about tourism in Vietnam. Our method combined BERT and knowledge graph to enhance accurately and find quickly for an answer. We experimented our crafted QA data about Vietnam tourism on three models such as LSTM, BERT fine-tuned multilingual for QA (BERT for QA), and BERT+vnKG. As a result, our model outperformed two previous models in terms of accuracy and time. This research can also be applied to other fields such as finance, e-commerce, and so on.

show abstract

Image Captioning in Vietnamese Language Based on Deep Learning Network

Tien

Nguyễn

2020

Communications in Computer and Information Science

View full text Add to dashboard Cite

VnCoreNLP: A Vietnamese Natural Language Processing Toolkit

Cited by 113 publications

References 16 publications

Attentive Neural Network for Named Entity Recognition in Vietnamese

Attentive Neural Network for Named Entity Recognition in Vietnamese

BERT+vnKG: Using Deep Learning and Knowledge Graph to Improve Vietnamese Question Answering System

Image Captioning in Vietnamese Language Based on Deep Learning Network

Contact Info

Product

Resources

About