Vietnamese Text Classification Algorithm using Long Short Term Memory and Word2Vec

Phat, Huu Nguyen; Anh, Nguyen Thi Minh

doi:10.15622/ia.2020.19.6.5

Cited by 7 publications

(2 citation statements)

References 22 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…LSTM with the Word2Vec model achieves an F1-score of 98.03% for word segmentation in the Arabic language (Almuhareb et al 2019 ). Neural network-based word embedding efficiently models a word and its context and has become one of the most widely used methods of word distribution representation (N.H. Phat and Anh 2020 )(Alharthi et al 2021 ).…”

Section: Review On Text Analytics Word Embedding Application and Deep...mentioning

confidence: 99%

“… Malla and Alphonse ( 2021 ) Twitter tweet analysis for the disease information collection COVID-19 labeled English dataset from Twitter Majority voting based ensemble deep learning model RoBERT, BERTweet, CT-BERT RoBERT achieves an accuracy of 90.30% 38. Phat and Anh ( 2020 ) Vietnamese text classification Vietnamese news articles LSTM, CNN, SVM, NB Word2Vec LSTM + Word2Vec achieves an F1-score of 95.74% 39. Grzeça et al ( 2020 ) Social networking site tweets analysis for identification of alcohol-related tweets Datasets DS1-Q1, Q2, Q3 SVM, XGBoost, CNN, BiLSTM DSWE(Drink2Vec), BERT CNN + Drink2Vec achieves an F1-score of 94.45% SANAD Single-label Arabic news articles datasets, NADiA News articles datasets in Arabic with multi-labels, HAN Hierarchical attention network, HDBSCAN Hierarchical Density-Based Spatial Clustering of Applications with Noise, LDA Logistic regression, linear discriminant analysis, QDA Quadratic discriminant analysis, NB Naïve Bayes, SVM Support vector machine, KNN k-nearest neighbor, DT Decision tree, RF Random forest, XGBoost MLP Multilayer perceptron, LIWC Linguistic Inquiry and Word Count features, NER Named entity recognition, PMMC Process model matching contest dataset, DLMF Digital Library of Mathematical Functions, GB Gradient Boosting, SGC Stochastic Gradient Descent, HAN Hierarchical attention network, DFFNN Deep feed-forward neural network.…”

Section: Appendix Amentioning

confidence: 99%

See 1 more Smart Citation

Impact of word embedding models on text analytics in deep learning environment: a review

2023

View full text Add to dashboard Cite

The selection of word embedding and deep learning models for better outcomes is vital. Word embeddings are an n-dimensional distributed representation of a text that attempts to capture the meanings of the words. Deep learning models utilize multiple computing layers to learn hierarchical representations of data. The word embedding technique represented by deep learning has received much attention. It is used in various natural language processing (NLP) applications, such as text classification, sentiment analysis, named entity recognition, topic modeling, etc. This paper reviews the representative methods of the most prominent word embedding and deep learning models. It presents an overview of recent research trends in NLP and a detailed understanding of how to use these models to achieve efficient results on text analytics tasks. The review summarizes, contrasts, and compares numerous word embedding and deep learning models and includes a list of prominent datasets, tools, APIs, and popular publications. A reference for selecting a suitable word embedding and deep learning approach is presented based on a comparative analysis of different techniques to perform text analytics tasks. This paper can serve as a quick reference for learning the basics, benefits, and challenges of various word representation approaches and deep learning models, with their application to text analytics and a future outlook on research. It can be concluded from the findings of this study that domain-specific word embedding and the long short term memory model can be employed to improve overall text analytics task performance.

show abstract

Section: Review On Text Analytics Word Embedding Application and Deep...mentioning

confidence: 99%

Section: Appendix Amentioning

confidence: 99%

Impact of word embedding models on text analytics in deep learning environment: a review

2023

View full text Add to dashboard Cite

show abstract

Proposing Recommendation System Using Bag of Word and Multi-label Support Vector Machine Classification

Huu

Anh

Trong³

et al. 2021

Communications in Computer and Information Science

View full text Add to dashboard Cite

User-Item Correlation in Hybrid Neighborhood-Based Recommendation System with Synthetic User Data

Duong

Giang

Cao

et al. 2022

2022 IEEE Ninth International Conference on Communications and Electronics (ICCE)

View full text Add to dashboard Cite

Vietnamese Text Classification Algorithm using Long Short Term Memory and Word2Vec

Cited by 7 publications

References 22 publications

Impact of word embedding models on text analytics in deep learning environment: a review

Impact of word embedding models on text analytics in deep learning environment: a review

Proposing Recommendation System Using Bag of Word and Multi-label Support Vector Machine Classification

User-Item Correlation in Hybrid Neighborhood-Based Recommendation System with Synthetic User Data

Contact Info

Product

Resources

About