2020
DOI: 10.3390/app10103386
|View full text |Cite
|
Sign up to set email alerts
|

Comparing the Quality and Speed of Sentence Classification with Modern Language Models

Abstract: After the advent of Glove and Word2vec, the dynamic development of language models (LMs) used to generate word embeddings has enabled the creation of better text classifier frameworks. With the vector representations of words generated by newer LMs, embeddings are no longer static but are context-aware. However, the quality of results provided by state-of-the-art LMs comes at the price of speed. Our goal was to present a benchmark to provide insight into the speed–quality trade-off of a sentence classifier fra… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0
2

Year Published

2021
2021
2022
2022

Publication Types

Select...
8
1
1

Relationship

3
7

Authors

Journals

citations
Cited by 15 publications
(10 citation statements)
references
References 19 publications
0
8
0
2
Order By: Relevance
“…This method benefits from the recent advantages in the field of ML, DL and NLP and operates on unstructured tweet text only. Based on performance of recent methods in this field [ 31 , 32 ], we decided to train a robustly optimized BERT pretraining approach model (RoBERTa “large” version) [ 33 ] and gradient boosting classifier (XGBoost) [ 34 ] on a sample of our Twitter data and further automatically infer the classes of all remaining tweets.…”
Section: Methodsmentioning
confidence: 99%
“…This method benefits from the recent advantages in the field of ML, DL and NLP and operates on unstructured tweet text only. Based on performance of recent methods in this field [ 31 , 32 ], we decided to train a robustly optimized BERT pretraining approach model (RoBERTa “large” version) [ 33 ] and gradient boosting classifier (XGBoost) [ 34 ] on a sample of our Twitter data and further automatically infer the classes of all remaining tweets.…”
Section: Methodsmentioning
confidence: 99%
“…Deep learning methods are effective for prediction because they automatically extract appropriate features from datasets [ 34 ]. RNN, a deep learning method, can store extensive historical information and use it to accurately predict the next steps in time-series problems [ 35 ].…”
Section: Mathematical Modelsmentioning
confidence: 99%
“…Proper configuration of ML and DL models requires experiments and studies. In order to configure model parameters in this study, we based them on past research [63] and our experience in the field.…”
Section: Configuration Of Models Which Used Lstmsmentioning
confidence: 99%