Improving BERT-Based Text Classification With Auxiliary Sentence and Domain Knowledge

Yu, Shujuan; Su, Jindian; Luo, Da

doi:10.1109/access.2019.2953990

Cited by 96 publications

(38 citation statements)

References 4 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…They have experienced that transfer learning models can perform better than other state-of-the-art methods in NLP. BERT is trained on BookCorpus, text corpus, and Wikipedia which can give overwhelming results in some areas of natural language processing but it still needs to be improved [ 32 ]. It somewhere lacks domain-related and task-related knowledge.…”

Section: Literature Reviewmentioning

confidence: 99%

A Fine-Tuned BERT-Based Transfer Learning Approach for Text Classification

Qasim

Bangyal

Alqarni

et al. 2022

Journal of Healthcare Engineering

103

View full text Add to dashboard Cite

Text Classification problem has been thoroughly studied in information retrieval problems and data mining tasks. It is beneficial in multiple tasks including medical diagnose health and care department, targeted marketing, entertainment industry, and group filtering processes. A recent innovation in both data mining and natural language processing gained the attention of researchers from all over the world to develop automated systems for text classification. NLP allows categorizing documents containing different texts. A huge amount of data is generated on social media sites through social media users. Three datasets have been used for experimental purposes including the COVID-19 fake news dataset, COVID-19 English tweet dataset, and extremist-non-extremist dataset which contain news blogs, posts, and tweets related to coronavirus and hate speech. Transfer learning approaches do not experiment on COVID-19 fake news and extremist-non-extremist datasets. Therefore, the proposed work applied transfer learning classification models on both these datasets to check the performance of transfer learning models. Models are trained and evaluated on the accuracy, precision, recall, and F1-score. Heat maps are also generated for every model. In the end, future directions are proposed.

show abstract

Section: Literature Reviewmentioning

confidence: 99%

A Fine-Tuned BERT-Based Transfer Learning Approach for Text Classification

Qasim

Bangyal

Alqarni

et al. 2022

Journal of Healthcare Engineering

103

View full text Add to dashboard Cite

show abstract

“…Sun et al [41] make detailed experiments on BERT and suggest several techniques to improve the results on text classification task. Yu et al [50] propose a BERT-based model for text classification to utilize more task-specific knowledge and achieve better results on multi-classification task. Yeung [49] inserts legal domain vocabulary to BERT, reports no improvement and explains their findings by the high overlap between vocabularies.…”

Section: Evaluation and Limitations Of Plmsmentioning

confidence: 99%

A Comparison of Pre-Trained Language Models for Multi-Class Text Classification in the Financial Domain

Arslan

Allix

Veiber

et al. 2021

Companion Proceedings of the Web Conference 2021

View full text Add to dashboard Cite

Neural networks for language modeling have been proven effective on several sub-tasks of natural language processing. Training deep language models, however, is time-consuming and computationally intensive. Pre-trained language models such as BERT are thus appealing since (1) they yielded state-of-the-art performance, and (2) they offload practitioners from the burden of preparing the adequate resources (time, hardware, and data) to train models. Nevertheless, because pre-trained models are generic, they may underperform on specific domains. In this study, we investigate the case of multi-class text classification, a task that is relatively less studied in the literature evaluating pre-trained language models. Our work is further placed under the industrial settings of the financial domain. We thus leverage generic benchmark datasets from the literature and two proprietary datasets from our partners in the financial technological industry. After highlighting a challenge for generic pre-trained models (BERT, DistilBERT, RoBERTa, XLNet, XLM) to classify a portion of the financial document dataset, we investigate the intuition that a specialized pre-trained model for financial documents, such as FinBERT, should be leveraged. Nevertheless, our experiments show that the FinBERT model, even with an adapted vocabulary, does not lead to improvements compared to the generic BERT models. CCS CONCEPTS• Applied computing → Text processing.

show abstract

“…Likewise, the precision represents the ratio of correctly predicted positive labels through the proposed approach to the total predicted positive labels. Consequently, the F-measure is calculated as the harmonic mean of precision and recall to show the cumulative effect of both measures as shown in equation (12).…”

Section: Accuracy =mentioning

confidence: 99%

“…The BERT model is designed to pre-train deep bidirectional representations of unlabeled text by co-conditioning both left and right context in all layers. A different contextual embeddings is produced by BERT according to the input sentence [12]. Nevertheless, BERT corrupts the input with masks, suffers a discrepancy between pre-training and fine-tuning, and ignores the interdependency between masked positions, thus leading to the loss of important information [13]- [15].…”

Section: Introductionmentioning

confidence: 99%

Sentence-Level Aspect-Based Sentiment Analysis for Classifying Adverse Drug Reactions (ADRs) Using Hybrid Ontology-XLNet Transfer Learning

2021

View full text Add to dashboard Cite

This paper presents a hybrid ontology-XLNet sentiment analysis classification approach for sentence-level aspects. The main objective of the proposed approach allows discovering user social data considering the extracted in-depth inference about sentiment depending on the context. Thus, in this paper, we investigate the contribution of utilizing the lexicalized ontology to improve the aspect-based sentiment analysis performance through extracting the indirect relationships in user social data. The XLNet model is utilized for extracting the neighboring contextual meaning and concatenating it with each embeddings word to produce a more comprehensive context and enhance feature extraction. In the proposed approach, Bidirectional Long Short Term Memory (Bi-LSTM) networks are used for classifying the aspects in online user reviews. Various experiments considering Adverse Drug Reactions (ADRs) discovery are conducted on six drug-related social data real-world datasets to evaluate the performance of the proposed approach using several measures. Obtained experimental results show that the proposed approach outperformed other tested state-of-the-art related approaches through improving feature extraction of unstructured social media text and accordingly improving the overall accuracy of sentiment classification. A significant accuracy of 98% and F-measure of 96.4% are achieved by the proposed ADRs aspect-based sentiment analysis approach.

show abstract

Improving BERT-Based Text Classification With Auxiliary Sentence and Domain Knowledge

Cited by 96 publications

References 4 publications

A Fine-Tuned BERT-Based Transfer Learning Approach for Text Classification

A Fine-Tuned BERT-Based Transfer Learning Approach for Text Classification

A Comparison of Pre-Trained Language Models for Multi-Class Text Classification in the Financial Domain

Sentence-Level Aspect-Based Sentiment Analysis for Classifying Adverse Drug Reactions (ADRs) Using Hybrid Ontology-XLNet Transfer Learning

Contact Info

Product

Resources

About