2023
DOI: 10.3390/ijgi12060240
|View full text |Cite
|
Sign up to set email alerts
|

Applicability Analysis and Ensemble Application of BERT with TF-IDF, TextRank, MMR, and LDA for Topic Classification Based on Flood-Related VGI

Abstract: Volunteered geographic information (VGI) plays an increasingly crucial role in flash floods. However, topic classification and spatiotemporal analysis are complicated by the various expressions and lengths of social media textual data. This paper conducted applicability analysis on bidirectional encoder representation from transformers (BERT) and four traditional methods, TextRank, term frequency–inverse document frequency (TF-IDF), maximal marginal relevance (MMR), and linear discriminant analysis (LDA), and … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2024
2024
2025
2025

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 48 publications
0
3
0
Order By: Relevance
“…TF-IDF is a commonly applied approach that assigns a weight to each term in a document based on its frequency within the document and inverse frequency across all documents. These resulting weights indicate the significance of each term in the document and serve as features for the machine learning algorithms ( Du et al, 2023 ).…”
Section: Methodsmentioning
confidence: 99%
“…TF-IDF is a commonly applied approach that assigns a weight to each term in a document based on its frequency within the document and inverse frequency across all documents. These resulting weights indicate the significance of each term in the document and serve as features for the machine learning algorithms ( Du et al, 2023 ).…”
Section: Methodsmentioning
confidence: 99%
“…This classification includes [28,33,34,[46][47][48], which are research works aimed at summarizing and retrieving crisis-relevant information from social media data to enhance situational awareness and decision making. Table 6 presents this detailed categorization scheme (Detailed in Appendix A, Table A3).…”
Section: Information Summarization and Retrievalmentioning
confidence: 99%
“…Other (e.g., SVM, TF-IDF, TextRank, LDA) [33,48] Effective summarization and topic classification in specific disaster-related contexts.…”
Section: Information Summarization and Retrievalmentioning
confidence: 99%