2022
DOI: 10.1007/s42001-022-00191-7
|View full text |Cite
|
Sign up to set email alerts
|

A comparison of approaches for imbalanced classification problems in the context of retrieving relevant documents for an analysis

Abstract: One of the first steps in many text-based social science studies is to retrieve documents that are relevant for an analysis from large corpora of otherwise irrelevant documents. The conventional approach in social science to address this retrieval task is to apply a set of keywords and to consider those documents to be relevant that contain at least one of the keywords. But the application of incomplete keyword lists has a high risk of drawing biased inferences. More complex and costly methods such as query ex… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
references
References 91 publications
(115 reference statements)
0
0
0
Order By: Relevance