2022
DOI: 10.31235/osf.io/ft84u
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

From Documents to Data: A Framework for Total Corpus Quality

Abstract: As large corpora of digitized text and novel methodologies become increasingly available, researchers are rediscovering textual data’s potential fruitfulness for inquiries into social and cultural phenomena. While textual corpora show great promise to enrich our knowledge of the social, avoiding problems related to data quality remains a challenge to related empirical research. Hence, evaluating the quality of a corpus will be pivotal for future social science inquiries. We propose a conceptual framework for t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
0
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 64 publications
0
0
0
Order By: Relevance
“…The relative error is therefore only able to identify groups of seemingly errorprone factors in the research process, rather than definite sources of error. However, although an absolute measure of error (Meyer & Mittag, 2021;Hurtado Bodell et al, 2022) would be better suited to assess DBD quality, such an approach must first identify a true effect from which different error dimensions deviate. This is impossible in most practical cases.…”
Section: Discussionmentioning
confidence: 99%
“…The relative error is therefore only able to identify groups of seemingly errorprone factors in the research process, rather than definite sources of error. However, although an absolute measure of error (Meyer & Mittag, 2021;Hurtado Bodell et al, 2022) would be better suited to assess DBD quality, such an approach must first identify a true effect from which different error dimensions deviate. This is impossible in most practical cases.…”
Section: Discussionmentioning
confidence: 99%