2018
DOI: 10.1111/rssa.12315
|View full text |Cite
|
Sign up to set email alerts
|

Statistical Challenges of Administrative and Transaction Data

Abstract: Summary Administrative data are becoming increasingly important. They are typically the side effect of some operational exercise and are often seen as having significant advantages over alternative sources of data. Although it is true that such data have merits, statisticians should approach the analysis of such data with the same cautious and critical eye as they approach the analysis of data from any other source. The paper identifies some statistical challenges, with the aim of stimulating debate about and … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

2
96
0
1

Year Published

2018
2018
2023
2023

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 101 publications
(99 citation statements)
references
References 123 publications
(134 reference statements)
2
96
0
1
Order By: Relevance
“…(). A number of authors warn that these issues have the potential to provide misleading research outcomes and decisions; see, for example, Boyd and Crawford (), Wigan and Clarke (), Harford (), Hand (, ) and Fiebig ()…”
Section: A Statistical Toolboxmentioning
confidence: 99%
See 2 more Smart Citations
“…(). A number of authors warn that these issues have the potential to provide misleading research outcomes and decisions; see, for example, Boyd and Crawford (), Wigan and Clarke (), Harford (), Hand (, ) and Fiebig ()…”
Section: A Statistical Toolboxmentioning
confidence: 99%
“…In particular, Harford () and Hand () make a point that gathering more and more data in an effort to achieve ‘ T = ALL’ is impossible, with potentially undesirable consequences. This also violates the notion of random sampling underlying the statistical theory of inference, as Hand (; p. 567) points out. Fiebig () argues that big data is not synonymous with better or more informative data, pointing out that it is often subject to missing observations and measurement errors.…”
Section: A Statistical Toolboxmentioning
confidence: 99%
See 1 more Smart Citation
“…Much, and perhaps all, that is at issue in [15] is very important for all that is under discussion here. This reference, [15], describes the problems of data quality, in the Big Data context, relating to administrative data. Hence, data curation is very relevant for reproducibility of analytics.…”
Section: Integration Of Data and Analytics: Context Of Applicationsmentioning
confidence: 99%
“…Much, and perhaps all, that is at issue in [17] is very important for all that is under discussion here. This reference, [17], describes the problems of data quality, in the Big Data context, relating to administrative data. Hence data curation is very relevant for reproducibility of analytics.…”
Section: Integration Of Data and Analytics: Context Of Applicationsmentioning
confidence: 99%