2020
DOI: 10.1007/s10940-020-09461-x
|View full text |Cite
|
Sign up to set email alerts
|

Dude, Where’s My Treatment Effect? Errors in Administrative Data Linking and the Destruction of Statistical Power in Randomized Experiments

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
15
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
1
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 18 publications
(15 citation statements)
references
References 131 publications
0
15
0
Order By: Relevance
“…Due to the absence of global identifiers across records systems, we used a combination of the first three letters of a defendant's first name, the first three letters of a defendant's last name, and year of birth to link assessments to jail and court records within each county. Prior studies have shown that exact name matching can result in data loss and lower statistical power (Tahamont et al, 2020). We have found that local records often contain slight name misspellings, and this shortened identifier can detect the same individuals with good accuracy in a given jurisdiction.…”
mentioning
confidence: 82%
“…Due to the absence of global identifiers across records systems, we used a combination of the first three letters of a defendant's first name, the first three letters of a defendant's last name, and year of birth to link assessments to jail and court records within each county. Prior studies have shown that exact name matching can result in data loss and lower statistical power (Tahamont et al, 2020). We have found that local records often contain slight name misspellings, and this shortened identifier can detect the same individuals with good accuracy in a given jurisdiction.…”
mentioning
confidence: 82%
“…When referring to links within a dataset, this is more commonly known as deduplication. Record linkage has received much attention in recent years with advances in computational techniques for linking data by matching on identifying information (Tahamont et al 2021).…”
Section: Quantifying Consolidationmentioning
confidence: 99%
“…Although a single entity might own many properties, its name or contact information might not be entered the same way in every record -creating errors that needed to be corrected. Fortunately, advances in text matching methods have significantly increased our ability to deal with the issues of messy data (Tahamont et al 2021).…”
Section: Introductionmentioning
confidence: 99%
“…Recent work has also shown that even small amounts of linkage error can result in large effects on false negative (Type II) error rates in research studies. This is especially the case with small sample sizes that can occur with the rare effects that are often sought to be identified via record linkage from large population databases [78]. Such errors can even lead to research studies that are based on a small linked subpopulation to become useless [8].…”
Section: Misconceptions Due To Linking Datamentioning
confidence: 99%
“…One example is the increasing use of medical databases for public health research, compiled from the electronic health records of patients in hospitals or doctor's clinics, and potentially linked with databases that contain other personal information such as education or employment details of these patients. While analysing such linked data can provide exciting new insights into the effects of people's social status upon their health, not considering any potential bias or other limitations of such data can lead to wrong conclusions [10,26,43,78,83].…”
Section: Introductionmentioning
confidence: 99%