The Impacts of Reading Recovery at Scale: Results From the 4-Year i3 External Evaluation

Sirinides, Philip; Gray, Abigail; May, Henry

doi:10.3102/0162373718764828

Cited by 29 publications

(33 citation statements)

References 27 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…A recent evaluation of the Reading Recovery intervention provides an exemplar two-level MSRT with a latent outcome at Level 1 (e.g., Sirinides, Gray, & May, 2018). Schools are randomly selected to ensure representativeness and then students are randomized to the treatment condition within these schools.…”

Section: Resultsmentioning

confidence: 99%

“…The primary outcome of interest is reading achievement, a latent construct estimated by the Total Reading standard score from the Iowa Tests of Basic Skills. It is likely that the reading achievement score outcome includes some degree of measurement error with reliability of these tests typically around .8–.9 (Sirinides et al, 2018).…”

Section: Resultsmentioning

confidence: 99%

“…Based on Sirinides, Gray, and May (2018), let us assume a large sample of students and schools to investigate the scalability of the Reading Recovery program and a specific subsample of 3,400 students from 1,100 rural schools (i.e., ≈ 3 per school). We predict a Reading Recovery treatment effect of δ j = .1 , student-level variance on the standardized reading test of σ Y 2 = .9 , and variance of the Reading Recovery treatment effect across schools of τ δ| 2 = .3 .…”

Section: Resultsmentioning

confidence: 99%

“…For an MSRT with a large sample size, the more important design consideration is often MDE. Using the parameter values described above but with the full sample from Sirinides et al (2018; n 1 = 6 , 800 and n 2 = 1 , 200 ), the MDE is .081 and .084 when considering unreliability in the reading test scores. Again, increases to the MDE are minimized by the high reliability of the standardized tests scores and to a lesser degree the larger individual sample size.…”

Section: Resultsmentioning

confidence: 99%

“…Last, we can consider planning this type of evaluation under an optimal sampling framework. Sirinides et al (2018) note the high cost of the intensive Reading Recovery program that entails specialized staff training and serves relatively few students at a time. As far as research costs, let us assume sampling a school costs around US$3,000, while the cost to sample a student costs around US$1,000 ( c 2 / c 1 = 3 / 1 ) with a budget of US$4 million (Sirinides et al, 2018).…”

Section: Resultsmentioning

confidence: 99%

See 4 more Smart Citations

Optimal Design of Cluster- and Multisite-Randomized Studies Using Fallible Outcome Measures

Cox

Kelcey

2019

Eval Rev

View full text Add to dashboard Cite

Background: Evaluation studies frequently draw on fallible outcomes that contain significant measurement error. Ignoring outcome measurement error in the planning stages can undermine the sufficiency and efficiency of an otherwise well-designed study and can further constrain the evidence studies bring to bear on the effectiveness of programs. Objectives: We develop simple formulas to adjust statistical power, minimum detectable effect (MDE), and optimal sample allocation formulas for two-level cluster- and multisite-randomized designs when the outcome is subject to measurement error. Results: The resulting adjusted formulas suggest that outcome measurement error typically amplifies treatment effect uncertainty, reduces power, increases the MDE, and undermines the efficiency of conventional optimal sampling schemes. Therefore, achieving adequate power for a given effect size will typically demand increased sample sizes when considering fallible outcomes, while maintaining design efficiency will require increasing portions of a budget be applied toward sampling a larger number of individuals within clusters. We illustrate evaluation planning with the new formulas while comparing them to conventional formulas using hypothetical examples based on recent empirical studies. To encourage adoption of the new formulas, we implement them in the R package PowerUpR and in the PowerUp software.

show abstract

Section: Resultsmentioning

confidence: 99%

Section: Resultsmentioning

confidence: 99%