Matt Goldman scite author profile

When comparing two distributions, it is often helpful to learn at which quantiles or values there is a statistically significant difference. This provides more information than the binary "reject" or "do not reject" decision of a global goodness-of-fit test. Framing our question as multiple testing across the continuum of quantiles τ ∈ (0, 1) or values r ∈ R, we show that the Kolmogorov-Smirnov test (interpreted as a multiple testing procedure) achieves strong control of the familywise error rate. However, its well-known flaw of low sensitivity in the tails remains. We provide an alternative method that retains such strong control of familywise error rate while also having even sensitivity, i.e., equal pointwise type I error rates at each of n → ∞ order statistics across the distribution. Our one-sample method computes instantly, using our new formula that also instantly computes goodness-of-fit p-values and uniform confidence bands. To improve power, we also propose stepdown and pre-test procedures that maintain control of the asymptotic familywise error rate. One-sample and two-sample cases are considered, as well as extensions to regression discontinuity designs and conditional distributions. Simulations, empirical examples, and code are provided.JEL classification: C12, C14, C21

show abstract

Fractional order statistic approximation for nonparametric conditional quantile inference

Goldman

Kaplan

2017

Journal of Econometrics

View full text Add to dashboard Cite

Using and extending fractional order statistic theory, we characterize the O(n −1 ) coverage probability error of the previously proposed confidence intervals for population quantiles using L-statistics as endpoints in Hutson (1999). We derive an analytic expression for the n −1 term, which may be used to calibrate the nominal coverage level to get O n −3/2 [log(n)] 3 coverage error. Asymptotic power is shown to be optimal. Using kernel smoothing, we propose a related method for nonparametric inference on conditional quantiles. This new method compares favorably with asymptotic normality and bootstrap methods in theory and in simulations. Code is provided for both unconditional and conditional inference.JEL classification: C21

show abstract

Modeling Consumer Preferences and Price Sensitivities from Large-Scale Grocery Shopping Transaction Logs

Wan

Wang

Goldman

et al. 2017

View full text Add to dashboard Cite

Non‐parametric inference on (conditional) quantile differences and interquantile ranges, using L‐statistics

Goldman

Kaplan

2018

View full text Add to dashboard Cite

We provide novel, high-order accurate methods for non-parametric inference on quantile differences between two populations in both unconditional and conditional settings. These quantile differences correspond to (conditional) quantile treatment effects under (conditional) independence of a binary treatment and potential outcomes. Our methods use the probability integral transform and a Dirichlet (rather than Gaussian) reference distribution to pick appropriate L-statistics as confidence interval endpoints, achieving highorder accuracy. Using a similar approach, we also propose confidence intervals/sets for vectors of quantiles, interquantile ranges and differences of linear combinations of quantiles. In the conditional setting, when smoothing over continuous covariates, optimal bandwidth and coverage probability rates are derived for all methods. Simulations show that the new confidence intervals have a favourable combination of robust accuracy and short length compared with existing approaches. Detailed steps for confidence interval construction are provided in online Appendix E as supporting information, and code for all methods, simulations and empirical examples is provided.

show abstract

Network Experimentation at Scale

Karrer

Shi

Bhole

et al. 2021

View full text Add to dashboard Cite

We describe our framework, deployed at Facebook, that accounts for interference between experimental units through cluster-randomized experiments. We document this system, including the design and estimation procedures, and detail insights we have gained from the many experiments that have used this system at scale. We introduce a cluster-based regression adjustment that substantially improves precision for estimating global treatment effects as well as testing for interference as part of our estimation procedure. With this regression adjustment, we find that imbalanced clusters can better account for interference than balanced clusters without sacrificing accuracy. In addition, we show how logging exposure to a treatment can be used for additional variance reduction. Interference is a widely acknowledged issue with online field experiments, yet there is less evidence from real-world experiments demonstrating interference in online settings. We fill this gap by describing two case studies that capture significant network effects and highlight the value of this experimentation framework.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Matt Goldman

Comparing distributions by multiple testing across quantiles or CDF values

Fractional order statistic approximation for nonparametric conditional quantile inference

Modeling Consumer Preferences and Price Sensitivities from Large-Scale Grocery Shopping Transaction Logs

Non‐parametric inference on (conditional) quantile differences and interquantile ranges, using L‐statistics

Network Experimentation at Scale

Contact Info

Product

Resources

About