1 option
Factorial Designs, Model Selection, and (Incorrect) Inference in Randomized Experiments / Karthik Muralidharan, Mauricio Romero, Kaspar Wüthrich.
- Format:
- Book
- Author/Creator:
- Muralidharan, Karthik.
- Series:
- Working Paper Series (National Bureau of Economic Research) no. w26562.
- NBER working paper series no. w26562
- Language:
- English
- Physical Description:
- 1 online resource: illustrations (black and white);
- Other Title:
- Factorial Designs, Model Selection, and
- Place of Publication:
- Cambridge, Mass. National Bureau of Economic Research 2019.
- Summary:
- Factorial designs are widely used for studying multiple treatments in one experiment. While t-tests based on the "long" model (including main and interaction effects) provide valid inferences against "business-as-usual" counterfactuals, "short" model t-tests (that ignore interactions) yield higher power if the interactions are zero, but incorrect inferences otherwise. Out of 27 factorial experiments published in top-5 journals in 2007-2017, 19 use the short model. We reanalyze these experiments, and show that over half of their published results lose significance when interactions are included. We show that testing the interactions using the long model and presenting the short model if the interactions are not significantly different from zero leads to incorrect inference due to the implied data-dependent model selection. Based on recent econometric advances, we show that local power improvements over the long model are possible. However, if the main effects are of primary interest, leaving the interaction cells empty yields valid inferences and global power improvements. In addition, the sample size needed to detect interactions is substantially larger than that required to detect main effects, resulting in most experiments being under-powered to detect interactions. Thus, using factorial designs to explore whether interactions are meaningful can be problematic because interaction estimates are likely to considerably overestimate the magnitude of the true effect conditional on being significant.
- Notes:
- Print version record
- December 2019.
The Penn Libraries is committed to describing library materials using current, accurate, and responsible language. If you discover outdated or inaccurate language, please fill out this feedback form to report it and suggest alternative language.