Summary: | There are multiple tests of homogeneity of binomial proportions in the statistics literature. However, when working with sparse data, most test procedures may fail to perform well. In this article we review nine classical and recent testing procedures, including the standard Pearson and likelihood ratio tests; exact conditional and unconditional tests; tests based on moment matching chi-squared approximations; a recently proposed test based on a normal approximation in an asymptotic framework for sparse data; and a recent test based on higher order moment corrections using an Edgeworth approximation. For each test we review its theoretical underpinning, and show how to calculate the P-value. Most of the P-values can be readily calculated in a statistical computing software package such as R. We compare type I error probability and power via simulation. As expected, none of the procedures uniformly outperforms the others in terms of type I error probability and power, but we can make some recommendations based on our empirical results. In particular, we indicate scenarios in which certain otherwise reasonable test procedures can perform inadequately.
|