Bootstrapping phylogenies: Large deviations and dispersion effects

Bootstrapping phylogenies: Large deviations and dispersion effects

Michael A. Newton

1996 Biometrika, 83. 315--328

Formerly, Technical Report 923, Department of Statistics, University of Wisconsin, Madison.

Abstract:

A large deviation result is established for the bootstrap empirical distribution in a finite sample space, thereby validating both nonparametric and parametric bootstrapping in certain phylogenetic inference problems. The bias previously observed in the bootstrap distribution of the estimated tree topology is shown to stem from dispersion effects in the joint distribution of sample and bootstrap empirical distributions. Both results are examined for maximum likelihood estimation in a three-taxon model having particularly simple geometry. They are also applicable to discrete parameter problems outside of phylogenetic inference.

Keywords: bias, bootstrap efficiency, cladistics, DNA, discrete parameter space, entropy, molecular evolution, relative entropy, systematics, tree topology


Click here to obtain a reprint.