Sample design for analysis using high-influence probability sampling

Publication Name

Journal of the Royal Statistical Society. Series A: Statistics in Society

Abstract

Sample designs are typically developed to estimate summary statistics such as means, proportions and prevalences. Analytical outputs may also be a priority but there are fewer methods and results on how to efficiently design samples for the fitting and estimation of statistical models. This paper develops a general approach for determining efficient sampling designs for probability-weighted maximum likelihood estimators and considers application to generalized linear models. We allow for non-ignorable sampling, including outcome-dependent sampling. The new designs have probabilities of selection closely related to influence statistics such as dfbeta and Cook's distance. The new approach is shown to perform well in a simulation based on data from the New Zealand Health Survey.

Open Access Status

This publication is not available as open access

Volume

185

Issue

4

First Page

1733

Last Page

1756

Funding Sponsor

University of Southampton

Share

COinS
 

Link to publisher version (DOI)

http://dx.doi.org/10.1111/rssa.12916