Two-stage sampling in the estimation of growth parameters and percentile norms: sample weights versus auxiliary variable estimation

George Vamvakas*, Courtenay Norbury, Andrew Pickles

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)


Background: The use of auxiliary variables with maximum likelihood parameter estimation for surveys that miss data by design is not a widespread approach, despite its documented improved efficiency over traditional approaches that deploy sampling weights. Although efficiency gains from the use of Normally distributed auxiliary variables in a model have been recorded in the literature, little is known about the effects of non-Normal auxiliary variables in the parameter estimation. Methods: We simulate growth data to mimic SCALES, a two-stage survey of language development with a screening phase (stage one) for which data are observed for the whole sample and an intensive assessments phase (stage two), for which data are observed for a sub-sample, selected using stratified random sampling. In the simulation, we allow a fully observed Poisson distributed stratification criterion to be correlated with the partially observed model responses and develop five generalised structural equation growth models that host the auxiliary information from this criterion. We compare these models with each other and with a weighted growth model in terms of bias, efficiency, and coverage. We finally apply our best performing model to SCALES data and show how to obtain growth parameters and population norms. Results: Parameter estimation from a model that incorporates a non-Normal auxiliary variable is unbiased and more efficient than its weighted counterpart. The auxiliary variable method is capable of producing efficient population percentile norms and velocities. Conclusions: The deployment of a fully observed variable that dominates the selection of the sample and correlates strongly with the incomplete variable of interest appears beneficial for the estimation process.

Original languageEnglish
Article number173
JournalBMC Medical Research Methodology
Issue number1
Publication statusPublished - Dec 2021


  • Auxiliary variable
  • Missing data
  • Percentile norms
  • Population norms
  • Two-stage design
  • Weights


Dive into the research topics of 'Two-stage sampling in the estimation of growth parameters and percentile norms: sample weights versus auxiliary variable estimation'. Together they form a unique fingerprint.

Cite this