Statistics and Its Interface

Volume 4 (2011)

Number 3

How to estimate the measurement error variance associated with ancestry proportion estimates

Pages: 327 – 337

DOI: https://dx.doi.org/10.4310/SII.2011.v4.n3.a7

Authors

David B. Allison

Raymond J. Carroll

Jasmin Divers (Center for Public Health Genomics, Wake Forest University Health Sciences, Winston-Salem, North Carolina, U.S.A.)

David T. Redden

Abstract

To show how the variance of the measurement error (ME) associated with individual ancestry proportion estimates can be estimated, especially when the number of ancestral populations ($k$) is greater than 2.

We extend existing internal consistency measures to estimate the ME variance, and we compare these estimates with the ME variance estimated by use of the repeated measurement (RM) approach. Both approaches work by dividing the genotyped markers into subsets. We examine the effect of the number of subsets and of the allocation of markers to each subset on the performance of each approach. We used simulated data for all comparisons.

Independently of the value of $k$, the measures of internal reliability provided less biased and more precise estimates of the ME variance than did those obtained with the RM approach. Both methods tend to perform better when a large number of subsets of markers with similar sizes are considered.

Our results will facilitate the use of ME correction methods to address the ME problem in individual ancestry proportion estimates. Our method will improve the ability to control for type I error inflation and loss of power in association tests and other genomic research involving ancestry estimates.

Keywords

population stratification, admixture, Type I error inflation, reliability, Cronbach’s alpha, measurement errors, measurement error variance

Published 29 August 2011