For Manuscript Submission, Check or Review Login please go to Submission Websites List.
For the academic login, please select your country in the dropdown list. You will be redirected to verify your credentials.
Improved Eigenanalysis of Discrete Subpopulations and Admixture Using the Minimum Average Partial TestShriner D.
Center for Research on Genomics and Global Health, National Human Genome Research Institute, Bethesda, Md., USA Corresponding Author
Center for Research on Genomics and Global Health
National Human Genome Research Institute
Building 12A, Room 4047, 12 South Dr., MSC 5635, Bethesda, MD 20892-5635 (USA)
Tel. +1 301 435 0068, E-Mail email@example.com
Principal components analysis of genetic data has benefited from advances in random matrix theory. The Tracy-Widom distribution has been identified as the limiting distribution of the lead eigenvalue, enabling formal hypothesis testing of population structure. Additionally, a phase change exists between small and large eigenvalues, such that population divergence below a threshold of FST is impossible to detect and above which it is always detectable. I show that the plug-in estimate of the effective number of markers in the EIGENSOFT software often exceeds the rank of the sample covariance matrix, leading to a systematic overestimation of the number of significant principal components. I describe an alternative plug-in estimate that eliminates the problem. This improvement is not just an asymptotic result but is directly applicable to finite samples. The minimum average partial test, based on minimizing the average squared partial correlation between individuals, can detect population structure at smaller FST values than the corrected test. The minimum average partial test is applicable to both unadmixed and admixed samples, with arbitrary numbers of discrete subpopulations or parental populations, respectively. Application of the minimum average partial test to the 11 HapMap Phase III samples, comprising 8 unadmixed samples and 3 admixed samples, revealed 13 significant principal components.
© 2012 S. Karger AG, Basel