
Vol. 60, No. 1, 2005
Free Abstract
Article (References)
Article (PDF 1112 KB)
Original Paper
Power and Sample Size Calculations for Genetic Case/Control Studies Using Gene-Centric SNP Maps: Application to Human Chromosomes 6, 21, and 22 in Three Populations
Francisco M. De La Vegaa,1, Derek Gordonb,1, Xiaoping Sua,2, Charles Scafea, Hadar Isaaca, Dennis A. Gilberta, Eugene G. Spiera
aApplied Biosystems, Foster City, Calif., and bLaboratory of Statistical Genetics, Rockefeller University, New York, N.Y., USA
Address of Corresponding Author
Hum Hered 2005;60:43-60 (DOI: 10.1159/000087918)
Key Words
- SNP
- Linkage disequilibrium
- Association studies
- Statistical power
- Sample size
- Case/control
- Study design
Abstract
Power and sample size calculations are critical parts of any research design for genetic association. We present a method that utilizes haplotype frequency information and average marker-marker linkage disequilibrium on SNPs typed in and around all genes on a chromosome. The test statistic used is the classic likelihood ratio test applied to haplotypes in case/control populations. Haplotype frequencies are computed through specification of genetic model parameters. Power is determined by computation of the test's non-centrality parameter. Power per gene is computed as a weighted average of the power assuming each haplotype is associated with the trait. We apply our method to genotype data from dense SNP maps across three entire chromosomes (6, 21, and 22) for three different human populations (African-American, Caucasian, Chinese), three different models of disease (additive, dominant, and multiplicative) and two trait allele frequencies (rare, common). We perform a regression analysis using these factors, average marker-marker disequilibrium, and the haplotype diversity across the gene region to determine which factors most significantly affect average power for a gene in our data. Also, as a 'proof of principle' calculation, we perform power and sample size calculations for all genes within 100 kb of the PSORS1 locus (chromosome 6) for a previously published association study of psoriasis. Results of our regression analysis indicate that four highly significant factors that determine average power to detect association are: disease model, average marker-marker disequilibrium, haplotype diversity, and the trait allele frequency. These findings may have important implications for the design of well-powered candidate gene association studies. Our power and sample size calculations for the PSORS1 gene appear consistent with published findings, namely that there is substantial power (>0.99) for most genes within 100 kb of the PSORS1 locus at the 0.01 significance level. Copyright © 2005 S. Karger AG, Basel
Author Contacts
Francisco M. De La Vega, PhD Applied Biosystems 850 Lincoln Centre Dr. Foster City, CA 94404 (USA) Tel. +1 650 638 6989, Fax +1 650 554 2577, E-Mail delavefm@appliedbiosystems.com
Article Information
Received: November 29, 2004
Accepted after revision: July 12, 2005
Published online: September 2, 2005
Number of Print Pages : 18
Number of Figures : 6, Number of Tables : 4, Number of References : 76 |
|

|

For non-native English speakers and international authors who would like assistance with their writing before submission, we suggest American Journal Experts for their scientific editing service. |
|
|