Human Molecular Genetics Advance Access published online on July 31, 2007
Human Molecular Genetics, doi:10.1093/hmg/ddm205
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Evaluation of genome-wide power of genetic association studies based on empirical data from the HapMap Project
1 Department of Hematology/Oncology, University of Tokyo, Tokyo 113-8655, Japan 2 Department of Regeneration Medicine for Hematopoiesis, University of Tokyo, Tokyo 113-8655, Japan 3 Graduate School of Medicine, and Department of Information and Communication Engineering, Graduate School of Information Science, University of Tokyo, Tokyo 113-8655, Japan 4 Core Research for Evolutional Science and Technology, Japan Science and Technology Agency, Saitama 332-0012, Japan
* Correspondence should be addressed to: Seishi Ogawa, MD, PhD, Department of Hematology and Oncology, Department of Regeneration Medicine for Hematopoiesis, The 21st Century COE program, Graduate School of Medicine, University of Tokyo, 7-3-1, Hongo, Bunkyo-ku, Tokyo 113-8655, Japan TEL: +81-3-5800-8741, FAX: +81-3-5804-6261, E-mail: sogawa-tky{at}umin.ac.jp
Received April 7, 2007; Revised July 22, 2007; Accepted July 22, 2007
With recent advances in high-throughput SNP typing technologies, genome-wide association studies have become a realistic approach to identify the causative genes that are responsible for common diseases of complex genetic traits. In this strategy, a trade-off between the increased genome coverage and a chance of finding SNPs incidentally showing a large statistics becomes serious due to extreme multiple-hypothesis testing. We investigated the extent to which this trade-off limits the genome-wide power with this approach by simulating a large number of case-control panels based on the empirical data from the HapMap Project. In our simulations, statistical costs of multiple hypothesis testing were evaluated by empirically calculating distributions of the maximum value of the
2 statistics for a series of marker sets having increasing numbers of SNPs, which were used to determine a genome-wide threshold in the following power simulations. With a practical study size, the cost of multiple testing largely offsets the potential benefits from increased genome coverage given modest genetic effects and/or low frequencies of causal alleles. In most realistic scenarios, increasing genome coverage becomes less influential on the power, while sample size is the predominant determinant of the feasibility of genome-wide association tests. Increasing genome coverage without corresponding increase in sample size will only consume resources without little gain in power. For common causal alleles with relatively large effect sizes, (genotype relative risk (GRR) 1.7), we can expect satisfactory power with currently available large-scale genotyping platforms using realistic sample size (
1,000 per arm).
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
M. Kamei, Y. Nannya, H. Torikai, T. Kawase, K. Taura, Y. Inamoto, T. Takahashi, M. Yazaki, S. Morishima, K. Tsujimura, et al. HapMap scanning of novel human minor histocompatibility antigens Blood, May 21, 2009; 113(21): 5041 - 5048. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. N. Gong Gene association studies in acute lung injury: replication and future direction Am J Physiol Lung Cell Mol Physiol, May 1, 2009; 296(5): L711 - L712. [Full Text] [PDF] |
||||
![]() |
D. R. Nyholt, K. S. LaForge, M. Kallela, K. Alakurtti, V. Anttila, M. Farkkila, E. Hamalainen, J. Kaprio, M. A. Kaunisto, A. C. Heath, et al. A high-density association screen of 155 ion transport genes for involvement with common migraine Hum. Mol. Genet., November 1, 2008; 17(21): 3318 - 3331. [Abstract] [Full Text] [PDF] |
||||


