Human Molecular Genetics Advance Access originally published online on December 17, 2003
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Human Molecular Genetics, 2004, Vol. 13, No. 3 335-342
DOI: 10.1093/hmg/ddh035
Defining haplotype blocks and tag single-nucleotide polymorphisms in the human genome
1Division of Genetic Epidemiology in Psychiatry, Central Institute of Mental Health (ZI), 68159 Mannheim, Germany, 2Genetics Unit, Mood and Anxiety Disorders Program, National Institute of Mental Health, National Institutes of Health, US Dept of Health and Human Services, Bethesda, MD 20892, USA, 3Department of Psychiatry, The University of Chicago, Chicago, IL 60637, USA, 4Molecular and Computational Biology Program, Department of Biological Sciences, University of Southern California, Los Angeles, CA 90089, USA and 5Section on Statistical Genetics, Department of Biostatistics, University of Alabama, Birmingham, AL 35294, USA
Received September 16, 2003; Accepted December 3, 2003
Recent studies suggest that the genome is organized into blocks of haplotypes, and efforts to create a genome-wide haplotype map of single-nucleotide polymorphisms (SNPs) are already underway. Haplotype blocks are defined algorithmically and to date several algorithms have been proposed. However, little is known about their relative performance in real data or about the impact of allele frequencies and parameter choices on the detection of haplotype blocks and the markers that tag them. Here we present a formal comparison of two major algorithms, a linkage disequilibrium (LD)-based method and a dynamic programming algorithm (DPA), in three chromosomal regions differing in gene content and recombination rate. The two methods produced strikingly different results. DPA identified fewer and larger haplotype blocks as well as a smaller set of tag SNPs than the LD method. For both methods, the results were strongly dependent on the allele frequency. Decreasing the minor allele frequency led to an up to 3.7-fold increase in the number of haplotype blocks and tag SNPs. Definition of haploytpe blocks and tag SNPs was also sensitive to parameter changes, but the results could not be reconciled simply by parameter adjustment. These results show that two major methods for detecting haplotype blocks and tag SNPs can produce different results in the same data and that these results are sensitive to marker allele frequencies and parameter choices. More information is needed to guide the choice of method, marker allele frequencies, and parameters in the development of a haplotype map.
* To whom correspondence should be addressed. Tel: +49 6211703724; Fax: +49 6211703741; Email: schulze{at}zi-mannheim.de
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
R. W Lawrence, D. M Evans, and L. R Cardon Prospects and pitfalls in whole genome association studies Phil Trans R Soc B, August 29, 2005; 360(1460): 1589 - 1595. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. A. Rana, N. D. Ebenezer, A. R. Webster, A. R. Linares, D. B. Whitehouse, S. Povey, and A. J. Hardcastle Recombination hotspots and block structure of linkage disequilibrium in the human genome exemplified by detailed analysis of PGM1 on 1p31 Hum. Mol. Genet., December 15, 2004; 13(24): 3089 - 3102. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Ke, C. Durrant, A. P. Morris, S. Hunt, D. R. Bentley, P. Deloukas, and L. R. Cardon Efficiency and consistency of haplotype tagging of dense SNP maps in multiple samples Hum. Mol. Genet., November 1, 2004; 13(21): 2557 - 2565. [Abstract] [Full Text] [PDF] |
||||

