Human Molecular Genetics, 2001, Vol. 10, No. 26 2961-2972
© 2001 Oxford University Press
Dissecting a population genome for targeted screening of disease mutations
1Department of Molecular Medicine, National Public Health Institute, Biomedicum, 00250 Helsinki, Finland, 2Department of Human Genetics, University of California, Los Angeles, CA 90095-7088, USA, 3Department of Clinical Neurophysiology, Helsinki University Hospital (Jorvi Hospital), 02740 Espoo, Finland, 4Department of Medical Genetics, University of Helsinki, 00250 Helsinki, Finland and 5Molecular Medicine, Department of Medical Sciences, S-751 85 Uppsala, Sweden
Received August 6, 2001; Revised and Accepted October 10, 2001.
| ABSTRACT |
|---|
|
|
|---|
Compared to mixed populations, population isolates such as Finland show distinct differences in the prevalence of disease mutations. However, little information exists of the differences on the prevalence of different disease alleles in regional populations with different history of multiple bottlenecks. We constructed a DNA-array and monitored the prevalence of 31 rare and common disease mutations underlying 27 clinical phenotypes in a large population-based study sample. Over 64 000 genotypes were assigned in 2151 samples from four geographical areas representing early and late settlement regions of Finland. Each sample was analyzed in duplicate and a total of 142 000 array-derived genotyping calls were made. On average one in three individuals was found to be a carrier of one of the 31 monitored mutations. This should remove fears of the stigmatizing effect of a carrier-screening program monitoring multiple diseases. Regional differences were found in the prevalence of mutations, providing molecular evidence for the deviating population histories of regional subisolates. The mutations introduced early into the population revealed relatively even distribution in different subregions. More recently introduced rare mutations showed local clustering of disease alleles, indicating the persistence of population subisolates and the effect of multiple bottlenecks in molding the population gene pool. Regional differences were observed also for common disease alleles. Such precise information of the carrier frequencies could form the basis for targeted genetic screens in this population. Our approach describes a general paradigm for large-scale carrier-screening programs also in other populations.
| INTRODUCTION |
|---|
|
|
|---|
DNA variants in over 1000, mostly monogenic, traits have been identified (1). Identification of mutations has not resulted in immediate DNA diagnosis due to the high heterogeneity of disease alleles. Isolated populations provide special advantages for DNA diagnostics and carrier-screening programs due to a limited spectrum of disease mutations; one test having high diagnostic specificity and sensitivity (2,3). Targeted screening of ethnically restricted disease mutations in the appropriate population subgroups has demonstrated its efficiency in disease prevention (4). However, for most populations, rational design of genetic-screening programs requires large population-based pilot studies to monitor for the diversity and prevalence of specific disease mutations.
DNA-array technology is a promising approach to monitor large numbers of sequence variants in one assay (58). If a moderate number of variants in a large number of samples are to be analyzed, a custom-made, spotted oligonucleotide arrays used for enzymatic allele discrimination provides a high-throughput system (8). Such arrays are flexible and can be tailored for population-specific carrier screening of several disease variants. Information on the frequency and regional distribution of individual disease mutations as well as the validation of array-based assays must exist before such tests can be implemented population-wide as a standard component of the health care system.
We constructed a DNA-array for the detection of 31 mutations in a study sample of 2400 individuals from different geographical regions of Finland to monitor for the regional differences in the prevalence of disease alleles. The majority of the mutations included on the array, the Finnish Chip, are recessive disease mutations enriched in the Finnish population, belonging to the Finnish disease heritage. In addition, we incorporated to the array some disease mutations common in most Caucasian populations, like mutations of
-1-antitrypsin, factor V and the hereditary hemochromatosis gene.
We did not include on the array dominant disease mutations like those of hereditary breast and ovarian cancer (BRCA1 and 2), familial non-polypotic colon cancer or familial hypercholesterolemia due to their rarity. Recent studies in cancer patients (9,10) or population-based surveys (11) demonstrate the low prevalence of these mutations in the Finnish population (<1:400).
The microarray-based primer extension assay proved to be highly robust and provided reliable genotyping results at a relatively low cost. Interestingly, we found distinct regional differences in the carrier frequency of both rare and common disease mutations within this genetically homogeneous, isolated population, the findings providing support for multiple historical population bottle necks. The population-wide carrier frequency of disease alleles detected by this panel of 31 mutations was 1:3, illustrating the prediction that if a wider panel of disease mutations were to be included on the array, every Finn would be a carrier of at least one of the tested disease mutations. This should alleviate fears of the stigmatizing effect of carrier-screening programs monitoring numerous disease mutations.
| RESULTS |
|---|
|
|
|---|
Mutations screened with the DNA array
The mutation-screening panel on the DNA-array comprised the known major mutations of Finnish diseases (12), a total of 19 mutations in 16 different genes. Most mutations show a strong founder effect and the coverage of the mutation detection varied between 74 and 99% for different diseases. The common Caucasian mutations consisted of 10 mutations in nine genes including two common polymorphisms of factor V (13) and the prothrombin gene (14). A list of the mutations included on the array as well as the corresponding diseases and OMIM symbols are provided in Table 1. A pair of allele-specific detection primers for each of the mutations was spotted on derivatized microscopic glass-slides (8). Eighty replicate arrays were spotted onto each slide and custom-made reaction chambers were used to analyze up to 80 separate samples per slide. This facilitated the monitoring of 2480 mutations (31 genotypes for 80 individuals) on a single microscopic glass slide (Fig. 1). The allele-specific primer extension reactions provided reliable discrimination between the obtained genotypes (8).
|
|
We determined the carrier frequencies of disease mutations by genotyping 2151 anonymous DNA samples from four geographical regions (Fig. 2). One rural sample was collected from southern Botnia, representing the early settlement region, which was inhabited some 2000 years ago. Another rural sample was collected from North Karelia, representing the late settlement northeastern region, permanently inhabited after the 16th century. The two urban community samples were from Helsinki and from the city of Oulu on the northwestern coastline. Both of these cities have been targets for internal population movement during the past eight to nine decades. However, according to Y chromosomal haplotype analysis, the influence from early immigration from Western Europe is much less evident in Oulu region than in southern Finland (15). Further, immigration to Helsinki, the capital, has been much more excessive than to Oulu. As positive controls, 212 samples from carriers of the disease mutations were allocated blindly among the study samples.
|
Quality of the data from the arrays
Each sample was amplified by PCR and assayed in duplicate on separate microarrays. The genotyping results for each sample were read and the genotypes were independently assigned from the two arrays. Moreover, the genotypes were called on coded samples, without knowledge of the sample status to serve as quality control of the array-based screening procedure. The reference methods were systematically used to verify the genotyping result if a sample indicated a carrier status, if the results from the duplicate assays were discrepant or if the signal intensities on a microarray were too low to assign the genotype. For samples with low signal intensity on the array, the reference method was performed only once, and the sample was omitted from further analysis if this reference assay failed. In the cases of conflicting duplicate genotype calls and for every carrier genotype sample the reference method was used to confirm the genotype.
Since each sample was analyzed in duplicate, a total of 142 000 array derived genotyping calls were made. None of the heterozygous controls was assigned as normal homozygotes. Of the 71 000 genotypes generated, 95% were called successfully. In 5% of the samples the signal intensities were too low to make a reliable genotype call, and the reference methods were used to assign the genotypes. Less than 0.1% of the samples gave conflicting results in the duplicate array-based assay, due to inconsistencies resulting from the spotting of the primer-arrays. The final assignment of genotype in the study samples was made in 99.15% of cases (64 076 out of 64 625 genotypes were assigned). The genotyping results are summarized in Tables 2 and 3.
|
|
Carrier frequencies of the Finnish disease heritage mutations
A particularly informative approach to the analysis of the regional prevalence of the Finnish mutations is to divide them based on the time of their assumed introduction into the population. Recent introduction of a disease allele is characterized by the geographically restricted occurrence of patients, a genealogical history revealing ancestors in the same communities and a demonstration of linkage disequilibrium over a wide genetic interval flanking the disease mutation (12,16). Representative examples of such mutations in our screen include vLINCL (17), EPMR (18), Salla disease (19), CCD (20) and two X-linked retinoschisis (RS) (21) mutations (Table 1). Figure 3 demonstrates the carrier frequencies of five young mutations in the four regional samples. Notably, all these mutations show evidence of local clustering of disease alleles, indicating the persistence of the subisolates. A striking example of such clustering is the high frequency (1:44) of the vLINCL mutation in the southern Botnia sample whereas the carrier frequency in the mixed Helsinki population is only 1:1000. Clustering is also evident for CCD and Salla disease in which the prevalence of disease mutation was highest in North Karelia and lowest in the Helsinki sample. However, the distribution for the Salla mutation was relatively even across all geographical regions when compared with vLINCL, which would provide evidence for a relatively early introduction of Salla mutation to the Finnish population. Only one of the two major mutations causing X-linked RS could be detected as one RSOulu carrier was identified in the Oulu sample. No carriers for the other mutation, RSPori, were seen in any of our samples, probably reflecting the fact that the mutation is highly concentrated in a single community, Noormarkku (21), located in southwestern Finland and outside the regions sampled for this study. Similar findings were obtained for EPMR, a rare recessive disease known to occur in a very restricted region in Kainuu in northeastern Finland (18) which was also not covered in the sampling for the present study.
|
Somewhat differently from young mutations, the old Finnish mutations show relatively even distribution of patients and their carrier frequencies (Fig. 4). The most common of the Finnish diseases, congenital nephrosis (22), is caused by two major mutations, CNFmajor and CNFminor, accounting for 78 and 16% of disease alleles, respectively. We found a relatively even distribution of the CNFmajor allele across all the geographical regions whereas we saw a significant clustering of the CNF minor allele in the sample from the early settlement southern Botnia region (Fig. 4). In this region, the combined carrier frequency of the two mutations was as high as 1:27 (suggesting birth incidence of 1:2500), whereas in the other study samples the combined carrier frequency varied from 1:48 to 1:61. The second most common Finnish disease is aspartylglucosaminuria, in which one major mutation (AGUFin) accounts for 98% of disease alleles (23). Again the carrier frequency varied, the prevalence being lowest in the old settlement, southern Botnia region (1:132) and highest in the late settlement North Karelia (1:63). Diastrophic dysplasia (24), the third most common representative of Finnish diseases, has a major mutation identified in 90% of disease alleles. Interestingly we found a high frequency of the DD major mutation in the regional samples from North Karelia and Oulu, a finding not expected based on the results of earlier epidemiological and genealogical studies (25).
|
The combined carrier frequency of all tested Finnish mutations varied between
1:11 and
1:6 in regional study populations, being highest in the rural eastern and western study samples.
Mutations common in Caucasian populations
The carrier frequencies for common Caucasian mutations are shown in Figure 5. The Z-allele of the
-1-antitrypsin gene revealed a higher prevalence in Helsinki and early settlement southern Botnia, when compared to the late settlement northeastern sample (1:26 versus 1:46). The frequency of the common mutation of hereditary hemochromatosis, HFEC282Y, varied across Finland, showing high prevalence in the early settlement western sample (1:10), with decreasing frequency in the late settlement northeastern Finland (1:33). An opposite regional distribution was evident for FVLeiden (13) and prothrombinG20210A (14), SNPs predisposing to venous thromboembolism. Both were more commonly encountered in the late settlement sample. Another common disease mutation which was monitored was a common frameshift mutation in the connexin 26 gene referred to as GJB2
35G, resulting in congenital non-syndromic deafness in several populations (26,27). The GJB2
35G mutation had a relatively uniform geographical distribution across Finland with a carrier frequency of 1:431:63. The combined carrier frequency of the common Caucasian mutations in Finland varied between
1:7 and
1:5, being highest in the rural, early settlement region.
|
Mutations with exceptionally low prevalence in Finland
We monitored two mutations,
F508 and
TT394 in the CFTR gene, reported to account for 45 and 30% of Finnish CF chromosomes, respectively (28). The carrier rate for the CF mutations was low in all study samples, the combined carrier frequency varying from 1:99 in the western to 1:193 in the eastern study sample. The common PKU mutation, PAH R408W, accounts for 50% of Finnish PKU alleles (29). The calculated carrier frequency of PAH mutations was <1:200, consistent with estimated PKU incidence of less than 1:100 000 in Finland (30). The common deletion in the CLN3 gene, causing Batten disease (31) and accounting for 90% of Finnish Batten chromosomes, was surprisingly rare. The carrier frequency for this mutation was 1:1651:365 in all regional samples, except the eastern, late settlement study sample, which had a frequency of 1:77, being comparable to that reported from other Caucasian populations (32). The prevalence of the medium-chain acyl CoA dehydrogenase (MCAD) mutation was very low, the carrier frequency varying between 1:503 and 1:198. This would suggest that the MCAD deficiency represents still another example of a negative founder effect in the Finnish population.
Linkage disequilibrium in common disease alleles
SNPs flanking the common mutations in the HFE (33,34) and FV (35) genes were selected (Materials and Methods; Table 4) to monitor for the linkage disequilibrium of common disease alleles and the haplotypes of disease alleles in regional samples. After correction for multiple testing, the only significant departure from linkage equilibrium between the SNP variants of the HFE allele was observed in the late settlement North Karelia sample (Table 4). In the case of the factor V gene SNPs, no regional differences in linkage disequilibrium could be observed.
|
The predicted haplotypes of HFE and FV alleles suggested that the mutation occurred in the same haplotype background in all geographic regions. This provided evidence that many of these are old founder mutations. The old age of mutations is supported by the fact that for both loci these haplotypes were in agreement with the previously described haplotypes observed in other Caucasian populations (33,34,36,37). Allelic information of these common variants provided no evidence of genetic distance between regional subpopulations based on the Neis identity estimation (38).
| DISCUSSION |
|---|
|
|
|---|
Our study demonstrates the power of array-based mutation detection techniques in parallel screening for numerous disease mutations. Miniaturization and multiplexing resulted in the low cost for genotyping, the price per genotype being $0.30USD/mutation/sample (8). Several approaches for parallel genotyping by DNA-microarrays have been presented [reviewed by Hacia and Collins (39)], but there is a notable lack of their application beyond the proof-of-principle level. ASO-hybridization-based approaches suffer from complicated array designs and imperfect allelic discrimination (40,41) rendering them inflexible and costly for custom genotyping applications. Primer extension-based approaches (7,8,42) with low complexity array designs provide a better accessibility of the microarray technology for high-throughput genotyping. However, dissemination of high-throughput primer extension-based microarray systems to clinical diagnostics and carrier-screening programs are currently hindered by the considerable set-up optimization required for multiplex PCR (8,43). Generic arrays of tag (44) or zip code (45) sequences can be used to capture and visualize primer extension or ligation products carrying complementary tag- or sequence-sequences at their 5' ends in a highly parallel manner. These generic arrays will obviously be more accessible to non-specialized laboratories.
Despite the relatively limited sample size and statistical power of our pilot study, the obtained carrier frequencies of Finnish diseases were generally in very good agreement with the reported carrier frequencies of the individual diseases, evidencing for the accuracy of the data (12,25). Furthermore, for most of the diseases studied, the observed geographic distribution of the carrier frequencies agreed well with earlier epidemiological and genealogical data (16,46). It should be noted that our study samples reflect the effect of at least two features molding the gene pools of the analyzed regions. First, the western sample represents the typical early settlement region with the population history of some 100 generations, whereas the eastern sample represents the late settlement communities, permanently inhabited only some 15 generations ago. Secondly, both western and eastern samples were collected from rural, isolated communities, whereas Helsinki and Oulu samples represent more mixed, urban populations. However, some distinct differences in the population history as well as in the distinction of Y chromosomal haplotypes between these urban populations have been shown to exist (15) and this genetic diversity could explain some of the variation seen in the carrier frequencies between these two urban areas. Very low carrier frequencies were seen for many Finnish mutations in rural population samples. An enrichment of these disease alleles is not obvious at the whole population level but only in highly restricted communities created by a small number of founders and by a long-standing isolation.
The regional distribution varied considerably for the rare mutations that had been recently introduced into the population. These disease alleles are known to reveal linkage disequilibrium over a wide genetic interval (12). With regard to the older, common Finnish mutations, such as CNF, DD and LPI (16), no dramatic differences could be observed in carrier frequency between the regional study samples. Interestingly, the INCLFin mutation (47) as well as the most common APECED mutation (48) suspected to represent old mutations based on the short LD interval in disease chromosomes showed enrichment in both east and west rural populations as compared to the more mixed Helsinki and Oulu study samples (Tables 2 and 3). Further, the AGU mutation, otherwise revealing an even prevalence across the regional samples showed a lower frequency of carriers in the early settlement region. These findings would be in agreement with multiple regional bottlenecks in the population history and suggest that local events mold the gene pool continuously and can have a significant effect on the frequency of disease alleles.
It should be noted that although the major mutations of old Finnish diseases showed a relatively even distribution across the country, minor alleles of these diseases revealed dramatic differences in their regional distribution supporting their more recent introduction into the population, as exemplified by the prevalence of the CNF minor mutation. This mutation was found to be highly enriched in the western coastline communities while being very rare in other parts of Finland. This is in agreement with the recent study of 2000 Finns from the eastern area (49). Most probably, the CNF minor mutation has been introduced to this early settlement region quite recently, possibly after the major internal migration movement in the 16th century (16). This illustrates the fact that even in isolated populations with relatively little immigration several different mutations for a rare disease may at times be found simultaneously. Further, not all of these mutations are original founder mutations nor are they of similar age. Finally, CNF also provides a demonstrative example of how the introduction of a second mutation into the population can result in patients who are compound heterozygotes, a feature that may markedly increase the incidence of the disease.
The data on the common Caucasian mutations revealed several interesting features. A geographic gradient has been reported in the prevalence of the deletion mutation of connexin 26 (50). In Europe the carrier frequency is highest (1:31) in some Mediterranean populations and gradually decreases towards central and northern Europe (1:79). Despite this, the Finnish frequency (1:451:60) represents an average European population frequency. There were no distinct differences in the regional distribution of this mutation providing evidence for its early introduction and even spreading during the inhabitation of the country.
-1-Antitrypsin deficient individuals are most commonly homozygous for a recurrent point mutation referred to as the Z-allele (51). Nearly 30 years ago a study was conducted among Finnish students to determine the prevalence of the Z-allele by protein electrophoresis in different parts of Finland (52). The carrier frequency of the Z-allele in Helsinki was reported to be 2.7% (1:38) and a higher frequency was seen in the western early settlement region when compared to northeastern Finland. Our data are in good agreement with these findings exposing a similar decreasing gradient (1:261:46) of the Z allele from west to east.
The prevalence of another common disease mutation, HFEC282Y, causing hereditary hemochromatosis had been previously studied in two Finnish population-based samples. The first study reported a carrier frequency of 9.9% (1:10) in 141 Finns (53), whereas a larger study in eastern Finland reported a carrier frequency of 6.7% (1:15) (54). Our data exposes differences in tested geographical regions, the prevalence being highest in the early settlement area of Finland. The high global prevalence of both of these mutations is reflected by the high prevalence among founders of the Finnish population. Further, the mutation has become relatively evenly distributed, even during the internal migration. However, later bottlenecks, especially those affecting the eastern, late settlement region have molded the population prevalence of the mutations. An opposite regional distribution was observed in the prevalence of two other common Caucasian mutations: FVLeiden and prothrombinG20210A, both predisposing to venous thrombosis. These were most prevalent in the eastern samples. It remains to be seen if this opposite direction of increasing gradients in the frequency of common mutations simply reflects a random drift in regional subisolates or rather mirrors the dual theory concerning the settlement history of Finland (15,55). The combined carrier frequency of these common Caucasian mutations in Finland is slightly higher (1:5) than or similar to the pooled frequency of recessive Finnish mutations (1:6). As expected, population bottlenecks that have had a distinct effect on rare alleles, have less dramatically influenced the common Caucasian alleles. Even the recent bottlenecks have resulted in only minor regional differences in the prevalence of common mutations. However, when analyzing the SNPs flanking a common mutation (HFE), a more significant linkage disequilibrium between the SNPs was detected in the eastern rural sample than in the other populations. This reflects a younger age and a less mixed population in the late settlement region when compared to the rest of Finland. Our finding also emphasizes the importance of population selection in linkage disequilibrium-based disease gene mapping studies. The fact that even closely spaced SNPs were in linkage equilibrium for a clinically important factor V Leiden mutation, underlines the potential difficulties in performing and interpretations of possible SNP-based studies relying on linkage disequilibrium.
The negative founder effect reflecting genetic drift in small founder populations of Finland has resulted in almost complete exclusion of some relatively common Caucasian disease alleles. We found a very low prevalence of CF and PKU alleles, approximately one-third of the prevalence reported for other Caucasian populations (56,57). Also, in agreement with an earlier study the carrier frequency of the common mutation for MCAD was very low adding another mutation to the list of disease alleles showing exceptionally low prevalence in Finland (58). It appears that by chance these mutations had not been introduced into the Finnish population reflecting the small number of founders.
Our mutation screen of Finns represents the largest direct determination of the carrier frequencies of numerous disease mutations in a representative population sample. Furthermore, it is thus far the only study considering regional differences in the mutational spectrum in a population isolate, extensively studied both genealogically and genetically. Most rare mutations of Finnish disease heritage showed relatively constant population frequency indicating the old age of these mutations; however, some mutations had remarkable high local incidences, reflecting the distinct effect of recent population bottlenecks. Although being less affected than rare recessive mutations, the prevalence of common Caucasian mutations also reflected the stochastic effect of population bottlenecks, founder effects and genetic drift: some common mutations showing distinct regional enrichment such as HFEC282Y in Botnia or FVLeiden in North Karelia whereas others became significantly under-represented in Finland (i.e. CF
F508).
| MATERIALS AND METHODS |
|---|
|
|
|---|
Samples
Four sets of population samples representing different geographical areas in Finland were analyzed (Fig. 5). A sample from the city of Helsinki, the capital of Finland and considered to represent the most genetically heterogeneous subpopulation of the country consisted of 510 samples of the large epidemiological population-based FINRISKI study (59) and 495 samples from the Finnish Twin Cohort Study (http://kate.pc.helsinki.fi/twin/twinhome.html) (60). The sample from the county of southern Botnia consisted of 396 random blood donors from the rural counties of Kurikka, Alajärvi and Lapua. The North Karelia (eastern Finland) sample consisted of 385 random blood donors from the rural counties of Eno, Ilomantsi, Juuka, Lieksa and Polvijärvi. The Oulu sample was composed of 365 random inhabitants of the city of Oulu in northern Finland born in the year of 1966. All samples were analyzed anonymously. In addition to the study subjects, 212 samples from verified carriers of the disease mutations studied were included to serve as blinded positive controls. Following the initial screening, the remaining positive samples (some DNA-aliquots were exhausted after the array-based assay and confirmatory genotyping) for either the FVLeiden and HFEC282Y mutations from each geographic region were further analyzed for SNPs in the FV and HFE genes. Along with these samples approximately 20 negative samples for both mutations were picked from each geographic region and similarly analyzed for the SNPs. A total of 218 samples were included for this part of the study.
Genotyping
A system based on allele-specific primer extension in a microarray format was used to screen for carriers of the 31 disease mutations as well as the FV and HFE SNPs as described in detail previously for these panels (8). Standard solid-phase minisequencing in a microtiter plate format served as the reference method (61). For the NKH and LPI mutations PCR-RFLP digestion was used as the reference method (62,63), and for the Salla disease mutation an allele-specific PCR reaction (18) with an internal control amplicon was used. The genotypes of the carriers identified by the array-based screening were confirmed using one of the above mentioned reference methods. Samples yielding signal ratios not falling within distinct clusters or having low signal intensities were reanalyzed using a reference method.
Statistical analysis
Statistically significant differences between carrier frequencies were assessed using Fishers exact test. In cases where, based on published literature, carriers were expected to cluster in a particular region one-tailed tests were used, otherwise two-tailed tests were employed. HardyWeinberg proportion and linkage disequilibrium were assessed using the Genepop web version 3.1c program (http://wbiomed.curtin.edu.au/genepop/) (64). Neis identity estimation (38) was carried out to compare genetic distances between different study samples using the program PopDist (65). In the case of the Factor V gene SNPs (WIAF-10964 A/G, WIAF-11529 T/G, WIAF-11065 G/A, WIAF-11064 A/G, WIAF-11063 A/G, WIAF-11091 A/C, WIAF-11062 C/T) no regional differences in linkage disequilibrium values could be observed. For the HFE gene, tests of disequilibrium based on multilocus genotypes were conducted within each population to avoid spurious results due to admixture. Fisher exact tests could not reject the hypothesis of equilibrium between SNPs HHC282Y and HFE5569 in any of the populations, despite the close positions of the two markers. Equilibrium could not be rejected between any marker for the Helsinki and Oulu populations. To correct for multiple comparisons across populations (4) and marker pairs (three for each population) using the Bonferroni method, one should inflate the P-values of each single test by a number between 4 and 12 (corresponding, respectively, to the extreme of total dependence and independence between marker pairs). This leaves only one significant departure from equilibrium in the HFE gene in North Karelian population (the pair 282Y and HLA-IVS2 has an initial P-value of 0.01 and the pair HLA-H5569 HLA-IVS2 has an initial P-value of 0.004these P-values are before the Bonferroni correction).
| ACKNOWLEDGEMENTS |
|---|
We owe our deepest gratitude to several researchers and clinicians for making the control samples available to us. Vlad Kustanovitsh is appreciated for his advice in grammar and style. The following grants made this study possible: the Technology Development Centre of Finland, EC Biomed2 Contract no. BMH4-972013, The Hjelt Fond of the Pediatric Research Foundation, The Finnish Cultural Foundation, The Academy of Finland and The Maud Kuistila Foundation.
| FOOTNOTES |
|---|
+ To whom correspondence should be addressed at: UCLA Department of Human Genetics, Gonda Neuroscience and Genetics Research Center, Room 6506, 695 Charles E. Young Drive South, Box 708822, Los Angeles, CA 90095-7088, USA. Tel: +1 310 794 5631; Fax: +1 310 794 5446; Email: lpeltonen@mednet.ucla.edu Present address: Tomi Pastinen, Montreal Genome Centre, Montreal, H3G 1A4 Quebec, Canada
| REFERENCES |
|---|
|
|
|---|
1 Antonarakis, S.E. and McKusick, V.A. (2000) OMIM passes the 1000-disease-gene mark. Nat. Genet., 25, 1111.[Web of Science][Medline]
2 Hietala, M., Aula, P., Syvänen, A.-C., Isoniemi, A., Peltonen, L. and Palotie, A. (1996) DNA-based carrier screening in primary healthcare: screening for aspartylglucosaminuria mutations in maternity health offices. Clin. Chem., 42, 13981404.
3 Eng, C.M., Schechter, C., Robinowitz, J., Fulop, G., Burgert, T., Levy, B., Zinberg, R. and Desnick, R.J. (1997) Prenatal genetic carrier testing using triple disease screening. J. Am. Med. Assoc., 278, 12681272.
4 Mitchell, J.J., Capua, A., Clow, C. and Scriver, C.R. (1996) Twenty-year outcome analysis of genetic screening programs for Tay-Sachs and ß-thalassemia disease carriers in high schools. Am. J. Hum. Genet., 59, 793798.[Web of Science][Medline]
5 Hacia, J.G., Brody, L.C., Chee, M.S., Fodor, S.P. and Collins, F.S. (1996) Detection of heterozygous mutations in BRCA1 using high density oligonucleotide arrays and two-colour fluorescence analysis. Nat. Genet., 14, 441447.[Web of Science][Medline]
6 Cronin, M.T., Fucini, R.V., Kim, S.M., Masino, R.S., Wespi, R.M. and Miyada, C.G. (1996) Cystic fibrosis mutation detection by hybridization to light-generated DNA probe arrays. Hum. Mutat., 7, 244255.[Web of Science][Medline]
7 Pastinen, T., Perola, M., Niini, P., Terwilliger, J., Salomaa, V., Vartiainen, E., Peltonen, L. and Syvänen, A. (1998) Array-based multiplex analysis of candidate genes reveals two independent and additive genetic risk factors for myocardial infarction in the Finnish population. Hum. Mol. Genet., 7, 14531462.
8 Pastinen, T., Raitio, M., Lindroos, K., Tainola, P., Peltonen, L. and Syvänen, A.C. (2000) A system for specific, high-throughput genotyping by allele-specific primer extension on microarrays. Genome Res., 10, 10311042.
9 Syrjäkoski, K., Vahteristo, P., Eerola, H., Tamminen, A., Kivinummi, K., Sarantaus, L., Holli, K., Blomqvist, C., Kallioniemi, O.P., Kainu, T. et al. (2000) Population-based study of BRCA1 and BRCA2 mutations in 1035 unselected Finnish breast cancer patients. J. Natl Cancer. Inst., 92, 15291531.
10 Aaltonen, L.A., Salovaara, R., Kristo, P., Canzian, F., Hemminki, A., Peltomäki, P., Chadwick, R.B., Kaariainen, H., Eskelinen, M., Jarvinen, H. et al. (1998) Incidence of hereditary nonpolyposis colorectal cancer and the feasibility of molecular screening for the disease. New Engl. J. Med., 338, 14811487.
11 Vuorio, A.F., Turtola, H., Piilahti, K.M., Repo, P., Kanninen, T. and Kontula, K. (1997) Familial hypercholesterolemia in the Finnish north Karelia. A molecular, clinical, and genealogical study. Arterioscler. Thromb. Vasc. Biol., 17, 31273138.
12 Peltonen, L., Jalanko, A. and Varilo, T. (1999) Molecular genetics of the Finnish disease heritage. Hum. Mol. Genet., 8, 19131923.
13 Bertina, R.M., Koeleman, B.P., Koster, T., Rosendaal, F.R., Dirven, R.J., de Ronde, H., van der Velden, P.A. and Reitsma, P.H. (1994) Mutation in blood coagulation factor V associated with resistance to activated protein C. Nature, 369, 6467.[Medline]
14 Poort, S.R., Rosendaal, F.R., Reitsma, P.H. and Bertina, R.M. (1996) A common genetic variation in the 3'-untranslated region of the prothrombin gene is associated with elevated plasma prothrombin levels and an increase in venous thrombosis. Blood, 88, 36983703.
15 Kittles, R.A., Perola, M., Peltonen, L., Bergen, A.W., Aragon, R.A., Virkkunen, M., Linnoila, M., Goldman, D. and Long, J.C. (1998) Dual origins of Finns revealed by Y chromosome haplotype variation. Am. J. Hum. Genet., 62, 11711179.[Web of Science][Medline]
16 Varilo, T. (1999) The age of the mutations in the Finnish heritage: a genealogical and linkage disequilibrium study. Academic Dissertation., University of Helsinki, Finland.
17 Savukoski, M., Klockars, T., Holmberg, V., Santavuori, P., Lander, E.S. and Peltonen, L. (1998) CLN5, a novel gene encoding a putative transmembrane protein mutated in Finnish variant late infantile neuronal ceroid lipofuscinosis. Nat. Genet., 19, 286288.[Web of Science][Medline]
18 Ranta, S., Zhang, Y., Ross, B., Lonka, L., Takkunen, E., Messer, A., Sharp, J., Wheeler, R., Kusumi, K., Mole, S. et al. (1999) The neuronal ceroid lipofuscinoses in human EPMR and mnd mutant mice are associated with mutations in CLN8. Nat. Genet., 23, 233236.[Web of Science][Medline]
19 Verheijen, F.W., Verbeek, E., Aula, N., Beerens, C.E., Havelaar, A.C., Joosse, M., Peltonen, L., Aula, P., Galjaard, H., van der Spek, P.J. et al. (1999) A new gene., encoding an anion transporter, is mutated in sialic acid storage diseases. Nat. Genet., 23, 462465.[Web of Science][Medline]
20 Hoglund, P., Haila, S., Socha, J., Tomaszewski, L., Saarialho-Kere, U., Karjalainen-Lindsberg, M.L., Airola, K., Holmberg, C., de la Chapelle, A. and Kere, J. (1996) Mutations of the down-regulated in adenoma (DRA) gene cause congenital chloride diarrhoea. Nat. Genet., 14, 316319.[Web of Science][Medline]
21 Huopaniemi, L., Rantala, A., Forsius, H., Somer, M., de la Chapelle, A. and Alitalo, T. (1999) Three widespread founder mutations contribute to high incidence of X-linked juvenile retinoschisis in Finland. Eur. J. Hum. Genet., 7, 368376.[Web of Science][Medline]
22 Kestilä, M., Lenkkeri, U., Mannikko, M., Lamerdin, J., McCready, P., Putaala, H., Ruotsalainen, V., Morita, T., Nissinen, M., Herva, R. et al. (1998) Positionally cloned gene for a novel glomerular proteinnephrinis mutated in congenital nephrotic syndrome. Mol. Cell, 1, 575582.[Web of Science][Medline]
23 Ikonen, E., Baumann, M., Gron, K., Syvänen, A.C., Enomaa, N., Halila, R., Aula, P. and Peltonen, L. (1991) Aspartylglucosaminuria: cDNA encoding human aspartylglucosaminidase and the missense mutation causing the disease. EMBO J., 10, 5158.[Web of Science][Medline]
24 Hastbacka, J., de la Chapelle, A., Mahtani, M.M., Clines, G., Reeve-Daly, M.P., Daly, M., Hamilton, B.A., Kusumi, K., Trivedi, B., Weaver, A. et al. (1994) The diastrophic dysplasia gene encodes a novel sulfate transporter: positional cloning by fine-structure linkage disequilibrium mapping. Cell, 78, 10731087.[Web of Science][Medline]
25 de la Chapelle, A. and Wright, F. (1998) Linkage disequilibrium mapping in isolated populations: the example of Finland revisited. Proc. Natl Acad. Sci. USA, 95, 1241612423.
26 Kelsell, D.P., Dunlop, J., Stevens, H.P., Lench, N.J., Liang, J.N., Parry, G., Mueller, R.F. and Leigh, I.M. (1997) Connexin 26 mutations in hereditary non-syndromic sensorineural deafness. Nature, 387, 8083.[Medline]
27 Zelante, L., Gasparini, P., Estivill, X., Melchionda, S., DAgruma, L., Govea, N., Mila, M., Monica, M.D., Lutfi, J., Shohat, M. et al. (1997) Connexin 26 mutations associated with the most common form of non-syndromic neurosensory autosomal recessive deafness (DFNB1) in Mediterraneans. Hum. Mol. Genet., 6, 16051609.
28 Kere, J., Estivill, X., Chillon, M., Morral, N., Nunes, V., Norio, R., Savilahti, E. and de la Chapelle, A. (1994) Cystic fibrosis in a low-incidence population: two major mutations in Finland. Hum. Genet., 93, 162166.[Web of Science][Medline]
29 Guldberg, P., Henriksen, K.F., Sipila, I., Guttler, F. and de la Chapelle, A. (1995) Phenylketonuria in a low incidence population: molecular characterisation of mutations in Finland. J. Med. Genet., 32, 976978.
30 Visakorpi, J.K., Palo, J. and Renkonen, O.V. (1971) The incidence of Pku in Finland. Acta Paediatr. Scand., 60, 666668.[Web of Science][Medline]
31 TIBD Consortium (1995) Isolation of a novel gene underlying Batten disease, CLN3. Cell, 82, 949957.[Web of Science][Medline]
32 Järvelä, I., Mitchison, H.M., Munroe, P.B., ORawe, A.M., Mole, S.E. and Syvänen, A.C. (1996) Rapid diagnostic test for the major mutation underlying Batten disease. J. Med. Genet., 33, 10411042.
33 Beutler, E., West, C. and Gelbart, T. (1997) HLA-H and associated proteins in patients with hemochromatosis. Mol. Med., 3, 397402.[Web of Science][Medline]
34 Jeffrey, G.P., Chakrabarti, S., Hegele, R.A. and Adams, P.C. (1999) Polymorphism in intron 4 of HFE may cause overestimation of C282Y homozygote prevalence in haemochromatosis. Nat. Genet., 22, 325326.[Web of Science][Medline]
35 Cargill, M., Altshuler, D., Ireland, J., Sklar, P., Ardlie, K., Patil, N., Shaw, N., Lane, C.R., Lim, E.P., Kalyanaraman, N. et al. (1999) Characterization of single-nucleotide polymorphisms in coding regions of human genes. Nat. Genet., 22, 231238.[Web of Science][Medline]
36 Bernardi, F., Faioni, E.M., Castoldi, E., Lunghi, B., Castaman, G., Sacchi, E. and Mannucci, P.M. (1997) A factor V genetic component differing from factor V R506Q contributes to the activated protein C resistance phenotype. Blood, 90, 15521557.
37 Faioni, E.M., Franchi, F., Bucciarelli, P., Margaglione, M., De Stefano, V., Castaman, G., Finazzi, G. and Mannucci, P.M. (1999) Coinheritance of the HR2 haplotype in the factor V gene confers an increased risk of venous thromboembolism to carriers of factor V R506Q (factor V Leiden). Blood, 94, 30623066.
38 Nei, M. (1972) Genetic distance between populations. Am. Nat., 106, 283292.[Web of Science]
39 Hacia, J.G. and Collins, F.S. (1999) Mutational analysis using oligonucleotide microarrays. J. Med. Genet., 36, 730736.
40 Wang, D.G., Fan, J.B., Siao, C.J., Berno, A., Young, P., Sapolsky, R., Ghandour, G., Perkins, N., Winchester, E., Spencer, J. et al. (1998) Large-scale identification, mapping, and genotyping of single-nucleotide polymorphisms in the human genome. Science, 280, 10771082.
41 Cho, R.J., Mindrinos, M., Richards, D.R., Sapolsky, R.J., Anderson, M., Drenkard, E., Dewdney, J., Reuber, T.L., Stammers, M., Federspiel, N. et al. (1999) Genome-wide mapping with biallelic markers in Arabidopsis thaliana. Nat. Genet., 23, 203207.[Web of Science][Medline]
42 Pastinen, T., Kurg, A., Metspalu, A., Peltonen, L. and Syvänen, A.C. (1997) Minisequencing: a specific tool for DNA analysis and diagnostics on oligonucleotide arrays. PCR Methods Appl., 7, 606614.
43 Hacia, J.G., Sun, B., Hunt, N., Edgemon, K., Mosbrook, D., Robbins, C., Fodor, S.P., Tagle, D.A. and Collins, F.S. (1998) Strategies for mutational analysis of the large multiexon ATM gene using high-density oligonucleotide arrays. Genome Res., 8, 12451258.
44 Fan, J.B., Chen, X., Halushka, M.K., Berno, A., Huang, X., Ryder, T., Lipshutz, R.J., Lockhart, D.J. and Chakravarti, A. (2000) Parallel genotyping of human SNPs using generic high-density oligonucleotide tag arrays. Genome Res., 10, 853860.
45 Gerry, N.P., Witowski, N.E., Day, J., Hammer, R.P., Barany, G. and Barany, F. (1999) Universal DNA microarray method for multiplex detection of low abundance point mutations. J. Mol. Biol., 292, 251262.[Web of Science][Medline]
46 Norio, R., Nevanlinna, H.R. and Perheentupa, J. (1973) Hereditary diseases in Finland; rare flora in rare soul. Ann. Clin. Res., 5, 109141.[Web of Science][Medline]
47 Vesa, J., Hellsten, E., Verkruyse, L.A., Camp, L.A., Rapola, J., Santavuori, P., Hofmann, S.L. and Peltonen, L. (1995) Mutations in the palmitoyl protein thioesterase gene causing infantile neuronal ceroid lipofuscinosis. Nature, 376, 584587.[Medline]
48 The FinnishGerman APECED Consortium (1997) An autoimmune disease, APECED, caused by mutations in a novel gene featuring two PHD-type zinc-finger domains. Nat. Genet., 17, 399403.[Web of Science][Medline]
49 Romppanen, E.L. and Mononen, I. (2000) Detection of the Finnish-type congenital nephrotic syndrome by restriction fragment length polymorphism and dual-color oligonucleotide ligation assays. Clin. Chem., 46, 811816.
50 Gasparini, P., Rabionet, R., Barbujani, G., Melchionda, S., Petersen, M., Brondum-Nielsen, K., Metspalu, A., Oitmaa, E., Pisano, M., Fortina, P. et al. (2000) High carrier frequency of the 35delG deafness mutation in European populations. Genetic Analysis Consortium of GJB2 35delG. Eur. J. Hum. Genet., 8, 1923.[Web of Science][Medline]
51 Crystal, R.G. (1989) The
1-antitrypsin gene and its deficiency states. Trends Genet., 5, 411417.[Web of Science][Medline]
52 Varpela, E., Saris, N.E. and Lokki, O. (1973) The prevalence of serum
1-antitrypsin deficiency in Finland. Duodecim, 89, 818824.[Medline]
53 Heliö, T., Färkkilä, M., Halme, L., Karlsson, M., Palotie, M. and Kontula, K. (1998) Genetic background of hereditary hemochromatosis and DNA-diagnostics [Finnish]. Duodecim, 114, 14041409.[Medline]
54 Tuomainen, T.P., Kontula, K., Nyyssonen, K., Lakka, T.A., Helio, T. and Salonen, J.T. (1999) Increased risk of acute myocardial infarction in carriers of the hemochromatosis gene Cys282Tyr mutation: a prospective cohort study in men in eastern Finland. Circulation, 100, 12741279.
55 Lahermo, P., Savontaus, M.L., Sistonen, P., Beres, J., de Knijff, P., Aula, P. and Sajantila, A. (1999) Y chromosomal polymorphisms reveal founding lineages in the Finns and the Saami. Eur. J. Hum. Genet., 7, 447458.[Web of Science][Medline]
56 Beaudet, A.L. (1990) Carrier screening for cystic fibrosis. Am. J. Hum. Genet., 47, 603605.[Medline]
57 Guthrie, A. and Susi, A. (1963) A simple phenylalanine method for detecting phenylketonuria in large populations of newborn infants. Pediatrics, 32, 338343.
58 Romppanen, E.L., Mononen, T. and Mononen, I. (1998) Molecular diagnosis of medium-chain acyl-CoA dehydrogenase deficiency by oligonucleotide ligation assay. Clin. Chem., 44, 6871.
59 Vartiainen, E., Puska, P., Jousilahti, P., Korhonen, H.J., Tuomilehto, J. and Nissinen, A. (1994) Twenty-year trends in coronary risk factors in north Karelia and in other areas of Finland. Int. J. Epidemiol., 23, 495504.
60 Kaprio, J. (1994) Lessons from twin studies in Finland. Ann. Med., 26, 135139.[Web of Science][Medline]
61 Syvänen, A.C., Ikonen, E., Manninen, T., Bengtstrom, M., Soderlund, H., Aula, P. and Peltonen, L. (1992) Convenient and quantitative determination of the frequency of a mutant allele using solid-phase minisequencing: application to aspartylglucosaminuria in Finland. Genomics, 12, 590595.[Web of Science][Medline]
62 Kure, S., Takayanagi, M., Narisawa, K., Tada, K. and Leisti, J. (1992) Identification of a common mutation in Finnish patients with nonketotic hyperglycinemia. J. Clin. Invest., 90, 160164.
63 Torrents, D., Mykkanen, J., Pineda, M., Feliubadalo, L., Estevez, R., de Cid, R., Sanjurjo, P., Zorzano, A., Nunes, V., Huoponen, K. et al. (1999) Identification of SLC7A7, encoding y+LAT-1, as the lysinuric protein intolerance gene. Nat. Genet., 21, 293296.[Web of Science][Medline]
64 Raymond, M., and Rousset, F. (1995) GENEPOP (version 1.2): Population genetics software for exact tests and ecumenicism. J. Hered., 86, 248249.
65 Guldbrandtsen, B., Tomiuk, J. and Loeschcke, V. (2000) POPDIST, version 1.1.1: a program to calculate population genetic distance and identity measures. J. Hered., 91, 178179.
66 Conover, W. (1971) Practical Nonparametric Statistics. John Wiley & Sons, New York.
67 Hollander, M. and Wolfe, D. (1973) Nonparametric Statistical Inference. John Wiley & Sons, New York, pp. 1522.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
W. Hennah, T. Varilo, M. Kestila, T. Paunio, R. Arajarvi, J. Haukka, A. Parker, R. Martin, S. Levitzky, T. Partonen, et al. Haplotype transmission analysis provides evidence of association for DISC1 to schizophrenia and suggests sex-dependent effects Hum. Mol. Genet., December 1, 2003; 12(23): 3151 - 3159. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Varilo, T. Paunio, A. Parker, M. Perola, J. Meyer, J. D. Terwilliger, and L. Peltonen The interval of linkage disequilibrium (LD) detected with microsatellite and SNP markers in chromosomes of Finnish populations with different histories Hum. Mol. Genet., January 1, 2003; 12(1): 51 - 59. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Stangl, G. Baumann, and K. Stangl Coronary atherogenic risk factors in women Eur. Heart J., November 2, 2002; 23(22): 1738 - 1752. [Full Text] [PDF] |
||||
![]() |
M. Faucher, J.-L. Anctil, M.-A. Rodrigue, A. Duchesne, D. Bergeron, P. Blondeau, G. Cote, S. Dubois, J. Bergeron, R. Arseneault, et al. Founder TIGR/myocilin mutations for glaucoma in the Quebec population Hum. Mol. Genet., September 1, 2002; 11(18): 2077 - 2090. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Lindroos, S. Sigurdsson, K. Johansson, L. Ronnblom, and A.-C. Syvanen Multiplex SNP genotyping in pooled DNA samples by a four-colour microarray system Nucleic Acids Res., July 15, 2002; 30(14): e70 - e70. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||







