Skip Navigation


Human Molecular Genetics Advance Access originally published online on September 12, 2007
Human Molecular Genetics 2007 16(23):2960-2971; doi:10.1093/hmg/ddm256
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow All Versions of this Article:
16/23/2960    most recent
ddm256v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (1)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Song, Y.
Right arrow Articles by Liu, S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Song, Y.
Right arrow Articles by Liu, S.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2007. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org

Common genetic variation in calpain-10 gene (CAPN10) and diabetes risk in a multi-ethnic cohort of American postmenopausal women

Yiqing Song1, Nai-chieh You2, Yi-Hsiang Hsu3, James Sul2, Lin Wang4, Lesley Tinker6, Charles B. Eaton7 and Simin Liu1,2,5,*

1 Division of Preventive Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA 02215, USA, 2 Department of Epidemiology, University of California at Los Angeles (UCLA), School of Public Health, Los Angeles, CA 90095, USA, 3 Molecular and Integrative Physiological Science Program, Department of Environmental Health, Harvard School of Public Health, Boston, MA 02115, USA, 4 Department of Human Genetics and 5 Department of Medicine, David Geffen School of Medicine UCLA, Los Angeles, CA 90095, USA, 6 Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA and 7 Center for Primary Care and Prevention, Brown Medical School, Memorial Hospital of Rhode Island, Pawtucket, RI 02860, USA

* To whom correspondence should be addressed at: Department of Epidemiology, UCLA School of Public Health, PO Box 951772, 650 Charles E. Young Drive South, Los Angeles, CA 90095, USA. Tel: +1 3102065862; Fax: +1 3102066039; Email: siminliu{at}ucla.edu

Received July 30, 2007; Accepted September 3, 2007


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 RESULTS
 DISCUSSION
 MATERIALS AND METHODS
 FUNDING
 REFERENCES
 
Calpain-10 (CAPN10) protein may play a role in glucose metabolisms, pancreatic ß-cell insulin secretion and thermogenesis. Emerging evidence has implicated a role of CAPN10 genetic variants in the risk of type 2 diabetes mellitus (T2DM). Previous association studies, however, have focussed only on several variants initially reported and provided inconsistent results. We conducted a large nested case–control study to comprehensively investigate the associations between common variations in CAPN10 gene and T2DM risk among postmenopausal women aged 50–79 years from the Women's Health Initiative Observational Study. After comprehensive screening in 244 randomly chosen control samples (n = 61 for each of four ethnic groups), we selected a total of 12 tagging single nucleotide polymorphisms (tSNPs) spanning 91 kb in CAPN10 and genotyped them in 1543 diabetes cases and 2132 matched controls (including 968 cases and 968 controls for whites, 366 and 732 for blacks, 152 and 303 for Hispanics and 98 and 195 for Asian/Pacific Islanders). There were no significant associations between any individual tSNP and T2DM, within either the full study sample or any specific ethnic group. Nor was there any evidence of association between common CAPN10 haplotypes and diabetes risk (global tests for differences in risk were P = 0.31 for overall common haplotypes, P = 0.44 for haplotypes in block 1 and P = 0.37 for haplotypes in block 2). In conclusion, we did not observe any significant associations of the common SNPs or haplotypes across the CAPN10 gene with diabetes risk in our large and ethnically diverse cohort of postmenopausal women.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 RESULTS
 DISCUSSION
 MATERIALS AND METHODS
 FUNDING
 REFERENCES
 
Type 2 diabetes mellitus (T2DM) is a genetically heterogeneous and complex metabolic disease. Twin and family studies have shown that T2DM has a strong genetic component (1). Single-gene mutations causing rare monogenic forms of early-onset T2DM (<25 years), such as maturity-onset diabetes of the young, have been identified but they are estimated to account for only a small fraction of all diabetes cases (1–5%) (2). The more common, late-onset form of T2DM is thought to be modulated by multiple loci, each with modest genetic effects. However, specific genetic variants with low penetrance contributing to this polygenic disease remain largely undefined.

In a genome-wide linkage analysis of 330 Mexican-American affected sib pairs (3), calpain-10 gene (CAPN10) was first positionally cloned as a putative diabetes-predisposing gene within a T2DM susceptibility region localized to chromosome 2q37.3 (termed NIDDM1, non-insulin-dependent diabetes mellitus 1). CAPN10 encodes a 672 amino acid intracellular protease, which is a member of the calpain family of calcium-dependent intracellular cysteine proteases (4). It has been reported that CAPN10 is ubiquitously expressed, especially in heart, skeletal muscle, liver and pancreatic islet cells (5). CAPN10 protein may play a role in insulin-mediated glucose metabolisms (6,7), pancreatic ß-cell insulin secretion (6,810) and thermogenesis (11). There is also some evidence that CAPN10 genetic variants may affect either CAPN10 gene expression or mRNA stability (3,6,12). The initial report documented the significant associations of one polymorphism [single nucleotide polymorphism (SNP)-43, rs3792267] and a ‘high risk’ haplotype combination ‘1-1-2/1-2-1’ inferred from SNP-43 (rs3792267), -19 (3842570) and -63 (rs5030952) (1, major allele and 2, minor allele) with increased risk of T2DM in three independent populations (3). Since then, many replication association studies and hypothesis-driven experiments have generated inconsistent results. As indicated in our systematic assessment of the available data, the lack of association replication may reflect the fact that different populations with genetic and environmental diversity, case ascertainment schemes, control selection, sample sizes, analytic strategies and potential gene–gene and/or gene–environment interactions could produce either false-positive or false-negative results. Of note, almost all previous studies have focussed only on these three or four SNPs and their derived haplotypes highlighted in the initial report (13). As there is substantial heterogeneity of allelic and haplotypic frequencies among different populations with various ethnic or geographic backgrounds (14), these CAPN10 variants and their derived haplotypes originally defined in a small sample of Mexican-Americans may not be sufficient to capture the genetic variability of CAPN10 in multiple populations of various ethnic origins. The observed haplotype diversity could also decrease the power of individual studies and was thus likely to generate false-negative results. Most recently, a region of extensive linkage disequilibrium (LD) patterns was observed to be within the CAPN10 locus (15), which makes it unlikely that the putative genetic variant responsible for the CAPN10 genotype–T2DM association lies outside the CAPN10 genomic region. However, few studies have systematically explored the underlying common genetic variation in CAPN10 across diverse populations, especially when a body of information from the publicly accessible SNP databases, such as the HapMap database of the International HapMap Project (16), has provided the foundation to conduct a comprehensive association study of common genetic variation.

We therefore employed a comprehensive haplotype approach to assess the association between common variation in CAPN10 and T2DM in a large case–control study nested in the Women's Health Initiative Observational Study (WHI-OS), an ethnically diverse cohort of postmenopausal women aged over 50 years including whites, blacks, Hispanics and Asian/Pacific Islanders. In particular, we also set out to replicate the associations of those specific SNPs and haplotypes in the CAPN10 with T2DM highlighted in previous reports.


    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 RESULTS
 DISCUSSION
 MATERIALS AND METHODS
 FUNDING
 REFERENCES
 
Estimation of MAFs and LD structures of 12 CAPN10 gene tSNPs among controls
None of the SNPs showed statistically significant deviation from Hardy–Weinberg equilibrium (HWE) among controls (Table 1 and Fig. 1). With the exception of rs1138847 [5'-untranslated region (UTR)] and rs4676399 (3'-UTR), 10 of 12 tSNPs (one every ~3.1 kb) were within a 31.3 kb of CAPN10 genomic region. The allelic frequencies of all tSNPs varied significantly by ethnic groups (all P < 0.0001 except for rs4676399 at P < 0.003). In particular, allele frequencies in blacks and Asian/Pacific Islanders were significantly different from those of other ethnic groups for most SNPs.


Figure 1
View larger version (10K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 1. Schematic presentation of the position of 12 tSNPs spanning the CAPN10 region. These markers within CAPN10 genomic region included one putative promoter SNP (rs2953173), one SNP in the 3'-UTR (rs1509), one missense SNP (rs7607759, Thr504Ala), one synonymous SNP (rs3749166, Ala620Ala) and six intronic SNPs. On the basis of physical positions, two SNPs (rs1138847 and rs4676399) are outside of the CAPN10 genomic region without high LD with other SNPs; rs1138847 is located 24 kb upstream and rs4676399 is located 32 kb downstream of the CAPN10 coding region.

 


View this table:
[in this window]
[in a new window]

 
Table 1. Characterization of 12 haplotype tagging SNPs (tSNPs) chosen in CAPN10 genomic region

 
Figure 2 showed the LD structure and haplotype blocks based on these 12 tSNPs stratified by ethnic groups (for all controls). With the exception of rs1138847 and rs4676399, strong, pairwise LD was observed between all other 10 SNPs within CAPN10. As expected, the lengths and locations of defined haplotype blocks were slightly different among ethnic groups, partly reflecting the differences in allele frequencies between groups. Overall, two blocks with slightly different boundaries across four ethnic groups could be clearly defined. The LD pattern within block 2 was similar among whites, Hispanics and Asian/Pacific Islanders. For blacks, the sizes of both blocks were modestly reduced. There was no evidence for high LD of these two blocks with either of two SNPs in the 5' and 3' outside the CAPN10 genomic region. This finding was consistent with a recent study reporting a high LD region within the CAPN10 locus (17). Notably, the SNP44 and SNP43 loci are 12 bp apart and yet not in complete LD (D' = 0.94–1.00; r2 = 0.007–0.058). SNP-44 is in almost perfect LD with rs7607759 (Thr504Ala, a common non-synonymous SNP in exon 10) in all groups (D' = 0.988–1.00; r2 = 0.934–1.00).


Figure 2
View larger version (54K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 2. Haploview plot defining LD structures between the 12 tSNPs within or near CAPN10 among all controls from four ethnic groups (American whites, blacks, Hispanics and Asian/Pacific Islanders). The relative physical position of each SNP is given in the upper diagram. Each diamond for each SNP combination indicates the pairwise LD between all tSNPs, with red indicating strong LD (D' > 0.8) and a logarithm of odds score of >2.0. LD strength between the chosen SNPs is determined by the 90% confidence limits of D' statistic.

 
Single-marker association analysis
As showed in Table 2, no significant associations between all tSNPs and diabetes risk were found in American blacks, Hispanics and Asians/Pacific Islanders. A marginally significant association was found for rs5030952 (SNP-63) in an additive genetic model among American whites (P = 0.035). However, the false discovery rate (FDR) q-value was larger than 0.05 after adjusted for multiple testing. Tests of heterogeneity of the associations across ethnic groups were not statistically significant. We observed no evidence of significant associations between any of tSNPs and diabetes risk in all groups combined (Table 2).


View this table:
[in this window]
[in a new window]

 
Table 2. Single-SNP associations between each tSNP spanning the CAPN10 genomic region and T2DM risk

 
Haplotype association analysis
We used 12 tSNPs to infer common haplotype within each block with a frequency of ≥5% in the combined data of all controls, regardless of ethnicity groups. We observed four common haplotypes in block 1 and five common haplotypes in block 2 (Table 3). We first performed global tests for differences in the overall haplotype frequency between cases and controls and observed no significant haplotype effects in blocks 1 (P = 0.44) and 2 (P = 0.37). Within block 1 (rs2953173/rs2975760/rs3792267), we observed marginal haplotype effects; compared with non-carriers for 1-1-2, we observed increased risk for carriers of the 1-1-2 [odds ratio (OR) = 1.82, 95% confidence interval (CI): 1.00–3.29; P = 0.048] and lower risk of T2DM for carriers of the 2-1-1 (OR = 0.67, 95% CI: 0.48–0.93; P = 0.016). In block 2 (rs3749166/rs3749168/rs5030952/rs4676348), the only SNP63 (rs5030952) rare allele T-bearing haplotype 2-1-2-2 showed significant positive association with T2DM risk in whites (OR = 1.50, 95% CI: 1.02–2.20; P = 0.039) and all ethnic groups combined (OR = 1.33, 95% CI: 1.05–1.69; P = 0.0195). However, these associations were weak and did not reach nominal significance after we did an FDR correction for multiple testing.


View this table:
[in this window]
[in a new window]

 
Table 3. Haplotype-based associations between CAPN10 common haplotypes and T2DM risk

 
We also reconstructed haplotypes on the basis of the haplotype block structure in each ethnic group (Table 4). The global likelihood ratio tests of haplotype associations in each block were not statistically significant. Significant ORs were only observed in whites. Besides the haplotype 2-1-1 in block 1 (same as that in Table 3), we also observed increased T2DM risk for all carriers of the SNP-63 T-bearing haplotype 2-1-2-2-1 in block 2 (OR = 1.54, 95% CI: 1.02–2.26; P = 0.030). Their statistical significance could not retain after taking into account multiple testing.


View this table:
[in this window]
[in a new window]

 
Table 4. Associations between ethnic-specific haplotypes of CAPN10 and T2DM risk

 
Replication of previously studied haplotype and diplotype associations
On the basis of SNP-43, -19 and -63, the haplotype 1-2-1 was most common in whites (36%), Hispanics (31%) and Asians/Pacific Islanders (54%), but only at a minor frequency in blacks (18%). The most common haplotype in blacks was 1-1-2 (41%). We found no evidence of any significant associations of haplotypes defined by SNP-43, -19 and -63 with T2DM (Table 5). When we inferred haplotype from SNP-44, -43, -19 and -63, the major haplotypes were 1-1-1-2 in blacks (40%) and 1-1-2-1 in other groups at 30–54%. There was evidence to suggest an association between the rare allele C in the SNP-44 locus alone or in combination with the 111 haplotype with risk of T2DM (18), whereas our study failed to replicate this finding with SNP-44. There was a suggestion that the haplotype 2-2-1 at SNP-43, -19 and -63 or with the common allele of SNP-44 (1-2-2-1) appeared to be associated with T2DM risk only in American Hispanics, but their P-values were approximately marginal (0.042 for carriers of 2-2-1 and 1-2-2-1 versus non-carriers, respectively) (Table 5).


View this table:
[in this window]
[in a new window]

 
Table 5. Association of CAPN10 haplotype previously reported and T2DM risk

 
To replicate the associations of diplotypes with T2DM previously reported, we inferred the common diplotypes based on SNP-43, -19 and -63 with and without SNP-44 (Table 6). Of the diplotypes based on three SNPs, 10 genotypes with frequency >1% were observed in whites, blacks and Hispanics and 2-2-1/2-2-1 was not shown in Asians/Pacific Islanders. When SNP-44 was included, we observed 15 diplotypes with frequency >1% in whites, blacks and Hispanics. Similarly, 1-2-2-1/1-2-2-1 was not found in Asians/Pacific Islanders. Of 108 tests for the crude comparisons of diplotype frequencies between cases and controls, only seven tests showed notably different diplotype frequencies between cases and controls (P < 0.05). These results could be false positives because they failed to remain statistically significant after adjusting for multiple comparisons. Overall, there were not statistically significant associations for these diplotypes and T2DM.


View this table:
[in this window]
[in a new window]

 
Table 6. Association of CAPN10 diplotypes previously reported and T2DM risk

 

    DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 RESULTS
 DISCUSSION
 MATERIALS AND METHODS
 FUNDING
 REFERENCES
 
On the basis of 1543 T2DM cases and 2132 matched controls from a multi-ethnic cohort of American postmenopausal women, our study showed little evidence for the role of common genetic variants in CAPN10 gene in T2DM risk. None of the SNPs originally featured as potential T2DM loci showed any significant association with T2DM. Moreover, our haplotype-based analyses did not reveal any significant findings. Such null results were consistent across four ethnic groups including whites, blacks, Hispanics and Asian/Pacific Islanders. The lack of replication in our large multi-ethnic case–control study strongly suggests that common genetic variants in CAPN10, including those highlighted in the original reports, may not confer a susceptibility to T2DM.

Given the large number of associations tested, any observed significant associations with any single marker or haplotype should be interpreted with caution. Although we did observe some significant associations, such findings may simply be explained by multiple testing (all corrected P >0.05) and are thus more likely to represent false-positive associations. First, the P-values for significant SNP and haplotype associations were above 5% after performing either an FDR or a Bonferroni correction for multiple testing, although the methods are quite conservative. Secondly, the significant results did not replicate any previously reported associations. Thirdly, the observed significant results were not robust to statistical model selection and adjustment for confounding. Fourth, the global tests did not support a significant association of the CAPN10 haplotypes with T2DM.

Despite substantial heterogeneity in frequencies of the alleles and haplotypes in CAPN10 gene among various ethnic groups, almost all previous studies did not determine the haplotype structure of CAPN10 in their own study samples, but rather genotyping the SNPs (SNP-44, -43, -19, -63) originally identified in Mexican-Americans (3). As noted in our previous meta-analysis (13), the lack of association in previous association studies could be related to genetic heterogeneity because these four intronic CAPN10 variants may not be sufficient to capture the vast majority of genetic variability of CAPN10 in diverse populations. Emerging evidence has indicated that an optimal subset of most informative polymorphisms, defined as tSNPs based on the LD information, is more likely to capture most of common genetic variation. The haplotype-block information is believed to greatly increase power for detection of untyped common variants and improve localization of untyped variants associated with the trait of interest. Empirical data have also demonstrated that tSNPs selected from the HapMap population samples can effectively capture common variation and provide good power to detect an association under a disease model of modest risk in many other independent samples (19). On the basis of the HapMap database, our comprehensive association analysis of informative SNPs [minor allele frequency (MAF)≥5%] based on LD patterns in each ethnic group provides powerful null evidence against a main effect association between the overall risk of T2DM and variants in CAPN10 that are common among four American ethnic groups. In addition, our exploration of LD patterns and extent across CAPN10 locus provided evidence to support the recent findings that an extensive LD region existed within the CAPN10 gene (15) and made it unlikely that the putative genetic variants outside the CAPN10 gene may explain the putative CAPN10 genotype–T2DM association.

Our study focussed primarily on testing the common variant-common disease hypothesis. Indeed, the vast majority of common variations (85–95%) was observed to be shared between populations living in different continents (20). In the present study, we observed most of the common CAPN10 haplotypes to be shared among whites, Hispanics and Asian/Pacific Islanders. In line with recent report from a population-based study, neither all the intronic variants previously reported in the initial study for haplotype definition nor functional variants (regulatory or coding polymorphisms) were significantly associated with T2DM in our four ethnic groups. Our strategies for common variant selection based on the available genetic information, although comprehensive and cost-effective, may be unable to identify any rare but potentially functional variants with relatively low frequencies (<5%) in any specific ethnic group that were also not in high LD with any SNPs chosen in our study. Nevertheless, a rare polymorphism may not be a major contributor to T2DM with a high prevalence in the general population, given its low frequency and most likely modest genetic effects. Further large-scale studies with resequencing efforts are warranted to help identify any rare but causal variants underlying T2DM.

The lack of association for CAPN10 variants and T2DM may also be most likely because of insufficient power to detect changes in phenotypes. It is clear that the size of effects of each CAPN10 variant or haplotype was modest (1.15–1.25), requiring very large sample sizes to attain sufficient power for detection. We may not have enough power (<80%) to detect significant signal, if estimated OR less than 1.25 for any single SNP or haplotype with a frequency of <10%. Specifically, our study was well powered (≥80%) to detect a relative risk of ≥1.15 for the combined samples and a relative risk of ≥1.25 for whites and blacks for a risk allele with frequencies of 10 and 70%. However, we did not have adequate statistical power (<80%) if the relative risk is less than 1.25 in Hispanics and Asians/Pacific Islanders. To increase statistical power to detect modest effect sizes, meta-analysis has been increasingly applied in genetic association studies to synthesize the totality of evidence across available studies using either study-level data or individual data. However, we do not believe that sole reliance on the increased power in any meta-analysis could provide a true genetic effect size, because many inherent features contributing to between-study heterogeneity, such as the quality of study design and analysis, various ethnic backgrounds and publication bias, might influence the pooled estimates and complicate the interpretations. In this regard, it is essential to replicate the genetic association in carefully conducted, large-scale association studies.

As a complex metabolic disease, the pathogenesis of T2DM is multi-factorial involving both environmental and genetic factors. It seems plausible to speculate that the main effects of genetic factors may be modest and can be masked in the presence of underlying gene–gene or gene–environment interaction. There was also evidence for a statistical interaction between the NIDDM1 region harboring CAPN10 of chromosome 2 and CYP19 region of chromosome 15 (21); a recent study showed that gene expression of CAPN3, another gene on chromosome 15, was negatively associated with body fat and insulin resistance in 27 non-diabetic individuals (22). Without more data from independent association and function studies, it is, however, difficult to draw any conclusions about whether or not they are true disease-predisposing genes, and other candidate genes in the region may also contribute to disease predisposition. Although there is no a priori reason to suspect that these other genes are associated with T2DM, they cannot be definitely excluded as possible candidates until further characterization of this region is complete. To adequately explore potential gene–gene and gene–environmental interactions, further investigation in large-scale and well-designed association studies is clearly warranted.

In addition to T2DM, the variants in CAPN10 have been associated with polycystic ovary syndrome (23), obesity (24,25) and the metabolic syndrome (26), which share some pathophysiological traits that also underlie T2DM (e.g. insulin resistance and/or compensatory hyperinsulinemia). Intriguingly, biological data, mostly from in vitro studies, point to a role of CAPN10 in several key pathways for the pathophysiology of T2DM including insulin-mediated glucose metabolism (6,7), insulin production and release in pancreatic ß cell (6,810) and thermogenesis (11). However, such results should be interpreted cautiously because different cell types, quantitative methods and experimental conditions as well as non-specific CAPN antagonists were used. Furthermore, CAPN10 encodes at least eight different protein isoforms through the use of alternative translation or splicing processes. Each specific CAPN10 protein isoform may function as a tissue-specific metabolic modulator, leading to suggest that different tissues may have different protein profiles in terms of CAPN10 protein type. To clarify the molecular consequences of genetic variation at CAPN10 locus, additional study will be required to investigate the regulation mechanisms underlying CAPN10 mRNA expression, splicing and degradation and the consequences on protein translation, stability, activity and tissue-specific expression patterns.

Finally, it is worth mentioning that population stratification may be a concern when all individuals in this study self-identified their ethnicity, and data from women with diverse ethnic backgrounds are included. However, the consistency of the null associations across ethnic groups suggests that population stratification is not a major concern in our study. Our prospective study design and matching strategy used in such a well-characterized study population could eliminate the bias because of different selection procedures between cases and controls.

In conclusion, our large multi-ethnic case–control study of postmenopausal women did not replicate previous reports implicating CAPN10 as a T2DM susceptibility locus, although we cannot exclude the possibility of a modest genetic effect. Our results provide no evidence to support the notion that common genetic variants in CAPN10 may contribute significantly to the pathogenesis of T2DM.


    MATERIALS AND METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 RESULTS
 DISCUSSION
 MATERIALS AND METHODS
 FUNDING
 REFERENCES
 
Study population
The WHI-OS is a longitudinal study designed to examine the association between clinical, socio-economic, behavioral and dietary risk factors with subsequent incidence of health outcomes, including cardiovascular disease (CVD) and diabetes. Details of the scientific rationale, eligibility and other design aspects have been described elsewhere (27,28). Between September 1994 and December 1998, the WHI-OS enrolled a total of 93 676 postmenopausal women aged 50–79 years at 40 clinical centers throughout the USA. Women of all races and ethnicities were included with a priority to enroll racial/ethnic minority groups in their representation in the US population of women aged 50–79 years. At baseline, women completed screening and enrollment questionnaires and underwent a physical examination and fasting blood specimen collection. The study has been reviewed and approved by human subjects review committees at each participating institution, and signed informed consent was obtained from all women enrolled.

WHI-OS participants were followed by annual mailed self-administered questionnaires and completed an annual medical history. Incident cases of diabetes were identified on the basis of post-baseline self-report of first-time use of hypoglycemic medication (oral hypoglycemic agents or insulin) during a median follow-up period of 5.9 years (mean value = 5.5 years). Of the 93 676 postmenopausal women enrolled into the WHI-OS cohort, approximately 82 069 have no prior history of CVD and/or diabetes at baseline. Following the principle of risk-set sampling (27), for each new case, controls were selected randomly from women who remained free of CVD and/or diabetes at the time the case was identified during follow-up. Controls were matched to the cases by age (±2.5 years), racial/ethnic group (white/Caucasian, black/African, Hispanic/Latino and Asian/Pacific Islander), clinical center (geographic location), time of blood drawn (±0.10 h) and length of follow-up. In our current nested case–control study, 968 cases in whites were randomly chosen and matched with one control each. Of 616 incident cases among ethnic minority women, 366 cases were black, 152 were Hispanics and 98 cases were Asian/Pacific Islanders. The 1:2 matching ratio was used for minorities to strengthen the power in these smaller sample sizes of cases. We estimated that we would have at least >80% statistical power to detect a relative risk ≥1.5 among whites or black/African Americans and a relative risk ≥2.0 among other ethnic groups. Our study did not include American-Indian or Alaskan-native women because of their limited numbers.

Characterizing LD and haplotype-tagging SNP selection
As described in detail previously (29), we implemented a two-stage approach to choose haplotype tagging SNP (tSNP) for genotyping in our large case–control samples. The first stage consists of comprehensive common SNP discovery followed by genotyping them in 244 samples randomly selected from the WHI-OS source population. The second stage is selecting the tSNPs based on LD patterns.

In the first stage, we initially surveyed common genetic variation from the National Center for Biotechnology Information database SNP (NCBI dbSNP) supplemented by HapMap database. Our goal was to capture the common haplotype patterns across 91.8 kb of the CAPN10 gene, covering its 30 kb 5' upstream and 30 kb 3' downstream regions. In total, an initial set of 35 SNPs was selected on the basis of the following criteria: (i) functionality priority: non-synonymous coding SNPs (cSNPs) and splicing-site SNPs (ssSNPs) were kept following the order: cSNPs>ssSNPs>synonymous SNPs>5'-UTR SNPs>3'-UTR SNPs>intronic SNPs and (ii) MAF≥5% in at least one ethnic group (30). The initial set of 35 SNPs (one every ~2.6 kb) was genotyped in a multi-ethnic panel of 244 women randomly chosen from the entire case–control sample (n = 61 for each group of whites, blacks, Hispanics and Asian/Pacific Islanders). This sample size guaranteed that any common haplotype with a frequency of >5% were sufficiently captured.

In the second stage, we identified tSNPs on the basis of the LD patterns of those 35 SNPs among 61 individuals from each ethnic population. Pairwise LD between SNPs was assessed using Lewontin's D' statistic and the squared correlation statistic r2. The Haploview program was used to calculate the LD coefficient and to define haplotype blocks (31,32). We chose all common tSNPs with special focus on African and Caucasian-American samples. First, we selected tSNPs in the African-American sample using the r2-based Tagger program (33). tSNPs in the African-American were chosen by finding the minimum set of SNPs with r2 ≥0.80 and MAF ≥5%. We then used backward trimming to reach a minimum set of tSNPs for other ethnic groups to ensure a sufficient and yet non-redundant parsimonious set of tSNPs. From the initial dense set of 35 SNPs, a total of 12 tSNPs (including one insertion/deletion polymorphism, SNP-19, rs3842570) were eventually selected and genotyped in all case–control samples (Table 1).

As described elsewhere (29,30), common tSNPs chosen using our two-stage strategy captured majority of genetic variations (29) and provided sufficient statistical power to detect an association under a disease model of modest risk (19).

SNP genotyping method
DNA was extracted from the buffy coat fraction of centrifuged blood using the QIAmp Blood Kit (Qiagen, Chatsworth, CA, USA). For the SNP set at first stage, the SNPs were genotyped using the high-throughput Illumina BeadArrayTM platform for 264 DNA samples (San Diego, CA, USA). The large-scale tSNP genotyping in the entire case–control sample consisting of 1543 cases and 2132 matched controls was performed by the TaqMan allelic discrimination method using an ABI 7900 HT Sequence Detection System (Applied Biosystems, Foster City, CA, USA) in the UCLA Molecular Epidemiology Laboratory (Dr Simin Liu). The primers and probes were custom-designed by the ABI Taqman system (PE Biosystems, Foster City, CA, USA). Following polymerase chain reaction (PCR) amplification, endpoint fluorescence was read with the Applied Biosystems Primer 7900HT instrument and genotypes were assigned using SDS2.2.2 Allelic Discrimination Software (Applied Biosystems) by two independent technicians blinded to sample identification numbers. A DNA fragment containing SNP-19 (rs3842573), which is a tandem-repeat insertion/deletion variant with either two or three copies of a 32 bp fragment, was amplified by a PCR using a forward primer of 5'-GTTTGGTTCTCTTCAGCGTGGAG-3' and a reverse primer of 5'-CATGAACCCTGGCAGGGTCTAAG-3'. PCR products were separated by electrophoresis on a 3.5% agarose gel stained with ethidium bromide. Allele 1 (two repeats) comprised 154 bp and allele 2 (three repeats) comprised 185 bp. A total of 138 replicate samples were randomly selected and duplicated across all plates (5%). Concordance rate was >99% and the average genotyping drop-out rate was 1.5% (from 0.4 to 5.9%) for each of the 12 SNPs in the case–control study.

Statistical analyses
We first assessed each tSNP for the HWE test using the {chi}2 test. We also tested for heterogeneity of genotype distributions across ethnic groups by the {chi}2 test. In both single-SNP and halpotype-based analyses, we employed conditional logistic regression to calculate ORs and 95% CIs for each genetic variant with diabetes risk (using the PROC PHREG procedure in SAS v9.1, SAS Institute, Cary, NC, USA). We made adjustments for potential confounding variables including matching factors (age, clinical center, time of blood drawn and ethnicity), body mass index (BMI), cigarette smoking (never, past and current), alcohol intake (never, past and current), hormone replacement therapy (HRT) usage (never, past, current), family history of diabetes and physical activity per week at baseline (expressed as total METs; total expenditure of energy from recreational physical activity). For each tSNP, we tested for allelic association with diabetes risk under dominant, recessive and additive models. Likelihood ratio test was used to test the interaction effect between the genotypes and ethnicity on diabetes risk.

We estimated haplotype frequency in each ethnic group using the expectation–maximization (EM) algorithm applied to phase-unknown case–control genotype data (as implemented in SAS PROC HAPLOTYPE) (34). Haplotype frequency was validated via HAPLOTYPER version 2, a Bayesian haplotype inference algorithm (35). For each individual and each haplotype, h, the haplotype dosage estimate (i.e. an estimate of the number of copies of haplotype h) was computed using the individual's genotype data and haplotype frequency estimates obtained from the combined (cases and controls) data set (34,36). Only haplotypes with estimated frequencies ≥1% in the combined cases and controls were included for analyses. We first performed global likelihood ratio tests to examine whether the frequency distributions of the common haplotypes differed between cases and controls by comparing a model with additive effects on the log odds scale for each common haplotype (using the most common halpotype as the referent) with the intercept-only model. We then formally tested for ethnic differences in haplotype-associated risks by performing a likelihood ratio test following the inclusion of an interaction term between the risk haplotypes and ethnicity in the multi-variable model. To examine the association between the resulting haplotype/haplotype combinations and diabetes risk, the estimate of haplotype dosage was treated as a surrogate variable for the true haplotype. All the analyses were performed using SAS statistical package (version 9.0 for window; SAS Institute), unless specified.

To account for the multiple comparisons made in this study, we calculated the q-value statistic to estimate the FDR by incorporating all P-values from multiple tests performed for SNPs, haplotypes and diplotypes in the Cox models (37). The q-values were obtained for each and all of the P-values, and the q-values for P ≤ 0.05 were presented.


    FUNDING
 TOP
 ABSTRACT
 INTRODUCTION
 RESULTS
 DISCUSSION
 MATERIALS AND METHODS
 FUNDING
 REFERENCES
 
This study was supported by the National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK) R01 grant DK062290 from the National Institutes of Health. The WHI program is funded by the National Heart, Lung and Blood Institute and by the US Department of Health and Human Services.


    ACKNOWLEDGEMENTS
 
We would like to acknowledge all WHI centers and their principal investigators for their participation in this research. We are indebted to all dedicated and committed participants of the WHI-OS. We thank Alyssa Smith and Joseph Larson from the WHI Clinical Coordinating Center for their assistance. A short list of WHI investigators are shown in an on-line appendix file.

Conflict of Interest statement. None of the authors had any conflicts of interest.


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 RESULTS
 DISCUSSION
 MATERIALS AND METHODS
 FUNDING
 REFERENCES
 

  1. Barroso I. Genetics of type 2 diabetes. Diabet. Med. (2005) 22:517–535.[CrossRef][Web of Science][Medline]

  2. Hansen L., Pedersen O. Genetics of type 2 diabetes mellitus: status and perspectives. Diabetes Obes. Metab. (2005) 7:122–135.[CrossRef][Web of Science][Medline]

  3. Horikawa Y., Oda N., Cox N.J., Li X., Orho-Melander M., Hara M., Hinokio Y., Lindner T.H., Mashima H., Schwarz P.E., et al. Genetic variation in the gene encoding calpain-10 is associated with type 2 diabetes mellitus. Nat. Genet. (2000) 26:163–175.[CrossRef][Web of Science][Medline]

  4. Goll D.E., Thompson V.F., Li H., Wei W., Cong J. The Calpain system. Physiol. Rev. (2003) 83:731–801.[Abstract/Free Full Text]

  5. Ma H., Fukiage C., Kim Y.H., Duncan M.K., Reed N.A., Shih M., Azuma M., Shearer T.R. Characterization and expression of calpain 10. A novel ubiquitous calpain with nuclear localization. J. Biol. Chem. (2001) 276:28525–28531.[Abstract/Free Full Text]

  6. Baier L.J., Permana P.A., Yang X., Pratley R.E., Hanson R.L., Shen G.Q., Mott D., Knowler W.C., Cox N.J., Horikawa Y., et al. A calpain-10 gene polymorphism is associated with reduced muscle mRNA levels and insulin resistance. J. Clin. Invest. (2000) 106:R69–R73.[Medline]

  7. Hoffstedt J., Ryden M., Lofgren P., Orho-Melander M., Groop L., Arner P. Polymorphism in the calpain 10 gene influences glucose metabolism in human fat cells. Diabetologia (2002) 45:276–282.[CrossRef][Web of Science][Medline]

  8. Sreenan S.K., Zhou Y.P., Otani K., Hansen P.A., Currie K.P., Pan C.Y., Lee J.P., Ostrega D.M., Pugh W., Horikawa Y., et al. Calpains play a role in insulin secretion and action. Diabetes (2001) 50:2013–2020.[Abstract/Free Full Text]

  9. Marshall C., Hitman G., Turner M. Calpain-10 regulates insulin secretion. Diabetes (2003) 52:A373.

  10. Marshall C., Hitman G.A., Partridge C.J., Clark A., Ma H., Shearer T.R., Turner M.D. Evidence that an isoform of calpain-10 is a regulator of exocytosis in pancreatic beta-cells. Mol. Endocrinol. (2005) 19:213–224.[Abstract/Free Full Text]

  11. Hoffstedt J., Naslund E., Arner P. Calpain-10 gene polymorphism is associated with reduced beta(3)-adrenoceptor function in human fat cells. J. Clin. Endocrinol. Metab. (2002) 87:3362–3367.[Abstract/Free Full Text]

  12. Tsuchiya T., Yan H., Vogelstein B., Kinzler K., Bell G.I. Identification of a haplotype associated with the decreased expression of calpain-10 in lymophoblastoid cell lines. Diabetes (2003) 52:A256.

  13. Song Y., Niu T., Manson J.E., Kwiatkowski D.J., Liu S. Are variants in the CAPN10 gene related to risk of type 2 diabetes? A quantitative assessment of population and family-based association studies. Am. J. Hum. Genet. (2004) 74:208–222.[CrossRef][Web of Science][Medline]

  14. Fullerton S.M., Bartoszewicz A., Ybazeta G., Horikawa Y., Bell G.I., Kidd K.K., Cox N.J., Hudson R.R., Di Rienzo A. Geographic and haplotype structure of candidate type 2 diabetes susceptibility variants at the calpain-10 locus. Am. J. Hum. Genet. (2002) 70:1096–1106.[CrossRef][Web of Science][Medline]

  15. Vander Molen J., Frisse L.M., Fullerton S.M., Qian Y., Del Bosque-Plata L., Hudson R.R., Di Rienzo A. Population genetics of CAPN10 and GPR35: implications for the evolution of type 2 diabetes variants. Am. J. Hum. Genet. (2005) 76:548–560.[CrossRef][Web of Science][Medline]

  16. The International HapMap Consortium. A haplotype map of the human genome. Nature (2005) 437:1299–1320.[CrossRef][Medline]

  17. Tsuchiya T., Schwarz P.E., Bosque-Plata L.D., Geoffrey Hayes M., Dina C., Froguel P., Wayne Towers G., Fischer S., Temelkova-Kurktschiev T., Rietzsch H., et al. Association of the calpain-10 gene with type 2 diabetes in Europeans: results of pooled and meta-analyses. Mol. Genet. Metab. (2006) 89:174–184.[CrossRef][Web of Science][Medline]

  18. Elbein S.C., Chu W., Ren Q., Hemphill C., Schay J., Cox N.J., Hanis C.L., Hasstedt S.J. Role of calpain-10 gene variants in familial type 2 diabetes in Caucasians. J. Clin. Endocrinol. Metab. (2002) 87:650–654.[Abstract/Free Full Text]

  19. de Bakker P.I., Burtt N.P., Graham R.R., Guiducci C., Yelensky R., Drake J.A., Bersaglieri T., Penney K.L., Butler J., Young S., et al. Transferability of tag SNPs in genetic association studies in multiple populations. Nat. Genet. (2006) 38:1298–1303.[CrossRef][Web of Science][Medline]

  20. Hinds D.A., Stuve L.L., Nilsen G.B., Halperin E., Eskin E., Ballinger D.G., Frazer K.A., Cox D.R. Whole-genome patterns of common DNA variation in three human populations. Science (2005) 307:1072–1079.[Abstract/Free Full Text]

  21. Cox N.J., Frigge M., Nicolae D.L., Concannon P., Hanis C.L., Bell G.I., Kong A. Loci on chromosomes 2 (NIDDM1) and 15 interact to increase susceptibility to diabetes in Mexican Americans. Nat. Genet. (1999) 21:213–215.[CrossRef][Web of Science][Medline]

  22. Walder K., McMillan J., Lapsys N., Kriketos A., Trevaskis J., Civitarese A., Southon A., Zimmet P., Collier G. Calpain 3 gene expression in skeletal muscle is associated with body fat content and measures of insulin resistance. Int. J. Obes. Relat. Metab. Disord. (2002) 26:442–449.[CrossRef][Web of Science][Medline]

  23. Vollmert C., Hahn S., Lamina C., Huth C., Kolz M., Schoepfer-Wendels A., Mann K., Bongardt F., Mueller J.C., Kronenberg F., et al. Calpain-10 variants and haplotypes are associated with polycystic ovary syndrome in Caucasians. Am. J. Physiol. Endocrinol. Metab. (2007) 292:836–844.[CrossRef]

  24. Carlsson E., Fredriksson J., Groop L., Ridderstrale M. Variation in the calpain-10 gene is associated with elevated triglyceride levels and reduced adipose tissue messenger ribonucleic acid expression in obese Swedish subjects. J. Clin. Endocrinol. Metab. (2004) 89:3601–3605.[Abstract/Free Full Text]

  25. Shima Y., Nakanishi K., Odawara M., Kobayashi T., Ohta H. Association of the SNP-19 genotype 22 in the calpain-10 gene with elevated body mass index and hemoglobin A1c levels in Japanese. Clin. Chim. Acta (2003) 336:89–96.[CrossRef][Web of Science][Medline]

  26. Hitman G.A., Sudagani J. Searching for genes in diabetes and the metabolic syndrome. Int. J. Clin. Pract. (2004) 143:3–8.

  27. Anderson G.L., Manson J., Wallace R., Lund B., Hall D., Davis S., Shumaker S., Wang C.Y., Stein E., Prentice R.L. Implementation of the Women's Health Initiative study design. Ann. Epidemiol. (2003) 13:S5–S17.[CrossRef][Web of Science][Medline]

  28. The Women's Health Initiative Study Group. Design of the Women's Health Initiative clinical trial and observational study. Control Clin. Trials (1998) 19:61–109.[CrossRef][Web of Science][Medline]

  29. Hao K., Liu S., Niu T. A sparse marker extension tree algorithm for selecting the best set of haplotype tagging single nucleotide polymorphisms. Genet. Epidemiol. (2005) 29:336–352.[CrossRef][Web of Science][Medline]

  30. Wang L., Liu S., Niu T., Xu X. SNPHunter: a bioinformatic software for single nucleotide polymorphism data acquisition and management. BMC Bioinformatics (2005) 6:60.[CrossRef][Medline]

  31. Gabriel S.B., Schaffner S.F., Nguyen H., Moore J.M., Roy J., Blumenstiel B., Higgins J., DeFelice M., Lochner A., Faggart M., et al. The structure of haplotype blocks in the human genome. Science (2002) 296:2225–2229.[Abstract/Free Full Text]

  32. Barrett J.C., Fry B., Maller J., Daly M.J. Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics (2005) 21:263–265.[Abstract/Free Full Text]

  33. de Bakker P.I., Yelensky R., Pe'er I., Gabriel S.B., Daly M.J., Altshuler D. Efficiency and power in genetic association studies. Nat. Genet. (2005) 37:1217–1223.[CrossRef][Web of Science][Medline]

  34. Kraft P., Cox D.G., Paynter R.A., Hunter D., De Vivo I. Accounting for haplotype uncertainty in matched association studies: a comparison of simple and flexible techniques. Genet. Epidemiol. (2005) 28:261–272.[CrossRef][Web of Science][Medline]

  35. Niu T., Qin Z.S., Xu X., Liu J.S. Bayesian haplotype inference for multiple linked single-nucleotide polymorphisms. Am. J. Hum. Genet. (2002) 70:157–169.[CrossRef][Web of Science][Medline]

  36. Stram D.O., Leigh Pearce C., Bretsky P., Freedman M., Hirschhorn J.N., Altshuler D., Kolonel L.N., Henderson B.E., Thomas D.C. Modeling and E–M estimation of haplotype-specific relative risks from genotype data for a case–control study of unrelated individuals. Hum. Hered. (2003) 55:179–190.[CrossRef][Web of Science][Medline]

  37. Benjamini Y., Drai D., Elmer G., Kafkafi N., Golani I. Controlling the false discovery rate in behavior genetics research. Behav. Brain Res. (2001) 125:279–284.[CrossRef][Web of Science][Medline]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
DiabetesHome page
Y.-H. Hsu, T. Niu, Y. Song, L. Tinker, L. H. Kuller, and S. Liu
Genetic Variants in the UCP2-UCP3 Gene Cluster and Risk of Diabetes in the Women's Health Initiative Observational Study
Diabetes, April 1, 2008; 57(4): 1101 - 1107.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow All Versions of this Article:
16/23/2960    most recent
ddm256v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (1)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Song, Y.
Right arrow Articles by Liu, S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Song, Y.
Right arrow Articles by Liu, S.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?