Human Molecular Genetics Advance Access originally published online on April 17, 2008
Human Molecular Genetics 2008 17(14):2228-2237; doi:10.1093/hmg/ddn123
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Interaction between a novel TGFB1 haplotype and CFTR genotype is associated with improved lung function in cystic fibrosis
1 McKusick-Nathans Institute of Genetic Medicine 2 Division of Pediatric Endocrinology, Johns Hopkins University School of Medicine, Baltimore, MD, USA 3 Department of Pediatrics, National Naval Medical Center, Bethesda, MD, USA 4 Department of Human Genetics, Emory University School of Medicine, Atlanta, GA, USA
* To whom correspondence should be addressed at: McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, 733 N. Broadway, Broadway Research Building 559, Baltimore, MD 21205, USA. Tel: +1 4109551773; Fax: +1 4106140213; Email: gcutting{at}jhmi.edu
Received December 28, 2007; Accepted April 11, 2008
| ABSTRACT |
|---|
|
|
|---|
Cystic fibrosis (CF), the most common lethal single gene disorder in Caucasians, is due to mutations in the CFTR gene. Twin and sibling analysis indicates that modifier genes, rather than allelic variation in CFTR, are responsible for most of the variability in severity of lung disease, the major cause of mortality in CF patients. We used a family-based approach to test for association between lung function and two functional SNPs (rs1800469, –509 and rs1982073, codon 10) in the 5' region of transforming growth factor-beta1 (TGFB1), a putative CF modifier gene. Quantitative transmission disequilibrium testing of 472 CF patient–parent–parent trios revealed that both TGFB1 SNPs showed significant transmission distortion when patients were stratified by CFTR genotype. Although lung function and nutritional status are correlated in CF patients, there was no evidence of association between the TGFB1 SNPs and variation in nutritional status. Additional tagging SNPs (rs8179181, rs2278422, rs8110090, rs4803455 and rs1982072) that capture most of the diversity in TGFB1 were also typed but none showed association with variation in lung function. However, a haplotype composed of the –509 C and codon 10 T alleles along with the C allele of the 3' SNP rs8179181 was highly associated with increased lung function in patients grouped by CFTR genotype. These results demonstrate that TGFB1 is a modifier of CF lung disease and reveal a previously unrecognized beneficial effect of TGFB1 variants upon the pulmonary phenotype.
| INTRODUCTION |
|---|
|
|
|---|
Cystic fibrosis (CF [MIM 602421 [OMIM] ]) is a common autosomal recessive genetic disorder caused by mutations in the CF transmembrane conductance regulator (CFTR) gene. There is a high degree of phenotypic variability in CF, especially in lung disease, the major cause of morbidity and mortality. Differences in lung disease severity are not entirely explained by CFTR genotype, as illustrated by the great extent of variability in patients homozygous for the most common mutation,
F508 (1,2). Environmental factors such as infection with bacterial pathogens contribute to lung disease variation (3). However, affected twins and siblings demonstrate that variation in lung disease severity also has a strong genetic component, with heritability estimated at 0.6–0.8 (4). These observations indicate that modifier genes make a substantial contribution to CF lung disease, independent of CFTR genotype. The moderate degree of similarity of CF lung disease severity between siblings suggests that a number of modifier genes are operating. These genes may individually be of small effect, requiring multiple replications to substantiate their relevance to CF lung disease. Indeed, numerous modifier genes have been examined, but few have withstood the test of replication (5). The most compelling biological candidate is transforming growth factor-beta1 (TGFB1). The TGFB1 gene product is a secreted protein with numerous functions including involvement in cellular growth and differentiation, inflammation, and tissue fibrosis (6–8). Although several case–control studies have investigated TGFB1 as a modifier of CF lung disease, the results have been conflicting (9–12). Discrepancies among these case–control studies may be due to population stratification, confounding of the studied trait by other phenotypes, interaction among genes, undetected variants in the candidate gene that affect lung function or differences in power. The US CF Twin and Sibling Study has recruited twins and siblings affected with CF and their parents to evaluate association between candidate modifier genes and variation in lung function. Use of these families permits the employment of transmission-based methods that avoid spurious associations due to population stratification. Patients were recruited only on the condition that they had a living sibling with CF, thereby removing potential bias inherent in recruiting based on extreme phenotypes and facilitating the analysis of the confounding effect of one trait upon another. Additionally, patients bearing all CFTR genotypes were recruited, allowing the investigation of gene–gene interaction between modifier genes and CFTR. Whenever possible, the CF Twin and Sibling Study enrolled both parents of each patient to facilitate the construction of haplotypes and the search for occult variation in candidate genes that modify CF lung disease. Finally, the unfortunate relative commonness of CF and the presence of a highly organized nationwide CF care system enabled recruitment of hundreds of families, thus providing reasonable power for transmission-based association studies.
| RESULTS |
|---|
|
|
|---|
Association of TGFB1 alleles with variation in lung function is dependent upon CFTR genotype
Patients in this study were similar for age, CFTR genotype distribution, body mass index (BMI) and sex ratio compared with CF patients in the US CF Patient Registry (Table 1) (13). Since lung disease is progressive in CF patients, pulmonary function measures of the forced expiratory volume in 1 s (FEV1) were converted into disease-specific percentiles (FEV1CF%) to facilitate comparisons among patients of different ages and sex (14). The mean percentiles of the CF-specific lung function measures were higher in our cohort than in unrelated patients (i.e. >0.5); however, the entire spectrum of disease severity was represented. Quantitative transmission disequilibrium tests (QTDT) were performed on patient–parent–parent trios to evaluate association between polymorphisms in TGFB1 and lung function in CF patients. QTDT tests for transmission distortion, or a skewing of the expected 50/50 ratio of alleles transmitted from heterozygous parents to offspring, across the spectrum of a quantitative trait. Though Z-scores of the cross-sectional (MaxFEV1CF%) and longitudinal (AvgFEV1CF%) lung function measures were employed, the results were essentially identical when non-transformed traits were used (data not shown). Two SNPs in TGFB1 (rs1800469, –509 and rs1982073, codon 10) that have shown variable association with lung function in case–control studies were typed in 472 trios (genotype frequency data provided in Supplementary Material, Table S1). The –509 SNP showed transmission distortion in patients stratified by the cross-sectional measure (P = 0.011, achieving study-wide statistical significance) while transmission distortion of the codon 10 SNP approached significance (P = 0.059; Table 2). No difference in the parental origin of transmitted codon 10 alleles was observed, in contrast to the report by Becker et al. (15) (data not shown). These studies confirm that TGFB1 is a modifier of lung function in patients recruited by the CF Twin and Sibling Study.
|
|
To determine if CFTR genotype influenced TGFB1 effect upon lung function, patients were divided into two groups of nearly equal size: those who were homozygous for the common
F508 mutation (
F508 homozygotes) and all other genotypes (non-
F508 homozygotes). Eight patients with unknown CFTR genotype were excluded from CFTR-genotype-specific analyses. The
F508 homozygote and non-
F508 homozygote groups had very similar means and standard deviations for cross-sectional (0.68 ± 0.28, 0.69 ± 0.26, respectively) and longitudinal lung function (0.59 ± 0.24, both groups). When stratified by CFTR genotype it became apparent that non-
F508 homozygotes were predominantly responsible for the observed transmission distortion of both SNPs (Table 2). The –509 SNP demonstrated study-wide significant association with the cross-sectional and the longitudinal measure of lung function in non-
F508 homozygotes. Association between the codon 10 SNP and the cross-sectional measure of lung function also achieved study-wide significance, and almost attained study-wide significance for the longitudinal measure of lung function in non-
F508 homozygotes (Table 2).
Nutritional status, as measured by BMI, is known to be correlated with lung function measures in CF patients (16,17). As expected, the average of Z-scores for BMI between ages 2–20 (AvgBMIZ) was correlated with both cross-sectional and longitudinal lung function measures (Pearson's R = 0.36 and 0.44, respectively; P < 0.0001). QTDT performed in 457 trios in which both pulmonary and BMI data were available revealed that neither TGFB1 SNP showed association with AvgBMIZ (data not shown). To further examine whether TGFB1 alleles were associated with variation in lung function or a composite of both lung function and BMI, we adjusted for the confounding effect of BMI upon lung function using linear regression. Association between the TGFB1 SNPs and the cross-sectional and longitudinal pulmonary traits adjusted for BMI (see Materials and Methods) was analyzed by QTDT. The results were nearly identical to those for the unadjusted lung function measures, demonstrating that variation in nutritional status did not account for the association between TGFB1 alleles and lung function (Table 2, BMI-adjusted columns). As noted previously, transmission distortion of both the –509 and codon 10 SNPs was present only in non-
F508 homozygotes.
Transmission analysis of additional SNPs identifies TGFB1 haplotypes that are associated with variation in lung function in CF patients
To determine if certain combinations of variants in or near the TGFB1 gene (i.e. haplotypes) were associated with lung function measures, an additional eight SNPs (rs8179181, rs28730295, rs2278422, rs8110090, rs11466334, rs1800472, rs4803455, rs1982072) were typed in 445 trios with pulmonary data. The allele and genotype frequencies observed in our population are similar to the frequencies reported in other Caucasian subjects (data provided in Supplementary Material, Table S1). A monomorphic SNP (rs28730295) and two SNPs with minor allele frequency (MAF) below 0.05 (rs11466334, rs1800472) were dropped from further analysis. None of these additional SNPs were individually associated with lung function measures in CF patients (data not shown), except for rs1982072, which was in almost complete linkage disequilibrium (LD) with the –509 SNP (r2 = 1.0, D' = 1.0).
Screening tools within PBAT (pedigree-based association testing) software (18) were used to select haplotypes of varying lengths and combinations of seven SNPs (–509, codon 10, rs8179181, rs2278422, rs8110090, rs4803455 and rs1982072) that were most likely to show association with the cross-sectional measure of lung function. Haplotypes comprising –509 and codon 10 showed the highest power, especially in the presence of an additional SNP in intron 5 (rs8179181). Although the –509 and codon 10 SNPs were in strong LD with each other (r2 = 0.65, D' = 0.99), both SNPs were in low LD with the intron 5 SNP (r2 < 0.01; D' = 0.10, 0.22; respectively). The latter finding suggested that the improved power upon inclusion of the intron 5 SNP is not due to a high degree of ancestral correlation with the –509 and codon 10 SNPs, but that specific haplotypes composed of all three SNPs have greater association with variation in lung function than any SNP alone.
To test the above hypothesis, haplotypes composed of two SNPs (–509 and codon 10) and three SNPs (–509, codon 10 and intron 5; haplotype frequencies provided in Supplementary Material, Table S2) were tested for association with the cross-sectional lung function measure using QTDT and a second method, Family-Based Association Testing (FBAT). In the group of All CFTR genotypes, the –509 T–codon 10 C (T–C) haplotype was significantly under-transmitted to patients with increasing lung function in both analyses (i.e. associated with more severe lung disease as indicated by the negative Z-statistic in FBAT; Table 3). The TGFB1 haplotypes also demonstrated interaction with CFTR genotype, as was observed for individual SNPs. In non-
F508 homozygotes, the T–C haplotype was under-transmitted with increasing lung function, as noted for the entire group. Intriguingly, when the T–C haplotype was divided into two 3-SNP haplotypes based on the allele present at intron 5, the –509 T–codon 10 C–intron 5 C (T–C–C) haplotype was associated with decreased lung function, while the other haplotype, –509 T–codon 10 C–intron 5 T (T–C–T), showed no transmission distortion, even though the latter haplotype also contains a T at –509 and a C at codon 10. On the other hand, the –509 C–codon 10 T (C–T) haplotype was over-transmitted (i.e. associated with milder lung disease) in non-
F508 homozygotes, as shown by the positive Z-statistic. When the intron 5 SNP was included, the three-SNP haplotype –509 C–codon 10 T–intron 5 C (C–T–C) was over-transmitted to non-
F508 homozygous patients with improved lung function in both analyses while the –509 C–codon 10 T–intron 5 T (C–T–T) haplotype showed no transmission distortion (Table 3). Thus, haplotype transmission analysis provides further evidence that the association of genetic variants in TGFB1 with CF lung function is better defined by haplotypes rather than individual alleles.
|
One TGFB1 haplotype is correlated with improved lung function in non-
F508 homozygotesTo assess the relative magnitude of the effect of individual TGFB1 haplotypes on the cross-sectional measure of lung function, we performed linear regression analysis under the assumption of additive, dominant and recessive modes of inheritance (MOIs). A measure of goodness-of-fit [Akaike's information criterion (AIC)] was calculated to compare models (19). For both univariate and multivariate regression, models which coded haplotypes according to additive or dominant MOIs had better fits than a model employing a recessive MOI. Since the resulting association estimates were qualitatively similar in both additive and dominant scenarios, only the latter results are shown. Results derived from the use of additive or recessive MOIs are available upon request. In the univariate analysis, only the C–T–C haplotype had a significant relationship with lung function in the complete group of patients (All CFTR genotypes; Table 4). The positive value of the regression coefficient (β) associated with the C–T–C haplotype indicates that this haplotype is associated with better lung function in CF patients. In contrast, the C–T–T haplotype demonstrated no association with variation in lung function, despite sharing with the C–T–C haplotype a C at –509 and a T at codon 10 (Table 4). Furthermore, the regression coefficient associated with the C–T–T haplotype has a negative value, as opposed to the positive value for the C–T–C haplotype. In non-
F508 homozygous patients, the C–T–C haplotype was associated with an increase in lung function of 15.9 percentile points, while the T–C–C haplotype was associated with a decrease in lung function of 11.9 percentile points. Neither the C–T–T haplotype nor the T–C–T haplotype demonstrated association with variation in lung function, despite having the same alleles at –509 and codon 10 as the C–T–C and T–C–C haplotypes, respectively (Table 4).
|
To verify that the magnitude of the effect of the C–T–C haplotype (15.9 percentile points) was greater than the effect of individual SNPs, we performed linear regression on single alleles in 232 non-
F508 homozygotes. As expected, the regression coefficients were lower, at 12.0 (P = 0.045) and 13.2 percentile points (P = 0.009) for the –509 C and codon 10 T alleles, respectively. Additionally, the –509 C–codon 10 T (C–T) haplotype increased lung function by only 12.0 percentile points (P = 0.019, n = 218). These results demonstrate that the C allele of the intron 5 SNP parses the ancestral haplotype block of –509 and codon 10 SNPs into 3-SNP haplotypes that show enhanced effect upon lung function.
The use of univariate regression described earlier allowed us to evaluate the contribution of individual haplotypes to variation in lung disease; however, this modeling could not account for the confounding effect of one haplotype upon another. For example, association of the C–T–C haplotype with increased lung function might simply be reflecting an absence of the T–C–C haplotype. Indeed, as these are the two most common haplotypes in our population, patients who do not carry the former haplotype are likely to carry the latter haplotype. To address this issue, we performed multivariate linear regression to determine the effect of TGFB1 C–T–C and T–C–C haplotypes (the two haplotypes that showed association by univariate analysis) upon lung function while simultaneously accounting for the effects of other haplotypes. When the relationship between lung function and both the C–T–C and the T–C–C haplotypes was modeled in non-
F508 homozygotes, the C–T–C haplotype continued to have a positive influence on CF lung disease while the T–C–C haplotype no longer showed a relationship with lung function (Table 4). Patients who carried the former haplotype had an estimated increase in lung function of 12.6 percentile points compared to patients who did not carry this haplotype. This analysis implies that non-
F508 homozygote CF patients with at least one C–T–C haplotype will have better lung function than patients carrying any other combination of the three SNPs that comprise this haplotype (i.e. T–C–C, T–C–T, C–T–T and C–C–C).
To confirm this conclusion, we plotted and compared the cross-sectional lung function of patients bearing different combinations of TGFB1 haplotypes (Fig. 1).
|
Non-
F508 homozygous patients carrying at least one C–T–C haplotype had significantly higher median lung function (MaxFEV1CF% = 0.84, n = 106) than patients carrying the T–C–C haplotype (MaxFEV1CF% = 0.58, n = 51; P < 0.0001, Mann–Whitney U test). Of particular note, patients bearing both the C–T–C and T–C–C haplotypes (i.e. C–T–C/T–C–C heterozygotes) had higher lung function (median MaxFEV1CF% = 0.79, n = 42) compared with those carrying the T–C–C haplotype in the absence of the C–T–C haplotype (P = 0.006, Mann–Whitney U test). This demonstrates the positive influence of the C–T–C haplotype on lung function and argues against a negative influence of the T–C–C haplotype. There was no difference in lung function between C–T–C/T–C–C heterozygotes and patients bearing a C–T–C haplotype in trans with a non-T–C–C haplotype. These results were unchanged when patients carrying no C–T–C or T–C–C haplotypes (i.e. C–T–T, T–C–T, C–C–C; n = 14) were also included in the group of T–C–C-bearing patients (data not shown). Thus, the C–T–C haplotype manifests a dominant positive effect on lung function in non-
F508 homozygotes. | DISCUSSION |
|---|
|
|
|---|
Using family-based transmission analysis, we have shown that variation in the TGFB1 gene is associated with variability in lung function in CF patients. More importantly, our study design facilitated the discovery of three new attributes of the modifier effect of TGFB1. First, our analyses revealed interaction between TGFB1 and CFTR, as the effect of variation in TGFB1 on lung function was primarily observed in patients who were not homozygous for
F508. Secondly, variants in TGFB1 were shown to modify lung function and not nutritional status, despite strong correlation between these two traits. Thirdly, transmission and regression analyses revealed a TGFB1 haplotype that is associated with better lung function, and hence mild lung disease, an observation of substantial therapeutic potential.
Several studies have demonstrated association between SNPs in TGFB1 and variation in CF lung disease, while others have not. There are a number of key differences among these studies including design, power, definition of the pulmonary phenotype and assignment of affection status. The best-powered case–control study to date defined severe and mild lung disease as having forced expiratory volume in 1 s (FEV1) measurements in the lowest or highest quartiles for age, respectively. Of 808
F508 homozygous patients, those with severe lung disease were two times more likely to have the –509 TT and codon 10 CC genotypes than those with mildly impaired lung function. The authors replicated this finding in a second population of 498 CF patients, of whom 70% were
F508 homozygotes and 30% had other severe CFTR genotypes associated with poorer clinical outcomes. Interestingly, this association was not observed in the sub-group made up of
F508 homozygotes only, suggesting that TGFB1 variants had greater modifying capacity in non-
F508 homozygotes as seen in our population. In contrast, Arkwright et al. (9) showed in 171
F508 homozygotes that the codon 10 TT genotype was associated with an earlier decline in lung function. Brazova et al. (12) did not see association of TGFB1 SNPs with variation in lung function in a study of 118 Czech CF patients, of whom about half were
F508 homozygotes and half had other severe CFTR genotypes, and 268 control subjects. More recently, a study examined the rate of decline in lung function in 511 Canadian CF patients stratified by codon 10 genotype. Though a significant difference in the rate of decline was observed between the three genotype groups (CC, CT, TT), the pattern was not entirely consistent with previous reports as this study found that codon 10 heterozygotes (CT) had the smallest annual decline in lung function (20). In the only other family-based study, transmission distortion of –509 and codon 10 alleles at TGFB1 was not observed in 34 pairs of extreme concordant and discordant
F508 homozygote siblings using a composite measure of lung function and BMI. The current study of 472 trios demonstrated that the –509 C and codon 10 T alleles had a beneficial modifier effect on cross-sectional and longitudinal measures of lung function based upon FEV1. From our results we infer that the absence of the –509 C and codon 10 T alleles should be associated with reduced lung function in CF patients. Indeed, the case–control study of more than 1300 unrelated patients showed that the presence of the alternate alleles (–509 T and codon 10 C) was associated with lower FEV1, fundamentally the same measure of lung function used in this study. Thus, the results of our family-based study are consistent with the association observed in the the case–control study conducted by Drumm et al. (10).
The observations of the current study differ from all previous association studies in that the modifier effect of TGFB1 was dependent upon CFTR genotype. To reduce genetic heterogeneity at the CFTR locus, prior studies primarily tested for association of TGFB1 with lung function in patients who were homozygous for the most common CF mutation,
F508. We detected association in patients who were not homozygous for
F508, but association was not observed in
F508 homozygotes. This disparity does not appear to be a function of power since the two groups had nearly identical numbers of patients and had similar means and variances in lung function. We favor the concept that the magnitude of the modifier effect conferred by TGFB1 is smaller in
F508 homozygotes than in CF patients carrying other CFTR mutations. By analyzing
F508 homozygous patients with extreme phenotypes, the study by Drumm et al. (10) had 80% power to detect modifiers that altered lung function by 0.7%. The 232 trios homozygous for
F508 in our study had only 11% power to detect the same change in lung function. However, this number of trios did have reasonable power (80%) to detect modifiers that account for
11% of the total variance in lung function, an effect size comparable with what was observed in the non-
F508 homozygous trios. These findings suggest that having two copies of
F508 creates a unique lung disease environment that is less responsive to variation in TGFB1. On the other hand, other CFTR mutations may lead to lung pathology that is more amenable to alteration by modifiers such as TGFB1. The latter concept is supported by the observation that the modifier effect of the glutamate–cysteine ligase catalytic subunit (GCLC) gene upon severity of lung disease could be observed only in CF patients with mild CFTR alleles (21).
Patients were enrolled in the CF Twin and Sibling Study based on having a living sibling with CF. Since patients were drawn from the entire spectrum of CF phenotypes, we were able to test for association of TGFB1 alleles with traits that are correlated with lung function, such as nutritional status. Due to the ascertainment of only patients with extreme phenotypes, prior studies could not distinguish with certainty whether TGFB1 variants were associated with lung disease severity, nutritional status or both. The absence of association with a longitudinal measure of nutritional status and the presence of association with lung function after adjustment for differences in nutritional status indicates that variation in TGFB1 primarily modifies CF lung disease. This observation suggests that the lung should be the focus of studies examining the mechanism of the TGFB1 modifier effect in CF.
Haplotype analysis is a powerful tool for discovering causal alleles that are highly associated with typed markers and for determining whether combinations of alleles lead to a greater effect upon phenotype than is caused by individual alleles. While single alleles have been studied as modifiers in CF, TGFB1 haplotypes have not previously been explored. The TGFB1 intron 5 SNP (rs8179181) by itself was not associated with variation in lung function in this study. However, when the C allele of the intron 5 SNP occurred on the same haplotype as –509 C and codon 10 T, this 3-SNP haplotype (C–T–C) was shown to correlate with improved lung function in non-
F508 homozygous patients. The intron 5 SNP has no functional role of which we are aware; thus we propose that the –509 C–codon 10 T–intron 5 C haplotype contains a variant (or variants) that modulates TGFB1 expression or function such that a protective outcome is conferred on CF lungs. Another possible mechanism is that the codon 10 T and/or –509 C alleles in combination with an additional variant on this haplotype are necessary to ameliorate CF lung disease. The requirement of patients in this study to have a surviving affected sibling probably accounts for the increased mean of cross-sectional and longitudinal lung function measures compared with the CF population mean (Table 1). The bias of the patients in this study toward better lung function may have aided in the detection of the mild TGFB1 C–T–C haplotype.
Variation in TGFB1 has been linked to several chronic pulmonary disorders in addition to CF. For example, the –509 TT genotype was shown to associate with the severity and diagnosis of asthma (22,23), similar to the observations in this study of CF. In contrast, the –509 T and codon 10 C alleles were found to be protective against chronic obstructive pulmonary disease (24–26). These observations suggest that the mechanism of TGFB1 action upon lung function is context-specific. The TGFB1 –509 T and codon 10 C alleles have been linked to higher gene and protein expression than the alternate alleles at these loci (27–32). TGFB1 is known to promote fibrogenesis by stimulating extracellular matrix production (33) and by inhibiting matrix degradation (34). Alleles that decrease levels or activity of TGFB1 may be predicted to stimulate an appropriate balance between tissue repair and fibrosis that improves lung function in CF patients. Our discovery that TGFB1 variants associate with mild lung disease in a subset of CF patients (non-
F508 homozygotes) provides new opportunity to test the aforementioned prediction and to possibly develop therapeutics that retard progression of this life-limiting feature of CF.
| MATERIALS AND METHODS |
|---|
|
|
|---|
Study subjects
CF twins and siblings (n = 617) and their parents (n = 606) from 303 families were recruited by the CF Twin and Sibling Study as previously described. Of these families, 88.4% had two children old enough to perform pulmonary function testing, 7.6% had three children and 4.0% had one child. Sixteen dizygous and thirteen monozygous (MZ) twin pairs were included. Blood samples were obtained from patients and parents for standard DNA phenol/chloroform extraction. Raw pulmonary function test data, CFTR genotypes and height and weight measurements were obtained from medical records. In some cases in which genotypes were unavailable, CFTR exons were sequenced to identify mutations. Written informed consent or assent was obtained from all subjects. Only families in which both parents of the patients were available (i.e. complete trios) were included in the present study. Analyses were conducted using data from all patients, as well as using data from subgroups of patients categorized by CFTR genotype: those homozygous for the
F508 mutation (
F508 homozygotes) and those bearing all other genotypes (non-
F508 homozygotes).
Phenotype definition
The forced expiratory volume in 1 s (FEV1), a lung function measure that is highly correlated with survival in CF patients (35,36), was used to derive cross-sectional (MaxFEV1CF%) and longitudinal (AvgFEV1CF%) measures, as previously described. All patients in this study with a longitudinal lung function measure also had a cross-sectional measure. However, 142 patients were too young to have sufficient data for the longitudinal measure. Of the MZ twin pairs in which both twins had pulmonary data, ten pairs had MaxFEV1CF% and four pairs had AvgFEV1CF%. To include as many subjects as possible and to avoid randomly excluding one member of each pair, lung function measures were averaged for MZ twin pairs and included in analyses only if the twins' values were within 10 percentiles of each other, as not to double-count genetically identical individuals. For MZ twin pairs in which only one of the twins had pulmonary data, that twin's data was included. The average of Z-scores for BMI between ages 2 and 20 (AvgBMIZ) was used as a longitudinal marker of nutritional status. Pulmonary phenotypes were adjusted for nutritional status by regressing lung function measures on AvgBMIZ, and then for each individual the product of the regression coefficient and AvgBMIZ was subtracted from the lung function measure.
SNP selection and genotyping
Two SNPs in TGFB1, c.–1347C>T (rs1800469, –509) and c.29T>C (rs1982073, codon 10), were chosen based on the findings of a recent study on modifiers of CF lung disease. These SNPs were genotyped in all individuals using TaqMan Assays-on-Demand and Assays-by-Design, respectively (Applied Biosystems, Foster City, CA). Reactions were performed in 384-well plates in a total reaction volume of 10 µl with 10 ng of template DNA in a Bio-Rad iCycler thermal cycler. Quality control samples were included on each plate. Endpoint fluorescence readings were obtained using an ABI PRISM 7900HT sequence detection system (SDS; Applied Biosystems). Genotype-calling was conducted using SDS v.2.1 software and inheritance checking was performed by SIB-PAIR v.0.99.9 (http://www2.qimr.edu.au/davidD) (37). In addition, –509 genotypes were verified using allele-specific oligonucleotide linear arrays (Roche Molecular Systems, Alameda, CA). The discrepancy rate between these two methods was 0.91%. Unclear or erroneous genotypes were either repeated by TaqMan or sequenced using the BigDye Terminator v.3.1 Cycle Sequencing Kit on an ABI 3100 sequencer (Applied Biosystems).
Tagger (http://www.broad.mit.edu/mpg/tagger) (38), the tag SNP selection algorithm implemented in Haploview (http://www.broad.mit.edu/mpg/haploview) (39), was used to select a minimal set of tag SNPs that adequately represented the genetic diversity within the TGFB1 gene region, based on patterns of LD found in the HapMap CEPH data (40) and Perlegen Caucasian data. The data source at the time of SNP selection was HapMap data release 20/phaseII Jan06 on the NCBI build 35 assembly, dbSNP build 125. Criteria for selection of tag SNPs were an r2 >0.8 with the untyped SNP, an Illumina design score >0.6 and an inter-SNP spacing of no less than 60 bp. Eight tag SNPs (rs8179181, rs28730295, rs2278422, rs8110090, rs11466334, rs1800472, rs4803455, rs1982072) within and 5 kb upstream of the TGFB1 gene were genotyped in 445 trios with pulmonary data using Illumina BeadArray technology (Illumina, San Diego, USA). Allele and genotype frequencies are provided in Supplementary Material, Table S1.
Statistical analyses
Genotype distributions were tested for Hardy–Weinberg equilibrium using the –unrelatedsOnly option in PEDSTATS v.0.6.6 (http://www.sph.umich.edu/csg/abecasis/Pedstats) (41), which performs an exact test in a subset of unrelated individuals, as to avoid bias from correlated genotypes within families. General statistics, linear regression, Mann–Whitney U tests and t-tests were performed in Intercooled Stata 8 (StataCorp, College Station, TX). Because correlation among sibling marker genotypes may invalidate the results of family-based tests of association in the presence of linkage, we tested for linkage of MaxFEV1CF% to the TGFB1 gene region on chromosome 19. Single- and multipoint parametric linkage analysis was performed using all SNPs with MAF > 0.05 and two previously typed short-tandem repeat markers downstream of TGFB1 (D19S400, D19S718) using Sequential Oligogenic Linkage Analysis Routines (SOLAR v.4.0.7; http://www.sfbr.org/solar). No linkage of the cross-sectional measure of lung function to this region was found in CF patients.
Quantitative transmission disequilibrium testing (QTDT v.2.5.0; http://www.sph.umich.edu/csg/abecasis/QTDT) (42) was used to perform family-based tests of LD. The orthogonal model implemented in QTDT was adopted to test for association. To account for multiple testing in the presence of linked polymorphisms, empirical P-values were calculated from 1000 Monte-Carlo permutations using the -m option. Though single-test P-values are reported, those meeting the threshold for a global empirical significance level of 0.05 are denoted by bold font.
Data analysis tools for continuous traits implemented within PBAT software (http://www.biostat.harvard.edu/~clange/default.htm) were used to estimate the power of the data set to detect an association between TGFB1 haplotypes and MaxFEV1CF% using the conditional mean model (43) while simultaneously minimizing the number of tests for which to correct. Haplotypes with >80% power (at
= 0.05) were analyzed by QTDT and by a second method, the FBAT program (http://www.biostat.harvard.edu/~fbat/fbat.htm) (44). Single-test P-values meeting the threshold for a global empirical significance level of 0.05 are denoted by bold font and P-values achieving significance after Bonferroni correction are marked with an asterisk. Because QTDT does not internally generate haplotypes, PHASE v.2.1 (http://stephenslab.uchicago.edu/software.html) (45) was used to construct haplotypes from the trio data via the -P1 option. Patients with recombinant or ambiguous haplotypes were excluded. The constructed haplotypes were treated as individual alleles in QTDT. The hbat command implemented within FBAT was employed to construct and test haplotypes comprising specified SNPs. The -p option in FBAT was employed to compute empirical P-values from Monte-Carlo permutations and also to perform the minimal p test, which calculates the significance of the smallest P-value. Since the family-based association tests employed (QTDT, PBAT and FBAT) have optimal power when traits are normally distributed, all quantitative phenotypes were also ranked and converted to Z-scores using an inverse normal transformation.
For univariate and multivariate linear regression of the cross-sectional lung function measure (MaxFEV1CF%) on TGFB1 variants, alleles or haplotypes were coded in an additive, dominant or recessive fashion to determine the most likely MOI. For the various models, alleles or haplotypes were coded as follows: additive, 0 (zero copies of the allele/haplotype present), 1 (one copy, i.e. heterozygous) or 2 (i.e. homozygous); dominant, 0 (zero copies) or 1 (at least one copy); recessive, 0 (zero or one copies) or 1 (two copies). AIC, a measure of the goodness-of-fit of an estimated statistical model, was used to compare models. Power analysis was performed using the online Genetic Power Calculator (http://pngu.mgh.harvard.edu/~purcell/gpc/) (46).
| SUPPLEMENTARY MATERIAL |
|---|
|
|
|---|
Supplementary Material is available at HMG Online.
| FUNDING |
|---|
|
|
|---|
This work was supported by the National Institutes of Health (HL68927) and the Cystic Fibrosis Foundation (CUTTIN06P0).
| ACKNOWLEDGEMENTS |
|---|
The authors sincerely thank the many CF patients and their families, research coordinators, nurses and physicians involved with the CF Twin and Sibling Study, as this work would not have been possible without their participation. We also thank Nulang Wang for DNA extraction and CFTR genotyping, Michal Kulich, Ph.D. for providing conversion programs for CF-specific percentiles for FEV1 and John McGready, Ph.D. for helpful statistical discussions.
Conflict of Interest statement. None declared.
| REFERENCES |
|---|
|
|
|---|
- Kerem E., Corey M., Kerem B.-S., Rommens J., Markiewicz D., Levison H., Tsui L.C., Durie P. The relation between genotype and phenotype in cystic fibrosis–analysis of the most common mutation (deltaF508). N. Engl. J. Med. (1990) 323:1517–1522.[Abstract]
- Burke W., Aitken M.L., Chen S.H., Scott C.R. Variable severity of pulmonary disease in adults with identical cystic fibrosis mutations. Chest (1992) 102:506–509.[CrossRef][Web of Science][Medline]
- Kerem E., Corey M., Gold R., Levison H. Pulmonary function and clinical course in patients with CF after pulmonary colonization with Pseudomonas aeruginosa. J. Pediatr (1990) 116:714–719.[CrossRef][Web of Science][Medline]
-
Vanscoy L.L., Blackman S.M., Collaco J.M., Bowers A., Lai T., Naughton K., Algire M., McWilliams R., Beck S., Hoover-Fong J., et al. Heritability of lung disease severity in cystic fibrosis. Am. J. Respir. Crit. Care Med (2007) 175:1036–1043.
[Abstract/Free Full Text] - Cutting G.R. Modifier genetics: cystic fibrosis. Annu. Rev. Genomics Hum. Genet (2005) 6:237–260.[CrossRef][Web of Science][Medline]
-
Blobe G.C., Schiemann W.P., Lodish H.F. Role of transforming growth factor beta in human disease. N. Engl. J. Med (2000) 342:1350–1358.
[Free Full Text] - Bartram U., Speer C.P. The role of transforming growth factor beta in lung development and disease. Chest (2004) 125:754–765.[CrossRef][Web of Science][Medline]
-
Sheppard D. Transforming growth factor beta: a central modulator of pulmonary and airway inflammation and fibrosis. Proc. Am. Thorac. Soc (2006) 3:413–417.
[Abstract/Free Full Text] -
Arkwright P.D., Laurie S., Super M., Pravica V., Schwarz M.J., Webb A.K., Hutchinson I.V. TGF-beta(1) genotype and accelerated decline in lung function of patients with cystic fibrosis. Thorax (2000) 55:459–462.
[Abstract/Free Full Text] -
Drumm M.L., Konstan M.W., Schluchter M.D., Handler A., Pace R., Zou F., Zariwala M., Fargo D., Xu A., Dunn J.M., et al. Genetic modifiers of lung disease in cystic fibrosis. N. Engl. J. Med (2005) 353:1443–1453.
[Abstract/Free Full Text] -
Stanke F., Tummler B., Becker T. Genetic modifiers in cystic fibrosis. N. Engl. J. Med (2006) 354:88–90.
[Free Full Text] - Brazova J., Sismova K., Vavrova V., Bartosova J., Macek M. Jr, Lauschman H., Sediva A. Polymorphisms of TGF-beta1 in cystic fibrosis patients. Clin. Immunol (2006) 121:350–357.[CrossRef][Web of Science][Medline]
- Cystic Fibrosis Foundation. Cystic Fibrosis Foundation Patient Registry Annual Data Report 2005. (2005).
-
Kulich M., Rosenfeld M., Campbell J., Kronmal R., Gibson R.L., Goss C.H., Ramsey B. Disease-specific reference equations for lung function in patients with cystic fibrosis. Am. J. Respir. Crit. Care Med (2005) 172:885–891.
[Abstract/Free Full Text] - Becker T., Jansen S., Tamm S., Wienker T.F., Tummler B., Stanke F. Transmission ratio distortion and maternal effects confound the analysis of modulators of cystic fibrosis disease severity on 19q13. Eur. J. Hum. Genet (2007) 15:774–778.[CrossRef][Web of Science][Medline]
- Konstan M.W., Butler S.M., Wohl M.E., Stoddard M., Matousek R., Wagener J.S., Johnson C.A., Morgan W.J. Growth and nutritional indexes in early life predict pulmonary function in cystic fibrosis. J. Pediatr (2003) 142:624–630.[CrossRef][Web of Science][Medline]
- Milla C.E. Association of nutritional status and pulmonary function in children with cystic fibrosis. Curr. Opin. Pulm. Med (2004) 10:505–509.[CrossRef][Web of Science][Medline]
- Lange C., DeMeo D., Silverman E.K., Weiss S.T., Laird N.M. PBAT: tools for family-based association studies. Am. J. Hum. Genet (2004) 74:367–369.[CrossRef][Web of Science][Medline]
- Akaike H. A new look at the statistical model identification. IEEE Trans. Biomed. Eng (1974) 19:716–719.
- Dorfman R., Sandford A., Taylor C., Huang B., Frangolias D., Wang Y., Sang R., Pereira L., Sun L., Berthiaume Y., et al. Complex two-gene modulation of lung disease severity in children with cystic fibrosis. J. Clin. Invest (2008) 118:1040–1049.[Web of Science][Medline]
-
McKone E.F., Shao J., Frangolias D.D., Keener C.L., Shephard C.A., Farin F.M., Tonelli M.R., Pare P.D., Sandford A.J., Aitken M.L., et al. Variants in the glutamate-cysteine-ligase gene are associated with cystic fibrosis lung disease. Am. J. Respir. Crit. Care Med (2006) 174:415–419.
[Abstract/Free Full Text] - Pulleyn L.J., Newton R., Adcock I.M., Barnes P.J. TGFbeta1 allele association with asthma severity. Hum. Genet (2001) 109:623–627.[CrossRef][Web of Science][Medline]
-
Silverman E.S., Palmer L.J., Subramaniam V., Hallock A., Mathew S., Vallone J., Faffe D.S., Shikanai T., Raby B.A., Weiss S.T., et al. Transforming growth factor-beta1 promoter polymorphism C-509T is associated with asthma. Am. J. Respir. Crit. Care Med (2004) 169:214–219.
[Abstract/Free Full Text] -
Wu L., Chau J., Young R.P., Pokorny V., Mills G.D., Hopkins R., McLean L., Black P.N. Transforming growth factor-beta1 genotype and susceptibility to chronic obstructive pulmonary disease. Thorax (2004) 59:126–129.
[Abstract/Free Full Text] -
Celedon J.C., Lange C., Raby B.A., Litonjua A.A., Palmer L.J., Demeo D.L., Reilly J.J., Kwiatkowski D.J., Chapman H.A., Laird N., et al. The transforming growth factor-beta1 (TGFB1) gene is associated with chronic obstructive pulmonary disease (COPD). Hum. Mol. Genet (2004) 13:1649–1656.
[Abstract/Free Full Text] - van Diemen C.C., Postma D.S., Vonk J.M., Bruinenberg M., Nolte I.M., Boezen H.M. Decorin and TGF-beta1 polymorphisms and development of COPD in a general population. Respir. Res (2006) 7:89.[CrossRef][Medline]
- Yamada Y., Miyauchi A., Goto J., Takagi Y., Okuizumi H., Kanematsu M., Hase M., Takai H., Harada A., Ikeda K. Association of a polymorphism of the transforming growth factor-beta1 gene with genetic susceptibility to osteoporosis in postmenopausal Japanese women. J. Bone Miner. Res (1998) 13:1569–1576.[CrossRef][Web of Science][Medline]
-
Grainger D.J., Heathcote K., Chiano M., Snieder H., Kemp P.R., Metcalfe J.C., Carter N.D., Spector T.D. Genetic control of the circulating concentration of transforming growth factor type beta1. Hum. Mol. Genet (1999) 8:93–97.
[Abstract/Free Full Text] -
Suthanthiran M., Li B., Song J.O., Ding R., Sharma V.K., Schwartz J.E., August P. Transforming growth factor-beta 1 hyperexpression in African-American hypertensives: a novel mediator of hypertension and/or target organ damage. Proc. Natl Acad. Sci. USA (2000) 97:3479–3484.
[Abstract/Free Full Text] - Luedecking E.K., DeKosky S.T., Mehdi H., Ganguli M., Kamboh M.I. Analysis of genetic polymorphisms in the transforming growth factor-beta1 gene and the risk of Alzheimer's disease. Hum. Genet (2000) 106:565–569.[CrossRef][Web of Science][Medline]
-
Dunning A.M., Ellis P.D., McBride S., Kirschenlohr H.L., Healey C.S., Kemp P.R., Luben R.N., Chang-Claude J., Mannermaa A., Kataja V., et al. A transforming growth factorbeta1 signal peptide variant increases secretion in vitro and is associated with increased incidence of invasive breast cancer. Cancer Res (2003) 63:2610–2615.
[Abstract/Free Full Text] - Shah R., Hurley C.K., Posch P.E. A molecular mechanism for the differential regulation of TGF-beta1 expression due to the common SNP -509C-T (c. -1347C>T). Hum. Genet (2006) 120:461–469.[CrossRef][Web of Science][Medline]
-
Ignotz R.A., Massague J. Transforming growth factor-beta stimulates the expression of fibronectin and collagen and their incorporation into the extracellular matrix. J. Biol. Chem (1986) 261:4337–4345.
[Abstract/Free Full Text] -
Overall C.M., Wrana J.L., Sodek J. Independent regulation of collagenase, 72-kDa progelatinase, and metalloendoproteinase inhibitor expression in human fibroblasts by transforming growth factor-beta. J. Biol. Chem (1989) 264:1860–1869.
[Abstract/Free Full Text] -
Beaty T.H., Cohen B.H., Newill C.A., Menkes H.A., Diamond E.L., Chen C.J. Impaired pulmonary function as a risk factor for mortality. Am. J. Epidemiol (1982) 116:102–113.
[Abstract/Free Full Text] - Kerem E., Reisman J., Corey M., Canny G.J., Levison H. Prediction of mortality in patients with cystic fibrosis. N. Engl. J. Med (1992) 326:1187–1191.[Abstract]
- Duffy D. Sib-pair: a program for non-parametric linkage/association analysis. Am. J. Hum. Genet (1997) 61(suppl.):A197.
- de Bakker P.I., Yelensky R., Pe'er I., Gabriel S.B., Daly M.J., Altshuler D. Efficiency and power in genetic association studies. Nat. Genet (2005) 37:1217–1223.[CrossRef][Web of Science][Medline]
-
Barrett J.C., Fry B., Maller J., Daly M.J. Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics (2005) 21:263–265.
[Abstract/Free Full Text] - The International HapMap Consortium. The International HapMap Project. Nature (2003) 426:789–796.[CrossRef][Medline]
-
Wigginton J.E., Abecasis G.R. PEDSTATS: descriptive statistics, graphics and quality assessment for gene mapping data. Bioinformatics (2005) 21:3445–3447.
[Abstract/Free Full Text] - Abecasis G.R., Cardon L.R., Cookson W.O. A general test of association for quantitative traits in nuclear families. Am. J. Hum. Genet (2000) 66:279–292.[CrossRef][Web of Science][Medline]
- Lange C., DeMeo D., Silverman E.K., Weiss S.T., Laird N.M. Using the noninformative families in family-based association tests: a powerful new testing strategy. Am. J. Hum. Genet (2003) 73:801–811.[CrossRef][Web of Science][Medline]
- Laird N.M., Horvath S., Xu X. Implementing a unified approach to family-based tests of association. Genet. Epidemiol (2000) 19(Suppl. 1):S36–S42.[CrossRef][Web of Science][Medline]
- Stephens M., Smith N.J., Donnelly P. A new statistical method for haplotype reconstruction from population data. Am. J. Hum. Genet (2001) 68:978–989.[CrossRef][Web of Science][Medline]
-
Purcell S., Cherny S.S., Sham P.C. Genetic power calculator: design of linkage and association genetic mapping studies of complex traits. Bioinformatics (2003) 19:149–150.
[Abstract/Free Full Text]
This article has been cited by other articles:
![]() |
J. R. Bartlett, K. J. Friedman, S. C. Ling, R. G. Pace, S. C. Bell, B. Bourke, G. Castaldo, C. Castellani, M. Cipolli, C. Colombo, et al. Genetic Modifiers of Liver Disease in Cystic Fibrosis JAMA, September 9, 2009; 302(10): 1076 - 1083. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. M. Hantash, A. Rebuyon, M. Peng, J. B. Redman, W. Sun, and C. M. Strom Apparent Homozygosity of a Novel Frame Shift Mutation in the CFTR Gene Because of a Large Deletion J. Mol. Diagn., May 1, 2009; 11(3): 253 - 256. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||


