Mosaic methylation of the repeat unit of the human ribosomal RNA genes
Mosaic methylation of the repeat unit of the human ribosomal RNA genesGraham J. R. Brock and Adrian Bird*
Institute of Cell and Molecular Biology, University of Edinburgh, Kings Buildings, Edinburgh EH9 3JR, UK
Received October 22, 1996;Revised and Accepted December 11, 1996
The pattern of methylation in human genes for 18S and 28S ribosomal RNA has been investigated using methylation-sensitive restriction enzymes. We find that the transcribed region of the repeat unit is predominantly unmethylated, in agreement with previous studies. In contrast the non-transcribed spacer, which makes up the majority of the 43 kb repeat unit, is highly methylated in blood cell DNA. The boundaries between methylated and non-methylated domains appear to be relativelysharp, and occur ~1.5 kb upstream of the 5' edge of the proximal promoter and ~1.0 kb downstream of the 3' end of the transcribed region. A small proportion of all repeat units are methylated throughout the transcribed region, and may represent silent genes. The coincidence between the methylation pattern, the transcription pattern and other features of the repeat unit has implications for our understanding of the mechanism by which patterns of DNA methylation are generated.
The human 18S and 28S ribosomal RNA (rRNA) genes are present at ~400 copies per human haploid genome, clustered on the short arms of the five acrocentric chromosomes (1 ). Each gene is part of a 43 kb repeat unit that can be divided into two regions: a 13.3 kb transcribed region which contains the highly conserved genes for 18S, 5.8S and 28S rRNA subunits of the ribosome, and a 30 kb non-transcribed spacer (NTS) (2 ). Repeat unit clusters consist of head-to-tail arrays of ~80 repeats (3 ).
Vertebrate DNA is frequently modified at the dinucleotide CpG by addition of a methyl group to give m5CpG (4 ). The modification can alter the ability of various restriction enzymes to cleave DNA, and this facilitates investigation of methylation at these sites (5 ,6 ). A previous study of methylation levels in the transcribed region of the rDNA repeat of various species demonstrated that the conserved region of the repeat unit exists in a predominantly unmethylated form in DNA from a variety of mammals, including man (7 ). In the mouse, a small fraction of all rDNA repeats are methylated throughout the transcribed region, and indirect evidence has suggested that these correspond to transcriptionally inactive repeats (8 ). The findings in mammals contrast with those in amphibia and fish, where chromosomal rDNA is heavily methylated in both transcribed and non-transcribed regions of the repeat unit (9 ,10 ).
This study began with isolation of fragments of the rDNA spacer from a library of densely methylated DNA fragments. The library was constructed by fractionating human blood DNA over a column that specifically binds to densely methylated DNA (11 ). The frequent occurrence of rDNA fragments in the library was unexpected given previous evidence that the transcribed region of mammalian rDNA is predominantly unmethylated. Further experiments showed that, in fact, human rDNA is mosaic with respect to CpG methylation. The transcribed region is indeed largely methylation-free at testable sites, but the spacer is densely methylated.
A library of densely methylated sequences was constructed after fractionating human blood DNA on a methyl-CpG binding column (G.J.R.Brock, unpublished). Fragments that repeatedly bound to the column were cloned in plasmids. The library was tested by examination of the DNA sequence and methylation status of random inserts. Clones containing p1A7 and p1B12 showed DNA sequence identity with the NTS of human rDNA. Each clone was represented many times in the library. Methylation was analysed by using inserts from the library as probes against Southern blots of genomic DNA that had been digested with the methylation-sensitive enzyme SmaI (CCCGGG) or its methylation-insensitive isoschizomer XmaI. Probes p1A7 and p1B12 both hybridised to very high molecular weight SmaI fragments, around 28 kb, whereas XmaI generated smaller fragments as predicted by the sequence of the spacer (Fig. 1 ). Weaker SmaI bands <27 kb indicated that sites in this part of the repeat unit are occasionally non-methylated. The large size of the major band in the SmaI lanes showed that p1A7 and p1B12 are part of long tracts of DNA in which multiple (at least 14) SmaI/XmaI sites are usually in a methylated state. The equivalent experiment with sperm DNA again showed that the sequence environment of p1A7 and p1B12 is methylated, but the level of methylation is significantly lower than in blood DNA. Most of the rDNA repeats in sperm DNA were digested by SmaI to give fragments <20 kb, and some fragments were of the same size as in the XmaI digested lane.
We have demonstrated a mosaic pattern of CpG methylation in the majority of human rDNA repeat units. The 13.3 kb transcribed region is apparently free of methylation, whereas the NTS which lies between consecutive transcribed regions is highly methylated at ~300 tested CpG sites (Fig. 4 A). The transition between methylated and non-methylated domains appears to be sharp and occurs near the boundaries of the transcribed region. Previous studies have only determined the methylation status of the transcribed region of the repeat unit. In this respect there is a great contrast within the vertebrates. Fish and amphibia have heavily methylated transcribed regions, whereas the mammals (and perhaps birds and reptiles) have transcribed regions that are predominantly non-methylated (7 ). The present results reduce the contrast somewhat, as it now seems likely that a major part of the chromosomal rDNA repeat unit may be methylated in most or all vertebrates.
The sequence of the human rDNA repeat unit shows that the dramatic variation in methylation is matched by less dramatic, but easily detectable, differences in sequence composition. Firstly, base composition fluctuates through the repeat unit from a high average GC content over the transcribed region, to a lower GC content over the NTS. This is apparent from the uneven frequency of GC-rich restriction enzyme sites (Fig. 4 A), and also from a plot of average GC-content across the repeat unit (Fig. 4 B). In some respects the structure of the repeat resembles the large scale isochore structure of the genome in which regions of differing average base composition are juxtaposed (20 ). One possible reason for the GC-richness of the transcribed region is selective pressure to conserve the rRNA precursor sequence. This seems an inadequate explanation, however, as the 5' transcribed spacer evolves quite rapidly (there is little overall sequence similarity between human and rat or mouse in this region) and is therefore not apparently under strong sequence constraint (21 -25 ). In spite of the sequence variation, the GC-rich character of the external transcribed spacer is maintained in mammals (23 ,24 ). Indeed, the human 18S sequence is less GC-rich than the transcribed spacers that flank it (see Fig. 4 B). It is also apparent that the GC-rich domain, like the non-methylated domain, extends outside the region that is transcribed, to include the proximal promoter sequences (see also ref. 26 ) and regions downstream of the transcription termination site. This would not be expected if selection only for GC-rich RNA was operating. It seems more likely that the phenomena that relate lack of methylation to transcription also lead to GC-richness in this domain.
Figure 4. (A) Schematic representation of two consecutive rDNA repeat units showing the frequency of GC-rich sites for three restriction enzymes Bsp1201, EagI and BstUI. The transcription unit is represented by a box with an arrow indicating the start and direction of transcription. The 18S, 5.8S and 28S genes are shown in black. Approximate methylation levels across the repeat are plotted below. (B) A plot of %GC content across the repeat unit based on a window size of 1000 bp and a step of 30 bp. (C) A plot of the frequency of CpG across the repeat based on the observed frequency in a 1000 bp window divided by the expected frequency. The step is 30 bp.
Another prominent feature of the repeat unit sequence is the fluctuation in CpG frequency (observed/expected). In the transcribed region, CpG occurs at near the expected level, but in the NTS there is a significant CpG deficiency (Fig. 4 C). This finding fits well with the observation that methylation occurs in the NTS, as the primary cause of CpG deficiency is the mutability of m5C (26 ). It is possible to see analogies between the non-methylated GC-rich domain of rDNA and CpG islands, which have similar properties (26 ). Both are also associated with transcription, although the details differ. CpG islands usually extend for ~1 kb downstream from the promoter of a gene and rarely encompass the entire transcription unit. Only in cases where the gene is unusually small does it lie entirely within the island (for example, [alpha] globin; see ref. 27 ). In rDNA the non-methylated domain is much larger than a typical CpG island (13 kb) and includes all of the transcription unit. Although the mosaic character of the rDNA repeat unit cannot yet be satisfactorily explained, it is notable that the region that is methylation-free and GC-rich corresponds with parts of the repeat unit that are probably associated with proteins. These are bound to the promoter and termination sites, and pass repeatedly through the transcribed region during transcription. It is conceivable that protein-DNA complexes block access to the DNA methyltransferase. A more radical possibility is that the act of transcription actively demethylates this part of the repeat unit. Preliminary evidence for RNA-mediated demethylation has been reported (28 ). Whatever the mechanism, it is important to keep in mind that the transcription unit of amphibian rDNA is maintained in a heavily methylated state in spite of protein associations and transcription (10 ).
Analysis of amplified ribosomal RNA genes in a human lymphoblastoid cell line has suggested that the methylated genes are transcribed at low levels (29 ). Active and relatively inactive populations of rDNA were also detected in HeLa cells (30 ). In the mouse, it has been shown that a minority of rDNA repeats are methylated in the transcribed region and are probably transcriptionally inactive (8 ). The present work shows a parallel phenomenon in normal human cells, as a small proportion of rDNA from blood is resistant to methylation-sensitive restriction enzymes in the transcribed region of the repeat unit. By analogy with the mouse, the methylated repeats may represent inactive human rRNA genes. There is evidence that lack of methylation at CpG islands depends on the binding of factors that are also required for transcription to occur (31 ,32 ). A similar requirement may apply to rDNA. Inactivity of a gene may invite de novo methylation. Once methylated, the silent repeat unit may be unable to return to transcriptional activity due to the repressive effect of methylation (33 ). In this way CpG methylation may stabilise the repressed state.
DNA derived from blood was extracted using standard protocols (12 ), a gift from L. Strain (MRC Human Genetics Unit, Edinburgh). Restriction enzyme digests were performed according to the manufacturer's instructions (New England Biolabs). As a control for complete digestion, an aliquot was removed and incubated with plasmid DNA (200 ng) that contained sites for the relevant enzyme. The plasmid and an undigested control were then visualised on an agarose gel stained with ethidium bromide to check for complete digestion (data not shown). DNA derived from sperm was extracted by addition of 50 mM EDTA pH 8.0, 1% SDS and 500 [mu]g/ml Proteinase K with overnight incubation at room temperature. This was followed by addition of DTT to 50 mM and proteinase K to 100 [mu]g/ml and further incubation overnight at 50oC (13 ). After phenol/chloroform extraction, the solution was ethanol-precipitated and resuspended in 10 mM Tris pH 7.5, 1 mM EDTA.
In some cases DNA from human blood and sperm was first digested with EcoRV, which has no recognition site within the rDNA repeat, in order to reduce the molecular weight of bulk DNA. The DNA was then redigested with XmaI or its methylation-sensitive isoschizomer SmaI (14 ). Where DNA was first digested with EcoRI or HindIII followed by digestion with methylation-sensitive restriction enzymes, EcoRV was not used. Digested DNA (5 [mu]g/lane) was separated by electrophoresis in 0.8% or 0.5% agarose gels and blotted onto Hybond N+ filters (Amersham) following the manufacturer's protocol. Probes were labelled using the random priming method to incorporate [[alpha]-32P]dCTP (15 ). Hybridisation was performed overnight at 68oC in 0.5 M Na2HPO4 (pH 7.2), 7% SDS with 5% powdered milk added to reduce background signal. After probe p1A7 had been denatured, 30 [mu]g Cot-1 DNA (Gibco BRL) was added, competition was necessary due to the presence of a 100 bp Alu consensus sequence within p1A7 (16 ). Filters were washed three times for 20 min in 0.1* SSC, 0.2% SDS at 68oC (12 ).
Base pair co-ordinates used in this study correspond to those published in the GenBank database for the human ribosomal DNA complete repeating unit (accession number U13369). The coordinates of restriction sites shown in the figures were determined using either the GeneJockey III (Biosoft Cambridge) or DNAStar (DNAStar Incoporated) programs. Locations of restriction sites agree with previously published studies (17 ,18 ). Probes pHsrDNA5.9 and pHsrDNA7.1 comprise EcoRI fragments from coordinates 1 to 5900 bp and from 5900 to 13 000 bp, respectively, cloned into pUC9 (a gift from Rakesh Anand). Probes p1A7 and p1B12 contain MseI fragments from coordinates 17 062 to 17 369 bp and from 35 695 to 36 231 bp cloned into pGem T-vectors (Promega). Probes p1B12 and p1A7 were both obtained during the construction of a library of densely methylated sequences from human blood DNA (G.J.R.Brock, unpublished). Both sequences were used as queries in BLAST (19 ) searches of the databases maintained by the NCBI Bethesda.
We thank Vicky Clark for DNA sequencing, Martin Simmen for computing assistance, Joan Davidson and Aileen Greig for technical help and Christine Struthers for secretarial support. We also thank Donald MacLeod, Susan Tweedie, Xinsheng Nan and Sally Cross for comments on the manuscript. GJRB was supported by a studentship from the Wellcome Trust. The work was supported by the Wellcome Trust and the Howard Hughes Medical Institute.
1 Worton, R.G., Sutherland, J., Sylvester, J.E., Willard, F.H., Bodrug, S., Dube, I., Duff, C., Kean, V., Ray, N.P. and Schmickel, R.D. (1988) Human ribosomal RNA genes: orientation of the tandem array and conservation of the 5'end. Science, 239, 64-68.MEDLINE Abstract
2 Gonzalez, L.I., Wu, S., Li, W., Kuo, A.B. and Svlvester, E.J. (1992) Human ribosomal RNA intergenic spacer sequence. Nucleic Acids Res., 20, 5846-5847.
3 Sakai, K., Ohta, T., Minoshima, S., Kudoh, J., Wang, Y., De Jong, J.P. and Shimizu, N. (1995) Human ribosomal RNA gene cluster: identification of the proximal end containing a novel tandem repeat sequence. Genomics, 26, 521-526.MEDLINE Abstract
4 Gruenbaum, Y., Stein, R., Cedar, H. and Razin, A. (1981) Methylation of CpG sequences in eukaryotic DNA. FEBS Lett., 124, 67-71.MEDLINE Abstract
5 Bird, A.P. (1978) Use of restriction enzymes to study eukaryotic DNA methylation. II: The symmetry of methylated sites supports semi-conservative copying of the methylation pattern. J. Mol. Biol., 118, 48-60.
6 Bickmore, W. and Bird, A.P. (1992) The use of restriction endonculeases to detect and isolate genes from mammalian cells. Methods Enzymol., 216, 224-244.MEDLINE Abstract
7 Bird, A.P. and Taggart, M.H. (1980) Variable patterns of total DNA and rDNA methylation in animals. Nucleic Acids Res., 8, 1485-1497.MEDLINE Abstract
8 Bird, A.P., Taggart, M.H. and Gehring, C.A. (1981) Methylated and unmethylated ribosomal RNA genes in the mouse. J. Mol. Biol., 152, 1-17.MEDLINE Abstract
9 Dawid, I.B., Brown, D.D. and Reader, R.H. (1970) Composition and structure of chromosomal and amplified ribosomal DNAs of Xenopus laevis. Mol. Biol., 51, 341-360.
10 Bird, A.P. and Southern, E.M. (1978) Use of restriction enzymes to study eukaryotic DNA methylation. I: The methylation pattern in ribosomal DNA from Xenopus laevis. J. Mol. Biol., 118, 27-47.MEDLINE Abstract
11 Cross, S.H., Charlton, J.A., Nan, X. and Bird, A.P. (1994) Purification of CpG islands using a methylated DNA binding column. Nature Genet., 6, 236-244.MEDLINE Abstract
12 Maniatis, T., Fritsch, E.F. and Sambrook, J. (1982) Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY. pp 196-198.
13 Cross, S.H. (1989) Isolation and Characterisation of Human Telomeres PhD. Thesis, University of Edinburgh.
14 McClelland, M., Nelson, M. and Raschke, E. (1994) Effect of site-specific modification on restriction endonucleases and DNA modification methyltransferases. Nucleic Acids Res., 22, 3640-3659.MEDLINE Abstract
15 Feinberg, A.P. and Vogelstein, B. (1983) A technique for radiolabelling DNA restriction endonuclease fragments to high specific activity. Anal. Biochem., 132, 6-13.MEDLINE Abstract
16 Sealey, P.G., Whittaker, P.A. and Southern, E.M. (1985) Removal of repeated sequences from hybridisation probes. Nucleic Acids Res., 13, 1905-1921.MEDLINE Abstract
17 Maden, E.B., Dent, L.C., Farrell, E.T., Garde, J., McCallum, F.S. and Wakeman, A.J. (1987) Clones of human ribosomal DNA containing the complete 18S-rRNA and 28S-rRNA genes. Characterization, a detailed map of the human ribosomal transcription unit and diversity among clones. Biochem. J., 246, 519-527.
18 Gonzalez, I.L. and Sylvester, J.E. (1995) Complete sequence of the 43-kb human ribosomal DNA repeat: analysis of the intergenic spacer. Genomics, 27, 320-328.MEDLINE Abstract
19 Altschul, S.F., Gish, W., Miller, W., Myers, E.W. and Lipman, D.J. (1990) Basic local alignment search tool. J. Mol. Biol., 215, 403-410.MEDLINE Abstract
20 Bernardi, G. (1989) The isochore organization of the human genome. Annu. Rev. Genet., 23, 637-661.MEDLINE Abstract
21 Urano, Y., Kominami, R., Mishima, Y. and Muramatsu, M. (1980) The nucleotide sequence of the putative transcription initiation site of a cloned ribosomal RNA gene of the mouse. Nucleic Acids Res., 8, 6043-6058.MEDLINE Abstract
22 Rothblum, I.L., Reddy, R. and Cassidy, B. (1982) Transcription initiation site of rat ribosomal DNA. Nucleic Acids Res., 10, 7345-7362.
23 Financsek, I., Mizumoto, K., Y., M. and Muramatsu, M. (1982) Human ribosomal RNA gene: Nucleotide sequence of the transcription initiation region and comparison of three mammalian genes. Proc. Natl. Acad. Sci. USA, 79, 3092-3096.MEDLINE Abstract
24 Renalier, M.-H., Mazan, S., Joseph, N., Michot, B. and Bachellerie, J.-P. (1989) Structure of the 5'-external transcribed spacer of the human ribosomal RNA gene. FEBS Lett., 249, 279-284.MEDLINE Abstract
25 Gonzalez, I.L., Chambers, C., Gorski, J.L., Stambolian, D., Schmickel, D.R. and Sylvester, E.J. (1990) Sequence and structure correlation of human ribosomal transcribed spacers. J. Mol. Biol., 212, 27-35.MEDLINE Abstract
26 Bird, A.P. (1986) CpG-rich islands and the function of DNA methylation. Nature, 321, 209-213.MEDLINE Abstract
27 Bird, A.P., Taggart, M.H., Nicholls, R.D. and Higgs, D.R. (1987) Non-methylated CpG-rich islands at the human alpha-globin locus: implications for evolution of the alpha-globin pseudogene. EMBO J., 6, 999-1004.
28 Weiss, A., Keshet, I., Razin, A. and Cedar, H. (1996) DNA demethylation in vitro: involvement of RNA. Cell, 86, 709-718.MEDLINE Abstract
29 Dante, R., Percy, M., Baldini, A., Markovic, V., Miller, D., Rocchi, M., Niveleau, A. and Miller, O.J. (1992) Methylation of the 5' flanking sequences of the ribosomal DNA in human cell lines and a human-hamster cell line. J. Cell Biochem., 50, 357-362.MEDLINE Abstract
30 Qu. H.L, Nicoloso, M. and Bachellerie, J.-P. (1991) A sequence dimorphism in a conserved domain of human 28S rRNA. Uneven distribution of variant genes among individuals. Differential expression of HeLa cells. Nucleic Acids Res., 19, 1015-1019.MEDLINE Abstract
31 MacLeod, D., Charlton, J., Mullins, J. and Bird, A.P. (1994) Sp1 sites in the mouse aprt gene promoter are required to prevent methylation of the CpG island. Genes Dev., 8, 2282-2292.MEDLINE Abstract
32 Brandeis, M., Frank, D., Keshet, I., Siegried, Z., Mendelsohn, M., Nemes, A., Temper, V., Razin, A. and Cedar, H. (1994) Sp1 elements protect a CpG island from de novo methylation. Nature, 371, 435-438.MEDLINE Abstract
33 Labhart, P. (1994) Negative and positive effects of CpG-methylation on Xenopus ribosomal gene transcription in vitro. FEBS Lett., 356, 302-306.MEDLINE Abstract
*To whom correspondence should be addressed
-->
This page is maintained by OUP admin. Last updated Fri Feb 7 12:40:48 GMT 1997. Part of the OUP Journals World Wide Web service.Copyright Oxford University Press, 1996