Skip Navigation

This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (19)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Bouzekri, N.
Right arrow Articles by Jobling, M. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Bouzekri, N.
Right arrow Articles by Jobling, M. A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Human Molecular Genetics Pages 655-659


Novel mutation processes in the evolution of a haploid minisatellite, MSY1: array homogenization without homogenization
Introduction
Results
   Identification of new variant repeat types
   MVR-PCR system specific to new repeat types
   Diversity of alleles containing the novel repeat types
Discussion
Materials And Methods
Acknowledgements
References


Novel mutation processes in the evolution of a haploid minisatellite, MSY1: array homogenization without homogenization

Novel mutation processes in the evolution of a haploid minisatellite, MSY1: array homogenization without homogenization Nourdine Bouzekri1,2,+, Paul G. Taylor2, Michael F. Hammer3 and Mark A. Jobling2,*

1CNRS, UPR 8291, Centre d'Immunopathologie et de Génétique Humaine (CIGH), CHU Purpan, 31300 Toulouse, France, 2Department of Genetics, University of Leicester, University Road, Leicester LE1 7RH, UK and 3Laboratory of Molecular Systematics and Evolution, University of Arizona, Tucson, AZ 85721, USA

Received November 18, 1997; Revised and Accepted January 27, 1998

The Y-specific locus MSY1 is the only known haploid minisatellite, and displays an extremely high degree of structural diversity which can be assayed by minisatellite variant repeat PCR (MVR-PCR). One group of alleles, in an African-specific class of Y chromosomes (haplogroup 8), behaves unusually in the conventional MVR-PCR assay, and sequencing demonstrates that this is because repeat units in these alleles contain an additional base substitution. We have designed a new MVR-PCR system to detect these novel variants, and show firstly that they are confined to the haplogroup 8 chromosomes, and secondly that the base substitution has spread through these arrays without the elimination of existing repeat variants. The sharing of a particular base substitution between otherwise distinct repeat types in these alleles represents evidence of a remarkable mutation process in their evolutionary history, in which the variant base must have been spread by a biased repair mechanism operating in very small patches within heteroduplexes.

INTRODUCTION

Minisatellites are loci composed of tandem arrays of 10-50 bp repeat units. They have been widely used and investigated because of their high degree of length polymorphism, which results from a high mutation rate to new length alleles. As well as polymorphism in repeat number, there is also polymorphism in the sequence of repeat units. This can be accessed by minisatellite variant repeat PCR (MVR-PCR), in which PCR products are generated between a fixed flanking primer and discriminator primers designed to anneal specifically to one kind of repeat variant, thus mapping the positions of variant repeats along arrays, and allowing the fine structures of alleles to be analysed (1).

As well as in diversity studies, MVR-PCR has been useful in understanding the dynamics of mutation at minisatellites. Comparisons of progenitor and mutant allele structures have shown the importance of inter-allelic, gene conversion-like processes at autosomal loci (2).

We previously have isolated and characterized the only known constitutively haploid minisatellite, the Y-specific locus MSY1 (DYF155S1; ref. 3). This minisatellite is composed of 48-114 copies of a 25 bp repeat unit which is AT rich and predicted to form stable hairpin structures. Five sequence variant repeat types, differing by base substitutions, were identified originally, and four of these (types 1-4) mapped in an MVR-PCR system allowing complete codes for all alleles to be obtained.

We are interested in using MSY1 as a marker for Y chromosome diversity, in combination with other, more slowly mutating polymorphisms. However, it also provides a unique opportunity to study mutation at a haploid minisatellite, exempt from the activities of inter-allelic exchange processes which have been shown to be so important at diploid loci. Here we identify further novel repeat types at MSY1 which provide evidence for a remarkable mutational process in the history of this locus with a strand bias capable of homogenizing particular base substitutions within minisatellite arrays, without eliminating other repeat variants.

RESULTS

Identification of new variant repeat types

In some alleles, three-state MVR-PCR analysis (coding type 1, 3 and 4 repeats) gave reproducible bands of very much lower intensity than usual for some repeats, and for others coded strongly in one direction (reverse mapping) and not at all in the other direction (forward mapping); we interpreted this unusual behaviour in MVR analysis as evidence for the existence of further novel repeat types (3). The last (3'-most) one or two repeats in this class of alleles do not show this reduced intensity (Fig. 1a).

Figure 1. (a) Three-state MVR-PCR mapping of MSY1 alleles. MVR-PCR analysis of alleles from the Kenyan m66 and the Nigerian m118 compared with a control allele (English m19), using the original three-state system (3). Under these conditions, m66 and m118 code differently to m19: the central core of repeats codes as type 3 in reverse, but not in forward, and the last one (m118) or two (m66) repeats code very strongly in type 4 reverse mapping. (b) Sequences of novel repeat types containing the T -> C transition, compared with repeat types 1, 3 and 4. The sequence is also shown of the 5'-most repeat in m118, which contains an additional base substitution, and codes as a null in the new MVR-PCR system (see c). (c) MSY1 alleles analysed with the new MVR-PCR system. The males are those shown in (a). The new system specifically detects repeat types which contain the transition mutation. Very little signal is seen in m19, while all repeats in m66 and m118 code strongly, except the true type 4 repeats at the 3' ends, and the 5'-most repeat in each case, which here is a null (see sequences in b). (d) MVR-PCR codes derived from data shown in (a) and (c). `h/g' = haplogroup. Note that the novel repeat types in the haplogroup 8 chromosomes are classified as nulls in the usual three-state MVR-PCR system (3).

To investigate the molecular basis of this intensity difference, we analysed one of these alleles, in the Nigerian male m118, by direct sequencing. Sequence data were obtained for 12 repeats from each end of this 57 repeat allele. At the 3' end, there is a single type 4 repeat, which corresponds to an intense band in the MVR-PCR coding. All other sequenced repeats share a base substitution, a T -> C transition at position 6 (Fig. 1b); in sequencing autoradiographs, this transition could be seen extending into the allele well beyond the fully readable sequence. In all other respects, these repeats are equivalent to the previously described types 1, 3 and 4, with the exception of the first (5'-most) repeat, which contains an additional base substitution (T -> A transversion) at position 21 (Fig. 1b). To reflect their similarity to the original repeat types, the new types are designated 1a, 3a and 4a.

MVR-PCR system specific to new repeat types

In order to identify and map these novel repeat types more efficiently and unambiguously, a new MVR-PCR system was designed to detect them. This system discriminates well between repeats which contain the substitution and those which do not (Fig. 1c), including the detection, as a null, of the 5'-most variant in m118, and allows unambiguous MVR codes to be constructed (Fig. 1d).

Diversity of alleles containing the novel repeat types

We used the new MVR-PCR system to determine the codes of a larger sample of alleles carrying the novel repeat types. The males m66 and m118 belong to a Y-chromosomal haplogroup which we refer to as haplogroup 8 (previously `Af'; ref. 4). This is a subgroup of the set of chromosomes which bear the YAP Alu element insertion (DYS287; ref. 5) and the PN2 base substitutional polymorphism (6), and is defined by the presence of the sY81 (DYS271; ref. 7) and PN1 (6) base substitutions. The phylogenetic relationships of these different groups of chromosomes are illustrated in Figure 2. It is because of the derived nature of the haplogroup 8 chromosomes that the MSY1 repeat transition is defined as T -> C, rather than vice versa. This conclusion is also supported by the fact that the monomer repeat unit in the MSY1 homologue, DYF155S2, which may represent an unamplified progenitor of MSY1, has a T rather than a C at position 6 (ref. 3).


Figure 2. Phylogenetic relationship of Y chromosomes carrying the YAP polymorphic Alu element insertion. Names of haplogroup 4 and 8 are from ref. 4, and haplogroup 25 is a new name.

We surveyed a total of 28 chromosomes from haplogroup 8 and 10 from the ancestral haplogroup 25 (Fig. 3). All 38 alleles have different structures. All of the haplogroup 8 chromosomes, and none of the haplogroup 25 chromosomes, showed the MSY1 T -> C transition repeats. The number of true type 4 repeats at the 3' ends of these alleles is always either one (seven chromosomes) or two (21 chromosomes). Nineteen of the alleles have a `null' repeat at their 5' end, but we cannot say whether this is equivalent to the 5'-most sequenced variant repeat in m118. The diversity of the haplogroup 8 alleles is low; apart from the repeat content and structural similarity, the lengths of these alleles range only between 54 and 67 repeats. The ancestral haplogroup, in contrast, shows a high degree of structural and length diversity.


Figure 3. MVR-PCR codes of 26 haplogroup 8 chromosomes and 10 ancestral haplogroup 25 chromosomes. The MSY1 T -> C transition is exclusive to haplogroup 8 chromosomes.

DISCUSSION

An increasing collection of neurodegenerative disorders and fragile sites has been shown to be due to dynamic mutations of trinucleotide repeat loci, and minisatellites are now joining the list of tandemly repeated sequences which can be associated with disease phenotypes. Certain alleles of the H-ras minisatellite can influence transcription of the H-ras genes and lead to increased susceptibility to some cancers (8); a similar phenomenon appears to affect the insulin gene, and thus predisposition to insulin-dependent diabetes (9), and an expanded minisatellite upstream of the cystatin B gene is associated with progressive myoclonus epilepsy type 1 (10). The FRA16B fragile site has been shown to be due to the massive expansion of a minisatellite, which, like MSY1, is AT rich and has strongly predicted secondary structure (11). As more tandemly repeated sequences become implicated in disorders, it is increasingly important to understand the dynamics of the mutation processes which underlie the variability of these kinds of loci.

As well as this, an understanding of the mutational properties of MSY1 is relevant to the specific issue of human Y chromosome diversity and the use of the Y as a tool to investigate population histories (4). MSY1 is the most variable single marker on the chromosome, and it is the only one where the characteristics of large numbers of mutations can be studied in detail; it thus has potential as a dating tool for haplogroups of Y chromosomes defined by slowly mutating polymorphisms such as base substitutions.

Here we have presented evidence of a novel mutation homogenization process. In the haplogroup 8 chromosomes, a T -> C transition mutation is homogenized throughout MSY1 arrays, with the exception of one or two repeats at the 3' ends of alleles, but without eliminating the pre-existing type 1, 3 and 4 repeats. Chromosomes ancestral to this haplogroup completely lack this transition mutation. We can speculate on the chain of events which led to this situation. It seems reasonable to assume that the initial mutation event was the genesis, by point mutation, of a single repeat with the T -> C transition, although we cannot say where within the array this occurred; nor can we say anything about the polarity of spread of the new variant. Subsequent events must have had two remarkable cardinal features: firstly, as the transition spread throughout the allele, presumably through the haploid mutation processes of slippage or unequal sister chromatid exchange (USCE), mispaired intermediates within heteroduplexes were repaired in a biased manner; the direction of change at position 6 was consistently T -> C, rather than C -> T. Secondly, the repair was somehow restricted to position 6: positions 3 and 13, which define repeat types 1, 3 and 4, were not affected. Wholesale repair of mispaired repeat units, either within a helix or between helices, would homogenize a single repeat type (e.g. type 1a) throughout arrays, rather than a single base within each repeat. The different positions of variation within repeat units thus behave as if they are evolving independently. Figure 4 illustrates biased mismatch repair between helices (USCE), and within a helix (slippage), where the formation of the hairpin secondary structures which are predicted on the basis of sequence may be important. If these hairpins do form, either in this homogenization process or in the normal mutation process of MSY1 alleles, then the T -> C transition is expected to destabilize the hairpin and this may have an effect on mutation rate. Rates in haplogroup 8 alleles could, in principle, be compared with those in alleles lacking the transition by mutation studies in sperm DNA.


Figure 4. Biased repair within intrahelical (slippage) and interhelical (USCE) heteroduplexes during the homogenization of the T -> C transition at position 6. Repair (small vertical arrows) is illustrated within a type 1 and type 3a heteroduplex (left) and a type 3 and type 4a heteroduplex (right); repairs which do not occur at mispaired bases are indicated by small vertical arrows with crosses on them. An example of a predicted hairpin structure, possibly important in slippage, is shown for a type 1 repeat; other repeat types form similar hairpins, although the T -> C transition at position 6 will be destabilizing. Note that for type 1/type 1a, type 3/type 3a or type 4/type 4a heteroduplexes there are no mismatches at positions 3 or 13, but that there is still a requirement for biased repair at position 6.

What can be said about the rate of this process? Other than those with either one or two type 4 repeats, we do not observe any alleles in which the transition is only partly homogenized. These intermediates must have existed at some time, and we may find them if we survey more chromosomes, or they may now be extinct. Coalescent analysis indicates an age of 19 000 years (~760 generations) for the PN1 mutation (previously 30 000 years; ref. 6) which, together with sY81, defines haplogroup 8. Length distributions of MSY1 alleles (3) suggest that the predominant mutation process is gain or loss of a single repeat unit. If the homogenization, too, were a single-repeat process, and always biased, then the homogenization of a 60 repeat allele would require 60 mutation events. Thus, 760 generations implies a rate for this process of at least 8% per meiosis; however, 95% confidence limits on the age of PN1 are wide (3500-50 000 years), giving a range of rates of 3-43%. Homogenization may certainly be very rapid if mispairing involving misalignment by more than one repeat unit occurs frequently. The finding of intermediate alleles would allow direct mutation analysis to be done and a rate for homogenization to be determined directly.

Patch gene conversion processes within tandem repeats are known to occur on a larger scale, for example in the Bombyx mori late chorion locus (12), but our observations here are unprecedented. There may be other instances of this kind of homogenization, a striking example of molecular drive (13), in other classes of MSY1 alleles, and our continuing diversity survey may detect them.

MATERIALS AND METHODS

Taq cycle sequencing from flanking primer sites and three-state MVR-PCR using the original system were carried out as described (3). Discriminator primer sequences for the novel MVR-PCR system are the same as those used previously (3), but contain the additional substitution: Y1TAG1a, 5'-(Tag)-TGTGTATAATATACATCATGTATGTTG-3'; Y1TAG3a, 5'-(Tag)-TGTGTATAATATACATGATGTATGTTG-3'; Y1TAG3aR, 5'-(Tag)-CATCATGT-ATATTATACACAACATACATC-3'; and Y1TAG4aR, 5'-(Tag)-C-ATCATGTATATTATACATAACATACATC-3'. (Tag) is the 20mer 5' extension used by (1): 5'-Tcatgcgtccatggtccgga-3'. First phase annealing temperature is raised from 64 to 66°C for the new primers.

Some DNA samples have been described previously (14,15); others were gifts from Alec Jeffreys, Arpita Pandya and Chris Tyler-Smith.

Typing of Y-specific polymorphisms by PCR was as described: YAP (16), sY81 (7), PN1 and PN2 (6).

ACKNOWLEDGEMENTS

We thank many people who gave us DNA samples, Robert Griffiths for coalescent analysis of the PN1 mutation, and John Armour, Gabby Dover, Matt Hurles, Alec Jeffreys and Chris Tyler-Smith for comments on the manuscript. This work was supported by the EC as part of the Network Project: `The Biological History of European Populations' (EC Contract 92-0032). P.G.T. and M.A.J. are supported by the Wellcome Trust; M.A.J. is a Wellcome Career Development Fellow (grant no. 044910).

REFERENCES

1. Jeffreys, A.J., MacLeod, A., Tamaki, K., Neil, D.L. and Monckton, D.G. (1991) Minisatellite repeat coding as a digital approach to DNA typing. Nature, 354, 204-209. MEDLINE Abstract

2. Jeffreys, A.J., Tamaki, K., MacLeod, A., Monckton, D.G., Neil, D.L. and Armour, J.A.L. (1994) Complex gene conversion events in germline mutation at human minisatellites. Nature Genet., 6, 136-145. MEDLINE Abstract

3. Jobling, M.A., Bouzekri, N. and Taylor, P.G. (1998) Hypervariable digital DNA codes for human paternal lineages: MVR-PCR at the Y-specific minisatellite, MSY1 (DYF155S1). Hum. Mol. Genet., 7, 643-653.

4. Jobling, M.A. and Tyler-Smith, C. (1995) Fathers and sons: the Y chromosome and human evolution. Trends Genet., 11, 449-456. MEDLINE Abstract

5. Hammer, M.F. (1994) A recent insertion of an Alu element on the Y chromosome is a useful marker for human population studies. Mol. Biol. Evol., 11, 749-761. MEDLINE Abstract

6. Hammer, M.F., Spurdle, A.B., Karafet, T., Bonner, M.R., Wood, E.T., Novelletto, A., Malaspina, P., Mitchell, R.J., Horai, S., Jenkins, T. and Zegura, S.L. (1997) The geographic distribution of human Y chromosome variation. Genetics, 145, 787-805. MEDLINE Abstract

7. Seielstad, M.T., Hebert, J.M., Lin, A.A., Underhill, P.A., Ibrahim, M., Vollrath, D. and Cavalli-Sforza, L.L. (1994) Construction of human Y-chromosomal haplotypes using a new polymorphic A to G transition. Hum. Mol. Genet., 3, 2159-2161. MEDLINE Abstract

8. Krontiris, T.G. (1995) Minisatellites and human disease. Science, 269, 1682-1683. MEDLINE Abstract

9. Kennedy, G.C., German, M.S. and Rutter, W.J. (1995) The minisatellite in the diabetes susceptibility locus IDDM2 regulates insulin transcription. Nature Genet., 9, 293-298. MEDLINE Abstract

10. Virtaneva, K., D'Amato, E., Miao, J., Koskiniemi, M., Norio, R., Avanzini, G., Franceschetti, S., Michelucci, R., Tassinari, C.A., Omer, S., Pennacchio, L.A., Myers, R.M., Dieguez-Lucena, J.L., Krahe, R., de la Chapelle, A. and Lehesjoki, A.-E. (1997) Unstable minisatellite expansion causing recessively inherited myoclonus epilepsy, EPM1. Nature Genet., 15, 393-396. MEDLINE Abstract

11. Yu, S., Mangelsdorf, M., Hewett, D., Hobson, L., Baker, E., Eyre, H.J., Lapsys, N., Le Paslier, D., Doggett, N.A., Sutherland, G.R. and Richards, R.I. (1997) Human chromosomal fragile site FRA16B is an amplified AT-rich minisatellite repeat. Cell, 88, 367-374. MEDLINE Abstract

12. Eickbush, T.H. and Burke, W.D. (1986) The silkmoth late chorion locus. II. Gradients of gene conversion in two paired multigene families. J. Mol. Biol., 190, 357-366. MEDLINE Abstract

13. Dover, G. (1982) Molecular drive: a cohesive mode of species evolution. Nature, 299, 111-116. MEDLINE Abstract

14. Jobling, M.A. (1994) A survey of long-range DNA polymorphisms on the human Y chromosome. Hum. Mol. Genet., 3, 107-114. MEDLINE Abstract

15. Mathias, N., Bays, M. and Tyler-Smith, C. (1994) Highly informative compound haplotypes for the human Y chromosome. Hum. Mol. Genet., 3, 115-123. MEDLINE Abstract

16. Hammer, M.F. and Horai, S. (1995) Y-chromosomal DNA variation and the peopling of Japan. Am. J. Hum. Genet., 56, 951-962. MEDLINE Abstract


*To whom correspondence should be addressed. Tel.: +44 116 252 3377; Fax: +44 116 252 3378; Email: maj4@leicester.ac.uk
+Present address: Institute of Biological Anthropology, University of Oxford, 58 Banbury Road, Oxford OX2 6QS, UK


This page is run by Oxford University Press, Great Clarendon Street, Oxford OX2 6DP, as part of the OUP Journals
Comments and feedback: www-admin{at}oup.co.uk
Last modification: 14 Mar 1998
Copyright© Oxford University Press, 1998.

Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
GeneticsHome page
J. Cocquet, E. De Baere, S. Caburet, and R. A. Veitia
Compositional Biases and Polyalanine Runs in Humans
Genetics, November 1, 2003; 165(3): 1613 - 1617.
[Abstract] [Full Text] [PDF]


Home page
J. Med. Genet.Home page
P. Blanco, M. Shlumukova, C. A Sargent, M. A Jobling, N. Affara, and M. E Hurles
Divergent outcomes of intrachromosomal recombination on the human Y chromosome: male infertility and recurrent polymorphism
J. Med. Genet., October 1, 2000; 37(10): 752 - 758.
[Abstract] [Full Text]


This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (19)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Bouzekri, N.
Right arrow Articles by Jobling, M. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Bouzekri, N.
Right arrow Articles by Jobling, M. A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?