| Human Molecular Genetics | Pages |
Repetitive conundrums of centromere structure and function
Introduction
Centromere function: Where are the repeats?
The function of repetetive structure
Pericentromeric duplications: Complex DNA becomes more complicated
Resolution of the centromere structure-function paradox
Acknowledgement
References
Repetitive conundrums of centromere structure and function
INTRODUCTION
The association between repetitive DNA and centromere structure is long-standing. With the exception of point centromeres among species of budding yeast and the enigmatic holocentric chromosomes of Caenorhabditis elegans (1), almost every other species of plant, animal and fungus studied to date harbors a spectacular array of both highly repetitive and middle repetitive elements at the site of each chromosomes primary constriction (2). Although the specific details of the DNA structure vary among the diverse species, the molecular architecture is generally the same: blocks of AT-rich tandemly repeated DNA bracketed by clusters of various classes of retroposons (3-6). The recurrent evolutionary theme of repetitive DNA has been taken as evidence (albeit not as a proof) of an implied functional relationship (7). The presence of satellite DNA and other repetitive DNA in these regions of a chromosome are thought to favor the assembly of the kinetochore, thereby ensuring the efficient and timely meiotic and mitotic segregation of chromosomes (8-10). Several recent experiments in both humans and Drosophila provide continued support for this model (5,11,12). The repetitive structure of naturally occurring centromeres within the context of mini/microchromosomes, for example, appears to be sufficient to recapitulate, if not all, at least some aspects of centromere function including kinetochore formation and chromosome segregation. Nevertheless, theobservation of functional inactivation of one centromere among some stable human dicentric chromosomes (13-15) and the capacity of portions of chromosomes which do not normally carry repetitive centromeric DNA to function as sites for kinetochore assembly (16-18) provide ample evidence that repetitive DNA, by itself, does not explain adequately the molecular basis of a functionally competent centromere (19).
CENTROMERE FUNCTION: WHERE ARE THEREPEATS?
Two articles in this issue further complicate the relationship between centromere structure and function (20,21). Both articles examine in detail the molecular structure of two functionally competent centromeres on human chromosome 10. The structural dichotomy between these two structures could not have been more striking (Fig. Figure 1.. Structure of functionally competent chromosome 10 centromeres. (a) Sequence analysis of 80 kb of neocentromere sequence (21). Mardel(10) is formed by breakage and fusion within 10p11.2 and 10q23.2, resulting in the generation of a neocentromere at 10q25.2. A total of 80 kb of sequence (GenBank accession no. AF04284) corresponding to the site of kinetochore formation was analyzed for the presence of repeats using RepeatMasker. The distribution and orientation (+, forward with respect to sequence) of various repeats (SINE, LINE, MER and LTR elements) are depicted. Only repeat segments with length >200 bp were considered. (b) Analysis of an ~8 Mb region spanning the normal chromosome 10p11/10q11 centromere (20). Note the presence of various subtelomeric and pericentromeric homologies located distal to the satellite sequence as well as the large duplications of the ZNF (zinc finger gene) and D10S14 region. Sizes of various segments are approximated as the extent of each homology is not precisely known. Approximately 1 Mb of 10p11.2 currently is being sequenced by the Sanger Center and is available within the HTGS division of GenBank (accession nos AL022344 and AL022345). Analysis of this sequence indicates the presence of many interchromosomally duplicated genes and gene segments. In stark contrast to the mundane neocentromere sequence, the overview of the organization of the normal chromosome 10 centromere and its flanking pericentromeric sequence is astonishingly complex (20). Jackson presents arguably the best developed and well-supported physical maps of any human centromere. Building on previously developed pulsed-field gel electrophoresis (PFGE) and yeast artificial chromosome (YAC) maps in this region (23-25), the authors develop a collection of pericentromeric markers which are used to confirm the orientation of various duplications flanking the alpha satellite centromeric DNA as well as to assess the degree of evolutionary conservation. This analysis indicates an unusually high degree of variability in the organization and content of sequences flanking the alpha satellite and satellite III blocks at or near the chromosome 10 centromere. In addition, the data suggest the presence of uncharacterized recently duplicated segments from a variety of different pericentromeric locations (Fig.
THE FUNCTION OF REPETITIVE STRUCTURE
If these data are generally applicable to other centromeric regions, it would suggest that neocentromeres are less structurally complex than anticipated, while normal centromeres show an unsuspected degree of repetitive complexity which may extend into the megabases of DNA sequence flanking alpha satellite DNA. How can these two very different structures be reconciled with centromere function? One solution to this puzzling conundrum (often used when simple molecular paradigms prove less than failsafe) has been to invoke an epigenetic process to explain the marking of functional centromere DNA (19,38,39). One candidate for this marker, recently proposed by Csink and Henikoff, may be late DNA replication (40). In this resurrected version of Dupraws model (41), functional competence of a centromere is simply an attribute of the latest replicating segment of DNA. What, then, is the functional relevance of repetitive structure? According to Henikoff, the accumulation of repetitive DNA further retards replication, effectively fixing such sites in the genome to function more competently as a centromere. Therefore, the universality of repetitive DNA at centromeres is an evolutionarily derived character state and not a precursor of centromere function. In other words, centromere function is not a consequence of repetitive structure, but repetitive structure is a consequence of enhancing centromere function. Although this model could adequately explain phenomena of lateral inhibition among some dicentric chromosomes and the formation of neocentromeres, there are, at this point, very few experimental data in its support. The availability of neocentromere sequence, however, provides a direct opportunity to test the validity of this model. It should be possible, for example, to develop assays to test replication timing of this sequence in both normal as well as neocentromere chromosomes. The finding that this 80 kb of sequence from 10q25.2 is the latest replicating piece of DNA would clearly support the model that a neocentromere is the evolutionary equivalent of a newborn centromere awaiting the maturity conferred by repetitive DNA.
PERICENTROMERIC DUPLICATIONS: COMPLEX DNA BECOMES MORE COMPLICATED
Whatever model is invoked to explain the relationship between centromere structure and function, it is clear that in humans, at least, the model might need to take into account the phenomenon of recent pericentromeric duplication. For many chromosomes, there is now compelling evidence that regions in close proximity to the centromere are hotspots for recent gene duplication events (26-37). If the centromere is a recruitment station for repeats, perhaps it can also serve as a reservoir for the accumulation of transposed genic segments. Interestingly, this peculiar structural property (the wholesale duplication of entire genes or portions of intronic/exonic sequence) has only been reported among primate chromosomes, where it can result in considerable variation in chromosome structure as well as a presumptive proclivity to recurrent cytogenetic rearrangements associated with genetic disease (42). In the absence of detailed maps and sequence analysis from other organisms, it would be premature to conclude that this property is restricted to primate chromosomes. It may be noteworthy that a very similar process of recent gene duplication has now been reported to occur near the subtelomeric boundaries of a variety of human chromosomes (43-46). Is the accumulation of these genic segments a property of heterochromatic sequence? Does their presence in these locations serve a particular function? Disregarding potential evolutionary implications of recent gene duplication and exon shuffling, perhaps the presence of such trans-posed DNA, harboring multiple protein-binding sites associated with gene function and processing, may function as a gate-keeper, effectively quenching the effects of heterochromatization associated with telomeric and centromeric DNA. Such an effect might serve effectively to insulate bona fide unique genes from hetero-chromatic position effects due to their close proximity (i.e. RET proto-oncogene on 10q11.2) to repetitive sequences. Alternatively, duplicated segments may contribute to the epigenetic basis for centromere formation by playing a role in the formation of the proposed higher-order repeat structures (5,38). In this regard, it is interesting to note that the vast majority of neocentromeres [of which mardel(10) is an exception] are structurally mirror-image chromosomes (39). Perhaps large inverted duplicated sequences somehow aid in specifying the position of a functionally competent centromere. Such inverted structures are not uncommon in pericentromeric DNA (47,48), as evidenced by the large (250 kb) inverted blocks of sequence in 10q11 and 10p11 (20) (Fig.
What mutational mechanisms are responsible for the accumu-lation of duplicated segments within the pericentromeric region? One hypothesis put forward is that these apparent transpositions are a direct consequence of one of the many mutational forces operating on alpha satellite DNA (32). Although the data thus far are limited, there is also the suggestion that there may be specific GC-rich repetitive signals which may be involved in directing duplications to the pericentromeric regions of human chromosomes (27,30,31). Recent molecular and cytogenetic analysis of marsupial interspecific hybrid chromosomes indicates that massive pericentromeric-directed invasions of transposons can occur within a single generation (49). The significance of these and other observations with respect to centromeric function awaits further clarification.
RESOLUTION OF THE CENTROMERE STRUCTURE-FUNCTION PARADOX
How will the repetitive conundrum(s) of centromere structure and function ever be resolved? Perhaps the first step is, for the time being, to dissociate these two different aspects of the centromere and to focus on the development of more comprehensive functional and structural analysis of such regions of the genome. Detailed structural analysis will entail continued sequencing of DNA associated with both neocentromere and normal centromeric regions of the genome. Of particular interest will be the elucidation of the structure of the sequence which exists at the boundaries of alpha and non-alpha satellite DNA. Such information could possibly shed light on how centromeres have evolved, as well as the types of mutational processes which have been important in their formation (22).
Unfortunately, the outlook for understanding such regions as a byproduct of the Human Genome Project (HGP) may be bleak. In the recently announced new 5 year plan for the US HGP, an ambitious schedule was outlined to develop a working draft of the complete sequence by the end of 2001, with its final completion by the end of 2003 (50). Acknowledged was the fact that certain regions of the genome would probably be under-represented, in particular areas within the centromeres and other constitutive heterochromatic regions. Other areas due to their repetitive nature will pose serious challenges to the sequence-and-assemble machine of the HGP although such regions are expected to be rare. Our notion of their frequency, however, may be more than slightly biased by our inability to detect and analyze such regions. Our current understanding of the structure of the human genome is based on what we have already mapped and sequenced. Traditional sequence-tagged site (STS)-based or YAC physical mapping strategies often disintegrate and crumble as mappers approach the centromere. Efforts to develop detailed physical maps of various chromosomes indicate that the gaps often lie within regions, substantial in size, composed of repetitive DNA (51,52). The fact that some of the best-mapped, most proximal markers are often >1 Mb away from classic satellite DNA gives further testament to this fact (53). Recent estimates suggest that pericentromeric duplications may account for as much as 30-50 Mb of human DNA, above and beyond the 300 Mb already occupied by various classes of satellite DNA. Similar amounts of recently duplicated material may exist near the sub-telomeric regions as well as other regions of the human genome (54). In total, as much as 15-20% of the human genome could conceivably consist of highly repetitive DNA or large fragments of recently duplicated DNA. There is a distinct possibility that such difficult regions may be swept under the carpet in the race to sequence the human genome.
Understanding the function of centromeres ultimately will require detailed analysis of both the nature of the repetitive structure and the apparent lack of repetition associated with traditional centromeres and neocentromeres, respectively. From the structural perspective of normal centromeres, there is probably little scientific merit in targeting for sequencing the many megabases of tandemly repeated alpha satellite present in the human genome. There is, however, a need for a few such regions to be studied and understood in detail. We should at least be confident of the homogeneity of such sequence and the potential patterns of repetition which may be encountered both within and flanking the primary points of constriction (22). Such patterns and their variation are inevitably important in our understanding of the mutational processes that create such regions and the association of these structures with the trans-acting components important for kinetochore formation. Perhaps the secrets underlying the proposed epigenetic function-structure of centromeres are to be found within the pericentromeric graveyard of genes and transposons which flank the centromere. Alternatively, pericentromeric duplications may simply be byproducts of an active centromere with no immediate consequence to its competence. Nevertheless, these two papers (20,21) clearly underscore our current deficit in the understanding of centromere structure and function. Further analyses are warranted to resolve the molecular role of repetitive DNA in one of the most important biological functions of every living eukaryotic cell.
ACKNOWLEDGEMENT
I would like to thank Dr Huntington Willard for useful suggestions in the preparation of this manuscript.
REFERENCES
This article has been cited by other articles:
This page is run by Oxford University Press, Great Clarendon Street, Oxford OX2 6DP, as part of the OUP Journals
Comments and feedback: www-admin{at}oup.co.uk
Last modification: 4 Feb 1999
Copyright©Oxford University Press, 1999.
![]()
CiteULike
Connotea
Del.icio.us What's this?
![]()
![]()

![]()
![]()
![]()
G. C. Ferreri, D. M. Liscinsky, J. A. Mack, M. D. B. Eldridge, and R. J. O'Neill
Retention of Latent Centromeres in the Mammalian Genome
J. Hered.,
May 1, 2005;
96(3):
217 - 224.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
R. J. O'Neill, M. D. B. Eldridge, and C. J. Metcalfe
Centromere Dynamics and Chromosome Evolution in Marsupials
J. Hered.,
September 1, 2004;
95(5):
375 - 381.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
J. A. Bailey, D. M. Church, M. Ventura, M. Rocchi, and E. E. Eichler
Analysis of Segmental Duplications and Genome Assembly in the Mouse
Genome Res.,
May 1, 2004;
14(5):
789 - 801.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
M. T. Tammi, E. Arner, E. Kindlund, and B. Andersson
Correcting errors in shotgun sequences
Nucleic Acids Res.,
August 1, 2003;
31(15):
4663 - 4672.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
T. Newman and B. J. Trask
Complex Evolution of 7E Olfactory Receptor Genes in Segmental Duplications
Genome Res.,
May 1, 2003;
13(5):
781 - 793.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
D. Schindelhauer and T. Schwarz
Evidence for a Fast, Intrachromosomal Conversion Mechanism From Mapping of Nucleotide Variants Within a Homogeneous alpha -Satellite DNA Array
Genome Res.,
December 1, 2002;
12(12):
1815 - 1826.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
J. E. Horvath, J. A. Bailey, D. P. Locke, and E. E. Eichler
Lessons from the human genome: transitions between euchromatin and heterochromatin
Hum. Mol. Genet.,
October 1, 2001;
10(20):
2215 - 2223.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
M. Ventura, N. Archidiacono, and M. Rocchi
Centromere Emergence in Evolution
Genome Res.,
April 1, 2001;
11(4):
595 - 599.
[Abstract]
[Full Text]
![]()
![]()
![]()

![]()
![]()
![]()
J. Enkerli, H. Reed, A. Briley, G. Bhatt, and S. F. Covert
Physical Map of a Conditionally Dispensable Chromosome in Nectria haematococca Mating Population VI and Location of Chromosome Breakpoints
Genetics,
July 1, 2000;
155(3):
1083 - 1094.
[Abstract]
[Full Text]
![]()
![]()
![]()

![]()
![]()
![]()
Y. Ji, E. E. Eichler, S. Schwartz, and R. D. Nicholls
Structure of Chromosomal Duplicons and their Role in Mediating Human Genomic Disorders
Genome Res.,
May 1, 2000;
10(5):
597 - 610.
[Abstract]
[Full Text]
![]()
![]()
![]()

![]()
![]()
![]()
H. Hutter, B. E. Vogel, J. D. Plenefisch, C. R. Norris, R. B. Proenca, J. Spieth, C. Guo, S. Mastwal, X. Zhu, I. S. A. J. Scheel, et al.
Conservation and Novelty in the Evolution of Cell Adhesion and Extracellular Matrix Genes
Science,
February 11, 2000;
287(5455):
989 - 994.
[Abstract]
[Full Text]
![]()
![]()
![]()

![]()
![]()
![]()
G. Montefalcone, S. Tempesta, M. Rocchi, and N. Archidiacono
Centromere Repositioning
Genome Res.,
December 1, 1999;
9(12):
1184 - 1188.
[Abstract]
[Full Text]
![]()
![]()
![]()

![]()
![]()
![]()
E. T. Peterson, R. Sutherland, D. L. Robinson, L. Chasteen, M. Gersh, J. Overhauser, L. L. Deaven, R. K. Moyzis, and D. L. Grady
An Integrated Physical Map for the Short Arm of Human Chromosome 5
Genome Res.,
December 1, 1999;
9(12):
1250 - 1267.
[Abstract]
[Full Text]
![]()
![]()
![]()

![]()
![]()
![]()
J. A. Bailey, A. M. Yavor, H. F. Massa, B. J. Trask, and E. E. Eichler
Segmental Duplications: Organization and Impact Within the Current Human Genome Project Assembly
Genome Res.,
June 1, 2001;
11(6):
1005 - 1017.
[Abstract]
[Full Text]
[PDF]
![]()
This Article ![]()
![]()
Abstract
![]()
FREE Full Text (PDF)
![]()
Alert me when this article is cited
![]()
Alert me if a correction is posted
![]()
Services ![]()
![]()
Email this article to a friend
![]()
Similar articles in this journal
![]()
Similar articles in ISI Web of Science
![]()
Similar articles in PubMed
![]()
Alert me to new issues of the journal
![]()
Add to My Personal Archive
![]()
Download to citation manager
![]()
Search for citing articles in:
ISI Web of Science (33)
![]()
Request Permissions ![]()
Google Scholar ![]()
![]()
Articles by Eichler, E. E.
![]()
Search for Related Content
![]()
PubMed ![]()
![]()
PubMed Citation
![]()
Articles by Eichler, E. E.
![]()
Social Bookmarking ![]()
![]()
What's this?