Skip Navigation

This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (40)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Guy, J.
Right arrow Articles by Jackson, M.S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Guy, J.
Right arrow Articles by Jackson, M.S.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Human Molecular Genetics, 2000, Vol. 9, No. 13 2029-2042
© 2000 Oxford University Press

Genomic sequence and transcriptional profile of the boundary between pericentromeric satellites and genes on human chromosome arm 10q

J. Guy1, C. Spalluto1,2, A. McMurray3, T. Hearn1, M. Crosier1, L. Viggiano2, V. Miolla2, N. Archidiacono2, M. Rocchi2, C. Scott3, P.A. Lee4, J. Sulston3, J. Rogers3, D. Bentley3 and M.S. Jackson1,+

1Human Genetics Unit, School of Biochemistry and Genetics, University of Newcastle upon Tyne, Newcastle upon Tyne NE1 7RU, UK, 2DAPEG, Sezione di Genetica, Università di Bari, Via Amendola 165/A, 70126 Bari, Italy, 3Sanger Sequencing Centre, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK and 4Department of Computer Science, University of Newcastle upon Tyne, Newcastle upon Tyne NE1 7RU, UK

Received 24 April 2000; Revised and Accepted 22 June 2000.

DDBJ/EMBL/GenBank accession nos AL022344, AL022345, AL031601 and AJ290103AJ290167.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 RESULTS
 DISCUSSION
 MATERIALS AND METHODS
 REFERENCES
 
The organization of centromeric heterochromatin has been established in a number of eucaryotes but remains poorly defined in human. Here we present 1025 kb of contiguous human genomic sequence which links pericentromeric satellites to the RET proto-oncogene in 10q11.2 and is presumed to span the transition from centric heterochromatin to euchromatin on this chromosome arm. Two distinct domains can be defined within the sequence. The proximal ~240 kb consists of arrays of satellites and other tandem repeats separated by tracts of complex sequence which have evolved by pericentromeric-directed duplication. Analysis of 32 human paralogues of these sequences indicates that most terminate at or within repeat arrays, implicating these repeats in the interchromosomal duplication process. Corroborative PCR-based analyses establish a genome-wide correlation between the distribution of these paralogues and the distribution of satellite families present in 10q11. In contrast, the distal ~780 kb contains few tandem repeats and is largely chromosome specific. However, a minimum of three independent intrachromosomal duplication events have resulted in >370 kb of this sequence sharing >90% identity with sequences on 10p. Using computer-based analyses and RT–PCR we confirm the presence of three genes within the sequence, ZNF11/33B, KIAA0187 and RET, in addition to five transcripts of unknown structure. All of these transcribed sequences map distal to the satellite arrays. The boundary between satellite-rich interchromosomally duplicated DNA and chromosome-specific DNA therefore appears to define a transition from pericentromeric heterochromatin to euchromatin on the long arm of this chromosome.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 RESULTS
 DISCUSSION
 MATERIALS AND METHODS
 REFERENCES
 
Heterochromatin is defined cytologically within the eucaryotic nucleus as densely packed and dark staining chromatin, distinct from the more diffuse and lightly staining euchromatin (1,2). This distinction reflects significant differences in biological activity and sequence organization. Heterochromatin can both repress the expression of genes via position effects (3–5) and suppress recombination in a wide variety of eucaryotes (6). Gene density is also known to be low in the heterochromatin of organisms where detailed mapping (7) or sequence data (8–10) are available. For instance, a combination of pulsed field gel electrophoresis (PFGE) and sequencing indicates that Drosophila heterochromatin is composed of satellite arrays interspersed with retrotransposable elements (9,10). A similar organization has been established in Arabidopsis where centromeric heterochromatin largely consists of a 180 bp satellite repeat and retrotransposons, mostly from the Athilia family (8,11). A heterochromatic knob from the short arm of chromosome 4 in this species has also been completely sequenced, revealing 22.5 tandemly arranged copies of a 1950 bp repeat surrounded by DNA rich in retrotransposons (12). Heterochromatic knobs have also been partially characterized in maize and found to have comparable organizational features (13). It is presumed that sequence repetition is responsible for conferring heterochromatic properties, a hypothesis supported by the fact that repetitive sequences are capable of inducing gene silencing in a number of eucaryotes including mammals (14,15).

In human chromosomes the most prominent sites of heterochromatin are the centromeric regions, the large C bands present on chromosomes 1, 2, 9 and 16, and the long arm of the Y chromosome. Consistent with data from other eucaryotes, all of these regions of the genome are rich in tandem repeats (16). To date, a detailed understanding of pericentromeric sequence organization has been hindered by the lack of contiguous sequence data, largely due to repetitive sequences causing clone instability and contig branching during mapping (17). Only the alpha satellite which is present at the primary constriction of all human chromosomes has been analysed in detail, due to its possible role in centromere function (18). Despite this, the basic organization of centromeric repeats has been established on several chromosomes by PFGE and fluorescence in situ hybridization (FISH); in each case a central array of alpha satellite is flanked by arrays of other satellites such as the classical satellites (I–IV) and the beta satellite (16).

However, it is clear that pericentromeric regions of human chromosomes do not consist solely of repetitive sequences. Tracts of gene-related sequences, ranging in size from 60 to 200 kb, have been identified proximal to arrays of satellite II and the 48 bp satellite (D22Z3) on chromosome 22 (19) and at the terminus of the centromeric alpha satellite array on chromosome 16 (20). Some of these sequences have been formed by recent pericentromeric-directed duplication events, examples of which involve the duplication of NF1- and ALD-related sequences (17,21–23). Phylogenetic analysis of human and primate ALD sequences indicates that they have been amplified by a two-step process where the duplication of ~10 kb of DNA from Xq28 into a pericentromeric location has been followed by rapid distribution of this sequence between chromosomes (20). There is some evidence that repetitive sequences are involved in these processes: sequence identities between the ALD gene and its autosomal paralogues terminate within a GCTTTTTGC repeat (21), the genomic distribution of human NF1-related sequences correlates with the position of alpha satellite suprachromosomal family 2 (22), and several pericentromeric duplication events have been shown to terminate close to or within the GC-rich CAGGG and HSREP522 repeats (24). In addition to these interchromosomally duplicated sequences, large intrachromosomal duplications flank the centromeric repeats of several human chromosomes (25–29) which suggests that pericentromeric instability may involve a number of mechanistically distinct phenomena.

We have previously developed a high resolution physical map of the 10p11–q11 region which includes a PFGE map across the centromeric repeats (30) and two yeast artificial chromosome (YAC) contigs which link pericentromeric satellites to genes on both chromosome arms (31). In addition to interchromosomally duplicated sequences, these contigs contain two large (90–250 kb) duplications which flank the centromere (32). To further investigate the organization of these sequences, and to attempt to define human pericentromeric heterochromatin at the sequence level, we have developed 1.025 Mb of contiguous genomic sequence which extends from satellite arrays in 10q11.1 to the RET proto-oncogene in 10q11.2. We have used this sequence to analyse transcriptional activity and to establish the distribution of paralogous sequences within the human genome. The data provide the first integrated picture of mammalian sequence organization, evolution and gene expression at a boundary between pericentromeric repeats and a chromosome arm. The 1025 kb region is gene poor and contains what appears to be a simple discontinuity between repeat-rich, transcriptionally repressed centromeric heterochromatin and gene-containing sequences on the chromosome arm. In this respect, the results are consistent with data from other eucaryotes. However, the identification of distinct domains of interchromosomally and intrachromosomally duplicated sequences indicates that human pericentromeric sequences are more complex than those of other eucaryotes analysed to date.


    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 RESULTS
 DISCUSSION
 MATERIALS AND METHODS
 REFERENCES
 
Sequence acquisition and fidelity
A total of ~1.025 Mb of contiguous sequence was generated using three overlapping YACs as templates for shotgun cloning (33), a technique which has proved invaluable for sequencing repetitive regions within the Caenorhabditis elegans genome (34). The YACs form part of a 10 Mb map of 10p11–q11, have been FISH mapped, and possess internally consistent restriction maps which suggests that they have not rearranged during cloning (31). The position of these YACs relative to the final sequence, and to the centromeric alphoid array (D10Z1), is shown in Figure 1A. The 1–2 Mb region between D10Z1 and the sequence presented here remains poorly characterized but is known to include an ~1 Mb satellite 2-rich domain (31). Because YACs are known to rearrange at high frequency (35), several steps were taken to ensure the fidelity of the data (see Materials and Methods). Once the sequence was completed, the large overlap between Y738F9 and Y214H10 was compared as a further check on sequence quality. The overlap between these two sequences share 99.92% similarity with a total of 150 nucleotide differences and 25 insertions/deletions (indels) in 231 540 bp of aligned sequence. This figure must be considered to be a conservative estimate of sequence quality as it will include errors in both sequences. Furthermore, a significant proportion of the sequence variation is likely to represent allelic polymorphism. For instance, the two largest indels consist of an 11 bp difference within a GAA trinucleotide repeat (position ~648 630 bp) and a 17 bp difference within a poly(A) tract at the terminus of an Alu element (position ~661 330 bp). This conclusion is supported when the sequence is aligned with cDNAs from two genes from the region: RET (GenBank accession nos X15262 and X12949) and KIAA0187 (GenBank accession no. D80009). There are a total of nine differences between the cDNAs and the genomic sequence. However, all are within untranslated regions or result in silent substitutions (data not shown) and one is a known polymorphism (C->A at position 3959 of GenBank accession no. D80009).



View larger version (25K):
[in this window]
[in a new window]
 
Figure 1. Overview of 10q11 sequence data. (A) YACs used to generate 10q11 sequence. The positions of the first and last nucleotide generated from each YAC (GenBank accession nos AL022344, AL022344 and AL031601) are shown relative to the final ~1025 kb of sequence. All nucleotide positions in the text refer to the contiguous sequence derived from the three YACs. Y738F9 was not sequenced in its entirety. The position of the YACs relative to D10Z1 based on published PFGE data (31) is also indicated. A previously identified inconsistency exists between the size of the satellite 3 array in the sequence and in the PFGE map (31). We have confirmed that this is not due to a reduction in array size during cloning by using PFGE to establish linkage between probes on opposite sides of the satellite 3 array on an ~26 kb BstEII fragment (position 178 801–205 199 bp) within the cell line used to construct the YAC library (data not shown). (B) GC content and percentage of interspersed repeats in the sequence. The data were generated from output of Repeatmasker using a 20 kb sliding window with a 4 kb interval between data points (see Materials and Methods). (C) Distribution of interspersed repeats by element classification. The data were generated from output of Repeatmasker using a 20 kb sliding window with a 4 kb interval between data points (see Materials and Methods). The peak of unclassified elements at ~240 000 bp is due to composite BUR1 elements. (D) Principle features in the sequence. The position of known genes, gene and pseudogene fragments (Table 2), CpG islands, spliced and unspliced ESTs with >96% identity to the sequence (Table 1), intrachromosomal duplications and tandem repeat arrays longer than 300 bp (Table 3) are shown. The orientation of transcription is indicated by white arrows. Duplications 1 and 4 are the previously described ZNF and D10S141 duplications (31,32). As genomic sequence is not available for the IFB12 and RSU1 genes, the size of duplicated segments 2 and 3 are minimum estimates based on the fact that nucleotides 608 143–619 443 of the 10q11 sequence contain four presumptive exons with 93.1% identity to nucleotides 990–1426 of the RSU1 cDNA and nucleotides 576 780–596 862 contain 10 presumptive exons with 93.3% identity to nucleotides 6144–7936 of the IFB12 cDNA. A summary map showing genes, repeats and duplications is also shown for reference with Figure 3. For details of sequence analysis see Materials and Methods.

 
GC content and the distribution of interspersed repeats
The sequence extends from pericentromeric satellites in the G-dark band 10q11.1 to the RET proto-oncogene in the G-light band 10q11.2 (32). The GC content of the entire sequence is 43.18%. However, it rises steadily from 39.48% in the proximal 200 kb (38.34% after removal of all tandem repeats) to 54.04% in the most distal 100 kb (Fig. 1B), consistent with the general observation that G-dark bands are GC poor relative to G-light bands (36). In total, 48.46% of the entire sequence is accounted for by interspersed repeats, with SINEs, LINES and long terminal repeats (LTRs) accounting for 10.60, 24.28 and 10.87% of the sequence, respectively (Fig. 1B and C). These figures are typical of human autosomal DNA (37). There is a slight excess of LINEs in the proximal region (28.04% of the first 720 kb is LINE related, compared with the genome-wide average of 20.6% for autosomes with an equivalent GC content) and a more distinct excess of LTR elements [17.3% of the sequence from 200 to 560 kb is LTR related compared with the genome average of 7.8% for autosomes of the same GC content (37)]. Again, these observations appear consistent with the known characteristics of G-dark bands (38). However, the frequency of SINEs is highest within the proximal region which is inconsistent with generalizations based on cytogenetic classifications (38). It is also clear from Figure 1C that LINE and LTR elements are non-randomly distributed, with the existence of 15–45 kb sequence tracts where a single element type predominates.

Proximal 10q11 is gene poor
The principal gene-related features identified within the sequence are shown in Figure 1D and Tables 1 and 2. Because gene-related sequences are known to transpose into and between pericentromeric locations (18,21–24), the 10q11 sequence is likely to contain paralogous copies of human genes/sequence tracts present at other genomic locations (a paralogue being defined here as one of a set of homologous genes/sequence tracts within a single species). As a result, we have been conservative in our definition of what constitutes a gene, requiring the presence of both spliced expressed sequence tags (ESTs) with >96% identity to the sequence and either similarity to a known protein or association with an ab initio gene prediction. Using these criteria there are only three genes in the sequence (ZNF11/33B, KIAA0187 and RET) all of which have been mapped previously to 10p11–q11.2 (Table 1). There are also a further 15 ESTs or EST clusters. However, only one of these (GenBank accession no. AI524643) is spliced relative to the genomic sequence and this is derived from the pseudogene ZNF37B. None of the remaining 14 ESTs/EST clusters contain open reading frames (ORFs), show significant similiarity to known proteins, or bear a simple relationship to ab initio gene predictions (data not shown). Furthermore, eight of these contain one or more interspersed repeat sequence. Despite this, primer pairs were designed from 12 of these and used to analyse cDNAs derived from eight adult human tissues by RT–PCR (Table 1). Five primer pairs (corresponding to sequences at ~289, ~434, ~455, 473 and ~479 kb) produced weak amplification products from cDNA derived from pancreas, or kidney and pancreas, indicating a restricted pattern of low level expression. However, because we were unable to identify any gene structures associated with these ESTs using a wide variety of gene prediction programs (see Materials and Methods), we have classified these as transcripts rather than genes. This limited expression contrasts sharply with KIAA0187 and ZNF11/33B which are expressed in all tissues analysed. Examples of RT–PCR experiments are presented in Figure 2. In addition to these gene features, there are a further 13 pseudogenes and gene fragments within the sequence that are not associated with ESTs and have not been analysed by RT–PCR. These are presented in Table 2 and include HOX, ZNF and immunoglobulin lambda-related sequences.


View this table:
[in this window]
[in a new window]
 
Table 1. Genes, ESTs and confirmed transcripts within 10q11
 

View this table:
[in this window]
[in a new window]
 
Table 2. Pseudogenes and gene fragments with no associated ESTs
 


View larger version (85K):
[in this window]
[in a new window]
 
Figure 2. RT–PCR analysis of 10q11 ESTs. Examples of RT–PCR analyses (Table 1) are shown together with appropriate controls. The tissue source of each cDNA used is shown above the panels, and the EST/gene name (Table 1), position in the sequence (kb) and expected PCR product size (bp) are shown to the left of each panel. The sequences amplified using the EST and KIAA0187 primers are contiguous with genomic DNA and give an identical product from both cDNA and genomic DNA controls (DNA). The G3PDH control primers (D) span an intron and do not amplify from genomic DNA. The positive control lane present in all experimental panels consist of a G3PDH cDNA template amplified using the G3PDH primers (see Materials and Methods) and was left blank in (D). The marker is a 100 bp ladder (Promega, Madison, WI). Each experiment was performed in duplicate (data not shown).

 
The proximal 200 kb is rich in satellite repeats
The position and periodicities of tandem arrays >300 bp in total length are presented in Table 3 and Figure 1D. There are four 5–10 kb repeat arrays spaced ~55–60 kb apart within the proximal 200 kb of sequence: two satellite II arrays, a GC-rich array with a 6 bp periodicity [the CAGGG repeat (24)] and a satellite III array. Interspersed between these repeats are nine shorter arrays including two TTCC tetranucleotides, two CT-rich repeats with 13 and 18 bp periodicities based on the pentamer CATTT and four complex GC-rich satellites with periodicities of 22–23 bp which are related to the GCTTTTTGC repeat implicated in the transposition of ALD paralogues (21). There are also seven alpha satellite monomers abutting directly onto the satellite III array which exhibit ~10–15% divergence from chromosome 10-specific alphoid sequences (39), and one short array of the HSREP522 repeat (40). However, the most striking aspect of these repeats is their distribution: no array longer than 500 bp is present in the distal 825 kb of sequence.


View this table:
[in this window]
[in a new window]
 
Table 3. Tandem repeats >300 bp in 10q11
 
Tracts of intrachromosomal duplicated sequence lie distal to the satellite repeats
The pericentromeric region of chromosome 10 contains two large genomic duplications with one copy of each duplication on each arm (25,32). Comparisons with unfinished sequence from 10p11 (http://webace.sanger.ac.uk:80/cgi-bin/display ) indicates that the proximal duplication is ~250 kb long and extends from ~278 to 528 kb, whereas the more distal duplication is ~90 kb long and extends from ~762 to ~853 kb (Fig. 1D, duplications 1 and 4). These figures are consistent with previous PFGE estimates (32). However, there is also an ~11.3 kb non-processed gene fragment related to RSU1 and an ~20 kb non-processed pseudogene fragment related to IFB12 (Fig. 1D, duplications 2 and 3). As RSU1 and IFB12 have been mapped to 10p13 and 10p12, respectively, it is likely that the identity between these cDNAs and the 10q11 genomic sequence is a result of one or two further intrachromosomal duplication events. This is supported by the fact that a FISH probe from this region hybridizes both to 10q11 and to 10p13 on human metaphases [y738F9-R (32)]. Thus, a minimum of three independent events appear to have resulted in the intrachromosomal duplication of >370 kb of sequence within a 650 kb region immediately distal of the satellite repeats (Fig. 1D).

Satellite arrays are separated by pericentromerically duplicated sequences
To identify all known paralogous sequences within the human genome we queried the non-redundant EMBL (EMBL-NR) database with a masked version of the 10q11 sequence (see Materials and Methods). We discarded all hits <2 kb long, together with three hits which consisted almost entirely of L1 elements and may have been formed by transposition rather than pericentromeric duplication (41). A total of 57 hits within 36 clones or contigs were identified which varied in length from ~2.3 to ~56 kb. The distribution of these paralogues relative to the 10q11 sequence is shown graphically in Figure 3A and details of all paralogous sequences are given in Table 4. A number of clones have complex patterns of sequence identity to 10q11 with up to eight distinct paralogous domains in a single bacterial artificial chromosome (BAC) clone (Table 4, clone 4). We have used PCR to confirm that this is not due to rearrangement of the 10q11 sequence during cloning or propagation (see legend to Fig. 3).



View larger version (28K):
[in this window]
[in a new window]
 
Figure 3. Distribution of paralogous sequences related to 10q11. The position of each BLAST hit >2 kb in length within the EMBL-NR database is represented by a horizontal line. Both the extent of the BLAST hit within the 10q11 sequence (x-axis) and the percentage identity with the 10q11 sequence (y-axis) are indicated. Deletions and insertions within individual sequence matches are not shown. The summary map (Fig. 1D) is shown for reference. (A) Distribution of paralogous sequences within the first 1000 kb of the 10q11 sequence. (B) Expansion of the proximal 250 kb of the 10q11 sequence illustrated in (A), showing the positions of 10q11 tandem repeats (Table 3) and repeats at the termini of paralogous sequences. Numbers refer to individual alignments (Table 4). In some cases there are multiple domains within the same clone, often in different orientations (Table 4). To confirm that these domain termini do not represent cloning artefacts, PCR primers were designed to amplify across 18 termini of paralogues within the 10q11 sequence (Fig. 3B and Table 4, alignments 1, 2 and 3). All but one primer pair generated products of the expected size both in YACs derived from 10q11 and in total human DNA with the remaining pair failing to amplify from any template (see Materials and Methods for primer details).

 

View this table:
[in this window]
[in a new window]
 
Table 4. Paralogues of 10q11 sequences within the EMBL-NR database
 
The majority of clones containing sequences paralogous to 10q11 map to the pericentromeric regions of other human chromosomes including 2p11, 16p11.2, 19p12–q12, 21q11.1 and 22q11.1–11.2 (Table 4), suggesting that the sequence identity is a result of pericentromeric-directed duplication events. This is confirmed by comparative FISH analyses which demonstrate that these sequences map to different pericentromeric locations in different primate species (32, and http://bioserver.uniba.it/fish/Cytogenetics/10q11-examples/10q11.html ). However, hits to the EMBL-NR database are not randomly distributed within the 10q11 sequence. A total of 32 of the 57 paralogous domains match sequences within the proximal 240 kb of sequence (Fig. 3A). Furthermore, these are longer (mean = 16.89 kb), and show higher sequence identity to 10q11 (mean = 94.3%), than paralogues of more distal sequences (means = 7.98 kb and 82.43%, respectively). This clustering of paralogues with high sequence identity to 10q11 within the proximal 250 kb suggests that the distribution of these sequences may be related to the satellite repeats in this part of the sequence. There is also a second, less distinct, cluster of paralogues between the ZNFB duplication and the IFB12 pseudogene fragment (530–580 kb) where six genomic sequences show >85% identity to the 10q11 sequence.

Paralogous sequences often terminate at or within tandem repeats
To investigate the association of paralogous sequence with satellite repeats in more detail we looked for sequence features at the termini of all 57 paralogous domains identified within the EMBL-NR database (see Materials and Methods). A total of 20 domains terminate within interspersed repeats at one or both ends (data not shown). However, as ~48% of the 10q11 sequence is accounted for by interspersed repeats (Fig. 1B), the significance of this is unclear. Only four domains have one or both ends precisely defined by a repetitive element, implying a possible mechanistic involvement of the repeat in their formation. What is more striking is that 24 paralogous domains terminate in a tandem repeat at one or both ends. Of these, 21 share sequence identity with the proximal 250 kb and are shown in detail in Figure 3B. In 10 cases, one terminus is defined by a tandem repeat on chromosome 10 only (Fig. 3B, vertical bars). In a further eight cases, one terminus falls within a tandem repeat which is present on both chromosomes, whereas in three cases both termini fall within two different tandem repeat families (alignments 1C, 5B and 9). No one repeat is exclusively found at the termini, with paralogues terminating within the 22 bp GC-rich repeat, the 4 bp TTCC repeat, and the 5 bp classical satellites II and III. Furthermore, in alignment 6, one terminus falls within a GC-rich repeat on chromosome 10 and a CT-rich repeat on chromosome 21 (Fig. 3B). In several cases a tandem repeat is present within a domain of paralogy (1E, 2, 4D, 3B and 11).

Identification of a boundary between multi-copy pericentromeric sequences and chromosome-specific DNA
Because of the limited pericentromeric sequence data within the EMBL-NR database we needed to establish whether the skewed distribution of paralogous sequences (Fig. 3) is due to ascertainment bias. To do this we developed a panel of 74 sequence tagged sites (STSs) and established the distribution of 10q11-related sequences within the human genome using a monochromosomal somatic cell hybrid panel (see Materials and Methods). A number of paralogous STSs identified in this way were sequenced to assess levels of divergence from the 10q11 sequence. The results of these analyses are presented in Figure 4B.



View larger version (54K):
[in this window]
[in a new window]
 
Figure 4. Chromosomal distribution of 10q11-related STSs and satellite families. (A) Chromosomal distribution of satellite 3, satellite 2 and CAGGG repeats, as assayed by FISH (24,42). The presence of the satellite on each chromosome is represented by a filled box. (B) STSs used to screen the monochromosomal somatic cell hybrid panel are shown in linear order with the centromeric end of the sequence at the top. The position of each within the 10q11 sequence is given on the y-axis to the nearest kilobase ranging from 6 (S6) to 980 (RET) kb. The human chromosome present within each monochromosomal somatic cell hybrid is indicated on the x-axis (1 to Y). Amplification of each STS is indicated with blue boxes: dark blue boxes indicate that the PCR product was sequenced (GenBank accession nos AJ290103AJ290167) and used to calculate the percentage divergence of paralogues from the 10q11 sequence (% DIV) (see Materials and Methods). The positions of the four largest satellite arrays (Sat 2, CAGGG and Sat 3) are also indicated by coloured horizontal bars. For details of primers, hybrids and STS sequencing, see Materials and Methods.

 
All but one of the 28 STSs from the proximal 220 kb amplify PCR products of the expected size from between 3 and 14 monochromosomal hybrids (mean = 6.78). Alignment of DNA sequence generated from the PCR products amplified from these hybrids indicates that all of the paralogous sequences across the entire 220 kb are closely related to the 10q11 sequence, with paralogous STSs diverging from the 10q11 sequence by between 1.2 and 5.7%. This result is consistent with the high identities observed between paralogues of this region identified within the EMBL-NR database (Fig. 3).

In sharp contrast to the results from the proximal 220 kb, 35 of the 46 STSs distal to S214 are specific to chromosome 10 with the remaining 11 STSs amplifying a product from between three and six monochromosomal hybrids. Sequence analysis of PCR products from these STSs suggest that paralogues in this distal region may be more distantly related to the 10q11 sequence, with eight of nine paralogous STSs diverging from the 10q11 sequence by 5.0–9.6%. The small sample size makes it unclear whether this increased divergence relative to the proximal 220 kb is significant. However, it is noteworthy that three of these STSs (S528, WME31-R and S557) map within the distal cluster of paralogues identified within the EMBL-NR database which are also more diverged relative to the 10q11 sequence (Fig. 3A). A further five of the STSs from the distal region which amplify products from other chromosomes (S679, S680, S702, S705 and S713) lie within the KIAA0187 gene (666 675–713 708 bp). Paralogous fragments of this gene have already been identified within the immunoglobulin lambda and cat-eye syndrome regions on chromosome 22 (19) (Table 4, alignments 35B and 37). Thus, the results of the STS analysis are totally consistent with the BLAST results (Fig. 3) and indicate that an abrupt discontinuity exists between satellite-rich interchromosomally duplicated sequences and chromosome-specific sequences.

The genomic distribution of paralogous STSs is associated with the distribution of specific satellite families
To investigate the link between pericentromeric duplications and satellite sequences on other chromosomes we have analysed the chromosomal distribution of paralogues of the 10q11 STSs in relation to the chromosomal distribution of linked satellites. The basic distribution of the satellite 2, satellite 3 and the CAGGG repeat families, as assayed by FISH (24,42), is shown in Figure 4A. The distributions of these repeat families overlap extensively, with chromosomes 1, 2, 9, 10, 15, 16, 17 and 22 being positive for two of the three families and nine chromosomes being negative for all three families. However, what is more striking is the association between the chromosomal distribution of paralogues of STSs within the proximal 250 kb and the distribution of these satellites. Chromosomes which lack these satellites (chromosomes 3, 4, 6, 8, 11, 12, 18, 19 and X) are positive for only zero to five 10q11 STSs (mean = 1.9) compared with an average of ~14 for chromosomes which contain two or more of the satellite families analysed (chromosomes 1, 2, 9, 14–17 and 22). There are some exceptions to this association; for example, chromosome 5 contains satellite 3 sequences but is negative for all but one of the 10q11 STSs. Despite this, the association between the chromosomal distribution of each of these repeats with the distribution of paralogous STSs is highly significant using Fisher’s exact test (P < 2 x 10–4 for each satellite when chromosome 10 is excluded from the analysis; see Materials and Methods). No association could be detected with alpha satellite suprafamily organization (43).


    DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 RESULTS
 DISCUSSION
 MATERIALS AND METHODS
 REFERENCES
 
We have generated ~1025 kb of contiguous genomic sequence obtained from three YAC clones which link pericentromeric satellite repeats in 10q11.1 to the RET proto-oncogene in 10q11.2. We have confirmed the fidelity of the data using PFGE (see Materials and Methods), by analysing >230 kb of overlapping sequence, and by PCR analyses of the termini of domains of paralogy in 10q11 (Fig. 3B). The global features of the sequence conform to our expectations for the transition from a centromeric G-dark band (10q11.1) to a G-light band (10q11.2). First, the GC content rises steadily from ~39.5% at the centromeric satellites to >55% at RET. A similar rise in GC content occurs within the proximal 2 Mb of the chromosome 22q sequence (19). Second, our data indicate an excess of both LINEs and retroviral LTRs within the sequence (Fig. 1B) which supports the conclusion drawn from low resolution FISH-based analyses (38) and limited sequence data (44) that human pericentromeric regions are rich in interspersed repeats. However, this excess is modest compared with the over-representation of transposable elements close to heterochromatin in Drosophila (9) and to the centromeres of chromosomes 2 and 4 in Arabidopsis (4,5), suggesting that regional variations in transposable element distribution may not be as marked within the human genome as they are in other eucaryotes.

Identification of two pericentromeric sequence domains
The most striking aspect of the sequence is the non-random distribution of satellite repeats, pericentromeric-directed duplications and intrachromosomal duplications. Satellites and other repeats are located almost exclusively within the proximal 200 kb and are separated by complex sequences which have numerous paralogues within the genome. The predominantly pericentromeric location of these paralogous sequences (Table 4), and comparative FISH analyses (32, and http://bioserver.uniba.it/fish/Cytogenetics/10q11-examples/10q11.html ), indicate that they have expanded in copy number due to recent pericentromeric-directed duplication events. The fact that most of these paralogues terminate at or within tandem repeats (Fig. 3B), and that few are found outside the satellite-rich region (Fig. 3A), suggests that specific repeats are directly responsible for the rapid and centromere-specific evolution of these sequences, a hypothesis supported by the correlation between the genomic distribution of paralogues of 10q11 sequences and the genomic distribution of satellites 2 and 3 and the CAGGG repeat (Fig. 4).

This sequence organization is very similar in structure and evolutionary dynamics to the sequences derived from Xq28 and 4q24 which abut onto the alpha satellite array on chromosome 16p11 (20). This, together with the identification of pericentromerically duplicated ALD- and NF1-related sequences close to satellite arrays on chromosome 22 (19) and the large number of pericentromerically duplicated sequences that have been analysed at the cytogenetic level (18), strongly suggest that tracts of rapidly evolving complex sequences may be interspersed between satellite arrays on most, if not all, human chromosomes. At the sequence level, the identification of repeats defining the termini of paralogous sequence domains (Fig. 3B) is also consistent with the presence of repeats at the termini of ALD paralogues (21) and the identification of the GC-rich CAGGG and HSREP522 repeats at the termini of pericentromeric duplications on chromosomes 16, 21 and 22 (24). The similarity of the CAGGG repeat to immunoglobulin switch regions (45) has recently led to the suggestion that a specific subset of human repeats may be responsible for interchromosomal duplication (24). However, the wide spectrum of repeats found at the termini of paralogous domains in 10q11, including CT-rich sequences and AT-rich classical satellites (Fig. 3B), indicates that the ability to promote interchromosomal duplication may be a function of repetition per se.

In sharp contrast to sequences within the proximal 240 kb, most of the distal 780 kb of sequence is chromosome specific. However, the existence of a second, distal, pericentromeric sequence domain is implied by the clustering of >370 kb of intrachromosomally duplicated DNA within an ~600 kb region distal to the satellites. The inverted orientation of the ZNF and D10S141 duplications (32) and the fact that the RSU1 and IFB12 genes map to 10p12–10p13 suggest that these sequences have been subjected to a minimum of three independent intrachromosomal duplication events. Furthermore, the fact that these duplications do not contain repeats and are relatively stable components of primate genomes (32) indicates that they are behaviourally distinct from the more proximal interchromosomal duplications. It is possible that these have been produced by rearrangements initiated within the proximal satellites but resolved within more distal, single copy sequences. If so, the second cluster of diverged paralogues which lie between the ZNF and IFB12 duplications (Fig. 3A, 530–580 kb) could represent pericentromerically duplicated sequences that have become isolated from satellites during the formation of the ZNFB or IFB12 duplicated sequences. Data from similar duplications on chromosomes 1, 2, 9 and 21 (26–29) will establish whether distinct pericentromeric domains of intrachromosomally duplicated sequences are common features of human chromosomes.

Identification of a boundary between heterochromatin and euchromatin
Because the sequence presented here includes both pericentromeric satellite arrays and highly expressed genes, it is presumed to contain both heterochromatic and euchromatic sequences (31,32). Although the distinction between heterochromatin and euchromatin is cytological, data from other eucaryotes indicate that it can reflect basic differences in underlying sequence organization (1–5). The well established association of satellite repeats with heterochromatin (1–8) and the fact that repeats per se can induce gene silencing (14,15) strongly suggest that the proximal satellite sequences in 10q11, and the intervening duplicated sequences, would be heterochromatic in nature. We would therefore predict that they are gene poor. This appears to be the case as we can identify only a single EST cluster proximal to the satellite 3 array in addition to four pseudogenes and a single gene fragment (Tables 1 and 2).

If this interpretation of our data is correct, it also predicts that equivalent regions of other human chromosomes will have similar structural and transcription profiles. The only extensive comparable data currently available are from chromosome 22 where two satellite arrays are present within the most centromeric 1 Mb of sequence (19). This sequence contains pericentromerically duplicated ALD and NF1 paralogues and only 4 of 37 STSs from this region were found to be chromosome specific during map construction, suggesting that this sequence is comparable in structure and content to the proximal 250 kb of the 10q11 sequence. A total of 37 gene features have been annotated within this 1 Mb of sequence, 30 of which are proximal to a satellite II array (19, http://www.sanger.ac.uk/cgi-bin/cwa/22cwa.pl ). As in 10q11, most of these are based on identity to known proteins with no associated ESTs showing >96% identity to the sequence, although in contrast to the 10q11 data there are also 11 gene features based on EST matches. However, seven of these ESTs/EST clusters are contiguous with genomic DNA and have no associated gene structure (ap000523.3, ap000525.2, ap000525.7, ap000527.2, ap000528.1, ap000529.2 and ap000534.2), a further three have short predicted ORFs (115–130 amino acids) within unspliced ESTs (ap000525.5, ap000525.6 and ap000527.1) whereas the last is identified by a single spliced EST (AA725634) which contains a paralogous fragment of the KIAA0187 gene from 10q11 (ap000526.1). Whether these are bona fide genes will, therefore, require experimental confirmation. This may be a laborious task as the multicopy nature of pericentromeric sequence means that hybridization and PCR-based techniques alone cannot provide definitive proof of transcription, a problem already noted during transcriptional analysis of the subtelomeric region of 4q (46). The chromosome 22 data are, therefore, currently consistent with the interpretation that sequences proximal to pericentromeric satellite arrays consist of pericentromerically duplicated DNA which is gene poor and heterochromatic.

The transcriptional analysis also indicates that sequences which lie distal to the satellite arrays are expressed. The identification of genes within complex eucaryotic genomes is prone to errors, making it is difficult to establish definitively the transcriptional profile of large genomic regions (47,48). Using conservative criteria for gene identification we have confirmed the presence of three genes all of which have been identified previously: ZNF11/33B (49), KIAA0187 (50) and RET (51). All other gene-related features consist of transcripts with no clear gene structure, non-spliced ESTs, or gene fragments identified through protein database searches with no associated ESTs. Whereas we cannot formally rule out the possibility that some, or all, of these features correspond to bona fide genes, it is also possible that they represent a mixture of cDNA library artefacts, short aberrant or repeat-associated transcripts, and recently formed pseudogene fragments. Thus, although sequences distal of the satellites are expressed, the region appears to be gene poor.

The lack of transcripts within the satellite-rich proximal 200 kb, and the position of the most proximal confirmed transcript (~289 kb) and gene (ZNF11/33B, ~475 kb), suggests that there may be a single boundary between heterochromatin and euchromatin within the 10q11 sequence presented here. The most likely location for this is between the satellite III array at ~200 kb and the transcript at ~289 kb. Interestingly, this approximately defines the transition from sequences prone to interchromosomal duplication and sequences which have been involved in intrachromosomal duplication. However, either FISH-based techniques (52) or chromatin immunoprecipitation (53) will be required to confirm that this sequence discontinuity corresponds to a boundary between chromatin types. This will be particularly important in light of the discovery that heterochromatin can extend several hundred kilobases beyond the repeats that are presumed to be a pre-requisite for its formation (11).

Does pericentromeric duplication have a function?
The gene-related nature of many pericentromeric duplications (17,20–23), together with the established role of telomere plasticity in generating antigen variation in some primitive eucaryotes (54), has led to the proposal that rapidly evolving pericentromeric sequences may be an important source of novel genes (17,21). The data presented here suggest that these sequences may be heterochromatic, making it unlikely that pericentromeric rearrangement will lead directly to the formation of new genes. However, if gene formation is not the primary function of pericentromerically duplicated sequences, then what is?

The most straightforward answer to this question is that these sequences are of no adaptive significance whatsoever and exist due to the combined action of two well characterized phenomena. First, centromeric satellites are known to evolve rapidly by concerted evolution with frequent sequence exchange between repeat arrays on non-homologous chromosomes (55). Second, duplications of euchromatic material into centric heterochromatin have been reported as cytogenetically visible variants within the human population (56, and references therein) and may represent the first step in pericentromeric-directed duplication events (20). When such a chance event occurs, the euchromatin-derived sequence may be prone to interchromosomal transfer through exchange events which initiate within flanking satellite sequences. Pericentromeric duplication may, therefore, simply represent the transient intermediates of satellite homogenization. More functionally based proposals include a role in chromosome pairing [the only established function of pericentromeric satellites outside centromere activity is in maintaining alignment between homologous chromosomes during achiasmate meioses in Drosophila (57)], and a role as spacers between satellites and genes to guard against centromeric position effects (20).

Although plausible, none of these hypotheses can account for the lack of equivalent sequences within the heterochromatin of Arabidopsis (4,5) or Drosophila (9) indicating that more elaborate possibilities warrant consideration. The identification of a growing number of proteins that bind to satellite sequences (58,59) and the fact that the relocation of sequences towards centromeric nuclear domains can be involved in gene silencing (60,61) suggest that satellite sequences are involved in complex patterns of nuclear compartmentalization (62,63) and gene expression (64,65). It is possible that ‘pseudo-euchromatic’ sequences between satellites help to define the position of functionally distinct silencing domains within the nucleus, and could even provide a means to dynamically alter their position through changes in chromatin conformation. Alternatively, the facts that heterochromatic sequences in Arabidopsis and Drosophila are rich in transposons (9,10) relative to the equivalent regions in humans (Fig. 1B), and that high frequencies of element movement can significantly reduce fitness (67), raise the possibility that pericentromeric duplication represents a novel mechanism which has evolved in repeat-rich genomes to modulate transposition frequency (by providing pseudo-euchromatic pericentromeric sites where insertions will not be deleterious) and/or element density (through a continual process of deletion and renewal of recombinationally deficient pericentromeric sequences). The centromere-directed nature of the burst of retroviral transposition reported within an interspecific marsupial hybrid (68) could be accounted for by such a mechanism.

The data presented here strongly suggest that the pericentromeric sequences in 10q11 are organized into two domains based both on the presence or absence of tandem repeat arrays and on the type of rearrangement to which they are prone. Furthermore, transcriptional activity suggests that the boundary between these domains approximately defines a transition between pericentromeric heterochromatin and euchromatin. This raises the possibility that most, if not all, human chromosomes will exhibit similar organizational features. Realistic short-term aims will be to extend the sequence data presented here to the centromeric alphoid array (D10Z1), to develop equivalent data from other centromeres both in man and other mammals for comparative analyses, and to experimentally confirm the existence of the abrupt changes in chromatin state implied here.


    MATERIALS AND METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 RESULTS
 DISCUSSION
 MATERIALS AND METHODS
 REFERENCES
 
YAC subcloning, sequencing and analysis
The restriction map of each YAC DNA preparation used for cloning was analysed by PFGE to ensure that no rearrangements occurred during culture. YAC DNA was isolated, shotgun cloned and sequenced as described previously (33,34). The restriction map of each finished sequence was compared with the existing PFGE map and found to be in agreement with the exception that four BssHII sites and two MluI sites within the PFGE map correspond to two or more closely spaced sites in CpG islands within the finished sequence (data not shown). This is presumed to be due to the low resolution of the PFGE map. The finished sequence was subjected to the standard Sanger Centre automated analyses (http://www.sanger.ac.uk/HGP/Humana/human_analysis.shtml ) and imported into an AceDB database (http://www.sanger.ac.uk/HGP/Humana/ACE.shtml ) to allow interactive interpretation of results. In addition, the sequence was split up into overlapping 50 and 100 kb sections and analysed using NIX (69) which provides an interactive graphical interface to simultaneously query the results of multiple gene prediction and database search programs. Ab initio gene predictions were obtained using Grail2, Fgene, Genefinder, Fgenes, Genscan and HMMGene (70–72). Interspersed and tandem repeats were identified using Repeatmasker (http://repeatmasker.genome.washington.edu/cgi-bin/RM2_req.pl ) and Tandem Repeat Finder (73). Similarities to existing genomic, EST and protein sequences were identified by using repeat-masked 10q11 sequence to query Swissprot, TREMBL and EMBL databases with the BLAST family of programs (74,75). The overlap between y738F9 and y214H10 was aligned using Blast 2 sequences (76) whereas cDNA sequences were aligned to genomic DNA using est_genome available through the UK HGMP Resource Centre. The GC content and distribution of interspersed repeats was established by using RepeatMasker to analyse overlapping 20 kb sequence files with a 16 kb overlap which were generated using in-house software. The tabulated summary from each analysis was then transferred to a single text file using in-house software and analysed using Microsoft Excel.

RT–PCR analysis
A panel of eight cDNAs derived from adult tissues (Clontech, Palo Alto, CA) were analysed according to the manufacturer’s recommendations. Primers were not generated from two ESTs due to the presence of interspersed repeats (N89814 and HS1251874). All primers used in the analysis can be viewed at http://www.ncl.ac.uk/human.genetics/rtpcr.html or obtained from the corresponding author.

Analysis of paralogous sequences
Each genomic clone containing paralogous sequences was compared with the 10q11 sequence using DOTTER (77). Regions of high identity were then aligned using BESTFIT (78) to estimate the percentage identity between the sequences. Where complex patterns of alignment were identified within a single clone each alignment was considered independently except when the discontinuity in alignment was clearly due to the insertion/deletion of a transposable element. Sequences longer than 32 kb were split prior to alignment with an average percentage identity being calculated from the individual alignments after correction for alignment length. The percentage identity between two sequences is defined as the number of matching nucleotides divided by the total alignment length (excluding insertions and deletions). The percentage similarity is the number of matching nucleotides divided by the sum of the number of matching nucleotides and the number of insertions and deletions. Features at the termini of each alignment (defined by BESTFIT) were identified by analysing the output of RepeatMasker and Tandem Repeat Finder. All primers used to verify the integrity of domain termini can be viewed at http://www.ncl.ac.uk/human.genetics/paralogues.html or obtained from the corresponding author.

Somatic cell hybrid analysis
The Human Monochromosomal Hybrid Panel available from the UK HGMP Resource Center (79) was used to establish the chromosomal distribution of 10q11 STSs. The chromosome 8 hybrid in this panel also contains the pericentromeric region of chromosome 22. Paralogous STSs amplified from this hybrid were mapped further using two additional hybrids which contain chromosomes 8 (NA10156B) and 22 (NA10888) in isolation (80). A number of STSs amplified from the chromosome 20 hybrid (GM10478) were found to share 100% identity to the 10q11 sequence. None of these STSs amplified a product from a second chromosome 20 hybrid, NA13140 (80), suggesting that the GM10478 hybrid retains a cytogenetically cryptic chromosomal fragment from 10q11. As a result, chromosome 20 was recorded as being negative for the markers involved. The association of STS distribution with satellite repeats was analysed by summing the number of paralogous STSs amplified from chromosomes which are positive for each satellite and the number amplified from chromosomes negative for each satellite and comparing these two totals with the expected totals if a random distribution is assumed (chromosome 10 was excluded from the analysis as all STSs are derived from this chromosome). Independent analysis of each satellite families gave a P value of <2 x 10–4 in all three cases (Fisher’s exact test). All primers used in the analysis can be viewed at http://www.ncl.ac.uk/human.genetics/sts.html or obtained from the corresponding author.

STS sequencing
PCR products were amplified from somatic cell hybrids (50 ng) and purified using Qiaquick PCR purification kits (Qiagen, Hilden, Germany) according to the manufacturer’s instructions except that DNAs were eluted in water. Approximately 100 ng of template was used for each sequencing reaction and all sequencing reactions were performed using an ABI PRISM BigDye cycle sequencing kit according to the manufacturer’s instructions (PE Applied Biosystems, Foster City, CA) and analysed using an ABI377 (PE Applied Biosystems). Each template was sequenced in both orientations. Despite this, a significant number of templates failed to give unambiguous sequence data, suggesting that multiple paralogues of some STSs are present on some human chromosomes. High quality sequence obtained in both orientations was aligned to the 10q11 sequence and divergences calculated using the Megalign software package (DNAstar, Madison, WI). Where more than one paralogous STS was available, each sequence was aligned to the 10q11 sequence separately and the mean divergence was taken.


    ACKNOWLEDGEMENTS
 
We thank T. Strachan for critical reading of the manuscript. The financial support of the AIRC, Telethon (grant E.672), Wellcome Trust (grant 049859) and EC (contract BMH4-CT97-2433) is gratefully acknowledged, as are a short-term fellowship from EMBO (L.V.) and studentships from the MRC (T.H.). M.S.J. is a Royal Society University Research Fellow.


    FOOTNOTES
 
+ To whom correspondence should be addressed. Tel: +44 191 222 8005; Fax: +44 191 222 6662; Email: mjackson@hgmp.mrc.ac.uk Back


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 RESULTS
 DISCUSSION
 MATERIALS AND METHODS
 REFERENCES
 
1 Holmquist, G.P., Kapitonov, V.V. and Jurka, J. (1998) Mobile genetic elements, chiasmata, and the unique organization of beta-heterochromatin. Cytogenet. Cell Genet., 80, 113–116.[Web of Science][Medline]

2 Csink, A.K., Sass, G.L. and Henikoff, S. (1997) Drosophila heterochromatin: retreats for repeats. In van Driel, R. and Otte, A.P. (eds), Nuclear Organization, Chromatin Structure, and Gene Expression. Oxford Universiy Press, Oxford, UK, pp. 223–235.

3 Henikoff, S. (1990) Position effect variegation after 60 years. Trends Genet., 6, 422–426.[Web of Science][Medline]

4 Milot, E., Strouboulis, J., Trimborn, T., Wijgerde, M., deBoer, E., Langerveld, A., Tan-Un, K., Vergeer, W., Yannoutsos, N., Grosveld, F. and Fraser, P. (1996) Heterochromatin effects on the frequency and duration of LCR-mediated gene transcription. Cell, 87, 105–114.[Web of Science][Medline]

5 Kleinjan, D.-J. and van Heyningan, V. (1998) Position effect in human genetic disease. Hum. Mol. Genet., 7, 1611–1618.[Abstract/Free Full Text]

6 Choo, K.H.A. (1998) Why is the centromere so cold? Genome Res., 8, 81–82.[Free Full Text]

7 Gatti, M. and Pimpinelli, S. (1992) Functional elements in Drosophila melanogaster heterochromatin. Annu. Rev. Genet., 26, 239–275.[Web of Science][Medline]

8 Lin, X.Y., Kaul, S.S., Rounsley, S., Shea, T.P., Benito, M.I., Town, C.D., Fujii, C.Y., Mason, T., Bowman, C.L., Barnstead, M. et al. (1999) Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana. Nature, 402, 761–772.[Medline]

9 Adams, M.D., Celniker, S.E., Holt, R.A., Evans, C.A., Gocayne, J.D., Amanatides, P.G., Scherer, S.E., Li, P.W, Hoskins, R.A. et al. (2000) The genome sequence of Drosophila melanogaster. Science, 287, 2185–2195.[Abstract/Free Full Text]

10 Sun, X., Wahlstrom, J. and Karpen, G. (1997) Molecular structure of a functional Drosophila centromere. Cell, 91, 1007–1019.[Web of Science][Medline]

11 Fransz, P., Armstrong, S., de Jong, J.H., Parnell, L.D., vanDrunen, C., Dean, C., Zabel, P., Bisseling, T. and Jones, G. (2000) Integrated cytogenetic map of chromosome arm 4S of A. thaliana: structural organization of heterochromatic knob and centromere region. Cell, 100, 367–376.[Web of Science][Medline]

12 The Cold Spring Harbor Laboratory, Washington University Genome Sequencing Center, and PE Biosystems Arabidopsis Sequencing Consortium (2000) The complete sequence of a heterochromatic island from a higher eukaryote. Cell, 100, 377–386.[Web of Science][Medline]

13 Ananiev, E.V., Philips, R.L. and Rines, H.W. (1998) Complex structure of knob DNA on maize chromosome 9. Retrotransposon invasion into heterochromatin. Genetics, 149, 2025–2037.[Abstract/Free Full Text]

14 Garrick, D., Fiering, S., Martin, D.I.K. and Whitelaw, E. (1998) Repeat-induced gene silencing in mammals. Nature Genet., 18, 56–59.[Web of Science][Medline]

15 Selker, E.U. (1999) Gene silencing: repeats that count. Cell, 97, 157–160.[Web of Science][Medline]

16 Lee, C., Wevrick, R., Fisher, R.B., Ferguson-Smith, M.A. and Lin, C.C. (1997) Human centromeric DNAs. Hum. Genet., 100, 291–304.[Web of Science][Medline]

17 Eichler, E.E. (1998) Masquerading repeats: paralogous pitfalls of the human genome. Genome Res., 8, 758–762.[Free Full Text]

18 Murphy, T.D. and Karpen, G.H. (1998) Centromeres take flight: alpha satellite and the quest for the human centromere. Cell, 93, 317–320.[Web of Science][Medline]

19 Dunham, I., Shimizu, N., Roe, B.A., Chissoe, S., Dunham, I., Hunt, A.R., Collins, J.E., Bruskiewich, R., Beare, D.M., Clamp, M. et al. (1999) The DNA sequence of human chromosome 22. Nature, 402, 489–495.[Medline]

20 Horvath, J.E., Viggiano, L., Loftus, B.J., Adams, M.D., Archidiacono, N., Rocchi, M. and Eichler, E.E. (2000) Molecular structure and evolution of an alpha satellite non-alpha satellite junction at 16p11. Hum. Mol. Genet., 9, 113–123.[Abstract/Free Full Text]

21 Eichler, E.E., Budarf, M.L., Rocchi, M., Deaven, L.L., Doggett, N.A., Baldini, A., Nelson, D.L. and Mohrenweiser, H.W. (1997) Interchromosomal duplications of the adrenoleukodystrophy locus: a phenomenon of pericentromeric plasticity. Hum. Mol. Genet., 6, 991–1002.[Abstract/Free Full Text]

22 Regnier, V. Meddeb, M., Lecointre, G., Richard, F., Duverger, A., VanCong, N., Dutrillaux, B., Berheim, A. and Danglot, G. (1997) Emergence and scattering of multiple neurofibromatosis (NF1)-related sequences during hominoid evolution suggests a process of pericentromeric interchromosomal transposition. Hum. Mol. Genet., 6, 9–16.[Abstract/Free Full Text]

23 Zimonzjic, D.B., Kelley, M.J., Rubin, J.S., Aaronson, S.A. and Popescu, N.C. (1997) Fluorescence in situ hybridization analysis of keratinocyte growth factor gene amplification and dispersion in evolution of great apes and humans. Proc. Natl Acad. Sci. USA, 94, 11461–11465.[Abstract/Free Full Text]

24 Eichler, E.E., Archidiacono, N. and Rocchi, M. (1999) CAGGG repeats and the pericentromeric duplication of the hominoid genome. Genome Res., 9, 1048–1058.[Abstract/Free Full Text]

25 Tunnacliffe, A., Liu, L., Moore, J.K., Leversha, M.A., Jackson, M.S., Papi, L., Ferguson-Smith, M.A., Thiesen, H.-J. and Ponder, B.A.J. (1993). Duplicated KOX zinc finger gene clusters flank the centromere of chromosome 10: evidence for a pericentric inversion during primate evolution. Nucleic Acids Res., 21, 1409–1417.[Abstract/Free Full Text]

26 Hardas, B.D., Zhang, J., Trent, J.M and Elder, J.T. (1994) Direct evidence for homologous sequences on the paracentric regions of human chromosome 1. Genomics, 21, 359–363.[Web of Science][Medline]

27 Edwards, Y.H., Putt, W., Fox, M. and Ives, J.H. (1995) A novel human phosphoglucomutase (PGM5) maps to the centromeric region of chromosome 9. Genomics, 30, 350–353.[Web of Science][Medline]

28 Maresco, D.L., Chang, E., Theil, K.S., Francke, U. and Anderson, C.L. (1996) The three genes of the human FCGR1 gene family encoding Fc{gamma}R1 flank the centromere of chromosome 1 at 1p12 and 1q21. Cytogenet. Cell Genet., 73, 157–163.[Web of Science][Medline]

29 Nothwang, H.D., Rensing, C., Kubler, M., Denich, D., Brandl, B., Stubanus, M., Haaf, T., Kurnit, D. and Hildebrandt, F. (1998) Identification of a novel ran binding protein 2 related gene (RANBP2L1) and detection of a gene cluster on human chromosome 2q11-q12. Genomics, 47, 383–392.[Web of Science][Medline]

30 Jackson, M.S., Slijepcevic, P. and Ponder, B.A.J. (1993). The organisation of repetitive sequences in the pericentromeric region of human chromosome 10. Nucleic Acids Res., 21, 5865–5874.[Abstract/Free Full Text]

31 Jackson, M.S., See, C.G., Mulligan, L.M. and Lauffart, B.F. (1996) A 9.75-Mb map across the centromere of human chromosome 10. Genomics, 33, 258–270.[Web of Science][Medline]

32 Jackson, M.S., Rocchi, M, Thompson, G., Hearn, T., Crosier, M., Guy, J., Kirk, D., Mulligan, L., Ricco, A., Piccininni, S. et al. (1999) Sequences flanking the centromere of human chromosome 10 are a complex patchwork of arm-specific sequences, stable duplications, and unstable sequences with homologies to telomeric and other centromeric locations. Hum. Mol. Genet., 8, 205–215.[Abstract/Free Full Text]

33 Vaudin, M., Roopra, A., Hillier, L., Brinkman, R., Sulston, J., Wilson, R.K. and Waterston, R.H. (1995) The construction and analysis of M13 libraries prepared from YAC DNA. Nucleic Acids Res., 23, 670–674.[Abstract/Free Full Text]

34 C. elegans Sequencing Consortium (1998) Genome sequencing of the nematode C. elegans: a platform for investigating biology. Science, 282, 2012–2018.[Abstract/Free Full Text]

35 Neil, D.L. Villasante, A., Fisher, R.B., Vetrie, D., Cox, B. and Tyler-Smith, C. (1990) Structural instability of human tandemly repeated DNA sequences clones in yeast artificial chromosome vectors. Nucleic Acids Res., 18, 1421–1428.[Abstract/Free Full Text]

36 Korenberg, J. and Engels, W.R. (1978) Base ratio, DNA content and quinicrine-brightness of human chromosomes. Proc. Natl Acad. Sci. USA, 75, 3382–3386.[Abstract/Free Full Text]

37 Smit, A.F.A. (1999) Interspersed repeats and other mementos of transposable elements in mammalian genomes. Curr. Opin. Genet. Dev., 9, 657–663.[Web of Science][Medline]

38 Korenberg, J.R. and Rykowski, M.C. (1988) Human genome organisation: alu, lines, and the molecular structure of metaphase chromosome bands. Cell, 53, 391–400.[Web of Science][Medline]

39 Devilee, P., Kievits, T., Waye, J.S., Pearson, P.L. and Willard, H.F. (1988) Chromosome-specific alpha satellite DNA: isolation and mapping of a polymorphic alphoid repeat from human chromosome 10. Genomics, 3, 1–7.[Medline]

40 Borden, P., Jaenichen, R. and Zachau, H. (1990) Structural features of transposed human Vk genes and implications for the mechanism of their transposition. Nucleic Acids Res., 18, 2101–2107.[Abstract/Free Full Text]

41 Moran, J.V., DeBerardinis, R.J. and Kazazian, H.H. (1999) Exon shuffling by L1 retrotransposition. Science, 283, 1530–1534.[Abstract/Free Full Text]

42 Taggaro, I., Fernandez-Peralta, A.M. and Gonzalez-Aguilera, J.J. (1994) Chromosomal localization of human satellites 2 and 3 by a FISH method using oligonucleotides as a probe. Hum. Genet., 93, 383–388.[Web of Science][Medline]

43 Alexandrov, I.A., Mitkevich, S.P. and Yurov, Y.B. (1988) The phylogeny of human chromosome specific alpha satellites. Chromosoma, 96, 443–453.[Web of Science][Medline]

44 Wevrick, R., Willard, V.P. and Willard, H.F. (1992) Structure of DNA near long tandem arrays of alpha satellite DNA at the centromere of human chromosome 7. Genomics, 14, 912–923.[Web of Science][Medline]

45 Dunnick, W., Hertz, G.Z., Scappino, L. and Gritzmacher, C. (1993) DNA sequences at immunoglobulin switch region recombination sites. Nucleic Acids Res., 21, 365–372.[Abstract/Free Full Text]

46 VanGeel, M., Heather, L.J., Lyle, R., Hewitt, J.E., Frants, R.R. and deJong, P.J. (1999) The FSHD region on human chromosome 4q35 contains potential coding regions among pseudogenes and a high density of repeat elements. Genomics, 61, 55–65.[Web of Science][Medline]

47 Xu, Y., Mural, R.J. and Uberbacher, E.C. (1997) Inferring gene structures in genomic sequences using pattern recognition and expressed sequence tags. Ismb, 5, 344–353.

48 Loftus, B.J., Kim, U.J., Sneddon, V.P., Kalush, F., Brandon, R., Fuhrmann, J., Mason, T., Crosby, M.L., Barnstead, M., Cronin, L. et al. (1999) Genome duplications and other features in 12 Mb of DNA sequence from human chromosome 16p and 16q. Genomics, 60, 295–308.[Web of Science][Medline]

49 Roussea-Merck, M.F., Tunnacliffe, A., Berger, R., Ponder, B.A.J. and Thiesen, H.-J. (1992) A cluster of expressed zinc finger protein genes in the pericentromeric region of human chromosome 10. Genomics, 13, 845–848.[Web of Science][Medline]

50 Nagase, T., Seki, N., Ishikawa, K., Tanaka, A. and Nomura, N. (1996) Prediction of the coding sequences of unidentified human genes. V. The coding sequences of 40 new genes (KIAA0161–KIAA0200) deduced by analysis of cDNA clones from human cell line KG-1. DNA Res., 3, 17–24.[Abstract]

51 Takahashi, M., Buma, Y., Iwamoto, T., Inaguma, Y., Ikeda, H. and Hiai, H. (1998) Cloning and expression of the ret proto-oncogene cDNA with an amino-terminal signal. Oncogene, 3, 571–578.

52 Haaf, T. and Ward, D.C. (1994) Structural analysis of alpha-satellite DNA and centromere proteins using extended chromatin and chromosomes. Hum. Mol. Genet., 3, 697–709.[Abstract/Free Full Text]

53 Johnson, C.A., O’Neill, L.P., Mitchell, A. and Turner, B.M. (1998) Distinctive patterns of histone H4 acetylation are associated with defines sequence elements within both heterochromatic and euchromatic regions of the human genome. Nucleic Acids Res., 26, 994–1001.[Abstract/Free Full Text]

54 van der Ploeg, L.H., Gottesdiener, K. and Lee, M.G. (1992) Antigenic variation in African trypanosomes. Trends Genet., 8, 452–457.[Web of Science][Medline]

55 Warburton, P.E. and Willard, H.F. (1997) Evolution of centromeric alpha satellite DNA: molecular organisation within and between human and primate chromosomes. In Jackson, M., Strachan, T. and Dover, G. (eds), Human Genome Evolution. BIOS Scientific, Oxford, UK, pp. 121–145.

56 Verma, R.S., Luke, S., Brennan, J.P., Mathews, T., Conte, R.A. and Macera, M.J. (1993) Molecular topography of the secondary constriction region (qh) of human chromosome 9 with an unusual euchromatic band. Am. J. Hum. Genet., 52, 981–986.[Web of Science][Medline]

57 Dernberg, A.F., Sedat, J.W. and Hawley, R.S. (1996) Direct evidence of a role for heterochromatin in meiotic chromosome segregation. Cell, 86, 135–146.[Web of Science][Medline]

58 Csink, A.K. and Henikoff, S. (1998) Something from nothing: the evolution and utility of satellite repeats. Trends Genet., 14, 200–204.[Web of Science][Medline]

59 Tang, Q.-Q. and Lane, M.D. (1999) Activation and centromeric localization of CCAAT/enhancer binding proteins during the mitotic clonal expansion of adipocyte differentiation. Genes Dev., 13, 2231–2241.[Abstract/Free Full Text]

60 Francastel, C., Walters, M.C., Groudine, M. and Martin, D.I.K. (1999) A functional enhancer suppresses silencing of a transgene and prevents its localization close to centromeric heterochromatin. Cell, 99, 259–269.[Web of Science][Medline]

61 Andrulis, E.D., Neiman, A.M., Zappulla, D.C. and Sternglanz, R. (1998) Perinuclear localization of chromatin facilitates transcriptional silencing. Nature, 394, 592–595.[Medline]

62 Kadonaga, J.T. (1998) Eukaryotic transcription: an interlaced network of transcription factors and chromatin-modifying machines. Cell, 92, 307–313.[Web of Science][Medline]

63 Saurin, A.J., Shiels, C., Williamson, J., Satijn, D.P.E., Otte, A.P., Sheer, D. and Freemont, P.S. (1998) The human polycomb group complex associates with pericentromeric heterochromatin to form a novel nuclear domain. J. Cell Biol., 142, 887–898.[Abstract/Free Full Text]

64 Cockell, M. and Gasser, S.M. (1999) Nuclear compartments and gene regulation. Curr. Opin. Genet. Dev., 9, 199–205.[Web of Science][Medline]

65 Brown, K.E., Guest, S.S., Smale, S.T., Hahm, K., Merkenschlager, M. and Fisher, A.G. (1997) Association of transcriptionally silent genes with Ikaros complexes at centromeric heterochromatin. Cell, 91, 845–854.[Web of Science][Medline]

66 Carmena, M. and Gonzales, C. (1995) Transposable elements map in a conserved pattern of distribution extending from beta-heterochromatin to centromeres in Drosophila melanogaster. Chromosoma, 103, 676–684.[Web of Science][Medline]

67 Charlesworth, B., Sniegowski, P. and Stephan, W. (1994) The evolutionary dynamics of repetitive DNA in eukaryotes. Nature, 371, 215–220.[Medline]

68 O’Neill, R.J.W., O’Neill, M.J. and Graves, J.A.M. (1998) Undermethylation associated with retroelement activation and chromosome remodeling in an interspecific mammalian hybrid. Nature, 393, 68–72.[Medline]

69 Williams, G.W., Woollard, P.M. and Hingamp, P. (1998) NIX: a nucleotide identification system at the HGMP-RC (http://www.hgmp. mrc.ac.uk/NIX/ ).

70 Solovyev, V., Salamov, A. and Lawrence, C.B. (1995) Identification of human gene structure using linear discriminant functions and dynamic programming. Ismb, 3, 367–375.

71 Solovyev, V. and Salamov, A. (1997). The Gene-Finder computer tools for analysis of human and model organisms genome sequence. Ismb, 5, 294–302.

72 Murakami, K. and Takagi, T. (1998) Gene recognition by combination of several gene-finding programs. Bioinformatics, 14, 665–675.[Abstract/Free Full Text]

73 Benson, G. (1999) Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res., 27, 573–580.[Abstract/Free Full Text]

74 Altschul, S.F., Gish, W., Miller, W., Myers, E.W. and Lipman, D.J. (1990) Basic local alignment search tool. J. Mol. Biol., 215, 403–410.[Web of Science][Medline]

75 Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W. and Lipman, D.J. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res., 25, 3389–3402.[Abstract/Free Full Text]

76 Tatusova, T.A. and Madden, T.L. (1999) Blast 2 sequences—a new tool for comparing protein and nucleotide sequences. FEMS Microbiol. Lett., 174, 247–250.[Web of Science][Medline]

77 Sonnhammer, E.L.L. and Durbin, R. (1996) A dot-matrix program with dynamic threshold control suited for genomic DNA and protein sequence analysis. Gene, 167, 1–10.

78 Genetics Computer Group (1991) Program Manual for the GCG Package, Version 7. GCG, Madison, WI.

79 Kelsell, D.P., Rooke, L., Warne, D., Bouzyk, M., Cullin, L., Cox, S., West, L., Povey, S. and Spurr, N.K. (1995) Development of a panel of monochromosomal somatic cell hybrids for rapid gene mapping. Ann. Hum. Genet., 59, 233–241.[Web of Science][Medline]

80 Dubois, B.L. and Naylor, S.L. (1993) Characterisation of NIGMS human/rodent somatic cell hybrid mapping panel 2 by PCR. Genomics, 16, 315–319.[Web of Science][Medline]

81 Prosser, J., Frommer, M., Paul, C. and Vincent, P.C. (1986) Sequence relationships of three human satellite DNAs. J. Mol. Biol., 187, 145–155.[Web of Science][Medline]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Genome ResHome page
C. Grunau, J. Buard, M.-E. Brun, and A. De Sario
Mapping of the juxtacentromeric heterochromatin-euchromatin frontier of human chromosome 21
Genome Res., October 1, 2006; 16(10): 1198 - 1207.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
M. K. Rudd, G. A. Wray, and H. F. Willard
The evolutionary dynamics of {alpha}-satellite
Genome Res., January 1, 2006; 16(1): 88 - 96.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
J. E. Horvath, C. L. Gulden, R. U. Vallente, M. Y. Eichler, M. Ventura, J. D. McPherson, T. A. Graves, R. K. Wilson, S. Schwartz, M. Rocchi, et al.
Punctuated duplication seeding events during the evolution of human chromosome 2p11
Genome Res., July 1, 2005; 15(7): 914 - 927.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
N. Kouprina, M. Mullokandov, I. B. Rogozin, N. K. Collins, G. Solomon, J. Otstot, J. I. Risinger, E. V. Koonin, J. C. Barrett, and V. Larionov
The SPANX gene family of cancer/testis-specific antigens: Rapid evolution and amplification in African great apes and hominids
PNAS, March 2, 2004; 101(9): 3077 - 3082.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
J. E. Horvath, C. L. Gulden, J. A. Bailey, C. Yohn, J. D. Mcpherson, A. Prescott, B. A. Roe, P. J de Jong, M. Ventura, D. Misceo, et al.
Using a Pericentromeric Interspersed Repeat to Recapitulate the Phylogeny and Expansion of Human Centromeric Segmental Duplications
Mol. Biol. Evol., September 1, 2003; 20(9): 1463 - 1479.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
D. P. Locke, R. Segraves, L. Carbone, N. Archidiacono, D. G. Albertson, D. Pinkel, and E. E. Eichler
Large-Scale Variation Among Human and Great Ape Genomes Determined by Array Comparative Genomic Hybridization
Genome Res., March 1, 2003; 13(3): 347 - 357.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
A. Courseaux, F. Richard, J. Grosgeorge, C. Ortola, A. Viale, C. Turc-Carel, B. Dutrillaux, P. Gaudray, and J.-L. Nahon
Segmental Duplications in Euchromatic Regions of Human Chromosome 5: A Source of Evolutionary Instability and Transcriptional Innovation
Genome Res., March 1, 2003; 13(3): 369 - 381.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
J. Guy, T. Hearn, M. Crosier, J. Mudge, L. Viggiano, D. Koczan, H.-J. Thiesen, J. A. Bailey, J. E. Horvath, E. E. Eichler, et al.
Genomic Sequence and Transcriptional Profile of the Boundary Between Pericentromeric Satellites and Genes on Human Chromosome Arm 10p
Genome Res., February 1, 2003; 13(2): 159 - 172.
[Abstract] [Full Text] [PDF]


Home page
Cold Spring Harb Symp Quant BiolHome page
J.A. BAILEY and E.E. EICHLER
Genome-wide Detection and Analysis of Recent Segmental Duplications within Mammalian Organisms
Cold Spring Harb Symp Quant Biol, January 1, 2003; 68(0): 115 - 124.
[Abstract] [PDF]


Home page
Genome ResHome page
J. W. Thomas, M. G. Schueler, T. J. Summers, R. W. Blakesley, J. C. McDowell, P. J. Thomas, J. R. Idol, V. V.B. Maduro, S.-Q. Lee-Lin, J. W. Touchman, et al.
Pericentromeric Duplications in the Laboratory Mouse
Genome Res., January 1, 2003; 13(1): 55 - 63.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
E. E. Eichler and P. J. DeJong
Biomedical Applications and Studies of Molecular Evolution: A Proposal for a Primate Genomic Library Resource
Genome Res., May 1, 2002; 12(5): 673 - 678.
[Abstract] [Full Text] [PDF]


Home page
QJMHome page
C. Berry
Variety
QJM, April 1, 2002; 95(4): 259 - 260.
[Full Text] [PDF]


Home page
Genome ResHome page
M. Crosier, L. Viggiano, J. Guy, D. Misceo, R. Stones, W. Wei, T. Hearn, M. Ventura, N. Archidiacono, M. Rocchi, et al.
Human Paralogs of KIAA0187 Were Created through Independent Pericentromeric-Directed and Chromosome-Specific Duplication Mechanisms
Genome Res., January 1, 2002; 12(1): 67 - 80.
[Abstract] [Full Text] [PDF]


Home page
J HeredHome page
E. E. Eichler, M. E. Johnson, C. Alkan, E. Tuzun, C. Sahinalp, D. Misceo, N. Archidiacono, and M. Rocchi
Divergent Origins and Concerted Expansion of Two Segmental Duplications on Chromosome 16
J. Hered., November 1, 2001; 92(6): 462 - 468.
[Abstract] [Full Text] [PDF]


Home page
Hum Mol GenetHome page
J. E. Horvath, J. A. Bailey, D. P. Locke, and E. E. Eichler
Lessons from the human genome: transitions between euchromatin and heterochromatin
Hum. Mol. Genet., October 1, 2001; 10(20): 2215 - 2223.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
L. H. Wong and K. H. A. Choo
Centromere on the Move
Genome Res., April 1, 2001; 11(4): 513 - 516.
[Full Text]


This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (40)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Guy, J.
Right arrow Articles by Jackson, M.S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Guy, J.
Right arrow Articles by Jackson, M.S.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?