Skip Navigation

This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (52)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Sutherland, H. G.E.
Right arrow Articles by Bickmore, W. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Sutherland, H. G.E.
Right arrow Articles by Bickmore, W. A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Human Molecular Genetics, 2001, Vol. 10, No. 18 1995-2011
© 2001 Oxford University Press

Large-scale identification of mammalian proteins localized to nuclear sub-compartments

Heidi G.E. Sutherland, Gail K. Mumford, Kathryn Newton, Lisa V. Ford, Rachel Farrall, Graham Dellaire, Javier F. Cáceres and Wendy A. Bickmore+

MRC Human Genetics Unit, Crewe Road, Edinburgh EH4 2XU, UK

Received May 15, 2001; Revised and Accepted June 26, 2001.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 RESULTS
 DISCUSSION
 MATERIALS AND METHODS
 REFERENCES
 
Many nuclear components participating in related pathways appear concentrated in specific areas of the mammalian nucleus. The importance of this organization is attested to by the dysfunction that correlates with mis-localization of nuclear proteins in human disease and cancer. Determining the sub-nuclear localization of proteins is therefore important for understanding genome regulation and function, and it also provides clues to function for novel proteins. However, the complexity of proteins in the mammalian nucleus is too large to tackle this on a protein by protein basis. Large-scale approaches to determining protein function and sub-cellular localization are required. We have used a visual gene trap screen to identify more than 100 proteins, many of which are normal, located within compartments of the mouse nucleus. The most common discrete localizations detected are at the nucleolus and the splicing speckles and on chromosomes. Proteins at the nuclear periphery, or in other nuclear foci, have also been identified. Several of the proteins have been implicated in human disease or cancer, e.g. ATRX, HMGI-C, NBS1 and EWS, and the gene-trapped proteins provide a route into further understanding their function. We find that sequence motifs are often shared amongst proteins co-localized within the same sub-nuclear compartment. Conversely, some generally abundant motifs are lacking from the proteins concentrated in specific areas of the nucleus. This suggests that we may be able to predict sub-nuclear localization for proteins in databases based on their sequence.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 RESULTS
 DISCUSSION
 MATERIALS AND METHODS
 REFERENCES
 
The mammalian nucleus is not homogeneous; rather it is organized into domains associated with different facets of nuclear function, and so proteins in common pathways often colocalize into specific areas of the nucleus (13). The importance of this organization is revealed by the mis-localization of nuclear proteins in human genetic disease (4,5), in cancers (6) and in virally infected cells (7).

The full complexity of the mammalian nucleus is yet to be revealed. Many abundant nuclear proteins have been detected and identified biochemically. Developments in protein microcharacterization and mass spectrometry have allowed the identification of components of large nuclear complexes/structures such as the spliceosome (8) or interchromatin granules (ICGs), which are components of the nuclear splicing speckles (9). Mutant genetic screens have identified proteins necessary for genome and nuclear functions but rely on recognition of appropriate mutant phenotypes and are best suited to the most genetically tractable organisms. Visual screens that identify mammalian proteins based only on the criterion of their sub-cellular location provide an alternative approach, and the large size of the mammalian nucleus makes it particularly amenable to a visual dissection. Such screens take one of two general forms. In the first, cDNA or genomic libraries under the control of an exogenous promoter are fused to green fluorescent protein or ß-galactosidase (ßgal) reporters and transfected into cells (1014). However, these fusion proteins are expressed at non-physiological levels, which may alter or mask normal protein localization, and in some cases may be cell lethal. The relevant genes must also be represented in the cDNA libraries used to make the fusion constructs and so these screens may be biased toward abundantly expressed genes. In the second approach, reporter constructs are integrated into endogenous genes so that the resulting fusion proteins are expressed from the normal gene promoter. The tagging of the endogenous gene with the reporter is also mutagenic, thus enabling mutant phenotypes to be investigated (15).

The sequencing of the human and mouse genomes now makes genome-based approaches to understanding nuclear proteins more feasible. In turn, determining sub-nuclear localization can give important clues to function for novel proteins present in databases and currently of unknown function. Previously we have shown that a gene trap screen is capable of identifying chromosomal and nuclear proteins in mouse cells. A ß-galactosidase-neomycin phosphotransferasegeo) reporter gene (lacking its own promoter and ATG) is integrated into an intron of an expressed gene so that it can be spliced in frame into the gene transcript (15). The resulting fusion proteins can recapitulate the sub-nuclear localization of wild-type proteins. However, we are not able to trap genes that contain no introns or proteins in which the nuclear localization signal (NLS) is at the N-terminus. The sequence of the trapped gene is identified by 5' rapid amplification of cDNA ends (5' RACE) from the fusion transcript (Fig. 1) and database searching.



View larger version (19K):
[in this window]
[in a new window]
 
Figure 1. Gene trap screening and sequencing strategy. (A) pGT1-3 contain a ßgeo cassette, preceded by a splice acceptor (SA) and followed by a polyadenylation signal (pA). Productive integrations into introns of expressed genes (fusion gene) downstream of coding exons (stippled boxes) result in splicing of the reporter in frame into the gene transcript. X-Gal staining and immunofluorescence assess the sub-cellular distribution of the resulting fusion proteins. Changes in fusion protein distribution that accompany the cessation of transcription are investigated using ActD. Changes in protein distribution or expression levels that accompany differentiation are investigated by the withdrawal of LIF from ES cell clones and by the addition of RA to both ES and F9 clones. (B) The sequence of trapped genes is established by 5' RACE. Products of the second PCR reaction are sequenced directly. Primers used are those described in (15) and (55). RACE products can also be cloned as BamHI (B)/SacI (S) fragments.

 
Our pilot screen established the efficacy of the approach (15). Refinements to 5' RACE sequencing and the enormous increase in human and mouse gene sequences present in databases suggested that the screen could be scaled up to identify large numbers of nuclear proteins. Here we report the identification and characterization of more than 100 different mouse genes whose protein products locate to nuclear compartments, including the nucleolus, splicing speckles, chromosomes (including heterochromatin), other nuclear foci and the nuclear periphery. More than half of the genes isolated in this screen encode novel proteins, or proteins whose sub-cellular localization was previously unknown. The analysis of conserved domains and motifs reveals common signatures amongst proteins concentrated in the same nuclear compartment, whilst the unexpected occurrence of some functional protein motifs points to new biochemical activities in some compartments.


    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 RESULTS
 DISCUSSION
 MATERIALS AND METHODS
 REFERENCES
 
At least one in 10 proteins expressed in embryonic cells target to the nucleus
We conducted large-scale gene trap screens in both F9 cells and E14 mouse embryonic stem (ES) cells and used 5-bromo-4-chloro-3-indoyl ß-D-galactoside (X-Gal) staining to identify those with nuclear fusion proteins. Approximately two-thirds of the 703 neomycin resistant (neoR) F9 clones and 646 ES cell clones had visible X-Gal staining. Colonies with no visible staining are due to the greater sensitivity of G418 selection compared with X-Gal staining and to inactivation of ßgal activity in some fusion proteins (15). Approximately 40% of clones showed staining restricted to the cytoplasm and so were discarded. In addition ~10% of clones had an intense spot of staining close to, but outside of, the periphery of the nucleus. These are most likely clones in which the fusion protein has misfolded and mislocalized (15). Ten percent of neoR clones (17% of clones with visible staining) were found to have either a nuclear distribution of X-Gal stain, or had staining in association with chromosomes or the chromosome segregation machinery in mitotic cells and these were selected for further study. Finally ~4% of clones had staining in the cytoplasm as well as the nucleus. Only those with the most prominent nuclear staining were studied.

The identity of trapped genes and establishing a database of nuclear proteins
Sequences of trapped genes were obtained by 5' RACE and compared with those in databases using the BLAST algorithm. These results together with sub-nuclear distribution determined from immunostaining using a ßgal antibody (see below) are set out in Table 1. Approximately 35% of the genes encode previously identified nuclear proteins of known cellular location. One quarter of the genes encode proteins of known sequence but previously unknown sub-cellular localization and 37% of sequences identified only mouse or human expressed sequence tags (ESTs) or cDNA sequences. The remaining clones match genomic sequences only.


View this table:
[in this window]
[in a new window]
 
Table 1. Nuclear distribution and sequence analysis of proteins from gene traps
 
Predicted protein products were analysed for the presence of NLSs using Psort II (http://psort.nibb.ac.jp/form2.html). The incidence of other conserved protein domains and motifs was assessed using InterProScan (http://www.ebi.ac.uk/interpro/scan.html) and SMART (http://smart.embl-heidelberg.de). The site of gene trap integration relative to the full-length sequence of wild-type proteins is indicated to illustrate how many amino acids/motifs are missing from the trapped protein (Table 1). For ES cell lines this may indicate whether the gene trap is likely to represent a knockout of gene function. To gather together the information from our screens we have created a nuclear protein relational database (NPD) that will be made publicly available.

Fusion proteins locate to different sub-nuclear compartments
To determine the distribution of fusion proteins within the nucleus at high resolution, cells were immunostained using an antibody recognizing ßgal. In about half of the clones the immunofluorescence signal was diffusely spread across the whole nucleus, but excluded from the nucleolus (nucleoplasmic). Figure 2 shows clone ES510, in which a gene encoding a zinc finger protein has been trapped, as an example of this staining pattern.



View larger version (51K):
[in this window]
[in a new window]
 
Figure 2. Fusion proteins can concentrate in distinct nuclear compartments. Examples of trapped protein localized to discrete nuclear sub-compartments detected by immunofluorescence with antibody recognizing ßgal. Domains of heterochromatin are readily apparent as bright foci of DAPI-staining (DNA) in mouse cells. Top row: the KRAB box containing zinc finger protein Znf97 trapped in ES510 cells has a diffuse nucleoplasmic distribution but is excluded from the nucleolus. Second row: a novel protein localizes in the nucleolus of F9/21D6 cells as marked by fibrillarin. Third row: a putative splicing factor hcc1 co-localizes with snRNPs detected by Y12 antibody in WF9/5A7 cells. Fourth row: in contrast to the diffuse localization of the fusion protein in ES510 cells (top row), the KRAB box containing protein trapped in ES113 cells co-localizes with chromatin. Fifth row: a known nuclear pore complex component co-localizes with lamin B at the nuclear periphery of F9/17A4 cells, even though only 111 of the 1448 amino acids of the wild-type protein are retained by the gene trap (Table 1, section V). Bottom row: the Bard1 fusion protein trapped in F9/16A3 cells accumulates in foci that may correspond with BRCA1 foci. Scale bar, 5 µm.

 
In other clones the fusion proteins concentrated in known nuclear compartments. Amongst the most common of these (10% of clones) was the nucleolus—revealed by staining with an antibody that recognizes fibrillarin. F9/21D6 in Figure 2 is a novel protein that concentrates in the nucleolus around the dense fibrillar centres where fibrillarin is found.

Concentration of trapped proteins in multiple nuclear speckles was also common. The coincidence of some of these (10% of clones) with sites of splicing factor accumulation (splicing speckles) was confirmed by co-immunofluorescence with Y12 antibodies recognizing snRNPs (e.g. clone F9/5A7, in which a putative splicing factor hcc1 has been trapped in Fig. 2; see Fig. 3 for further examples).



View larger version (50K):
[in this window]
[in a new window]
 
Figure 3. Changes in fusion protein localization upon the cessation of transcription. Gene trap proteins that change sub-nuclear distribution when transcription is inhibited with ActD (+ActD) were analysed by immunofluorescence with anti-ßgal antibody in cells counterstained with DAPI. The identity of nuclear compartments was confirmed by co-immunofluorescence with either Y12 antibody recognizing snRNPs or anti-fibrillarin to detect the nucleolus. Top row: the 100 kDa U5 snRNP protein (21) trapped in F9/2B4 cells moves from the splicing speckles (–ActD) into fewer but large foci that are coincident with Y12 upon treatment of cells with ActD (+ActD). Second row: in F9/21B2 cells the TRAP150 protein, previously thought to be associated with transcriptional regulation via nuclear hormone receptors (50), is localized in streaks (arrowed) that only partially overlap the Y12 snRNP speckles in untreated cells (–ActD), but re-locates into foci that are coincident with some of those occupied by snRNPs in ActD treated cells. Third row: the novel protein trapped in F9/12C2 is located in foci that, although coincident with Y12 staining, are more rounded in shape and punctate. In ActD treated F9/12C2 cells the fusion protein locates adjacent to, or only partially overlapping, Sm foci (arrows in +ActD). Fourth row: foci containing gene-trapped Rheb in F9/19A3 cells only partially overlap Sm foci (arrows) in ActD treated cells. Bottom row: Zap, present in unidentified speckles in F9/11C4 cells, partially disperses into the cytoplasm upon ActD treatment, with some protein moving to surround the nucleolar remnants. Scale bar, 5 µm.

 
For another 10% of clones, the sites of ßgal immunostaining were coincident with 4,6-diamidino-2-phenylindole (DAPI) stain. Depending on its intensity, this pattern of staining can be indicative of a fusion protein that is associated with either interphase chromosomes, or more specifically with the heterochromatin that is found in large pericentric blocks in mouse cells and appears as bright DAPI foci (15,16). An example of the latter staining pattern is shown for ES113, a novel Krüppel-associated box (KRAB)-containing protein in Figure 2. KRAB boxes are involved in transcriptional repression through their interaction with KAP-1, and some KRAB-containing proteins have been shown to co-localize with heterochromatin (17).

Four percent of clones showed a concentration of staining at the nuclear periphery, which was confirmed by co-staining with an antibody that recognizes B-type lamins (e.g. F9/17A4, a trap of the nuclear pore complex protein NUP153 in Figure 2).

Trapped proteins located in other nuclear bodies, foci or fine speckles were also identified. Some of these (e.g. the novel proteins trapped in ES32 and F9/19B6) concentrate in patterns different to those described for other known nuclear bodies, e.g. Cajal bodies, and so may represent novel sub-nuclear domains. The foci containing traps of Bard1 (F9/16A3 in Figure 2) and Nbs1 (F9/29A6), both of which contain BRCT domains (Table 1 section IV), might correspond to BRCA1 foci (18).

Some proteins were found not in the nucleus per se but associated with the chromosome or cell division apparatus, e.g. centromeres in F9/20D4 cells and the mitotic spindle in F9/11D4 cells (traps of CenP-E and clathrin heavy chain, respectively) (Fig. 4).



View larger version (61K):
[in this window]
[in a new window]
 
Figure 4. Cell cycle changes in fusion protein localization. Gene trap proteins that change their distribution during the passage from interphase through to metaphase and anaphase were detected with antibody recognizing ßgal in cells counterstained with DAPI. Top row: some of the novel nucleolar protein trapped in ES74 cells stays associated with mitotic chromosomes and coats segregating chromosomes at anaphase. Second row: the trapped version of CenP-E in F9/20D4 cells is cytoplasmic during interphase, associates with the centromeres of metaphase chromosomes, then remains behind at the spindle mid-zone from anaphase to telophase (27). Third row: the co-localization of trapped Hmgi-c with DAPI counterstain in ES446 cells suggests that the protein is associated with both euchromatin and heterochromatin in interphase cells. However, during mitosis the protein appears to concentrate specifically at the pericentric heterochromatin on segregating chromosomes (separate DAPI and ßgal images shown for clarity). Bottom row: the trapped version of clathrin heavy chain in F9/11D4 cells is predominantly perinuclear. However, during mitosis the protein co-localizes with tubulin on the spindle. Scale bars, 5 µm.

 
Eight of the fusion proteins, for which the subcellular localization of the endogenous protein was known, have completely mislocalized. These show a nuclear diffuse staining pattern and are listed in section VIII at the end of Table 1. Some are known components of nuclear sub-compartments, e.g. fibrillarin trapped in ES403 should show nucleolar staining, and can therefore provide information on motifs required for correct targeting to nuclear compartments. The mislocalized proteins are usually missing a large proportion of the endogenous protein after the gene trap and probably enter the nucleus by unmasking a cryptic NLS in the remainder of the protein. These are very likely to be knockouts of gene function, and some of these genes are known to be mutated in human syndromes, e.g. the thiamine transporter in thiamine-responsive megaloblastic anaemia (TRMA) with diabetes and deafness syndrome and the 70 kDa peroxisomal membrane protein in Zellweger syndrome (ES100 and F9/4D12 in section VIII of Table 1). Undoubtedly, some of the novel proteins identified in this screen will also be mislocalized relative to the wild-type protein but based on the proportion of the known proteins mislocalized this is likely to be a minority.

Dynamic changes in protein localization when transcription is inhibited
In vivo, many proteins move rapidly about the nucleus, and their apparent concentration into nuclear compartments reflects their steady-state association/disassociation with other nuclear proteins/complexes (19). Perturbing this equilibrium changes the apparent localization of such proteins. Hence, many proteins appear to alter their sub-cellular distribution in response to changes in transcription, during the cell cycle or accompanying differentiation. Analysing fusion protein dynamics can point toward the biological pathways in which the endogenous proteins function.

Many proteins involved in pre-mRNA processing concentrate in 20 to 40 nuclear foci or ‘speckles’ that correspond with ICGs and perichromatin fibrils detected by electron microscopy (3). Splicing is interrupted when transcription is blocked, and some splicing proteins [such as the serine/arginine-rich (SR) protein SC35 and ‘Sm’ snRNPs] then appear to concentrate into fewer, but larger, foci within the nucleus (9,20). We found that the trapped proteins from lines F9/2B4, WF9/5A7, F9/19C3 and F9/29D5 show the same behaviour as ‘Sm’ snRNPs in actinomycin D (ActD) treated cells (Fig. 3 and section II of Table 1). The trapped protein in F9/2B4 is the 100 kDa U5 snRNP (21) and the U5 snRNA has a similar behaviour (20). WF9/5A7 (Fig. 2) is the homologue of hCC1, a protein known to co-localize with human splicing factors (22). F9/29D5, a trap of plenty-of-prolines-101, was also identified amongst human proteins purified from ICGs (9).

Examination of gene trap proteins that are coincident with splicing components reveals additional complexity and heterogeneity in the morphology of the splicing compartment. The protein trapped in F9/21B2 cells is the homologue of human TRAP150 (Table 1 section II) and often appeared in long streaks in nuclei (Fig. 3, arrows), but moved into Sm snRNP-containing foci after treatment with ActD. Foci in F9/12C2 cells, although coincident with Sm antigens, are more rounded in shape and punctate. These foci enlarge and become fewer in number upon ActD treatment but they only partially overlap, and/or are adjacent to, Sm foci detected by Y12 staining (Fig. 3, arrows). The protein trapped in F9/12C2 is novel and contains an RNA recognition motif (RRM) (Table 1, section II). Similarly, the foci in F9/19A3, a trap of Rheb—a RAS-like GTPase, only partially overlap Sm foci in ActD treated cells (Fig. 3, arrows). The protein of unknown function trapped in ESKN60 cells also changed from a speckled pattern to engorged foci in response to ActD (Table 1, section IV).

Some nuclear proteins, including hnRNP A1 and a subset of SR proteins, continually shuttle in and out of the nucleus because they accompany the nuclear export of processed mRNAs. When transcription ceases these proteins accumulate in the cytoplasm (23,24). Other proteins such as U2AF, the U1 snRNP and p80 coilin form ring-like structures around the nucleolus when transcription is inhibited (20,25). We detected nucleo-cytoplasmic shuttling of Zap, a protein of unknown function trapped in F9/11C4 (Table 1, section IV), but a fraction of ZAP fusion protein also remains around the nucleolar remnants of ActD treated cells (Fig. 3). Fusion protein movements in response to ActD were also detected in other lines. In F9/21D6, a novel nucleolar protein (Table 1, section I) shows perinucleolar staining in ActD treated cells, the speckles seen in F9/19B6 cells disperse in treated cells, and the foci in ES32 cells form ring-like structures in the presence of ActD (Table 1, section IV).

Changes in fusion protein localization related to the cell-cycle or to differentiation
Many nuclear complexes disperse in mitotic cells, but some stay associated with segregating chromosomes. In ES38, the nucleolar protein Ubf had been trapped toward the C-terminus (Table 1, section I). Like the wild-type protein (26), the trapped protein remained associated with chromosomes throughout mitosis, concentrated in paired spots just below the centromeres of a few mouse chromosomes (most likely those carrying active arrays of rRNA genes). Other nucleolar proteins involved in pre-rRNA processing, including fibrillarin, are partially dispersed into the mitotic cytoplasm but with some protein coating the segregating chromosomes. This behaviour was seen for the novel nucleolar proteins that have been trapped in ES74 (Fig. 4), ES448 and F9/21D6 lines (Table 1, section I).

In ES446 cells we have trapped into the mouse gene encoding the high-mobility group protein Hmgi-c. The presence of AT-hook motifs in the protein sequence suggests that this may be a DNA binding protein (Table 1, section III). In interphase nuclei, the protein co-localizes with the DAPI stain suggesting that the protein distributes over both euchromatin and heterochromatin. However, during mitosis we have found that the protein concentrates at the pericentric heterochromatin of metaphase and anaphase chromosomes (Fig. 4).

Some proteins that are not nuclear associate with chromosomes or the chromosome segregation apparatus during mitosis and we also examined these (Table 1, section IV). In F9/20D4 cells the trapped protein CenP-E retains almost its entire C-terminal kinetochore binding domain (27) and recapitulates the dynamics of the normal protein (Fig. 4). We trapped the gene encoding clathrin heavy chain twice and the resulting fusion proteins are interrupted within the distal leg, or after the proximal leg, of the protein in F9/11D4 and F9/25A4 cells, respectively. Neither of these fusion proteins should be able to participate in clathrin trimerization to form triskeleton hubs and the F9/11D4 protein should not bind to clathrin light chains (28). Nevertheless in both cases, the fusion proteins were perinuclear (probably in Golgi-derived coated pits around the centrosome) and at the plasma membrane in interphase cells. We found that both the trapped clathrin fusion proteins, and wild-type clathrin (data not shown), co-localize with tubulin along the length of the mitotic spindle (Fig. 4). Tubulin is a component of coated pits and clathrin co-purifies with microtubules (29). It remains to be determined whether clathrin is present on the mitotic spindle through a passive interaction with tubulin, or whether it plays an active role in spindle function.

We identified proteins whose expression changes accompanying differentiation. In F9/12C4 we trapped the gene encoding Mis5 (a homologue of the human MCM6 replication licensing factor). Nuclear localization of this member of the minichromosome maintenance (MCM) family is dependent on complex formation with other MCMs that have NLSs and the trapped protein retains the zinc finger and ATPase motifs of the wild-type protein (Table 1, section IV). The variable levels of Mis5 protein in nuclear foci reflect cell cycle changes in the level of MCM proteins (30). Treating F9/12C4 cells with retinoic acid (RA) decreased the proportion of cells with abundant staining, consistent with an increase in cells that have exited the cell cycle in response to the induction of differentiation (data not shown). Similarly, the withdrawal of leukaemia inhibitory factor (LIF) and the addition of RA down-regulates the expression of the novel nuclear zinc finger protein trapped in ES205 cells (Table 1, section VII). mRNA levels from this gene are known to be reduced in F9 cells in response to RA-induced differentiation (31). In ES32 cells the novel trapped protein is present in both the cytoplasm and nucleus of undifferentiated cells but becomes concentrated into the nucleus in differentiated cells (Table 1, section IV).


    DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 RESULTS
 DISCUSSION
 MATERIALS AND METHODS
 REFERENCES
 
Novel mammalian proteins in the nucleolus and splicing speckles
If the most frequent patterns of discrete sub-nuclear localization in our screen are from compartments that contain the greatest number of different protein components then, apart from the chromosomes themselves, the nuclear compartments involved in RNA processing (the nucleolus and the splicing speckles) are also complex (Table 1).

The nucleolus is much more than just a site of ribosome biogenesis. It is involved in the biogenesis of small RNAs transcribed by RNA polymerase III, and it may be where some factors involved in cell cycle control and checkpoints locate (32). Nucleolar proteins already known to be mutated in human disease include TCOF in Treacher Collins (4) and ataxin 7 in a form of spinocerebellar ataxia (33). Several proteins located in the nucleolus are involved in cancers (32,34). Here we have identified eight novel nucleolar components (Table 1 section I) and the motifs that they contain suggest roles in RNA processing (see below).

We identified six proteins that are novel to the nuclear compartments occupied by concentrations of splicing factors (Table 1, section II). Components of splicing complexes, assembled in vitro from nuclear extracts of human cells, have been characterized by nanospray mass spectrometry and MALDI (8). In this way 21 known and 19 novel spliceosome-associated proteins were identified. However, only one of these (the U5 snRNP 100 kDa protein) corresponds with a protein co-localizing with splicing factors in our screen. The analysis performed by Neubauer et al. (8) was designed to isolate core-spliceosomal proteins that interact stringently with a pre-mRNA substrate. Therefore it may not have enriched for proteins with a more structural or regulatory role in splicing, or those that are loosely associated with the spliceosome. This may be the type of protein we detect here. Similarly, human proteins present in purified ICGs have been determined by HPLC and ion-trap electrospray mass spectrometry (9). Only one of those reported proteins corresponds with one that we have identified here concentrated in splicing speckles (a novel proline-rich protein in line F9/29D5). Only five of the proteins reported by Mintz et al. (9) are in common with those identified by Neubauer et al. (8). Although the sequences of more than 80 mammalian proteins that concentrate in the nucleus with splicing factors have now been reported in the literature, it is clear that we are far from understanding the full complexity of gene products that locate in this nuclear compartment. The multitude of proteins that appear to participate in, or regulate, splicing may reflect the importance of alternative splicing in generating protein complexity from mammalian genomes with surprisingly few genes.

Proteins associated with chromosomes in interphase and mitotic cells
We identified 10 mammalian proteins that visually co-localize with chromosomes in the nucleus and seven of these remain associated with mitotic chromosomes (Table 1, section III). Many of these proteins appear to be concentrated at the sites of constitutive heterochromatin. At least two of these proteins have a proven link with human genetic disease.

The gene trapped in F9/18D6 cells is the mouse homologue of the gene mutated in ATRX syndrome. Motifs present in ATRX suggest that the protein may be a chromatin-remodelling factor, and so finding the protein concentrated in heterochromatin was surprising. However, the wild-type protein has the same heterochromatic localization in human and mouse cells (16), and ATRX individuals have altered methylation patterns at the sequences that underlie heterochromatin (35). The site of gene trap integration within the ATRX gene, and the co-localization of trapped and wild-type proteins (16), suggests that either the plant homeodomain (PHD) or the coiled-coil region is targeting the protein to heterochromatin.

Another chromatin remodelling factor, ACF, has been trapped in CT146 cells and, like ATRX, is also concentrated at heterochromatin (36). The site of gene trap integration suggests that the WAC domain (36) is responsible for heterochromatin targeting. The WAC domain is named for its presence in William’s syndrome transcription factor (WSTF), ATP-utilizing chromatin assembly and remodelling factor (ACF) and chromatin binding protein 146 (cbp146). A gene within the common deletion region for Williams–Beuren syndrome (36) encodes a very closely related human protein with a WAC motif at a similar position and we predict that this may also be protein concentrated at heterochromatin.

In ES446 cells we trapped the gene encoding Hmgi-c. This protein has been implicated in adipocyte homeostasis since a deficiency of fat tissue is found in mice with targeted disruption of Hmgi-c and this is reponsible for the pygmy phenotype (37). In humans, translocation-induced fusions of HMGI-C are associated with lipomas (38). Fusions to HMGI-C are also found in a variety of other mesenchymal tumours and in chronic lymphocytic leukemia, and HMGI-C is one of the most commonly rearranged genes in human neoplasms (38). The role of HMGI-C in tumorigenesis is unclear. Hmgi-c expression correlates throughout fetal development with high proliferative activity (39). Here we show that the protein localizes to the centric heterochromatin in mitosis (Fig. 4), suggesting that it might have a role in chromosome segregation. Interestingly, the site of gene trap integration in ES446 is into intron 3 of Hmgi-c, after the three AT hooks but before the C-terminal acidic domain (40), equivalent to the translocation breakpoints in human cancers (38).

In CT45 cells we have trapped the mouse homologue of a recently described human gene CECR2, located in the segment of proximal HSA22q that is present in multiple copies in Cat eye syndrome (41). Our trap suggests that this bromodomain-containing protein locates with chromatin in both interphase and metaphase cells.

Motifs shared between proteins in sub-nuclear domains
Although many of the proteins we found are novel and of unknown function, they contain domains or motifs that point to potential biochemical activities or functions (Table 1). By analysing the distribution of motifs/domains amongst the proteins identified here, and amongst proteins of known sub-nuclear location in databases, it became clear that mammalian proteins located in the same nuclear compartment often contain common signatures. Conversely, some generally common protein motifs are rare or absent from the proteins concentrated at some sub-nuclear sites.

Two of the novel nucleolar proteins we trapped contain motifs characteristic of the DEAD/H box family of proteins (Table 1, section I). Many DEAD box proteins are RNA helicases but some may function as RNPases (42). We analysed more than 60 known mammalian nucleolar proteins from the literature and found that the DEAD box helicase was also the most common motif recurring amongst them (Table 2), even though it is only the 47th most common motif amongst human proteins (http://www.ensembl.org/IPtop500.html) and it is rare in the proteins that we trapped in other nuclear compartments (Table 1).


View this table:
[in this window]
[in a new window]
 
Table 2. Protein domains and motifs abundant in proteins that co-localize in the nucleus
 
Five of the proteins that we found concentrated in splicing speckles contain an arginine serine-rich (RS) domain (Table 1). This domain is common amongst proteins known to play a role in the regulation of pre-mRNA splicing (43) and is also abundant in the proteins purified in ICGs (9). However, few of the human proteins reassembled in vitro into spliceosomal complexes (8) appear to contain this domain. The other abundant motif present in all screens to identify splicing factors is the RRM (Table 2), which is also relatively abundant in the human proteome (8,9,43,44).

Three of the trapped proteins that visually co-localize with interphase or mitotic chromosomes contain a PHD finger (Table 1, section III). This C4HC3 zinc finger-like motif has been implicated in protein–protein interaction, and it may be specialized for a role in the chromatin-mediated control of gene expression (45). PHD fingers are not abundant in the human proteome (44) (http://www.ensembl.org/Iptop500.html). Apart from the chromodomain, the PHD finger was the most common recurring motif amongst chromosome-associated proteins in the literature (Table 2), but the only chromodomain protein that we identified was the diffusely located SRG3 component of a SWI/SNF chromatin-remodelling complex (Table 1, section VII). Two of the PHD-containing chromosomal proteins that we found also have more C-terminal bromodomains and the chromosomal protein in CT45 cells also contains a bromodomain (Table 1 Section III). Some bromodomains have been shown to bind to acetylated lysine residues in the N-terminal tails of histone H3 and H4 (46) and the bromodomain is not abundant in the human proteome (Table 2) (http://www.ensembl.org/Iptop500.html). The combination of a PHD finger with a bromodomain appears to be a good indicator of likely chromosomal association for novel proteins in human or mouse cells.

The most common motif we identified amongst the proteins diffusely distributed in the nucleus (Table 1, section VII) was the C2H2 Krüppel-type zinc finger (Table 2). We found only one protein each in the nucleolus, on chromosomes and in unidentified foci that contain this motif. Since this class of zinc finger is the second most abundant motif amongst human proteins (44), the fact that it is not commonly found within proteins located at other discrete sites within the nucleus is highly significant. Given the putative DNA binding ability of this class of zinc fingers it is especially surprising that it is not common amongst proteins with a chromosomal localization. It may be a signature of proteins that can bind DNA, but that do not stably associate with chromatin. One-third of proteins with a C2H2 zinc finger are thought to have an N-terminal KRAB. This vertebrate-specific motif is thought to be involved in mediating transcriptional repression (17). Four of the nuclear-diffuse proteins that we trapped (ES205, F9/30A6, ES510 and ES9) have KRAB boxes and all contain more C-terminal C2H2 zinc fingers (Table 1, section VII). However, two of the trapped proteins that co-localize with interphase chromosomes and heterochromatin, but that do not remain associated with chromosomes at mitosis (ES113, ES261), also have KRAB boxes (Table 1, section III). Complete cDNA sequences are not available in databases for these two genes, so it remains to be determined if they contain C2H2 zinc fingers beyond the KRAB boxes and the sites of gene trap integration. For both ES113 and ES261 (Table 1, section III) fusion protein associated with heterochromatin is seen only in a subset of cells, suggesting that interaction of fusion protein with components of heterochromatin is cell cycle or differentiation dependent. Our data suggest that sequences in addition to the KRAB box may be necessary for targeting to heterochromatin, and that localization to heterochromatin of mouse embryonic cells is only seen for some KRAB box-containing proteins, and only some of the time.

The groups of proteins that we identified in either the chromosomal, nucleolar, splicing speckle or diffuse sub-domains of the nucleus also differ in their physical parameters. The proteins associated with chromosomes were the largest (average molecular weight = 119 kDa), and surprisingly were not especially basic (average pI = 7.6). In contrast, proteins from the two compartments involved in RNA metabolism (nucleolus and slicing speckles), were smaller (73 and 92 kDa on average, respectively) and more basic (pI = 8.3 and 9.3, respectively). Proteins diffusely located across the nucleoplasm had a similar average pI to chromosomal proteins (7.3), but were considerably smaller (81 kDa). We found the same trends amongst proteins described in the literature as belonging to these compartments (data not shown). Different pI values for proteins located at different sub-cellular locations have been reported for the Drosophila proteome, and suggested to reflect different functional requirements of cytoplasmic, nuclear and membrane-associated proteins (47). Our analysis suggests that similar constraints also operate at different sites within the nucleus of mammalian cells.

Proteins targeted to the same sub-nuclear compartment often seem to share common characteristics and signatures. These combinations of motifs are more rare in other compartments. Therefore, it may become possible to predict new constituents of nuclear compartments from databases based solely on the combination of protein motifs that they contain and their physical parameters (48).

Sub-nuclear localization as a guide to protein function in vivo
For some known proteins, the sub-nuclear localization that we determined was not expected, and demands a rethink about the putative functions of such proteins. For example, human homologues of two of our proteins co-localizing with splicing factors have been reported to interact with thyroid hormone receptor or its associated complexes and so were considered to have roles in transcriptional regulation. CT37 (TRIP12) was isolated in a yeast two-hybrid screen with thyroid hormone receptor (49), and the protein trapped in F9/21B2 cells has been purified biochemically as a component (TRAP150) of a thyroid hormone receptor associated complex (50). However, our localization (Fig. 3) (15) suggests that these proteins may interact with splicing factors in the nucleus, and this is supported by the presence of an RS domain, commonly found in splicing regulators, in TRAP150 (Table 1, section II).

Sites of gene trap integration
We have independently trapped a number of genes more than once, suggesting that certain regions of the genome are more susceptible to vector integration. The most common location trapped in both ES cells and F9 EC cells was in the same intron of a putative erythroid differentiation regulator gene on the X/Y pseudoautosomal region (F9/7A1 etc in Table 1, section VII). There may be a chromatin structure in this region of the sex chromosomes in embryonic cells that makes them susceptible to integration/recombination. Other loci trapped multiple times include a KRAB zinc finger protein (ES9 etc), karyopherin ß2 (WF/2C9 and ES140), a novel protein that localizes to splicing speckles (F9/19D5 and ESKN319), p52 transcriptional coactivator (F9/3G5 and ES149), clathrin heavy chain (F9/11D4 and F9/25A4) and a novel protein with similarity to the Drosophila muscleblind protein (F9/9D3 and ES644) (Table 1). Whilst in the latter three examples the vector integrated into different regions of the gene, in the others integration was into the same intron and irrespective of the cell line used. Three gene trap events occurred in equivalent gene regions to human cancer-associated translocation breakpoints, e.g. intron 7 or 8 of the Ewing’s sarcoma gene (ES361 in Table 1, section VII) (51), intron 5 of p52 transcriptional co-activator (F9/3G5 in Table 1, section III) (52) and intron 3 of Hmgi-c (ES446 in Table 1, section III) (38). The properties of these regions that make them susceptible to chromosomal breakage may be conserved between mouse and humans.

Functional genomics is trying to bridge the gap between the number of gene sequences in databases and the number of gene products that have been functionally characterized in any way. One systematic approach to ascribing possible function is to determine the sub-cellular location of proteins. Here we have shown that a visual gene trap screen is a powerful way to identify large numbers of proteins located in sub-compartments of the mammalian nucleus. It is complementary to other approaches, such as biochemical purification of complexes, and to other visual screens that rely on the over-expression of tagged cDNAs (1014). A number of the known proteins that we trapped have a recognized or suspected role in human disease. These include Atrx (F9/18D6), Hmgi-c (ES446), Nibrin (F9/29A6), All-1 (F9/1D1), Pasg (F9/28A5), Wt1 (F9/23A3), Ewsh (ES361) and PRDM10 (ES34). In many cases, the site of gene trap integration suggests that the fusion protein is unlikely to be functional (Table 1). Gene trap cell lines bearing such mutations are available to the research community and may be useful for investigating molecular mechanisms, and the ES lines can be used to study the mutant phenotypes in whole animals.

It is clear from the large number of proteins that we and others have assigned to domains within the nucleus that we are far from understanding the full complexity of nuclear organization. However, emerging patterns of protein motifs married with sub-nuclear protein location promise that bioinformatics may have a prominent role to play in achieving this goal.


    MATERIALS AND METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 RESULTS
 DISCUSSION
 MATERIALS AND METHODS
 REFERENCES
 
Cell culture and transfection
The gene trap screen was performed with pGT1, 2 and 3 as previously described (15), but with the following modifications. The ES cell line E14 (53) was maintained in the presence of LIF and electroporated as described by Tate et al. (15). NeoR E14 colonies were selected with 250 µg/ml Geneticin/G418 (Gibco BRL) for 10 days. F9 embryonic carcinoma cells were selected with 400 µg/ml G418 (17). A subset of F9 clones, prefaced with F9HIS/ in Table 1, were derived using pGT1-3 constructs into which a 6x HIS tag was inserted between the splice acceptor and ßgeo.

All cells were exposed to 5 µg/ml ActD for 2 h to identify fusion proteins that alter their sub-cellular distribution in response to the cessation of transcription (15). To identify fusion proteins that change their level of expression, or their sub-cellular distribution during differentiation, all E14 lines were grown on gelatinized slides both in the presence and absence of LIF for 2–3 days prior to staining. Some cell lines were also stained after a 48 h incubation with 10–6 M RA in differentiating medium for ES cells and 10–7 M RA in normal medium for F9 cells (Fig. 1).

Cytological examination of fusion proteins
NeoR colonies were first screened for ßgal activity with X-Gal. Lines with nuclear X-Gal staining were then analysed by immunofluorescence with antibodies specific for ßgal (15). To co-localize fusion proteins with known components of nuclear compartments immunofluorescence using ßgal-specific antibody was combined with antibodies; D77 recognizing fibrillarin (54), Y12 recognizing Sm antigens in splicing speckles (15) and Lamin B (Santa Cruz) at the nuclear periphery. Slides were counterstained with 0.5 µg/ml DAPI and examined on a Zeiss Axioplan fluorescence microscope fitted with a Chroma #83000 filter set. Images were captured with a Photometrics cooled charged coupled device (CCD) camera and Smartcapture software (Digital Scientific).

Determining the sequence of trapped genes
Total RNA was prepared using Bio/RNA-X-cellTM (Bio/Gene Limited) and 5' RACE products were generated as described previously (15), except that P456 (5'-CCGTGCATCTGCCAGTTTGAGGGGA) [lac3 in (55)] was used to prime cDNA synthesis instead of primer 78 of Tate et al. (15). After dialysis on 0.1 µm nitrocellulose disks (Millipore), 5' RACE products were sequenced directly (Fig. 1) using Big DyeTM terminator cycle sequencing and –40 lacZ primer (USB/Amersham). Databases were searched for sequence matches using the BLAST algorithm (http://www.ncbi.nlm.nih.gov/BLAST/). Domains, repeats or motifs, and other parameters/features of the trapped proteins, were examined using ProtParam (http://expasy.cbr.nrc.ca/cgi-bin/protparam), PSORTII (http://psort.nibb.ac.jp/), InterProScan, which is an integrated search of PROSITE, Pfam and PRINTS databases (http://www.ebi.ac.uk/interpro/scan.html) and the Simple Modular Architecture Research Tool (SMART) (http://smart.embl-heidelberg.de/).

A database of nuclear protein localization
To facilitate data storing and analysis an NPD was created in Microsoft Access 97 (to be described in detail elsewhere). The nuclear protein database stores details about the DNA and protein sequences of trapped genes and related genes, together with protein function and subnuclear localization. Gene Ontology (GO) Project annotations were used to describe function and biological process (56) (http://www.geneontology.org). A world wide web interface is being developed to provide public access to data from this screen. Details about gene and protein sequence, sub-nuclear protein location, and molecular and biological function are linked to supporting references, images and external databases.


    ACKNOWLEDGEMENTS
 
H.G.E.S. was supported by an EMBO fellowship and G.K.M. and L.V.F. are supported by a grant from the James S. McDonnell Foundation. G.D. is supported by a fellowship from MRC Canada. W.A.B. is a Centennial Fellow of the James S. McDonnell Foundation. We thank our colleagues at the MRC Human Genetics Unit for reading and criticism of this manuscript.


    FOOTNOTES
 
+ To whom correspondence should be addressed. Tel: +44 131 332 2472; Fax: +44 131 343 2620; Email:w.bickmore@hgu.mrc.ac.uk Back


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 RESULTS
 DISCUSSION
 MATERIALS AND METHODS
 REFERENCES
 
1 Bridger, J.M. and Bickmore, W.A. (1998) Putting the genome on the map. Trends Genet., 14, 403–409.[Web of Science][Medline]

2 Lamond, A.I. and Earnshaw, W.C. (1998) Structure and function in the nucleus. Science, 280, 547–553.[Abstract/Free Full Text]

3 Misteli, T. (2000) Cell biology of transcription and pre-mRNA splicing: nuclear architecture meets nuclear function. J. Cell Sci., 113, 1841–1849.[Abstract]

4 Marsh, K.L, Dixon, J. and Dixon, M.J. (1998) Mutations in the Treacher Collins syndrome gene lead to mislocalization of the nucleolar protein treacle. Hum. Mol. Genet., 7, 1795–1800.[Abstract/Free Full Text]

5 Wilson, K.L., Zastrow, M.S. and Lee, K.K. (2001) Lamins and disease: insights into nuclear infrastructure. Cell, 104, 647–650.[Web of Science][Medline]

6 Koken, M.H., Reid, A., Quignon, F., Chelbi-Alix, M.K., Davies, J.M., Kabarowski, J.H., Zhu, J., Dong, S., Chen, S., Chen, Z. et al. (1997) Leukemia-associated retinoic acid receptor alpha fusion partners, PML and PLZF, heterodimerize and colocalize to nuclear bodies. Proc. Natl Acad. Sci. USA, 94, 10255–10260.[Abstract/Free Full Text]

7 Bell, P., Lieberman, P.M. and Maul, G.G. (2000) Lytic but not latent replication of Epstein-Barr virus is associated with PML and induces sequential release of nuclear domain 10 proteins. J. Virol., 74, 11800–11810.[Abstract/Free Full Text]

8 Neubauer, G., King, A., Rappsilber, J., Calvio, C., Watson, M., Ajuh, P., Sleeman, J., Lamond, A. and Mann, M. (1998) Mass spectrometry and EST-database searching allows characterization of the multi-protein spliceosome complex. Nat. Genet., 20, 46–50.[Web of Science][Medline]

9 Mintz, P.J., Patterson, S.D., Neuwald, A.F., Spahr, C.S. and Spector, D.L. (1999) Purification and biochemical characterization of interchromatin granule clusters. EMBO J., 18, 4308–4320.[Web of Science][Medline]

10 Bejarano, L.A. and Gonzalez, C. (1999) Motif trap: a rapid method to clone motifs that can target proteins to defined subcellular localisations. J. Cell Sci., 112, 4207–4211.[Abstract]

11 Fujii, G., Tsuchiya, R., Ezoe, E. and Hirohashi, S. (1999) Analysis of nuclear localization signals using a green fluorescent protein-fusion protein library. Exp. Cell Res., 251, 299–306.[Web of Science][Medline]

12 Rolls, M.M., Stein, P.A., Taylor, S.S., Ha, E., McKeon, F. and Rapoport, T.A. (1999) A visual screen of a GFP-fusion library identifies a new type of nuclear envelope membrane protein. J. Cell Biol., 146, 29–43.[Abstract/Free Full Text]

13 Misawa, K., Nosaka, T., Morita, S., Kaneko, A., Nakahat, T., Asano, S. and Kitamura, T. (2000) A method to identify cDNAs based on localization of green fluorescent protein fusion products. Proc. Natl Acad. Sci. USA, 97, 3062–3066.[Abstract/Free Full Text]

14 Simpson, J.C., Wellenreuther, R., Poustka, A., Pepperkok, R. and Wiemann, S. (2000) Systematic subcellular localization of novel proteins identified by large-scale cDNA sequencing. EMBO Rep., 1, 287–292.[Web of Science][Medline]

15 Tate, P., Lee, M., Tweedie, S., Skarnes, W.C. and Bickmore, W.A. (1998) Capturing novel mouse genes encoding chromosomal and other nuclear proteins. J. Cell Sci., 111, 2575–2585.[Abstract]

16 McDowell, T.L., Gibbons, R.J., Sutherland, H., O’Rourke, D., Bickmore, W.A., Pombo, A., Turley, H., Gatter, K., Picketts, D., Buckle, V.J. et al. (1999) Localisation of a putative transcriptional regulator (ATRX) at pericentric heterochromatin and the tips of acrocentric chromosomes. Proc. Natl Acad. Sci. USA, 96, 13983–13988.[Abstract/Free Full Text]

17 Matsuda, E., Agata, Y., Sugai, M., Katakai, T., Gonda, H. and Shimizu, A. (2001) Targeting of kruppel-associated box-containing zinc finger proteins to centromeric heterochromatin. implication for the gene silencing mechanisms. J. Biol. Chem., 276, 14222–14229.[Abstract/Free Full Text]

18 Wang, Y., Cortez, D., Yazdi, P., Neff, N., Elledge, S.J. and Quin, J. (2000) BASC, a super complex of BRCA1-associated proteins involved in the recognition and repair of aberrant DNA structure. Genes Dev., 14, 927–939.[Abstract/Free Full Text]

19 Phair, R.D. and Misteli, T. (2000) High mobility of proteins in the mammalian cell nucleus. Nature, 404, 604–609.[Medline]

20 Carmo-Fonseca, M., Pepperkok, R., Sproat, B.S., Ansorge, W., Swanson, M.S. and Lamond, A.I. (1991) In vivo detection of snRNP-rich organelles in the nuclei of mammalian cells. EMBO J., 10, 1863–1873.[Web of Science][Medline]

21 Teigelkamp, S., Mundt, C., Achsel, T., Will, C.L. and Lührmann, R. (1997) The human U5 snRNP-specific 100 kD protein is an RS domain-containing, putative RNA helicase with significant homology to the yeast splicing factor Prp28p. RNA, 3, 1313–1326.[Abstract]

22 Imai, H., Chan, E.K., Kiyosawa, K., Fu, X.D. and Tan, E.M. (1993) Novel nuclear autoantigen with splicing factor motifs identified with antibody from hepatocellular carcinoma. J. Clin. Invest., 92, 2419–2426.

23 Cáceres, J.F., Screaton, G.R. and Krainer, A.R. (1998) A specific subset of SR proteins shuttles continuously between the nucleus and the cytoplasm. Genes Dev., 12, 55–66.[Abstract/Free Full Text]

24 Nakielny, S. and Dreyfuss, G. (1999) Transport of proteins and RNAs in and out of the nucleus. Cell, 99, 677–690.[Web of Science][Medline]

25 Raska, I., Ochs, R.L., Andrade, L.E., Chan, E.K., Burlingame, R., Peebles, C., Gruol, D. and Tan, E.M. (1990) Association between the nucleolus and the coiled body. J. Struct. Biol., 104, 120–127.[Web of Science][Medline]

26 Dousset, T., Wang, C., Verheggen, C., Chen, D., Hernandez-Verdun, D. and Huang, S. (2000) Initiation of nucleolar assembly is independent of RNA polymerase I transcription. Mol. Biol. Cell, 11, 2705–2717.[Abstract/Free Full Text]

27 Chan, G.K.T., Schaar, B.T. and Yen, T.J. (1998). Characterization of the kinetochore binding domain of CENP-E reveals interactions with the kinetochore proteins CENP-F and hBUBR. J. Cell Biol., 143, 49–63.[Abstract/Free Full Text]

28 Pishvaee, B. and Payne, G.S. (1998) Clathrin coats – threads laid bare. Cell, 95, 443–446.[Web of Science][Medline]

29 Maro, B., Johnson, M.H., Pickering, S. and Louvard, D. (1985) Changes in the distribution of membranous organelles during mouse early development. J. Embryol. Exp. Morphol., 90, 287–309.[Web of Science][Medline]

30 Kimura, H., Ohtomo, T., Yamaguchi, M., Ishii, A. and Sugimoto, K. (1996) Mouse MCM proteins: complex formation and transportation to the nucleus. Genes Cells, 1, 977–993.[Abstract]

31 Faria, T.N., LaRosa, G.J., Wilen, E., Liao, J. and Gudas, L.J. (1998) Characterization of genes which exhibit reduced expression during the retinoic acid-induced differentiation of F9 teratocarcinoma cells: involvement of cyclin D3 in RA-mediated growth arrest. Mol. Cell. Endocrinol., 143, 155–166.[Web of Science][Medline]

32 Carmo-Fonseca, M., Mendes-Soares, L. and Campos, I. (2000) To be or not to be in the nucleolus. Nat. Cell Biol., 2, E107–E112.[Web of Science][Medline]

33 Kaytor, M.D., Duvick, L.A., Skinner, P.J., Koob, M.D., Ranum, L.P. and Orr, H.T. (1999) Nuclear localization of the spinocerebellar ataxia type 7 protein, ataxin-7. Hum. Mol. Genet., 8, 1657–1664.[Abstract/Free Full Text]

34 Mohaghegh, P. and Hickson, I.D. (2001) DNA helicase deficiencies associated with cancer predisposition and premature ageing disorders. Hum. Mol. Genet., 10, 741–746.[Abstract/Free Full Text]

35 Gibbons, R.J., McDowell, T.L., Raman, S., O’Rourke, D.M., Garrick, D., Ayyub, H. and Higgs, D.R. (2000) Mutations in ATRX, encoding a SWI/SNF-like protein, cause diverse changes in the pattern of DNA methylation. Nat. Genet., 24, 368–371.[Web of Science][Medline]

36 Poot, R.A., Dellaire, G., Hulsmann, B.B., Grimaldi, M.A., Corona, D.F.V., Becker, P.B., Bickmore, W.A. and Varga-Weisz, P.D. (2000) HuCHRAC, a human ISWI chromatin remodelling complex contains hACF1 and two novel histone-fold proteins. EMBO J., 19, 3377–3387.[Web of Science][Medline]

37 Zhou, X., Benson, K.F., Ashar, H.R. and Chada, K. (1995) Mutation responsible for the mouse pygmy phenotype in the developmentally regulated factor HMGI-C. Nature, 376, 771–774.[Medline]

38 Tallini, G. and Dal Cin, P. (1999) HMGI(Y) and HMGI-C dysregulation: a common occurrence in human tumors. Adv. Anat. Pathol., 6, 237–246.[Web of Science][Medline]

39 Hirning-Folz, U., Wilda, M., Rippe, V., Bullerdiek, J. and Hameister, H. (1998) The expression pattern of the Hmgic gene during development. Genes Chromosomes Cancer, 23, 350–357.[Web of Science][Medline]

40 Zhou, X., Benson, K.F., Przybysz, K., Liu, J., Hou, Y., Cherath, L. and Chada, K. (1996) Genomic structure and expression of the murine Hmgi-c gene. Nucleic Acids Res., 24, 4071–4077.[Abstract/Free Full Text]

41 Footz, T.K., Brinkman-Mills, P., Banting, G.S., Maier, S.A., Riazi, M.A., Bridgland, L., Hu, S., Birren, B., Minoshima, S., Shimizu, N. et al. (2001) Analysis of the cat eye syndrome critical region in humans and the region of conserved synteny in mice: a search for candidate gene at or near the human chromosome 22 pericentromere. Genome Res., 11, 1053–1070.[Abstract/Free Full Text]

42 Schwer, B. (2001) A new twist on RNA helicases: DexH/D box proteins as RNPases. Nat. Struct. Biol., 8, 113–116.[Web of Science][Medline]

43 Birney, E., Kumar, S. and Krainer, A.R. (1993) Analysis of the RNA-recognition motif and RS and RGG domains: conservation in metazoan pre-mRNA splicing factors. Nucleic Acids Res., 21, 5803–5816.[Abstract/Free Full Text]

44 International Human Genome Sequencing Consortium. (2001) Initial sequencing and analysis of the human genome. Nature, 409, 860–921.[Medline]

45 Capili, A.D., Schultz, D.C., Rauscher, F.J. and Borden, K.L.B. (2001) Solution structure of the PHD domain from the KAP-1 co-repressor: structural determinants for PHD, RING and LIM zinc-binding domains. EMBO J., 20, 165–177.[Web of Science][Medline]

46 Syntichaki, P., Topalidou, I. and Thireos, G. (2000) The Gcn5 bromodomain co-ordinates nucleosome remodelling. Nature, 404, 414–417.[Medline]

47 Schwartz, R., Ting, C.S. and King, J. (2001) Whole proteome values correlate with subcellular localizations of proteins for organisms within the three domains of life. Genome Res., 11, 703–709.[Abstract/Free Full Text]

48 Eisenhaber, F. and Bork, P. (1998) Wanted: subcellular localization of proteins based on sequence. Trends Cell Biol., 8, 169–170.[Web of Science][Medline]

49 Lee, J.W., Choi, H.S., Gyuris, J., Brent, R. and Moore, D.D. (1995) Two classes of proteins dependent on either the presence or absence of thyroid hormone for interaction with the thyroid hormone receptor. Mol. Endocrinol., 9, 243–254.[Abstract/Free Full Text]

50 Ito, M., Yuan, C.-X., Malik, S., Gu, W., Fondell, J.D., Yamamura, S., Fu, Z.-Y., Zhang, X., Qin, J. and Roeder, R.G. (1999) Identity between TRAP and SMCC complexes indicates novel pathways for the function of nuclear receptors and diverse mammalian activators. Mol. Cell, 3, 361–370.[Web of Science][Medline]

51 Plogastel, B., Zucman, J., Peter, M., Thomas, G. and Delattre, O. (1993) Genomic structure of the EWS gene and its relationship to EWSR1, a site of tumor-associated chromosome translocation. Genomics, 18, 609–615.[Web of Science][Medline]

52 Ahuja, H.G., Hong, J. Aplan, P.D., Tcheurekdjian, L., Forman, S.J. and Slovak, M.L. (2000) t(9;11)(p22;p15) in acute myeloid leukemia results in a fusion between NUP98 and the gene encoding transcriptional coactivators p52 and p75-lens epithelium-derived growth factor (LEDGF). Cancer Res., 60, 6227–6229.[Abstract/Free Full Text]

53 Hooper, M., Hardy, K., Handyside, A., Hunter, S. and Monk, M. (1987) HPRT-deficient (Lesch-Nyan) mouse embryos derived from germline colonization by cultured cells. Nature, 326, 292–295.[Medline]

54 Bridger, J.M., Boyle, S., Kill, I.R. and Bickmore, W.A. (2000). Re-modelling of nuclear architecture in quiescent and senescent human fibroblasts. Curr. Biol., 10, 149–152.[Web of Science][Medline]

55 Menichini, P., Viaggi, S., Gallerani, E., Fronza, G., Ottaggio, L., Comes, A., Ellwart, J.W. and Abbondandolo, A. (1997) A gene trap approach to isolate mammalian genes involved in the cellular response to genotoxic stress. Nucleic Acids Res., 25, 4803–4807.[Abstract/Free Full Text]

56 The Gene Ontology Consortium. (2000) Gene Ontology: tool for the unification of biology. Nat. Genet., 25, 25–28.[Web of Science][Medline]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
J. Cell Sci.Home page
S. Briers, C. Crawford, W. A. Bickmore, and H. G. Sutherland
KRAB zinc-finger proteins localise to novel KAP1-containing foci that are adjacent to PML nuclear bodies
J. Cell Sci., April 1, 2009; 122(7): 937 - 946.
[Abstract] [Full Text] [PDF]


Home page
FASEB J.Home page
Z. Michailidou, R. N. Carter, E. Marshall, H. G. Sutherland, D. G. Brownstein, E. Owen, K. Cockett, V. Kelly, L. Ramage, E. A. S. Al-Dujaili, et al.
Glucocorticoid receptor haploinsufficiency causes hypertension and attenuates hypothalamic-pituitary-adrenal axis and blood pressure adaptions to high-fat diet
FASEB J, November 1, 2008; 22(11): 3896 - 3907.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. Biol.Home page
A. Wagschal, H. G. Sutherland, K. Woodfine, A. Henckel, K. Chebli, R. Schulz, R. J. Oakey, W. A. Bickmore, and R. Feil
G9a Histone Methyltransferase Contributes to Imprinting in the Mouse Placenta
Mol. Cell. Biol., February 1, 2008; 28(3): 1104 - 1113.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. Biol.Home page
H. G. Sutherland, K. Newton, D. G. Brownstein, M. C. Holmes, C. Kress, C. A. Semple, and W. A. Bickmore
Disruption of ledgf/psip1 results in perinatal mortality and homeotic skeletal transformations.
Mol. Cell. Biol., October 1, 2006; 26(19): 7201 - 7210.
[Abstract] [Full Text] [PDF]


Home page
J. Cell Sci.Home page
S. J. Royle and L. Lagnado
Trimerisation is important for the function of clathrin at the mitotic spindle
J. Cell Sci., October 1, 2006; 119(19): 4071 - 4078.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
S. K. Elkin, D. Ivanov, M. Ewalt, C. G. Ferguson, S. G. Hyberts, Z.-Y. J. Sun, G. D. Prestwich, J. Yuan, G. Wagner, M. A. Oettinger, et al.
A PHD Finger Motif in the C Terminus of RAG2 Modulates Recombination Activity
J. Biol. Chem., August 5, 2005; 280(31): 28701 - 28710.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. Biol.Home page
D. Auboeuf, D. H. Dowhan, M. Dutertre, N. Martin, S. M. Berget, and B. W. O'Malley
A Subset of Nuclear Receptor Coregulators Act as Coupling Proteins during Synthesis and Maturation of RNA Transcripts
Mol. Cell. Biol., July 1, 2005; 25(13): 5307 - 5316.
[Full Text] [PDF]


Home page
Mol. Cell. Biol.Home page
D. Cazalla, K. Newton, and J. F. Caceres
A Novel SR-Related Protein Is Required for the Second Step of Pre-mRNA Splicing
Mol. Cell. Biol., April 15, 2005; 25(8): 2969 - 2980.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
N. Wiwatwattana and A. Kumar
Organelle DB: a cross-species database of protein localization and function
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D598 - D604.
[Abstract] [Full Text] [PDF]


Home page
J. Cell Sci.Home page
A. M. Ishov, O. V. Vladimirova, and G. G. Maul
Heterochromatin and ND10 are cell-cycle regulated and phosphorylation-dependent alternate nuclear sites of the transcription repressor Daxx and SWI/SNF protein ATRX
J. Cell Sci., August 1, 2004; 117(17): 3807 - 3820.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
D. Bannasch, A. Mehrle, K.-H. Glatting, R. Pepperkok, A. Poustka, and S. Wiemann
LIFEdb: a database for functional genomics experiments integrating information from external sources, and serving as a sample tracking system
Nucleic Acids Res., January 1, 2004; 32(90001): D505 - 508.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. Biol.Home page
K. Newton, E. Petfalski, D. Tollervey, and J. F. Caceres
Fibrillarin Is Essential for Early Development and Required for Accumulation of an Intron-Encoded Small Nucleolar RNA in the Mouse
Mol. Cell. Biol., December 1, 2003; 23(23): 8519 - 8527.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
G. Dellaire, R. Farrall, and W.A. Bickmore
The Nuclear Protein Database (NPD): sub-nuclear localisation and functional annotation of the nuclear proteome
Nucleic Acids Res., January 1, 2003; 31(1): 328 - 330.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
R. Mott, J. Schultz, P. Bork, and C. P. Ponting
Predicting Protein Cellular Localization Using a Domain Projection Method
Genome Res., August 1, 2002; 12(8): 1168 - 1174.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. Biol.Home page
G. Dellaire, E. M. Makarov, JeffJ.M. Cowger, D. Longman, H. G. E. Sutherland, R. Luhrmann, J. Torchia, and W. A. Bickmore
Mammalian PRP4 Kinase Copurifies and Interacts with Components of Both the U5 snRNP and the N-CoR Deacetylase Complexes
Mol. Cell. Biol., July 15, 2002; 22(14): 5141 - 5156.
[Abstract] [Full Text] [PDF]


Home page
J. Cell Sci.Home page
T. Mistelli
Nuclear protein database (NPD): http://npd.hgu.mrc.ac.uk
J. Cell Sci., July 15, 2002; 115(14): 2805 - 2806.
[Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (52)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Sutherland, H. G.E.
Right arrow Articles by Bickmore, W. A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Sutherland, H. G.E.
Right arrow Articles by Bickmore, W. A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?