| Human Molecular Genetics | Pages |
Genome-wide mapping of unselected transcripts from extraembryonic tissue of 7.5-day mouse embryos reveals enrichment in the t-complex and under-representation on the X chromosome
Introduction
Results
Construction of a cDNA library and profiles of collected cDNAs
Genetic mapping of newly identified cDNAs
Statistical analysis of the distribution of 7.5 EPC genes on the mouse genome
Expression analyses of mapped genes by RT-PCR
Characterization of genes clustered in the t-complex region
Discussion
Three outcomes of the study
Uneven genomic distribution of 7.5 EPC expressed genes
Do dosage control and implantation requirements drive co-localization?
Materials And Methods
cDNA library and screening
Sequencing and sequence analyses
Gene mapping
Statistical analysis
Gene expression analyses by RT-PCR
Acknowledgements
Abbreviations
References
Genome-wide mapping of unselected transcripts from extraembryonic tissue of 7.5-day mouse embryos reveals enrichment in the t-complex and under-representation on the X chromosome
DDBJ/EMBL/GenBank accession nos 1051017 and 1054202
INTRODUCTION
A recent survey of mouse gene knockouts and spontaneous mutations concludes that implantation and subsequent development of the placenta are critical for the survival of the mammalian embryo in utero (1). In the human, clinicians have observed that about one to two thirds of human embryos die before birth, with the majority of these deaths occurring in the peri-implantation period (2,3). The key to the maternal-fetal interaction is the extraembryonic cells, which are derived from the conceptus but do not participate in formation of the embryo proper (1,4,5). Among these cells, trophoblast giant cells especially play multiple important roles as: (i) the leading edge of embryo invasion of the maternal endometrium; (ii) an immunological barrier protecting the embryo from the maternal immune response; and (iii) an endocrine organ, synthesizing and secreting steroid and peptide hormones (4-6).
Genomic imprinting plays an important role in the formation and function of extraembryonic tissues (7,8). Androgenotes carrying two paternally derived haploid genomes develop a relatively normal extraembryonic trophoblast and yolk sac, whereas the embryo itself develops very poorly (9,10). Complete human hydatidiform moles, for example, are androgenotes (11). In contrast, gynogenotes carrying two maternally derived haploid genomes show relatively normal development of the embryo itself, but the extraembryonic tissues develop poorly (9,10). In another example of parental-specific dosage control, the paternally derived X chromosome is preferentially inactivated in extraembryonic tissues of placental mammals (12), whereas in other somatic tissues X chromosome inactivation occurs randomly (13).
As a first step toward understanding the function of extraembryonic tissues at the implantation site, we employed systematic sequencing and mapping analyses of genes expressed in the upper half (extraembryonic portion) of the 7.5 days post-conception (d.p.c.) mouse embryo. The study reveals many interesting genes, many of them unexpectedly clustered in the mouse genome. Possible implications of this finding for extraembryonic tissue function and genomic imprinting are discussed.
RESULTS
Construction of a cDNA library and profiles of collected cDNAs
We constructed a cDNA library from the upper half of the 7.5 d.p.c. mouse embryo. Although the tissue consists of trophoblast giant cells, the ectoplacental cone (EPC, precursor of trophoblast cells, the largest mass in the sample) and the extraembryonic ectoderm (Fig.
Figure 1. Experimental strategy. (Left) Transverse section through entire uterus showing implanted 7.5 d.p.c. mouse embryo. The section is stained with hematoxylin and eosin. (Right) Schematic representation of the embryo. EPC, ectoplacental cone; Extra. Ectoderm, extraembryonic ectoderm; TGC, trophoblast giant cells. The frequency of individual cDNA clones included in this EST collection is a reflection of the expression level of these genes in the tissue source used for cDNA library construction (17). Table 1 shows the gene expression profiles of 3[prime]-ESTs which were identified as known genes listed according to the frequency with which they were found in this cDNA library. Well-recognized cDNA sequences with characteristic functions in extraembryonic tissue at this developmental stage were recovered at levels appropriate for the extraembryonic tissues: for example, prolactin-like protein A (Prlpa, 0.65% of the total), placental lactogen-I (Pl1, 0.60%), H19 (H19, 0.50%), proliferin (Plf, 0.40%), placental lactogen-I variant (RNU32679, 0.15%), Cea5 (MUSCEA5, 0.15%), connexin31 (MMCX31, 0.10%), endo B cytokeratin (MUSKTCEB, 0.10%), placenta and embryonic expression gene (Pem, 0.10%) and X-inactivation-specific transcript (Xist, 0.05%). Genes were also encountered which had not previously been shown to be expressed in extraembryonic tissues, e.g. adrenomedullin (Adm, 0.35%), Wnt6 (0.10%) and tropomyosin-4 (RNTM4, 0.10%). The presence of these genes at relatively high abundance suggests that they have a previously unrecognized importance in extraembryonic tissue development. Indeed, we have recently demonstrated by in situ hybridization and northern blot analyses that the Adm gene, which encodes a potent hypotensive peptide, is highly expressed in trophoblast giant cells at the implantation site (18). Interestingly, the expression profile also includes a group of genes previously detected as expressed in vascular endothelial cells, e.g. Adm (0.35%) and VE-cadherin (MMVECADH, 0.10%). This suggests some similar properties of extraembryonic trophoblast giant cells and vascular endothelial cells. It is consistent with a recent report that suggests such a similarity based on a comparable switch in integrin expression (19), even though the embryological origins of the two cell types are completely different (5,20). Localization of new genes on the mouse genetic map was performed as described (21). In brief, unique PCR primer pairs were designed from the 3[prime]-untranslated region (UTR) of each novel cDNA. The use of the 3[prime]-UTR of cDNAs as the target for PCR amplification increases the probability of finding sequence polymorphisms between C57BL/6J and Mus spretus (22). A total of 751 PCR primer pairs were synthesized. Approximately 500 of the primer pairs gave a unique PCR product of the expected size from both mouse strain DNAs in our standardized PCR assay, a success rate in line with previous results based on 3[prime]-UTR sequences from other cDNA sequences from GenBank as the source of markers (21). The original gene mapping method used primarily PCR-RFLP (22), but we have recently noted that ~35% of usable primer pairs can detect polymorphism between C57BL/6J and SPRET/Ei (M.spretus) by simple heteroduplex analysis (23). Most of the primer pairs described in this paper were mapped by this heteroduplex method. These primer pairs were used to type an interspecific backcross mouse panel (BSS panel of 94 animals [(C57BL/6JEi × SPRET/Ei) × SPRET/Ei]) from the Jackson Laboratory (24). The data were analyzed using the Map Manager program (25). Table 1.
Genetic mapping of newly identified cDNAs
Identification
Frequency
Identification
Frequency
Mus musculus breast heat shock 73 protein (hsc73) mRNA
16
Homo sapiens mRNA for gCIq-R
2
Mus musculus protein synthesis elongation factor Tu (eEF-Tu)
14
Haemophilus influenzae era, fdhE, holD, lepA, lepB, rimI
2
Rat prolactin-like protein A (rPLP-A) mRNA
13
Homo sapiens splicing factor, arginine/serine-rich 7 (SFRS7)
2
Mouse placental lactogen I (PL-I) mRNA
12
Mus domesticus fetus cerebral cortex mRNA
2
Mus musculus H19 mRNA
10
Mus domesticus mRNA for 14-3-3 [epsis]
2
Mouse ferritin light chain
10
Human capping protein [alpha] mRNA
2
Mouse heat shock protein 86 mRNA
9
Human cytochrome bc-1 complex core protein II mRNA
2
Mouse cytoplasmic [gamma]-actin gene
8
Human cytochrome c1 mRNA
2
Mouse proliferin mRNA
8
Human DNA sequence from cosmid V870H8
2
Mouse (clone M1) GTPase (Ran) mRNA
7
Human fetal brain: promyelogeneous leukocyte mRNA for TPRD
2
Mouse cytoskeletal mRNA for [beta]-actin
7
Human HLA-B-associated transcript 3 (BAT3) mRNA
2
Mouse mRNA for translational controlled 40 kDa polypeptide
7
Human hnRNP A2 protein mRNA
2
Human mRNA for ORF
6
Human mRNA (KIAA0071) for ORF (novel protein)
2
Mus musculus mRNA for testis-specific thymosin [beta]-10
6
Human mRNA for 19 kDa protein of signal recognition particle
2
Mouse 84 kDa heat shock protein mRNA
6
Human mRNA for KIAA0183 protein
2
Mouse glyceraldehyde-3-phosphate dehydrogenase mRNA
6
Human mRNA for ORF
2
Mouse nucleolin gene
6
Mus musculus gene for [alpha]-catenin
2
Rat major [alpha]-globin mRNA
6
Mus musculus AP-2.2 gene
2
Rat mitochondrial ATP synthase [beta] subunit mRNA
6
Mus musculus bgbp gene for [beta]-galactoside binding protein
2
Rat mRNA for adrenomedullin precursor
6
Mus musculus DNA for [alpha] globin gene and flanking regions
2
Mus musculus 45S pre-rRNA gene
5
Mus musculus gene for haem oxygenase (exon 5)
2
Mus musculus mRNA for ribosomal protein S3
5
Mus musculus mitochondrial 12S rRNA
2
Mus musculus mRNA for tropomyosin 5 (3[prime]-UTR)
5
Mus musculus mitochondrial malate dehydrogenase gene
2
Mouse 18S rRNA gene 5[prime]-end
5
Mus musculus mRNA for connexin31
2
Mouse [alpha]-tubulin gene M-[alpha]-2, 3[prime]-end
5
Mus musculus mRNA for L14 lectin
2
Mouse metallothionein II (MT-II) gene
5
Mus musculus mRNA for Rab7 protein
2
Mouse mRNA for elongation factor 1-[alpha] (EF 1-[alpha])
5
Mus musculus mRNA for ribosomal protein L37a
2
Mouse mRNA for ribosomal protein S6
5
Mus musculus mRNA for ribosomal protein L5, 3[prime]-end
2
Mouse nucleolar protein N038 mRNA
5
Mus musculus mRNA for secretin
2
Mus musculus calmodulin synthesis (CaM) cDNA
5
Mus musculus mRNA TNZ2 for p68 RNA helicase
2
Mus domesticus adult brain mRNA
4
Mus musculus NRF1 mRNA
2
Human initiation factor eIF-5A gene
4
Mus musculus rpS6 gene
2
Human mRNA (KIAA0045) for ORF
4
Mus musculus VE-cadherin gene ([lambda]5 clone)
2
Mus musculus mRNA for poly(A)-binding protein
4
Mouse argininosuccinate synthetase (Ass) mRNA
2
Mus musculus mRNA for TAX-responsive element-binding protein
4
Mouse calmodulin (Cam I) mRNA
2
Mouse A-X actin mRNA
4
Mouse cyclophilin mRNA
2
Mouse catalase mRNA
4
Mouse elongation factor 2 (ef-2) mRNA, 3[prime]-end
2
Mouse heat shock protein hsp84 mRNA
4
Mouse embryonal carcinoma cell mRNA for 3-hydroxyacyl CoA
2
Mouse mRNA for acidic ribosomal phosophoprotein PO
4
Mouse endo B cytokeratin mRNA
2
Mouse mRNA for G protein [beta] subunit homolog
4
Mouse gelsolin gene
2
Mouse mRNA for nuclear pore-targeting complex component
4
Mouse insulinoma (rig) mRNA
2
Mouse ribosomal protein S4 (Rps4) mRNA
4
Mouse integrin [beta]4 subunit mRNA
2
Murine mRNA for J1 protein, yeast ribosomal protein L3
4
Mouse LT lymphotoxin (LT) gene
2
Mus musculus acidic ribosomal phosphoprotein P1 mRNA
4
Mouse mRNA for [alpha]-enolase (2-phospho-d-glycerate)
2
Mus musculus CDC42 mRNA
4
Mouse mRNA for apg-2
2
Mus musculus ERCC2 gene
4
Mouse mRNA for CArG-binding factor-A
2
Mus musculus fat facets homolog (Fam) mRNA
4
Mouse mRNA for cysteine-rich glycoprotein SPARC
2
Rattus norvegicus mRNA for ribosomal protein L4
4
Mouse mRNA for monoclonal non-specific suppressor factor
2
Rattus norvegicus mRNA for ribosomal protein L41
4
Mouse mRNA for pyrubate kinase M
2
Rattus rattus mRNA for ribosomal protein L11
4
Mouse placenta and embryonic expression gene (pem) mRNA
2
Rattus rattus mRNA for ribosomal protein L23a
4
Mouse ribosomal protein L19
2
Rat mRNA for ribosomal protein L17
4
Mouse surfeit locus (Surf-3) processed pseudogene 2
2
Identification
Frequency
Identification
Frequency
Rat mRNA for ribosomal protein L18a
4
Mouse ubf gene for transcription factor UBF
2
Rat mRNA for ribosomal protein L7a
4
Mouse Wnt-6 mRNA
2
Sequence 5 from US patent 4758511
3
Mus musculus (clone pVZmSin3B) mSin3B mRNA
2
Cricetulus longicaudatus rrs1 gene for arginyl-tRNA synthetase
3
Mus musculus (clone R24) rds gene
2
Homo sapiens mRNA for activin [beta]-C chain
3
Mus musculus (clone: pMAT1) mRNA
2
Homo sapiens HnRNP F protein mRNA
3
Mus musculus [alpha]-globin mRNA
2
Human histone H2A.Z gene, upstream promoter sequence
3
Mus musculus B6D2F1 clone 2C#4 mRNA
2
Leucine aminopeptidase (cattle, kidney, mRNA, 2056 nt)
3
Mus musculus chaperonin mRNA
2
Mus musculus mRNA for calmodulin
3
Mus musculus common cytokine receptor [gamma] chain gene
2
Mus musculus mRNA for mitochondrial gene for subunit I
3
Mus musculus cytokeratin no. 19 mRNA
2
Mus musculus syndecan-1
3
Mus musculus folate-binding protein gene, 5[prime]-end
2
Mus musculus tdgf1 gene
3
Mus musculus Hsp70-related NST-1 (hsr.1) mRNA
2
Mouse 18S, 5.8S, 28S rRNA gene cluster (clone pMEB3)
3
Mus musculus LAF1 transketolase mRNA
2
Mouse [alpha]-tubulin isotype M-[alpha]-6 mRNA
3
Mus musculus laminin [alpha]5 chain (Lama5) mRNA
2
Mouse ATP synthase [alpha] subunit
3
Mus musculus lymphoid-specific transcription factor NFATc3
2
Mouse ERp99 mRNA encoding an endoplasmic reticulum
3
Mus musculus MT transposon-like element, clone MTi7
2
Mouse facilitated glucose transport protein mRNA
3
Mus musculus nuclear-encoded mitochondrial acyltransferase
2
Mouse Ig-related glycoprotein-70 mRNA
3
Mus musculus Paneth cell enhanced expression PCEE mRNA
2
Mouse lactate dehydrogenase A-4 (LDH-A) mRNA
3
Mus musculus phosphoglycerate kinase (Pgk1-ps1) processed
2
Mouse mRNA for cyclophilin (EC 5.2.1.8)
3
Mus musculus retinoblastoma-binding protein (mRbAp46) mRNA
2
Mouse mRNA for prothymosin [alpha]
3
Mus musculus ribosomal protein (Ke-3) mRNA
2
Mouse mRNA for ubiquitin
3
Mus musculus ribosomal protein L9 (musl9) mRNA
2
Mouse replication-dependent histone H2A.1 gene
3
Mus musculus translation initiation factor 4E (eIF-4E)
2
Mus musculus Cea5 mRNA
3
Mus sp. nucleic acid-binding protein mRNA
2
Mus musculus large ribosomal subunit protein mRNA
3
Phosphoglycerate mutase type B subunit (rats, liver, mRNA)
2
Mus musculus ribosomal protein L8 (RPL8) mRNA
3
Rattus norvegicus (Sprague-Dawley) ribosomal protein S23 mRNA
2
Mus musculus ribosomal protein S26 (RPS26) mRNA
3
Rattus norvegicus C15 mRNA
2
Mus musculus RNA helicase mRNA
3
Rattus norvegicus genes for 18S, 5.8S and 28S rRNAs
2
Mus musculus translation initiation factor (Eif4g2) mRNA
3
Rattus norvegicus mRNA for elongation factor 1[alpha]
2
Rattus norvegicus (Sprague-Dawley) ARL5 mRNA for ARF-like protein
3
Rattus norvegicus mRNA for ribosomal protein S9
2
Rattus norvegicus mRNA for ribosomal protein S21
3
Rat alternative brain Ca2+-ATPase mRNA
2
Rat brain Ca2+-ATPase mRNA
3
Rat clathrin heavy chain mRNA
2
Rat mRNA for [beta]-globin
3
Rat liver mRNA for proteasomal ATPase (TBP1)
2
Rat mRNA for ribosomal protein S3
3
Rat mitochondrial proton/phosphate symporter mRNA
2
Rattus norvegicus clone C426 intestinal epithelium
3
Rat mRNA for 14-3-3 protein [gamma]-subtype
2
Rattus norvegicus placental lactogen-I variant mRNA
3
Rat mRNA for ribosomal protein L23
2
42 kDa membrane glycoprotein (human, gastric carcinoma cells)
2
Rat mRNA for ribosomal protein L35
2
[beta]-HKA, H,K-ATPase [beta]-subunit (rat)
2
Rat mRNA for ribosomal protein S10
2
Bovine mitochondrial adenylate kinase (AK2B) mRNA
2
Rat mRNA for ribosomal protein S14
2
Bovine mRNA for actin2
2
Rat mRNA for tissue factor pathway inhibitor
2
Chinese hamster gene for polyubiquitin
2
Rat ribosomal protein L18 mRNA
2
Chinese hamster ovary (CHO) ribosomal protein S14 mRNA
2
Rat TM-4 gene for fibroblast tropomyosin 4
2
Chinese hamster P1 protein mRNA
2
Rattus norvegicus eukaryotic initiation factor 5 (eIF-5)
2
Cricetulus griseus [beta]-tubulin isotype I mRNA
2
Rattus norvegicus gene 33 DNA, exon 4 and 3[prime]-end
2
Cyclotella spp. rRNA large subunit
2
Rattus norvegicus taipoxin-associated calcium-binding protein
2
Differentially expressed homolog of HepG2 cDNA clone
2
Rattus rattus eukaryotic initiation factor (Eif-2) 67 kDa
2
Homo sapiens mitoxantrone-resistance associated mRNA
2
SCP-2, sterol carrier protein-2 {3[prime]-region} (mice, C57BL/6)
2
The locations of 155 new cDNAs are shown in Figure
Figure 2. Genetic map of the Jackson Laboratory BSS cross (data taken from July 1997 release) showing the distribution of 7.5 EPC genes. The map shows all positions that contain loci marked with ticks and lists the specific locations of genes derived from the 7.5 EPC library. The map was anchored by 354 Mit markers and many gene markers through the contributions of many researchers (see http://www.jax.org/resources/documents/cmdata/bkmap/BSS.html ). Unexpectedly, these novel 7.5 EPC genes showed an uneven distribution in the mouse genome (Fig.
Statistical analysis of the distribution of 7.5 EPC genes on the mouse genome
Because of the involvement of the X chromosome in extraembryonic tissue function (see Introduction), we originally hypothesized that many genes would map to the X chromosome. The results were quite the opposite. The probability of appearance of the 7.5 EPC genes on the X chromosome was compared with that of a reference population of all coding loci (2504) on the backcross panel. The reason for using all coding loci as a reference population (rather than the physical length of the map) is that distribution of genes in general is not uniform in the mouse genome (26). This sample should be relevant for both autosomes and the X chromosome and would correct for any chromosome-specific bias, such as a lower density of genes, as reported for the human X chromosome (27). As shown in Table 2, 4.7% (118/2504) of the reference population were localized on the X chromosome, while 1.3% (2/155) of the 7.5 EPC genes were localized on the X chromosome. To test the hypothesis, the odds ratio (OR) and the 95% confidence interval (95% CI) were calculated. Contrary to our original expectation, the X chromosome was significantly under-represented in 7.5 EPC genes (OR 0.264, 95% CI 0.031, 0.994).
Secondly, to investigate those chromosomes (or regions) where 7.5 EPC genes are abundantly distributed compared with the reference population, [chi]2 tests of 2×2 tables were conducted for each chromosome (Table 2). The Bonferroni correction was employed for this exploratory analysis, to avoid multiple testing problems. With the Bonferroni correction, a P-value of <0.0023 (0.05/22) was considered as statistically significant. By this criterion, chromosome 17 showed a higher density of EPC-expressed genes (P = 0.003). The proximal domain of chromosome 17 includes the t-complex region (28,29) and, when further statistical analysis was conducted by splitting chromosome 17 into t-complex (centromere to H2-M2) and non-t-complex regions, statistically significant clustering of 7.5 EPC genes in the t-complex was observed (P = 0.0005). In addition to the novel genes, we found three other previously defined genes (Tcp1, H2-Ke2 and H2-Ke4) in the cDNA collection that also map to this region (Fig. These results clearly indicate clustering of the 7.5 EPC genes on the mouse t-complex. Table 2.
Chromosome
All loci
Wsu loci
P-value
No. of loci
%
No. of loci
%
1
156
6.2
5
3.2
0.128
2
201
8.0
14
9.0
0.656
3
112
4.5
7
4.5
0.988
4
163
6.5
11
7.1
0.774
5
162
6.5
11
7.1
0.759
6
127
5.1
9
5.8
0.687
7
161
6.4
11
7.1
0.743
8
137
5.5
7
4.5
0.610
9
99
4.0
10
6.5
0.128
10
115
4.6
8
5.2
0.744
11
179
7.1
9
5.8
0.527
12
82
3.3
3
1.9
0.483
13
146
5.8
7
4.5
0.496
14
94
3.8
3
1.9
0.373
15
80
3.2
7
4.5
0.370
16
86
3.4
6
3.9
0.773
17
101
4.0
14
9.0
0.003
17 (t-complex)
52
2.1
10
6.5
0.0005
17 (outside t)
49
2.0
4
2.6
0.590
18
84
3.4
5
3.2
0.931
19
94
3.8
6
3.9
0.941
X
118
4.7
2
1.3
0.045
Y
7
0.3
0
0.0
>0.999
Total
2504
100.0
155
100.0
Expression analyses of mapped genes by RT-PCR
Figure 3. (A) Haplotype figure of the Jackson Laboratory BSS proximal chromosome 17 mapping data. Loci are listed in order with the most proximal at the top, with no order implied for co-segregating loci. The filled boxes represent the C57BL/6J allele and the open boxes represent the SPRET/Ei allele. Numbers at the bottom of each column of boxes are the total number of animals with that haplotype. (B) Detailed map of the Jackson Laboratory BSS chromosome 17 proximal region (left) compared with the composite map with important landmarks (right). (Left) Red, Wsu markers; black, anchor markers. (Right) red, mouse mutant loci; black, anchor markers; blue, known imprinted genes. To test whether these mapped genes are expressed specifically in the extraembryonic tissues, we performed semi-quantitative RT-PCR analyses of genes mapped in this study. The same amount of cDNA pools derived from extraembryonic tissues and embryonic tissues were sequentially diluted in 10-fold steps. A cDNA fragment was amplified from each diluted cDNA pool by PCR with gene-specific primer pairs and run on non-denaturing polyacrylamide gels. Figure Figure 4. Example of RT-PCR analyses. Data obtained for 150 cDNA species are summarized in Figure Figure 5. Summary of RT-PCR analyses. Because we have not selected cDNA clones which are uniquely expressed in the extraembryonic tissues, this cDNA collection should and does also include genes expressed in embryonic tissues. For example, 17.3% (26/150) of genes showed the same level of expression both in extraembryonic and embryonic tissues and 2.7% (4/150) showed 10-fold higher expression in the embryonic tissues than in the extraembryonic tissues. Furthermore, 12.7% (19/150) of genes were expressed at a very low level in both embryonic and extraembryonic tissues and yielded no RT-PCR products and 1.3% (2/150) of genes showed only embryonic expression. The lack of detectable expression of these genes in extraembryonic tissues may mean that some cDNA clones with very low expression levels were included by chance. Alternatively, these clones may be the result of slight contamination of the tissue sample used to make the clones with some extraembryonic cells. In summary, about two thirds of the genes showed either unique expression in the extraembryonic tissues or higher expression in extraembryonic than in embryonic tissues. This confirms the expectation that ESTs sampled from the extraembryonic tissues reflect the gene expression patterns in those tissues.
Characterization of genes clustered in the t-complex region
Where are the new `clustered' genes within the t-complex and are they actually related in any way? Figure
Because the mapped 7.5 EPC genes showed no significant sequence similarity among themselves, clustering obviously does not reflect descent from a duplicated ancestral gene (30), as with the Hox (31) and globin (32) gene clusters. To look more carefully for any sequence similarities, the 7.5 EPC cDNA clones that mapped to the t-complex were completely sequenced and no sequence motifs in common were found. Consistent with these results again, northern blot analyses also showed distinctive mRNA sizes and tissue distributions for each of them (data not shown). Taken together, these results suggest an unexpected t-complex clustering of many distinctive genes expressed in extraembryonic tissues.
DISCUSSION
At 7.5 d.p.c., dramatic developmental changes in embryonic tissues are laying down both organ anlage and the complex placental nourishment of the nascent fetus. At first sight it may nevertheless be unexpected that a considerable fraction of the genes recovered in this study were previously undetected and that they tend to cluster at some locations. Here we summarize the major findings and discuss the possible relationship of localization and regulation of cohorts of developmental genes.
Three outcomes of the study
Sequencing and mapping tissue-specific gene cohorts has become a standard and powerful entrée to systematic functional analysis. Here, with 3186 ESTs developed from the 7.5 d.p.c. extraembryonic zone and 155 previously undetected species mapped to 1 cM resolution in the genome, there were three significant outcomes.
(i) A significant fraction of the EST set showed no hits when compared with the 1 400 000 entries in dbEST. This suggests that genes specifically expressed during development are not found in the current large-scale EST projects, which have concentrated on mature tissues. Especially because of the technical and ethical/legal barriers to the study of human embryos, the mouse provides a key source of discovery of many genes uniquely important in the early development of placental mammals. Consequently, specialized EST projects like this one can aid in studies that range from positional cloning of disease genes to `cyberscreening' for genes of interest (33). An indication of the potential usefulness of the 7.5 d.p.c. cDNA collection comes from the recent successful hunt for the Peutz-Jeghers syndrome gene (34), which found the only EST match in the public databases, a protein kinase, among the set described here.
(ii) EST sequence data, now deposited in the NCBI dbEST (15) adds expression profile information for randomly selected cDNA clones from the early embryonic tissue. Information about the relative expression of genes in tissues is thus obtained without northern analyses.
(iii) Systematic mapping of the genes provides a genome-wide view of the distribution of genes expressed in the tissue, with some hints of evolutionary and co-regulatory processes that are discussed further below.
Uneven genomic distribution of 7.5 EPC expressed genes
By RT-PCR analysis, one third of the cDNAs were expressed in both embryonic and extraembryonic compartments and likely include housekeeping genes, but another one third were expressed only in extraembryonic tissue and the rest were expressed predominantly in the extraembryonic zone. These results substantiate both the efficacy of the dissection used to define the EPC and the developmental specificity of gene expression.
Figure
Clustering of coordinately expressed genes is of course common in bacterial operons (35). In mammalian genomes, however, gene clusters, where seen, are on a much larger scale (Mb rather than 5-20 kb) and may involve long-range chromatin interactions (36,37). Until now, two types of clusters have been seen. One, exemplified by the Hox (31) and globin (32) genes, is based on tandem duplication and subsequent divergent evolution of ancient lineages (30). A second, including the major histocompatibility complex (38) and sets of imprinted genes (7,39), involves genes that are coordinately regulated but structurally unrelated. The extent to which such clusters exist in mammals has been unknown, but this is, to our knowledge, the first report of a large cluster of genes whose defining common feature is their co-expression at a fixed time and location. Recently, however, Danieli et al. (40) have reported that genes expressed in skeletal muscle may be enriched on human chromosomes 17, 19, 21 and X. Thus, clustering could be part of the global structure of evolving chromosomes. On the other hand, the genes in the cohort examined here may be a collective exception to ordinary dispersal of most genes, based on the particular requirements of the developing embryo, discussed below.
Do dosage control and implantation requirements drive co-localization?
Possible mechanisms underlying clustering are suggested by features of the regions highlighted in this study and the key developmental functions of the genes involved.
First, the genes in regions like the t-complex are involved in functions like implantation, which are specific to placental mammals. In fact, all homozygous lethal mutants in the t-complex show abnormal phenotypes at or around the stage of implantation (28). An intriguing example is the tw73 mutant (tclw73), which is known to affect implantation (41). Some of the correlatively mapped new genes in Figure
Second, the regions that are enriched or deficient in EPC-expressed genes undergo tight regulation of gene dosage, by imprinting in the t-complex and other autosomal regions (7) and by dosage compensation on the X chromosome. Consistent with this possibility, apparent gene clusters co-localize with imprinted areas that include, in addition to the t-complex, proximal chromosome 2 and proximal and distal segments of chromosome 7 (Fig.
The t-complex also often exhibits haploinsufficiency in mutagenesis experiments (42). If a developmental pathway is particularly sensitive to dosage of the genes involved, this requirement might serve as a possible mechanism for haploinsufficiency (43). For example, nuclear transplantation experiments (9,10), analyses of complete hydatidiform moles (11) and analyses of embryos with uniparental disomy (44) demonstrate that, depending on the genomic segment involved, abnormal dosage of genes in imprinted regions results in overgrowth or deficient growth of extraembryonic tissues. One known imprinted gene, Mash2, specifically controls the growth of the placenta (45). In a formally similar manner, the extraembryonic tissues of mouse embryos with supernumerary X chromosomes develop poorly, resulting in death of the embryo in utero (46,47), and at least one locus on the X chromosome regulates the size of the placenta in interspecific crosses (48).
Thus, stringent control of extraembryonic tissue growth may be so critical for placental mammals that many genes expressed in extraembryonic tissue are localized in evolutionarily advantageous, dosage-sensitive genomic regions. The need for tight dosage control might also apply to growth of the embryo proper, because embryonal overgrowth is seen in mouse and human when the dosage of certain genes in the imprinted regions is increased or when a particular growth suppressing locus on the X chromosome is mutated (for human cases see refs 49,50).
Dosage control of genes included in these genomic regions provides additional options by control of the developmental time at which dosage is adjusted and on parental-specific contributions to the active gene complement. We note that there is an interesting coincidence of timing of the reimposition of imprinting and X-inactivation at this very stage of development of the EPC, which may have implications for the relationship between selective inactivation of genes and the implantation process. For imprinted genes, the experiments on nuclear transplantation (see above) have also shown that paternal alleles and maternal alleles are differentially obligatory for development of the embryo (9-11). Correlatively, X-inactivation is random in the somatic cells of eutherian mammalian embryos (13), but is uniquely paternal in the extraembryonic tissue (12). This pattern of parental contributions has given rise to considerable speculation about conflicts between maternal and paternal genomes during embryonic development (51). It now seems that the divergent roles of the two parental genomes may have evolved as an effective method for the specific control of gene expression in the regions where genes expressed in the cDNA collections reported here are over- or under-represented and that some of these genes may be mediators of this type of regulation.
MATERIALS AND METHODS
cDNA library and screening
Ten embryos at 7.5 d.p.c. were collected from timed pregnant C57BL/6J inbred mice (Jackson Laboratory) according to a standard procedure (52). Embryos were cut in two to separate the upper portion (ectoplacental cone and extraembryonic ectoderm) from the lower portion (embryonic ectoderm). Total cell RNA was extracted using a kit (Stratagene). Construction of the cDNA library was performed using a kit (Life Technologies) with ~50 ng total RNA mixed with 100 ng oligo(dT)-NotI primer (5[prime]-AATTCGCGGCCCGCTTTTTTTTTTTTTTT-3[prime]) for first strand cDNA synthesis in a 20 µl reaction. The cloning vector was pSPORT1 (Life Technologies) and cloning sites were SalI and NotI. Cloning of inserts of average size 1.4 kb was carried out without normalization (53), in order to provide information about the abundance of the individual cDNA species.
Sequencing and sequence analyses
Plasmid DNA from arrayed cDNAs was isolated using the QIAwell-8 System (Qiagen) on 3 ml LB overnight cultures. Eluted DNA (150-200 ng/µl) was used directly in cycle sequencing reactions. Sequencing reactions were performed using Applied Biosystems (ABI) M13 primers and PCR followed by analysis on an ABI 373A Automated DNA Sequencer. Approximately 400 bp of the 3[prime] and 5[prime] ends of cDNA clones were obtained.
Similar sequences were sought in the NCBI non-redundant sequence database (nr) and EST database (dbEST) (15) by the Blastn program and protein sequence database by the Blastx program (14) between 1 January and 31 March 1997. The 3[prime] sequences that did not show significant similarity to any of these entries (score <800) were selected and these clones were also sequenced from the 5[prime] end. These sequences were also compared with the database for any matches.
The putative ID for individual ESTs was determined by the following rules. When the Blastn score was >800, two sequences were identified as matched sequences and the hit was selected as putative identification. If the score was <300, the putative ID of the sequence was called `unknown'. If the score was >300 and <800 and the hit contained any of the words `ribosomal RNA', `repeat', `repetitive', `vector', `rRNA' or `mitochondrial' the hit was used as a putative ID. Otherwise, the putative ID of the sequence was classified as `unknown'. The rules were set rather empirically. For example, a score of 818 is equivalent to P = 2.0e - 60 for 78.00% of matches in 261 bp of sequence and a score of 819 is equivalent to P = 6.9e - 123 for 97.00% of matches in 171 bp of sequence. A score of 753 is equivalent to 2.40e - 118 for 98% identity in 156 bp of sequence and a score of 620 is equivalent to 5.60e - 46 for 100% identity in 124 bp of sequence.
To keep up with the rapidly growing number of ESTs, the sequence similarity searches were performed again more recently (1-22 April 1998). Thirty six percent (710/1989) of the 3[prime]-ESTs were still `unknown' in the annotated sequence database (nr) and 16% (326/1989) of them were `unknown' in the EST database (dbEST).
All the information, including the cDNA and PCR primer sequences, is available through our WWW server (http://mouse2.biosci.wayne.edu/ ). The EST sequence information was deposited in GenBank (2 May 1997) and has been publicly available since then through the NCBI (accession nos 1051017 and 1054202).
Gene mapping
The 751 PCR primer pairs that were designed and synthesized are available to the community through Research Genetics. Localization of genes on the mouse genetic map was performed as described with slight modifications (21). For genotyping, genomic DNAs (25 ng each) were amplified in 10 µl reactions using a Thermal Cycler Model 9600 (Perkin Elmer-Cetus). The reaction contained 10 mM Tris-HCl, pH 8.3, 50 mM KCl, 200 µM each dNTP, 1.5 mM MgCl2, 0.2 µM primer (up), 0.2 µM primer (down) and 0.32 U Taq polymerase (Perkin Elmer-Cetus). The samples were heated at 95°C for 1 min to ensure complete template denaturation. Then they were amplified by 35 or 40 cycles of denaturation (at 94°C for 30 s), annealing (at 58°C for 30 s) and extension (at 72°C for 1 min), followed by 3 min extension after the last cycle. The DNAs were run on 10% non-denaturing polyacrylamide gels (20 × 15 × 0.1 cm) at 250 V for 1 h. Gels were stained with 0.5 mg/ml ethidium bromide for 30 min. Photographs were taken with a UV transilluminator (254 nm). Genotypes were scored by reading the heteroduplex bands.
The raw mapping data for the cross are accessible through the Jackson Laboratory Backcross DNA Panel Mapping Resource WWW server (http://www.jax.org/resources/documents/cmdata/BSS.html ) and the Mouse Genome Database (accession no. MGD-JNUM-J41591).
Statistical analysis
Differences in the probability of appearance on each chromosome between the 7.5 EPC genes and a reference population (all coding loci) were analyzed using the [chi]2 test or Fisher's exact test when appropriate. Number of all coding loci was obtained by subtracting Wsu loci and special project loci focusing on chromosomes 2 and 16 from all coding loci in the Jackson BSS Map as of 21 July 1997. Bonferroni correction was employed for exploratory analyses to avoid the multiple testing problem. We determined the new [alpha] level as 0.05/no. of comparisons (0.05/22 = 0.0023). For reference, 2504 mouse loci were collected from the Jackson Laboratory BSS Interspecific Backcross database. The StatView statistical package (Abacus Concepts, Palo Alto, CA) was used to carry out these analyses.
Gene expression analyses by RT-PCR
The same primer pairs used for gene mapping were used to amplify gene-specific PCR products from each cDNA mixture. RNA samples without reverse transcriptase treatment did not show any PCR products, indicating that there is no genomic DNA contamination in the cDNA mixtures (data not shown). Therefore, all the PCR products are derived from cDNAs and represent the expression level of the genes in the tissues. To separate the embryo proper and extraembryonic tissues including trophoblast giant cells, extraembryonic ectoderm and visceral and parietal endoderm, the 7.5 d.p.c. mouse embryos were carefully dissected out from the decidua, an incision was made in Reichert's membrane with visceral and parietal endoderm on the mesometrial side and the embryo proper was cut out. Total RNAs were extracted from each tissue sample and treated with RNase-free Dnase and reverse transcribed into cDNA.
ACKNOWLEDGEMENTS
We thank David Schlessinger for discussions and critical reading of the manuscript, Susumu Ohno, Joe Nadeau and Karen Artzt for discussion and Michael Boehnke, Morton Brown and Alan Zonderman for invaluable suggestions for statistical analysis of gene clusters. We also thank Carrie James for mouse husbandry, Sudheer Tamula and Sunil Kosuru for computational analyses. H.N. was supported by a post-doctoral fellowship from Uehara Memorial Foundation. H.F. was on leave from Daiichi Pharmaceutical Co. Ltd. The Jackson Laboratory Backcross DNA Panel Mapping Resource is supported by grant HG00941 from NHGRI. This work was mainly supported by grant HD32243 from NICHD to M.S.H.K.
ABBREVIATIONS
d.p.c., days post-conception; EPC, ectoplacental cone; ESTs, expressed sequence tags.
REFERENCES
This article has been cited by other articles:
This page is run by Oxford University Press, Great Clarendon Street, Oxford OX2 6DP, as part of the OUP Journals
Comments and feedback: www-admin{at}oup.co.uk
Last modification: 12 Nov 1998
Copyright©Oxford University Press, 1998.
![]()
CiteULike
Connotea
Del.icio.us What's this?
![]()
![]()

![]()
![]()
![]()
J. S. Gilbert and M. J. Nijland
Sex differences in the developmental origins of hypertension and cardiorenal disease
Am J Physiol Regulatory Integrative Comp Physiol,
December 1, 2008;
295(6):
R1941 - R1952.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
J. G. Ledford, M. Kovarova, and B. H. Koller
Impaired Host Defense in Mice Lacking ONZIN
J. Immunol.,
April 15, 2007;
178(8):
5132 - 5143.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
S. T. Kosak and M. Groudine
Form follows function: the genomic organization of cellular differentiation
Genes & Dev.,
June 15, 2004;
18(12):
1371 - 1384.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
H. Ishikawa, A. Rattigan, R. Fundele, and P. S. Burgoyne
Effects of Sex Chromosome Dosage on Placental Size in Mice
Biol Reprod,
August 1, 2003;
69(2):
483 - 488.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
J. M. Tarrant, J. Groom, D. Metcalf, R. Li, B. Borobokas, M. D. Wright, D. Tarlinton, and L. Robb
The Absence of Tssc6, a Member of the Tetraspanin Superfamily, Does Not Affect Lymphoid Development but Enhances In Vitro T-Cell Proliferative Responses
Mol. Cell. Biol.,
July 15, 2002;
22(14):
5006 - 5018.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
M. Hemberger, J. C. Cross, H.-H. Ropers, H. Lehrach, R. Fundele, and H. Himmelbauer
UniGene cDNA array-based monitoring of transcriptome changes during mouse placental development
PNAS,
November 6, 2001;
98(23):
13126 - 13131.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
J. Schimenti
Monoallelic Gene Expression in Mice: Who? When? How? Why?
Genome Res.,
November 1, 2001;
11(11):
1799 - 1800.
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
T. S. Tanaka, S. A. Jaradat, M. K. Lim, G. J. Kargul, X. Wang, M. J. Grahovac, S. Pantano, Y. Sano, Y. Piao, R. Nagaraja, et al.
Genome-wide expression profiling of mid-gestation placenta and embryo using a 15,000 mouse developmental cDNA microarray
PNAS,
August 1, 2000;
97(16):
9127 - 9132.
[Abstract]
[Full Text]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
G. J. Kargul, R. Nagaraja, T. Shimada, M. J. Grahovac, M. K. Lim, H. Nakashima, P. Waeltz, P. Ma, E. Chen, D. Schlessinger, et al.
Eleven Densely Clustered Genes, Six of them Novel, in 176 kb of Mouse t-complex DNA
Genome Res.,
July 1, 2000;
10(7):
916 - 923.
[Abstract]
[Full Text]
![]()
![]()
![]()

![]()
![]()
![]()
M. Ko, J. Kitchen, X Wang, T. Threat, X Wang, A Hasegawa, T Sun, M. Grahovac, G. Kargul, M. Lim, et al.
Large-scale cDNA analysis reveals phased gene expression patterns during preimplantation mouse development
Development,
January 4, 2000;
127(8):
1737 - 1749.
[Abstract]
[PDF]
![]()
![]()
![]()

![]()
![]()
![]()
T. Wiltshire, M. Pletcher, S. E. Cole, M. Villanueva, B. Birren, J. Lehoczky, K. Dewar, and R. H. Reeves
Perfect Conserved Linkage Across the Entire Mouse Chromosome 10 Region Homologous to Human Chromosome 21
Genome Res.,
December 1, 1999;
9(12):
1214 - 1222.
[Abstract]
[Full Text]
![]()
![]()
![]()

![]()
![]()
![]()
Y. Sano, T. Shimada, H. Nakashima, R. H. Nicholson, J. F. Eliason, T. A. Kocarek, and M. S.H. Ko
Random Monoallelic Expression of Three Genes Clustered within 60 kb of Mouse t Complex Genomic DNA
Genome Res.,
November 1, 2001;
11(11):
1833 - 1841.
[Abstract]
[Full Text]
[PDF]
![]()
This Article ![]()
![]()
Abstract
![]()
FREE Full Text (PDF)
![]()
Alert me when this article is cited
![]()
Alert me if a correction is posted
![]()
Services ![]()
![]()
Email this article to a friend
![]()
Similar articles in this journal
![]()
Similar articles in ISI Web of Science
![]()
Similar articles in PubMed
![]()
Alert me to new issues of the journal
![]()
Add to My Personal Archive
![]()
Download to citation manager
![]()
Search for citing articles in:
ISI Web of Science (53)
![]()
Request Permissions ![]()
Google Scholar ![]()
![]()
Articles by Ko, M. S.
![]()
Articles by Nakashima, H.
![]()
Search for Related Content
![]()
PubMed ![]()
![]()
PubMed Citation
![]()
Articles by Ko, M. S.
![]()
Articles by Nakashima, H.
![]()
Social Bookmarking ![]()
![]()
What's this?