Comparative analysis and genomic structure of the tuberous sclerosis 2 (TSC2) gene in human and pufferfish
Comparative analysis and genomic structure of the tuberous sclerosis 2 (TSC2) gene in human and pufferfishMagitha M. Maheshwar, Richard Sandford1, Mark Nellist+, Jeremy P. Cheadle, Barbara Sgotto1, Mark Vaudin2 and Julian R. Sampson*
Institute of Medical Genetics, University of Wales College of Medicine, Cardiff, CF4 4XN, UK, 1Department of Medicine, Addenbrooke's Hospital, Hills Road, Cambridge CB2 2QQ, UK and 2Department of Genetics and Genome Sequencing Center, Washington University School of Medicine, St Louis, MO 63108, USA
Received September 9, 1995;Revised and Accepted October 27, 1995
Germ-line mutations of the TSC2 tumour suppressor gene have been identified in humans with tuberous sclerosis and in the Eker rat. Tuberin, the human TSC2 gene product, has a small region of homology with rap1GAP and stimulates rap1 GTPase activity in vitro, suggesting that one of its cellular roles is to function as a GTPase activating protein (GAP). We have undertaken a comparative analysis of the TSC2 gene in human and the pufferfish, Fugu rubripes. In addition to the GAP domain, three other regions of the proteins are highly conserved (peptide sequence similarity >80%). These regions are likely to represent further functional domains. To facilitate analysis of mutations within these domains we have determined the genomic structure of the human TSC2 gene. It comprises 41 exons, including exon 31 which was absent from the originally described spliceoform of the human TSC2 transcript and was identified following exon prediction from Fugu genomic sequence. These findings support the proposal of the Fugu genome as a tool for human gene analysis.
In humans, constitutional mutations affecting the TSC2 gene on chromosome 16 are found in individuals with the autosomal dominant trait tuberous sclerosis, MIM 191100 (1 ). Linkage studies in families with tuberous sclerosis (TSC) show that approximately 50% segregate a mutant TSC2 gene, while an unidentified gene which maps to chromosome 9 (designated TSC1) is implicated in the remainder (2 -5 ). With the exception of a recently described contiguous gene syndrome involving deletion of both the TSC2 gene and the adjacent PKD1 gene on chromosome 16 (6 ), phenotypes associated with TSC1 and TSC2 mutations appear indistinguishable. Both are characterized by development of unusual hamartomatous growths affecting many organs, especially the brain, skin, kidneys and heart (7 ). Loss of heterozygosity (LOH) across the TSC1 and TSC2 chromosomal regions in TSC associated hamartomas (8 -10 ) suggests that cell proliferation and differentiation become perturbed when both TSC1 or TSC2 alleles are mutated and that the genes act as tumour suppressor genes serving related functions. The wide range of tissues in which TSC associated hamartomas develop implies a fundamental role for both TSC genes in regulating cell proliferation and differentiation. It is likely that study of the TSC2 gene and its protein product tuberin (and when identified, the TSC1 gene and its product) will provide new insights into these processes.
The rat homologue of the TSC2 gene has recently been characterized (11 ) and a constitutional insertion mutation has been identified in the Eker strain (12 ,13 ), which exhibits a predisposition to renal cell carcinoma, splenic haemangioma and uterine leiomyoma or leiomyosarcoma (14 ). Analogous pathology is only occasionally seen in humans with TSC, suggesting that further studies of TSC2 might also provide insights into species-specific differences in tumorigenesis and/or phenotype-specific mutations.
The human TSC2 gene was identified by positional cloning (1 ). It maps to a gene-rich region of 16p13.3, approximately 2.25 Mb from the telomere and immediately adjacent to the PKD1 gene (15 ). The 5.5 kb transcript spans an estimated 43 kb of genomic sequence and encodes a novel, predicted ~198 kDa protein (tuberin). Fifty-eight amino acids near the carboxy terminus are homologous to the catalytic domain of the GTPase activating protein GAP3 (rap1GAP) (1 ). GAPs are regulators of the GTP binding and hydrolysing activity of the Ras superfamily of proteins that help to regulate cell growth, proliferation and differentiation. Further homologies between tuberin and other proteins have not been identified and alternative approaches will be required to elucidate other functional domains. Strategies which might prove useful in this regard include characterization of mutations at the TSC2 locus in humans and other species and identification of regions of the gene which exhibit a high level of conservation in distantly related species.
The haploid genome of the Japanese pufferfish, Fugu rubripes, comprises ~400 Mb (16 ), compared with ~3000 Mb in man, but is estimated to contain a similar number of genes. The compact nature of the Fugu genome reflects the small relative size of intergenic and intronic sequences and a paucity of repetitive DNA. These characteristics have led to the proposal of Fugu rubripes as an efficient model for vertebrate genome analysis (16 ). We have characterized the Fugu homologue of the TSC2 gene and have undertaken a comparative analysis of TSC2 in human and Fugu. Four regions of coding sequence, including the rap1GAP homologous region, showed a particularly high level of conservation. These may represent critical functional or structural domains toward which further studies, such as mutation analysis might be usefully directed. To facilitate the identification and analysis of potentially revealing mutations, such as missense mutations, we have determined the intron-exon structure of the human TSC2 gene.
Genomic clones containing the Fugu homologue of the TSC2 gene were identified as previously described (Sandford et al. manuscript submitted). Briefly, human TSC2 cDNA clones 4B2 and 4.9 (1 ) were used to screen a Fugu genomic library constructed in lambda 2001 (16 ). Three overlapping lambda clones were identified and fragments of these used to isolate a single cosmid, 295C6, from a gridded library containing 40 000 clones (G. Elgar, unpublished). This clone was also found to contain part of the PKD1 gene homologue and was sequenced in its entirety (Sandford et al. manuscript in preparation). Exon 1 was found to be interrupted by the vector cloning site. An overlapping cosmid, 52G18, was isolated and the sequence of exon 1 completed using specific walking primers.
The intron-exon structure of the Fugu homologue of TSC2 was predicted using the genomic sequence from cosmid 295C6. The BLAST algorithm (17 ) and direct comparison to the human predicted protein sequence after three frame translations with MacVectorTM were used to search for Fugu sequences with homology to the human TSC2 cDNA. Searches using the known consensus sequences for Fugu splice sites (18 ) were then made to predict positions of Fugu intron-exon boundaries, which were also analysed using the Netgene (19 ) and Grail (20 ) programs. Homology between Fugu genomic sequence and human tuberin sequence predicted 41 Fugu exons comprising a transcript of similar size to the previously published human TSC2 transcript (Fig. 1 ). GC content and Grail (20 ) analysis also predicted a further Fugu exon of 90bp (exon 31) for which no homologous sequence was present in the published human transcript (Fig. 2 ). Alternative splicing of exon 31 in Fugu was confirmed by RT-PCR using pooled RNAs.
Sequence data from genomic sub-clones of the cosmids LADS4 and ZDS5 which together span the human TSC2 locus (see Fig. 3 and Methods) were compared with the human TSC2 cDNA. The genomic sequences at points of divergence were compared with the 3' and 5' splice site consensus sequences defined by Shapiro and Senapthy (21 ), identifying 59 intron-exon boundaries. A further 19 boundaries were identified by direct sequencing of PCR products amplified from the genomic TSC2 locus. The 40 exons defined by these approaches accounted for all 5754 nucleotides of the published cDNA sequence. An additional, alternatively spliced exon (exon 31) which was not represented in the original TSC2 transcript was also identified. This exon was predicted by analysis of Fugu genomic sequence (see above). RT-PCR amplification of human lymphoblastoid RNA revealed no evidence for the exon, but 4A2, one of eight human fetal brain cDNA clones spanning this part of the transcript, contained an additional 69 bp of coding sequence homologous to the alternatively spliced exon in Fugu. Genomic PCR products spanning the novel exon were amplified from cosmid ZDS5 and the exon and flanking intronic regions were confirmed by direct sequencing. Thus a total of 41 human TSC2 exons were identified (Table 1 ). A combination of PCR, restriction digestion and Southern hybridization were used to estimate the sizes of 28 introns for which sequence data were incomplete, and to confirm the positions of exons on the genomic map (Fig. 3 ). Partial genomic sequence across the 41 exons and flanking intronic sequence has been deposited with GenBank (accession numbers L48517-L48546).
Table 1 . Intron-exon structure of TSC2 in human and fugu*Human exon 1 contains at least 19 nucleotides upstream of the translation initiation codon.In Fugu, a phase 0 intron, absent in the human gene, separates exons 33a and 33b. Human nucleotide numbers 1-3832 (amnio acids 1-1271) correspond to the previously published transcript (1). Thereafter they differ due to the present of the additional exon (exon 31).
Figure 3.The human TSC2 locus. Top, The positions of cosmid clones LADS4 and ZDS5 are shown above a genomic map indicating restriction sites for EcoRI (E). Part of the genomic region spanned by ZDS5 is duplicated more proximally on the short arm of chromosome 16 (vertical hatches). The genomic distribution and direction of transcription (arrows) of the TSC2 gene and the adjacent genes, OCTS2, OCTS3 and PKD1 are shown below the restriction map. Bottom, genomic distribution of 41 coding exons (vertical bars) of the human TSC2 gene.
The known TSC2 gene spans ~43 kb of genomic DNA in human and only 17.3 kb in Fugu. However, in both cases transcript size (~5.5 kb) including putative translation initiation codons and polyadenylation signals, and intron-exon organization are very similar. The difference in genomic coverage reflects smaller intron size in the Fugu (62-1928 bp in Fugu, and 79 to >4 kb in humans). The only structural divergence noted between the Fugu and human TSC2 genes was the presence of an additional Fugu intron. Optimal alignment indicated that this intron of 163 bp was inserted in phase 0 at amino acid number 1435 of the human transcript (WSASGE--DSRGQP), splitting exon 33, the largest of the human exons, into two Fugu exons denoted 33a and 33b. Otherwise both position and phase of predicted Fugu introns were identical to their directly determined human counterparts (Table 1 ).
After optimal alignment using the CLUSTAL program (22 ), the TSC2 transcripts in Fugu and humans showed 65% sequence homology at nucleotide level. GC content was 60% in man and 54% in Fugu. Optimal alignment at the protein level revealed 60% identity of amino acid residues, with 79% similarity if conservative changes were included. Comparison of the predicted Fugu and human peptide sequences is shown in Figure 4 . The PLOTSIMILARITY program from the GCG package was used to assess variation in similarity across both predicted proteins as a 75 amino acid residue window was moved along the aligned sequences. Four regions of high conservation were identified. These include the GAP-related domain (human residues 1593-1631) and the sequences flanking this, two small regions between residues 750 and 1100, and the N-terminal portion of the molecule.
Figure 4.Similarity plot produced from the alignment of the TSC2 predicted protein sequences. Similarity was determined over a window of 75 residues using the GCG package program, PLOTSIMILARITY. 100% similarity scores 1.5. The scale along the bottom of the plot represents the consensus sequence after alignment and is therefore larger than the individual sequences. The dotted line shows the average similarity.
Analysis of the human TSC2 predicted protein sequence has previously identified hydrophobic regions which could represent membrane spanning domains (1 ). Hydropathy of the Fugu and human predicted proteins was examined using the Kyte and Doolittle method (23 ) and both N-termini were found to be hydrophobic. Alignments of the two sequences were analysed for transmembrane domains using TMAP (24 ) and PredictProtein (25 -27 ), but none were reliably predicted.
Apart from the small region of tuberin showing sequence homology to rap1GAP, no similarities to known proteins have been found (1 ). Extensive searches for structural and functional protein domains were made with the aligned Fugu and human sequences (see Materials and Methods). Evolutionary conservation can increase the power of these analyses, but no similarity to other protein motifs or domains was found.
The comparative analysis we have undertaken in humans and Fugu has highlighted four regions of the TSC2 gene product which have remained highly conserved (>80% at the amino acid level) over some 400 million years of evolution. These include the previously recognized GAP homologous region (1 ) and an area of ~200 amino acid residues which surrounds this. It is likely that this conserved area as a whole contributes to any functional GAP domain. Preliminary in vitro biochemical evidence for (rap1) GAP activity has been demonstrated for tuberin (28 ), but additional cellular roles are expected. In this regard the three remaining regions that are highly conserved between human and Fugu deserve further investigation.
Prediction of intron-exon structure from genomic sequence at the Fugu TSC2 locus helped us to identify a novel, alternatively spliced exon in humans which was absent from the previously reported TSC2 transcript (1 ). The additional exon was identified by the presence of a GC rich genomic region 3' to exon 30 in the Fugu. By characterizing eight human fetal brain cDNA clones we confirmed that the additional exon (exon 31) was also present in some human transcripts. The exon corresponded to the alternatively spliced exon 31 of the recently characterized rat TSC2 gene (11 ). Sequencing of the exon 31 containing splice form showed that 67 of 69 nucleotides were identical to the homologous rat exon. It has already been established that exon 25 shows tissue specific alternative splicing in humans and mouse (29 ) and that it is alternatively spliced in the rat (11 ). Using RT-PCR we showed that exons 25 and 31 are also alternatively spliced in Fugu. Conservation of these splice forms is suggestive of underlying functional correlates which remain to be established.
The predicted genomic structure of the Fugu TSC2 gene was extremely similar to that determined for the human TSC2 gene. The position and phase of each of 41 introns interrupting the human TSC2 transcript were identical in Fugu, the only divergence in organization being the occurrence of an additional intron (phase 0) of 163 bp splitting the coding sequence of human exon 33 into Fugu exons 33a and 33b. Given this level of conservation it is unsurprising that intron position and phase are completely conserved in the recently characterized rat TSC2 gene, which has no intron splitting the homologue of human exon 33 (11 ). For a very few genes, such as the Huntington's disease gene (30 ) and the G6PD gene (31 ) genomic structure have been determined in both Fugu and humans. In these cases complete conservation of intron-exon boundaries has been found throughout protein coding regions. The overall size of the human genome (and other mammalian genomes) is ~7.5-fold larger than that of Fugu, reflecting the larger size of both intergenic and intronic sequences in mammals. However, relative difference in size varies between genes. The TSC2 gene was 2.5-fold larger in humans than Fugu, compared with 7.5-fold for the HD gene (30 ) and 3.75-fold in the case of the G6PD gene (31 ). In contrast, the level of nucleotide sequence conservation within the coding regions of these genes is very similar, at 65% for TSC2, 69% for the HD gene and 71% for the G6PD gene.
It is possible that additional, alternatively spliced protein encoding exons of the TSC2 gene remain to be found. However, we do not predict the existence of such exons from our analysis of genomic sequence in Fugu. It is more likely that one or more leader exons (non-protein encoding) exist 5' to the ATG codon identified as the probable translation initiation codon in man (1 ), mouse (Olsson et al. manuscript submitted), rat (11 ) and Fugu. A sequence corresponding to the splice site acceptor consensus lies ~30 bp upstream from this codon at the human TSC2 locus. As yet, the 5'UTR of the TSC2 gene has not been fully characterized in any organism, but delineation of this region, and the regulatory elements contained within it, is under investigation.
The data we present on genomic structure and sequence will assist with the identification of constitutional and somatic mutations at the human TSC2 locus. Some 5% of germ-line TSC2 mutations appear to be deletions detectable by pulse field gel electrophoresis or conventional Southern analysis (1 ,9 ). More detailed studies applying single strand conformation polymorphism (SSCP) and heteroduplex analysis may be useful both diagnostically and in the identification and characterization of functional domains of tuberin and vital residues within these. Unfortunately, the complex genomic structure of the human TSC2 gene with many small exons will make any exon screening approach to mutation identification labour intensive. As a wide spectrum of inactivating mutations might be expected, alternative approaches such as the protein truncation test (PTT) (32 ) may be more appropriate in a diagnostic setting. However, PTT does not detect missense mutations, which are potentially the most revealing in terms of function, and SSCP and related techniques will remain useful in the research-oriented laboratory.
The sequence of the Fugu TSC2 gene was obtained from the complete sequence of cosmid 295C6 and individual walking sequences from the adjacent cosmid 52G18. Sonicated cosmid 295C6 DNA was shotgun cloned into SmaI cut M13 vector. Sequence was obtained using both dye primer and dye terminator chemistries and automated detection with ABI 373 or 377 sequencers. Walking primers were used to confirm sequence on both strands and to obtain the complete sequence of exon 1 from cosmid 52G18.
Four cDNA clones spanning the human TSC2 transcript have been described previously, 2A-6, 4.9, 4B2 and 1A1 (1 ). Additional cDNAs were identified by screening a human fetal brain cDNA library (Stratagene) with the established TSC2 cDNAs. Previously undescribed cDNAs are 4.1, 4A2, 4B5, 1.1, 1/1, 1/3, 3.3, 4B4, 2A1, 2/1, 2B1 and 1B1. These were mapped relative to cosmids LADS4 and ZDS5 and to each other. Eight of the clones spanned the region of the gene predicted to contain an additional exon (exon 31) in Fugu. These were investigated for evidence of the additional exon by PCR using the primers 3699F (5'-TGCCCCTGCAGGAGCTGTCTA-3') and MP53 (5'-GCGCCTGTCCATGCCTAGC-3'). cDNA 4A2 was found to contain an additional 69 bp. The PCR product was sequenced directly.
PstI and TaqI restriction fragments from cosmids LADS4 and ZDS5 (1 ), were shotgun cloned into pBluescript (Stratagene). Recombinants were picked on to duplicate grids and transferred to Hybond membrane. Exon containing clones were identified by colony hybridization with TSC2 cDNA fragments according to the method of Grunstein and Hogness (33 ) and the positive genomic clones were assigned to their relative positions in the cDNA.
Initially, intron-exon boundaries were determined by single stranded sequencing of the PstI and TaqI shotgun clones using the M13 reverse primer (30 boundaries). Further sequencing of these clones using additional intragenic primers identified an additional 29 boundaries (primer sequences available upon request). The boundary sequences were completed by direct sequencing of PCR products generated from genomic clones at the TSC2 locus. PCR amplification was carried out in a 50 [mu]l total volume containing 0.2 mM of each dNTP, 25 pmol of each primer and 1 unit of Amersham Taq polymerase in standard PCR buffer (10 mM Tris-HCl, 50 mM KCl, 2 mM MgCl2). PCR conditions used for each primer set were similar (95oC 2 min followed by 30 cycles of 95oC 1 min, 55oC-60oC 1 min, 72oC 2 min). Direct DNA sequencing was carried out using the PCR product sequencing kit (Amersham), using exonuclease I and shrimp alkaline phosphatase for pre-enzymic treatment of the PCR products. The sequences at all splice sites were analysed for agreement with the 3' and 5' splicing consensus sequences using the method described by Shapiro and Senapathy (21 ).
An approximate genomic map was constructed by hybridization of TSC2 cDNA fragments to restriction digests of cosmid and human genomic DNA. These results were confirmed by hybridization of exon-containing shotgun clones to the same digests. Finer mapping was achieved by restriction digest analysis of the shotgun clones and PCR products derived from the TSC2 genomic locus. In many instances restriction mapping data were confirmed by sequence analysis.
Total RNA from Fugu pooled tissues and from human lymphoblastoid lines for use in RT-PCR experiments was isolated using guanidinium thiocyanate-phenol-chloroform extraction (34 ). First-strand cDNA was obtained from 1 [mu]g of RNA using the Promega Reverse Transcription System according to the supplied protocols. One-twentieth of the reaction volume was used in subsequent PCR reactions with Fugu or human specific primers.
Sequence analysis and exon prediction was carried out with MacVectorTM, GRAILII (20 ), and Netgene (19 ). Homology searches to identify sequence, motif and structural similarity were performed with Blastn, Blastp, PredictProtein (25 -27 ), TMAP (24 ), MOTIF (Prosite directory, ref. 35 ), and BLOCKS by direct access to the servers via the worldwide web. Sequence alignments were performed using the GCG package.
The authors would like to thank their colleagues in the European Chromosome 16 Tuberous Sclerosis Consortium and also Sydney Brenner, Sam Aparicio, Anne-Marie Frischauf and Peter Harper. This work was supported by the Tuberous Sclerosis Association (GB), the Medical Research Council, the London Law Trust and the Dutch Kidney Foundation.
1 The European Chromosome 16 Tuberous Sclerosis Consortium (1993) Identification and characterisation of the tuberous sclerosis gene on chromosome 16. Cell,75, 1305-1315.
2 Sampson, J.R., Janssen, L.A.J., Sandkuijl, L.A. and the Tuberous Sclerosis Collaborative Group (1992) Linkage investigation of three putative tuberous sclerosis determining loci on chromosomes 9q, 11q and 12q. J. Med. Genet.,29, 861-866.MEDLINE Abstract
3 Kwiatkowski, D.J., Armour, J., Bale, A.E., Fountain, J.W., Goudie, D., Haines, J.L., Knowles, M.A., Pilz, A., Slaugenhaupt, S. and Povey, S. (1993) Report on the second international workshop on human chromosome 9. Cytogenet. Cell Genet.,64, 94-106.
4 Povey, S., Burley, M.W., Attwood, J., Benham, F., Hunt, D., Jeremiah, S.J., Franklin, D., Gillet, G., Malas, S., Robson, E.B., Tippett, P., Edwards, J.H., Kwiatkowski, D.J., Super, M., Mueller, R., Fryer A., Clarke, A., Webb D. and Osborne J. (1994) Two loci for tuberous sclerosis: one on 9q34 and one on 16p13. Ann. Hum. Genet.,58, 107-127.MEDLINE Abstract
5 Janssen, B., Sampson, J., van der Est, M., Deelen, W., Verhoef, S., Daniels, I., Hesseling, A., Brook-Carter, P., Nellist, M., Lindhout, D., Sandkuijl, L. and Halley, D. (1994) Refined localisation of TSC1 by combined analysis of 9q34 and 16p13 data in 14 tuberous sclerosis families. Hum. Genet.,94, 437-440.MEDLINE Abstract
6 Brook-Carter, P.T., Peral, B., Ward, C.J., Thompson, P., Hughes, J., Maheshwar, M., Nellist, M., Gamble, V., Harris, P.C. and Sampson, J.R. (1994) Deletion of the TSC2 and PKD1 genes associated with severe infantile polycystic kidney disease-a contiguous gene syndrome. NatureGenet.,8, 328-332.
8 Green, A.J., Smith, M. and Yates, J.R.W. (1993) Loss of heterozygosity on chromosome 16p13.3 in hamartomas from tuberous sclerosis patients. Nature Genet.,6, 193-196.MEDLINE Abstract
9 Green, A.J., Johnson, P.H. and Yates, J.R.W. (1994) The tuberous sclerosis gene on chromosome 9q34 acts as a growth suppressor. Hum. Mol. Genet.,3, 1833-1834.MEDLINE Abstract
10 Carbonara, C., Longa, L., Grosso, E., Borrone, C., Garre, M.G., Brisigotti, M. and Migone, M. (1994) 9q34 loss of heterozygosity in a tuberous sclerosis astrocytoma suggests a growth suppressor-like activity also for the TSC1 gene. Hum. Mol. Genet.,3, 1829-1832.MEDLINE Abstract
11 Kobayashi, T., Nishizawa, M., Hirayama, Y., Kobayashi, E. and Hino, O. (1995) cDNA structure, alternative splicing and exon-intron organisation of the predisposing tuberous sclerosis (Tsc2) gene of the Eker rat model. Nucleic Acids Res.,23, 2608-2613.MEDLINE Abstract
12 Kobayashi, T., Hirayama, Y., Kobayashi, E., Kubo, Y. and Hino, O. (1995) A germline insertion in the tuberous sclerosis (TSC2) gene gives rise to the Eker rat model of dominantly inherited cancer. Nature Genet.,9, 70-74.MEDLINE Abstract
13 Yeung, R.S., Xiao, G-H., Jin, W-C., Testa, J.R. and Knudson, A.G. (1994) Predisposition to renal cell carcinoma in the Eker rat is determined by germ-line mutation of the tuberous sclerosis 2 (TSC2) gene. Proc. Natl Acad. Sci. USA,91, 11413-11416.MEDLINE Abstract
14 Everitt, J.I., Goldsworthy, T.L., Wolf, D.C. and Walker, C.L. (1992) Hereditary renal cell carcinoma in the Eker rat: a rodent familial cancer syndrome. J. Urol.,148, 1932-1936.MEDLINE Abstract
15 The European Polycystic Kidney Disease Consortium (1994) The polycystic kidney disease 1 gene encodes a 14kb transcript and lies within a duplicated region on chromosome 16. Cell,77, 881-894.
16 Brenner, S., Elgar, G., Sandford, R., Macrae, A., Venkatesh, B. and Aparicio, S. (1993) Characterisation of the pufferfish (Fugu) genome as a compact model vertebrate genome. Nature,366, 265-268. MEDLINE Abstract
17 Altschul, S.F., Gish, W., Miller, M., Myers, E.W. and Lipman, D.J. (1990) Basic local alignment search tool. J. Mol. Biol.,215, 403-410. MEDLINE Abstract
18 Elgar, G., Sandford, R., Aparicio, S., Venkatesh, B., Macrae, A. and Brenner, S. (1995) Comparative genomic analysis using the Japanese Puffer fish (Fugu rubripes). TrendsGenet., in press.
19 Brunak, S., Engelbrecht, J. and Knudsen, S. (1991) Prediction of human mRNA donor and acceptor sites from the DNA sequence. J. Mol. Biol.,220, 49-65. MEDLINE Abstract
20 Xu, Y., Mural, R. J., Shah, M. B. and Uberbacher, E. C. (1994) Recognising exons in genomic sequence using GRAIL II. In Genetic Engineering: Principles and Methods, Vol 15. Jane Setlow (Ed.). Plenum Press.
21 Shapiro, M.B. and Senapathy, P. (1988) RNA splice junctions of different classes of eukaryotes: sequence statistics and functional implications in gene expression. Nucleic AcidsRes.,15, 7155-7174.
22 Higgins, D.G. (1994) CLUSTAL V: multiple alignment of DNA and protein sequences. Methods Mol. Biol.,25, 307-318. MEDLINE Abstract
23 Kyte, J. and Doolittle, R. F. (1982) A simple method for displaying the hydropathic character of a protein. J. Mol. Biol.,157, 105-132.MEDLINE Abstract
24 Persson, B. and Argos, P. (1994) Prediction of transmembrane segments in proteins utilising multiple sequence alignments. J. Mol. Biol.,237, 182-192.MEDLINE Abstract
25 Rost, B. and Sander, C. (1993) Prediction of protein structure at better than 70% accuracy. J.Mol. Biol.,232, 584-599.
26 Rost, B. and Sander, C. (1994) Combining evolutionary information and neural networks to predict protein secondary structure. Proteins,19, 55-72.MEDLINE Abstract
27 Rost, B., Casadio, R., Fariselli, P. and Sander, C. (1995) Prediction of helical transmembrane segments at 75% accuracy. Protein Sci.,4, 521-533.MEDLINE Abstract
28 Wienecke, R., Konig, A. and DeClue, J.E. (1995) Identification of tuberin, the tuberous sclerosis 2 product-tuberin possesses specific rap1GAP activity. J. Biol. Chem.,270, 16409-16414.MEDLINE Abstract
29 Xu, L., Sterner, C., Maheshwar, M., Wilson, P.J., Nellist, M., Short, P., Haines, J., Sampson, J.R. and Ramesh, V. (1995) Alternative splicing of the tuberous sclerosis 2 (TSC2) gene in human and mouse tissues. Genomics,27, 475-480.MEDLINE Abstract
30 Baxendale, S., Abdulla, S., Elgar, G., Buck. D., Berks, M., Micklem, G., Durbin, R., Bates, G., Brenner, S., Beck, S. and Lehrach H. (1995) Comparative sequence analysis of the human and pufferfish Huntington's disease genes. Nature Genet.,10, 67-76.MEDLINE Abstract
31 Mason., P.J., Stevens, D.J., Luzzatto, L., Brenner, S. and Aparicio, S. (1995) Genomic structure and sequence of the Fugu rubripes glucose-6-phosphate dehydrogenase gene (G6PD). Genomics,26, 587-591.MEDLINE Abstract
32 Roest, P.A.M., Roberts, R.G., Sugino, S., van Ommen, G-J. and den Dunnen, J.T. (1993) Protein truncation test (PTT) for rapid detection of translation-terminating mutations. Hum.Mol. Genet.,2, 1719-1721.
33 Grunstein, M. and Hogness, D.S. (1975) Colony hybridisation: a method for the isolation of cloned DNAs that contain a specific gene. Proc. Natl Acad. Sci. USA,72, 8961.
34 Chomczynski, P. and Sacchi, N. (1987) Single-step method of RNA isolation by acid guanidinium thiocyanate-phenol-chloroform extraction. Anal. Biochem.,162, 156-159.MEDLINE Abstract
35 Bairoch, A. (1992) Prosite: a dictionary of sites and patterns in proteins. Nucleic Acids Res.,20, 2013-2018.MEDLINE Abstract
*To whom correspondence should be addressed
+Present address: Department of Clinical Genetics, Erasmus University, 3015 GE Rotterdam, The Netherlands
This page is maintained by OUP admin. Last updated Thu Oct 31 15:08:29 GMT 1996. Part of the OUP Journals World Wide Web service.Copyright Oxford University Press, 1996