Human Molecular Genetics, Vol 7, 919-932, Copyright © 1998 by Oxford University Press
MQ Zhang
To facilitate gene finding and for the investigation of human molecular
genetics on a genome scale, we present a comprehensive survey on various
statistical features of human exons. We first show that human exons with
flanking genomic DNA sequences can be classified into 12 mutually exclusive
categories. This classification could serve as a standard for future
studies so that direct comparisons of results can be made. A database for
eight categories (related to human genes in which coding regions are split
by introns) was built from GenBank release 87.0 and analyzed by a number of
methods to characterize statistical features of these sequences that may
serve as controls or regulatory signals for gene expression. The
statistical information compiled includes profiles of signals for
transcription, splicing and translation, various compositional statistics
and size distributions. Further analyses reveal novel correlations and
constraints among different splicing features across an internal exon that
are consistent with the Exon Definition model. This information is
fundamental for a quantitative view of human gene organization, and should
be invaluable for individual scientists to design human molecular genetics
experiments.
ARTICLES
Statistical features of human exons and their flanking regions
Cold Spring Harbor Laboratory, PO Box 100, Cold Spring Harbor, NY 11724, USA. mzhang@cshl.org
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
N. Vignier, S. Schlossarek, B. Fraysse, G. Mearini, E. Kramer, H. Pointu, N. Mougenot, J. Guiard, R. Reimer, H. Hohenberg, et al. Nonsense-Mediated mRNA Decay and Ubiquitin-Proteasome System Regulate Cardiac Myosin-Binding Protein C Mutant Levels in Cardiomyopathic Mice Circ. Res., July 31, 2009; 105(3): 239 - 248. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Sinha, S. Nikolajewa, K. Szafranski, M. Hiller, N. Jahn, K. Huse, M. Platzer, and R. Backofen Accurate prediction of NAGNAG alternative splicing Nucleic Acids Res., June 1, 2009; 37(11): 3569 - 3579. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. E. Loughlin, R. E. Mansfield, P. M. Vaz, A. P. McGrath, S. Setiyaputra, R. Gamsjaeger, E. S. Chen, B. J. Morris, J. M. Guss, and J. P. Mackay The zinc fingers of the SR-like protein ZRANB2 are single-stranded RNA-binding domains that recognize 5' splice site-like sequences PNAS, April 7, 2009; 106(14): 5581 - 5586. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. R. Guyon, J. Goswami, S. J. Jun, M. Thorne, M. Howell, T. Pusack, G. Kawahara, L. S. Steffen, M. Galdzicki, and L. M. Kunkel Genetic isolation and characterization of a splicing mutant of zebrafish dystrophin Hum. Mol. Genet., January 1, 2009; 18(1): 202 - 211. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Piskol and W. Stephan Analyzing the Evolution of RNA Secondary Structures in Vertebrate Introns Using Kimura's Model of Compensatory Fitness Interactions Mol. Biol. Evol., November 1, 2008; 25(11): 2483 - 2492. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. D. Fackenthal and L. A. Godley Aberrant RNA splicing and its functional consequences in cancer cells Dis. Model. Mech., July 1, 2008; 1(1): 37 - 42. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Ryabov and M. Gribskov Spontaneous symmetry breaking in genome evolution Nucleic Acids Res., May 1, 2008; 36(8): 2756 - 2763. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Gao, A. Masuda, T. Matsuura, and K. Ohno Human branch point consensus sequence is yUnAy Nucleic Acids Res., April 1, 2008; 36(7): 2257 - 2267. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Majerciak, K. Yamanegi, E. Allemand, M. Kruhlak, A. R. Krainer, and Z.-M. Zheng Kaposi's Sarcoma-Associated Herpesvirus ORF57 Functions as a Viral Splicing Factor and Promotes Expression of Intron-Containing Viral Lytic Genes in Spliceosome-Mediated RNA Splicing J. Virol., March 15, 2008; 82(6): 2792 - 2801. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Sahashi, A. Masuda, T. Matsuura, J. Shinmi, Z. Zhang, Y. Takeshima, M. Matsuo, G. Sobue, and K. Ohno In vitro and in silico analysis reveals an efficient algorithm to predict the splicing consequences of mutations at the 5' splice sites Nucleic Acids Res., September 25, 2007; 35(18): 5995 - 6003. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Vilardell and A. Sanchez-Pla Hypothesis testing approaches to the exon prediction problem Bioinformatics, December 15, 2006; 22(24): 3003 - 3008. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Buratti, M. Baralle, and F. E. Baralle Defective splicing, disease and therapy: searching for master checkpoints in exon definition Nucleic Acids Res., July 19, 2006; 34(12): 3494 - 3510. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Swan, S. A. Richards, N. P. Duroudier, I. Sayers, and I. P. Hall Alternative Promoter Use and Splice Variation in the Human Histamine H1 Receptor Gene Am. J. Respir. Cell Mol. Biol., July 1, 2006; 35(1): 118 - 126. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Spena, M. L. Tenchini, and E. Buratti Cryptic splice site usage in exon 7 of the human fibrinogen B{beta}-chain gene is regulated by a naturally silent SF2/ASF binding site within this exon RNA, June 1, 2006; 12(6): 948 - 958. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Sinnreich, C. Therrien, and G. Karpati Lariat branch point mutation in the dysferlin gene with mild limb-girdle muscular dystrophy Neurology, April 11, 2006; 66(7): 1114 - 1116. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Schickel, C. Beetz, C. Frommel, G. Heide, A. Sasse, P. Hemmerich, and T. Deufel Unexpected pathogenic mechanism of a novel mutation in the coding sequence of SPG4 (spastin) Neurology, February 14, 2006; 66(3): 421 - 423. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Akerman and Y. Mandel-Gutfreund Alternative splicing regulation at tandem 3' splice sites Nucleic Acids Res., January 3, 2006; 34(1): 23 - 31. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Yamanegi, S. Tang, and Z.-M. Zheng Kaposi's Sarcoma-Associated Herpesvirus K8{beta} Is Derived from a Spliced Intermediate of K8 Pre-mRNA and Antagonizes K8{alpha} (K-bZIP) To Induce p21 and p53 and Blocks K8{alpha}-CDK2 Interaction J. Virol., November 15, 2005; 79(22): 14207 - 14221. [Abstract] [Full Text] [PDF] |
||||
![]() |
D Baralle and M Baralle Splicing in action: assessing disease causing sequence changes J. Med. Genet., October 1, 2005; 42(10): 737 - 748. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Kol, G. Lev-Maor, and G. Ast Human-mouse comparative analysis reveals that branch-site plasticity contributes to splicing regulation Hum. Mol. Genet., June 1, 2005; 14(11): 1559 - 1568. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. L. Rubin, C. F. Thorn, T. E. Klein, and R. B. Altman A Statistical Approach to Scanning the Biomedical Literature for Pharmacogenetics Knowledge J. Am. Med. Inform. Assoc., March 1, 2005; 12(2): 121 - 129. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. J. Kang, K. O. Choi, B.-D. Kim, S. Kim, and Y. J. Kim FESD: a Functional Element SNPs Database in human Nucleic Acids Res., January 1, 2005; 33(suppl_1): D518 - D522. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Buratti, M. Baralle, L. De Conti, D. Baralle, M. Romano, Y. M. Ayala, and F. E. Baralle hnRNP H binding at the 5' splice site correlates with the pathological effect of two intronic mutations in the NF-1 and TSH{beta} genes Nucleic Acids Res., August 6, 2004; 32(14): 4224 - 4236. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. D. Emes, M. C. Riley, C. M. Laukaitis, L. Goodstadt, R. C. Karn, and C. P. Ponting Comparative Evolutionary Genomics of Androgen-Binding Protein Genes Genome Res., August 1, 2004; 14(8): 1516 - 1529. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. C. H. Kerr, F. E. Holmes, and D. Wynick Novel Isoforms of the Sodium Channels Nav1.8 and Nav1.5 Are Produced by a Conserved Mechanism in Mouse and Rat J. Biol. Chem., June 4, 2004; 279(23): 24826 - 24833. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. C. C. Ryther, A. S. Flynt, B. D. Harris, J. A. Phillips III, and J. G. Patton GH1 Splicing Is Regulated by Multiple Enhancers Whose Mutation Produces a Dominant-Negative GH Isoform That Can Be Degraded by Allele-Specific Small Interfering RNA (siRNA) Endocrinology, June 1, 2004; 145(6): 2988 - 2996. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. H-F. Zhang and L. A. Chasin Computational definition of sequence motifs governing constitutive exon splicing Genes & Dev., June 1, 2004; 18(11): 1241 - 1250. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Sironi, G. Menozzi, L. Riva, R. Cagliani, G. P. Comi, N. Bresolin, R. Giorda, and U. Pozzoli Silencer elements as possible inhibitors of pseudoexon splicing Nucleic Acids Res., March 19, 2004; 32(5): 1783 - 1791. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Nogues, M. J. Munoz, and A. R. Kornblihtt Influence of Polymerase II Processivity on Alternative Splicing Depends on Splice Site Strength J. Biol. Chem., December 26, 2003; 278(52): 52166 - 52171. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. H-F. Zhang, K. A. Heller, I. Hefter, C. S. Leslie, and L. A. Chasin Sequence Information for the Splicing of Human Pre-mRNA Identified by Support Vector Machine Classification Genome Res., December 1, 2003; 13(12): 2637 - 2650. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Ohno, M. Milone, X.-M. Shen, and A. G. Engel A frameshifting mutation in CHRNE unmasks skipping of the preceding exon Hum. Mol. Genet., December 1, 2003; 12(23): 3055 - 3066. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Cocquet, E. De Baere, S. Caburet, and R. A. Veitia Compositional Biases and Polyalanine Runs in Humans Genetics, November 1, 2003; 165(3): 1613 - 1617. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. W. Soong, C. D. DeMaria, R. S. Alvania, L. S. Zweifel, M. C. Liang, S. Mittman, W. S. Agnew, and D. T. Yue Systematic Identification of Splice Variants in Human P/Q-Type Channel alpha 12.1 Subunits: Implications for Current Density and Ca2+-Dependent Inactivation J. Neurosci., December 1, 2002; 22(23): 10142 - 10152. [Abstract] [Full Text] [PDF] |
||||
![]() |
E H Stover, K J Borthwick, C Bavalia, N Eady, D M Fritz, N Rungroj, A B S Giersch, C C Morton, P R Axon, I Akil, et al. Novel ATP6V1B1 and ATP6V0A4 mutations in autosomal recessive distal renal tubular acidosis with new evidence for hearing loss J. Med. Genet., November 1, 2002; 39(11): 796 - 803. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. V. Postma, I. Denjoy, T. M. Hoorntje, J.-M. Lupoglazoff, A. Da Costa, P. Sebillon, M. M.A.M. Mannens, A. A.M. Wilde, and P. Guicheney Absence of Calsequestrin 2 Causes Severe Forms of Catecholaminergic Polymorphic Ventricular Tachycardia Circ. Res., October 18, 2002; 91 (8): e21 - e26. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. N. Hines, K. A. Hopp, J. Franco, K. Saeian, and F. P. Begun Alternative Processing of the Human FMO6 Gene Renders Transcripts Incapable of Encoding a Functional Flavin-Containing Monooxygenase Mol. Pharmacol., August 1, 2002; 62(2): 320 - 325. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. Schoenhard, M. Eren, C. H. Johnson, and D. E. Vaughan Alternative splicing yields novel BMAL2 variants: tissue distribution and functional characterization Am J Physiol Cell Physiol, July 1, 2002; 283(1): C103 - C114. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Chen, A. J. Gentles, J. Jurka, and S. Karlin Genes, pseudogenes, and Alu sequence organization across human chromosomes 21 and 22 PNAS, February 20, 2002; (2002) 52692099. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Clark and T. A. Thanaraj Categorization and characterization of transcript-confirmed constitutively and alternatively spliced introns and exons from human Hum. Mol. Genet., February 1, 2002; 11(4): 451 - 464. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y.-L. Yan, C. T. Miller, R. Nissen, A. Singer, D. Liu, A. Kirn, B. Draper, J. Willoughby, P. A. Morcos, A. Amsterdam, et al. A zebrafish sox9 gene required for cartilage morphogenesis Development, January 11, 2002; 129(21): 5065 - 5079. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Levine and R. Durbin A computational scan for U12-dependent introns in the human genome sequence Nucleic Acids Res., October 1, 2001; 29(19): 4006 - 4013. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Ando, N. J. Sarlis, J. Krishnan, X. Feng, S. Refetoff, M. Q. Zhang, E. H. Oldfield, and P. M. Yen Aberrant Alternative Splicing of Thyroid Hormone Receptor in a TSH-Secreting Pituitary Tumor Is A Mechanism for Hormone Resistance Mol. Endocrinol., September 1, 2001; 15(9): 1529 - 1538. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. A. Thanaraj and F. Clark Human GC-AG alternative intron isoforms with weak donor sites show enhanced consensus at acceptor exon positions Nucleic Acids Res., June 15, 2001; 29(12): 2581 - 2593. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Wolf, D. Mertens, C. Schaffner, C. Korz, H. Dohner, S. Stilgenbauer, and P. Lichter B-cell neoplasia associated gene with multiple splicing (BCMS): the candidate B-CLL gene on 13q14 comprises more than 560 kb covering all critical regions Hum. Mol. Genet., June 1, 2001; 10(12): 1275 - 1285. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Trexler, L. Bányai, and L. Patthy A human protein containing multiple types of protease-inhibitory modules PNAS, March 7, 2001; (2001) 61028398. [Abstract] [Full Text] |
||||
![]() |
M. Romano, R. Marcucci, and F. E. Baralle Splicing of constitutive upstream introns is essential for the recognition of intra-exonic suboptimal splice sites in the thrombopoietin gene Nucleic Acids Res., February 15, 2001; 29(4): 886 - 894. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. G. Fairbrother and L. A. Chasin Human Genomic Sequences That Inhibit Splicing Mol. Cell. Biol., September 15, 2000; 20(18): 6816 - 6825. [Abstract] [Full Text] |
||||
![]() |
H. Sun and L. A. Chasin Multiple Splicing Defects in an Intronic False Exon Mol. Cell. Biol., September 1, 2000; 20(17): 6414 - 6425. [Abstract] [Full Text] |
||||
![]() |
Z.-M. Zheng, J. Quintero, E. S. Reid, C. Gocke, and C. C. Baker Optimization of a Weak 3' Splice Site Counteracts the Function of a Bovine Papillomavirus Type 1 Exonic Splicing Suppressor In Vitro and In Vivo J. Virol., July 1, 2000; 74(13): 5902 - 5910. [Abstract] [Full Text] |
||||
![]() |
A.-M. Mallon, M. Platzer, R. Bate, G. Gloeckner, M.R.M. Botcherby, G. Nordsiek, M.A. Strivens, P. Kioschis, A. Dangel, D. Cunningham, et al. Comparative Genome Sequence Analysis of the Bpa/Str Region in Mouse and Man Genome Res., June 1, 2000; 10(6): 758 - 775. [Abstract] [Full Text] |
||||
![]() |
C. L. Lorson, E. Hahnen, E. J. Androphy, and B. Wirth A single nucleotide in the SMN gene regulates splicing and is responsible for spinal muscular atrophy PNAS, May 25, 1999; 96(11): 6307 - 6311. [Abstract] [Full Text] [PDF] |
||||
![]() |
Q. Wu and A. R. Krainer AT-AC Pre-mRNA Splicing Mechanisms and Conservation of Minor Introns in Voltage-Gated Ion Channel Genes Mol. Cell. Biol., May 1, 1999; 19(5): 3225 - 3236. [Full Text] [PDF] |
||||
![]() |
C.-a. A. Hu, W.-W. Lin, C. Obie, and D. Valle Molecular Enzymology of Mammalian Delta 1-Pyrroline-5-carboxylate Synthase. ALTERNATIVE SPLICE DONOR UTILIZATION GENERATES ISOFORMS WITH DIFFERENT SENSITIVITY TO ORNITHINE INHIBITION J. Biol. Chem., March 5, 1999; 274(10): 6754 - 6762. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. D. Schaal and T. Maniatis Multiple Distinct Splicing Enhancers in the Protein-Coding Sequences of a Constitutively Spliced Pre-mRNA Mol. Cell. Biol., January 1, 1999; 19(1): 261 - 273. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Chen, A. J. Gentles, J. Jurka, and S. Karlin Genes, pseudogenes, and Alu sequence organization across human chromosomes 21 and 22 PNAS, March 5, 2002; 99(5): 2930 - 2935. [Abstract] [Full Text] [PDF] |
||||
![]() |
Q. Wu and T. Maniatis Large exons encoding multiple ectodomains are a characteristic feature of protocadherin genes PNAS, March 28, 2000; 97(7): 3124 - 3129. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Trexler, L. Banyai, and L. Patthy A human protein containing multiple types of protease-inhibitory modules PNAS, March 27, 2001; 98(7): 3705 - 3709. [Abstract] [Full Text] [PDF] |
||||























