Human Molecular Genetics Advance Access originally published online on July 13, 2005
Human Molecular Genetics 2005 14(16):2415-2419; doi:10.1093/hmg/ddi243
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Prediction of pathogenic mutations in mitochondrially encoded human tRNAs
1Section on Ecology, Behavior and Evolution, Division of Biological Sciences, University of California at San Diego, 2218 Muir Biology Building, La Jolla, CA 92093, USA and 2Engelhardt Institute of Molecular Biology, Vavilova 32, 119991 Moscow, Russia
* To whom correspondence should be addressed. Tel: +1 7916 2566774; Fax: +1 858 534718; Email: kondrashov{at}ucdavis.edu
Received May 30, 2005; Revised June 24, 2005; Accepted July 1, 2005
| ABSTRACT |
|---|
|
|
|---|
Some mutations in human mitochondrial tRNAs are severely pathogenic. The available computational methods have a poor record of predicting the impact of a tRNA mutation on the phenotype and fitness. Here patterns of evolution at tRNA sites that harbor pathogenic mutations and at sites that harbor phenotypically cryptic polymorphisms were compared. Mutations that are pathogenic to humans occupy more conservative sites, are only rarely fixed in closely related species, and, when located in stem structures, often disrupt WatsonCrick pairing and display signs of compensatory evolution. These observations make it possible to classify
90% of all known pathogenic mutations as deleterious together with only
30% of polymorphisms. These polymorphisms segregate at frequencies that are more than two times lower than frequencies of polymorphisms classified as benign, indicating that at least
30% of known polymorphisms in mitochondrial tRNAs affect fitness negatively. | INTRODUCTION |
|---|
|
|
|---|
A variety of often severe genetic disorders, mostly neuromuscular and neurodegenerative, are caused by mutations in mitochondrial (mt) DNA (1
It is thought that the accurate identification of pathogenic mutations would enable researchers to describe their molecular and biochemical characteristics (7
,8
), which may lead to more successful treatment of the resulting pathologies (8
,9
). Currently, a mutation can be identified as pathogenic by a variety of different criteria (10
), one of which is the preference for the mutation to be found at an evolutionarily conserved site, which by itself is a poor predictor of pathogenic nature of a variant (11
13
). The other criteria, such as the requirement for the pathogenic mitochondria to be heteroplasmic, are not computational and often require extensive work in the laboratory (10
). Thus, the availability of an accurate computational approach should aid rapid and inexpensive identification of novel pathogenic mutations.
Previous comparisons of pathogenic mutations and phenotypically cryptic polymorphisms in human mt tRNAs showed that pathogenic mutations are more often located at conservative sites (11
14
) and stem structures (14
) and tend to disrupt WatsonCrick (WC) nucleotide pairing in stems (11
,14
). However, these observations turned out to be insufficient to predict the impact of a nucleotide substitution in an mt tRNA on the phenotype and fitness (11
,14
). Here, an evolutionarily based computational analysis of the differences between pathogenic and non-pathogenic substitutions is described.
| RESULTS |
|---|
|
|
|---|
Pathogenic mutations and cryptic polymorphisms were mapped on multiple alignments of the 22 mt tRNAs from 138 different mammals. Both nucleotides which act as mutations and as polymorphisms in humans may be found at orthologous sites of closely related species (15
|
There are 94 known variants (47 pathogenic mutations and 47 polymorphisms) in mt tRNA stems that disrupt WC pairing. For each variant, the number of species was counted in which a variant was found without the compensatory substitution, and the number of species in which a potentially WC pairing-restoring compensatory substitution was found without the variant. For example, in Figure 1A, both of these numbers are equal to 0 (variant A is always found opposite a T) and in Figure 1B, these values are 1 and 2 for variant G, and 4 and 0 for variant A. The sum of these two numbers, obtained separately for primates only and for all mammals, was used as a gauge of evolutionary independence of the interacting sites. A variant was classified as benign if this sum was greater than two for primates (four pathogenic mutations; 15 polymorphisms) or if this sum was greater than nine for all mammals, and the site was conserved in less than 100 of the 138 (80 out of 119 for tRNALys variants, see Materials and Methods) available mammals (one pathogenic mutation; five polymorphisms). All other variants were classified as deleterious (42 pathogenic mutations; 27 polymorphisms).
If a variant within an mt tRNA stem that does not disrupt a WC pair (two pathogenic mutation and 38 polymorphisms total) affects a site that is conserved in more than 128 of the 138 available mammals (108 out of 119 for tRNALys variants, see Materials and Methods), and the variable nucleotide was found in fewer than five mammals, it was classified as deleterious (two pathogenic mutation; four polymorphisms) and other variants (34 polymorphisms) were classified as benign. Together, these criteria place 94% pathogenic mutations localized in the stems, and 36% of such cryptic polymorphisms, into the deleterious category.
Without tertiary structure information, the only available data on tRNA loops are sequence conservation. Previous attempts to distinguish pathogenic mutations from polymorphisms relied on sequence conservation in all the available mammalian orthologs (11
). However, this dataset is too diverse, because mt tRNA loops are often not conserved even within primates (Fig. 1), and the use of non-primate mammals may obscure the pattern of conservation. In general, the use of closely related species is preferred, because of the possibility of functional or compensatory changes in more distant orthologs.
Two criteria were applied to 18 pathogenic mutations and 98 polymorphisms located outside of stems: (i) whether a variant is found in one of the five non-human Greater Ape species and (ii) the level of conservation in all of the 17 primate species of the site of a variant. A variant was classified as benign if it was present in at least one Greater Ape species (one pathogenic mutation; 42 polymorphisms) or if its site was not conservative, in the sense that its most common nucleotide was found in less than 15 out of 17 primate species (one pathogenic mutation; 35 polymorphisms). Otherwise, a variant was classified as deleterious (16 pathogenic mutations; 21 polymorphisms). This simple approach classifies as deleterious 89% of pathogenic mutations and only 21% of polymorphisms located outside of stem structures.
Among all the known variants within all human mt tRNAs,
70% of polymorphisms were classified as benign, whereas the rate of false negative predictions was low: only
10% of all known pathogenic mutations were classified as benign (Fig. 2). The classification of
90% of all pathogenic mutation as deleterious is a substantial improvement over previous results (11
,14
). Still, classification of
30% of phenotypically cryptic polymorphisms as deleterious may appear as a deficiency of the proposed analysis, should it represent the rate of false positive prediction. Previous analyses assumed that all segregating polymorphisms are selectively neutral (11
,14
). However, owing to a very high mutation rate of the mitochondrial DNA (16
), many segregating variants may be deleterious (17
). In addition, some polymorphisms may be sequencing errors or heteroplasmic variants.
|
Indeed, the average frequency of polymorphisms classified as deleterious (0.0014) is more than two times lower than that of the polymorphisms classified as benign (0.0032), which is highly significant (n1=131, n2=52, U=4401.0, P<0.0018, two-tailed Mann Whitney U-test). Most of the polymorphisms that were classified as deleterious are singletons, i.e. they are present in only one of the 2064 individuals from which the data on segregating variants have been obtained (18
30% of all polymorphisms in human tRNAs classified as deleterious (52/183), indeed, reduce fitness, despite the lack of obvious phenotypic manifestation. The probability that a polymorphism drawn at random from the human population is deleterious is
25% (340/1415). The high estimate is not surprising because some of the polymorphisms found in the human population used in this study are known to contribute to the progression of mitochondrial disease (17
|
Taken together, these observations suggest that the described method has a high rate of accuracy for distinguishing benign variants from severely and slightly pathogenic ones. To aid the identification of new pathogenic variants, this method was applied to all possible mutations of the 22 mt tRNAs (Supplementary Material). As expected, the mutations disrupting WC pairing in stems were predicted to have the highest probability of being deleterious (2218 deleterious mutations out of 2346 mutations total), whereas mutations in stems that to do not disrupt WC pairs have the lowest (104 out of 354). Mutations that are not located in stems have an intermediate probability of being deleterious (1095 out of 1767), most likely due to the inclusion of the highly conserved anticodon loop. Obviously, most lethal mutations should also be classified as deleterious by this approach.
| DISCUSSION |
|---|
|
|
|---|
Polymorphisms classified here as deleterious are not necessarily overtly pathogenic, because even a slight decrease in fitness is enough to reduce the frequency of a variant (19
Similar attempts to discriminate pathogenic mutations from polymorphisms were made for amino acid replacements in nuclear encoded proteins (22
,23
). However, the described analysis of pathogenic mutations in mt tRNAs is slightly more successful than that in proteins, possibly because of the difficulty of predicting patterns of compensatory evolution in proteins (24
), relative to tRNAs (15
). Many polymorphisms found in nuclear encoded proteins are deleterious (22
,23
); future application of the method described here for the identification of deleterious variants in the two rRNA genes encoded in the mitochondria and the application of protein-based methods (25
) for the identification of deleterious variants in 13 mitochondrially encoded proteins should lead to an estimate of the genetic load (26
) of the mitochondrial genome.
There are four reasons why the method described here appears to have a higher accuracy than those that were published previously (11
,14
). First, sites in mt tRNAs were segregated into three, rather than two, structural categories. Secondly, this method allows for the possibility of having a variable site that excludes a particular nucleotide, i.e. it allows for the possibility that A, T and G are neutral equivalents and C is deleterious. Thirdly, use patterns of compensatory evolution were used in stem sites that disrupt WC pairs. Finally, sequence conservation information is used only in closely related species. Further improvement of this method may be based fine-tuning the cutoff values that were used, or on the addition of other ways to distinguish pathogenic and polymorphic variants. For example, the use of tertiary structure information improves the prediction of pathogenic mutations in proteins (23
) and should help to improve the method of pathogenic mutation prediction for mt tRNAs as well. However, given that sequence similarity of closely related species is a good predictor of pathogenicity, the present method can be improved by the availability of mitochondrial sequences of other primate species, especially from Platyrrhini and Strepsirhini, of which only three species are currently available.
| MATERIALS AND METHODS |
|---|
|
|
|---|
A list of pathogenic mutations was taken from MitoMap (5
G) was excluded from the list of pathogenic mutations because of the raised doubts of its pathogenic nature (27
5% for pathogenic compensations in WC pairs (11
12%) in primate species and 15 out of 138 (
11%) in all mammals. Sequence conservation cutoff values were taken as
5% for non-WC pairs, as was done previously (11
20% was chosen for sites located outside of stem structures because of a substantially higher fraction of compensatory evolution in such sites (15| SUPPLEMENTARY MATERIAL |
|---|
|
|
|---|
Supplementary Material is available at HMG Online.
| ACKNOWLEDGEMENTS |
|---|
The author thanks P. Andolfatto, D. Bachtrog, N. Esipova, S. Makeev, A. Kondrashov, V. Ramensky, V. Tumanyan and P. Vlasov for a critical reading of the manuscript. The author is an NSF Graduate Research Fellow. This work was supported by a Contract of the Russian Ministry of Science and Education (02.434.11.1008 [EC] ) and a grant on Molecular and Cellular Biology from RAS.
Conflict of Interest statement. None declared.
| REFERENCES |
|---|
|
|
|---|
- De Vivo, D.C. (1993) The expanding clinical spectrum of mitochondrial diseases. Brain Dev., 15, 122.[CrossRef][ISI][Medline]
- Taylor, R.W. and Turnbull, D.M. (2005) Mitochondrial DNA mutations in human disease. Nat. Rev. Genet., 6, 389402.[Medline]
-
Zeviani, M. and Di Donato, S. (2004) Mitochondrial disorders. Brain, 127, 21532172.
[Abstract/Free Full Text] -
Jacobs, H.T. (2003) Disorders of mitochondrial protein synthesis. Hum. Mol. Genet., 12, R293R301.
[Abstract/Free Full Text] -
Brandon, M.C., Lott, M.T., Cuong Nguyen, K., Spolim, S., Navethe, S.B., Baldi, P. and Wallace, D.C. (2005) MITOMAP: a human mitochondrial genome database2004 update. Nucleic Acids Res., 33, D611D613.
[Abstract/Free Full Text] - Helm, M., Brule, H., Friede, D., Giege, R., Putz, D. and Florentz, C. (2000) Search for characteristic structural features of mammalian mitochondrial tRNAs. RNA, 6, 13561379.[Abstract]
- Florentz, C., Sohm, B., Tryoen-Toth, P., Putz, J. and Sissler, M. (2003) Human mitochondrial tRNAs in health and disease. Cell. Mol. Life Sci., 60, 13561375.[CrossRef][ISI][Medline]
- Wittenhagen, L.M. and Kelley, S.O. (2003) Impact of disease-related mitochondrial mutations on tRNA structure and function. Trends Biochem. Sci., 28, 605611.[CrossRef][ISI][Medline]
- Smith, P.M., Ross, G.F., Taylor, R.W., Turnbull, D.M. and Lightowlers, R.N. (2004) Strategies for treating disorders of the mitochondrial genome. Biochim. Biophys. Acta, 1659, 232239.[Medline]
- DiMauro, S. and Schon, E.A. (2001) Mitochondrial DNA mutations in human disease. Am. J. Med. Genet., 106, 1826.[CrossRef][ISI][Medline]
- Florentz, C. and Sissler, M. (2001) Disease-related versus polymorphic mutations in human mitochondrial tRNAs. Where is the difference? EMBO Rep., 2, 481486.[ISI][Medline]
- Schon, E.A., Bonilla, E. and DiMauro, S. (1997) Mitochondrial DNA mutations and pathogenesis. J. Bioenerg. Biomembr., 29, 131149.[CrossRef][ISI][Medline]
-
Sternberg, D., Chatzoglou, E., Laforêt, P., Fayet, G., Jardel, C., Blondy, P., Fardeau, M., Amselem, S., Eymard, B. and Lombés, A. (2001) Mitochondrial DNA transfer RNA gene sequence variations in patients with mitochondrial disorders. Brain, 124, 984994.
[Abstract/Free Full Text] - McFarland, R., Elson, J.L., Taylor, R.W., Howell, N. and Turnbull, D.M. (2004) Assigning pathogenicity to mitochondrial tRNA mutations: when definitely maybe is not good enough. Trends Genet., 20, 591596.[CrossRef][ISI][Medline]
- Kern, A.D. and Kondrashov, F.A. (2004) Mechanisms and convergence of compensatory evolution in mammalian mitochondrial tRNAs. Nat. Genet.. 36, 12071212.[CrossRef][ISI][Medline]
- Ballard, J.W.O. and Whitlock, M.C. (2004) The incomplete natural history of the mitochondria. Mol. Ecol., 13, 729744.[CrossRef][Medline]
-
Wallace, D.C. (1994) Mitochondrial DNA sequence variation in human evolution and disease. Proc. Natl Acad. Sci. USA, 91, 87398746.
[Abstract/Free Full Text] - Ingman, M., Kaessmann, H., Paabo, S. and Gyllenstein, U. (2000) Mitochondrial genome variation and the origin of modern humans. Nature, 408, 708713.[CrossRef][Medline]
- Kimura, M. (1983) The Neutral Theory of Molecular Evolution. Cambridge University Press, Cambridge, UK.
- Niemi, A.K. et al. (2003) Mitochondrial DNA polymorphisms associated with longevity in a Finnish population. Hum. Genet., 112, 2933.[CrossRef][ISI][Medline]
-
Tanaka, M., Takeyasu, T., Fuku, N., Li-Jun, G. and Kurata, M. (2004) Mitochondrial genome single nucleotide polymorphisms and their phenotypes in the Japanese. Ann. NY Acad. Sci., 1011, 720.
[Abstract/Free Full Text] - Sunyaev, S.R., Lathe, W.C., III, Ramensky, V.E. and Bork, P. (2000) SNP frequencies in human genes an excess of rare alleles and differing modes of selection. Trends Genet., 16, 335337.[CrossRef][ISI][Medline]
-
Sunyaev, S., Ramensky, V., Koch, I., Lathe III, W., Kondrashov, A.S. and Bork, P. (2001) Prediction of deleterious human alleles. Hum. Mol. Genet., 10, 591597.
[Abstract/Free Full Text] -
Kondrashov, A.S., Sunyaev, S. and Kondrashov, F.A. (2002) DobzhanskyMuller incompatibilities in protein evolution. Proc. Natl Acad. Sci. USA, 99, 1487814883.
[Abstract/Free Full Text] -
Ramensky, V., Bork, P. and Sunyaev, S. (2002) Human non-synonymous SNPs: server and survey. Nucleic Acids Res., 30, 38943900.
[Abstract/Free Full Text] - Eyre-Walker, A. and Keightley, P.D. (1999) High genomic deleterious mutation rates in hominids. Nature, 397, 344347.[CrossRef][Medline]
- McFarland, R., Taylor, R.W., Chinnery, P.F., Howell, N. and Turnbull, D.M. (2004) A novel sporadic mutation in cytochrome c oxidase subunit II as a cause of rhabdomyolysis. Neuromuscul. Disord., 14, 162166.[CrossRef][ISI][Medline]
-
Wheeler, D.L., Barrett, T., Benson, D.A., Bryant, S.H., Canese, K., Church, D.M., DiCuccio, M., Edgar, R., Federhen, S., Helmberg, W. et al. (2005) Database resources of the National Center for Biotechnology Information. Nucleic Acids Res., 33, D39D45.
[Abstract/Free Full Text] -
Thompson, J.D., Higgins, D.G. and Gibson, T.J. (1994) CLUSTALW: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res., 22, 46734680.
[Abstract/Free Full Text] -
Janke, A., Feldmaier-Fuchs, G., Thomas, W.K., von Haeseler, A. and Paabo, S. (1994) The marsupial mitochondrial genome and the evolution of placental mammals. Genetics, 137, 243256.[Abstract]
This article has been cited by other articles:
![]() |
J. Putz, B. Dupuis, M. Sissler, and C. Florentz Mamit-tRNA, a database of mammalian mitochondrial tRNA primary and secondary structures RNA, August 1, 2007; 13(8): 1184 - 1190. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||




