Human Molecular Genetics, 2001, Vol. 10, No. 16 1627-1630
© 2001 Oxford University Press
Peptide-N-glycanases and DNA repair proteins, Xp-C/Rad4, are, respectively, active and inactivated enzymes sharing a common transglutaminase fold
National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
Received March 5, 2001; Revised and Accepted June 12, 2001.
| ABSTRACT |
|---|
|
|
|---|
Yeast RAD4, its human ortholog Xp-C and their orthologs in other eukaryotes are DNA repair proteins which participate in nucleotide excision repair through a ubiquitin-dependent process. However, no conserved globular domains that might have shed light on their origin or functions have been reported for these proteins. By using sequence profile analysis, we show that RAD4/Xp-C proteins contain the ancient transglutaminase fold and are specifically related to the recently characterized peptide-N-glycanases (PNGases) which remove glycans from glycoproteins during their degradation. The PNGases retain the catalytic triad that is typical of this fold and are predicted to have a reaction mechanism similar to that involved in transglutamination. In contrast, the RAD4/Xp-C proteins are predicted to be inactive and are likely to only possess the protein interaction function in DNA repair. These proteins also contain a long, low-complexity insert in the globular transglutaminase domain. The RAD4/Xp-C proteins, along with other inactive transglutaminase-fold proteins, represent a case of functional re-assignment of an ancient domain following the loss of the ancestral enzymatic activity.
| INTRODUCTION |
|---|
|
|
|---|
Eukaryotes have a complex system of nucleotide excision repair (NER) (1,2) that has been the subject of extensive studies, particularly in connection with the inactivation of various components of this system in human diseases such as Xeroderma pigmentosum [XP (3)]. Studies with the yeast model system and with XP complementation groups have led to the identification of a variety of repair enzymes, including DNA helicases and ATPases, such as the ERCC2/3, and nucleases, such as ERCC1/XP-F and XP-G. Additional components of the excision repair system include Xp-C/Rad4, RAD23 and RAD7 that mediate DNAprotein and proteinprotein interactions. Most of these proteins contain recognizable, conserved globular domains which are consistent with the corresponding biochemical activities (4,5). Yeast Rad4 and its human ortholog XP-C play an important role in the recognition of DNA damage and recruitment of the TFIIH complex for excision repair (69), but contain no previously identified domains. Here we show that Rad4/XP-C is an inactive homolog of the recently identified peptide-N-glycanases (PNGases) that are involved in glycoprotein degradation (10) and that these proteins share a core transglutaminase fold.
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
Iterative searches of the non-redundant protein sequence database (National Center for Biotechnology Information, NIH, Bethesda) using the PSI-BLAST program (11) revealed statistically significant sequence similarity between the Xp-C/Rad4 proteins and the PNGases. These searches also showed a statistically significant similarity between the Xp-C/Rad4, PNGases and the proteins of the transglutaminase superfamily (12) (Fig. 1). PFAM search tools that utilize Hidden Markov Models based on alignments from the PFAM database (13) also identified the transglutaminase domain in the yeast Rad4 and PNGases (E-values: 104103), but not in XP-C or other RAD4 orthologs. The presence of the transglutaminase fold in these proteins was further confirmed by carrying out sequencestructure threading using the hybrid fold recognition method (14), with the yeast Rad4 as a query. This resulted in the detection of PDB:1FIE as the best hit. The transglutaminase superfamily includes, in addition to the well-characterized transglutaminases such as the vertebrate clotting-factor XIIIA', several proteases and many uncharacterized proteins that are found in a broad range of prokaryotes and eukaryotes (12). The majority of the proteins of this superfamily are known or predicted to be active enzymes that utilize a catalytic triad comprised of a histidine, a cysteine and an aspartate (Fig. 1) and resembling the active site of papain-like proteases (12,15). The reactions catalyzed by these enzymes involve either the formation of amides by linking alkylamines to the glutamate side chains of proteins or hydrolysis of peptide bonds in the case of proteases (12,15). Thus, the finding of the transglutaminase fold in the PNGases is consistent with the reaction catalyzed by these enzymes that involves breakage of the amide bond between N-acetylglucosamine and an asparagine side chain (10). This is confirmed by the inactivation of the yeast PNGase by a mutation that eliminates the cysteine of the transglutaminase fold catalytic triad (Fig. 1) (10).
|
The RAD4/XP-C proteins are the closest homologs of the PNGases within the transglutaminase superfamily; these two groups of proteins share a unique feature, a conserved C-terminal extension, to the exclusion of all other members of this superfamily (Fig. 1). All animal Rad4/XP-C proteins and one of the paralogs from Schizosaccharomyces pombe contain a large, compositionally biased insert between strands 2 and 3 of the transglutaminase fold (Fig. 1). Examination of the multiple alignment shows that RAD4/XP-C proteins lack the (predicted) catalytic residues, suggesting that these proteins emerged early in the evolution of eukaryotes through a duplication of the PNGase, followed by elimination of the enzymatic activity due to the disruption of the active site triad (Fig. 1). At least two other cases of similar, independent, secondary inactivation of the transglutaminase superfamily proteins following their divergence from active ancestors were noticed, namely the erythrocyte protein-band 4.2 and a highly conserved family of potential cytoskeletal proteins, typified by mouse Ky protein (16) and yeast Cyk3 protein (17) and represented in eukaryotes and cyanobacteria (Fig. 1). Due to the general ability of the ancestral forms of these enzymes to interact with other proteins, some of their inactive descendants probably have been recruited for a non-catalytic interaction function. Notably, the PNGases are required for the early stage of proteasomal degradation of glycoproteins (10), whereas Rad4 has also been shown to interact with the proteasome via the atypical ubiquitin homolog, Rad23 (18,19). Thus, it appears that Rad4/XP-C has evolved from an ancestral N-deglycosylase or protease that could have been involved in proteasome-dependent degradation of chromosomal proteins. With the recruitment of the ubiquitin-dependent machinery for a non-proteolytic function in NER, Rad4 could have been recycled to function as an adaptor in proteinprotein interactions that are required for this form of repair.
The inactive transglutaminase is the only globular domain (20) in the RAD4/Xp-C proteins, which makes this domain a candidate for the role of the determinant of the specific proteinprotein interaction of these proteins. Consistent with this, the portion of RAD4 that encompasses the tranglutaminase domain is required for the interaction with the leucine-rich-repeat containing protein RAD7 (21).
Recruitment of inactivated proteases has been described as one of the evolutionary sources of eukaryotic transcription factors (22). The present observations indicate that evolutionary exaptation (recycling of proteins that have lost their original activity) of the transglutaminase fold for proteinprotein interaction has occurred on several independent occasions in contexts as different as NER complex and cytoskeletal organization.
| MATERIALS AND METHODS |
|---|
|
|
|---|
The non-redundant database of protein sequences (National Center for Biotechnology Information, NIH, Bethesda, MD) was searched using the BLASTP program (11). Profile searches were conducted using the PSI-BLAST program with either a single sequence or an alignment used as the query, with a profile inclusion expectation (E) value threshold of 0.01, and were iterated until convergence (11,23). Previously known, conserved protein domains were detected using the corresponding position-specific scoring matrices, which were constructed using PSI-BLAST (24). Multiple alignments of protein sequences were constructed using the ClustalW program (25) and protein secondary structure was predicted using a multiple alignment as the input for the PHD program (26). Structure visualization was done with Swiss PDB Viewer (27). Sequencestructure threading was performed using the hybrid fold prediction method that combines multiple alignment information with secondary structure prediction (14).
| FOOTNOTES |
|---|
+ To whom correspondence should be addressed. Tel: +1 301 594 2445; Fax: +1 301 480 9241; Email aravind@ncbi.nlm.nih.gov
| REFERENCES |
|---|
|
|
|---|
1 Prakash, S. and Prakash, L. (2000) Nucleotide excision repair in yeast. Mutat. Res., 451, 1324.[ISI][Medline]
2 Batty, D.P. and Wood, R.D. (2000) Damage recognition in nucleotide excision repair of DNA. Gene, 241, 193204.[ISI][Medline]
3 Cleaver, J.E., Thompson, L.H., Richardson, A.S. and States, J.C. (1999) A summary of mutations in the UV-sensitive disorders: xeroderma pigmentosum, Cockayne syndrome, and trichothiodystrophy. Hum. Mutat., 14, 922.[ISI][Medline]
4 Aravind, L., Walker, D.R. and Koonin, E.V. (1999) Conserved domains in DNA repair proteins and evolution of repair systems. Nucleic Acids Res., 27, 12231242.
5 Eisen, J.A. and Hanawalt, P.C. (1999) A phylogenomic study of DNA repair genes, proteins, and processes. Mutat. Res., 435, 171213.[ISI][Medline]
6 Yokoi, M., Masutani, C., Maekawa, T., Sugasawa, K., Ohkuma, Y. and Hanaoka, F. (2000) The xeroderma pigmentosum group C protein complex XPC-HR23B plays an important role in the recruitment of transcription factor IIH to damaged DNA. J. Biol. Chem., 275, 98709875.
7 Jansen, L.E., Verhage, R.A. and Brouwer, J. (1998) Preferential binding of yeast Rad4.Rad23 complex to damaged DNA. J. Biol. Chem., 273, 3311133114.
8 Guzder, S.N., Sung, P., Prakash, L. and Prakash, S. (1998) Affinity of yeast nucleotide excision repair factor 2, consisting of the Rad4 and Rad23 proteins, for ultraviolet damaged DNA. J. Biol. Chem., 273, 3154131546.
9 Wang, Z., Wei, S., Reed, S.H., Wu, X., Svejstrup, J.Q., Feaver, W.J., Kornberg, R.D. and Friedberg, E.C. (1997) The RAD7, RAD16, and RAD23 genes of Saccharomyces cerevisiae: requirement for transcription-independent nucleotide excision repair in vitro and interactions between the gene products. Mol. Cell Biol., 17, 635643.[Abstract]
10 Suzuki, T., Park, H., Hollingsworth, N.M., Sternglanz, R. and Lennarz, W.J. (2000) PNG1, a yeast gene encoding a highly conserved peptide:N-glycanase. J. Cell Biol., 149, 10391052.
11 Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W. and Lipman, D.J. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res., 25, 33893402.
12 Makarova, K.S., Aravind, L. and Koonin, E.V. (1999) A superfamily of archaeal, bacterial, and eukaryotic proteins homologous to animal transglutaminases. Protein Sci., 8, 17141719.[Abstract]
13 Bateman, A., Birney, E., Durbin, R., Eddy, S.R., Howe, K.L. and Sonnhammer, E.L. (2000) The Pfam protein families database. Nucleic Acids Res., 28, 263266.
14 Fischer, D. (2000) Hybrid fold recognition: combining sequence derived properties with evolutionary information. Pac. Symp. Biocomput., 119130.
15 Pedersen, L.C., Yee, V.C., Bishop, P.D., Le Trong, I., Teller, D.C. and Stenkamp, R.E. (1994) Transglutaminase factor XIII uses proteinase-like catalytic triad to crosslink macromolecules. Protein Sci., 3, 11311135.[Abstract]
16 Blanco, G., Coulton, G.R., Biggin, A., Grainge, C., Moss, J., Barrett, M., Berquin, A., Marechal, G., Skynner, M., van Mier, P. et al. (2001) The kyphoscoliosis (ky) mouse is deficient in hypertrophic responses and is caused by a mutation in a novel muscle-specific protein. Hum. Mol. Genet., 10, 916.
17 Korinek, W.S., Bi, E., Epp, J.A., Wang, L., Ho, J. and Chant, J. (2000) Cyk3, a novel SH3-domain protein, affects cytokinesis in yeast. Curr. Biol., 10, 947950.[ISI][Medline]
18 Masutani, C., Sugasawa, K., Yanagisawa, J., Sonoyama, T., Ui, M., Enomoto, T., Takio, K., Tanaka, K., van der Spek, P.J., Bootsma, D. et al. (1994) Purification and cloning of a nucleotide excision repair complex involving the xeroderma pigmentosum group C protein and a human homologue of yeast RAD23. EMBO J., 13, 18311843.[ISI][Medline]
19 Schauber, C., Chen, L., Tongaonkar, P., Vega, I., Lambertson, D., Potts, W. and Madura, K. (1998) Rad23 links DNA repair to the ubiquitin/proteasome pathway. Nature, 391, 715718.[Medline]
20 Wootton, J.C. (1994) Non-globular domains in protein sequences: automated segmentation using complexity measures. Comput. Chem., 18, 269285.[ISI][Medline]
21 Wei, S. and Friedberg, E.C. (1998) A fragment of the yeast DNA repair protein Rad4 confers toxicity to E. coli and is required for its interaction with Rad7 protein. Mutat. Res., 400, 127133.[ISI][Medline]
22 Aravind, L. and Koonin, E.V. (1998) Eukaryotic transcription regulators derive from ancient enzymatic domains. Curr. Biol., 8, R111R113.[ISI][Medline]
23 Aravind, L. and Koonin, E.V. (1999) Gleaning non-trivial structural, functional and evolutionary information about proteins by iterative database searches. J. Mol. Biol., 287, 10231040.[ISI][Medline]
24 Schaffer, A.A., Wolf, Y.I., Ponting, C.P., Koonin, E.V., Aravind, L. and Altschul, S.F. (1999) IMPALA: matching a protein sequence against a collection of PSI-BLAST-constructed position-specific score matrices. Bioinformatics, 15, 10001011.
25 Thompson, J.D., Higgins, D.G. and Gibson, T.J. (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res., 22, 46734680.
26 Rost, B. and Sander, C. (1993) Prediction of protein secondary structure at better than 70% accuracy. J. Mol. Biol., 232, 584599.[ISI][Medline]
27 Guex, N. and Peitsch, M.C. (1997) SWISS-MODEL and the Swiss-PdbViewer: an environment for comparative protein modeling. Electrophoresis, 18, 27142723.[ISI][Medline]
28 Chavanne, F., Broughton, B.C., Pietra, D., Nardo, T., Browitt, A., Lehmann, A.R. and Stefanini, M. (2000) Mutations in the XPC gene in families with xeroderma pigmentosum and consequences at the cell, protein, and transcript levels. Cancer Res., 60, 19741982.
29 Li, L., Bales, E.S., Peterson, C.A. and Legerski, R.J. (1993) Characterization of molecular defects in xeroderma pigmentosum group C. Nat. Genet., 5, 413417.[ISI][Medline]
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
G. Zhao, X. Zhou, L. Wang, G. Li, C. Kisker, W. J. Lennarz, and H. Schindelin Structure of the Mouse Peptide N-Glycanase-HR23 Complex Suggests Co-evolution of the Endoplasmic Reticulum-associated Degradation and DNA Repair Pathways J. Biol. Chem., May 12, 2006; 281(19): 13751 - 13761. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Beatham, R. Romero, S. K.M. Townsend, T. Hacker, P. F.M. van der Ven, and G. Blanco Filamin C interacts with the muscular dystrophy KY protein and is abnormally distributed in mouse KY deficient muscle fibres Hum. Mol. Genet., November 15, 2004; 13(22): 2863 - 2874. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. SUZUKI, H. PARK, and W. J. LENNARZ Cytoplasmic peptide:N-glycanase (PNGase) in eukaryotic cells: occurrence, primary structure, and potential functions FASEB J, May 1, 2002; 16(7): 635 - 641. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Katiyar, T. Suzuki, B. J. Balgobin, and W. J. Lennarz Site-directed Mutagenesis Study of Yeast Peptide: N-Glycanase. INSIGHT INTO THE REACTION MECHANISM OF DEGLYCOSYLATION J. Biol. Chem., April 5, 2002; 277(15): 12953 - 12959. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||



