gm: .4: . . ‘su1. .. a 4. V: I... new. £2.33. . n.“ amwmflfimmmiy gum? ”(1‘4qiun‘1vqt—N , _ a ‘ H . r. .2 WP; Mafia“; .. 1. r Q a ”rd. nWwiwttfi Epwf. sand.“ 1 q . 23W V . El, 06 ‘ a t. a? dirt). 0% .. 133. ii . . .M. .3 ‘ z. k a...¢§.w..2v§: 1.1. .5; 3.. o. ‘JI i nxhfiflma fifipf i 1 33.714 (Iv... 531‘: (2.3.... 2!. :“v 1 )3. fight}. 3131.1 1. .‘\»..a.. a3; a? This is to certify that the dissertation entitled STRUCTURAL AND FUNCTIONAL STUDIES OF THE ENZYMES INVOLVED IN A BACTERIAL GDP-D-RHAMNOSE BIOSYNTHETIC PATHWAY presented by NICOLE A. WEBB has been accepted towards fulfillment of the requirements for the Ph.D. degree in Biochemistry and Molecular Biology ZZZZJZ/Zv\ Major Professor’s Signature /-:Z ' Z? —- (75" Date MSU is an Affirmative Action/Equal Opportunity Institution LIBRARY Michigan State University 44-—.-o-n_‘-.-I--d-o-u-n-u-o-n—b— PLACE IN RETURN BOX to remove this checkout from your record. To AVOID FINES return on or before date due. MAY BE RECALLED with earlier due date if requested. DATE DUE DATE DUE DATE DUE 2/05 p:/ClRC/DateDue.indd-p.1 STRUCTURAL AND FUNCTIONAL STUDIES OF THE ENZYMES INVOLVED IN A BACTERIAL GDP-D-RHAMNOSE BIOSYNTHETIC PATHWAY By Nicole A. Webb A DISSERTATION Submitted to Michigan State University in partial fulfillment of the requirements , for the degree of DOCTOR OF PHILOSOPHY Department of Biochemistry and Molecular Biology 2005 ABSTRACT STRUCTURAL AND FUNCTIONAL STUDIES OF THE ENZYMES INVOLVED IN A BACTERIAL GDP-D-RHAMNOSE BIOSYNTHETIC PATHWAY By Nicole A. Webb D-rhamnose is a rare 6—deoxy sugar found primarily in the lipopolysaccharide molecules of pathogenic gram-negative bacteria, where it is involved in host-bacterium interactions and the establishment of infection. The biosynthetic pathway of D—rhamnose may be a potential therapeutic target, as it is expected that inhibitors of the enzymes would not affect human metabolic pathways. The biosynthesis of D-rhamnose proceeds through the conversion of GDP-D-mannose by GDP-D-mannose 4,6-dehydratase (GMD) to GDP-4—keto-6-de0xy-D-mannose, which is subsequently reduced to GDP-D-rhamnose by the reductase (RMD). This study focuses on the structural and enzymatic characterization of the two enzymes. The X-ray crystal structure of GMD from Pseudomonas aeruginosa has been determined in complex with NADPH and GDP. GMD belongs to the NDP-sugar modifying subfamily of the short-chain dehydrogenase/reductase (SDR) enzymes, all of which exhibit bidomain structures and a conserved catalytic triad (Ser/T hr, Tyr-XXX- Lys). Although most members of this enzyme subfamily display homodimeric structures, this bacterial GMD forms a tetramer in the same fashion as the plant Arabidopsis thaliana GMD isoform, MURI. Based on the conservation of subunit interactions and sequences in GMDS, evidence suggests that the tetrameric form of this enzyme is the preferred, and perhaps functionally relevant. oligomeric state for most bacterial and eukaryotic GMDs. Initial X-ray analysis has been completed on RMD from P. aeruginosa as well as a homolog from the bacterium Aneurinibacillus thermoaerophilus. While crystals of RMD from P. aeruginosa diffracted to 3.7 A, crystals of RMD from A. thermoaerophilus diffracted to 1.8 A. Complete data sets were collected on crystals from both species. The P. aeruginosa RMD crystal belongs to the tetragonal space group P422 (or higher symmetry) (a = b = 182, c = 250 A), with 12 molecules in the asymmetric unit. The A. thermoaerophilus RMD crystal belongs to the triclinic space group P1 (a = 47, b = 56. c = 79 A, or = 72, [3 = 83, y = 76°), with 2 molecules in the asymmetric unit. Amino acid sequence homology details indicate that RMD is also a member of the SDR family, whose structures are quite similar. Therefore, full structure determination by the molecular replacement method using the coordinates from the most closely related SDR protein, GMD from P. aeruginosa. is in progress. As no structure is currently available for RMDs, crystallographic analysis of RMD should provide detailed information on the active site of the enzyme and facilitate structure-based inhibitor design. Finally, using a capillary-electrophoresis—based enzymatic assay, we have shown that recombinant GMD and RMD from P. aeruginosa are able to convert GDP-D-mannose to GDP-D-rhamnose, continuing the product structure by NMR analysis. Additionally, we present a method for the enzymatic synthesis of GDP-D-rhamnose, an important glycobiological building block not commercially available. Synthesis of GDP-D- rhamnose is a crucial prerequisite for (l) the identification and biochemical studies of corresponding glycosyltransferases, as well as (2) the determination of how this deoxysugar, as a component of cell wall glycoconjugates, contributes to the virulence of plant and human pathogens. for my husband, who has inspired me to achieve more than I ever dreamed possible for my daughter whose life has provided new meaning and purpose to mine iv The proper rewards are not simply tacked on to the activity/or which they are given. but are the activity itself in consummation. -C.S. Lewis. The Weight of Glory VI ACKNOWLEDGMENTS One of the pleasures of finally finishing my graduate program is the opportunity to show my appreciation to those who were influential in my career as a graduate student. I am grateful to many people in the academic community at Michigan State University. I would first like to express my gratitude to my advisor and mentor, Dr. R. Michael Garavito, whose guidance and encouragement provided me with an opportunity to grow as a scientist. My past and current colleagues have imparted much needed support: Ms. Amy Sharmen, Ms. Karen Poster-Verrill and Mr. Micheal Dumond shared with me their extensive knowledge of lab techniques, Dr. Anne Mulichak helped walk me through solving my first crystal structure, Dr. Ling Qin was always a great source of assistance in collecting data, and Dr. Christine Harman and Dr. Rachel Powers were an invaluable source of support both academically as well as personally. I also wish to thank my committee members, Dr. Christoph Benning, Dr. James Geiger, Dr. Jack Priess and Dr. John Wang, who provided a great deal of insight into my project. I could also not due without the kindness and computer-savvy of Dr. Kaillathe “Pappan” Padmanabhan. There are several members of the scientific community outside of Michigan State University I would also like to acknowledge. My collaborator on the functional studies was Dr. Karen Poon, a member of the lab of Dr. Joseph Lam, Department of Molecular and Cellular Biology. University of Guelph, Guelph, Ontario, Canada. Furthermore, the staff at the Advanced Photon Source, Argonne National Laboratory assisted in collecting data and data processing. Specifically I would like to thank Dr. Stephan Ginell at the vi Structural Biology Center Collaborative Access Team and Dr. Zdzislaw Wawrzak at the DuPont-Northwestern-Dow Collaborative Access Team. I am blessed with a supportive family. I would like to thank those members who guided my path to graduate school: my father— and mother-in-law, Dr. and Mrs. Charles and Philippa Webb for planting the idea and my parents, Mr. and Mrs. Philip and Beverley Cowan for their encouragement to attend. My parents have instilled in me the importance of maintaining a high level of integrity and pressed me to use my God-given talents to the best of my abilities. My parents-in—law have shown confidence in me even when I have had none. My siblings and their families have been understanding of the commitment level of the path I have chosen as well as the distance it has taken me, which has often led to my absence in family activities. Finally I am most grateful to my core family. When I first began at MSU, we were a family of two. Now we are three. My daughter Elizabeth, or as we call her “Bizzy”, has brought a new and unexpected level of joy to my life. Her sense of exploration and experimentation warms a scientist’s heart. Ultimately, I would like to express my appreciation for the friendship, encouragement and advice of my husband, David. I know very few people as intelligent, logical, decisive. and devoted. His unique outlook on life has not only influenced my graduate career but also my world-view, and I consider it a privilege that he accepted me as his wife. vii TABLE OF CONTENTS DEDICATION ........................................................................................................... iv ACKNOWLEDGMENTS .......................................................................................... vi TABLE OF CONTENTS ......................................................................................... viii LIST OF TABLES ...................................................................................................... x LIST OF FIGURES........ ............................................................................................ xi LIST OF ABBREVIATIONS .................................................................................. xiii INTRODUCTION Deoxysugars and their biosynthesis ......................................................................... 1 Prevalence of deoxyhexoses ................................................................................. 1 GDP-deoxyhexoses biosynthetic pathways .......................................................... 5 dTDP-deoxyhexoses biosynthetic pathways ......................................................... 6 CDP-deoxyhexoses biosynthetic pathways ........................................................... 7 Structural biology of the deoxyhexose pathway enzymes ..................................... 8 GDP-D-rhamnose biosynthesis .............................................................................. 11 Rhamnose in P. aeruginosa LPS ......................................................................... 1 1 GDP-D—rhamnose biosynthetic pathway ............................................................. 13 CHAPTER 1: CRYSTAL STRUCTURE OF GMD 1.1. Introduction .................................................................................................... 20 1.2. Experimental Procedures ................................................................................. 21 1.3. Results and Discussion .................................................................................... 25 CHAPTER 2: CRYSTALLIZATION AND INITIAL X-RAY ANALYSIS OF RMD 2.1 . Introduction .................................................................................................... 47 2.2. Experimental Procedures ................................................................................. 48 2.3. Results and Discussion .................................................................................... 52 CHAPTER 3: FUNTIONAL CHARACTERIZATION OF GMD AND RMD 3.1. Introduction .................................................................................................... 61 3.2. Experimental Procedures ................................................................................. 62 3.3. Results ............................................................................................................ 64 3.4. Discussion ...................................................................................................... 73 CHAPTER 4: PROTEIN ENGINEERING WITH THE GOAL OF IMPROVING PROTEIN EXPRESSION, PURIFICATION AND/OR CRYSTALLIZATION 4. 1 . Introduction .................................................................................................... 77 4.2. Experimental Procedures ................................................................................. 79 viii 4.3. Results and Discussion .................................................................................... 81 CHAPTER 5: FUTURE DIRECTIONS ..................................................................... 95 APPENDIX A: MBP-HuPLSCRl AND THE Abl-SH3 DOMAIN A]. Introduction ................................................................................................. 100 A2. Experimental Procedures .............................................................................. 101 A3. Results ......................................................................................................... 102 LITERATURE CITED ............................................................................................ 107 ix LIST OF TABLES Table 1. X-ray crystal structures of NDP-sugar modifying enzymes involved in 6- and 3,6-di-deoxyhexose pathways .................................................................................. 9 Table 2. GMD X-ray diffraction data and refinement statistics ..................................... 24 Table 3. RMD X-ray diffraction data ............................................................................. 51 Table 4. Gel filtration results of GMD and RMD ........................................................... 66 Table 5. NMR analysis of GDP-D-rhamnose ................................................................. 73 Table 6. Amino acid sequence of the linker region in pMal-C2 and pMal-C 2L3 ............ 84 LIST OF FIGURES Figure 1. Biosynthetic pathways for 6- and 3,6-di-deoxyhexoses ..................................... 3 Figure 2. Structures of 6- and 3,6-di-deoxyhexoses .......................................................... 4 Figure 3. SDR model of catalysis .................................................................................. 10 Figure 4. Lipopolysaccharide of P. aeruginosa .............................................................. 12 Figure 5. Biosynthetic pathway of GDP-D-rhamnose .................................................... 14 Figure 6. The alignment of GMD and RMD amino acid sequences from various species. .............................................................................................................................. 17 Figure 7. SDS-PAGE of samples from a typical GMD purification ................................ 25 Figure 8. Crystals of GMD ............................................................................................ 26 Figure 9. Overall Structure of GMD ............................................................................. 27 Figure 10. Electron density at the GMD active site ........................................................ 30 Figure 11. GMD active site interactions ......................................................................... 31 Figure 12. The mechanism proposed for GDP-D-mannose 4,6 dehydratase ................... 36 Figure 13. The alignment of GMD amino acid sequences from various species ............. 37 Figure 14. RR loop of GMD and MUR] ........................................................................ 44 Figure 15. SDS-PAGE of samples from a typical PaRMD purification .......................... 52 Figure 16. SDS-PAGE of samples from a typical AtRMD purification .......................... 53 Figure 17. Crystals of PaRMD ....................................................................................... 54 Figure 18. Crystals of AtRMD ....................................................................................... 55 Figure 19. Gel filtration analysis of GMD and RMD ..................................................... 65 Figure 20. CE analysis of GMD dehydratase activity from pH 5 to 10 ........................... 67 Figure 21. CE analysis of GMD/RMD reactions ............................................................ 68 xi Figure 22. CE analysis of the time—dependent GMD conversion of the keto-intermediate in the presence of NADPH ..................................................................................... 69 Figure 23. HPLC purification of the GMD/RMD reaction product followed by CE analysis .................................................................................................................. 71 Figure 24. NMR spectra for GDP-D-rhamnose. ............................................................. 72 Figure 25. SDS-PAGE of samples from a MBP-RMD purification and Factor Xa cleavage ................................................................................................................. 83 Figure 26. Crystal structures of MBP and MBP-fusion proteins ..................................... 87 Figure 27. Model of MBP-8H3 binding domain sitezAbl-SH3 domain complex. ........... 88 Figure 28. SDS-PAGE of samples from a typical MBP-8H3 and MBP-SH3-RMD purification ............................................................................................................ 89 Figure 29. SDS-PAGE of samples from a typical Abl-SH3 domain purification ............ 90 Figure 30. Gel filtration analysis of MBP-SH3-RMDzAbl—SH3 domain ......................... 91 Figure 31. Crystals of MBP-SH3-RMDzAbl-SH3 .......................................................... 92 Figure 32. SDS-PAGE of samples from a typical MBP-HuPLSCRl purification ......... 102 Figure 33. SDS-PAGE of MBP-HuPLSCRl/Abl-SH3 domain binding ....................... 104 xii ADP AGME CDP CF Ered FPLC GalE GDP GFS GMD GMER GTS HEPES HPLC IPTG LB LPS MES MPD MURI NAD(H) NADP(H) NDP Ni-NTA NMR LIST OF ABBREVIATIONS adenine diphosphate ADP-L-glycero-D-mannoheptose 6‘-epimerase cytidine diphosphate capillary electrophoresis cystic fibrosis deoxythymidine diphosphate CDP-6-deoxy-L-threo-D-glycero-4-hexulose-3-dehydrase E. reductase CDP-3,6-dideoxy-D-glycero-D-glycero-4-hexulose-5-epimerase CDP-D-glucose 4,6-dehydratase glucose-1 -phosphate cytidylyltransferase CDP-3,6-dide0xy-D-glycero-L-glycero-4-hexulose-4-reductase fast protein liquid chromatography UDP-galactose epimerase guanosine diphosphate GDP-fucose synthase GDP-D-mannose 4,6-dehydratase GDP-4-keto-6-deoxy-D-mannose 3,5-epimerase/reductase GDP-6-deoxy-D-talose synthetase 4-(2-hydroxyethyl)-1-piperazine-ethanesulfonic acid high-pressure/performance liquid chromatography isopropyl-beta-D-thyogalactopyranoside Luria broth lipopolysaccharide 4-morpholineethanesulfonic acid 2-methyl-2,4-pentanediol GMD isoform from Arabidopsis thaliana nicotinamide-adenine dinucleotide nicotinamide-adenine dinucleotide diphosphate nucleoside disphosphate nickel nitrilo-triacetic acid nuclear magnetic resonance xiii OD PCR PDB PEG PEP PL PLSCRI RMD leA leB/dTGDH leC leD RMSD SDR SDS-PAGE 8H3 SQDI T1 1 Tris UDP UV optical density polymerase chain reaction protein data bank polyethylene glycol pentaerythritol propoxylate phospholipid phospholipid scramblase 1 GDP-4-keto-6-deoxy-D-mannose reductase glucose- 1 -phosphate thymidylyltransferase dTDP-D-glucose 4,6-dehydratase dTDP—4-keto-6-deoxy-D-glucose 3,5-epimerase dTDP-4-keto-6-deoxy-L-mannose reductase root mean square deviation short chain dehydrogenase/reductase sodium dodecylsulfate polyacrylamide gel electrophoresis Src homology 3 UDP-sulfoquinovose synthase dTDP-4-keto—6-deoxy-L-mannose reductase tris(hydroxymethyl)aminomethane uridine diphosphate ultraviolet xiv INTRODUCTION Deoxysugars and their biosynthesis Prevalence ofdeoxyhexoses Lipopolysaccharide (LPS) molecules are major components of the outer membrane of the cell walls of gram-negative bacteria and are intimately involved in host-bacterium interactions and the viability of the pathogen. The permeability restrictions imposed by the outer membrane contribute to the intrinsic resistance of gram-negative bacteria to antibiotics. The LPS molecules, in particular, play a role in outer membrane stability and impede the destruction of bacterial cells by serum components and phagocytic cells. In addition, the LPS molecules may be involved in colonization or antigenic shifts that determine the course and outcome of the infection. LPS molecules share a common tripartite structure of a lipid A region, a core oligosaccharide region and an O-antigen region. The O-antigen, which is the most exposed region, consists of repeating units of varying monosaccharides. O-antigen can be a linear or branched homOpolymer or heteropolymer, and can be strain-specific or vary within a strain. A tremendous amount of effort has been exerted to determine the exact chemical structures of O—antigen due to its contribution to bacterial virulence. In general, the structures consist of neutral sugars, amino sugars, sugar acids and many different deoxysugars. More recently. studies have expanded to include the enzymology involved in the biosynthesis and assembly of O-antigen. Specifically, the biosynthesis of 6- deoxyhexoses has received heightened interest after a few of them have shown to be substantial LPS O-antigen components of several human pathogens. Humans lack metabolism for many deoxyhexoses found in bacterial cell walls and would most likely remain unaffected by inhibitors of bacterial cell wall metabolism. Since many of the currently available antibiotics target the same cellular process or even the same target enzyme, multiple-drug resistance of pathogens has become a serious problem. placing a high demand on new therapeutic targets and/or strategies. The biosynthetic pathways of 6-deoxyhexoses may be targets for novel antibacterial therapeutics. 6-deoxyhexoses are formed from a common monosaccharide by replacing the 6— hydroxyl group with a hydrogen atom, which can dramatically affect its biological function. The committed step in the 6-deoxyhexose biosynthetic pathway is the conversion of the precursor, the nucleotide diphosphate (NDP)-activated hexose, to an NDP—activated 4-keto-6-deoxyhexose by an NDP-sugar 4,6-dehydratase. The resulting NDP-4—-keto-6~deoxyhexose intermediate serves as a branching point for D- and L- deoxysugar pathways. The intermediate may undergo subsequent steps such as epimerization, reduction or methylation to produce mono-, di-, tri- and tetradeoxysugars and branched-chain deoxysugars. More than one nucleotidyl sugar can be associated with a given deoxyhexose. For instance, both dTDP—D-glucose and UDP-D-glucose serve as precursors for L-rhamnose. However, it appears that only GDP-D-mannose serves as a precursor for D-rhamnose. In addition to being present as constituents of bacterial LPS molecules, deoxyhexoses can also be found in macrolide antibiotics. The general pathways to several 6- and 3,6-di-deoxyhexoses are shown in Figure 1. The structures of those deoxyhexoses are shown in Figure 2. Glucose GLK Glucose-I-P "‘—— Glucose-S-P —’ Glucose-1-P Ep 1 PGI 1 leA l CDP-D-glucose Fructose-S-P dTDP-D-glucose Eodl PMI 1 leBl CDP-4-keto-6- Mannose-6-P dTDP4-keto-6— deow-D-glucose deoxy-D-glucose £1,531 PMM l leC 1 fed CDP-3,6-di- Mannose-1 -P dTDP-4-keto-6- deoxy-D-glucose deoxy- L-mannose abe GDP leD / rms\Eep, Ered l dTDP-D- GDP- D-mannose fucose CDP-D- CDP-L- abequose ascarylose GMD l dTDP L' dTDP-S-deoxy- rhamnose L-talose CDP-D-paratose GDP-4— keto-6- deox -D-mannose l M y CDP-D- GMER/ RMD GTS \ tyvelose GDP-6-d GDP-L-fucose eoxy- D-talose GDP-D-rhamnose Figure l. Biosynthetic pathways for 6- and 3,6-di-deoxyhexoses GLK, glucokinase, PGI, phosphoisomerase, PMI, phosphomannoisomerase, PMM, phosphomannomutase, GDP, manno-l-phosphate guanylyltransferase, GMD, GDP-D- mannose 4,6-dehydratase, GMER, GDP-4-keto-6-deoxy-D-mannose 3,5- epimerase/reductase, RMD, GDP-4-keto-6-deoxy-D-mannose reductase, GTS, GDP-6- deoxy-D-talose synthetase, PGM, phosphoglycomutase, leA, glucose-l-phosphate thymidylyltransferase, leB, dTDP-D-glucose 4,6-dehydratase, leC, dTDP-4-keto-6- deoxy-D-glucose 3,5-epimerase, leD, dTDP-4-keto-6-deoxy-L-mannose reductase, T11, dTDP-4-keto-6-deoxy-L-mannose reductase, gene product of fed, dTDP-4-keto-6- deoxy-D-glucose reductase, Ep, glucose-l-phosphate cytidylyltransferase, E01), CDP-D- glucose 4,6-dehydratase, El, CDP-6-deoxy-L-threo-D-glycero-4-hexulose-3~dehydrase, E3, E. reductase, Ecp, CDP-3,6-dideoxy-D-glycero-D-glycero-4-hexulose-S-epimerase, Ema, CDP-3,6-dideoxy-D-glycero-L-glycero-4-hexulose-4-reductase, gene product of r/bS, CDP-D-paratose synthase, GDP, guanosine diphosphate, dTDP, deoxythymidine diphosphate, CDP, cytidine diphosphate. OH Me O OH OCDP C BED-abequose Me O HO OH OCDP C DP-D-paratose e M 0g mm OCDP CDP-D-tylevose Me 0 OCDP HO OH C DP—l.-ascary0se Me 0 OGDP OH HO OH G DP-l .-fucose Me OH O HO / HO OGDP GDP—D-rhamnose OH Me OH O HO OGDP G DP—b—(leoxy-D—talose Figure 2. Structures of 6- and 3,6-di—deoxyhexoses OH Me O HO OdTDP OH dTDP-DTucose Me 0 OdTDP HO HO OH dTDP—l.—rhamnose Me 0 OdTDP OH HO OH dTDP-6-de0xy-1.-talose GDP-deox)thexoses biosynthetic pc‘ithways The 6-deoxyhexoses originating from the GDP-D-mannose precursor whose pathways have been studied include GDP-D-rhamnose, GDP-L-fucose and GDP-6-deoxy—D-talose (Figure 1, middle). The enzyme GDP-D-mannose 4,6-dehydratase (GMD) is responsible for catalyzing the conversion of GDP-D-mannose to GDP-4-keto-6-deoxy-D-mannose, which serves as the common intermediate to these 6-deoxyhexoses. Enzymes possessing GMD activity have been identified in bacteria [1-4|, plants [SI and animals [6, 7]. For GDP—D-rhamnose synthesis, a 4-reductase (RMD) that targets the 4—keto group of the intermediate has been identified in Pseudomonas aeruginosa [8] and Aneurinibacillus thermoaerophilus [3, 8|. D-rhamnose is found mainly as a component of the LPS of gram—negative bacteria, such as the plant pathogen Xanthomonas campestris [9| and the human pathogens Helicobacter pylori [10] and P. aeruginosa [l l l. The 3,5-epimerase/4- reductase (GMER) involved in the GDP-L-fucose pathway has been identified in bacteria [12], plants [13] and animals [14]. L-Fucose is a substantial cell wall component of bacteria, affects nodulation in rhizobial organisms [15. 16[ and is important in stem development and strength in plants [5]. In humans, L-fucose is present in the human ABO blood group antigens and Lewis (Le) glycans, playing a role in inflammation and immune response [17]. The enzyme involved in the GDP-6-deoxy-D-talose pathway has been identified in bacteria, where 6-deoxy-D-talose is a component of LPS [18, 19]. The GDP—4-keto-6—deoxy-D-mannose reductase (GTS) involved in GDP-6-deoxy-D-talose . biosynthesis has been characterized in Actinobacillus aetinomyeetemcomitans [18, 20]. In addition to 6—deoxyhexoses, GDP-4-keto-6-deoxy-D-mannose can also be converted to the dideoxy amino sugar GDP-D-perosamine by GDP-D-perosamine synthetase (beE), which has been identified in Vibrio clzolerae [ l9] dTDP-deoxyhexoses biosynthetic pathways The precursor dTDP-D-glucose leads to the following 6-deoxyhexoses: dTDP-L- rhamnose, dTDP—6—deoxy-L-talose and dTDP-D-fucose (Figure 1, right). Similar to the GDP-D-mannose pathways, these deoxyhexoses are synthesized from dTDP—D-glucose via dTDP-4-keto—6—deoxy-D-glucose with the assistance of dTDP—D-glucose 4,6- dehydratase (leB). The 4—keto-6—deoxyhexose again serves as a common intermediate in the pathways to these deoxyhexoses. The enzymes involved in the biosynthesis of dTDP-L—rhamnose have been studied extensively from Salmonella enterica [21—26] and Escherichia coli [27-32]. The catalytic action of leB is related to GMD, dTDP-4—keto- 6-deoxy-D-glucose 3,5-epimerase (leC) performs a double epimerization at position C3 and C5 like GMER, and the direct analog to dTDP-4-keto-6-deoxy-L-mannose reductase (leD) is RMD. L-rhamnose is a known component of cell walls of several pathogens such as P. aeruginosa, Mvcobacterium tuberculosis, V. cholerae, Enterococcus feacalis and Streptococcus mutans as reviewed in Maki et al. [33]. RmIB-C are also involved in the pathway leading to 6-deoxy—I--talose. The remaining enzyme in the pathway is a dTDP-4—keto-6-deoxy-L-mannose reductase like RmID; however, it provides the stereoselectivity to reduce the intermediate to dTDP-6—deoxy-L—talose [34]. The gene for dTDP—6-deoxy-I.-talose biosynthesis (product of the gene Ill) and its corresponding protein has been identified in A. actinomyceremcomirons, one of the few species of bacteria where 6-deoxy—L-talose is a component of the cell wall [34]. Finally, in dTDP- D-fucose synthesis, the dTDP—4-keto-6—deoxy-D-glucose reductase (product of the gene fcd) catalyzes the reduction of intermediate [35]. D-fucose is located in the cells and capsule structures of a limited number of gram-negative and gram-positive bacteria. CDP-deoxyhexoses biosynthetic pathways The deoxyhexoses originating from the precursor CDP-D-glucose shown in Figure I (left) are actually 3,6-dideoxyhexoses. Since 6-deoxyhexoses serve as the universal precursors for higher reduced di— and tri-deoxyhexoses, the initial enzymatic steps are the same. The enzymes involved in the pathway leading to L-ascarylose have been well studied from the bacteria Yersinia pseudotuberculosis [36]. Again the committed step is the conversion of CDP-D-glucose to the 4-keto-6-deoxy intermediate by the CDP-D- glucose 4,6-dehydratase (E01,). The 4—keto-6—deoxyhexose is converted to the 3,6- dideoxyhexose by sequential activities of CDP-6-deoxy-L-threo-D-glycero-4—hexulose-3- dehydrase (El) and El reductase (E3). The 3,6-dideoxyhexose can undergo a C5 epimerization catalyzed by GDP-3,6-dideoxy-D-glycero-D-glycero-4—hexulose-5- epimerase (Ecp) and a C4 reduction catalyzed by CDP-3,6—dideoxy-D-glycero—L-glycero- 4—hexulose-4-reductase (Emd) to yield CPD-I.—ascarylose. The 3,6-dideoxyhexose can also undergo other subsequent epimerization and/or stereospecific reductions to yield paratose, abequose and tyvelose. Two reductases, CDP-D-paratose synthase (encoded by the rbe gene) from Salmonella typhi [37] and S. enterica [38] and CDP-D-abequose synthase (encoded by the abe gene) from S. enterica [39] have been identified. Finally, tyvelose epimerase, which is involved in the last step of the conversion of CDP-D- paratose to CDRD-tyvelose, has been isolated from S. typhi and its structure has been reported [40]. Each of these 3,6—dideoxyhexoses is known to be components of cell walls and capsules of gram-negative and gram—positive bacteria. Structural biology of'the deoxyhexose pathway enzymes Of the enzymes involved in the deoxyhexose pathways mentioned above, several X- ray crystal structures are available (Table 1). Interestingly, despite functional diversity and low sequence identity, many of the enzymes exhibit strikingly similar three- dimensional (3D) structures. The monomers fold into two domains: the N-terminal NAD(H) or NADP(H) cofactor-binding domain and the C-terminal NDP-sugar substrate- binding domain. The N-terminal domain consists of an alternating a/B motif known as a Rossmann fold, which is commonly associated with dinucleotide binding. The cleft between the two domains is the site of dinucleotide cofactor binding, sugar nucleotide binding and where catalysis occurs. These structural characteristics place the enzymes in the NDP-sugar modifying subfamily of the short chain dehydrogenase/reductase (SDR) protein family. This is a large and diverse family of proteins. requiring NAD(H) or NADP(H) as their cofactor to carry out a variety of reactions including reductions, oxidations, dehydrations or epimerizations. Although members of the NDP-sugar modifying subfamily of the SDR family share amino acid sequence identities of only 15-30%. distinct sequence motifs allow for tentative functional assignment of residues. For instance, a common N-terminal glycine- rich region is a part of the nucleotide-binding region; this allows for the close packing of the cofactor to the protein backbone. Also, the conserved catalytic triad of Thr/Ser and Tyr-XXX—Lys allows one to build a model of catalysis for the initiating step (Figure 3). Mutagenesis and kinetic studies on SDR proteins has revealed the importance of the catalytic triad member Tyr, which acts as an active site base deprotonating the O4 hydroxyl of the substrate as the C4 position is oxidized by NAD/NADP. The catalytic Lys lowers the pKa of Tyr by stabilizing it in its negatively charged state [32, 41]. The role of the Ser/T hr catalytic triad member is less clearly defined. It has been proposed to help orient the substrate in the active site and/or facilitate proton transfer [42-45]. The oxidation at the C4 position prepares the sugar for further reactions by acidifying the hydrogen atoms at positions 3 and 5. The 4-keto intermediate acts as a springboard for other SDR reactions. Table 1. X-ray crystal structures of NDP-sugar modifying enzymes involved in 6- and 3,6—di-deoxyhexose pathways Enzyme Substrate Product Protein Number Family of Species GDP-D-mannose 4.6- GDP-4-keto-6- dehydratase GDP—D-mannose deoxy-D- SDR 4 (GMD) mannose GDP::;;::§°** GDP-4-keto—6- . - deoxy-D- GDP—L—fucose SDR 1 epimerase/reductase mannose (GMER) dTDP-D-glucose 4,6 dTDP-4-keto-6— dehydratase dTDP-D- glucose SDR 4 (leB) deoxy-D-glucose dTDP-4—keto-6deoxy- D-glucose 3,5- dTDP-4-keto-6— dTDP-4—keto-6— .new . deoxy-L- epimerase l epimerase deoxy-D- glucose . (mm C) mannose family dTDP-6-deoxy-L-lyxo- dTDP-4-keto-6— dTDP-L- 4-hexulose reductase deoxy-L- rhamnose SDR 1 (leD) mannose CDP-D-glucose 4.6— CDP-4— keto-6- dehydratase CDP-D-glucose SDR l deoxy-D-glucose (BOD) CDP—D—tyvelose 2- epimerase CDP—D-paratose CDP-D-tyvelose SDR 1 (AW) TYT Tyr (0 HO O-NDP O-NDP NAD(P)+ NAD(P)H Figure 3. SDR model of catalysis The SDR catalytic triad member Tyr deprotonates the 04 position of the sugar as a hydride is transferred from the C4 position of the sugar to NAD(P). Nearly all the members of the SDR family exist as dimers where the monomers interact, forming a four-helix bundle at the interface involving 2 helices of each monomer. However, there are exceptions. leD exhibits a new Mgz’ldependent dimerization mode and the GMDS seem to be following a new tetramerization pattern. So although we can make assumptions based on the highly conserved sequence motifs of some of these enzymes, we have already seen exceptions to the rule that makes the structure/function relationship of these enzymes an important topic to be studied. Furthermore, the fact that leC represents a new class of epimerases unrelated to the 10 SDR enzymes raises questions about other enzymes in the biosynthetic pathways of deoxyhexoses. GDP-D-rhamnose biosynthesis Rhamnose in P. aeruginosa LPS The focus of this dissertation is on structure/function studies of the enzymes involved in the biosynthetic pathway of the NDP-activated 6-deoxyhexose, GDP-D-rhamnose, from the bacterium P. aeruginosa. This opportunistic pathogen causes infection in patients with impaired immune systems such as burn wound victims, cancer patients and especially those with cystic fibrosis (CF). Due to its intrinsic resistance to many antibiotics, infections by this bacterium are difficult to treat; therefore efforts have been made to understand the factors involved in host-bacterium interactions and the establishment of infection. The LPS, which plays a significant structural role in the outer membrane, contributes to the pathogenesis of P. aeruginosa [46]. It is involved in protecting the bacterium from phagocytosis [47] as well as serum-mediated killing [48]. As previously mentioned, the LPS of gram-negative bacteria is tripartite in nature: the hydrophobic lipid A region, the core oligosaccharide region and the O-antigen or O polysaccharide region (Figure 4). P. aeruginosa contains two variant forms of O-antigen known as A-band and B-band. The serotype-specific B-band is a heteropolymer composed of many different monosaccharides, while the common A-band LPS is a homopolymer of D-rhamnose arranged as trisaccharide repeat units linked Oil—>2, Oil—>3, Oil—>3 [8]. Interestingly, studies have shown that B-band O-antigen is absent or expressed in smaller amounts in chronic P. aeruginosa isolates from CF patients, while 11 the level of A-band O-antigen is maintained [8]. Furthermore, there is also a correlation between the presence of anti-A-band antibodies in CF patients with both lower pulmonary function and increased duration of P. aeruginosa infections [8]. Other organisms expressing the A-band O-antigen include two bacterial species known to be pathogenic in CF patients, Burkholderia cepacia and Stenotrophomonas (Xanthomonas) maltophilia, as well as Pseudomonas syringae pv. morsprunorum C28 and P. syringae pv. cerasi 435 [8]. As a result, the biosynthesis of A-band LPS has gained interest as a potential therapeutic target. A i B-band l l O-antigenf A-band Core i region . Lipid A l Outer Leaflet 5E §§§§§§3§§§ Inner >5 s‘gg s >§><2 Phosphollpidi §$<§Z§<>§§E $3 : ._ \ V I ‘2 2 ($223? I "‘5 Leaflet Figure 4. Lipopolysaccharide of P. aeruginosa' (A) Schematic of the outer membrane and (B) chemical structure of A-band O—antigen of P. aeruginosa. Images in this dissertation are presented in color. 12 GDP-D-rhamnose biosynthetic pathway The source of the D-rhamnose molecules found in P. aeruginosa A-band LPS is the nucleotide-activated GDP-D—rhamnose. Markovitz proposed the biosynthetic pathway to GDP-D-rhamnose in the 19605 [49]. The reaction proceeds through two steps: GDP-I)- mannose 4,6-dehydratase (GMD) converts GDP-D-mannose to GDP-4-keto-6-deoxy—D- mannose, and the reductase (RMD) subsequently reduces the intermediate to GDP-D- rhamnose (Figure 5). The first step in the reaction is particularly important, as GMD is also part of the GDP-L—fucose pathway. Defects in GDP-L-fucose synthesis, particularly in GMD activity, have been linked to stem shoot development in plants [5]. In addition. deficiencies in the biosynthesis of GDP-L-fucose in humans have resulted in the rare immune disorder leukocytes adhesion deficiency type II (LADII) [50]. Characterization of GMD has been reported from bacterial sources H. pylori [4], E. coli [2], Klebsiella pneumoniae [l] and A. thermoaerophilus [3], plant source Arabidopsis thaliana (known as MURI) [5], and mammalian sources porcine thyroid [6] and human [7, 51]. GMD amino acid sequences are quite similar and the molecular mass of the monomers generally lies between 40-55 kDa. Somoza et al. determined the 3D structure of GMD from E. coli and definitively confirmed the structural relationship of GMD to the NDP- sugar modifying subfamily of SDR enzymes [52]. However, GMDs from various species tend to differ in their multimerization. Like SDR members, dimeric structures have been reported for E. coli [52], K. pneumonie [1] and human GMDs [51]. Nevertheless, the more recently published structure of GDP-D-mannose 4,6-dehydratase MURl from A. thaliana [44] suggests it exists as a tetramer. Furthermore, there are reports that suggest l3 that H. pylori GMD is a tetramer [4] and that porcine thyroid GMD may be a hexamer l6I- OH OH H O O GDP-D-mannose O-GDP OH NA DP+ GMD NADPH OH OH ,O O GDP-4-keto-6-deoxy- D-mannose O-GDP CH3 N ADPH RMD ( NADP+ OH OH OH O GDP-D-rhamnose O-GDP CH3 Figure 5. Biosynthetic pathway of GDP-D-rhamnose GDP-D-mannose 4,6-dehydratase (GMD) catalyzes the conversion of GDP-D-mannose to the intermediate GDP-4-keto-6-deoxy-D-mannose. The 4—reductase (RMD) catalyzes the reduction of the intermediate to GDP-D-rhamnose. A BLAST search with the amino acid sequence of P. aeruginosa GMD demonstrates that it most closely matches dTDP-glucose 4,6-dehydratases (like leB), RMDs and UDP-glucose 4-epimerases, which catalyze the interconversion of UDP-galactose and UDP-glucose. These enzymes are also members of the NDP-sugar modifying subfamily 14 of the SDRs and contain a catalytic triad that includes Ser/Thr and Tyr—XXX~Lys as well as a characteristic glycine-rich region at the N—terminus. Mutagenesis studies on E. coli GMD have confirmed the role of Ser/Thr-Tyr-Lys in catalysis [52]. Like the initial catalytic step of the SDR enzymes, Tyr deprotonates the O4 hydroxyl of the hexose ring while NADP oxidizes the C4 position forming the 4—keto intermediate. The ensuing dehydration step requires another active site base. Evidence suggests the second active site base is a conserved Glu, which effectively implies that a catalytic quartet is required for dehydratases [52]. To complete the action of oxidation and reduction, it has been demonstrated in a bacterial GMD that the mannose C4 hydride is transferred from NADPH back to the hexose C6 position to produce the 4—keto-6-deoxy intermediate product [53]. Much fewer studies have focused on the second step of the GDP-D-rhamnose biosynthetic pathway. In fact, RMD has only been characterized in the nonpathogenic, gram-positive bacteria A. thermoaerophilus [3] and in P. aeruginosa in a coupled reaction with H. pylori GMD [54]. The amino acid sequences of these RMDs indicate the molecular masses of the monomers are ~35 kDa. Currently there are no 3D structures available and the state of multimerization is unknown. A BLAST search with RMD demonstrates its relatedness to SDR members: GMDs, dTDP-glucose 4,6-dehydratases and UDP-glucose epimerases. The amino acid sequence of RMD most closely aligns with that of its biosynthetic pathway mate GMD (Figure 6). The conservation of the SDR residues is initially apparent: Gly-XX-Gly-XX-Gly and the catalytic triad of Ser and Tyr-XXX-Lys (corresponding to Gly9—XX-Gly12-XX—Gly15 and Thr126 and Tyr150- XXX-Ly3154 of P. aeruginosa GMD). If RMD fits into the “functional-mold” of the 15 SDR enzymes, the catalytic triad member Tyr would be expected to activate the substrate by protonation facilitating the cofactor to reduce the 4—keto-6-deoxysugar to GDP-D- rhamnose. This reaction mechanism has been suggested for the related enzyme dTDP—6- deoxy-lyxo-4—hexulose reductase (leD), which is involved in dTDP-L-rhamnose biosynthesis [26]. It is interesting to note that one of the catalytic residues of GMD, Glu128 (GMD of P. aeruginosa), is not conserved across RMDs. In GMD catalysis, Glu128 is proposed to abstract a hydrogen atom from the hydroxyl group at C5 of the sugar. The C5 atom is not expected to be involved in the reduction as RMD functions by reducing the keto group at the C4 position. I Figure courtesy of Dr. Joseph Lam, Department of Molecular and Cellular Biology, University of Guelph, Guelph, Ontario, Canada. 16 Figure 6. The alignment of GMD and RMD amino acid sequences from various species Conserved residues are highlighted in blue boxes; similar residues are highlighted in light blue. 17 Oxm0.0mm>mzm¢¢m<¢aum4< Hozwwfix>wom¢dzw>m¢flm>w >mZUmHHOZOJUAmm<4JMm>¢ 4mAO>Z>B>3m<¢UKm4¢ ¢mmm2>>H3>4>JK¢¢U m¢>Sfld>¢ dwzwwmmme>BszdwaMQ>< dmzommmm>Z>BH3IU>>Kfl>m Z>BH3$O>QKQ>O and Az>m >IJH JZQH 4&44 qmqh HmAH AKAH HZB> Jmh> ....mq¢mm¢0>qu>$IU¢Om ....qn43mema<4 ........ HmOHZEmemmqum>mwoxonqnmeqwmoem Hmoezmmmmmm>Hom>m>oxmaanmaq>moaooeam HamezHmmmmmxaom>m>642nAsmaqwmonooe>w. Z>ZHOZ m>ZHmm HJZHOJ H<>>Q< >0Q>D< HOQQ< A49>OO AOD>>O mod mma A>mzzmmbm wxeJZmQx m4 >mmmhmmm>x >mmmmmm>< quOm3m9¢ F>m023m<0 mm :wZZfiXUdmamugflBUOXme >>mmmm>ma>m4mm¢mm....Oddfln WWmEmmJOZmflm>mmmmdeZ$wmmmmOHBQHmUQQZWQ....m>4ud finmemmHamNSHmJ<4m¢>OL>>DOmmH>thUmmOm¢....¥J¢UJ LWWmQImmHPMmOQ.mm..EmubzmmmOW¥>BmOmQHhmmm>¢md flWmmmwmmBFN¥OmHmO..>x $Jmmhm¢0>mx>mZHAOUB..¥>43 wwwmmzmmH7NZOmHmJ..>m >4mmhm40>mmHMMmQOJH..m24wd .mmemeHPM¥OmHmO..>JL%JmmBm¢O>mmfixme04m..demd fwmmmwmmFPNXOmEm>..>x mzmmmmewmxdflmme....ma4mfi mmmmmwmmFZMDOmméo..Hd m2mmbm40>m¢fimmmm0....mH4mA and ova mna man man IBH¢< I>>IHQ4¢¢UJ¢OWJOmdeHDUm>me¢Im ........... mmxdd ImH>meHDWH>¥¥>mOmQZHQdeZW.>Zm ........... >BOO¢ $H>¢Dm.meZJDOAmDmeJDWmIQ>...m ........... ImOdd Z>>mflfixH>QH3mmqmmIquJ4¥Z>ZEQQH»HEZ IWOdU Z>Hm9mx>mZHHK>AUHWDBJQU>$J¥EZUMHmdomzqumm Im04< ZWHmPNXBhflHAmHJmeQHZ.medmmmmxmmmmdflqumfl ImZ>mQflO>MMAHmBAZMBQmJDOWIth.XQZUBmmflowafl mmOdO Z>>mQ.OHXXHHd>dmmdDBJQOm>ABH¥QX ........ dmm mmO<4 Z>>mOQO¢¥H>€m0>m04042900m>0HQOm ...... Hoamm we on we an A>¢OAIxO>mOm43 HZ>ZHaHmBmmHJU>>M>OxMJQded WmOSOOb H >mquBz oooooooooooooooooooooooooo HJ¢memm>>BmQZmEmUOm>memOmm .............. HQ4H¥WXZ ......................... HJ—dN/me ........................... HJ—desz ........................... mean 1... u . EWUQHOG>EGZ¢ ,..... HQHNHNmE ........................... ....dxdmzzmmhwm>m>ZOmHQ224J>xO>mO<>fl fiddmz ............................ a >mme2mmm¢mmxme>m>wxmnaEm4q>manoa>m“ dmzmbflmmmm<>dom>m%0&MJJ4¥¢J%QUDOOEH mv . . on ON >1H4mmrfinz .......................... ca w DEM MdOZHmMm.m 92m 0mm402mmmfi.< Q2“ dmOZHUDmm4.m n20 mHmmOQHmdflfl QED Zdlbfl Q80 HMOQNQ.H 920 HAOU.M DIG OMM£OSMMNH.¢ DID (mOZHODmM¢.m 92m Mmmmmeszq ................. mmm>mwm30ququm0xHBH.mmx.OBBQIdmdmmUm>mmOm4mmzmdmDO mgmOO&¢DEWU¢D>J>¥mmxfiqmqoma>22x>dxmm0>.Ommioq>mx4xmdDUOQZO>mdQMm>mOQH ............ mwdzmZHmde>Q¢m>me>demd>.max.quonXBUDGOAmD>mBmmwwx40> ..m>mZKABNOUQOAWJZXOUquawmdzox>qm¥daw.mm>.Udmxmdwemawqqao>mbmmmwmmoH .mquH¢>Q>Ummxaqmmxxddma024>2mm>zmeBH.max:qummdx8m00445m>m ............... xaoxmHJDZOZQ>EZB>QOmmmemmZ:0A¥ZH¢XmD>mdQMmmme> ................ >¢mH>mmmzwzqddeBQquB.xmm0940m24mzzwmdmmHamm—ZmDmDH ................. mmw>mmao4m>EEqumemH.mmKBOA>mO¢x¢mZOAQ>Q>m¢mmmmdeH wan man man wow wow rHs>mH> oooooooooooooooooooooooooooooooooooo mmmqZ¢Z4AAJDQ>OOHmBOHomU>Zo o o o>> >qum oooooooooooooooooooooooooooooooooooo >OdH04qquqmmeom00mU>Zoooow> m>>ox3 .................................. zqwqwomm>oammm>emwmme<> ..... >w B>m>xoew ................. xumo>mzmzxom3>HonHmammxm>mmm>mm>me¢H ..... >m m>>HfiUBMJImJZJOAm¥HAUHWXHOBZOmMJQHUHmmkm2x>mDm>mEB¥DHdH ..... >w €HH>DOQ¥ ....... >OQ¢DIOB>m>>HUfimm>0fi0mmmdxHOA04¢mOm>m>O>UB¢H ..... >m >>>omw .................................. zqw>mwm¢Hmummm>mmzmwe> xqmom» oooooooooooooooooooooooooooooooooo ng>mmm¢HOUEDm>FHB>wB<> IIIII >> men men omu m0m0>¢ou>flm>oqmmu440420>MQOQ40¥0HQm>QBmmw>¢mQ70>xHHQMOmemHQ>HOMhUAmOUmOH1W €0¢m0mmaama>:0>aqmfiww>DHQO>mAmZth¥OUmOHa. DQmmmOOAZAZde>N.U¢mORfiwmdodzudmax.BOJU>¥Hquhszmmmmmq QMQWQZOJZQZE¢M>MQx4mURAmX¢QAZUAmmU.meUA>H34>mmmmeh>hz>Qx¢mwzmwxdodzaqwqu.QBJZ>bhm0m>mmmWT OmmOMOOJZZZOEx>y.x¢IUZ&nJmDZzUA>JU.mmaOOthOmmmmmT moaxmooqzqzzom>s.xooemee>mmHommmmmxx.nmeu aHomm u. .n QQéXQOOJZJZZWQ04mOZFJ¥¢D>ZUAEAW.OOXOAXHm4>MmH>hmH0mgammmE ova wan on“ man mom mad and t 92m mfiwszwm.m 92m OMMQOEMNEB.< QSm 4mOZHODmm<.m DIG mHmmOQHmddd n80 debm GIG Hmonhm.fl 020 HAOU.M GEO ON”¢OSMNNB.4 DID QmOZHODmM4.m DEM mxooo fo 0 espresso E Ill 0 IO IO 30-0 IO All IO 0 mmoccwEé- m :0 new: L .. A +no¥0wqmmhm¢0>mx>mZHQOUH. mfizmzoafimd>m0>dmmkmfiomeBKMmeAH. QBFMXOQHmO>dO>QmmHm¢O>mmBMMmQOdh. HQBFWMOQ. mmBmOmZNWWOdOM¥M>FMOmDHBIW mu>x0azmmmmdowmx Z>¢mdz>€ Zodwzo>m ZU¢LJU>< - 3| '8 QXEZUWHE¢OmZ¥>Qmm Jmmmxxmommqaquxo JEh.¥&ZUHE&QO>HIQ AXZQ¢M2>ZIQQHVHIZ >JUBQQHJDCw HJZWmQPZQDw BAZMHQWJDDa c.mH¥>mm0¢OAZ>HmHm¥>mZHH . m>¥>$m0<<>xw2d042w>mam0>mma m><>mm0<¢qz>>mom¥H>DH mB¢>hmO¢OAZ>>QOOH¥MH 4>JmIJQHJDOm JHH¥D¥ ........ Jim . m¢0>mm0¢¢dzw>mOQO¢¥H> >OHQUWHO ...... Qmm ooa om on or om om dAnquddnflfiqqdddnfiq .Allll deDDHQddNDfiq AIIIII dd .8 a «a a H0m>m>0¥mddmmdqmeQOOBHOBHA¢>zmmMUZNUDUmUM<...mmomémdmdz.. dom>m>042dqymm00009>OBHA¢H¥mKZ ......................... H0m>m>0xmqqmmOBHA4>wa ......................... J0m>m>0xmaqqxdd>q¢mmBE .......................... on ON OH H Sad ill 3333 i as a 8 a nannouanau¢ afluosu.< onoaamuoa.m $38392 afluosu.¢ oaoaamsuou.m adadm auoflhm.m “HOU.M aanaouanaud canonu.¢ uaouamfluoa.m 38 mnowuomaounH Hofimuuma ............. Q¢$>Emm>JmQh<>.mmz324¥0M<¥FUDOOJmD>mB .mwmzmaHflOUDOQquxoumXQoymJZQ¥>qdeQ>.mm>30am¥m<¥fimm©gqqa>mfi mm4Qwomquqmm¥x<¢MJDZ¢>me>EmKQBH.mmxBDAXmE¢¥BmOOJQBm>mQ .mOOM¢OZ>OQ>Mmmxzzx>4¥mm0>.Oaxzwd>m¥<¥m¢QOOQZQ>w< .............. quxmHADEQZQ>ZEB>QOmmmH¥mmZSUQXZHQ>m< ................ mMm>mmJQEZMHJmQQmB.MQXZGA>NOQ>W¢ own oam oom omn dddflqddUHQdANQdddflqugq ddfiqqddfiq .Llllll c_5 as N_u amwwxqa>e>m>xo ................. meomo>mzmzxomz>aeonmqamxm>amm>mm>mwed amasmmoam>>HouqumquaxqmxquHmxaoezomquawammamzx>aem>mhexoe< amawmmo>¢aa>o0m ....... x>omaomoe>m>>aoxmm>weommquHoqo¢¢<2m>aom>mwo>we¢ mMmeODHm>>QM3 .................................. ZJquOmm>QAmmm>FIOmmH¢ amaaxmo>>>>om> .................................. zqo>m0adamoamm>mmzmwe< mMmmdeHMQmDMV .................................. QAO>Imm4HOUEfim>BBB>UB< oow ohm own omm ‘III dddddddddqq All __n we o_n H>aoanmmzoqquzdm>wox¢mozomxaan209mmumqowq»me>mmmme9>mzwouaHmaoqzamzzx>wog«mozomxamazoawnuoeqzwaaamaaaeaxme>aemom>mmmm2quw H>aamEOmooqzaxozx>woa¢mozomqmnzzoqwnumanuo¢Hzmemommmmm22mqaw >>womaxmuonzn.a¢m>>awmaozommannzoaqueano>xHawaaxeme9>mzmommnmmm2aqHo som>>om¢aoznmoooeaxme>mmHomxnmmmZEAHo >>>on4xmooqzara m>>oozoqmnmooxoaxHm4>aoe>xme>mmHomqnmmmzaqu own omm own oam oou oma oma .LIII ddflmqqddenfiq AIIIII .OII dddflfiqqdd&flnfiqq .All on he an an as mnofluumumunH Hufian dufism anoH>m.m auoo.m namoooanuua oauonu.< amonamsuou.m ndadm akOahm.= aHou.m mauaouanaua oauonu.¢ ouonamauoa.m nufium aHoH%Q.m aaou.u nauoooanuua ofiuonu.< amonamfiuoo.m 6.283 .2 2:»: 39 In the oxidation step, the NADP cofactor abstracts a hydride from the C4 hydroxyl of the mannose ring. The mannose modeled in the active site of GMD reveals a distance of 3.5 A between the C4 of the nicotinamide ring and C4 of the hexose. The angle formed by the N1-C4 atoms of the nicotinamide ring and C4 of the hexose is 92°. Both values are consistent with the positioning of the cofactor and substrate in other related enzymes such as MURl [44], SQDI [43] and human GalE [45], where the distances range from 3.4 A to 3.7 A and the angles fall in the range of 81° to 96°. To complete the oxidation step, an active site Tyr removes a proton from the O4 hydroxyl forming the 4-keto intermediate. The catalytic TyrlSO in GMD is in proper position (distance of 2.7 A) relative to the mannose model for it to directly attack the O4 hydroxyl. Early studies of E. coli GalE [55] and dTGDH [21] showed that the distance of Tyr to the O4 hydroxyl is too great for it to act directly as the base. The Ser/T hr catalytic triad member had been proposed to act as a proton shuttle to complete the oxidation step. However, more recent crystallographic studies of human GalE [45], dTGDH [42], SQDI [43] and MURI [44], showed that Tyr is within proper hydrogen bonding distance to directly attack the O4 hydroxyl. The Ser/T hr instead may orient the substrate in the active site and/or facilitate proton transfer. The presence of hydrogen bonds between the O4 and O6 hydroxyl and the Thr126 hydroxyl suggests that both these roles may be accomplished. To further facilitate the oxidation step, the catalytic Lys may stabilize Tyr in its negatively charged state. Studies suggest that Lys lowers the pKa of Tyr [32], which is normally between 9-12. The measured pKa of Tyr in E. coli dTGDH [32] and E. coli GalE [41] is 6.4 and 6.1 respectively. The distance between the phenolic oxygen of Tyr150 and the amide nitrogen of LyslS4 is 4.4 A, a distance too far for hydrogen 40 bonding, but within the range of electrostatic interactions to effectively lower the pKa of Tyr150. The 4—keto intermediate acts as a springboard to other SDR reactions. In the case of GMD, the ketone functionality serves to acidify the proton at C5 and permits the dehydration from C6 to form the GDP-4—keto-5,6—ene intermediate. The presence of the “cue” intermediate has recently been detected in the homologous dTGDH reaction [29]. For dehydration to occur, another active site base must be present to abstract a proton from the C5 position. Studies of E. coli GMD [52] and dTGDH [30] showed that a glutamic acid might fulfill this requirement. The corresponding Glu128 side chain in GMD is within 3.6 A of the C5 carbon of the mannose model, a position that would enable it to deprotonate C5. To complete the dehydration reaction, the C6 hydroxyl must be protonated by an active site acid. An Asp residue has been proposed based on structural analysis of dTGDH [42] and supported by mutagenesis experiments [27, 30]. The corresponding GMD residue, Ser127, is within 2.8 A of the modeled position of the hexose O6 hydroxyl, and may assume a similar role. Alternatively, Glu128 is within 2.6 A, suggesting the possibility of this side chain playing a dual role in the dehydration step, acting as both a general base and a general acid, as has been suggested in dTGDH and MURI [30, 44]. Whereas the dehydration mechanism described here is the step-wise water elimination mechanism as seen in D135N and D135A mutants of dTGDH, dehydration may also occur through a concerted mechanism as seen in wild type dTGDH [31 [. Further kinetic studies would need to be completed to determine which mechanism of dehydration GMD actually utilizes. 41 The final step of the GMD reaction involves a hydride transfer from NADPH back to the hexose C6 position. The distance between the nicotinamide C4 atom and the C4 and C6 atoms of the hexose moiety (3.5A and 3.8A respectively) suggests that only modest rotation of the hexose ring would be required to complete the hydride transfer. Interestingly, because NADP is regenerated, the cofactor may remain bound through each catalytic cycle. To finalize the reduction step, an active site acid is required for proton addition to the C5 position of the hexose. Proposed residues to fulfill this role based on structural analysis of dTGDH include the catalytic Tyr, Glu or Asp [42]. Of the corresponding residues in GMD, Tyr150 and Ser127 (aligning with Asp) are >44 A to the C5 position of the hexose model. Glu128 is 3.6 A away, but would move even further with the rotation of the hexose ring towards the nicotinamide ring. However, Thr126 of the catalytic triad and Asnl79 are positioned such that they may be able to fulfill the role as the general acid to complete the reaction. 1.3.6. Structural Comparisons. The secondary structural elements between P. aeruginosa GMD and MURI superimpose well with a root mean square deviation (rmsd) of 1.2 A over Cu atoms. The secondary structural elements between P. aeruginosa GMD and E. coli GMD do not superimpose as well (rmsd of 3 A) because E. coli GMD has no substrate or cofactor bound in the active site. The main difference between the three is an area of disorder present in MURI residues 76-81 and E. coli GMD residues 35-55 which corresponds to a region that is highly variable in size and sequence among GMDs (Figure 13). This stretch in P. aeruginosa GMD is shorter and well ordered, forming a short helix ((12) between [32 and [33 of the Rossmann fold. Immediately preceding this region is a Gly-XX-ArgArg sequence that is conserved among all GMDs sequenced thus far 42 except for P. aeruginosa GMD. In contrast, P. aeruginosa GMD exhibits an arginine shift resulting in the sequence Gly—XXX-ArgArg. The two positively charged arginine residues mark the beginning of the RR loop that closes over the adenosyl phosphate end of the cofactor, are important in cofactor binding and are involved in the tetramer interface for P. aeruginosa GMD and MURI. The RR shift causes an interesting rearrangement of interactions. In MURI the first arginine of the sequence, Arg60, adopts a parallel stacking arrangement with Arg60* of the neighboring monomer, simultaneously packing against the adenine ring and coordinating the 2’phosphate. In GMD the second arginine of the sequence Arg36 also adopts a parallel stacking arrangement with Arg36* of the neighboring monomer. However, due to the shift by two residues, the side chain is oriented almost perpendicular to the adenine ring (Figure 14). Despite the rearrangement in this region, Arg36 in P. aeruginosa GMD still maintains the electrostatic interactions to coordinate the 2’phosphate. Based on the role that the cofactor plays in the tetramer interface of P. aeruginosa GMD and MURl, cofactor binding might also be expected to assist in ordering part of the RR loop in E. coli GMD and may be essential for tetramer formation. Other differences between P. aeruginosa GMD and E. coli GMD include the positioning of the loop between the [34 strand and the (14 helix. This loop in the apo E. coli GMD occupies a portion of the cofactor-binding site and would have to move as much as 6 A to make room for the cofactor. Secondly, the smaller substrate—binding domain in E. coli GMD adopts a more open conformation. The difference is apparent when comparing the rmsd of 2.4 A with P. aeruginosa GMD for the C-terminal domain, as opposed to just 1.8 A for the N-terminal domain. In addition to the more open 43 conformation of its C-terminal domain, E. coli GMD has an extended loop between the (18 helix and the BI] strand. The loop, which inserts an additional 27 residues compared to P. aeruginosa GMD, is variable in size among GMDs. ArgSS' , i‘ ArgGO 5‘s"! Ar961' . Figure 14. RR loop of GMD and MURl Stereoview of the superposition of the RR loops from GMD (highlighted in color) and MURl (blue). GMD side chains are labeled in green, MURl side chains are labled in blue, asterisks denote side chains from opposite monomer. One of the intriguing features about P. aeruginosa GMD is its oligomeric state, as it deviates from the canonical homodimeric structures seen in most other related enzymes of the N DP—sugar modifying subfamily of the SDRs. The structures of all other members of this subfamily, including E. coli GMD, have been observed as dimers, with the exception of the previously mentioned tetrameric tyvelose epimerase [40] and ADP-L- glycero—D L _‘ 5 r' , which is a pentamer [74]. P. aeruginosa GMD and A. thaliana MURI can be seen as a dimer of canonical SDR dimers, which then 44 generates a new set of subunit interactions. An important subsequent question is whether a significant number of tetramer interactions between GMD and MURl are conserved. As previously mentioned, the RR loop of residues Arg35-Arg43 that is so intimately involved in the tetramer interface and the cofactor binding sites, makes protein-protein interactions as well as protein—cofactor interactions to the neighboring monomer. The Arg35 to Ser85* and Ser37 to Arg43* hydrogen bonds are conserved in GMD and MURl. Furthermore, these residues are highly conserved among the GMD sequences. Also within the RR loop is hydrogen bonding between Ser38 and Arg43*. Although the residue corresponding to Ser38 is an Asn in MURl, the interaction is conserved. The sequences of several GMDs reveal that in most cases, a Ser is present in this position. Away from the cofactor binding site overlap, Asp62 hydrogen bonds to the amide nitrogen atoms of Val95* and Thr96*. Asp62 is highly conserved among GMDs. One of the more interesting interactions involves residue Arg68, which is moderately conserved across several GMD sequences. This residue is involved in hydrogen bonding to the main chain carbonyl oxygen of Asn92*, an interaction also seen in MURI. The same Arg in GMD and MURl extends towards the diagonally related monomer to hydrogen bond to Glu] 10 and Argl 13, both highly conserved residues among GMD sequences. 1.3.7. Conclusions. In summary. the GDP-D-mannose 4,6-dehydratase MURI isoform from Arabidopsis thaliana was shown to be a tetramer, while the first GMD structure to be determined, E. coli apo-GMD, was observed as a dimer. This raised the question as to whether or not prokaryotic and eukaryotic GMDs differed in oligomeric state. We have determined the structure of P. aeruginosa GMD with NADPH and GDP bound in the active site and found it to exist as a tetramer. The tetramer arises from the 45 dimerization of the canonical dimer seen for most members of the SDR superfamily, but in a manner where the cofactor binding sites closely interact across the new interface. The residues involved in the tetramer interactions are well conserved between the prokaryotic GMD and the eukaryotic MURI. Moreover, a high degree of sequence conservation among the residues within the tetramer interface is also observed across a broad range of GMDs. These observations suggest that the tetramer may be a more common oligomeric state for GMDs than previously thought. 1Portions of Chapter 1 were previously published, Webb et al.. (2004) Protein Science. 46 CHAPTER 2: CRYSTALLIZATION AND INITIAL X-RAY ANALYSIS OF RMD 2.1. Introduction Genes coding for a particular bacterial polysaccharide are usually found in clusters. In the case of both A. thermoaerophilus and P. aeruginosa, the rmd gene is located adjacent to the gmd gene. The corresponding RMD enzymes have been characterized in both the gram-negative bacterium P. aeruginosa and gram-positive bacterium A. thermoaerophilus [3, 54]. A sequence alignment of the two RMDs shows 33% identity and 54% similarity between pairs. A BLAST search with RMD shows significant similarities to GDP-mannose dehydratases, dTDP-glucose dehydratases, UDP-galactose epimerases and GDP-fucose synthetases, all members of the NDP-sugar modifying subfamily of the SDRs. No RMD analogue has yet been found in the human genome using P. aeruginosa RMD as a probe, which is in agreement with the fact that humans lack rhamnosylation. Thus, the second step in GDP-D-rhamnose biosynthesis appears to be a potential target for enzyme inhibition to prevent the synthesis of D-rhamnose- containing LPS molecules in pathogenic bacteria. Structural information would aid in the design of inhibitors. Since no structure is currently available for RMDs, efforts were made to determine the X-ray structure of RMD from P. aeruginosa. As this proved to be a difficult task, an ancillary effort was made to determine the structure of RMD from A. thermoaerophilus, which has shown to be similar in function to the P. aeruginosa RMD. Reported here are the cloning, expression, purification, crystallization and initial X- ray analysis of the RMD crystals from both bacterial species. The amino acid sequence of RMD most closely aligns with its biosynthetic pathway mate GMD. If the residues align at the tertiary level, then we would expect that the core structure would mimic that 47 of GMD, or other SDR enzymes, since the three-dimensional structures tend to be highly similar. Crystallographic analysis should allow us to see if RMD meets our expectations or breaks from the mold of the typical SDR protein. Preliminary data suggests that P. aeruginosa GMD may be an appropriate search model for the molecular placement method of phase determination using data from A. thermoaerophilus RMD crystals. indicating our hypothesis may be true. 2.2. Experimental Procedures 2.2.1. Cloning and Expression of RMD. The rmd gene from P. aeruginosa, originally isolated by Lightfoot and Lam [75], was amplified by PCR while incorporating a BamHI site over the natural start codon and a SalIII site over the natural stop codon. The new PCR fragment was ligated into the pQE30 vector (Qiagen), placing a 6x-His tag at the N-terminus. A glycerol stock of E. coli M15 cells harboring the recombinant plasmid was used to inoculate a starter culture of 50 ml Luria Bertani (LB) broth. Following cultivation at 37°C for 16 h, the starter culture was transferred to 1 L LB broth. Once an optical density at 600 nm of 0.5—0.6 was reached, the cells were induced with 0.25 mM IPT G and grown for an additional 16 h at room temperature before harvesting by centrifugation. Cells were re-suspended in Lysis Buffer (50 mM HEPES, 300 mM NaCl, 5 mM imidazole pH 8.0). The rmd gene from A. thermoaerophilus, originally isolated by Kneidinger et al. [3], was also amplified by PCR while incorporating a BamHI site at the N-terminus and a Kpnl site at the C-terminus. The PCR fragment was then ligated into the pQE80 vector (Qiagen), which placed a 6x-His tag at the N-terminus. The new plasmid was 48 transfonned into E. coli BL21 DE3 cells. The same protocol as stated above was used to express the protein in shake flasks. 2.2.2. Purification of RMD. The same protocol as in section 1.2.3 was used to purify RMD resulting in the N-terminally His-tagged RMD from P. aeruginosa (PaRMD) or the N -terminally His-tagged RMD from A. thermoaerophilus (AtRMD). 2.2.3. Crystallization of RMD. Purified PaRMD or AtRMD at 10 mg/ml were used to set up crystallization screens using the microbatch method with the Impax I-S robot (Douglas Instruments). A total of 198 conditions were screened by combining 1 ul of purified protein with 1 ul precipitating solution. Crystallization conditions were refined using the hanging drop vapor diffusion method. Prior to data collection, the PaRMD crystals were treated with 0.1% glutaraldehyde, briefly transferred to a cryoprotectant solution (10% PEG 10K, 100 mM Hepes pH 7.6, 7.5% glycerol, 2.75 M sodium formate. 5 mM GDP-D-mannose and 2 mM NADP) and flash-frozen in liquid nitrogen. The AtRMD crystals were flash-frozen in liquid nitrogen directly from the crystallization drop. 2.2.4. Cryoprotection Techniques. Through personal correspondence with Terese Bergfors [76], a technique for streak-seeding for cryoconditions was created. Purified PaRMD at 10 mg/ml was used to set up crystallization screens (Crystal Screen 1 and 2 from Hampton Research) using the hanging drop vapor diffusion method. A 1:1 mix of protein and precipitating solution in a drop size of 4 ul was placed on the cover slip and allowed to equilibrate over 500 pl of precipitating solution for one day. All drops were then streak-seeded with their respective protein crystals in an effort to determine possible cryoprotectants. This process was accomplished by touching a cat whisker to the side of 49 a previously obtained protein crystal in order to gather nuclei. Then the cat whisker was streaked through the drops on the cover slips whether there was precipitation or not. The drops were then monitored for their ability to grow crystals. Those conditions that supported any crystal growth, whose precipitating solution contained a component used for cryoprotecting, provided a starting point for optimizing the final cryosolution. A technique was devised from an idea that stemmed from a method listed in the CCP4 bulletin board (we-'w.ccp4.ztc.uk) titled “Summary of oil and cryo-protectant combo”. After crystal(s) were obtained in a hanging drop vapor diffusion screen, an o- ring greased with vacuum grease was placed on the cover slip around the drop containing the crystal(s). The resulting reservoir was then filled with paraffin oil, covering the drop. The crystal(s) were then directly looped out of the drop and flash-frozen, providing a layer of oil as the cryoprotectant. 2.2.5. Data Collection and Processing. X—ray diffraction data from both the PaRMD and AtRMD crystals were collected on a MAR CCD detector at the Advanced Photon Source beamline 5—ID (DND), Argonne National Laboratory. During data collection the crystals were held at 100 l( in a cryostream and radiation was used at a 1.0 A wavelength. The data were processed using XDS software [77]. The AtRMD crystal diffraction data were used to calculate a self-rotation function using the AMoRe programs [66] from the CCP4 suite [67]. See Table 3 for statistics. 2.2.6. Preparation of Phasing Models. The secondary structure of P. aeruginosa and A. thermoaerophilus RMD were predicted using the PSIPRED secondary structure prediction method [78] using the PSIPRED server [79]. The fold recognition server 3D- PSSM was used to predict tertiary structure by threading [80]. The tertiary structure was 50 also predicted using SWISS—MODEL [81], which uses comparative modeling. The coordinates were checked visually and compared to other structures of SDR members. Based on the secondary and tertiary structure prediction as well as sequence alignments, molecular replacement models were constructed using P. aeruginosa GMD (PDB code IRPN), E. coli GMD (PDB code 1DB3) and Arabia'opsis MURI (PDB code 1N7H) as starting models. All differing residues were changed to alanines and portions were removed in which 1) amino acid sequences differed significantly and/or 2) the predicted structure differed. The main structural feature of the SDR enzymes, the Rossmann fold, was maintained. Table 3. RMD X-ray diffraction data PaRMD AtRMD Space group P422 (potentially P4212, 1" P4122, P412|2, P4222, P42212, P4322, P43212) Unit cell parameters a=b=182, c=250 3:46.88, b: 55.74, c=79.24, (A, °) a=B=y=90 =72.54, [3432.95, y=75.61 Resolution range (A) soc—3.7 30.0- 1 .8 No. observed reflections 209,657 215,748 No. unique reflections 44,486 64,129 Completeness (%) 93.9 (96.7)* 96.5 (95.2)* Rmerge (%) 7.5 (21.8)* 10.6 (48.1)* Average l/0(l) 5.8 (1.2)* 13.4 (3.7)* VM (A Da 1) 2.46 2.66 Molecules per asymmetric unit 12 2 Estimated solvent content (%) 50.1 53.8 *Indicates statistic from highest resolution shell. 51 2.3. Results and Discussion 2.3.1. Expression and Purification. RMD from both P. aeruginosa and A. thermoaerophilus were expressed in the pQE system as N—terminally His—tagged protein (PaRMD and AtRMD, respectively). Both constructs were readily purified to 90—95% by Ni—NTA column, and further purified to 95-98% by an anion exchange column. A typical protein yield from the PaRMD preparations was 9-12 mg of purified protein per liter of cell culture; the yield for AtRMD was less, only 2—3 mg of purified protein per liter of cell culture. SDS—PAGE gels of samples from a typical purification of PaRMD are shown in Figure 15 and AtRMD samples are shown in Figure 16. The gene products appear to run on the SDS—PAGE gels as a 35 kDa protein, which closely matches the molecular weight based on the amino acid sequence of 34.8 kDa for PaRMD and 35.9 kDa for AtRMD. 2 M L F1 F2 F3 F4 F5 F6HF-7 F8 F9 Figure 15. SDS-PAGE of samples from a typical PaRMD purification (A) load (L), flow through (FT), wash (W) and fractions (F) from the Ni-NTA column; (B) load (L) and fractions (F) from the anion exchange column. Molecular weight marker (M) standards shown in kDa to the left of each gel. PaRMD runs as expected at ~35 kDa. 52 LFTWFI F2 F3 F4 F5 MLFI F2 F3F4F5 193 a. -‘ 102 60 41 27 20 15 Figure 16. SDS-PAGE of samples from a typical AtRMD purification The left-hand side of the gel shows the load (L), flow through (FT), wash (W) and fractions (F) from the Ni-NTA column; the right-hand side of the gel shows the load (L) and fractions (F) from the anion exchange column. Molecular weight marker (M) standards shown in kDa to the left of the gel. AtRMD runs as expected at ~35 kDa. 2.3.2. Crystallization. The initial screening of crystallization conditions at room temperature for PaRMD resulted in protein crystal clusters as shown in panels A, B and C of Figure 17. The crystals in panels A and B were obtained in the following conditions: 50:50 mix of protein sample and well solution where the protein sample contained 10 mg/ml protein, 5 mM GDP and 2 mM NADP and the well solution contained 10% PEG 8K, 100 mM sodium/potassium phosphate pH 6.0, 200 mM NaCl. Although any sight of protein crystals in a screen is encouraging, it is clear from the inspection of these crystals under a low-powered light microscope that they are severely twinned. Crystallization conditions such as growth temperature, the pH of the buffer and the molecular weight of the PEGs were modified to find the optimal crystal growing condition. Single crystals were obtained simply by replacing 5 mM GDP with the substrate analog 5 mM GDP-D- mannose to the protein sample. Examples of these crystals can be seen in panels D and E. The crystals in panel D are hexagonal in shape and about 0.2 x 0.2 x 0.1 mm in size. 53 Panel E shows thick rod-shaped crystals that were generally 0.15 mm in thickness and 0.4-0.5 mm in length. Figure 17. Crystals of PaRMD Conditions giving crystals in panels A and B were optimized to yield single crystals shown in panels D and E by the addition of GDP-mannose instead of GDP. The conditions to grow the crystals in panel C were optimized by changing the buffer and the molecular weight of PEG to yield single crystals like that shown in panel F. The PaRMD crystals in panel C of Figure 17 were obtained in a different screening condition: 50:50 mix of protein sample and well solution where the protein sample contained 10 mg/ml protein, 5 mM GDP-D-mannose and 2 mM NADP and the well solution contained 12% PEG 20K and 100 mM MES pH 6.5. By optimizing the PEG molecular weight and using a different buffer, single, non—twinned crystals were obtained in the following conditions: 50:50 mix of protein sample and well solution where the 54 protein sample contained 10 mg/ml protein, 5 mM GDP-D-mannose, 2 mM NADP and the well solution contained 19% PEG 10K and 100 mM HEPES pH 7.5. An example of these bipyramidal-like crystals, which were generally 0.4 x 0.4 x 0.4 mm in size, is shown in panel F. A microbatch screen for crystallization conditions using AtRMD resulted in small, plate-like crystals shown in Figure 18. The crystallization conditions were as follows: 50:50 mix of protein sample and well solution where the protein sample contained 10 mg/ml protein, 5 mM GDP-D-mannose and 2 mM NADPH and the well solution contained 35% pentaerythritol propoxylate (PEP) (5/4 PO/OH), 100 mM MES pH 6.5 and 200 mM MgC12. Larger plate-like crystals were grown using the hanging drop vapor method. Figure 18. Crystals of AtRMD Small, plate-like crystals of AtRMD obtained in a microbatch screen. The PaRMD crystals tended to grow within a week whereas the AtRMD crystals tended to take 3-4 weeks to grow. Reproducibility was a problem not only with protein originating from different purifications, but also with protein from the same batch. In addition, the crystals were prone to crumbling and dissolving, necessitating an arduous 55 search for a suitable cryoprotectant and/or cryoprotection technique that would not damage the crystals. 2.3.3. Cryoprotection. Generally before X-ray data collection, crystals are briefly transferred to cryoprotected mother liquor then flash-frozen to minimize radiation damage during the experiment. The cryoprotectant ideally prevents the formation of ice and maintains the crystallographic integrity of the crystal during the freezing process. The selection of the cryoprotectant(s) as well as the percent composition involves some trial and error. The PaRMD crystals grew in PEGs, which are known ice-preventing agents. The X-ray data of the crystals that were frozen directly from the drop revealed the formation of ice by the observed powder diffraction rings or “ice rings” on the X-ray diffraction screen. Data from crystals that had been transferred to a drop with a higher percentage of the respective PEG revealed poor-quality diffraction, potentially indicating poor cryoconditions. Thus an assay of several cryoprotectants, concentrations and a test of mixtures were required to determine the best cryocondition. To reduce the evaluation time it took to find the appropriate cryoprotectant, a screen of crystallization conditions was streak-seeded. First, a crystal screen is prepared and allowed to sit over night. Then a cat whisker is stroked over the surface of a previously obtained crystal to dislodge and collect the nuclei. Finally each drop of a screen is streaked with the cat whisker. The crystal trays are then monitored for any crystal growth, whether large or small, twinned or single, needles or chunks. If crystals appear, then it is assumed that the components of the precipitating solution would be more helpful than harmful to the crystal lattice. With any luck, one of the components is a cryoprotectant, giving a good starting point in the search for the appropriate cryosolution. Nevertheless streak-seeding PaRMD for 56 cryoconditions yielded no positive results, so several of the more common cryoprotectants were tested (i.e. sugars, MPD, ethylene glycol, glycerol and lower- molecular-weight PEGs). However, in each case, the crystals crumbled before flash- freezing. Paraffin oil was also investigated as a suitable cryoprotectant, since it does not require the transfer of the crystal from one drop to another, which could disturb the crystal packing. A previous study has shown that crystals grown in PEG solutions are stripped of the thin film of mother liquor when passed through oil [83]. So a low—tech technique was created that would minimize manual manipulation of the PaRMD crystals. The cover slip on which the crystals were grown was turned drop side up. A greased o-ring was placed over the 4 ul drop and the reservoir was filled with paraffin oil. The crystal was then directly loop out of the drop, swiped through the oil and flash—frozen in liquid nitrogen. This method has the added advantage of providing extra handling-time as it protects the drop from drying out. Even though it seemed to be a less invasive procedure, the crystals were irreversibly damaged. In the end, the best cryocondition for the PaRMD crystals was an initial treatment with glutaraldehyde, then the use of a combination of the same molecular weight PEG in which the crystals were grown, along with glycerol and sodium formate as well as GDP— D-mannose and NADP. Glutaraldehyde is known to cross-link proteins through the amino group of lysine, adding stability to fragile crystals for data collection. The final conditions served to improve the quality of the diffraction by reducing spot twinning and mosaicity. 57 The AtRMD crystals also proved to be very fragile. However, mesh litholoops (Protein Wave Corp.) provided support for the thin plates during freezing. Since the crystals grew in a high percentage of PEP, a known cryoprotectant, the crystals were frozen directly from the crystallization drop. Fortunately, this proved to be a suitable cryoprotectant and no further optimization was necessary. 2.3.4. Data Collection and Processing. A complete data set was collected from one of the bipyramidal PaRMD crystals, like the one shown in panel F of Figure 17. These crystals diffracted to 3.7 A, while all other PaRMD crystals diffracted to 8-10 A at best. Although the signal-to-noise ratio is 1.2 in the final resolution shell (22 is desired), the completeness (96.7%) and Rmerge (21.8%) are respectable (see Table 3). These crystals belong to the P422 space group; it is unclear based on the systematic absences if the unit cell possesses symmetry elements that would place it in a higher symmetry space group. The number of molecules in the asymmetric unit was therefore estimated based on the current working space group (P422), unit cell parameters (a=b=182 A, c=250 A, a=B=y=90°) and molecular weight (34,825 Da) using the Matthews probability calculation [84]. The median Matthews coefficient (VM) is 2.52 A3/Da with the vast majority of proteins falling into the 2-3.5 A3/Da range; the median solvent content is 47% with the vast majority of proteins falling in the 35-65% range [85]. For PaRMD a Matthews coefficient of 2.46 A3/Da yields 12 molecules in the asymmetric unit with a solvent content of 50.1%. However. certainly within reason is a VM of 2.96 or 2.11 A3/Da, which would allow for 10 or 14 molecules respectively in the asymmetric unit with a solvent content of 58.4 or 41.8% respectively. 58 Though the resolution of the PaRMD data was high enough to determine phases, the data would not provide much detail beyond the backbone trace. While attempts were made to grow crystals in different conditions with the hope obtaining a different space group, these hopes were not realized. Since it is difficult to give up on crystals that have already been obtained, attempts were made to save the crystals and improve the resolution by post-crystallization treatments. Two separate annealing procedures were used: macromolecular crystal annealing (MCA) [86] and in situ annealing [86, 87]. MCA was able to improve the diffraction of the PaRMD crystals from 8-10 A to 6-7 A. In situ or flash-annealing was able to help decrease the ice-rings, but had little affect on the diffraction of the PaRMD. A “last resort” procedure, as described by the author of the paper in which the procedure was presented, was fast desiccation of protein crystals [88]. Although it was reported that poorly diffracting crystals showed “spectacular” improvement of diffraction with this procedure, the same results did not occur with any of the PaRMD crystals. This was the point at which efforts were directed at the RMD homolog from A. thermoaerophilus. Up until the point of X-ray data collection, the RMD protein from P. aeruginosa and A. thermoaerophilus behaved in a similar fashion. Expression and purification were comparable, even the crystals had fragility in common. However, once a single frame of X-ray diffraction data was collected on the AtRMD crystals, the difference was readily apparent, as they diffracted to 1.8 A. A complete data set was collected on the AtRMD crystals (see Table 3 for statistics). The crystal falls into the P1 space group and have a much smaller unit cell than that of the PaRMD crystals. As a result, based on a Matthews coefficient of 2.66 A3/Da, the number of molecules per asymmetric unit is 2, with a 59 solvent content of 53.8%. A self-rotation function indicates that the molecules in the asymmetric unit are related by a non-crystallographic twofold axis. suggesting AtRMD exists as a dimer. 2.3.6. Initial Phasing Results. Structure determination by the molecular replacement method using coordinates from various homologous structures was initially attempted with data from the PaRMD crystals. With the use of the programs AMoRe [66] and Phaser [89] from the CCP4 suite [67], no clear solution was evident in any of the possible P422 space groups. On the other hand, using a dimer as a search model based on the known structure of GMD from P. aeruginosa (PDB code lRPN), a solution for a dimer in the asymmetric unit was obtained using the Phaser program from the CCP4 suite [67]. Currently efforts are underway to obtain experimental phases for comparison to those determined by molecular replacement. 2.3.7. Conclusions. To summarize, the process of crystal structure determination of RMD from P. aeruginosa and A. thermoaerophilus has been initiated. The cloning, expression, and purification were successful, yielding plenty of protein pure enough for crystallization trials. Single crystals were obtained of protein from both bacterial species and X-ray diffraction data has been collected. While PaRMD crystals diffracted weakly to only 3.7 A, AtRMD crystals diffracted to 1.8 A. Full structure determination by the molecular replacement method using the coordinates from GMD from P. aeruginosa is in progress. Our hope is that the AtRMD structure will serve as a successful search model for the PaRMD data, allowing a comparison of the two structures. 60 CHAPTER 3: FUNTIONAL CHARACTERIZATION OF GMD AND RMDl 3.1. Introduction Rhamnose is a deoxyhexose found in glycoconjugates of bacteria and plants but not in humans. L-rhamnose is the more common isoform, found in cell walls and capsules of a variety of bacteria. On the other hand, D-rhamnose has mainly been described as a constituent of the LPS of gram-negative bacteria such as plant pathogens X. campestris [9], and human pathogens, for example P. aeruginosa [11], H. pylori [10] and Campylobacter fetus [90]. D-rhamnose molecules are arranged as repeating trisaccharide units in the A-band LPS O-antigen of P. aeruginosa. The LPS molecules of this opportunistic pathogen contribute to its virulence in immunocompromised patients including those with cystic fibrosis, cancer and burn-wounds. Interestingly, the same D- rhamnose polysaccharide structure seen in A-band LPS of P. aeruginosa has also been detected in other opportunistic pathogens B. cepacia and S. maltophilia that are problematic in cystic fibrosis patients [8]. The source of D-rhamnose comes from the nucleotide activated GDP-D-rhamnose, which is made in two steps: GDP-D-mannose 4,6—dehydratase (GMD) first converts GDP-D-mannose to GDP—4-keto—6—deoxy-D-mannose, which is subsequently reduced GDP-D-rhamnose by the reductase (RMD). The genetics of GDP-D-rhamnose biosynthesis in P. aeruginosa has been extensively studied. Lightfoot and Lam determined that the P. aeruginosa gmd gene coded for a protein that functions as a GDP- D-mannose dehydratase based on amino acid homology to other dehydratases. In addition, paper chromatography experiments showed the conversion of ”C-GDP-D- mannose to GDP-D-rhamnose from supernatants of an A” strain carrying the gmd gene on 61 a plasmid [91]. Rocchetta et al. confirmed the role of P. aeruginosa RMD in the pathway from work that revealed that rmd knockout mutants lacked A-band O-antigen [92]. Amino acid homology of RMD shows that it is similar to enzymes that modify sugars at the C4 and C6 positions. Maki et al. verified the role of P. aeruginosa RMD in the conversion of the 4-keto-6-deoxy intermediate to GDP-D-rhamnose by co-expressing the Helicobacter pylori gmd and P. aeruginosa rmd genes in Saccharomyces cerevisiae [54]. However. GMD from P. aeruginosa has not yet been functionally characterized. Furthermore, the entire pathway has only been functionally characterized in A. thermoaerophilus, where D-rhamnose is a constituent of the surface layer glycoprotein of this bacterium [3]. In this study we developed a capiMary—electrophoresis (CE) -based enzymatic assay to confirm the involvement of the enzymes GMD and RMD from P. aeruginosa in the biosynthesis of GDP-D-rhamnose. To supplement the X-ray analysis experiments (Chapter 2), assays were also completed on RMD from A. thermoaerophilus. We demonstrate dehydratase activity in GMD and reductase activity in both GMD and RMD. Furthermore, we present a method for the enzymatic synthesis of GDP-D-rhamnose, an important glycobiological building block. 3.2. Experimental Procedures 3.2.1. Cloning, Expression and Purification of GMD and RMD. Procedures outlined in Section 1.2.1. and 1.2.3. were used to produce purified GMD. Likewise. procedures stated in Sections 2.2.1. and 2.2.2. were used to produce purified RMD from both P. aeruginosa and A. thermoaerophilus. (Unless otherwise noted, discussion will center on RMD from the bacterial source P. aeruginosa.) 62 3.2.2. Determination of the Oligomeric State by Gel Filtration. A HiPrep 16/60 Sephacryl S-200 High Resolution gel filtration column (Amersham Biosciences), calibrated with high and low molecular weight calibration kits (Amersham Biosciences), was used to determine the oligomerization of both GMD and RMD. A 1 ml sample was prepared using protein in 20 mM Tris pH 8.5 and 300 mM NaCl with or without 10 mM GDP-D-mannose or 5 mM NADP. Samples were applied to the column and protein elution was detected at 280 nm. Molecular weights of the samples were estimated based on the calibration curve of K8,. [(Ve-Vo)/(Vt-Vo)| vs. log molecular weight (where Ve is the elution volume for the protein, V0 is the void volume and Vt is the total bed volume). 3.2.3. Assay of GMD and RMD activity. The standard reaction buffer was 40 mM Tris-HCI (pH 7.5) and 10 mM MgCl2 with a total reaction volume of 35 to 60 uL. Typically, the initial GMD reactions contained 1.0 uM GDP-D-mannose and 0.1 mM or 0.01 mM NADP. The reaction was initiated by the addition of l to 10 ug of purified protein and incubated at 37°C. The completion of the GMD reaction was judged by CE analysis (~1—2 h). For the coupled assay, 1 to 10 ug of RMD was added directly to the reaction tube along with 0.1 mM or 0.01 mM NADPH. For the sequential assay, GMD was removed by filtration using a microcon YM- 10 filter cartridge; the filtrate containing the labile 4-keto intermediate was used as a substrate for the RMD/NADPH reaction. For the determination of the pH optimum for GMD, the following reaction buffers were used: MES (pH 5, 5.5, 6.5), Tris (pH 7.0, 7.5, 8.0) and Bis-Tris propane (pH 9.0, 10.0). 3.2.4. Analysis of Reaction Products by Capillary Electrophoresis. CE analysis was performed with a P/ACE MDQ Glycoprotein System (Beckman Coulter) with UV detection. A 75 um x 50 cm bare silica capillary was used with the UV detector mounted 63 at 50 cm of the capillary. The CE analysis was run in 25 mM sodium tetraborate (pH 9.5) at 25°C, and sample was introduced by pressure injection for 8 sec. The separation was performed at 22 kV, and the sugar-nucleotide substrate and product were detected at 254 nm. Peak integration was performed using the Beckman 32 Karat Software. 3.2.5. Purification of Sugar Nucleotide Product via HPLC. A preparative scale enzymatic reaction containing 30 umol of GDP-D-mannose and 1.5 mg of purified GMD was incubated for 2 h at 37°C in 40 mM Tris pH 7.5 and 10 mM Mngl. Protein was subsequently removed from the completed reaction by unltrafiltration through a Centriplus YM-3 cartridge (Millipore). The filtrate was subjected to an Econo-Pac High Q anion exchange column over a 0-500 mM triethylammonium bicarbonate buffer (pH 8.0) gradient. Fractions that contained GDP-D-rhamnose were pooled, acidified to pH 4.8 with AG 50W-X4 resin (Bio-Rad) and lyophilized in preparation for NMR analysis. 3.2.6. NMR Analysis of Reactions Product. The lyophilized sample containing GDP-D—rhamnose was resuspended in 160 pl of 99% D20 and analyzed by NMR spectroscopy. All spectra were acquired using a Varian Inova 500 MHz spectrophotometer equipped with a Z-gradient triple resonance (‘H, l3C, 3'P) probe. The experiments were performed at 15°C with suppression of the water resonance. The methyl resonance of acetone was used as an internal reference at 6,, 2.225 ppm and 6C 31.07 ppm. The COSY. TOCSY, HMQC and 3'P HSQC experiments were used to assign the resonances [93]. 3.3. Results 3.3.1 Protein Expression, Purification and Determination of Oligomeric Status. The results for GMD and RMD cloning, expression and purification are summarized in 64 Section 1.3.1. and 2.3.1. respectively. Gel filtration analysis of recombinant GMD suggested that it exists as a tetramer in similar buffer conditions used for crystallization (Chapter I) as well as functional studies (Figure 19a). In contrast, recombinant RMD appears to exist as a dimer (Figure 1%). See Table 4 for elution volumes. > ii; 2A ‘1 I . ‘ A r i i ' l, i I .‘ , 2 f l i l . 2 ‘ l I ‘ r I l . I‘ ~ I i ' i ‘ l I I 8 . l V l . . ‘. . ./' ‘. l . .' ‘ . I ' L. . ‘ i l i ‘ ~ 1 V l '. . ' , ; j", i ' . ‘ l l l ‘5 l ' . I l i l ‘ I . f , . . V . ‘ , l v, ,- l i l . . l “ u l '2 3 h . .7 "V A‘ f 2 l, \‘. \. 1' r ‘ ‘ ~ ‘ \ '.' (,1 i ,4 I / \\.;\'r- . 1’ /I . l ,- . I l l l l ‘7‘ ire-“M. ‘ (NW w l l l \ 171"“ ""T . l l I '. , . l. T‘ t 4 ‘1‘ ‘r'h—e . = l l 3 3 ' I ! ' l l l t , l i I l .I l 2 . : , . , I l l .—. . , , ‘ » I lA ’I It I l i . l 5 , ‘ i l l l ' l ‘ I . i I i l x. I . l ‘ I| ' I . i j l l i i ll 1 I i . ' l l , l ‘ t t l l I « : I i ., ‘ l j l \ l t l | l , _. . J t ' y‘ l . . ‘ I . ~ ‘ l l ; 3 l v .‘ 1 n l ‘ ‘- u I i , i ' - -. 19 ‘1 l‘ r, f '1 x I l' \\ /’ \ i '. \ ‘ 1' ‘ \ J .I \ , ,‘ __ \\ N. l .‘ ‘ ‘l I '/ \ ' 2“. \ ‘\ .‘ / ‘ 'l 4- " \- 1/ 33“.. _ \ a i’ " \\ I l . ‘ ,1 ,‘i x; t '- 4 ~ JT/N' ~. —-L -I’ . ' 4 l "~- I I l l ' - ' ' ‘ . l , . t; *‘Jm-zzr—fm i‘_ ; L»: in- ii;- “L- . .i/ l lNT'h‘MW ' ' ' ' l ' l ' 5 l 1 l I I I l l . l l l . , I Figure 19. Gel filtration analysis of GMD and RMD . I 1 63'4"... 1.1:: -. -.- flti- - F:t' . A.- .42 -:. 'i‘. " " lit—taltdfl" I I (A) Gel filtrations analysis of GMD and standards, and (B) RMD and standards. Peaks: (1) ferritin, (2) catalase, (3) aldolase, (4) albumin, (5) ovalbumin, (6) chymotrypsinogen A, (7) ribonuclease, (8) GMD, (9) RMD, (10) RMD/GDP-mannose/NADPH. 65 Table 4. Gel filtration results of GMD and RMD Peak Sample Molecular Weight Elution Volume, Ve (Da) (ml) 1 Ferritin 440,000 36.5 2 Catalase 232,000 42.7 3 Aldolase 158,000 45.5 4 Albumin 67,000 49.8 5 Ovalbumin 43,000 55.8 6 Chymotrypsinogen A 25,000 69.4 7 Ribonuclease 13,700 75.7 8 GMD 48.] 9 RMD 52.9 10 RMD/NADPH/GDP-man 56.5 3.3.2. GMD catalyzes the dehydration of GDP-D-mannose. CE analysis of the GMD-catalyzed reaction containing GDP-D-mannose in the presence or absence of NAD or NADP resulted in the appearance of a new peak (Figure 20). This new peak putatively corresponds to the formation of the intermediate product of GMD, GDP-4-keto-6-deoxy- D-mannose. No standard is available for this intermediate product, which is known to be labile making it difficult to analyze. The apparent dehydratase reaction of GMD proceeded quantitatively and irreversibly and required no exogenous cofactor for activity. To test the pH optimum for GMD activity, the reaction was completed under pH range of 5 to 10; GMD showed optimum activity between pH 7 and 8 (Figure 20). 66 k) >u— 1 [>5 0 > >> [2 F > 1: 3:9 11 ’l j a.u. A254 J 6.0 Time (min) Figure 20. CE analysis of GMD dehydratase activity from pH 5 to 10 (1) GDP-D-mannose, (2) putative GDP-4-keto—6-deoxy-D-mannose. 3.3.3. RMD catalyzes the reduction of GDP-4-keto-6-deoxy-D-mannose. Due to the instability of the intermediate product of GMD, the 4-keto-6-deoxy-D-mannose intermediate was prepared in situ. First the GMD-catalyzed reaction was incubated for 2 h, and the enzyme was removed by filtration. The filtrate containing the 4-keto- intermediate was followed by CE analysis after incubation with NADPH (Figure 21f), NADP (Figure 21e), NADPH/RMD from P. aeruginosa (Figure 21c) and NADPH/RMD from A. thermoaerophilus (Figure 21b). No new peak was observed in the control reactions with just NADPH or NADP, ruling out the possibility that reduction of the labile intermediate was either spontaneous or by any reducing power of the buffer. However, a new peak, the putative GDP-D-rhamnose. was observed upon the addition of NADPH/RMD from either bacterial species. The novel peak was verified by spiking the 67 final reaction with GDP-D-mannose (Figure 21a). Similar results were observed in the coupled reactions (data not shown). The apparent reductase activity of RMD proceeded quantitatively and required exogenous NADPH. No reverse reaction was observed. A L ii A) + GDP-D-man : B) NADPH/AtRMD E 4‘4 . 1 <2 . C) NADPH/PaRMD . ,l H. .3 l________;(~jl j l D) NADPH/GMD . A ILL , E) NADP . L A F) NADPH Time (min) Figure 21. CE analysis of GMD/RMD reactions The GDP-4-keto-6-deoxy intermediate was treated with the components noted; the final reaction was spiked with GDP-D-mannose (A). Arrows: (1) GDP-D-mannose, (2) GDP- D-rhamnose, (3) GDP-4-keto—6-deoxy-D-mannose, (4) NADP, (5) NADPH. 3.3.3. ls GMD bifunctional? Interestingly, CE analysis revealed that the same peak corresponding to GDP-D-rhamnose was observed when GMD/NADPH was added to the filtrate (containing just the 4-keto-intermediate) (Figure 21d) as when RMD/NADPH was added, indicating GMD displays bifunctionality in vitro. To test this observation further, the conversion of the 4~keto-intermediate by GMD was followed in a time-dependent fashion at 0 h (Figure 22d), 1 h (Figure 22c) and 2.5 h (Figure 22b), revealing the 68 quantitative conversion of the 4-keto-intermediate to GDP-D-rhamnose. The novel peak was again verified by spiking the reaction with GDP-D-mannose (Figure 22a). The reductase activity of GMD was dependent upon the addition of exogenous NADPH to the reaction and no reverse reaction was detected. ‘— e ‘— {—5 {—6. M 1%.... ' ‘ A) +GDP-D-man 31 - 3. l