nag»? 5.- % zvsomr J 15.5.; .J. . swag?» . V? . fluviri. . if}. , u». ,. z in a, . 19.96% . 4.1.1.4..“ , , . 3.4.»... ‘ ”Wmfia , ‘ «a: . r... . Muzak; hrs: .{EE ‘6... b I. . V in. t e :4. up; . . = {131.55. 11:443.... \ tn nah-(mu hi. :52: :w “in". ..!».m 22.... uh. . uéaizm". 4r 3‘! . 37: )Ib .. 9.6.1.. havmzwu. h av: . 1 o .I 49% WW. I \:runtufl3nn. . .1 . figmé .M $115.55 I 1|.v .5 ‘53,“. v“ 2‘ 2. 17 ‘ a! . A uninhai. r at? SH #5.}...53 Qua—3“ - w... _-,, ..I\& .7 £1} . I» . r, {fail r LIBRARY Michigzm State University This is to certify that the dissertation entitled OVER-EXPRESSION, PURIFICATION, CHARACTERIZATION AND CRYSTALLIZATION OF RECOMBINANT SNAPC VARIANTS presented by Andrej Hanzlowsky - -.—c-.-.- -.-.-.--A_._ has been accepted towards fulfillment of the requirements for the Doctoral j degree In Chemistry ’ Major 5/1/76) fesso7$ignature Date MSU is an Affirmative Action/Equal Opportunity Institution PLACE IN RETURN BOX to remove this checkout from your record. To AVOID FINES return on or before date due. MAY BE RECALLED with earlier due date if requested. DATE DUE DATE DUE DATE DUE 2/05 p:/ClRC/DateDue.indd~p.1 OVER-EXPRESSION, PURIFICATION, CHARACTERIZATION AND CRYSTALLIZATION OF RECOMBINANT SN APc VARIANTS By Andrej Hanzlowsky A DISSERTATION Submitted to Michigan State University In partial fulfillment of the requirements For the degree of DOCTOR OF PHILOSOPHY Department of Chemistry 2006 ABSTRACT OVER-EXPRESSION, PURIFICATION, CHARACTERIZATION AND CRYSTALLIZATION OF RECOMBINANT SNAPC VARIANTS By Andrej Hanzlowsky In humans, the small nuclear RNA (snRN A) genes are transcribed by either RNA polymerase II or III. Different protein-protein and protein-DNA interactions are responsible for polymerase specificity and a large number of transcription factors are involved in this process. The small nuclear RNA activating protein complex (SNAPC) is required for transcription initiation for both classes of snRNA genes. SNAPC is a five subunit complex responsible for recognition of the proximal sequence element (PSE) on the DNA and interacts with multiple transcription factors that guide the transcription process. Co-expression of SNAPC subunits in baculovirus resulted in small quantities of material that is active for both the DNA binding and transcription. On the other hand, expression of the subunits in bacteria and reconstitution of the complex resulted in material capable of DNA binding, but not capable of transcription initiation. Here we present a method for efficient expression and purification of a mini SNAPC (mSNAPc) complex, composed of SNAPl90(l—505), SNAPSO, SNAP43, and SNAP19 in bacteria. The complex was co-expressed in E. coli and was active for both DNA binding and transcription initiation from snRNA genes. Furthermore, the amount of recovered material is greatly increased compared to any other previously described method. The larger amounts and increased purity of the complex obtained by this new approach enabled structural studies and analysis of zinc content in mSNAPc. We determined that the SNAPSO subunit of the complex contains a zinc finger domain that is involved in DNA binding of the complex. Crystallization conditions were identified, yielding protein crystals that diffracted both electrons in a transmission electron microscope and x-rays. Increased production of this complex is a step forward in understanding how the SNAPC recognizes DNA, and how it interacts with a multitude of transcription factors. ACKNOWLEDGMENTS I sincerely thank my advisor, Prof. James H. Geiger for his guidance and support throughout the research. I would like to thank our collaborator Prof. Bill Henry for providing DNA materials and for his countless discussions regarding this research. I would like to thank my co-workers Blanka, Justin, Suzan, Xiaofei and Fang for their direct involvement in this project. I would also like to thank all present and past members of Dr. Geiger’s and Dr. Henry’s research groups for their help and advice. I would like to thank Dr. Kathleen Foley for introducing me into the world of cloning. I must sincerely thank Gauri W. Jawdekar for performing all DNA binding and transcription initiation assays, Dr. Alicia Pastor-Leicha for her help with the transmission electron microscopy, Dr. Kathryn Severin for her help with flame atomic absorbtion, David Szymanski for assistance with the ICP-MS, Dr. Aizhuo Liu for his assistance with NMR, and Janet Haun for her advice on non-research related subjects. iv TABLE OF CONTENTS ACKNOWLEDGMENTS ................................................................................................. iv TABLE OF CONTENTS .................................................................................................... v LIST OF FIGURES ......................................................................................................... viii LIST OF TABLES ............................................................................................................ xv LIST OF ABBREVIATIONS .......................................................................................... xvi 1. Introduction ................................................................................................................... l 1.1. RNA polymerase and promoters .............................................................................. 2 1.1.1. RNA polymerase I ............................................................................................ 2 1.1.2. RNA polymerase II ........................................................................................... 3 1.1.3. RNA polymerase III .......................................................................................... 6 1.2. Composition of human RNAP transcription initiation complexes .......................... 9 1.2.1. RNA polymerase I transcription initiation complex ......................................... 9 1.2.2. RNA polymerase II transcription initiation complexes .................................. 10 1.2.3. RNA polymerase III transcription initiation complexes ................................. 11 1.3. SNAPC .................................................................................................................... 13 1.3.1. The composition of SNAPC ............................................................................ 13 1.3.2. The SNAP complex and chromatin ................................................................ 15 1.4. Proteins that interact directly with the SNAP complex ......................................... 17 1.4.1. TBP is associated with SNAPC through multiple interactions ........................ 17 1.4.2. Oct-1 POU domain relieves self inhibitory properties of SNAPC towards DNA binding ............................................................................................................. 18 1.4.3. p53 associates with SNAPC and TBP ............................................................. 19 1.4.4. Rb associates with SNAPSO and SNAP43 ...................................................... 19 2. Results and discussion ................................................................................................ 21 2.1. SNAP subunits - overview ..................................................................................... 21 2.1.1. SNAP19 .......................................................................................................... 21 2.1.1.1. Over-expression, purification and crystallization of SNAP19 ......... 22 2.1.1.2. Cloning, over-expression, purification and crystallization of SNAP19(1-85) ...................................................................................................... 24 2.1.1.3. Nuclear magnetic resonance analysis of SNAP19 ............................ 25 2.1.2. SNAPl90(1-505) ............................................................................................ 30 2.1.2.1. Over-expression and purification of SNAPl90(l-505) .................... 31 2.1.2.2. Crystallization of SNAPl90(l-505) ................................................. 32 2.1.2.3. Crystallization of SNAP190(1-505)/SNAP19 complex ................... 32 2.2. Over-expression and purification of SNAPC subunits ........................................... 33 2.3. Cloning of plasmids for co-expression .................................................................. 39 2.4. Over-expression of mSNAPc variants ................................................................... 43 2.5. Comparison of the individual expression of SN APc subunits to the co-expression system ........................................................................................................................... 46 2.6. Purification of mSNAPc ........................................................................................ 48 2.7. DNA binding activity of mSNAPc variants ........................................................... 51 2.8. mSNAPcy4 is capable of recruiting TFIIIB to the promoter ................................. 53 2.9. Transcription initiation mediated by mSNAPc ...................................................... 56 2.10. Zinc finger domain in SNAPSO ........................................................................... 57 2.10.1. ICP-MS and FAAS analysis of mSNAPc variants ....................................... 60 2.10.2. Ab-initio modeling of SNAPSO C-terminus ................................................. 63 2.10.3. Mutagenesis .................................................................................................. 63 2.10.4. DNA binding activity of SN AP50 mutants .................................................. 65 2.10.5. Transcription initiation assay ........................................................................ 66 2.11. Crystallization of mSNAPc variants .................................................................... 67 2.12. X-ray diffraction of mSNAPc crystals ................................................................. 70 2.13. Evaluation of mSNAPc crystals using transmission electron microscopy (TEM) ....................................................................................................................................... 70 2.14. Diffraction of mSNAPc crystals using TEM ...................................................... 73 2.15. Purification optimization of mSNAPcy4 ............................................................. 77 . Methods and materials ............................................................................................... 91 3.1. Preparation of expression plasmids ....................................................................... 91 3.1.1. Cloning of the SNAPSO subunit ..................................................................... 91 3.1.2. Cloning of the SNAP43 subunit ..................................................................... 91 3.1.3. Cloning of the SNAP19 subunit ..................................................................... 92 3.2. Preparation of expression host ............................................................................... 92 3.2.1. Preparation of competent cells for heat shock ................................................ 92 3.2.2. Preparation of competent cells for electroporation ......................................... 93 3.2.3. Preparation and use of a glycerol stock .......................................................... 93 3.2.4. Transformation into the cloning bacterial host ............................................... 93 3.2.5. Transformation into the expression bacterial host Codon Plus RIL ............... 94 3.2.6. Transformation into the co-expression bacterial host ..................................... 94 3.2.7. Electroporation ................................................................................................ 95 3.3. Growth media ......................................................................................................... 95 3.3.1. Preparation of culture plates ........................................................................... 95 3.3.2. LB growth media ............................................................................................ 96 3.3.3. TB growth media ............................................................................................ 96 3.3.4. M9 minimal media .......................................................................................... 96 3.4. Over-expression ..................................................................................................... 97 3.4.1. Over-expression of individual SNAP subunits ............................................... 97 3.4.2. Co-expression of mSNAPcy3 and mSNAPcy4 ............................................... 97 3.5. Purification ............................................................................................................. 98 3.5.1. GST purification ............................................................................................. 98 3.5.2. GST purification in the absence of a detergent ............................................... 99 3.5.3. Ion-exchange purification ............................................................................... 99 vi 3.5.4. Gel filtration purification .............................................................................. 100 3.6. SDS-PAGE .......................................................................................................... 100 3.7. Western blotting ................................................................................................... 101 3.8. DNA purification ................................................................................................. 102 3.9. TEM analysis of mSNAPc crystals ...................................................................... 102 3.9.1. Cross-linking of mSNAPc crystals ............................................................... 102 3.9.2. Preparation of the TEM grid ......................................................................... 103 3.9.3. TEM analysis ................................................................................................ 103 3.10. Quantification of zinc in SNAP50 ..................................................................... 104 3.10.1. Determination of protein concentration ...................................................... 104 3.10.1.1. UV absorbance in 6 M urea ............................................................ 104 3.10.1.2. Standardized Bradford assay ........................................................... 104 3.10.2. Flame atomic absorption ............................................................................. 105 3.10.3. ICP-MS ....................................................................................................... 105 4. References ................................................................................................................... 107 vii LIST OF FIGURES Figure 1: The upstream control element (UCE, -180 to -107) and the control promoter (CP, -45 to +20) are the regulatory regions required for efficient targeting by RNA polymerase I. ....................................................................................................................... 3 Figure 2: mRN A promoters are often characterized by the presence of a TATA box. Other promoter regions are often present upstream or downstream of the transcription start site. .............................................................................................................................. 4 Figure 3: H. sapiens U1 snRNA promoter region determined by the presence of both the DSE and the PSE, but by the absence of the TATA box. ................................................... 5 Figure 4: A. thaliana snRNA recognized by RNAP H contains the upstream sequence element (USE) and a TBP recruiting TATA box7. ............................................................. 5 Figure 5: snRN A promoter region from D. melanogaster with two promoter elements recognized by RNA polymerase H7. ................................................................................... 6 Figure 6: X. laevis SS RNA gene contains three elements in the internal control region (ICR) which can be divided into box A (+50 to +60), an intermediate element (IE, +67 to +72) and box C (+80 to +90). A simple run of T residues determines the end of the genes. ............................................................................................................................................. 7 Figure 7: X. laevis tRNA'eu internal promoter region consisted of boxes A (+8 to +19) and B (+52 to +62)8. .................................................................................................................. 7 Figure 8: H. sapiens U6 snRNA with gene external promoter regions. The DSE (-240 to - 215), the PSE (~65 to -48) and the TATA box (-32 to -25) are all located upstream of the transcription start site8 ......................................................................................................... 8 Figure 9: The S. cerevisiae U6 gene with a combination of gene internal A-box (+21 to +31) and gene external promoter elements; The TATA box (-30 to -23) upstream from the trapscription start site and B-box downstream (+234 to +244) from the U6 coding region . ................................................................................................................................ 9 Figure 10: H. sapiens RNA polymerase I transcription from an rRNA promoter with SL1, a TBP containing complex and UBF 1, a multi-subunit transcription factor targeting the USE and the CP regions respectively. Placement of TFIs on the figure is arbitrary. ....... 10 Figure 11: H. sapiens polymerase II transcription initiation complex: (A) an example of a mRNA promoter with a TATA box, recognized by the TBP or by the TBP containing TFIID complex and (B) TATA—less U1 snRNA type of promoter, with the PSE sequence recognized by the SNAP complex. ................................................................................... 11 viii Figure 12: H. sapiens polymerase III transcription initiation complex: a U6 snRNA promoter (A) with the PSE sequence, targeted by the SNAP complex, and the TATA box, targeted by the TFIIIB like complex, assembled from TBP, Brf2 (BrfU) and del. (B) TATA-less tRNA type of promoter with boxes A and B targeted by TFIIIC. TFIIIB is required for transcription initiation. .................................................................................. 12 Figure 13: Protein—protein/DNA contact map of SN APc. The complex is composed of five subunits SNAP190 interacts with TBP using the TBP recruitment region (TRRl) from amino-acids 34 to 84 and with Oct-1 using the OIR (Oct-1 interacting region) from amino-acids 888 to 912. RSRR is the arginine and serine rich area of SNAP190, a target for phosphorylation. The unusual Myb domain is responsible for DNA binding of SNAP190 (adopted from Hernandez et al.28). .................................................................. 15 Figure 14: Proposed model of SNAPC recruitment to the U6 promoter. SNAPc is recruited to the promoter by interaction with Oct-1 and TBP is recruited to the TATA box by interaction with SNAPC. The N-terminal domain of SNAP190, SNAP43 and SNAP45 are involved in the interaction with TBP. SNAP190 contains three main domains; the N- terminal region, the Oct-1 interacting region (OIR) and the C-terminal region responsible for the interaction with SNAP45. SNAP190 and SNAP50 interact with the PSE region of the DNA. Positioning of the nucleosome is crucial for Oct-l dependent activation. ....... 16 Figure 15: TBP contains a highly conserved C-terminal domain and an N-terminal domain that can be divided into three regions. Region II is a stretch of glutamines and the number of glutamine residues varies between organisms ................................................. 18 Figure 16: SNAP19 purified over GSH resin. Protein and beads suspension after 10 min (CLl), 20 min (CL2), 2 h (CL3) and 6 h (CL4) thrombin cleavage. Cleaved SNAP19 (ELI and EL2, two successive fractions of the eluted material). ..................................... 22 Figure 17: Ion-exchange purification of SNAP19. Consecutive fractions containing highly purified SNAP19 (IE1, IE2 and IE3). 15 mg of SNAP19 purified by affinity chromatography were diluted 3 fold, to bring the salt concentration below 50 mM and loaded onto Source-Q ion-exchange resin (15 mL). The protein material was eluted from the column by a salt gradient (0-70%, 1 M KCl). 8-10 mg of highly purified SNAP19 were recovered in two fractions (IE2, IE3) ....................................................................... 23 Figure 18: Commassie blue stained SDS-PAGE of GST purified SNAP19(1-85) (lane 2) and wild type SNAP19 (lane 3). ....................................................................................... 24 Figure 19: 1H NMR spectra of SNAP19. .......................................................................... 26 Figure 20: HSQC (heteronuclear single quantum coherence transfer) spectra of SNAP19. ........................................................................................................................................... 28 Figure 21: 1H NMR spectra of SNAPl90(84-133) peptide, SNAP19 and the complex obtained by mixing the two proteins ................................................................................. 30 ix Figure 22: SNAPl90(l-505) purification. GST-SNAPl90(1-505) immobilized on the GSH agarose (lane 2) and cleaved and eluted material (lanes 3 and 4) ............................ 31 Figure 23: Purification of (A)SNAP190(1-505) and (B)SNAP43 using GST affinity chromatography. The proteins were purified from equal culture volumes under identical conditions. The majority of SNAP43 can not be recovered and the material that eluted off the beads sticks to the reaction vessel in a short time. The material can be recovered by boiling in the sample buffer for SDS-PAGE. ................................................................... 34 Figure 24: Purification of (A) SNAP50 showing the strong presence of an unidentified protein that can not be removed by subsequent purification and (B) SNAP19 using GST affinity chromatography. The proteins were purified from equal culture volumes under identical conditions. .......................................................................................................... 35 Figure 25: Electro mobility shift assay (EMSA) of the recombinant mSNAPc (mrSNAPc) assembled from individually expressed and purified subunits, and endogenous SNAPc obtained from HeLa cells (monoQ fractions). Increasing amounts of mSNAPc that was assembled from subunits individually expressed in E. coli (approximately 1, 2.5, and 7.5 ng; lanes 5—7, respectively) were added to EMSA reactions containing dsDNA probes containing a wild-type (wt) or mutant (mu) PSE, as indicated. Reactions loaded in lanes 3 and 4 contained 7.5 uL of partially purified endogenous SNAPC (approximately 0.3 ng SNAPC/mL). Reactions containing only the probe DNAs are shown in lanes 1 and 2 (experiment performed by Craig Hinkley) 58. ................................................................... 36 Figure 26: In-vitro U6 transcription initiation assay of the recombinant mSNAPc. Increasing amounts of mSNAPc were added to human U6 in vitro transcription reactions for which the HeLa whole cell extract (WCE) was treated with oc-SNAP43 antisera to remove endogenous SNAPC, as shown in lanes 6—8. Lane 4 shows the decreased signal for the correctly initiated U6 transcription upon removal of endogenous SNAPC. Lane 5 shows the U6 signal dependent upon addition of endogenous SNAPC obtained from biochemical fractionation of a HeLa cell nuclear extract. Note, this signal is comparable to the mock depleted WCE (lane 3) or WCE alone (lane 1)“. ......................................... 38 Figure 27: The pGST190(1-505) plasmid with pET origin of replication and ampicillin resistance. The SNAPl90(l—505) sequence follows the sequence for the GST tag and the thrombin recognition sequence. ........................................................................................ 40 Figure 28: pCDF43/50-1 plasmid with CDF origin of replication and streptomycin resistance. SNAP50 gene is in MCSI and SNAP43 is in MCSII ...................................... 42 Figure 29: pRSFl9-l plasmid with RSF origin of replication and kanamycin resistance with SNAP19 gene in MCSI and un-utilized MCSII. ...................................................... 43 Figure 30: Analysis of expression for SNAPC subunits in the individual expression and co-expression approach by Western blot. GST labeled protein were visualized using 0t- GST antibodies and the SN APc subunits were visualized using antibodies directed against the subunit of interest. .......................................................................................... 47 Figure 31: Purification of co-expressed mSNAPcy4 using the GST affinity resin and thrombin cleavage. The mSNAPcy4 was purified under the same conditions as the individual subunits. ........................................................................................................... 49 Figure 32: Recovery of the co-expressed material compared to individually expressed and purified subunits. SNAP19 improves complex stability and subunit recovery. Subunit recovery for partial SNAPC assembled without and with co-expressed SNAP19 (mSNAPcy3 and mSNAPcy4, respectively) was estimated by SDS—PAGE and Coomassie blue staining. Note the increased recovery of SNAP43 and SNAP50 in mSNAPcy4 (lane 7) compared to mSNAPcy3 (lane 6). Lanes 2—5 contain aliquots of the individually expressed SNAPC subunits for reference. Note that the unrecovered SN AP43 is used to show the migration of SNAP43. ....................................................................... 50 Figure 33: mSNAPcy4 facilitates TBP promoter recruitment. Increasing amounts of mSNAPcy4 (3 and 10 ng) were added to DNA binding reactions containing radiolabeled probes that harbor a wt PSE and wt TATA box (lanes 1—4), mu PSE and wt TATA box (lanes 5—8), wt PSE and mu TATA box (lanes 9—12), or mu PSE and mu TATA box (lanes 13—16). Lanes 4, 8, 12, and 16 contain 50 ng of recombinant full-length human T BP in addition to 10 ng of mSNAPcy4. Reactions containing only the DNA probes are shown in lanes 1, 5, 9, and 13. Positions of the mSNAPcy4 and mSNAPcy4 lus TBP complexes are indicated (experiment was performed by Gauri W. J awdekar) 8 .............. 52 Figure 34: mSNAPcy4 is capable of recruiting TFIHB to the promoter. Coordinated DNA binding by SNAPC and TBP facilitates higher order complex assembly with Brf2 and de1. DNA binding reactions were performed with the indicated combinations of SNAPCy4 and Brf2-TFIIIB subunits. These results suggest that preinitiation complex assembly follows the order SNAPc>TBP>Brf2>de1 (experiment was performed by Gauri w. Jawdekar)” ........................................................................................................ 54 Figure 35: SNAPC stimulates DNA binding by Brf2-TFIIIB. Electrophoretic mobility shift assays were performed using a U6 probe containing a wild-type mouse U6 PSE and a wild-type TATA-box (AC probe). DNA binding was carried out in the absence (lanes 1-8) or presence (lanes 9-12) of wild type SNAPCy4. Reactions containing individual TBP, Brf2, and de1 (1-470) subunits are shown in lanes 2-4. Reactions containing pair wise combinations of TBP with Brf2, TBP with de1 (1-470), and Brf2 with del (1- 470) are shown in lanes 4, 5, and 8. DNA binding by the complete Brf2-TFIIIB complex in the absence of SNAPC is shown in lane 7. Additional reactions were performed with SNAPC alone (lane 9) or in combination with Brf2-TFIIIB subunits (lanes 912), as indicated. Lane 1 shows migration of the probe alone. The relative positions of the various SN APcy4/Brf2-TFIIIB complexes are shown on the right (experiment was performed by Gauri W. J awdekar)5 9. ................................................................................ 55 Figure 36: mSNAPcy4 functions for U1 snRNA transcription by RNA polymerase H. HeLa cell nuclear extract was either mock depleted with a preimmune rabbit sera (lane 2) or a-SNAP43 antisera (lanes 3—10) to deplete endogenous SNAPC. Extracts were then used for human U1 in vitro transcription assays. The Ul-specific signal was diminished upon removal of endogenous SNAPC, as shown in lane 3. Increasing amounts of xi mSNAPcy4 (0.08, 0.25, 0.75, 2.5, 7.5, 25, and 75 ng) reconstituted the correctly initiated transcription from a human U1 reporter, as shown in lanes 4—10. Lanes 1 and 2 show the U1 signal obtained from either untreated or mock depleted reactions (experiment was performed by Gauri W. J awdekar)5 8. ................................................................................ 56 Figure 37: mSNAPcy4 functions for U6 snRNA transcription by RNA polymerase III. In vitro transcription of human U6 snRNA was carried out using HeLa cell nuclear extract that was treated as in Figure 36. Lane 2 shows the reduced U6 signal upon removal of endogenous SNAPC. Increasing amounts of mSNAPcy4 (0.08, 0.25, 0.75, 2.5, 7.5, 25, and 75 ng) reconstituted the correctly initiated transcription from a human U6 reporter as shown in lanes 3—9. Approximately 75 ng of GST was added to the transcription reaction shown in lane 10 (experiment was performed by Gauri W. J awdekar)5 8. ........................ 57 Figure 38: Sequence alignment of human SNAP50 C-terminal amino acids (301-411) with corresponding regions from SN AP50 homologues of other species. Putative zinc fingers similar to TFIIIA (Hstmex3C) and steroid receptors (szcxzzcsz) are indicated as region 1 and region 2, respectively. The sequence of highly conserved amino acids derived from this alignment corresponds to Lx4GX6Hx3CxHx20-23YPx1 1- 1ZCXZCXIng3-4Cx2CFx3Hx1-4G. This alignment was performed using the Clustal W program. Homo sapiens (NP003075), Cannisfamiliaris (XP853813), Bos taurus (AAX08912), Mus musculus (NP084225), Rattus norvrgicus (NP001013230), Drosophila melanogaster (NP724647), Drosophila pseudaoobscura (EAL25490), Trypanosoma brucei (XP827295), Arabidopsis Ihaliana (AAO30067), Caenorhabditis elegans-l (NP500819), Caenorhabditis elegans-Z (NP497807), Plasmodium falciparum (), Danio rerio (XP694501), Leishmania major (XP843572), Anopheles gambiae (XP310411), Dictyostelium discoideum (XP644064), Entamoeba histolytica (XP653151). This comparison was prepared by Gauri W. J awdekarS 9. ................................................. 59 Figure 39: Flame atomic absorption spectroscopic analysis of mSNAPcy3 and mSNAPcy4. Unlabeled data points represent the absorption obtained for standard zinc solutions. ........................................................................................................................... 61 Figure 40: Mutations in the SNAP50 zinc finger domain do not disrupt SNAPC assembly. Approximately 20 ng of each of the SNAPcy4 complex containing substitution mutations in HA-SNAPSO were affinity purified first using glutathione agarose to pull down GST- SNAP190 (1-505) followed by immunoprecipitation with (it-SNAP43 antibodies. Associated wild type or mutant HA-SNAPSO was detected by a-HA Western analysis (lanes 6-22). A titration of wild type SNAPcy4 using 8, 4 and 2% of the input material is shown in lanes 1-3, respectively. Lanes 4 and 5 contain wild type SNAPcy4 recovered with the protein-G agarose beads alone or with pre-immune serum, respectively. The bottom panel represents 4% of the input material directly analyzed by oc-HA Western analysis (experiment was performed by Gauri W. Jawdekar)”. ...................................... 64 Figure 41: Mutations in the SNAP50 zinc finger domain disrupt DNA binding by SNAPC. Increasing amounts (3 and 10 ng) of SNAPcy4 with wild type (lanes 2 and 3) or mutant HA-SNAPSO (lanes 4-35) containing the indicated substitution mutations was tested in an EMSA for binding to a radiolabeled DNA probe containing a high affinity xii PSE and TATA box (AC probe). Lane 1 shows the probe alone with no added proteins. Mutations H3l3-A, C317-A, and H319-A resulted in reduced DNA binding, whereas mutations C354-A, C357-A, C380-A, and C383-A completely abolished DNA binding activity. Mutation H388—A also exhibits weakened DNA binding activity (experiment was performed by Gauri W. J awdekar)5 9 .......................................................................... 65 Figure 42: U1 and U6 in-vitro transcription initiation assay of wt or mutant HA-SNAPSO containing mSNAPcy4. Mutations that resulted in reduced DNA binding also showed reduced transcription initiation activity for both types of promoters, with the exception of the H388-A mutation, which resulted in weakened U1 activity, but wild type U6 activity of transcription initiation59. HeLa cell nuclear extract was depleted with u-SNAP43 antibodies to immunodeplete endogenous SN APc. In vitro U1 and U6 transcription was then tested in the absence (lanes 1) or presence of purified SNAPcy4 (5 ng) containing wild type SNAP50 (lane 3) or mutant SNAP50 with the indicated alanine substitutions (lanes 4-19). Addition of GST alone did not reconstitute either U1 or U6 transcription as shown in lane 2 (experiment was performed by Gauri W. J awdekar)5 9. .......................... 67 Figure 43: Crystals of mSNAPcy4 as observed under the light microscope grown in 9% PEGSOOOMME, 100 mM Tris pH 8.5, 100 mM NaCl and 100 mM MgC12. ................... 68 Figure 44: Optimized crystals of mSNAPcy4 as observed under the light microscope grown in 8% PEGSOOOMME, 100 mM Tris pH 8.4, 100 mM NaCl and 100 mM MgC12. ........................................................................................................................................... 69 Figure 45: Needle clustered crystals of mSNAPcy4 under the TEM have a heavily branched appearance and the branches are even further sectioned ................................... 71 Figure 46: Crystals of mSNAPcy4 under the TEM. Close-up on one of the branches showing a heavily fragmented formation. ........................................................................ 72 Figure 47: Circular diffraction pattern typical for a powdered material ........................... 73 Figure 48: TEM image of mSNAPcy4 single crystals as observed under TEM. ............. 74 Figure 49: TEM diffraction images obtained from a single crystal. Image B was obtained from a crystal rotated approximately 0.5 degree with respect to the crystal from image A. ........................................................................................................................................... 75 Figure 50: TEM image of mSNAPcy4/DNA crystals ....................................................... 76 Figure 51: Diffraction pattern from a single branch obtained under TEM. ...................... 77 Figure 52: A representative elution profile for ion-exchange purification (Source-S) of mSNAPcy4. The solid line represents the relative absorption intensity of the eluted protein material and the dotted line represents the salt gradient. ...................................... 79 Figure 53: Gel filtration chromatograms of mSNAPcy4 purified with affinity chromatography (A); mSNAPcy4 rerun fraction 10 (B); mSNAPcy4 rerun fractions 12 xiii and 13 (C). The first peak in fractions 9 and 10 corresponds to 650 kDa and is also the void. The second peak corresponds to a protein with an apparent molecular weight of 150 kDa according to the gel filtration standards (Biorad, data not shown). .......................... 83 Figure 54: Gel filtration chromatogram of mSNAPcy4 (A) supplemented with zinc (B), iron (C) or EDTA (D). ...................................................................................................... 86 Figure 55: Gel filtration of mSNAPcy4 using Sephacryl-300 size exclusion beads. (A) Biorad molecular weight standards (a=670 kDa, b=158 kDa, c=44 kDa, d=17 kDa), (B) GST purified mSNAPcy4, (C) fraction 9—11 from the first gel filtration rerun using the same column, (D) fractions 9-11 from the second gel filtration rerun (e: void peak, f: mSNAPcy4 heterotetramer). Fractions were 5 mL. .......................................................... 88 Figure 56: Crystals of mSNAPcy4 grown in 100 mM Tris pH 8.5, 10% PEG4000, 100 mM NaCl and 100 mM MgC12. Protein material used to obtain the crystals was prepared by successive gel filtration purification steps. .................................................................. 90 xiv LIST OF TABLES Table 1: Quantification of zinc and nickel using ICP-MS and FAAS for mSNAPcy3 and mSNAPcy4. Two isotopes “Zn (3) and 68Zn (b) were measured for ICP-MS. Two independently prepared samples of mSNAPcy4 were used in the analysis: (1) extensively purified complex used for crystallization and (2) void fraction from gel filtration purification of the complex. The protein concentration was determined by absorbance measurement in 6M urea and by the Bradford assay. ....................................................... 62 Table 2: Combinations of antibiotics used for preparation of agar plates. AMP, KAN, STR and CAM represent ampicillin, kanamycin, streptomycin and chloramphenicol respectively. ...................................................................................................................... 96 Table 3: Antibiotic concentrations used in individual and co-expression systems .......... 98 XV A/ADE Amp BOG C Cam C/CYT C terminal DNA DNase DSE dsDNA DTT EDTA LIST OF ABBREVIATIONS alanine adenine ampicillin n-octyl-beta-D-glucopyranoside base pair cysteine chloramphenicol cytosine carboxy terminal aspartic acid Dalton deoxyribonucleic acid deoxyribonucleosidase distal sequence element double stranded DNA dithiothreitol glutamic acid Diaminoethanetetraacetic acid xvi EMSA F FAA(S) G/GUA GST GSH HCl Hepes HSQC ICP—MS Kan KCl kDa electrophoretic mobility shift assay phenylalanine flame atomic absorption (spectroscopy) glycine Guanine Glutathione S-Transferase glutathione histidine hydrochloric acid N—[2-hydroxyethyl] piperazine-N’-[ethane sulfonic acid] heteronuclear single quantum coherence transfer isoleucine inductively coupled plasma mass spectroscopy Isopropyl-B-D-Thiogalactopyranoside lysine kanamycin potassium chloride kilo Dalton xvii MCS mg 118 mL MW mRNA mSNAPc mrSNAPc NaCl "8 NMR N -terminal OIR PCR leucine methionine multiple cloning site milligram microgram milliliter molecular weight messenger RNA mini small nuclear RNA activating protein complex mini recombinant small nuclear RNA activating protein complex asparagine sodium chloride nanogram nuclear magnetic resonance amino terminal Oct-l interacting region proline polymerase chain reaction xviii pol PEG PMSF ppb ppm PSE Rb RNA RN AP rRNA S siRN A SNAP SNAPC snRNA SDS-PAGE Str polymerase polyethylene glycerol phenylmethylsulfonylfluoride parts per billion parts per million proximal sequence element glutamine arginine retinoblastoma protein Ribonucleic Acid Ribonucleic Acid Polymerase Ribosomal RNA serine small interfering RNA small nuclear RNA activating protein small nuclear RNA activating protein complex small nuclear RNA sodium dodecyl sulfate — polyacrylamide gel electrophoresis Streptomycin xix T/T HY TBP TEM TFI TFII TFIII Tris tRN A UV VIS WCE threonine Thymine TATA (box) Binding Protein transmission electron microscopy Transcription Factor I Transcription Factor 11 Transcription Factor III 2-Amino-2-(hydroxymethyl)-1,3-propendiol transfer RNA uracil ultraviolet valine visual tryptophan whole cell extract tyrosine XX 1. Introduction Genetic information of all living organisms, with the exception of retro viruses, is stored on the DNA. Transcription, a process in which the information on the DNA is transferred to the RNA, is achieved by accurate synthesis of RNA from the DNA template by different RNA polymerases. One RNA polymerase exists in prokaryotes and it is solely responsible for the transcription of the genomic information of the organism. In contrast, eukaryotes contain four classes of polymerases. The four classes differ in localization, template specificity, and susceptibility to activators and inhibitors. RNA polymerase I, located in the nucleoli, is responsible for transcription of ribosomal RNA and transcribes the genes for 188, 5.85 and 28S ribosomal RNA (rRNA). RNA polymerase 11, located in the nucleoplasm, is responsible for transcription of all messenger RNA (mRNA) and some small nuclear RNA (snRNA), such as U1 and U2 snRNA. RNA polymerase III, also located in the nucleoplasm, is responsible for transcription of SS ribosomal RNA, the transfer RNA (tRNA) and also the U6 snRNA. RNA polymerase IV was recently identified in plants and is responsible for transcription of short interfering RNAs (siRNAs) which were implicated in gene silencing and also mitochondrial RNA1’2’3. Strict control of transcription is required for healthy cell existence and is achieved by targeting regions of DNA called promoters, located upstream or downstream from transcription start sites. A promoter is a region on the DNA that is capable of recruiting transcription factors and eventually the RNA polymerase. An enhancer is a region on the DNA that is responsible for recruitment of transcription factors that influence transcription, but do not directly recruit the RNA polymerase. Different transcription factors bind to the promoter region and can initiate transcription by helping the polymerase to specifically bind to the DNA and start the transcription process. Other transcription factors function as repressors and inhibit the binding of the polymerase to the DNA, and thus reduce or abolish transcription. Some factors, called basal transcription factors, are always present in the cell4. Some are incapable of DNA binding and can, only by specific external stimuli, overcome this inability. When the inability to bind the DNA is eliminated, these factors become active and can then activate or repress the transcription process. The organism uses such mechanisms to adapt to a change in external conditions or to a different developmental stage, by expressing the proteins required to adapt to the new environment. A different type of transcription factor is expressed only in response to external stimuli and can then regulate transcription. In the absence of the stimuli these transcription factors are present in only small concentrations . .4 or are not present at all1 3 . 1.1. RNA polymerase and promoters 1.1.1. RNA polymerase I RNA polymerase I (RNAP I) is unique among the nuclear RNAs in that it is responsible for transcription of only one set of genes, the ribosomal RNA (rRNA) and has to be able to recognize only one promoter structure. There are essentially two different genes for all the rRNAs in a tandem repeated form. A long precursor for rRNA is transcribed by RNA polymerase I and it is subsequently cleaved and processed into the individual ribosomal RNAs. Two control regions make up the promoter for this polymerase; the core promoter (CP) extending from -45 to +20 and the upstream control element (UCE) extending from -180 to -107 which increases the efficiency of the core promoter. Both promoter regions are unusually GC-rich. SL1 (H. sapiens), TIF-IB (M. muscullus) or CF (S. cerevisiae) are TBP associated multi-subunit complexes responsible for binding to the core promoter and are essential for RNAP I transcription. The upstream control element recruits another multi-subunit factor called the upstream binding factor or UBF (UBF-1, H. sapiens) and this factor greatly enhances SL1 recruitment and its . . . ‘3 specrficrty1 . -180 -45 +1 l" e —[]—l I l UCE CP Figure 1: The upstream control element (UCE, -l80 to —107) and the control promoter (CP, -45 to +20) are the regulatory regions required for efficient targeting by RNA polymerase 1. 1.1.2. RNA polymerase II RNA polymerase 11 (RN AP II) is responsible for transcription of genes coding for all messenger RNAs (mRNAs) and most small nuclear RNA (snRNA). The RNAP 11 promoter region is the most diverse compared to other polymerases recognized by other promoters. Many different factors bind to this promoter region and influence the transcription of the genes. This is understandable because of the need for a living cell to be able to quickly increase or decrease the expression of particular proteins in response to changes in the environment or progress in the developmental stage of the cell. A variety of promoter elements have been identified including the TATA box (TATAAAA), upstream elements such as the CCAAT box, the GC box (GGGCGG) and downstream elements. RNAP II promoters have one or more of these elements in different numbers and combinations (Figure 2)3. +1 l'" W TATA Figure 2: mRNA promoters are often characterized by the presence of a TATA box. Other promoter regions are often present upstream or downstream of the transcription start site. Human U1 small nuclear RNA promoter is an example of a TATA box-less promoter and differs from the U6 snRN A promoter by the absence of the TATA box. The two promoters serve as an interesting model in the understanding of RNAP specificity for transcription of genes with similar promoter regions. The U1 promoter contains two regions; a distal sequence element (DSE) and the proximal sequence element (PSE, 4.5.6 Figure 3) +1 n I” i] u l DSE PSE Figure 3: H. sapiens U1 snRNA promoter region determined by the presence of both the DSE and the PSE, but by the absence of the TATA box. The upstream sequence element (USE) in A. thaliana (Figure 4) and the PSEA in D. melanogaster (Figure 5) are similar to the human PSE. Factors that are recruited to these promoter regions are not well characterized, but it is possible that the polymerase specificity in these systems is determined by different conformations of the same factor, which are defined by the USE or PSEA sequence. Different conformations of a factor can than recruit other factors that can than bind to the promoter and regulate transcription7. +1 r" El—[l—I USE TATA Figure 4: A. thaliana snRNA recognized by RNAP 11 contains the upstream sequence element (USE) and a TBP recruiting TATA box7. +1 r1 FT u [H PSEA PSEB Figure 5: snRNA promoter region from D. melanogaster with two promoter elements recognized by RNA polymerase 117. 1.1.3. RNA polymerase III RNA polymerase III (RNAP III) transcribes genes that encode structural or catalytic RNA and are in general shorter than 400 base pairs (bp), which is also consistent with the elongation properties of RNA polymerase III. The RNA molecules transcribed by RNA polymerase III are involved in fundamental metabolic processes. These RNAs are components of the protein synthesis, mRN A splicing and tRNA processing apparatus. RNA polymerase 111 uses three main types of promoters. The type 1 RNA polIII promoter has only one example; the X. laevis SS promoter, which contains an internal control region (ICR). This promoter region consists of three elements; the A box (+50 to +60), the intermediate element (IE; +67 to +72) and the C box (+80 to +90; Figure 6). In S. cerevisiae SS genes, only C box is required for efficient transcription3‘4‘8’9. +1 +120 ICR I 7 7 , 7 I LRHJ W A IE c l—_ Figure 6: X. laevis SS RNA gene contains three elements in the internal control region (ICR) which can be divided into box A (+50 to +60), an intermediate element (IE, +67 to +72) and box C (+80 to +90). A simple run of T residues determines the end of the genes. Type 2 promoters are represented by the genes of Ad2 VAl and most tRNAs. The main hallmark of this promoter type is the presence of gene internal A and B boxes. These regions are conserved in tRNA genes between various species. The A-boxes of type 1 and 2 are interchangeable in X. laevis (Figure 7)”). This reflects the similarity in sequence rather than a conserved function, since the A-boxes of SS RNA and tRNA genes bind different transcription factorsl 1. +1 +8 +52 +80 1" E U TTTT A B Figure 7: X. laevis tRNAlcu internal promoter region consisted of boxes A (+8 to +19) and B (+52 to +62)8. The H. sapiens U6 snRNA gene contains a type 3 promoter and is characterized by the presence of gene external promoter regions only. The distal sequence element (DSE), proximal sequence element (PSE), and the TATA box are located upstream from the transcription start site. Representative members of this promoter type are the U6 snRNA genes, coding for the U6 snRNA component of the spliceosomelz‘l3 ‘14, the 78K gene whose RNA product was implicated in the regulation of the CDK9/cyclin T complex”, and the HI RNA gene, which codes the RNA component of hRNaseP and the RNA component of hRNase MRPIG'”. Interestingly in the vertebrate snRNA promoters, RNA specificity can be switched between RNAPH and RNAPIII and vice versa by deletion or generation of the TATA box element (Figure 8)”. -240 -65 -32 +1 +106 i] EJ—[l-l W" DSE PSE TATA Figure 8: H. sapiens U6 snRNA with gene external promoter regions. The DSE (- 240 to -215), the PSE (-65 to -48) and the TATA box (-32 to -25) are all located upstream of the transcription start sites. The S. cerevisiae U6 snRNA gene is a type 4 RNAPIII promoter and is a hybrid consisted of gene internal and gene external regions. The TATA box is located upstream of the transcription start site and the B-box is located downstream from the coding region. The A-box is located gene internally (+21 TO +31, Figure 9). All three of these . . . . . 9 promoter elements are required for transcrrptron 1n vrvo1 . -30 +1 +113 +234 I'" mu m. l]— TATA A B Figure 9: The S. cerevisiae U6 gene with a combination of gene internal A-box (+21 to +31) and gene external promoter elements; The TATA box (-30 to -23) upstream from the transcription start site and B-box downstream (+234 to +244) from the U6 coding regiong. 1.2. Composition of human RNAP transcription initiation complexes 1.2.1. RNA polymerase I transcription initiation complex The TATA box binding protein (TBP) is the only known transcription factor that seems to be almost universally required for transcription regardless of the promoter type and polymerase used to transcribe a gene. Interestingly TBP is required for transcription whether or not the TATA box element is present in the promoter. In humans and in the case of RNA polymerase I the necessary transcription factor SL1 (selectivity factor) is a TBP containing multi-subunit complex and is responsible for the recognition of the USE region of the promoter, while the upstream binding factor (UBFl) is responsible for recognition of the CP region (Figure 10) 1‘3, TF IS su RNAP I TBP UBF1 THs TFIS E::::}-——-{::::}-——-— use CP Figure 10: H. sapiens RNA polymerase I transcription from an rRNA promoter with SL1, a TBP containing complex and UBFl, a multi—subunit transcription factor targeting the USE and the CP regions respectively. Placement of TFIs on the figure is arbitrary. 1.2.2. RNA polymerase II transcription initiation complexes mRNA type promoter, which is transcribed by RNA polymerase II and contains the TATA box, can be recognized by TFIII) or TBP. Once TFIID or TBP are successfully recruited to the TATA box, other TAFus are then recruited to the promoter. First to follow TFIID is TFIIB, then the TFIIF-RNAP H complex, TFIIE and TFIIH. TFIIA can enter the initiation complex at any stage of the assembly and its main function is counteracting repressors that associate with TBP and reduce or prevent its association with the DNA (Figure 11)2°'2"22'23‘2“. 10 pse . RNAP II Figure 11: H. sapiens polymerase II transcription initiation complex: (A) an example of a mRNA promoter with a TATA box, recognized by the TBP or by the TBP containing TFIID complex and (B) TATA-less U1 snRNA type of promoter, with the PSE sequence recognized by the SNAP complex. 1.2.3. RNA polymerase III transcription initiation complexes The key player in polymerase III transcription initiation is the TFIIIB complex, because it contacts the polymerase directly. H. sapiens Brfl-TFHIB is composed of three subunits, the TATA-box binding protein, Brfl and de1. An example of Brfl-TFIIIB dependent initiation is transcription from the tRNA type of promoters, where the TBP containing TFIIIB is recruited to the DNA by positioning of TFIIIC to boxes A and B. TBP is required even though the promoter is TATA-less. In the case of U6 snRNA promoters, the Brf2-TFIIIB complex, is assembled from TBP, Brf2 and de1. This complex is recruited by binding of the SNAPC complex to the proximal sequence element upstream of the TATA box (Figure 12)1'3’25 '26. Figure 12: H. sapiens polymerase III transcription initiation complex: a U6 snRNA promoter (A) with the PSE sequence, targeted by the SNAP complex, and the TATA box, targeted by the TFIIIB like complex, assembled from TBP, Brf2 (BrfU) and del. (B) TATA-less tRNA type of promoter with boxes A and B targeted by TFIIIC. TFIIIB is required for transcription initiation. 12 1.3. SNAPC 1.3.1. The composition of SNAPC The small nuclear RNA activating protein complex (SNAPC) is a basal transcription factor responsible for transcription initiation for polymerase II and III snRNA systems. This transcription factor is at least a five subunit complex, composed of SNAP190, SNAP50, SNAP43, SNAP45 and SNAP196'27'28. TBP was found to often co- purify with the SNAP complex purified from HeLa cells”. The largest subunit of SNAPC, SNAP190, is the backbone of the complex. This subunit interacts with all other subunits of the complex. The N-terminal part of SNAP190 is the most important region and interacts with SNAP50, SNAP43 and SNAP19 (Figure 13). The N-terminus is responsible for the DNA binding activity of the protein and also interacts with TBP. The unusual DNA binding domain of SNAP190 resembles the Myb DNA binding motif and is composed of four full (Ra, Rb, Rc and Rd) and one half (Rh) repeats. An exceptionally serine rich area following an arginine rich region in the N- terminal part of SNAP190 was shown to be a target for phosphorylation. This modification had an important impact on the ability of the complex to bind DNA and to initiate transcription by changing the electrostatic properties of this region of SNAP1903O’3 '. Oct-1 is recruited to the complex by interaction with SNAP190 in the area encompassing amino acids 869-91232‘33'34‘3SSNAP43 recruits SNAP50 to the largest subunit of the complex and was shown to also interact with SNAP19 and TBP (Figure 13)6. SNAP50 is, like SNAP190, responsible for DNA binding of this transcription factor judged from the DNA cross-linking experiments. An unusual zinc binding domain was predicted in the C-terminal part of SNAP50. This zinc-containing domain is believed to 13 be involved in the DNA binding properties of SNAP506. The smallest subunit of the SNAP complex is SNAP19. An exceptionally charged C-terminal tail and a leucine zipper motif characterize this subunit. SNAP19 is not required for the formation of the complex between SNAP190, SNAP43 and SNASO at higher concentrations, but it seems to be needed for efficient complex formation at lower, physiologically relevant concentrations”. SNAP45 interacts with the C-termini of SNAP190 and TBP. A SNAP complex missing SNAP45 (SNAPC—SNAP45) showed a diffusive band in the EMSA, which migrated slower than complete SNAPC, suggesting that the C—terminal domain of SNAP190 in a SNAP complex missing SNAP45 may assume multiple conformations. Addition of recombinant SNAP45 to the SNAPC-SNAP45 resulted in a band that migrated at the same position as the complete SNAPC. This suggests that SNAP45 was incorporated into the complex and due to direct protein-protein interactions, the C- terrninal domain of SNAP190 assumed a more rigid conformation”. The stoichiometry of the subunits in the complex is unknown and so is the size of the complex”. 14 1 I SNAP50 411 I ; """""" I 164 263 1L ISNAP43 I I368 1.90 1I SNAP45 am 1 eeeee ‘ ~~~~~~ ‘\.\ l E I! . ‘w L L MYI’ a) "0'3 ' 1409 LE . ‘ m 7'13: 7 7 503 1281 1393 73484133 263 ITBPI Figure 13: Protein-protein/DNA contact map of SNAPC. The complex is composed of five subunits SNAP190 interacts with TBP using the TBP recruitment region (TRRI) from amino-acids 34 to 84 and with Oct-1 using the OIR (Oct-1 interacting region) from amino-acids 888 to 912. RSRR is the arginine and serine rich area of SNAP190, a target for phosphorylation. The unusual Myb domain is responsible for DNA binding of SNAP190 (adopted from Hernandez et al.28). 1.3.2. The SNAP complex and chromatin The two critical promoter elements on the DNA, the proximal sequence element (PSE) and the distal sequence element (DSE) are separated by 150 base pairs. The separation of the two promoter elements is conserved in several organisms. SNAPC binds the PSE and the transcription activator Oct-1 binds to the DSE, resulting in transcription activation. It was also shown that SNAPC dependent Oct-1 activation is not observed 15 when naked DNA with separated DSE and PSE is used in in-vitro assays”. The Oct-1 POU domain is capable of increasing SNAPc recruitment to the promoter if DNA between the DSE and PSE is spaced closer together. It was also observed that reconstruction of the nucleosome, by addition of purified histones, leads to increased activation of the Oct-1 induced SNAPc recruitment and transcription activation. DNA foot printing revealed that a nucleosome core is positioned between the two regions. The localization of the DNA wrapped around the histones is defined by the presence of the PSE and the DSE and the proteins that bind to the two regions, SNAPc and Oct-l. Interestingly, the two proteins come in close contact with the nucleosome core, since 146 base pairs of the DNA are needed to form the nucleosome38'39’40. It is thus possible that SNAPc or Oct-1 come into direct contact with the nucleosome, not only with the PSE and the DSE regions, hence stabilizing the formation of the pro-initiation complex (Figure 14). Pg; SNAP19(\\ /“ Y f::i::‘:-(""’<>IR J ‘\31‘e‘”"/‘ \—r~”” ,/ ”‘\ \ l \ /.,,-/+~~\ e / \ISNAP45 / NUCLEOSCIDME 319‘ SNAP43 / It. I) VCORl; \x \ /' z m >' (I) 5 8 (I) > 8 O ' O p. U 2 I— U e (n a) t: (11 cu c 2 (9 c: 3 2 (9 t: D ZSOkD -+ ZSOkD F— 150 ' ~ 150 -—- 100 ’1- 100 a 75 -U 75 -" - — I".- 37 .- 37 - ‘ 2 3 4 1 2 3 4 Figure 23: Purification of (A)SNAP190(1-505) and (B)SNAP43 using GST affinity chromatography. The proteins were purified from equal culture volumes under identical conditions. The majority of SNAP43 can not be recovered and the material that eluted off the beads sticks to the reaction vessel in a short time. The material can be recovered by boiling in the sample buffer for SDS-PAGE. 34 o 8 0x A g 22 9: a W < Z " < Z 0. Z W CL 2 V) < . < . Z A > 2 J3 > v: > 8 V.» > 8 I; 3 .7. Q) 2 o a: D E o a: D zsoko — $2ng a 150 "" ,00 u 100 .— 75 - 75 a... .. -GST—SNAPSO so - SO - — -' ' -SNAPSO h 25 . -- - 'GST 37 - 20 4. - 'SNAP19 ’ 1s a "' 1 2 3 4 1 2 3 4 Figure 24: Purification of (A) SNAP50 showing the strong presence of an unidentified protein that can not be removed by subsequent purification and (B) SNAP19 using GST affinity chromatography. The proteins were purified from equal culture volumes under identical conditions. It is apparent that the isolated subunits share a few common characteristics. They are prone to thrombin degradation, which is most likely due to structurally unstable regions like exposed loops. Secondly, some subunits like SNAP43 and SNAP50 and to some extent SNAP190( 1-505), were hard to recover after thrombin cleavage. SNAP43 is virtually impossible to recover from the glutathione resin and different approaches were tested to cope with this problem, but were unsuccessful. Only small amounts of SNAP43 can be successfully eluted from the beads and the quantity of the protein is only sufficient 35 for biological, but not for structural studies and the majority of the cleaved material remained immobilized on the resin or adhered to the reaction vessel. The mini SNAP complex, made from SNAPl90(1-505), SNAP50 and SNAP43 (mSNAPc) can be assembled from individually expressed and purified subunits. Although the production of such material is at best inefficient, the amounts obtained and the quality of the material were sufficient to perform DNA binding studies using U1 and U6 DNA probes. The complex was capable of binding to the PSE region of the U1 or U6 DNA, as can be judged by the retardation of the mobility of radiolabeled DNA as was the endogenous SNAPc isolated from HeLa cells. DNA binding was specific since it was abolished when mutant PSE was used (Figure 25)”. monoQ mtS NAPC I 1 I fl r I wt mu W1 mu wt ————’ mu w - S NAPc ”an- - mtS NAPC .7 Free Jprotn Figure 25: Electro mobility shift assay (EMSA) of the recombinant mSNAPc (mrSNAPc) assembled from individually expressed and purified subunits, and 36 endogenous SNAPc obtained from HeLa cells (monoQ fractions). Increasing amounts of mSNAPc that was assembled from subunits individually expressed in E. coli (approximately 1, 2.5, and 7.5 ng; lanes 5—7, respectively) were added to EMSA reactions containing dsDNA probes containing a wild-type (wt) or mutant (mu) PSE, as indicated. Reactions loaded in lanes 3 and 4 contained 7.5 uL of partially purified endogenous SNAPc (approximately 0.3 ng SNAPc/mL). Reactions containing only the probe DNAs are shown in lanes 1 and 2 (experiment performed by Craig Hinkley) 58. This complex was also tested for the ability to restore transcription from SNAPc depleted nuclear extract. Surprisingly, the complex was unable to restore transcription significantly for either U1 or U6 types of promoters (Figure 26). Endogenous SNAPc isolated from HeLa cells (monoQ fractions) was in contrast capable of both DNA binding and transcription initiation. The amounts of the material obtained from this expression system are significantly lower than the amounts that can be obtained form bacterial expression”. 37 E 5 a S w L ‘0 g 5 WCE a—SNAPC depleted O r l + E g w L” L” c + mrS NAP: s—’ .w u o ? 5‘: '5 - E -~ --—-a -U6 5' Figure 26: In-vitro U6 transcription initiation assay of the recombinant mSNAPc. Increasing amounts of mSNAPc were added to human U6 in vitro transcription reactions for which the HeLa whole cell extract (WCE) was treated with on- SNAP43 antisera to remove endogenous SNAPc, as shown in lanes 6—8. Lane 4 shows the decreased signal for the correctly initiated U6 transcription upon removal of endogenous SNAPc. Lane 5 shows the U6 signal dependent upon addition of endogenous SNAPc obtained from biochemical fractionation of a HeLa cell nuclear extract. Note, this signal is comparable to the mock depleted WCE (lane 3) or WCE alone (lane 1)”. We hypothesized that the SNAPc components are unable to fold properly when expressed separately and have exposed surfaces that render the material unstable and sticky. Also, since co-expression using the baculo virus expression system yielded fully active mSNAPc, we thought that co-expressing the subunits in E. coli could potentially 38 yield quantities sufficient for biological and structural studies. We also hypothesized that the material obtained from a co-expression system would, unless specific secondary modifications are needed, yield fully active complex similar to the baculovirus system, but with increased expression levels. 2.3. Cloning of plasmids for co-expression The goal of simultaneously expressing 3 or 4 subunits in the same cell can be achieved by either using 3 or 4 plasmids with different antibiotic resistance, or by using a polycistronic plasmid with several expression cassettes. We decided to use a combination of the two approaches, since the number of compatible plasmids with different antibiotic resistance is limited. Also, the strategy to ligate several genes in to the same plasmid, targeting many different multiple cloning sites (MCS) is complicated. First, a theoretical purification plan was devised that would enable rapid and efficient extraction and purification of the co-expressed complex. Since all subunits were previously cloned in a GST expression plasmid we decided to keep the existing plasmid with SNAPl90(1-505) in the cloning site following the GST and thrombin recognition sequences (Figure 27). Other subunits would be placed on different plasmids with different antibiotic resistance. First, SNAP43 was cloned into pCDFDuet-l to form pCDF43-1 and SNAP50 was cloned into pRSFDuet-l to form pRSFSO-l. Since pCDF43-l and pRSFSO-l carry the information for streptomycin and kanamycin resistance. respectively, it is possible to transform these two plasmids into the same E. coli cell with the pGST-190 plasmid carrying the ampicillin resistance already present. pCDFDuet-l was also used to insert the SNAP50 gene in to the multiple cloning site I (MCSI) and following that the SNAP43 39 gene was ligated into the second multiple cloning site (MCSII), resulting in the pCDF43R/50—l expression plasmid (Figure 28). The SNAP19 gene was cloned in to pRSFDuet-l to yield pRSF19-l (Figure 29). Plasmids were sequentially transformed into E. coli, following calcium treatment to make the cells competent (to make mSNAPcy3 pGSTl90(l-505) was transformed first, then pCDF43R/50-l, and finally, to make mSNAPcy4 pRSFl9-l was transformed). After successful insertion of the plasmid in to the bacterial host, the expression was performed at 16°C and the proteins were extracted, purified using the GST affinity column and analyzed (see Methods and materials for details). mm a, T7 7% pGST190(1-505) ’6! 019d Figure 27: The pGST190(l-505) plasmid with pET origin of replication and ampicillin resistance. The SNAPl90(l-505) sequence follows the sequence for the GST tag and the thrombin recognition sequence. 40 Different combinations of which gene is present in which cloning site and the type of plasmid used were used and tested to check for the maximum efficiency of the co- expression system. And although all subunits were expressed well in almost every case, the different co-expression systems varied in the apparent stoichiometry of the subunits. Good expression and recovery levels were obtained using the pGSTl90(l-505) and pCDF43/50-l, with ampicillin and streptomycin resistances, respectively. The complex (SNAPl90(l-505), SNAP50, and SNAP43) obtained using this co-expression system is referred as mSNAPcyB. Better expression and recovery levels were obtained using the pGSTl90(l-505), pCDF43/SO-1 and pRSFl9-1 with ampicillin, streptomycin and kanamycin resistances, respectively. The complex (SNAPl90(1-505), SNAP50, SNAP43, and SNAP19) obtained using this co-expression system is referred to as mSNAPcy4. 41 Noo I BamH I Sac I m del T7 Sac | (BamH l/Bgl ll) Kpn I pCDF43/50-1 6104 bp 800.00 Figure 28: pCDF43/50-l plasmid with CDF origin of replication and streptomycin resistance. SNAP50 gene is in MCSI and SNAP43 is in MCSII. 42 T7 pRSF19—1 4092 bp '00 Figure 29: pRSF19-l plasmid with RSF origin of replication and kanamycin resistance with SNAP19 gene in MCSI and un-utilized MCSII. 2.4. Over-expression of mSNAPc variants Once the plasmids were sequentially inserted in to the E. coli BL21- CodonPlus(DE3)-RIL competent cells, the cultures were grown to the desired density and over-expression was induced at different temperatures to determine the V optimal expression conditions. 16°C was determined to be the temperature that yielded the most material. The expression at increased temperatures resulted in a drastic reduction of the target protein material. Both the mSNAPcy3 and mSNAPcy4 systems expressed all the desired proteins as was determined by SDS-PAGE and Western blot analysis. 43 Different strains of E. coli were tested in search of the best co-expression host, like BL21(DE3), CodonPlus-RIL, CodonPlus-RP, and pLysS (Stratagene). The BL21(DE3) competent cells (genotype: E. coli B F' dcm ompT hst(rB” 1113—) gal 7L(DE3)) are an all-purpose strain for high-level protein expression. The BL21(DE3) pLysS competent cells (genotype: E. coli B F' dcm ompT hst(rB" m3“) gal 1(DE3) [pLysS Cam']) contain a pLysS plasmid with chloramphenicol resistance and provide tighter control of protein expression of toxic proteins. The BL21-CodonPlus(DE3)-RIL competent cells (genotype: E. coli B F' onsz hst(rB’ m3") dcm+ Tetr gal MDE3) endA Hte [argU ileY leuW Cam']) contain a plasmid encoding rare codons for arginine, isoleucine and leucine (argU (AGA, AGG), ileY (AUA) and leuW (CUA)) and provides chloramphenicol resistance. The BL21-CodonPlus(DE3)-RP competent cells (genotype: E. coli B F” ompT hst(rB' m3-) dcm+ Tetr gal MDE3) endA Hte [argU proL Cam']) contain a plasmid encoding rare codons for arginine and proline (argU (AGA, AGG), proL (CCC)) and provides chloramphenicol resistance. mSNAPcy4 was successfully expressed in all types of E. coli with similar amounts of the protein obtained, but with the highest amount of the mSNAPc being expressed in the CodonPlus-RIL cells, as judged by immobilization and recovery of the complex using the glutathione resin. It was observed from the amino-acid sequence and the codon analysis that several subunits, especially SNAPl90(1-505) and SNAP50, contain rare codons and could potentially lead to lower expression levels due to codon bias. Next, different cell culture growth media were tested in order to further optimize the expression levels of partial SNAP complexes. LB and TB media, as well as M9 44 minimal media were tested by immobilization and recovery of the protein from the glutathione resin. M9 minimal media resulted in hardly detectable levels of the complex, as judged by Commassie blue staining of proteins separated by SDS-PAGE. LB and TB yielded the highest levels of expressed material. The over-expression in TB had a two fold increase compared to the expression levels for the cells grown in LB medium. The density of the cells grown in TB media was about twice that of the cells grown in LB media. Overall, the expression levels per cell are comparable for the two growth medias (data not shown). The time of [PT G induction was also studied to further improve the expression levels of the mSNAP complexes. Different times of induction, defined by the cell density in the growth culture, were tested in the range of 0.1 to 1.4 optical density (ODooo). After the induction the cell culture was grown for 12—16 h at 16°C and the amount of the expressed material was evaluated using the GST purification scheme. It was determined that the expression levels of the protein are in direct proportion to the cell density at the time of induction. When the cell culture was induced with IPTG at an OD600 of 1.4, almost double amount of the mSNAPc was obtained as compared to the cell culture induced at an ODooo of 0.7. The amount of cells obtained, when the culture was induced at an OD600 of 1.4 was also doubled (data not shown). It seems that the mSNAPc components are non-toxic to E. coli and the expression level per cell is constant when grown at identical conditions and independent of the induction point. The only consequence of the different induction point is the final cell density and therefore the final amount of the produced material and not the actual amount of protein per cell as is usually the case when the proteins are even slightly toxic to the cell. 45 The temperature of induction was studied. And the complex over- expressed only at 16°C. Expression at 24°C or 38 °C resulted in diminished expression levels of mSNAPc. 2.5. Comparison of the individual expression of SNAPc subunits to the co-expression system Cell cultures expressing the mSNAPcy4 or individual subunits were grown under identical conditions and the expressed material was analyzed by Western blotting method using the antibody directed against the specific subunit of interest. The SNAP50 and SNAP43 subunits expressed well in both the individual and co-expression systems. Interestingly the amount of leaky expression for these two proteins is relatively high in the co-expression system. The individual expression systems yielded no expressed protein before induction (Figure 30, panels A and B). The level of the protein in the insoluble or soluble fractions is similar for the two systems. The relatively high amount of the material in the insoluble fraction is most likely due to insufficient rinsing of the material and may not be a measure of the total material actually in the insoluble form. The amount of the target proteins is also comparable between the two expression systems, suggesting that expression was not drastically improved. Nevertheless, the expression levels for both SNAP43 and SNAP50 are elevated in the co-expression system compared to the individual expression. 46 8 8 a: 0 Q 0 U _ a i? :3 % § 8 3 g ,5 a o 2 'E 8 8 a 5 2 a a a E ‘-= w r ‘ - -J -GST-SNAP43 l - -—-GST-SNAP50 T CW ' ‘ "‘_—iw".—T_‘ ‘- O - -i - SNAP43 in L“: - - -l-SNAP50 in ‘ “’“WW‘” " mSNAPc./4 mSNAPc/4 A B 3 8 8 8 g 2 3 8 £3 2 ‘0 o .2 D “O o 2 .o .9 2 o .2 E 3 0 2 5 E E 8 5 E ‘5” (1°) l m‘osr-SNAmgom-wm i .. -u- -GST—SNAP19 [1 __-L________ ~ _ GST-SNAP190(1-505) in i _ l : ___. z} mSNAP“ . -SNAP19In --—~~- ** mSNAPcy4 C D Figure 30: Analysis of expression for SNAPc subunits in the individual expression and co-expression approach by Western blot. GST labeled protein were visualized using Ot-GST antibodies and the SNAPc subunits were visualized using antibodies directed against the subunit of interest. The expression levels of GST-SNAPl90(l-505) are to some extent elevated in the co-expression system compared to individual expression. Leaky expression was not observed for this protein and the material is mostly in the soluble fraction of the cell extract (Figure 30, panel C). In contrast the amounts of SNAP19 in the co-expression system were not detectable in the cell extract using Western blot. The levels of this protein are significantly reduced compared to the individual expression system (Figure 30). SNAP19 is the smallest of the subunits and to achieve complex formation where the 47 subunits are equimolar, a smaller amount of SNAP19 is needed compared to the other subunits, so a complete mSNAPcy4 complex could still be assembled. 2.6. Purification of mSNAPc Since the SNAPl90(l-505) contains a cleavable linker and a GST tag, the soluble culture extract harboring mSNAPCy3 or mSNAPcy4 was clarified and allowed to bind to the glutathione resin to immobilize the expressed complex. Other subunits were not tagged, yet they were efficiently immobilized thru the GST-190(l—505). Different purification conditions were tested to explore different approaches to achieve highest purity of the material in the first step of purification. Parameters like salt, detergent and glycerol concentration were varied and the purity of the immobilized material was evaluated with SDS-PAGE and silver staining. It was determined that the detergent (Tween-20) is absolutely critical for the extraction of the complex from E. coli since virtually no material was immobilized when Tween-20 was not present as judged by SDS-PAGE and Commassie blue staining. The Tween-20 could be replaced with n-octyl- beta—D-glucopyranoside (BOG) with slightly reduced extraction levels in this purification step. The salt concentration was varied from 150 to 500 mM with the highest purity achieved when higher salt is used. Lower salt concentration leads to increased presence of impurities. The bound material can be efficiently removed from the beads by glutathione elution or by thrombin cleavage of the linker connecting the SNAPl90(l- 505) and the GST tag. In comparison to the GST purification step of some SNAPc subunits, like SNAP43 or SNAP50, the material was almost fully recovered from the resin (Figure 31). Furthermore, the relative intensities of the bands observed on the 48 Coomassie stained SDS-PAGE suggest that the subunits are equimolar, although in some cases the intensity of SNAP50 was weaker than the intensities of SNAPl90(1-505) or SNAP43. The co-expressed material is also more stable during thrombin cleavage as relatively high concentration of thrombin can be used for extended periods of time without ill effects on the complex composition. When the optimal cleavage conditions were investigated, no truncated products were observed even after three-day cleavage with 200 units of thrombin at 4°C (6 L cell culture). On the other hand, the individually expressed subunits were extremely sensitive during thrombin cleavage and prone to over digestion. More importantly, the co-expressed material was successfully recovered from the resin, which was not the case for SNAP43 where relatively small amounts of the protein were recovered (Figure 32). w; *3 J ' 1 Y. 5’ < K ‘1 z < Z ‘9 z ‘9 c a c g C >' U 0 o [2 s a E O x D zsoko 150 100 ‘5'." so -7", - ' :SNAPSO SNAP43 37 - 25 - .— -csr 20 - -SNAP19 IS . I 2 3 4 Figure 31: Purification of co-expressed mSNAPcy4 using the GST affinity resin and thrombin cleavage. The mSNAPcy4 was purified under the same conditions as the individual subunits. 49 SNAP190(1-5051 unrecov SNAP43 mSNAPC/B mSNAPcyd SNAP50 SNAP19 - - ~ - —- SNAP 'l 90(‘1—505) ~ "0 -' SNAP50 - . —SNAP43 - -- SNAP 19 Figure 32: Recovery of the co-expressed material compared to individually expressed and purified subunits. SNAP19 improves complex stability and subunit recovery. Subunit recovery for partial SNAPC assembled without and with co- expressed SNAP19 (mSNAPcy3 and mSNAPcy4, respectively) was estimated by SDS—PAGE and Coomassie blue staining. Note the increased recovery of SNAP43 and SNAP50 in mSNAPcy4 (lane 7) compared to mSNAPcy3 (lane 6). Lanes 2—5 contain aliquots of the individually expressed SNAPc subunits for reference. Note that the unrecovered SNAP43 is used to show the migration of SNAP43. The recovered material was further purified using ion-exchange chromatography. Cation and anion (SourceQ and SourceS, Pharmacia) exchange resins were tested in a wide array of purification conditions. The complex had the tendency to bind to the ion- exchange resin, but the recovery of the protein material was often accomplished only with severe losses (data not shown). Optimal purification conditions were eventually 50 determined and the detergent Tween-2O was again essential for this purification. Surprisingly, addition of magnesium ions to the purification buffer resulted in even better recovery rates. The protein does, however, elute off the column over a broad range of salt concentrations and the purity is not greatly improved compared to the GST purification step alone. The co-expression and GST purification scheme using the GSH resin works well with both mSNAPcy3 and the mSNAPcy3 and is a major improvement compared to individually expressed and purified subunits or the baculo—virus co-expression system, which resulted in low amounts of material or material of poor purity. 2.7. DNA binding activity of mSNAPc variants The mSNAPcy4 was tested for DNA binding activity for both U1 (Figure 33 lane 11) and U6 (Figure 33 lanes 2, 3) promoters in an electrophoretic mobility shift assay (EMSA) and was similar to the assembled recombinant mSNAPc active for DNA binding to both types of promoters”. 51 PSE [TATA was JAIA lPSE‘w-mu-mw—W PROBE r 1 ---+---+---+---+TBP - 4‘-.4 'A'AmSNAPC‘M «mSNAPcy4/TBP 4—mSNAPcy4 12 3 4 5 6 7 8 910111213141516 Figure 33: mSNAPcy4 facilitates TBP promoter recruitment. Increasing amounts of mSNAPcy4 (3 and 10 ng) were added to DNA binding reactions containing radiolabeled probes that harbor a wt PSE and wt TATA box (lanes 1—4), mu PSE and wt TATA box (lanes 5—8), wt PSE and mu TATA box (lanes 9—12), or mu PSE and mu TATA box (lanes 13-16). Lanes 4, 8, l2, and 16 contain 50 ng of recombinant full—length human TBP in addition to 10 ng of mSNAPcy4. Reactions containing only the DNA probes are shown in lanes 1, 5, 9, and 13. Positions of the mSNAPcy4 and mSNAPcy4 plus TBP complexes are indicated (experiment was performed by Gauri W. J awdekar)5 8. The complex was also tested for the ability to recruit TBP to the promoter and was capable of doing so in the case of the U6 promoter (Figure 33 lane 4) but was unable to recruit TBP in the absence of the TATA box, as is the case in a U1 type of snRNA 52 promoter (Figure 33 lane 12). TBP alone was not capable or binding to the DNA in the absence of the SNAP complex, under the experimental conditions, regardless of TATA box presence. mSNAPcy4 binds to the promoter region of the DNA specifically and mutations in the PSE region abolish the interaction”. 2.8. mSNAPcy4 is capable of recruiting TFIIIB to the promoter The co-expressed mSNAPcy4 was tested for its ability to recruit Brf2-TFIIIB to the U6 promoter. Human TBP, Brf2 and de1(l-470) were mixed with mSNAPc and the labeled U6 DNA probe. In the absence of mSNAPc, Brf2-TFIIIB did not bind to the DNA (Figure 34, lanes 2-7) or did so weakly (Figure 35, lanes 5 and 7). The Brf2-TFIIIB complex bound the probe in the presence of mSNAPc. Migration of mSNAPc was altered in the presence of TBP (Figure 34, lane 9), but not by addition of Brf2 or de1 alone (Figure 34, lanes 10 and 11), suggesting that TBP is essential for subsequent recruitment of Brf2 and del and that mSNAPc through direct protein-protein interactions stabilizes the TBP/DNA interaction. A supershift was also observed when Brf2 was added to the mSNAPc/TBP/DNA complex, but not when del was added in the absence of Brf2 (Figure 34, lanes 12 and 13). The presence of both Brf2 and del(l-470) resulted in a band migrating slower than mSNAPc/TBP/DNA and mSNAPc/TBP/Brf2/DNA (Figure 34, lane 15, Figure 35, lane 12 ). Specific protein-protein contacts are involved in the formation of the mSNAPc/Brf2-TFIIIB/DNA complex and the complex is assembled in the order mSNAPc-DNA-TBP-Ber-de15°. 53 mSNAPcy4"'-"'++++++++ de1 (1-470) ‘ ' ' + ' + + ' ' ' + ' + + + 8er - + - + + - + - + + + TBP + - + + + + - + + - + mSNAPc/TBP/Ber/del — mSNAPchBP/Brf2 . ‘ —\—mSNAPcffBP _\- mSNAPc ’g Free Probe 12 3 4 5 6 7 8 9101112131415 Figure 34: mSNAPcy4 is capable of recruiting TFIIIB to the promoter. Coordinated DNA binding by SNAPc and TBP facilitates higher order complex assembly with Brf2 and de1. DNA binding reactions were performed with the indicated combinations of SNAPCy4 and Brf2-TFIHB subunits. These results suggest that preinitiation complex assembly follows the order SNAPc>TBP>Brf2>del (experiment was performed by Gauri W. J awdekar)”. 54 - mSNAPcy4 + mSNAPcy4 de1(1-47o)--- +- +++ ---+ Br12 - - + - + - + + - - + + TBP ' + ' ' + + + ' ' + + +7 mSNAPc/TBP/BerIdel TBP/Brf2/de1 mSNAPcrrBP/Brf2 " mSNAPcfl'BP TBP/Ber mSNAPc 12345678 9101112 Figure 35: SNAPc stimulates DNA binding by Brf2-TFIIIB. Electrophoretic mobility shift assays were performed using a U6 probe containing a wild-type mouse U6 PSE and a wild-type TATA-box (AC probe). DNA binding was carried out in the absence (lanes 1-8) or presence (lanes 9-12) of wild type SNAPCy4. Reactions containing individual TBP, Brf2, and de1 (1-470) subunits are shown in lanes 2-4. Reactions containing pair wise combinations of TBP with Brf2, TBP with de1 (1-470), and Brf2 with de1 (1-470) are shown in lanes 4, 5, and 8. DNA binding by the complete Brf2-TFIIIB complex in the absence of SNAPc is shown in lane 7. Additional reactions were performed with SNAPc alone (lane 9) or in combination with Brf2-TFIIIB subunits (lanes 9-12), as indicated. Lane 1 shows migration of the probe alone. The relative positions of the various 55 SNAPcy4/Brf2-TFIIIB complexes are shown on the right (experiment was performed by Gauri W. J awdekar)”. 2.9. Transcription initiation mediated by mSNAPc mSNAPcy4 was then tested for its ability to initiate transcription from both U1 and U6 promoters in an in-vitro transcription initiation assay and was, in contrast to the recombinant mSNAPc (assembled from individual subunits), active for transcription initiation from U1 (Figure 36) and U6 (Figure 37) promoterssg. U (D .‘g a-SNAPC depleted a) F I "C? +mSNAPCy4 - E - F «*4 M * H...“ «who H 1 2 3 4 5 6 7 8 9 10 Figure 36: mSNAPcy4 functions for U1 snRN A transcription by RNA polymerase II. HeLa cell nuclear extract was either mock depleted with a preimmune rabbit sera (lane 2) or a-SNAP43 antisera (lanes 3—10) to deplete endogenous SNAPc. Extracts were then used for human U1 in vitro transcription assays. The U1- specific signal was diminished upon removal of endogenous SNAPc, as shown in lane 3. Increasing amounts of mSNAPcy4 (0.08, 0.25, 0.75, 2.5, 7.5, 25, and 75 n g) reconstituted the correctly initiated transcription from a human U1 reporter, as shown in lanes 4—10. Lanes 1 and 2 show the U1 signal obtained from either 56 untreated or mock depleted reactions (experiment was performed by Gauri W. J awdekar)5 8. Ot-SNAPC depleted r I _ - +mSNAPcy4 - a *“*G'I 12 3 4 5 6 7 8 910 + GST -U6 5' Figure 37: mSNAPcy4 functions for U6 snRNA transcription by RNA polymerase III. In vitro transcription of human U6 snRNA was carried out using HeLa cell nuclear extract that was treated as in Figure 36. Lane 2 shows the reduced U6 signal upon removal of endogenous SNAPc. Increasing amounts of mSNAPcy4 (0.08, 0.25, 0.75, 2.5, 7.5, 25, and 75 ng) reconstituted the correctly initiated transcription from a human U6 reporter as shown in lanes 3—9. Approximately 75 ng of GST was added to the transcription reaction shown in lane 10 (experiment was performed by Gauri W. J awdekar)5 8. 2.10. Zinc finger domain in SNAP50 The C-terminal region of SNAP50 is cysteine and histidine rich and is a potential Zn finger domain based on amino-acid sequence, which might be involved with protein- DNA or protein-protein contacts. DNA binding domains sometimes consist of multiple repeats of related zinc finger motifs. The SNAP50 zinc finger domain however contains an unusual arrangement of six histidine and nine cysteine residues that can be grouped 57 into region I that resembles a TFIIIA—like C2H2 zinc finger and region 11 that resembles a glucocorticoid-like C2C2 zinc finger. Sequence alignment of the C-terminal region (301-411) of SNAP50 with the SNAP50 homologues from different mammals, fish, insects, worms, plants, slime mold and parasites, revealed two highly conserved potential zinc finger domains (Figure 38). Interestingly the residues from region I are highly conserved, with the exception of C312, which is conserved in mammals, but not in lower organisms. The fact that SNAP50 is highly conserved in lower organisms suggests that this subunit plays an important role in the transcription initiation mechanism. Region 2 is invariantly conserved in all SNAP50 homologues and the four conserved cysteine residues are similar to steroid receptor (szszszzC) type zinc fingers. Overall the consensus motif Lx4Gonx3CxHx20- 23YPx1I-1szsz18Px3-4Cx2CFx3Hx1-4G is unlike any other family of zinc fingers described”. 58 m 3325 n “5:8: w BEE z 5.85.: a 83.85: c 32.0 c 533833“ a nagging» a £32; H n arena N u 23:55.5 “2 a Sena N. 53.2 > $33.3 u &.§%E m 5.25.32 3 EB > "5:3: E guy umm finexrfliamgficflow.Emsr E amnwozagmoe E mamzfifigg... a“ .35., E . .m§.mnmxgh.w_umc._.3p5 a“ M.* H““* iEmlmmfibH :n E r....m......m.m.o.593:¢ ...... e ......... we ............ 0:0 .................. m..30..nm..o=....m ................... Figure 38: Sequence alignment of human SNAP50 C-terminal amino acids (301- 411) with corresponding regions from SNAP50 homologues of other species. 59 Putative zinc fingers similar to TFIIIA (Hstmex3C) and steroid receptors (szszszzC) are indicated as region 1 and region 2, respectively. The sequence of highly conserved amino acids derived from this alignment corresponds to LX4GX6HX3CXHX20-23YPX1l—12CX2CX18PX3-4CX2CFX3HXl-4G- This alignment was performed using the Clustal W program. Homo sapiens (NPOO3075), Cannis familiaris (XP853813), Bos taurus (AAX08912), Mus musculus (NP084225), Rattus norvrgicus (NP001013230), Drosophila melanogaster (NP724647), Drosophila pseudaoobscura (EAL25490), Trypanosoma brucei (XP827295), Arabidopsis thaliana (AAO30067), Caenorhabditis elegans-l (NP500819), Caenorhabditis elegans-2 (NP497807), Plasmodium falciparum (), Danio rerio (XP694501), Leishmania major (XP843572), Anopheles gambiae (XP310411), Dictyostelium discoideum (XP644064), Entamoeba histolytica (XP653151). This comparison was prepared by Gauri W. J awdekarsg. 2.10.1. ICP-MS and FAAS analysis of mSNAPc variants SNAP50 contains a potential zinc finger domain. Since SNAP50 is the only subunit of SNAPc containing a Zn finger motif, the SNAP50-containing mSNAPcy4 and mSNAPc-73 were used in the metal analysis study to confirm the presence of a zinc atom. The complex was co-expressed in E. coli and extensively purified. The material was digested in 2% nitric acid for the inductively coupled plasma mass spectrometry (ICP- MS) or ashed for flame atomic absorption (FAAS) analysis. 1 zinc atom per molecule of the complex was determined with the ICP-MS method, where two isotopes 66Zn and 68Zn were quantified. The FAAS determination of zinc was in good agreement with the ICP— 60 MS study. Both mSNAPcy4 and mSNAPcy3 complexes were determined to contain 1 Zn atom per molecule of the complex, as they yielded 0.90 and 0.89 moles of zinc per mol of the complex respectively (Figure 39, Table 1). 1200 . 1000 - 800 . 600d Ila“) (ppb) 400 r 200 ~ 0 0.005 0.01 0.015 0.02 0.025 absorbance Figure 39: Flame atomic absorption spectroscopic analysis of mSNAPc'y3 and mSNAPcy4. Unlabeled data points represent the absorption obtained for standard zinc solutions. Two separately prepared samples were used for ICP-MS analysis and yielded 1.1 and 1.2 equivalents of zinc and are in agreement with previously determined zinc content that utilized the flame atomic absorption. Nickel was also measured during this experiment and the signal for this metal was on the level of the background noise and 61 insignificant. Nickel is therefore not present in the protein sample. Although two potential metal binding domains were identified in the C-terminal region of SNAP50 by ab-initio modeling, the presence of only one zinc atom was determined experimentally. It is possible that a second metal is still present and required when the complex interacts with the DNA (Table 1). Table 1: Quantification of zinc and nickel using ICP-MS and FAAS for mSNAPcy3 and mSNAPcy4. Two isotopes 66Zn (3) and (’an (b) were measured for ICP- MS. Two independently prepared samples of mSNAPcy4 were used in the analysis: (1) extensively purified complex used for crystallization and (2) void fraction from gel filtration purification of the complex. The protein concentration was determined by absorbance measurement in 6M urea and by the Bradford assay. Sample [Zn] ppb [Ni] ppb [fig/{:13} n(Zn)/n(prt) n(Ni)/n(prt) mSNAPcy4 (1) 31250.4 - 2.62 1.16 - b1268.5 - 2.62 1.14 - g - 17.9 2.62 - 0.02 E: mSNAPcy4(2) a1415.5 - 2.90 1.18 - b1443.6 - 2.90 1.17 - - 35.5 2.90 - 0.03 ‘2 mSNAPcy3 1519.7 - 3.86 0.89 - E mSNAPcy4 1827.3 - 4.95 0.90 — 62 2.10.2. Ab-initio modeling of SNAP50 C-terminus Since an experimental structure of SNAP50 is not available, computational methods were employed by Dr. Michael Feig to predict the structure of the C-terminal domain of SNAP50 so that the mechanism of SNAP50 function could be better understood. Because of the unavailability of a structural homolog, an ab-initio approach using the predicted secondary structure and the amino acid sequence was taken. This method is possible because the SNAP50 finger domain region is sufficiently short. Interestingly, one of the structures produces by this ab-initio method had a convincing arrangement of cysteine and histidine residues for coordination of one zinc atom, however some of the conserved residues could, although unlikely, also coordinate another metal atom like zinc, nickel or iron”. 2.10.3. Mutagenesis In order to examine the function of the zinc finger domain in SNAP50 the cysteine and histidine residues were changed to alanine. The presumption that the zinc finger domain is responsible for the DNA interacting function of SNAP50, was tested by the DNA binding assay. HA-SNAPSO with selected cysteine or histidine single point mutation to alanine was co-expressed with GST-SNAPl90(l-505), full length SNAP43 and SNAP19. The recombinant complexes were immobilized to the glutathione resin, washed extensively and released from the beads by the cleavage of the linker with thrombin. Since the presumption was that the mutation would only affect the binding of the complex to the DNA and not the actual formation of the complex, GST pulldown and co-immuno precipitation experiments were performed to confirm the existence of the 63 mSNAP complex. The complex was purified using the glutathione beads in the first step of the purification. Since only SNAPl90(l-505) contains a GST tag, all the other subunits were obtained through this subunit. Next, the partially purified complex, containing SNAPl90(1-505), SNAP43, HA-SNAPSO and SNAP19, was treated with anti-SNAP43 antibody cross-linked to agarose beads, thus allowing specific immobilization of the complex through SNAP43 only. The obtained material was tested for the presence of HA-SNAPSO by western blot targeting the HA tag on SNAP50. The presence of HA-SNAPSO, proves that different point mutations of the C-terminus of SNAP50 did not change the consistency of the complex (Figure 40) 59. rr-SNAPAS 1P Q3 V"r?'8’:<;g<<<<<<2555' °td.Curve c E 0- '4 J- 6 N ' x N " 8§~8*886 828.888.8888 maaoorozxrozoooooazx 1 2 3 4 5 6 7 8 9 10 11 12 13 .14 15 16 17 18 19 20 21 22 Figure 40: Mutations in the SNAP50 zinc finger domain do not disrupt SNAPc assembly. Approximately 20 ng of each of the SNAPcy4 complex containing substitution mutations in HA-SNAPSO were affinity purified first using glutathione agarose to pull down GST-SNAP190 (1-505) followed by immunoprecipitation with a-SNAP43 antibodies. Associated wild type or mutant HA—SNAPSO was detected by OL-HA Western analysis (lanes 6-22). A titration of wild type SNAPcy4 using 8, 4 and 2% of the input material is shown in lanes 1-3, respectively. Lanes 4 and 5 contain wild type SNAPcy4 recovered with the protein-G agarose beads alone or with pre-immune serum, 64 respectively. The bottom panel represents 4% of the input material directly analyzed by OL-HA Western analysis (experiment was performed by Gauri W. J awdekar)”. 2.10.4. DNA binding activity of SNAP50 mutants The mSNAPcy4 harboring HA-SNAPSO with different point mutations was tested for DNA binding activity. A PSE-containing DNA probe was used for the assay. Point mutations in the evolutionarily conserved ngCxH motif, H3l3—A, C317-A and H3l9-A resulted in reduced DNA binding as compared to the wild type. Interestingly point mutations C354-A, C357-A, C380-A and C383-A in the conserved CXZCx23Cx2C domain exhibited complete abolition of the DNA binding activity of the complex. Furthermore the mutation of the last evolutionarily conserved residue in the C-terminal area of SNAP50 the H388-A also resulted in reduced DNA binding activity (Figure 41) 59. mSNAPcy4 1 fl C-xg-C-----H-x3-C--x--H-xic-H---H-x2--C-\'12-H-x6-C-x2-C-x:9-C-x2~C-x2-C ----- x4 ----- H m C3028 C312-A 14313». C317-A P3197? 433371 H331-A casd-A H347-A c.354-A C357~A C377-A C380—A 3383A Rae‘s-A 1—358-A AAA‘AA‘AA‘AAAA‘A A “a...“ - qm 1* fl " 12 3 4 5 6 7 6 91011'213‘4151617181920212223242525272829333132333435 Figure 41: Mutations in the SNAP50 zinc finger domain disrupt DNA binding by SNAPc. Increasing amounts (3 and 10 ng) of SNAPcy4 with wild type (lanes 2 and 3) or 65 mutant HA-SNAP50 (lanes 4-35) containing the indicated substitution mutations was tested in an EMSA for binding to a radiolabeled DNA probe containing a high affinity PSE and TATA box (AC probe). Lane 1 shows the probe alone with no added proteins. Mutations H313-A, C317-A, and H319-A resulted in reduced DNA binding, whereas mutations C354-A, C357-A, C380-A, and C383-A completely abolished DNA binding activity. Mutation H388-A also exhibits weakened DNA binding activity (experiment was performed by Gauri W. J awdekar)”. 2.10.5. Transcription initiation assay Next, the wild type or mutant HA-SNAPSO containing mSNAPcy4 was tested in an in-vitro transcription initiation assay. HeLa cell nuclear extract was first depleted of mSNAPc with anti-SNAP43 antibodies and then transcription was tested in the presence of purified mSNAPCy4 containing either wild type or mutant HA-SNAPSO. As expected the mutations that showed abolished DNA binding activity also resulted in abolished transcription initiation for both U1 and U6 types of promoters. The addition of the mSNAPc with SNAP50 point mutants C354-A, C357-A, C380-A and C383-A in the conserved szCx23Cx2C domain were completely unable to restore transcription for both types of promoters (Figure 42; lanes 13, 14, 16 and 17). Interestingly, the mutations on the Hx3CxH motif, H313-A, C317-A and H319-A, resulted in complete inability to restore transcription from the U1 snRNA promoter. Transcription, albeit weakened, was still evident for the U6 system (Figure 42; lanes 6, 7 and 8). The SNAP50 H388-A mutant was capable of wild type transcription from the U6 promoter, but its activity was severely reduced in the U1 system (Figure 42; lane 19; experiment was performed by Gauri W. J awdekar)5 9. 66 mSNAPc/4 < <|Z < < <( < < < < <( < < < < < < }_ (\I or c6 K 0') (:3 J- 4: K tr K K c3 (0 to oo 0 ~— .— .— - <9 6) to v to to K 60 co co 00 (I) g co co co co co co co co m to co co co co co to - O O O I 0 I I I 0 I O O O O O (I I flue-5:3 ‘fl‘m‘ Iii _’ ~93 4115‘ . fill-*‘m 2* five ‘7 ‘IJ .‘-= 1' ‘ 1 2 3 4 5 6 7 8 9 10 11 12 1314 15 1617 1819 Figure 42: U1 and U6 in-vitro transcription initiation assay of wt or mutant HA- SNAP50 containing mSNAPcy4. Mutations that resulted in reduced DNA binding also showed reduced transcription initiation activity for both types of promoters, with the exception of the H388-A mutation, which resulted in weakened U1 activity, but wild type U6 activity of transcription initiation59. HeLa cell nuclear extract was depleted with 01- SNAP43 antibodies to immunodeplete endogenous SNAPc. In vitro U1 and U6 transcription was then tested in the absence (lanes 1) or presence of purified SNAPcy4 (5 ng) containing wild type SNAP50 (lane 3) or mutant SNAP50 with the indicated alanine substitutions (lanes 4-19). Addition of GST alone did not reconstitute either U1 or U6 transcription as shown in lane 2 (experiment was performed by Gauri W. Jawdekar)”. 2.11. Crystallization of mSNAPc variants The mSNAPcy3 and mSNAPcy4 complexes obtained from the GST and ion exchange purification schemes was each concentrated to approximately 4.5 mg/ml and screened for crystallization using various crystallization screens, namely the Crystal Screens I and 11 (Hampton Research) and the PEG/pH grid screen. At room temperature, needle clusters were obtained in a variety of crystallization conditions. Most conditions 67 had magnesium chloride present as a salt additive and a PEG as a precipitating agent. The crystals grew in a wide range of pH; from pH 7 to pH 10. The crystals did not appear to be single under the light microscope and the size was insufficient for X-ray diffraction studies (Figure 43). 111 1111111111111“ 1 111111113. 111111 1111““ 11‘11'1““‘ 1111 1111‘ 1‘11 111111111111111“““‘1‘1“111111111111“‘31 11111111111 3 111111111 111 1111111111111111111111111111111""“‘ 1‘1 . ‘ . 1111 111 " ‘1 WW 1‘ 31111 1’ “ ‘ 111. ‘ 111“ 3311‘ 1111- 11111111111'1‘1““‘ , 3“ “11'" ‘ 11.3,.1. “‘ ‘ y1".‘11311111 , :311‘11‘1 1111111 11111111 111111111111 1 ‘ ‘11 33‘ 1.1 11111111: ~1‘Ii131h‘ ‘ ‘ 1. 11111-111 . 1111111111”1 11 1111111111 11:1" 11' 11 3 1 .1‘111111‘1 11'“ 111 1 11"“11 1111“" 1‘ “11 1.1111 111111111111111111111111 1111111111111111111111111111111111 11 1111111111 131111111111111111111111111111“!11 1 1 1 ' 31.11 11111111111! 111111111”111l|1111111|111111111 Figure 43: Crystals of mSNAPcy4 as observed under the light microscope grown in 9% PEGSOOOMIVIE, 100 mM Tris pH 8.5, 100 mM NaCl and 100 mM MgCl2. 68 Figure 44: Optimized crystals of mSNAPcy4 as observed under the light microscope grown in 8% PEGSOOOMME, 100 mM Tris pH 8.4, 100 mM NaCl and 100 mM MgC12. Optimizations of the crystallization condition eventually lead to a condition where the crystals were larger, but the quality was not sufficient for X-ray crystallographic studies due to small size and morphology (Figure 44). Additive screening and replacement of all components of the crystallization condition with their analogs were tried. The effect of temperature on the crystallization was also investigated, by attempting to grow the crystals at 4°C. The crystals failed to appear at lower temperature even after extended periods of time. Further crystallization optimizations were unsuccessful, therefore micro and macro seeding experiments were performed, again without success. 69 2.12. X-ray diffraction of mSNAPc crystals A needle cluster was cryogenically protected and tested for diffraction at the synchrotron facility. Rings at low resolution were observed, similar to a powder diffraction pattern. The result was surprising, since the crystals forming the needle cluster were relatively small. The conclusion that can be drawn from this experiment is that the crystals, albeit small and non-single, still diffract strongly (data not shown). A single crystal of larger dimensions and better quality could potentially diffract strongly and to high resolution. 2.13. Evaluation of mSNAPc crystals using transmission electron microscopy (TEM) Since all crystallization optimization efforts did not result in improved crystal size and quality, and neither did the seeding experiments, we decided to observe the crystals under the TEM and investigate the morphology of crystals. The crystal structure of a protein of this size can not be determined using electron diffraction, but some interesting crystal parameters, like the unit cell dimensions, could. The crystals were loaded on the EM grid and observed under TEM. Some crystal like formations were observed, but they were destroyed by the powerful electron beam. In order to be able to observe the crystals under the TEM, they were first cross linked with a formaldehyde and glutaraldehyde mixture, washed with water and then evaluated with the electron microscope. 7O The crystals observed were surprisingly not needle like, but appeared rectangular in nature and needle formations were built out of the small rectangles. What appeared to be needle clusters under the light microscope were actually large aggregates of small crystals (Figure 45, Figure 46). IIIIIIIIIIIIIIII. . I II IIIIIIIIII In” E NIIIIIIIII “I“ II III IIII‘I I V miIIII iIIIIllIn I. I I 'IIIIII I‘IIIIIIIIII I I I ‘IIIIII M .IIIIIIIIII _ " III M‘ IIIIIIIIIIII I IIIIIIIIIIII lIIlIIII . "‘“""““""""'f“““I v IIIIIIIIIIIIIIII nil" IIIIII‘III I I' IIIII II IIIIIIIIIII IIII :‘ I“ II I IIIIDIIIIII “III I I II “"II-I‘II 1II 'III'IIII III III I “"1th IIIIIIII- III MI?“IIIIIIIIIIII‘III‘I‘III‘I‘INI‘ I ' Figure 45: Needle clustered crystals of mSNAPcy4 under the TEM have a heavily branched appearance and the branches are even further sectioned. 71 I I III . III III I III III-W III“ III II III III“ III IIIIII III” III III IIIIIIIII WIIIIIIIIIIIII IIIIIIIII IIIIIIIII PM“ “1‘ IIII'I'II III!“ II‘III IIII‘II‘IIIIlIl‘lfi‘I.I m‘lm‘MI‘IiIII‘ III I IIIIIIIIIIIIIWII IIII‘III ‘ I I I‘ II M" ' : III III IIIIIII II I‘ IIIIIIIIIII IIII I‘III‘I‘I‘.IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII II I .I - é .‘IIIIIIYII “WI . I Figure 46: Crystals of mSNAPcy4 under the TEM. Close—up on one of the branches showing a heavily fragmented formation. 72 I‘II» III ”llhlllllfi'lfllwil " "'I'IIIIMIIIIIIIIMM 17,. ‘ fillrywl”, i '1 . A \ I .. ‘ Figure 47: Circular diffraction pattern typical for a powdered material. When observed in diffraction mode, these crystal clusters diffracted with powder diffraction like pattern as can be seen by the appearance of the circular pattem (Figure 47). 2.14. Diffraction of mSNAPc crystals using TEM A few small single crystals were found under the TEM (Figure 48) and the diffraction aperture was set to select only one crystal. These crystals diffracted like a 73 single crystal and a different diffraction pattern was observed when the grid with the crystal was rotated for approximately 0.5 degrees (Figure 49). . ,mum: w ." “(mull-n! ' l». .Il"""|‘ .1 .Illlil‘m’e‘ ' “1w"- ‘l‘EI‘ pulp l" .liill muratw‘” NW Kill“ ”30%” 11.“,11llll“‘31“.lulml M . 1 .1? “a“"Qt“;mw{HW'UHMWJ", »1 MW 1;,Liullv1l'lll‘ ‘ w 1 liiil‘u'llml 1 mu .5 Figure 48: TEM image of mSNAPcy4 single crystals as observed under TEM. 74 A B Figure 49: TEM diffraction images obtained from a single crystal. Image B was obtained from a crystal rotated approximately 0.5 degree with respect to the crystal from image A. Next, mSNAPcy4 was mixed with PSE-containing DNA and crystallized under similar conditions. The crystals had a similar appearance as the crystals of the complex alone as observed under the light microscope. Crystals were cross-linked and analyzed with electron microscopy. The appearance of these crystals was improved compared to the crystals of mSNAPcy4 alone, but the crystals were still fragmented and not single. Branch like single crystals were observed, emerging from a stalk (Figure 50). When the electron beam was focused on a single branch, a diffraction pattern, typical for a single crystal, was obtained (Figure 51). 75 IutmIIIIIII HMWIIIIII ‘ IIIII‘.‘ 1II1‘ ' II 1 I"I III.“ I Ii'Ii1 ‘II‘IIIIIIII -"I ||IIIIIIIII‘ ‘ IIIINI IIII-II IIII. .[IIIIIl II III” .IlIIII‘IIII IIIIIII I III I II III ..I III“ IIIIIIII‘I' I II'“ III‘ IIIII-I II II' IIIIIIIIIIIIIIIIIIIIIIII IIIIIIIIImI IIIII|IIIIIII IIIIN'IIIIIIIIIII‘IIuII ‘ IIIIIIIII‘II IIIII‘IIIIII :II III" IIIIIIIIII III I IIIIIII"I JIIIII'I‘II , "IIIIII ‘IIIIIIIIIIIIIIH IIIIIII I ‘ II [IIII’II ,NIII' IIIHPI II'IIIIIIII I. MIMIIIIIIIIIIII I I IIIIIIIIII'IIIIIIIIIIIIIIIIIIIII I III 1 II III II III" IIIIIIIIII I IIIIIIII IIIII IIIIIIIIIIIII I-IIIIIIIII'” IIIIII IIIIIIIIIIIIIII"llllwl‘lIl‘IlIml:“m “m“ i . ‘ l IIIIIIIIIIII Illlmwmmmm ‘III‘II: m" IIIIIIIIIIIII'IIIIIII IIIII IIIIIIIIIIIIIIIIIIIIIIII I II I IIIII II I IIIIII IIIII IIIIIIIIIIIIWWIIIIIIII WIIIIIIIIII I‘III IIIIIIII III IIII‘ IIIII WIIIIIIIII I IIIIIIIII II III III . IIIIIIII ‘ I 'II1III"“I III‘IIlI .‘II I» ,1 ‘I ‘I‘ 1. III II‘ “I Figure 50: TEM image of mSNAPcy4/DNA crystals. 76 III Figure 51: Diffraction pattern from a single branch obtained under TEM. 2.15. Purification optimization of mSNAPcy4 Although the material that was used for crystallization and activity assays is relatively pure according to the SDS-PAGE stained with Commassie, the inability to grow high quality crystals suggests that further purification of the complex is required. Both anion and cation exchange resins were used in the attempts to prepare a more homogeneous material. The success of such purification was rather disappointing, since the protein eluted in an extremely broad range of the salt gradient. Modifications to the buffer used in this purification, with variations in pH of the buffer and additives, were 77 made, without much improvement in the purity of the complex. Due to the fact that the complex is eluted from the column in such a broad range, the efficiency of the ion- exchange purification is low. The complex is eluted in approximately 50% of the fractions, or more specifically in a salt gradient from 150 mM to 650 mM (Figure 52). 78 100% B — — -—I ' — —-I 0 _ .7: o ‘- -' _§~ . 2 I: .‘ 5‘3 .9 ,' £- 35 .' g) i 83 SO°/B - —"3 ‘55 o 3 m8 . 0° B I /° llllllllllllllllllllll 1 5 1O 15 20 ‘ Fraction Figure 52: A representative elution profile for ion-exchange purification (Source- S) of mSNAPcy4. The solid line represents the relative absorption intensity of the eluted protein material and the dotted line represents the salt gradient. 79 Since ion exchange chromatography using SourceQ or SourceS resin (Pharmacia) failed to accomplish an improvement in the purity of the material, gel filtration chromatography was employed next. Partially purified mSNAPcy4 obtained from the affinity purification was concentrated either by centrifugation or by ammonium sulfate precipitation to enable efficient detection during the purification run. The complex was purified in the absence of the detergent using the size exclusion gel filtration and two peaks were observed in the chromatogram. The first peak that eluted in the void contained mSNAPcy4. Interestingly, the second peak with the apparent molecular weight of 150 kDa contained the complex as well. The majority of the protein material submitted to gel filtration eluted in the first peak and only a fraction (approximately 5%) of the total material was eluted in the second peak. SDS-PAGE analysis of all fractions confirmed the presence of the complex, with some impurities being equally represented in both peaks (data not shown). The presence of two peaks confirmed that the complex is not homogenous and an experiment was designed to determine how the removal of components from one peak would affect the behavior of the other. The significant difference in the apparent size of the two peaks and the fact that the first peak eluted close to the void volume suggested that the complex might be forming aggregates. Several gel filtration runs were performed and fractions were combined from the runs and concentrated to approximately 5 mg/mL. Crystallization experiments were set up with these fractions to determine which of the complexes is required for successful crystallization. Interestingly crystals formed first in the material from fraction 12, which corresponds to the second peak with the material of the apparent size of 150 kDa. mSNAPcy4 obtained only with the use of affinity 80 chromatography was used as a positive control and also yielded crystals, but slightly later than fraction 12. More interestingly fractions 9 and 10 never yielded crystals and fraction 13 yielded crystals that were of better quality than those obtained from fraction 12 or the control. It is obvious that the material with the apparent size of 150 kDa (second peak) is required for crystal formation. Unfortunately this material is also a minor part of the total material obtained from the affinity chromatography, which consisted mostly of the aggregated material. Furthermore the separation of the two peaks is poor, since the second peak is only a shoulder of the first peak on the gel filtration run (Figure 53; A). In order to obtain homogeneous material for crystallization, large quantities of mSNAPcy4 would need to be purified and then the two major components would have to be separated on the gel filtration column. The separation would also be further complicated due to the fact that the two peaks are not well resolved and several sequential gel filtration experiments, each enriching the complex in the second peak, would need to be performed. Such a purification scheme would be relatively complicated and would yield little material suitable for crystallization from a large amount of the mixture. A pilot experiment was performed to determine the efficiency of the separation of the two peaks and also to check whether the two forms of the complex are in equilibrium. Fraction 10, which corresponds to the peak in the void and is presumably an aggregated form of the complex was concentrated and analyzed on the gel filtration column. The material eluted in the void volume as before, without any apparent presence of the seCond peak. Fractions 12 and 13, which correspond to the second peak that is presumed to be a hetero tetramer, were pooled and analyzed in the same manner. This material yielded a chromatogram where the first peak is under-represented compared to 81 the second peak resulting in material which is highly enriched compared to the previous gel filtration separations (Figure 53). Removal of the fractions containing the aggregate again would most likely yield a highly homogenous hetero tetramer mSNAPcy4. 82 C I I I I I 1 I I I I I I » 1“ WM 7 8 9 1011121314151617 Figure 53: Gel filtration chromatograms of mSNAPcy4 purified with affinity chromatography (A); mSNAPcy4 rerun fraction 10 (B); mSNAPcy4 rerun fractions 12 and 13 (C). The first peak in fractions 9 and 10 corresponds to 650 kDa and is also the 83 void. The second peak corresponds to a protein with an apparent molecular weight of 150 kDa according to the gel filtration standards (Biorad, data not shown). Due to the fact that the un-aggregated mSNAPcy4 is under-represented (approximately 5% of injected protein material) in the starting mixture and the laborious purification scheme yielding low amounts of homogenous material (approximately 5% of the total injected protein material), the amount of un-aggregated mSNAPc must be increased. In order to change the ratio of the two peaks a few approaches were taken. First, since the ICP-MS metal analysis of the gel filtration fraction 9, which corresponds to the aggregate peak, yielded 0.5 mol of iron per 1 mol of protein for fraction 9 and 0.3 equivalents of iron for the mixture (data not shown), it seems plausible that the iron is responsible for aggregate formation. The aggregate fraction is enriched in iron and it is possible that the iron binds to the putative second metal binding site in SNAP50. Since the mixture contains less iron, it is also possible that the un-aggregated material contains less iron. The ab-initio modeling suggested that zinc, nickel or possibly iron could occupy the tentative metal binding site in SNAP50. Therefore, supplementing iron into the mixture of the two forms of the complex could result in complete aggregation of the complex and in contrast supplementation with zinc or nickel could result in a reduction of the aggregated material. Chelating the iron with EDTA could also potentially increase the amounts of the desired form of the complex. To test this assumption, mSNAPcy4 samples obtained from affinity chromatography were incubated with different metal ions or with EDTA and then analyzed with size exclusion gel filtration. Interestingly, none of the 84 additives tested had any relevant effect of the ratio of the two peaks. The effect of the nickel supplementation could not be tested, since nickel was reduced upon mixing with the sample. The resulting elementary nickel could potentially damage the gel filtration column and was therefore not tested. 10 mM and 100 mM metal ions or EDTA were tested, but again resulted in no visible effect on the ratio of the two peaks suggesting that the formation of the aggregate is not related to metal content G:igure 54). 85 .Oh’—._.— —O—'—. - - .x-r-t-auvwavav'ra‘mz—a . V(mL) 6 7 8 910111213141516 Figure 54: Gel filtration chromatogram of mSNAPcy4 (A) supplemented with zinc (B), iron (C) or EDTA (D). 86 None of the additives used in the gel filtration separation had any effect on the relative ratio between the aggregate and the hetero-tetramer. In order to efficiently separate the two peaks, a gel filtration matrix with larger pores (Sephacryl-300) was used. All aspects of the purification remained the same as with Sephadex-ZOO gel filtration media used previously. The pores on this gel filtration matrix are approximately 50% larger compared to Sephadex-ZOO and the hypothesis was that the complex should migrate slower and therefore separate from the void peak. The hetero-tetramer peak actually separated and the purification yielded almost baseline separation. Fractions containing the hetero—tetramer were pooled, concentrated and injected to the same gel filtration column again. This time the two peaks were separated completely and the aggregate peak was present as a minor component of the binary mixture. It should be noted that both gel filtration runs were performed in a buffer without the detergent Tween-20 and the material could be efficiently concentrated using the Centriprep spin concentrators. The peak that corresponds to the void could only be concentrated in the presence of the detergent (Figure 55). 87 o o 0. o. ... .'°'o .00‘ '9Ooo|-000|0". Fraction 123456 7891011121314151617181920 Figure 55: Gel filtration of mSNAPcy4 using Sephacryl-3OO size exclusion beads. (A) Biorad molecular weight standards (a=670 kDa, b=158 kDa, c=44 kDa, d=17 kDa), (B) GST purified mSNAPcy4, (C) fraction 9-11 from the first gel filtration rerun using the same column, (D) fractions 9-11 from the second gel filtration rerun (e: void peak, f: mSNAPcy4 heterotetramer). Fractions were 5 mL. 88 Highly purified and homogenous material that was obtained by two successive runs on Sephacryl—3OO gel filtration matrix and was concentrated in the absence of the detergent with high recovery rates. Crystallization trials were started using the sparse matrix formulations and many conditions were found where the protein crystallized. Compared to previous preparations of the complex, the new approach resulted in protein material that gave crystals of improved quality. Although the crystals were not significantly larger the quality was greatly improved. Crystals no longer formed needle clusters but resembled single needle-like or orthogonal crystals (Figure 56). The number of successful crystallization hits was also increased drastically compared to the previous preparations of mSNAPCy4. Furthermore, the purification protocol was modified and Tween-2O is not required in any step of the purification. Although the amount of protein material that is immobilized on the GSH resin is reduced compared to the purification scheme where Tween-2O is used, the homogeneity of the complex is improved. The relative amounts of heterotetramer are comparable in the two purification approaches, but the amount of the aggregated material is somewhat reduced when no Tween—20 is used. Due to improved separation of the two peaks in the gel filtration step when Sephacryl-BOO is used the recovery of the heterotetramer is also improved. 89 111 1 1 w which” 1111"“ 1“ 111' 1111' ,1. I“ 1.1 ”1‘“ 1M: ngx I‘ 11‘ u ' 1 1"“ I“ 1 1. ‘ ' '111'1 11111131111'] 11 1 1 111" 11 1‘." .1 1111111 \11“\ \‘1‘ 1* 111 ”‘11, 14'}- :— .«i I II I ll 1‘111“,{,‘11111\11 ~— J #1";- . ;;_ fi‘i/~ ,_. . [fizz-:yz-L -:;-o 9 4, _ 4:}; ,.- V ‘1 - . . Jr" .. m4",- ' fl. ’4 ,— m" — —r ”f gig—1"; ,— ’2. .— 5.; fl 1 11‘ 1111. _ 11111111 1‘11 %\111\‘|'\m’ 1H3" ‘11 1 “I 1111‘ let" 11111\k1112m3111llls111111111313 11 11111111111111111111111111111 l 11 m1 1“ 1111x11111ll‘1mulxl1‘111 £1151“ l\ll I '1 1 1 1. 1 1 1 111111111 5“ ‘7' . . 1.11.11 1 , ‘1"‘1‘1111 11.111111111111111 ‘— a d/.‘ ,v-' {I I, “2. __._;==- ,— ,—-__. '7': 3:1 ’-’- fl .5357; d. i" w. 5‘. ’ J. ‘ ‘1 ‘ ' ' ._ i . . ’. 1. .- w “”1111 11111 ‘ 1111 3,.——- ’J" ,g pr. 4.4,» '— Figure 56: Crystals of mSNAPcy4 grown in 100 mM Tris pH 8.5, 10% PEG4000, 100 mM NaCl and 100 mM MgC12. Protein material used to obtain the crystals was prepared by successive gel filtration purification steps. 90 3. Methods and materials 3.1. Preparation of expression plasmids 3.1.1. Cloning of the SNAP50 subunit Using the primers 5’-TCAGCCATGGCTGAAGGAAGC-3’ and 5’— AGAGCTCTTAATFAAAGGTFCCAGG-3’, which correspond to the 5’- and 3’- noncoding region of human SNAP50, respectively, we amplified the SNAP50 DNA from a pGSTSO plasmid by standard PCR reaction. The PCR product corresponding to SNAP50 was cleaved and inserted into the first multiple cloning site of the pCDFDuet-l polycistronic expression vector (Novagen) via Ncol and Sac] cloning sites. The expression vector pCDFSO-l was transformed into E. coli DHSOt competent cells, and plated on LB plates supplemented with streptomycin (20 ug/mL). 3.1.2. Cloning of the SNAP43 subunit Using the primers 5’-GGAATTCCATATGGGGACTCCTCCCGGCCTGCA-3’ and 5’-GAGGATCCTCAGTGTT'ITCTCCTCTTC’ITGGATGC -3’, which correspond to the 5’— and 3’-noncoding region of human SNAP43, respectively, we amplified the SNAP43 DNA from a pGST43 plasmid by standard PCR reaction. The resulting PCR product corresponding to SNAP43 was digested with NdeI and BamHI and inserted into the pRSFSO-l expression vector via NdeI and Hg”! cloning sites. The expression vector pCDF43/50-1 was transformed into E. coli DHSOt competent cells, and plated on LB plates supplemented with streptomycin (20 ug/mL). 91 3.1.3. Cloning of the SNAP19 subunit Using the primers 5’-CTCACCATGGTGAGCCGGCTTC-3’ and 5’- AAGGATCCTTAGGAATCTGATTCTTC-3’ which correspond to the 5’- and 3’- noncoding region of human SNAP19, respectively, we amplified the SNAP19 DNA from a pGSTl9 plasmid by standard PCR reaction. The resulting PCR product corresponding to SNAP19 was inserted into pRSFDuet-l (Novagen) via Neal and BamHI cloning sites. The expression vector pRSF 19-l was transformed into E. coli DHSOL competent cells, and plated on LB plates supplemented with kanamycin (SO ug/mL). 3.2. Preparation of expression host 3.2.1. Preparation of competent cells for heat shock The E. coli BL21-CodonPlus(DE3)-RIL were chosen for the over-expression of SNAP subunits and for co-expression. The cells were made competent by calcium treatment and the procedure is done on ice; 50 mL culture was grown until the OD600 (optical density) was 0.3 to 0.4 and the culture was cooled on ice for 5 minutes. The cell pellet was collected by centrifugation, resuspended in 4 mL ice-cold FSB (Frozen Storage Buffer; 10 mM potassium acetate, 100 mM KCl, 50 mM CaC12 and 10% glycerol; pH 6.2) and kept on ice for 20 minutes. The cell pellet was again collected by centrifugation and resuspended in 3 mL FSB. Aliquots of the desired volume were prepared, frozen on dry ice and stored at -80°C. 92 3.2.2. Preparation of competent cells for electroporation The procedure for preparation of electro competent cells was done on ice. Cells were spun at 3000xg. 50 mL cell culture was grown until the OD600 of 0.5. The cell pellet was collected by centrifugation and resuspended in 25 mL of milliQ water twice. The cells were harvested by centrifugation and resuspended in 10 mL 10% glycerol in milliQ. The cells were spun again and resuspended in 0.25 mL of 10% glycerol in milliQ. The resulting suspension of cells was stored in aliquots and frozen at -80°C. 3.2.3. Preparation and use of a glycerol stock Cell colonies harvested from freshly grown plates were used to inoculate 50 mL of LB media supplemented with the appropriate antibiotic. The cells were grown for 12- 16 h at 37°C. Glycerol was added to the culture to bring the concentration of glycerol to 20%. Cells were incubated for l h at 37°C, aliquoted and stored at -80°C for long term storage. To use a glycerol stock, an aliquot was thawed at room temperature and 100 uL to lmL of the cells were used to start a 50 mL LB culture, which was then grown for 12-16 h for further use. Glycerol stocks prepared this way are viable up to at least 2 years. 3.2.4. Transformation into the cloning bacterial host DHSa competent cells were thawed on ice and 1-5 uL of the DNA was added, mixed gently and the cells were placed on ice for 10 minutes. The cells were then placed 93 in a heated bath at 37°C for 40 seconds and then immediately placed on ice for 2 minutes. 900 uL of LB was added and the cells were incubated at 37°C for l h while shaking. 100 uL of this mixture was plated on agar plates supplemented with the appropriate antibiotic. 3.2.5. Transformation into the expression bacterial host Codon Plus RIL BL21(DE3) Codon Plus RIL competent cells were thawed on ice and 1-2 uL of the DNA were added, mixed gently and the cells were placed on ice for 20 minutes. The cells were then placed in a heated bath at 42°C for 20 seconds and then placed immediately on ice for 2 minutes. 900 uL of LB was added and the cells were incubated at 37°C for 1 h while shaking. 100 uL of this mixture was plated on the agar plates supplemented with the appropriate antibiotic. Chloramphenicol at 50 ug/mL is required for the cells to retain the Codon Plus Rm plasmid. 3.2.6. Transformation into the co-expression bacterial host The cells that were used for co-expression contained a single plasmid after the first transformation and were made competent by calcium treatment. Next, the cells were thawed on ice, mixed with 1-2 uL of plasmid DNA and placed on ice for 15 minutes. The cells were placed on a heated plate at 42°C for 45 seconds and then immediately returned on ice for 2 minutes. 900 uL of LB was added and the cells were incubated at 37°C for 1 hour and then plated on agar plates supplemented with the appropriate antibiotic mixture. The order in which the plasmids are transformed into the cells is irrelevant and eXpression levels are not dependant on the order in which the plasmids were transformed 94 (data not shown). Usually the pGSTl90(l-505) was transformed first, followed by pCDF43R/50-l (to make mSNAPcy3 producing cells) and finaly pRSTl9-1 (to make mSNAPcy4 producing cells). 3.2.7. Electroporation Electro-competent cells were thawed on ice and 20 uL of the cells were mixed with 1-2 uL of the DNA. The mixture was transferred into a sterile electroporation cuvete. The cells were then pulsed with 2000 V using a pulse electroporator (Biorad), 480 uL of LB was added and the cells were incubated at 37°C for one hour and plated on agar plates supplemented with the appropriate antibiotic mixture. 3.3. Growth media 3.3.1. Preparation of culture plates To make 1 L of agar media for plates, 15 g of Tryptone, 5 g of yeast extract, 5 g of NaCl and 15 g of agarose were mixed and dissolved in 1 L of water. The solution was autoclaved. When the autoclaved solution was sufficiently cooled (60°C) the appropriate antibiotic was added and the media was poured into Petri-dishes and allowed to cool and harden. Prepared culture plates were stored at 4°C until use. 95 Table 2: Combinations of antibiotics used for preparation of agar plates. AMP, KAN, STR and CAM represent ampicillin, kanamycin, streptomycin and chloramphenicol respectively. Concentration of antibiotics used for preparation of agar plates [uglmL] Antibiotic used Ampicillin Chloramphenicol Streptomycin Kanamycin AMP 100 - - - KAN - - - 100 STR - - 20 - AMP/CAM 100 50 - - AMP/CAM/ STR 50 50 15 - AMP/CAM/ KAN 50 50 - 15 AMP/CAM/ KAN/STR 50 50 10 10 3.3.2. LB growth media To make 1 L of LB media, 15 g of Tryptone, 5 g of yeast extract and 5 g of NaCl were mixed and dissolved in 1 L of water. The solution was autoclaved. Before use appropriate antibiotic was added (Table 2). 3.3.3. TB growth media To make 1 L of TB media, 12 g Tryptone, 24 g yeast extract and 4 mL glycerol was mixed and dissolved in 900 mL of water and autoclaved. Before use 100 mL of salt solution (2.31 g KH2PO4, 12.54 g K2HPO4) was added to the broth. 3.3.4. M9 minimal media To make 1 L of M9 minimal media 4 g of (NH4)ZSO4 or 1 g NH4Cl, 13.4 g of NazHPO4, 6 g KH2PO4, 4 g NaCl and MgSO4 were dissolved in water. 200 uL of the 96 trace elements solution (10 mM FeCl3, 10 mM ZnClz and 100 mM CaC12) was added and the mixture was autoclaved. Before use 20 mL of 20% autoclaved glucose, 20 mL of the Basal Eagle vitamin medium and the appropriate antibiotic were added. 3.4. Over—expression 3.4.1. Over-expression of individual SNAP subunits Cells harboring the plasmid with the sequence for the desired protein were grown in LB medium supplemented with the appropriate antibiotic (100 ug/mL ampicillin). The 50 mL cultures were started from a freshly plated plate or from 3 glycerol stock and were allowed to grow at 37°C for 12-18 h. The cell culture was then transferred to 1 L LB containing flasks and allowed to grow at 37°C until 0.6 to 0.8 optical density. The cultures were induced by the addition of 1 mM IPT G and grown at the appropriate temperature (16°C for SNAPl90(l-505), SNAP50 and SNAP43, and 23°C for SNAP19). The cells were grown for 12-18 h. The cells were collected by centrifugation and stored at -20°C until needed. 3.4.2. Co-expression of mSNAPcy3 and mSNAPcy4 Co-expression was performed virtually the same way as the expression of individual subunits. The only difference was the amount and the type of antibiotic used (Table 3). 97 Table 3: Antibiotic concentrations used in individual and co-expression systems Concentrations of antibiotics used for growth in LB or TB medium [uglmL] Protein construct in BL2 1 (DE3) Codon Plus RIL Ampicillin Chloramphenicol Streptomycin Kanamycin GST-SNAP190( 1 -505) 100 - - _ GST-SNAP50 100 - - _ GST-SNAP43 100 - - - GST-SNAP19 100 - - _ MSCy3 50 50 20 - MSCy4 50 50 20 20 3.5. Purification 3.5.1. GST purification 3L of cells harboring the over-expressed protein material were thawed on ice, resuspended in 80 mL TEMGT-250 (25 mM Tris pH 7.9, 2 mM EDTA, 12.5 mM Mng, 10% glycerol, 0.1% Tween-20, 250 mM KCl and 3 mM DTT) and sonicated (3x 1 minute pulse, 1 minute pause). The cell extract was clarified by centrifugation, mixed with the glutathione (GSH) agarose beads (Amersham Pharmacia) and allowed to shake for 12 h at 4°C. GSH beads were washed with TEMGT-250 by centrifugation, transferred to a glass column equipped with a frit and washed with TEMGT-250 until the flow-thru showed no detectable protein matter by Bradford assay. 20 mL of TEMGT-250 were mixed with the resin and 50 u of thrombin were added to remove the GST tag. Cleavage was allowed to proceed to completion (12-20 h) and the protein material was eluted off the beads by several consecutive TEMGT—250 washes. This method was used to purify the individual subunits and the co-expressed material. 98 3.5.2. GST purification in the absence of a detergent 3L of cells harboring the over-expressed protein material were thawed on ice, resuspended in 80 mL TEG-250 (25 mM Tris pH 7.9, 2 mM EDTA, 10% glycerol, 250 mM KCl and 3 mM DTT) and sonicated (3x 1 minute pulse, 1 minute pause). The cell extract was clarified by centrifugation, mixed with the glutathione (GSH) agarose beads (GE) and allowed to shake for 12 h at 4°C. GSH beads were washed with TEG-250 by centrifugation, transferred to a glass column equipped with a frit and washed with TEG- 250 until the flow-thru showed no detectable protein matter by Bradford assay. 20 mL of TEG-250 were mixed with the resin and 50 u of thrombin were added to remove the GST tag. Digestion was allowed to proceed for 6-8 h, while monitoring the cleavage progress by SDS-PAGE. Once the protein was completely digested 1 mM PMSF was added to stop the digest and the protein material was eluted off the beads by several consecutive TEG-250 washes. 3.5.3. Ion-exchange purification The protein material obtained after the GST purification step was diluted with water, until the salt concentration was below 50 mM (5 fold) and loaded to ion exchange resin (Source-Q or Source-S, GE). The protein was then eluted with a 50 mM to l M NaCl salt gradient (50 mM or 1 M NaCl, 20 mM Tris pH 7.5, 10% glycerol, 2 mM EDTA, 12.5 mM MgC12, 0.1 % Tween—20 and 3 mM DTT). The purification was performed with 2 mL of resin, 5-8 mg of protein material, flow rate was 4-6 mIJmin, and the FPLC instrument used was by Amersham Pharmacia. 99 ”'1'. -LJ 3.5.4. Gel filtration purification The partially purified protein material was concentrated to a desired concentration (0.5-5 mg/mL) and injected onto the gel filtration matrix. A variety of buffers were tested for this purification with relatively successful results. The formulation of a gel filtration buffer that works well is 10 mM Tris pH 7.5, 150 mM NaCl, 10% glycerol, 0.05% Tween-20 and 3 mM DTT. For the Superdex-200 (GE) and Sephacryl-300 (GE) the flow rates were 1 mIJmin and 0.5 mUmin respectively. 3.6. SDS-PAGE Sample solution was mixed with an equal volume of 2xSPRS buffer (30 mM Tris HCl, 2.5% glycerol, 6% SDS, 0.005% Bromophenol Blue, 0.025% B-mercaptoethanol) and placed into a bath with boiling water for 1-2 minutes to denature the proteins. The samples were loaded in the well portion of the stacking gel (4% acrylamide) and separated at 220 V for 50-60 minutes. The resolving portion of the gel was 9-15 % acrylamide, 1% SDS in Tris HCl pH 7.5. Proteins on the gel were stained with Commassie (4% methanol, 10% acetic acid and 1% Commassie blue in water; Commassie blue must be dissolved in methanol first, then mixed with acid and finally diluted with water due to the low solubility of the dye in water). The gel was destained to reveal the positions of the protein material with DESTAIN (5% acetic acid, 20% methanol in water). 100 Destained gels were soaked in water for 30 minutes, then soaked in 5% glycerol for 30 minutes and dried between cellophane sheets for 24-48 h, to preserve the gels for long term storage. 3.7. Western blotting Protein samples were separated using SDS-PAGE. Two filter paper sheets were placed on the bottom of the transfer cell (Biorad). Semi-dry buffer (50 mM Tris, 40 mM glycine, 0.037% SDS and 20% w/v methanol) was added and any air was removed by rolling with a cylinder. The gels were then placed on the prepared filter paper and more semi-dry buffer was added. The membrane was cut to fit the gel, soaked in the semi-dry buffer and placed on top of the gel carefully. 6 sheets of filter paper were soaked in semi- dry buffer and placed on the membrane. Any air was removed by rolling the cylinder. The transfer was allowed to continue for 50 minutes at 12 V. The membrane was removed and blocked with 4% milk solution in TBST (20 mM Tris pH 7.6, 0.15 M NaCl and 0.2 % v/v Tween-20) buffer. The membrane was then transferred to the primary antibody solution (antibody for the protein of interest in 4% milk solution in TBST) for 1 h. The membrane was rinsed with TBST buffer twice by soaking the membrane in the buffer for 10 minutes. The membrane was then transferred to the secondary antibody solution (secondary antibody in 4% milk solution in TBST) for l h and again rinsed with TBST as before. Solutions 1 and 2 were mixed in equal ratio and the membrane was soaked in this solution for 1 minute. The membrane was then placed on a filter paper, wrapped in Saran wrap and the bands were visualized on a photographic paper. 101 Primary antibody solution was prepared by a 1000 fold dilution of the antibody stock in 4% milk solution in TBST. The secondary antibody solution was prepared by a 3000 fold dilution of the stock antibody. Rabbit and mouse antibodies were used for SNAPc subunits and GST, respectively, as primary antibodies. Horse a-rabbit and OL-mouse were used as secondary antibodies. Antibodies a-SNAP190 #234, OL-SNAPSO CSH303, OL-SNAP43 C848, 0t- SNAP19 CS543, a-mouse and oz-rabbit were a generous gift from Dr. R. W. Henry. 3.8. DNA purification DNA was separated on 1% agarose gel, stained with ethydium bromide and observed under UV light. Bands corresponding to the desired DNA product were excised with a spatula and the DNA was extracted from the gel slice with QIAGEN gel extraction kit, following the manufacturer’s instructions. 3.9. TEM analysis of mSNAPc crystals 3.9.1. Cross-linking of mSNAPc crystals A 1:1 mixture of paraformaldehyde and glutaraldehyde was added directly to the hanging drop containing protein crystals to a final concentration of 1% aldehyde and 1% glutaraldehyde. The drop was returned on t0p of the well for 1 h. Cross-linked crystals were then washed with water by addition of water and removal of the solvent with a pipette, to remove buffer, salt and precipitant components of the crystallization reagent. 102 3.9.2. Preparation of the TEM grid Glass strips were thoroughly washed with a detergent and wiped dry. A few drops of 25% Formvar solution were added to the glass strip to form a thin film, without creating thick areas. The prepared film was allowed to dry completely (2-5 min). A razor blade was used to mark and cut the edges on the film. The glass strip with film was then carefully placed in a bowl with water and if successful the film was lifted from the glass and floated on the surface. 300 mesh copper grids were then carefully placed on the floating film and a paper was placed on top to sandwich the grids between the film and the paper. The prepared grids on the paper were removed from the water and dried. 3.9.3. TEM analysis Cross-linked crystals were crushed with small crystallization tools to create a suspension with a large number of small crystals. A drop (1-5 ul) of the crushed and cross-linked crystals was placed on a Formvar grid (300 mesh copper) and excess liquid was removed with a filter paper. The grid was then placed in the sample holder and inserted in the instrument. Samples were analyzed with JEOL 100CX transmission electron microscope. Voltage on the electron gun was 100,000 V. Condenser aperture #2 and objective aperture #2 was used. In diffraction mode diffraction aperture #2 or #3 was used. The grid was viewed in low magnification mode (magnification 5,000) to identify an area on the grid with desired properties such as intact membrane, absence of precipitate, absence of dark areas, and presence of crystalline material. The crystals were 103 then observed at increasingly higher magnification and focus was adjusted at every magnification increase until the desired images were obtained. Images were recorded with a digital camera installed in the electron microscope and controlled with a MegaViewII digital camera controller. Images were viewed and manipulated with the AnalySIS software package. 3.10. Quantification of zinc in SNAP50 3.10.1. Determination of protein concentration 3.10.1.1. UV absorbance in 6 M urea The approximate protein concentration was first determined with Bradford assay as per manufacturer’s directions. The sample was then mixed with urea in such manner that the measured absorbance was in the range of 0.1 to 0.6 where the assay is linear. In the protein concentration range of approximately 2 to 4 mg/mL (for mSNAPc) the expected absorbance was in the linear range if 50 uL of the sample was mixed in 450 uL of 6.5 M urea. 3.10.1.2. Standardized Bradford assay Working Bradford solution (20%) was prepared fresh before use, by diluting the Bradford Assay stock in water. To measure the concentration, the protein (1-20 uL) is mixed in 1 mL of the working Bradford solution and absorbance at 600 nm was measured. A linear plot was obtained with the BSA standard solution (Biorad) in the range of concentrations from 0.05 to 0.8 mg/mL. The protein concentration was then determined from the plot. It is important that the absorbance of the sample in the 104 Bradford solution is within 0.1 and 0.6 to remain in the linear range. The plot points were prepared in duplicate and the protein was measured in triplicate. 3.10.2. Flame atomic absorption The concentration of the protein sample was determined by UV absorbance measurement or by the standardized Bradford assay. The sample was transferred to a crucible, dried and ashed at 260°C in an oven and concentrated nitric acid was added to the brown powder and heated until all liquids evaporated. The process was repeated until only white powder remained in the crucible. Solution was reconstituted by the addition of a known volume of 5% nitric acid. To prepare the zinc standards, the mass of zinc metal (Spectrum, 99.9 %) was measured carefully and the zinc was dissolved in concentrated nitric acid and brought to the desired volume. Standards in the concentration range from 10 to 1000 ppb were made by serial dilution of the stock zinc solution with 5% nitric acid. The samples and standards were analyzed on the flame atomic absorption instrument (Varian SpectrAA-200) equipped with a zinc hollow cathode lamp operating at 213.9 nm. The flow rate was 1 mL/min. 3.10.3. ICP-MS The protein concentration was determined with the standardized Bradford Assay using BSA as a standard. The sample solutions (protein, buffer) were transferred to a Teflon vial and brought to dryness on a hot plate. Concentrated nitric acid was added and the sample was placed on the hot plate and allowed to hydrolyze for 30 min. After cooling the sample was diluted with water to bring the acid concentration to 2% and 2% 105 nitric acid was added to bring the solution to the desired volume. Standards containing nickel and zinc in the concentration range 0-1000 ppb were prepared in 2% nitric acid from a 1000 ppm stock solution (Trace grade). The samples and standards were mixed with indium and bismuth solution as internal standards. The samples were run on an ICP- MS instrument (Micromass) with flow rate 0.5 ml/min. 66Zn, 68Zn and 60Ni isotopes were measured and quantified. The responses of zinc and nickel were corrected according to indium and bismuth response. 106 4. References 1. Paule M. R., White R. J., Survey and summary: transcription by RNA polymerases I and III, Nucleic Acids Research, 28, 1283 (2000). 2. Herr A. J ., Jensen M. B., Dalmay T., Baulcombel D. C., RNA Polymerase IV Directs Silencing of Endogenous DNA, Science, 308, 118 (2005). 3. Lee T. 1., Young R. A., Regulation of gene expression by TBP-associated proteins, Genes Dev., 12, 1398 (1998). 4. Lobo S., Hernandez N., in Transcription, Mechanisms and Regulation (Conway R. C., and Conway J. W., eds) pp. 127-159, Raven Press, Ltd., New York (1994). 5. Bai L., Wang 2., Yoon J .-B., R. G. Roeder., Cloning and characterization of the beta "1; subunit of human proximal sequence element-binding transcription factor and its involvement in transcription of small nuclear RNA genes by RNA polymerases H and 111, Mol. Cell. Biol., 16, 5419 (1996). 6. Henry R. W., Ma B., Sadowski C. L., Kobayashi R., Hernandez N., Cloning and characterization of SNAP50, a subunit of the snRNA-activating protein complex SNAPc, EMBO J., 15, 7129 (1996). 7. Wang Y., Stumph W. E., RNA Polymerase II/III Transcription Specificity Determined by TATA Box Orientation, Mol. Cell. Biol, 18, 1570 (1998). 8. Schramm L., Hernandez N., Different human TFIIIB activities direct RNA polymerase 1H transcription from TATA-containing and TATA-less promoters, Genes Dev., 16, 2593 (2002). 9. Challice J. M., Segall J ., Transcription of the 5 S rRNA gene of Saccharomyces cerevisiae requires a promoter element at +1 and a l4-base pair internal control region, J. Biol. Chem, 264, 20060 (1989). 10. Ciliberto G., Raugei G., Costanzo F., Dente L., Cortese R., Common and interchangeable elements in the promoters of genes transcribed by RNA polymerase 111, Cell, 32, 725 (1983). 11. Braun B. R., Kassavetis G. A., Geiduschek E. P., Bending of the Saccharomyces cerevisiae SS rRNA gene in transcription factor complexes, J. Biol. Chem, 267, 22562 (1992). 107 12. l3. 14. 15. l6. l7. 18. 19. 20. 21. 22. 23. Krol A., Carbon P., Ebel J. P., Appel B., Xenopus tropicalis U6 snRNA genes transcribed by Pol III contain the upstream promoter elements used by Pol II dependent U snRNA genes, Nucleic Acids Res., 15, 2463 (1987). Das G., Henning D., Wright D., Reddy R., Upstream regulatory elements are necessary and sufficient for transcription of a U6 RNA gene by RNA polymerase III, EMBO J., 17, 503 (1988). Kunkel G. R., Pederson T., Upstream elements required for efficient transcription of a human U6 RNA gene resemble those of U1 and U2 genes even though a different polymerase is used, Genes Dev., 2, 196 (1988). M Murphy 5., Tripodi M., Melli M., A sequence upstream from the coding region is required for the transcription of the 78K RNA genes, Nucleic Acids Res., 14, 9243 (1986). Baer M., Nilsen T. W., Costigan C., Altman S., Structure and transcription of a a human gene for H1 RNA, the RNA component of human RNase P, Nucleic Acids Res., 18, 97 (1989). Topper J. N., Clayton D. A., Characterization of human MRP/T h RNA and its nuclear gene: full length MRP/T h RNA is an active endoribonuclease when assembled as an RNP, Nucleic Acids Res., 18, 793 (1990). Lobo S., Hernandez N., A 7 bp mutation converts a human RNA polymerase II snRNA promoter into an RNA polymerase III promoter, Cell, 58, 55 (1989). Echenlauer J. B., Kaiser M. W., Gerlach V. L., Brow D. A., Architecture of a yeast U6 RNA gene promoter, Mol. Cell Biol, 13, 3015 (1993). Orphanides G., Lagrange T., Reinberg D., The general transcription factors of RNA polymerase II, Genes Dev., 10, 2657 (1996). Kuhlman T. C., Cho H, Reinberg D., Hernandez N., The General Transcription Factors 11A, 11B, IIF, and [IE Are Required for RNA Polymerase II Transcription from the Human U1 Small Nuclear RNA Promoter, Mol. Cell. Biol, 19, 2130 (1999). Hampsey M., Molecular Genetics of the RNA Polymerase II General Transcriptional Machinery, Microbiol. Mol. Biol. Rev., 62, 465 (1998). Smale S. T., Kadonaga J. T., The RNA polymerase 11 core promoter, Annu. Rev. Biochem, 72, 449 (2003). 108 25. 26. 27. 28. 29. 30. 31. 32. 33 34. . Wu S. —Y., Kershnar E., Chiang C. —M., TAFII-independent activation mediated by human TBP in the presence of the positive cofactor PC4, EMBO J., 17, 4478 (1998). Nikitina T. V., Tishchenko L. 1., RNA polymerase HI transcription machinery: Structure and transcription regulation, Mol. Biol, 39, 161 (2004). Ghavidel A., Schultz M. C., Casein kinase II regulation of yeast TFIIIB is mediated by the TATA-binding protein, Genes Dev., 11, 2780 (1997). Yoon J. B., Roeder R. G., Cloning of two proximal sequence element-binding transcription factor subunits (gamma and delta) that are required for transcription of small nuclear RNA genes by RNA polymerases II and III and interact with the TATA-binding protein, Mol. Cell. Biol, 16, l ( 1996). Hernandez N., Small Nuclear RNA Genes: a Model System to Study Fundamental Mechanisms of Transcription, J. Biol. Chem, 276, 26733 (2001). Henry R. W., Sadowski C. L., Kobayashi R., Hernandez N., A TBP-TAF complex required for transcription of human snRN A genes by RNA polymerase II and III, Nature, 374, 653 (1995). Wong M. W., Henry R. W., Ma B., Kobayashi R., Klages N., Matthias P., Strubin M., Hernandez N ., The large subunit of basal transcription factor SNAPc is a Myb domain protein that interacts with Oct-1, Mol. Cell. Biol, 18, 368 (1998). Gu L., Esselman J ., Henry R. W., Cooperation between Small Nuclear RN A- activating Protein Complex (SNAPC) and TATA-box-binding Protein Antagonizes Protein Kinase CK2 Inhibition of DNA Binding by SNAPC, J. Biol. Chem, 280, 27697 (2005). Mittal V., Ma 8., Hernandez N., SNAPc: a core promoter factor with a built-in DNA- binding damper that is deactivated by the Oct-l POU domain, Genes & Dev., 13, 1807 (1999). . Ford E., Strubin M., Hernandez N., The Oct—1 POU domain activates snRN A gene transcription by contacting a region in the SNAPc largest subunit that bears sequence similarities to the Oct-1 coactivator OBF-l, Genes & Dev., 12, 3528 (1998). Hovde S., Hinkley C. S., Strong K., Brooks A., Gu L., Henry R. W., Geiger J., Activator recruitment by the general transcription machinery: X-ray structural analysis of the Oct-l POU domain/human U1 octamer/SNAP190 peptide ternary complex, Genes & Dev., 16, 2772, (2002). 109 35. 36. 37. 38. 39. 40. 41. 42. 43. 44. 45. 46. 47. Sytina E. V., Pankratova E. V., Transcription Factor Oct-1: Plasticity and Multiplicity of Functions, Mol. Biol, 37, 637 (2003). Henry R. W., Mittal V., Ma B., Kobayashi R., Hernandez N., SNAP19 mediates the assembly of a functional core promoter complex (SNAPc) shared by RNA polymerases II and III, Genes & Dev., 12, 2664 (1998). Sadowski C. L., Henry R. W., Kobayashi R., Hernandez N., The SN AP45 subunit of the small nuclear RNA (snRNA) activating protein complex is required for RNA polymerase H and III snRNA gene transcription and interacts with the TATA box binding protein, Proc. Natl. Acad. Sci., 93, 4289 (1996). Kunkel G. R., Hixsom J. D, The distal elements, OCT and SPH, stimulate the formation of preinitiation complexes on a human U6 snRN A gene promoter in vitro, Nucl. Acids Res., 26, 1536 ( 1998). Zhao X., Pendergrast P. S., Hernandez N., A positioned nucleosome on the human U6 promoter allows recruitment of SNAPc by the Oct-1 POU domain, Mol. Cell, 7, 539 (2001). Spector D. L., The dynamics of chromosome organization and gene regulation, Annu. Rev. Biochem, 72, 573 (2003). Hinkley C. S., Hirsch H. A. Gu L., LaMere B., Henry R. W., The Small Nuclear RNA-activating Protein 190 Myb DNA Binding Domain Stimulates TATA Box- binding Protein-TATA Box Recognition, J. Biol. Chem, 278, 18649 (2003). Mittal V., Hernandez N., Role for the amino-terminal region of human TBP in U6 snRN A transcription, Science, 275, 1136 (1997). Das D., Scovell W. M., The Binding Interaction of HMG-l with the TATA-binding Protein/TATA Complex, J. Biol. Chem, 276, 32597 (2001). Ford E., Hernandez N., Characterization of a Trimeric Complex Containing Oct-1, SNAPc, and DNA, J. Biol. Chem, 272, 16048 (1997). Jin S., Levine A. J., The p53 functional circuit, J. Cell Sci., 114, 4139 (2001). K0 L. J ., Prives C., p53: puzzle and paradigm, Genes Dev., 10, 1054 ( 1996). Levine A. J ., p53, the Cellular Gatekeeper for Growth and Division, Cell, 88, 323 (1997). 110 48. 49. 50. 51. (1995). 52. 53. 54. 55. 56. 57. Stein T., Crighton D., Boyle J. M., Varley J. M., White R. J., RNA polymerase III transcription can be derepressed by oncogenes or mutations that compromise p53 function in tumours and Li-Fraumeni syndrome, Oncogene, 21, 2961 (2002). Gridasova A. A., Henry R. W., The p53 Tumor Suppressor Protein Represses Human snRN A Gene Transcription by RNA Polymerases H and III Independently of Sequence-Specific DNA Binding, Mol. Cell Biol, 25, 3247 (2005). Hirsch H. A., Gu L., Henry R. W., The Retinoblastoma Tumor Suppressor Protein Targets Distinct General Transcription Factors To Regulate RNA Polymerase HI Gene Expression, Mol. Cell Biol, 20, 9182 (2000). Weinberg R. A., The retinoblastoma protein and cell cycle control, Cell, 81, 323 Horowitz J. M., Park S. H., Bogenmann E., Cheng J .C., Yandell D.W., Kaye D. J., Minna J .D., Dryja T. P., Weinberg R. A., Frequent Inactivation of the Retinoblastoma Anti-Oncogene is Restricted to a Subset of Human Tumor Cells, Proc. Natl. Acad. USA, 87, 2775 (1990). Horowitz J. M., Yandell D.W., Park S. H.,Canning S., Whyte P., Buchkovich K., Harlow E., Weinberg R. A.,Dryja T. P., Point mutational inactivation of the retinoblastoma antioncogene, Science, 243, 937 (1989). Novitch B. G., Mulligan G. J ., Jacks T., Lassar A. B., Skeletal muscle cells lacking the retinoblastoma protein display defects in muscle gene expression and accumulate in S and G2 phases of the cell cycle, J. Cell Biol, 135, 441 (1996). Onadim Z., Hogg A., Baird P. N., Cowell J. K., Oncogenic Point Mutations in Exon 20 of the RBI Gene in Families Showing Incomplete Penetrance and Mild Expression of the Retinoblastoma Phenotype, Proc. Natl. Acad. USA, 89, 6177 (1992). Hirsch H. A., Gu L., Henry R. W., The Retinoblastoma Tumor Suppressor Protein Targets Distinct General Transcription Factors To Regulate RNA Polymerase III Gene Expression, Mol. Cell Biol, 20, 9182 (2000). Hovde S. L., personal communication. 58. Hanzlowsky A., Jelencic B., J awdekar G. W., Hinkley C. S., Geiger J. H., and Henry R. W., Co-expression of multiple subunits enables recombinant SNAPC assembly and function for transcription by human RNA polymerases II and III, Protein Expr. Purif., 48, 215 (2006). 111 59. Jawdekar G. W., Hanzlowsky A., Hovde S. L., Jelencic B., Feig M., Geiger J. H., Henry R.W., The unorthodox SNAP50 zinc finger domain contributes to co- operative promoter recognition by human SNAPc, J. Biol. Chem, 281, 31050 (2006). 112 U utijjjjjjjjttrill