.. .. 5...... . . . . 5:» . . . m u... . . "3...... frat, 4: . a 2.5.3... bazaar? . A . . . . ‘ . .2... éi _ _ r h. .. . V . A. .. . ‘ A .. r. , . . , 90%.: 2:. ‘ 2.... g5taiial w .. 71x n 5.9%.). H15... 9 t 1 1.. 412.. . n... ‘53... A. .1 n17.39 21¢: 3 \L 722,19 :1 3951.53) ‘1‘ 1....5 ,\ .Jxvuv ' v .- 1 3:... :9... 1. l...1..xl..:.x in : x . :1... u . .1055}... yt‘ix‘: a... x ’35::: a 0.9:! 3:14... a . 4.. .iu Tl-li’i‘I'S QCDCH LIBRARY E‘fiichigan S'éafig University 4-" This is to certify that the dissertation entitled VACUOLAR TRANSPORT 0F ctVSS-BEARING PROTEINS: A GENETIC APPROACH presented by Sridhar Venkataraman has been accepted towards fulfillment of the requirements for Ph.D. degree in Biochemistry and Molecular Biology Major professor 1L/11459l Date MS U i: an Affirmative Action/Equal Opportunity Institution 0-12771 PLACE IN RETURN BOX to remove this checkout from your record. To AVOID FINES return on or before date due. MAY BE RECALLED with earlier due date if requested. DATE DUE DATE DUE DATE DUE 6’01 c-JCIFlC/DateDue.p65-p.15 VACUOLAR TRANSPORT OF ctVSS-BEARING PROTEINS: A GENETIC APPROACH By Sridhar Venkataraman A DISSERTATION Submitted to Michigan State University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY Department of Biochemistry and Molecular Biology 2001 ABSTRACT VACUOLAR TRANSPORT OF ctVSS-BEARING PROTEINS: A GENETIC APPROACH By Sridhar Venkataraman Plants respond to stress and pathogen attack by synthesizing and storing defense- related proteins in the vacuole. Many of these proteins are targeted to the vacuole by virtue of sorting information residing in a C-terminal vacuolar sorting signal (ctVSS). While many studies have been conducted to identify the nature of these sorting signals, very little is known about the factors they interact with or the factors involved in the transport mechanism that delivers these proteins to the vacuole. A genetic approach has been used in Arabidopsis thaliana (I...) Heynh, to identify loci involved in the ctVSS- bearing protein transport pathway. Mutants were screened from a collection of EMS- treated Arabidopsis thaliana ecotype Columbia seeds expressing rat-preputial {5-D glucuronidase with a ctVSS from tobacco chitinase and barley lectin. Mutants cvsI and cst (for ctVSS vacuolar sorting) were identified and characterized by imrnunoelectron microscopy. In these mutants, barley lectin was found to be partially mislocalized in the apOplast along the cell wall and in intercellular spaces of root sections. Such results were not observed in leaf sections, indicating the identification of root specific mutants. The mutant locus of cvsl was mapped to a contig of BACs between AtSOl9l and DFR on Chromosome V. Further analyses will have to be carried out to clone the cvsI mutant locus. However, the research presented describes identification of a factor potentially involved in transporting ctVSS bearing proteins to the vacuole. To my parents Vasantha and R. Venkataraman and my wife Meera iii ACKNOWLEDGMENTS My graduate studies have involved many colleagues and faculty members who have been part of my development as a student. First and foremost, I would like to thank my advisor Natasha for having supported me all these years and helped me conduct my research. She has been instrumental in teaching me how to be critical in reviewing scientific work. I would also like to thank Dr Kenneth Keegstra for helping me join the PRL. I owe a lot to him. My committee members, Dr Kroos, Dr Preiss and Dr Smith, have also helped me through these years as a PhD student. A number of colleagues have taught me a lot in these years: Ahmed Faik, John Froehlich, Brett Brzobohaty, Philipp Kapranov, Krzysztof Szczyglowski and Sergei Mekhedov. Coming to the Raikhel Lab: Past - Jim “Jim is always right” Dombrowski, Maor “Bapeti” Bar-Peled, Sharif “Jim 11” Ahmed; and present — Diane Bassham, Enrique Rojo, Valya Kovaleva, Rodrigo Sarria-Millan, Haiyan Zheng, Ivan Delgado, Emily Avila-Teegarden, Curtis Wilkerson and Sybil Myers. I am especially grateful to Jim Dombrowski to have initiated my project. These past few months would have been impossible without the help of Tony Sanderfoot (the Universal Thesis Master) and John Scott-Craig (Seminario Oratorio Major). Finally my family members: my wife Meera, my parents Vasantha and R. Venkataraman, Latha, Zoltan and now Milan. I must not forget some of my friends who have been around when I needed them: Mundath Narayanan Menon, Penmetsa Ramachandra Varma, Hemant Varma and Satyashayee Misra. iv TAB list list Abbi. [RUDE TABLE OF CONTENTS List of tables ....................................................................................... viii List of figures ......................................................................................... x Abbreviations ....................................................................................... xii CHAPTER 1 Introduction: Protein targeting to the vacuole: the ctVSS story ................................. 1 Plant Vacuole ................................................................................. 2 Storage proteins and the PSV ............................................................... 2 Protein targeting to the vacuole in plants .................................................. 4 The endoplasmic reticulum ................................................................ 6 Golgi and the plant secretory pathway .................................................... 7 Sorting determinants ........................................................................ 8 ssVSS ............................................................................... 9 ctVSS .............................................................................. l 1 Sorting machinery ......................................................................... 12 Statement of purpose ............................................................................... 14 CHAPTER 2 Development of a reporter line for use in a screen to identify mutants impaired in ctVSS- sorting in Arabidopsis ............................................................................... 15 Introduction ................................................................................. 16 Results and Discussion ................................................................... 18 Conclusion ................................................................................. 32 Materials and Methods .................................................................... 33 C HA. . EMS CHAT Champ CHAP Map; Um I 9M dM‘tl. CHAPTER 3 EMS mutagenesis and characterization of ctVSS-missorting plants ........................ 39 Introduction ................................................................................. 40 Results and Discussion .................................................................... 41 Conclusion .................................................................................. 54 Materials and Methods ..................................................................... 54 CHAPTER 4 Characterization of mutants and selection of cvsI ............................................. 56 Introduction ................................................................................. 57 Results and Discussion ................................................................... 57 Conclusion ................................................................................. 63 Materials and Methods ................................................................... 64 CHAPTER 5 Mapping cvsI to a contig of BACs ................................................................ 67 Introduction .................................................................................. 68 Results and Discussion .................................................................... 68 Conclusion ................................................................................. 80 Materials and Methods ................................................................... 82 CHAPTER 6 Identification of ORFs and complementation of cvsl .......................................... 86 Introduction ................................................................................. 87 Results and Discussion ................................................................... 88 Conclusion ................................................................................. 93 vi (HA; \lappi APPEE Bndi we T5335; Edd Materials and Methods .................................................................... 94 CHAPTER 7 Conclusions and Future .................................................................... 95 Conclusions ......................................................................... 96 The Future ........................................................................ 100 REFERENCES .................................................................................... 104 APPENDIX A Mapping and Linkage analysis made simple: A web-based approach ..................... 115 Abstract .................................................................................... 1 16' Introduction ............................................................................... 1 17 Results ...................................................................................... 117 References ................................................................................. l 22 HTML code ............................................................................... 123 APPENDIX B Bradford analysis made simple: A web-based approach .................................... 130 Introduction ............................................................................... 131 Results and Conclusion .................................................................. 131 References ................................................................................. l3 1 HTML Code ............................................................................... 133 APPENDIX C ...................................................................................... 137 Tables: Genes present on BACs MBB18, MKDlO, K15E6, MXF12, K3K3 and MUL8s and the proteins encoded vii Alec. LIST OF TABLES Table 1.1 List of representative vacuolar sorting signals ....................................... 9 Table 2.1 B-glucuronidase activity in protein extract from primary transforrnant .........23 Table 2.2 Frequency of T2 progeny surviving kanamycin selection ............................ 24 Table 2.3 Reporter segregation in 20 T3 progeny pools of line CRBT3-6 .................. 26 Table 2.4 Reporter segregation in 20 T3 progeny pools of line CRBT3-9 .................. 27 Table 2.5 Evaluation of reporter levels in tertiary transformants ............................. 28 Table 2.6 Kanamycin resistance in quaternary transformants ................................ 29 Table 2.7 Reporter level estimation in quaternary transformants ............................ 30 Table 3.1 Pool-wise summary of number of mutants identified .............................. 4-4 Table 3.2 List of mutants identified and the pools they were identified from .............. 45 Table 3.3 Evaluation of M3 lines by Rat-GUS secretion assay and EM ..................... 46 Table 5.1 lO-Sample mapping analysis .......................................................... 70 Table 5.2 46-Sample mapping analysis .......................................................... 72 Table 5.3 Fine map between AthPHYC and LFY3 ............................................ 73 Table 5.4 Markers used to map cvsI within between ATSOl91 and DF R ................. 79 Table 5.5 Fine mapping analysis of cvsI between AtSOl9l and DFR ..................... 81 Table 6.1 List of BACs that map close to locus cvsl .......................................... 88 Table 6.2 List of genes neighboring cvsI ....................................................... 90 Table C.l Genes present on the BAC MBB18 and the proteins encoded ................. 138 Table C.2 Genes present on the BAC MKDlO and the proteins encoded ................. 139 Table C.3 Genes present on the BAC K15E6 and the proteins encoded .................. 140 Table C.4 Genes present on the BAC MXF12 and the proteins encoded .................. 141 viii Tab; lab}. Table C.5 Genes present on the BAC K3K3 and the proteins encoded ................... 142 Table C.6 Genes present on the BAC MUL8 and the proteins encoded .................. 143 ix LIST OF FIGURES Figure 1.1 Model of the plant secretory pathway .............................................. 5 Figure 1.2 Vacuolar soluble proteins and their targeting signals ............................ 8 Figure 2.1 Verification of Rat-GUS activity in primary transformants ..................... 21 Figure 2.2 B-glucuronidase activity in protein extract from primary transformant ........ 22 Figure 2.3 Western blot to verify the expression of the barley lectin (BL) ................. 23 Figure 2.4 Verification of BL in CRBT4 seedlings by western analysis .................... 30 Figure 2.5 Western analysis of protein extracts from different seedling tissues ........... 33 Figure 2.6 Diagram of plasmid pMOGCRB ................................................... 34 Figure 2.7 Diagram of plasmid pMOGRGD ................................................... 35 Figure 3.1 Mutant Screen on vertical plates .................................................... 42 Figure 3.2 Immunolocalization of BL in section of root of mutant E1 1C .................. 48 Figure 3.3 Immunolocalization of BL in section of root of mutant El 1C .................. 49 Figure 3.4 Immunolocalization of BL in section of root of mutant El 1C .................. 50 Figure 3.5 Immunolocalization of BL in section of leaf of mutant El 1C .................. 51 Figure 3.6 Non-immune control: section of root of mutant E11C ........................... 52 Figure 4.1 Northern analysis of root RNA from mature plants .............................. 58 Figure 4.2 Western analysis of barley lectin (BL) in mutant and control tissues .......... 60 Figure 4.3 Examination of floral phenotype of cvsI ........................................... 62 Figure 5.] Alignment of BAC contig between AtSOl91 and DFR on chromosome V ...74 Figure 5.2 Alignment of BAC MBB18 (Col, 28295 — 28460) with Clone co954 (Ler) ..75 Figure 5.3 Sequence alignment of a section of BAC MKM21 (C01) and cl 103 (Ler) ...76 Figure 5.4 Sequence alignment of a section of BAC K15E6 (C01) and c951 (Ler) ....... 77 Figure 5.5 Alignment of BAC MUL8 (Col, 48930 — 49068) with Clone c1765 (Ler) ...78 Figure 5.6 BAC contig spanning cvs] locus .................................................... 82 Figure 7.1 Model of the ctVSS sorting pathway ................................................ 99 xi ABE BAT BCL BL | BPR 351i CAP CCV cDX CM cm COP! COPl Cohi CTAE dVS§ CV5 CB7 03 DEPC DMSt DXA dXTB E flu": EDTA EMS ER FY GTP Ha 1C3 Ler LY it it 11;. MES MOPS mm Mt ABBREVIATIONS AP-l BAC BCIP BL BP80 BSA CAPS CCV cDNA cM Col COPI COPII Cotyl CTAB ctVSS CVS CW Da DEPC DMSO DNA dNTPs E. coli EDTA EMS ER F1 F2 F V GTP Het MES MOPS mRNA MU Clathrin-associated adaptor complex 1 Bacterial artificial chromosome 5-Bromo-4-chloro-3-indolyl phosphate p-Toluidine Salt Barley lectin Binding Protein 80 Bovine serum albumin (fraction V) Cleaved amplified polymorphic sequences Clathrin-coated vesicles copy deoxyribo nucleic acid centi Morgan Arabidopsis thaliana ecotype Columbia Coat-protein (I) coated vesicles Coat-protein (II) coated vesicles Cotyledon Cetyltn'methylammonium bromide C-terminal vacuolar sorting signal ctVSS vacuolar sorting (gene or mutant) Cell wall Dalton Diethyl pyrocarbonate Dirnethyl sulfoxide Deoxyribo nucleic acid Deoxynucleotide triphosphate Escherichia coli Ethylenediaminetetraacetic acid disodium salt Ethyl methane sulfonate Endoplasmic reticulum First filial generation Second filial generation (usually self-fertilized progeny of F 1) Fused vacuole Guanosine 5'-triphosphate Sodium salt Heterozygous ltercellular spaces Arabidopsis thaliana ecotype landsberg erecta Lytic vacuole Primary mutagenized generation Secondary mutagenized generation (usually self-fertilized progeny Of Ml) Tertiary mutagenized generation (usually self-fertilized progeny 0f M2) 2-(N-Morpholino)ethanesulfonic acid 3-(N-Morpholino)propanesulfonic acid Messenger ribonucleic acid Methyl umbelliferone xii a. ORF PAC PBS PCR PHA PMSF PSV PsVSS PT PVC Rat-GUS Rat-GUS-ctVSS Rat-GUS-Delta RF RFLP RNA RR Buffer SDS SDS-PAGE SDV SNARE SP SSLP SsVSS Open reading frame Precursor accumulating vesicles Phosphate-buffered saline Polymerase chain reaction Phytohemagglutinin Phenylmethylsulfonyl fluoride Protein storage vacuole Physical structure vacuolar sorting signal Potato inhibitor terminator Prevacuolar compartment Rat preputial B-glucuronidase Rat preputial B-glucuronidase fused to tobacco basic chitinase ctVSS Secreted Rat preputial B-glucuronidase Recombination frequency Restriction fragment length polymorphism Ribonucleic acid RNA resuspension buffer Sodium dodecyl sulfate Sodium dodecyl sulfate poly acrylamide gel electrophoresis Smooth dense vesicles soluble N-ethyl maleirnide sensitive factor adaptor protein receptor Signal peptide Simple sequence length polymorphism Sequence specific vacuolar sorting signal Secretory vesicle Primary transformants Secondary transformants Tertiary transformants Quaternary transformants Fifth generation transformants Tris Borate EDTA Tobacco basic chitinase Transfer DNA Tris EDTA Trans-Golgi network Tonoplast intrinsic protein Vacuole Volume/volume Vacuolar protein aggregates Vacuolar protein sorting weight/volume Wheat germ agglutinin Wild type 5-Bromo-4-chloro-3-indolyl beta-D-glucuronide Sodium salt xiii l'At YAC Yeast artificial chromosome xiv Chapter 1 Protein targeting to the plant vacuole: the ctVSS story Plant {BIKE com;- encltw bark; 1W3 . exist ; 199$. Vaflltt' Cult-m 19W. PSV l} TIP_ ,, ”la-hm mdjCal: INTRODUCTION Plant vacuoles Plant vacuoles were first observed to be large spaces devoid of cytoplasmic matter and hence termed vacuole. This large organelle occupies 30 to 90 % of the plant cell depending on the type of cell under observation. Functionally the vacuole is a large repository of inorganic ions, organic acids, sugars, secondary metabolites, plant defense compounds, stress-related solutes, detoxified substances, enzymes and storage proteins enclosed by the vacuole membrane (tonoplast) (Marty, 1999). Recent evidence from barley root cells, barley aleurone cells and tobacco cell (Paris et a1, 1996, Swanson et a1, 1998 and Di Sansebastiano et al, 1998) indicates that two or more types of vacuoles may exist in a plant cell, each characterized by different pH optima (Di Sansebastiano et a1, 1998) and different aquaporins in the respective tonoplasts (J auh et a1, 1998). The low pH vacuole is termed the lytic vacuole (LV) due to the presence of low pH optima hydrolytic enzymes, whereas the near-neutral vacuole, the protein storage vacuole (PSV) (Marty, 1999). The LV is analogous to the yeast vacuole and the mammalian lysosome while the PSV is unique to plants (Marty, 1999). The lytic vacuole has been reviewed in detail by Marty (1999) and will not be discussed. Storage proteins and the PSV The protein storage vacuole has been found to have a tonoplast decorated with Ot- TIP, an aquaporin found in seeds and root tips. It differs from the lytic vacuole in its inability to be stained with neutral red, a dye that accumulates in acidic compartments, indicating a neutral pH (Di Sansebastiano et a1, 1998). This compartment accumulates O'J- ’YJ ‘11 ‘W‘y- AJ.A‘ . storage proteins such as globulins and prolamins, proteins characterized by their relative solubility in aqueous or ethanolic buffers. Globulins are found in all dicots and many monocots. Globulins are of the vicilin type (78 globulins) or the legumin type (11S globulins). Common 78 globulins include vicillin (from Pisum sativum), phaseolin (from Phaseon vulgaris) and conglycinin (from Glycine max). The legumin category of proteins includes glycinin from Glycine max. Prolamins are predominantly found in the endosperm of monocots such as wheat and rice. Apart from storage proteins, the PSV also contains many ancillary storage proteins such as barley lectin and bean phytohemagglutinin as well as hydrolase inhibitors such as tat-amylase inhibitors (Marty, 1999). These proteins may serve a defense role in plants against herbivory, insect and pathogen attack (reviewed by Marty, 1999; Herman and Larkins, 1999). The ability of plants to synthesize and store proteins has made them very important as a primary source of protein for animal species. In modern times, these aspects of plants have been utilized in developing engineered proteins in plants to improve the nutritive value of plants and their products (Lawrence et a1, 1994). These aspects have also been exploited in developing plants with therapeutic properties. The Escherichia coli enterotoxin B protein has been expressed in potatoes as an oral vaccine against E. coli-induced diarrhea (Mason et a1, 1998). Similarly, the hepatitis B surface antigen has been expressed in potatoes as an oral vaccine against hepatitis B (Richter et al, 2000) and the Guy 13 monoclonal antibody against Streptococcus mutans (agent of dental caries) was expressed in tobacco (Cabanes-Macheteau et al, 1999). While most of these developments are still in their infancy, the prospects of plant based therapies for common ailments are exciting. Prot 1530; punt.” Vesit . cars:- test} (301;; the pr litl’ac ldfipett Golgi , l l “OCH-Ii Protein targeting to the vacuole in plants Plant proteins destined for the vacuole are synthesized as precursors by ribosomes associated with the endoplasmic reticulum (ER), and are co-translationally inserted into the ER lumen (see Figure 1.1). In the ER lumen, they undergo several modifications such as the proteolytic processing of the signal peptide, N-linked glycosylation, disulfide bond formation, and protein folding. These steps are regulated by a “quality control” step following which these proteins are thought to be packaged into COPII vesicles bound for the Golgi complex (see Pirnpl et a1, 2000). Certain proteins such as cereal prolamins and pumpkin ZS albumin are packaged into protein bodies or precursor accumulating (PAC) vesicles bound for the protein storage vacuole. Afier the COPII vesicles deliver their cargo to the cis-Golgi, proteins further undergo maturation processes such as complex mannosylation of the glycan as they move through the medial and trans-Golgi via a COPI vesicle mediated manner (see Pirnpl et al, 2000). Eventually proteins arrive at the trans- Golgi network (TGN) where they are sorted according to localization signals residing on the protein. Proteins with vacuolar targeting signals are sent to the vacuole via the prevacuolar compartment (PVC) in clathrin-coated vesicles or smooth dense vesicles (depending on the targeting signal). Proteins, which lack a vacuolar targeting signal or a Golgi retention signal, are delivered to the extracellular space via the default pathway of exocytosis. O x O ER SV COPII COPI \ 2' \ TGN \ 1" ES". . /' \ Golgi \ Complex SDV @ (D / PSV PAC [3 III Figure 1.1 Model of the plant secretory pathway Soluble proteins are synthesized by ribosomes associated with the Endoplasmic Reticulum (ER). Once in the lumen of the ER they are processed and eventually packaged into a) COPII vesicles containing cargo such as barley lectin or sporamin (yellow) or b) Precursor Accumulating Vesicles (PAC) as in the case of pumpkin ZS albumin (red). COPII vesicles deliver their cargo to the Golgi complex where they are processed as they proceed by a COPI vesicle mediated process (dark green). PAC vesicles, in contrast, deliver their cargo (red) to the Protein Storage Vacuole (PSV). Proteins processed in the Golgi network reach the TGN where they are sorted. l) ssVSS bearing proteins (blue) are packaged into Clathrin-Coated Vesicles (CCV) en route to the prevacuolar compartment (PVC). These proteins ultimately reach the Lytic Vacuole (LV). 2) Proteins bearing a ctVSS such as barley lectin or a psVSS such as legumin (brown) are packaged into Smooth Dense Vesicles (SDV) en route to the PSV. In vegetative cells, the PSV and LV oflen fuse to form one large vacuole (FV). 3) Proteins which lack a Golgi retention signal or a vacuolar targeting signal are packaged into secretory vesicles and their cargo (green) are secreted into the apoplast. The CO-lf. sin. 30 it ah}; The Endoplasmic reticulum Storage proteins are synthesized on the rough endoplasmic reticulum (ER) and are co-translationally inserted into the ER lumen. This entry into the ER is specified by a signal peptide residing at the N-tenninus of the nascent polypeptide and consists of 16 to 30 residues organized in three domains: a positively charged amino terminus, followed by a hydrophobic region and a polar region terminated by a cleavage site (von Heijne, 1985). The signal peptide is cleaved upon entry into the ER. Once in the ER lumen, proteins undergo various modifications such as N-linked glycosylation, disulphide bond formation, oxidation of prolines to hydroxyprolines, trimming of the glycan and protein folding (reviewed in Galili et a1, 1998). There is significant evidence to indicate the existence of a quality control step in the ER resulting in the reverse translocation of misfolded proteins to the cytosol (reviewed in Vitale and Denecke, 1999). Proteins such as pumpkin 2S albuminand monocot prolamins exit the ER via PAC vesicles or protein bodies respectively and are transported directly to the vacuole (Okita and Rogers, l996; Galili et a1, 1998; Hara-Nishimura et all, 1998; Choi et a1, 2000). The transport of soluble proteins from the ER to the Golgi has been found to occur via COPII vesicles in yeast (for review see Barlowe, 1998). This mechanism involves Sec12p (GTP-exchange protein), Sarlp (a GTPase), Se023p, Sec24p, Sec13p and Sec31p (for review see Barlowe, 1998). Similarly, in plants the ER to Golgi traffic is attributed to a homologous COPII system. This hypothesis was supported by the identification of plant homologues of the yeast proteins. The homologues identified are AtSARl (Bar-Peled and Raikhel, 1997; Takeuchi et a1, 2000), AtSEClZ (Bar-Peled and Raikhel, 1997), and AtSEC23 (Movafeghi et a1, 1999). QTOIL‘E (Lem mans. prom 211. It, Piotr} Golgi and the plant secretory pathway The plant Golgi body can be discerned as a stack of membrane-bound compartments (cistemae) associated with a large network of tubules and vesicle buds called the trans-Golgi network or TGN. The number of Golgi stacks varies depending on the plant species and can range from one stack per Chlamydomonas cell to thousands in giant fiber cells in cotton (for review Andreeva et a1, 1998). Plant Golgi bodies carry out a number of functions including protein glycosylation, lipid biosynthesis and polysaccharide synthesis. BaSed on the localization of Golgi enzymes, polysaccharides, and various modifications that proteins undergo in the Golgi stacks, the Golgi cistemae are classified into cis-, medial- and trans-Golgi (Dupree and Sherrier, 1998). Typically, proteins are delivered from the ER to the cis-Golgi, proceed through the medial- and trans-Golgi undergoing various modifications. The modifications include protein glycan al,3-fucosylation (Lerouge et al, 1998), protein glycan Bl,2-xylosy1ation (Lerouge et a1, 1998), proteolytic processing (Jiang and Rogers, 1999) and complex mannosylations of the glycan (Lerouge et a1, 1998). Intra-Golgi traffic in plants has been proposed to occur via a COPI vesicle-mediated manner (Andreeva et a1, 2000, Pirnpl et a1, 2000) or via cistemal maturation as in the case of scales in algae (Brown, 1969) and procollagen in mammalian cells (Bonfanti et a1, 1998). Eventually, proteins arrive at the TGN en route for two broad destinations: the extracellular space via the default pathway of exocytosis, or the vacuole via a sorting determinant-mediated pathway. It can be said that in the absence of a positive sorting determinant, soluble proteins are secreted via the default pathway. Currently, very little is known about the vesicular machinery of exocytosis. However it is clear that the process is med; lIlCt'l‘. The; 50ml med; mediated by calcium ions and annexins and that these vesicles are delivered to the plasma membrane by microfilaments (Carroll et a1, 1998, Battey et a1, 1999, Sutter et al. 2000). The presence of a positive sorting determinant directs proteins to the vacuole (V itale and Raikhel, 1999). Sorting determinants Sorting determinants can be broadly classified into two categories, propeptide mediated sorting and mature protein domain-mediated (psVSS) sorting (Figure 1.2). In Ls? fissvss | I Sweet potato sporamin {3: lama... hu]ctVSS] Barley lectin LE | | psVSS I ! Bean phytohemagglutinin Figure 1.2 Vacuolar soluble proteins and their targeting signals SP = Signal peptide, ssVSS = sequence specific vacuolar sorting signal, ctVSS = C- terrninal vacuolar sorting signal, psVSS = protein structure vacuolar sorting signal. the case of propeptide-bearing proteins, the propeptide is eventually cleaved and is absent fi'om the mature protein in the vacuole. The study of a number of vacuolar hydrolases, lectins, seed storage proteins and protease inhibitors indicates that vacuolar sorting is mediated by the presence of a propeptide signal at the N-terminus (ssVSS) of the protein Net ll‘. CL ssVS 6. 1 art prefer r o" p.816! IT‘t—Jm Ana—l ss\'55 Slfial 3‘1. (adjacent to the signal peptide) as in the case of sweet potato sporamin and barley aleurain (Figure 1.2) or at the C-terminus (ctVSS) of the protein (Figure 1.2) as in barley lectin (BL), tobacco chitinase (TC), kiwi actinidin (Paul et al, 1995), bean phaseolin and brazil nut ZS albumin (reviewed by Venkataraman and Raikhel, 1998; Matsuoka and Neuhaus, 1999; Vitale and Raikhel, 1999). The analysis of the psVSS has been reviewed in detail by Neuhaus and Rogers (1998) and will not be covered in this review. ssVSS Recently Matsuoka (2000) summarized the properties of the ssVSS as to consist of an Xl-Xz-I/L-X3-X4 motif where X1 lacks a small hydrophobic side chain and Asn is preferred, X2 is a non acidic amino acid, X3 is any amino acid and X4 is a large and preferably hydrophobic amino acid (Table 1.1). Although these signals were originally Table 1.1 List of representative vacuolar sorting signals Protein ssVSS Sweet potato sporaminl . . RFNPIRLPTTHEPA . . Barley aleurain' . .SSSSFADSNPIRPVDTDRAAST. . Arabidopsis aleura' 2 . .ANIGFDESNPIRMVSDGLREVC. . Protein ctVSS Barley lectinl . .VFAEAIAANSTLVAE Tobacco chitinasel - - GLLVDTM Bean phaseollinl . .AFVY Tobacco osmotinI . . CPYGSAHNETTNFPLEMPTSSTHEVAK Kiwi actinidin3 . . QNHPKPYSSLINPPAFSMSKDGPVGVDDGQRYSA ssVSS = sequence specific vacuolar sorting signal; ctVSS = C-terminal vacuolar sorting signal. Letters in bold face represent amino acids forming the conserved motif. 1 = reviewed by Matsuoka and Neuhaus (1999). 2 = Ahmed et a1 (2000). 3 = Paul et al (1995) tder. Sig: tine. nut I NAB. that 1 601g: bails} 5.00m: 51mm" lira f imam lll Son identified at the N-terminus of cargo proteins, Koide et a1 (1997) demonstrated that the signal is also functional if artificially placed at the C-terminus of a cargo protein. This finding may explain the properties of the C-terminally located targeting signals of Brazil nut ZS albumin (Kirsh et al, 1996), the pumpkin ZS albumin (Shimada et a1, 1997) and NaPI (Miller et al, 1999). These C-terminally located signals have a XIXZI/LX3X4 motif and bind the ssVSS-specific vacuolar sorting receptor BP80 or its orthologs (Matsuoka, 2000). Jiang and Rogers (1998) have demonstrated that the cleavage of the ssVSS occurs in the PVC and that the mature protein is then transported to the vacuole. To differentiate between proteins with a vacuolar sorting determinant and those that lack a sorting determinant, the existence of receptors at the trans-Golgi or trans- Golgi network was postulated. Using synthetic peptides corresponding to the ssVSS of barley aleurain and the non-functional mutant ssVSS, Kirsch et al (1994) purified from pea clathrin coated vesicles, an 80 kDa protein (BP80). This protein was found to interact with the aleurain ssVSS with a kd of 37 nM in a pH dependent manner. Later the Raikhel group cloned AtELP (Ahmed et al, 1997), the Arabidopsis ssVSS receptor, and demonstrated that this protein is localized in the TGN and interacts with synthetic peptides representing ssVSSs in vitro (Ahmed et al, 2000). The role of AtELP as an ssVSS receptor was further strengthened by observing the co-localization of AtELP and sporamin in Arabidopsis cells. However AtELP did not co-localize with BL in Arabidopsis cells indicating that the receptor was specific for ssVSS bearing proteins in viva. Synthetic peptides representing the cytosolic domain of AtELP were found to interact with mammalian AP-l adaptor complexes in-vitro confirming the role of AtELP in sorting ssVSS-bearing proteins and delivering them into clathrin-coated vesicles 10 IV '5 mlSSL‘ secret Greer filncti. rest»; Vacuo (Sanderfoot et al, 1998). AtELP was also co-localized with AtSYPZl (a syntaxin found on the PVC) indicating that this receptor shuttles between the TGN and the PVC (Sanderfoot et al, 1998). ctVSS While a consensus motif has been identified for the ssVSS sorting determinants, ctVSS signals do not possess a common motif (Table 1.1) at the level of their primary structure. This type of sorting determinant is found in a number of lectins, storage proteins, hydrolases and defense related proteins. To date ctVSSs from barley lectin (Dombrowski et al, 1993), tobacco chitinase (Neuhaus et al, 1994), kiwi actinidin (Paul et al, 1995), bean phaseolin (Frigerio et a1, 1998) and tobacco osmotin (Sato et a1, 1995) have been characterized. The deletion of this determinant from the protein results in the missorting to the apoplast, and conversely the addition of this signal to heterologous secreted proteins such as cucumber chitinase (Neuhaus et al, 1991), hen egg white lysozyme (Neuhaus et al, 1995), rat preputial B-glucuronidase (Neuhaus et al, 1995), Green Fluorescent Protein (Di Sansebastiano et a1, 1998) and sporamin with a non- functional ssVSS (Matsuoka et al, 1995) resulted in the vacuolar localization of the respective chimeric proteins. It has been postulated that the ctVSS may be cleaved in the vacuole by an aspartic proteinase (Amidon et a1, 1999). Extensive site directed mutation analyses have been performed on the tobacco chitinase ctVSS: 'GLLVDTM' (Neuhaus et a1, 1994) and the BL ctVSS 'VFAEAIAANSTLVAE' (Dombrowski et al, 1993). The terminal methionine of the TC-ctVSS could be deleted without effect, while all other residues were required. Replacing 2LLV4 with SSS or 4VDTM7 with LLLL did not reduce 11 I he b}? THC L simn Secret sauna 50m. bten t Sonin ssl'gg. the sorting efficiency while replacing the terminal M with a G reduced vacuolar sorting by 50%. In the case of the BL-ctVSS as little as lVFA3 was shown to be functional in vacuolar targeting, while the addition of two G residues at the C-terminus of the motif abrogated vacuolar targeting. Although glycosylation is not necessary for vacuolar targeting in plants, translocating the glycosylation site from a -7 position to a -1 position in the ctVSS of BL resulted in secretion of the protein (for review Dombrowski and Raikhel, 1996). This is attributed to a steric hindrance rather than a role for the glycan. The lack of a consensus motif within the ctVSSs indicates that a structural feature of the ctVSS might be responsible for the selectivity. It has been suggested that the ctVSS of barley lectin adopts an amphipathic alpha helix structure (Dombrowski and Raikhel, 1996). However, such a structural feature has not been demonstrated to be necessary for ctVSS-mediated vacuolar sorting. Currently, no receptor has been identified for the ctVSS pathway. It has been shown that if ctVSS proteins are expressed to high levels, then the proteins can be secreted (Neuhaus et a1, 1994; Frigerio et al, 1998) indicating that the pathway is saturable. Frigerio et a1 (1998) suggest the existence of a specific machinery involved in sorting ctVSS bearing proteins onward to the vacuole. However, such proteins have not been identified. Sorting machinery While a large body of evidence has been accumulated regarding the transport of ssVSS-bearing proteins, very little is known about the pathway transporting ctVSS- bearing proteins. However, there are several differences between the two pathways. 12 ct\'S (163110 C0352 \BTIO‘ Sét‘fel obser VBC 110 Chara. Other 1 1‘. I B0163. Matsuoka et a1 (1995) demonstrated that both barley lectin and tobacco chitinase are secreted from tobacco protoplasts when treated with 33 M of wortmannin, whereas sporamin is correctly targeted to the vacuole. Though wortmannin is an inhibitor of phosphatidyl inositol-3 kinase in mammalian cells, its specific role in the plant secretory pathway has not been elucidated. It is also known that ctVSS-bearing proteins have not been found in clathrin-coated vesicles that contain aleurain (an ssVSS-bearing protein). Finally, the final destinations for these proteins are different as the ssVSS-bearing proteins are delivered to the lytic vacuole (decorated with 'y-TIP, an aquaporin), while ctVSS-bearing proteins are delivered to the PSV (decorated with OL-TIP). These evidences demonstrate that the ctVSS and ssVSS-mediated pathways are different. The lack of a consensus motif in the ctVSS and the variation in the length of the ctVSS among the various known ctVSS sequences has hampered the understanding of this part of the secretory pathway. In summary, the study of vacuolar protein transport has come a long way since observing spaces in the cell that were devoid of cytoplasmic contents. A number of vacuolar proteins have been identified and their targeting motifs dissected. The characterization of these sorting signals led to the identification of sorting receptors and other proteins required for this machinery. However, the lack of a consensus sequence has hampered the understanding of the machinery involved in sorting ctVSS-bearing soluble proteins to the vacuole. l3 .‘lrait to 6. mute: defee throw "Statement of purpose" The research presented aims at identifying proteins involved in ctVSS-mediated protein transport. So far it is known that a class of soluble proteins reach the vacuole by virtue of their ctVSSs. In view of the general features of the various secretory pathway models proposed, it is conceivable to expect a variety of proteins involved in the pathway. Some of these might be receptors recognizing ctVSS-bearing cargo and sorting them into vesicles bound for the vacuole. This pathway may also involve a specialized class of vesicles decorated with a complement of proteins. The fate of these vesicles may be regulated by certain factors. To characterize the pathway, a genetic approach has been undertaken in Arabdopsis thaliana to identify factors involved in the pathway, described in Chapters 2 to 6. Through a mutant screen cm] and cst were identified as ctVSS vacuolar sorting mutants. cm] was found to be a single gene recessive mutation causing a leaky sorting defect in root cells. This mutation was mapped to a contig of bacterial artificial chromosomes between AtSOl9l and DFR on chromosome V. 14 Chapter 2 Development of a reporter line for use in a screen to identify mutants impaired in ctVSS-sorting in Arabidopsis 15 1)" C01. ..| .\l. h | | '23 the t: CASE that :1 Vanity tnt'ol'. Plant r item It W018}: (Haw INTRODUCTION The study of the secretory pathway in yeast has advanced due to the large collection of vps mutants impaired in vacuolar protein transport (Raymond et a1, 1992). More than 50 genes have been identified whose products play important roles in the pathway. The identification of the plant homologues of yeast and mammalian secretory pathway proteins has advanced the understanding of this mechanism in plants. This approach was initiated by complementing the yeast pep12 mutant with an Arabidopsis cDNA library resulting in the identification of AtPEPlZ (later renamed SYPZl) (Bassham et al, 1995), the first plant syntaxin to be identified. Since then AtSYP4l and AtSYP42 (Bassham et al, 2000), AtVPS45 (Bassham et al, 2000), AtVTIla and b (Zheng et a1, 1999), have been identified and characterized. It is likely that these proteins are involved in the ssVSS pathway. On the other hand, wortmannin sensitivity studies (Matsuoka et a1, 1995) and ctVSS mutation studies (Dombrowski et al, 1993, Neuhaus et al, 1994) have shown that the ctVSS sorting pathway may be different from that of the ssVSS-sorting pathway. The ctVSS mediated pathway appears to be unique to plants as Gal and Raikhel (1994) showed that the yeast vacuolar targeting machinery does not transport barley lectin to the yeast vacuole. Taken together the ctVSS-mediated pathway may be unique to plants and may involve proteins hitherto unknown in other systems. To identify proteins involved in targeting ctVSS-bearing proteins to the vacuole, a plant mutant screen was envisaged. A number of candidate ctVSS-bearing proteins have been identified in Arabidopsis such as AtOsmotin (Capelli et a1, 1997), myrosinase-binding protein (Takechi et al, 1999), vacuolar invertases such as AtFruct3 and AtFruct4 (Haouazine-Takvorian et al, 1997), and a vacuolar chitinase, AtChib (Samac et a1, 1990). 16 Ht. let: an; 1h; tmr' toe Vat". the. ”1110:. 16ml: “Ola: 53le However, the role of the putative ctVSS in these proteins has not been characterized. Barley lectin (BL), a root cap and developing embryo protein has been expressed in Arabidopsis and extensively characterized (Dombrowski, 1995; Ahmed et al, 2000). In Arabidopsis thaliana ecotype RLD, constitutively-expressed BL was localized in the vacuole by immunoelectron microscopy and the precursor protein was shown by pulse-chase analysis to be proteolytically cleaved to the mature size. It was also demonstrated that BL lacking a ctVSS was secreted and localized in the apoplast. Another reporter active in plant vacuoles is rat preputial B-glucuronidase (Rat-GUS, Nishimura et al, 1986) when fused to the ctVSS of tobacco chitinase (Neuhaus et a1, 1995). Rat-GUS is a homotetrameric protein consisting of 75 kDa (648 amino acids) subunits, synthesized in rat cells as a precursor with a 15 amino acid C-terminal extension. This extension bears a glycosylation site for the phosphomannosyl glycan required for targeting to the lysosome via the mannose-6 phosphate receptor in animal cells (Johnson et al, 1990). The deletion of the C-terminal extension (Rat-GUS-Delta) resulted in the secretion of the protein in tobacco cell suspension protoplasts and a reduction in activity. The level of expression, however, was not affected. With the addition of the ctVSS from tobacco chitinase (Rat- GUS-ctVSS), the protein was efficiently directed to tobacco protoplast vacuoles (Neuhaus et al, 1995). To identify mutants impaired in the ctVSS pathway in the absence of known endogenous reporter, a transgenic approach was used whereby mutants would be identified which mislocalized heterologous ctVSS-bearing proteins. As the mutant screen involved chemical mutagenesis (see Chapter 3 for details), two reporters were used, one as a primary reporter to screen mutants visually and the other as a secondary reporter to 17 \‘Cl‘f. mur. 53D" “‘1 , BL ' -‘DV JAM map: I l hon mu: due verify the mutants by electron microscopy. Such an approach would ensure that cis- mutants (defective only in the targeting signal of one reporter) would be eliminated by the secondary screen. To screen a large collection of mutants, a visual screen was devised with Rat-GUS-ctVSS as the primary reporter as its enzymatic activity is easily detected in vivo. To perform the secondary screen, BL was chosen as a reporter. The localization of BL could be performed by immunoelectron microscopy. Since this screen required a stable line expressing both reporters, it was necessary to develop a line with a single insertion locus that was homozygous for the reporter construct. This would facilitate mapping and positional cloning at a later stage. The development of a transgenic homozygous single locus seed line for EMS mutagenesis is presented. RESULTS AND DISCUSSION While important results have been obtained describing protein sequences necessary for sorting soluble proteins to the vacuole, little is known of the machinery that carries out the sorting in plants. The goal of this study was to develop a line of Arabidopsis plants stably transformed with a vacuolar reporter construct for creating ctVSS-sorting mutants. Mutants with aberrant sorting pathways would then be expected to mis-direct their cargo to the apoplast via the default pathway of exocytosis. In order to screen for mutants defective in the transport machinery, a reporter construct was developed containing cDNAs encoding two reporter proteins: Rat-GUS-ctVSS and BL. Rat-GUS-ctVSS served as the primary rePorter in a visual screen and BL as a secondary reporter to verify the trans nature of the mutation. Two reporters were used in the screen to eliminate false positives that would arise due to chemically mutagenized vacuolar targeting signals (cis—mutations). As a control, a 18 U37: 1101.. Rat- VCClt . C0111.“ transgenic line was developed (Rat-GUS-Delta) which lacks a vacuolar targeting signal and would be secreted into the apoplast. Rat-GUS-ctVSS/BL Reporter construct and Rat-GUS-Delta construct A binary vector construct was created in pMOG800 (Mogen International, The Netherlands), with Rat-GUS-ctVSS driven by a double 358 promoter and BL driven by a second double 35S promoter resulting in pMOGCRB. Agrobacterium tumefaciens (GV3101 pMP90) cells were transformed with this construct and used to infect Arabidopsis thaliana ecotype Columbia plants by vacuum infiltration (Bent et al, 1994). In parallel, Rat- GUS-Delta (Neuhaus et al, 1995) driven by a 35S promoter was cloned into a pMOG800 vector and used to similarly transform Arabidopsis thaliana ecotype Columbia plants. This control line helped establish the screening assay. Primary transformants and evaluations Eleven Arabidopsis thaliana ecotype Columbia plants were independently infiltrated with a suspension of Agrobacterium cells carrying the pMOGCRB construct according to Bent et al (1994). About 3000 seeds per parent were screened on kanamycin media and an average of 15 plants per parent were found resistant to the antibiotic after 2 weeks of treatment. Since the insertion of the T-DNA is random, the activity of the transgene is known to vary from plant to plant depending on the site of insertion. Also, Within a single independently transformed plant, several transformation events could occur giving rise to different levels of expression of the transgene. Leaves of kanamycin resistant seedlings were assayed for Rat-GUS activity. Briefly, leaves were removed from three 19 seedlings per transformed parent and wounded to allow the entry of the Rat-GUS substrate, 5-bromo-4~chloro-3-indolyl B-D-glucuronide (X-GlcU), into the cell. The activity was evaluated visually. The results were compared with wild type plants (Figure 2.1a, b, c and d). Although wild type Arabidopsis has been reported to contain B-glucuronidase activity (Wozniak and Owens, 1994), wild type Arabidopsis plants did not pick up any stain after 2 hours of staining at 37 0C. In contrast, pMOGCRB plants stained very strongly. Activity was detected in trichomes and leaf veins. This confirmed that Rat-GUS was active when expressed in plants as previously shown by Neuhaus et al (1995). Among independent transformants, seedlings were selected visually for high B—glucuronidase activity. These plants were subsequently transferred to soil in growth chambers. It is likely that some of the lines have multiple insertions of the T-DNA and therefore exhibit higher levels of activity. At this stage however, it was difficult to evaluate the number of insertions based solely on a visual comparison of the level of activity. The histochemical assay was complemented by a quantitative fluorometric assay (adapted from Jefferson, 1987). B-Glucuronidase activity could be detected only in pMOGCRB plants (Figure 2.2 and Table 2.1) using 10 11g total protein per assay. The reaction appeared to proceed in a linear manner with respect to time. To confirm the presence of the secondary reporter, protein extracts were analyzed for the presence of BL. BL was detected by western blotting using anti-WGA sera (Ahmed et a1, 2000). At a dilution of 1:1000, the antibody cross-reacted with a single product of the expected size (Figure 2.3). 20 Panel A Panel C Panel B Panel D Figure 2.1 Verification of Rat-GUS activity in primary transformants A single leaf was harvested from each of 3 kanamycin resistant seedlings per independent transformant and wounded with a sterile micropipet tip (to allow the dye to enter the cells) in a microtiter plate. The leaf was then incubated overnight in 100 ml of assay buffer (0.1 M sodium acetate pH 4.8, 1 mg/ml BSA, 2 mM X-GlcU, 0.5 mM potassium ferrocyanide and 0.5 mM potassium ferricyanide) at 37 0C (modified from Guerineau and Mullineaux, 1993). Panel A) Leaf samples fi'orn kanamycin resistant primary transformant seedlings of 8 independent Columbia plants transformed with pMOBCRB stained for Rat-GUS activity (blue). Panel B) Leaf samples from wild type Columbia seedlings stained for Rat-GUS activity. Panel C) Close up of one microtiter well with a pMOGCRB primary transformant leaf stained for Rat-GUS activity (blue). Panel D) Close up of one microtiter well with a wild type leaf stained for Rat-GUS activity. The comparison of panels C and D indicate the strong blue stain in pMOGCRB transformants unlike wild type leaf samples. Images in this dissertation are presented in color. 21 16 TE :9 14 -~ 0 b n 12 + E \ MOGCRB : 10 T P . E 2 a ~~ . 2 .2 WI' = 6 -tr-— 3 .0 E a '5. 5 O 2 _. ._.<.HEAD><.T1TLE>PCReactor <./TITLE> <.SCRIPT>function PCR (Intro, NR,Prim,B10,prog) { if (BIO == "Sep0") { BlOX = NR * 2 B 10Xreal = Math.ceil (B 10X) K20 = "10 X PCR Buffer = " + B10Xreal + " ul " MG = NR * 0.6 MGreal = Math.ceil (MG) K30 = "50 mM MgC12 = " + MGreal + " ul " taq = NR * 0.4 Primer = NR " 0.25 dntp = NR "‘ 0.25 dd = (NR *19)-(BlOX + MG + taq + Primer + Primer + dntp) ddreal = Math.ceil(dd) } if (BIO = "Sep") { BlOX = NR * 2 B l 0Xreal = Math.ceil (B 10X) K20 = "10 X PCR Buffer == " + BlOXreal+ "111 " MG = NR * 1.2 MGreal = Math.ceil (MG) K30 = "50 mM MgC12 = " + MGreal + " ul " taq = NR * 0.4 Primer = NR * 0.25 dntp = NR * 0.25 dd = (NR *19)-(B10X + MG + taq + Primer + Primer + dntp) ddreal = Math.ceil(dd) } if (B10 = "Combo") { BlOX = NR * 3.2 B l 0Xreal = Math.ceil (B 10X) K20 = "6.25 X PCR-Mg Buffer = " + B10Xreal + " ul " K30 = H N taq = NR * 0.4 Primer = NR * 0.25 dntp = NR * 0.25 dd = (NR * l9) - ( BlOX + taq + Primer + Primer + dntp) ddreal = Math.ceil(dd) } if (prog == "svss1>4") 123 { prog = " I Cycling Program = " + prog Sl= "I Step 1:94 C for 7 min" 82: " I Step 2: 94 C for 30 sec" S3= " I Step 3: 55 C for 15 sec" S4= " I Step 4: 72 C for 30 sec" SS= " I Step 5: Back to Step 2 40 times" S6= " I Step 6: 72 C for 7 min" S7= " I Step 7: 4 C forever" } if (prog == "SVSSP3") { prog = " I Cycling Program = " + prog Sl= "I Step 1:94Cfor2min" SZ= " I Step 2: 94 C for 30 sec" S3= " I Step 3: 55 C for 15 sec" S4= " I Step 4: 72 C for 30 sec" SS= " I Step 5: Back to Step 2 40 times" S6= " I Step 6: 72 C for 7 min" S7= " I Step 7: 4 C forever" } if (prog == "SVSSP5") { prog = " I Cycling Program = " + prog Sl= "I Step 1:94 C for 5 min" SZ= " I Step 2: 94 C for 1 min" S3= " I Step 3: 55 C for l min" S4= " I Step 4: 72 C for 1 rrrin" SS= " I Step 5: Back to Step 2 40 times" S6= " I Step 6: 72 C for 7 min" S7= " I Step 7: 4 C forever" } TexSep = Intro + "\n\nName of Primer = " + Prim + " I Number of reactions = " + NR + "\nWater = " + ddreal + " 111" + prog + "\n" + K20 + 81 + "\n" + K30 + S2 + "\ndNTPs (10 mM each) = " + dntp + " 111" + S3 + "\n" + Prim + " Forward = " + Primer + " 111" + S4 + "\n" + Prim + " Reverse = " + Primer + ',' 111" + SS + "\nTaq Enzyme = " +taq+" ul"+S6+"\n ----- > 19 ulperRXN" +S7+ "\n+ lul Template DNA" document.PCRFonn.TextBox.value = TexSep } function F indMarker (marker) { fullQuery = "httpz/lgenome-www3.stanford.edu/cgi-bin/Webdriver 124 ?MIval=atdb_locus_max&name=" + escape (marker) location.href=fullQuery } function MasMix () { location.href = "http://www.msu.edu/~venkata1/sslpmas.htm" } function RFcM () { location.href = "http://www.msu.edu/~venkata1/cminvort.htm" } function labl () { location.href = "http://www.msu.edu/~venkata1/labsslst.htm" } function Eckers () { location.href = "http://genome.bio.upenn.edu/SSLP_info/SSLP.html" } function EckerMaps () { location.href = "http://genome.bio.upenn.edu/SSLP_info/SSLP_map.html" } function AraSer O { location.href = "http://genome-www.stanford.edu/Arabidopsis/AT-arabgen.html" } function LocatMarker (Smarker) { fulQuery = "http://genome-www3.stanford.edu/cgi-bin/AtDB/RImap?locus=" + escape (Smarker) location.href=firlQuery } function Chisqhet(Intro,C,H,L,Al) { C = C * l H = H * 1 L = L * 1 T = C + H + L CHI=((4*C*C)+(2*H*H)+(4*L*L)-(T*T))/T if (A1 = 0.1) {CHITAB = 4.605} if (A1 = 0.05) {CHITAB = 5.991} if (A1 = 0.01) {CHITAB = 9.210} if (CHI >= CHITAB) {R = "Failed"} if (CHI <. CHITAB) 125 {R = "Passed"} CHISHO = (Math.floor (1000 "' CHI))/1000 document.PCRForm.Hourf.value = Intro + "\n0bserved Segregation Ratio ofCol to Het to Ler= " + C + "z" + H + "z" + L + "\nExpected ratio 'of Col : Het : Ler if unlinked is = " + T/4 + "z" + T/2 + "z" + T/4 + "\n The Chisquare value is " + CHISHO + "\nand has " + R + " the null hypothesis at alpha = " + Al } function CalculateLink (Intro, HC, Het, HL) { N=(HC *1)+(HL*1)+(Het*l) RF = ((Het * 1) + (HL * 2))/(2 "' N) M1 = (Math.log(l- (2*RF))) * (-50) Mrounded = (Math.round (100 * Ml))/100 if (Mrounded <. l) {Mrounded = "0" + Mrounded} if (RF >= 0.5) {Mrounded = "Undefined & Unlinked"} if (RF >= 0.3) {Linko = "Not Linked"} if (RF <. 0.3) { Linko = "Linked"} SD = Math.sqrt((RF * (l - RF))lN) R2F = RF - SD M2 = (Math.log(l- (2*R2F))) "‘ (-50) M2R = (Math.round (100 * M2))/ 100 if (M2R <. 1) {M2R = "0" + M2R} SDcM = SD SDcMR = (Math.round (100 * SDcM))/ 100 if (SDcMR <. 1 ) {SDcMR = "0" + SDcMR} RF = (Math.round (100 * RF))/100 if (RF <. 1 ) {RF = "O" + RF} document.PCRForm.result0.value = RF document.PCRForm.resultl .value = Mrounded document.PCRForm.result2.value = Linko document.PCRForm.result3.value = SDcMR C = HC * 1 H=Ha*l L = HL * 1 T = C + H + L CHI=((4"'C*C)+(2*H*H)+(4*L*L)-(T*T))/T if (CHI >= 9.21) {R = "failed the null hypothesis at alpha = 0.01"} 126 if ((CHI <. 9.21) && (CHI >= 5.991)) {R = "failed the null hypothesis at alpha = 0.05"} if ((CHI <. 5.991) && (CHI >= 4.605)) {R = "failed the null hypothesis at alpha = 0.1"} if (CHI <. 4.605) {R = "passed the null hypothesis at alpha = 0.1"} CHISHOT = (Math.floor (1000 * CHI))/1000 CHISHO = CHISHOT if (CHISHOT <. 1) CHISHO = "0" + CHISHOT document.PCRForm.Hourf.value = Intro + "\n\nObserved Segregation Ratio of Col to Hetto Ler=" +C+":"+H+":"+L+ "\nExpected ratio of Col : Het : Ler if unlinked is = " + T/4 + ":" + T/2 + ":" + T/4 + "\nThe Chisquare value is " + CHISHO + "\nand has " + R + N." + "\nThe Recombination frequency is = " + RF + " +/- " + SDcMR + "." + "\nThe mapping distance is = " + Mrounded + " cM." + "\nThe lower limit (RF - SD) mapping distance is = " + M2R + " cM." } today = new Date () document.write ("<.H1>SSLP Package <./Hl><.D"I>This program estimates materials to be used for a PCR SSLP reaction and the analysis of the results<.DT> <.F ONT SIZE = 1> A freeware product of Iyer-Venk: Developed by Meera Iyer and Sridhar Venkataraman<./D'I><./FONT><.HR>Date: " + today) <./SCRIP'I> <.BODY> <.FORM NAME = "PCRForm"><.FONT FACE = "tahoma" SIZE =2><.DT>Details of the experiment <.IN PUT TYPE = "text" NAME = "ibox" SIZE= 50> <.DT>Number of standard SSLP - 20 ul Reactions <.INPUT TYPE = "text" NAME = "NumR" SIZE= 3> Name of Primer <.INPUT TYPE = "text" NAME = "Prim" SIZE= 8> <.dt>Buffer and Mg <.SELECT NAME= "Bufl 0Mg"> <.0PTION VALUE = "Sep0">Separate 10 X PCR buffer and MG (1.5 mM final)<./0PT10N> <.0PTION VALUE = "Sep">Separate 10 X PCR buffer and MG (3 mM final)<./0PTION> <.0PT ION VALUE = "Combo">Combo PCR buffer and MG (6.25 X and 18.75 mM)<./0PT10N> <./SELECT> Cyling Program <.SELECT NAME= "Prog"> <.0PTION VALUE = "SVSSP3">SVSSP3<./0PTION> <.0PTION VALUE = "SVSSP4">SVSSP4<./0PT10N> <.0PTION VALUE = "SVSSPS">SVSSP5<./0PT10N> 127 <./SELEC'I> <.dt><.INPUT TYPE = "button" VALUE = "Polymerize" onClick = " PCR (document.PCRForm.ibox.value, document.PCRForm.NumR.value, document.PCRForm.Prim.value,document.PCRForm.Bufl 0Mg.options [document.PCRForm.Bufl 0Mg.selectedlndex].value,document.PCRForm.Prog.options [documentPCRForm.Prog.selectedIndex].value)"><./FONT> <.INPUT TYPE = "button" VALUE = "Master Mixes" onClick = " MasMix 0" target = "_new_window_"><./FON'l> <.IN PUT TYPE = "button" VALUE = "Explain" onClick = " F indMarker (document.PCRForm .Prim.value)" target = "_new_window_"><./FONT><.INPUT TYPE = "button" VALUE "Locate on RI Map" onClick = " LocatMarker (document.PCRForm.Prim.value)" target = "_new_window_"><./FONT> <.IN PUT TYPE = "button" VALUE = "cM <.=> RF conversions" onClick = " RFcM 0" target = "_new_window_"><./FON'I> <.dt><.INPUT TYPE = "button" VALUE = "GoTo Ecker's SSLP page" onClick = " Eckers() " target = "_new_window__"><./FONT> <.IN PUT TYPE = "button" VALUE = "Ecker's SSLP Map page" onClick = " EckerMaps 0 " target = "_new_window_"><./FON'I> <.INPUT TYPE = "button" VALUE = "Raikhel's SSLPs" onClick = " labl 0" target = "_new_window__"><./FONT> <.FONT SIZE =3><.dt><.BR><.textarea name="TextBox" rows=15 cols=70><./textarea><./BR><.D'I> <.p><.b><.font size = 3> Analysis of Data <./font><./b> <.dt><.FONT SIZE =3> Details of the experiment <.INPUT TYPE = "text" NAME = "boxO" SIZE= 50><./dt> <.DT>Number of homozygous lines in Columbia Ecotype <.INPUT TYPE = "text" NAME = "HomCol" SIZE= 5> <./D'I> <.DT>Number of heterozygous lines <.INPUT TYPE = "text" NAME = "Het" SIZE= 15> <./DT> <.DT>Number of homozygous lines in Lansberg Ecotype <.INPUT TYPE = "text" NAME = "HomLer" SIZE= 5> <./DT> <.dt><.INPUT TYPE = "button" VALUE = "Evaluate" onClick = "CalculateLink (document.PCRForm.box0.value,document.PCRFonn.HomCol.value, document.PCRForm.Het.value, document.PCRForm.HomLer.value)"> <.P>Recombination Frequency (RF) <.INPUT TYPE = "text" NAME = "resultO" VALUE="" SIZE=15> <.DT>Standard Deviation in "~RF" <.INPUT TYPE = "text" NAME = "result3" VALUE="" SIZE=15> <./D'I> <.DT>Linked (<. 0.3 RF)/ Unlinked (> 0.3 RF) <.INPUT TYPE = "text" NAME = 128 "result2" VALUE="" SIZE=15> <./D'l> <.DT>Mapping Distance (cM) <.INPUT TYPE = "text" NAME = "resultl" VALUE="" SIZE=30> <./D'I> <.BR> <.textarea name="Hourf" rows=12 cols=70><./textarea><.BR> <.FONT FACE="Tahoma"><.P>You are visitor no. <.1MG SRC="http://counter.digits.corn/wc/-d/4/svsslp" ALIGN=middle WIDTH=60 HEIGHT=20 BORDER=0 HSPACE=4 VSPACE=2> <.A HREF="http://www.digits.com/"><.FONT SIZE=2>http://www.digits.com<./FON'I><./A> <.P> Should you find this program useful in your analysis, please cite "Sridhar Venkataraman and Natasha V. Raikhel, Arabidapsis Mapping Package using SSLP and CAPS data, 1999" <.P><.A HREF="mailtozvenkatal@pilot.msu.edu"><.FONT FACE="Times,Times New Roman">Please send Comments to Sridhar Venkataraman<./A><./D'I> <.DT><.A HREF="http://www.msu.edu/~venkatal/index.htm"><.FONT FACE="Times, Times New Roman">Back to the main page<./A><./DT><./FORM> <.strong>A search of all messages contained in the BioSci Arabidapsis Genome mailing or newsgroup.<.lstrong> <.FORM ACT10N="http://genome-www.stanford.edu/cgi-bin/AtDB/AT-arabgenbiosci search.cgi" METHOD=POST> <.TABLE><.TR><.TD><.INPUT NAME="search" VALUE="" size=50> <.TD><.INPUT TYPE="image" SRC="http:l/genome-www.stanford.edu/Excite/pictures/ AT-search_button.gif' NAME="searchButton" HEIGHT=20 WIDTH=75 ALT="Search" BORDER=0><./TD><./TR><./TABLE> <.IN PUT TYPE="hidden" NAME="sp" VALUE="sp"> <./FORM> <.FORM NAME= "main" METHOD="POST" ACT10N="http://www.williamstone.com/ primers/calculator/calculator.cgi"> Primer Evaluation at www.williamstone.com <.dt> <.td><.IN PUT TYPE="text" NAME="fprimer" SIZE=40><./td> <.td><.SELECT NAME="forient"><.0PTION SELECTED VALUE="0">5' to 3' <.0PTION VALUE="1">3' to 5'<./SELECT><./td> <./tr><./table><./font><./td> <.INPUT TYPE="submit" NAME="nextbutton" VALUE="Eva1uate Primer"> <.IN PUT TYPE="hidden" NAME="next" VALUE="evaluate"> <./FORM> <./BODY> <./HTML> 129 APPENDIX B Bradford Analysis made simple: A web-based approach http://www.msu.edu/~venkata1/bradfordhsa.htm 130 INTRODUCTION A key step in protein studies is to quantify the concentration of protein present in a given solution. A variety of assays are available for determination of protein concentrations such as the Lowry procedure, the Biuret method, the bicinchonic acid procedure and the Bradford assay (see Dunn, 1989 for review). The Bradford assay is a convenient assay to measure concentrations of protein in the range of 200 11ng and 1,400 ug/ml. This assay is based on the binding of Coomassie Brilliant Blue G-250 to primarily basic (especially arginine) and aromatic amino acid residues (Bradford, 1976). The reaction can be detected using colorimetrically at 595 nm. RESULTS AND CONCLUSION To measure the concentration of a protein sample, the absorption of light at 595 nm is compared to the absorption of known concentrations of human serum albumin (HSA). To facilitate the calculation of the concentrations an HTML code has been written. This code carries out a regression analysis to determine the slope and y-intercept of the standards used in the assay. These results are then used to calculate the concentration of the test sample. The use of this program has facilitated the evaluation of protein concentrations in a convenient manner. REFERENCES Bradford MM (1976) A rapid and sensitive method for the quantitation of rnicrogram quantities of protein utilizing the principle of protein-dye binding. Anal Biochem 72: 248-54. 131 Dunn MJ (1989) Determination of total protein concentration. In Harris ELV and Angel S (eds.) Protein purification methods: a practical approach. Oxford University Press, Oxford, UK. pp 10-17 132 HTML code for Bradford Analysis available at www.msu.edu/~venkata“bradfordhsa.htm <.HTML> <.HEAD> <.TI'I‘LE>BradFord<./'1‘ITLE> <.SCRIP'I>function Brad (Intro,NS,Xl,X2,X3,X4,X5,X6,X7,X8,X9,X10,Yl,Y2,Y3,Y4,Y5,Y6,Y7,Y8,Y9,Y10, NU,VU,AU,Nl,N2,N3,N4,N5,N6,N7,N8,N9,N10,U1,U2,U3,U4,U5,U6,U7,U8,U9,U10) { ARRAYNBIG = new Array(N 1,N2,N3,N4,N5,N6,N7,N8,N9,N10) ARRAYXBIG = new Array(Xl,X2,X3,X4,X5,X6,X7,X8,X9,X10) ARRAYYBIG = new Array(Y l,Y2,Y3,Y4,Y5,Y6,Y7,Y8,Y9,Y10) ARRAYUBIG = new Array(U1,U2,U3,U4,U5,U6,U7,U8,U9,U10) ARRAYUX = new Array(NU) SIGX = 0 for (a = 0; a <. NS; a++) { SIGX = SIGX + (1 * ARRAYXBIG[a]) } SIGX2 = 0 for (i=0; i <. NS; i++) { SIGX2 = SIGX2 + (ARRAYXBIG[i])*(ARRAYXBIG[i]) } SIGY = 0 for (b = 0; b <. NS; b++) { SIGY = SIGY + (1 * ARRAYYBIG[b]) } SIGXY = 0 for (j=0; j <. NS; j++) { SIGXY = SIGXY + (ARRAYXBIG[j])*(ARRAYYBIG[j]) } B = (SIGXY - ((SIGX) * (SIGY))/NS)/ (SIGX2 - (SIGX * SIGX)/N S) A = (SIGY/N S) - (B * SIGX/N S) for (k=0; k <. NU; k++) { ARRAYUX[k] = (Math.round ((ARRAYUBIG[k] - A) * 10000/(B * VU)))/10000 } . STDTAB = "\n\nStandard's Table" + "\n " + "\nIHSA (ug/ul)I OD 595 I" + "\n " for (p=0; p <. NS; pH) { 133 TX = ARRAYXBIG[p] if (ARRAYXBIG[p] <. 1) { TX = "o" + ARRAYXBIG[p]} TY = ARRAYYBIG[p] if (ARRAYYBIG[p] <. 1) { TY = "o" + ARRAYYBIG[p]} STDTAB = STDTAB + "\nI " + TX + " I " + TY + n I" } UNKTAB = "\n " + "\n\nUnknown's Table" + "\n ----------- + "\n Sample Vol = " + VU + " 111." + "\nISampleI OD 595 | Cone (ug/ul)| Vol (111) for " + AU + " ug protein" + "\n -------------- for (CF-0; q <- NU; q++) { TU = ARRAYUBIG[q] if (ARRAYUBIG[q] <. l) { TU = "0" + ARRAYUBIG[q]} TUX = ARRAYUX[q] VTUX = (Math.round (10 * AU / TUX))/10 if (ARRAYUX[q] <. l) { TUX = "0" + ARRAYUX[q]} UNKTAB = UNKTAB + "\nI " + ARRAYNBIG[q] + " I " + TU + " I " + TUX + " l H + VTUX +1! I N } document.ecoliForm.Brader.value = Intro + STDTAB + UNKTAB + "\n } today = new Date () document.write ("<.H1>Bradford test by Regression Analysis<./Hl><.DT>This program evaluates protein concentrations based on 0D595<.D'I><.F ONT SIZE = 2>Protocol <.dt> 1) Add 5 ml Bradford reagent (Bio-Rad 500-0006) to 20 ml H20. <.dt> 2) Start the Spec and set at CD 595 nm. <.dt> 3) Aliquot 1 ml of the diluted reagent to sample tubes, add 0,2,4,6,8 and 10 ul of <.dt> 1 mg/ml HSA (BSA can also be used but HSA is more linear and reproducible) for standards and say 3 ul of protein extract for analysis. <.dt>4) Quantitate and calculate below <.dt> <.F ONT SIZE = 1> A freeware product of Iyer-Venk: Developed by Meera Iyer and Sridhar Venkataraman<./DT><./FON'I><.HR>Date: " + today) <./SCRIPT> <.BODY> <.FORM NAME = "ecoliForm"> <.P><.F ONT FACE="Tahoma" SIZE=3>Details of the experiment <.INPUT TYPE = "text" NAME = "Intro" SIZE= 50> <.DT>Number of Standards (including blank) <.INPUT TYPE = "text" NAME = "NoS" SIZE= 3> 134 <.DT>Standard Blank (0 ug/ul HSA) <.INPUT TYPE = "text" NAME = "SIP" SIZE= 4> 0D <.INPUT TYPE = "text" NAME = "SOD1" SIZE= 6> <.DT>Standard l (ug/ul HSA)<.INPUT TYPE = "text" NAME = "SZP" SIZE= 4> 0D <.INPUT TYPE = "text" NAME = "SOD2" SIZE= 6> <.DT>Standard 2 (ug/ul HSA)<.IN PUT TYPE = "text" NAME = "S3P" SIZE= 4> 0D <.INPUT TYPE = "text" NAME = "SOD3" SIZE= 6> <.DT>Standard 3 (ug/ul HSA)<.INPUT TYPE = "text" NAME = "S4P" SIZE= 4> 0D <.INPUT TYPE = "text" NAME = "SOD4" SIZE= 6> <.DT>Standard 4 (ug/ul HSA)<.INPUT TYPE = "text" NAME = "SSP" SIZE= 4> 0D <.INPUT TYPE = "text" NAME = "SODS" SIZE= 6> <.DT>Standard 5 (ug/ulHSA)<.1NPUT TYPE = "text" NAME = "S6P" SIZE= 4> 0D <.INPUT TYPE = "text" NAME = "SOD6" SIZE= 6> <.DT>Standard 6 (ug/ul HSA)<.INPUT TYPE = "text" NAME = "S7P" SIZE= 4> 0D <.INPUT TYPE = "text" NAME = "SOD7" SIZE= 6> <.DT>Standard 7 (ug/ul HSA)<.INPUT TYPE = "text" NAME = "S8P" SIZE= 4> 0D <.INPUT TYPE = "text" NAME = "SOD8" SIZE= 6> <.DT>Standard 8 (ug/ul HSA)<.INPUT TYPE = "text" NAME = "S9P" SIZE= 4> 0D <.INPUT TYPE = "text" NAME = "SOD9" SIZE= 6> <.DT>Standard 9 (ug/ul HSA)<.INPUT TYPE = "text" NAME = "S10P" SIZE= 4> 0D <.INPUT TYPE = "text" NAME = "SODIO" SIZE= 6> <.P> <.DT>Number of Unknowns <.INPUT TYPE = "text" NAME = "NoU" SIZE= 3> Volume of Unknowns (ul) <.INPUT TYPE = "text" NAME = "VoU" SIZE= 4> Required amount of Unknowns (ug)<.INPUT TYPE = "text" NAME = "AU" SIZE= 4> <.DT>Unknown l <.INPUT TYPE = "text" NAME = "UlP" SIZE= S> OD 595 <.INPUT TYPE = "text" NAME = "UOD1" SIZE= 6> <.DT>Unknown 2 <.INPUT TYPE = "text" NAME = "U2P" SIZE= 5> OD 595 <.INPUT TYPE = "text" NAME = "UOD2" SIZE= 6> <.DT>Unknown 3 <.INPUT TYPE = "text" NAME = "U3P" SIZE= 5> OD 595 <.INPUT TYPE = "text" NAME = "UOD3" SIZE= 6> <.DT>Unknown 4 <.INPUT TYPE = "text" NAME = "U4P" SIZE= 5> OD 595 <.INPUT TYPE = "text" NAME = "UOD4" SIZE= 6> <.DT>Unknown 5 <.INPUT TYPE = "text" NAME = "USP" SIZE= 5> OD 595 <.INPUT TYPE = "text" NAME = "UOD5" SIZE= 6> <.DT>Unknown 6 <.INPUT TYPE = "text" NAME = "U6P" SIZE= 5> OD 595 <.INPUT TYPE = "text" NAME = "UOD6" SIZE= 6> ' <.DT>Unknown 7 <.INPUT TYPE = "text" NAME = "U7P" SIZE= 5> OD 595 <.INPUT TYPE = "text" NAME = "UOD7" SIZE= 6> <.DT>Unknown 8 <.INPUT TYPE = "text" NAME = "U8P" SIZE= 5> OD 595 <.INPUT TYPE = "text" NAME = "UOD8" SIZE= 6> <.DT>Unknown 9 <.INPUT TYPE = "text" NAME = "U9P" SIZE= 5> OD 595 <.INPUT TYPE = "text" NAME = "UOD9" SIZE= 6> <.DT>Unknown 10 <.INPUT TYPE = "text" NAME = "UlOP" SIZE= 5> OD 595 <.INPUT TYPE = "text" NAME = "UOD10" SIZE= 6> 135 <.P><.INPUT TYPE = "button" VALUE = "Submit" onClick = "Brad (document.ecoliForm.Intro.value, document.ecoliForm.NoS.value,document.ecoliForm. S 1P.value, document.ecoliForm.$2P.value, document.ecoliForm.S3P.value, document.ecoliForm.S4P.value, document.ecoliForm.SSP.value, document.ecoliForm.S6P.value, document.ecoliForm.S7P.va1ue, document.ecoliForm.S8P.value, document.ecoliForm.S9P.value, document.ecoliForm.S 10P.value, document.ecoliForm. SOD 1 .value, document.ecoliForm.SOD2.value,document.ecoliForm.SOD3.value, document.ecoliForm.SOD4.value, document.ecoliForm.SODS.value, document.ecoliForm.SOD6.value, document.ecoliForm.SOD7.va1ue, document.ecoliForm.SOD8.value, document.ecoliForm.SOD9.value, document.ecoliForm. SOD l 0.value,document.ecoliForm.NoU.value,document.ecoliForm. VoU.value,document.ecoliForm.AU.value, document.ecoliForm.U 1 P.value, document.ecoliForm.U2P.value, document.ecoliForm.U3P.value, document.ecoliForm.U4P.value, document.ecoliForm.USP.value, document.ecoliForm.U6P.value, document.ecoliForm.U7P.value, document.ecoliForm.U8P.value, document.ecoliForm.U9P.value, document.ecoliForm.U 10P.value, document.ecoliF orm.UOD1 .value, document.ecoliForm.UOD2.value, document.ecoliForm.UOD3.value, document.ecoliForm.UOD4.value, document.ecoliForm.UODS.value, document.ecoliForm.UOD6.value, document.ecoliForm.UOD7.value, document.ecoliForm.UOD8.value, document.ecoliForm.UOD9.value, document.ecoliForm.UOD10.value)"> <./FON"I> <.BR> <.textarea name="Brader" rows=50 cols=80 ><./textarea> <.BR> <.D'1> <.P><.A HREF="mailtozvenkata1@pilot.msu.edu"><.FONT FACE="Times,Times New Roman">Please send Comments to Sridhar Venkataraman<./A><./DT> <.DT><.A HREF="http://www.msu.edu/~venkata1/index.htm"><.F ONT FACE="Times,Times New Roman">Back to the main page<./A><./DT> <./FORM> <./BODY> <./HTML> 136 APPENDIX C 137 Table C.1 Genes present on the BAC MBB18 and the proteins encoded Clone Similarity 'MBB18.2 [Unknown Protein MBB 18.3 IUnknown Protein MBB18.4 IUnknown Protein BBl8.5 'Unknown Protein MBB18.6 Tryptophan synthase beta chain yrosinase binding protein-like, similar to jasmonate induced 'MBB l 8.7 rotein yrosinase binding protein-like, similar to jasmonate induced MBB 18.8 rotein B 18.9 IUnknown Protein MBB18.10 Isimilar to protein kinase 'MBB18.1 1 nknown Protein MBBI8.12 Similar to heat shock transcription factor HSF 30 MBB 18.13 IUnknown Protein [MBB 18.14 irnilar to heat shock transcription factor HSF 30 lMBB18.15 nknown Protein BB 18.16 IUnknown Protein BB18.17 DS-box protein-like lMBB18.18 Cytochrome B-S6l MBB18.19 Similar to guanine nucleotide exchange factor BB 1 8.20 nknown Protein MBB 1 8.21 [Unknown Protein MBB18.22 IUnknown Protein MBBl8.23 [Unknown Protein MBBI8.24 [Unknown Protein MBB18.25 Similar to non-LTR retroelement reverse transcriptase Annotation by the Kazusa DNA Research Institute (Chiba, Japan) and The Institute for Genomic Research, (TIGR, Rockville, MD, USA). 138 Table C.2 Genes present on the BAC MKD10 and the proteins encoded Clone imilarity 'MKD10.2 nknown Protein MKD10.3 IProline oxidase precursor MKD10.4 IUnknown Protein [MKD10.5 Similar to Salt-inducible protein IMKD10.6 [Unknown Protein IMKD10.8 IUnknown Protein IMKD10.9 Similar to pollen coat protein IM S-adenosyl methioninezsalycylic acid KD10. 10 carboxlrnethyltranferase-like protein IMKD10.11 [Unknown Protein IMKD10.12 JUnknown Protein Annotation by the Kazusa DNA Research Institute (Chiba, Japan) and The Institute for Genomic Research, (TIGR, Rockville, MD, USA). 139 Table C.3 Genes present on the BAC K15E6 and the proteins encoded blone Similarity K15E6.1 Similar to bZIP transcription factor K15E6.2 Unknown Protein K15E6.3 Amino acid transporter-like Kl 5E6.4 Cysteine-tRN A 1i gase K15E6.5 Unknown Protein K15E6.6 IDisease resistance protein-like K15E6.7 DNA-binding protein-like K15E6.8 Unknown Protein K15E6.9 Unknown Protein K15E6.10 Unknown Protein K15E6.11 Unknown Protein K15E6. 12 FmE protein-like K15E6.13 TRNA-IlefiAT) K15E6. l4 xalate oxidase (gemrin proteinL-like protein K15E6.15 Similar to non-LTR retroelement reverse transcriptase K15E6. l6 Oxalate oxidase (gerrnin protein)-like protein K15E6.17 Similar to retroelement pol K15E6. 18 Oxalate oxidase (germin protein)-like protein K15E6.19 Similar to oxalate oxidase K15E6.20 Oxalate oxidase (germin protein)-like protein K15E6.21 Cytochrome P-450 K15E6.22 Similar to retroelement pol K15E6.23 nknown Protein K15E6.24 Receptor protein kinase-like protein Annotation by the Kazusa DNA Research Institute (Chiba, Japan) and The Institute for Genomic Research, (TIGR, Rockville, MD, USA). 140 Table C.4 Genes present on the BAC MXF12 and the proteins encoded 12.2 12.3 12.4 12.5 12.6 12.7 12.8 12.9 12.10 12.11 12.12 12.13 12.14 12.15 12.16 12.17 12.18 12.19 -1ike ' kinase nknown Protein ' kinase-like ' kinase-like -like ltransferease-like imilar to Ac-like ' like ltransferease-like imilar to retroelement 1 like like nknown Protein like ' ' kinase-like Annotation by the Kazusa DNA Research Institute (Chiba, Japan) and The Institute for Genomic Research, (TIGR, Rockville, MD, USA). 141 Table C.5 Genes present on the BAC K3K3 and the proteins encoded K3.2 oxidase K33 nknown Protein K3.4 oxidase K3.5 ' ' to retroelement K3.6 ' ' K3.7 Protein K3.8 nknown Protein K3.9 nknown Protein K3.10 Protein K3.] 1 Protein K3.12 nknown Protein K3. 1 3 nknown Protein K3.l4 Protein K3.15 LTR retroelement reverse K3. 1 6 nknown Protein K3. 1 7 ' ' ' K3. 1 8 K3 . 19 ' like K3.20 ' like K3.2] ' like K3.22 ' like K3.23 P Annotation by the Kazusa DNA Research Institute (Chiba, Japan) and The Institute for Genomic Research, (TIGR, Rockville, MD, USA). 142 Table C.6 Genes present on the BAC MUL8 and the proteins encoded Protein Protein imilar to S60 S-locus Protein kinase-like imilar to ' nknown Protein bokr'os'uth'wiv; lo .10 .11 nknown Protein .12 ' ll .13 nknown Protein .14 nknown Protein .15 imilar to CHP-rich Zinc F ' .16 imilar to CHP-rich Zinc F' .17 nknown Protein .18 nknown Protein .19 nknown Protein .20 formation .21 -SNARE At-VTIla .22 ' .23 nknown .24 P K3.23 Annotation by the Kazusa DNA Research Institute (Chiba, Japan) and The Institute for Genomic Research, (TIGR, Rockville, MD, USA). 143 IIIIIIIIIIIIIIIIIIII IlIllI BIIIIIIJIZQIOM