{Fiat-25;. ‘1" lllllllllijlfllamllllljjllfflllllfllfllflllll ”gram... University THE GENOMIC ORGANIZATION, STRUCTURE AND EXPRESSION OF THE CHICKEN H2b HISTONE GENE FAMILY presented by 0 David Kilgore Grandy has been accepted towards fulfillment of the requirements for . Doctor of Philosophy degree in Microbiology and Public Health 6 a r professor O . dodgson MS U is an Affirmative Action/Equal Opportunity Institution 0-12771 Date 3-1L-85 MSU LIBRARIES —— v RETURNING MATERIALS: Place in book drop to remove this checkout from your record. FINES will be charged if book is returned after the date stamped below. dfi284fig; 35997 ’87 ass Ifi"->~‘-.-V‘wo‘ 'l" fie? 3257 'l THE GENOMIC ORGANIZATION, STRUCTURE AND EXPRESSION OF THE CHICKEN H2b HISTONE GENE FAMILY By David Kilgore Grandy A DISSERTATION Submitted to Michigan State University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY Department of Microbiology and Public Health 1985 To Laurel and Madeline ii ACKNOWLEDGMENTS I would first like to thank Dr. Jerry Dodgson for providing a very pleasant and stimulating environment in which to learn and work. The unselfish sharing of his knowledge, time and financial resources has made the completion of this research project, in my 4% years in the Department of Microbiology and Public Health, possible. I wish him the best in the future. I should also thank Dr. J.D. Engel and Dr. B.J. Sugarman for sharing their ideas and unpublished data with me. I would like to thank my committee members, past and present, who have offered guidance and assistance during my tenure as a graduate student: Drs. Conrad, Fluck, Fritsch, Kung, Magee, Miller, Patterson and Robbins. I am especially grateful to M. Fluck who has stuck it out with me to the end and P.T. Magee, who almost did. I would like to acknowledge the support, shared ideas and friendship of the members of the Dodgson lab: K. Barringer, P. Bates, P. Boyer, D. Browne, D. Brush, P. Creature, M. Federspiel, J.D. Lee, W. Lewis, S. Stadt and C. Yoshihara. A special note of thanks is due my parents, H. Bruce and Betty Grandy, for literally and figuratively making me what I am today; my grandfather C.C. Grandy who encouraged my early interest in natural science; and my other surviving grandparents J. Grandy, M. Kilgore and L.D. Kilgore for their interest and support. iii Finally I thank Laurel for putting up with the trials and tribula- tions of the most hectic four years of our lives; for her love and support; for typing a large portion of this manuscript; and for Madeline (who was a doll in spite of all the commotion). I would also like to thank Hilary, Maxx and Maynyrd for offering playful diversions when they were needed. Chapter 2 is reprinted from Gene Expression (PP.445-455,1983) with the permission of the publisher, A.R. Liss. iv TABLE OF CONTENTS LIST OF TABLES ..... . .............. . ................................ vii LIST OF FIGURES ................................................... viii DEFINITION OF HISTONE ............................................... x INTRODUCTION........ ................................................. 1 The Histone Proteins.................. ......... ............ ..... 2 Biochemistry.. ..... ........... ......................... ....2 Histone protein function....................... ..... .......3 Histone protein variants........................ ........... 7 Secondary modifications of histone proteins.... ..... ......11 The Histone Genes.................................. ........ ....13 Organization....................... ...... ....... ...... ....13 Sea urchin............................... ..... .......13 Drosophila melanogaster.................. ..... .......18 Fungi........................... ..... ................20 Tetrahymena sp. and Artemia sp.. ..... ...... ..... .....22 Amphibia........................ ..... ... ..... ........22 Higher eucaryotes................... ..... .. ...... ....23 Plants.................................... ........ ...28 Fine structure................................... ..... ....29 Histone genes-5' flanking sequences..................30 Histone gene coding regions..........................33 Histone genes-3' flanking sequences..................34 Regulation of histone gene expression........ ....... ......36 Literature Cited...............................................42 CHAPTER 1. COMPLETE NUCLEOTIDE SEQUENCE OF A CHICKEN HZb HISTONE GENE........... ...................................................... 49 Abstract.............. ...... .. .......... .... .................. .49 Introduction................ ........ ...... ....... ..............50 Experimental Procedures..................... ..... .. ......... ...52 Material.......................... ..... ......... ....... ...52 Preparation and characterization of pKRla-1.3.............52 DNA sequencing............................................53 Results and Discussion.........................................53 Preparation of a chicken H2b gene plasmid.................53 DNA sequence analysis of pKRla-l.3 ...... ..................55 Sequence of gene-flanking regions............. ..... .......57 Acknowledgments.............................. .............. ....60 References..... ............... .. ........ .. .............. .. ..... 61 V PAGE CHAPTER 2. THE CHICKEN H2b GENE FAMILY .............................. 63 Abstract..... .......................................... . ....... 63 Introduction. 0 O O O I O O O O O O O O O O O O O O O O O O O O O O O O O I O O O O O O O O O O O O O O O I O O .64 Results and Discussion... ...... .... ....... .....................65 Analysis of the complete chicken H2b gene family..........65 Construction of a chicken histone H2b gene-containing plasmid....................... ..... . ....... ..... ........ 67 DNA sequence analysis of pBR2e-3.5........................67 Sequence conservation in the 5' gene-flanking regions Of H2b-la and H2b-26............... oooooooooo .0069 Acknowledgments............................. ........ .. ......... 73 References............................................ ......... 74 CHAPTER 3. GENOMIC ORGANIZATION, STRUCTURE AND EXPRESSION OF THE CHICKEN H2b HISTONE GENE FAMILY ......... . ........................... 76 Abstract............ ..... ........... ............. .. ............ 76 Introduction........ ..... ........... ............... .... ..... ...78 Materials and Methods..........................................81 Materials..................................... ...... ......81 Isolation and purification of recombinant DNA molecules...81 Filter hybridization......................................82 Restriction enzyme site mapping...........................83 DNA sequencing............................................83 Sl mapping analysis.......................................84 Results........................................................84 Estimate of the number of chicken H2b histone genes.......84 Isolation of the chicken H2b histone genes................88 DNA sequencing strategy of the H2b histone genes..........92 Chicken H2b coding sequences..............................95 5' sequences flanking the H2b histone genes..............100 81 mapping of the pKRla-l.3 transcript...................104 Chicken H2b histone gene 3' flanking sequences...........105 Identification of nonallelic sibling H2b histone genes...108 Discussion....................................................ll3 References. ........... ..... ............. . ...... ..... .......... 123 CONCLUSIONS ........................................................ 126 vi TABLE INTRODUCTION I CHAPTER 3 I LIST OF TABLES PAGE Classes of histones....... ....... .....4 Comparison of the H2b histone exons and the proteins they predict with respect to pKRla-1.3.........................101 vii LIST OF FIGURES FIGURE PAGE INRODUCTION 1 Comparison of the genomic organization of histone genes from a variety of species ................................... l6 2 Genomic organization of the chicken histone genes ............ . ................ 25 3 Genomic organization of a mouse histone gene cluster on MM221 ............. 27 4 Genomic organization of the human histone genes.. ........................... 27 5 Histone gene 5' homology blocks ........... 31 CHAPTER 1 1 Restriction maps of ACHla and pKRla-1.3...54 2 The complete nucleotide sequence of the chicken H2b gene ..... . .................... 58 CHAPTER 2 1 Identification of the chicken H2b histone gene family ..... . ..... . ...... . ............ 66 2 Restriction maps of ACHZe and pRR2e-3.5...68 3 The nucleotide sequence of two chicken H2b histone genes ....... . ................. 7O 4 Sequence and spacing homologies found in the 5' flanking regions of two chicken H2b genes ............... . ................. 72 5 A conserved nine nucleotide sequence unique to H2b genes.... ........ .... ....... 72 CHAPTER 3 l Genomic Southern blot of EcoRI/HindIII digested chicken red blood cell DNA ....... 87 2 Restriction enzyme cleavage maps of the ten chicken H2b histone gene-containing phage ..................................... 91 viii PAGE Restriction enzyme maps of the chicken H2b-containing subclones... ....... 94 Chicken H2b exon sequence comparison......97 Distribution of nucleotide base changes observed in the eight chicken H2b histone genes with respect to pKRla-1.3...98 DNA sequence analysis of the 5' flanking regions of eight chicken H2b histone geDESooooooooooo ...... ......OOOO... ...... 103 S mapping analysis of the pKRla—1.3 1 . transcrlpt... ...... 0.0.000000000000000107 DNA sequence comparison of the 3' flank- ing regions of nine chicken H2b histone geneSoooo00000000000000.0000...one...0000110 Alignment of the phage maps which contain nonallelic, sibling H2b histone genes....................................112 ix HIS'° TONE - n.[H.Hm~chm ~sm.m~ mMH mac ~-H Nan m.~ Nm.mH a.o Nm.m~ NOH zoo fiHHH.mmvm: umm Na oom.o~ nH< Nofi Nm.mm m.~ Nm.mH w.¢ No.q Nwa mom I How NHH q~m.m~ mNH mH< NOH Nm.~N m.~ Nmfi m.~ Nm.o Nofi zoo ANQHH.n~mVn~: oom.o~ sac NHH coo.mH mH< Nsa Nam m.H Nan m.~ Nm.a NHH man . saw NHH 00¢.MH mNH mH< Nma mom m.H Nm.mH N.H Nm.m “Hg zoo AanHH.~mNmV\Ch3c 7F H28 H28 H28 H2b L ‘ ‘V \1 \1/ v 11711 )\Ch2d H28 H2b H1 [1‘ .IL . ..91 v1 J) Ac“ H2b H2aH2b 7F lit—4.? <7 0 _S? v TT 1) Xensa H28 H2b == H2a _ = H4 \U=EcofiI‘ ,‘llecoRlv- =HINDIIIA= KPNI' ;/1\=BAMHI‘ -= Direction oi linker Transcription Figure 2. 92 restriction enzyme(s) and the H2b-containing fragment was then gel puri- fied and subcloned into the appropriate restriction sites of the plasmid pBR322. The plasmid DNA was prepared and then digested with a variety of restriction enzymes, The electrophoretic and blotting data obtained for each clone was organized into a fine structure restriction map. These maps are diagrammed in Figure 3 for the eleven subclones obtained. The H2b-containing sequences present on A Chl have not been subcloned. (c) DNA sequencing strategy for the H2b histone genes The fine structure restriction maps generated for each subclone were used to develop a sequencing strategy for the individual H2b genes. The method of Maxam and Gilbert (1980) as modified by Smith and Calvo (1980) was used to determine the DNA sequence for each of the H2b genes subcloned. The horizontal arrows in Figure 3 indicate the extent of the region sequenced and the direction in which the sequence was read. The arrows above the clone map indicate that the DNA strand with the same sense as the mRNA was read and the arrows under the clone identify se- quences read which correspond to the template strand of the DNA molecule. From this sequence data the exact location of each H2b gene was identified as was the direction in which they are transcribed. This di- rection is indicated in Figure 3 by the open horizontal arrows. Some sequence data was also obtained for a few H2a and H1 genes (Browne, 1984) which are closely linked to particular H2b genes. The location and transcriptional orientation of these genes are indicated on the maps. The DNA sequence results indicate that in the four instances where an H2b gene and an H2a gene are closely linked to each other, they are in- variably transcribed in a divergent fashion. This interesting arrangement fixes the promoter region for both genes in the 340 bp region which 93 Figure 3. Restriction enzyme maps of the chicken H2b-containing subclones. The open horizontal arrows mark the various genes and point in the direction of transcription. The horizontal arrows above the map indicate the extent of the DNA sequence read which corresponds to the mRNA. The horizontal arrows below the map indicate the extent of the DNA sequence determined for the tem- plate strand. . 94 —-s ‘= ——> O .. ' KR1 -1.3 02 h—Z: p a <—<—— —4 .. l a E ‘1 pRR4a-2.2 , C- . b H2b - 1 mam-4.4 T l e— ! “—5 H1 ,P H2a H2b pFiR2e-3.5 E pRRac-5.4 ——L__, «a H2b 1; "’ . o 1 6; (——— l T TTTVIA i? pRRac- -.3 s '_'L__> 1w 1' TPPP2°'4-° 6 6 (— ——-> .l 11 fl__’pPP2d-2 o E: HINFI E i: HINCH ——9 5: 1(er ’1 L126 , pBBA-3.0 i: ssrn 3 = BAMHI = HlNOlII /r s e T ’ "= BGLI , (l {gar—‘1‘” ”BRA-5'4 ’ = Xhol ' s__° :: BSTEH I: AVAI = PSTI 39923 L T <1 an 1135' HRSa-3.3 1= EcoRI m ° £= EcoRI A LINKER 2: 3m; Figure 3. 95 separates the two genes. Eight H2b genes have been completely sequenced. In addition all of the coding sequences of the pPP2d-4.0 gene which is incomplete due to the insertion of an EcoRI linker in the original phage construction have been obtained. Partial DNA sequencing (mainly flanking sequences, see below) has been performed on the pRR4a-2.2 and pRRlOa-4.4 H2b genes. (d) Chicken H2b coding sequences The H2b coding regions were identified based on sequence comparisons with the two previously characterized genes in pKRla-1.3 and pRR2e-3.5 (Grandy et al., 1982; Grandy et al., 1983). Figure 4 presents a compil- ation of nine (eight complete plus that available for the pPP2d-4.0 gene) H2b gene coding sequences. The H2b exon in pKRla-l.3 was chosen as the prototype against which the others were compared. An analysis of these sequences reveals that all of the chicken H2b genes that have been completely sequenced are uninterrupted and code for an H2b protein composed of 125 amino acids. The coding region of each H2b gene is 375 nucleotides long. These exons tend to be rich in G and C residues which can account for 58%(pBBA-3.0) to 64%(pRR3c-3.5) of each coding region. This GC bias is reflected in the number of CpG di- nucleotides which can number as many as 40 in certain H2b genes, whereas CpG is generally relatively rare in eucaryotic DNA, although less so within coding sequences (Salser, 1978). Each of the nine H2b coding regions is unique, but they are all found to share extensive sequence homology to the pKRla-1.3 H2b exon ranging from 92% to 99.7%. Most of the differences in sequence are silent, falling in the third or, occasionally, the first or second codon position. The distribution of these nucleotide substitutions with respect to the pKRla-1.3 96 Figure 4. Chicken H2b exon sequence comparison. The gene on pKRla-1.3 was used as the prototype against which the other sequences were compared. The amino acid substitutions are indicated where they occur. Nine unique H2b genes are represented. The gene on pPP2d-4.0 contains the sequence 3' of residue 106. pKRla-l. pRRZe-3. pRRBc-S. pRREc-3. pPPZd-Z. pBBA- 3. OWUIJ-hlflw pKRla-l. pHRSa-3. pBBA- 3. a n - “—13 - N Hasn‘t Jo JSOWKJ pKRla-l. pBBA- 3. ow pKRla-l. pPPZd-Z. pBBA- 3. OWN pKRla-l.3 pBBA- 3.0 pKRla-l.3 pKRla-l. pRRZe-B. pRR3c-5. pRR3c-3. pPPZd-Z. pPPZd-4. cum-hunt.) pKRla-l. pRR2e-3. pRR3c-S. pRR3c-3. pPPZd-Z. pPPZd-4. pHRSa-3. pBBA- 3. owouma-tnw pKRla-l. pRRZe-3. pRR3c-5. pPPZd-d. (3:th O pBBA- 3. Figure 4. UI ATG 15 Lys AAG 30 -y- AAG 4S Leu CTG 60 as 91'] GGC Ser 75 Gly GGC 90 Thr AC6 105 Glu GAG 120 Lys AAG (‘1 '1 Donn—i O 16 Lys AAG J - ' I 4.; 3'5 AAG Arg 46 Lys AAG 61 Ile ATC 76 Glu GAG 91 Ser TCG 106 Leu CTG 121 Tyr TAC a) (- Glu GAG Se: AGC GCG Ala 47 Gln CA6 62 Me: ATG 77 Ala GCG 92 Arg C66 107 Ala GCC 122 Thr ACC Ile 3 ro CCG 108 Lys AAG 123 Ser 4 Ala GCC I: —4—+4 94 Ile ATC 109 His CAC 124 Ser 97 5 Lys AAG 20 Lys AAG I) r4 (n >05 (1 CD (.) L.) 50 Pro CCC 65 - Phe TTC 80 _ Leu CTG 95 Gln CAG 110 Ala GCG 125 Ly s AGC TCC AAG 6 Ser TCC 21 Thr ACC (T) (1) O\ r111 13(1) (.1 31 Asp GAC 66 Val GTC 81 Ala GCG 96 Thr ACA mmmmm 111 Val GTC 3T Tyr TAC 52 Thr ACG 67 Asn AAC 82 His CAC 97 Ala GCC 112 Set TCC G Pro CCC TCG 53 Gly GGC 68 Asp GAC 83 Tyr TAC 98 Val GTG 113 Glu GAG Ala GCC 114 Gly GGT non (WHO 85 . Lys AAG 100 Leu CTG 115 Thr ACC ll Lys AAG 116 Lys AAG 12 Lys AAG 117 Ala GCG 13 Gly GGC 118 Val GTC 14 Ser TCC cw —4—+4 29 Arg ~ CGC ‘ A A 44 GTE S9 Met ATG 74 Ala GCC 89 Ile ATC 104 Gly GGC 119 Thr ACC 98 .uauoo wmwcmno wafiwoo vfium ocfiam mamas mumu«wcfi mxmflumumm any .m.~lm~mxa cu uomammu nu“? mmcww accumas nmm cmxofiso uswwm msu a“ vm>ummno mmwcmno ammo mwauomausa mo coauanfiuumfin .m muswwm coxm nmm ca mmswfimmu wwwuomaosc mo acmscwfimmm HmowumEDZ mum com ocm omm oom owm com oqm CNN com owH Goa oq~ ONH cod cm 00 De om U