TRANSCRIPTIONAL REGULATION IN DROSOPHILA -- FROM GENOME TO GENE By Yiliang Wei A DISSERTATION Submitted to Michigan State University in partial fulfillment of the requirements for the degree of Biochemistry and Molecular Biology Ð Doctor of Philosophy 2016 ABSTRACT TRANSCRIPTIONAL REGULATION IN DROSOPHILA -- FROM GENOME TO GENE By Yiliang Wei Transcriptional regulation plays a major role in gene expression, and is critical for development and diseases. To understand the molecular mechanisms of transcriptional control, I took advantage of the classic gene regulation model, Drosophila melanogaster, and developed approaches from both a genome wide angle and zooming in to the specific gene. On the genome level, I carried out genome-wide studies to characterize binding profiles for Drosophila retinoblastoma family proteins. I identified novel, yet conserved roles for retinoblastoma proteins in regulation of signaling pathways and ribosomal biosynthesis. I discovered that the retinoblastoma cofactors might be an important regulator of cellular growth through control of ribosomal gene expression, an unrecognized feature of this tumor suppressor protein that is apparently evolutionarily conserved in mammals. On the specific gene level, I characterized the cis-regulatory landscape of the Drosophila insulin receptor gene, a gene that is frequently involved in many diseases such as diabetes, AlzheimerÕs, and cancer. I identified many dynamically and redundantly regulated enhancers embedded in the large insulin receptor gene locus, and demonstrated that a ÒhousekeepingÓ gene can be subject to control by extraordinarily complex regulatory circuitry.!"""! This thesis is dedicated to my parents. Thank you for always supporting me. !"#!ACKNOWLEDGMENTS I would like to express my sincere gratitude to my advisor Prof. David Arnosti for his continuous support through my PhD study, and for his patience, kindness, motivation, and immense knowledge. His guidance not only helped me with research and writing of this thesis, but also shaped me into an independent scientific researcher with critical and creative thinking. The training and lessons I received from Dr. Arnosti is truly a life-long benefit. I would also like to thank Dr. Henry, for being a passionate mentor and a good friend. I would like to thank my committee members, Dr. Burton, Dr. Xiao, Dr. Floer, and Dr. Knott, for their helpful guidance. I would like to thank all former and present members of the Arnosti and Henry laboratories, for their contribution and inspiration to my study. I would deeply thank my parents, Gang Wei and Chunyan Yu, for their constant support and love. Lastly, I would like to thank Michigan State University. As written in its motto - 'Advancing Knowledge, Transforming Lives', MSU has truly ÒtransformedÓ my life, and I will always be proud to be a Spartan. !#!TABLE OF CONTENTS LIST OF FIGURES........................................................................................................viii KEY TO ABBREVIATIONS..........................................................................................xii CHAPTER 1.......................................................................................................................1 Introduction........................................................................................................................1 Mechanism of transcription control...................................................................................1 Transcriptional regulation and transcription factors..............................................1 Cis-regulatory elements Ð Promoters and insulators..............................................2 Cis-regulatory element Ð Enhancers.......................................................................3 Identification of enhancers.....................................................................................4 Transcriptional regulation in development and normal physiology...................................5 Transcriptional regulation in development.............................................................5 Transcriptional regulation in physiology...............................................................7 Transcriptional control and evolution................................................................................8 Transcription and disease.................................................................................................10 Trans-factors and disease.....................................................................................10 Cis-elements and disease......................................................................................11 Retinoblastoma family proteins........................................................................................12 Insulin signaling pathway and regulation of insulin receptor gene..................................14 Insulin signaling pathway and insulin receptor....................................................14 The Drosophila insulin receptor...........................................................................15 Transcriptional regulation of InR.........................................................................18 Thesis preview..................................................................................................................19 REFERENCES.................................................................................................................21 CHAPTER 2.....................................................................................................................32 Genome-wide analysis of Drosophila Rbf2 protein highlights diversity of RB family targets and possible role in regulation of ribosome biosynthesis.....................................32 Abstract............................................................................................................................32 Introduction......................................................................................................................34 Results..............................................................................................................................38 Genome-wide Rbf1 and Rbf2 association............................................................38 Rbf2-alone targets include most ribosomal protein genes...................................46 Rbf2 shows differential repression activity on ribosomal protein gene promoters..............................................................................................................51 Rbf1 represses ribosomal protein gene expression in vivo..................................53 Enrichment of BEAF-32 motif in Rbf bound promoters.....................................57 !#"! Some ribosomal protein gene promoters exhibit non-canonical Rbf recruitment............................................................................................................67 Association with ribosomal protein gene promoters is a conserved character for the RB family.......................................................................................................72 Discussion........................................................................................................................75 Material and Methods.......................................................................................................79 ChIP-exo...............................................................................................................79 Read mapping, peak finding, visualization, and annotation.................................79 De novo motif searching......................................................................................80 Validation of ChIP-exo peaks..............................................................................80 Gene ontology (GO) analysis...............................................................................81 Reporter constructs and luciferase assay..............................................................82 RNAi and ChIP....................................................................................................82 RNA-seq...............................................................................................................83 Data set preparation for STAP analysis...............................................................84 Testing for motif association with ChIP enrichment............................................84 Motif streghth assessment....................................................................................85 REFERENCES.................................................................................................................86 CHAPTER 3.....................................................................................................................95 Complex cis-regulatory landscape of the insulin receptor gene reflects dynamic regulation of a ÒhousekeepingÓ gene................................................................................95 Abstract............................................................................................................................95 Introduction......................................................................................................................97 Results............................................................................................................................103 Genomic rescue construct identifies regulatory regions of InR.........................103 Evidence of tissue-specific enhancers in InR introns.........................................106 Identification of active enhancers located within InR introns............................115 Dynamic regulation of InR enhancers by dFOXO.............................................125 Most dFOXO response fragments may be indirectly regulated by dFOXO......125 Ecdysone induces similar responses to dFOXO.................................................129 Impact of RB binding site on InR promoter and enhancers...............................134 Transcriptional circuitry of the InR gene revealed by precise-mapping of CREs...................................................................................................................139 Combinatorial interactions of InR regulatory elements.....................................145 Analysis of population- and species-level variation..........................................149 Discussion......................................................................................................................155 Material and Methods.....................................................................................................163 Fly strains and reporter analysis.........................................................................163 Generation of transgenic flies.............................................................................163 qPCR analysis of mRNA from transgenic flies..................................................164 Genome data visualization.................................................................................164 Luciferase reporter library..................................................................................165 Cell culture and transfection...............................................................................165 FOXO ChIP-qPCR.............................................................................................167 !#""! SNP analysis of population variation.................................................................167 Indel analysis of population variation................................................................168 Analysis of sequence conservation between species..........................................168 REFERENCES...............................................................................................................170 CHAPTER 4...................................................................................................................184 Future Directions............................................................................................................184 REFERENCES...............................................................................................................190 APPENDICES................................................................................................................194 APPENDIX A: Role of Rbf1/E2F1 in regulation signaling pathway gene promoters...........................................................................................................195REFERENCES...................................................................................................199 APPENDIX B: Rbf1 phosphorylation and stability...........................................201 REFERENCES...................................................................................................204 APPENDIX C: Role of Rbf1 and Rbf2 C-terminus in protein stability and activity................................................................................................................206 REFERENCES...................................................................................................212 !#"""!LIST OF FIGURES Figure 1-1: Overview of insulin signaling pathway in Drosophila..................................17 Figure 2-1: Rbf2 binds to a large number of unique targets in the Drosophila genome..........................................................................................................40 Figure 2-2: Distribution of distance of Rbf1/2 peaks to the nearest Transcription Start Sites (TSS)....................................................................................................41 Figure 2-3: Annotations of genomic position of Rbf1/2 peaks........................................42 Figure 2-4: ChIP-qPCR for Rbf1, Rbf2, dE2F1, dE2F2, and dDP on chromatin from 12-18 hr embryos...............................................................................................43 Figure 2-5: Motifs enriched in sequences associated with Rbf1- and Rbf2-bound regions of the genome that were located near TSS....................................................45 Figure 2-6: Enriched targeting by Rbf1 and Rbf2............................................................47 Figure 2-7: Association of Rbf proteins, E2F proteins, and dREAM complex proteins with ribosomal protein gene promoters........................................................49 Figure 2-8: Transcriptional responses of Rbf targeted genes in reporter gene assays.....52 Figure 2-9: Global analysis of ribosomal protein gene expression in larval wing imaginal discs with overexpression of Rbf1................................................................54 Figure 2-10: Rbf1-mediated reduction of ribosomal protein gene expression in wing imaginal disc tissue.......................................................................................55 Figure 2-11: Enrichment of specific DNA binding protein motifs under peaks for Rbf1 and Rbf2 genome-wide.................................................................................59 !"$!Figure 2-12: Enrichment of 127 motifs on selected cell cycle gene promoters...............61 Figure 2-13: Enrichment of 127 motifs on selected signaling pathway gene promoters......................................................................................................62 Figure 2-14: %&'"()*+&,!-.!/01!*-,".2!-&!2+3+(,+4!(5,-6372*"(!'"8-2-*73!9+&+!6'-*-,+'2::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::;'+!0?/@A!%&'"()*+&,!-.!/01!*-,".2!-&!2+3+(,+4!*",-()-&4'"73!'"8-2-*73!9+&+!6'-*-,+'2::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::;B!="9>'+!0?/;A!C-''+37,"-&!8+,D++&!E8.0F!E8./F!7&4!G%H=?<0!C)IJ!2"9&732::::::::::::::::::::;@ Figure 2-17: RNAi depletion reveals E2F/DP-dependent and Ðindependent Rbf2 binding in cultured cells.............................................................................................69 Figure 2-18: Distinct qualities of motifs associated with Rbf1+Rbf2 bound promoters, vs. those bound solely by Rbf2...........................................................................71 Figure 2-19: Retinoblastoma corepressor association with ribosomal protein genes is a conserved feature of RB proteins..................................................................73 Figure 3-1: An 80 kbp region contains cis regulatory information necessary for genomic rescue of InR mutants..................................................................................104 Figure 3-2: Number and genotype of survivors from InR BAC rescue experiment......105 Figure 3-3: Enhancers identified by STARR-seq, DHS-seq and FAIRE at InR gene locus............................................................................................................109 Figure 3-4: Regulatory landscape of the InR locus........................................................110 Figure 3-5: Images of larvae and adult flies expressing UAS-GFP under the control of indicated Janelia Gal4 driver lines..............................................................113 Figure 3-6: InR introns contain cis-regulatory elements that respond to dFOXO and/or ecdysone signaling......................................................................................118 !$!Figure 3-7: RNA polymerase II and H3K4Me3 signals at InR locus............................122 Figure 3-8: Preferential activation of T1 promoter element by InR intronic enhancers 2 and 3............................................................................................................123 Figure 3-9: Requirement for EcR protein for 20E response in Kc cells........................124 Figure 3-10: In vivo occupancy of dFOXO provides evidence for direct and indirect regulation of InR.........................................................................................127 Figure 3-11: EcR is not required for dFOXO activity on InR enhancers.......................132 Figure 3-12: Ecdysone signaling involves direct activation and indirect repression of InR enhancers.....................................................................................................133 Figure 3-13: Promoter-proximal Rbf1 binding site role in transcriptional regulation....................................................................................................136 Figure 3-14: K"..+'+&,"73!2+&2","#",5!-.!!"#!L/!6'-*-,+'!7&4!$%&'!6'-*-,+'!,-!4%0=/::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::/'5!79-F!,)+!6'"&("63+2!-.!,'7&2('"6,"-&73!(-&,'-3!)7#+!8++&!+2,783"2)+4!85!2,>4"+2!"&!87(,+'"73!7&4!+>O7'5-,"(!-'97&"2*2!RP7(-8!7&4!Q-&-4F!/S;/T:!H('-22!733!4-*7"&2!-.!3".+F!,'7&2('"6,"-&73!'+9>37,"-&!"&#-3#+2!KUH!8"&4"&9!,'7&2('"6,"-&!.7(,-'2!R,'7&2?7(,"&9!.7(,-'2T!7&4!,)+"'!(-?.7(,-'2!"&,+'7(,"&9!D",)!26+("."(!KUH!2+V>+&(+2!R("2?'+9>37,-'5!+3+*+&,2T!,-!'+9>37,+!,)+!87273!,'7&2('"6,"-&!*7()"&+'5F!D)"()!"&(3>4+2!,)+!*>3,"2>8>&",!EUH!6-35*+'72+!+&W5*+!7&4!.7(,-'2!,)7,!"&,+'7(,!D",)!6-35*+'72+!7,!*7&5!6'-*-,+'2:!L)+!6'-(+22!-.!,'7&2('"6,"-&73!'+9>37,"-&!"&!+>O7'5-,+2!"&#-3#+2!(-*63+$!*+()7&"2*2F!"&(3>4"&9!4"'+(,!8"&4"&9!8+,D++&!,'7&2('"6,"-&73!7(,"#7,-'2!7&4!2>'.7(+2!-.!,)+!87273!*7()"&+'5X!KUH!3--6"&9!8+,D++&!4"2,73!'+9>37,-'5!2",+2!7&4!,)+!,'7&2('"6,"-&73!2,7',!2",+2!-.!'+9>37,+4!9+&+2X!()'-*7,"&!'+*-4+3"&9F!&>(3+-2-*+!6-2","-&"&9F!7&4!,)+!2,"33!6--'35!>&4+'2,--4!7(,"-&!-.!&-&(-4"&9!EUH!R&(EUHT!RY++!7&4!Z->&9F!0[/O7'5-,"(!,'7&2('"6,"-&!7(,"#7,-'2!'+9>37,+!,'7&2('"6,"-&!"&","7,"-&!85!'+('>","&9!(-7(,"#7,-'2F!2>()!72!,)+!*+4"7,-'!(-*63+$F!6<[[F!7&4!-,)+'!D"4+35?7(,"#+!.7(,-'2!RY++!7&4!Z->&9F!0[/2+!2",+2!RE7)3!+,!73:F!0[/[T:!H,!,)+!()'-*7,"&!3+#+3F!HLJ?4+6+&4+&,!()'-*7,"&!'+*-4+3"&9!(-*63+$+2!7'+!'+('>",+4!.-'!9+&+!7(,"#7,"-&!,-!'+*-#+!&>(3+-2-*+2!7&4!('+7,+!7((+22!.-'!,'7&2('"6,"-&!*7()"&+'5!7&4!'+9>37,-'5!.7(,-'2!RC376"+'!7&4!C7"'&2F!0[[ST:!L'7&2('"6,"-&!.7(,-'2!732-!'+('>",!)"2,-&+?*-4".5"&9!+&W5*+2!,-!744!-'!'+*-#+!26+("."(!()+*"(73!*-4"."(7,"-&2!7,!)"2,-&+!,7"32:!L)+2+!*-4"."(7,"-&2!.>',)+'!('+7,+!2>'.7(+!.-'!-,)+'!,'7&2('"6,"-&!.7(,-'2!RJ-',+37!7&4!%2,+33+'F!0[/[T:!Cis-regulatory elements Ð Promoters and insulators Another important aspect of transcriptional regulation is mediated through DNA elements, which include basal promoters, enhancers, and boundary elements or insulators (Arnosti, 2003). Basal promoters are typically ~100 bp in size and are located directly at the transcription initiation sites. Basal promoter regions may contain Initiator (Inr) sites overlapping +1 site of transcriptional initiation, 5Õ TATA-box sequences positioned at -30bp and 3Õ promoter element such as the DPE (Ohler et al., 2003). The TATA-box interacts with TATA-binding protein (TBP), an important part of basal transcription machinery that helps the binding of RNA polymerase and other basal transcription factors. TBP is part of the multisubunit general transcription factor TFIID, which contains TBP-associated proteins (TAFs). The Inr is bound by RNA polymerase and TAFs, while the DPE serves to provide additional contacts with TAFs to help anchor TFIID (Arnosti, 2003). However, not all basal promoters contain the same elements. The so-called ÒhousekeepingÓ genes lack these elements in their promoters, but instead are enriched with DRE and other motifs, while in mammals CpG-rich promoters feature dispersed initiation patterns that are not guided by these canonical !3"&!7&4!L\E!2"9&73"&9!"&!()*+*,-./0:!H88'+#"7,"-&2A!4IYJ2F!()*+*,-./01"&2>3"&?3"O+!6+6,"4+2!/?1X!I&EF!"&2>3"&!'+(+6,-'X!JI<]F!6)-26)-"&-2","4+!8+'->2!2(3+'-2"2!(-*63+$X!B%?GJF!B%?8"&4"&9!6'-,+"&X!4=\_\F!()*+*,-./01.-'O)+74!8-$F!2>8?9'->6!\:!L)+!9+&+2!6'-*-,+'2!,)7,!)7#+!8++&!"4+&,"."+4!722-("7,+4!D",)!E8./!.'-*!C)IJ?2+V!7'+!"&4"(7,+4!"&!'+4!RH'()7'57!+,!73:F!0[/0T:! !!!/M!Transcriptional regulation of InR Although the frequency of T2D and its link to cancer has renewed interest in studying the role of IR in cancer progression, the mechanisms that regulate IR and IGFR levels still remain unclear in mammalian system. However, the Drosophila system has provided key insights into regulatory mechanisms affecting expression of the insulin receptor gene. This model system has provided strong clues that transcriptional regulation plays a key role in regulating insulin receptor gene expression. Previous studies have shown that the Drosophila forkhead protein FOXO (dFOXO) and ecdysone receptor (EcR) directly target the InR gene, and effectively regulate its gene transcription in response to nutrient and steroid hormone (Puig et al., 2003; Gauhar et al., 2009). Our research has focused on identifying the genome-wide targets of a cancer suppressor, Rbf1 in developing fly embryos, and for the first time Rbf1 was found strongly associated with the InR promoter, and to significantly repress its activity in cells (Acharya et al., 2012; Raj et al., 2012). A shared feature for the insulin receptor gene in Drosophila and human is that it contains large introns. In Drosophila, these introns are nearly 40 kbp, while they span nearly 200 kbp in humans (Casas-Tinto et al., 2007). These large introns contain many putative enhancers, as indicated by STARR-seq and other methods, suggesting the gene is subjected to complex regulatory control (Kaplan et al., 2011; Li et al., 2011; N‘gre et al., 2011; Thomas et al., 2011; Arnold et al., 2013; McKay and Lieb, 2013). In both the fly and humans, only small (~2 kb) regions of the insulin receptor gene have been previously characterized for cis-regulatory activity (Leal et al., 1992; Lee and Tsai., 1994; Garc™a-Arencibia et al., 2005; Casas-Tinto et al., 2007; !/S!Calle et al., 2008). Considering the overall size of the gene, the cis-regulatory control of the mammalian insulin receptor gene remains largely uncharacterized. Thesis preview In this thesis, I explore molecular characterization of transcriptional controls that apply to understanding the genome-wide activities of a conserved transcriptional corepressor, the RB protein, as well as detailed and specific cis- regulatory controls of one of the targets of RB regulation, the gene encoding the insulin receptor. I identified novel functions of Drosophila RB family proteins (Rf1 and Rbf2), transcription cofactors and tumor suppressors, whose human counterparts are widely involved in many type of cancers. Besides their well-known roles in cell cycle regulation, I found that their target genes include many involved in ribosomal synthesis and signaling pathways. These novel roles of Rbf1 and Rbf2 appear to be conserved in mammals. Further analysis of the ribosomal targets revealed a noncanonical role for Rbf regulation. The RB regulation of ribosomal proteins and signaling pathways would potentially give a means to integrate growth control and cell cycle control, which may have significant implications to understanding a more general role of RB in cancer and other diseases. One especially intriguing target of Rbf1 in signaling pathways is the insulin receptor gene. The InR gene is highly conserved; it is critical for metabolism and growth, and essential for development. As a ÒhousekeepingÓ gene, InR contains unexpected long intragenic regions, embedded with many putative cis-elements. This feature appears to be conserved in mammalian insulin receptor gene, indicating this ÒhousekeepingÓ gene is, in fact, subjected to complex transcriptional control. A major part of my thesis is !0[!dedicated to identify and characterize the cis- regulatory elements associated with the InR gene using reported-based assays, and the mapping their responses to dFOXO and ecdysone, two key transcriptional components regulating gene expression in response to nutrient status and growth. My detailed mutagenic studies of the active enhancers identified specific elements and motifs required for enhancer activity. The dynamic regulation of multiple enhancers within this gene by dFOXO and ecdysone indicates these enhancers may play a role in temporal, spatial, and fine-tuning control of InR gene expression. My study indicates that this gene is subject to a complex transcriptional circuit extending far beyond the previously described simple model of the FOXO-feedback loop mechanism. This study will be a fundamental guideline for designing genetic assays to understand the transcriptional regulation of the insulin receptor gene, which will give a better understanding of the role of insulin receptor in metabolism, growth control and cancer. !0/! REFERENCES !00!E%=%E%UC%N!!Acharya, P., Negre, N., Johnston, J., Wei, Y., White, K. P., Henry, R. W. and Arnosti, D. N. (2012). Evidence for autoregulation and cell signaling pathway regulation from genome-wide binding of the Drosophila retinoblastoma protein. G3 (Bethesda). 2, 1459Ð1472. Acharya, P., Raj, N., Buckley, M. S., Zhang, L., Duperon, S., Williams, G., Henry, R. W. and Arnosti, D. N. (2010). Paradoxical instability-activity relationship defines a novel regulatory pathway for retinoblastoma proteins. Mol. Biol. Cell 21, 3890Ð3901. Alic, N., Andrews, T. D., Giannakou, M. E., Papatheodorou, I., Slack, C., Hoddinott, M. P., Cocheme, H. M., Schuster, E. F., Thornton, J. M. and Partridge, L. (2011). Genome-wide dFOXO targets and topology of the transcriptomic response to stress and insulin signalling. Mol. Syst. Biol. 7, 502. Arnold, C. D., Gerlach, D., Stelzer, C., Boryn, L. M., Rath, M. and Stark, A. (2013). Genome-wide quantitative enhancer activity maps identified by STARR-seq. Science 339, 1074Ð1077. Arnosti, D. N. (2003). Analysis and function of transcriptional regulatory elements: insights from Drosophila. Annu. Rev. Entomol. 48, 579Ð602. Arnosti, D. N. and Kulkarni, M. M. (2005). Transcriptional enhancers: Intelligent enhanceosomes or flexible billboards? J. Cell. Biochem. 94, 890Ð898. Belfiore, A. and Malaguarnera, R. (2011). Insulin receptor and cancer. Endocr. Relat. Cancer 18, R125Ð47. Brogiolo, W., Stocker, H., Ikeya, T., Rintelen, F., Fernandez, R. and Hafen, E. (2001). An evolutionarily conserved function of the Drosophila insulin receptor and insulin-like peptides in growth control. Curr. Biol. 11, 213Ð21. Calle, C., Maestro, B. and Garcia-Arencibia, M. (2008). Genomic actions of 1,25-dihydroxyvitamin D3 on insulin receptor gene expression, insulin receptor number !0'+!0?B!R(-&,`4T!The selected targets are grouped in different colors based on whether they are bound by Rbf1/Rbf2 (from this study) or dE2F2 (Korenjak et al., 2012). !B@! Figure 2-5: Motifs enriched in sequences associated with Rbf1- and Rbf2-bound regions of the genome that were located near TSS. MEME-ChIP was used to identify de novo overrepresented motifs. Top three motifs (ranked by E-value) were shown. E2F-like sequences were not enriched in Rbf2-associated regions, except where Rbf1 was also present. !B;!Rbf2-alone targets include most ribosomal protein genes We analyzed the nature of genes bound by Rbf1, Rbf2, or both Rbf1/Rbf2 using the DAVID gene ontology annotation database (Huang et al., 2009a and 2009b). Consistent with the known importance of RB proteins for cell cycle regulation, genes bound by both Rbf1/Rbf2 were significantly enriched for this category. In contrast, cell cycle related genes were not enriched in the set of genes bound solely by Rbf2; instead, the most significantly enriched category was that of ribosomal proteins (Figure 2-6A). The Rbf1-only group showed no significant enrichment of any gene class in this analysis (data not shown). To further characterize this enriched feature, we manually inspected Rbf1 and Rbf2 peaks on each of the 94 cytoplasmic ribosomal protein gene promoters (CRP) and 75 mitochondrial ribosomal protein gene promoters (MRP), observing that Rbf2 associated with a majority of the ribosomal protein gene promoters (Figure 2-6B). We also compared our results with the previous dREAM complex ChIP-chip study (Georlette et al., 2007), and found that some, but not all dREAM complex components co-occupy with Rbf1/2 on ribosomal protein gene promoters (Figure 2-7). In our earlier study (Acharya et al., 2012), Rbf1 was found to bind multiple genes encoding components of conserved signaling pathways. In the current study, we found that Rbf2 also associates with a significant number of signaling pathway gene promoters (Figure 2-6B). ChIP-qPCR assays were performed on selected cell cycle, signaling pathway, and ribosomal protein targets, confirming the enrichment found in the ChIP-exo experiments (Figure 2-6C). Thus, Rbf2 appears to occupy a greater fraction of noncanonical targets such as signaling pathway and ribosomal protein genes, compared to Rbf1, which is present together with Rbf2 at many canonical cell cycle related genes. !B1! Figure 2-6: Enriched targeting by Rbf1 and Rbf2. (A) Genes bound by Rbf2 alone or by both Rbf1 and Rbf2 were functionally annotated using the DAVID database (Huang et al., 2009); values indicate enrichment scores. (B) Promoters of cell cycle related genes (annotated by flybase.org, The Interactive Fly), signaling pathway genes (Acharya et al., 2012), cytoplasmic ribosomal protein genes, and mitochondria ribosomal protein genes !BM!Figure 2-6 (conÕt) (Marygold et al., 2007) were manually inspected for Rbf1 and Rbf2 binding sites. (C) To validate ChIP-Seq results, manual ChIP of Rbf1 and Rbf2 on chromatin from 12-18 hr embryos was carried out on selected cell cycle (PCNA, DNApol!-50), signaling pathway (InR), cytoplasmic ribosomal protein (RpL37a, RpS29, RpS19b, RpS27, RpL7), and mitochondrial ribosomal protein (mRpS12/tko, mRpL22, mRpL1) targets using anti-Rbf1, anti-Rbf2, and pre-immune serum. An intergenic region (INT) was used as negative control. !BS! Figure 2-7: Association of Rbf proteins, E2F proteins, and dREAM complex proteins with ribosomal protein gene promoters. Rbf1 and Rbf2 data were from this study, dE2F1, dE2F2, and dREAM data were from previous studies (Georlette et al., !@[!Figure 2-7 (conÕt) 2007; Korenjak et al., 2012). Bound promoters were indicated in dark red color, unbound promoters were indicated in yellow color. ChIP data is from embryo (em), larva (la), and Kc cells (Kc). !@/!Rbf2 shows differential repression activity on ribosomal protein gene promoters To determine the regulatory significance of Rbf1 and Rbf2 binding at ribosomal protein promoters, we selected several genes for further functional characterization. Six promoter-proximal regions from cytoplasmic and mitochondrial ribosomal protein genes were cloned into a luciferase reporter, and the effects of Rbf1, Rbf2, dE2F2, or a combination of Rbf2 plus dE2F2 were tested in Drosophila S2 cells. As expected, transcription from the PCNA-luc reporter was repressed by Rbf1, dE2F2, and Rbf2/dE2F2 (Figure 2-8). In contrast, none of the ribosomal protein gene promoters were repressed by Rbf1, even though these particular promoters have robust Rbf1 signals in the embryo. Notably, overexpression of Rbf2 alone repressed the mRpS12/tko promoter ~25%, with repression increasing to ~50% with co-expression of dE2F2. Overexpression of dE2F2 alone decreased RpL37a promoter activity by about one-third, with a modest but reproducible ~15-20% repression observed on RpS29 and mRpL22 promoters. These latter promoters were not sensitive to Rbf1 or Rbf2 overexpression alone. The mRpL1 or RpS14b promoters were not repressed to any extent by any of the overexpressed proteins, and in fact transcription of these reporters was mildly stimulated. Thus, unlike the classical RB cell cycle target PCNA, whose expression dynamically varies during cell growth, regulation of these non-canonical ribosomal protein gene promoters is more restrained. This behavior is consistent with the similarly modest but reproducible regulation of these genes under growth-limiting or stress conditions (Gasch et al., 2000; Causton et al., 2001; Gershman et al., 2007; Miller et al., 2011). As central mediators of global protein expression, small changes in ribosomal protein expression !@0!are predicted to have significant and pleiotropic effects (Steffen et al., 2012; Xue and Barna, 2012; Woolford and Baserga, 2013; Hasygar and Hietakangas, 2014). Figure 2-8: Transcriptional responses of Rbf targeted genes in reporter gene assays. Rbf1, Rbf2, dE2F2, or Rbf2/dE2F2 were overexpressed in cells containing reporters with promoter regions of indicated genes. ChIP occupancy by Rbf1 and Rbf2 is shown along with gene structure. Rbf1 showed repression activity only on PCNA. Rbf2 and/or E2F2 significantly repressed PCNA, mRpS12/tko, RpL37a, mRpL22, and had modest repression on RpS29. Activity of mRpL1 and RpS14b promoters was not significantly repressed by any treatment. The increase in expression may be due to indirect effects, particularly for RpS14b, which is not bound by these proteins in ChIP assays. (*p-value < 0.05) ! !@Gal4 driver; three biological replicates were conducted and analyzed by RNA-seq, as described in Materials and Methods. !@1!Enrichment of BEAF-32 motifs in Rbf bound promoters Our analyses of the Rbf and E2F genomic binding profiles revealed many Rbf2 target genes that were not bound by E2F factors, and therefore we tested whether there was evidence for other transcription factors associated with Rbf2 bound regions on target promoters. To identify relevant motifs, we used the STAP program, which correlates ChIP signal intensity with presence of overrepresented motifs for known transcription factors (He et al., 2009). Globally, a few motifs showed strong correlation with Rbf1 and Rbf2 peaks, including the E2F and DREF motifs that we previously demonstrated to be enriched at Rbf1 binding sites (Acharya et al., 2012) (Figure 2-11). Viewed as separate classes, those genes annotated as Òcell cycle relatedÓ, ÒsignalingÓ and Òcytoplasmic ribosomal proteinÓ also showed a strong enrichment for the E2F motif (Figure 2-12~14). Promoters from cell cycle and ribosomal protein genes were also enriched in a variety of other motifs, presumably related to their unique regulation (Figure 2-12, 2-14~15). However, genes representing conserved signaling pathways were not strongly enriched for additional motifs, likely because the divergent promoter sequences have very diverse regulatory properties (Figure 2-13). Unexpectedly, we found motifs for BEAF-32, an insulator binding protein, significantly correlated with both Rbf1 and especially with Rbf2 peaks (Figure 2-11). BEAF-32 binding sites measured in Drosophila 0-8-hr old embryos significantly overlap with Rbf1 and Rbf2 peaks (Figure 2-16A, B), with co-occupancy found for one-third of the Rbf2 and just over one-quarter of Rbf1 sites (Figure 2-16B) (Yang et al., 2012). A similar overlapping was also observed for BEAF-32 binding sites in S2 cells (Figure 2-16B) (Schwartz et al., 2012). Other insulator proteins, such as CP190 also co-occupy Rbf2 binding sites similar to BEAF-32, while the overlapping between Rbf2 and CTCF was less significant (Figure 2-16B) (Schwartz !@M!et al., 2012). Focusing specifically on ribosomal protein gene promoters, BEAF-32 binding sites were significantly enriched, especially on Rbf2-bound genes (Figure 2-16C). !@S! Figure 2-11: Enrichment of specific DNA binding protein motifs under peaks for Rbf1 and Rbf2 genome-wide. STAP results from 127 motifs were plotted in circular diagrams. The first histogram below the circumference shows Pearson correlation scores of individual motifs with Rbf1 ChIP-exo peak intensity, and the inner histogram in the circle shows the Rbf2 data. Strong enrichment for E2F, DREF, and BEAF-32 motifs is noted for both Rbf1 and Rbf2. The scale is from -0.5 to 0.5 with baseline of 0 in the !;[!Figure 2-11 (conÕt) middle, high scores (>0.19) are indicated in dark orange color (implying strong correlation), medium scores are in orange, and negative correlations are in green. The histogram outside the circumference shows the score differences between Rbf1 and Rbf2. The histogram is of light green color by default. Correlation-difference values lower than 0.04 are in blue color, implying those motifs are correlated with both Rbf1 and Rbf2 at similar level; correlation-difference values higher than 0.14 are in dark red color as seen in Figure 2-12~15, implying those motifs correlate with Rbf1-binding, but not Rbf2-binding, and vice versa. The Pearson correlation scores were calculated on the whole fly genome. For data in the four functional classes (cell cycle, signaling, and ribosomal protein genes - cytoplasmic and mitochondrial), see Figure 2-12~15. !;/! Figure 2-12: Enrichment of 127 motifs on selected cell cycle gene promoters. Inner wheel, Rbf2; middle wheel, Rbf1, outer wheel, differences between Rbf1 and Rbf2. !;0! Figure 2-13: Enrichment of 127 motifs on selected signaling pathway gene promoters. Inner wheel, Rbf2; middle wheel, Rbf1, outer wheel, differences between Rbf1 and Rbf2. !;UAS rbf1 and PenGal4>UAS GFP flies. Total RNA was isolated using TRIzol (Invitrogen) followed by cleanup steps using RNeasy Mini kit (Qiagen). 1-4 "g total RNA from three biological replicates was collected. Library preparation and sequencing was conducted by the Research Technology Support Facility (Michigan State University) using an Illumina HiSeq2500. All standard libraries were created using Illumina TruSeq kits and reagents following the manufacturerÕs protocols. In brief, polyA mRNA was isolated from total RNA, chemically fragmented, and then reverse transcribed to form double stranded cDNA. The cDNA was then end repaired, A-tailed, adapter ligated and amplified to create the final library. A bead-based size selection was performed to target final library molecules with a mean size of 500 base pairs. All libraries were then quantified on a Qubit Fluorometer (Life Technologies) and run on an Agilent BioAnalyzer to determine final size and purity of the library. Final concentration was then determined by qPCR using the KAPA Illumina Library Quantification Kit (KAPABiosystems). Libraries were appropriately diluted and loaded onto the flow cell for sequencing on the Illumina HiSeq2500 following the manufacturerÕs protocols. RNA-seq reads were mapped using TopHat v2.0.13 and analyzed using Cufflinks v2.2.1 (Trapnell et al., 2012). Analyzed !MB!results were visualized using R v2.15.3 with CummeRbund package as described (Trapnell et al., 2012). Data set preparation for STAP analysis For all (15,829) D. melanogaster genes, their locations and DNA sequences from 500bp upstream to Transcription Start Site (TSS) were retrieved from Flybase and UCSC database (dmel-5.48 Flybase release). For the four functional groups: Cytoplasmic Ribosomal Protein (CRP) genes, Mitochondrial Ribosomal Protein (MRP) genes, Cell Cycle Genes (CCG) and Signaling Pathway Genes (SPG) the same data was extracted and processed separately. The quantitative ChIP enrichments were calculated from the .wiggle files computed by MACS v1.4.2 (Zhang et al., 2008) by taking a maximum average signal over a sliding window within the 500bp upstream of the TSS both for Rbf1 and Rbf2 ChIP experiments. Position Weight Matrices (PWMs) of 127 motifs of Transcription Factors Binding Sites (TFBS) compiled from literature were used. Testing for motif association with ChIP enrichment The STAP program was used to test which TFBS affinity scores correlate with ChIP enrichment for the DNA sequences upstream of the TSS (He et al., 2009). For individual motif analysis, STAP was run with default parameters (sequence file, data file and motif file) with the option of co-operative binding set to 0 for each of the 127 motifs. The Pearson correlation between predicted binding and observed binding (in the cases of both Rbf1 binding and Rbf2 binding) for each of the 127 motifs was plotted using Circos (Krzywinski et al., 2009). !M@!Motif strength assessment Using a pipeline programmed in Python, MAST (Bailey and Gribskov, 1998) was run for each of the 127 motifs on the database of 15829 sequences to obtain each motif's occurrences, with maximal p-value=0.0005 and E-value=10000. All motif occurrences for each TFBS were extracted from the mast output file. Then we divided the sequences into two groups: co-bound by Rbf1+Rbf2 and bound by Rbf2 only. This procedure was repeated for the genome wide set of sequences as well as the ribosome associated sequences only. We compared the distribution of the strength of non-overlapping binding sites reported by MAST (as p-values). Negative logarithms (-log10) of those p-values (the lower p-value the stronger value, hence the reverse logarithm) were plotted as histograms for both "Rbf1+Rbf2" and "Rbf2-only". Mann-Whitney rank test was performed on the observed two groups with the threshold of one-sided p-value<0.05. !M;! REFERENCES !M1!REFERENCES Acharya, P., Negre, N., Johnston, J., Wei, Y., White, K. P., Henry, R. W. and Arnosti, D. N. (2012). Evidence for autoregulation and cell signaling pathway regulation from genome-wide binding of the Drosophila retinoblastoma protein. G3 (Bethesda). 2, 1459Ð1472. Acharya, P., Raj, N., Buckley, M. S., Zhang, L., Duperon, S., Williams, G., Henry, R. W. and Arnosti, D. N. (2010). Paradoxical instability-activity relationship defines a novel regulatory pathway for retinoblastoma proteins. Mol. Biol. Cell 21, 3890Ð3901. Ambrus, A. M., Islam, A. B. M. M. K., Holmes, K. B., Moon, N. S., Lopez-Bigas, N., Benevolenskaya, E. V and Frolov, M. V (2013). Loss of dE2F compromises mitochondrial function. Dev. Cell 27, 438Ð451. Bailey, T. L. and Gribskov, M. (1998). Methods and statistics for combining motif match scores. J. Comput. Biol. 5, 211Ð221. Bohla, D., Herold, M., Panzer, I., Buxa, M. K., Ali, T., Demmers, J., Krger, M., Scharfe, M., Jarek, M., Bartkuhn, M., et al. (2014). A functional insulator screen identifies NURF and dREAM components to be required for enhancer-blocking. PLoS One 9, e107765. Cao, L., Peng, B., Yao, L., Zhang, X., Sun, K., Yang, X. and Yu, L. (2010). The ancient function of RB-E2F pathway: insights from its evolutionary history. Biol. Direct 5, 55. Causton, H. C., Ren, B., Koh, S. S., Harbison, C. T., Kanin, E., Jennings, E. G., Lee, T. I., True, H. L., Lander, E. S. and Young, R. A. (2001). Remodeling of yeast genome expression in response to environmental changes. Mol. Biol. Cell 12, 323Ð337. Cavanaugh, a H., Hempel, W. M., Taylor, L. J., Rogalsky, V., Todorov, G. and Rothblum, L. I. (1995). Activity of RNA polymerase I transcription factor UBF blocked by Rb gene product. Nature 374, 177Ð180. !MM!Cecchini, M. J. and Dick, F. a (2011). The biochemical basis of CDK phosphorylation-independent regulation of E2F1 by the retinoblastoma protein. Biochem. J. 434, 297Ð308. Chicas, A., Wang, X., Zhang, C., McCurrach, M., Zhao, Z., Mert, O., Dickins, R. a, Narita, M., Zhang, M. and Lowe, S. W. (2010). Dissecting the unique role of the retinoblastoma tumor suppressor during cellular senescence. Cancer Cell 17, 376Ð87. Chinnam, M. and Goodrich, D. W. (2011). RB1, development, and cancer. Curr. Top. Dev. Biol. 94, 129Ð169. Classon, M. and Dyson, N. (2001). P107 and P130: Versatile Proteins With Interesting Pockets. Exp. Cell Res. 264, 135Ð47. Classon, M. and Harlow, E. (2002). The retinoblastoma tumour suppressor in development and cancer. Nat. Rev. Cancer 2, 910Ð917. Dick, F. A. and Rubin, S. M. (2013). Molecular mechanisms underlying RB protein function. Nat. Rev. Mol. Cell Biol. 14, 297Ð306. Dimova, D. K., Stevaux, O., Frolov, M. V and Dyson, N. J. (2003). Cell cycle-dependent and cell cycle-independent control of transcription by the Drosophila E2F/RB pathway. Genes Dev. 17, 2308Ð2320. Du, W. and Pogoriler, J. (2006). Retinoblastoma family genes. Oncogene 25, 5190Ð200. Du, W. (2000). Suppression of the rbf null mutants by a de2f1 allele that lacks transactivation domain. 379, 367Ð379. Dyson, N. (1998). The regulation of E2F by pRB-family?proteins. Genes Dev. 12, 2245Ð2262. Elenbaas, J. S., Mouawad, R., Henry, W. R., Arnosti, D. N. and Payankaulam, S. (2014). Role of Drosophila Retinoblastoma Protein Instability Element in Cell Growth and Proliferation. Cell Cycle 37Ð41. !MS!Fan, G. and Steer, C. J. (1999). The role of retinoblastoma protein in apoptosis. Apoptosis 4, 21Ð29. Felton-Edkins, Z. A., Kenneth, N. S., Brown, T. R. P., Daly, N. L., Gomez-Roman, N., Grandori, C., Eisenman, R. N. and White, R. J. (2003). Direct regulation of RNA polymerase III transcription by RB, p53 and c-Myc. Cell Cycle 2, 181Ð184. Frolov, M. V, Huen, D. S., Stevaux, O., Dimova, D., Balczarek-Strang, K., Elsdon, M. and Dyson, N. J. (2001). Functional antagonism between E2F family members. Genes Dev. 15, 2146Ð2160. Gasch, A. P., Spellman, P. T., Kao, C. M., Carmel-Harel, O., Eisen, M. B., Storz, G., Botstein, D. and Brown, P. O. (2000). Genomic expression programs in the response of yeast cells to environmental changes. Mol. Biol. Cell 11, 4241Ð4257. Georlette, D., Ahn, S., MacAlpine, D. M., Cheung, E., Lewis, P. W., Beall, E. L., Bell, S. P., Speed, T., Manak, J. R. and Botchan, M. R. (2007). Genomic profiling and expression studies reveal both positive and negative activities for the Drosophila Myb MuvB/dREAM complex in proliferating cells. Genes Dev. 21, 2880Ð96. Gershman, B., Puig, O., Hang, L., Peitzsch, R. M., Tatar, M. and Garofalo, R. S. (2007). High-resolution dynamics of the transcriptional response to nutrition in Drosophila: a key role for dFOXO. Physiol. Genomics 29, 24Ð34. Giacinti, C. and Giordano, a (2006). RB and cell cycle progression. Oncogene 25, 5220Ð5227. Gjidoda, A. and Henry, R. W. (2013). RNA polymerase III repression by the retinoblastoma tumor suppressor protein. Biochim. Biophys. Acta 1829, 385Ð392. Gutzat, R., Borghi, L. and Gruissem, W. (2012). Emerging roles of RETINOBLASTOMA-RELATED proteins in evolution and plant development. Trends Plant Sci. 17, 139Ð148. Hasygar, K. and Hietakangas, V. (2014). p53- and ERK7-Dependent Ribosome Surveillance Response Regulates Drosophila Insulin-Like Peptide Secretion. PLoS Genet. 10, e1004764. !S[!He, X., Chen, C.-C., Hong, F., Fang, F., Sinha, S., Ng, H.-H. and Zhong, S. (2009). A biophysical model for analysis of transcription factor interaction and binding site arrangement from genome-wide binding data. PLoS One 4, e8155. Heinz, S., Benner, C., Spann, N., Bertolino, E., Lin, Y. C., Laslo, P., Cheng, J. X., Murre, C., Singh, H. and Glass, C. K. (2010). Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol. Cell 38, 576Ð89. Hernando, E., Nahle, Z., Juan, G., Diaz-Rodriguez, E., Alaminos, M., Hemann, M., Michel, L., Mittal, V., Gerald, W., Benezra, R., et al. (2004). Rb inactivation promotes genomic instability by uncoupling cell cycle progression from mitotic control. Nature 430, 797Ð802. Hirsch, H. a, Gu, L. and Henry, R. W. (2000). The retinoblastoma tumor suppressor protein targets distinct general transcription factors to regulate RNA polymerase III gene expression. Mol. Cell. Biol. 20, 9182Ð9191. Hirsch, H. A., Jawdekar, G. W., Lee, K.-A., Gu, L. and Henry, R. W. (2004). Distinct mechanisms for repression of RNA polymerase III transcription by the retinoblastoma tumor suppressor protein. Mol. Cell. Biol. 24, 5989Ð5999. Huang, D. W., Sherman, B. T. and Lempicki, R. a (2009). Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat. Protoc. 4, 44Ð57. Huang, D. W., Sherman, B. T. and Lempicki, R. A. (2009). Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res. 37, 1Ð13. Julian, L. M., Palander, O., Seifried, L. A., Foster, J. E. G. and Dick, F. A. (2008). Characterization of an E2F1-specific binding domain in pRB and its implications for apoptotic regulation. Oncogene 27, 1572Ð1579. Keller, S. A., Ullah, Z., Buckley, M. S., Henry, R. W. and Arnosti, D. N. (2005). Distinct developmental expression of Drosophila retinoblastoma factors. Gene Expr. Patterns 5, 411Ð421. !S/!Korenjak, M., Anderssen, E., Ramaswamy, S., Whetstine, J. R. and Dyson, N. J. (2012). RBF binding to both canonical E2F targets and noncanonical targets depends on functional dE2F/dDP complexes. Mol. Cell. Biol. 32, 4375Ð87. Korenjak, M., Kwon, E., Morris, R. T., Anderssen, E., Amzallag, A., Ramaswamy, S. and Dyson, N. J. (2014). dREAM co-operates with insulator-binding proteins and regulates expression at divergently paired genes. Nucleic Acids Res. 42, 8939Ð8953. Korenjak, M., Taylor-Harding, B. and Binn”, U. (2004). Native E2F/RBF complexes contain Myb-interacting proteins and repress transcription of developmentally controlled E2F target genes. Cell 119, 181Ð193. Krzywinski, M., Schein, J., Birol, I., Connors, J., Gascoyne, R., Horsman, D., Jones, S. J. and Marra, M. A. (2009). Circos: an information aesthetic for comparative genomics. Genome Res. 19, 1639Ð1645. Larminie, C. G., Cairns, C. A., Mital, R., Martin, K., Kouzarides, T., Jackson, S. P. and White, R. J. (1997). Mechanistic analysis of RNA polymerase III regulation by the retinoblastoma protein. EMBO J. 16, 2061Ð2071. Latorre, I., Chesney, M. A., Garrigues, J. M., Stempor, P., Appert, A., Francesconi, M., Strome, S. and Ahringer, J. (2015). The DREAM complex promotes gene body H2A.Z for target repression. Genes Dev. 29, 495Ð500. Latorre, I., Chesney, M. a, Garrigues, J. M., Stempor, P., Appert, A., Francesconi, M., Strome, S. and Ahringer, J. (2013). The DREAM complex promotes gene body H2A . Z for target repression. 495Ð500. Lewis, P. W., Beall, E. L., Fleischer, T. C., Georlette, D., Link, A. J. and Botchan, M. R. (2004). Identification of a Drosophila Myb-E2F2/RBF transcriptional repressor complex. Genes Dev. 18, 2929Ð2940. Machanick, P. and Bailey, T. L. (2011). MEME-ChIP: motif analysis of large DNA datasets. Bioinformatics 27, 1696Ð1697. Marygold, S. J., Roote, J., Reuter, G., Lambertsson, A., Ashburner, M., Millburn, G. H., Harrison, P. M., Yu, Z., Kenmochi, N., Kaufman, T. C., et al. (2007). The ribosomal protein genes and Minute loci of Drosophila melanogaster. Genome Biol. 8, R216. !S0!Miller, C., Schwalb, B., Maier, K., Schulz, D., Dmcke, S., Zacher, B., Mayer, A., Sydow, J., Marcinowski, L., Dılken, L., et al. (2011). Dynamic transcriptome analysis measures rates of mRNA synthesis and decay in yeast. Mol. Syst. Biol. 7, 458. Morris, E. J. and Dyson, N. J. (2001). Retinoblastoma protein partners. Adv. Cancer Res. 82, 1Ð54. Nicolay, B. N., Gameiro, P. a, Tschıp, K., Korenjak, M., Heilmann, A. M., Asara, J. M., Stephanopoulos, G., Iliopoulos, O. and Dyson, N. J. (2013). Loss of RBF1 changes glutamine catabolism. Genes Dev. 27, 182Ð96. Norton, J. D., Deed, R. W., Craggs, G. and Sablitzky, F. (1998). Id helix-loop-helix proteins in cell growth and differentiation. Trends Cell Biol. 8, 58Ð65. Raj, N., Zhang, L., Wei, Y., Arnosti, D. N. and Henry, R. W. (2012). Ubiquitination of retinoblastoma family protein 1 potentiates gene-specific repression function. J. Biol. Chem. 287, 41835Ð41843. Reynolds, M. R., Lane, a N., Robertson, B., Kemp, S., Liu, Y., Hill, B. G., Dean, D. C. and Clem, B. F. (2014). Control of glutamine metabolism by the tumor suppressor Rb. Oncogene 33, 556Ð66. Rhee, H. S. and Pugh, B. F. (2011). Comprehensive genome-wide protein-DNA interactions detected at single-nucleotide resolution. Cell 147, 1408Ð1419. Rubin, S. M., Gall, A. L., Zheng, N. and Pavletich, N. P. (2005). Structure of the Rb C-terminal domain bound to E2F1-DP1: A mechanism for phosphorylation-induced E2F release. Cell 123, 1093Ð1106. Schwartz, Y. B., Linder-Basso, D., Kharchenko, P. V, Tolstorukov, M. Y., Kim, M., Li, H.-B., Gorchakov, A. A., Minoda, A., Shanower, G., Alekseyenko, A. A., et al. (2012). Nature and function of insulator protein binding sites in the Drosophila genome. Genome Res. 22, 2188Ð2198. Steffen, K. K., McCormick, M. A., Pham, K. M., MacKay, V. L., Delaney, J. R., Murakami, C. J., Kaeberlein, M. and Kennedy, B. K. (2012). Ribosome deficiency protects against ER stress in Saccharomyces cerevisiae. Genetics 191, 107Ð118. !S10 bp) is in some cases >50%; the distribution of some of these variations is skewed toward specific geographic regions, possibly a result of selection for specific InR regulatory properties. Direct observation of diverse metazoan genomes confirms that the general structures of intron-rich InR genes are conserved, however conservation of putative regulatory regions is often difficult to discern, as illustrated in Figure 3-20A. To test for conservation within the functional elements we identified within the InR introns, we measured conservation in the locus using pairwise alignments from the UCSC genome browser between D. melanogaster and other species in the Drosophilidae family (Rosenbloom et al., 2015). D. simulans is ~1.4 million years diverged, while D. grimshawi is 40 million years diverged (Obbard et al., 2012). We measured the average BLASTZ score per 100 bp for regions of interest (Figure 3-20B). The degree to which regulatory sequences are conserved is quite variable between regions and between species, especially at greater evolutionary distances. Neither genome-wide survey of chromatin marks as represented in Figure 3-3 nor the overall levels of nucleotide conservation reveal the functional properties of this locus, emphasizing the necessity for fine-scale functional measurements as carried out here. !/@/! Figure 3-20: Genetic variation at InR locus across different species and among different populations. (A) Indel/SNP frequencies at the InR locus across different D. melanogaster populations. 340 genomes from 24 African D. melanogaster populations were analyzed. TajimaÕs D score indicates proportion of high and low frequency SNPs. Low frequency indels, present in less than 30 genomes (8% frequency), are not listed. !/@0!Figure 3-20 (conÕt) The InR intron fragments that are either constitutively active or respond to dFOXO overexpression or 20E treatment are indicated with bars above the InR gene structure. Conservation at the InR locus across different Drosophila and other insect species are shown below the gene structure. The species include D.simulans, D.sechellia, D.yakuba, D.erecta, D.ananassae, D.pseudoobscura, D.persimilis, D.willistoni, D.virilis, D.mojavensis, D.grimshawi, A.gambiae, A.mellifera, and T.castaneum (from top to bottom). Species conservation is obtained from USCS genome browser. (B) Sequence conservation indicated by BLASTZ scores at individual intron fragment across different Drosophila species, plotted on a log10 scale. Fragments that are either constitutively active or respond to dFOXO overexpression or 20E treatment are indicated with red dot. (Loc., the InR locus; TU, the InR transcription unit; Ex., the largest InR exon, containing protein coding region). !/@