.3. A , .
35.6. .4 2.2!

‘. ..... .3. .3, .

v 1
.,. 7751
w 4

«z

Af

. 13.1%}
1:4; 2:; .. ,

42....

WA». Wm.
0.117... 31.».

 

4.
"MEWS?
$5...
3%.}...
73.. . .W» .
’51.??? M

£8."

Ari...

59%.: s . ‘ , . , .
bar.ch . , . ‘ , 3n: .1 in}... :. 1 C .. . ‘ ‘ V ‘ .
Lu; . , , ‘ . . . .euwﬂwﬁ.§kamh ‘ : ,.. am}... Lhaﬂtnw. 5mm}. pm.

_. v..A. .

 

m
(‘1

,’ .,/ \_.
1/)» \

5&75357‘i

This is to certify that the
dissertation entitled

RELATING STRESS TO STRAIN AT THE
LEVEL OF GENE EXPRESSION

presented by

Stephen Jan Callister

has been accepted towards fulfillment
of the requirements for the

Doctoral degree in Environmental Engineering

 

 

1 am (Di/manua-

 

Major Professor’s Signature

”18/200”

 

Date

MSU is an Afﬁrmative Action/Equal Opportunity Institution

 

W;

Michigan State
University ,

 

 

PLACE IN RETURN BOX to remove this checkout from your record.
To AVOID FINES return on or before date due.
MAY BE RECALLED with earlier due date if requested.

 

DATE DUE

DATE DUE

DATE DUE

 

DECHO '8 2005

 

 

 

 

 

 

 

 

 

 

 

 

 

6/01 c:/CIRC/DateDue.p65-p.15

 

RELATING STRESS T0 STRAIN AT THE LEVEL OF GENE
EXPRESSION

By

Stephen Jan Callister

A DISSERTATION
Submitted to
Michigan State University
in partial fulﬁllment of the requirements
for the degree of
DOCTOR OF PHILOSOPHY

Department of Civil and Environmental Engineering

2004

ABSTRACT
RELATING STRESS To STRAIN AT THE LEVEL OF GENE EXPRESSION
By

Stephen Callister

The stress strain paradigm was investigated at the level of gene
expression for Saccharomyces cerevisiae using tools developed and adapted
from various ﬁelds including Ecology, Mathematics, and Engineering. Tools
presented in this dissertation were developed to calculate strain, i.e., an
aggregate measure of gene expression response, and to establish its
relationship to stress, i.e. the perturbation causing the response. Strain for the
Environmental Stress Response in S. cerevisiae was calculated using a Moment
of Area and compared across eight environmental perturbations. Additionally,
relative expression for a set of 169 genes making up the glycolysis, glycerolipid
metabolism, cell cycle, and MAPK signaling biochemical pathways was observed
for a series of applied osmotic stresses representing mild, hyper, and sever
osmotic Shock. To add biological signiﬁcance to the stress strain paradigm,
stability and its associated aspects such as resistance, which is the ability of a
system to withstand the perturbation, resilience, which is rate at which a system
returns to steady state, and reactivity, which is the rate at which a system departs
from steady state were applied at the elvel of gene expression adding biological
signiﬁcance to the observed strain. Results showed that with increased

perturbation magnitude a decrease in resilience, reactivity, and resistance

occurred. An overall decrease in stability resulted with increased strain as
measured by the Moment of Area. An exponential relationship between
perturbation magnitude and strain was observed on an individual gene basis,
biochemical pathway basis, and aggregate gene basis. This dissertation presents

the ﬁrst application of relating stress to strain on a gene level.

Copyright by
Stephen Jan Callister
2004

DEDICATION

To my Father and Mother for instilling in me the desire to comprehend and
achieve beyond my perceived expectations. Also, to Charlene for her patience
and encouragement.

ACKNOWLEDGMENTS

The body of research contained in this dissertation was not possible
without the help and guidance of my advisor Dr. Syed A. Hashsham. It has been
an honor to work with him and I have thoroughly enjoyed observing his scientiﬁc
and engineering creativity. His commitment and expressed excitement to this
research has more than once carried me through periods of disappointment, and
his willingness to take the time and give valuable criticism helped keep this
research focused on the objectives.

I also need to express gratitude to my research committee, Dr. Bruce E.
Dale for his ﬂexibility and willingness to not “stand in the way” during the last few
months of this research, Dr. Susan J. Masten for her perspective and realistic
advise, and Dr. James M. Tiedje for his patience in helping an engineer learn and
develop the molecular biology tools necessary for this research.

Mr. Sean M. Spellman gave hours of help with experimental set up and
sampling. RNA extractions were performed with the help of Mr. John J. Parnell.
Instrumentation set up and optimization was not possible without Dr. Annette P.
Thelen of the Genomics Teaching and Support Facility. To these individuals I
express my Sincere gratitude for rolling up your sleeves and jumping in to keep
things moving efﬁciently.

I need to express my appreciation to my parents, Jan and Jane Callister.
My father is credited for introducing me to and instilling in me the desire to try

and comprehend, at least to a small degree, my surroundings. Whether it was

vi

observing the stratigraphy of the Colorado Plateau, scanning the horizon on top
of Notch Peak, or fossil hunting in the Great Basin Desert he taught me as a boy
to never be afraid to ponder the answers associated with the difﬁcult
interrogatives “How?” and “Why?”. My mother is credited for teaching me that
patience in oneself, hard work, trial and error, and faith are the required
ingredients for accomplishing a difﬁcult task. During the pursuit of this Ph.D. she
continually reminded me of the long term beneﬁts associated with what I was
undertaking, which can often be difﬁcult to discern after the nth experimental
failure.

Finally, I owe the deepest gratitude to my wife, Charlene. She is the only
one, among all that have been mentioned, who has directly taken on the burden
of supporting me in this pursuit while delaying her own educational goals,

maintaining our home, and raising our two daughters. Thank you Charlene!

vii

TABLE OF CONTENTS

LIST OF FIGURES ...........................................................................
LIST OF TABLES .............................................................................

CHAPTER 1: INTRODUCTION
1.1 Background and Signiﬁcance .....................................................
1.2 Stress, Strain, and Stability ........................................................
1.3 Choice of Monitoring Technology for Strain Quantiﬁcation ...............
1.4 Choice of the Model Microorganism and Stress .............................
1.5 Objectives ..............................................................................
1.6 Hypotheses ............................................................................

CHAPTER 2: LITERATURE REVIEW
2.1 Pulse and Press Perturbations ...................................................
2.2 Stability and Generalized Expression Response Envelopes .............
2.3 Measures of Stability used in Ecology ..........................................
2.4 The Moment of Area as a Tool to Calculate Strain for the
Generalized Response Envelope ................................................
2.5 Equations used to Calculate the Aggregate Response Envelope .......
2.6 Physiological Response of S. cerevisiae to Osmotic Shock ..............
2.7 Transcriptional Response of S. cerevisiae to Osmotic Shock ............
2.8 Regulation of Speciﬁc Transcriptional Respnse to Osmotic Shock .....
2.9 Monitoring of mRNA using Reverse Transcriptase Real-Time PCR....
2.10 Current Analysis Tools for Gene Expression Data ........................

CHAPTER 3: MATERIALS AND METHODS
3.1 Sorting of Gene Expression Data for the ESR ...............................
3.2 Clustering of Relative Gene Expression Data ................................
3.3 Calculation of Strain from the Response Envelope .........................
3.4 Calculation of Strain for Large Data Sets ......................................
3.5 Selection of Genes and Primer Design .........................................
3.6 Experimental Approach for Quantitative Stress-Strain Response ......

CHAPTER 4: RESULTS
4.1 Application of the Moment of Area to Calculate Strain .....................
4.2 Development of Calibration Curves from cDNA Targets ..................
4.3 Growth of S. cerevisiae under the Applied Osmotic Stress ...............
4.4 Reactivity, Resilience, and Resistance in Response to Osmotic
Stress ....................................................................................
4.5 Calculated Strain for the Applied Stresses using the Moment of Area.

viii

X

xiii

1
4
7
9
12
13

17
19
22

24
29
30
32
36
37
41

47
48
48
49
51
53

65
73
76

77
82

CHAPTER 5: DISCUSSION

5.1 Strain as an Aggregate Response of the Transcriptome to Stress...... 89

5.2 Stability Parameters: Resilience, Reactivity, and Resistance ............ 90

5.3 Moment of Area as a Measure of Strain ....................................... 93

5.4 Relationship between Stress and Strain: The Modulus of Stability...... 96
CHAPTER 6: CONCLUSION AND FUTURE PERSPECTIVES

6.1 Conclusions ........................................................................... 99

6.2 Suggested Future Research ...................................................... 101

APPENDIX A: COMMON METRICS USED FOR MEASURING
SIMILARITY (NEARNESS), DISSIMILARITY, AND CONFIDENCE ............ 104

APPENDIX B: MATHCAD PROGRAM USED FOR CALCULATING THE
MOMENT OF AREA FOR LARGE DATA SETS ..................................... 106

APPENDIX C: DESCRIPTION OF GENES GROUPED ACCORDING TO
BIOCHEMICAL PATHWAY USED TO STUDY RELATIVE GENE
EXPRESSION TO OSMOTIC SHOCK .................................................. 110

APPENDIX D: GENES EXHIBITING A SIGNIFICANT INCREASE OR
DECREASE IN EXPRESSION STEPPING FROM 1.0 M TO 1.2 M NACL

BIBLIOGRAPHY .............................................................................. 123

LIST OF FIGURES

Some of the ﬁgures in this dissertation are in color.

Figure 2.1. (a) Pulse perturbations in which an instantaneous change either by
addition or removal of an environmental condition occurs followed by a return
to the pre-perturbed condition. Pulse perturbations are a function of
magnitude and duration. (b) Press perturbations in which change in an
environmental condition is maintained. Press perturbations are a function of
magnitude only. ........................................................................... 18

Figure 2.2. Generalized relative expression patterns in response to
environmental perturbation. An induction (a) or repression (b) in transcription
followed by a return to pre-perturbed expression levels. An induction (0) or
repression (d) in transcription followed by an establishment of new relative
expression levels. Patterns (e) and (f) could represent one of the other
expression patterns given a longer time-scale. Pattern (9) represents no
change in relative expression in response to perturbation. .................... 20

Figure 2.3. (a) Relative expression of Saccharomyces cerevisiae exhibiting
asymptotic and neighborhood stability to hyper-osmotic and temperature
perturbations (b) Application of the stability parameters to asymptotic and
neighborhood stability. .................................................................. 23

Figure 2.4. The Moment of Area encompasses the the stability parameters and
is measure of overall stability. Components of the Moment of Area applied to
asymptotic stability are shown. Components include the response envelope,
perturbation axis, moment arm, and center of mass. ........................... 27

Figure 2.5. Symbols applied to the response envelope showing how it
encompasses the stability parameters reactivity, resilience, and resistance.
.................................................................................................. 29

Figure 2.6. Physiological and speciﬁc transcriptional responses to osmotic
shock. Upon encountering an osmotic shock cell growth ceases until suitable
glycerol accumulation, triggered by the high osmotic glycerol (HOG) pathway,
allows for growth to resume. ........................................................... 36

Figure 2.7. Graphical representation of different comparison metrics used for
analysis of microarray data. (a) Pearson’s correlation in which the angle
between the mean of the gene expression vectors x and y is used to calculate
Similarity. (b) Pearson’s correlation around zero or often known as the
standard correlation, in which Similarity for the gene expression vectors x and
y is calculated as the angle from zero. (c) Euclidean distance is calculated as
the distance between the expression vectors x and y. .......................... 43

Figure 3.1. Schematic of the MathCadTM program used to calculate the
Moment of Area for large data sets. The program is divided into four sections.
Section one uses linear regression to ﬁt the data to a statistical mode. Section
two calculates the statistics associated with the regression analysis. Section
three ﬁnds the components leading to the Moment of Area calculation.
Section four creates an output table containing all of the results. ............ 50

Figure 3.2. Design approach used for conducting osmotic Shock experiments.
RNA was extracted from roughly one hundred eighty samples checked for
genomic DNA contamination and reverse transcribed into cDNA. Standard
curves for real time PCR were developed in parallel to the RNA extractions. In
all, roughly 18,000 PCR reactions were carried out in 384 well optical plates.

Figure 4.1. Comparison of Moments of Area for the eight perturbations. (a)
Using the cumulative equation to describe aggregate response of the ESR
associated with the set of two-clusters and single cluster approach. (b) Using
the displacement equation to describe the aggregate response of the ESR
associated with the set of two-clusters and Single cluster approach. ....... 67

Figure 4.2. Standard curve development results. a) Results from the ﬁrst
approach using small lengths of DNA ranging from 101 bp to 106 bp as
template for ampliﬁcation. Good reproducibility was observed, but large
separation between lengths occurred at the 104 and 106 dilutions. b) Results
from the second approach using longer lengths of DNA encompassing the
Shorter amplicons. Amplicons were grouped closer together for all dilutions
compared to the ﬁrst approach. ...................................................... 74

Figure 4.3. ODeoo measurements normalized by initial absorbance for
Saccharomyces cerevisiae. The vertical line represents the onset of the
perturbation. Retarded growth was seen followed by decreased growth rates
with increasing perturbation magnitude. ............................................. 76

Figure 4.4. Aggregate response envelopes of 153 out of 169 genes showing
expression to osmotic shock. The aggregate response was calculated using
the cumulative response equation described in the Materials and Methods
section. Resistance decreases until the 1.2 M NaCl perturbation. For this
perturbation and the 1.4 M NaCl perturbation resistance increased. ........ 78

Figure 4.5. Comparison of fold changes in relative gene expression stepping
from 1.0 M to 1.2 M NaCl. The largest fraction of genes at 1.0 M NaCI
showed at least a 2-fold increase in expression. Whereas, the largest fraction
of genes at 1.2 M NaCI Showed at least a 2-fold decrease in expression.
Percentages associated with the pathways indicate percent contribution in
terms of response for the given fraction of genes. 81

xi

Figure 4.6. Comparison of Moments of Area plotted against perturbation
magnitudes for (a) individual genes, (b) gene associated pathways, and (c)
the aggregate response of selected genes described in this research. 85

Figure 4.7. Modulus of stability values for individual genes normalized by the
aggregate value. Genes greater than 1.0 are less sensitive than genes below
1.0. More genes showed greater sensitivity to the stress imposed by osmotic
shock than less sensitivity compared to the aggregate. ........................ 88

Figure 5.1. Plot of moment of area versus perturbation magnitude in terms of

NaCl molar concentration. The domain of stability is depicted between the
lower bound of strain, Stmin, and upper bound of strain, Stmax. ................ 97

xii

LIST OF TABLES

Table 1.1. Description of eight perturbations administered by Gasch et al.,
(2000), in order to study relative gene expression in Saccharomyces
cere visiae. ................................................................................. 1 1

Table 3.1. Genes and their associated primers used in developing standards
for relative quantiﬁcation of mRNAS. Two approaches were used in
developing the standard curves. In the ﬁrst approach ampliﬁcation was
attempted from small DNA templates associated with the genes ranging from
101bp to 106 bp. In the second approach, larger DNA templates were used
encompassing the smaller amplicons in hopes improving ampliﬁcation. 64

Table 4.1. Residual comparison relative to their respective Moments of Area
for the eight perturbations. Relative residuals for the cumulative equation
were identical between the cluster approaches. This was not observed for the
displacement equation. .................................................................. 68

Table 4.2. Residual areas associated with the aggregate response equations
and cluster approaches. The residual areas are additive when applying the
cumulative equation. This property was not associated with the displacement
equation. .................................................................................... 70

Table 4.3. Comparison of estimated stability parameters and Moments of Area
for the eight perturbations. The differences in Moments of Area can be
evaluated in terms of the individual stability parameters. ..................... 72

Table 4.4. Calculated stability parameters corresponding to the aggregate
cumulative response envelopes Shown in Figure 4.4. .......................... 79

Table A.1. Additional metrics used for relating Similarity and dissimilarity of
relative gene expression data obtained from microarray studies. ........... 105

Table 0.1. Descriptions of genes, and their accompanying primer sets,
encoding proteins for the Glycolysis/GluconeogeneSiS pathway. ........... 111

Table C.2. Descriptions of genes, and their accompanying primer sets,
encoding proteins for the Glycerolipid Metabolism pathway. ................ 112

Table C.3. Descriptions of genes, and their accompanying primer sets,
encoding proteins for the Mitogen Activated Kinase (MAPK) Signaling
pathway. .................................................................................. 114

Table C.4. Descriptions of genes, and their accompanying primer sets,
encoding proteins for the Cell Cycle pathway. .................................. 116

xiii

Table 0.1. Genes exhibiting a signiﬁcant increases or decreases in relative
expression at the time of maximum displacement. Comparison between the
1.0 M and 1.2 M NaCl perturbation magnitudes. ............................... 119

xiv

CHAPTER 1

INTRODUCTION

1.1 Background and signiﬁcance

High throughput genomic technologies have dramatically increased our
ability to develop models to quantitatively predict the behavior of microorganisms
in natural and engineered systems (Schena et al.,1995; Lockhart, et al., 1996;
DeRisi et al., 1997). Many variations of the technology exist today, including the
glass slide based cDNA and oligonucleotides microarrays (DeRisi et al., 1997;
Agilent, Palo Alto, CA), in Situ synthesized oligonucleotide arrays (Affymetrix,
Santa Clara, CA; Xeotron, Houston, TX; NimbleGen, Madison, WI), and bead-
based systems (Spiro et al., 2000). Typically, whole genome expression studies
using the above technologies revolve around measuring relative changes in
messenger RNA of thousands of genes together (referred to as genome
expression pattern, response, or strain) to physical, chemical, or biological
perturbations (referred to as perturbation or stress). Knowledge of this response
is believed to be important in making new discoveries in many ﬁelds including
oncology- to develop drugs for the control of cancerous cells (Perou et al., 2000;
Swami et al., 2003), biotechnology- to produce proteins of commercial value, and
environmental engineering- to study the degradation of xenobiotic compounds
during bioremediation (Beliaev et al., 2002). In parallel to these, other low-
throughput but more quantitative technologies such as quantitative real time
polymerase chain reaction (RT-PCR) have also progressed towards higher

capacities (Hernandez et al., 2000; DeFrancesco, 2003). Quantifying the relative

abundance of mRNA from thousands of genes is more economically feasible
using the microarray platform. Quantifying the absolute abundance of a few
hundred mRNAs is better accomplished by RT-PCR.

Since the inception of microarray technology, approximately a decade
ago, more than one thousand studies have been reported covering dozens of
whole genomes of various microorganisms (Wodicka et al., 1997; Cho et al.,
1998; Hecker and Engelmann, 2000; Hinchliffe et al., 2003). Many of the data
analysis tools applied to these studies were initially developed using experiments
performed on the Sacchaormyces cerevisiea (yeast) genome because of
experimental data publicly available from the laboratories of Pat Brown and
Jospeh DeRisi (DeRisi et al., 1997; Gollub et al., 2003). The sharing of gene
expression data and results, although initially lagging, has taken on new meaning
in the ﬁelds of bioinformatics, genomics, and microbiology. A number of
mathematical tools now exist to cluster and analyze the deluge of genome
expression data under varying environmental conditions and extract qualitative
(and often quantitative) information (Eisen et al., 1998; Xia and Xie, 2001; Autio
et al., 2003; Laws et al., 2003).

A primary goal of data analysis for experiments involving environmental
perturbation and genomic response is to establish (to the extent possible)
predictive rules for the behavior of a microorganism for the system of interest
(reactor, human body, soil, aquifer, surface water etc.). The predictive rules of
behavior can be established on many organizational levels including community,

population, proteome, and transcriptome. On a transcriptome level, none of the

analysis tools at present have the capability to ﬁnd a quantitative aggregate
measure of relative gene expression for a group of genes; whether it be a cluster
of genes, a pre-defined gene set, or whole transcriptome. The aggregate
measure of gene response is referred too as strain throughout this dissertation. A
measure of strain is needed to compare the effect of different types and
magnitudes of perturbations on the transcriptome. This information in turn may
be useful in the direct control and management of gene expression and indirectly
in the control and management of the microorganism. Past studies related to the
existence of quantitative relationships, if any, between strain and applied
perturbation (referred to as stress subsequently) have not been undertaken,
because tools to calculate an aggregate measure of expression did not exist.
This posed some problems for the research presented in this dissertation,
because no published information on the aspect of relating stress to strain in a
quantitative manner on a transcriptome level was found. Hence, many
comparisons were made to studies that closely resembled the objectives and
hypotheses of this study. It is of course possible to draw parallel from dose-
response studies and toxicity evaluations at the organism level, but none of the
studies related to analysis of gene expression patterns have adopted the
approach presented here.

It is well known that many physical, chemical, social, and economic
systems behave according to a stress-strain paradigm. Young’s modulus of
elasticity (extension in the length of a wire per unit area per unit of applied load)

used to measure the strength of a material is an example of this type of

relationship important to comparing materials used in civil engineering.
Ecologists routinely study ecosystems as a perturbation-response problem and
characterize it as less or more stable. Economists forecast (to the extent
possible) the change in economic activity due to an event such as a decrease in
the interest rate. The hole in ozone layer is an example of the strain caused by
intentional and unintentional release of chloroﬂuoro-carbons (CFCS). The
foundation for this research is that “within limits”, such a stress-strain relationship
also exists for genomic systems and can be quantitatively studied by employing
aggregate measures of “stress and strain”. Such quantitative relationships will
have potential utility in control and management of microorganisms in all areas of

science.

1.2 Stress, Strain, and Stability

An aggregate measure of strain can be obtained from application of
stability principles from ecology. Ecologists analyze and measure the response of
a system by its response envelope (Nuebert and Caswell, 1997). A response
envelope is simply the curve describing the behavior of a state variable in
response to an applied stress. It is generally parabolic in shape but other
responses are possible. Gene expression patterns can also be considered as
response envelops. In ecology, the state variable is often the biomass or number
of species, or the abundance of a nutrient. In gene expression studies, the
response envelope will be attributable to the relative abundance of mRNAs
associated with individual genes. Indeed, in genomics, not all envelopes are of

the traditional parabolic shape. Five other common shapes of response are:

inverse of a parabola, a step-up response, a step-down response, and no
response at all to the stress, implying no detectable deviation from pre-perturbed
expression following the onset of a stress. It is obvious that genomic responses
are more complex and may require additional tools for analysis compared to the
response envelopes that are generally considered in ecology. However, as
presented later in this dissertation, the mechanistic mathematical equations
developed to analyze the response envelope can be applied to genome
expression patterns of many types.

“Stability" is a system-associated property extracted from the response
envelope. Its origins are rooted in the physics of perturbed motion (Merkin,
1997). In ecology, this property has been used in different manners (Grimm et
al., 1992), requiring a speciﬁc deﬁnition for this research. Stability is the ability of
a system to remain at equilibrium following a perturbation (Holling, 1973;
Harrison, 1979; Sennhauser, 1991). Although equilibrium is commonly equated
to stability in an ecological sense the use of steady state is more appropriate in

the case of relative gene expression. Steady state is deﬁned in this research as:

M

Steady State 2 = O. (1 .1)

Where, Ax(t) represents the difference between the perturbed, xp(t), and pre-
perturbed expression, xe(t), of an individual gene, or system of genes. In this
case, a system is deﬁned as a set of related genes found within a gene cluster or

predeﬁned group, such as those making up the glycolyis pathway for

Saccharomyces cerevisiae. Certain aspects of stability, or stability parameters,
are commonly used to quantify this property. The two most common parameters
are resistance and resilience. Resistance is a measure of the magnitude of
response, i.e., the maximum change in the parameter of interest in response to
perturbation. Resilience is often described either as the time required for the
system to return to its pre-perturbation state or rate at which the system returns
to its pre-perturbed state. The exact deﬁnition used in past studies has often
depended on the author’s prerogative. Resilience applied to gene response
envelopes is described in more detail in Chapter 2. Since resistance and
resilience are both quantitative terms (differing in units, however), they can be
combined appropriately to yield a single measure for a single response envelope.
And, for many response envelopes, strain calculated from the aspects of stability
can be added together to describe the system as a whole (Grimm et al., 1992).
Therefore, a value of strain made up of thousands of gene response envelopes is
theoretically feasible. Such a summation of stability parameters calculated as
Moments of Area for Six response envelopes was used to describe the functional
stability of an anaerobic microbial community (Hashsham et al., 2000). The
research presented, proposes to use the same concept of calculating the
Moment of Area of response envelopes extended to genome expression
patterns.

It is well known that only a certain percentage of genes respond to an
applied stress. These are generally those genes that are related to a group of

proteins useful in averting the effect of the applied stress. It may belong to a

single pathway or spread over a few pathways. It is apparent that an aggregate
measure of expression response (in terms of strain) iS possible, but comes at the
cost of losing resolution, i.e., information about the type of genes or pathways
responding to that stress is lost. Such loss of resolution is common in all
calculations that result in an index. The beneﬁt of calculating an index becomes
obvious when we consider that two very different types of perturbation (e.g.,
temperature and hydrogen peroxide) can be compared to each other via this
index. This would not have been possible by cluster analysis or any other
mathematical analysis that is currently in use in data analysis. It is possible,
however, to calculate the strain for one gene, one pathway, a group of genes
responding to the stress, or the whole genome. By keeping the gene identities
included in an analysis constant, the comparison of strains may be made more

biologically meaningful.

1.3 Choice of Monitoring Technology for Strain Quantiﬁcation

After some initial analysis, it became clear that the cost of testing the
applied concepts was prohibitive on microarrays. Experiments on genome
expression patterns yield relative expression ratios. These ratios must be
obtained preferably in triplicate for good data analysis. Adequate description of
time series data often requires a minimum of ﬁve to seven temporal points,
judiciously selected at times that represent important changes in the expression
ratio. In addition, the experiments needed to be conducted at several magnitudes

of stress to be linked to the magnitude of strain. In terms of microarrays,

considering six different magnitudes of stress including the control, and applying
. a factor of safety of 40% extra because some microarrays may not yield good
results, at least one hundred microarrays will be needed. Combining the cost of
labeling and hybridization buffer, the total cost was estimated at more than
$50,000 in supplies. Therefore, an alternative approach was adopted. This
approach used quantitative real-time PCR and reverse transcription of the
mRNA. Quantitative real-time PCR is often used to validate expression ratios
obtained from microarray experiments because it provides better quantitative
information and can be conducted to yield absolute amounts of the mRNA.
However, RT-PCR becomes uneconomical and more time consuming for more
than a few hundred genes. The strategy of making quantitative RT-PCR
economical stems from limiting genes to a smaller set that has been shown to be
relevant to the type of stress. Yet, the cost saving strategy for a smaller number
of genes can also be achieved for the microarray platform, especially if the
microarray is printed in house. Therefore, the determining factor between a
microarray of a smaller set of genes and RT-PCR was the reliability in
determining the magnitude of strain. RT-PCR quantiﬁcation of mRNA is more
reliable with the possibility of absolute quantiﬁcation. Hence, RT-PCT of a set of
genes responding to the model stress was chosen as the best option for this

research.

1.4 Choice of the Model Microorganism and Stress

The type of microorganisms or stress that can be used for validation of the

stress-strain relationship is abundant. Therefore, organism selection for the

purpose of this study was carried out based on the following two characteristics:

i)

The microorganism’s whole genome expression pattern should be
well studied with respect to the type of stress and response so that
an informed selection can be made about the set of genes that are
relevant to the stress. This was essential because of the use of RT-
PCR based quantiﬁcation instead of the microarray-based
quantiﬁcation.

Initial data on whole genome expression patterns must be available
in the public domain so that mathematical tool development for
quantifying the strain can be accomplished before conducting a

more extensive stress-strain experiment.

S. cerevisiae satisﬁes the above two requirements. It is one of a few

organisms in which the effects of and response to osmotic shock have been

extensively studied (Hohmann and Mager, 1997). Brieﬂy, upon encountering

osmotic shock, growth ceases and the cell’s volume decreases until the internal

production of glycerol reduces the water potential sufﬁciently for growth to

resume. Glycerol is the only compatible osmolyte produced by S. cerevisiae and

its production in response osmotic shock results from transcription of GPD1 and

a host of supporting genes (Hohmann and Mager, 1997; Martijn et al., 1999,

2000 & 2001; Mager and Siderius, 2002).

Increased levels of mRNA for GPD1 have been observed with greater
osmotic Shock, suggesting a relationship between the magnitude of the
perturbation and response. The potential dependence between perturbation
magnitude and response of this gene suggests that other genes related to
glycerol production, or to maintaining cellular viability, should also exhibit greater
response with greater osmotic Shock. Currently, no quantitative description of this
relationship exists for relative gene expression of S. cerevisiae in response to

osmotic shock.

S. cerevisiae has been extensively used as a model organism to study the
whole genome response to a number of stresses. Gasch and co—workers
collected data on the relative gene expression of S. cerevisiae to a variety of
environmental perturbations (Gasch et al., 2000). Eight of twelve perturbations,
which are summarized in Table 1.1, were selected for addressing hypothesis 1
described below. Relative mRNA abundance was monitored by Gasch et al.,
(2000), over time using a full-scale genome microarray developed by Pat Brown
and colleagues at Stanford University (Shalon et al., 1996; DeRisi et al., 1997).
This microarray contained the PCR ampliﬁcation products of approximately 6400
distinct genes along with positive controls for normalization and negative controls
to monitor hybridization quality. The data from these experiments is publicly
available from the Stanford Microarray Database (Gollub et al., 2003).
Hierarchical clustering of differentially expressed genes (relative expression
greater than 2-fold change) revealed two clusters of a combined total of

approximately 900 genes. This set of genes was labeled by Gasch et al.,(2000),

Table 1.1. Description of the eight perturbations administered by Gasch et
al.,(2000) in order to study relative gene expression in Saccharomyces.
cerevisiae.

   

EnvirMmentubaon
(Cellular Stress)

Description

 

1. Temperature Shock from 25° to 37° C

Cells were grown at 25°C collected by centrifugation,
and suspended in media at 37°C. This temperature
was maintained throughout the experiment.

 

2. Temperature shock from 37° to 25°

Cells were grown at 37°C collected by centrifugation,
and suspended in media at 25°C. This temperature
was maintained throughout the experiment.

 

3. Hyper-osmotic shock

Cells were grown to an 00600 of 0.6 and
supplemented with media at containing 2 M sorbitol.
Final concentration of sorbitol was 1.0 M.

 

4. Hypo-osmotic Shock

Cells were grown in the presence of 1.0 M sorbitol,
collected by centrifugation and suspended in media
without sorbitol.

 

5. Hydrogen peroxide shock

Cells were grown to early log phase and H202 added
for a ﬁnal concentration of 3.0x10“ M. Culture volume
and concentration were maintained throughout the
experiment.

 

6. Menadione shock

Menadione bisulfate previously suspended in water
at a concentration of 1.0 M was added to the cell
culture for a ﬁnal concentration of 1 mM.

 

7. Diamide shock

Diamide was added to the cell culture to a ﬁnal
concentration of 1.5x10’3 M.

 

8. DTT shock

Cells were grown at 25°C and dithiothreitol was

 

added for a ﬁnal concentration of 2.5x10'3 M.

as the Environmental Stress Response (ESR). One cluster of approximately 600
genes showed repression in transcription followed by a return to relative
expression prior to the perturbation. The other cluster of approximately 300
genes exhibited an induction in transcription also followed by a return to relative
expression prior to onset of the perturbation. The ESR was induced as a
transitional response to maintain cellular conditions optimal for growth and
survival. Many of the genes responding to these perturbations were also
conﬁrmed by Causton et al., (2001), and labeled as the Common Environmental

Response.

11

The above study by Gasch et al., (2000), and Causton et al., (2001),
would serve as an excellent starting point to explore if genome expression
patterns can be summarized in terms of strain. However, it would be important
that the numerical values of strain retain meaningful differences for comparison
of stress and developing the stress-strain relationships. It became clear that data
from various types of stresses obtained by Gasch et al., (2000), would be useful
for this purpose. However, it was also clear that the previously performed
experiments were not speciﬁcally designed for the purpose of testing the
existence of a stress-strain relationship. Therefore, a series of controlled
experiments would need to be performed to fully explore the validity of the
hypothesized relationships. The following sections describe the objectives and

hypotheses for this study.

1.5 Objectives

Two objectives of this study are put forth. Objective 1): To develop a
mathematical tool to calculate strain, i.e., an aggregate measure of gene
expression response, from the response envelopes of a set of genes responding
to the applied stress, and Objective 2): To study the relationship between stress
and strain at the level of gene expression for a set of selected genes in S.
cerevisiae under a series of applied osmotic stresses.

Completion of the ﬁrst objective is necessary before the second objective
can be met. The ﬁrst objective uses existing data from Joe DeRisi’S laboratory

(Gash, et al., 2000) for initial tool development. This tool is then applied to the

12

experimental data obtained as part of this research to accomplish the second

objective.

1.6 Hypotheses
The following three hypotheses will be tested as part of this study. They
are essentially sub-hypotheses emanating from the more fundamental question

of whether stress iS related to strain in biological systems.

Hypothesis 1 — The magnitude of strain for the set of genes labeled as ESR in
S. cerevisiae, can be calculated using the Moment of Area method and will be a
function of the type of stress. It will also be relatively insensitive to various

methods employed to calculate it.

Since all response envelopes have areas, calculation of strain is trivial.
Whether this calculated area retains the resolution with respect to various types
and magnitudes of stresses is key to determining the usefulness of this
approach. Proof of this ﬁrst hypothesis determines the fate of the second
hypothesis that relates to the magnitude of stress. Also, the mathematical tools

developed to test the ﬁrst hypothesis are essential for the other two.

Hypothesis 2 —The magnitude of strain is related to the magnitude of stress for
the set of genes responding to that stress. This relationship is most likely non-
linear and exists within a range given by minimum and maximum values of the

stress.

13

This range exists because strain may not be detectable below a certain
lower value of stress due to limitations posed by the detection limit of the
monitoring tools. Similarly, above a certain value of the stress an organism may

not be able to respond in any manner except inactivity or death.

Hypothesis 3 — Moment of Area is a better measure of the strain than resistance
or resilience alone because it combines and emphasizes delayed response of
the microorganisms under higher magnitudes of stress and it gives equal weight
to induction and repression.

This last hypothesis emanates from the need to reconcile the numerous
parameters that have been used to study stability of a system in ecology. For
example, resistance, resilience, reactivity, declivity, time of return, time of
maximum response, and many other parameters represent various aspects of
the response envelope (Grimm et al., 1992). The details of these parameters, the
subtle differences, and their utility in obtaining strain are presented in Chapter 2.
Testing this last hypothesis iS also essential because of the need to analyze
response envelopes that are generated by induction as well as repression. For
example, Gasch et al., (2000), observed two clusters of genes within the ESR,
although in opposite directions, one representing induction and the other
repression. Both returned to the expression levels prior to the perturbation. The
third hypothesis tests if both clusters can be combined to make a single cluster

using the moment of the area approach. Assuming that the stability of the ESR is

additive, then the Moment of Area calculated from the set of both clusters should

quantitatively agree with the Moment of Area from the Single cluster.

are:

Five tasks were completed in order to test the above hypotheses. They

Review the current technologies used to group related genes
according to patterns of relative gene expression in response to
environmental perturbation. This was essential to device methods
to calculate Moment of Area and stability parameters.

Evaluate various parameters of stability theory in ecology, and
adapt the most suitable option to obtain strain from response
envelopes of relative gene expression.

Develop mathematical tools to automatically calculate the stability
parameters from the response envelopes of relative gene
expression.

Conduct stress-strain laboratory experiments using S. cerevisiae
and at least ﬁve different magnitudes of osmotic stress (selected
because maximum information is available for this type of stress on
yeast). The experiment involved growing S. cerevisiae, applying
the stress, and collecting mRNA for quantiﬁcation by reverse
transcription and real-time PCR.

Use the mathematical tool developed in Task 4 to calculate strain

and relate it to the corresponding stress.

IS

Completion of the above tasks was carried out in three phases. In the ﬁrst
phase, a review of the current qualitative analysis techniques used to analyze
relative genome expression was conducted. This phase was important to
develop an understanding of how expressed genes are grouped in a systematic
manner. The second phase evaluated various methods to calculate stability
parameters and applied the best option to develop mathematical tools for
calculating strain from response envelopes of whole genome expression
patterns. Existing data from Gasch et al., (2000), on yeast was used for this
second phase. The third phase involved experimentation and application of the
mathematical tools developed in phase 2. S. cerevisiae was subjected to varying
magnitudes of stress (osmotic shock), messenger RNAS were harvested and
quantiﬁed using quantitative PCR. The response envelopes obtained by this
experiment were analyzed to calculate strain and study its relationship with the
applied stress.

The remaining chapters of this dissertation are arranged in the following
manner. Chapter 2 describes the pertinent literature available on stress, strain,
and stability. Chapter 3 presents the materials and methods employed in this
study including the mathematical tools. Chapters 4 and 5 present the results
obtained and the associated discussion. Finally, conclusions are presented in
Chapter 6. Appendices are provided to document the MathCad program used for
conducting calculations, list of genes used in the analysis with corresponding

pathways.

16

CHAPTER 2

LITERATURE REVIEW

2.1 Pulse and Press Perturbations

A perturbation is responsible for causing the transcriptional response
necessary to counteract the resulting cellular stress. Two distinct types of
experimental perturbations described by Bender et al., (1984), are: “pulse” and
“press”. Figure 2.1 graphically depicts both pulse and press perturbations. Pulse
perturbations are instantaneous changes in an environmental condition followed
by a return to steady state. Press perturbations are changes in an environmental
condition that are continually maintained over the period of the study or for a
speciﬁed time. Responses to pulse perturbations are commonly of short duration
compared to the duration of press perturbations, which invoke long-term
responses to the permanent change (lnchausti, 1995). Currently, perturbation
experiments focusing on gene expression have focused on the press type
(Siderius et al., 1997; Gasch et al., 2000; Causton et al., 2001; Alexandre et al.,
2001; Zhang et al., 2002).

Quantiﬁcation of the magnitude and duration of a perturbation is difﬁcult
for those occurring naturally, and often only the effects (direct and indirect) are
quantiﬁed rather than the perturbation (Bender et al., 1984; Yodzis, 1988).
Experimental perturbations resulting from artiﬁcial manipulation of the
environment or community are quantiﬁable. Quantiﬁcation of the pulse

perturbation is a function of both the magnitude and duration. Because the

duration of press perturbation may continue to inﬁnity (or the period of study),
quantiﬁcation is a function of magnitude only.

The administration and quantiﬁcation of press perturbations is simpler, but
inherent problems can result in selecting the time scale for observing the
response (Yodzis, 1988). Conclusions based on press perturbations must be
based within the boundaries of the observed response. On the other hand, pulse
perturbations can be more difﬁcult to administer and quantify, but the time scale
for observations is easier to determine. Observations in the case of pulse
perturbations are made until the environmental condition or community has

returned to its pre-perturbed state.

1?)

le.d) me)

 

 

 

 

 

 

Magnitude
Magnitude

----——---
\
\

‘=

 

 

 

 

Duration Duration

Figure 2.1. (a) Pulse perturbations in which an instantaneous change either by
addition or removal of an environmental condition occurs followed by a return
to the pre-perturbed condition. Pulse perturbations are a function of magnitude
and duration. (b) Press perturbations in which change in an environmental
condition is maintained. Press perturbations are a function of magnitude only.

18

2.2 Stability and Generalized Gene Expression Response Envelopes

Stability has generally been deﬁned as the ability of a system to remain at
equilibrium following a perturbation (Holling, 1973; Harrison, 1979; Sennhauser,
1991, Grimm et al., 1992). It is worth noting that in ecological literature, the
concept of stability is often elusive, confusing, and subjective. Hence many other
deﬁnitions can also be found. For the purpose of this research, the above
deﬁnition will be used with the exception that “equilibrium” will be substituted with
“steady state”. A system that exhibits no displacement from steady state in terms
of an observed parameter following a perturbation is deemed perfectly stable.
However, once a system is displaced from steady state, it can either return to its
pre—perturbed condition, or establish a new steady state. The former is known as
asymptotic stability, and was ﬁrst described by the Russian mathematician,
Liapunov in the early 20th century (referenced in Harrison, 1979; Merkin, 1997).
The establishment of a new steady state is known as neighborhood stability and
results from the perturbation permanently altering the environment (Lewontin,
1969; Sutherland, 1981; Murray, 1993; Hernandez, 2003). These two types of
stability have been extensively discussed and debated by theoretical and
empirical ecologists over several decades with demonstration to a few well-
characterized ecosystems (Pim, 1982).

The above deﬁnitions associated with stability are applicable in general to
any response envelope. Hence, they may be applied to the response envelope of

gene expression pattern(s) for a single gene or a group of related genes that

19

 

 

 

 

1-

 

 

 

 

 

 

Relative Gene Expression

{)1 9)

 

 

 

 

Time

Figure 2.2. Generalized relative expression patterns in response to
environmental perturbation. An induction (a) or repression (b) in transcription
followed by a return to pre-perturbed expression levels. An induction (c) or
repression (d) in transcription followed by an establishment of new relative
expression levels. Patterns (e) and (f) could represent one of the other
expression patterns given a longer time-scale. Pattern (9) represents no
change in relative expression in response to perturbation.

20

result from an environmental perturbation. Figure 2.2 summarizes the
generalized patterns of relative gene expression. Response a represents a
transcriptional activation until suitable physiological conditions of the cell are
restored followed by transcriptional repression at which point remaining mRNAs
are translated or degraded to pre-perturbed expression levels. Contrasting
response a, is response b in which transcriptional repression is followed by
activation to achieve pre-perturbed expression levels. Responses a and b might
occur when the perturbation momentarily changes environmental conditions after
which the environmental conditions returns to its pre-perturbation state.
Responses c and d denote induction and repression, respectively, until a new
equilibrium expression level is achieved to maintain suitable physiological
conditions. These responses might occur when the perturbation results in the
establishment of new environmental conditions. It is unclear whether responses e
and f will ultimately look like responses a and c or b and d, respectively, due to a
shortened time scale of observation. Because all systems tend to a recognizable
equilibrium following small perturbations (Pimm, 1982), the continuing trend of
responses e and f is unlikely. Finally, response 9 represents no change in

expression from the resulting perturbation.

Evidence of these generalized expression patterns was demonstrated by
Wen et al. (1998) while studying the temporal gene expression of the central
nervous system. Euclidean clustering identiﬁed ﬁve basic “waves” of gene
expression, comparable to responses a, c, e, f and 9 that characterized central

nervous system development. Evidence for responses 3 and b were presented

2]

by Gasch et al. (2000) following the study of gene expression in S. cerevisiae to

a number of perturbations including temperature and osmotic shock.

2.3 Measures of Stability used in Ecology

Figure 2.3a shows asymptotic stability for a cluster of genes responding to
osmotic shock and neighborhood stability for a cluster of genes responding to a
shift in temperature from 37° to 25° C (Gasch et al., 2000). These two sets of
response envelopes are similar to the generalized response envelopes
presented in Figure 2.1 a and c. The stability parameters resilience, and
resistance (Figure 2.3b) can be used to describe speciﬁc aspects of asymptotic
and neighborhood stability. An additional stability parameter, reactivity, is also
shown in Figure 2.3b. Until now, these aspects of stability have predominately
been applied to analyze response envelopes of parameters at the organism level
(Viragh, 1989; Mittelbach et al., 1995; Grandpre and Bergeron, 1997). Application
at the genome expression level will be attempted in this research for the ﬁrst
time.

Resilience and reactivity have units of inverse time and describe the
relative change in abundance of mRNAs governed by the rates of transcription,
translation, and translocation in Eukaryotes. The more resilient a cluster of
genes, the more rapidly it returns to pre-perturbation state following the
perturbation. Further, those genes that did not respond at all, as represented by
the generalized response envelope i in Figure 2.2, are considered perfectly

resilient. Mathematically, resilience is deﬁned as (Neubert and Caswell, 1997):

22

 

a)

 
 

\

MI“ I”: I

l.
.
r1
////

 

 

I
(I
I
i
/

 

    

 

 

b)

Reactivity |.

Relative Gene Expression

 
 

Resilience

Reactivity

Resistance
Resistance

 

 

 

 

 

 

Time

Figure 2.3. (a) Relative expression of Saccharomyces cerevisiae exhibiting
asymptotic and neighborhood stability to hyper-osmotic and temperature
perturbations (b) Application of the stability parameters to asymptotic and
neighborhood stability.

23

Resilience (1/time) a

[ 1 dAX(t)]. (2.1)
“me x(t) dt
Where, Ax(t) represents the difference between the perturbed, xp(t), and pre-
perturbed expression, xe(t), state of the system. According to Equation 2.1, as t
approaches co, the greater the negative value of dAx(t)/dt, the more rapidly xp(t)
approaches x9(t) and the greater the stability of the system in terms of resilience
(Harrison, 1979, DeAngelis, 1980; Nuebert and Caswell, 1997).

Reactivity is a less commonly described stability parameter, but is

complementary to resilience. Mathematically, reactivity is deﬁned as (Nuebert

and Caswell, 1997):

 

Reactivity(1/time) E _1_ dAX(’)]. (2.2)

limto —>tmax[x(t XV)

Where, to—>tmax is the limit taken from the time at which the perturbation begins,
to, to the maximum displacement from steady state, tmax. The larger the positive
value of dAx(t)/dt the more rapidly xe(t)—>xp(t) aS to——>tmax. AS with resilience, a
larger reactivity value indicates greater stability in that the system reacts more
quickly to remove the affects of the perturbation. The more reactive a cluster of
genes the more rapidly the cluster reacts to the perturbation to restore steady
state expression. A comparably larger resilience and reactivity denote greater

stability (Harrison, 1979; Sutherland, 1981; Nakajima, 1992; Ives, 1995).

24

Resistance is deﬁned as the maximum displacement from the pre-
perturbed state (Harrison, 1979; Pimm, 1984) and is deﬁned mathematically aS

(Nuebert and Caswell, 1997):

Resistance (unitless) a [My ]. (2.3)
max X 0

Resistance is the ability of the gene cluster to withstand the perturbation. It
has been compared to the concept of buffering capacity in ecology (Grimm et al.,
1992). On a molecular scale, the less resistant a gene cluster the larger the

magnitude the displacement.

2.4 The Moment of Area as a Tool to Calculate Strain for Generalized
Response Envelope

A dilemma originates when dealing with multiple response envelopes and
multiple stability measures (i.e., number of parameters). Consider a case where
resistance is high but reactivity and resilience are small because the Slopes of
each limb of the response envelope are small. A system could be more stable
with respect to one parameter and less stable with respect to another making an
overall determination of stability difﬁcult. The problem is compounded when
trying to compare multiple response envelopes Therefore, a single measure of
stability that integrates multiple envelopes and multiple stability parameters is

required. Hashsham et al., (2000), presented the Moment of Area as a measure

25

of overall stability incorporating resilience, resistance, and reactivity. They proved
the utility of this measure in comparing the functional stability of anaerobic
reactor communities perturbed by glucose. Mathematically, the Moment of Area

is deﬁned as:
i
Moment of Area (timez) -=- ZAI-ti . (2.4)
0

Where, A,- is the area under the response envelope for the ith compound and t; is
the moment arm of the envelope (Figure 2.4). Modifying the original deﬁnition, A,-
can represent the area of the response envelope and t,- its corresponding
moment arm, for a single gene, an aggregate of clustered genes, or set genes
involved in a biochemical pathway.

The moment arm is calculated as the distance from the center of mass of
the response envelope to the perturbation axis, a vertical line extending from the
starting point of the perturbation (Hashsham et al., 2000). The center of mass
represents a point of symmetry on which the stability parameters act, similar to
describing the center of mass of an object on which distributed forces act. AS a
component of the Moment of Area, the moment arm accounts for possible lag
time from the onset of the perturbation to the gene cluster response, effect of the
shape of different ampliﬁcation envelopes, and emphasizes the response that
occurs over a longer duration. Units associated with different types of
perturbations, such as temperature or concentration does not appear in the

Moment of Area. Therefore, differences between Moments of Areas are a

26

 

| Perturbation

8 1(— Response Axrs
'6 . Envelo e I

g p I Moment
5. I Am

x
LLI I

a, I

c I

In
(D ' Center of
m I Mass
.2 l

“a I
'0': I
m |
\
H... ‘ s
_ 1—_ :23”. — - - : 3;-

 

 

 

 

 

Duration

Figure 2.4. The Moment of Area encompasses the the stability parameters
and is measure of overall stability. Components of the Moment of Area applied
to asymptotic stability are shown. Components include response envelope,
perturbation axis, moment arm, and center of mass.

reﬂection of differences in stability, which allow for comparisons between i)
commonly expressed genes across different environmental perturbations, ii) the
response of a set of genes to different magnitudes of the same perturbation, or
iii) the variations just described but with different genomes. AS with the other
stability parameters a larger Moment of Area indicates a lower degree of stability.

The response envelope encompasses all the stability parameters
described previously. To demonstrate this, consider the asymptotic stable
response of a single gene, i. Dividing the response of i at tmax, the area of the

response envelope for each half can be found by integrating (Figure 2.5):

27

A.=

I

dt+ j

. dt. (2.5)
(to) tmax X,- (to)

 

 

tmax xpi(t)_xei(t) “’0 XPIII)‘Xei(t)

to X

Where, xp,-(t) is the perturbed expression, er(t) is the expression prior to the
perturbation and x,-’(t) describes the initial expression prior to the perturbation of
gene i. If expression is at steady state prior to the perturbation, then x,-'(t)= XeI(t).
Making the substitution, Ax;(t,)= xp,-(t)- xei(t), taking the anti-derivative of Equation

2.5, and assigning the appropriate limits results in Equation 2.6.

[ 1 )dAxl-(t): [ 1 mm} [ 1 dAXI-(t)]
x;(t) dt limt0—>tmax xii“) dt limtmax-—>t,Jo XII“) dt .

Equation 2.6 describes the ampliﬁcation envelope of gene i as the overall

 

 

 

rate of change of relative expression governed by reactivity for to < t _< tmax and
resilience for tmax _< t < to... If the relative response of gene i exhibited
neighborhood stability, then there is no return to pre-perturbed steady state.
Hence the resilience portion of Equation 2.6 is non-existent and the overall rate

of change of relative expression is governed by reactivity only.

28

 

 

 

 

 

 

 

 

 

 

 

 

 

 

tmax tmax

Figure 2.5. Symbols applied to the response envelope showing how it
encompasses the stability parameters reactivity, resilience, and resistance.

2.5 Equations used to Calculate the Aggregate Response Envelopes

The above exercise for a Single gene can be expanded to consider an
aggregate response of a group of genes, with the Moment of Area being the sum
of aggregate response envelopes for multiple gene sets or clusters multiplied by
their respective moment arms. Two approaches for calculating the aggregate
response are shown mathematically by Equations 2.7 and 2.8 The cumulative
response equation (Equation 2.7) sums the variance of each gene over all genes

in the set or cluster at each measured time point according to:

 

i [x i(t)—x;(t):r
cap-Z " . .
1 [’90)]2

29

 

(2.7)

Where, X,,,-(t) is the perturbed gene expression and X'i(t) is the expression prior to
the perturbation. The relative displacement equation is Similar to equation 2.7
with the exception that the square root is taken after summing the variances

according to:

 

on: :[xpm—xxt)? (2.8)

: [MOT

Equation 2.8, designated in this dissertation as the relative displacement
equation, was initially proposed to compare the resilience of energy ﬂow among
different ecosystems (O’Neill, 1976), and has found applications in determining
the stability of nutrient cycling (DeAngelis, 1980; DeAngelis et al., 1989;
DeAngelis 1992; Cottingham and Carpenter, 1994). Its application to the stability
of gene expression as an aggregate measure is novel and represents a new

quantitative tool for analyzing microarray data.

2.6 Physiological Response of S. cerevisiae to Osmotic Shock

S. cerevisiae, like all other unicellular organisms, has developed
mechanisms to counteract changes in transmembrane water potential due to
natural ﬂuctuations in external osmolytes. Because of its commercial importance,
the response of S. cerevisiae to osmotic Shock has consistently been a focus of
attention. Exposure to salt solutions, used in de-watering processes during mass

production, subject S. cerevisiae to high osmotic pressures (Attﬁeld, 1997).

30

During dough fermentation the osmotic pressure experienced by S. cerevisiae
has been Shown to nearly double (Benitez et al., 1996). Thus, development and
identiﬁcation of new yeast strains with increased osmotolerance is an ongoing
research interest.

Three physiological events occur in S. cerevisiae upon encountering
sudden increases in osmotic conditions. In the ﬁrst event, a decrease in turgor
from loss of intercellular water results in the reduction of the cell volume
(Blomberg, 2000). Depending on the severity of the osmotic Shock the cell
volume has been Shown to decrease by as much as 30% to 60% its original
volume (Benitez et al., 1996; Hohmann, 1997; Klis et al., 2002). The ability of the
cell wall to withstand such a dramatic change is mainly attributed to the presence
of the elastic properties associated with the membrane protein 81,3-Glucan,
which has a Shape comparable to a wire spring (Klis et al., 2002).

In the next event, growth ceases until adaptation to the new environment.
For S. cerevisiae adaptation primarily occurs by accumulation of glycerol in the
cytosol through internal production and by restricting transport to the outside
through the glycerol-speciﬁc channel protein, fps1p (Andreishcheva and
Zvyagilskaya, 1999; Ilomberg, 2000). Trehalose, another osmolyte, has also
been observed to accumulate in the cytosol. However, the level of accumulated
trehalose compared to glycerol indicates that it does not have a substantial
impact on osmotic resistance (Blomberg, 2000). The accumulation of inorganic
osmolytes such as KCI and NaCl to counteract osmotic stress has not been

observed in S. cerevisiae, and is rarely observed in any other unicellular

31

organism, because of severe impacts on metabolic function due to ionic
properties associated with these inorganic compounds (Yancey et al., 1982).

In the ﬁnal physiological event, the cell volume expands and growth
resumes. Yet, the imposed stress does have its costs. The cell volume does not
return to its pre-stressed state, growth rate is retarded, and population viability is
impacted. The extent of these costs varies with S. cerevisiae strains. Hohmann
and Meger, (1997), indicate that for the commonly studied W303-1A and YPH
strains growth has been observed at 1.7 M NaCl; whereas, some strains Show no
growth in as little as 0.35 M NaCl. In terms of viability, most strains exhibit a 10%
survival rate after addition of NaCl to a ﬁnal concentration ranging from 1.0 M to
1.38 M (Blomberg, 1997). However, viability has also been shown to depend on
the stage of growth. Yeast populations are less viable after an osmotic Shock
during early log phase growth compared to an osmotic Shock during late log
phase growth. The reason for increased sensitivity during early log phase has not
been clearly explained, but could be a result of thermodynamic or genetic

considerations.

2.7 Transcriptional Response of S. cerevisiae to Osmotic Shock

The transcriptional response of S. cerevisiae to osmotic Shock can be
grouped into two categories: the general stress response, which is associated
with the expression of a number of stress genes across many different
perturbations, and the speciﬁc stress response associated with expression of
genes to a Speciﬁc perturbation. In the case of osmotic Shock, if NaCI is used as

the osmolyte generating the perturbation, a Slightly expanded speciﬁc response

32

is observed compared to the speciﬁc response generated following the addition
of sorbitol. This occurs because metal toxicity associated with Na’, generates an
additional response compared to an equivalent stress caused by addition of
sorbitol. In spite of this, NaCI is generally preferred over sorbitol because of
lesser amounts required to generate the genomic response. A 1.2 M NaCl is
equivalent to 1.8 M sorbital for generating a water activity of 0.960 (Hohmann
and Mager, 1997).

The general stress response in S. cerevisiae is connected to a cis-
regulatory element common to genes showing expression to heat shock,
oxidative stress, nutrient starvation, and other encountered stresses. This cis-
regulatory element was ﬁrst identiﬁed by comparison of promoter regions for
C‘IT1, (a catalase), DDR2 (involved in DNA damage repair), and HSP12 (heat
Shock chaperon) as a ﬁve base pair consensus sequence, CCCCT, and was
subsequently named the stress response element (STRE) (Ruis and Schiiller,
1995). Targeting this consensus sequence are the transcriptional factors
Msn2/Msn4p that are negatively regulated by the cAMP-PKA Signaling pathway.
A search of the yeast genome for this element resulted in 186 potentially
regulated STRE genes with roughly 50 of these known to be functionally related
to stress response (Kobayashi et al., 1993; Marchler et al., 1993; Martinez-Pastor
et al., 1996; Eustruch, 2000.).

Despite the presence of the STRE, genes containing this element often
show different patterns of expression (Estruch, 2000). This has been explained

by the presence of additional stress induced regulatory sequences in the

33

promoter regions. For example, Gasch et al., (2000), observed that expression of
the TRX2 cluster was induced for a variety of stresses, but super-induced upon
change of the cellular redox potential. Genes in this cluster contained an
additional promoter sequence targeting the transcription factor Yap1p,
responsible for inducing transcriptional response under oxidative stress
(Fernandes et al., 1997). Therefore, the STRE in combination with additional
condition speciﬁc promoter sequences result in enhanced expression of genes
related to the speciﬁc stress. This redundancy adds to the assurance that the
appropriate genes necessary to mediate the effects of the stress are transcribed.
The use of_high throughput technologies for studying gene expression has
of lately expanded the notion of the general stress response to beyond those
genes solely containing the STRE. Gasch et al., (2000), observed differential
expression in roughly 1200 genes (constituting ~19% of the genome) to different
stresses. Out of this, the expression of 180 genes was signiﬁcantly impacted by
either Msn2/Msn4p double mutants or over-expression of Msn2/Msn4p. Similarly,
Causton et al., (2001), observed differential expression in roughly 10% of the
genome, with 47 of 216 induced genes having the STRE Site. Additionally, of the
283 genes exhibiting repression, 133 contained a previously unknown consensus
sequence, GATGAG, which occurred within 600 bp of the transcription start Site.
All of this points to a much more complicated general stress response composed
of multiple and redundant transcriptional regulatory Sites, the products of which
may be directly and indirectly involved In mediating the effects imposed by the

stress.

34

The main mechanism for the accumulation of glycerol results from
induction of GPD1, GPP2, and a host of other supporting genes with numbers
estimated from 200 to 1000 (Mager et al., 2002). Translation of GPD1 and GPP2
results in a NAD+ dependent glycerol-3-phosphate dehydrogenase and a
glycerol-3-phosphatase, respectively. As shown in Figure 2.6, dihydroxyacetone
phosphate (DHAP) produced from the isomerization of glyceraldehyde-S-
phosphate during glycolySis is catalytically reduced by Gpd1p to glycerol-3-
phosphate (Gly-3-P). Gly-3-P is then dephosphorylated by Gpp2p producing
glycerol.

Two homologous genes for GPD1 and GPP2 have also been identiﬁed
and named GPD2 and GPP1, respectively. Although GPD1 and GPD2 produce
glycerol-3-phosphate dehydrogenases their physiological assignments have
been shown to differ. Evidence for this was demonstrated by protein deletions in
which Gpd1pA mutants exhibited normal growth under anaerobic conditions, but
poor glycerol production at high osmolarity (Larsson et al., 1993). Whereas,
Gpd2pA mutants exhibited poor growth under anaerobic conditions when glycerol
was needed for redox regulation, but normal growth under osmotic shock
(Hohmann and Mager, 1997). Differential expression of GPP1 and GPP2 have
been demonstrated in the presence of increased external osmolytes indicating

functional Similarity (Norbeck et al., 1996; Martijn et al., 2000).

35

Osmotic HOG Pathway

Sh ck /

h 1
Growth Arrest i s o

 

 

 

   
 
  
 

    

ﬂ

 

  
 
   
   

 

Water Outﬂow, Glycerol
Cell volume
decreases ssk2 gpp1, gpp2
I Gly-3-P
NAD+
Glycerol Accumulation gpd1
(mrnutesvto hours) NADH
” Cell volume
increases
cholysns

\\.

Growth Resumes

 

 

 

 

 

K sko1

Figure 2.6. Physiological and specific transcriptional responses to osmotic
shock. Upon encountering an osmotic shock cell growth ceases until suitable
glycerol accumulation, triggered by the high osmotic glycerol (HOG) pathway,
allows for growth to resume.

 

2.8 Regulation of Specific Transcriptional Response to Osmotic Shock

GPD1 contains three transcriptional activation sites demonstrating its
redundancy for osmotic shock. One site contains the stress response element
connected to the general stress response, another site corresponds to the Msnp1
transcriptional factor of which little is known as to its regulation. The ﬁnal
transcriptional activation Site corresponds to the Hot1p activator. This activator is
regulated by the high osmotic glycerol pathway (HOG), which is one of four
known pathways associated with the mitogen activated phosphorylation kinase

(MAPK) signaling pathway. Another important pathway that is related to osmotic

36

shock, and also a part of MAPK, is the PKC pathway, which is responsible for
cell wall integrity (Hohmann and Mager, 1997). All pathways associated with
MAPK have a characteristic cascade of kinase activity.

As demonstrated in Figure 2.6, following an osmotic perturbation a two
component osmo-sensor (Sln1p & Ypd1p) in the cell wall activates the control
kinase (Ssk1 p) by dephosphorylation, which in turn directly interacts with the
MAPKKK (Ssk2p/Ssk22p). A cascade of kinase activity proceeds with activation
of MAPKK (Pszp) and activation by dual phosphorylation of the MAPK(Hog1p).
Active Hog1p is translocated across the nuclear membrane where it directly
interacts with Hot1p. Although Hot1p has not been directly observed to bind with
the ORF encoding GPD1, glycerol production has been shown to dramatically
decrease with a HOT1 mutant (Rep et al., 1999). Hog1p has also been shown to
regulate other transcriptional factors including Sko1p, which activates
transcription of ENA1 encoding an ATPase involved in Na+ extrusion
(Andreishcheva and Zvyagilskaya, 1999). Alternative activation of Hog1p was
demonstrated bypassing the control kinase and MAPKKK by way of the synthetic

high-osmolarity sensitive sensor, Sho1p (Maeda et al., 1995).

2.9 Monitoring of mRNA using Reverse Transcriptase Real-Time PCR
Real-time PCR allows for either the absolute or relative quantiﬁcation of

mRNA by monitoring the PCR amplification process in real-time using a

ﬂuorescent reporter. The mRNA is ﬁrst converted to cDNA by reverse

transcription. This cDNA is used as template in the ampliﬁcation process. As the

37

PCR reaction progresses ampliﬁed DNA increases exponentially over the
number of selected reaction cycles until one or more components of the reaction
mixture become limiting. At this point the rate of ampliﬁcation decreases and the
ampliﬁcation curve levels off. Quantiﬁcation occurs by extending an arbitrary
horizontal line through the linear portion of the ampliﬁcation curve, which is an S-
shaped curve depicting ampliﬁcation of the target with increasing cycle number;
and ﬁnding the corresponding reaction cycle value (x-value) deﬁned as the cycle
threshold, Ct. The cycle threshold is compared to a standard curve equating the
initial amount of target to Ct. Samples containing a larger initial amount of target,
or larger copy number, have smaller Ct values. Whereas, samples with a
relatively smaller amount of initial target, or copy number have larger Ct values.

Absolute quantiﬁcation has been demonstrated in studies measuring the
abundance of isolates from mixed communities and mRNAs in gene expression
studies (Rantakokko—Jalava and Javala, 2001; Ayala-del-Rio, 2002; VVIckert et
al., 2002). For abundance of isolate(s), standards composed of quantiﬁed target
sequences are used to prepare the standard curve. For mRNA quantiﬁcation,
however, the standard curve must account for the efﬁciency of reverse
transcription of mRNA to cDNA (Bustin, 2000). This is accomplished by
generating target DNA sequences then in-vitro transcribing them back to RNA.
The RNA is then reverse transcribed to cDNA along with the RNA of the
unknowns.

Measuring the relative abundance of gene expression is less complicated

and two approaches are commonly used. In the ﬁrst approach, a standard curve

38

is created, but the absolute abundance of target sequence does not need to be
determined. Only the dilution number associated with each standard in the curve
is required. The dilution number of the unknown is determined from the standard
curve and divided by the calibrator i.e., the dilution number of the gene in which
the unknown will be relative too. The other approach of measuring relative gene
expression from real-time PCR data uses a mathematical computation known as
the delta delta Ct (2440’) computation (Livak and Schmittgen, 2001). The MC!
refers to the difference between ACT” - ACm. Where, ACT” is the difference
between the cycle threshold of a reference, known as an endogenous control,
and unknown sample, q, and AC“), is the difference between the cycle threshold
of the endogenous control and cycle threshold of the gene selected as the
calibrator. This computational method relies on the assumption that the
ampliﬁcation efﬁciencies of the reference, calibrator, and unknown are
approximately equal. An endogenous control must be included for both absolute
and relative quantiﬁcation to account for minor differences in the amount of
starting template of unknown samples to be compared. A gene that does not
exhibit a change in expression for the condition studied may serve as an
endogenous control. Common endogenous controls include ribosomal DNA,
GAPDH, and B-actin (Toshihide et al., 2000).

Monitoring real time ampliﬁcation commonly occurs by way of one of two
generalized approaches. The ﬁrst approach uses DNA binding dyes that directly
react with double stranded DNA. SYBR green | is the most commonly used DNA

binding dye for monitoring PCR ampliﬁcation (Karsai et al., 2001). This dye emits

39

a ﬂuorescent signal upon intercalating with double stranded DNA. As the amount
of ampliﬁed DNA increases the ﬂuorescence emitted by SYBR green I is
proportional. However SYBR green I is unable to discriminate between target
and non-target ampliﬁed DNA resulting from non-speciﬁc binding of primers.
Therefore, results are sensitive to false positives. Similarly, the presence
genomic DNA contamination in reverse transcribed samples can result in high
background ﬂuorescence. Identiﬁcation of false positives can be accomplished
by generating a melting temperature (Tm) curve (Ririe et al., 1997). The Tm is the
temperature at which the double stranded DNA of the amplicon separates and is
dependent of the nucleotide composition. A peak is generated that should
correspond to the speciﬁc Tm of the amplicon as the temperature of the reaction
mixture is raised. Additional peaks, or broad peaks indicate the presence and
signiﬁcance of non-target ampliﬁed DNA.

The second approach depends on hybridization of speciﬁc probes
containing ﬂuorescent reporter systems to the amplicon of interest. Three
techniques make up the second approach for monitoring ampliﬁcation. The
Taqman assay (Perkin Elmer and Applied Biosystems) utilizes a third probe
speciﬁc to a region on the amplicon bound by the speciﬁc template primers. This
probe contains a ﬂuorescent reporter on the 5’ end and a ﬂuorescent quencher
on the 3’ end. During the annealing and extension step the probe hybridizes to
the amplicon and 5’-exonuclease activity of the DNA polymerase cleaves the
probe separating the reporter from the quencher allowing for the ﬂuorescent

emission of the reporter to be monitored. The Molecular beacons (Stratagene)

4O

assay also relies on a reporter and quencher system except the probe containing
the two dyes forms a hairpin loop due to self-complimentarity on both ends. In
the hairpin loop, the dyes are close enough so that the emission from the
reporter is quenched. At the correct annealing temperature, conformational
transition separates the ends of the probe, which then binds to its complimentary
sequence. The separation of the reporter and quencher is large enough so that
emission is no longer quenched. The ﬁnal technique relies on two separate
hybridization probes one of which contains a ﬂuorescein dye on the 3’ end and
the other contains an acceptor ﬂuorophore on the 5’ end. The excitation
spectrum of the acceptor ﬂuorophore overlaps the emission spectrum of the
ﬁuorescein donor resulting in ﬂuorescence resonance energy transfer when the
3’ end is brought into close contact of the 5’ end. The resulting emission is a red
ﬂuorescence. When the two probes are separated, only the background
ﬂuorescein donor emission is present. This last technique results in the greatest
target speciﬁcity because of the combined speciﬁcities of the template primers

and hybridization probes (Bustin, 2000).

2.10 Current Analysis Tools for Gene Expression Data

Current tools for analysis of relative gene expression data rely heavily on
systematic methods originally designed for inferring evolutionary history from
hierarchy relationships (Planet et al., 2001). The use of systematic methods,
applied to expressed genes, assumes that meaningful insights into molecular
function can be inferred from similar patterns of expression (Planet et al., 2001).

This assumption is also valid when describing the stability of clusters of genes to

41

environmental perturbation. Because all systematic methods contain biases, it is
important to have a basic understanding of the systematic method selected when
grouping genes for stability analysis.

All data obtained from gene expression studies is grouped by placing each
gene in a row and each condition studied, or each observed time point for a
single condition in a column. This forms the gene expression matrix in which
expression is studied by comparing proﬁles of genes by comparing rows, or
proﬁles of samples by comparing columns (Brazma and Vilo, 2000; Altman and
Raychaudhuri, 2001). Each entry in the expression matrix is a gene expression
vector thought of as a point in m-dimensional space (Brown et al., 2000). Once
the expression matrix has been established, analysis proceeds by calculating
similarity, dissimilarity, or conﬁdence between expression vectors using a
statistical metric, then establishing relationships among the expression vectors
using clustering.

Among the numerous metrics available for calculating similarity,
dissimilarity, or conﬁdence, Pearson’s correlation, including its modiﬁcations, and
Euclidean distance have found broad application to studying relative gene
expression. Pearson’s correlation is a measure of directional similarity between
two gene expression vectors in which each vector is treated as unit length
(Sherlock, 2000). The directional similarity can often be taken around the mean

of the two vectors (Figure 2.7a), or around another arbitrarily deﬁned reference

42

b) c)

Response

 

9x

 

 

 

 

9y

 

 

 

Time or Condition

Figure 2.7. Graphical representation of different comparison metrics used for
analysis of microarray data. (a) Pearson’s correlation in which the angle
between the mean of the gene expression vectors x and y is used to calculate
similarity. (b) Pearson’s correlation around zero or often known as the
standard correlation, in which similarity for the gene expression vectors x and
y is calculated as the angle from zero. (c) Euclidean distance is calculated as
the distance between the expression vectors x and y.

line (Figure 2.7b). Although Pearson’s correlation excels at capturing similarity in
terms of shape without regard to magnitude it is not robust to outliers (Eisen et
al., 1998; Sherlock, 2000). Heyer et al., (1999), found that this lack of robustness

resulted in false positives; genes receiving a high score of similarity in terms of
function or co-regulation, but in reality are dissimilar.

Euclidean distance is a measure of dissimilarity and takes into account
both direction and magnitude (Legendre and Legendre, 1998). As shown in
Figure 2.7c, it is calculated as the distance between two points making up the
hypotenuse of an arbitrary right triangle using the Pythagorean formula. It is often
overly robust by giving high dissimilarity scores to expressed genes with the
same shape, but different magnitudes (Heyer et al., 1999). Alternative metrics to

Pearson’s correlation and Euclidean distance are described in Appendix A. There

43

is no “absolute” for choosing the best metric, rather consideration should be
given to whether the shape of the expression pattern or the magnitude is of more
importance (Brazma and Vilo, 2000).

The result of calculating a similarity, dissimilarity, or conﬁdence value for
each pair of genes for all genes in the gene expression matrix, is a n2 matrix,
known as a similarity matrix. Values in the similarity matrix are clustered so that
important inferences about gene function, and regulation can be made. Most
clustering analysis is categorized as supervised or unsupervised. Supervised
clustering uses learning algorithms such as support vector machines (SVM)
(Brown et. al., 2000), Bayesian networks (Long et al., 2001), and others to deﬁne
classiﬁers that represent certain groups of functional genes. Functionality of
unknown expressed genes is determined by comparing the classiﬁers to
expression data.

Unsupervised clustering is performed without a priori knowledge to
classiﬁcation and consists of hierarchical and non-hierarchical methods.
Hierarchical algorithms form a similarity tree by grouping related genes into
clusters and connecting related clusters at nodes. Nodes of related clusters are
connected using branches, with the length of a branch indicating the distance
between adjoining clusters. Algorithms building a similarity trees in the fashion
are described as agglomerative, meaning the tree is formed from top to bottom.
Spellman et al., (1998), in a seminal example of applying agglomerative
clustering to microarray analysis, associated clusters of genes with their

corresponding transcription factors for S. cerevisiae during its cell cycle.

44

The relevancy of genes to their associated clusters when progressing
higher up the similarity tree is one disadvantage of agglomerative clustering
(Sherlock, 2000). When genes are clustered together, an average expression
proﬁle of the proﬁles making up the cluster is used to represent that cluster
(Eisen et al., 1998). As a cluster grows from top to bottom, the average
expression proﬁle of the cluster may not accurately reﬂect the proﬁles contained
in the smaller clusters (Sherlock, 2000). To compensate for this, Alon et al.,
(1999), demonstrated the use of a divisive clustering algorithm, or a “bottom to
top” approach of building a similarity tree. In this approach, genes are randomly
assigned to two clusters and then re-assigned to maximize the probability of
each cluster. Each cluster is then divided and the processes of re-assignment
continues until each cluster is made up of a single expression proﬁle.

Non-hierarchical clustering is useful for grouping gene expression data,
collected over time, into families of patterns associated with important events.
Common algorithms include K-means (Hartigan, 1975; Heyer et al., 1999), Self-
organizing maps (Tamayo et al., 1999; Sherlock, 2000), CAST clustering (Ben-
dor et al., 1999), and QT clustering (Heyer et al., 1999). An important example of
non-hierarchical clustering was demonstrated by Cho et al., (1998), in
characterizing genes associated with the periodic events of DNA replication,
chromosome segregation, and mitosis for S. cerevisiae. Unlike hierarchical
clustering, the user must specify values inﬂuencing the number of resulting
clusters in the output. This gives the user more control over the clustering

process and alleviates the problem of relevancy associated with agglomerative

45

hierarchical clustering. However, the user should have some external criteria or
notions as to the number of clusters desired (Planet et al., 2001). If the number of
resulting clusters is too few, important groups of genes are not separated. If the
resulting number of clusters is too many, genes that should be grouped together
are separated. User control associated with non-hierarchical clustering makes
this approach ideally suited for grouping similar responses according the
characteristic patterns of gene expression in response to environmental

perturbation.

46

CHAPTER 3

MATERIALS AND METHODS

3.1 Sorting of Gene Expression Data for ESR

Raw gene expression data for the eight environmental perturbations listed
in Table1.1 was obtained publicly from the Stanford Microarray Database
(www.dnachip.org). ESR genes and their corresponding expression values were
located and removed from the raw data by comparison to a downloadable list of
these genes, also publicly available on the Stanford Microarray Database.
Relative expression was calculated and zero transformed according to (Eisen,

1998;

Relative expression = CHZDN — CH1D . (3.1)

CH 10

 

The symbols used in equation 3.1 correspond to the data format
associated with the software, GeneScanm. Where, CH2DN is the background
subtracted and normalized emission from ﬂuorescently labeled nucleotides;
labeled with the carbocyanine dye derivative, Cy-3, incorporated into cDNA
during reverse transcription. Fluorescent emission from this ﬂuorophore
represented the transcriptional response of ESR to the perturbations. CH1D is
the background subtracted emission from cDNA containing incorporated
nucleotides labeled with another derivative of the ﬂuorescent carbocycanine

dyes, Cy-5, and corresponds to transcription of genes prior to the perturbation.

47

3.2 Clustering of Relative Gene Expression Data

To separate genes according to the two clusters observed by Gasch et al.,
(2000), relative expression data was imported into GeneSpringTM software
(Silicon Genetics, Redwood City, CA) and a similarity matrix created using a
modiﬁed Pearson’s correlation (Legendre and Legendre, 1998). The non-
hierarchical K-means clustering algorithm was used to sort genes into patterns of
induction and repression (Brazma and Vilo, 2000). Clustering results were
exported to ExcelTM and manually sorted to remove genes with missing data
points. These results were veriﬁed against results obtained by both Gasch et al.,

(2000), and Causton et al., (2001).

3.3 Calculation of Strain from the Response Envelope

Least squares regression was used in conjunction with the general linear
model, y(t)=,Bo+ ﬂ1t1+ﬂ2tz+nﬁktk+a to estimate the partial slopes of the response
envelopes. The area of each ampliﬁcation envelope was determined by
integrating the general linear model, from the start of the perturbation to the last
observation. The center of mass for each ampliﬁcation envelope was also

determined from the general model according to (Fishbane et al., 1993):

b b
It - Y(t)dt J(y(t))2 dt
x coordinate = §3_—’ y coordinate = 0.5 a b (3.2)
MW MW!
8 a

48

Where, y(t) is the speciﬁc model for each response envelope, b is the time of the
last observation, and a is the starting time of the perturbation. The aggregate
response equations (Equations 2.7 and 2.8) were used to calculate the relative
displacement as described previously. Results of the two methods were

compared for the ESR genes.

3.4 Calculation of the Strain for Large Data Sets

To expedite the process of calculating the Moment of Area for a large
number of response envelopes a program was developed using MathCad”. A
ﬂow diagram for the calculation approach is displayed in Figure 3.1. The program
was made up of four parts. Data representing ampliﬁcation envelopes of cluster
aggregates, or individual genes were imported as a text ﬁle to part one. Here,
linear regression takes place using the least squares method to calculate the
coefﬁcients (partial slopes) of the regression model (Longnecker and Ott, 2001).
These coefﬁcients were then exported as a text ﬁle, and used in parts two and
three.

In part two, a statistical analysis of the regression model was done by
calculating the sum of squares of the residuals, sum squares of regression, and
total sum of squares. The above sum of squares was used to calculate the
residual standard deviation, and coefﬁcient of determination. These statistical
measures are used to evaluate the error associated with the regression model,
goodness of ﬁt, and predictive capability, respectively. Statistical equations to

calculate these parameters can be found in Longnecker and Ott, (2001).

49

 

 

 

 

 

 

 

 

 

Regression Analysis
(Least Squares Method) ‘

 

 

 

Export
5 Coefﬁcients (—
E...a.a.t>stt ......

 

 

3

 

Calculate Area
of
Ampliﬁcation
Envelope

 

 

SS(Total)

 

 

 

l N

SS(Regression)

 

 

SS(Residual)

 

 

Y

Coef. Determination

(

 

 

 

I Residual Std. Dev.
Statistic Calculated:

 

 

 

 

 

 

 

 

 

 

 

 

Find Center of Mass 3

x-coordinate and
y-coordinate

 

O
I
O
O
C
C
O
I
v
C
O
O
O
O
C
I
O
O
O
l
O
O
O
O
O
C
I
C
O
O
C
I
I
1
v
C

 

 

 

 

 

 

 

 

 

 

 

Calculate Moment
Arm

 

 

 

 

 

 

 

 

 

Calculate Moment
of Area

 

 

 

 

 

 

 

 

 

Create Output
Table

 

 

 

 

 

__) Export Output
Table as .txt*

 

 

 

 

 

Figure 3.1. Schematic of MathCadTM program used to calculate the Moment
of Area for large data sets. The program is divided into four sections. Section
one uses linear regression to ﬁt the data to a statistical mode. Section two
calculates the statistics associated with the regression analysis. Section three
ﬁnds the components leading to the Moment of Area calculation. Section four
creates an output table containing all of the results.

50

In part three, the coefﬁcients of regression were used to ﬁnd the area, x-
coordinate, and y-coordinate of the center of mass according to Equation 3.2,
and moment arm of the response envelopes. Using the areas and moment arms,
the Moments of Area were calculated using Equation 2.4. Results from parts one,
two, and three were exported to part four of the program where an output table
was generated and exported as a text ﬁle. A detailed description of each part of

the MathCadTM program is found in Appendix B.

3.5 Selection of Genes and Primer Design

One hundred sixty nine genes were selected for observation to osmotic
shock. The selection criteria were based on: 1) whether the genes had previously
exhibited signiﬁcant expression to osmotic shock from published microarray data
(Gasch et al., 2000; Martijn: et al., 2000; Causton et al., 2001), 2) whether the
genes had been previously described in connection to osmotic shock, but did not
exhibit signiﬁcant expression indicated by microarray results (Hohmann and
Mager, 1997), and 3) whether the genes were connected with one of the four
biochemical pathways noted as important to osmotic shock. These pathways
included one regulatory pathway-the MAPK signaling pathway, and three
metabolic pathways- glycolysis/gIuconeogenesis, glycerolipid metabolism, and
cell cycle. Identiﬁcation of genes associated with the four biochemical pathways
were obtained from the Kyoto Encyclopedia of Genes and Genomes (KEGG)
database (Kanehisa and Goto, 2000).

In addition to the 169 genes, RDN18-1 that encodes 18S rRNA was

included as an endogenous control for normalization. Components of the

51

ribosome in eukaryotes are often used for normalization (Toshihide et al., 2000).
However, post-transcriptional modiﬁcations can make designed primers
ineffective for ampliﬁcation of cDNA (Venema and Tollervey, 1999). Therefore,
YBR011C (IPP1) that encodes a pyrophosphatase was also included as an
endogenous control (Kurilova et al., 1993). Changes in expression of YBR011C
to shifts osmotic shock have not been observed, and its use for normalization
has been demonstrated (Norbeck and Blomberg, 1997; Martijn et al., 1999).

Primers for each gene were designed using Primer ExpressTM 2.0 (Perkin
Elmer Applied Biosystems, Foster City, CA). Design parameters included a
melting temperature ranging from 58° C to 62° C, GC content ranging from 45%
to 55%, length ranging from 20 bp to 22 bp, and amplicon length (size of the
ampliﬁed PCR product) ranging from 100 bp to 110 bp. Short amplicons are
ideally suited for quantitative PCR because of better ampliﬁcation efﬁciency due
to an increased likelihood for complete denaturing. This allows effective binding
of primers to their target sequences. PCR efﬁciency also increases due to shorter
extensions times required by polymerization (Bustin, 2000). If possible, at least
one primer was designed to cross an intron; hence, increasing primer speciﬁcity
for the target (Wrckert et al., 2002). Additional design parameters not listed here,
but included in the software were set according to the manufacturer’s
suggestions. Appendix C describes the 169 genes, including primers, according
to the four biochemical pathways mentioned above.

Speciﬁcity of primers was evaluated by: 1) using stand alone BLAST and

the yeast nucleotide database contained in GenBank (www.ncbi.nlm.nih.gov), 2)

52

ethidium bromide staining of the ampliﬁed products using gel electrophoresis,
and 3) analyzing a dissociation curve for each primer pair following RT-PCR
using an ABI Prism 7900HT (Perkin Elmer Applied Biosystems, Foster City, CA).
Primers showing fewer than 3 mis-matches to non-target sequences as indicated
by BLAST, were discarded. Primers resulting in the formation of multiple
amplicons as indicated by double bands on gels following electrophoresis were
also discarded. Finally, dissociation curves with multiple peaks were matched to

their primer sets and disregarded in the ﬁnal data analysis.

3.6 Experimental Approach for Quantitative Stress-Strain Response

An overview of the experimental approach used to study osmotic shock in
S. cerevisiae is outlined in Figure 3.2. Osmotic shock was induced using a 5.0 M
NaCl solution added to batch cultures resulting in press perturbations of
increasing magnitude (0.5, 0.7, 1.0, 1.2, and 1.4 M). Previous experiments
showed a ﬁnal concentration of 0.4 M NaCl was required to induce osmotic
shock; whereas a 1.4 M NaCl concentration resulted in no resumption of growth
(Wuytswinkel et al., 2000). Biological replication of each perturbation was
conducted in triplicate. In addition, negative controls were included for each
perturbation to monitor gene expression under non-perturbed growth conditions.

An average of eight time point samples, including time zero, were
collected for each replicate and negative control for the six perturbations resulting
in roughly 180 samples to be processed in preparation for relative gene
expression quantiﬁcation. Total RNA was extracted and quantiﬁed followed by

detection of genomic DNA contamination using no RT control procedures. If the

53

 

Growth Perturbation
Experiment

 

 

 

 

I 5 Perturbations X 3 Replicates = 18 Experiments

 

 

 

Stored Samples

 

 

 

 

 

RNA Extraction

I.

No RT Control Yes Add. DNase

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Gemmic DNA? Treatment PCR Ampliﬁcation 3 Calibration
of Standards Curves
Reverse
Transcription . _ .
Dilution Series 5 Paints/Calibration

 

 

 

 

 

 

 

 

 

Real Time PCR
Plate Preparation

l

Real Time PCR ‘

 

 

 

 

 

 

 

 

 

 

L Repeat real time PCR for remaining replicates.

 

 

Preliminary Analysis

 

 

 

 

Figure 3.2. Design approach used for conducting osmotic shock experiments.
RNA was extracted from roughly one hundred eighty samples checked for
genomic DNA contamination and reverse transcribed into cDNA. Standard
curves for real time PCR were developed in parallel to the RNA extractions. In
all, roughly 18,000 PCR reactions were carried out in 384 well optical plates.

54

presence of genomic DNA was signiﬁcant, additional enzymatic treatment
occurred until the presence of genomic DNA could not be detected using no RT
controls. Total RNA was then reverse transcribed to cDNA, puriﬁed, and
quantiﬁed in preparation for real time PCR.

Real-time PCR was conducted on six of the eight time point samples
using SYBR l green as the ﬂuorescent reporter to quantify gene expression
relative to accompanying calibration curves. Each time point sample for each
perturbation acted as template for the 169 genes shown previously to be
differentially expressed to osmotic shock. Preliminary analysis showed success
in the experimental design for the ﬁve perturbations. In all, more than 18,000
PCR reactions were carried out in roughly 50 384 well plates, which included

calibration standards, negative controls, and no template controls.

Growth of S. cerevisiae and application of osmotic stress. S.
cerevisiae strain W303 was obtained from the American Tissue and Culture
Collection (ATCC 200060). Cells were grown in batch culture at 30° C in YDP
medium containing 2% bacto-peptone (DIFCO, Detroit, MI), 1% yeast extract
(DIFCO, Detroit, MI), and 2% glucose. A 10x solution containing the peptone and
yeast extract was prepared separate from a 10x solution of glucose and then
combined following sterilization to prevent a darkening of the medium.

Application of osmotic shock to S. cerevisiae proceeded by harvesting an
overnight culture by centrifugation at 3500 rpm for approximately ﬁve minutes.

Cells were suspended in YDP without glucose and diluted to an ODsoo of 0.2.

55

The cell suspension was divided into four equal parts of 360 ml and 40 ml of a
1.11 M glucose solution was added. Final ODsoo of the cell suspension with
glucose was approximately 0.16. The four cell suspensions were placed on an
environmental orbital shaker set at 100 rpm and 30° C. Growth was monitored
using a Shimazdu UV-160 spectrophotometer until early log phase growth
(approximate ODeoo 0.4). An appropriate volume of a 5 M NaCl solution was
added to three of the cell suspensions to create a ﬁnal NaCl concentration of
either 0.5 M, 0.7 M, 1.0 M, 1.2 M, or 1.4 M. The fourth cell suspension not
receiving any NaCl served as a negative control for gene expression not related

to osmotic shock.

Harvesting of cells during osmotic shock. Just prior to the onset of
osmotic shock a 15 ml sample was collected from each cell suspension and used
to represent gene expression at time 0. During the perturbation cells were
harvested a regular intervals from all cell suspensions until growth resumed,
indicated by three consecutive increases in ODsoo. To avoid gene expression not
related to osmotic shock, harvesting proceeded according to established
protocols (Causton et al., 2001.). Brieﬂy, 15 ml volumes were withdrawn from
each cell suspension and pelleted by centrifugation at 3500 rpm for 3 minutes at
room temperature. The medium was decanted and pellets were immediately
plunged into liquid nitrogen then stored at —80° C until RNA extraction, which

occurred within one month of harvesting to avoid degradation.

56

RNA extraction. Total RNA extraction of the stored samples proceeded
according to modiﬁcations of the hot acid phenol extraction method developed by
Schmitt et al., (1990). While being kept on ice, cell pellets were suspended in 400
pl AE buffer (50 mM sodium acetate, 10 mM EDTA [pH 5.3]) and transferred to
1.5 ml centrifuge tubes. Immediately, 40 ul of a 10% SDS solution was added
followed by addition of an equal volume of double distilled phenol at pH 5.3
(Ambion, Austin, TX) equilibrated in AE buffer and prepared within one week of
use. Samples were mixed, incubated at 65° C for 6 minutes, then rapidly chilled
in a dry ice ethanol bath until the appearance of phenol crystals. Following the
chilling step, samples were allowed to thaw slightly then centrifuged at room
temperature for 2 minutes at 14,000 rpm. The aqueous phase was transferred to
a new tube and extracted again with acid phenol.

To the transferred aqueous phase, an equal volume of
phenol/chloroform/lsoamyl alcohol (Sigma, St. Louis, MO) was added, mixed by
vortexing and centrifuged at room temperature for 5 minutes at 14,000 rpm. The
aqueous phase was transferred to a new tube. DNase I (Ambion, Austin, TX)
was added to the above aqueous sample to achieve a 10 units per ml solution
and the sample was incubated at 37° C for 30 minutes. To inactivate the enzyme,
an additional acid phenol extraction was performed and the aqueous phase was
transferred to a new tube.

Total RNA was precipitated by addition of 0.1 volume 3 M sodium acetate,
[pH 5.3], and 2.5 volumes ice-cold 95% ethanol. Samples were placed at -20° C

overnight. RNA was harvested by centrifuging at 4° C for 30 minutes at 14,000

57

rpm. The supernatant was decanted and RNA washed once in ice cold 80%
ethanol, and centrifuged at 4° C for 20 minutes at 14,000 rpm. The supernatant
was decanted and the RNA pellet was allowed to dry, then suspended in 50 ul
Tris-EDTA [pH 8.0] (Ambion, Austin, TX). A 5 pl sub-sample was removed and
reserved for RNA quantiﬁcation, and to check for the presence of genomic DNA
contamination.

All solutions and supplies used for RNA extractions were either purchased
RNase free or treated with DEPC (diethyl pyrocarbonate) according to standard
protocols (Fedorcsak and Ehrenberg, 1966). All glassware was Placed in a 180°
C furnace for at least 8 hr to destroy residual RNase (Sambrook et al., 1989).
Additionally, prior to each extraction, a hood dedicated to RNA extraction was
cleaned to reduce or destroy residual RNase activity using either a 10% bleach

solution or RNase wipes (Ambion, Austin TX).

Check for Genomic DNA Contamination. To check for genomic DNA
contamination in the extracted RNA samples, a no RT control was performed.
Brieﬂy, 20 ng of RNA template obtained from each of the RNA sub-samples and
added to a reaction cocktail (Karsai et al., 2002) containing 10 mM Tris-HCI (pH
8.5), 50 mM KCI, 2 mM MgCl2, 0.15% Triton x-100 (Sigma, St. Louis, MO) and 40
uM dNTPs (Promega, Madison, WI). One unit of taq polymerase (Promega,
Madison, WI) was added to this mixture along with 12 pmoles of both forward
and reverse primers targeting a 104 bp region of the RDN18-1 (18$ rDNA) open

reading frame. Design of the forward primer (5’-

58

TGAACCCAATCATCCAAGACAC-S’) and reverse primer (5’-
TI'GTGGGAAAGCACCATAGTTG-3’) followed guidelines described in the gene
selection and primer design section (Section 3.6).

Final reaction volume of the above components was 25 pl, which
underwent PCR following a protocol described previously (Ayala-del-Rio, 2002).
Brieﬂy, it consisted of a pre-dwell for 2 minutes at 94° C followed by a denaturing
step for 30 seconds at 94° C, an extension step for 30 seconds at 55° C, and an
elongation step for 1 minute at 72° C. The denaturing, extension, and elongation
steps were repeated 40 times. A ﬁnal post-dwell of 1 minute at 72° C occurred
following completion of the repeated steps. A positive control containing genomic
DNA template instead of RNA template, and negative control lacking any
template were also included for quality control.

Ampliﬁed products from PCR were detected using electrophoresis on a
3% (wt/vol) agarose gel (Gibco BRL) in 1x TAE (Tris-acetate-EDTA). The
presence of a band of the same size compared to the positive control indicated
genomic DNA contamination. It was suspected that poor digestion of genomic
DNA resulted from the presence of NaCl, which can signiﬁcantly reduce activity
of the DNase I enzyme. Total RNA in contaminated samples was precipitated as
described above and suspended in 500 pl DEPC treated water to which 55 pl
DNase buffer (Ambion, Austin, TX) and 6 units of Turbo DNase I (Ambion,
Austin, TX) were added. This modiﬁed enzyme is more robust to NaCl than
traditional DNase I. The mixture was incubated at 37° C for 30 minutes and

enzyme inactivated with a phenol/chloroform/isoamyl alcohol extraction.

59

Precipitation, harvesting, and quantiﬁcation of the treated samples proceeded as
described previously. RNA samples free of genomic DNA were reverse
transcribed using a High Capacity cDNA Conversion Kit (Applied Biosystems,
Foster City, CA) and puriﬁed using Qiagen PCR puriﬁcation columns (Qiagen,
Valencia, CA). Puriﬁed cDNA was quantiﬁed by measuring absorbency at 260

nm and stored at -80° C.

Real-time PCR. The reaction cocktail used for SYBR“ Green I real time
PCR was adapted from Karsai et al., (2002). It consisted of 1x SYBRTM Green I
buffer described above. To the reaction cocktail was added 1.0 unit of Taq
Polymerase, 0.375 pl of SYBRTM Green I (Molecular Probes, Eugene, OR)
diluted from 10,000x to 1000x prior to addition, 2.3 pM 6-ROX (6-carboxy-x-
rhodamine, Molecular Probes, Eugene, OR), 10 ng cDNA, and 12.5 pmoles of
forward and reverse primers.

Thirteen pl of the reaction cocktail minus primers was distributed to 384
well optical plates (Applied Biosystems, Foster City, CA) using a Biomek 2000
workstation (Beckman Coulter, Fullerton, CA). Two pl of each primer were also
added to each well using the workstation. PCR was performed on an ABI
7900HT (Applied Biosystems, Foster City, CA) using the reaction conditions

described in the previous section.

60

Production of Taq. Because the consumption of Taq was expected to be
cost—prohibitive, it was produced in the laboratory using Thermus aquaticus DNA
polymerase expressed in Escherichia coli INValphaF’ (lnvitrogen, San Diego,
CA) following protocols established by Engelke et al., (1990), and Pluthero,
(1993). E. coli containing the taq polymerase gene on pTaq plasmid under the
control of the lac promoter was grown in LB broth containing 80 mg l’1 ampicillin.
Induction of Taq polymerase occurred by addition of IPTG (iso-propyl-1-thio—B-D-
galactopyranoside, Sigma, St. Louis, MO) to a concentration of 125 mg I". Cells
were harvested, lysed, and soluble proteins separated by centrifuging. Soluble
proteins containing the polymerase were salted out using ammonium sulfate and
dissolved in 50 mM Tris-HCI [pH 7.9], 50 mM glucose, and 1 mM EDTA. Crude
puriﬁcation of the protein extract occurred by dialysis using a Spectra/Por 50,000
MWCO membrane (Spectrum Labs, Rancho Dominquez, CA) against a 50%
(vol/vol) glycerol solution with two changes in dialysis solution every 12 hours.
The puriﬁed extract was diluted in half with a storage buffer solution containing
50 mM Tris-HCI [pH 7.9] 50 mM KCI, 0.1 mM EDTA, 1 mM dithiothreitol, 0.5 mM
PMSF (phenylmethylsulfonylﬂuoride, Sigma, St. Louis, CA), and 50% (vol/vol)
glycerol. The diluted extract was stored at —20° C. The Taq polymerase
contained in the puriﬁed extract was evaluated against purchased Taq

polymerase (Promega, Madison, WI) to estimate the activity units per micro-liter.

6|

Calibration curves for real-time PCR. Common practice using real-time
PCR includes a standard curve on each plate for each unknown amplicon. This
was not economically feasible for the 170 genes selected for this research.
Rather, it was assumed that SYBRTM green I could not discriminate between
amplicons of different genes, but could discriminate between amplicon length,
i.e., the longer the amplicon the more dye intercalates between the two DNA
strands resulting in greater ﬂuorescence per amplicon. Two approaches were
used for development of standard curves. In the ﬁrst approach, primers were
used to generate PCR products of length 101 bp, 104 bp, and 106 bp
representing the smallest, average, and largest amplicon lengths for the 170
genes. The PCR products were puriﬁed using 10,000 MWCO spin columns
(Millipore, Bedford, MA), and quantiﬁed by measuring absorbance at 260 nm.
Puriﬁed products were then adjusted to the same absorbance. A 1:10 dilution
series was created from each puriﬁed and adjusted PCR product to be used as
the starting template for the standard curves. Four dilutions, 10°, 102, 10“, and
10°, from each series were evaluated for linearity and agreement in cycle
threshold (Ct).

It seemed plausible that poor ampliﬁcation from the short pieces of DNA,
generated in the ﬁrst approach might occur. Therefore, in the second approach
additional primers were designed (Table 3.1) that encompassed the primers used
in the ﬁrst approach. Ampliﬁed PCR products using these new primers were
roughly 1000 bp. These PCR products were puriﬁed using Qiagen PCR

puriﬁcation columns (Qiagen, Valencia, CA), quantiﬁed, then adjusted as

62

previously described. A 1:10 dilution series from each product was created and
six dilutions, 10°, 10", 102, 10°, 10°, and 10°, from each series was used as
starting template for the standard curves. Ampliﬁcation of these templates

occurred satisfactorily using the original primers designed in the ﬁrst approach.

63

.303 a... 06:3 man :6: 880$an 336$ 53 3 35.028 mamaamam 8.. 35:5 acmznsomzoz oﬁ 3mz>m. ._.<<o
munqomosmm Ema cmma .: 35.098 Em mamaqma oc2mm. _: Sm 3m" muuﬁomo: mangomzo: 53 £63an :03 mam:
02> 83n_m8m mmmogmaa E2: So 933 38:6 :03 i: on 8 3m on. _: :6 $83 munamo? .mamq 02> $32m~mm
imam Emma 383833. 2.6 m3m=9 m3.=oo:m 8 .3.3<m m3._somzo:.

1.1:..on 341.3." Rename: .. ,-

 

 

 

 

 

 

 

 

 

 

 

 

0mm Gone 6 D8330: 33qu 3.2.39 3: mo<mam panama 3: >328:
3.8 «B 3.3 «8 383 BE
<0m3~<< p05 38630076686 00>o>>>ooaom>aomao>>>> mo OOO>>OOO>AOmjm>AOA>> mo :5
Eammm
<Oroos<< cams ozomuzmnasmmaam jo>ammoojoo>om>>o> mo AOOOO>>400>OHOAOAOAO>O mo .2:
- $3386
«628; poao: 0* 8m 0040000>aomjo>jo>> mm 000m>>400>>oooj>joo mm as
«02>

 

 

,-1236...».moqwmgsm>uuqomnr .

 

 

 

 

Ohm Dose 6 083.30: mogma p13? #3 $39.2... 32.39 «.3 >328:
3.2.3 5 3.3 5 583 3E
<Om3m<< U05 6380:096on OOO>>OO>AO>>OH>OQ _ _ .0 mm OO>OO>04>O>O>O>AOOO>0>H mm Smo
xmzmmm
<05an bows ozomuzmzasmmzzm OOAOO>>OHO>OAOOAOOAO>A mm >OH>HOOAOO>OAO>>OOHOO> mm 30.x
mkazmmm
453030.. =up. 38638638 OOOOAO®>>03>OO>>OQJ mo j®>>OOOOO>4>OQAmOA>O mo sows

 

 

 

 

 

 

 

 

CHAPTER 4

RESULTS

4.1 Application of the Moment of Area to Calculate Strain

Moment of Area results for the environmental stress response are
described using two comparisons. Both comparisons are used to demonstrate
that the Moment of Area is relatively insensitive to the method used to calculate
it. In the ﬁrst comparison, Moments of Area resulting from the relative cumulative
response equation (Equation 2.7), and the relative displacement equation
(Equation 2.8) are compared. These equations were used to describe the
aggregate response of the ESR. Their comparison is made to determine which
equation is better suited for this purpose and to fulﬁll the ﬁrst objective of
development of tools for describing strain.

The second comparison looks at the cluster approach used to calculate
the Moment of Area. Recall that Gasch, et al., (2000), observed that the ESR
was comprised of two clusters, one representing induction and the other
repression. To demonstrate the relative insensitivity of the Moment of Area, the
Moment of Area resulting from combining both clusters into one (single cluster),
then calculating the aggregate response envelope using either equations 2.7 or
2.8 is compared to the summed aggregate response envelopes of each cluster

multiplied by their respective moment arms (set of two clusters).

65

Comparison of equations used to calculate the aggregate response
envelope. Moments of Area for the ESR perturbed by the eight stresses
(described in Table 1.1) are shown in Figure 4.1. Figure 4.1a compares the
Moments of Area associated with the aggregate response envelopes calculated
using the cumulative response equation (i.e., Equation 2.7) and Figure 4.1b
compares the Moments of Area resulting from the displacement equation (i.e.,
Equations 2.8). Error bars in Figures 4.1a and 4.1b represent residual error
propagated through the Moment of Area calculation. Both aggregate methods
resulted in qualitative agreement of the ESR to the different perturbations, but
differed by an order of magnitude in their calculated values. Smaller values
associated with the displacement equation were expected due to the square root
being taken after summation of relative expression values rather than before
summation of relative expression values as with the case of the cumulative
equaﬁon.

Table 4.1 lists the coefﬁcients of variation calculated from the residual
standard deviation for each perturbation divided by their respective Moments of
Area. This was done to evaluate which aggregate response equation was more
suited to the regression model used for linear regression. For all perturbations,
except osmotic shock and temperature shock from 25° C to 37° C, the
contribution of the residual error to the Moment of Area was greater for the
aggregate response envelope described by the displacement equation than the
aggregate response envelope described by the cumulative equation. In the case

of the two exceptions, the contribution of the residual error to the calculated

66

 

 

 

 

 

 

3)
5e+7
- ESR composed of two clusters
ESR composed of one cluster
“A
.g 4e+7
E, 1e+7// //
x .
"8‘
*5 Key:
«2, 1-Hyper-osmotic shock
E E 2-Hypo-osmotic shock
; .r 3-Temperature shock
T E if: from 250-370 0
0 _ . ,. .
4-Temperature shock
1 2 3 4 5 6 7 8 from 37°—25°C
b 5-Hydrogen Peroxide
) Exposure
3.5e+6 6-Menadione Exposure
- ESR composed of two clusters 7-Diamide Exposure
3_0e+6 — ESR composed of one cluster
‘3‘ 8-DTT Exposure
5: 2.5e+6 -
E
3; 2.0e+6 ~/ /
775 // //
g / /
X
0)
.g 5.0e+5 -
._<_1.

 

 

 

Figure 4.1. Comparison of Moments of Area for the eight perturbations. (a)
Using the cumulative equation to describe aggregate response of the ESR
associated with the set of two-clusters and single cluster approach. (b) Using
the displacement equation to describe the aggregate response of the ESR
associated with the set of two-clusters and single cluster approach.

67

Table 4.1. Comparison of residuals relative to their respective Moments of Area
for the eight perturbations. Relative residuals for the cumulative equation were
identical between the cluster approaches. This was not observed for the

dis lacement eo uation.

 
      

     

      

 

 

 

 

 

 

 

 

 

 

 

 

 

0 0
0)
§ 1‘2 K? s
‘5 Q To ?\ 'R
m ”i N to 9
9 a? '3. e m 0: ~
8: q) 0 3 5 Q‘ m g
< to .o to ‘3 E 0) .9 m
3 o; a. 0 m m ‘o ,‘D ‘5
4.. e 0 6 Q. Q. 5’ to g .9
:3 a E s. .5, E. a s .«o .s
o <( O I l~ i~ :1: 2 Q Q
Stat 0“ Cumulative equation 0.35 0.02 0.23 0.15 0.04 0.10 0.08 0.05
0..
clusters Displacement equation 0.34 0.14 0.21 0.43 0.17 0.18 0.12 0.18
Single Cumulative equation 0.35 0.02 0.23 0.15 0.04 0.10 0.08 0.05
cluster Dis lacement uation 0.26 0.01 0.22 0.47 0.10 0.15 0.13 0.19

Moment of Area was approximately the same between the two aggregate
response equations. Therefore, the regression model was more suitable for the

cumulative equation than the displacement equation.

Effect of clusters. Within Figures 4.1a and 4.1b a comparison between
the two-cluster approach to represent the ESR versus the single cluster
approach (i.e., both clusters combined into one) to represent the ESR is made. In
Figure 4.1a the Moments of Area and the residuals associated with the two
approaches exhibited 99% agreement. This demonstrated that when the
cumulative equation was used to calculate the aggregate response envelopes of
the ESR, residual error from linear regression had little impact on the Moments of
Area.

However, a different result was observed for Moments of Area for the two

cluster approaches in Figure 4.1b. Here, the use of the displacement equation to

68

calculate the aggregate response envelopes resulted in worse agreement,
ranging from 71% to 84%, between the Moments of Area. The lack of agreement
can be explained by observing the residual error associated with linear
regression propagated to the area of the aggregate response envelopes (i.e., the
A,- term in the Moment of Area equation). Table 4.2 compares this propagated
residual error (residual area) for the single cluster and set of two clusters. Once
again, the summed residual areas of the set of two clusters and the residual area
of the single cluster are equal, conﬁrming that the Moment of Area is less subject
to error associated with linear regression when using the relative cumulative
response equation. However, the summed residual areas of the set of two
clusters do not agree with the residual area of the single cluster using the
displacement equation. In fact, the summed residual areas of the set of two
clusters were consistently larger than the area of the single cluster. This helps
explain why the Moments of Area associated with the set of two clusters in
Figure 4.1b were consistently larger than the Moments of Area associated with
the single cluster.

Hence, linear regression of the aggregate response envelopes calculated
from the displacement equation resulted in a worse ﬁt than linear regression of
the aggregate response envelopes calculated using the cumulative response
equation. This difference in ﬁt may result from taking the square root after

summation of the relative expression values required by the displacement

69

Table 4.2. Residual areas associated with the aggregate response equations
and cluster approaches. The residual areas are additive when applying the
cumulative equation. This property was not associated with the displacement

e uaﬁon.
Cumulative Equation Displacement Equation I

 

 

 

 

 

 

 

 

 

Residual Area . Residual Area .
. Resrdual Area for . Resrdual Area for
Appmac“ f°' 333°“ Diamide (min) ”RE???” Diamide (min.)
Cluster l gRepressed) 301 0 1 064 480 37
Cluster 2 (Induced) 5479 5300 367 380
Cluster 1 & 2 Summed 8489 4384 847 417
Single Cluster Approach 8489 4384 482 361

equation, which tends to “ﬂatten” the parabolic shape of the aggregate response
envelopes. This “ﬂattening” could result in larger differences between the
predicted responses using the regression model and measured responses. Thus,
a different regression model may be more suitable for the ﬁtting the aggregate
response envelopes calculated using the displacement equation. This was not
investigated because Moments of Area in Figures 4.1a and 4.1b exhibited
qualitative agreement, regardless of the aggregate response equation. However,
because the regression model in combination with the cumulative response
equation was better suited for Moments of Area, these results were carried on to

the next analysis.

Comparison of strain for various perturbations. For the magnitudes of
the perturbations tested, the stress imposed on the ESR by exposure to DTT
resulted in the largest Moment of Area and hence least stability compared to the
Moments of Area for the other perturbations. This comparison cannot easily be
made in a quantitative manner without the approach being developed here.

Hypo-osmotic shock resulted in the smallest Moment of Area, and hence the

70

greatest stability of the ESR. Normalization by the Moment of Area of hypo-
osmotic shock showed that the stability of the ESR was roughly the same for
hypo-osmotic shock and temperature shock of 25° to 37° C, over twice as stable
compared to osmotic shock. Additionally, the normalization showed that the ESR
for hypo-osmotic shock was ﬁve times as stable compared to temperature shock
37°-25° C, six times as stable compared to diamide exposure, approximately
seven times as stable compared to hydrogen peroxide exposure, and twelve
times as stable compared to menadione exposure. The stability of the ESR to
hypo-osmotic shock was ninety times more stable than to DTT exposure.

Such large differences in stability can be evaluated in terms of the stability
parameters. Calculated values for the stability parameters are displayed in Table
4.3. These values were found by taking the natural logarithm and ﬁtting the data
to the equation y(t)=,80 + ,61x, where ,61 is either the value for resilience or
reactivity. In terms of resistance, the ESR was most resistant to the hypo-osmotic
shock and least resistant to diamide exposure, which is also consistent with their
Moments of Area. In terms of reactivity and resilience, the ESR was most
reactive and resilient to osmotic shock, but least reactive to DTT exposure and
least resilient to menadione exposure. It is interesting to note that although the
ESR proved most stable to hypo-osmotic shock it was neither the most reactive
nor most resilient. However, it was the most resistant suggesting that this
attribute of stability contributed most to the overall stability. Likewise, the ESR

showed the smallest stability to DTT exposure even though it was neither the

71

Table 4.3. Comparison of calculated stability parameters and Moments of Area
for the eight perturbations. The differences in Moments of Area can be evaluated

 

 

 

 

 

 

 

 

 

 

 

 

 

 

in terms of the individual stabili carameters.
.9 o
‘6 =0 9 E B
E g a a m -‘=
0 Q t t
3 8 63x $8, ° ﬂ =§ £3 E
8 <5 Qm QN 5’ >< m g .9
9. 9. 81:. St 2% 6 s2 6
I I l~ N l~ no I O. E Q Q
Reactivity(1/min) 0.163 0.085 0.137 0.074 0.066 0.035 0.086 0.005
Resilience(1/min) -0.034 -0.015 -0.015 -0.012 -0.015 -0.003 -0.024 0005
Resistance (unitless) 69.4 13.9 27.7 63.7 78.1 52.6 118.7 32.9
Moment ofArea (min2) 1.5x106 5.1x105 8.3x105 3.0x106 3.7x106 7.0x10“3 3.2x10° 4.7x107

 

 

least resilient nor resistant. In this case however, the ESR exhibited the smallest
reactivity, by an order of magnitude compared to the other perturbations

suggesting it contributed most to the overall stability.

Contribution of clusters to the Moment of Area. The magnitudes of the
temperature perturbations and the magnitudes of the osmotic perturbations were
the same, but in opposite directions. The stability of the ESR resulting from a
decrease in 12° C was less than the stability of increasing the temperature by 12°
C. With the osmotic perturbations, the ESR was less stable with increased
osmolarity due to the addition of 1 M sorbitol and more stable upon removal of 1
M sorbitol. The difference in stability can be explained by different contributions
of the clusters originally observed by Gasch et al., (2000), to the overall stability.
The calculated percent contribution of the two clusters to the overall Moment of

Ar ea for hyper-osmotic shock showed that the induced cluster contributed
r0'49th 67% compared to the 33% contributed by the repressed cluster. The

oF’lilosite was observed for the hypo-osmotic shock. The induced cluster

72

contributed roughly 32% and the repressed cluster contributed 68%. This pattern
was not observed for the repressed and induced clusters associated with the
temperature perturbations. Here, the repressed clusters contributed most to the
overall Moments of Area for both shifts in temperatures, i.e., roughly 58% for the
temperature shift from 25° C to 37° C, and roughly 80% for the temperature shift

from 37° C to 35° C.

4.2 Development of Calibration Curves from cDNA Targets

Calibration curves were developed using standards composed of three
target lengths, 101 bp, 104 bp, and 106 bp. Two approaches were employed in
generating amplicons from the targets using RT-PCR. The ﬁrst approach
involved ampliﬁcation directly from the short targets described above. The other
involved ampliﬁcation of the short targets from larger DNA fragments roughly,
1000 bp, which encompassed the sequence of short targets. Figure 4.2a shows
calibration curves resulting from ﬁrst approach and Figure 4.2b shows the
calibration curves resulting from the second approach. Approximate copy number
is shown above each dilution. Both approaches showed good reproducibility per
size of template, except for the 10“ dilution of RND18-1, which encodes the 185
rRNA. Differences in the Ct values for this dilution ranged from roughly 15 to 25.

No apparent explanation for this poor reproducibility was discovered. For the ﬁrst

73

 

 

 

 

 

 

 

 

 

 

 

35
3O 3 104
O
25 n O 105
w 20 _
C) 1
15 d
10 ‘ o 106bp
O 101bp 10'
5 - v 104bp(RDN18-1) 9

 

 

 

 

10'7 10'6 10‘5 10“ 10‘3 10‘2 10’1 10° 101

 

b) 35 DllUthl‘l
30° 102
I 3'3
25~ 10‘
a o
0 U 105
.H m 106
0 204 o
15~ . 10’
o
106
O 106bp
o 101bp
54 v 101bp(YBR01IC)

 

 

 

10-7 10*" 10-5 10-4 10-3 10-2 10-1 10° 101
Dilution

Figure 4.2. Standard curve development results. (a) Results from the ﬁrst
approach using small lengths of DNA ranging from 101 bp to 106 bp as
template for ampliﬁcation. Good reproducibility was observed, but large
separation between lengths occurred at the 10“ and 10*5 dilutions. (b) Results
from the second approach using longer lengths of DNA encompassing the
shorter amplicons. Amplicons were grouped closer together for all dilutions
compared to the ﬁrst approach.

74

approach (Figure 4.2a), the 10'3 dilution containing 10° copies represented the
upper bound of ampliﬁcation for the given PCR components and conditions. No
ampliﬁcation occurred for the 10'1 dilution containing 1010 copies. The upper
bound of ampliﬁcation for the second approach (Figure 4.2b) was not reached
within the range of dilutions tested. However, it appears from visual extrapolation
that ampliﬁcation could have occurred with template copies greater than 107.
Disagreement between the amplicon sizes increased with dilution for both
approaches used to generate template. However, comparison of the two
approaches showed that a better agreement between amplicon sizes existed for
the second approach. For the 104 and 10° template copy numbers of the ﬁrst
approach, roughly 10 cycle thresholds separated the 106 bp and 104 bp
amplicons. In comparison, the number of cycle thresholds of the second
approach separating the same amplicon sizes for the 104 copy number was
insigniﬁcant and less than 4 for the 10° copy number. Signiﬁcant difference in
cycle thresholds of the second approach was not observed until 103 and 102 copy
numbers. A statistical F-test comparing the slopes and y-intercepts (calculated
for the 100 through 10'5 dilutions), between the different amplicon sizes, failed to
rejected the null hypothesis (p-value S 0.05) indicating that there was insufficient
evidence that one of the slopes and intercepts was statistically different from the
others. Thus, it was concluded that the slight difference in amplicon lengths
would not statistically inﬂuence the resulting cycle threshold values of the

unkowns.

75

3.0 I 77777 z,z,z.z,,,z,izz,z-

 

 

 

 

I . I
I . I
2.5 I I
I o
I I 7 7 a 9 1
C I
g 2.0 I I I . 0.5MNaCl I
S o 00.7MNaCI
6 I l o
g 15 I o. . IA1.0MNaC|
5 7 h' o A I X1.2MNaCI
Do A. 0’ A XX xxx)“ I>K14MNC|
o 1-0 ammxxx I ' a
I . I .Control
I ° ' T T T T
O
0.5 3.. I
‘ I
I I
0.0 ,1, 7 z z z, ,,, ,. , , ,1
0.0 150 300 450 600

Time (min)
Figure 4.3. OD600 measurements normalized by absorbance at perturbation
onset for S. cerevisiae. The vertical line represents the onset of the

perturbation. Retarded growth was seen followed by decreased growth rates
with increasing perturbation magnitude.

4.3 Growth of S. cerevisiae under the Applied Osmotic Stresses

Growth of S. cerevisiae following the ﬁve osmotic stresses is shown in
Figure 4.3. Absorbance values presented for each perturbation were normalized
by their readings at the time of perturbation. Each perturbation occurred at
roughly 180 minutes indicated by the vertical line. The effects of the perturbations
were demonstrated by an arrest of growth following the perturbation, and a

resumption of growth but at a retarded rate. The time of arrested growth

76

increased with perturbation magnitude and the resumed growth rate decreased.
At 0.5 M NaCl growth arrest either did not occur or was not detected by the
measurement time scale. The rate of resumed growth was 1.47 x 10'2 i 4.5 x 10'
3 min", slightly less than the unperturbed growth rate (1.63 x 10'2 min"). At 1.4 M
NaCl, resumption of growth was not observed for 3 1/2 hrs. following onset of the
perturbation. The new growth rate, 2.58 x 10’3 :t 5.6 x 104 min", was an order of

magnitude less than the growth rate of the control.

4.4 Reactivity, Resilience, and Resistance in Response to Osmotic Stress
Using the developed tools described in Sections 4.1, gene expression
data was analyzed to determine the amount of strain for the ﬁve levels of applied
stresses (0.5, 0.7, 1.0, 1.2, and 1.4 M NaCl). Only 153 genes out of the original
169 were used in describing the aggregate response envelope. Cycle threshold
values could not be determined by real-time PCR for the remaining sixteen
genes and therefore they were not included in the analysis. The aggregate
response envelopes shown in Figure 4.4 were calculated for each perturbation
using the cumulative response equation. Asymptotic stability, i.e., a return to pre-
perturbed expression following perturbation, was displayed in response to each
perturbation. It was interesting to note that the time associated with the
occurrence of maximum displacement increased with an increase in the
magnitude of stress. Maximum displacement for the 1.4 M NaCI stress occurred
at a time that was approximately 6-fold longer than the maximum displacement

for the lowest stress tested (0.5 M NaCl). To see if the stability parameters also

77

 

2500

+ 0.5 M NaCl
—0— 0.7 M NaCl
+ 1.0 M NaCI

V 1.2 M NaCI
—I— 1.4 M NaCI

 
   
  

2000 -

1500 d

1000 3

500 -

Cumulative Response [Arfus x (rfus)'1]

 

 

 

l I I I I I

0 50 100 150 200 250 300 350

Time (min.)

Figure 4.4. Aggregate response envelopes of 153 out of 169 genes showing
expression to osmotic shock. The aggregate response was calculated using
the cumulative response equation described in Chapter 3. Resistance
decreases until the 1.2 M NaCI perturbation. For this perturbation and the 1.4
M NaCI perturbation resistance increases.

78

Table 4.4. Calculated stability parameters corresponding to the aggregate
cumulative res onse envelo es shown in Fi ure 4.4.

 

 

 

 

 

[NaCI] Reactivity (min' ) Std. Dev. Resilience (min' ) Std. Dev. Resistance Std. Dev.
0.5 1.2x10'1 2.2x10'2 6.4x10'2 1.9x10'2 1198 513
0.7 7.2x10’2 1 .7x10'2 -5.0x10‘2 2.5x10’2 1800 679
1.0 5.2x10‘T 2.6x10’3 -3.8x10'2 6.7x10’3 2194 210
1.2 3.0x10’" 6.0x10’3 -1.2x10‘2 1.5x10“ 972 272
1.4 7.9x10'3 3.4x10‘2r -7.Ox10'3 3.8x10‘3 1224 109

 

 

 

 

 

 

reﬂected this trend, reactivity, resilience, and resistance values were also
calculated. These values are listed in Table 4.4. Reactivity associated with the
0.5 M NaCI perturbation was two orders of magnitude greater than the reactivity
associated with the 1.4 M NaCI. Resilience also decreased with increased
perturbation magnitude. Resilience for the 0.5 M NaCl was roughly one order of
magnitude larger than resilience for the 1.4 M NaCI stress. Recall that a
comparatively larger negative value for resilience and a comparatively larger
positive value for reactivity denote greater stability associated with these aspects.
Thus, the aggregate response envelope was more stable in terms of reactivity
and resilience for the 0.5 M NaCI stress than the other stresses. Recall that for
resistance a comparatively larger value denotes lesser stability. Thus, stability in
terms of resistance, decreased for the 0.5, 0.7, and 1.0 M NaCI stresses but not
for the 1.2 and 1.4 M NaCI stresses. For these last two stresses, the resistance
suddenly increased and was similar in magnitude to the resistance displayed by
the 0.5 M NaCI stress.

The increased resistance stepping from 1.0 M to 1.2 M NaCI
corresponded to a 55% net reduction in the aggregate response at the time of

maximum displacement indicating a decrease in contribution of genes and

79

possibly of a shift in the make-up of signiﬁcantly expressed genes. Forty nine
genes exhibited at least a 2-fold decrease in expression at the time of maximum
displacement from 1.0 M to 1.2 M NaCI. These genes accounted for roughly 83%
of the reduction in aggregate response. Within this percentage, genes making up
the MAPK signaling pathway showed the largest reduction, ~45%, followed by
the glycolysis pathway, ~44%, cell cycle pathway, ~8%, glycerolipid metabolism
pathway, ~3% (Figure 4.5).

Of interest is the number of genes in each pathway and the pathway's
contribution in the observed reduction in aggregate response. The MAPK
signaling pathway included 11 of the 49 genes. However, the cell cycle pathway,
which included 25 of the 49 genes, contributed least among the pathways. Even
more dramatic are the 6 genes of the glycolysis pathway, which accounted for
the second highest contribution in aggregate response reduction indicating the
signiﬁcant role of these genes. One of which was identiﬁed as ALD3 (YMR169C)
encoding an aldehyde dehydrogenase known to respond to osmotic shock and
the general stress response mechanism of S. cerevisiae (Norbeck and Blomberg,
2000). Another gene within this pathway was identiﬁed as PDCB (YGR087C), an
isozyme of pyruvate decarboxylase (Eberhart et al., 1999). From the MAPK
signaling pathway was identiﬁed CTT1 (YGR088W), which exhibited the second
largest reduction in resposne and is known to encode a catalase that also plays a
role in the general stress response (Vlﬁesser et al., 1991).

The gross reduction in aggregate response at the point of maximum

displacement was offset by an increase in relative expression. Twenty ﬁve genes

80

Number of Genes (x)

 

Figure 4.5. Comparison of fold changes in relative gene expression stepping
from 1.0 M to 1.2 M NaCl. The largest fraction of genes at 1.0 M NaCI showed
at least a 2-fold increase in expression. Whereas, the largest fraction of genes
at 1.2 M NaCl showed at least a 2-fold decrease in expression. Percentages
associated with the pathways indicate percent contribution in terms of

1.0M

 

1.2M

 

MAPK 19%

Glycolysis 15%

 

l—)

Glycerolipid 10%

 

x > 2-fold

Cell Cycle 56%

 

increase

(_J

2-f0ld > X

 

> 1-fold

 

T

1-fold > X

 

> 0.5-fold

 

Glycolysis 44%

 

r—-)

Glycerolipid
3%

 

MAPK 45%

 

x < 0.5-

 

 

fold
(_____J

 

 

Cell Cycle 8%

 

 

response for the given fraction of genes.

8!

 

exhibited a 2-fold increase in expression going from 1.0 M to 1.2 M NaCI
(Appendix D). Here, 7 of the 25 genes from cell cycle pathway contributed
roughly 56% to this increased response, followed by MAPK signaling, 3 genes
contributing ~23%, glycolysis, 5 genes contributing 15%, and glycerolipid
metabolism, 6 genes contributing 10%. Within the cell cycle pathway, the largest
contributing genes where connected with regulating cell growth including FUSB
(YBL016W), which is required for cell arrest during cell conjugation (Cherkasova
et al., 1999), and NET1 (YJL076W), which encodes a nucleolar protein involved

in mitosis regulation (Shou et al., 1999).

4.5 Calculated Strain for the Applied Stresses using the Moment of Area
The inconsistent behavior between the aspects of stability does not allow
for an overall comparison of stability for the aggregate gene responses
associated with the perturbation magnitudes. Therefore, the Moments of Area
were calculated for the aggregate response envelopes composed of the 153
genes yielding cycle threshold values according to the procedures described in
Chapter 3. Comparatively larger Moment of Area values denote lesser overall
stability. Thus, for the aggregate response envelopes displayed in Figure 4.5,
overall stability decreased with an increase in the magnitude of stress. This was
not surprising for perturbation magnitudes less than 1.2 M NaCI. However, the
decreased overall stability despite the increased resistances for the 1.2 and 1.4
M NaCI perturbations was surprising. Evaluation of the components making up

the Moment of Area revealed the inﬂuence of the moment arm on the overall

82

stability, which gives additional weight to response envelopes occurring over a
larger time frame.

The 153 genes were divided according to biochemical pathways, i.e.,
glycolysis, glycerolipid metabolism, MAPK signaling, and cell cycle pathways,
and the contribution of each pathway on the strain was evaluated. For the 0.5 M
NaCI perturbation, glycolysis and the MAPK signaling pathways were roughly
equal in their contribution to the Moment of Area and represented the largest
contribution followed by the glycerolipid and cell cycle pathways. The largest
contribution to the Moments of Area for the 1.2 M NaCl and 1.4 M NaCI stresses
came from the glycerolipid pathway. A transition from glycolysis as the largest
contributor to the glycerolipid metabolism pathway becoming the largest
contributor was observed for the 0.7 M NaCl and 1.0 M NaCI. Glycolysis
contributed least to the Moment of Area associated with the 1.4 M NaCI

perturbation.

4.6 Relationship Between Stress and Strain for Gene Expression Patterns
The Moment of Area calculated above for each magnitude of stress is
plotted in Figure 4.6. It showed an exponential increase with increase in the
magnitude of stress. This relationship was also observed on an individual gene
basis (Figure 4.6a), and for each group of genes making up the four pathways
(Figure 4.6b). Data in Figure 4.6c was linearized by taking the natural logarithm
of the Moments of Area. The slope of the linearized data was found by linear

regression. This slope is deﬁned as the modulus of stability, which predicts the

83

strain associated with changes in stress. A modulus of stability of 4.14 :l: 0.12
[(minz) x M"] was calculated for the aggregate response composed of the 153
genes. This predicts that for every unit change in osmotic stress a change in 4.14
units of strain occurs. Hence, a comparatively larger modulus of stability value
denotes a greater change in strain per unit change in stress. Comparison of
modulus of stability values to a deﬁned standard conveys a degree of sensitivity
to stress relative to the standard. The modulus of stability is of course speciﬁc to
many things including the microorganisms, the type of stress, the set of genes
included in the analysis, and the manner in which it was calculated.

The modulus of stability for genes grouped according to biochemical
pathway ranged from 5.31 :l: 0.33 [(minz) x M"] for the glycerolipid pathway to
3.92 :i: 0.21 [(minz) x M“] for glycolysis. This suggests that the stability associated

with the transcriptional response of genes making up the glycerolipid metabolism

84

 

1.2e+6
1.0e+6 4 0
8.0e+5 ‘
6.0e+5 «
4.0e+5 ‘

2.0e+5 4

 

0.0 1

 

1.8e+7 . , . T
1.6e+7 - b)

1.4e+7 ~—
1.2e+7 4
1.0e+7 — + Glycolysis
-O— MAPK Signaling
8.0e+6 ‘ -v— Cell Cycle
6.0e+6 - ‘V— GIYcerolipid
4.0e+6 —
2.0e+6 4

0.0 a
5.0e+7 i i i r i

4.0e+7 - f

3.0e+7 —

 
   

 

Moment of Area [(Arfus x (rfus)") x minzl

2.0e+7 —

1.0917 ‘ I

0.0~* ’

 

 

 

I I I I I

0.4 0.6 0.8 1.0 1.2 1.4 1.6
Perturbation Magnitude (M)
Figure 4.6. Comparison of Moments of area plotted against perturbation

magnitude for (a) individual genes, (b) gene associated pathways, and (c) the
aggregate response of selected genes described in this research.

85

pathway was more sensitive to changes in osmotic stress compared to the
aggregate modulus of stability. Whereas, the stability associated with the
transcriptional response of genes making up the glycolysis pathway was roughly
equal in sensitivity to the modulus of stability of the aggregate. Of interest is the
cell cycle pathway which had a modulus of stability (4.81 i 0.51 [(minz) x M"])
greater than the aggregate set of genes and almost equal to the glycerolipid
pathway, yet contributed the least to the aggregate Moment of Area for each
perturbation. This suggests that while the cell cycle demonstrated greater overall
stability per perturbation, it was more sensitive to osmotic shock compared to the
other glycolysis and MAPK signaling pathways.

Modulus of stability values for individual genes are shown in Figure 4.7
normalized by the aggregate modulus of stability. Only those genes that
exhibited a r2 value greater than 0.90 were included in this ﬁgure (roughly 110
out of 153 genes). Genes that were less sensitive to stress relative to the
aggregate modulus of stability are plotted below 1.0. Whereas, genes that were
more sensitive to stress relative to the aggregate modulus of stability are plotted
above 1.0. A larger number of genes exhibited greater sensitivity to osmotic
shock than lesser sensitivity. Of those genes that showed lesser sensitivity most
were part of the glycolysis pathway. The gene, PGM2, showed the least
sensitivity to osmotic stress among the genes plotted. However, no single gene
stood out among those exhibiting greater relative sensitivity to osmotic stress. A
few genes per pathway falling below 1.0 are labeled in Figure 4.7. Among these,

ALD3, which encodes an aldehyde dehydrogenase, proved more sensitive to

86

 

osmotic stress compared to the aggregate for both the glycolysis and glycerolipid
pathways. CDC24, which encodes a GTP/GDP exchange factor showed slightly
more sensitivity than STE7, which encodes a MAP kinase kinase. CDC20, a cell
division control protein showed slightly more sensitivity than its neighbor, MAD2.
A description of all the expressed genes indicated in Figure 4.7 can be found in

Appendix C.

87

Modulus of Stability [(minz) x M"]

Figure 4.7. Modulus of stability values for individual genes normalized by the
aggregate modulus of stability value. Genes greater than 1.0 are relatively
more sensitive to stress than the aggregate modulus of stability. Genes below
1.0 are less sensitive to stress than the aggregate modulus of stability. More
genes showed greater sensitivity to the stress imposed by osmotic shock than

2.5

 

-8 N
01 O
l I

.5
o

p
08 ALD3 MNT3
I

 

.0
0|
1

 

0.0

 

 

C0924 STE? cogzojADz
A503 1/ I I
O O
V I I O
’ 'v v I \i 0 ° 0
5 v "Wyn-u . I. 3...},
V
o C. V‘ I Q o O
Q '1',“ l‘." A O ’°+.9
PYK1 Q .
. TEC1 K m
.TCtB. V YJL218 PDZYEL032W BUBZ
PGK1 ALD4 DAK1 . Glycolysis
C v MAPK signaling
PGM2 I Gycerolipid
0 Cell cycle

 

less sensitivity compared to the aggregate.

88

 

CHAPTER 5

DISCUSSION

5.1 Strain as an Aggregate Response of the Transcriptome to Stress

This research is the ﬁrst to demonstrate the use of the cumulative
response and relative displacement equations as a means of describing the
aggregate response of a set of genes to stress. This response could be from a
group of genes deﬁned either as a cluster, or a speciﬁc functional set such as the
genes making up a biochemical pathway. Describing the aggregate response of
a group of genes has previously been reported in terms of a mean expression
ratio. Gasch et al., (2000), demonstrated the use of the mean expression ratio to
show the reciprocal responses of the ESR cluster to the osmotic and temperature
perturbations. Additionally, hierarchical clustering algorithms often rely on this
mean to build similarity trees from top to bottom (Eisen et al., 1998). However,
using the mean response to represent the group of genes relies on the
assumption that each expression proﬁle within the group is a representative of
the group rather than an individual component.

In comparison, the cumulative response and relative displacement
equations used here to describe the aggregate response of the genes making up
the ESR and four biochemical pathways, assume that each expression proﬁle is
an equally weighted contributor to the group of genes. Rather than measuring the
central tendency, these equations describe the total response of the cluster or

set of genes. Treating each expression proﬁle as a contributor rather than a

89

representative has the disadvantage of being more sensitive to large erroneous
expression proﬁles. The mean ratio of the group, in such a case, is pulled toward
the erroneous proﬁle, whereas the erroneous proﬁle is propagated to the total
response with the cumulative or displacement equations.

Because of this sensitivity, the demonstration of experimental and
instrument reproducibility takes precedence. Good instrument reproducibility
associated with real time PCR has been demonstrated in the past (Bustin, 2000),
and is often used to validate gene expression measured using dense microarrays
(Talaat et al., 2002; Yuen et al., 2002). The research approach used in this
dissertation was similar to those studies in the sense that previously published
microarray data was used to select a speciﬁed number of genes, then real-time
PCR was used to measure their relative mRNA abundance. Because the Taq
polymerase used in this research was produced in house, and PCR conditions
were altered from those suggested by the equipment manufacturer, instrument
reproducibility was tested and found to be excellent, as shown in Chapter 4.
Rather than validating expression proﬁles using microarrays, experimental
replication was deemed a more suitable use of resources in order to give
statistical signiﬁcance expression proﬁles used to generate the aggregate

responses.
5.2 Stability Parameters: Resilience, Reactivity, and Resistance

The biological signiﬁcance associated with the aggregate responses to the

environmental stresses was demonstrated by comparison of their stability

90

parameters for eight different types of environmental perturbations (Figure 4.1).
Because the magnitudes associated with these perturbations varied, it is entirely
possible that under different perturbation magnitudes, stability in terms of the
stability parameters and Moments of Area could be different than those
estimated in Table 1.1. Normalization by the perturbation magnitudes introduces
non-equivalent units, i.e., molar concentration and temperature, reducing the
number of possible comparisons among the perturbations. However, if the
perturbations could be represented in terms of equivalents of stress, then a more
accurate picture of stability among the different perturbations for the ESR could
be formulated using the analysis and developed tools demonstrated in the
Results Chapter. This opens the possibility of addressing questions related to
controlling gene expression by interchanging different perturbations, i.e., can the
same aggregate response be generated or observed by very different types of
perturbations.

Because the method of calculating strain presented here has never been
used in genomics, no direct comparison to similar studies can be made. Indirect
comparisons are possible, however, between the results obtained from this study
and published research describing the expression of individual genes such as
GPD1 to stress. Rep et al. (1999), observed that the time of occurrence of
maximum relative expression of GPD1 increased, with an increase in NaCl
concentration. It was also accompanied by a decrease in the rate of induction
and rate of feedback, terms describing reactivity and resilience in this

dissertation. In contrast, Rep at al., (1999), did not report any estimation of the

91

kinetics associated with these rates. A lag time following the onset of the
perturbation and the start of induction with an increase in the magnitude of
osmotic shock is also known to occur (Rep et al., 1999, Wuytswinkel et al.,
2000). The reported lag time prior to induction or repression was not directly
observed in this research, perhaps due to the schedule of temporal sampling.

The decrease in reactivity and resilience with an increase in the
magnitude of perturbation observed for a single gene as well as for a larger set of
genes suggests that it is a general property of relative gene expression to
osmotic shock. In the case of GPD1, explanations for decreased stability in terms
of reactivity and resilience could stem from direct inhibition of the transcriptional
factors; Hot1p and Msn2/Msn4p, or indirect inhibition by an impeded high-
osmolarity glycerol signaling pathway (HOG pathway). Evidence of the latter was
presented by Wuytswinkel et al., (2000). Hog1p with a green ﬂuorescent fusion
protein (GFP) was observed to accumulate in the cytoplasm with increased
osmolarity indicating hampered translocation across the nuclear membrane. It
was further demonstrated that the phosphatases responsible for inactivating
Hog1p were inhibited, retarding the reintroduction of Hog1p back to the
cytoplasm. The nuclear accumulation of activated Hog1p in mutants unable to
dephosphorylate this protein proved lethal under osmotic shock (Wuytswinkel et
al., 2000).

Hindered translocation and enzymatic activity resulting in impeded
transcription of GPD1 can possibly be extrapolated to the transcription of many

genes suggested by the decreased reactivity and resilience of the aggregate

92

response. Indeed, Wuytswinkel et al. (2000), hypothesized that this translocation
and hindered phosphatase problem might extend beyond the transcriptional
response of GPD1 to the general stress response. Supporting this hypothesis,
Yancey et al. (1982), in an earlier review on the molecular effects of osmotic
shock discussed the proportional increase in the Michaelis constant (Km) with
increased NaCl concentration on enzyme activity.

The three main stability parameters (resistance, resilience, and reactivity)
can be helpful in identifying the effect of such stresses on the general stress
response. These parameters may also be useful in identifying candidate genes

that are most relevant to nullify the effect of a given stress.

5.3 Moment of Area as a Measure of Strain

Relative gene expression is often compared by measuring the fold-change
in their expression (Causton et al., 2001; Rep et al., 2000; Yuen et al., 2002).
This is an appropriate comparison for relative gene expression at a single time
point. In this case, such an approach is less appropriate for time course
experiments because the relative expression of genes is dynamic. Hence, a fold
change comparison describes only one aspect of the response, in this case
resistance, and does not adequately describe the total response. This was
demonstrated by comparing the observed resistances of the aggregate
responses in Figure 4.5 and Table 1.1 and their respective Moments of Area with
increased osmolarity. An increase in resistance (i.e., decrease in fold-change)

occurred between the 1.0 M NaCl and 1.4 M NaCI perturbations, yet the

93

Moments of Area increased because they encompassed the total response. The
advantage of measuring the overall response using the Moment of Area also has
a disadvantage. The disadvantage in using the Moment of Area can be the
cumbersome calculation (at least at present) and the effect of residual error
associated with regression techniques can make precise calculations and
comparisons of overall stability difﬁcult.

The amount of residual error was critical for deciding whether the relative
cumulative or displacement equations were better suited for calculating the
aggregate response envelopes and is directly related to the regression model
selected to describe the envelope. Because residual error results from the
difference in the predicted value and the measured value, it can be minimized by
the choice of an appropriate regression model. In the case of the linear
regression model used to describe the aggregate response envelopes calculated
from the displacement equation, hypothesis testing for lack of ﬁt, i.e., whether the
linear model was appropriate, was not possible with the single experimental
observations made for response of the ESR. It is possible that different
regression models, even non-linear ones, would have more adequately
described the aggregate response envelopes. This would have been especially
true if the aggregate response envelopes exhibited neighborhood stability, which
was not observed for the ESR or genes associated with the different biochemical
pathways.

lntuitively, it seems that increased activity of the glycolysis, glycerolipid,

MAPK signaling, and cell cycle pathways would be required with osmotic stress,

94

and any stress for that matter, in order to meet additional cell maintenance and
energy needs. In terms of gene expression, this would be implied by an increase
in transcriptional response of associated genes. Hirayma et al. (1995), observed
an increased transcriptional response of genes making up glycolysis to increased
osmotic shock, supporting this observation. In terms of stability, the decreased
overall stability corresponding to progressively larger Moments of Area seems
logical considering the taxation on resources and energy requirements to repair
cell wall damage and produce glycerol. However, the observed change in
contribution of individual genes as well as genes associated pathways to the
overall stability is not easy to explain without a better understanding of their
transcriptional regulation.

It is interesting to note that on a pathway level, the observed shift from
glycolysis as the largest contributor to the Moment of Area to the glycerolipid
pathway indirectly correlates to the three categories of osmotic shock
experienced by S. cerevisiae W303. Mild osmotic shock in terms of NaCl
concentration, occurs from 0.4 M to 0.7 M, while hyper-osmotic shock occurs
from 0.7 M to 1.2 M. Finally, severe osmotic shock occurs at NaCI concentrations
greater than 1.2 M with an upper limit of possibly 1.7 M. Growth of S. cerevisiae
W303 has been observed even at 1.7 M NaCl concentration (Hohmann and

Mager, 1997; Wuytswinkel et al., 2000).

95

5.4 Relationship between Stress and Strain: The Modulus of Stability

The exponential relationship between the Moment of Area and
perturbation magnitude observed for the aggregate response of genes supports
the division of mild, hyper, and severe osmotic shocks. Up to 0.7 M NaCI the
amount of stress per unit strain (slope) remains close to zero. Between 0.7 M
NaCI and 1.2 M NaCI the slope transitions from close to zero to rapidly
increasing. At concentrations greater than 1.2 M NaCI the slope approaches
inﬁnity. The range of NaCl concentrations for which the observed change in slope
transitions from zero to approaching inﬁnity represents a domain of strain that
can be deﬁned by upper and lower boundaries.

An upper boundary must exist for which an additional increase in
perturbation results in complete cellular failure either from complete dehydration
or cell wall failure. Blomberg, (2000), estimated this concentration at roughly 2.0
M NaCl for most S. cerevisiae strains using plating techniques. Although no gene
expression studies for concentrations greater than 1.4 M NaCI have occurred, it
is logical that the aggregate gene response would occur over a longer time
period and its maximum displacement would become increasingly smaller with
magnitudes of stress greater than 1.4 M NaCl. Theoretically, the aggregate
response becomes non-existent right before 2.0 M NaCI, which roughly coincides
with the concentration at which cellular failure happens. These theories are
based on the observed trend in aggregate response envelopes described in

Figure 4.5. Figure 5.1 theoretically depicts this upper boundary as a vertical line

96

 

5.0e+7

4.0e+7 ‘

3.0e+7 ‘

2.0e+7 “

1.0e+7 ‘

 

 

 

Moment of Area [(Aifus x (rfus)'1) x minz]

 

0.4 0.6 0.8 1.0 1.2 1.4 1.6
Perturbation Magnitude [NaCI]

 

‘

Domain of Stability
Stmin 81'

‘max

 

 

Figure 5.1. Plot of moment of area versus perturbation magnitude in terms of
NaCl molar concentration. The domain of stability is depicted between the
lower bound of strain, Stmin, and upper bound of strain, Stmax.

for the strain of the aggregate set of 153 genes, which is labeled as Stmax
(maximum strain)

Two scenarios are possible for deﬁning this upper boundary. In the ﬁrst
scenario, a collapse in the Moment of Area, from “transcriptional arrest”, at a
concentration greater than 1.4 M NaCI would result in an undeﬁned modulus of
stability. The concentration at which this occurs would represent the upper
boundary of stability for the gene response of S. cerevisiae to osmotic shock. It

seems logical that “transcriptional arrest” would proceed the threshold of cell

97

death. However, whether this “transcriptional arrest” is an instantaneous
occurrence or gradual is uncertain on a single cell level. It seems more likely that
on a population level, transcriptional arrest would not be instantaneous reﬂecting
the different ﬁtnesses of cells within the population as evidenced by the dramatic
reduction in, yet remaining cell viability following severe osmotic shock.

In the second scenario, the upper boundary would be represented as an
asymptote in which the amount of strain occurring at this magnitude of osmotic
shock is undeﬁned. The amount of strain would infinitely increase as this
asymptote is approached. A weakness of this scenario is the apparent
disconnect between the strain becoming undeﬁned and the need for increasing
transcriptional response as the asymptote is approached. This weakness seen
from a kinetic viewpoint suggests that the rates of transcription and translocation
responsible for the required transcriptional response would soon be unable to
meet the demands for maintaining cell viability.

A lower boundary of strain must also exist for which physiological
conditions are altered just enough so that a transcriptional response is required
altering, at least temporarily, transcriptional equilibrium. This lower boundary
represents the transition from perfect stability to either asymptotic or
neighborhood stability. The lower boundary is theoretically depicted in Figure 5.1,
and labeled as Stmin (minimum strain). The experimental determination of the
concentration of NaCI causing this initial transcriptional response is currently
dependent on the capability of equipment to resolve small changes in mRNA

abundance.

98

CHAPTER 6

CONCLUSIONS AND FUTURE RESEARCH

6.1 Conclusions

The objectives of this study were to develop mathematical tools to quantify
strain at a transcriptome level and to demonstrate these tools in a model
microorganism. Developing new tools to increase our understanding of gene
expression in response to environmental perturbation plays an important role in
genomics. It also indirectly plays an important role in areas that are utilizing the
information learned through genomics. For example, the ability not only to
quantitatively monitor, but also quantitatively describe the response of
environmentally important genes to stress has applications in the area of
biotechnology, biological process engineering, drug development, pathogen
disinfection, and remediation. Because many of the approaches used to fulﬁll
these objectives are new, or have been adapted from other ﬁelds, it was
necessary to demonstrate them in a well characterized model microorganism
such as S. cerevisiae. Vlﬁth this organism in mind, combined with the deﬁned
objectives, the major ﬁndings are summarized below:

1. Gene expression response to environmental stresses can be

described by established deﬁnitions of stability.
2. Overall stability calculated from the Moments of Area of individual

response envelopes of genes are additive and statistically equate with

99

the overall stability of the aggregate response made up of the
individual genes.

. The deﬁned environmental stress response of S. cerevisiae exhibited
the most stability to hypo-osmotic shock, and the least stability to DTl'
exposure for the magnitudes of the applied perturbations.

. Stability in terms of the parameters reactivity, resilience, and
resistance decreased with increasing osmotic shock. Overall stability
also decreased with increasing osmotic shock.

. Genes respnsible for glycolysis in S.icerevisiae contributed most to the
Moment of Area at a perturbation magnitude of 0.5 M NaCI. However,
a transition was observed as perturbation magnitude increased. At 1.4
M NaCI, genes belonging to the glycerolipid pathway contributed most
to the Moment of Area.

. An exponential relationship between overall stability and osmotic shock
was observed on an individual gene response, pathway response, and
aggregate response for S. cerevisiae.

. The modulus of stability was deﬁned as the slope of the relationship
between stability and perturbation magnitude, which describes the
sensitivity of gene response to stress.

. Genes belonging to the glycerolipid pathway for S. cerevisiae were
most sensitive to stress from osmotic shock. Whereas, genes
belonging to the glycolysis pathway were least sensitive to stress from

osmotic shock.

100

The research presented in this dissertation combines concepts from
different ﬁelds including ecology, engineering, and mathematics and apply it to
genomics. An aim has been to preserve these concepts in their original form.
However, because of the compexity of the genome some concepts have been
slightly adapted to better ﬁt our current understanding. For example, the
displacement equation was origanally intended to describe the resilience of
nutrient cycles not an aggregate response of a set of expressed genes. Also, the
stability parameters, particularly resilience, vary throughout the literature in
deﬁnition and calculation method. Indeed, one can become easily and rapidly
confused from reading theoretical papers on ecological stability. For the stability
parameters, the most adaptable deﬁnition was applied to observed gene
expression. Application of concepts from other disciplines to genomics is not
new, however. Clustering algorithms so often used to sort related genes
originated from systematic biology. It is expected that the presented concepts,
will serve as a basis for expanding our understanding of the biological

signiﬁcance of gene expression to environmental perturbation.

6.2 Suggested Future Research

Future research directions utilizing the concepts presented in this
dissertation are potentially many. A few are brieﬂy described below.

Two types of perturbations were deﬁned as press and pulse and the
stability of gene response associated with press perturbations investigated.

However, the stability of gene response associated with pulse perturbations was

101

not investigated. This lack of investigation stemmed from a deﬁciency of
published gene expression data to this type of perturbation. However, such
perturbations are common. For example, the stability of genes important to waste
water treatment could be investigated in response to a simulated shock loading
of the reactor. Further, modeling the relationship between the pulse perturbation
and aggregate gene response could be adapted and tested from previously
proposed ecological models. An exactly similar experiment conducted with a
pathogen (e.g., Escherichia coli O157:H7) and one of the many possible
disinfectants will be of great value in providing information and modeling the
response of the same organism in the water distribution system to varying
concentrations of the disinfectant. Similar arguments can be advanced for
remediation.

lntuitively, gene response associated with the establishment of a new
equilibrium expression should occur, especially with constitutively expressed
genes. Quantiﬁcation of this neighborhood stability was proposed but not
demonstrated in this research. This is because the aggregate responses that
were observed demonstrated or tended to return to pre-perturbed expression
despite the environmental change imposed by the press perturbation. However, it
has been observed with GPD1 and other related genes responding to osmotic
shock that following a return to pre—perturbed expression, expression slightly
increases possibly due to adaptation. The overall stability of genes previously
exposed to osmotic shock compared to the stability of genes never having been

exposed to osmotic shock could be investigated.

102

Finally, the application of high throughput genomic technologies to mass
microorganism identiﬁcation is currently being evaluated. The Moment of Area
and modulus of stability concepts could easily be applied at the organism scale
to describe the stability of microbial communities. If the challenges of validating
organism speciﬁcity can be overcome then the stability of individual organisms
within the community as well as the stability of the aggregate community could
be evaluated to different environmental perturbations. Until speciﬁcity issues are
resolved the stability of a community “ﬁngerprint” could be observed to different

perturbations.

103

APPENDIX A

COMMON METRICS USED FOR MEASURING SIMILARITY (NEARNESS),
DISSIMILARITY, AND CONFIDENCE

Metrics and associated equations for deﬁning similarity, dissimilarity, or
conﬁdence between expressed genes are displayed in the following table. Where
G represent the primary data of a gene for condition or time point iover a series
of m1 conditions and time points, with X and Y representing gene expression
vectors for comparison. For time series, in which expression proﬁles are
compared by rows in the expression matrix, the selected metric is normalized by
the number of time points in the series.

104

Table A.1. Additional metrics used for relating similarity and dissimilarity of

relative ene ex ression data obtained from microarra studies.
Metric Equation Category Source

 

 

 

 

 

 

 

 

 

 

 

Legrendre and
Euclidean _ "' 2 Distance Legendre, 1998;
D(X’Y)_ ;(Xi—Yi) Wenetal.,1998
m Pielou, 1984;
Manhattan _ _ Distance Xia and Xie,
D(X,Y)-§IXI YII 2001
m Pielou, 1984;
Chord Distance Xia and Xie,
D(X,Y)=Z(VX,-\/Z)2 2001
i=1
m X. — Y Y. — 7
Pearson or 5(X,Y)=Z[ ' II ' I
M°°'f'°° ‘=' (DX (DY Nearness Eisen et al., 1998
Pearson (See )2
footnote a) m G. _ E
(D = '
. I; I—m
Similar to Pearson’s with the exception that . .
Spearman objects are first ranked according to their Nearness Xla 23881X'e'
measured values.
Jackknife
(I) (2) (m) I Heyer et al.,
See footnote . Nearness
( b) J,_, = mmISX, ...S,,, ,...S,,, ,..5,,,, ) 1999
Zmin(X,. , 7,.)
R PerCem C(X, Y) = 100 — 100 ‘=‘ Conﬁdence Pielou, 1984
emoteness m

 

Zmax(X..Y.-)

i=l

 

 

aEisen et al., (1998) uses Goﬁse. instead of the mean of G over m observations. Gags... represents a
reference or standard value in which to compare with G,.
t"l’he Jackknife correlation is really just the minimum value from a correlation set obtained by

deleting the ith observation and calculating a new value, SW”.

1 ()5

APPENDIX B

MATHCAD PROGRAM USED FOR CALCULATING THE MOMENT OF AREA
FOR LARGE DATA SETS

The following program was used to calculate the Moment of Area for large
data sets. The program can be divided into four areas: 1) linear regression
analysis, 2) statistical analysis, 3) Moment of Area calculation, and 4) generation
of and output table containing results. A detailed explanation of commands used
for the program can be found in the software manuals.

l06

MOA :=

 

Alc— READPRN "C:/test/test.txt" )

"Regression Analysis: ﬁnding coefﬁcients"

for al 6 (1.. (cols(A1)— 1))

cIan" Iblalbi
dlai" Iblaibi‘
elalé— iblal,
flalt— ibl

glale— bel

 

81,0

Clc—augment(c1,dl)
D1+—augment(el,fl)

Eli—augment(Cl,Dl)
Flt—augment(E1,g1)

Gl+—(FI)T

bla'c—linﬁtIA1<0>,Al

0,:
31.0.3.0

”4,0

<l> ‘
3 ,AI

WRITEPRN"C:/test/coefﬁcients.txt" ) :=Gl

"Regression Analysis: multiple regression statistics"

Hli—submatn'x(Gl,0,(rows(Gl)— l),1,(cols(G])— 1))
for hl e 0.. (cols(Hl)— l)
for ile 0.. (rows(Al)- l)

jInf—mom T HILhi'Ali

klii,hi‘—j]hi

llo—submatn'x(Al,O,(rows(A1)— I),1,(eols(Al)— 1))

"Regression Analysis: multiple regression statistics:SS(Residual)"

—->
Jlt—(kl-ll)2

for 116 0.. (cols(ll)— l)

mini—”£11“>

T

nlo—ml

"Regression Analysiszmultiple regression statistics: residual standard deviation)"

————)
0114—— _n_]___.
(il+l)-4

107

MathCad Program (continued)

 

"Regression Analysiszmultiple regression statistics: SS(Total)"
for “IE 0.. (cols(.ll)— 1)

<1”)
mnmh'le
il+l
I'lllt—mllT
for 1126 0.. (cols(nll)-— 1)

for 1136 0.. (rows(ll)— l)
mlzmc—Ill

nlzll3,112‘_m]2112

<ll2>
2

.2
"”0,ii2’

113,112-
nl3nzc—an

ml31—n13T

"Regression Analysiszmultiple regression statistics: SS(Regression)"

 

————)
nl4e—(ml3—nl)

"Regression Analysis:multiple regression statistics: coefﬁcient of determination R"2"

‘3 i
n154— Im13— nlI
\ m13 /

"Regression Analysiszmultiple regression statistics: F test statistic calculated"

n161—

 

(i1+1)— 4
"Center of Mass"
for ole 0.. (rows(Al)— l)

 

ph—AIOLO
for ql e 0.. (cols(Gl)— I)

up]

rlqlt—‘I Glo‘ql+Gll’ql-H-Glz'ql-x2+Gl3,ql-x3+(314‘qI-x4 dx
0
0p]

slqlc— x- Glow+G11,qi'x+Glz,qi'x2+GI},qi'x3+Gl4,qi'x4.dx
0
Op]

thlt—I (310m+GILqI-xl-(112.q|~x2+GILqI-x‘i-rGIMII-x4 de
O

 

108

MathCad Program (continued)

"'Center of Mass: x-coordinate"

sl 1
ul 4-—_q_.

q] rl
ql

"Center of Mass: y-coordinate"

tl 1

VI l<—0.5-—q
q r1

ql

 

r11«—submatrix(rl, 1,(r0ws(r1)- l),0,cols(rl)— 1)
ullc—submatrix(u1,1,(r0ws(u1)- l),0,cols(u1)-— 1)
v11«—submatrix(v1,1,(r0ws(v1)— l),0,cols(v1)-— 1)
"End"

"Moment Arm"

for W] e 0.. (rows(ul)— l)

  

xl «—
wl

 

, wl,0
x1k—submatn'x(x1,1,(r0ws(x1)—l),0,cols(xl)—1)
"End"

"Moment of Area"

for yl e 0.. (rows(xl)- 1)

zlyli—rlyl-xlyl

zll<—submatrix(zl,l,(rows(zl)—1),0,cols(zl)-l)
"End"

"Organizing Results"

 

ch— augment(m1, r1 1)

L14— augment(K1, ul 1)

Mlc—augment(Ll, v1 1)

N11—augment( M l , xl l)

011-— augment( N1, 21 1)

PI«— ( "Residual Standard Deviation" "Area" "x" "y" "Moment Arm" "Moment of Area" )
Qlt—stack( P1,0I)

ol 1

 

109

APPENDIX C

DESCRIPTION OF GENES GROUPED ACCORDING TO BIOCHEMICAL
PATHWAY USED TO STUDY RELATIVE GENE EXPRESSION T0 OSMOTIC
SHOCK.

Genes studied in response to osmotic shock are listed in the following
Tables. Description of genes and related information was obtained from the
Saccharomyces cerevisiae genome database (SGD) located at
www.yeastgenome.org. Diagrams for each pathway can be obtained from the

Kyoto encyclopedia of genomes and genes (KEGG) located at
www.genome.ad.jp/kegglkeggz.html.

110

 

 

  

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

4.020 0.4. 00001330 9. @0300. 0:0 30: 0000300350 01303 0040. 0:80.30 0340.30 42 30
0_<0o_<0.0\0_coo:00 0300.0 003,23. .
n H"
00403020 03:0qu 00:0 00002.30: 0.3530 003030 00020300 «0.23 4.3 E0030 00020200 303 33
2030 2030 00334.08:
35092 009103;: 3:206 5884 mom: 8 4>0300>>>000>>00300>00 000030300>0>>0>0>30> a
42.800 >00; 082-09» $59208 40000.: 30>0>00400>0>0>3000 3>000>00>0300030>>>0> %
”00.00.8000 5.0—0.30300 \ 0.0000 4- m0.m.4 om 0.4 0.0 000>00>3030>0040>>> 30030000004>4>4>>00>04 4
9:800063000306 400303000 monmu; w 30>03>>0003>>0>04000>> >0300030044>0300>0> luml
0585308 m0»: N 304>00>00>00>>3>>0oa 00000309308350 -03
630363888 5308 many.» 0 00>0>>>0030030040>>>> 00030003003030?) I0Mi
03000382003040 35000 monmbb 4 >0000>0>440043>0>0030> mo 004>>>0>>00400>030300 4
42303038 83368308 mos»: 3.4.4 30333003009000 8 3300>0>0000030>0403 8
€503.30: 646-030.: 0_no:o_
6.880 3: 388383600386835 momma; 4 00000030330039 I81300000>0>04003004>> a
<Ummmo<< >mo4o 35208 0000335000 mom»: 4 0300>>000>30>00>>0> mm 30>>030>00>00>>0>000 mo
<mwod<< 2.00050 2838 83338800 mos»; 0 >00>00>>03030040003 mo >003000003>>0003>30 mo
<mm4um<< 00»; 35.208 00303003000 m4 833203. m0; b.» 4 >0>>300>0>3000441030>00 mo 3000>30400>3>000>0>0 mo
06:0 0:93: i
30800 $03. 1.9 56:88. mo”: 4 >0>300303000>>0>>00 mm 000>>0.3>03>>0000>0040> 00
00840 .0000 2256 8869208 8230 0 mos; .4 a 004>0030>>330000030 8 3003>00030>0000>03>0 on
0280 30x0. 008 383.83% 06:86:06 mosh; ; 30000400330034.0303 00 cEEF>>00>>0300>0>>0 mo
<0m~soo .033 0-030030304342000 momma; 2 0300004400,...000300454.
59800 has; 4296002830300 madame moss.» 8 0003030093930 00 90003030030930 8
#330 $3 46688.405.038338 moriu : 0030033000303>0003I mo 0>030>>030>>300000>> 00
2380 002.0 36832863508 mods.» N 00090030033303? 0» 00300>33030>000003> a
3.380 2.8 056.34% 836883022533 m0;.~.4 0 00030303300300 00 00304030403004004003 103
<53 30 >00>000003>>0>00033 E 0>00>00300>>>>003003 8
042.0800 0x00 8-88-058: $5308 mowed? 000>>00oq>0303040300l an 0>004>>0>0000>0>300>0 o.»
$000054 3.00854 8208 6.048 0885 4 mosmé: 3>30000>>00>00>3300 1% >3030300>>000030030 8
<2ro§<< .53. 00>» 3:208 00308002000 mm .00300203 mounag .4» 000303000>>>30303>>00 mo 00>3>300>>>3004000>00 mo
<Oromm<< Team 03000300200308 35000 monm.s.m.4 0>00>00300>00>0>0>>0>3 mm 0003 3: 2.160. _GCC>._. 001
50032 I353 183% 8363888 W! 0.4 .0 _>4.0300000403030>040 mo 1 03030>3>00>00>000>>00 ml
<23 40 72084 «0 93926003400 00308003000 m0; .m.._ .A _j400000>0._.43>00>._.04.0 mo

mm _40>>0>0>._.004040>00>00

 

Ill

300.0 0.». 000002.030 o." 00:00. 0:0 30.3 008300330 03.303 00.0. 0:80.30 080.30 81:0 0:808:30 30802.03
003.20%

 

 

    

 

 

 

 

                            

 

 

 

   

 

 

 

 

 

 

 

“‘ IN"
9083020 00030030 0030 00003000: 003030 00020300 «0.0.0 33 00.080 000.0000 «000.0 33
2030 2030
8.0800 000.. 000. 8880806 gasscmamaama 03.303000030303000 I010 0>030>>>>>00>nﬂ0>00>> 8'
80.85. 80.02,. 0.308.028.0000 0000.30 >>003000>00>>0>00303 8 030>>>>0030400003300 8
<05an 000. 3832.00.02.30 $35000 3030000300>00>>0> 03 30000>300>030303030>0 8
c.<003o..w-u:00u:0.0 00308003000 m0.......m >>300300>>>0>30300030 mm >30>30300000>00440440 mo
z>o+
.ﬂQOS02837938306 00883300309003 3.603030000300303 3
E05888 llll
0000.09.65.0087.0.8087? m0.m..\.m.u ﬂ03030>>>>>>000>00>00 mo 0000303434030030030 mo
<0...mm<< 003 833000300 00308003000 m0”..n.... 3030§3003000>000 03 >>300>0>CCCCC3C>030>3 mo

6.503.030: 6:00:03 0.00:0. 4.. .. ..

LFL

 

 

 

 

 

 

 

 

<Ur~000 >>UA 050.20 02.0.88. 00308003000 440300000>>>03>03>000
1.0.8000 30..N 3026.808. .5000 >>00>0>300>30>30300>00
<Um33<< mx: 0308.03.30 x3000 >>>3030>0>>000>>00>00
45.000. 0 2.50 89000.20 0.8.30 2600:0800 m0.0.. .0; .0'0Wo>0300>0030>>0>3>3

 

0.2.0-. .u-30::o0<=8:08800 mow}. .- 000>>>30>300003>3>00 mm 00>0>0000>>00030>>3> mo

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

   

 

 

 

 

 

 

 

 

 

3003020500330 033000 m0.N.~.m.m >34000>0>0>000>0>0>0 mo >0000033030033>>30300 mo
o.<88.-0-u:o00:0.000 m0.0.0.... 003300>0>>00. . _0>3000 mo _ _ . _0>0003>0..40>0000>> mo
<mwouw<< 2.00. >rom 0.00300 00308003000 m0...~...0 >00>00>>030333003000>3 mo >003000003>>000>3340 mo
<mrommo >>Om 05030 02.0.88. 00308003000 00>000>00. . . .0>033003 mo 000>300>>G€ .C.C>03033 mo
E0.
<05me 00.... 06:0-..0-303800830403000 0030>>340>>30000300 mu 00000>0>3030030>>>>> 03
<05030 .523» 0.30-.0.3032000203030800 000>00443>00>30003m3> mo . .C.CCC>>>>>0>3>000030 mm
<0283<< ZED. 03230680308. 03005203083000 .. mm 000. . . . _ . .0033003000 03
<0mio<< _.umUN 0803020500330 00008931000 mm 0>>00000>3300>344>0>03 mm
<0mmo~0 003.. 003.. 08.30-080.088 03005203083000 30000>>03>3003003>>>00 mo 030000>>0>300>>>>0>30> mm
003 -
<Irou~0 0C3. 0.808. .8000 30030w0>>30003>0030>3>> mm 0>00>>>>03>0000>00>0> mo
<Im.nw<< mn3. 03030.033008008203000.000 30003>0030. . . .000300 mo 300030>3>30300000 mu
m3Iv3v . - ;Il|‘ ll
<....o§<< 2.230 W63;0-3038002030308000 >>30>>0300>>00030300>0 mo 0>300303030>00444000 mo
$5000 0c3~ _o.<00_.o.-u-u:o00:0~0 00308002000 >>30>>>30000030>300> . mu 300>>0>>>000>3>300300 mm

 

 

112

ngm 0.». 803.26%

 

 

 

 

  

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

   

 

 

 

 

   
  

 

mkmNmszo mamzqqu 09$ D3326: m:~.<3m ‘63qu 93538 3.23 3: mm<m>mm @3538 353 3:
ZmSm Zm3¢ 003386:
558$ 558$ ucnmzé m8€¢m=m§mw¢ 3 1x1 fqum >4>OOOO>OOAO>HOO>>30> mm 40>o>oaoo>ooaoo>>qoo> mo
. . . 690:
$9080 mOxN uanﬁoSE anﬁmwodmsﬁammammm- mom‘s; .- f: .- >>>>QOOOQ>OOA>A>OOHOAO mo >Od>o>40>>CCECCCC>>j mm
85.638 2.55 8385.8 néaosbo)
icing \ U-u-:<q«ox<mn<_ 00>
<xmo$o mud: 3832.38 0 man: 55 EC: _ COFFEE: :0 mm OOAOO>OQ>OO>AO>AOJO>4 mo
<9:qu ox: 030:3 {ammo momma; .mw 4>>AO>>OOOOOO>>>OO>O mu O>joooa>>>ao>4>amoooo mm
<§rouo<< 023 azﬁaémoonosm 5.5mm mounagmo 0004004000H0>>0>>>>>> mm >ao>>>00000040404>004 mo
ismwouo mxmu ﬂuvamﬂmécom: $333,». moﬂmk.‘ .uA ooo>>oooa>0401049400 mn o>004>>0>oooo>o>>amm>o mm
<25uoo on: Emn<_c_<oma_ n30:333033233333 moHNQaN 043.002 _ _ _Oo>o>oo>oo> wm 4004400>04400>0>j400 mm
<Oroww<< ovon. QuUu m_<88_-m-u:omu:m8 3398330 mOHZPm >oo>aoooo>>>oao>joo mu 4.0502 _ _ 50400000>40 wu
:z>o+~ I l
<O§No<< moi. oo< om_onmm-5ac8a “.885 S m_ao\xm8 mo; 9‘..- >>OAOOOHO>>OOO>40>VOO 40>400>00004>Ojo>ooj mo
L L 33?
<Omu§<< >6» mamzﬁa magaqommammm mo; ,N; .u 3,404.3.0000040440499040 OAOHO>>4>OO>OO>OOO>>OO mm
ﬂurmomo £322 8 9528:8302. 93?: >400004000>>3040>> ¢ mm mo
crow r L. .
$024 8<< 23 GDP...mmo<_c_<83_:_=om=o_ u- >OOQ>>OO>AOOO>>O>>>04 >>>OO>>>O>0>AO>>QOO>OO mm
30m :25 Emamﬂmamm
H

  

 

H3

400.0 0.0. 0000202000 0* @0000. 0:0 50... 0000300350 04.304 008. 0000050 0400050 84 50 2:800: 0,9200“.
x5000 3305 0600.50 005.20%

 

 

 

 

 

 

 

 

 

 

 

 

 

 

     

 

 

 
 

 

 

 

 

 

 

 

 

 

 

  

 

 

 

   

   

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

II“ J"
9083020 0800040 0000 00000000: 0.00050 003040 00000000 3.23 43 m0<0400 00000000 35:. 43
2030 2030 0033.00.00
5392.2 000§| 040mg” 0038000012 00030 10033003300300.» 1%
<wro..0<< 0cmu 3.800:-0Q.<0.00 0885 5:000 A.<.>.u mOHNQ. .- 0>>040>0>0000>0>0040>0 0n 00000040>04>j4000>0> 00
.5000
0580 0x0. 038.”. :38 0.58 . .28 : mom: 0- 0>0>4>4400>>0>00>40000 a140o>>000>04>4>000>>00> lama
<mmomm<< 4m0. "400004.053. 4000.08. 0‘ 4E 008800.03 40>>>40400>40>00>>>000 ﬂ>004440004>0444040000 a
<mx~oo<< wmz; 0.00. can 03060000 300.084 _ I 0. Z >_ I 000000>00 ml 40>4004>>0>>>000>>0>00 4
440503 0cm. 30.00156: 08.05 044000>>004>00400400>0 mm 040>000000>0>j>0>>>> um
<0... um<< 04mg 00:92:: 0885 .5000 0* 250 .5000 W009. .- 0>00>>0000>>>>40>>004 um 00>404>4>>400>40000400 mo
.5000 ‘03..
<9.»qu <00. 2<000300N03 0:000:0«0.0< 504300.00 monmgun 400>40>00>40>44000>4 mm 3440>000400>004040 aw
<Um.ow<< % 03083020 0.000. $003020: 005<<0< 0040004))0340004000, lad!
885
<mx. . .0 m<<EV 2.04. ”40:00:26: .0008. >3004000040>>0004Q4 %
<mmim0 5<0.<00 5 50 100. 363-00.00.01? 000400400>00>00404>40> Imml
0.9.0. . L .. 003<<0< II
00083000 0.030 00084 4000084 4004000>4>0>0300>>00 mo 00400000>0>440>40>4>>4 mm
_ 300000600: $55000 moﬁmbg .HX >>04400040>>00>>00>>00 mm >00>0040444>>00>40000 4
023353850 0385 :38 a 50 mo»: .- 04000>>404>00>40>000>|a 40040440000300.3430 a
mex «03..
<0momm<< _004. 000.000 < m0...:‘..0 4400>00401404000>4>040 mo 40>040>>0>CE _ .0656). 00
$0830 m4m~o m0:=0>300=50-0_.205 5:000 m0..~.u.. .- 00>j>00>>00>00>0400>> % .....0.3.000>00>400>400> %
5.5800 003. 000. 000050 30.02.000.350 0865 0.030-. _0>04>0000>00>00>>>>004 mm 0040>13400040>00Q34 mm
0003. 26. 295: LIII I
5.2600 94» 004539400350 .<.>0. r5000 m0um.u.. 0- 0000>0>>0>>0>>40>j400 mm 4000>004040>40400>40>4 mo
$408005 04m . N 200000030: 0008. 4>000>>>000>>4004400> 00 >>>40040000000>4>>>> ﬂ
5530 2.2. 00030.04? 3000300003 $0.03 0865 monmawﬁ >04>00>>>>> 40>40>0000 mm 0>004>0>>44004400>0400 mo
<05me 00x. 0045030400350 0885 5:000 m0..N.~.-.- 0>00>00>>440040>00>0>0 0» 4.9000010000304040 a
ll 005.95.00.00. '1 ||
<0:an .000» 28050 0605 50000 0* 50 .<.>.u x5000 mop: .- 440>>>>00404000>0004 mu >000000>>400044>44>>0 mm
.5000 ‘03..
5.0540 02». 0<0.5-0000w.a03 .5000 53.0.84 8x: O>ﬁ44000>00440>4000> m4 440>400000>>0>00j40 m4
$2.300 M4mu 00083000 > .0084 4000084 0>00>00>4440040>00>m>4 mo 14004>>OQQO>0>40>4>400 um

 

 

 

 

114

400.0 Ow. 80:13:03

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

96833.0 @8038. 00:0 0302.02.00 @550 002503 @3508 35.» .03 mm<m>wm @8508 3.33 a
2030 2030 0033.09.00

<xxomm<< 2...“: 3393-..!“ 038.: >>oo>>o>aooodo>4>o>oo> lmmldOOQ>O>4>oo>QjAOO>>Od 4

<rmoomo mmxg 0.330.012 3060300302 $683 088.: >4>>OOO>>OOO>404000> mu >OOQ>030>04>>400000 I

<5; 85 1004 3.80000920an 088.: Eamwm momma A . 00040044953134.0000 mu OOO>O>AOOO>>O>>040>040 mm

<C-~mo 000$ 00.0.6.0 8 30 74.0 magma? 0* mmméa deOO>CCC 2.; a JO>>>>0 mm O>AO>>AOAOOOA>OOO>OO> a

365m

<rmw$<< 0x9. 020‘ 04.933068: $3508 ” . 4 . >jo>o>00040>0004004> mo an>>q>oa040>>oojooo> ﬂ

<rmum~<< mam: was: 088.: 5:03 .3023 .2 Em moumuga >>oojo>0400>jo>ooo>o mo Hadogaooogc_ iron: mo
3036 £000.55 00353 II

55520 Grog 5336.503630 Emmm maxi; .u O>OOOHO>40§>OOOO>O mo 00440049540000.3on mo

<58qu ZmZN £80m 23003020 Seemsq 088.: 44>40>OOOOO>>Q>3>OOO mm 3000>400300>>>0040 |%

15.8qu .3023 0:830 "Samozgoaa 09222 0* £030- O>ODO>>>4>OOJ>OQOO>AO mo jo>j>mo>jooo>oooo #1
m 0050 0:00

<2romu<< Emma %cm_-m0mnsgq 086.: 23min. monwg‘ubm ooao>>00>oj>qoooo>jo mo O>Oooom>dojo>4>oaoo>o ml»

30m :mBmm
<2roomo m>mN 9.00. 0040-2:03o 0665 00>>>AO>>00040030440 mm doooooodooaoaoqgoa Ia.
042m

<z_.~30 m2: wz: 0885 335020 65“.: may j>004>00>>j>00000004 mo o>000040404>>>400300 .Mol.

<zmow40 mme 3132360350 0383 5:000 momma}- 4000>HOOOHO>>00494>00 mo OOJ>OO>A>AOO>HOOOO>> mm

<Omoomo m5; wroa 083.: 06880.. ojooo>4>03000>>oo>o mo >>>>4404nwo>>ooooooao mu

<Om3m<< mama mcmazm acoﬁonaméaaim 086.: 003 0>40400040000404>0>>>0 mm oo>jOHO>OJOOHOO>>O>> a!
mecca:

<OmNS<< 2.52 8130\380050 088.: 5:30 monk; .- O>>4OHOOO>>>400>>OOO u mu O>>oqooo>4400>0040>>04 la

52.0500 0.9 :20 Eammmémmoamﬁn 0385 >OOOH>QO>0O>>QAOOO>>HQ mo doo>qoooojo>>>OHoody Imml

$2.800 NE: 323 8000308081.? 088.: >o>400>00004>>>400>004 mo >j>4>400004000>00>00 mo

<03mm<< mIOA 9.0-0.3050 066.: 0‘ 30 30 magma? oo>ooo>0>>>oo>jo>>o>> mm >OH>AOOOQJOOOOO>>AOA mo
0* 03.58 8213

ll

 

 

 

 

 

 

 

300.0 0.0. 000000008 0.. 00:00. 0:0 50: 0000300350 01303 008. 0:00050 08850 81:0 00.. 08.0

005.20“.

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

   

 

 

 

 

 
 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

00083020 080003 00:0 0000:0000 m:~.<30 00303 00020300 3.20.» 3.3 30880 00000000 «0.20.0 ﬂ
2030 2030 0033.008: -
<>romao ram. 000540 08:0:00 808. >>Oooo>>ooojo>>004> 03 >00004000>404>>0>0>0>0 mm
50,580 9.20. 02”.. 9.0.5. 0.8000030 o>>>0>00004>00030>40 mo O>>ooo>>>400400>0>>>40 1%
<<I..

59.00.00 000.0 00.. 0.<.0.0: 0038. 088.: .0 manna. .- 4>>CCC. _ .G>000>0......Oo> 00 03>40>>o>010400000> ﬂ

<mro.m<< .ucmnw 3.800:-00».<080 088.: .5000 :52... manna: .- 0>>jo>o>oooo>o>ooao>o mm ooooojo>04>3000>o> mo
x5000

<0mo0wo .0100 8080.088 00.0 0:000:0800 0805003 mouu; .0.» HOOO>>OAOOO>OO>403 mu 40014400>OO>400>>0004 [010'
000

<00. . no O<Om A0.8000 80800.0: 300.08.. 088.: 0.00 oo>>003>>400>00> mm 400>H>0000000>404>04>> a!

<00. 000 Im: 508:0 0.35020 850.5. ooaoo>>>>aooo>qooaoj 00 >joo>o>ojooo>>oqomo q

<00:me 02m. 0<0..:.0000:00:. .5000 800.08.... 0095.. >OO>40>04>40>00004400> mo 004040>04>000000>0>>4> mm

<00. um<< z.m0.. @200. 80:80 8.. 02> 003000 50:80 >>j>000>00400400>>>0 mm o>040>>ooooqojooa>>>0 mo

mmm. 0:00.605. 8000:000 5 0... m5... 52.0 m.

0:0 032. 5 3.80.0. 0.3.83 8 0.0-.50000

<mm.mo<< 00000. 902.0 0<0_.:-0000:00:. 088.: .5000 00.0.3.0 mOHNN... O>O>CCC _>_ 5300090040.» mo >>0>oo>>>>00>000004> mu
0000::

<wm~§<< 01.3 08000.0 00...:0>:8o:.:0-0..085 x5000 momma; .- mjoo>ojo>1000>0040 mo 00040036404040.3900; a.

<omomo<< menu 00020008.: 0.0-208 .0030800 mOHmb. .m 040>>>O>OAOOO>>HOOOOA mo OO>O>O§OOO>HOO>>10 q

80:80..

[Snows—O ...C.u. 00.00000 80800.0: 800.080. 088.: _ OO>O>>A4000>>>>O>AOHOO mm Doo>>>>>j4000>40400>0 Imml
3.8.0 0.338030 333.50... 83088808003083 lam» 8838883000300 8:
00.. 02.0.0: 0038. 0885 v. mo.~.~.. .. O>>0040>>j>0030>0000> mo >>00>00>>4>>40>4000>00 [Imml
00350::885058050 088.: .5000 mOHNQ. .- OOO>OO>O>>OO>04>QJOOA mm >>OOOO>HO>OO>O>H>HO>O> 40-
. _ . 00.. E .. x5000 .
800.082 0:95.. 83 00000 0885V 00009940). . 53000.04... 0.x jooo>>>>o>oojooo>o mw
x5000

<00. .00 00.. 08.0 8050.2 joo>>0>om>>o>40>o>ooo mo Agjooo>040>aoo>04o>0 00

45.0300 8:080:80. 8083 0>040>4>>00000>00>40>> mo 400000>00304>400>>0 mo

Jammie 02> 800: 0:00.605. 088.: 00>Oqooao>ooja>ooo>>a mm Amaojoo>0>aooo>ooj40 a
00...:0>:80:.:0-0_.085 x5000 mouN: .. 4003>>>oaoo>001004004 mo O>>OA>OOOOHO>O>A4>OOOO d
35.0: 8300030 3058880 088.: w >OO>FGaOO>OOAO>jo>>H mm oojoo>04>>>>aooooo>o 1%
20:00:08: 808.. 04>00>400>>400§CC Fr.» 0» FHOOHQOOOHQEOOQAO... mo
00.. 08.0 0:00.605. 088.: O>OOOOO>>OO>OO>4>>>OH mo >000400000§>>0>>040 mo
2.338030 08602.9. 038... 3008003:v . 000. .580 00 88033000800101», la
8.. 02.0.2. 838. 038... 80803030800833.0401 x 00 0009808330380 .8:

 

 

 

 

 

 

 

116

4008 0.... .00:..:000.

 

 

 

 

 

 

 

 

        

 

 

 

 

 

 

    

 

 

020830...." 08:00.... 00:0 00008.8: m:~<30 0020... 0000080 «000.. 03 00.8.00 0000080 «000..
2030 2030 0033.008:

$2.800 0.92.. 0.5 00.000.20.50 0.0.0.: x5000 m9»... .. 0>00>000..>..0000>>>0._4

8.00000 0100 0:000:08 $083 000...<0 800.08... >0004>>>0>00>>>0000>
0.0.05 010..

<Qroo00 00.... 0000208000050 00.208. 0.. >00- 40400000040340.0130;
00.:08 000030... 0:00.605. 83008:. 030>0400>jo>0400>00>
00.. 0.<.0.0: 0038. 0.0.0.: 00264040000300.0020,
.:<0.<00 5 80:00.8: AOO>0040>304400>0040
00.50.582.50 0.0.0.: .0800 mom»... .- 00>00>0>>004040>>40044
.:<0.<00 5 800.0..0: 0. 0050.0 00.0 0000. 00>>>j>0>>00000>004>>
000.808:
020.5. 002000050 . .>_G _ CFC .C>00000>>.:
9.0.5. 0-...00 >402... .GCC . C. 50.4.0000...
0000:..0. 3.8..0 0050.0 008 08.0.: 000040>j4>000>400>>>
0:00.605. 00..:0..:..00:.:0-0885 x5000 mow... .- .....0...0000>>>...0..>0000>

 

 

 

 

 

 

 

   

 

0<0_.:.0000:00:. x5000 550.8.

H>>H>>AO>HO>OOOOOOO>

  

 

800.20... 800.080 00.8. 0350 00.00

OO>>>HOAOOO>>OHOHOOOH

   

 

 

00508-000030... 0:00.605. 0.0.0.:

jO>O>OOHOOOjOJQOAO

 

 

  

 

 

 

 

0:.0300030 00080020: 0.0.0.: 00>O>j>oaoo>0000>0>>
00.00_.0:00 0.8:. 03030.5 .>o>.1..0>>cccrcr . C . CC) .
020.500.8803 x5000 5508. .02.. 0>j4000>0030>4000>

 

 

 

 

 

 

 

 

3.80.0 .::.0.8. 0.0.0.: x5000 m0.~.u.. ..
00.900030. 00003:... 0.08.: - 0080:
0:392:
800.80 8. 0.00000 80800.0: 0:0 8.
0.00000 0:0 00.8: 20:000..
5.0.0.2 10... 0.00008 00..:0..:80:.:0-0.085 x5000 m0.~.u...-
<.._uooa<< 0000 0:05 8000:...0: 00308.. 08.05. 00003.

<5»..on m5.

 

0

   

 

 

 

0000.88 0:0 5:58. 0. .:0 8.0.5.
L . 0.08.: x5000 00000

OHOOOOOHOOEAOJO

 

 

 

 

<5». 80

00000

  

    

800.80 8. 35.0:8300030 30583300

 

0:0 550.8: 0. 0:.0300030. 02>
8 .808:

 

 

>. .C. .CCCC>.CC.CC:C

 

 

 

 

 

4020. 03. 603.253

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

III [In
0533020 03:qu 00:0 00003.28: mz~<30 3ng 00020300 308 43 mm<0am 00020300 3.23 ﬂu
2030 2030 00337.06:

5.4335 9.? 80:? 033000050 40>>04>0>0014>o>400000 ﬁ40040>004>4>000j40400 a
1.330 ZmOu 0~.000050 0:00.693 0385 40>00>4>00400>4000>0>4 mo 0>0003400>OO>4>>000 la
$50800 4m_<: 040.0598 03.03 2 50 80 0:00:03? 404440>0040>0>00400>0> mo 0444040000>00>>>>4>00> Imml
<_<__.omm<< Own: 0463 3083.20: 0030.2 088.3. mecca: 0044000))004>>400>400 mo 40000>04000040>0>>>4> mo

.
$5483 0 000m 268.: 5:000 530: 8202030 B 30 monnkgu >440>000>>>>>00>00>0>0 um 4000>>>004>0>4>0400004 %

0 «:0 NE 0250»
553800 25.: 2.4.5000 30:00“ 0330:0300 moo; ohm >40>>04>4004000>040000 mm 0>000400400440>>44004 %
ismommo wcwn 00: 30.0 0200" 0385 >4044>o040400>0004004 mo 004>>>400440400>>00400 mlo.
552805 dumb mi:m<mﬁzmémseommﬁzogsmﬁ $35000 moumhggw 004040>004404>00>40004 mm O)>0§0000004>>00> m4

E II: II
<Zr~mo<< 0.05 . 1008 20:? 90.000050 >>>CC _ G _ CG>4>00>4000> mm 0o0>04040ﬂ40440004>404 mo
<Oxo~m<< mcmu. 00: 98.0 02.00, 0885 40>4>004>04000>00ﬁ400> mo 40>00000o>>40>014>4 ﬂ

0304 m

<Omwmm<< x)? 4 02> 003000 0309093 00:22 068.: 0OO40>N000400>044>0>>0 mm 40>O>0000>40>00>40>> uu
14385 VIOmm 295003303 0885 £300 manna; - 30004>44000040>>0>0 mm 40>>400>40400>04>00400 lmdl
<vﬁmuo 3903 00333: 0883 5:000 . _ _ _ _C_ _CC_GC 5.9400400 l mm 400040>40000>4>0>040> mo
13.805 009 02> 00330 0309003" 088.: >0>000000>040>>>0>o>> mo 044400>04400>000>0>0>0 q
<tr~mm0 9.2» 96:? 045000050 3004o>00>00>04o>>>0>0 mo 00>000040>j4014004>0> 'mlol
<2": A d<< 0030 00: 906 08.05 x5000 6580 8 000% mOUNNT >403404000040>>000 . m4 >>40004o>00440>4>40000 0'0-

 

 

 

 

 

 

 

118

APPENDIX D

GENES EXHIBITING A SIGNIFICANT INCREASE OR DECREASE IN
EXPRESSION STEPPING FROM 1.0 M T0 1.2 M NACL

Genes exhibiting a decrease in relative expression of at least 0.5 and
those genes exhibiting an increase in at least 2-fold expression stepping from 1.0
M to 1.2 M NaCl are displayed in the table below. Relative expression values
were obtained from the ratio of the 1.2 M NaCI data and the 1.0 M NaCl data.
Negative values indicate a decrease in expression stepping from 1.0 M to 1.2 M
NaCI. Positive values indicate an increase in expression stepping from 1.0 M to
1.2 M NaCl. Highlighted genes are discussed in the results section.

119

 

Table 0.1. Genes exhibiting a signiﬁcant increases or decreases in relative
expression at the time of maximum displacement. Comparison between the 1.0
M and 1.2 M NaCI erturbation ma nitudes.

 
  

        
      
 

     

Systematic Standard Gene Description Pathway *Change in

Name Name Response
from 1.0 M to

1. 2 M NaCI

 

“Genes exhibiting a decrease in relative expression of at least 0.5 stepping from 1.0 M to 1.2 M
NaCI.

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

YBR136W MECt 53$? $622313?” '"duced Cell Cycle -o.77
YER173W RAD24 Cell cycle checkpoint protein Cell Cycle -5.04
YGL003C CDH1 ﬁgggg'géi‘tsggﬁfgggvam °‘ APC‘ Cell Cycle -5.os
YGL086W MAD1 Spindle assembly checkpoint component Cell Cycle -9.77
YGR092W DBF2 Serine/threonine protein kinase Cell Cycle -0.77
YGR113W DAM1 Essential mitotic spindle pole protein Cell Cycle -2.13
YGR1 88C BUB1 ﬁgzcszlépolnt senne/threonlne-proteln Cell Cycle -2.64
YIL046W MET30 gigjﬂrfgifgu'ates sum" am'"° ““5 Cell Cycle -1 .35
YJL030W MADZ Spindle-assembly Checkpoint protein Cell Cycle -3.94
YJL074C SMC3 Chromosome segregation protein Cell Cycle -0.99
YJL21OW PEXZ Peroxisomal assembly protein - peroxin Cell Cycle -1.15
YJR053W BFA1 Unknown Cell Cycle -1.58
YJROQOC GRR1 33:32:23? $315332; if?“ and f°' Cell Cycle -2.3a
weovew S:§:i’§éif§:il2iit§:;§;$33?” ewcee .0...
YLR21OW CLB4 Cyclin, GZ/M-speciﬁc Cell Cycle -1.27
YLR288C MECB GZ-speciflc checkpoint protein Cell Cycle -3.83
YML064C TEM1 GTP-binding protein of the ras superfamily Cell Cycle -5.81
YML065W ORCt gamiﬁwgmm “mp'ex ”“3"” Cell Cycle -207
YMROOlC cocs gzgfgg‘zmabsfuﬂfrhxmwms at the Cell Cycle -1 .27
YMROSGC lMlH1 M-phase inducer phosphatase Cell Cycle -3.72
YNL289W PCL1 Cyclin, G1/S—speciﬁc Cell Cycle -1.77
YOR368W RAD1? DNA damage checkpoint control protein Cell Cycle -5.94
YPL153C RADS3 serlthr/tyr protein kinase Cell Cycle -2.90
YPL194W 0001 DNA damage checkpoint protein Cell Cycle -37.48
YPR111W DBF20 Cell cycle protein kinase related to DBF2P Cell Cycle -9.62
YAL0540 ACS1 Acetyl-CoA synthetase Glycolysis -5.31
YGR087C PDC6 Pyruvate decarboxylase isozyme 3 Glycolysis -99.94
YLR377C F BP1 Fructose-1 .6-bisphophatase Glycolysis -8.79
YMR1690 ALD3 Aldehyde dehydrogenase (NAD(P)+) Glycolysis 479.33
YMR3030 FKS3 1,3-beta-glucan synthase Glycolysis -1.89
YPL017C YPL017C Dihydrolipoamide dehydrogenase Glycolysis -38.20
YDR147W EKl1 Ethanolamine kinase Glycerolipid -5.46
YGR170W PSDZ Phosphatidylserine decarboxylase Glycerolipid -2.36

 

 

120

 

 

 

 

Table D1. Continued

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

121

 

 

Systematic Standard Gene Description Pathway *Change in
Name Name Response
from 1.0 M to
1.2 M NaCI
YHL032C GUT1 Glycerol kinase Glycerolipid -0.69
YlL155C GUT 2 Glycerol-3-phosphate dehydrogenase Glycerolipid -2.74
YKR031C SPO14 Phospholipase D Glycerolipid -3.71
YML07OW DAK1 Dihydroxyacetone kinase Glycerolipid ~28.28
YOR374W ALD4 Aldehyde dehydrogenase Glycerolipid -5.54
YBR200W BEM1 Bud emergence mediator MAPK -1 .33
Two-component phosphorelay MAPK
YDL235C YPD1 intermediate -6.46
serine/threonine protein kinase of the MAPK
YGRO40W KSS1 MEKK family -1.41
YGR088W CTT1 Catalase MAPK -626.33
Cuanine nucleotide-binding protein alpha- MAPK
YHR005C GPA1 1 subunit -2.13
Tyrosine protein kinase of the MAP MAPK
YJL128C PBSZ kinase kinase family -o_53
YKR095W MLP1 MLosin-like protein MAPK -1.19
Osomolarity two-component system MAPK
YLROOBC SSK1 protein -0.73
Ser/T hr protein kinase involved in the MAPK
YLR362W STE11 mating signalling pathway -1.60
YMR037C MSN2 Stress responsive regulatory protein MAPK -2.32
Putative transcriptional activator of alpha- MAPK
YMR043W MCM1 speciﬁc genes -0.76
"Genes exhibiting an increase in relative expression of at least 2-fold stepping from 1.0 M to
1.2 M NaCI.
YBL016W Fuss m::g;n'a°"va‘ed pm‘e'" “"359 (MAP Cell Cycle 32.46
YBR093C PH05 :RPespgfssible acid phosphatase precursor Cell Cycle 1.05
YBR135W CKSt Eggﬂggtdepewem kinase 'egu'am'y Cell Cycle 10.33
YDR507C GIN4 serine/threonine-protein kinase Cell Cycle 0.43
Phosphate system positive regulatory
YFR0340 PHO4 protein PHO4 Cell Cycle 2.78
YGL201 C MCMB Involved in replication Cell Cycle 0.83
YJL076W NET1 Establishes silent chromatin Cell Cycle 25.82
YBR196C 23211 Phosphatidate cytidylyltransferase Glycolysis 0.58
. 1-acyl-sn-glycerol-3-phosphate .
YDROSOC SLC1 acyltransferase Glycolysrs 2.35
YDR380W PH08 Repressible alkaline phosphatase Glycolysis 12.34
YER178W MNTZ Alpha-1 ,3-mannosyltransferase Glycolysis 0.58
YGR24OC MUQl Choline-phosphate cytidylyltransferase Glycolysis 1.20
Ethanolaminephosphotransferase .
YKL0600 EPT1 (ETHPD GlycolySls 2.43
YBROZQC GPP1 Glycerol biosynthesis Glycerolipid 0.35
YDL05ZC PGl1 Glucose-G-phosphate isomerase Glycerolipid 0.30
YDR481C TPl1 Triosephosphate isomerase (TIM) Glycerolipid 1.44
YGL257C AR010 Pyruvate decarboxylase Glycerolipid 0.54
Pyruvate dehydrogenase E1 component, . .
YGR007W PDA1 al ha subunit Glycerollpld 7.87

 

      

 

 

 

 

 

 

 

 

 

 

 

 

Systematic Gene Description Pathway *Change in
Name Name Response
from 1.0 M to
1.2 M NaCI
YHR123W PFK1 6-phosphofructokinase Glycerolipid 1 .69
YlL053W FBA1 Fructose-bisphosphate aldolase Glycerolipid 1.57
MA...
YFL026W STE2 Pheromone alpha factor receptor MAPK 11.29
YKL178C STE3 Pheromone A factor receptor MAPK 1.05
YPL089C RLM1 Serum response factor-like protein MAPK 0.40
YPR165W RHO1 giggl'iﬁg‘ﬁo‘t’gg?" °f "'8 "‘° SUDfam'” MAPK 11.38

 

*The Change in response is the difference between the relative expression for the 1.0 M and 1.2
M NaCI perturbations. Negative values indicate a decrease in the response.

“Relative expression values were obtained from the ratio of the 1.2 M NaCI data and the 1.0 M
NaCI data.

122

BIBLIOGRAPHY

Alexandre, H., V. Ansanay-Galeote, S. Dequin and B. Bondin. (2001). Global
gene expression during short-term ethanol stress in Saccharomyces
cerevisiae. FEBS Lett. 498298-103.

Alon, U., N. Barkai, D. A. Notterman, K. Gish, S. Ybarra, D. Mack and A. J. Levin.
(1999). Broad patterns of gene expression revealed by clustering analysis of

tumor and normal colon tissues probed by oligonucleotide arrays. Proc. Natl.
Acad. Sci. USA 96:6745-6750.

Altman, R. B. and S. Raychaudhuri. (2001). Whole-genome expression analysis:
challenges beyond clustering. Curr. Opin. Struct. Biol. 11:340-347.

Andreishcheva, E. N. and R. A. Zvyagilskaya. (1999). Adaptation of yeasts to salt
stress (Review). Appl. Biochem. Microbiol. 35:217-228.

Attﬁeld, P. V. (1997). Stress tolerance: the key to effective strains of industrial
baker’s yeast. Nat. Biotechnol. 15:1351-1357.

Autio, R., S. Hautaniemi, P. Kauraniemi, O. Yli-Harja, J. Astola, M. Wolf and A.
Kallioniemi. (2003). CGH-Plotter: MATLAB toolbox for CGH-data analysis.
Bioinformatics 19:1714-1715.

Ayala-deI-Rio, H. (2002). Long-term effects of phenol and phenol plus
thrichloroethene application on microbial communities in aerobic sequencing
batch reactors. Ph.D. dissertation. Michigan State University, East Lansing,
MI.

Beliaev, A. 8., D. K. Thompson, T. Khare, H. Lim,_C. C. Brandt, G. Li, A. E.
Murray, J. F. Heidelberg, c. s. Giometti, J. Yates 3'“, K: H. Nealson, J. M.
Tiedje and J. Zhoui. (2002). Gene and protein expression proﬁles of
Shewanella oneidensis during anaerobic growth with different electron
acceptors. OMICS 6:39-60.

Bender, E. A., T. J. Case and M. E. Gilpin. (1984). Perturbation experiments in
community ecology: theory and practice. Ecology 65:1-13.

Ben-Dor, A., R. Shamir and Z. Yakhini. (1999). Clustering gene expression
patterns. J. Comput. Biol. 6:281-297.

123

Benitez, T., J. M. Gasent-Ramirei, F. Castrejon and A. C. Codon. (1996).
Development of new strains for the food industry. Biotechnol. Prog. 12:149-
163.

Blomberg, A. (1997). The osmotic hypersensitivity of yeast Saccharomyces
cerevisiae is strain and growth media dependent: quantitative aspects of the
phenomenon. Yeast 13:529-539.

Blomberg, A. (2000). Metabolic surprises in Saccharomyces cerevisiae during
adaptation to saline conditions: questions, some answers and a model. FEMS
Microbiol. Lett. 182:1-8.

Brazma, A. L. and J. Vilo. (2000). Gene expression analysis. FEBS Lett. 480:17-
24.

Brown, C. S., P. C. Goodwin and P. K. Sorger. (2001). Image metrics in the
statistical analysis of DNA microarray data. Proc. Natl. Acad. Sci. USA
98:2622-267.

Bustin, S. A. (2000). Absolute quantiﬁcation of mRNA using real-time reverse
transricption polymerase chain reaction assays. J. Molec. Endroc. 25:169—
193.

Causton, H. C., B. Ren, S. S. Koh, C. T. Harbison, E. Kanin, E. G. Jennings, T. |.
Lee, H. L True, E. S. Lander and R. A. Young. (2001). Remodeling of yeast

genome expressionin response to environmental change. Mol. Biol. Cell.
12:323-337.

Cherkasova V., D. M., Lyons and E. A. Elion. (1999). FusBp and Kss1p control
G1 arrest in Saccharomyces cerevisiae through a balance of distinct arrest

and proliferative functions that operate in parallel with Far1p. Genetics
15:989-1004.

Cho, R. J., M J. Campbell, E. A. VVlnzeIer, L. Steinmetz, A. Conway, L. Wodicka,
T. G. Wolfsberg, A. E. Gabrielian, D. Landsman, D. L. Lockhart and R. W.

Davis. (1998). A genome-wide transcriptional analysis of the mitotic cell cycle.
Mol. Cell. 2:65-73.

Cottingham, K. L. and S. R. Carpenter. (1994). Predictive indices of ecosystem
resilience in models of north temperate lakes. Ecology 75:2127-2138.

124

DeAngelis, D. L. (1980). Energy ﬂow, nutrient cycling, and ecosystem resilience.
Ecology 61 :764-771.

DeAngelis, D. L., S. M. Bartell and A. L. Brenkert. (1989). Effects of nutrient and
food-chain length on resilience. Amer. Nat. 134:778-805.

DeAngelis, D. L. (1992). Dynamics of nutrient cycling and food webs. Chapman
and Hall, New York.

DeFrancesco, L. (2003). Real-time PCR takes center stage. Anal. Chem.
75:175A-179A.

DeRisi, J. L., V. R. lyer and P. 0. Brown. (1997). Exploring the metabolic and
genetic control of gene expression on a genomic scale. Science 278:680-686.

Eberhardt l, H. Cederberg, H. Li, S. Konig, F. Jordan and S. Hohmann. (1999).
Autoregulation of yeast pyruvate decarboxylase gene expression requires the
enzyme but not its catalytic activity. Eur J Biochem 262:191-201

Eisen, M. B., P. T. Spellman, P. 0. Brown and D. Botstein. (1998). Cluster
analysis and display of genome-wide expression patterns. Proc. Natl. Acad.
Sci. USA 95:14863-14868.

Engelke, D. R., A. Krikos, M. E. Bruck and D. Ginsburg. (1990). Puriﬁcation of
Thermus aquaticus DNA polymerase expressed in Escherichia coli. Anal.
Biochem. 191:396-400.

Estruch, F. (2000). Stress-controlled transcription factors, stress induced genes
and stress tolerance in budding yeast. FEMS Microbiol. Rev. 24:469-48.

Fedorscak, l. and L. Ehrenberg. (1966). Effects of diethyl pyrocarbonate and
methyl methanesulfonate on nucleic acids and nuclease. Acta. Chem.
Scand. 20:107.

Fernandes, L., C. Rodrguez-Pousada and K. Struhl. (1997). Yap, a novel family
of eight bZlP proteins in Saccharomyces cerevisiae with distinct biological
functions. Mol. Cell Biol. 17:6982-6993.

Fishbane, P. M., S. Gasiorowiz and S. T. Thornton. (1993). Physics for scientists
and engineers. Prentice Hall, Englewood Cliffs, New Jersey.

I25

Gasch, A. P., P. T. Spellman, C. M. Kao, O. Carmel-Harel, M. B. Eisen, G. Storz,
D. Botstein and P. 0. Brown. (2000). Genomic expression programs in the
response of yeast cells to environmental changes. Mol. Biol. Cell. 11:4241-
4257.

Gollub, J., C. A. Ball, G. Binkley, J. Demeter, D. B. Finkelstein, J. M. Hebert, T.
Hernandez-Boussard, H. Jin, M. Kaloper, J. C. Matese, M. Schroeder, P. 0.
Brown, D. Botstein and G Sherlock. (2003). The Stanford Microarray
Database: data access and quality assessment tools. NAR 31: 94-96.

Grandpre, L. and Y. Bergeron. (1997). Diversity and stability of understorey
communities following disturbance in the southern boreal forest. J. Ecol.
85:777-784.

Grimm, V., E. Schmidt and C. Wlssel. (1992). On the application of stability
concepts in ecology. Ecol. Model. 63:143-161.

Harrison, G. W. (1979). Stability under environmental stress: resistance,
resilience, persistance, and variability. Amer. Nat. 113:659-669.

Hartigan, J. A. (1975). Clustering algorithms. John Wlley and Sons, New York.

Hashsham, S. A., A. S. Fernandez, S. L. Dollhopf, F. B. Dazzo, R. F. Hickey, J.
M. Tiedje and C. S. Criddle. (2000). Parallel processing of substrate
correlates with greater functional stability in methanogenic bioreactor
communities perturbed by glucose. Appl. Environ. Microbiol. 66:4050—4057.

Hecker, M and S. Engelmann. (2000). Proteomics, DNA arrays and the analysis
of still unknown regulons and unknown proteins of Bacillus Subtilis and
pathogenic gram-positive bacteria. J. Med. Microbiol. 290:123-134.

Hernandez, J. A. 2003. Stability properties of elementary dynamic models of
membrane transport. Bulletin of mathematical biology 65:175-197.

Hernandez, A., A. Figueroso, L. A. Rivas, V. Parro and R. P. Mellado. (2000).
RT-PCR as a tool for systematic transcriptional analysis of large regions of
the Bacillus subtilis genome. Microbiology 146:823—828.

Heyer, L. J., S. Kruglyak and S. Yooseph. (1999). Exploring expression data:
identiﬁcation and anlysis of coexpressed genes. Genome Res. 921106-1115.

126

Hinchliffe, S. J., K. E. lsherwood, R. A. Stabler, M. B. Prentice, A. Rakin, R. A.
Nichols, P. C. Oyston, J. Hinds, R. W. Titball and B. W. Wren. Application of
DNA microarrays to study the evolutionary genomics of Yersinia pestis and
Yersinia pseudotuberculosis. Genome Res. 9:2018-29.

Hirayama, T., T. Maeda, H. Saito and K. Shinozaki. (1995). Cloning and
characterization of seven cDNAs for hyperosmolarity-responsive (HOR)
genes of Saccharomyces cerevisiae. Mol. Gen. Genet. 249: 1 27-1 38.

Hohmann, S and W. H. Mager. (1997). Yeast stress response. Chapman and
Hall, New York, New York.

Holling, C. S. (1973). Resilience and stability of ecological systems. Annu. Rev.
Ecol. Syst. 421-23.

lnchausti, P. (1995). Competition between perennial grassed in neotropical
savanna: the effects of ﬁre any hydric-nutritional stress. J. Ecol. 83:231-243.

Ives, A. R. (1995). Measuring resilience in stochastic systems. Ecological
Monographs 65:217-233.

Kanehisa, M. and S. Goto. (2000). KEGG: kyoto encyclopedia of genes and
genomes. Nucleic Acids Res. 28:27-30.

Karsai, A., S. Mi‘lller, S. Platz and M. T. Hauser. (2001). Evaluation of a home-
made SYBR green l reaction mixture for real-time PCR quantiﬁcation of gene
expression. Biotechniques 32:790-796.

Klis F. M., M. Pieternella, K. Hellingwerf and B. Stanley. (2002). Dynamics of cell
wall structure in Saccharomyces cerevisiae. FEMS Microbiol. Rev. 26:239-
256.

Kobayashi N. and K. McEntee. (1993). Identiﬁcation of cis an trans components
of a novel heat shock stress regulatory pathway in Saccharomyces
cerevisiae. Mol. Cell Biol. 13:248-256.

Kurilova, S. A., N. N. Vorobjeva, T. l. Nazarova, and S. M. Avaeva. (1993).
Expression of Saccharomyces cerevisiae inorganic pyrophosphatase in
Escherichia coli. FEBS Lett. 333:280-282.

127

Larsson K., P. Eriksson, R. Ansell and L. Alder. (1993). A gene encoding sn-
glycerol 3-phosphate dehydrogenase (NAD+) complements an
osmosensitive mutant of Saccharomyces cerevisiae. Mol. Microbiol.
10:1101-1111.

Laws, R. J., T. L. Bergemann, F. Quiaoit and L. P. Zhao. (2003). SignalViewer:
analyzing microarray images. Bioinformatics 19:1716-1717.

Legendre, P. and Legendre L. (1998). Numerical Ecology. 2"“ Edition. Elsevier
Science, The Netherlands.

Lewontin, R. C. (1969). The meaning of stability. Brookhaven Symp. Biol. 22:13-
24.

Livak, K. J. and T. D. Schmittgen. (2001). Analysis of relative gene expression
data using real-time quantitative PCR and the Z'MCT method. Methods
25:402-408.

Lockhart, D. J., H. Dong, M. C. Byrne, M. T. Follettie, M. V. Gallo, M. S. Chee, M.
Mittmann, C. Wang, M. Kobayashi, H. Horton and E. L. Brown. (1996).
Expression monitoring by hybridization to high-density oligonucleotide
arrays. Nat. Biotech. 14:1675-1680.

Long, A. D., H. J. Mangalam, B. Y. P. Chan, L. Tolleri, G. W. Hatﬁeld and P.
Baldi. (2001). Improved statistical inference from DNA microarray data using

analysis of variance and a bayesian statistical framework. J. Biol. Chem.
276: 1 9937-1 9944.

Longnecker, M. and R. L. Ott. (2001). An introduction to statistical methods and
data analysis. Duxbury Press, Paciﬁc Grove, CA.

Maeda, T., M. Takekawa and H. Saito. (1995). Activation of yeast PBSZ MAPKK
by MAPKKKS or by binding of an SH3-containing osmo-sensor. Science
269:554-558.

Mager, W. H. and M. Siderius. (2002). Novel insights into osmotic stress
response of yeast. FEMS Yeast Res. 2:251-257

Marchler G., C. Schuller, G. Adam and H. Ruis. (1993). A Saccharomyces
cerevisiae UAS element controlled by protein kinase A activates transcription
in response to a variety of stress conditions. EMBO J. 12:1997-2003.

128

Martinez-Pastor, M. T., G. Marchler, C. Schuller, A. Marchler-Bauer, H. Ruis and
F. Estruch. (1996). The Saccharomyces cerevisiae zinc ﬁnger proteins
Msn2p and Msn4p are required for transcriptional induction through stress
response element (S.T.R.E.). EMBO J. 15:2227-2235.

Merkin, D. R. (1997). Introduction to the theory of stability. Springer-Verlag, New
York.

Mittelbach, G. G., A. M. Turner, D. J. Hall and J. E. Rettig. (1995). Perturbation
and resilience: a long-term whole-lake study of predator extinction and
reintroduction. Ecology 76:2347-2360.

Murray, J. D. (1993). Mathematical biology. Springer-verlag, New York, New
York.

Nakajima, H. (1992). Sensitivity and stability of ﬂow networks. Ecol. Mod. 62:123-
133.

Neubert, M. G and H. Caswell. (1997). Aternatives to resilience for measuring the
response of ecological systems to perturbations. Ecology 78:653-665.

Norbeck A., A. K. Pahlmann, N. Akhtar, A. Blomberg and L., Alder. (1996).
Puriﬁcation and characterization of two isoenzymes of DL-glycerol 3-
phosphatase from Saccharomyces cerevisiae. Identiﬁcation of the
corresponding GPP1 and GPP2 genes and evidence for osmostic regulation
of Gpp2p expression by the osmosensing MAP kinase signal transduction
pathway. J. Biol. Chem. 271:13875-13881.

Norbeck, J. and A. Blomberg. (1997). Two dimensional electrophoretic
separation of yeast proteins using a non-linear wide range (pH 3-10)
immobilized pH gradient in the ﬁrst dimension: reproducibility and evidence
for isoelectric focusing of alkaline (pl>7) proteins. Yeast 13:1519-1534.

Norbeck, J. and A. Blomberg. (2000). The level of CAMP-dependent protein
kinase A activity strongly affects osmotolerance and osmo-instigated gene
expression changes in Saccharomyces cerevisiae. Yeast 16:121-37.

O’Neill, R. V. (1976). Ecosystem persistence and heterotrophic regulation.
Ecology 57:1244-1253.

129

Perou, C. M., T. Sorlie, M. B. Eisen, M. van de Rijn, S. S. Jeffrey, C. A. Rees, J.
R. Pollack, D. T. Ross, H. Johnsen, L. A. Akslen, O. Fluge, A.
Pergamenschikov, C. Wllliams, S. X. Zhu, P. E. Lonning, A. L. Borresen-
Dale, P. 0. Brown and D. Botstein. (2000). Molecular portraits of human
breast tumours. Nature 406:747-752.

Pimm, S. L. (1982). Food Webs. Chapman and Hall, New York, NY.

Pimm. S. L. (1984). The complexity and stability of ecosystems. Nature 307:321-
326.

Planet, P. J., R. DeSalle, M. Siddall, T. Bael, I. N. Sarkar and S. E. Stanley.
(2001). Systematic analysis of DNA microarray Data: ordering and
interpreting patterns of gene expression. Genome Res. 11:1149-1155.

Pluthero, G. (1993). Rapid puriﬁcation of high-activity Taq DNA polymerase.
Nucleic Acids Res. 21 :4850-4851.

Rantokokko—Javala, K. and J. Javala. (2001). Development of conventional and
real-time PCR assays for detection of Legionella DNA in respiratory
specimens. J. Clin. Microbiol. 39:2904-2910.

Rep, M, J. A. Albertyn, J. M Thelvelein, B. A. Prior, and S. Hohmann. (1999).
Different signalling pathways contribute to the control of gpd1 gene
expression by osmotic stress in Saccharomyces cerevisiae. Microbiology
145:715-727.

Rep, M., M. Krantz, J. M. Thevelein and S. Hohmann. (2000). The transcriptional
response of Saccharomyces cerevisiae to osmotic shock. J. Biol. Chem.
275:8290-8300.

Rep, M., M. Proft, J. Remize, M. Tamas, R. Serrano, J. M. Thevelein and S.
Hohmann. (2001). The Sacchromyces cerevisiae Sko1p transcription factor
mediates HOG pathway-dependent osmotic regulation of a set of gene
encoding enzymes implicated in protection from oxidative damage. Mol.
Microbiol. 40: 1 067-1 083.

Ririe, K. M., R. P. Rasmussen and C. T. Wlttwer. (1997). Product differentiation
by analysis of DNA melting curves during the polymerase chain reaction.
Anal. Biochem. 245:154-160.

I30

Sambrook J., E. F. Fritsch and T. Maniatis. (1989). Molecular cloning: a
laboratory manual. Cold Springs Harbor, Plainview, New York.

Schena, M., D. Shalon, R. W. Davis and P. 0. Brown. (1996). Quantitative
monitoring of gene expression patterns with a complementary DNA
microarray. Science 270:467-470.

Schmitt, M. E., T. A. Brown and B. L. Trumpower. (1990). A rapid and simple
method for preparation of RNA from Saccharomyces cerevisiae. Nucleic
Acids Res. 18:3091-3092.

Sennhauser, E. B. (1991). The concept of stability in connection with the gallery
forests of the chaco region. Vegetatio. 94:1-13.

Shalon D., S. J. Smith and P. 0 Brown. (1996). A DNA microarray system for
analyzing complex DNA samples using two-color ﬂuorescent probe
hybridization. Genome Res. 6:639-6445.

Sherlock, G. (2000). Analysis of large-scale gene expression data. Curr. Opin.
lmmunol. 12:201-205.

Shou W, J. H. Seol, A. Shevchenko, C. Baskerville, D. Moazed, Z. W. Chen, J.
Jang, A. Shevchenko, H. Charbonneau and R. J. Deshaies. (1999) Exit from
mitosis is triggered by Tem1-dependent release of the protein phosphatase
Cdc14 from nucleolar RENT complex. Cell 97:233-44.

Siderius, M., E. Rots and W. H. Mager. (1997). High-osmolarity signalling in
Saccharomyces cerevisiae is modulated in a carbon-source-dependent
fashion. Microbiology UK 143:3241-3250.

Spellman, P. T., G. Sherlock, M. Q. Zhang, V. R. lyers, K. Anders, M. B. Eisen,
P. 0. Brown, D. Botstein and B. Futcher. (1998). Comprehensive
identiﬁcation of the cell cycle-regulated genes of the yeast Saccharomyces
cerevisiae by microarray hybridization. Mol. Biol. Cell. 923273-3297.

Spiro A., M Lowe and D. Brown. (2000). A bead-based method for multiplexed
identiﬁcation and quantiﬁcation of DNA sequences using ﬂow cytometry.
Appl. Environ. Microbiol. 66:4258-4265.

Sutherland J. P. (1981). The fouling community at beaufort, north carolina: a
study in stability. Amer. Nat. 118:499-519.

131

 

Swami S., N. Raghavachari, U. R. Muller, Y. P. Bao and D. Feldman. (2003).
Wtamin D growth inhibition of breast cancer cells: gene expression patterns
assessed by cDNA microarray. Breast Cancer Res. Treat. 80:49-62.

Talaat A. M., S. T. Howard, W. Hale IV, R. Lyons, H. Garner and S. A. Johnston.
(2002). Genomic DNA standards for gene expression proﬁling in
Mycobacterium tuberculosis. Nucleic Acids Res. 302104-112.

Tamayo, P., D. SIonim, J. Mesirov, Q. Zhu, S. Kitareewan, E. Dmitrovsky, E. S.
Lander and T. R. Golub. (1999). Interpreting patterns of gene expression
with self-organizing maps: methods and application to hematopoietic
differentiation. Proc. Natl. Acad. Sci. USA 96:2907-2912.

Treger, J. M., A. P. Schmitt, J. R. Simon and K. McEntee. (1998). Transcriptional
factor mutations reveal regulatory complexities of heat shock and newly

identiﬁed stress genes in Sacchromyces cerevisiae. J. Biol. Chem.
273:26875-26879.

Toshihide S., P. J. Higgins and D. R. Crawford. (2000). Control selection for RNA
quantiﬁcation. Biotechniques 29:332-337.

Venema J., and D. Tollervey. (1999). Ribosome synthesis in Saccharomyces
cerevisiae. Annu. Rev. Genet. 33:261-311.

Viragh, K. (1989). An experimental approach to the study of community stability:
Resilience and Resistence. Acta Bot. Hung. 35299-125.

Wen, X., S. Fuhrman, G. S. Michaels, D. B. Carr, S. Smith, J. L. Barker and R.
Somogyi. (1998). Large-scale temporal gene expression mapping of central
nervous system development. Proc. Natl. Acad. Sci. USA 95:334-339.

chkert, L., S. Steinkrﬂger, M. Abiaka, U. Bolkenius, O. Purps, C. Schnabel and
A. M. Gressner. (2002). Quantiﬁcation monitoring of the mRNA expression
pattern of the TGF-B-isoforms ([31, [32, [33) during transdifferentiation of
hepatic stellate cells using a newly developed real-time SYBR green PCR.
Biochem. Biophys. Res. Commun. 295:330-335

VVleser R., G. Adam, A. Wagner, C. Schuller, G. Marchler, H. Ruis, Z. Krawiec
and T. Bilinski. (1991). Heat shock factor-independent heat control of
transcription of the C'l'l'1 gene encoding the cytosolic catalase T of
Saccharomyces cerevisiae. J Biol Chem 266:12406-11.

132

Wodicka, L., H. Dong, M. Mittmann, M. H. Ho and D. J. Lockhart. (1997).
Genome-wide expression monitoring in Saccharomyces cerevisiae. Nat.
Biotech. 15:1359-1367.

Wuytswinkel, O. V., V. Reiser, M. Siderius, M. C. Kelders, G. Ammerer, H. Ruis,
and W. H. Mager. (2000). Response of Sacchaormyces cerevisiae to severe
osmotic stress: evidence for a novel activation mechanism of the HOG MAP
kinase pathway. Mol. Microbiol. 372382-397

Yancey P. H., M. E. Clark, S. C. Hand, R. D. Bowlus and G. N. Somero. (1982).
Living with water stress: evolution of osmolyte systems. Science 217:1214-
1222.

Yodzis, P. (1988). The indeterminancy of ecological interactions as perceived
through perturbation experiments. Ecology 69:508-515.

Yuen, T., E. Wurmbach, R. L. Pfeffer, B. J. Ebersole and S. C. Sealfon. (2002).
Accuracy and calibration of commercial oligonucleotide and custom cDNA
microarrays. Nucleic Acids Res. 30:48-56.

Xia, X. and Z. Xie. (2001). AMADA: analysis of microarray data. Bioinformatics
17:569-570.

Zhang, L., Y. Zhang, Y. Zhou, S. An, Y. Zhou and J. Cheng. (2002). Response of

gene expression in Saccharomyces cerevisiae to amphotericin B and
nystatin measured by microarrays. J. Antimicrob. Chemo. 49:905-915.

I33

EEEEEEEEEEEEEEEEEEEEEEEEEEEE

lllcylllllllllllllllllllllllllllllllllllllll