THE EVOLUTION OF DIGITAL COMMUNITIES UNDER LIMITED RESOURCES
By
Bess Linden Walker

A DISSERTATION
Submitted to
Michigan State University
in partial fulﬁllment of the requirements
for the degree of
DOCTOR OF PHILOSOPHY
Computer Science
Ecology, Evolutionary Biology and Behavior
2012

ABSTRACT
THE EVOLUTION OF DIGITAL COMMUNITIES UNDER LIMITED
RESOURCES
By
Bess Linden Walker
Schluter (1996) describes adaptive radiation as “the diversiﬁcation of a lineage into species
that exploit a variety of diﬀerent resource types and that diﬀer in the morphological or physiological traits used to exploit those resources”. My research focuses on adaptive radiation in
the context of limited resources, where frequency-dependence is an important driver of selection (Futuyma & Moreno, 1988; Dieckmann & Doebeli, 1999; Friesen et al., 2004). Adaptive
radiation yields a community composed of distinct organism types adapted to speciﬁc niches.
I study simple communities of digital organisms, the result of adaptive radiation in environments with limited resources. I ask (and address) the questions: How does diversity,
driven by resource limitation, aﬀect the frequency with which complex traits arise? What
other aspects of the evolutionary pressures in this limited resource environment might account for the increase in frequency with which complex traits arise? Can we predict community stability when it encounters another community, and is our prediction diﬀerent for
communities resulting from adaptive radiation versus those that are artiﬁcially assembled?
Community diversity is higher in environments with limited resources than in those with
unlimited resources. The evolution of an example complex feature (in this case, Boolean
EQU) is also more common in limited-resource environments, and shows a strong correlation
with diversity over a range of resource inﬂow rates. I show that populations evolving in
intermediate inﬂow rates explore areas of the ﬁtness landscape in which EQU is common,
and that those in unlimited resource environments do not. Another feature of the limited-

resource environments is the reduced cost of trading oﬀ the execution of building block tasks
for higher-complexity tasks. I ﬁnd strong causal evidence that this reduced cost is a factor
in the more common evolution of EQU in limited-resource environments.
When two communities meet in competition, the fraction of each community’s descendants making up the ﬁnal post-competition community is strongly consistent across replicates. I ﬁnd that three community-level factors, ecotypic diversity, community composition,
and resource use eﬃciency can be used to predict this fractional community success, explaining up to 35% of the variation.
In summary, I demonstrate the value of digital communities as a tractable experimental
system for studying general community properties. They sit at the bridge between ecology
and evolutionary biology and evolutionary computation, and oﬀer comprehensible ways to
translate ideas across these ﬁelds.

For Jay Genglebach, who unknowingly spurred half a year of productivity simply by
assuming I was a productive person.

iv

ACKNOWLEDGMENTS

This dissertation has been brought to you by Swedish Fish R , which fueled the last couple
weeks of writing.
I am deeply grateful to the many people who have aided and supported me through
my graduate studies. My mother Ruth E. W. Walker has been a curious ear, supportive
shoulder, and helpful editor throughout this process. Thanks for years of love, tolerance,
and support are owed to Kaben Nanlohy, whose unique history with Avida has made him
an excellent sounding board for many pieces of this research.
I would not have gotten very far without the input of my academic committee. My
advisor, Dr. Charles Ofria, has been the best advisor anyone could wish for: intellectually
engaging, warm-hearted, and patient. His contribution to this work cannot be overstated.
I would also like to thank the rest of my committee for their contributions to my research:
Dr. Richard Lenski, Dr. Philip McKinley, and Dr. William Punch. Their diverse perspectives
have added depth to this interdisciplinary eﬀort.
My time at Michigan State has been spent in the company of my fellow Devolab students,
in whom I have been very lucky. Every one of them has been friendly and supportive, and
they have formed the core of my friendships here. I would especially like to thank Sherri
Goings, who is one of my collaborators for the research presented in Chapters 3 and 4. Her
skill for developing incisive tests is one I envy and have beneﬁtted from.
I owe a great deal to Dr. Tim Cooper, whose preliminary work on the outcome of
community competition forms the solid foundation of my research in Chapter 5. His summary
report of that work has been invaluable, both in guiding my research toward promising paths
and in steering it away from already-explored blank alleys.
v

This material is based in part upon work supported by a National Science Foundation
Graduate Research Fellowship, by Michigan State University under a University Distinguished Fellowship, and by the National Science Foundation under Grant No. CCF-0643952
and Cooperative Agreement No. DBI-0939454.

vi

TABLE OF CONTENTS

List of Tables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

x

List of Figures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xiii

Chapter 1 Introduction . . . . . . . .
1.1 Study System . . . . . . . . . . . .
1.2 Communities . . . . . . . . . . . .
1.3 Complex Features . . . . . . . . . .
1.4 Links to Evolutionary Computation

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

.
.
.
.
.

1
1
2
3
5

Chapter 2 The Avida Digital Evolution Platform
2.1 Avidians and the Avida World . . . . . . . . . . .
2.2 Self-Replication in Avida . . . . . . . . . . . . . .
2.3 Boolean Logic Tasks . . . . . . . . . . . . . . . .
2.4 Limited Resources . . . . . . . . . . . . . . . . .
2.5 Phenotypes and Ecotypes in Avida . . . . . . . .
2.6 The Ecological Period . . . . . . . . . . . . . . .

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

6
7
8
9
11
12
13

Chapter 3 Evolutionary Potential is Maximized at Intermediate Diversity
3.1 Diversity as a Potential Factor in the Evolution of Complex Features . . . .
3.2 Experimental Environments . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.2.1 The unlimited-resource environment . . . . . . . . . . . . . . . . . .
3.2.2 The limited-resource environment . . . . . . . . . . . . . . . . . . . .
3.2.3 Speciﬁc conﬁguration details . . . . . . . . . . . . . . . . . . . . . . .
3.3 Diversity Peaks at Intermediate Productivity . . . . . . . . . . . . . . . . . .
3.4 The Evolution of EQU Peaks at (Higher) Intermediate Productivity . . . . .
3.5 Populations at Intermediate Diversity Evolve EQU More Often Even When
It Is Not Rewarded . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.5.1 Preventing the ﬁxation of EQU . . . . . . . . . . . . . . . . . . . . .
3.5.2 Testing the EQU-granting mutation encounter rate . . . . . . . . . .
3.6 Low Individual Distinct Task Counts Limit the Evolution of EQU . . . . . .
3.7 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

28
28
29
30
31

Chapter 4
4.1

Reduced Cost of Trade-Oﬀs in Limited Resource Environments
Promotes the Evolution of Complex Features . . . . . . . . . . .
Task Loss Without Gain . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
4.1.1 Testing for task loss mutations: technical restrictions . . . . . . . . .

15
17
18
19
19
22
24
25

vii

33
34
35

4.1.2
4.2

4.3

Deleterious “loss without gain” mutations do not detectably aﬀect the
evolution of EQU . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Task Trade-Oﬀs: Loss Accompanied by Gain . . . . . . . . . . . . . . . . . .
4.2.1 Trade-oﬀs at the evolution of EQU . . . . . . . . . . . . . . . . . . .
4.2.2 Eliminating the possibility of a neutral or detrimental trade-oﬀ for EQU
4.2.3 Eliminating the possibility of a neutral or detrimental trade-oﬀ when
trading “up” . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

35
36
37
39
39
41

Chapter 5 Predicting the Outcome of Competitions Between Communities
5.1 Measures of Community-Level Properties . . . . . . . . . . . . . . . . . . . .
5.1.1 Measuring community success . . . . . . . . . . . . . . . . . . . . . .
5.1.2 Measuring community diversity . . . . . . . . . . . . . . . . . . . . .
5.1.3 Measuring community composition . . . . . . . . . . . . . . . . . . .
5.1.4 Measuring community breadth . . . . . . . . . . . . . . . . . . . . . .
5.1.5 Measuring community resource use eﬃciency . . . . . . . . . . . . . .
5.2 Generation of Test Communities . . . . . . . . . . . . . . . . . . . . . . . . .
5.3 Community Competition . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
5.3.1 Self-competition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
5.3.2 Competition results are consistent . . . . . . . . . . . . . . . . . . . .
5.3.3 Evolved communities have a nearly-linear dominance hierarchy . . . .
5.4 Correlations with Community Success . . . . . . . . . . . . . . . . . . . . . .
5.4.1 Community diversity . . . . . . . . . . . . . . . . . . . . . . . . . . .
5.4.2 Average ecotypic generality . . . . . . . . . . . . . . . . . . . . . . .
5.4.3 Community breadth . . . . . . . . . . . . . . . . . . . . . . . . . . .
5.4.4 Resource use eﬃciency . . . . . . . . . . . . . . . . . . . . . . . . . .
5.5 Linear Models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
5.6 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

42
43
43
43
43
44
45
45
47
47
48
50
51
52
53
55
58
61
66

Chapter 6 Conclusions and Future Work . . . . . . . . . . . . . .
6.1 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
6.2 Future Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
6.2.1 Building blocks: features of lesser complexity . . . . . . . .
6.2.2 Competitions between evolved and assembled communities
6.2.3 Ecotype-level community descriptors . . . . . . . . . . . .

.
.
.
.
.
.

.
.
.
.
.
.

.
.
.
.
.
.

.
.
.
.
.
.

.
.
.
.
.
.

.
.
.
.
.
.

68
68
69
69
70
71

Appendix A Instruction Set . . .
A.1 No-ops . . . . . . . . . . . .
A.2 Flow control . . . . . . . . .
A.3 Single argument math . . .
A.4 Double argument math . . .
A.5 Biological operations . . . .
A.6 I/O and Sensory operations
A.7 Deﬁnitions . . . . . . . . . .

.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.

74
74
74
75
76
76
77
77

.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.

viii

.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.

.
.
.
.
.
.
.
.

Appendix B Boolean Logic . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

78

Appendix C How two deleterious EQU-granting mutations ﬁxed on the line
of descent . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

86

Bibliography . . . . . . . . . . . . . . .

89

ix

. . . . . . . . . . . . . . . . . . .

LIST OF TABLES

Table 2.1

NAND-count-based task reward values. The symbol “¬” denotes
negation, while semicolons separate symmetrical functions. See Appendix B for a fuller treatment. . . . . . . . . . . . . . . . . . . . .

10

Table 3.1

Task rewards in Lenski et al. (2003). An organism that performs a
task has its current execution rate multiplied by the amount of the
task’s reward; it can be rewarded for each task only once per lifetime. 19

Table 3.2

Hybrid task rewards, based both on task complexity and resource
availability (AT ASK denotes the number of resource units an organism uses from the T ASK’s pool). An organism that performs a task
has its current execution rate multiplied by the amount of the task’s
reward; it can be rewarded for each task only once per lifetime. . . .

22

Detailed phenotypic diversity measures during the evolution of EQU.
Shannon diversity of viable phenotypes in 200 runs each of the inﬁniteand 100-unit-inﬂow environments is measured at three points: the
ﬁrst appearance of EQU in the population, its ﬁxation on the line of
descent, and the end of the run (100,000 updates). P-values compare
the diversity of the two treatments using a two-tailed Mann-Whitney
U test with a sequential Bonferroni correction. . . . . . . . . . . . .

24

Number of populations out of 200 that evolved EQU at intermediate
inﬂow rates. We performed a chi-squared test to determine if the
evolvability of EQU for at least one inﬂow rate diﬀered signiﬁcantly
from the rest (p<.005, χ2 = 13.156, 3 degrees of freedom). With this
conﬁrmed, we calculated the signiﬁcance of each ratio’s diﬀerence
from 171/200 (the result at RIN F LOW = 100) with Fisher’s exact
test, two-tailed, and corrected with the sequential Bonferroni correction; the n=2 correction was applied to both the RIN F LOW = 30
and RIN F LOW = 300 data, since they can be ordered arbitrarily. . .

27

Number of distinct resources used per individual. The 5th, 50th, and
95th percentiles for each environment are displayed. We compared
the distributions from the uncapped 100-inﬂow and unlimited environments to the uncapped 10-inﬂow environment using a two-tailed
Mann-Whitney U test; for both, p
10−12 . . . . . . . . . . . . . .

30

Table 3.3

Table 3.4

Table 3.5

x

Table 3.6

Distinct tasks and the evolution of EQU with a three-task cap. The
5th, 50th, and 95th percentiles for each environment are displayed.
The number of populations that evolved EQU in each treatment is
also given; p-values compare the capped treatment to the uncapped
treatments within each environment using two-tailed Fisher’s Exact
tests with a sequential Bonferroni correction. . . . . . . . . . . . . .

31

Table 4.1

The EQU-81 reward structure. Any trade-oﬀ for a more complex
task is guaranteed to be beneﬁcial. An organism that performs a
task has its current execution rate multiplied by the amount of the
task’s reward; it can be rewarded for each task only once per lifetime. 40

Table 5.1

Linear models predicting the fractional community success (f cs) of
evolved communities based on diﬀerential community diversity (d),
generality (g), and resource use (in)eﬃciency (e). The ﬁve models
with AIC diﬀerences below three are displayed; AIC and BIC weights
(considering only these ﬁve models) are displayed. Factors that are
insigniﬁcant at p = .05 inside each model are italicized. . . . . . . .

63

Linear models predicting the fractional community success (f cs) of
randomly assembled communities based on diﬀerential community
generality (g), task coverage (c), and resource use (in)eﬃciency (e).
These are the four models with AIC diﬀerence less than three; AIC
and BIC weights (when considering only these models) are displayed.
Factors that are not signiﬁcant at p = .05 inside each model are
italicized. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

64

Table 5.2

Table 5.3

Linear models predicting the fractional community success (f cs) of
ecologically complete assembled communities based on diﬀerential
community generality (g), task coverage (c), and resource use (in)eﬃciency
(e). These are the two models with AIC diﬀerence less than three;
AIC and BIC weights (when considering only these models) are displayed. Factors that are not signiﬁcant at p = .05 inside each model
are italicized. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65

Table 5.4

Linear models predicting the fractional community success (f cs) of
specialist-preferred ecologically complete assembled communities based
on diﬀerential community task coverage (c) and resource use (in)eﬃciency
(e). These are the two models with AIC diﬀerence less than three;
AIC and BIC weights (when considering only these models) are displayed. Factors that are not signiﬁcant at p = .05 inside each model
are italicized. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66

xi

Table B.1

Table B.2

Table B.3

Table B.4

Table B.5

Table B.6

Table B.7

Table B.8

Table B.9

Table C.1

Table C.2

Truth table of binary NAND. This function is false only when both
A and B are true. . . . . . . . . . . . . . . . . . . . . . . . . . . . .

78

Truth table of unary NOT. This function is false when the input A
is true. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

79

Truth table of binary AND. This function is true only when both A
and B are true. . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

79

Truth table of binary ORN. This function is false only when A is false
and B is true. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

80

Truth table of binary OR. This function is false only when both A
and B are false. . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

81

Truth table of binary ANDN. This function is true only when A is
true and B is false. . . . . . . . . . . . . . . . . . . . . . . . . . . .

81

Truth table of binary NOR. This function is true only when both A
and B are false. . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

82

Truth table of binary XOR. This function is true only when A and
B are diﬀerent. . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

83

Truth table of binary EQU, sometimes known as XNOR. This function is true only when A and B are the same. . . . . . . . . . . . . .

84

Task performance snapshot of Population 1089 before, at, and after
the detrimental ﬁxation of EQU. The EQU-granting mutation trades
a 21 task (NOT) and a 24 task (NOR) for the 25 EQU task; the task
trade-oﬀ net gain is 0. The next mutation on the lineage regains the
24 NOR task. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

87

Task performance snapshot of Population 1191 before, at, and after
the detrimental ﬁxation of EQU. The EQU-granting mutation trades
both 22 tasks (AND, ORN) and a 24 task (NOR) for the 25 EQU
task and a 23 task (ANDN); the task trade-oﬀ net gain is 0. The
next mutation on the lineage trades the 23 ANDN task for the 24
NOR task. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

87

xii

LIST OF FIGURES

Figure 2.1

Figure 3.1

Figure 3.2

Figure 3.3

Figure 3.4

The Avidian genetic program and virtual hardware. Circles on the
left represent instructions in a portion of the genome. The green box
represents the virtual hardware that executes the genetic program,
with three registers, two stacks, and input/output capabilities. The
instruction pointer is currently at the input instruction; the ﬂow,
read, and write heads are not shown. (From Beacon 101: Experimental Evolution with Digital Organisms. Used with permission.)
For interpretation of the references to color in this and all other ﬁgures, the reader is referred to the electronic version of this dissertation.

8

A representative pattern of mutational distance over time in a population evolving in an unlimited-resource environment. The x-axis
spans 100,000 updates, while the y-axis counts the mutational steps
from each currently living genotype to the original ancestor. Color
values give the natural log of the number of organisms at a particular
mutational distance. The population as a whole never experiences
adaptive radiation, and so each member genotype remains similarly
distant from the common ancestor. . . . . . . . . . . . . . . . . . . .

21

A representative pattern of mutational distance over time in a population evolving in a limited-resource environment with the hybrid
reward system (Table 3.2) and RIN F LOW = 100. The x-axis spans
100,000 updates, while the y-axis counts the mutational steps from
each currently living genotype to the original ancestor. Color values give the natural log of the number of organisms at a particular
mutational distance. The population experiences negative frequencydependent selection due to the hybrid reward system, and divergent
lineages emerge, each with a diﬀerent mutational distance to the common ancestor. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

23

Diversity distributions across inﬂow rates, measured as the phenotypic Shannon entropy of all viable organisms in the population. Data
for inﬂows 1, 3, 1000, 3000, and inﬁnite are drawn from 20 populations; inﬂows 10, 30, 100, and 300 from 200 populations. . . . . . . .

25

Number of populations for which some genotype in the ﬁnal population can perform EQU across a range of resource inﬂow rates. Data
for all inﬂows is drawn from 20 populations. . . . . . . . . . . . . .

26

xiii

Figure 3.5

Figure 4.1

Figure 5.1

Figure 5.2

Figure 5.3

Figure 5.4

Figure 5.5

Figure 5.6

Number of populations which evolved tasks requiring fewer NAND
operations than the EQU task. The ﬁrst seven tasks (only NOT is
shown) showed results across resource inﬂows that were qualitatively
similar to the NOT task shown here, with low numbers of populations
evolving the task at RIN F LOW = 1 and very high numbers for all
other inﬂow rates. The XOR task shows results qualitatively similar
to the EQU task, with XOR being more likely to evolve at intermediate values of RIN F LOW . Data for all inﬂows and tasks is drawn
from 20 populations. . . . . . . . . . . . . . . . . . . . . . . . . . .

27

Shannon diversity vs. ﬁxation time of EQU. (A) Diversity of each
population that evolved EQU versus time at which EQU ﬁxed on the
line of descent. Blue markers represent populations evolved in the
unlimited-resource environment, while red represent those evolved
in the (100-inﬂow) limited-resource environment. Symbols denote
the ﬁtness eﬀect of the EQU-granting mutation as evaluated in the
unlimited-resource environment. Beneﬁcial mutations are marked
with +, neutral with , and detrimental with ; the legend indicates
how many mutations in each treatment fall into these categories. Histograms show the distribution of (B) the ﬁxation of EQU over time
and (C) the phenotypic diversity when EQU ﬁxes. Appendix C explains how two of the unlimited-resource populations were able to ﬁx
detrimental EQU-granting mutations. . . . . . . . . . . . . . . . . .

38

Community diversity vs. fractional community success for evolved
communities in competition. Spearman’s ρ = 0.5129. . . . . . . . . .

52

Community generality vs. fractional community success for evolved
communities in competition. Spearman’s ρ = −0.5207. . . . . . . . .

54

Community generality vs. fractional community success for randomly
assembled communities in competition. Spearman’s ρ = 0.4471. . . .

55

Community generality vs. fractional community success for ecologically complete assembled communities in competition. Spearman’s
ρ = 0.4178. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

56

Community breadth vs. fractional community success for randomly
assembled communities in competition. Spearman’s ρ = 0.4558. . . .

57

Community breadth vs. fractional community success for ecologically
complete assembled communities in competition. Spearman’s ρ =
0.3419. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

58

xiv

Figure 5.7

Community breadth vs. fractional community success for specialistpreferred ecologically complete assembled communities in competition. Spearman’s ρ = 0.3422. . . . . . . . . . . . . . . . . . . . . . .

59

Community resource use eﬃciency vs. fractional community success
for evolved communities in competition. Spearman’s ρ = −0.4525. .

60

Community resource use eﬃciency vs. fractional community success
for randomly assembled communities in competition. Spearman’s
ρ = −0.6293. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

61

Community resource use eﬃciency vs. fractional community success for ecologically complete assembled communities in competition.
Spearman’s ρ = −0.5003. . . . . . . . . . . . . . . . . . . . . . . . .

62

Community resource use eﬃciency vs. fractional community success
for specialist-preferred ecologically complete assembled communities
in competition. Spearman’s ρ = −0.3949. . . . . . . . . . . . . . . .

63

Figure B.1

The NAND logic gate in a circuit diagram. . . . . . . . . . . . . . .

78

Figure B.2

The NOT function implemented with NAND in a circuit diagram.
Feeding the input A into both inputs of a NAND gate produces ¬A.

79

The AND function implemented with NAND gates in a circuit diagram. AND is implemented with two NAND gates. . . . . . . . . .

80

The ORN function implemented with NAND gates in a circuit diagram. We apply De Morgan’s law to see that the computed result,
¬(¬A ∧ B), is equivalent to A ∨ ¬B, the ORN function. . . . . . . .

81

The OR function implemented with NAND gates in a circuit diagram. We apply De Morgan’s law to see that the computed result,
¬(¬A ∧ ¬B), is equivalent to A ∨ B, the OR function. . . . . . . . .

81

The ANDN function implemented with NAND gates in a circuit diagram. Implementing ANDN requires three NAND gates. . . . . . .

82

The NOR function implemented with NAND gates in a circuit diagram. Implementing NOR requires four NAND gates; we apply De
Morgan’s law to observe that the output, which simpliﬁes to ¬A∧¬B,
is equivalent to ¬(A ∨ B), the NOR function. . . . . . . . . . . . . .

83

Figure 5.8

Figure 5.9

Figure 5.10

Figure 5.11

Figure B.3

Figure B.4

Figure B.5

Figure B.6

Figure B.7

xv

Figure B.8

Figure B.9

The XOR function implemented with NAND gates in a circuit diagram. Implementing XOR requires four NAND gates; we simplify
the result by following the derivation backward to obtain (A ∧ ¬B) ∨
(¬A ∧ B), the XOR function. . . . . . . . . . . . . . . . . . . . . . .

84

The EQU or XNOR function implemented with NAND gates in a
circuit diagram. Implementing EQU requires ﬁve NAND gates; the
result simpliﬁes to (¬A ∧ ¬B) ∨ (A ∧ B), the EQU function. . . . .

85

xvi

Chapter 1
Introduction
In this introductory chapter, I provide a brief overview of my study system and the two
central topics of this thesis, communities and complex features: the formal details of these
concepts are more fully addressed in each chapter as they arise. In Chapter 2, I provide a
detailed description of my study system, the Avida digital evolution platform, and deﬁne
some experimental environments and terminology used throughout this thesis. In Chapter 3, I examine the relationships between resource limitation, community diversity, and the
evolvability of complex features. In Chapter 4, I explore other negative frequency-dependent
eﬀects of resource limitation on the evolvability of complex features. I show that the reduced
cost of trading the ability to perform a common task for a rare one increases the frequency of
the evolution of a complex feature. In Chapter 5, I investigate community-level predictors of
success in competition with another community. Finally, in Chapter 6, I describe extensions
that will carry this work forward.

1.1

Study System

I use the digital evolution software Avida (Ofria et al., 2009), which allows precise manipulation of resource supply and a complete record of the course of evolution. The Avida system
consists of a collection of digital organisms, each with a simple circular genome composed of
instructions from an assembly-like Turing-complete instruction set.

1

By executing the instructions in its genome, each organism is capable of self-reproduction;
during this process, copy mutations may be introduced into the oﬀspring’s genome. Because
the genetic instructions are drawn from a Turing-complete language, the organisms are also
theoretically capable of any Turing-computable task1 . The organisms have access to integers
that they can manipulate; the researcher can choose to reward certain manipulations, known
as tasks, with additional CPU cycles. These additional CPU cycles allow the organism to
execute its genome more quickly and thus increase ﬁtness. By tying the reward for each
task to a separate depletable resource, we can achieve adaptive radiation driven by negative
frequency-dependent selection.
Chapter 2 contains a more detailed explanation of this study system.

1.2

Communities

As in nature, negative frequency-dependent selection in Avida drives the evolution of ecologically stable communities that consist of multiple genotypes exploiting diﬀerent niches
(Cooper & Ofria, 2002). These communities are of interest in this thesis for three reasons. First, by using communities, results in Avida are more comparable to many real-world
phenomena; the single-niche populations we often study are more tractable than communities, but correspond closely only to monocultures and laboratory experiments. Second, the
evolution of a community under negative frequency-dependent selection would seem to be
inherently diﬀerent from the evolution of a single-niche population under no such diversifying selection; I study this diﬀerence in the context of the evolution of complex features
(see Section 1.3 below). Third, the interaction between community members is of interest; I
1 That

is, they are theoretically capable of any computation that any other traditional
computer can perform.
2

examine this speciﬁcally in the context of competition between communities by using both
evolved and artiﬁcially assembled communities.
In Chapter 5, I demonstrate that the outcome of such competitions is quite consistent.
What, then, makes one community more likely to out-compete another community when
the two come into contact? In nature, such community-community clashes are rare, though
likely to be important; when we study a community’s success in competition, it is usually
in reference to one or perhaps a few invasive species. Using a digital evolution system,
we can expand questions about resistance to invasion (many vs. one) to questions about
success in competition with another fully-evolved community (many vs. many). For example,
diverse communities tend to be invasion-resistant (Case, 1990; Kennedy et al., 2002; Tilman,
2004; Zavaleta & Hulvey, 2004; Maron & Marler, 2007); is diversity also a predictor of how
well a community will survive when two communities meet? Or is success in community
competition driven by some other factor, such as specialization or resource utilization?
Chapters 3 and 4 examine the evolution of complex features in communities; Chapter 5
investigates community success in competition for both evolved and artiﬁcially assembled
communities.

1.3

Complex Features

Digital organisms are little more than computer programs, and therefore have no complex
organs in the classical sense (e.g., the lensed eye). Avidian organisms do, however, have
concrete behaviors. Among these is the ability to perform certain mathematical tasks. We
call the more diﬃcult of these tasks “complex” in the same way that certain organs such as
the eye are complex. In this the case of these digital organisms, doing the mathematical tasks

3

requires coordination of several subtasks and many locations in the genome, and removing
any piece of the whole computational structure means the organism is now completely unable
to do the task. I use the broad term “complex feature” to encompass both complex organs
and complex mathematical tasks.
There is strong evidence that evolution by natural selection can and does produce complex
features in the natural world; for an excellent review, see Gregory (2008). Certainly it does so
in Avida (Lenski et al., 2003). My work investigates a diﬀerent question: given that complex
features can and do evolve, are there simple ecological selective pressures that cause them
to evolve more often, or more quickly? Conversely, are there pressures that decrease the
likelihood of their evolution? What are these pressures? How and why do they aﬀect the
evolution of complex features?
I have focused on a set of speciﬁc questions about the evolution of complex features in
multiple-niche communities. Why might the evolution of complexity be diﬀerent in a community than in a single-niche population? First, the negative frequency-dependent nature
of a limited-resource environment encourages adaptive radiation and maintains diversity
(Tilman, 1982; Schluter, 1996; Chesson, 2000; Hall & Colegrave, 2007; Abrams et al., 2008),
resulting in broader exploration of the ﬁtness space (Friedrich et al., 2009). Even na¨
ıvely, we
may expect that exploring more possibilities will result in a complex feature appearing more
frequently. Second, mutations causing the loss of access to depleted resources have a reduced impact, thus facilitating the crossing of ﬁtness valleys (Weissman et al., 2009). Third,
the evolution of any feature often requires trade-oﬀs (Roﬀ and Fairbairn 2007, Lenski et al.
2003). When an adaptive trait corresponds to an untapped resource, its high availability
mitigates the loss of traits tied to depleted resources, making the trade-oﬀ worth-while. I
explore these hypotheses in Chapters 3 and 4.
4

1.4

Links to Evolutionary Computation

The ﬁeld of evolutionary computation focuses on evolving solutions to complex problems.
Although the core idea comes from evolutionary biology, until recently the computational
ﬁeld has tended to be limited in its adoption of new biological ideas, and vice versa. Preserving diversity on the path to complex solutions is a major research area in evolutionary
computation (Friedrich et al., 2009). The research I describe below brings these results, designed for encouraging the evolution of solutions to complex problems, back into the context
of evolutionary theory, and reveals their relevance to questions in evolutionary biology about
the evolution of complex features.

5

Chapter 2
The Avida Digital Evolution Platform
In the Avida digital evolution platform (Ofria et al., 2009), self-replicating computer programs must compete for limited space in a virtual world. Avida is a true instantiation of
open-ended evolution, often producing new and unexpected survival strategies. Digital organisms in Avida are subject to errors (mutations) as they copy their genetic programs, as
well as selective pressures for survival and rapid replication. As in nature, almost all mutations are detrimental or neutral, but occasional beneﬁcial mutations do occur (Lenski et al.,
1999). These rare beneﬁcial mutations typically allow digital organisms to either metabolize
additional resources, boosting the energy available for oﬀspring production, or else improve
eﬃciency, reducing energy requirements.
Avida has been under development for almost two decades and has been used to study,
among a host of other topics, complexity (Lenski et al., 1999; Adami et al., 2000; Lenski
et al., 2003; Ofria et al., 2008), robustness (Lenski et al., 1999, 2006; Elena et al., 2007),
ﬁtness landscapes (Wilke et al., 2001), adaptive radiation (Cooper & Ofria, 2002; Chow
et al., 2004), and sexual reproduction (Misevic et al., 2006). The software is open-source
and available without cost at http://avida.devosoft.org.
Some sections of this chapter are adapted from Walker & Ofria (2012).

6

2.1

Avidians and the Avida World

We refer to digital organisms in the Avida system as “Avidians”. An Avidian consists
of two basic components: a genetic program and the virtual hardware that executes that
program. The genetic program is a series of computer instructions drawn from an alphabet of
simple assembly-like Turing-complete instruction set (see Appendix A). Through mutations,
these instructions can be substituted with other instructions, deleted, or be inserted into
the genetic program; they form the genome of the organism. The simulated hardware, or
virtual CPU, supplies the mechanisms to execute the genomic instructions. Four heads (the
instruction pointer, ﬂow head, read head, and write head) allow looping, conditional logic,
and self-replication. Three registers and two depth-10 stacks provide storage for any data
the organism is manipulating; their use is controlled by the genetic instructions.
The world of Avida consists of either a toroidal1 or bounded grid of cells, set to 60 × 60
in this work; each cell is either empty or holds one Avidian. In the work presented in this
thesis, organisms remain in the same grid cell for their entire lives. Each cell also contains
three random integer inputs that the resident Avidian may use as input for the calculations
it performs; these values are randomly chosen per cell, and are independent values from one
cell to another.
Time in Avida is measured in updates; each update corresponds to an average number
of 30 instruction executions per organism in the population. Organisms running quickly will
execute more than 30 instructions per update, while slow organisms will execute fewer.
Populations in Avida may begin with any ancestor organisms chosen by the researcher.
In this thesis, unless otherwise stated, an initially empty world is seeded with 3600 clones of
1 i.e.,

when an organism moves oﬀ of one side of the grid, it appears on the other side.

7

Figure 2.1: The Avidian genetic program and virtual hardware. Circles on the left represent
instructions in a portion of the genome. The green box represents the virtual hardware that
executes the genetic program, with three registers, two stacks, and input/output capabilities.
The instruction pointer is currently at the input instruction; the ﬂow, read, and write heads
are not shown. (From Beacon 101: Experimental Evolution with Digital Organisms. Used
with permission.) For interpretation of the references to color in this and all other ﬁgures,
the reader is referred to the electronic version of this dissertation.

an asexual ancestor organism capable only of self-replication.

2.2

Self-Replication in Avida

By executing its genome, each (viable) Avidian is capable of self-reproduction; during this
process, copy mutations may occur in the oﬀspring’s genome. Both insertion and deletion
mutations may also occur in the oﬀspring’s genome when it is born.

8

Organisms reproduce into any of the nine cells surrounding and including the Avidian
itself, preferring empty cells if any are available. Because the Avidian world grid can accommodate only a ﬁxed number of organisms (e.g., 3600), and newly born organisms overwrite
current organisms, the basic evolutionary pressure in Avida is the competition for space,
or equivalently, the need to reproduce before being overwritten. The predominant selective
pressure is therefore the pressure to reproduce more quickly, either by reducing the number
of instruction steps necessary for reproduction or by completing tasks which increase the
rate (relative to other organisms) at which the organism executes instructions.
In the experiments described in this thesis, the world grid begins ﬁlled with Avidians.
The only way for an Avidian to be removed from the grid is for it to be overwritten by
another Avidian or to die. Organisms in Avida die if they live for “too many” (in this thesis,
30 × their genome length) instruction cycles without replicating.

2.3

Boolean Logic Tasks

As noted above (Section 2.1), each organism has access to three 32-bit integers that it can
manipulate; the researcher can choose to reward certain manipulations with additional CPU
cycles. These additional CPU cycles allow the organism to execute its genome more quickly
and thus increase ﬁtness. In this thesis, Avidians are rewarded for performing Boolean
bitwise logic functions (see Table 2.1).
When an Avidian outputs a 32-bit integer, the system checks to see if that integer is the
solution to any of nine Boolean bitwise logic tasks, using any of the integers available in that
cell. If so, the organism is rewarded by an increase in metabolic rate2 , allowing it to use
2 Metabolic

rate was called “merit” in previous Avida works.

9

more CPU cycles per update. In essence, its metabolism increases with respect to its peers.
In my work, the magnitude of these rewards is based on the diﬃculty of the task. There
are many ways to evaluate diﬃculty; the convention in Avida is to rank tasks by the amount
of code required to implement them. Since the only logical operator in the Avidian instruction set is nand, we approximate the implementation diﬃculty of each logical task by
counting the minimal number of logical NAND operations necessary to compute it (Table
2.1).
Task name

Boolean operation

Task value (V)

NOT
NAND
AND
ORN
OR
ANDN
NOR
XOR
EQU

¬A; ¬B
¬(A ∧ B)
A∧B
(A ∨ ¬B); (¬A ∨ B)
A∧B
(A ∧ ¬B); (¬A ∧ B)
¬A ∧ ¬B
(A ∧ ¬B) ∨ (¬A ∧ B)
(A ∧ B) ∨ (¬A ∧ ¬B)

1
1
2
2
3
3
4
4
5

Table 2.1: NAND-count-based task reward values. The symbol “¬” denotes negation, while
semicolons separate symmetrical functions. See Appendix B for a fuller treatment.

Rewards may be applied in an “additive” or a “power” structure. To increase the diﬀerences in rewards, we use 2V rather than V to modify metabolic rate. In the additive reward
structure, performance of a task with value V is rewarded by adding 2V to the organism’s
metabolic rate. In the power reward structure, the organism’s metabolic rate is multiplied by
2V . Details of the particular reward structures used in this thesis are given in the chapters
in which they are discussed.

10

2.4

Limited Resources

The Avida default is to always give the same reward for the same task. However, it is also
capable of linking depletable resources to tasks; resource amounts can then modulate task
rewards. These resources are implemented in a chemostat-like manner: a constant amount
of each resource ﬂows into the world, is depleted as it is used by organisms, and ﬂows out
at a constant rate. Each task has a separate linked resource, and all are non-spatial; all
organisms access the same resource pools.
I refer to any such environment as a limited-resource environment, although, of course,
the degree to which resources are limited depends on inﬂow rate (RIN F LOW ), outﬂow rate
(ROU T F LOW ), and whether Avidians are performing tasks to deplete the resource pools.
The default Avida inﬂow and outﬂow rates for each resource pool are 100.0 units of resource
per update and 1% of the available resource per update, respectively; inﬂow and outﬂow
happen smoothly throughout the course of the update.
A task with value V (see Table 2.1) using resource amount A receives a reward of 2V ×A .
Avidians use 0.25% of the relevant resource pool (with a cap of one unit of resource); more
resource in the pool therefore means a greater reward for performing the task. In eﬀect,
one can think of an environment without limited resources as one that has inﬁnite inﬂow,
allowing the organisms to always use 1.0 units of resource when performing tasks: I refer to
this as an unlimited-resource environment.
This system of limited resources creates negative frequency-dependent selection based
on the amount of each resource available, and allows the evolution of ecologically stable
communities consisting of multiple coexisting ecotypes (Cooper & Ofria, 2002).

11

2.5

Phenotypes and Ecotypes in Avida

A phenotype in Avida, as in natural biology, is a collection of traits describing how an
organism’s genotype is expressed during the organism’s lifetime. In most cases, we simplify
this measurement to how many times the organism can perform each of the Boolean logic
tasks over the course of its lifetime. This set of values gives information about both the
organism’s ability to perform complex functions and the drain it imposes on the world’s
resource levels.
When two or more genotypes have similar phenotypes, we refer to them as being in the
same ecotype. In environments where each distinct task is rewarded only once (Chapters 3
and 4), we deﬁne an ecotype as the group of genotypes which share the same phenotype.
In Chapter 5, where distinct tasks can be rewarded more than once, ecotypes are determined by the hcluster package (Eads, 2008) by agglomerative clustering of log-transformed
task vectors. I use the centroid linkage method with Euclidean distance. The hierarchical
clustering is collapsed into ﬂat clusters with a threshold of 1.0. These settings come from
unpublished work by Christopher Strelioﬀ, another Avida researcher. The results with these
settings appear reasonable; that is, in test communities, genotypes with very similar phenotypes are clustered together, and genotypes with obviously diﬀerent phenotypes fall into
diﬀerent clusters.
At several points in this thesis I also refer to the binary task signature of an organism.
This categorization is simply whether or not the organism can perform each task, a true/false
variant of the phenotype – in fact, exactly equivalent to the phenotype in environments which
reward each distinct task only once. In my experiments with environments which reward
each distinct task more than once (see Chapter 5), it is possible but not common for two or

12

more ecotypes to have the same binary task signature if they diﬀer strongly in the number
of times they do at least one task.

2.6

The Ecological Period

At the end of any period of time during which mutations can occur (an evolutionary period ),
the Avida population consists of many genotypes. Some are incapable of reproducing themselves (“inviable”); others are newly born mutants that have a much lower ﬁtness than their
competitors, but haven’t had time to be overwritten yet. Still others are near-duplicates
phenotypically, coexisting simply because they are both more ﬁt than the background. A
community with so many genotypes to consider is diﬃcult to analyze; while some subset of
these genotypes may be all that is needed to form a community that is ecologically stable,
it is diﬃcult to pick that subset out.
Therefore, for many experiments I include an ecological period when the evolutionary
period ends. Typically evolutionary and ecological periods each last 100,000 updates, for a
total of 200,000 updates. In this period, all mutations (substitution, insertion, and deletion)
are turned oﬀ. Competition in this period is purely ecological, allowing the community
to settle out into a core set of ecotypes; all of the inviable and low-ﬁtness genotypes are
weeded out early, and ﬁt genotypes compete in an environment where small ﬁtness diﬀerences
dramatically reduce the probability for phenotypically-identical genotypes to coexist.
In some cases genotypes may be so phenotypically similar that an ecological period of
100,000 updates is not suﬃcient to drive one to extinction, which is why we characterize
the phenotypes of the ﬁnal population either in terms of clustered ecotypes or genotypes
grouped by binary task signature. However, the smaller set of genotypes remaining after an

13

ecological period makes this process far more tractable and the results intuitively veriﬁable.

14

Chapter 3
Evolutionary Potential is Maximized
at Intermediate Diversity
When Darwin ﬁrst posed his theory of evolution by natural selection, critics denounced its
ability to produce traits of signiﬁcant complexity (Mivart, 1871). Darwin responded by
noting that traits of lesser complexity can be used as building blocks and that most complex
problems can be solved in many ways, only one of which needs to be discovered by evolution
(Darwin, 1872). Evidence for these ideas has accumulated in the historic record, where
the evolutionary origins of a range of complex traits are now well understood (Sage, 2004;
Gregory, 2008; Flowers et al., 2010), as well as in theoretical proofs (Melendez-Hevia et al.,
1996; Nilsson & Pelger, 1994) and in computational experiments (Lenski et al., 2003; Graham
et al., 2007). However, the dynamics that drove the evolution of these complex traits are
not always clear.
Lenski et al. (2003) have investigated the evolutionary origin of complex features in Avida,
using Boolean EQU as the speciﬁc complex task under study. This is the most complex of
the one- and two-input Boolean operations to calculate, requiring at least ﬁve logical NAND
operations. An Avidian organism requires at least 19 coordinated instructions to perform
EQU, including the execution of at least ﬁve nand instructions. The ancestor starts out with
none of these instructions in its genome; Lenski et al. found that in the 23 of 50 populations

15

that evolved EQU in their experiments, EQU evolved in anywhere from 51 to 721 mutational
steps.
In practice, the evolution of EQU is dependent on rewarding building blocks: the oneand two-input Boolean tasks of lower complexity. When Lenski et al. evolved populations
in environments where EQU was the only rewarded task, none of the populations evolved
EQU. However, they also found that the evolution of EQU does not depend on any particular
building block or pair of building blocks. In fact, EQU can evolve in many diﬀerent ways;
all 23 of Lenski et al.’s EQU-evolving populations evolved building blocks in diﬀerent orders
and organized them diﬀerently in their genomes.
Computational experiments like Lenski et al. (2003) have focused on single-niche populations with minimal ecological forces at play. The theory can be logically extended to
multiple-niche communities that more closely mimic complex natural environments. In my
initial investigations, I designed a limited-resource environment with RIN F LOW = 100 in
parallel to Lenski et al.’s unlimited-resource environment (see Section 3.2, below, for details). I tested whether limited resources aﬀect the rate of complex trait evolution. In
the unlimited-resource treatment, EQU evolved in only 67 of 200 populations by the end of
100,000 updates1 , whereas 167 of 200 evolved EQU in this same period in the limited-resource
treatment, a highly signiﬁcant diﬀerence (p < 10−23 , Fisher’s exact test, two-tailed).
This result is strong evidence that limited resources promote the evolution of novel complex features, but it does not elucidate the speciﬁc mechanisms that are responsible for this
eﬀect. In this chapter, I examine diversity as a possible mechanism.
Some sections of this chapter have been adapted from Walker & Ofria (2012).
1 For

the treatments in this chapter and Chapter 4, the median number of generations
in 100,000 updates was 27,545.85. Th’e ﬁfth percentile was 14,322.845 generations, and the
95th was 31,601.475 generations.
16

3.1

Diversity as a Potential Factor in the Evolution of
Complex Features

Diversity in a population is often cited as a major facilitator for the evolution of new complex
features (Friedrich et al., 2009). The intuition behind this dynamic is that if a population
is exploring multiple regions of a ﬁtness landscape, more opportunities exist to ﬁnd new
functionality. Since limited-resource environments possess multiple niches (Gavrilets & Vose,
2005; Hall & Colegrave, 2007), facilitating diverse populations and increasing exploration of
the ﬁtness landscape (Sareni & Kr¨henb¨hl, 1998; Friedrich et al., 2009), I postulate that
a
u
this diversity is a driving factor of the more frequent evolution of EQU in a limited-resource
environment in Avida. In this chapter, I explore the eﬀect of multiple limited resources on
phenotypic Shannon diversity and, in turn, on the evolvability of the EQU complex task.
Of the many types and measures of diversity, we choose to examine phenotypic Shannon
diversity. We choose phenotypic over genotypic diversity because we are interested in how
limited resources promote task acquisition; since tasks deﬁne an organism’s phenotype, and
only phenotype aﬀects the environment, phenotypic diversity is the natural choice. (Note
that, since the environments in this chapter reward each distinct task only once, grouping
genotypes by phenotype, ecotype, or binary task signature is equivalent: of these possible terms, I choose to refer to phenotypic diversity since it is most likely to be intuitively
understood.)
We choose to measure phenotypic diversity as the Shannon entropy of these groupings
because Shannon entropy eﬀectively balances the two main interesting qualities in diversity:
the range of possible results and the evenness in their distribution. To be precise, we calculate

17

phenotypic Shannon diversity as:

H=−

P (xi ) log(P (xi ))

(3.1)

i

where xi is each genotype group, and P (xi )) is the number of organisms in that group divided
by the number of organisms total in the population.
We show that Shannon diversity peaks at intermediate levels of resource availability to
the population, and we map the evolvability of the EQU task on this availability-diversity
gradient. While the evolvability of EQU is highest at intermediate availabilities, it does not
peak at the same resource inﬂow level as phenotypic Shannon diversity, and it is more robust
than diversity in its response to inﬂow level. These results indicate that while phenotypic
Shannon diversity may play into the evolution of complex features, the selective pressures
caused by diversity cannot be the only pressures behind such evolution.

3.2

Experimental Environments

Resource inﬂow and availability is a major factor aﬀecting ecosystem diversity (Tilman,
1982; Chesson, 2000; Hall & Colegrave, 2007; Abrams et al., 2008; Cardinale et al., 2009).
I explore the eﬀect of the availability of multiple limited resources on phenotypic Shannon
diversity, and use this availability-mediated diversity gradient to examine the relationship
between Shannon diversity and the evolution of complex features.

18

3.2.1

The unlimited-resource environment

In their investigation of the evolutionary origin of complex features, Lenski et al. rewarded
digital organisms once for each distinct Boolean task performed. The value of each task
corresponded to its complexity as approximated by the minimum number of Boolean NAND
operations necessary for its performance (see Table 3.1), using a power reward system as
discussed in Section 2.3.
Function name Reward
×21
×21
×22
×22
×23
×23
×24
×24
×25

NOT
NAND
AND
ORN
OR
ANDN
NOR
XOR
EQU

Table 3.1: Task rewards in Lenski et al. (2003). An organism that performs a task has its
current execution rate multiplied by the amount of the task’s reward; it can be rewarded for
each task only once per lifetime.

3.2.2

The limited-resource environment

Chow et al. (2004) have previously performed an investigation into the relationship between
resource inﬂow and diversity in Avida. They measured diversity as species richness. Since the
digital organisms are asexual, Chow et al. used a clustering algorithm based on phylogenetic
distance to determine which genotypes belonged to the same “species”. Species richness in
this setup was the result of negative frequency-dependent selection due to nine depletable
resource pools, each linked to a task. RIN F LOW units of resource ﬂow into each resource pool

19

at a constant rate over each update, and 1% of each pool ﬂows out, modeling a chemostat.

Inf low : RT ASK = RT ASK + RIN F LOW
Outf low : RT ASK = 0.01 × RT ASK

(3.2)
(3.3)

Chow et al. used the same set of Boolean computational tasks as Lenski et al., but linked
each task to a separate resource pool. The amount of resource in a resource pool (RT ASK )
determines the beneﬁt to be gained by performing the associated task; they did not incorporate the diﬃculty (via NAND-count) of tasks into their environmental design. An individual
organism depletes AT ASK units of resource from the task-linked pool when performing a
Boolean task, reducing the amount available to the rest of the population and resulting in
negative frequency-dependent selection (Cooper & Ofria, 2002). Rewarding an organism for
the performance of a task again consists of multiplying its current execution count by the
amount of the reward.

AT ASK = 0.0025 × RT ASK
Depletion : RT ASK = RT ASK − AT ASK
Reward : ×2AT ASK

(3.4)
(3.5)
(3.6)

In the environments of Chow et al., high inﬂow rates result in populations that converge
on a single genotype specialized on replication eﬃciency: these rarely perform more than
one or two of the simpler Boolean tasks. This is because the authors do not incorporate the
diﬃculty of the task into the task’s reward; at high resource abundance, there is little to
no pressure to seek new resources, and thus no reason to do diﬃcult tasks. On the other

20

hand, the environments of Lenski et al. reward tasks based on diﬃculty. However, without
the negative frequency-dependence of Chow et al.’s environments, populations converge to
a single generalist genotype that performs all tasks (Figure 3.1).

Mutational Distance Over Time (Seed 1002)
7.2
6.4
350

5.6

300

4.8

250

4.0

200

3.2

150

2.4

100

1.6

50

ln(# of organisms)

# of Mutational Steps From Ancestor

400

0.8

0

0

200

600
400
Time in Updates ×100

800

1000

0.0

Figure 3.1: A representative pattern of mutational distance over time in a population evolving
in an unlimited-resource environment. The x-axis spans 100,000 updates, while the y-axis
counts the mutational steps from each currently living genotype to the original ancestor.
Color values give the natural log of the number of organisms at a particular mutational
distance. The population as a whole never experiences adaptive radiation, and so each
member genotype remains similarly distant from the common ancestor.

In studying the eﬀect of resource supply on both phenotypic Shannon diversity and the
evolution of complex features, it is useful to create environments in which both the diﬃculty
of the task and its rarity in the population (via the availability of its associated resource)
contribute to the reward that organism receives for performing that task. Moreover, since we
21

use Lenski et al.’s unlimited-resource environment as a control, our environment should diﬀer
from theirs as little as possible. To this end, we have devised a limited-resource environment
starting with Lenski et al.’s diﬃcultly-dependent reward scheme, but where linked resource
pools mediate the amount of the reward as in Chow et al.; Table 3.2 describes this hybrid
reward scheme; Figure 3.2 shows a typical example of how genetic distances from the ancestor
diverge over time in the hybrid reward system.
Task

# NAND

Depletion

Reward

NOT
NAND
AND
ORN
OR
ANDN
NOR
XOR
EQU

1
1
2
2
3
3
4
4
5

AN OT
AN AN D
AAN D
AORN
AOR
AAN DN
AN OR
AXOR
AEQU

×21×AN OT
×21×AN AN D
×22×AAN D
×22×AORN
×23×AOR
×23×AAN DN
×24×AN OR
×24×AXOR
5×AEQU
×2

Table 3.2: Hybrid task rewards, based both on task complexity and resource availability
(AT ASK denotes the number of resource units an organism uses from the T ASK’s pool).
An organism that performs a task has its current execution rate multiplied by the amount
of the task’s reward; it can be rewarded for each task only once per lifetime.

3.2.3

Speciﬁc conﬁguration details

Our experiments use a development version of Avida 2.12.3 with the default instruction
set (inst-heads.cfg). The executable was complied from publicly-available source code (at
avida.devosoft.org); the speciﬁc revision is tagged “walker-complexfeatures”.
We set the per-site copy mutation rate to 0.0025, while we left the per-divide rates of
insertion and deletion mutations at the Avida default value, 0.05 each (meaning that there
was a 5% chance of a single insertion and a 5% chance of a single deletion at each organism’s
22

Mutational Distance Over Time (Seed 2004)

7.2
6.4

350
5.6
300

4.8

250

4.0

200

3.2

150

2.4

100

1.6

50

ln(# of organisms)

# of Mutational Steps From Ancestor

400

0.8

0

0

200

600
400
Time in Updates ×100

800

1000

0.0

Figure 3.2: A representative pattern of mutational distance over time in a population
evolving in a limited-resource environment with the hybrid reward system (Table 3.2) and
RIN F LOW = 100. The x-axis spans 100,000 updates, while the y-axis counts the mutational
steps from each currently living genotype to the original ancestor. Color values give the natural log of the number of organisms at a particular mutational distance. The population
experiences negative frequency-dependent selection due to the hybrid reward system, and
divergent lineages emerge, each with a diﬀerent mutational distance to the common ancestor.

birth). The ancestor organism is a modiﬁcation of the default ancestor that ships with
Avida, default-heads.org, to reduce its genotype from length 100 to length 50 by removing
50 lines of “blank tape” no-op instructions; we cut it down from the current default length
because the ancestor organism in Lenski et al. (2003) is also 50 lines long. (Chow et al. used
a 100-length ancestor.)

23

3.3

Diversity Peaks at Intermediate Productivity

Of the resource inﬂow rates we examined, the intermediate RIN F LOW of 10 (Figure 3.3) had
the highest diversity; observing the highly unimodal trend of these data, we conclude that
diversity in this system peaks somewhere between an RIN F LOW of 3 and 30. At lower inﬂow
rates, Shannon diversity drops oﬀ sharply; with too-low resource levels, each pool supports
too few organisms to make any substantial impact on diversity. The number of phenotypes
may remain high, but the Shannon entropy of the population as a whole is low. At higher
inﬂow rates, diversity drops more gradually as resources become so plentiful they might as
well be unlimited. Indeed, at inﬂow levels of 1000 and above, negative frequency-dependent
pressures are eﬀectively removed. This result is consistent with the results in other studies
of the eﬀects of resource supply on diversity (e.g. Kassen et al., 2000; Chow et al., 2004; Hall
& Colegrave, 2007)
Median diversity
At ﬁrst appearance of EQU
At ﬁxation of EQU
At end of run

∞ Inﬂow 100-unit Inﬂow
.034
.314
.314

.962
.948
.877

Signiﬁcance
p = 4.334 × 10−56
p = 5.515 × 10−29
p = 2.732 × 10−53

Table 3.3: Detailed phenotypic diversity measures during the evolution of EQU. Shannon
diversity of viable phenotypes in 200 runs each of the inﬁnite- and 100-unit-inﬂow environments is measured at three points: the ﬁrst appearance of EQU in the population, its
ﬁxation on the line of descent, and the end of the run (100,000 updates). P-values compare
the diversity of the two treatments using a two-tailed Mann-Whitney U test with a sequential
Bonferroni correction.

24

Diversity Distributions over Resource Inﬂow Rates
1
3

RIN F LOW

10
30
100
300
1000
3000
inﬁnite
0
0.5
1
1.5
2
2.5
3
3.5
4
Phenotypic Shannon entropy of viable organisms
Figure 3.3: Diversity distributions across inﬂow rates, measured as the phenotypic Shannon
entropy of all viable organisms in the population. Data for inﬂows 1, 3, 1000, 3000, and
inﬁnite are drawn from 20 populations; inﬂows 10, 30, 100, and 300 from 200 populations.

3.4

The Evolution of EQU Peaks at (Higher) Intermediate Productivity

We are now equipped to examine the evolvability of complex features on this resource inﬂow
gradient, and to observe how it relates to the corresponding Shannon diversity gradient. In
this case, we measure the evolvability of complex features by the proportion of populations
that have evolved EQU by the end of 100,000 updates. At 20 populations per treatment
(Figure 3.4), it is clear that the evolvability of EQU is highest at intermediate productivities.
Indeed, between the intermediate inﬂow levels of 10 and 300 units per resource per update,
the evolvability of EQU seems robust to increasing resource supply and decreasing phenotypic
25

Shannon diversity.

RIN F LOW

Evolution of EQU over Resource Inﬂow Rates
1
3
10
30
100
300
1000
3000
inﬁnite

0
5
10
15
20
# of populations that evolved EQU by 100,000 updates
Figure 3.4: Number of populations for which some genotype in the ﬁnal population can
perform EQU across a range of resource inﬂow rates. Data for all inﬂows is drawn from 20
populations.

To determine whether only complex tasks are sensitive to resource supply levels, we also
examined the evolvability of the other 8 tasks rewarded in this environment (Figure 3.5).
As a general trend, these tasks indicate that more complex tasks are more sensitive to the
resource supply level.
We focused on the inﬂow rates in which populations were most successful in evolving
EQU (10, 30, 100, and 300), and performed 10 times as many experimental runs at each
to gain a higher resolution (Table 3.4). At this resolution, we saw that the evolvability of
EQU is not truly unaﬀected by the variation of resource supply and Shannon diversity in
this inﬂow range. The number of populations evolving EQU by the end of 100,000 updates
is signiﬁcantly higher at the 100 unit inﬂow rate than at the 10, 30, or 300 unit inﬂow
rates. While these data do not indicate the precise RIN F LOW at which the evolutionary
potential of EQU peaks, it is clearly a diﬀerent – and greater – RIN F LOW than that at
which phenotypic diversity reaches its peak.

26

RIN F LOW

Evolution of NOT over Resource Inﬂow Rates
1
3
10
30
100
300
1000
3000
inﬁnite

0
5
10
15
20
# of populations that evolved NOT by 100,000 updates

RIN F LOW

Evolution of XOR over Resource Inﬂow Rates
1
3
10
30
100
300
1000
3000
inﬁnite

0
5
10
15
20
# of populations that evolved XOR by 100,000 updates
Figure 3.5: Number of populations which evolved tasks requiring fewer NAND operations
than the EQU task. The ﬁrst seven tasks (only NOT is shown) showed results across resource
inﬂows that were qualitatively similar to the NOT task shown here, with low numbers of
populations evolving the task at RIN F LOW = 1 and very high numbers for all other inﬂow
rates. The XOR task shows results qualitatively similar to the EQU task, with XOR being
more likely to evolve at intermediate values of RIN F LOW . Data for all inﬂows and tasks is
drawn from 20 populations.

RIN F LOW

10

30

100

300

#pops/200

141

152

171

152

p-value

< 0.00001 < 0.045

N/A < 0.045

Table 3.4: Number of populations out of 200 that evolved EQU at intermediate inﬂow rates.
We performed a chi-squared test to determine if the evolvability of EQU for at least one
inﬂow rate diﬀered signiﬁcantly from the rest (p<.005, χ2 = 13.156, 3 degrees of freedom).
With this conﬁrmed, we calculated the signiﬁcance of each ratio’s diﬀerence from 171/200
(the result at RIN F LOW = 100) with Fisher’s exact test, two-tailed, and corrected with the
sequential Bonferroni correction; the n=2 correction was applied to both the RIN F LOW = 30
and RIN F LOW = 300 data, since they can be ordered arbitrarily.
27

3.5

Populations at Intermediate Diversity Evolve EQU
More Often Even When It Is Not Rewarded

The Shannon diversity of phenotypes in the 100-inﬂow limited-resource environment is substantially higher (Figure 3.3, Table 3.3; also see Figure 4.1), and both diversity and the
evolution of EQU peak within a range of intermediate resource rates. However, these data
demonstrate only correlation, and it is clear that the evolvability of EQU does not peak in
the same environmental conditions as phenotypic diversity.
To tease out the eﬀects of diversity itself, rather than some co-correlate, on the evolvability of EQU, we observed that if diversity is responsible for the more frequent evolution
of EQU via expanded exploration of the ﬁtness landscape, we expect mutations to lead to
EQU more frequently, even when EQU is a) unrewarded and b) prevented from ﬁxing.

3.5.1

Preventing the ﬁxation of EQU

To test this hypothesis, we created treatments in which EQU was unrewarded for three
important inﬂow rates: inﬁnite inﬂow (unlimited resources), 10-unit inﬂow (the inﬂow region
of highest diversity), and 100-unit inﬂow (the region of highest evolvability for EQU). We
prevent the appearance of EQU-granting mutations in the population by testing each new
organism before it is added to the population. We evaluate it in a test environment; if it
can perform EQU, we record that an EQU-granting mutation happened, and replace it with
a new mutation drawn from the original mutation distribution. We continue this test-andreplace (or “resampling”) process until the new organism cannot perform EQU; it is then
added to the population.
We choose to resample such mutations rather than simply sterilizing the new organism
28

because such sterilization increases the lethal load of mutations; all EQU mutations become
lethal. By resampling, we distribute the imbalance across all mutational eﬀects: the resampled mutation may be beneﬁcial, neutral, detrimental, or lethal at the same rate as any
non-resampled mutation. Further explanation of resampling and its eﬀects on mutational
load can be found in Covert III (2011).
Testing the new organism in a test environment is not ﬂawless. Such a test will catch
organisms that perform EQU deterministically, but not necessarily those which perform it
stochastically. Stochastically performing EQU is a rare behavior in our experiments, but
we do see it. To remove stochastically-performing EQU mutations from the population,
we change the reward for performing EQU to always multiply the performing organism’s
metabolic rate by 0. If any organism performs EQU, it is eﬀectively sterilized. Because such
mutations are not caught in the test environment, they are not recorded as EQU-granting
mutations. This sterilization induces a selective pressure for organisms to avoid portions of
the mutational landscape in which it is simple to evolve stochastic EQU performance. However, since there is nothing that diﬀerentially rewards stochasticity across the environments,
we believe that this does not aﬀect the overall trend.

3.5.2

Testing the EQU-granting mutation encounter rate

EQU-granting mutations were replaced in the unlimited-resource populations at a median
of 3.6 per 1000 updates, signiﬁcantly less frequently than 28.6 times in 100-unit inﬂow
populations (p < 10−8 , Mann-Whitney U test, two-tailed). The 10-unit inﬂow populations
encountered EQU-granting mutations only 2.9 times per 1000 updates, again signiﬁcantly
less often than the 100-unit inﬂow populations (p < 10−13 , Mann-Whitney U test, twotailed). Not only do populations in the unlimited-resource and 10-unit inﬂow environments
29

fail to ﬁx EQU as often as those in the 100-unit inﬂow environment, mutations in these
environments do not produce EQU as often in the ﬁrst place.

3.6

Low Individual Distinct Task Counts Limit the Evolution of EQU

We hypothesize that this greater resource availability and lesser phenotypic diversity represent environments where more-abundant resources allow the desperate scramble for survival
to relax slightly, allowing organisms to accumulate a collection of building blocks necessary
for complex tasks.
Lenski et al. showed that the evolution of EQU relies on simpler tasks as building blocks,
implying that fewer tasks performed by each individual might constrain the evolution of
EQU. We therefore examined task performance in each of the three environments. Although
the populations in the 10-inﬂow environment performed the same range of task types as
populations in other environments, each individual performed signiﬁcantly fewer distinct
tasks (Table 3.5).
RIN F LOW

# Tasks per individual
5th %ile

10
100
unlimited

50th %ile

95th %ile

1
4
5

2
6
6

3
8
7

Table 3.5: Number of distinct resources used per individual. The 5th, 50th, and 95th
percentiles for each environment are displayed. We compared the distributions from the
uncapped 100-inﬂow and unlimited environments to the uncapped 10-inﬂow environment
using a two-tailed Mann-Whitney U test; for both, p
10−12 .

To test the importance of the number of distinct tasks each individual performs, we con30

structed variations for each of the three environments in which organisms were rewarded for
a maximum of three distinct tasks, as summarized in Table 3.6. With this restriction, the
number of populations acquiring EQU in the limited-resource environment was not signiﬁcantly diﬀerent than in the severely-limited environment (131/200 vs. 122/200, p=0.4068,
Mann-Whitney U test). We conclude that the average number of tasks performed by an
individual limits the evolvability of EQU, even when the population is highly diverse.
RIN F LOW

# Tasks per individual

# Populations

5 %ile
10
100
unlimited

50 %ile

95 %ile

evolving EQU

1
3
3

2
3
3

3
3
3

122/200
131/200
17/200

Signiﬁcance
p = 0.5350 (vs. 129/200)
p = 4.711 × 10−4 (vs. 167/200)
p = 7.578 × 10−9 (vs. 67/200)

Table 3.6: Distinct tasks and the evolution of EQU with a three-task cap. The 5th, 50th,
and 95th percentiles for each environment are displayed. The number of populations that
evolved EQU in each treatment is also given; p-values compare the capped treatment to the
uncapped treatments within each environment using two-tailed Fisher’s Exact tests with a
sequential Bonferroni correction.

3.7

Conclusion

I conclude that although wider exploration of the ﬁtness landscape is certainly responsible
for more frequent evolution of EQU, diversity cannot be the only driver of this phenomenon.
While the evolvability of complex features is indeed high at peak Shannon diversity, it seems
that complex features require more productive environments than those which produce the
highest diversity to evolve most often.
Building block tasks, however, appear to be essential to the evolution of EQU. In addition to encouraging adaptive radiation and maintaining diversity, the negative frequencydependent selection in the limited resource environments makes task rewards dependent on
31

the population’s current task performance. In Chapter 4, I investigate how this environmentdependent ﬂux in task rewards mitigates the cost of trading one task for another, and how
the decreased penalty for switching out tasks plays into the evolution of EQU.

32

Chapter 4
Reduced Cost of Trade-Oﬀs in
Limited Resource Environments
Promotes the Evolution of Complex
Features
In Chapter 3, I discussed the close relationship between diversity and the potential for
evolving complex features in a limited-resource environment. However, increased diversity
is only one eﬀect of the negative frequency-dependent selection active in that environment:
tying task rewards to resource pool levels not only enables the development of stable multiniche communities, but also switches the emphasis of task value from task complexity to
task rarity among the members of the community.
Mutations that cause the loss of access to depleted resources have a reduced impact in
such an environment, which facilitates the crossing of ﬁtness valleys (Weissman et al., 2009).
In Section 4.1, I examine whether such mutations – that is, those in which at least one task
is lost, but no tasks are gained – have an eﬀect on the evolution of the EQU complex feature.
Extending this idea, I examine task trade-oﬀ mutations, in which the mutation causes
simultaneous task loss and task gain. In the unlimited-resource environment, such a trade-oﬀ

33

is deleterious if the new task’s reward is lower than the rewards any lost tasks. Moreover,
because the contribution of the lost tasks is always substantial, such a trade-oﬀ is never
ideal: in the unlimited-resource environment, ﬁnal populations usually consist of a single
dominant genotype which can perform all tasks (Lenski et al., 2003). However, limited
resources produce a frequency-dependent reward structure where losing access to multiple
depleted resources for a single untouched resource is almost always advantageous; the lost
tasks were not contributing signiﬁcantly to the organism’s ﬁtness. I hypothesize that this
eﬀect promotes novelty – in practice, it promotes the evolution of complex tasks.
Following the results of Chapter 3, I examine only one limited-resource environment in
this chapter, the 100-inﬂow variant that proved most friendly to the evolution of EQU.

4.1

Task Loss Without Gain

Deleterious mutations can help populations cross ﬁtness valleys and are thus important for
adaptive evolution (Covert III, 2011). We hypothesize that the reduced harm of task loss
when the associated resources are depleted may contribute to the more frequent evolution of
EQU in the limited-resource environment. Mutations causing task loss with no concurrent
gain immediately preceded the introduction of EQU in 42 of the 167 lineages that produced
this task in the limited-resource environment, but in only three of 67 such lineages in the
unlimited-resource environment. In fact, two of the task losses in the limited-resource lineages were actually beneﬁcial: the loss of the task was accompanied by a substantial increase
in eﬃciency.

34

4.1.1

Testing for task loss mutations: technical restrictions

In order to determine when a mutation causes a “loss without gain” (LwG) mutation, we ran
each genotype of the lineage in a test CPU and recorded which tasks it did, then compared
those tasks to the tasks of its predecessor. The test CPU method has limits: namely, it
cannot guarantee detection of tasks that are performed stochastically. This limitation results
in false positives: mutations tagged as LwG that are not true losses, merely stochastically
performed tasks that did not show up in the test CPU.
We have discounted those LwG mutations where we are convinced they are false: namely,
those that both have a beneﬁcial eﬀect, which we do not expect from such a mutation,
and do not decrease the gestation time of the genotype, which might otherwise account
for the beneﬁcial eﬀect. The ﬁtness data and gestation time data used in the screen are
taken directly from the actual population record, and are therefore a faithful measure of
what happened during the actual evolution of the lineage. Technical limits prevent us from
outputting the tasks performed during evolution: for stochastic tasks, diﬀerent organisms of
the same genotype may perform diﬀerent tasks. We track lineages by genotype rather than
by organism, so we have no way of knowing which tasks the actual ancestors on the lineage
performed.

4.1.2

Deleterious “loss without gain” mutations do not detectably
aﬀect the evolution of EQU

To isolate the importance of this particular type of deleterious mutation, we evolved 200
additional populations in the unlimited-resource and limited-resource treatments, sterilizing
any mutant that experienced task loss without simultaneously gaining at least one other

35

task; see Covert III (2011) for a detailed description of this process in the general case of
deleterious mutations.
In the unlimited-resource environment, 55/200 populations evolved to perform EQU,
and in the limited-resource environment 160/200 populations did so. While both results
are less than the original treatments, we were not able to detect a signiﬁcant diﬀerence in
either (Fisher’s Exact test, two-tailed, 67/200 vs. 55/200 p = 0.2322, 167/200 vs. 160/200
p = 0.4375). While deleterious task loss without gain mutations do occur on the path to
evolving EQU, we were unable to ﬁnd evidence to establish them as an important factor in
the success of populations in the limited-resource environment.

4.2

Task Trade-Oﬀs: Loss Accompanied by Gain

A common assumption is that trait evolution is restricted or biased by ﬁtness trade-oﬀs.
(Roﬀ & Fairbairn, 2007) Trade-oﬀs have been demonstrated in the Avida system, where the
evolution of new complex tasks is often associated with the loss of simpler tasks (Lenski
et al., 2003). I show that limited resources facilitate evolutionary task trade-oﬀs, reducing
the cost of sacriﬁcing a common trait associated with a depleted resource in exchange for a
less common alternative, that is, decreasing the associated ﬁtness trade-oﬀ. This dynamic,
in turn, promotes the evolution of rare tasks, which in practice are typically more complex.
For example, assume that many organisms are performing the NAND task. Resource levels for a performed task tend to settle roughly between 30 and 100 units of resource available,
so also assume roughly 100 units of the NAND-linked resource exist in the environment. Any
particular organism can use .25% of the resource available (capped at 1.0 unit of resource),
so the performance of NAND would be rewarded by a metabolic rate multiplication of only

36

21×(100×.0025) , or 1.189. On the other end of the scale, the ﬁrst organism to perform EQU
would have access to an untapped resource pool with well over 400 units of resource, thus
hitting the hard consumption cap of 1.0 unit. This untapped pool results in a metabolic rate
increase of 25×(1.0) -fold; that is, by 32-fold – the same amount as in the unlimited-resource
environment.
As mentioned earlier, in the unlimited-resource environment, a task loss must be compensated for with new tasks of equal or greater combined value. Moreover, losing even a
single task always loses at least a ×2 multiplier to metabolic rate. The limited-resource
environment relaxes this restriction. The opportunity cost for losing a widely-used resource
is less – as calculated above, the organism may lose as low a metabolic rate multiplier as
×1.189. As a result of this, the community’s average multiplier is lower, so the relative
reward for accessing an unexploited resource is greater. This eﬀect allows many mutations
that involve task trade-oﬀs to be beneﬁcial in the limited-resource environment even though
they would have been detrimental in the unlimited-resource environment.

4.2.1

Trade-oﬀs at the evolution of EQU

We identiﬁed the 167 mutations that ﬁrst conferred EQU in the populations from the limitedresource environment, each of which was beneﬁcial when it occurred. However, had these
mutations happened in the unlimited-resource environment, 50 (29.94%) of them would have
been either neutral (9) or deleterious (41) (Figure 4.1) and thus would be unlikely to ﬁx the
ability to perform EQU. It seems likely that discouraging the ﬁxation of EQU in this manner
is responsible for at least some of the unlimited-resource environment’s poor performance in
evolving EQU.

37

# Populations
Shannon Diversity (Phenotypes, Viable Only)

Shannon Diversity vs. Time to Fixing EQU

14
B
12
10
8
6
4
2
0
1.4 A

Unlimited
ﬁtness gain (65)
ﬁtness loss (2)
Limited
ﬁtness gain (117)
no change (9)
ﬁtness loss (41)

C

1.2
1.0
0.8
0.6
0.4
0.2
0.0

0

2

4
6
Time (Updates ×104 )

8

10 0 5 10 15 20 25 30
# Populations

Figure 4.1: Shannon diversity vs. ﬁxation time of EQU. (A) Diversity of each population
that evolved EQU versus time at which EQU ﬁxed on the line of descent. Blue markers
represent populations evolved in the unlimited-resource environment, while red represent
those evolved in the (100-inﬂow) limited-resource environment. Symbols denote the ﬁtness
eﬀect of the EQU-granting mutation as evaluated in the unlimited-resource environment.
Beneﬁcial mutations are marked with +, neutral with , and detrimental with ; the legend
indicates how many mutations in each treatment fall into these categories. Histograms show
the distribution of (B) the ﬁxation of EQU over time and (C) the phenotypic diversity when
EQU ﬁxes. Appendix C explains how two of the unlimited-resource populations were able
to ﬁx detrimental EQU-granting mutations.

38

4.2.2

Eliminating the possibility of a neutral or detrimental tradeoﬀ for EQU

To test the hypothesis that some EQU-granting mutations do not ﬁx in the unlimitedresource environment because of task trade-oﬀ costs, we designed a variant reward system.
We raised the reward for performing EQU from a ×25 metabolic rate multiplier to ×221 .
The total multiplier from all other tasks remains ×220 . Therefore, with this reward system,
EQU-granting mutations are beneﬁcial even if they cause the loss of all other tasks.
Out of 200 populations using this reward structure in an unlimited-resource environment, 100 could perform EQU at the end of 100,000 updates, a signiﬁcant increase from
67 populations with the original reward structure (p = 0.006883, Fisher’s Exact test, twotailed, sequential Bonferroni correction), although still signiﬁcantly fewer than the 169 from
the limited-resource environment with the original reward structure (p = 1.507 × 10−11 ,
Fisher’s Exact test, two-tailed, sequential Bonferroni correction). We conclude that removing the possibility of a detrimental eﬀect when sacriﬁcing other tasks for EQU increases the
evolvability of EQU.

4.2.3

Eliminating the possibility of a neutral or detrimental tradeoﬀ when trading “up”

However, this variant reward structure removed the detrimental eﬀect of trade-oﬀs only for
EQU. Lenski et al. (2003) showed that EQU requires the lower-diﬃculty tasks as building
blocks, and data from Section 3.6 also indicated that the presence of other tasks increases the
probability of EQU evolving. We therefore extended our hypothesis: the limited-resource
environment is better suited to evolving EQU not just because it provides an incentive to
39

trade other tasks for EQU, but in fact because it provides an incentive for any trade-oﬀ of
a common task for a rare task.
We tested this extended hypothesis by designing another variant reward structure, called
EQU-81, in which any trade-oﬀ for a more complex task is always beneﬁcial (Table 4.1). The
values associated with the nine tasks are designed such that each increase in task complexity
(still approximated by the necessary number of NAND operations) confers a value V greater
than the sum of all previous values combined, that is, the ×2V multiplier is twice as large as
the product of all multipliers for less complex tasks. Using this reward structure, 136/200
populations in an unlimited-resource environment evolved EQU, signiﬁcantly more than the
67/200 populations with the original reward structure (p = 8.478 × 10−11 , Fisher’s Exact
test, two-tailed, sequential Bonferroni correction) and the 100/200 populations in the EQU21 reward structure (p = 2.877×10−03 , Fisher’s Exact test, two-tailed, sequential Bonferroni
correction). However, this number of populations is still signiﬁcantly fewer than the 167/200
in the limited-resource environment with the original reward structure (p = 3.015 × 10−3 ,
Fisher’s Exact test, two-tailed).
Function name Reward
×21
×21
×23
×23
×29
×29
×227
×227
×281

NOT
NAND
AND
ORN
OR
ANDN
NOR
XOR
EQU

Table 4.1: The EQU-81 reward structure. Any trade-oﬀ for a more complex task is guaranteed to be beneﬁcial. An organism that performs a task has its current execution rate
multiplied by the amount of the task’s reward; it can be rewarded for each task only once
per lifetime.
40

We conclude that providing an incentive to trade oﬀ simple tasks for more complex ones is
an important aspect of the limited-resource environment, responsible for some of its greater
propensity to evolve EQU. However, the trade-oﬀ cost mitigation cannot fully account for
the greater frequency of EQU evolution in limited-resource environments.

4.3

Conclusion

We conclude that both diversity and ease of trade-oﬀs for rare tasks play into the greater
evolution of complexity in limited-resource environments. A limited-resource environment –
as long as it is not so severely limited as to restrict the areas of the ﬁtness landscape that
may be explored – acts as a negative frequency-dependent pressure to maintain both of these
traits, signiﬁcantly increasing the probability of complex tasks evolving.

41

Chapter 5
Predicting the Outcome of
Competitions Between Communities
Thus far, I have focused on the evolution communities from the perspective of the origin
of new traits, with some examination of community diversity. In this chapter I will further
examine how these evolved communities stack up against each other, with a particular focus
on determining if certain community-level traits can predict the outcome of competitions.
Interaction among individuals is of vastly diﬀerent character in a system of many niches
than in a single niche, especially when some limiting factors (such as space or resources)
are shared across well-deﬁned niches, requiring that individuals compete both within niches
and across them. Thus, the communities formed by these interactions are true ecologies,
exhibiting properties on the community level as well as the individual level.
For the multiple niche environment I again turn to a limited resource setup. In this
chapter, I use the default limited-resource environment that ships with Avida. In contrast
to the limited-resource environment used in Chapters 3 and 4, the reward system in this
environment is additive, as explained in Section 2.3. Since metabolic rate does not increase
exponentially in such a system, Avidians may to receive a reward for up to 25 performances
of a particular task. The phenotype is therefore not equal to the binary task signature; for
these communities, I group genotypes into ecotypes with the clustering algorithm described

42

in Chapter 2.

5.1
5.1.1

Measures of Community-Level Properties
Measuring community success

I measure the success of a community A in an A vs. B competition as the fraction of individuals in the ﬁnal merged community (at the end of the competition) descended from member
genotypes of the original community A, the fractional community success. In a competition
between community A and community B, if community A ends up with the larger fraction
of descendant organisms in the ﬁnal community, community A has “dominated” community
B; if community B has no descendant organisms in the ﬁnal post-competition community,
community A has “excluded” community B.

5.1.2

Measuring community diversity

Just as community-level diversity proved to be valuable in the evolution of complex traits,
it is also a likely factor in community success in competition: after all, in many vs. one
competitions (i.e. invasion), more diverse communities have repeatedly proven more resistant
(Case, 1990; Kennedy et al., 2002; Tilman, 2004; Zavaleta & Hulvey, 2004; Maron & Marler,
2007).

5.1.3

Measuring community composition

Community composition describes the degree to which a community is composed of specialist
or generalist ecotypes. I use average ecotypic generality to measure community composition;

43

it averages how generalist the community’s member ecotypes are. I calculate this value as the
weighted average of the task diversity of the community’s member ecotypes. Each ecotype’s
task diversity is, in turn, the weighted average of the task diversity its member genotypes,
which is the Shannon entropy of their phenotypes (i.e. task performance count vectors; see
Section 2.5).
Low values indicate specialist genotypes, while high values indicate generalists. For
example, an organism that does NAND 90 times and performs other tasks has 0 entropy; on
the other hand, an organism that does all nine tasks but only 10 times each has an entropy
of 2.1972 (in nats).

5.1.4

Measuring community breadth

Community breadth describes how well the community as a whole ﬁlls its available niches.
Task coverage is the Shannon entropy of the community-level task performance vector, that
is, the sum of the phenotypes of all member organisms. Lower numbers indicate that the
community as a whole covers a small portion of the available niches, while higher numbers
indicate communities that cover most niches evenly. I would expect low community breadth
to be a poor strategy in competition, since it means this community is underexploiting some
of the available niches. High task coverage, on the other hand, tells us nothing about the
community’s composition; it might be the result of a complete community of specialists (with
low average ecotypic generality) or a community consisting of a single complete generalist
(with high average ecotypic generality).

44

5.1.5

Measuring community resource use eﬃciency

I measure community resource use eﬃciency via diﬀerential total-resource use. When comparing communities A and B, I calculate this measure as the diﬀerence between the total
amounts of resource remaining in each community when resources have been drawn down.
In this case, a higher total amount of resource remaining in isolation indicates a community
that is not eﬃciently using its available resources. The metric thus ranges from negative values when community A is more eﬃcient, and so has a lower total resource level, to positive
values when community B is more eﬃcient.
Task coverage, the measure of community breadth, may also be thought of as a measure
of resource use eﬃciency: high task coverage means the community is using all resources.
However, It is not a good measure for directly comparing the resource use eﬃciency of two
communities: it measures only the degree to which a community uses all resources equally,
not the level to which it is capable of drawing down those resources.
In this chapter, I consider only diﬀerential total-resource use as a measure of resource
use (in)eﬃciency. These values can sometimes be very large if only one of the competitors
is capable of doing a particular task.

5.2

Generation of Test Communities

In order to study how community structure is tied to predictability of community success,
I have performed community competition experiments with four types of communities: one
evolved, and three constructed.
1. Evolved communities are seeded with a single ancestor organism, capable of selfreplication but not of performing any Boolean tasks. I ran 100 communities for 100,000
45

updates in the additive limited-resources environment, then for a further 100,000 updates with mutations turned oﬀ so that the community structure could settle out.
These 100 evolved communities also provide the genotype pool from which the other
communities are assembled.
2. True random communities are assembled by randomly assigning the genotypes of the
original 100 evolved communities to 100 new communities (without replacement). The
only constraint is that each assembled community must have at least 3 genotypes. To
satisfy this constraint without introducing any other form of bias, I simply re-ran the
assembly algorithm until this property held true. (In practice, this took at most 100
tries.)
3. Ecologically complete communities are generated one-by-one from the genotypes of
the original evolved communities such that each new community can perform all tasks.
When assembling each of the 100 ecologically complete communities, the nine tasks are
ordered randomly. For each task, a genotype that can perform that task is randomly
selected (without replacement) from the pool of potential genotypes. This process
continues for each unincluded task until the assembled community contains at least
one genotype that performs each task. Once the core of each community has been
assembled, any genotypes remaining in the pool are randomly assigned across the
assembled communities. In practice, this method was always able to assemble the full
100 communities because the source pool had substantial redundancy.
4. Specialist-preferred ecologically complete communities use the previous assembly technique, but specialist genotypes are preferred during community construction. In practice, this meant that communities that were assembled ﬁrst tended to be dispropor46

tionately composed of specialists.

5.3

Community Competition

To compete two communities, I create a Avida world with 7200 cells and place both communities into it (original communities contain 3600 organisms each). All the organisms of each
community are marked with an inherited tag, allowing me to distinguish their descendants
at the end of the competition.
I start the experiment such that the two populations are kept separate, with separate
resource pools. The environments are the same as those in which the communities evolved.
The communities replicate in isolation for 10,000 updates, allowing them to draw down
resources to their equilibrium levels. At the end of those 10,000 updates, the two communities
are joined, and the two sets of resource pools are combined into one global set of pools. The
organisms are now free to replicate into any cell in the combined world, and the communities
interact through this replication and through drawing on the combined resource pools.

5.3.1

Self-competition

In order to test this community competition method, I competed each of the 100 base
communities against a copy of itself. Each copy of the community was provided a unique
tag for tracking purposes. As community A and community B had identical composition, I
expected to see community A dominate 50% of the time. I observed community A dominate
in 52 competitions, and community B in 48 competitions. At a p-value of > 0.8875 (Fisher’s
Exact Test, two-tailed), this ratio does not appear substantially diﬀerent than the expected
50/50 split. This result implies that the competition method is likely to be mechanically

47

sound, not promoting either community simply for its identity as the A (or B) community.

5.3.2

Competition results are consistent

How well does the result of a single competition between two communities predict the results of replicate competitions between those two communities? That is, how well does
the dominance result of a single competition predict the dominant community in replicate
competitions?
I created 50 independent competitor pairs from the 100 evolved base communities. I
replicated these 50 competitions 10 times each. I observed how many pairs had inconclusive
dominance – that is, not all replicates dominated by the same original community. If a single
competition were always a perfect predictor of all competitions between a pair, I expected to
see 0/50 instances of inconclusive dominance. For those competitor pairs with inconclusive
dominance, I examined to what degree the competitions were inconclusive. I found that of
50 the competitor pairs, only one had exhibited inconclusive dominance, indicating that the
dominance result of a single competition between a pair is highly predictive of the dominance
outcomes of all competitions between that pair.
Competitions between assembled communities showed similar consistency. In ﬁfty independent competitions between the randomly assembled communities (again with 10 replicates per competition), only 3/50 displayed inconclusive dominance. In ﬁfty competitions
between the ecologically complete assembled communities, 1/50 were inconsistent; 2/50 competitions between independent pairs of specialist-weighted ecologically complete communities
were inconsistent.
I examined the inconclusive competitions more closely. For the replicates of the competitor pair of evolved communities with inconclusive dominance, all ten ﬁnal communities
48

consisted of A’s descendants and B’s descendants in nearly equal proportions: the replicate
with the greatest imbalance had 55.14% of the ﬁnal population descended from B, and the
one with the least imbalance had 50.85% descended from A.
Five of 50 competitions between independent pairs of randomly assembled communities
were inconclusive. In two of them I observed the same roughly 50% dominance as described
above. In the remaining three, success seems to have been dependent on some tipping point:
either one competitor community dominates by some percentage (in these cases, community A), or the other community (B) completely excludes it. If indeed dominance in these
competitions is dependent on some stochastic event, it seems possible that extending the
competition period would result in the exclusionary community eventually winning every
competition. If the stochastic event is historically contingent or capable of aﬀecting only
the early organization of the combined community, however, these competitions are unpredictable and simply have two stable states.
Seven of 50 competitions between independent pairs of ecologically complete assembled
communities displayed inconclusive dominance. Only one was the easily explicable type, with
every dominance result near 50%. Three displayed the “tipping point” pattern described
above, appearing to be dependent on some stochastic event to determine the winner. Of the
remaining cases, one displayed consistent results of roughly 75% dominance by community
B in 9/10 replicates; in the odd one out, community A dominated by 50.16%. It seems
possible that this could be caused by an unlikely loss of a key member of community B. In
the ﬁnal two cases, the winning community dominated by at least 65%, but both communities
were winners over the ten replicate populations. Without detailed examination of the data
from these runs, it is diﬃcult to say what could cause such an eﬀect: perhaps these two
competitions are truly unpredictable.
49

Nine of 50 competitions between independent pairs of specialist-preferred ecologically
complete assembled communities were inconclusive. Two displayed roughly 50% dominance
in each competition; the remaining seven appear to be of the type in which the outcome is
determined stochastically. Three of these stochastically-determined competitions are truly
unpredictable: in each replicate, the dominating community completely excluded the other
community. There is thus no chance that extending these three runs would have led to a
conclusive result.
For competitions between evolved communities I observed no qualitatively inconclusive
results. It is therefore reasonable to do only one competition per competitor pair when
competing evolved communities, and let that result approximate the results of all competitions for that pair. This is especially true when using a non-binary measure of dominance
such as fractional community success, described above, because it reveals these qualitatively
consistent results.

5.3.3

Evolved communities have a nearly-linear dominance hierarchy

In order to make predictions about dominance, factors must exist that determine which
community will win in a competition. As long as these factors are not cyclic in nature, there
must be a dominance hierarchy. From 20 additional evolved base communities I constructed
all possible non-self competitions and then counted the number of intransitive dominance
triads, e.g., community A dominates community B, and community B dominates community C, yet community C dominates community A. Each competition consisted of only one
replicate; as observed above, for competitions of evolved communities, one replicate provides

50

a reliable approximation of the outcome.
I observed 6 intransitive triads out of an expected 285 (if dominances are assigned randomly) and a maximum possible 330 (Kendall & Smith, 1940). This result shows a strongly
linear dominance hierarchy (p

0.001, Appleby K-test (Appleby, 1983) with χ2 = 165.75,

df = 26), which indicates that a predictor of community dominance exists.
I did not test for dominance hierarchies among the assembled communities; there are no
such guarantees that they can be predicted.

5.4

Correlations with Community Success

In this section, I examine how the candidate predictors correlate with fractional community
success in competitions between evolved communities. If I have chosen reasonable predictors,
I expect to see a strong correlation with at least one of them, since the presence of a nearly
linear dominance hierarchy indicates that a predictor should exist. I examine the same set of
candidate predictors for the assembled communities, asking: if we know which community
measures correlate with success for evolved communities, what does this tell us about their
correlations with success for assembled communities?
To test for correlations between our hypothesized predictors and fractional community
success, I used the non-parametric Spearman’s rank correlation ρ (Savicky, 2009; R Core
Team, 2012). Because this is a non-parametric test relying on rank, and in some cases the
data contains tied values, the p-values obtained and given below are not guaranteed to be
exact. However, the ρ values are high enough that I am conﬁdent of signiﬁcance at the
p = 0.05 level. All p-values for correlations within the same set of competitions have been
corrected via the sequential Bonferroni method with n = 4.

51

5.4.1

Community diversity

In competitions between evolved communities, I found signiﬁcant correlation between a community’s fractional success and the diﬀerence between its ecotypic diversity and that of its
competitor (p < 0.0006, Spearman’s rank correlation test, ρ = 0.5129, S = 10143.86). Figure 5.1 graphs this relationship for each of the 50 competitions between evolved communities.

0.8
0.6
0.4
0.2
0.0

Fractional Success of Community A

1.0

Community Diversity vs. Fractional Success

-1.0

-0.5

0.0

0.5

1.0

Diﬀerential Ecotypic Entropy
Figure 5.1: Community diversity vs. fractional community success for evolved communities
in competition. Spearman’s ρ = 0.5129.

In the case of competitions between assembled communities, the data could not determine
whether there was signiﬁcant correlation for any of the three types. For randomly assembled
communities, p > 0.08, ρ = 0.2452, and S=15719.48. For ecologically complete communities,
52

p > 0.9, ρ = 0.001468, and S = 20794.43; for specialist-preferred assembled communities,
p > 0.3, ρ = −0.1232, and S = 18248.76.
Two of the three assembled communities are explicitly constructed to be ecologically
complete, and a stable ecologically complete community is likely to have higher diversity than
a stable but incomplete community. I therefore I suspected that it might be competitions
involving non-complete evolved communities driving the correlation between success and
diversity for evolved communities. I created a subset of the ﬁfty competitions for which both
competitor communities were ecologically complete (n = 40) and examined the correlation
between success and community diversity on just this subset. It remained signiﬁcant (p =
0.000274, ρ = .553, S = 4768.342, uncorrected). I conclude that in the competitions between
evolved communities, diversity is truly correlated with community success; the relationship
is not being driven by outlier non-ecologically-complete competitors.

5.4.2

Average ecotypic generality

Fractional community success in competition for evolved communities was also signiﬁcantly
correlated with the average ecotypic generality of the community (p < 0.0006, Spearman’s
rank correlation test, ρ = −0.5207, S = 31669.38); with a negative ρ, this indicates a positive
correlation between fractional success and average ecotypic specialization (Figure 5.2).
I also found a signiﬁcant relationship in the competitions between the randomly assembled
communities (p < 0.003, ρ = 0.4471, S = 11513.54), but with a positive ρ, indicating
that in these competitions those communities with generalist member ecotypes were more
likely to dominate (Figure 5.3). Random communities with generalists are more likely to be
ecologically complete, so this result is not inconsistent.
However, all ecologically complete assembled communities are guaranteed to have been
53

0.8
0.6
0.4
0.2
0.0

Fractional Success of Community A

1.0

Community Generality vs. Fractional Success

-1.0

-0.5

0.0

0.5

1.0

1.5

2.0

Diﬀerential Community Generality
Figure 5.2: Community generality vs. fractional community success for evolved communities
in competition. Spearman’s ρ = −0.5207.

so before their 50,000-update ecological settling-out period, but ecotypic generality is also
positively correlated with success for these competitions (p < 0.035, ρ = 0.4178, S =
31669.38; see Figure 5.4). I conclude that there may be an eﬀect of community origin
(i.e. evolved as a community vs. assembled from a pool of genotypes) on how generality
plays into success in community competition.
In competitions between the specialist-weighted assembled communities, there was insuﬃcient evidence to reject the null hypothesis that average ecotypic generality was not
correlated with community success (p > 0.1 after correction, ρ = 0.2917, S = 13911.04).

54

0.8
0.6
0.4
0.2
0.0

Fractional Success of Community A

1.0

Community Generality vs. Fractional Success

-1.5

-1.0

-0.5

0.0

0.5

1.0

Diﬀerential Community Generality
Figure 5.3: Community generality vs. fractional community success for randomly assembled
communities in competition. Spearman’s ρ = 0.4471.

5.4.3

Community breadth

In competitions between randomly assembled communities, community breadth measured
as task coverage was signiﬁcantly correlated with community success (p < 0.003, Spearman’s
rank correlation test, ρ = 0.4558, S = 11332.49; see Figure 5.5). I also found correlation
between success and task coverage for ecologically complete assembled communities (p <
0.035, ρ = 0.3419, S = 13705.12; see Figure 5.6) and for specialist-preferred ecologically
complete assembled communities (p < 0.05, ρ = 0.3422, S = 13698.7; see Figure 5.7).
This correlation is intuitive for randomly assembled communities; since there is no im55

0.8
0.6
0.4
0.2
0.0

Fractional Success of Community A

1.0

Community Generality vs. Fractional Success

-1.5

-1.0

-0.5

0.0

0.5

1.0

1.5

Diﬀerential Community Generality
Figure 5.4: Community generality vs. fractional community success for ecologically complete
assembled communities in competition. Spearman’s ρ = 0.4178.

plicit or explicit requirement for them to be ecologically complete, much less cover all tasks
equally, the lucky communities that ended up with good task coverage are likely to be more
successful. In the case of both types of ecologically complete assembled communities, there
is still no requirement that the assembled community cover all tasks equally well – as long as
at least one member can perform each task at least one time, the community is considered
complete. This correlation thus implies that a balanced occupation of niches is valuable in
competition even when both communities nominally occupy all available niches.
There was not suﬃcient evidence for correlation between success and task coverage in

56

0.8
0.6
0.4
0.2
0.0

Fractional Success of Community A

1.0

Community Breadth vs. Fractional Success

-1.5

-1.0

-0.5

0.0

0.5

1.0

1.5

Diﬀerential Task Coverage
Figure 5.5: Community breadth vs. fractional community success for randomly assembled
communities in competition. Spearman’s ρ = 0.4558.

competitions between evolved communities (p > 0.07, ρ = −0.2538, S = 26110.99); evolution
in a limited-resource environment provides an implicit negative frequency-dependent pressure
to occupy all open niches as equally as possible. Each community therefore covers all the
tasks its members are capable of fairly evenly, and the diﬀerence between communities that
are not ecologically complete and those that are, while part of this metric, is not enough to
make task coverage a signiﬁcant correlate in this case.

57

0.8
0.6
0.4
0.2
0.0

Fractional Success of Community A

1.0

Community Breadth vs. Fractional Success

-1.5

-1.0

-0.5

0.0

0.5

1.0

1.5

Diﬀerential Task Coverage
Figure 5.6: Community breadth vs. fractional community success for ecologically complete
assembled communities in competition. Spearman’s ρ = 0.3419.

5.4.4

Resource use eﬃciency

Diﬀerential total unused resource showed a signiﬁcant correlation with fractional community
success in competitions between the evolved communities (p < .003, ρ = −0.4525, S =
30250.19), although the correlation appears to be driven by outliers (Figure 5.8).
I therefore also evaluated the correlation between total unused resource and community
success for the subset of competitions in which both evolved communities were ecologically
complete, so that no untapped resources aﬀected the diﬀerence in total unused resource. For
this subset, ρ is lower, and the p-value, 0.0198, is higher. Nevertheless, p is still signiﬁcant,
58

0.8
0.6
0.4
0.2
0.0

Fractional Success of Community A

1.0

Community Breadth vs. Fractional Success

-1.5

-1.0

-0.5

0.0

0.5

1.0

1.5

Diﬀerential Task Coverage
Figure 5.7: Community breadth vs. fractional community success for specialist-preferred
ecologically complete assembled communities in competition. Spearman’s ρ = 0.3422.

and would remain so if this test were incorporated into the sequential Bonferroni correction,
indicating that the extreme outlier unused resource diﬀerences are not driving the entirety
of this correlation.
The correlation was also signiﬁcant for the randomly assembled communities (p < 7.3 ×
10−6 , ρ = −0.6293, S = 33930.46), between the ecologically complete assembled communities (p < 0.001042, ρ = −0.5003, S = 31242.75), and between the specialist-preferred
assembled communities (p < 0.02, ρ = −0.3949, S = 29048.04): resource use eﬃciency is
correlated with community success in competition no matter what the origin of the commu-

59

0.8
0.6
0.4
0.2
0.0

Fractional Success of Community A

1.0

Community Eﬃciency vs. Fractional Success

-20000

-10000

0

10000

20000

Diﬀerential Total Unused Resource
Figure 5.8: Community resource use eﬃciency vs. fractional community success for evolved
communities in competition. Spearman’s ρ = −0.4525.

nity (Figures 5.9, 5.10, and 5.11).
As the measure of diﬀerential resource use eﬃciency actually has negative values when the
dominant community draws resource levels lower than its competitor, i.e. is more eﬃcient, it
is not surprising to see a negative ρ for these correlations: it indicates a positive correlation
between fractional community success and the ability to draw resources down to a lower
level than the competitor community.

60

0.8
0.6
0.4
0.2
0.0

Fractional Success of Community A

1.0

Community Eﬃciency vs. Fractional Success

-60000

-40000

-20000

0

20000

40000

Diﬀerential Total Unused Resource
Figure 5.9: Community resource use eﬃciency vs. fractional community success for randomly
assembled communities in competition. Spearman’s ρ = −0.6293.

5.5

Linear Models

Of the predictors I have considered, it seems that diversity, generality, and resource use
eﬃciency may play into the success of evolved communities in competition. I therefore
constructed all one- and two-factor linear models combining these predictors (including twofactor models with interaction). I additionally constructed one containing all predictors
and one with all predictors all interactions. I calculated the AIC and BIC values for each
model (Bolker & Team, 2012; R Core Team, 2012); Table 5.1 shows the ﬁve models with
AIC diﬀerences less than three, as well as the weights assigned to each model. I conclude
61

0.8
0.6
0.4
0.2
0.0

Fractional Success of Community A

1.0

Community Eﬃciency vs. Fractional Success

-40000

-20000

0

20000

40000

60000

Diﬀerential Total Unused Resource
Figure 5.10: Community resource use eﬃciency vs. fractional community success for ecologically complete assembled communities in competition. Spearman’s ρ = −0.5003.

that diﬀerential resource eﬃciency and community generality together can predict fractional
community success (that is, not just predict which community will win, but by how much).
The adjusted R2 value for this model is 0.3425, indicating that the model predicts 34.25%
of the variation in fractional community success. Community diversity is seems largely
unhelpful for prediction unless community generality is not known.
Generality, task coverage, and eﬃciency each showed correlation with fractional success
for the competitions between randomly assembled communities and between ecologically
complete assembled communities. I therefore constructed the same families of linear models

62

0.8
0.6
0.4
0.2
0.0

Fractional Success of Community A

1.0

Community Eﬃciency vs. Fractional Success

-40000

-20000

0

20000

40000

Diﬀerential Total Unused Resource
Figure 5.11: Community resource use eﬃciency vs. fractional community success for
specialist-preferred ecologically complete assembled communities in competition. Spearman’s ρ = −0.3949.

Model

AIC weight BIC weight

fcs = 0.558 + −0.303g + −0.0000260e
f cs = 0.566 + 0.1591d + -0.212g + −0.0000235e
f cs = 0.556 + −0.306g + −0.0000257e + -0.00000151ge
f cs = 0.573 + −0.384d + −0.0000242e + -0.0000187de
f cs = 0.572 + 0.345d + −0.0000189e

0.355
0.289
0.133
0.127
0.0972

0.536
0.167
0.0770
0.0735
0.157

Table 5.1: Linear models predicting the fractional community success (f cs) of evolved communities based on diﬀerential community diversity (d), generality (g), and resource use
(in)eﬃciency (e). The ﬁve models with AIC diﬀerences below three are displayed; AIC and
BIC weights (considering only these ﬁve models) are displayed. Factors that are insigniﬁcant
at p = .05 inside each model are italicized.

63

for each: a null model, three one-factor models, three two-factor models without interaction
and three with interaction, and two full models, with and without all two- and three-way
interactions. For each community type, I examined the AIC and BIC values of all the models;
I present those models with AIC diﬀerence of less than three. In the case that the preferred
model as judged by BIC was not one of these, I present that as well.
Table 5.2 presents four models of fractional community success in competition for randomly evolved communities. Resource use eﬃciency is the only factor in the favored model,
and the only signiﬁcant factor in the models with the next three highest AIC weights. I conclude that resource use eﬃciency is the most important predictor of fractional community
success for these communities. The model including only the eﬃciency factor accounts for
38.66% of the variance (adjusted R2 ) in fractional community success.
Model

AIC weight BIC weight

f cs = 0.422 + −0.00000710e
f cs = 0.419 + -0.0528c + −0.00000822e
f cs = 0.422 + -0.00252g + −0.00000715e
f cs = 0.393 + 0.0518g + −0.00000665e + -0.00000228ge

0.472
0.216
0.174
0.139

0.735
0.130
0.104
0.0319

Table 5.2: Linear models predicting the fractional community success (f cs) of randomly
assembled communities based on diﬀerential community generality (g), task coverage (c),
and resource use (in)eﬃciency (e). These are the four models with AIC diﬀerence less than
three; AIC and BIC weights (when considering only these models) are displayed. Factors
that are not signiﬁcant at p = .05 inside each model are italicized.

For ecologically complete assembled communities in competition, only two models of
fractional community success had AIC weight less than three: these were the full model
and the full model with all interactions, presented in Table 5.3. However, the highest
weight model in the BIC table was the one-factor model including resource use eﬃciency,
f cs = 0.425 + −0.000008203e; this is also the only signiﬁcant factor in the full model with
interactions. This one-factor model accounts for 27.5% of the variation in fractional com64

munity success for ecologically complete communities, while the full model incorporating all
three factors explains 34.36%. I conclude that community generality, breadth, and eﬃciency
are all important in predicting success for the ecologically complete assembled communities.
Model

AIC weight BIC weight

f cs = 0.421 + 0.318g + −0.374c + −0.00000940e
f cs = 0.401 + 0.212g + -0.298c + −0.0000115e
+0.135gc + -0.00000809ge + 0.0000109ce + 0.00000183gce

0.794
0.206

0.994
0.00463

Table 5.3: Linear models predicting the fractional community success (f cs) of ecologically
complete assembled communities based on diﬀerential community generality (g), task coverage (c), and resource use (in)eﬃciency (e). These are the two models with AIC diﬀerence
less than three; AIC and BIC weights (when considering only these models) are displayed.
Factors that are not signiﬁcant at p = .05 inside each model are italicized.

Only community breadth (i.e., task coverage) and community resource use eﬃciency
were signiﬁcantly correlated with fractional community success for competitions between
specialist-preferred ecologically complete communities. I therefore constructed only a null
linear model, the two one-factor models, and full two-factor models with and without interaction. All models except the null model had AIC distance less than three; I present them
in Table 5.4. For such communities, both task coverage (breadth) and resource use eﬃciency
appear to be important predictors. However, the two single-factor linear models each predict
only about 12% of the variance in success on their own, and the models incorporating both
factors do no better, leading me to conclude that for these communities breadth and resource
use eﬃciency are roughly equivalent. Note that of all the community types, competitions
between the specialist-preferred communities were least consistent (above), so it is not surprising that the results of these competitions cannot be predicted as well as those between
other community types.

65

Model

AIC weight BIC weight

f cs = 0.553 + 0.208c
f cs = 0.449 + −0.00000613e
f cs = 0.553 + 0.136c + -0.00000315e
f cs = 0.532 + 0.134c + -0.00000341e + -0.00000212ce

0.508
0.349
0.118
0.025

0.381
0.262
0.231
0.127

Table 5.4: Linear models predicting the fractional community success (f cs) of specialistpreferred ecologically complete assembled communities based on diﬀerential community task
coverage (c) and resource use (in)eﬃciency (e). These are the two models with AIC diﬀerence
less than three; AIC and BIC weights (when considering only these models) are displayed.
Factors that are not signiﬁcant at p = .05 inside each model are italicized.

5.6

Conclusion

The fractional success of an evolved community in competition with another evolved community is both strongly consistent and predictable by community-level measures: in this
case, the diﬀerential community generality and resource use eﬃciency between the two can
predict up to 34.25% of the variation in fractional success. In fact, diﬀerential resource use
eﬃciency is a valuable predictor in competitions between any two communities of the same
type, both evolved and assembled.
Community generality is a predictor for evolved communities and two types of assembled
communities, but biases the competition in diﬀerent ways between the evolved and assembled
communities. In linear models predicting the success of evolved communities in competition,
the coeﬃcient of generality is negative. However, in most linear models predicting the
success of assembled communities in competition, the coeﬃcient of generality is positive;
when it is negative, the generality term is insigniﬁcant in the model. I observed the same
trend with the sign of Speaman’s ρ when calculating correlation between community success
and generality: this provides strong evidence that diﬀerential community generality biases
competitions between evolved communities diﬀerently than it does those between assembled

66

communities, and suggests that there is something fundamentally diﬀerent in their structures.
In nature, we often do not have access to detailed data about every member of a community, but we can estimate community-level measures such as the ones laid forth in this
chapter. The predictors I have examined here are in common use for characterizing the competition of two populations or the success of an invasive species. I have shown here that they
are also informative measures for competitions between entire communities and can predict
what fraction of the resulting post-competition community will be descended from each competitor. In this chapter I have shown only correlation relationships; while community-level
properties are diﬃcult to manipulate cleanly even in computational systems like Avida, I
have shown that experiments which might be able to tease out a causal relationship between
any one of these properties and fractional community success are worth facing that diﬃculty.

67

Chapter 6
Conclusions and Future Work

6.1

Conclusions

I have examined the diversity, task reward structure, and competition of digital communities,
the result of adaptive radiation due to negative frequency-dependent selection in limitedresource environments. They are more diverse than those evolving in environments where
the only resource is space to replicate, and evolve complex features more readily. Tying the
rewards of complex and building-block tasks to resource levels additionally reduces the cost
of trading simple tasks for complex ones, and I have shown that this has a strong causal
relationship with the evolution of complex features.
Additionally, I have shown that the results of competitions between communities, especially evolved communities, are consistent and predictable. These results establish a
strong basis for future work in this area that will further explore the relationships between
community-level and ecotype-level characterizations of communities and how that relates to
their ability to survive in competition. Moreover, investigations at the ecotype level will
allow us to understand how two communities combine when the outcome of competition is
not a simple exclusion of one community by another.
Systems of limited resources have shown signiﬁcant promise in evolutionary computation
(Goings & Ofria, 2009; Goings, 2010; Goldsby, 2011), with applications in multi-objective
optimization and in problem decomposition. A building-block task or “hint” sub-solution can
68

be easily rewarded simply by incorporating a resource associated with it. In systems where
solutions control their own reproduction, limited resources allow the simple construction
of complex landscapes, with an implicit ﬁtness function that changes smoothly to reﬂect
the current solution set. Some of the extensions to this work, discussed below, focus on
investigating how ﬂexible limited resource systems can be in terms of building-block tasks
and even dynamic addition or removal of rewards.
The role of diversity in encouraging the evolution of optimal solutions to complex problems has long been acknowledged in the ﬁeld of evolutionary computation (Sareni & Kr¨henb¨hl,
a
u
1998; Friedrich et al., 2009), and methods of diversity preservation such as ﬁtness sharing
(Goldberg & Richardson, 1987) employ negative frequency- or density-dependent mechanisms. On the other hand, I have found no literature addressing how diversity or negative
frequency-dependent selection drive the evolution of biological complex features. This comparative lack suggests that other evolutionary computation strategies for solving complex
problems may be worth further investigation as pressures driving the evolution of complex
traits in nature, returning inspiration to the ﬁeld of evolutionary biology from which they
arose. While evolutionary computation has often borrowed ideas from evolutionary biology,
its unique focus on using evolution to solve complex real-world problems has led to unique
and valuable insights into the evolution of complex features.

6.2
6.2.1

Future Work
Building blocks: features of lesser complexity

In Chapter 3, I found some evidence that the sensitivity of the evolution of a Boolean task
to resource inﬂow levels increases as the complexity of the task increases. In Chapter 4,
69

I showed that a reward system that eliminates detrimental trade-oﬀs of all tasks for more
complex tasks results in more evolution of EQU than simply eliminating the detrimental
trade-oﬀ for EQU itself. In the study of the evolution of complex tasks, it seems na¨ to
ıve
neglect the explicit study of building-block tasks. At what level of intermediate complexity
do these tasks begin to respond to the same pressures that drive the evolution of those we
have labeled “complex”? Do entirely diﬀerent pressures drive the evolution of low-complexity
tasks? If so, how do those pressures aﬀect the evolution of high-complexity tasks?
In Avida and in those evolutionary computation systems in which they can be identiﬁed,
we provide the building block tasks, often at several intermediate levels of complexity. How
does the ﬁnal evolution of a complex feature, or the speed of the evolution of a complex problem, react to diﬀerent collections of building blocks: for example, only the lowest-complexity
tasks, or only tasks just below the target in complexity? In evolutionary computation, we
often seek the evolution of multiple complex features which may have (or seem to have)
disjoint building blocks. How does the presence of lower complexity tasks unrelated to the
target task aﬀect the evolution of the target task? Can building blocks be included and
excluded dynamically? In Avida we could implement this by explicitly setting a task’s value
to 0 on the ﬂy, or by stopping inﬂow to a linked resource pool.

6.2.2

Competitions between evolved and assembled communities

In Chapter 5, I have only examined competitions between communities constructed with
the same implicit or explicit criteria. Are competition results still predictable, either by
community-level or ecotype-level descriptors, when the competitor communities diﬀer in construction criteria? This test may be of particular interest in competitions between evolved
and assembled communities. Will evolved communities be inherently better because the in70

dividuals evolved together? They certainly behave diﬀerently than assembled communities:
community generality seems to play a diﬀerent role in each. Within-type, evolved communities composed of specialists do well, whereas in assembled communities, those made up of
generalists have an advantage. What is the role of community generality / specialization
when the two types compete? Second, the interactions between evolved and assembled communities have application to several ﬁelds: not only to ecology, but also to computer security
where artiﬁcial immune systems face viruses and other malware.

6.2.3

Ecotype-level community descriptors

In Chapter 5, I established that whole-community properties can be used to construct linear
models that account for around 35% of the variation in community success in competition.
Considering the high consistency of the results of these competitions, especially for evolved
communities, it seems likely that more of the variation is explainable. The next obvious
set of predictor candidates are those that deal with individual ecotypes in the community:
are some simply better than others, no matter the circumstance? If so, can we detect them
without testing them in competition? Perhaps they share a common set of tasks. Perhaps
they might be particularly conservative with task execution, not wasting instruction cycles
performing more tasks than they will be rewarded for.
Or are certain types of community organization more successful than others? Even in
this simple nine-resource environment — or ten-resource, since space is the fundamental
resource in Avida — it seems unlikely there is one ideal Platonic community, but are there
certain community compositions that perform better than others? Does evolution tend to
reliably produce such communities? Can we reliably construct them? Of particular interest
and application in the natural world, especially the management of invasive species, can we
71

reliably strengthen or weaken communities with artiﬁcial additions or removals?

72

APPENDICES

73

Appendix A
Instruction Set

A.1

No-ops

nop-A The no-operation instructions do nothing when executed. However, they can modify
the behavior of other instructions immediately preceeding them, or act as labels in
the genome. When modifying other instructions, nop-A stands for register AX or
the instruction pointer, nop-B stands for register BX or the read head, and nop-C
stands for register CX or the write head. Note that only one no-op can modify each
instruction.
nop-B
nop-C

A.2

Flow control

if-n-equ Compares register ?BX? with its complement; if they are not equal, the next
instruction (after any modifying no-op instruction) is executed. If they are equal, the
next instruction is skipped.
if-less Compares register ?BX? with its complement; if ?BX? is the lesser value, the next
instruction is executed. If not, the next instruction is skipped.

74

if-label Reads the no-op sequence following it, interpreting it as a label, and tests if
the complement label was the most recently copied sequence of instructions. If so,
it executes the instruction following the label; if not, it skips that instruction. (This
is commonly used by organisms in determining whether they are done producing an
oﬀspring copy.)
mov-head Jumps the ?IP? to the memory location of the ﬂow head.
jmp-head Reads the CX register and jumps the ?IP? forward in the organisms genome by
that amount.
get-head Copies the memory location of the ?IP? into the CX register.
set-flow Moves the ﬂow head to the memory location in the ?CX? register.

A.3

Single argument math

shift-r Shifts the bits in the ?BX? register right by one.
shift-l Shifts the bits in the ?BX? register left by one, shifting in a 0 and truncating any
bits beyond the 32-bit capacity of the register.
inc Increments the contents of the ?BX? register by one.
dec Decrements the contents of the ?BX? register by one.
push Places the contents of the ?BX? register at the top of the active stack, leaving the
?BX? register unchanged.
pop Removes the top element from the active stack and places it into the ?BX? register.
75

swap-stk Toggles which stack is active. Other instructions act only on the active stack.
swap Swaps the contents of the ?BX? register and its complement register.

A.4

Double argument math

add Adds the contents of the BX and CX registers, placing the result in the ?BX? register.
sub Subtracts the contents of the CX register from the contents of the BX register, placing
the result in the ?BX? register.
nand

A.5

Biological operations

h-copy Reads the organisms memory at the position of the read head and copies it to the
position of the write head. If copy mutations are turned on, a random (chosen from the
full set of instructions with equal possibility) instruction may be placed at the write
head instead.
h-alloc Allocates additional memory for the organism, up to the maximum it is allowed to
use for its oﬀspring.
h-divide Divides the oﬀspring from the parent. The parent keeps its memory up to the
location of the read head; the oﬀsprings memory is initialized to everything between
the read head and the write head.

76

A.6

I/O and Sensory operations

IO Outputs the contents of the ?BX? register, checking for any tasks that have been performed, then places a new input into the same ?BX?.
h-search Reads the no-op sequence following it, interpreting it as a label, and ﬁnds the
location of a complement label in the organisms memory. The ﬂow head is moved to
the location of the complement label, the BX register is set to the distance between
the instruction pointer and the complements location, and the CX register is set to
the size of the label. If there is not a no-op sequence, the ﬂow head is moved to the
instruction following h-search, and the BX and CX registers are set to 0.

A.7

Deﬁnitions

complement The complement of nop-A is nop-B, of nop-B is nop-C, and of nop-C is nop-A.
?BX? If not followed by no-op, this instruction acts on register BX by default. If followed by
a no-op, nop-A indicates register AX, nop-B indicates register BX, and nop-C indicates
register CX.
?IP? If not followed by a no-op, this instruction acts on the Instruction Pointer by default.
If followed by a no-op, nop-A directs the instruction to act on the instruction pointer,
nop-B indicates the read head, and nop-C indicates the write head.

77

Appendix B
Boolean Logic
All Boolean logic functions can be written in terms of the Boolean NAND function. This
Appendix presents the Boolean logic functions used in the chapters above, their logical
formulae, and their English meaning. Truth tables and circuit diagrams are also included:
the circuit diagrams show both the function’s representation using a full range of logic gates
and implemented with only NAND logic gates.
A NAND B is shorthand for ¬(A ∧ B), a logical statement which reads “It is not true
that both A and B are true.” That is, in plain English, if A NAND B is true, either A is
not true, or B is not true, or both are not true. Table B.1 gives the truth table for NAND
when A and B are each one bit; that is, either 0 (false) or 1 (true). Figure B.1 shows a
representation of a NAND gate as it appears in a circuit diagram.
A
0
0
1
1

B
0
1
0
1

A NAND B
1
1
1
0

Table B.1: Truth table of binary NAND. This function is false only when both A and B are
true.

A
B

¬(A ∧ B)

Figure B.1: The NAND logic gate in a circuit diagram.

78

NOT A is written as ¬A in logical notation, and reads “A is not true.” Table B.2 gives
the simple truth table for NOT.
A NOT A
0
1
0
1
Table B.2: Truth table of unary NOT. This function is false when the input A is true.
Implementing NOT in terms of NAND is simple once we observe that A ∧ A is equivalent
to A, and therefore ¬A is equivalent to ¬(A ∧ A). This expression simply represents a NAND
operation where the two inputs are the same.
A
A

¬(A ∧ A) = ¬A

Figure B.2: The NOT function implemented with NAND in a circuit diagram. Feeding the
input A into both inputs of a NAND gate produces ¬A.

A AND B is written as A ∧ B in logical notation, and reads “Both A and B are true.”
Table B.3 gives the truth table for AND, and Figure B.3 shows the symbol for an AND logic
gate and its implementation using two NAND gates.
A B
0 0
0 1
1 0
1 1

A AND B
0
0
0
1

Table B.3: Truth table of binary AND. This function is true only when both A and B are
true.

To implement AND in terms of NAND, observe that we know how to implement both
NAND and NOT in terms of NAND. We simply compute NAND and use another NAND to
negate the result, producing AND using two NAND operations.
79

A
B

¬(A ∧ B)

¬(¬(A ∧ B)) = (A ∧ B)

Figure B.3: The AND function implemented with NAND gates in a circuit diagram. AND
is implemented with two NAND gates.

A ORN B is shorthand for the logical statement A ∨ ¬B. “ORN” is short for “or not”,
and this statement reads “Either A is true or B is false.” Note that by switching the names
of A and B, ORN can also mean “Either B is true or A is false.” Table B.4 shows the truth
table for ORN: notice that only one of the “A is true” and “B is false” statements must hold
for the whole statement to be true.
A B
0 0
0 1
1 0
1 1

A ORN B
1
0
1
1

Table B.4: Truth table of binary ORN. This function is false only when A is false and B is
true.

Even though ORN is phrased in terms of “or” rather than “and”, it can be implemented
with NAND operations. First, apply De Morgan’s law: A ∨ ¬B is equivalent to ¬(¬A ∧ B).
Observe that this is simply the NAND function, but with one input negated. It takes one
NAND to implement negation, so the total number of NAND operations is two. The circuit
diagram of this implementation is given in Figure B.4.
A OR B, in logical notation, is A ∨ B. It reads “Either A is true, or B is true, or both
of them are true.” Table B.5 shows the truth table for OR.
In order to express OR in terms of NAND, we apply De Morgan’s law: A∨B is equivalent
to ¬(¬A ∧ ¬B), that is, the NAND function with both inputs negated. This three-NAND
implementation is presented in Figure B.5.
80

A
A

¬A
¬(¬A ∧ B) = A ∨ ¬B

B

Figure B.4: The ORN function implemented with NAND gates in a circuit diagram. We
apply De Morgan’s law to see that the computed result, ¬(¬A ∧ B), is equivalent to A ∨ ¬B,
the ORN function.
A B
0 0
0 1
1 0
1 1

A OR B
0
1
1
1

Table B.5: Truth table of binary OR. This function is false only when both A and B are
false.
A ANDN B is shorthand for the logical expression A ∧ ¬B, which reads “A is true and
B is false.” The truth table for ANDN is shown in Table B.6.
A
0
0
1
1

B
0
1
0
1

A ANDN B
0
0
1
0

Table B.6: Truth table of binary ANDN. This function is true only when A is true and B is
false.
A
A

¬A
¬(¬A ∧ ¬B) = A ∨ B

B
B

¬B

Figure B.5: The OR function implemented with NAND gates in a circuit diagram. We apply
De Morgan’s law to see that the computed result, ¬(¬A ∧ ¬B), is equivalent to A ∨ B, the
OR function.

81

ANDN is simply the AND function with one of its inputs negated. We implement it in
terms of NAND by using one NAND for the input negation and two to implement AND, for
a total of three NAND operations. The circuit diagram laying out this implementation is
presented in Figure B.6.
A
B
B

¬(A ∧ ¬B)

¬(¬(A ∧ ¬B)) = A ∧ ¬B

¬B

Figure B.6: The ANDN function implemented with NAND gates in a circuit diagram. Implementing ANDN requires three NAND gates.

A NOR B is written ¬(A ∨ B) in logical notation, and reads “Neither A nor B is true,”
that is, “Both A and B are false.” The truth table for NOR is shown in Table B.7.
A B
0 0
0 1
1 0
1 1

A NOR B
1
0
0
0

Table B.7: Truth table of binary NOR. This function is true only when both A and B are
false.
To implement NOR in terms of NAND, we use the already existing implementations for
OR and NOT. Three NAND operations are used to implement OR, and a ﬁnal NAND is
used to negate the output, producing NOR by using four NAND operations.
A XOR B is written A ⊕ B in logical notation, which reads ”Either A is true or B is true,
but not both,” or equivalently, “A and B are diﬀerent.” Table B.8 gives the truth table for
XOR.
XOR can more more awkwardly phrased “Either A is true and B is false, or A is false
and B is true.” It is impossible for both statements to be true at the same time. If we
82

A
A

¬A
¬(¬A ∧ ¬B)

B
B

¬(¬(¬A ∧ ¬B)) = ¬(A ∨ B)

¬B

Figure B.7: The NOR function implemented with NAND gates in a circuit diagram. Implementing NOR requires four NAND gates; we apply De Morgan’s law to observe that the
output, which simpliﬁes to ¬A ∧ ¬B, is equivalent to ¬(A ∨ B), the NOR function.
A
0
0
1
1

B
0
1
0
1

A XOR B
0
1
1
0

Table B.8: Truth table of binary XOR. This function is true only when A and B are diﬀerent.
expand this statement with De Morgan’s law, a na¨ implementation requires ﬁve NAND
ıve
operations. However, XOR requires only four NAND operations, and this is easy to show
with a little symbolic manipulation:

A⊕B
(A ∧ ¬B) ∨ (¬A ∧ B)
(A ∧ ¬A) ∨ (A ∧ ¬B) ∨ (¬A ∧ B) ∨ (B ∧ ¬B)
(A ∧ (¬A ∨ ¬B)) ∨ ((¬A ∨ ¬B) ∧ B)
(A ∧ ¬(A ∧ B)) ∨ (¬(A ∧ B) ∧ B)
¬(¬(A ∧ ¬(A ∧ B)) ∧ ¬(¬(A ∧ B) ∧ B))
¬(¬(A ∧ X) ∧ ¬(X ∧ B))

83

Notice that ¬(A ∧ B), shortened to X, is repeated, but we will only have to calculate it
once. That uses one NAND operation. The equation in X is a simple NAND of two smaller
NAND operations: in total, four NAND operations.
¬(A ∧ ¬(A ∧ B))

A

¬(¬(A ∧ ¬(A ∧ B)) ∧ ¬(¬(A ∧ B) ∧ B))

¬A ∧ B
B

¬(¬(A ∧ B) ∧ B)

Figure B.8: The XOR function implemented with NAND gates in a circuit diagram. Implementing XOR requires four NAND gates; we simplify the result by following the derivation
backward to obtain (A ∧ ¬B) ∨ (¬A ∧ B), the XOR function.

A EQU B is read quite simply as “A is equal to B,” or “A and B are the same.” The
simple truth table is presented in Table B.9. This concept of equality or matching, while
quite simple for us, requires at least ﬁve NAND operations in a computer.
A B
0 0
0 1
1 0
1 1

A EQU B
1
0
0
1

Table B.9: Truth table of binary EQU, sometimes known as XNOR. This function is true
only when A and B are the same.

However, implementing EQU in terms of NAND is simple if we consider the functions
already implemented: EQU is simply the negation of XOR, requiring one extra NAND operation for a total of ﬁve NANDs. This implementation appears in Figure B.9. Equivalently,
EQU can be expressed as “Both A and B are true, or both A and B are false.” Because
we discount the possibility that A and B can be both true and false at the same time, it is
redundant to add “but not both.” Since EQU is now expressed in terms of both “and” and
84

“or”, we employ De Morgan’s law to express it only in terms of “and”: (A ∧ B) ∨ (¬A ∧ ¬B)
is equivalent to ¬(¬(A ∧ B) ∧ ¬(¬A ∧ ¬B)). This implementation also requires ﬁve NAND
operations.
¬(A ∧ ¬(A ∧ B))

A

(A ∧ ¬B) ∨ (¬A ∧ B)

¬A ∧ B
B

¬(¬(A ∧ B) ∧ B)
¬((A ∧ ¬B) ∨ (¬A ∧ B))

Figure B.9: The EQU or XNOR function implemented with NAND gates in a circuit diagram. Implementing EQU requires ﬁve NAND gates; the result simpliﬁes to
(¬A ∧ ¬B) ∨ (A ∧ B), the EQU function.

85

Appendix C
How two deleterious EQU-granting
mutations ﬁxed on the line of descent
As shown in Figure 4.1, two of the EQU-achieving populations evolved in the unlimitedresource environment of Chapters 3 and 4 ﬁxed a detrimental EQU-granting mutation on
the line of descent. (These are runs 1089 and 1191.) While unusual, this is not out of
the question. I examined these mutations in detail, and determined why these detrimental
mutations were able to ﬁx.
First, both mutations were nearly neutral. The ﬁtness ratio of the detrimental mutation
ﬁxed in population 1089 was 0.997674, and that of population 1191 was 0.999416. In both
cases, the net task trade-oﬀ eﬀect on the merit multiplier was 0 (Table C.1, Table C.2).
Presumably the mutation also increased gestation time slightly. Second, the next mutation
on each lineage is super-compensatory. Population 1191’s compensatory mutation doubled
its ﬁtness; population 1089,s resulted in a more-than-15-fold increase in ﬁtness!

86

Pop. 1089 NOT NAND

AND

ORN OR ANDN

NOR XOR EQU

pre-EQU
EQU
post-EQU
Table C.1: Task performance snapshot of Population 1089 before, at, and after the detrimental ﬁxation of EQU. The EQU-granting mutation trades a 21 task (NOT) and a 24 task
(NOR) for the 25 EQU task; the task trade-oﬀ net gain is 0. The next mutation on the
lineage regains the 24 NOR task.

Pop. 1191 NOT NAND

AND

ORN OR ANDN

NOR XOR EQU

pre-EQU
EQU
post-EQU
Table C.2: Task performance snapshot of Population 1191 before, at, and after the detrimental ﬁxation of EQU. The EQU-granting mutation trades both 22 tasks (AND, ORN)
and a 24 task (NOR) for the 25 EQU task and a 23 task (ANDN); the task trade-oﬀ net
gain is 0. The next mutation on the lineage trades the 23 ANDN task for the 24 NOR task.

87

BIBLIOGRAPHY

88

BIBLIOGRAPHY
Peter A. Abrams, Claus Rueﬄer and Gary Kim. 2008. Determinants of the strength of
disruptive and/or divergent selection arising from resource competition. Evolution, 62,
1571–1586.
Christoph Adami, Charles Ofria and Travis C. Collier. 2000. Evolution of biological complexity. Proceedings of the National Academy of Sciences of the United States of America,
97, 4463.
Michael C. Appleby. May 1983. The probability of linearity in hierarchies. Animal Behaviour, 31, 600–608.
Ben Bolker and R Development Core Team. bbmle: Tools for general maximum likelihood
estimation, 2012. URL http://CRAN.R-project.org/package=bbmle. R package version
1.0.5.2.
Bradley J. Cardinale, Helmut Hillebrand, W. S. Harpole, Kevin Gross and Robert Ptacnik. 2009. Separating the inﬂuence of resource ‘availability’ from resource ‘imbalance’ on
productivity–diversity relationships. Ecology Letters, 12, 475–487.
Ted J. Case. December 1990. Invasion resistance arises in strongly interacting species-rich
model competition communities. Proceedings of the National Academy of Sciences, 87,
9610–9614.
Paul Chesson. 2000. Mechanisms of maintenance of species diversity. Annual Review of
Ecology and Systematics, 31, 343–366.
Stephanie S. Chow, Wilke O. Claus, Charles Ofria, Richard E. Lenski and Christoph Adami.
July 2004. Adaptive radiation from resource competition in digital organisms. Science, 305,
84–86.
Tim F. Cooper and Charles Ofria. Evolution of stable ecosystems in populations of digital
organisms. In Proceedings of VIII International Conference on Artiﬁcial Life, pages 227–
232, 2002.
Arthur W. Covert III. On the beneﬁcial eﬀects of deleterious mutations. PhD thesis,
gradworks.umi.com, 2011.
89

Charles Darwin. The origin of species by means of natural selection, or, The preservation
of favored races in the struggle for life. John Murray, London, 6th edition, 1872.
Ulf Dieckmann and Michael Doebeli. July 1999. On the origin of species by sympatric
speciation. Nature, 400, 354–357.
Damian Eads, 2008. URL http://scipy-cluster.googlecode.com/. hcluster: Hierarchical Clustering for SciPy.
Santiago F. Elena, Claus O. Wilke, Charles Ofria and Richard E. Lenski. March 2007.
Eﬀects of population size and mutation rate on the evolution of mutational robustness.
Evolution, 61, 666–674.
Timothy J. Flowers, Hanaa K. Galal and Lindell Bromham. 2010. Evolution of halophytes:
multiple origins of salt tolerance in land plants. Functional Plant Biology.
Tobias Friedrich, Pietro S. Oliveto, Dirk Sudholt and Carsten Witt. 2009. Analysis of
diversity-preserving mechanisms for global exploration. Evolutionary Computation, 2003.
CEC’03. The 2003 Congress on, 17, 455–476.
Maren L Friesen, Gerda Saxer, Michael Travisano and Michael Doebeli. 2004. Experimental
evidence for sympatric ecological diversiﬁcation due to frequency-dependent competition in
Escherichia coli. Evolution.
Douglas J. Futuyma and Gabriel Moreno. 1988. The evolution of ecological specialization.
Annual Review of Ecology and Systematics, 19, 207–233.
Sergey Gavrilets and Aaron Vose. December 2005. Dynamic patterns of adaptive radiation.
Proceedings of the National Academy of Sciences of the United States of America, 102,
18040–18045.
Sherri Goings. Natural niching: Applying ecological principles to evolutionary computation.
PhD thesis, Michigan State University, September 2010.
Sherri Goings and Charles Ofria. 2009. Ecological approaches to diversity maintenance
in evolutionary algorithms. Artiﬁcial Life, 2009. ALife ’09. IEEE Symposium on, pages
124–130.
David E. Goldberg and Jon Richardson. Genetic algorithms with sharing for multimodal
function optimization. In Proceedings of the Second International Conference on Genetic
Algorithms and Their Application, pages 41–49. L. Erlbaum Associates Inc., October 1987.
90

Heather J. Goldsby. The Evolution of Division of Labor in Digital Organisms. PhD thesis,
Michigan State University, Michigan State University, February 2011.
Lee Graham, Franz Oppacher and Steﬀen Christensen. September 2007. Irreducible complexity in a genetic algorithm. Proceedings of the 2007 IEEE Congress on Evolutionary
Computation, pages 3692–3697.
T. Ryan Gregory. 2008. The evolution of complex organs. Evolution: Education and
Outreach, 1, 358–389.
Alex R. Hall and N Colegrave. January 2007. How does resource supply aﬀect evolutionary
diversiﬁcation? Proceedings of the Royal Society B: Biological Sciences, 274, 73–78.
Rees Kassen, Angus Buckling, Graham Bell and Paul B. Rainey. August 2000. Diversity
peaks at intermediate productivity in a laboratory microcosm. Nature, 406, 508–512.
M. G. Kendall and B. Babington Smith. 1940. On the method of paired comparisons.
Biometrika, 31, 324.
Theodore A. Kennedy, Shahid Naeem, Katherine M. Howe, Johannes M. H. Knops, David
Tilman and Peter Reich. June 2002. Biodiversity as a barrier to ecological invasion. Nature,
417, 636–638.
Richard E. Lenski, Jeﬀrey E. Barrick and Charles Ofria. 2006. Balancing robustness and
evolvability. PLoS Biol, 4, e428.
Richard E. Lenski, Charles Ofria, Travis C. Collier and Christoph Adami. 1999. Genome
complexity, robustness and genetic interactions in digital organisms. Nature, 400, 661–664.
Richard E. Lenski, Charles Ofria, Robert T. Pennock and Christoph Adami. May 2003.
The evolutionary origin of complex features. Nature, 423, 139–144.
John Maron and Marilyn Marler. 2007. Native plant diversity resists invasion at both low
and high resource levels. Ecology, 88, 2651–2661.
Enrique Melendez-Hevia, Thomas G. Waddell and Marta Cascante. 1996. The puzzle of
the Krebs citric acid cycle: Assembling the pieces of chemically feasible reactions, and
opportunism in the design of metabolic pathways during evolution. Journal of Molecular
Evolution, pages 293–303.

91

Dusan Misevic, Charles Ofria and Richard E. Lenski. February 2006. Sexual reproduction
reshapes the genetic architecture of digital organisms. Proceedings. Biological sciences /
The Royal Society, 273, 457–464.
St. George Mivart. On the Genesis of Species. Book on Demand, January 1871.
Dan-Eric Nilsson and Susanne Pelger. April 1994. A pessimistic estimate of the time
required for an eye to evolve. Proceedings of the Royal Society B: Biological Sciences, 256,
53–58.
Charles Ofria, David M. Bryson and Claus O. Wilke. Avida: A Software Platform for
Research in Computational Evolutionary Biology. In Maciej Komosinski and Andrew
Adamatzky, editors, Artiﬁcial Life Models in Software, pages 3–35. Springer London, London, 2009.
Charles Ofria, Wei Huang and Eric Torng. 2008. On the gradual evolution of complexity
and the sudden emergence of complex features. Artiﬁcial Life.
R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for
Statistical Computing, Vienna, Austria, 2012. URL http://www.R-project.org/. ISBN
3-900051-07-0.
D. A. Roﬀ and D. J. Fairbairn. 2007. The evolution of trade?oﬀs: where are we? Journal
of Evolutionary Biology, 20, 433–447.
Rowan F. Sage. 2004. The evolution of C4 photosynthesis. New Phytologist.
Bruno Sareni and Laurent Kr¨henb¨hl. September 1998. Fitness sharing and niching
a
u
methods revisited. IEEE Transactions on Evolutionary Computation, 2, 97–106.
Petr Savicky.
pspearman:
Spearman’s rank correlation test, 2009.
http://CRAN.R-project.org/package=pspearman. R package version 0.2-5.

URL

Dolph Schluter. November 1996. Ecological causes of adaptive radiation. The American
Naturalist, 148, S40–S64.
David Tilman. Resource Competition and Community Structure. Princeton University
Press, Princeton, NJ, 1982.

92

David Tilman. July 2004. Niche tradeoﬀs, neutrality, and community structure: a stochastic
theory of resource competition, invasion, and community assembly. Proceedings of the
National Academy of Sciences of the United States of America, 101, 10854–10861.
Bess L. Walker and Charles Ofria. Evolutionary Potential is Maximized at Intermediate
Diversity Levels. In International Conference on the Simulation and Synthesis of Living
Systems, pages 116–120. MIT Press, July 2012.
Daniel B. Weissman, Michael M. Desai, Daniel S. Fisher and Marcus W. Feldman. 2009.
The rate at which asexual populations cross ﬁtness valleys. Theoretical Population Biology,
75, 286–300.
Claus O. Wilke, Jia Lan Wang, Charles Ofria, Richard E. Lenski and Christoph Adami.
2001. Evolution of digital organisms at high mutation rates leads to survival of the ﬂattest.
Nature, 412, 331–333.
Erika S. Zavaleta and Kristin B. Hulvey. November 2004. Realistic species losses disproportionately reduce grassland resistance to biological invaders. Science, 306, 1175–1177.

93