COMPUTATIONAL METHODS TO INVESTIGATE CONNECTIVITY IN
                    EVOLVABLE SYSTEMS
                                 By
                        Acacia Lee Ackles
                       A DISSERTATION
                           Submitted to
                   Michigan State University
            in partial fulfillment of the requirements
                         for the degree of
           Integrative Biology - Doctor of Philosophy
         Ecology, Evolution, and Behavior - Dual Major
                                2022


                                          ABSTRACT
        COMPUTATIONAL METHODS TO INVESTIGATE CONNECTIVITY IN
                                   EVOLVABLE SYSTEMS
                                               By
                                       Acacia Lee Ackles
    Evolution sheds light on all of biology, and evolutionary dynamics underlie some of the
most pressing issues we face today. If we can deepen our understanding of evolution, we can
better respond to these various challenges. However, studying such processes directly can
be difficult; biological data is naturally messy, easily confounded, and often limited. For-
tunately, we can use computational modeling to help simplify and systematically untangle
complex evolutionary processes. The aim of this dissertation is therefore to develop innova-
tive computational frameworks to describe, quantify, and build intuition about evolutionary
phenomena, with a focus on connectivity within evolvable systems. Here I introduce three
such computational frameworks which address the importance of connectivity in systems
across scales.
    First, I introduce rank epistasis, a model of epistasis that does not rely on baseline
assumptions of genetic interactions. Rank epistasis borrows rank-based comparison testing
from parametric statistics to quantify mutational landscapes around a target locus and
identify how much that landscape is perturbed by mutation at that locus. This model
is able to correctly identify lack of epistasis where existing models fail, thereby providing
better insight into connectivity at the genome level.
    Next, I describe the comparative hybrid method, an approach to piecewise study of com-
plex phenotypes. This model creates hybridized structures of well-known cognitive substrates
in order to address what facilitates the evolution of learning. The comparative hybrid model
allowed us to identify both connectivity and discretization as important components to the
evolution of cognition, as well as demonstrate how both these components interact in differ-
ent cognitive structures. This approach highlights the importance of recognizing connected


components at the level of the phenotype.
    Finally, I provide an engineering point of view for Tessevolve, a virtual reality enabled
system for viewing fitness landscapes in multiple dimensions. While traditional methods
have only allowed for 2D visualization, Tessevolve allows the user to view fitness landscapes
scaled across 2D, 3D, and 4D. Visualizing these landscapes in multiple dimensions in an intu-
itive VR-based system allowed us to identify how landscape traversal changes as dimensions
increase, demonstrating the way that connections between points across fitness landscapes
are affected by dimensionality.
    As a whole, this dissertation looks at connectivity in computational structures across a
broad range of biological scales. These methods and metrics therefore expand our computa-
tional toolkit for studying evolution in multiple systems of interest: genotypic, phenotypic,
and at the whole landscape level.


Copyright by
ACACIA LEE ACKLES
2022


To Peggy Shupp–or, as I know you, Miss Peggy. Everything I have reaped from academia
           was sowed in your Montessori classroom some twenty years ago.
                                          v


                                ACKNOWLEDGEMENTS
    0 THE FOOL On the long journey through graduate school, there have been an im-
measurable number of people who saw a twenty-one-year-old with little concept of how to
debug code and helped shape her into a professor of computer science.
    I THE MAGICIAN To Arend Hintze, whose support of scientific failure shaped my
outlook through my graduate career.
    II THE HIGH PRIESTESS To Emily Dolson, who always let me cause a problem,
celebrate our wins, and cry in her office.
    III THE EMPRESS To Mom and all the times you let me leave the nest.
    IV THE EMPEROR To Dad and long summers filled with riddles.
    V THE HIEROPHANT To Charles Ofria, larger than life and down to earth.
    VI THE LOVERS To the strange little tangle of joy and love I’ve found myself in:
Shayne, and Parker, and Jess.
    VII THE CHARIOT To Jory Schossau, who taught me, with equal patience, how to
use vim and drive a car.
    VIII STRENGTH To the long, long list of people whose laughter kept me afloat—
Brielle and Crook and Mei and Luisa and DT and Giants and Kitsey and too many more to
name.
    IX THE HERMIT To Elliott Whitson. There’s no one else I’d rather be in a folie à
deux with.
    X THE WHEEL To the undergraduates I’ve been fortunate enough to mentor these
last few years: Sarah Albani, Aria Killebrew-Bruehl, and Lanea Rohan.
    XI JUSTICE To Grampy and Grammy and their magical home for wayward souls.
    XII THE HANGED MAN To Clifford Bohm, for the new worlds I got to play in.
    XIII DEATH To Harmony. I don’t ask much from you, but I asked for this. Thank
                                            vi


you.
   XIV TEMPERANCE To Elise Zipkin. Thank you for keeping the wheels on the track.
   XV THE DEVIL To Anya Vostinar. Let’s crash the next train.
   XVI THE TOWER To the plague years. Good riddance.
   XVII THE STAR To Katie. What else is there to say after seven years?
   XVIII THE MOON To the remaining members of the Hintze lab and Devolab: Clifford
Bohm, Douglas Kirkpatrick, Nitash CG, Vincent Ragusa, Alex Lalejini, Matthew Moreno,
Jose Hernandez, Kate Skocelas, and Austin Ferguson. Thank you for being there through
the different phases of my graduate career.
   XIX THE SUN To the members of the ECODE lab: Shakiba Shahbendegan, Austin
Mills, Anna Catenacci, and Aria Kingstrom. Thank you for the warmth the last year.
   XX JUDGEMENT To my committee members, Ingo Braasch and Christoph Adami,
who let me spin this together and was there weaving with me all the way.
   XXI THE WORLD Here’s to the rest of the journey with you all.
                                            vii


                                TABLE OF CONTENTS
LIST OF TABLES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .       xi
LIST OF FIGURES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .       xii
Chapter 1    Introduction . . . . . . . . . . . . . . .     . . . . . . . . . . . . . . . .   1
   1.1 Why Study Evolvable Systems? . . . . . . . . .       . . . . . . . . . . . . . . . .   2
   1.2 Connectivity in Evolvable Systems . . . . . . .      . . . . . . . . . . . . . . . .   2
   1.3 Digital Evolution and Artificial Life . . . . . . .  . . . . . . . . . . . . . . . .   4
   1.4 Thesis Statement . . . . . . . . . . . . . . . . .   . . . . . . . . . . . . . . . .   5
   1.5 Contributions . . . . . . . . . . . . . . . . . . .  . . . . . . . . . . . . . . . .   5
       1.5.1 Genotype: Rank Epistasis . . . . . . . .       . . . . . . . . . . . . . . . .   5
       1.5.2 Phenotype: Comparative Hybrid Method           . . . . . . . . . . . . . . . .   6
       1.5.3 Landscape: Tessevolve . . . . . . . . . .      . . . . . . . . . . . . . . . .   7
Chapter 2 Rank Epistasis: A rank based model of genome connectivity .                         8
   2.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .   8
   2.2 Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .    9
       2.2.1 Rank Epistasis Algorithm . . . . . . . . . . . . . . . . . . . . . . . .         9
       2.2.2 Comparisons of Baseline Assumptions . . . . . . . . . . . . . . . . . .        11
       2.2.3 Identifying Degrees of Epistasis . . . . . . . . . . . . . . . . . . . . .     13
               2.2.3.1 Canonical NK Landscapes . . . . . . . . . . . . . . . . . . .        13
               2.2.3.2 Variant NK Landscapes . . . . . . . . . . . . . . . . . . . .        13
       2.2.4 Data and Code Availability . . . . . . . . . . . . . . . . . . . . . . .       15
   2.3 Results and Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .   16
       2.3.1 Rank epistasis correctly detects the absence of epistasis on a mixed
               additive/multiplicative landscape . . . . . . . . . . . . . . . . . . . .    16
       2.3.2 Rank epistasis correlates with K on Canonical NK Landscapes . . . .            16
       2.3.3 Disentangling K on Variant NK Landscapes . . . . . . . . . . . . . .           18
               2.3.3.1 Rank epistasis measures different epistatic levels on Half-and-
                        Half landscape . . . . . . . . . . . . . . . . . . . . . . . . .    20
               2.3.3.2 Rank epistasis measures apparent epistasis on Mixed landscape        20
   2.4 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
   2.5 Author Contribution Statement . . . . . . . . . . . . . . . . . . . . . . . . .      23
   2.6 Acknowledgements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .     23
Chapter 3    The comparative hybrid approach to investigate                    cognition
             across substrates . . . . . . . . . . . . . . . . . . . . .      . . . . . . . 24
   3.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
   3.2 Methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .  . . . . . . . 27
       3.2.1 MABE: Modular Agent-Based Evolver . . . . . . . . . .            . . . . . . . 27
       3.2.2 Brains: RNNs and Markov Brains . . . . . . . . . . . . .         . . . . . . . 27
                                            viii


             3.2.2.1 Recurrent Artificial Neural Networks: . . . . . . . . . . . . .       28
             3.2.2.2 Markov Brains: . . . . . . . . . . . . . . . . . . . . . . . . .      29
             3.2.2.3 Comparison of Brains . . . . . . . . . . . . . . . . . . . . .        29
             3.2.2.4 Hybrid Brains . . . . . . . . . . . . . . . . . . . . . . . . . .     30
      3.2.3 Worlds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .   31
             3.2.3.1 NBack World . . . . . . . . . . . . . . . . . . . . . . . . . .       31
             3.2.3.2 PathFollow World . . . . . . . . . . . . . . . . . . . . . . .        33
             3.2.3.3 BlockCatch World . . . . . . . . . . . . . . . . . . . . . . .        34
             3.2.3.4 Experiment Details . . . . . . . . . . . . . . . . . . . . . . .      34
  3.3 Results and Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .   35
      3.3.1 Primary Results Analysis . . . . . . . . . . . . . . . . . . . . . . . . .     35
      3.3.2 Cross-Brain Examination: Discretization . . . . . . . . . . . . . . . .        38
             3.3.2.1 Discretization Helps Performance on BlockCatch and Path-
                       Follow . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .  38
             3.3.2.2 Discretization or Sparsity Alone Hinder, But Together Help
                       in RNN Hybrids Performance on NBack . . . . . . . . . . .           38
      3.3.3 Other Factors To Consider . . . . . . . . . . . . . . . . . . . . . . . .      38
             3.3.3.1 Potential Influence of Encoding . . . . . . . . . . . . . . . .       39
      3.3.4 Information Integration . . . . . . . . . . . . . . . . . . . . . . . . . .    39
  3.4 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
  3.5 Acknowledgements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .     42
  3.6 Author Contributions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .   42
Chapter 4   Tessevolve: An interface for visualizing evolutionary                fitness
            landscapes in 4D . . . . . . . . . . . . . . . . . . . . . . .       . . . . . 43
  4.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
  4.2 Domain Background . . . . . . . . . . . . . . . . . . . . . . . . . .      . . . . . 45
      4.2.1 Evolutionary Fitness Landscapes . . . . . . . . . . . . . . .        . . . . . 46
      4.2.2 Loss of Information & Intuition . . . . . . . . . . . . . . . .      . . . . . 47
      4.2.3 Holey fitness landscapes . . . . . . . . . . . . . . . . . . . .     . . . . . 48
  4.3 Related Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .   . . . . . 49
      4.3.1 Graph-Based Visualizations . . . . . . . . . . . . . . . . . .       . . . . . 49
             4.3.1.1 Network Models . . . . . . . . . . . . . . . . . . .        . . . . . 49
             4.3.1.2 Hypergraphs . . . . . . . . . . . . . . . . . . . . .       . . . . . 51
      4.3.2 VR-Enabled Visualizations . . . . . . . . . . . . . . . . . . .      . . . . . 52
             4.3.2.1 2D Landscapes in VR . . . . . . . . . . . . . . . .         . . . . . 52
             4.3.2.2 Breaking into the Fourth Dimension . . . . . . . .          . . . . . 53
  4.4 Domain Requirements and System Overview . . . . . . . . . . . . .          . . . . . 54
      4.4.1 Requirements . . . . . . . . . . . . . . . . . . . . . . . . . .     . . . . . 54
      4.4.2 System Overview . . . . . . . . . . . . . . . . . . . . . . . .      . . . . . 54
  4.5 Implementation of Tessevolve . . . . . . . . . . . . . . . . . . . . .     . . . . . 55
      4.5.1 Data Model . . . . . . . . . . . . . . . . . . . . . . . . . . .     . . . . . 55
      4.5.2 Data Generation and Pre-Processing . . . . . . . . . . . . .         . . . . . 56
      4.5.3 Virtual Reality Engine . . . . . . . . . . . . . . . . . . . . .     . . . . . 56
      4.5.4 Design Choices . . . . . . . . . . . . . . . . . . . . . . . . .     . . . . . 57
                                            ix


      4.5.5  Visualization Interface .   . . . . . . . . . . . . . . . . . . . . . . . . . 58
      4.5.6  Dimension Comparisons       . . . . . . . . . . . . . . . . . . . . . . . . . 60
             4.5.6.1 2D Landscapes       . . . . . . . . . . . . . . . . . . . . . . . . . 60
             4.5.6.2 3D Landscapes       . . . . . . . . . . . . . . . . . . . . . . . . . 60
             4.5.6.3 4D Landscapes       . . . . . . . . . . . . . . . . . . . . . . . . . 61
  4.6 Expert Feedback . . . . . . . .    . . . . . . . . . . . . . . . . . . . . . . . . . 61
      4.6.1 Ease of Interpretation .     . . . . . . . . . . . . . . . . . . . . . . . . . 61
      4.6.2 Ease of Use . . . . . . .    . . . . . . . . . . . . . . . . . . . . . . . . . 63
      4.6.3 Constructive Feedback .      . . . . . . . . . . . . . . . . . . . . . . . . . 63
  4.7 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
      4.7.1 Landscape Ruggedness .       . . . . . . . . . . . . . . . . . . . . . . . . . 64
      4.7.2 Valley-Crossing in 2D vs.    3D vs. 4D     . . . . . . . . . . . . . . . . . . 65
  4.8 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66
  4.9 Acknowledgements . . . . . . .     . . . . . . . . . . . . . . . . . . . . . . . . . 68
Chapter 5   Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .     69
  5.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
  5.2 Contributions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .  69
  5.3 Future Directions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .  70
      5.3.1 Information-theoretic extensions to existing metrics . . . . . . . . . .       70
             5.3.1.1 Rank epistasis on information-theoretic networks . . . . . .          71
             5.3.1.2 Adding informational network flow to the Comparative Hy-
                       brid Method . . . . . . . . . . . . . . . . . . . . . . . . . .     71
      5.3.2 Expanding our ability to interface with biological data . . . . . . . .        72
             5.3.2.1 Refining intuition in Tessevolve . . . . . . . . . . . . . . . .      72
             5.3.2.2 Creating audioscapes in VR . . . . . . . . . . . . . . . . . .        72
             5.3.2.3 Spatial models of genome structure . . . . . . . . . . . . . .        73
  5.4 Final Thoughts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .   73
BIBLIOGRAPHY . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .       74
                                             x


                                    LIST OF TABLES
Table 3.1: Differences (p-value) between mean performance on tasks (A) NBack, (B)
           PathFollow, and (C) BlockCatch. Green cells indicate significant differ-
           ence. P-values corrected using Tukey method for a family of 7 estimates.
           * = p ≤ 0.05; ** = p ≤ 0.01; *** = p ≤ 0.001. . . . . . . . . . . . . . . . 37
                                           xi


                                    LIST OF FIGURES
Figure 2.1: Schematic of additive and multiplicative genome for length n = 6. The
            first n/2 sites are summed and the last n/2 sites are multiplied, then the
            two halves are summed for the final fitness. Image by Katie Gleason. . .       12
Figure 2.2: Schematic for Half-and-Half landscape with first half K = 1, second half
            K = 2. Filled boxes represent sites of the genome. Empty rectangles
            underneath are guides for which sites are part of the same evaluation
            block. Image by Katie Gleason. . . . . . . . . . . . . . . . . . . . . . . .   14
Figure 2.3: Schematic for Mixed landscape with evens K = 1, odds K = 2. Lines
            indicate how even-valued sites and odd-numbered sites are extracted to
            form a seperately evaluated genomes. Filled boxes represent sites of the
            genome. Empty rectangles underneath are guides for which sites are part
            of the same evaluation block. Image by Katie Gleason. . . . . . . . . . .      15
Figure 2.4: Comparison of rank epistasis metric to additive and multiplicative versions
            of metric ϵ as described in (109). Each x-axis tick represents one position
            in the genome, i.e., the ϵ or ω detected for that site. Note that ϵ and ω
            are not usually directly comparable; however, since ω = 0 and ϵ = 0 both
            indicate no epistasis, here ω is graphed on the same Y-axis. Dashed line
            at Y=0 indicates the true level of interactivity in genome. Dots represent
            mean and bars represent 95% CIs for 100 replicates. . . . . . . . . . . . .    17
Figure 2.5: Rank epistasis metric ω on a canonical NK landscape. Each x-axis tick
            represents one position in the genome, i.e., the ω detected for that site.
            Dots represent mean and bars represent 95% CIs for 100 replicates. . . .       17
Figure 2.6: Rank epistasis metric ω on a Half-and-Half NK landscape. Top bars indi-
            cate the K value set for the first N = 50 values. Right bars indicate the
            K value for the second N = 50 values. Each x-axis tick within subplots
            represents one position in the genome, i.e., the ω detected for that site.
            Dots represent mean and bars represent 95% CIs for 100 replicates. . . .       19
Figure 2.7: Rank epistasis metric ω on a mixed NK landscape. Top bars indicate the
            K value set for even-valued. Right bars indicate the K value for odd-
            valued sites. Each x-axis tick within subplots represents one position in
            the genome, i.e., the ω detected for that site. Dots represent mean and
            bars represent 95% CIs for 100 replicates. . . . . . . . . . . . . . . . . . . 21
Figure 3.1: Schematics showing structures of A) a generic brain, B) an Recurrent
            Artificial Neural Network (RNN) and C) a Markov Brain. . . . . . . . . .       28
                                              xii


Figure 3.2: Schematic of hybrids. Orange (top) are Markov-based; purple (bottom)
            are RNN-based. Arrows indicate a single change away from the canonical
            brain structure. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
Figure 3.3: Depictions of the worlds on which the brains were evolved. (A) NBack.
            Visualization of the data flow required to perform the NBack 8 task of
            recalling the previous 8 given bits. Here i represents input, m represents
            memory, and o1 through o8 represent outputs. (B) PathFollow. Exam-
            ple map from the PathFollow task. Agents start on the green start square
            facing in the direction of the path. Black indicates path locations with
            food, where the agent must move forward and which the agent is rewarded
            for visiting. Yellow indicates locations with randomized turn signals which
            the agent must associate with either left of right turn. Red indicates the
            end of the path. Agents that reach the end of the path receive an addi-
            tional reward if they have visited all ”food” locations. (C) BlockCatch.
            Spatial layout of the BlockCatch task. Blue indicates the agent; dark blue
            indicates the location of the sensors. Bright yellow indicates the area vis-
            ible to the sensors, while darker yellow indicates area that is not visible
            in the agent’s current location. Green (blank) and red (hatched) depict
            blocks that should be caught or missed and arrows indicate possible lateral
            motion of blocks as they fall. . . . . . . . . . . . . . . . . . . . . . . . . . 32
Figure 3.4: Performance of computational structures across tasks. Color hue indicates
            the brain variant used; color saturation indicates distance in number of
            changes from canonical structures, with more distant hybrids being less
            saturated. Note the varied scales on the y-axis, as we are interested here
            only in comparative performance within worlds rather than across worlds.         36
Figure 4.1: Original fitness landscape schematic from Wright (1932). Plus signs rep-
            resent peaks while minus signs represent valleys. Dotted lines indicate
            changes in altitude (that is, changes in fitness). . . . . . . . . . . . . . . . 46
Figure 4.2: Network model of a fitness landscape. Lines indicate genotypes which are
            adjacent in evolutionary space. Colors indicate fitness according to key.
            Arrows denote traversal of the landscape. Adapted from Wu et al., 2016.
            Image by Katie Gleason. . . . . . . . . . . . . . . . . . . . . . . . . . . .    50
Figure 4.3: Hyperspace graph paper example in 3D. Each corner of the cube is “un-
            folded” to create a flat grid structure. Arrows are for demonstration pur-
            poses to indicate how adjacent corners in the cube are positioned in the
            unfolded grid. Greyscale represents fitness, where more saturated greys are
            higher fitness. Adapted from Wiles et al., 2003. Image by Katie Gleason.         51
                                              xiii


Figure 4.4: 2D landscape visualized in virtual reality. Black line represents a lineage
            traversing the landscape. . . . . . . . . . . . . . . . . . . . . . . . . . . .   52
Figure 4.5: Illustration of the challenges of having a continuous 3D landscape visual-
            ization. (A) A landscape visualization where all genotypes are displayed.
            (B) The poor view from inside A. (C) A landscape visualization where
            genotypes are displayed at fixed, discrete intervals. (D) The view from
            inside C. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57
Figure 4.6: Tessevolve web interface. Changing the settings in the settings drop-down
            panel and clicking ‘Draw’ allows users to load an updated landscape. The
            FAQ provided in the top right contains detailed information about the
            display. Drop-down menus allow users to select the combinations of pa-
            rameters they are interested in seeing; the Draw button loads that con-
            figuration into the JavaScript DOM. The configuration of settings shown
            here is used in all other figures unless otherwise stated. . . . . . . . . . .    59
Figure 4.7: Ruggedness in Vincent (top) and Composite Fitness 1 (bottom) landscape.
            Note how properties of ruggedness transfer across dimensions. (A,D) 2-
            Dimensional landscape. (B,E) 3-Dimensional landscape. (C,F) Adjacent
            slices of 4-Dimensional landscape. . . . . . . . . . . . . . . . . . . . . .      64
Figure 4.8: Representative valley-crossing dynamics in 2D, 3D, and 4D. In each case,
            the Shubert function is the underlying evolutionary landscape, and popu-
            lations mutate every generation with a standard deviation of 0.0001 and
            tournament size 16. The only difference in each is case is the number of
            dimensions. (A) 2D lineage where no valley-crossing occurs. (B) 3D lin-
            eage where valley-crossing occurs once across a large valley. (C) 4D lineage
            where the lineage stays primarily near one peak, but hops around nearby
            peaks frequently. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .   65
                                               xiv


Chapter 1
Introduction
      Evolutionary theory is the through line of biology. At all scales from base pairs to
ecosystems, evolution is a powerful driving force and organizing principle for biological phe-
nomena. Understanding evolutionary processes is essential to understand the world in which
we live and to address large-scale biological challenges such as disease prevention and climate
change mitigation.
      Evolution can be challenging to study, however, in part because evolvable systems1 are
made of constrained, connected components. Those connections are what make evolved
systems more than just simplistic input-output machines. Studying these connections in
actively evolving systems allows us to make inferences about the fundamental properties
behind evolutionary trajectories.
      Since natural systems are subject to numerous confounding factors and stochastic pro-
cesses, it is often more tractable to study evolution via a simulation or model. However, no
single model can capture all the complexities of any system; if it could, it would be a direct
copy providing no further insight. Imagine, for example, a map with perfect resolution of
every street, every bend in the road, every pothole or sidewalk crack, every blade of grass.
Such a map would be, by necessity, the size of the world. Therefore, simplifications must be
   1
     Evolvable systems here refers to any system (digital, biological, theoretical) that adheres to and is shaped
by principles of evolution: mutability, heritability, and selection.
                                                        1


made: streets as smooth lines, buildings as vague shapes. Similarly, to untangle evolutionary
processes, we must approach them from a variety of angles and with a suite of imperfect yet
informative models.
     The aim of this dissertation is therefore to develop innovative computational frameworks
to describe, quantify, and build intuition about evolutionary phenomena, with a focus on
connectivity within evolvable systems. Here I introduce the theoretical backing and method-
ological process for three new analysis techniques dealing with such evolutionary phenomena
at the genotype, phenotype, and landscape level. These techniques are: a metric for epistatic
interaction; an approach to the study of cognitive structures; and a visualization platform
for multidimensional fitness landscapes. Together, these approaches represent a novel con-
tribution to to the study of connected evolvable systems at a variety of scales.
1.1      Why Study Evolvable Systems?
     This dissertation approaches the study of evolution in a broad, theoretical sense rather
than in a specific model system (hence the term evolvable systems). The goal of this ap-
proach is to study evolution not in any particular instance, but to study it as a process that
underlies a wide variety of present-day scientific challenges. Evolutionary processes govern,
for example, ecological systems’ responses to human activity (74), antibiotic resistance and
vaccine efficacy (22; 53), and the emergence of viral variants (88; 19). If we can deepen
our understanding of evolution in a general sense, we can better respond to these various
challenges. In particular, I focus here on what I argue is one of the key drivers of complex
evolutionary processes: connectivity.
1.2      Connectivity in Evolvable Systems
     Connectivity is a term I use throughout this dissertation to refer broadly to the fact that
in any complex evolvable system, some components of the system impose constraints on other
                                                2


components. These connected systems therefore can all be thought of in terms of networks,
each with unique structural impact on their respective systems. Connectivity between genes
creates epistasis (Chapter 2); connectivity between information structures creates cognition
(Chapter 3); connectivity between mutational optima creates ridges in fitness landscapes
(Chapter 4). It is this connectivity which creates evolutionary phenomena worthy of deeper
study–after all, an unconstrained system is an unlimited problem solver.
     There are numerous existing approaches to studying connectivity in evolvable systems.
One approach is to look at evolutionary robustness, or how the failure of one component
affects the performance of the system (97; 64). Robustness is frequently studied as a driver
of genetic variation, particularly in metabolic networks (40; 33). A second common approach
is modularity: the study of systems consisting of densely connected hubs which are sparsely
connected to each other (98; 18; 68). Modularity in evolution is a key driver of evolution
at all scales, from protein interaction networks (39) to morphological innovation (82) eco-
evolutionary dynamics (30). In all of these cases, however, connectivity is under study as an
aspect of a more specific structural phenomenon.
     This dissertation takes a very broad view of connectivity which draws from each of
the areas listed above. I am interested in studying connectivity not as a feature of any
particular system or phenomenon, but as an underlying factor of what makes evolvable
systems evolvable.
     One challenge of studying connectivity in evolution is that natural systems are subject
not only to the inherent constraints of connectivity itself but also to the historical constraints
of past connections (10). Therefore, to study connected systems at a theoretical level, it is
necessary to study evolution not just as it has happened in existing systems, but as it happens
in controllable systems: evolution in action.
                                                3


1.3        Digital Evolution and Artificial Life
      While I am interested in studying questions of connectivity and complexity, often in
natural systems it is difficult to separate the complexity you are interested in from the
complexity that arise from stochastic processes outside experimental control. Fortunately,
we can use computational modeling to help simplify and systematically untangle complex
evolutionary processes.
      Using computational models, we can study evolution as an algorithm. The mathematical
rules which govern evolution in biological systems can be directly translated to computational
systems. We can represent organisms as data structures, and evolutionary forces as an
algorithmic process to evaluate, mutate, and replicate those structures in a process termed
digital evolution or artificial life (80). I will use here the term artificial life as it is the broader
term, but both are applicable.
      Artificial life is a rapidly-growing and relatively recent field; for some reviews of artificial
life as a field and its current applications, see (80; 67; 56; 6). The field has produced insights
into eco-evolutionary processes such as adaptation (103), genetic complexity (109), sexual
selection (11), cooperation (96), and more. At the cornerstone of many of these approaches
is the desire to model and understand biological evolution by means of digital evolution.
      Creating informative models of evolving systems is an open problem of great interest to
the fields of digital evolution and artificial life. Incredible progress has been made towards
these models, including for example the construction of an artificial cell (32), the development
of self-organizing bio-robots (57), and steps towards indefinitely scalable architecture (3).
Each of these systems contains some model of adaptive biology or evolution; it is that
general-purpose evolutionary model that makes them powerful tools for problem solving.
Crucially, while these models operate at very different scales from cells to systems, each is
made of connected components which together form a complex network of interactions. It
is this complex network which allows evolutionary systems to operate as more than the sum
of their parts, whether those parts are artificial organelles or large networks of computer
                                                    4


hardware. Connectivity in digital systems is therefore an important component to creating
rich, informative models of evolutionary processes.
     I introduce in this dissertation three connectivity-focused approaches to studying digital
evolution which seek to expand our repertoire of explanatory and exploratory models of
evolution. Each of these models operates at different scales, but each address problems of
connectivity in their respective evolvable systems.
1.4       Thesis Statement
     Connectivity in evolvable systems both constrains and complexifies those systems. De-
veloping new methods and metrics to study these complex interconnected features of evolv-
able systems is essential to our understanding of adaptation and evolvability.
1.5       Contributions
     This thesis is organized into three chapters, each introducing a novel method or metric
to investigate some aspect of connectivity in evolvable systems.
1.5.1      Genotype: Rank Epistasis
     In Chapter 2, I introduce rank epistasis, a new tool for assessing levels of genome
connectivity without baseline interaction assumptions. Epistasis refers to multiple genes
interacting to create a single phenotype, and it is a key component of complex genetic
architectures (55; 99; 105; 38; 83). If loci are epistatic, the fitness effects of individual
mutations at each loci may not fully determine their combined fitness effect. Determining
how strong epistatic interactions are at individual sites or for individual genes is therefore
an important problem in evolutionary biology if we are to understand fitness landscapes
(20; 99). However, challenges arise when measuring epistatic load in part because existing
                                                5


metrics require a baseline expectation against which to compare epistatic interactions, which
may be incomplete or incorrect (85).
     We therefore developed a new rank-based epistasis metric, rank epistasis, which makes
no assumptions about baseline interactions. We find that this metric is able to correctly
detect lack of epistasis where existing metrics fail, as well as distinguish between different
levels of epistatic activity.
1.5.2      Phenotype: Comparative Hybrid Method
     In Chapter 3, I introduce the Comparative Hybrid Method to investigate how connec-
tivity in computational structure impacts computational performance on a holistic level.
Understanding the structure and evolution of cognition is a topic of broad scientific inter-
est. Computational substrates are ideal for conducting investigations into this topic because
they can be incorporated in rapidly evolving Artificial Life systems and are easy to manipu-
late. However, design differences between currently existing digital systems make it difficult
to identify which manipulations are responsible for broad patterns in evolved behavior, and
current methods rarely look at how the underlying structural differences between systems in-
fluence cognition (43; 87). Identifying influential cognitive components is further confounded
if we are trying to disentangle how multiple features interact.
     Therefore, we developed an approach we termed the Comparative Hybrid Method to sys-
tematically analyze components from two evolvable digital neural substrates. We identified
elements of the logic and memory storage architectures in each substrate, then altered and
recombined properties of the original substrates to create hybrid substrates. We identified
both connectedness of the internal network and discreteness of memory as an important de-
terminants of performance across our test conditions, which provided greater understanding
of how phenotypic components create performance differences in computational structures.
                                                6


1.5.3     Landscape: Tessevolve
      In Chapter 4, I introduce Tessevolve, a new platform for intuitively investigating how
the mutational network connectivity between different areas of a fitness landscape scales
with dimensionality. Evolutionary fitness landscapes are a topographical representation of
the many parameters organisms must balance for evolutionary success, and are a popular
visualization tool for building intuition about the nature of evolution (106). However, these
relief-map inspired visualizations are traditionally limited to only two evolutionary parame-
ters, which can severely mislead our intuition about the full nature of the landscape (48; 5).
      To help combat this problem, we have developed Tessevolve, a web-enabled virtual
reality (VR) based platform for evolutionary landscape visualization in 2, 3, or 4 dimensions.
In this chapter we describe the full visualization pipeline for Tessevolve from evolutionary
landscape data to VR visualization. We render the evolutionary landscape as a series of
spheres whose coordinates represent evolutionary parameters and whose color represents
fitness. For 2D and 3D landscapes, the entire landscape is displayed at once. For 4D
landscapes, users can scroll through 3D slices of the landscape to watch the colors shift as
a representation of the fourth dimension. In each dimension, landscapes are overlaid with
phylogenetic lineage data to display changing dynamics as dimensions increase. We find that
the ability to view landscapes in 2D, 3D, and 4D under the same visualization scheme leads
to a stronger intuition about the effect of dimensionality on landscape connectivity therefore
traversability.
                                               7


Chapter 2
Rank Epistasis: A rank based model
of genome connectivity
Authors: Acacia L. Ackles, Emily L. Dolson, and Charles Ofria.
This chapter is adapted from an upcoming publication to be submitted to PLOS Computa-
tional Biology.
2.1       Introduction
      Epistasis, or the interaction between two or more genes to create an unexpected pheno-
type, is a genetic phenomenon that creates much of the complex structure we see in genome
evolution. Often, we are interested in epistasis not just for its own benefit, but rather as
an explanation for how a population might shift from one stable genotype to another stable
genotype. Both increased theoretical understanding (99; 83) and long-term empirical studies
(55; 105; 38), have led us to greater understanding of the significant role epistasis plays in
evolution and adapation.
      One major challenge to studying epistatic interactions, however, is detecting epistatic
interactions. While the impact of epistasis on genome function is well-established, there is
still ongoing uncertainty as to how to measure such impact (20; 23; 62; 75). Determining how
                                               8


strong epistatic interactions are at individual sites in a genome–or, indeed, whether sites are
epistatic at all–is essential to understanding evolutionary landscapes (99). However, such
detection can be confounded if we do not have an accurate baseline expectation for how
genes “should” interact in the absence of epistatic activity.
      Current measures of epistasis tend to set the assumption that genes will interact either
additivly (109) or multiplicativly (28), and measure epistasis based on the deviation from this
proposed interaction. However, these assumptions each result in qualitatively different kinds
of interactions being detected as epistatic (85). Therefore, if the wrong baseline is chosen, or
if some loci interact additively and others multiplicatively, loci may appear epistatic upon
measurement despite having no true epistatic effect.
      In order to address these challenges, we propose a rank-based epistasis metric that avoids
assumptions about the nature of epistatic interactions, focusing instead on how perturba-
tions to one site affect the ordering of fitness contributions from other sites. We find that
this approach correctly identifies lack of epistasis in an example case where both additive
and multiplicative existing metrics fail. Further, it is able to identify different degrees of
interactivity within genomes without making assumptions about the nature of these inter-
actions.
2.2       Methods
      Here we describe the rank epistasis algorithm, as well as a proof-of-concept experiment
to test its efficacy and an illustrative comparison between rank epistasis and traditional
metrics.
2.2.1      Rank Epistasis Algorithm
      To evaluate a target locus with rank epistasis, do the following:
    1. Choose a reference genome. In digital systems, this may be the maximally performing
                                                9


   organism; in biological systems, this may be the wild type. This reference genome will
   be used as the baseline around which epistasis is measured, so it should be a genome
   representative of the general population of interest.
2. For each site in the reference genome, measure the fitness effect of each one-step mu-
   tation across all non-target loci. For large genomes (or limited testing) it may be more
   tractable to sample a fixed sub-set of mutations. This step will give us a baseline for
   evaluating how single mutations at each site affect fitness of the entire genome.
3. Rank the single-step mutants by fitness. The resultant list is the reference ranking.
   Ties are broken by taking the average rank of the tied candidates.
4. Apply a mutation to the target locus and repeat step 2. This generates the set of all two-
   step mutants which include the target locus, so that we can evaluate how a mutation
   at that locus interacts with the existing mutations at all other loci.
5. Rank the two-step mutants by fitness. The resultant list is the target ranking. Ties
   are again broken by averaging across rank. This process creates a ranking which reflects
   the relative fitness of all two-step mutants where one of the mutations is at the target
   locus, so that we may compare it to the relative fitness of one-step mutants without
   the target locus.
6. Calculate Wilcoxon Signed-Rank Sum between the two rankings. Using the reference
   and target rankings as ranked lists paired by reference locus, calculate the Wilcoxon
   Signed-Rank Sum of the two lists. The Wilcoxon statistic, here designated ω, is used as
   the final measure of epistasis for the target locus, referred to as that locus’s epistatic
   load. This ω reflects the difference between the two ranked lists of genomes: one
   containing all single-step mutants except at a single site, and one containing all two-
   step mutants which include that site. The difference between how these genomes are
   ranked reflects how much the mutation at the target site perturbs the fitness effects of
                                             10


       other genes in the genotype.
    7. Repeat step 2-6 for all distinct possible mutations at the target locus. We can then
       average ω across all mutations to measure how heavily the target locus interacts with
       other loci in the genome.
      This procedure allows us to calculate the epistatic load at a single locus, without needing
to first make a baseline assumption about the type of interactions among loci. This per-
locus epistasis metric can be aggregated to measure epistasis across a whole organism (as is
common in other epistatic measurements (28; 31)), or for a single locus across a population
at a point in time.
2.2.2      Comparisons of Baseline Assumptions
      Current metrics for epistasis take a baseline assumption of either additivity or multi-
plicativity; however, choosing the wrong assumption will cause the metric to detect epistasis
where none exists. Further, in genomes with both additive and multiplicative components, it
is impossible to choose a correct metric as the assumption will be at least partially incorrect.
Therefore, we created a mixed additive and muliplicative model genome to evaluate rank
epistasis compared to traditional models with baseline assumptions.
      In this case, the genome was a vector of length n = 20 of integers between 1 and 4,
inclusive (see Fig 2.1). The first 10 integers are summed together, and the last 10 values
are multiplied together. The total fitness is the sum of these two values. None of these sites
interact epistatically; changing a single site does not affect the fitness contributions of any of
the other sites. However, under a baseline assumption that sites should be additive, muta-
tions in the sites which multiply together will appear to create a larger than expected fitness
effect. The reverse is true for a baseline assumption of multiplicativity when evaluating the
additive sites; they will appear to create a smaller than expected fitness effect. Therefore,
we expect traditional metrics which take a baseline for the entire genome to incorrectly iden-
                                                11


Figure 2.1: Schematic of additive and multiplicative genome for length n = 6. The first n/2
sites are summed and the last n/2 sites are multiplied, then the two halves are summed for
the final fitness. Image by Katie Gleason.
                                            12


tify positive epistasis (under an additive model) or negative epistasis (under a multiplicative
model). To determine whether rank epistasis successfully solves this problem, we compared
rank epistasis with the metric ϵ, which has both an additive and multiplicative form (109).
2.2.3      Identifying Degrees of Epistasis
     A good epistasis metric should be able to distinguish between different degrees of epis-
tasis. To confirm that rank epistasis can achieve this goal, we tested it on a landscape
where the true degree of interactivity at each site could be calculated. We used Kauffman
NK landscapes (50) for this purpose. NK landscapes are commonly used epistatic models
because they are tunably rugged : in NK landscapes, N refers to the number of sites in the
genome while K refers to the number of sites each site interacts with. Therefore, we can
tune how interactive (i.e. how epistatic) the genome is by changing the parameter K.
     We tested our metric on a canonical NK landscape, where all sites have equal K, and
on two variant NK landscapes, where K varied per site.
2.2.3.1    Canonical NK Landscapes
     The canonical landscape allows us to establish how the rank epistasis metric tracks
known degrees of interactivity on different genomes. Genomes in this case are bitstrings of
length N = 100. We then allowed the population to evolve on a given NK landscape for
t = 10, 000 generations and selected the most fit individual as a reference genome for the
rank epistasis algorithm. For our canonical NK landscape, we tested the metric on values
K = 0, 1, 2, 4, 8.
2.2.3.2    Variant NK Landscapes
     The variant landscapes allow us to establish how it tracks different degrees of interac-
tivity within the same genome. As with the canonical landscape, genomes were bitstrings of
lenth N = 100 which evolved for t = 10, 000 generations, and the most fit individual was se-
                                              13


Figure 2.2: Schematic for Half-and-Half landscape with first half K = 1, second half K = 2.
Filled boxes represent sites of the genome. Empty rectangles underneath are guides for which
sites are part of the same evaluation block. Image by Katie Gleason.
lected as the reference genome. Variant landscapes were tested on all pairwise combinations
of K = 0, 1, 2, 4, 8.
Variant 1: Half-and-Half For the first variant, which we call half-and-half, we evaluate
the first 50 sites on one NK landscape and the last 50 sites on a different landscape. Each
landscape is independently generated. This allows us to set a different K for each half of
the genome. At the boundary, evaluation wraps around to the genome half with the same
K value; e.g., if K = 2, site 49 interacts with sites 0 and 1, whereas site 100 interacts with
sites 50 and 51 (Fig 2.3.
Variant 2: Mixed For the second variant, which we call mixed, we evaluate odd-valued
sites on one landscape and even-valued sites on a second landscape. Each site still inter-
                                              14


Figure 2.3: Schematic for Mixed landscape with evens K = 1, odds K = 2. Lines indicate
how even-valued sites and odd-numbered sites are extracted to form a seperately evaluated
genomes. Filled boxes represent sites of the genome. Empty rectangles underneath are
guides for which sites are part of the same evaluation block. Image by Katie Gleason.
acts with the adjacent K sites, regardless of odd or even values. In this case, evaluation
wraps at the boundary of the genome rather than each landscape. As above, landscapes are
independently generated.
2.2.4     Data and Code Availability
     Experiments on the NK landscape were conducted using the Modular Agent-Based
Evolver framework (MABE2), an open-source modular digital evolution framework. The
source code for MABE2 is available at https://github.com/mercere99/MABE2. Experi-
ments for the additive and multiplicative comparison were performed in Python (v3.10.4).
     Statistical analysis and data visualization was performed in R (86) with the tidyverse
library (100). Implementation of paired Wilcoxon is from rstatix (49). All data is available
                                               15


at https://doi.org/10.5061/dryad.5dv41ns84. All code for data generation, analysis,
and visualization can be found at https://doi.org/10.5281/zenodo.6611759.
2.3       Results and Discussion
2.3.1      Rank epistasis correctly detects the absence of epistasis on
           a mixed additive/multiplicative landscape
     We find that rank epistasis correctly detects lack of epistasis on a deceptive additive
and multiplicative fitness landscape where other metrics fail (Fig 2.4).
     Here, the additive model detects positive epistasis, since some sites are actually multi-
plicative. Under its baseline assumption, the whole genome appears epistatic because the
average double-mutant has a much higher fitness than expected, due to the fact that some
mutations are multiplied rather than summed. On the other hand, the multiplicative model
detects negative epistasis, because some sites are actually additive. Under its baseline as-
sumption, double-mutants have much lower fitness than expected, since some mutations are
summed rather than multiplied.
     Rank epistasis, however, correctly detects no epistasis across the entire genome. Rank
epistasis does not assume additive or multiplicative interaction; it measures only relative
perturbations in the rankings between single and double mutants. Without baseline assump-
tions, the metric is able to correctly determine that no epistasis is present in the genome,
addressing a long-standing problem in detection of epistasis (20; 85).
2.3.2      Rank epistasis correlates with K on Canonical NK Land-
           scapes
     On these landscapes, every site in a particular genome has the same degree of interac-
tivity, so we expect roughly the same ω across sites. We also expect that identical landscapes
                                             16


   Epistasis Detected
                           0
                                                                                                                                                        Metric
                                                                                                                                                           additive
                         −20                                                                                                                               multiplicative
                                                                                                                                                           rank
                         −40
                                       0   1    2   3   4    5   6        7   8        9   10        11   12    13   14   15   16   17   18   19
                                                                              Genome Position
Figure 2.4: Comparison of rank epistasis metric to additive and multiplicative versions of
metric ϵ as described in (109). Each x-axis tick represents one position in the genome, i.e.,
the ϵ or ω detected for that site. Note that ϵ and ω are not usually directly comparable;
however, since ω = 0 and ϵ = 0 both indicate no epistasis, here ω is graphed on the same Y-
axis. Dashed line at Y=0 indicates the true level of interactivity in genome. Dots represent
mean and bars represent 95% CIs for 100 replicates.
                         3000
Epistasis Detected (ω)
                                                                                                                                                                  K
                         2000
                                                                                                                                                                       0
                                                                                                                                                                       1
                                                                                                                                                                       2
                                                                                                                                                                       4
                                                                                                                                                                       8
                         1000
                               0
                                   0           10       20           30           40            50             60         70        80             90
                                                                                       Genome Position
Figure 2.5: Rank epistasis metric ω on a canonical NK landscape. Each x-axis tick represents
one position in the genome, i.e., the ω detected for that site. Dots represent mean and bars
represent 95% CIs for 100 replicates.
                                                                                            17


with higher K will generate a higher ω. We find that the rank epistasis metric ω strongly
correlates with K when tested on traditional NK landscapes and correctly stratifies different
values of K when averaged across replicates (Fig. 2.5).
     Of note is that ω and K are not linearly correlated; for example, the average difference
in measured ω from K = 0 to K = 1 is much larger than that from K = 1 to K = 2. In
fact, as K increases, the difference in ω between consecutive values of K decreases.
     One possible explanation for this effect is that the number of peaks in an NK landscape,
and therefore the effect of perturbation on the genome, scales exponentially with K. In an
NK landscape, each group of K sites can take on 2K+1 possible fitness values. Therefore, for
an NK landscape with K = 2, each group of K sites has 8 possible fitness values, while each
group in a landscape where K = 4 has 32 possible fitness values. NK landscapes therefore
quickly reach a threshold where any mutation is likely to significantly change a genotype‘s
fitness; indeed, it has been shown that when K > 1 it is exponentially difficult to find even
local optima due to this effect (51). Since rank epistasis is a measure of how perturbation
affects the rank ordering of possible mutants, genomes on NK landscapes quickly reach
a point where they are close to maximally perturbed even at small K. This means the
detected rank epistasis for K = 4, for example, may not always be significantly different
than for K = 8 (see Fig 2.5). Despite these limitations, on average rank epistasis is still able
to distinguish low interactivity from high.
2.3.3      Disentangling K on Variant NK Landscapes
     Rank epistasis successfully detects varying levels of per-site epistasis within the same
genome on both variant landscapes, though the value of ω changes across genomes. In gen-
eral, we find that rank epistasis on these variant landscapes produces a measure of apparent
epistasis.
                                               18


                                              0                              1                              2                              4                              8
                         3000
                         2000                                                                                                                                                              0
                         1000
                            0
                         3000
                         2000                                                                                                                                                              1
                         1000
Epistasis Detected (ω)
                            0
                                                                                                                                                                                               Positional K
                         3000
                                                                                                                                                                                                  0
                         2000                                                                                                                                                              2      1
                                                                                                                                                                                                  2
                         1000
                                                                                                                                                                                                  4
                            0
                                                                                                                                                                                                  8
                         3000
                         2000                                                                                                                                                              4
                         1000
                            0
                         3000
                         2000                                                                                                                                                              8
                         1000
                            0
                                0 10 20 30 40 50 60 70 80 90   0 10 20 30 40 50 60 70 80 90   0 10 20 30 40 50 60 70 80 90   0 10 20 30 40 50 60 70 80 90   0 10 20 30 40 50 60 70 80 90
                                                                                               Genome Position
Figure 2.6: Rank epistasis metric ω on a Half-and-Half NK landscape. Top bars indicate
the K value set for the first N = 50 values. Right bars indicate the K value for the second
N = 50 values. Each x-axis tick within subplots represents one position in the genome,
i.e., the ω detected for that site. Dots represent mean and bars represent 95% CIs for 100
replicates.
                                                                                                                  19


2.3.3.1     Rank epistasis measures different epistatic levels on Half-and-Half land-
            scape
     In the Half-and-Half landscape, there is clear separation of the measured ω between the
two “halves” with different values of K. However, this ω is not consistently linked to values
of K across different landscapes. For example, when K = 4 is the higher value of K in the
landscape, the average ω for K = 4 is around 3000. In contrast, when K = 4 is paired with
K = 8 the ω for sites where K = 4 is closer to 2000. However, it is still lower than the
detected ω for sites where K = 8. This result reinforces that ω represents a relative epistatic
value, rather than an absolute one.
     Additionally, on both halves of the landscape, ω appears unexpectedly lower at the
boundary between halves, even for a low K transitioning to higher K. One explanation for
this effect could be that sites at the boundary are “caught” between landscapes; since they are
evolving across independent landscapes, it is difficult to find local optima on both landscapes
at once. Therefore, the genomes may evolve to be more robust to mutation around these
sites, finding lower fitness regions where mutations do not cause high perturbation of fitness.
Thus, ω measures lower than surrounding sites. This phenomenon, termed “survival of the
flattest”, is well-documented in the literature (103; 31). This observation suggests that rank
epistasis may identify interesting features of genetic architecture beyond interactivity, such
as evolved mutational robustness.
2.3.3.2     Rank epistasis measures apparent epistasis on Mixed landscape
     In the Mixed landscape, separation of K values is less clear. As the higher K increases,
the difference in ω between lower and higher K values generally decreases; one exception
is in the comparison of the mixed K = 0/K = 1 landscape vs. K = 0/K = 2 landscape
(Fig 2.7).
     It is likely that the overall effect of reduced separation of ω is due to a similar effect
as described in the results for the canonical landscape; since evaluation takes place across
                                                20


                                              0                              1                              2                              4                              8
                         3000
                         2000
                                                                                                                                                                                           0
                         1000
                            0
                         3000
                         2000
                                                                                                                                                                                           1
                         1000
Epistasis Detected (ω)
                            0
                                                                                                                                                                                               Positional K
                         3000
                                                                                                                                                                                                  0
                         2000
                                                                                                                                                                                           2      1
                         1000
                                                                                                                                                                                                  2
                                                                                                                                                                                                  4
                            0
                                                                                                                                                                                                  8
                         3000
                         2000
                                                                                                                                                                                           4
                         1000
                            0
                         3000
                         2000
                                                                                                                                                                                           8
                         1000
                            0
                                0 10 20 30 40 50 60 70 80 90   0 10 20 30 40 50 60 70 80 90   0 10 20 30 40 50 60 70 80 90   0 10 20 30 40 50 60 70 80 90   0 10 20 30 40 50 60 70 80 90
                                                                                               Genome Position
Figure 2.7: Rank epistasis metric ω on a mixed NK landscape. Top bars indicate the K
value set for even-valued. Right bars indicate the K value for odd-valued sites. Each x-axis
tick within subplots represents one position in the genome, i.e., the ω detected for that site.
Dots represent mean and bars represent 95% CIs for 100 replicates.
                                                                                                                  21


multiple landscapes, the highly rugged landscapes for large K dominates the level of pertur-
bation detected. In fact, ω here can be seen as a measure of the “true” K for each site, as
odd- and even-valued sites are evaluated on landscapes of different values for K but interact
with the same number of adjacent sites due to overlap.
     This measurement of “true K” also explains the behavior in the landscapes with K =
0/K = 1 compared to K = 0/K = 2. In the former case, each site where K = 1 directly
interacts with a site where K = 0, bringing the true interactivity of every site to K = 1.
In the latter case, evaluating every other site with K = 2 causes the K = 0 sites to fall
completely inside the evaluation block, and interact with only one of the K = 2 sites at a
time. In higher K the landscape staggering does not match the size of the evaluation block
and the effect is lost.
2.4       Conclusion
     Rank epistasis is a promising new metric to detect interactivity at specific sites in a
genome without the need for a baseline assumption of additivity or multiplicativity; there-
fore, unlike existing metrics, there is no risk of selecting the wrong baseline assumption and
falsely detecting epistatic activity where there is none. Additionally, rank epistasis is able
to accurately detect a lack of epistasis when both additive and multiplicative baseline in-
teractions are present, which existing metrics are unable to account for. The metric is also
able to reveal areas of high mutational robustness within a genome, and relative differences
in the epistasis at each site in the genome.
     One major difference between rank epistasis and existing metrics is that it does not
give an indication of positive or negative epistasis, only the degree of epistasis and how
much perturbation is present in the ordering of possible mutants. Other metrics may be
more appropriate if the sign of epistasis is of particular interest; however, if detection of site
interaction and the overall density of the mutational landscape is the primary goal, rank
                                                22


epistasis can be a strong choice of metric. Additionally, it can be used to supplement sign
epistasis metrics to ensure the measured epistasis is not an artefact of baseline assumptions.
     Here we do not provide biological applications for rank epistasis as it is beyond the
scope of this paper. However, this metric could be easily applied to any sufficiently complete
biological dataset in the same way it is applied here to computational ones. It is our hope
that using rank epistasis in this way to detect genome connectivity in biological systems
will lead to a fuller understanding of the complex ways in which genome structure dictates
function.
2.5      Author Contribution Statement
     All authors drafted and edited this paper. ALA wrote the code, conducted the experi-
ments, and analyzed and interpreted the results. ELD and CO provided theoretical guidance.
CO conceived of the original idea for the study.
2.6      Acknowledgements
     The authors thank Christoph Adami, Wolfgang Banzhaf, Austin Ferguson, and Connor
Grady for assistance on the design of this study. The authors also thank artist Kathleen
(Katie) Gleason for her contributions to the figures. This material is based in part upon
work supported by the National Science Foundation under Cooperative Agreement No. DBI-
0939454 and by a National Science Foundation Graduate Research Fellowship to ALA. Any
opinions, findings, and conclusions or recommendations expressed in this material are those
of the author(s) and do not necessarily reflect the views of the National Science Founda-
tion. This work was conducted on the ancestral, traditional and contemporary lands of the
Anishinaabeg – Three Fires Confederacy of Ojibwe, Odawa and Potawatomi peoples.
                                             23


Chapter 3
The comparative hybrid approach to
investigate cognition across substrates
Authors: Sarah Albani*, Acacia L. Ackles, Charles Ofria, and Clifford Bohm
This chapter is adapted from (7), which underwent peer review and appeared in the pro-
ceedings of the 2021 Artificial Life Conference.
* Undergraduate Author
3.1      Introduction
     The concept of cognition is a subject of study in both neuroscience and artificial intel-
ligence. Studying cognition can lead to a greater theoretical understanding of what allows
cognitive processes to function, as well as an understanding of how to harness these processes
for problem-solving. Much of machine learning was inspired by the effective mechanisms
found in natural systems, which may explain many of their functional similarities. However,
while natural brains are effective at producing robust forms of general cognition, human-
designed digital structures have not yet demonstrated general intelligence. On the one hand,
it may be that we have not gained a sufficient understanding of biological intelligence to im-
plement it in silico. On the other hand, perhaps modern computer systems simply lack some
                                               24


fundamental properties (such as massive parallelism) which make a direct implementation of
a biological algorithm for intelligence infeasible. There are a number of disparate approaches
to AI based different understandings of biological systems (e.g. ANN (89), HTM (41), spik-
ing neural nets (36)) or built up from mathematical or logical foundations (e.g. Avida (76),
Markov Brains (42), Signal GP (59), Tangled Program Graphs (52)). A real problem in the
field is an inability to directly compare these various systems and to draw concussions across
the various approaches. This makes can make it difficult apply discoveries in one area of AI
research to others and to relate research results from AI and neuroscience. Therefore, from
an Artificial Life perspective, we want to expand beyond biomimetic approaches to produc-
ing machine intelligence and move towards a more holistic view that includes the underlying
evolutionary processes that produced general cognition in nature.
      Evolution took billions of years to produce agents that engage in intelligent behavior;
we would prefer not to wait that long. While evolution in natural systems is driven by an
underlying stochastic process, we are not restricted by the rate of random occurrences of
beneficial mutations. We can take a more directed approach by systematically studying the
evolution of cognition in digital systems. Ideally, we want to provide hands-on guidance
to produce AI in a much shorter time, while simultaneously harnessing the creative and
constructive potential of evolution to allow our digital systems to reach their full utility.
To bootstrap this process, we must conduct a systematic study of potential evolutionary
building blocks and underlying representations that are both computationally efficient and
effective for evolving cognition.
      Previous work has often compared whole neural architectures’ performance on a par-
ticular task of interest. For example, algorithm performance has been studied in detail on
tasks such as general classification (104; 91; 9), text recognition (54), disease prediction
(94), and ecological modeling (21), to name a few. However, due to the numerous details
related to design and parameterization of any particular structure, performance comparisons
can not necessarily reveal why one structure outperforms another, nor investigate how their
                                                25


component parts may account for the results.
     Some work, particularly in algorithm optimization, has dealt with combining compo-
nents of multiple structures to create efficient computational hybrids. One such technique
is the “Buffet Method”, which allows a computational structure to access an array of dif-
ferent types of information processing sub-structures during the evolutionary process(43)
and shows that the proportion of sub-structures used in the evolved solutions is task de-
pendent. Another method known as “Auto ML” is designed to evolve a substrate from
basic mathematical principles (87). While these methods have been successful in optimizing
task performance, they rarely look “under the hood” at the exact variables responsible for
cognitive success.
     Our goal here is to demonstrate an approach for comparing cognitive substrates that
allows us to identify which aspects of each substrate confer strengths in evolving solutions
to cognitive processing tasks. Here, “substrate” refers to the virtual hardware underlying
each digital representation—that is, the stuff from which each model of cognition is built.
This concept of comparing two brains by testing hybrids based on these differences is what
we have termed the Comparative Hybrid Approach. In this approach, we first identify the
minimal number of differences between each substrate and then develop hybrid versions
which represent intermediate forms. These are evaluated on different cognitive tasks in
order to determine if some of the identified differences can account for performance variance.
While our current focus is on only two such systems, a wider application of this approach
will allow us to isolate properties of different types of components across more substrates
and ultimately provide clearer guidance for designing new types of evolvable cognition.
     This method is similar to knock out experiments in biology, where some aspect of a
system is disabled and tests are conducted to identify changes to the system’s response as a
whole.
     The conceptual backing for this method could be extended to more complex systems,
from comparisons between more divergent digital structures to, simulations of accurate
                                               26


biologically-based models, or even actual biological systems (although this would require
the ability to manipulate biological structures in a controlled manner).
     In this work we compare Markov Brains and Recurrent Artificial Neural Networks
(RNNs). In particular we identify differences in the logic that each brain has access to,
how the logic units are connected (sparsity), and how memories are stored (discretization).
We find that, while we observed performance differences on each axis of change, discretization
in particular was the strongest indicator of task performance.
3.2       Methods
3.2.1      MABE: Modular Agent-Based Evolver
     For the experiments in this work we used MABE, the Modular Agent Based Evolver
framework (12). We have configured MABE so that agents have a genome (a collection of
heritable and mutable data) and a brain (a computational structure that receives input and
generates output) specified by the genome. MABE manages the evolution of populations
of agents by repeatedly evaluating agents in worlds (a fitness-bearing task) and using the
results to select parents whose mutated offspring make up future generations. The advantage
of the MABE framework is that the brains and worlds are completely modularized and
therefore can be easily exchanged without altering other system properties, allowing for fast,
direct comparisons. This “plug-and-play” feature makes MABE well suited to not only our
current cross-brain and cross-world examination, but also to the proposed future extensions
of this work.
3.2.2      Brains: RNNs and Markov Brains
     In MABE, a brain is a process that converts a list of values T0 to a new list of values
T1 (Fig 3.1A). We define the T0 values as inputs (i.e. sensor readings or the state of the
task) combined with the brain’s memory and we define the T1 values as outputs (i.e. values
                                              27


Figure 3.1: Schematics showing structures of A) a generic brain, B) an Recurrent Artificial
Neural Network (RNN) and C) a Markov Brain.
that determine behavior) combined with new memory values (which will be provided to the
brain on the next update). Each time we want the brain to act (i.e. ask the brain “what
will you do now?”) we set the T0 values, allow the brain to run its internal process, and read
the resulting T1 values.
     For this work, we compare two well-studied computational structures: RNNs and
Markov Brains. These structures can be implemented in a number of ways; we describe
here our particular implementations. It is important to note that these particular imple-
mentations are designed not for speed of computation, but for ease of structural analysis.
     In all of the experiments described in this work, both the RNNs and Markov Brains
have eight memory values.
3.2.2.1    Recurrent Artificial Neural Networks:
     RNNs are a well-studied digital neural architecture; see (108) for a review focused on
RNNs and learning. Typically, RNNs are used in consort with back propagation or other
machine learning methods; here, we are instead using neuroevolution. In our particular
implementation, the internal structure of the RNN (Fig 3.1B) is a list of nodes, with one
node for each T1 value. Each node has a bias, and a weight for each T0 value. When the
RNN updates, each node adds its bias to the summation of the product of each T0 value and
its node-specific weight for that value, and applies tanh (a sigmoid function) to the result
                                             28


(which maps the value to the continuous range [−1, 1]). The resulting values are assigned
to their respective output and memory values. The genome is used to determine the node
weights and biases via a direct encoding.
3.2.2.2    Markov Brains:
      Markov Brains are a relatively new computational structure that have shown promise
in studying evolution of learning (42; 26; 90). The internal structure in the Markov Brains
(fig:3.1C) consists of wires and gates. Markov Brains have been studied with a number of
gate types, but here we used gates with binary lookup tables that have from 2 to 4 in-wires
and 2 to 4 out-wires. In-wires connect T0 values to gates and out-wires connect gates to
T1 values. When the Markov Brain updates, each gate uses its lookup table to convert
input wire values to output values. Each T1 value is the sum of the values provided by the
output wires connected to it. An indirect encoding method is used to convert genome values
into gates. Gates are defined by a “start codon” (a specific set of genome values) with the
following values determining number of inputs, number of outputs, the wiring of inputs and
outputs, and the lookup table.
3.2.2.3    Comparison of Brains
      Using the definition of the two brains, Markov Brains and RNNs, we can, in three steps,
alter the description of the Markov Brain to that of the RNN, or visa versa.
      Two of the alterations involve the internal processing and wiring of the brain. To alter
the definition of a Markov Brain to that of an RNN we would first change the data processing
unit from binary lookup table gates to summation and threshold nodes. Then, we would
trade out the sparse wiring of the Markov brain for the fixed fully connected weighted wiring
of the RNN. This identifies two obvious differences inherent to the internal architectures:
the method of data processing and sparse vs dense connectivity.
      The third required change relates not strictly to the internal architecture, but to the
                                               29


Figure 3.2: Schematic of hybrids. Orange (top) are Markov-based; purple (bottom) are
RNN-based. Arrows indicate a single change away from the canonical brain structure.
potential range of memory values. The binary lookup table gates in Markov brains are
limited to binary values, while RNNs are able to operate on continuous value ranges. As
a result, the 8 binary (i.e. discrete) memory values in the Markov Brain represent a much
smaller set of possible states than the 8 continuous range memory values in the RNN. In fact,
the third change—changing memory from discrete to continuous—is implicit and automatic,
as the summation and threshold nodes operate on continuous value ranges.
3.2.2.4    Hybrid Brains
     We implemented a number of features in RNNs and Markov Brains to allow us to create
“hybrid” brains based on each brain type (see Fig 3.2). These hybrids are representative of
the architectural changes described above which theoretically transform one brain type into
the other, and allow us to make those changes in a step-wise fashion, one change at a time.
     For Markov brains, we implemented an RNN gate that can stand in place of the canonical
lookup table gate. These gates have the same node summation and threshold properties as
the RNNs. The wiring of the RNN gates is sparse and determined by the same genetic
encoding as the lookup table gates with the exception that these gates have from 1 to 8
inputs and a single output. This replacement constitutes the change to the logical operator
described above. To approximate the difference in the range of memory values, we added
a setting so that memory is either bitted (i.e. mapped to binary such that values of 0 or
                                                30


less are set to 0 and values greater then 0 are set to 1) as in canonical Markov brains, or
continuous as in canonical RNNs. These changes allow us to test three Markov-based brain
variants: a canonical Markov Brain, a Markov-RNN hybrid with discrete memory (i.e. RNN
gate with bitted memory), and a Markov-RNN hybrid with continuous memory (i.e. RNN
gate without bitted memory).
     In RNN brains, we directly varied the sparsity of their wiring and the discretization of
memory. To implement sparsity, we biased the genome conversion so that wires with weight
0 were as common as wires with non-zero values. This meant that the inital population of
randomly generated sparse brains would have on average half as many active connections as
fully connected brains. As this ratio was only established at initialization, and not enforced
thereafter, it could change as a result of evolution. To implement the discretization of
memory, we added a bitting setting as in the Markov Brains. We tested four RNN variants: a
canonical RNN, a sparse RNN (i.e. biased weights), a discretized RNN (i.e. bitted memory),
and a sparse and discretized RNN (i.e. both biased weights and bitted memory).
3.2.3     Worlds
     Since our work is motivated by understanding learning in the context of computational
architectures, we tested each of our hybrid brains on three learning-based tasks with diverse
characteristics. One task dealt primarily with short term memory (NBack), the second with
simple information integration and lifetime memory (PathFollow), and the third with more
complex information integration and lifetime memory (BlockCatch). We provide an in-depth
analysis of the required logic and information integration required for each task and thoughts
about how this relates to the results in the Discussion section titled Information Integration.
3.2.3.1    NBack World
     NBack world (Fig 3.3) presents a simple short-term memory task. In this task, agents
are presented a bit stream, one bit at a time, and must respond with the last 8 bits they
                                              31


Figure 3.3: Depictions of the worlds on which the brains were evolved. (A) NBack. Visual-
ization of the data flow required to perform the NBack 8 task of recalling the previous 8 given
bits. Here i represents input, m represents memory, and o1 through o8 represent outputs.
(B) PathFollow. Example map from the PathFollow task. Agents start on the green start
square facing in the direction of the path. Black indicates path locations with food, where
the agent must move forward and which the agent is rewarded for visiting. Yellow indicates
locations with randomized turn signals which the agent must associate with either left of
right turn. Red indicates the end of the path. Agents that reach the end of the path receive
an additional reward if they have visited all ”food” locations. (C) BlockCatch. Spatial
layout of the BlockCatch task. Blue indicates the agent; dark blue indicates the location
of the sensors. Bright yellow indicates the area visible to the sensors, while darker yellow
indicates area that is not visible in the agent’s current location. Green (blank) and red
(hatched) depict blocks that should be caught or missed and arrows indicate possible lateral
motion of blocks as they fall.
                                               32


were provided. We tested each agent 10 times with 18 inputs presented per test. Only the
outputs associated with the last 10 inputs were evaluated. Agents received a score based
on the ratio of the correct outputs to the total number of outputs. NBack is based on
a common cognitive task in neuropsychology to test working memory via neuroimaging,
though its efficacy in human subjects is under review (81; 70; 45).
3.2.3.2    PathFollow World
     In PathFollow world (Fig 3.3B), agents must learn to respond to a variable environmen-
tal clue. Agents exist on a grid and are able to sense symbols only on the space they are
currently occupying. Agents must use those symbols to navigate a path defined by food and
randomized turn signals. Agents can move forward and backwards, and turn left and right
45 degrees. Each location in the world contains a symbol that marks that location either as
start, end, empty, food, turn signal 0, or turn signal 1. Agents start on the start location
oriented towards the next step on the path. Visting a food location provides reward (used
to determine reproductive success) and indicates a continuation of the path. If the agent
steps off the path they receive a small penalty. The two turn signals indicate turns in the
path, but these signals’ meaning are reset every time an agent starts a new path. That is,
signal 0 may indicate a left turn or a right turn, and signal 1 will indicate the opposite of
the direction of signal 0. In order to perform well in Path Follow world, agents must form
an association between the turn signals and their meaning by trial and error, and then must
remember these associations for the remainder of that path. Agents are scored on their
ability to reach the end of the path while visiting all locations and minimizing time off the
path. PathFollow world is a simplified version of the task presented in (84); in the previous
work, the turn signals were not restricted to a binary choice.
                                              33


3.2.3.3    BlockCatch World
      BlockCatch world (Fig 3.3C) is a task that requires both lifetime memory and complex
information integration. Agents are rewarded based on whether they catch or avoid a block
that falls towards them (63). The world is a 20 × 20 grid, and agents are paddles that move
left and right at the bottom of the world. Agents are 6 units wide and have 4 sensors that
look up and can detect if there is an object directly above them, but not the distance to the
object. The sensors are positioned such that the agent has a 2 unit wide blind spot over its
center. Blocks of either size 2 or 3 begin at the top of the world and fall toward the agent
one at a time. The blocks fall by moving horizontally one space, either left or right, and
vertically down one space at a time. If a block is size 2 the agent should ”catch” the block
(i.e. be positioned such that block intersects some part of the agent when the block has fallen
to the level of the agent), and if a block is size 3 the agent should ”miss” the block. Agents
must integrate sensor input over time both to determine the block shape (to decide whether
to catch or avoid) and the direction the block is moving (to correctly position themselves).
They therefore must also remember the block size and direction once they have detected
these features and behave accordingly. Agents are tested on every combination of block size
and direction and receive a score that is the ratio of correct results over all tests.
3.2.3.4    Experiment Details
      Each described brain variant was evolved on each described world 100 times (i.e. 100
replicates of each experiment) for 200,000 generations. In all cases, we used populations of
100 agents and tournament selection (size 5). Agents’ genomes were vectors of integer values
in the range [0,255] with initial length 5,000. On reproduction, offsprings’ genomes could be
altered by insertion and deletion mutations of size 128 to 512 at a per site rate of 0.00002
and point mutations at a per site rate of 0.005. Genomes could not mutate to be shorter
then 2,000 sites or longer than 20,000 sites.
      Markov Brain genomes were seeded in the first generation of each experiment with six
                                                34


gate “start codons” for either lookup table gates or RNN gates depending on the brain
variant.
     All source code for the project, including analysis and visualization scripts, can be found
at https://doi.org/10.5281/zenodo.4765681.
3.3       Results and Discussion
     We present here results from an application of the Comparative Hybrid Approach where
we compare Markov Brains and RNNs as a demonstration of the method’s qualitative an-
alytical power. By identifying architectural differences between brain types and creating
intermediate “hybrids” of the two computational structures, we gain the ability to analyze
how performance varied across hybrid structure and task. We are then able to examine
whether our assumptions about the architectural differences separating the substrates map
to the quantitative difference separating their performance.
3.3.1      Primary Results Analysis
     We saw that Markov brains generally suffered significant performance loss when their
logic lookup table gates were replaced with RNN-like threshold gates with continuous values.
However, performance was stable or even increased when those threshold gates were used in
consort with discrete memory. This indicates that discretization, rather than the logic lookup
table structure, accounts, at least in part, for the high performance of Markov Brains on
the tasks we investigated. The results are not as clear when we consider the RNN variants.
While we can clearly see that the RNN variant which is both sparse and discretized out
perform canonical RNNs in all three tasks, changes in independent task performance do not
clearly indicate whether it is sparsity or discretization that is the primary factor—in fact,
the results indicate that the benefits conferred by each feature are task dependent.
     Overall, from these results, we tentatively conclude that discretizing outputs results in
                                               35


                                                                  NBack
               1.0                                                         ●
                                                                                                              ●
                                                                                                              ●
                                                                                                              ●
                                                                           ●                                  ●
                                                                                                              ●
                                                                                                              ●
                                                                                                              ●
                                                                                                              ●
                                                                                                              ●
                                                                                       ●
                                            ●
                                            ●                                                                 ●
                              ●             ●                                          ●
                                                            ●              ●                                  ●
                                                            ●                          ●
                                                            ●                                                 ●
                                                            ●              ●
                                                                                                              ●
                                                            ●                                                 ●
               0.9                                          ●
                                                            ●
                                                                                                              ●
                                                            ●
                                                            ●
                                                            ●
                                                            ●
                                            ●
                                                            ●
                                                            ●
               0.8
                                                                                                      ●
                                                            ●
               0.7
                                                                PathFollow
              0.75                                                                                            ●
                                                                                                              ●
                                                                                                              ●
Final Score
                                                                                                              ●
                                                                                                              ●
              0.50                                                                                            ●
                                                                                                              ●
                                                                                                              ●
                                                                                                              ●
                                                                                                              ●
                                                                                                              ●
                                                                                                              ●
                                                                                                              ●
                                                                                                              ●
                                                                                                              ●
                                                                                                              ●
                                                                                                              ●
              0.25
                                                                                                              ●
                                                                                                              ●
              0.00
                                                            BlockCatch
               1.0
                              ●             ●               ●              ●
                              ●                             ●              ●
                              ●                             ●
                              ●                             ●              ●
                              ●                             ●              ●
                                                            ●
               0.9                                                         ●
                                                                           ●
                                                            ●
               0.8                                                                                    ●
                                                                                                      ●
                                                                                                      ●
                         kov
                                           d
                                          tte           N
                                                         N               d
                                                                       ize          tiz              se   N
                                                                                                              N
                     ar               bi             R                                 ed        ar       R
                                                                  re t          re              sp
                     M                N         kov             di             sc
                                  N             ar                sc           di           N
                                  R                                                         N
                         ko                 M             se               N            R
                           v                            ar              N
                         ar                          sp                R
                     M                              N
                                                N
                                                R
                                            Computational Structure
Figure 3.4: Performance of computational structures across tasks. Color hue indicates the
brain variant used; color saturation indicates distance in number of changes from canonical
structures, with more distant hybrids being less saturated. Note the varied scales on the
y-axis, as we are interested here only in comparative performance within worlds rather than
across worlds.
                                                                                                                  36


                                                         Significance: NBack
                                   RNN
            Markov      Markov                  RNN         RNN
   A                              sparse/                             RNN
         RNN bitted      RNN                 discretized   sparse
                                discretized
                                                 ***         ***
 Markov      1.000       0.943     1.000                             0.999
                                               <0.001       0.002
                                                 ***         ***
 Markov RNN bitted       0.971     1.000                             1.000
                                               <0.001       0.003
                                                 ***
 Markov RNN                        0.957                     0.06    0.998
                                               <0.001
                                                 ***          **
 RNN sparse/discretized                                              1.000
                                               <0.001       0.002
                                                             ***      ***
 RNN discretized
                                                           <0.001    <0.001
                                                                       **
 RNN sparse
                                                                      0.01
                                                      Significance: PathFollow
                                   RNN
            Markov      Markov                  RNN         RNN
   B                              sparse/                             RNN
         RNN bitted      RNN                 discretized   sparse
                                discretized
                          ***       ***          ***         ***      ***
 Markov      0.557
                        <0.001    <0.001       <0.001      <0.001    <0.001
                          ***       ***          ***         ***      ***
 Markov RNN bitted
                        <0.001    <0.001       <0.001      <0.001    <0.001
                                    ***           *          ***      ***
 Markov RNN
                                  <0.001        0.045      <0.001    <0.001
                                                 ***         ***      ***
 RNN sparse/discretized
                                               <0.001      <0.001    <0.001
                                                             ***      ***
 RNN discretized
                                                           <0.001    <0.001
                                                                       **
 RNN sparse
                                                                     0.002
                                                     Significance: BlockCatch
                                    RNN
            Markov      Markov                    RNN          RNN
   C                               sparse/                            RNN
          RNN bitted      RNN                  discretized    sparse
                                 discretized
                                                                 *
 Markov       1.000       1.000     1.000         1.000               0.087
                                                               0.029
                                                                 *
 Markov RNN bitted        0.999     1.000         1.000               0.057
                                                               0.017
 Markov RNN                         1.000         1.000        0.055  0.146
                                                                 *
 RNN sparse/discretized                           1.000               0.090
                                                               0.029
                                                                 *
 RNN discretized                                                      0.051
                                                               0.015
 RNN sparse                                                           1.000
Table 3.1: Differences (p-value) between mean performance on tasks (A) NBack, (B) Path-
Follow, and (C) BlockCatch. Green cells indicate significant difference. P-values corrected
using Tukey method for a family of 7 estimates. * = p ≤ 0.05; ** = p ≤ 0.01; ***
= p ≤ 0.001.
                                                                           37


generally higher performance than biasing towards network sparsity. However, we note that
all versions of Markov Brains tend to have higher sparsity compared to all versions of RNN,
and so we can not discount that sparsity plays a significant role in explaining Markov Brains’
generally higher performance.
3.3.2      Cross-Brain Examination: Discretization
3.3.2.1    Discretization Helps Performance on BlockCatch and PathFollow
     In two of the tasks, BlockCatch and PathFollow, discretizing the RNN memory resulted
in significant performance gains, while sparsity of the network had little effect. In addition,
the effect of combining sparsity and discretization resulted in performance that was lower
than that of the RNNs that were only discretized. In all cases, hybrid Markov Brains (i.e.
with RNN gates) display higher performance when discretized (i.e. bitted) in these same
tasks. Moreover, we were surprised to see that in PathFollow, the Markov ANN bitted
hybrids outperform the canonical Markov Brains.
3.3.2.2    Discretization or Sparsity Alone Hinder, But Together Help in RNN
           Hybrids Performance on NBack
     The advantages of discretization do not carry across all types of tasks as the results
from NBack attest. The RNN NBack results are clearly different than the other two tasks in
multiple ways; we see a performance gain only in the combined sparse-discretized network
where as discretized-only and sparse-only RNN hybrids experienced performance loss.
3.3.3      Other Factors To Consider
     Based on the contradictory results observed in relation to the NBack task, we turn to
the examination of other specific brain features and task details for insights that might help
to explain these results. The following are some conjectures on possible explanations for our
                                              38


observed results, and should be seen as theoretical.
3.3.3.1    Potential Influence of Encoding
     It is possible that only considering the difference between the RNN and Markov Brain
architectures is insufficient to explain the differences in performance. Even though both
structures are built from the same type of genomes with the same mutational properties, the
process which is used to map the genomes to the two types of brains is quite different. RNNs
use a direct encoding method where a contiguous region at the beginning of the genome is
read and each position in the genome relates to a particular feature in the brain (a bias or a
weight). In contrast, Markov Brains rely on biologically inspired “start codons” that mark
coding regions. The RNN encoding means that the number of coding sites in the genome is
constant, but because the number of start codons in a Markov Brains genome can vary, the
number of coding sites can also vary. This results in a fixed number of effective mutations
in RNNs but a variable number of effective mutations in Markov Brains. Because of these
differences in encoding, the ways that brains are affected by mutation and the resulting
evolutionary pathways may be very different. Further consideration of this is outside of the
scope of this work, but we stress that the encoding methods could account for differences
in observed performance and that future work should consider comparisons of encoding
methods.
3.3.4      Information Integration
     One factor that we believe could be critical to understanding these results and the re-
sults of our proposed method in general relate to the cognitive and memory requirements
of the tasks being used. In two of the tasks, BlockCatch and PathFollow, we saw a gradi-
ent of performance across hybrids as we generally expected. The alterations of the RNNs
caused them to perform more similarly to the Markov Brains (high performance), and the
alterations of the Markov Brains caused them to perform more similarly to the RNNs (lower
                                              39


performance). However, this was not the case in the NBack task.
     If we consider all three tasks from the standpoint of the cognitive abilities that each task
requires, we can see that NBack only requires simple feed forward memory, while the other
two tasks require long term memory and some level of information integration. Information
integration is the ability to arrive at an answer that requires more than simple memory.
     NBack can be solved by an agent that simply stores the incoming bit string in a “delay”
buffer and delivers these stored values to the correct outputs over time. Each output relies
only on a single prior input.
     In PathFollow, agents must be able to integrate information over time. When the agent
starts on a new path, they do not yet know which turn signal is associated with which turn.
They must use trial and error to establish if they are in a “0 means left and 1 means right”
path or a “1 means left and 0 means right” path. This requires that they make a guess
when they arrive at the first turn signal. If they stay on the path then they “know” that
they guessed correctly, but if they step off the path then they must not only remember that
they guessed incorrectly, but also must navigate back onto the path. During this process
(of getting back onto the path) their sensors will not provide them with any useful input,
so they must remember what part of the path recovery process they are executing. Finally,
integration over sensors is needed. In order to detect a turn signal, agents must look at at
least two inputs: the “turn signal” input, and either the “on turn” input or both the “on
forward” and “off path inputs”.
     Finally, Block Catch requires that agents identify the size and direction of falling blocks.
The identification of the left/right direction requires information integration (i.e. at least
the integration of sensor states from consecutive time steps), while the identification of block
size can be achieved by integration over time and sensor space or just over time.
     From this it follows that the optimal logic and wiring for NBack should be simple and
sparse. In fact there is no advantage in NBack to being able to integrate or even alter the
input values. Arguably, a “pass though” gate which simply moves a single value from T to
                                                40


T+1 should be optimal. It is unclear why canonical RNNs perform well on NBack versus the
discretized-only and sparse-only hybrids but we conjecture that these intermediate hybrids
resulted in less navigable fitness landscapes.
     PathFollow and Block Catch both require more complex models of memory and more
complex methods of manipulating data than NBack. We believe that this accounts for the
primary variance between the results of NBack and the results of PathFollow and Block
Catch.
     While this study was not designed to detect how information integration may impact
optimal cognitive structures, our results indicate that such integration should be considered
in future work.
3.4      Conclusion
     We present these results as an illustrative example of the Comparative Hybrid Method
and how it can be applied to analyze key components of digital brain architectures. By
creating hybrids of existing examples of neuro architectures, we are able to test computa-
tional structures in a piece wise manner on a collection of tasks of varying complexity. The
comparative performance of the unmodified and hybrid brains can be used to make infer-
ences about how computational components and structure may relate to task success and
ultimately to understanding fundamentally why the unmodified brains behave differently.
     In practice, this technique can be extended to any two or more structures whose per-
formance you are interested in comparing. All that is required is an intuition for what
architectural differences separate the two computational structures, and an implementation
of hybridized versions altering each of those components in turn. While such a task may
be in some cases practically unwieldy, it will in many cases be simpler than the alterna-
tive of analytically examining the entire brain architecture at once and altering all possible
parameters in order to identify key components.
                                               41


     In theory, the effects of the comparative hybrid model could extend into biological
systems. As the ability to accurately simulate neuron models progresses, our method could
be used across a continuum of neuron models and with the right advances in bio-engineering,
the biological brain itself. Ultimately we believe that the Comparative Hybrid Method will
allow us to single out key variables of cognition that could provide a path to a better
understanding of intelligence, whether it be biological or computational.
3.5      Acknowledgements
     This material is based in part upon work supported by a National Science Foundation
Graduate Research Fellowship to A.A. and by the National Science Foundation under Coop-
erative Agreement No. DBI-0939454. Any opinions, findings, and conclusions or recommen-
dations expressed in this material are those of the author(s) and do not necessarily reflect the
views of the National Science Foundation. Michigan State University provided computational
resources through the Institute for Cyber-Enabled Research. Michigan State University oc-
cupies the ancestral, traditional, and contemporary Lands of the Anishinaabeg–Three Fires
Confederacy of Ojibwe, Odawa, and Potawatomi peoples. The University resides on Land
ceded in the 1819 Treaty of Saginaw.
3.6      Author Contributions
     C.B. conceived of the presented idea and study design. S.A. provided empirical back-
ing for the theoretical design. C.B. developed components of the code base for the project.
A.A. and C.B. performed computation and data collection. S.A., A.A., and C.B. performed
data analysis and interpretation. S.A. drafted the initial manuscript. C.O. provided signif-
icant guidance on the manuscript. S.A., A.A., C.O., and C.B. revised and approved of the
final manuscript.
                                               42


Chapter 4
Tessevolve: An interface for
visualizing evolutionary fitness
landscapes in 4D
Authors: Acacia L. Ackles and Emily Dolson
This chapter is adapted from an upcoming publication in revision to be submitted to IEEE
VIS 2023.
4.1      Introduction
     Evolution, once firmly in the domain of biologists, is now understood to be a fundamental
algorithmic process with wide applicability across fields. Computer scientists and engineers
in particular have recently made many advances towards understanding and harnessing evo-
lution. Evolutionary dynamics have been applied, for example, to develop self-driving cars
(1), design radio antennae for satellites (61) , and advance the state of the art for machine
learning (95).
     Across these various domains, experts are frequently interested in understanding not
only how evolution has proceeded in the past, but where it might continue in the future.
                                              43


Such insights facilitate better understanding of the capabilities and limitations of their evolv-
ing system of interest. In these cases, experts are interested in understanding the search space
of their particular problem and how evolution traverses that space (hereafter called “evo-
lutionary space”). Strong intuition for the topological properties of a given search space
promote accurate inferences and predictions about how evolution will proceed in that space.
     To facilitate this intuition, search spaces are often represented as fitness landscapes. In
these representations of evolutionary space, high-fitness areas of the search space are rep-
resented as peaks and low-fitness areas are represented as valleys. This visual metaphor is
ubiquitous and powerful, and its usefulness lies in its familiarity; it draws upon topograph-
ical representations which most people will have encountered in multiple other contexts.
However, this familiarity can be misleading as it encourages us to think about landscapes in
terms of only three spatial dimensions, whereas true evolutionary landscapes can have vastly
more dimensions.
     Indeed, mathematical work suggests that the fitness landscape metaphor has mislead
evolution researchers. High-dimensional fitness landscapes may have qualitatively different
properties that lead evolution to behave very differently than it would in a low dimensional
landscape (5). This problem has lurked in the background of evolutionary theory research for
over a decade (48). It is frequently stated as a caveat, but low-dimensional fitness landscapes
continue to be used routinely for their immense intuition-building power.
     Building intuitions with a data visualization known to potentially produce misleading
intuitions is not ideal. To begin remedying this problem, we developed Tessevolve, a web-
enabled virtual reality (VR) based tool for visualizing fitness landscapes in 2D, 3D, and 4D.
This platform enables users to begin building intuition for how fitness landscapes change as
they gain dimensions. Additionally, it allows them to plot evolutionary data on top of the
fitness landscape, to understand how the higher numbers of dimensions affect evolutionary
dynamics. While four dimensions is still far fewer than most real world fitness landscapes
contain, the ability to compare three dimensional landscapes to four dimensional landscapes
                                                44


enables a substantial advance in our ability to comprehend the effects of adding dimensions.
     To our knowledge, Tessevolve is the first 4D visual representation of the evolution of a
lineage and its surrounding landscape. Furthermore, for both 3D and 4D landscapes, domain
experts found Tessevolve more intuitive and easier to use than existing visualization methods.
Here we present the development and use cases for Tessevolve in detail, and highlight some
of the insights made possible with this novel visual interface.
4.2       Domain Background
     A central goal in evolutionary research is to understand the trajectory evolution took in
the past and predict the trajectories it is likely to take in the future. Both the reconstruction
of the past and prediction of the future require a strong working model of the evolutionary
search space; that is, what combinations of traits might an organism have, and how do those
combinations lead to reproductive success or failure? Consider as a simplifeid example the
fitness of a tree; two traits of interest might be the depth and breadth of its root system. One
might be able to quantify the fitness impacts of these two traits and write an optimization
function to find the maximum. Finding the maximum fitness, however, is of limited use
when you don’t know if evolution can actually find the necessary combination of traits; an
extremely deep and extremely wide root system might be helpful, but also energetically
infeasible. Thus, it is frequently much more important to understand the trajectory that
evolution will take through the problem space. Such knowledge is particularly critical for
efforts to prevent the evolution of dangerous traits (e.g. antibiotic resistant bacteria) by
precisely controlling the trajectory of evolution (44). It is therefore useful to have some way
to visualize the entire domain of the traits of interest as they relate to fitness.
                                                 45


Figure 4.1: Original fitness landscape schematic from Wright (1932). Plus signs represent
peaks while minus signs represent valleys. Dotted lines indicate changes in altitude (that is,
changes in fitness).
4.2.1      Evolutionary Fitness Landscapes
     The current predominant method of visualizing the evolutionary search space is the
evolutionary fitness landscape. This visual metaphor, created by Sewall Wright (106), depicts
two traits as x and y coordinates with fitness as a third z coordinate, often represented as
height (Figure 4.1). The result is indeed a landscape-like appearance, which brings a sense
of intuitive familiarity to the reader; higher fitness areas are easy to spot, and movement
around the space is easy to imagine.
     Such a familiar depiction has made fitness landscapes a nearly ubiquitous visual
metaphor. Evolution research across fields frequently uses the terms “peaks” and “valleys”
to describe areas of high and low fitness respectively. Furthermore, populations moving from
low-fitness regions to high-fitness regions are said to be “hill-climbing”, while populations
moving through low-fitness regions are “valley-crossing”. These terms and the image of the
underlying landscape are descriptive and easily understandable; unfortunately, they can also
be severely misleading as to the true nature of evolutionary space.
                                               46


4.2.2      Loss of Information & Intuition
     One obvious shortcoming of the topographical fitness landscape is that it depicts only
two traits and therefore encourages imagination of the evolutionary space in only three di-
mensions. However, most real-world evolutionary spaces are extremely high-dimensional;
any trait an organism possesses might be considered a dimension of the evolutionary land-
scape. Some conceptualizations go even farther, treating the identity of the nucleotide at
each position in an organism’s DNA sequence as a separate dimension. Intuition about
movement across low-dimensional spaces does not always translate well to intuition about
similar movement across high-dimensional spaces (5); therefore, relying on 2D visual land-
scape metaphors can mislead us about certain evolutionary dynamics.
     One such evolutionary dynamic of particular interest is the aforementioned valley-
crossing problem. In the fitness landscape metaphor, both local and global maxima are
represented as peaks. At one of these peaks, any small mutation of a trait will necessarily
shift the population to an area of lower fitness, and evolutionary pressure will then return
the population to the peak. How then are populations able to move from local maxima to
global maxima when they are effectively buffeted away from moving through the intervening
areas of low fitness?
     Research inspired by intuitions from low-dimensional landscapes tends to approach this
problem with the implicit assumption that valley crossing requires 1) multiple mutations
to occur at once, or 2) a sequence of individuals with deleterious (i.e. fitness-reducing)
mutations to survive long enough to have an offspring that lands on the opposite side of
the valley. Much effort has been devoted to understanding what conditions allow these
scenarios. However, researchers skilled in understanding the mathematical properties of
high-dimensional landscapes have suggested that this is the wrong question (48? ). In
high dimensional spaces, they argue, the probability of a valley existing in all dimensions
simultaneously is very low. In this view, populations do not need to cross valleys and
instead likely drift across neutral “ridges” between the peaks. This phenomenon is sometimes
                                                47


referred to as an “extradimensional bypass” – a path along one dimension that connects two
peaks (16).
     In this case, it would seem that attempts to visualize fitness landscapes as three-
dimensional surfaces are not aiding scientific discovery and are in fact hindering our un-
derstanding. This issue has been well-known for over a decade and is severe enough that
some domain experts have called for an end to the fitness landscape metaphor altogether
(48), but the metaphor remains popular. Many people think most clearly about problems
when they have a thorough mental image of them. Asking them to abandon that mental
image without providing a better replacement is not an ideal solution. Thus, we suggest
that the most beneficial next step for the community is to build an improved intuition for
the ways in which low-dimensional fitness landscape visualizations may mislead us. If the
fitness landscape metaphor is to continue being used, evolutionary scientists need a visceral
understanding of where it is useful and where it falls short. Here, we seek to do exactly that.
4.2.3      Holey fitness landscapes
     Another influential way of thinking about high-dimensional fitness landscapes, is the idea
of holey fitness landscapes (34). This concept is inspired by the idea that, in realistic fitness
landscapes, nearly all high-fitness genotypes are mutationally adjacent to other genotypes
of comparable fitness. Thus, it may be useful to abstract a high dimensional landscape by
compressing the space of possible fitnesses to two: viable and not viable. The resulting
search space can be visualized as a plateau (the viable genotypes) with holes cut out of it
(the not viable genotypes). Evolution in such a space is a matter of neutral diffusion along
the plateau, and can be modeled as a percolation process (35).
     While this model has been criticized both for abstracting away important dynamics
and for being misleading in its own way (48), it has so strongly influenced thinking on
how best to conceptualize high-dimensional fitness landscapes that we would feel remiss in
not mentioning it here. Although we have yet to explicitly incorporate this thinking into
                                             48


Tessevolve, it suggests a variety of valuable future directions.
4.3      Related Work
     There is clearly demand for a multidimensional alternative to the fitness landscape
metaphor which can help develop the intuition for evolutionary space provided by such a
simple visualization. Multiple approaches to this multidimensional problem currently exist;
however, they suffer from problems with scaling and in some cases fail to provide intuition
about the movement of lineages across the space over evolutionary time.
     Here we present the current standard in fitness landscape visualization in more than two
dimensions, as well as inspirations for our own extension into 3D and 4D landscape-based
visualizations in virtual reality.
4.3.1     Graph-Based Visualizations
     The current standard for visualizations beyond two dimensions is to present a graph-
based model. The most typical representation, which we term network models, is to represent
the search space as sparsely connected network. An alternative model termed Hyper-Space
Graph Paper instead represents the space as a grid of systematically arranged trait combi-
nations.
4.3.1.1    Network Models
     Network models are perhaps the most popular way to represent higher dimensional evo-
lutionary search spaces. In these models, nodes represent genotypes while edges represent
proximity; connected nodes are adjacent genotypes. (65). Directed edges in these visual-
izations can represent either the observed behavior of a lineage traversing the space, or the
direction of increasing fitness. Lineage trajectories are also sometimes represented by the
color of edges and node outlines (77). Fitness can be represented visually in a multitude of
                                               49


Figure 4.2: Network model of a fitness landscape. Lines indicate genotypes which are adja-
cent in evolutionary space. Colors indicate fitness according to key. Arrows denote traversal
of the landscape. Adapted from Wu et al., 2016. Image by Katie Gleason.
ways: as labels on the nodes (77), node size (65; 44), or colorings of the network (107; 46). An
example of a network model with fitness mapped to node colorings is provided in Figure 4.2.
     A common use of network models is to represent landscapes composed of presence/ab-
sence data for a small number of genes (often four). In evolutionary medicine, a number
of empirically-derived landscapes of this form exist (71; 78). With four genes to consider,
there are 16 possible genotypes. These 16 genotypes can be arranged in an intuitive pattern
where all mutationally-adjacent genotypes are adjacent to each other. Such a landscape is
four dimensional (indeed, these visualizations are sometimes referred to as tesseracts). How-
ever, this intuitive visual representation is only possible because each of the dimensions is
constrained to the values zero and one.
     Overall, network models do not suffer the information loss problem of 2D and 3D topo-
logical representations; every candidate genotype is represented as a node in the network.
Proximity in evolutionary space is also intuitive, represented by edges connecting nodes.
However, intuition is limited by the size of the network, which is determined by the number
of traits as well as the number of values each trait can take on. For very large networks,
visualization quickly becomes messy as the number of edges grows rapidly. Additionally,
dynamics of interest such as ruggedness valley crossing are difficult to visualize; however, see
(77) for one visual option. Finally, a particularly difficult limit of network models is that
                                               50


Figure 4.3: Hyperspace graph paper example in 3D. Each corner of the cube is “unfolded” to
create a flat grid structure. Arrows are for demonstration purposes to indicate how adjacent
corners in the cube are positioned in the unfolded grid. Greyscale represents fitness, where
more saturated greys are higher fitness. Adapted from Wiles et al., 2003. Image by Katie
Gleason.
they are by necessity discrete; continuous-valued landscapes or lineages cannot be visualized
with a finite number of nodes.
4.3.1.2    Hypergraphs
     Hyperspace Graph Paper (HSGP) is an alternative graph-based model that takes ad-
vantage of recursive structure to aid in intuition. HSGP represents binary strings of length
n as the corners of an n-dimensional hypercube, then “unfolds” that cube to produce a flat
grid representation (102). Adjacent genotypes have self-similar placement within sub-grids,
and fitness is represented via shading (Figure 4.3).
     One advantage of HSGP visualizations, as in the network models, is that they are
lossless; each point in fitness space directly translates to a position in the grid, so no infor-
mation is discarded. Unlike network models, edges are not explicitly drawn, reducing the
visual noise for larger landscapes. For HSGP models it is possible to view global and local
maxima at a glance, even in problems with multiple optima (101). However, they still share
one of the major shortcomings of network models: they are limited to discrete landscapes,
and in particular to landscapes which can be binary-coded. Additionally, since adjacency in
evolutionary space does not map directly to adjacency in the visualization, viewing lineage
traversals across the landscape is messy and unintuitive.
     Network-based models do not suffer from the same dimensionality loss as landscape-
                                                51


Figure 4.4: 2D landscape visualized in virtual reality. Black line represents a lineage travers-
ing the landscape.
based models, but this ability to expand to multiple dimensions does not necessarily mean
those expansions are intuitive. Many of the limitations of network-based models are due
to either attempting to combine a lot of information into limited spatial dimensions (i.e. a
2D image) or due to their lack of spatial structure (i.e. poor representation of proximity in
genotype space vs. fitness space). Therefore, we turn to more spatially-based models to build
intuition for higher dimensions.
4.3.2     VR-Enabled Visualizations
     Virtual reality is increasingly used to push the boundaries of data visualization. Previous
studies have shown the use of VR can support better intuition for spatial data (8), provide
a faster overview for big data visualizations (79), and provide deeper insight into the true
structure of imaging-related data (27). However, the use of VR to study evolutionary data
generally, and fitness landscapes specifically, has been extremely limited.
4.3.2.1    2D Landscapes in VR
     There is no prior work on using VR to visualize higher-dimensional landscapes, but
VR has been used to visualize 2D fitness landscapes (25; 24). Here, as in the topological
                                                52


representation, the genotype is represented as a point on the cartesian plane. The spatial
capabilities of VR are leveraged to show fitness as height, analogous to a 3D relief map
(Figure 4.4. This use of VR avoids the problems inherent to projecting 3D graphs onto 2D
pages or planes.
     The ability to visualize landscapes in their full spatial context allows for greater in-
sight into evolutionary dynamics across the landscape; for example, lineages can be traced
“through” peaks rather than “over” them (25). However, this visualization is still limited
to 2D landscapes and therefore is subject to many of the same issues as the topographical
projection, albeit with improved interpretibility and visibility.
4.3.2.2     Breaking into the Fourth Dimension
     It is relatively simple to imagine how one might represent a 3-dimensional landscape in
a 3-dimensional space, but it is more difficult to imagine representations for a 4-dimensional
landscape. The fourth dimension essentially must be projected into the 3-dimensional space,
casting a 3-dimensional “shadow”. The viewer can then see slices of the fourth dimension
by viewing 3D space. However, these slices cannot be viewed concurrently, so we require a
way to step through successive slices.
     One option is to display 3D slices in a grid format, as in one medical application termed
“Bento Box” (47). This approach allows users to simultaneously view multiple slices, which
can be helpful for comparative analysis. However, since this approach requires rendering all
slices to the screen at once, it is computationally costly for the graphical software.
     Another approach is to allow the viewer to scroll through three-dimensional slices, anal-
ogous to an MRI scan. Slices of a fourth-dimensional object are displayed one at a time,
with the viewer able to scroll through the slices at will. This allows for smoother rendering
and user control, while sacrificing the comprehensive view from the Bento Box approach.
     Using a combination of inspiration from 2D fitness landscapes in VR and existing tech-
niques to visualize 4D landscapes, we can identify key components to visualize 3- and 4-
                                               53


Dimensional fitness landscapes in virtual reality.
4.4       Domain Requirements and System Overview
     Based on the above literature, conversations with domain experts, and our own experi-
ence as active researchers in the field, we identified five key requirements for a new VR-based
fitness landscape visualization and incorporated them into our final design.
4.4.1      Requirements
     The following requirements were identified as essential to the design to meet the goal of
gaining spatial insight into how fitness landscapes behave across dimensions.
R1: Spatial Insight. Experts must get a sense of depth, distance, and scale within a
three-dimensional space in order to build spatial intuition.
R2: Ease of Access. Experts must have access to the visualization even without virtual
reality hardware in order to increase reach and usability.
R3: Ease of Interpretation. Experts must be able to interpret the visualization without
drawing upon mathematical background knowledge.
4.4.2      System Overview
     Tessevolve (tesseract + evolve) is a web-enabled three dimensional fitness landscape
visualization built to address the domain requirements listed above. With this visualization,
we seek to expand intuitive understanding of crucial domain concepts such as landscape
ruggedness and lineage traversal. We address R1 (Spatial Insight) by implementing the
visualization in an established virtual reality framework which handles depth and distance.
This framework is hooked into a web-enabled interface so as to be accessible through any
                                                54


internet connected device to address R2 (Ease of Access). Finally, we solicit and report
qualitative feedback from experts on the intuitiveness of design to address R3 (Ease of
Interpretation).
4.5       Implementation of Tessevolve
     Here we describe in depth the data model, design choices, and implementation features
involved in the development of Tessevolve. Our visualization makes use of 3D projections
in virtual reality to scale up the fitness landscape metaphor into multiple dimensions. We
use the emerging digital evolution platform MABE2 to generate evolutionary data, the AL-
ife Data Standards Python package to process lineage information, and the open-source
platforms A-Frame, D3.js, and Bootstrap to create easily accessible web and VR interfaces
for the final data visualization. The result is a fully open source visualization of complex,
multidimensional phylogenetic and evolutionary data. The full GitHub repository for the
project, including all scripts involved in data generation, pre-processing, and visualization,
is available at https://github.com/alackles/tessevolve (2), and the platform itself is
accessible at https://alackles.github.io/tessevolve. Note that for all figures in this
paper, due to the nature of the images, they would likely better be viewed at the live demo
linked above.
4.5.1      Data Model
     Currently, there is no broadly accepted standard for fitness landscape data. Our data
model therefore accepts data in CSV format, with initial columns x0, x1, x2, x3 repre-
senting the traits of interest, and a column fitness which represents the fitness associated
with those trait values. Columns x2 and x3 are optional dependent on the dimensionality
of the underlying landscape (i.e., a 3D landscape would only have columns x0, x1, x2,
fitness).
                                               55


     We were also interested in visualizing lineages’ evolution (phylogenetic data) across these
fitness landscapes. Since in this case the data we visualized was from computationally evolved
organisms, we use the ALife Data Standard for phylogenetic data (58). In this standard,
information about an organisms’s traits is recorded every time a new lineage forms. In the
future, Tessevolve could be expanded to accept phylogenetic data in other formats such as
the biologically-standard Newick format.
4.5.2      Data Generation and Pre-Processing
     To generate the fitness landscapes used for our web demo of Tessevolve, we turned
to the CEC’2013 Benchmark functions which provide both Python and C++ (among
other languages) implementations of multiple functions across dimensions (60). The fit-
ness landscape data itself was generated using the Python implementations, while the
C++ implementations were incorporated into the digital evolution platform MABE2
(https://github.com/mercere99/MABE2) to generate lineage data. We then used the ALife
Standards Python package to process the lineage data into a standardized CSV format for
the visual front-end (58).
4.5.3      Virtual Reality Engine
     Virtual reality allows for users to take maximal ability of the brain’s ability to build
intuition about physical spaces. By being immersed in the visualization, they can develop a
visceral understanding of how it behaves. However, to promote accessibility, it is ideal for
VR visualizations to also be viewable in a web browser, as a fallback. By allowing web users
to manipulate the visualization with their mouse, such a format can still provide a far better
experience than a static 2D projection.
     To this end, we chose to implement Tessevolve using the webXR specification, which
supports browser-based VR scenes that can be viewed in a browser in addition to on a VR
headset. Specifically, we used Mozilla’s AFrame framework (73), which facilitates building a
                                                56


Figure 4.5: Illustration of the challenges of having a continuous 3D landscape visualization.
(A) A landscape visualization where all genotypes are displayed. (B) The poor view from
inside A. (C) A landscape visualization where genotypes are displayed at fixed, discrete
intervals. (D) The view from inside C.
VR visualization out of html components. This structure allows it to play nicely with d3.js
(15), which we used to connect the data into the visualization.
4.5.4      Design Choices
     The most crucial design decision we had to make was how to display the 3D landscape
in a way that was both intuitive and informative. In particular, we needed to decide how
users would be able to see multiple points in a landscape simultaneously. For example, a
true 3D analogue to the traditional 2D landscape would be akin to an aquarium tank, where
every spot in the 3D space conveys some visual information about the fitness at that point.
However, such a design would completely block the vision of the viewer when they are inside
the landscape; there is no space to see “around” any component to investigate other regions
(Figure 4.5)
     We therefore chose to display discrete, evenly-spaced points on the landscape rather
                                               57


than a continuous mesh. These points are displayed as spheres at 90% opacity, allowing
for both distant and nearby inspection of the landscape. While this technique somewhat
reduces the resolution of the fitness landscape, it greatly increases its interpretability.
     Another challenge was how to indicate the fitness at each point when all three of our
spatial dimensions are used to convey information about trait values. For this purpose,
we use a color scale, as is commonly implemented to visualize imaginary functions. This
method should be easily recognizable to domain experts due to its frequent use in other
mathematical visualizations. We chose to use the viridis color scheme plasma variant as it
is both colorblind friendly and maps “cool” colors to low values and “warm” colors to high
values, further increasing familiarity.
     We also wanted to display lineage data on landscapes. We represent newly born taxa
as cubes to distinguish from the landscape’s spheres, and consecutive taxa are connected
by a simple line. The color scheme for the fitness of each taxon is the same as that of the
landscape.
4.5.5     Visualization Interface
     Tessevolve is available through any modern web browser at https://alackles.github.
io/tessevolve.
     The front-end interface for Tessevolve is a simple interactive web page (Figure 4.6).
The top banner contains brief information about the site, displays for additional info, and
a settings panel. Under the settings panel is an embedded A-Frame scene which can be
adjusted to fill the screen or, in a VR headset, sent to VR mode.
     Under the hood, the site stores pre-processed landscape and lineage data for several
evolutionary conditions and replicates. When users adjust the settings in the settings panel
and click the Draw button to cue the site to reload, the embedded scene clears the old data
and loads the new.
     The result is a 3D projection of the selected landscape. Users on the web version can
                                              58


Figure 4.6: Tessevolve web interface. Changing the settings in the settings drop-down panel
and clicking ‘Draw’ allows users to load an updated landscape. The FAQ provided in the
top right contains detailed information about the display. Drop-down menus allow users to
select the combinations of parameters they are interested in seeing; the Draw button loads
that configuration into the JavaScript DOM. The configuration of settings shown here is
used in all other figures unless otherwise stated.
                                              59


navigate around the projection with the WASD or arrow keys and change viewing direction
using the mouse; in VR, they can move with the left joystick or by walking around the space.
4.5.6     Dimension Comparisons
     Users can view the same landscape and its associated lineage in 2D, 3D, or 4D. Each
dimension has its own unique design choices to help build intuition about extension into
higher dimensions.
4.5.6.1    2D Landscapes
     2D landscapes are displayed as a vertical grid of spheres in 3D space. Trait values are
mapped to vertical and horizontal coordinates, while fitness is mapped to the Plasma color
scheme (FIG). These landscapes are simple enough that they could be represented using the
traditional relief map technique, where fitness is used as a third dimension; see for example
Figure 4.4. However, to keep visualization consistent across dimensions and maintain the
analogy from 2D to 3D and 3D to 4D, 2D landscapes here are instead shown as if they were
a single “slice” of a 3D landscape.
     Lineage data is displayed as cubes connected by lines. Each cube represents the birth of
a new taxon—that is, in this case, a novel trait value. Cube positions represent trait values
and color represents fitness; lines connect consecutive taxa.
4.5.6.2    3D Landscapes
     3D landscapes are displayed as an evenly-spaced mesh of spheres in 3D space. As in
2D, traits and fitness are mapped to position and color respectively in both landscapes
and lineages. The only difference between the 2D and 3D landscape visualizations is the
additional dimension.
                                              60


4.5.6.3     4D Landscapes
      4D landscapes required more complex design choices to account for the fourth dimension,
which cannot be displayed concurrent with the other three. Here, the user sees one 3D “slice”
of the landscape at a time, and can move through the slices by scrolling. Each slice represents
a different value for the fourth dimension, and fitness is computed based on the total trait
values across all four dimensions. The result is that the spheres’ positions remain fixed,
while the colors change across the landscape to represent how fitness is changing over each
slice. This representation was loosely inspired by Bosch’s 4D toy-box application (14).
      Lineage data also adjusts depending on the current 3D slice. The connections between
nodes are always displayed, but the colors of the nodes are only displayed when that node
falls on the currently-viewed slice. Otherwise, nodes are greyed out and transparent to
indicate that they are not present in this slice.
4.6       Expert Feedback
      We solicited feedback from three domain experts whose primary disciplines were in three
different fields in which evolutionary fitness landscapes are common: one in evolutionary and
theoretical biology, one in machine learning and evolutionary computation, and one in digital
evolution and artificial life. They were asked to rate the ability to use and interpret data in
Tessevolve as easier, harder, or about the same as current methods they are familiar with,
and then to provide open-ended responses about their experience with the platform. Expert
feedback was generally positive, especially regarding interpretability across dimensions and
the intuitive use of the visualization tool.
4.6.1      Ease of Interpretation
      Two of the three experts rated Tessevolve as easier to interpret than current methods
for both 3D and 4D data, with one pointing out that so few methods even exist for 3D and
                                                 61


4D data that they currently avoid attempting to visualize such data. These experts also
provided qualitative feedback about the ease of interpretation:
       Expert 1: I had a hard time answering some of the questions [about comparative
       ease of use] since I don’t generally try to visualize 3D or 4D fitness landscapes
       since I don’t have tools to do so easily.
       Expert 2: Overall I was pleasantly surprised by how easy it was to view both
       landscapes and lineages in 2D and 3D! I believe the 4D mode would become
       more useful as time goes on and you are better able to wrap your brain around
       it. That said, even with just a few moments I was able to see the effect of
       the fourth dimension on some landscapes, so it may not take long to build that
       intuition.
     One expert found the 3D data harder to interpret than current methods, and was unsure
about the interpretability of 4D data. However, their feedback on the tool indicated they
were unable to find the explanation of positionality and color mappings provided on the
website:
       Expert 3: Moreover, I think you might add a line or two with the syntax of
       the visualisation; I am not a big expert in landscape visualisation, but I found it
       non-intuitive.
     This syntax was at the time provided in the FAQ, but based on this feedback we moved
it to the front matter of the website.
     All three experts found 2D data harder to interpret or about the same as current meth-
ods. This makes sense, as current methods such as topographical fitness landscapes are
explicitly designed to handle 3D data, while Tessevolve explicitly chooses to sacrifice 2D
interpretability in favor of improved ease of comparison to multiple dimensions.
                                                 62


4.6.2     Ease of Use
     Two of the experts agreed that the controls for Tessevolve were intuitive and that it ran
well in the browser and in the Oculus headset:
      Expert 1: I’m very impressed how well it runs in the browser and the controls
      were intuitive.
      Expert 2: I could definitely use this in my own work, as visualizing lineages in
      3D is no easy task, even though it can really help build your understanding of the
      system. The VR support in particular is really useful for actually understanding
      the space in 3D.
     Expert 3 had some trouble rendering the 2D landscapes, but we and the other experts
were unable to replicate this difficulty; they did not provide other comments related to ease
of use.
4.6.3     Constructive Feedback
     All three experts provided some constructive feedback on the visualization that we
incorporated into our final design.
     As noted, one expert had trouble interpreting the landscape’s design, and we attribute
this to the fact that this information was in a pop-up menu in the FAQ. We therefore moved
this information to the front of the page. We similarly moved information about the color
scheme to the front page based on feedback from Expert 2 regarding uncertainty about
whether the scheme transferred to the lineage data:
      Expert 2: One thing I would suggest would be a guide to the color system for
      both the landscape and the lineage. It appears that the landscape shifts from
      purple to yellow as fitness increases, but the lineage colors are more difficult to
      deduce.
                                               63


Figure 4.7: Ruggedness in Vincent (top) and Composite Fitness 1 (bottom) landscape. Note
how properties of ruggedness transfer across dimensions. (A,D) 2-Dimensional landscape.
(B,E) 3-Dimensional landscape. (C,F) Adjacent slices of 4-Dimensional landscape.
      Given that experts generally found Tessevolve easier to use for multidimensional data,
and critical comments were related to the front-facing interface rather than the visualization
itself, this initial round of expert feedback indicates Tessevolve can be an effective tool for
3D and 4D landscape data visualization.
4.7       Discussion
      In addition to the expert comments on ease of use and interpretability, we found that
Tessevolve provides novel insight about which landscape dynamics may change as dimensions
increase, and which may remain constant. In particular, we found that Tessevolve allowed us
to see that, for the landscapes used here, landscape ruggedness remains relatively unchanged
as dimensions increase, but the ability to perform valley-crossing greatly increases.
4.7.1       Landscape Ruggedness
      Landscape ruggedness is a measure of how quickly fitness changes across a landscape.
In smooth landscapes, points that are close together in evolutionary space will have similar
fitness values. However, in very rugged landscapes, close points could have very different
fitness values.
      The visual, color-based display of Tessevolve allowed us to see that landscapes which are
rugged in lower dimensions also tend to be rugged in higher dimensions (Figure 4.7). This is
                                               64


Figure 4.8: Representative valley-crossing dynamics in 2D, 3D, and 4D. In each case, the
Shubert function is the underlying evolutionary landscape, and populations mutate every
generation with a standard deviation of 0.0001 and tournament size 16. The only difference
in each is case is the number of dimensions. (A) 2D lineage where no valley-crossing occurs.
(B) 3D lineage where valley-crossing occurs once across a large valley. (C) 4D lineage where
the lineage stays primarily near one peak, but hops around nearby peaks frequently.
mathematically unsurprising but previously difficult to visually demonstrate. Here, a smooth
color gradient either across space (for 2D or 3D) or on scrolling (for 4D) easily maps to the
concept of a smooth fitness gradient; similarly, colors that change very rapidly for adjacent
spheres or slices are visually easy to notice and therefore easy to draw intuition about. This
visual depiction of landscape ruggedness across dimensions helps reinforce intuition about
how similar in fitness two similar trait states might be in higher-dimensional landscapes,
even when we only have lower dimensional analogues.
4.7.2      Valley-Crossing in 2D vs. 3D vs. 4D
     In contrast to landscape ruggedness, one landscape feature that changed dramatically
across landscapes was ease of landscape traversal and particularly ease of valley-crossing.
As noted earlier, valley crossing is of particular interest because it helps to explain how
                                               65


populations transition from local optima to global optima. Here, the ability to visualize a
lineage evolving across both a 2D and 3D landscape in multiple replicates demonstrates a
quantitative difference in landscape traversal between replicates. For one set of parameters
displayed in Figure 4.8, valley crossings from optimum to optimum occurred in 0 out of 10
replicates, while in 3D landscapes, they occurred in 8 out of 10 replicates. In all eight of
these transitions, there was only a single valley crossing observed in the 1000 generations,
and these peaks tended to be reasonably far apart (approximately half the distance of the
entire landscape).
     In 4D, however, rather than a single valley crossing from a one local optima to another,
lineages tended to “bounce” around closer peaks multiple times across their evolutionary
history. While there were some “major” valley crossing as in 3D, the dominant dynamic was
these more “minor” crossings.
     The dynamics underlying these major vs. minor valley crossings are not obvious from
the visualization, nor are they intended to be; however, this visualization provides the insight
that examining these large or small crossings might change across dimensions.
4.8      Conclusion
     Here we introduced Tessevolve, a platform for visualizing fitness landscapes in 2D, 3D,
and 4D. The concept was inspired by a lack of existing tools for qualitatively understanding
how fitness landscapes change as additional dimensions are added. It was designed to be
intuitive, customizable, and extensible. Based on expert feedback, we conclude that Tesse-
volve is an effective and easy-to-use tool for gaining a stronger intuitive understanding of
complex fitness landscapes.
     Experts found that when viewing 3D and 4D landscapes, Tessevolve was generally easier
to use and interpret than existing platforms they are familiar with. They also reported that
the landscape visualization was easy to interpret, even when given minimal information
                                               66


beyond the built-in interface. We incorporated their feedback about adding a clearer and
more apparent key to the visualization components to improve Tessevolve’s usability and
overall design.
     In the future, Tessevolve could incorporate additional sources of information for addi-
tional dimensionality. In particular, leveraging the audio input capabilities of VR headsets
could allow us to expand into more dimensions, or replace color as a fitness indicator to
make the visualization more accessible to blind or low-vision researchers.
     Another improvement we are considering, which may be particularly helpful for large
and complex landscapes, would be to add a mode inspired by the holey fitness landscape
concept (see Domain Background). In this mode, the user would enter a fitness threshold
that they were interested in viewing. Tessevolve would then render 3D shapes representing
the regions of the search space with fitnesses below that threshold. This approach would
allow the user to quickly build intuition about a larger region of the search space, at the cost
of reducing the resolution at which fitness is displayed.
     The generally positive feedback from domain experts indicates that Tessevolve is a valu-
able new tool for evolutionary biology and computational evolution. While the number of
dimensions Tessevolve can display is still limited, the ability to compare properties across
dimensions allows for insight into how dimensionality affects evolutionary dynamics. In par-
ticular, Tessevolve reveals the ways landscape ruggedness and landscape traversal intersect
with the number of landscape dimensions. It is our hope that Tessevolve, and other VR-
based visualizations, can continue to push the boundaries of human ability to comprehend
complex fitness landscapes. It has been decades since the problems with building intuition
from low-dimensional visual fitness landscape metaphors were first pointed out. Perhaps
now we can finally begin building a better visual metaphor.
                                              67


4.9      Acknowledgements
     The authors sincerely thank artist Kathleen (Katie) Gleason for her contribution to
the figures for this paper. We thank Charles Ofria, Clifford Bohm, and Vincent Ragusa for
helpful comments on early versions of Tessevolve. This material is based in part upon work
supported by a National Science Foundation Graduate Research Fellowship to A.A. and
by the National Science Foundation under Cooperative Agreement No. DBI-0939454. Any
opinions, findings, and conclusions or recommendations expressed in this material are those
of the author(s) and do not necessarily reflect the views of the National Science Foundation.
This work is also supported by Michigan State University. Michigan State University, where
this work was conducted, occupies the ancestral, traditional, and contemporary Lands of the
Anishinaabeg–Three Fires Confederacy of Ojibwe, Odawa, and Potawatomi peoples. The
University resides on Land ceded in the 1819 Treaty of Saginaw.
                                              68


Chapter 5
Conclusion
5.1       Introduction
     In this dissertation, I introduced three new methods and metrics for investigating con-
nectivity in evolvable systems at the genotype, phenotype, and landscape levels. Each of
these new methods provided insight into their respective systems that existing techniques
obfuscated or failed to detect. Together, these new tools represent both quantitative and
qualitative improvements towards our understanding of key evolutionary processes.
5.2       Contributions
     In summary, this dissertation makes the following contributions:
   • In Chapter 2, I developed a rank-based metric for measuring genome connectivity
      that does not rely on underlying assumptions of baseline interaction. This model was
      able to correctly identify when loci were non-interactive in a case where existing metrics
      failed. Rank epistasis therefore allows us to better detect when epistasis is present,
      leading to more accurate understanding of interactivity within genomes.
   • In Chapter 3, we introduced the comparative hybrid method, an analytic tool for
                                               69


       comparing computational cognitive structures. Using this method, we showed that
       connectivity and discretization are important components in the evolution of associa-
       tive learning. These results demonstrate the value of a piecewise approach to analysis
       of evolving systems.
    • In Chapter 4, I describe Tessevolve, a platform for visualizing and exploring fitness
       landscapes in up to four dimensions. Tessevolve expands our current ability to view and
       manipulate landscapes visually and allows us to view how patterns of ruggedness and
       mutational landscape connectivity change across dimensions. This platform represents
       a novel approach to the visualization of 3D and 4D fitness landscapes.
     As a whole, this dissertation contributes to our understanding of connected systems
across multiple scales. it provides new tools and sets up a framework for thinking about
connected systems in a digital evolution context, and lays groundwork for future studies on
the computational study of evolvable networks and systems.
5.3       Future Directions
     I have so far provided overviews of three distinct approaches to evolutionary phenomena
as well as representative examples of phenomena studied with each metric. These metrics
dealt primarily with investigating the underlying connected structures of their respective
evolvable systems. Here I present two branches of future directions to further investigate
these systems: introducing information theory to current methods, and expanding our ability
to interface with our data with novel metrics.
5.3.1      Information-theoretic extensions to existing metrics
     The metrics introduced here have produced promising results for studying connectivity
in evolvable systems. Further building upon these metrics would allow us to work towards a
                                               70


more complete picture of the underlying physical and mathematical constraints on our sys-
tems of interest. In particular, the quantitative and broadly applicable nature of information
theory is a promising direction for metric expansion.
5.3.1.1    Rank epistasis on information-theoretic networks
     The effect of epistasis on genotype structure can be quantified using information theory
via statistical epistasis networks (72; 66). In essence, epistatic loci or genes are nodes on
the network, while edges represent interaction. These edges can be derived from the rank
epistasis process by tracking which genotype rankings are altered for each double-mutant.
     Then, information theory as developed on networks can be applied to these epistatic
networks; in particular, those information-theoretic metrics which were developed to quan-
tify evolutionary networks (92; 4; 17). By viewing epistatic interactions as a network, we
effectively get all the tools of network information theory to apply to epistasis.
5.3.1.2    Adding informational network flow to the Comparative Hybrid Method
     In introducing the Comparative Hybrid Method, I discussed how the method gives a
high-level, qualitative understanding of the impact of structure on function. However, quan-
tifying this impact is also essential for deeper study of connectivity and constraint within
computational architectures. One direction for such quantification is analysis of informa-
tional network flow, or how information traverses the computational cognitive networks to
create output. Such a technique has been used in to investigate cognitive networks in the
past and could readily be applied to comparative hybrid structures in the future (13). By
analyzing the informational network flow of hybrids versus their canonical counterparts, we
could understand how changing structure on the phenotypic level changes information at the
genotypic level, expanding our understanding of their mapping.
                                               71


5.3.2      Expanding our ability to interface with biological data
     In the past, our ability to interface with data has been limited to charts, tables, for-
mulae, and figures–all of which are two-dimensional. However, biology moves and lives in
three dimensions, and data about biology exists in many more dimensions than that. There-
fore, leveraging emerging technology to expand our spatial and dimensional understanding
of connections between system components will be essential to future breakthroughs in evo-
lutionary theory.
5.3.2.1    Refining intuition in Tessevolve
     While Tessevolve is a powerful prototype in its current form, feedback from colleagues
and reviewers suggests several immediate improvements to its design. We are first interested
in adding the ability to display only regions of fitness above a certain threshold value; this
would allow users to scroll through the visualization in 4D and watch the fitness region
grow or shrink to identify peaks and valleys. We are similarly interested in expanding the
visualization of phylogenies through multiple dimensions through a similar fluid growing and
shrinking of phylogeny space throughout dimensions.
5.3.2.2    Creating audioscapes in VR
     The 3D visual landscape provided by virtual reality is a powerful tool for exploring
fitness landscapes. An immediate next step in this research is to leverage the immersive
sound capabilities of VR headsets to map landscape information to audio output. Such data
sonification has been used in astrophysics to reveal patterns not obvious from mathematical
or visual models alone (37). By leveraging audio as well as visual input, we can further
expand the number of available dimensions in our audiovisual model. Sonfiication would also
expand the use case for these VR models to scientists who are blind/low-vision, colorblind,
or otherwise may benefit from audio data interpretation over visual.
                                              72


5.3.2.3    Spatial models of genome structure
     Finally, in the next few years of my research I hope to merge the development of new
frameworks with novel 3D data interfacing and develop multidimensional spatial models of
genomes and genetic networks. Genome models in artificial life and evolutionary computa-
tion are typically one-dimensional (i.e. vectors), but biological genomes evolve and interact in
three-dimensional space. Much like how protein structure dictates function, understanding
the 3D form of genomes is essential to understanding complex processes such as gene regula-
tion and modification (69; 93). Therefore, I would create novel models of genetic structures
and networks to investigate how evolutionary forces are shaped by space and dimensionality.
Such modeling could range from comparing evolutionary dynamics of digital genomes of dif-
ferent dimensionality to creating entirely new evolving genome models within 3D physics em-
ulators. Similar work has been undertaken in protein folding (Finkelstein and Galzitskaya),
but has not yet been extended to genome structure. These spatial models could hopefully
provide insight into what it is like to be where science can only go in theory: inside the
genome itself.
5.4       Final Thoughts
     When we think about evolutionary biology we often think of finches and apes, birds and
bees. But evolution in the abstract is more a process than it is a feature of any particular
system or model, including those discussed and introduced here. Recognizing evolution as
a process, and developing new approaches to study the intricacies of that process, gives us
broad insight into the rich natural world in which we live. It is through this insight that we
can begin to make connections out of the chaos.
                                               73


BIBLIOGRAPHY
      74


                                       BIBLIOGRAPHY
[1] AbuZekry, A., Sobh, I., Hadhoud, M., and Fayek, M. (2019). Comparative Study of
   NeuroEvolution Algorithms in Reinforcement Learning for Self-Driving Cars. European
   Journal of Engineering Science and Technology, 2(4):60–71. Number: 4.
[2] Ackles, A. (2022). alackles/tessevolve: submission.
[3] Ackley, D. and Small, T. (2014). Indefinitely Scalable Computing = Artificial Life Engi-
   neering. In Artificial Life 14: Proceedings of the Fourteenth International Conference on
   the Synthesis and Simulation of Living Systems, pages 606–613. The MIT Press.
[4] Adami, C., Qian, J., Rupp, M., and Hintze, A. (2011). Information content of colored
   motifs in complex networks. Artificial Life, 17(4):375–390. eprint: 1104.0025.
[5] Agarwala, A. and Fisher, D. S. (2019). Adaptive walks on high-dimensional fitness land-
   scapes and seascapes with distance-dependent statistics. Theoretical Population Biology,
   130:13–49.
[6] Aguilar, W., Santamarı́a-Bonfil, G., Froese, T., and Gershenson, C. (2014). The Past,
   Present, and Future of Artificial Life. Frontiers in Robotics and AI, 1.
[7] Albani, S., Ackles, A. L., Ofria, C., and Bohm, C. (2021). The Comparative Hybrid Ap-
   proach to Investigate Cognition across Substrates. In ALIFE 2021: The 2021 Conference
   on Artificial Life, page 9.
[8] Ambinder, M. S., Wang, R. F., Crowell, J. A., Francis, G. K., and Brinkmann, P. (2009).
   Human four-dimensional spatial intuition in virtual reality. Psychonomic Bulletin & Re-
   view, 16(5):818–823.
[9] Binkhonain, M. and Zhao, L. (2019). A review of machine learning algorithms for identifi-
   cation and classification of non-functional requirements. Expert Systems with Applications:
   X, 1. Publisher: Elsevier Ltd.
[10] Blount, Z. D., Borland, C. Z., and Lenski, R. E. (2008). Historical contingency and the
   evolution of a key innovation in an experimental population of Escherichia coli. Proceedings
   of the National Academy of Sciences, 105(23):7899–7906.
[11] Bohm, C., Ackles, A. L., Ofria, C., and Hintze, A. (2020). On Sexual Selection in the
   Presence of Multiple Costly Displays. page 8.
[12] Bohm, C., C G, N., and Hintze, A. (2017). MABE (Modular Agent Based Evolver):
   A framework for digital evolution research. In Proceedings of the Fourteenth European
   Conference on Artificial Life, pages 76–83. The MIT Press.
                                                75


[13] Bohm, C., Kirkpatrick, D., Cao, V., and Adami, C. (2022). Information Fragmentation,
   Encryption and Information Flow in Complex Biological Networks. Entropy, 24(5):735.
   Number: 5 Publisher: Multidisciplinary Digital Publishing Institute.
[14] Bosch, M. T. (2020). N -dimensional rigid body dynamics. ACM Transactions on
   Graphics, 39(4).
[15] Bostock, M., Ogievetsky, V., and Heer, J. (2011). D³ Data-Driven Documents. IEEE
   Transactions on Visualization and Computer Graphics, 17(12):2301–2309.
[16] Cariani, P. A. (2002). Extradimensional bypass. Biosystems, 64(1):47–53.
[17] Carpi, L. C., Rosso, O. A., Saco, P. M., and Ravetti, M. G. (2011). Analyzing complex
   networks evolution through Information Theory quantifiers. Physics Letters, Section A:
   General, Atomic and Solid State Physics, 375(4):801–804. Publisher: Elsevier B.V.
[18] Clune, J., Mouret, J.-B., and Lipson, H. (2013). The evolutionary origins of modularity.
   Proceedings of the Royal Society B: Biological Sciences, 280(1755):20122863. Publisher:
   Royal Society.
[19] Cobey, S., Larremore, D. B., Grad, Y. H., and Lipsitch, M. (2021). Concerns about
   SARS-CoV-2 evolution should not hold back efforts to expand vaccination. Nature Reviews
   Immunology, 21(5):330–335.
[20] Cordell, H. J. (2002). Epistasis: what it means, what it doesn’t mean, and statistical
   methods to detect it in humans. Human Molecular Genetics, 11(20):2463–2468.
[21] Crisci, C., Ghattas, B., and Perera, G. (2012). A review of supervised machine learning
   algorithms and their applications to ecological data. Ecological Modelling, 240:113–122.
[22] Davies, J. and Davies, D. (2010). Origins and Evolution of Antibiotic Resistance. Mi-
   crobiology and Molecular Biology Reviews, 74(3):417–433.
[23] de Visser, J. A. G. M., Cooper, T. F., and Elena, S. F. (2011). The causes of epistasis.
   Proceedings of the Royal Society B: Biological Sciences, 278(1725):3617–3624.
[24] Dolson, E., Lalejini, A., Jorgensen, S., and Ofria, C. (2020). Interpreting the Tape of
   Life: Ancestry-based Analyses Provide Insights and Intuition about Evolutionary Dynam-
   ics. Artificial Life, 26(1):1–22.
[25] Dolson, E. and Ofria, C. (2018). Visualizing the tape of life: exploring evolutionary
   history with virtual reality. In Proceedings of the Genetic and Evolutionary Computation
   Conference Companion, pages 1553–1559, Kyoto Japan. ACM.
[26] Edlund, J. A., Chaumont, N., Hintze, A., Koch, C., Tononi, G., and Adami, C. (2011).
   Integrated information increases with fitness in the evolution of animats. PLoS Compu-
   tational Biology, 7(10):1002236. Publisher: Public Library of Science eprint: 1103.1791.
                                               76


[27] El Beheiry, M., Doutreligne, S., Caporal, C., Ostertag, C., Dahan, M., and Masson, J.-B.
   (2019). Virtual Reality: Beyond Visualization. Journal of Molecular Biology, 431(7):1315–
   1321.
[28] Elena, S. F. and Lenski, R. E. (1997). Test of synergistic interactions among deleterious
   mutations in bacteria. Nature, 390(6658):395–398.
[Finkelstein and Galzitskaya] Finkelstein, A. V. and Galzitskaya, O. V. Physics of protein
   folding. 1(1):23–56.
[30] Fletcher, R. J., Revell, A., Reichert, B. E., Kitchens, W. M., Dixon, J. D., and Austin,
   J. D. (2013). Network modularity reveals critical scales for connectivity in ecology and
   evolution. Nature Communications, 4(1):2572. Number: 1 Publisher: Nature Publishing
   Group.
[31] Franklin, J., LaBar, T., and Adami, C. (2019). Mapping the Peaks: Fitness Landscapes
   of the Fittest and the Flattest. Artificial Life, 25(3):250–262.
[32] Frischmon, C., Sorenson, C., Winikoff, M., and Adamala, K. P. (2021). Build-a-Cell:
   Engineering a Synthetic Cell Community. Life, 11(11):1176. Number: 11 Publisher:
   Multidisciplinary Digital Publishing Institute.
[33] Félix, M.-A. and Wagner, A. (2008). Robustness and evolution: concepts, insights and
   challenges from a developmental model system. Heredity, 100(2):132–140. Number: 2
   Publisher: Nature Publishing Group.
[34] Gavrilets, S. (1997). Evolution and speciation on holey adaptive landscapes. Trends in
   Ecology & Evolution, 12(8):307–312.
[35] Gavrilets, S. (2010). High-Dimensional Fitness Landscapes and Speciation. In Pigliucci,
   M. and M{ü}ller, G. B., editors, Evolution – the Extended Synthesis, pages 45–80. The
   MIT Press.
[36] Ghosh-Dastidar, S. and Adeli, H. (2009). Spiking neural networks. International journal
   of neural systems, 19(04):295–308. Publisher: World Scientific.
[37] Gibney, E. (2020). How one astronomer hears the Universe. Nature, 577(7789):155.
[38] Gupta, A. and Adami, C. (2016). Shared Information between Residues Is Sufficient
   to Detect Pairwise Epistasis in a Protein. PLOS Genetics, 12(12):e1006471. Publisher:
   Public Library of Science.
[39] Han, J.-D. J., Bertin, N., Hao, T., Goldberg, D. S., Berriz, G. F., Zhang, L. V., Dupuy,
   D., Walhout, A. J. M., Cusick, M. E., Roth, F. P., and Vidal, M. (2004). Evidence
   for dynamically organized modularity in the yeast protein–protein interaction network.
   Nature, 430(6995):88–93. Number: 6995 Publisher: Nature Publishing Group.
                                               77


[40] Handorf, T., Ebenhöh, O., and Heinrich, R. (2005). Expanding Metabolic Networks:
   Scopes of Compounds, Robustness, and Evolution. Journal of Molecular Evolution,
   61(4):498–512.
[41] Hawkins, J., Ahmad, S., and Dubinsky, D. (2011). Hierarchical Temporal Memory
   (HTM) Whitepaper. Numenta. Inc, Tech. Rep.
[42] Hintze, A., Edlund, J. A., Olson, R. S., Knoester, D. B., Schossau, J., Albantakis,
   L., Tehrani-Saleh, A., Kvam, P., Sheneman, L., Goldsby, H., Bohm, C., and Adami, C.
   (2017). Markov Brains: A Technical Introduction.
[43] Hintze, A., Schossau, J., and Bohm, C. (2019). The Evolutionary Buffet Method. In
   Banzhaf, W., Spector, L., and Sheneman, L., editors, Genetic Programming Theory and
   Practice XVI, pages 17–36. Springer, Cham. Section: 2.
[44] Iram, S., Dolson, E., Chiel, J., Pelesko, J., Krishnan, N., Gungor, O., Kuznets-Speck,
   B., Deffner, S., Ilker, E., Scott, J. G., and Hinczewski, M. (2021). Controlling the speed
   and trajectory of evolution with counterdiabatic driving. Nature Physics, 17(1):135–142.
[45] Jaeggi, S. M., Buschkuehl, M., Perrig, W. J., and Meier, B. (2010). The concurrent
   validity of the N-back task as a working memory measure. Memory, 18(4):394–412.
[46] Jimenez, J. I., Xulvi-Brunet, R., Campbell, G. W., Turk-MacLeod, R., and Chen, I. A.
   (2013). Comprehensive experimental fitness landscape and evolutionary network for small
   RNA. Proceedings of the National Academy of Sciences, 110(37):14984–14989.
[47] Johnson, S., Orban, D., Runesha, H. B., Meng, L., Juhnke, B., Erdman, A., Samsel, F.,
   and Keefe, D. F. (2019). Bento Box: An Interactive and Zoomable Small Multiples Tech-
   nique for Visualizing 4D Simulation Ensembles in Virtual Reality. Frontiers in Robotics
   and AI, 6:61.
[48] Kaplan, J. (2008). The end of the adaptive landscape metaphor? Biology & Philosophy,
   23(5):625–638.
[49] Kassambara, A. (2021). rstatix: Pipe-Friendly Framework for Basic Statistical Tests.
[50] Kauffman, S. and Levin, S. (1987). Towards a general theory of adaptive walks on
   rugged landscapes. Journal of Theoretical Biology, 128(1):11–45.
[51] Kaznatcheev, A. (2019). Computational Complexity as an Ultimate Constraint on
   Evolution. Genetics, 212(1):245–265.
[52] Kelly, S. and Heywood, M. I. (2017). Multi-task learning in atari video games with
   emergent tangled program graphs. In Proceedings of the Genetic and Evolutionary Com-
   putation Conference, pages 195–202.
[53] Kennedy, D. A. and Read, A. F. (2018). Why the evolution of vaccine resistance is less
                                                78


   of a concern than the evolution of drug resistance. Proceedings of the National Academy
   of Sciences, 115(51):12878–12886.
[54] Khan, A., Baharudin, B., Hong Lee, L., and Khan, K. (2019). A Review of Machine
   Learning Algorithms for Text-Documents Classification. Journal of Advances in Informa-
   tion Technology, 1(1):4–20.
[55] Khan, A. I., Dinh, D. M., Schneider, D., Lenski, R. E., and Cooper, T. F. (2011).
   Negative Epistasis Between Beneficial Mutations in an Evolving Bacterial Population.
   Science, 332(6034):1193–1196.
[56] Kim, K.-J. and Cho, S.-B. (2006). A Comprehensive Overview of the Applications of
   Artificial Life. Artificial Life, 12(1):153–182.
[57] Kriegman, S., Blackiston, D., Levin, M., and Bongard, J. (2020). A scalable pipeline
   for designing reconfigurable organisms. Proceedings of the National Academy of Sciences,
   117(4):1853–1859. eprint: https://www.pnas.org/doi/pdf/10.1073/pnas.1910837117.
[58] Lalejini, A., Dolson, E., Bohm, C., Ferguson, A. J., Parsons, D. P., Rainford, P. F.,
   Richmond, P., and Ofria, C. (2019). Data Standards for Artificial Life Software. In
   ALIFE 2019: The 2019 Conference on Artificial Life, pages 507–514. MIT Press.
[59] Lalejini, A. and Ofria, C. (2018). Evolving event-driven programs with SignalGP. In
   Proceedings of the Genetic and Evolutionary Computation Conference, pages 1135–1142.
[60] Li, X., Engelbrecht, A., and Epitropakis, M. G. (2013). Benchmark Functions for
   CEC’2013 Special Session and Competition on Niching Methods for Multimodal Function
   Optimization. Technical report, RMIT University, Australia.
[61] Lohn, J. D., Hornby, G. S., and Linden, D. S. (2005). An Evolved Antenna for De-
   ployment on Nasa’s Space Technology 5 Mission. In O’Reilly, U.-M., Yu, T., Riolo, R.,
   and Worzel, B., editors, Genetic Programming Theory and Practice II, volume 8, pages
   301–315. Springer-Verlag, New York. Series Title: Genetic Programming.
[62] Mackay, T. F. C. (2014). Epistasis and quantitative traits: using model organisms to
   study gene–gene interactions. Nature Reviews Genetics, 15(1):22–33. Number: 1 Pub-
   lisher: Nature Publishing Group.
[63] Marstaller, L., Hintze, A., and Adami, C. (2013). The Evolution of Representation in
   Simple Cognitive Networks. Neural Computation, 25(8):2079–2107.
[64] Masel, J. and Siegal, M. L. (2009). Robustness: mechanisms and consequences. Trends
   in Genetics, 25(9):395–403.
[65] McCandlish, D. M. (2011). Visualizing Fitness Landscapes. Evolution, 65(6):1544–1558.
[66] McKinney, B. A. and Pajewski, N. M. (2012). Six degrees of epistasis: Statistical
                                                 79


   network models for GWAS. Frontiers in Genetics, 2(JAN):1–6.
[67] McMullin, B. (2004). Thirty years of computational autopoiesis: A review. Artificial
   Life, 10(3):277–295. Publisher: MIT Press 238 Main St., Suite 500, Cambridge, MA
   02142-1046 USA journals-info@mit.edu.
[68] Melo, D., Porto, A., Cheverud, J. M., and Marroig, G. (2016). Modularity: genes,
   development and evolution. Annual review of ecology, evolution, and systematics, 47:463–
   486.
[69] Mendizabal, I., Keller, T. E., Zeng, J., and Yi, S. V. (2014).                 Epigenet-
   ics and Evolution.       Integrative and Comparative Biology, 54(1):31–42.           eprint:
   https://academic.oup.com/icb/article-pdf/54/1/31/30793676/icu040.pdf.
[70] Miller, K. M., Price, C. C., Okun, M. S., Montijo, H., and Bowers, D. (2009). Is the N-
   Back Task a Valid Neuropsychological Measure for Assessing Working Memory? Archives
   of Clinical Neuropsychology, 24:711–717.
[71] Mira, P. M., Crona, K., Greene, D., Meza, J. C., Sturmfels, B., and Barlow, M. (2015).
   Rational Design of Antibiotic Treatment Plans: A Treatment Strategy for Managing Evo-
   lution and Reversing Resistance. PLOS ONE, 10(5):e0122283.
[72] Moore, J. H., Gilbert, J. C., Tsai, C. T., Chiang, F. T., Holden, T., Barney, N., and
   White, B. C. (2006). A flexible computational framework for detecting, characterizing,
   and interpreting statistical patterns of epistasis in genetic studies of human disease sus-
   ceptibility. Journal of Theoretical Biology, 241(2):252–261.
[73] Mozilla (2022). A-Frame.
[74] Nadeau, C. P. and Urban, M. C. (2019). Eco-evolution on the edge during climate
   change. Ecography, 42:1280–1297.
[75] Niel, C., Sinoquet, C., Dina, C., and Rocheleau, G. (2015). A survey about methods
   dedicated to epistasis detection. Frontiers in Genetics, 6.
[76] Ofria, C. and Wilke, C. O. (2004). Avida: A software platform for research in compu-
   tational evolutionary biology. Artificial life, 10(2):191–229. Publisher: MIT Press.
[77] Ogbunugafor, C. B. and Eppstein, M. J. (2017). Competition along trajectories governs
   adaptation rates towards antimicrobial resistance. Nature Ecology & Evolution, 1(1):0007.
[78] Ogbunugafor, C. B., Wylie, C. S., Diakite, I., Weinreich, D. M., and Hartl, D. L.
   (2016). Adaptive Landscape by Environment Interactions Dictate Evolutionary Dynamics
   in Models of Drug Resistance. PLOS Computational Biology, 12(1):e1004710.
[79] Olshannikova, E., Ometov, A., Koucheryavy, Y., and Olsson, T. (2015). Visualizing Big
   Data with augmented and virtual reality: challenges and research agenda. Journal of Big
                                                80


   Data, 2(1):22.
[80] O’Neill, B. (2003). Digital evolution. PLoS Biology, 1(1). Publisher: Public Library of
   Science.
[81] Owen, A. M., Mcmillan, K. M., Laird, A. R., and Bullmore, E. (2005). N-Back Working
   Memory Paradigm: A Meta-Analysis of Normative Functional Neuroimaging Studies.
   Human Brain Mapping, 25:46–59.
[82] Parsons, K. J., Márquez, E., and Albertson, R. C. (2012). Constraint and Opportunity:
   The Genetic Basis and Evolution of Modularity in the Cichlid Mandible. The American
   Naturalist, 179(1):64–78.
[83] Payne, J. L. and Wagner, A. (2019). The causes of evolvability and their evolution.
   Nature Reviews Genetics, 20(1):24–38. Number: 1 Publisher: Nature Publishing Group.
[84] Pontes, A. C., Mobley, R. B., Ofria, C., Adami, C., and Dyer, F. C. (2020). The
   evolutionary origin of associative learning. American Naturalist, 195(1):E1–E19. ISBN:
   0000000229159.
[85] Puniyani, A., Liberman, U., and Feldman, M. W. (2004). On the meaning of non-
   epistatic selection. Theoretical Population Biology, 66(4):317–321.
[86] R Core Team (2019). R: A Language and Environment for Statistical Computing. R
   Foundation for Statistical Computing, Vienna, Austria.
[87] Real, E., Liang, C., So, D. R., and Le, Q. V. (2020). Automl-zero: Evolving machine
   learning algorithms from scratch. In Proceedings of the 37th International Conference on
   Machine Learning, volume 119, pages 8007–8019. ISSN: 23318422 eprint: 2003.03384.
[88] Rochman, N. D., Wolf, Y. I., Faure, G., Mutz, P., Zhang, F., and Koonin, E. V.
   (2021). Ongoing global and regional adaptive evolution of SARS-CoV-2. Proceedings of
   the National Academy of Sciences, 118(29):e2104241118.
[89] Rosenblatt, F. (1958). The perceptron: a probabilistic model for information storage
   and organization in the brain. Psychological review, 65(6):386. Publisher: American
   Psychological Association.
[90] Sheneman, L. and Hintze, A. (2017). Evolving autonomous learning in cognitive net-
   works OPEN. Scientific Reports, 7:16712.
[91] Singh, A., Thakur, N., and Sharma, A. (2016). A review of supervised machine learning
   algorithms. In Proceedings of the 10th INDIACom; 2016 3rd International Conference
   on Computing for Sustainable Global Development, INDIACom 2016, pages 1310–1315.
   Bharati Vidyapeeth, New Delhi as the Organizer of INDIACom - 2016.
[92] Solé, R. V. and Valverde, S. (2004). Information Theory of Complex Networks: On
                                              81


   Evolution and Architectural Constraints. pages 189–207.
[93] Sotelo-Silveira, M., Montes, R. A. C., Sotelo-Silveira, J. R., Marsch-Martı́nez, N., and
   Folter, S. d. (2018). Entering the Next Dimension: Plant Genomes in 3D. Trends in Plant
   Science, 23(7):598–612.
[94] Uddin, S., Khan, A., Hossain, M. E., and Moni, M. A. (2019). Comparing different
   supervised machine learning algorithms for disease prediction. BMC Medical Informatics
   and Decision Making, 19(1):1–16. Publisher: BMC Medical Informatics and Decision
   Making.
[95] Vikhar, P. A. (2016). Evolutionary algorithms: A critical review and its future prospects.
   In 2016 International Conference on Global Trends in Signal Processing, Information
   Computing and Communication (ICGTSPICC), pages 261–265.
[96] Vostinar, A. E. and Ofria, C. (2019). Spatial Structure Can Decrease Symbiotic Coop-
   eration. Artificial Life, 24(4):229–249.
[97] Wagner, A. (2007). Robustness and evolvability in living systems, volume 24. Princeton
   university press.
[98] Wagner, G. P., Pavlicev, M., and Cheverud, J. M. (2007). The road to modularity.
   Nature Reviews Genetics, 8(12):921–931.
[99] Weinreich, D. M., Lan, Y., Wylie, C. S., and Heckendorn, R. B. (2013). Should evo-
   lutionary geneticists worry about higher-order epistasis? Current Opinion in Genetics &
   Development, 23(6):700–707.
[100] Wickham, H., Averick, M., Bryan, J., Chang, W., McGowan, L. D., François, R.,
   Grolemund, G., Hayes, A., Henry, L., Hester, J., Kuhn, M., Pedersen, T. L., Miller, E.,
   Bache, S. M., Müller, K., Ooms, J., Robinson, D., Seidel, D. P., Spinu, V., Takahashi,
   K., Vaughan, D., Wilke, C., Woo, K., and Yutani, H. (2019). Welcome to the tidyverse.
   Journal of Open Source Software, 4(43):1686.
[101] Wiles, J. and Tonkes, B. (2003). Mapping the Royal Road and Other Hierarchical
   Functions. Evolutionary Computation, 11(2):129–149.
[102] Wiles, J. and Tonkes, B. (2006). Hyperspace Geography: Visualizing Fitness Land-
   scapes beyond 4D. Artificial Life, 12(2):211–216.
[103] Wilke, C. O., Wang, J. L., Ofria, C., Lenski, R. E., and Adami, C. (2001). Evolution
   of digital organisms at high mutation rates leads to survival of the flattest. Nature,
   412(6844):331–333.
[104] Williams, N., Zander, S., and Armitage, G. (2006). A preliminary performance com-
   parison of five machine learning algorithms for practical IP traffic flow classification. Com-
   puter Communication Review, 36(5):7–15.
                                              82


[105] Wiser, M. J., Ribeck, N., and Lenski, R. E. (2013). Long-Term Dynamics of Adaptation
   in Asexual Populations. Science, 342(6164):1364–1367.
[106] Wright, S. (1932). The roles of mutation, inbreeding, crossbreeding, and selection in
   evolution. Proceedings of the Sixth International Congress of Genetics, pages 356–366.
[107] Wu, N. C., Dai, L., Olson, C. A., Lloyd-Smith, J. O., and Sun, R. (2016). Adaptation
   in protein fitness landscapes is facilitated by indirect paths. eLife, 5:e16965. Publisher:
   eLife Sciences Publications, Ltd.
[108] Yu, Y., Si, X., Hu, C., and Zhang, J. (2019). A Review of Recurrent Neural Networks:
   LSTM Cells and Network Architectures. Neural Computation, 31:1235–1270.
[109] Østman, B., Hintze, A., and Adami, C. (2011). Impact of epistasis and pleiotropy
   on evolutionary adaptation. Proceedings of the Royal Society of London B: Biological
   Sciences, page rspb20110870.
                                               83