33%wa . tfikfia 4.. .95. , a.” .fia. a... “myam‘ .fi. 1; .li‘a‘IvSUu- . a . 2. as ., . 2., : a. k...“ .1 1 .45. a. , .. 3.3 V i: .25? .. THESIS zeal LIBRARY Michigan State University This is to certify that the thesis entitled A COMPARISON OF COHORT GA WITH CANONICAL SERIAL AND ISLAND-MODEL DISTRIBUTED GA'S presented by Huafeng Pei has been accepted towards fulfillment of the requirements for Master's fiegreeinikfitflfll Eng Major professor 0-7539 MS U is an Affirmative Action/Equal Opportunity Institution PLACE IN RETURN BOX to remove this checkout from your record. TO AVOID FINES return on or before date due. MAY BE RECALLED with earlier due date if requested. DATE DUE DATE DUE DATE DUE 11m C‘mmS-p14 A COMPARISON OF COHORT GA WITH CANONICAL SERIAL AND ISLAND- MODEL DISTRIBUTED GA’S By Huaieng Pei A THESIS Submitted to Michigan State University in partial fulfillment of the requirements for the degree of MASTER OF SCIENCE Department of Electrical and Computer Engineering 2000 ABSTRACT A COMPARISON OF COHORT GA WITH CANONICAL SERIAL AND ISLAND- MODEL DISTRIBUTED GA’S By Huafeng Pei This thesis considers the cohort genetic algorithm, a new type of genetic algorithm introduced by Holland. The cohort GA differs in several ways from the traditional canonical serial GA and island-model distributed GA, and was developed as a means of reducing “hitchhiking” - premature convergence of currently low-significance loci located near loci at which good building blocks are found early in the search process. The objective of this work is a comparison of one version of the cohort GA with canonical serial and island-model distributed GA’s on the basis of their abilities to reduce hitchhiking. The comparison is done on two test functions: royal road function and hyperplane-defined function. It is experimentally shown that even though theoretically the cohort GA can reduce hitchhiking, the particular version of the cohort GA tested is prone to another form of premature convergence and performs worse than the other GA’s. It is also shown that a small change in the placement of offspring among cohorts in the cohort GA may dramatically change its performance. This suggests that further work on the cohort GA may be fruitful. ACKNOWLEDGEMENTS I would like to express my sincere gratitude to my advisor Dr. Erik Goodman, for his guidance, support and patience during all of my graduate studies. Thanks to my committee members, Dr. Bill Punch and Dr. Robert Schlueter for their support and valuable suggestions. Thanks also go to Prof. John Holland and his student Theodore C. Belding at University of Michigan for their valuable information and suggestions. I am much obliged to Prof. Min Pei and Dr. Subbiah Kannappan for their help and advice during my stay at MSU. I would like to thank my writing group members, Dr. Renate Snider, Dr. Eric Torng, Julide Celik, Andrew Chen, Yong Chen and Georgies Theocharous for their careful proof-readings of my thesis. I am also grateful to the many good friends whose help I could always count on. Finally, and most importantly, I would like to thank my parents and the rest of my family for their love and understanding. iii TABLE OF CONTENTS LIST OF TABLES vi LIST OF FIGURES viiii INTRODUCTION 1 CHAPTER 1 BACKGROUND 3 1.1 Building block hypothesis and Royal Road functions 3 1.2 Hitchhiking 6 1.3 Cohort GA 8 1.4 Canonical Serial GA and its distinction from cohort GA 11 1.5 Island-model distributed GA and its distinction from Cohort GA 12 CHAPTER 2 EXPERIMENTAL DESIGN 15 2.1 Experiment objective 15 2.2 Testbed — Royal Road function and HDF function 15 2.3 Performance Criteria 24 2.4 Cohort GA implementation detail 24 2.4 Modifications made to the original cohort GA 26 2.5 Implementation details of canonical serial GA and island-model distributed GA 29 CHAPTER 3 RESULTS 3.1 Comparison of results using the RR function 3.2 Comparison results using HDF CHAPTER 4 CONCLUSIONS APPENDIX GLOSSARY REFERENCES 33 33 50 58 61 63 LIST OF TABLES Table 1: The average number of function evaluations until the optimum is found and the standard deviation of this average. The number in parentheses is the percentage of runs that the GA achieved that level. Average number shown without parentheses means hundred percent of runs achieved that level. Blank entries indicate that the level was never achieved in 300,000 function evaluations 37 Table 2: Results of the original cohort GA. The average number of function evaluations to achieve level 1 and the standard deviation of this average 39 Table 3: Results of cohort GA with new placement implementation. The average number of function evaluations to achieve level 1 and the standard deviation of this average 39 Table 4: Results of cohort GA with new placement implementation. The average number of function evaluations to achieve level 2 and the standard deviation of this average 40 Table 5: Results of cohort GA with new placement implementation. The average number of function evaluations to achieve level 3 and the standard deviation of this average 40 Table 6: The average number of function evaluations until the optimum is found and the standard deviation of this average. The number of cohorts is 20. The stopping criterion for this test is the total number of function evaluations exceeding 500,000 41 Table 7: The average number of function evaluations until the optimum is found and the standard deviation of this average. The number of cohorts is 35. The stopping criterion for this test is the total number of function evaluations exceeding 500,000 41 Table 8: The average number of function evaluations until the optimum is found and the standard deviation of this average. The population size is 1600. The subpopulation size in the island-model distributed GA is 200 and the number of subpopulations is 8 42 vi Table 9: The average number of function evaluations until the optimum is found and the standard deviation of this average. The population size is 4000. The subpopulation size in island-model distributed GA is 200 and the number of subpopulations is 8 45 Table 10: Maximum fitness values and cohort sizes recorded in a type cohort GA run at some number of circles. Original offspring placement implementation—45 Table 11: Maximum fitness values and cohort sizes recorded in a type cohort GA run at some number of circles. New offspring placement implementation 46 Table 12: The average number of function evaluations to achieve level 1 and the standard deviation of this average 48 Table 13: The average number of function evaluations to achieve level 2 and the standard deviation of this average 48 Table 14: The average number of function evaluations to achieve level 3 and the standard deviation of this average 49 Table 15: Comparison of the maximum fitnesses reached by three GA’s after 2500 function evaluations and the sum of the differences in intron parts. Population size is 200. The small table shows the t-test results. The numbers in parentheses represent the source of the data. For example, (1, 2) means the data from the 20 runs on canonical serial GA and canonical serial GA with niching. The cell with bold font indicates the difference is significant at 0.05 level 52 Table 16: Comparison of maximum fitness reached by three GA’s after 6000 function evaluations and the sum of the difference in intron part. The small table shows the t-test results 53 Table 17: Comparison of maximum fitness reached by two cohort GA’s after 2500 function evaluations and the sum of the differences in intron part 55 Table 18: Comparison of maximum fitness reached by two cohort GA’s after 6000 function evaluations and the sum of the differences in intron part results 56 vii LIST OF FIGURES Figure 1: Royal Road Function R1. The fitness function R1(x) (where x is a bit string) is computed by summing the coefficients 05 corresponding to each of the given schemas (s) of which x is an instance. For example, R1(1111111100...0) = 8andR1(111111111111111100....0)=16 4 Figure 2: Royal Road Function R2. The fitnessfunction R2(x) (where x is a bit string) is computed by summing the coefficients 08 corresponding to each of the given schemas (3,) of which x is an instance. For example, R2(1111111100...0) = 8andR2(111111111111111100...0)=32 5 Figure 3: Royal Road Function R4. Each 3; is an order 8 schema. The desired schemas are s1to $8 and combinations of them. A String that is not an instance of any desired schema receives fitness 1.0. Every time a new level is reached, a small increment u = 0.2 is added to the fitness. For example, strings at level 1 (that are instances of at least one level-1 schema) have fitness 1+0.2 7 Figure 4: Reproduction process of cohort GA. a) Initial Cohorts. b) Offspring of the individuals in cohort 1 will go to cohort j, with 2