LIBRARY Michigan State University PLACE ill RETURN BOX to remove this checkout from your record. TO AVOID FINES return on or before date due. DATE DUE DATE DUE DATE DUE I W V—Wl J MSU it An Affirmative Action/Equal Opportunity institution cmm1 A STUDY OF SINGLE- AND MULTI-LEVEL LOGISTIC REGRESSION MODELS USING REAL AND COMPUTER SIMULATED DATA BY Mohamed Abdulla Kamali A DISSERTATION Submitted to Michigan State University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY Department of Counseling, Educational Psychology and Special Education 1992 éW-aszg Copyright by MOHAMED ABDULLA KAMALI 1992 ABSTRACT A STUDY OF SINGLE- AND MULTI-LEVEL LOGISTIC REGRESSION MODELS USING REAL AND COMPUTER SIMULATED DATA BY Mohamed Abdulla Kamali This study provides a comparative analysis of the advantages and disadvantages associated with computer programs utilizing single- (i.e., SPSS) and multilevel logistic regression (MLR) estimation methods (i.e. , VARCL, MULTILOGIT) . Real and computer simulated data were employed in this study. Five different models of different complexity were investigated using real data. The simulated model included both a random intercept and random regression coefficient. The investigation considered random effects with both normal and t-distributions, various sample sizes of subjects within- groups, and different values of the random regression slope variance. The findings drawn from running the SPSS, VARCL, and MULTILOGIT estimation programs using real data were: (1) The estimated regression coefficient for MULTILOGIT generally had a larger absolute value than both VARCL and SPSS. (2) The standard error estimates for both the within- and between- school variables regression coefficient for VARCL and MULTILOGIT were close and much larger than the SPSS estimates, Mohamed Abdul 1a Kamali while the MULTILOGIT estimates were slightly larger than the VARCL estimates. (3) The estimate of variance-covariance components of the random effects for MULTILOGIT and VARCL were close. However, the MULTILOGIT estimates were generally larger absolute value than the VARCL estimates of the variance- covariance components. (4) There are several limitations of the MULTILOGIT program making its operation very restrictive. The conclusions resulting from. the SPSS, and ‘VARCL estimation programs utilizing simulated data were: (1) Both the VARCL and SPSS estimates of 7’s were found to be significantly negatively biased and inconsistent. ( 2) The SPSS estimates of the standard error of macro parameters were significantly biased and inconsistent, while the VARCL estimates of the standard error of macro parameters were unbiased. (3) The probability of type I error rate under a true null hypothesis for the tests of the macro parameters 7’s were much smaller for VARCL than SPSS. However, both estimation method give unacceptable type I error rate (i.e., p > .05). (4) The VARCL estimates of I”, 1" and 1d parameter were significantly negatively biased. However, the magnitude of the bias and MSE declined as the number of units within each group increased. (5) The VARCL estimates of the standard error for V100, V1", and to, were significantly biased. However, the magnitudes of bias, and MSE were reduced as the sample size within each group increased. Dedicated to my father, mother, brothers, sisters, and my wife and daughter for their continuous love, and blessing. ACKNOWLEDGEMENTS I wish to thank Dr. Stephen W. Raudenbush, my academic advisor and chairperson of my doctoral dissertation for his guidance, insightful comments, and understanding. Working with him contributed greatly to my knowledge, development and understanding of applied statistics. Special thanks are also extended to Dr. Dennis Gilliland for his thoughtful and technical counseling, particularly during the final phases of the study. I also wish to thank the other members of my committee, Dr. William Schmidt and Dr. Habib Salehi. I especially appreciate the support of the President and the Board of Trusteee of the United Arab Emirates University for sponsoring my studies and giving me this unique opportunity to obtain a doctoral degree. Special thanks are given to my friend Ivan Filmer at Michigan State University for his help with editing this dissertation. I wish to thank my father, mother, brothers, sisters and all the other members of my family for their love and support. Special thanks are also conveyed to my brother Adel Kamali for his help in computer programming during the summer of 1991. Finally, I wish to express my deepest gratitude and thanks to my wife and daughter, Noor, for all their love, vi support, patience, and the many sacrifices they made so that this study could be completed. vii LIST OF LIST OF CHAPTER I. II. III. TABLE OF CONTENTS FIGURESOOOOOOO ....... O OOOOOOOOOOOOOOOOOOO 0.... TABLES.. ........................ .. ............ STATEMENT OF THE PROBLEMOOOOOOOOOOOO0.0.0.0.... Introduction...... ...................... . ..... Problem Statement......... ..... . ...... ........ Purpose of the Study.......................... The Need for the Study........................ Research Question ..... ........................ Multilevel Binary Models Used in the Study.... The Real Data Models...... ...... ........... The Simulated Model........................ Demonstrating the Model....................... The Real Data ............................. The Simulated Data......................... Research Procedures........................... Summary..... ..... .... ......................... REVIEW OF LITERATURE ................. .......... Logistic Regression Model..................... Multilevel Linear Model........ ........ ....... Multilevel Logistic Regression Model.......... Development of Multilevel Logistic Regression MOdeIOOO0......0......OOOOOOOOOOOOOOOOOOOOOO summarYOO ....... CO. ..... OOOOOOOOOOOOOOOOOOOOOO METHODOOOOOOOOOO0.0000000000000000000000000.... Introduction.................................. The Pilot Study............................... The VARCL Program.......................... The ML3 Program............................ The MULTILOGIT Program..................... The SPSS Program........................... The Finding of the Pilot Study................ Characteristics of the Real Data.............. Multilevel Logistic Regression Models......... viii p HHWOOQQQGUIUUH CO H N 12 15 17 19 3O 32 32 33 34 36 38 41 41 42 43 IV. MLR Model MLR Model MLR Model MLR Model MLR Model 1 ................................ 2.. OOOOOOOOOOOOOOOOOOOOOOOOOOOOOO 3.............. ..... ........ ..... 4................................ 5................................ Comparing Two MLR Models Using the VARCL Method of Estimation ..... ................... Rationale for Excluding the MULTILOGIT Program Characteristics of the Simulated Model........ The Design of the Simulated study............. Procedure Used to Generate the Simulated Data. Statistical Comparison of the Estimation Methods..................................... Summary.... ............ . ...................... RESULTS... ..... . ......... ... .............. ..... IntrOduCtion ..... ................... .......... Results of the The The The The The The Results Results Results Results Results Results Real Data Analysis............. of of of of of of MLR Model 1................. MLR Model 2. ...... .......... MLR Model 3 ........ ......... MLR Model 4................. MLR Model 5................. Comparing Two MLRM Using VARCL Program ........................... Results of the Simulated Data Analysis........ Comparison of 7's Between the SPSS and VARCL Estimation Methods................. Comparison of the Standard Error of the 7's Between the SPSS and VARCL Estimation Methods.................................. The Effect of n on 7’s..................... The Effect of n on the Standard Error of 7. The Effect of the Random Effects Distribution on 7’s...................... The Effect of the Random Effects Distribution on the Standard Error of 7's The Effect of the RRS Variance Magnitude on ’8...................................... The Effect of the Magnitude of the RRS Variance on the Standard Error of the 7's Checking the Accuracy of the Variance- Covariance Component of the Random Effects Estimate Using VARCL............. Checking the Accuracy of the Estimated TOO Obtained by the VARCL Estimation Method.. The Effect of n on 100..................... The Effect of the Magnitude of the RRS Variance on the Estimated 100............ ix Page 44 45 46 47 48 49 51 53 56 58 62 63 65 65 65 65 68 70 71 73 76 78 78 91 99 100 103 104 107 109 109 111 111 113 Page The Effect of the Random Effects Distribution on the Estimated 100........ 113 Checking the Accuracy of the Estimated Standard Error of VTOO obtained from VARCL Estimation procedure............... 114 The Effect of n on Estimated Standard Error of Vroo ...... ....... ........ ...... ...... 114 The Effect of the Magnitude of the RRS Variance on Estimated Standard Error of VTOO...................... .......... ..... 114 The Effect of the Random Effects Distribution on Estimated Standard Error of «100........................... ..... 116 Checking the Accuracy of the Estimated 111 Obtained by the VARCL Estimation Method.. 116 The Effect of n on The Estimated 111....... 116 The Effect of the Magnitude of the RRS Variance on the Estimated 111............ 118 The Effect of the Random Effects Distribution on the Estimated 111........ 118 Checking the Accuracy of the Estimated Standard Error of V111 obtained from VARCL Estimation procedure............... 118 The Effect of n on Estimated Standard Error of V711 ................................. 118 The Effect of the Magnitude of the RRS Variance on Estimated Standard Error of V111...... .............. ................ 120 The Effect of the Random Effects Distribution on the Estimated Standard Error of V111..... ..... .................. 120 Checking the Accuracy of the Estimated 101 Obtained by the VARCL Estimation Method.. 120 The Effect of n on Estimated 701........... 122 The Effect of the Magnitude of the RRS Variance on the Estimated 101............ 122 The Effect of the Random Effects Distribution on the Estimated rol........ 122 Checking the Accuracy of the Estimated Standard Error of 701 obtained by VARCL Estimation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 2 3 The Effect of n on Estimated Standard Error Of 101 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 123 The Effect of the Magnitude of the RRS Variance on the Estimated Standard Error of 101................................... 123 The Effect of the Random Effects Distribution on the Estimated Standard Error of rol... ..... .. ...... ............. 123 Summary........ ..... .......................... 125 V. CONCLUSION..................................... Introduction.................................. Conclusions Based on the Real Data Analysis... Conclusions Based on the Simulated Data Analysis.................................... Implications of the Findings.................. The Consequences of the Conclusions of the Real and Simulated Data Analyses............ Suggestions for Future Research............... APPENDICES 3-1. 3-2. 3-5. 3-6. AN EXAMPLE OF VARCL PROGRAM "BASIC INFORMATION FILE" SPECIFIED FOR THIS STUDY.............. AN EXAMPLE OF MULTILOGIT PROGRAM "COMMAND FILE" SPECIFIEDFORTHISSTUDY.................... THE DESCRIPTIVE STATISTICS FOR THE REAL DATA AT BOTH THE STUDENT- AND SCHOOL-LEVEL....... A COPY OF THE GAUSS PROGRAM USED TO GENERATE DATA FOR THE GROUP PREDICTOR. . . . . . . . . . . . . . . . A COPY OF THE GAUSS PROGRAM USED TO GENERATE DATA FOR THE WITHIN-GROUP PREDICTOR......... A COPY OF THE GAUSS PROGRAM USED TO GENERATE THE DATA SET FOR THE CELL (ND,n10,RRSS)..... HISTOGRAM FREQUENCY FOR ESTIMATED yOO BY THE SPSS ESTIMATION METHOD...................... HISTOGRAM FREQUENCY FOR ESTIMATED 700 BY THE VARCL ESTIMATION METHOD..................... HISTOGRAM FREQUENCY FOR ESTIMATED 701 BY THE SPSS ESTIMATION METHOD...................... HISTOGRAM FREQUENCY FOR ESTIMATED 701 BY THE VARCL ESTIMATION METHOD..................... HISTOGRAM FREQUENCY FOR ESTIMATED 710 BY THE SPSS ESTIMATION METHOD...................... HISTOGRAM FREQUENCY FOR ESTIMATED 710 BY THE VARCL ESTIMATION METHOD..................... xi Page 126 126 126 127 131 137 138 140 141 142 146 147 148 151 151 152 152 153 153 Page 4-4. HISTOGRAM FREQUENCY FOR ESTIMATED 711 BY THE SPSS ESTIMATION METHOD...................... 154 HISTOGRAM FREQUENCY FOR ESTIMATED 711 BY THE VARCL ESTIMATION METHOD..................... 154 5-1. THE FIRST AND SECOND DERIVATIVE OF THE LOGISTICFUNCTION........................... 155 BIBLIOGRAPHY............ ........................ ...... 156 xii LIST OF FIGURES Figure Page 3-1 The design of the six cells of the simulated study ..... ...... ..... . ............ . ..... .... 59 xiii LIST OF TABLES Table 2-1 Several multilevel binary analysis techniques. 4-1a.--Estimated regression coefficient and standard error (given in parentheses) for MLR model 1 using different estimation methods............ 4-1b.--Estimated variance of the intercept random effects and S.E. for Vroo (given in parentheses) for MLR model 1 using the VARCL and MULTILOGIT estimation methods............. 4-2a.--Estimated regression coefficient and standard error (given in parentheses) for MLR model 2 using different estimation methods............. 4-2b.--Estimated variance-covariance components of the random effects and S.E. for V100, V100, and 101 (given in parentheses) for MLR model 2 using the VARCL and MULTILOGIT estimation methods....................................... 4-3a.--Estimated regression coefficient and standard error (given in parentheses) for MLR model 3 using different estimation methods............ 4-3b.--Estimate variance-covariance components of the rando effects and S.E. for Vtoo,.VIoo, and 101 (given in parentheses) for MLR model 3 using the VARCL and MULTILOGIT estimation methods...... ......... ........................ 4-4a.--Estimated regression coefficient and standard error (given in parentheses) for MLR model 4 using different estimation methods............ 4-4b.--Estimated variance-covariance components of the random effects and S.E. for Vroo, Vroo, and 101 (given in parentheses) for MLR model 4 using the VARCL and MULTILOGIT estimation methods....................................... xiv Page 20 66 67 68 70 71 72 72 73 Table Page 4-5a.--Estimated regression coefficient and standard error (given in parentheses) for MLR model 5 using different estimation methods............ 74 4-5b.--Estimated variance-covariance components of the random effects and S.E. for V100, V100, and 101 (given in parentheses) for MLR model 5 using the VARCL and MULTILOGIT estimation methods....................................... 76 4-6.--Estimated regression coefficient, and standard error (given in parentheses) for the model A having random intercept and random regression slope and model B having random intercept and fixed regression slope using VARCL estimation method......................................... 77 4-7.--The true value, Mean, S.E., MSE, and bias for estimated 7's of the SPSS and VARCL estimation procedures..................................... 80 4-8.--The true value, Mean, S.E., MSE, and bias for estimated yoo by cell identification for the SPSS and the VARCL estimation procedure........ 81 4-9.--The true value, Mean, S.E., MSE, and bias for estimated yol by cell identification for the SPSS and the VARCL estimation procedure........ 82 4-10.--The true value, Mean, S.E., MSE, and bias for estimated 710 by cell identification for the SPSS and the VARCL estimation procedure....... 83 4-11.--The true value, Mean, S.E., MSE, and bias for estimated 711 by cell identification for the SPSS and the VARCL estimation procedure....... 84 4-12.--Estimated 700 and estimated 100 for different models using VARCL estimation methods......... 87 4-13.--Estimated yoo and estimated 100 using VARCL program....................................... 89 4-14.--The true value, estimated 7's, for three replications having the fully fixed model using the SPSS estimation method.............. 90 4-15.--The true value, Mean, S.E., MSE, and bias for estimated standard error for macro parameters by the SPSS and VARCL estimation procedures... 93 XV Table 4-16.--The true standard error, Mean, S.E., MSE, and bias for estimated standard error of 701 by cell identification for the SPSS and the VARCL estimation procedure.......................... 4-17.-—The true standard error, Mean, S.E., MSE, and bias for estimated standard error of ylo by cell identification for the SPSS and the VARCL estimation procedure.......................... 4-18.--The true standard error, Mean, S.E., MSE, and bias for estimated standard error of 711 by cell identification for the SPSS and the VARCL estimation procedure.......................... 4-19.--The probability of type I error rates for tests of macro estimators under a true null by the SPSS and VARCL estimation procedures...... 4-20.--The probability of type I error rates for tests of macro estimators under a true null by cell identification for the SPSS and the VARCL estimation procedure.......................... 4-21.--The true value, Mean, S.E., MSE, and bias for estimated 7's by the number of subject within each group for the SPSS and VARCL estimation procedure..................................... 4-22.--The true value, Mean, S.E., MSE, and bias for estimated standard error for macro parameters by the number of subject within each group for the SPSS and VARCL estimation procedures...... 4-23.--The true value, Mean, S.E., MSE, and bias for estimated 7's by the distribution of the random effects for the SPSS and VARCL estimation procedure.......................... 4-24.--The true value, Mean, S.E., MSE, and bias for estimated standard error for macro parameters by the distribution of the random effects for the SPSS and VARCL estimation procedure....... 4-25.--The true value, Mean, S.E., MSE, and bias for estimated 7's by the magnitude of random regression slope variance for the SPSS and ‘VARCL estimation procedure.................... xvi Page 94 95 96 97 98 101 102 105 106 108 Table Page 4-26.--The true value, Mean, S.E., MSE, and bias for estimated standard error for macro parameters by the magnitude of random regression slope variance for the SPSS and VARCL estimation procedure..... ......... .... ..... .............. 110 4-27.--The true value, Mean, S.E., MSE, and bias for estimated 100 by cell identification for the ‘VARCL estimation procedure.................... 112 4-28.--The true value, Mean, S.E., MSE, and bias for estimated standard error for 4100 by cell identification for the VARCL estimation procedure..................................... 115 4-29.--The true value, Mean, S.E., MSE, and bias for estimated 111 by cell identification for the VARCL estimation procedure.................... 117 4-30.--The true value, Mean, S.E., MSE, and bias for estimated standard error for 4111 by cell identification for the VARCL estimation procedure..................................... 119 4-31.--The true value, Mean, S.E., MSE, and bias for estimated 101 by cell identification for the ‘VARCL estimation procedure.................... 121 4-32.--The true value, Mean, S.E., MSE, and bias for estimated standard error for 101 by cell identification for the VARCL estimation procedure....... ..... ......................... 124 4-33.--A summary of several statistics for different parameter by the SPSS and the VARCL estimation procedure..................................... 125 xvii CHAPTER I STATEMENT OF THE PROBLEM Introduction One of the major difficulties in quantitative research in education is the departure from the normality of errors in statistical models. This departure may affect estimates, confidence intervals, and statistical conclusions. For example, researchers are frequently confronted with analyzing an observed behavior that is dichotomously scored, where '1' indicates an occurrence:of the response, and ’0' indicates the absence of a response. Models for such outcomes cannot have normal errors. A second concern in educational research has been the appropriate analysis of multilevel data. For the past decade, there have been advances in multilevel data analysis with normally distributed outcomes. These have led to the development of several computer programs capable of analyzing data that have two or more levels of hierarchy. Some examples of such computer programs are GENMOD by Benjamin Hermalin based on Mason, Wong and Entwistle (1983); HLM by Bryk, Raudenbush, Seltzer and Congdom (1988) based on Raudenbush and Bryk (1986); ML3 by Rabash, Prosser and Goldstein based on Goldstein (1986); and VARCL by Longford based on Aitkin and Longford (1986) (all cited in Kreft and Kim, 1990). However, 2 there have been concerns regarding the violations of the normality assumptions of the residuals in these programs. This is a concern especially for multilevel data that are dichotomously scored. Some examples of multilevel data structures having binary outcomes that are common in the field of education are: 1. Student nepetitign, where a value of "1" indicates that the student has repeated a grade, while a value of "0" indicates that the student has never repeated a grade; 2. §tug§nt pegsistence in scngol (dropouts vs. non- dropouts); 3. Stude t status a ' a m s. non-nastegy; 4. u co r vs ' ct s o t iLQE; and 5. Stnggnt_n§;gngnng§ in college. In all of these above-mentioned examples, there is a need to estimate the effect of both. the student. and school characteristics on student performance. Unfortunately, the logistic regression model, which was specifically designed for analyzing binary outcomes, is not capable of taking into account the inherent hierarchical structure of the data. This has led to the advancement of several different approaches that take into account both (a) the binary response, and (b) multilevel data structure. However, the 3 applications of these estimation methods in the field of educational research have been limited. P b e S atem n Several multilevel binary estimation methods have been proposed that take into account the multilevel data structure and binary outcomes. However, the advantages and the disadvantages of these proposed multilevel binary estimation methods have not been investigated.' Knowledge of the advantages and the disadvantages of these methods will help researchers make informed decisions about their applications in the field of education. This study will identify the limitations of three different estimation methods that take into account both (a) the binary response and (b) multilevel data structure. In addition, the estimation method for the binary response of single level data will be investigated. The accuracy and statistical properties of the estimates for the multilevel logistic regression (MLR) model estimation methods and the single logistic regression model estimation procedure will be evaluated on real and computer simulated data. u se 0 Stu The purpose of this study was to analyze and compare the single and mmltilevel logistic regression model estimation 'methods using four computer programs. The first.three computer programs were based on different estimation methods designed 4 for hierarchical data with binary outcomes, while the fourth program was based on single-level data with binary outcomes. The four computer programs used in this study were as follows: 1) Generalized Least Square method (Goldstein, 1990) using the ML3 program; 2) Quasi-likelihood method (Nelder and Pregibon, 1987) using the VARCL program; 3) Empirical Bayes estimation method (Wong and Mason, 1985) using the MULTILOGIT program; and 4) Maximum Likelihood estimation method using the SPSS program for single level data. The following statistics were used to compare the four different estimation methods using data obtained from a national survey in Thailand: 1) the estimated regression coefficients for both student-level and school-level variables; 2) the estimated standard errors of the regression coefficients for both student-level and school-level variables; and 3) the estimated variance-covariance components of the random effects. In addition, simulated data was generated to evaluate the parameter estimations obtained by using the four computer programs. The following criteria were used to gauge the accuracy of each estimation method: 1) the difference between the estimated regression coefficient and the true regression coefficient of 5 both within- and between-school variables; 2) the difference between the estimated variance component estimates and its true value; and 3) the difference between estimated standard error and the true standard error for: (a) within- and between-school regression coefficients, (b) the variance-covariance components of the random effects. The above analyses were based on data generated according to three factors with two levels within each factor. These factors are the number of students within-school (small vs. large), the magnitude of the random slope variance (small vs. large), and distributions (both normal and t-) of the random effects terms. This resulted in a 22+2=6 design matrix (the t- distributed random effects were investigated only under large random regression slope variance magnitude). e eed t Stud Hopefully this study will not only identify the advantages, disadvantages, and estimation accuracy of existing multilevel binary estimation methods; but also inform researchers about the effect of student sample Size within a school and the magnitude of the random regression slope. This could provide a researcher with the basis to decide which method is more appropriate in analyzing school-related data with various sample sizes when dealing with binary outcomes. 6 It is hoped that this study will narrow the gap between methodologists and practitioners in the field of multilevel binary data analysis. It is further hoped that this study will also emphasize the need for researchers to concentrate on new areas of research in the development of multilevel binary data analysis, rather than duplicating an already developed approach with minimal changes. Reseanch Questinn The following research question guided the analysis of the data for this study: Is ~there a difference in the accuracy of parameter estimation between i) the Generalized Least Squares, ii) the Quasi-likelihood, iii) the Empirical Bayes, and. iv) the maximum likelihood (i.e. , SPSS program) estimation methods, in relation to the multilevel logistic regression model? This research was applied to both real and simulated data. However, the accuracy'of'parameter estimates for the simulated.data was evaluated for sixty groups according to the following conditions: (i) a small sample of 10 students and a large sample of 60 students within a school, (ii) the magnitude of .005% and 17.6% of the intercepts variance for the random regression slope, and (iii) the normal distribution and the t-distribution of the random effects, 05 and U”. 7 Multilevel Binary Models Used in the Study In this study the multilevel logistic regression model was applied to both real and simulated data. e ea at o e s The real data analysis was based on data obtained from a national survey in Thailand. This data was collected in 1988 by the National Education Commission of Thailand using a multistage cluster sample design. The entire sample consisted of 411 school principals, 3808 teachers, and 9768 sixth-grade students. Thus, the sample included both student- and school- level variables. For more information.about the sample and the sample design, please refer to Raudenbush and Bhumirat (1989) . Because of MULTILOGIT computer program limitations, the real data analysis was based on 59 schools only. In order to compare the four computer programs, the same number of schools had to be used in each real data analysis. Five different Multilevel Logistic Regression (MLR) models, from a simple to more complex models, were considered in real data analysis. These five models are shown in Chapter 3. ted The simulated analysis used the random intercept and random regression coefficient model. The simulated model contained one student-level variable and one school-level variable for the between-school model. 8 The within-group model was represented as where cm is the latent outcome variable which has been transformed to the log-odds by aij=log(0fi/ (1-0ij) ) , 0ij is the predicted probability of the subject (or student) i obtaining a value of 1 if he goes to the j-th group (or school), satisfying P(ilij=1)=9ij and P(Yij=0)=1-0ij. This is assuming that Yij's have a Bernoulli distribution with parameter 05 ( E(Yij)=0ij, ifij I 9i," Bernoulli (Oij) ) ; )g is the within-group level predictor for student 1 in the school j; B Bu were random logistic regression coefficients ojl across groups. Blj had both small and large magnitudes of variability between groups. The between-group model was represented as Boj = 70° + 701 Zj + U01- (1.2) Bij = 71° + 7n Zj + Uij (1°13) where 0g is the random effect where k=0 or 1, each with a mean of zero, and some variance Var(Uh) = Tu- For any pair of random effects 3 and p, Cov(k,k.) = 1a.; Eiis the predictor for the school level; 7“ is the overall intercept, and 7“ are the regression coefficients that capture the effects of school-level 9 variables on the school regression coefficient Bkj. Thus the combined model was represented as “a = You + 'Yoi Zj + 710 xij + 711 (Z; * Xij ) + (U05 + U1; xij) (1-4) Demgnstnating the Mgdel The models were demonstrated using real and simulated data. 1. The Real Data: the analysis was based on 59 schools which were selected randomly from 411 schools. This data contained seven student variables (i.e. sex, dialect, SES, pre-primary education, repetition, having breakfast daily, having lunch daily) and five school level variables (i.e., urban/rural, central, north, south, mean SES). However, only one student variable (i.e., SES) was used. This was due to the inability of the MULTILOGIT program to operate because of the small number of students within each school in the sample. The number of students within each school ranged between 8 and 37. This inability of the MULTILOGIT program is considered as one of its major weaknesses. The sample used for the study did not have any missing data. This was to ensure that the data remained the same when analyzed using different computer programs. 2. The Simulated Qatn: An independent variable was produced having random regression slopes of .005% and 17.6% of the intercept's variance. In addition, the simulated data 10 contained data sets of 10 and 60 subjects (students) within a group (school) and, the normal and t-distribution of the random effects, Uq and U“. Finally, a single school-level variable (a) was also produced. This resulted in a simulated model having both a random intercept and a random regression slope. Research Engcegunes Several techniques for analyzing multilevel binary data have been identified and a summary is shown in Table 2-1. Letters were sent to each of the researchers listed in Table 2-1 requesting their programs. The analyses in this study were based on three multilevel binary analysis programs (i.e., ‘VARCL, ML3, MULTILOGIT' programs) that. were obtained. In addition, a single-level regression model estimation method (i.e., SPSS Program) was carried out. Summany In spite of the development of several proposed methods of analyses for multilevel data with binary outcomes, the popularity of these programs and their applications in the field of educational research are limited. In addition, each of these programs has its own strengths and weaknesses. This study was aimed at conducting an analysis of several promising multilevel binary estimation methods consisting of the advantages of both the logistic regression model and the multilevel linear model. Often educational researchers are 11 interested in analysis that takes into account the multilevel structure of the data and the nature of the binary responses of the students. The analysis of this study was based on comparing three multilevel and a single-level binary estimation method on real and simulated data. For the real data analysis, five different multilevel logistic regression (MLR) models (ranging from simple to more complex) were used. While the simulated model included both.a:random intercept.and random regression coefficient, the generated (simulated) data considered the normal and t-distribution of the random effects, Uoj and U“. In addition, the effect of small and large sample sizes of students within-school was investigated. Finally, a small and large magnitude of the random regression slope was also investigated in the simulated data. CHAPTER II REVIEW OF LITERATURE This review'of the literature‘will present an overview of the development of the logistic regression model, the multilevel linear model, and finally, the advancement of the multilevel logistic regression model. Logistic Benzession Model Concerns regarding the distribution of normal errors in the case of data with binary outcomes have led to the development of the logistic regression model. This model was specifically designed for analyzing binary data (Cox, 1970). For single level data with binary outcomes where (Y9 takes the values "0" and "1" the expected value of Yiis E(Yi) = P(Yi = 1) = 0i (201) where Q represents the probability of‘m equal to 1 (probability of success), and 1-fi represents the probability of'm equal to 0 (probability of failure). If the researcher wishes to investigate the dependence of 0i on the independent variables (X1 X2 . . . xv) , one possible way is to employ the ordinary linear regression technique where 12 13 the model may be written as y. ,--= B. + 31x“ + 132x2i + ...+ 13pxpi + ei (2.2) where Bo represents the intercept; and B,,B2,...,BP represent the regression coefficients that characterize the relationship between the independent variables, mefi, . . . ,x,,, and the dependent variable, 31,. The two basic assumptions of the linear regression model represented by Equation (2.2) are: (a) 6i (error term) is a random variable with mean zero and variance 0’, that is E(ei)=0, V(ei)=02; and (b) 6i and 51- are not correlated, i¢jso that Cov(ei,ej)=0; thus the variance of ifi = 02 and Y, and Y,- where i¢j are not correlated. A further assumption which is not necessary for estimation, but is required in order to apply statistical tests such as the t- or F-tests, is that ei is a normally distributed random variable with mean zero and variance 0’, that is 5, ~ N(0,az) (Draper and Smith, 1966). Thus 5i and 6’- are not only uncorrelated but also independent. However, the literature has cited several inadequacies and limitations of the linear regression model (Cox, 1970; Cox and Snell, 1989; Scheffe, 1959; Dunteman G., 1984; Hosmer and Lemeshow, 1989; Weisberg S., 1985; McCullgh and Nelder, 1989; Hanushek and Jackson, 1977; Clogg C., 1990; Efron, 1975; Anderson, 1980; Bull and Donner, 1987; Haberman, 1974, 1977). The main disadvantages of the linear regression model have been attributed to the violations of assumptions that Yi's are normally distributed with mean 0i and variance 02 , and 0i is l4 linearly dependent on Xi’s. The limitations and the disadvantages of the above linear model could be summarized as follows: 1. It is quite possible that the predicted values of Q will exceed one or be a negative value. 2. Since Yitakes only the values 0 and 1, then Yfism and variance of Yi= &(l-Q). This violates the assumption of the least squares estimate that variance (L) = a2 (i.e., the assumption of homoscedasticity). Using the least squares estimate could give us an unbiased estimate of B" but it is not an efficient estimator. This has led to the development of the logistic regression model which addressed the above problems by transforming the probability of success into a continuous variable that can take any value on the real line (-w,w). The logistic regression model is represented as follows: Logit (0i) = Log (oi/(1-0,))= 13. + 13.xli +...+ raspxpi (2.3) The logistic regression model is a sensible method for regression analysis of dichotomous data for two primary reasons. First, from a mathematical point of view, it is an extremely flexible and easily-used function. Second, it lends itself to a substantively meaningful interpretation (Hosmer and. Lemeshow, 1989). It is the interpretation of the logistic regression coefficients that is the fundamental reason why logistic regression has proven to be such a powerful analytic tool for research (Breslow and Day, 1980; Alba R., 1987). 15 However, there are some disadvantages of the logistic regression model when dealing with data sets involving two or more levels of hierarchy. Assuming that the B's (regression slopes) are all fixed effects ignores the school (or group) effect on the variability between regression slopes. In fact, the concerns regarding usage of the single-level logistic regression model level are similar to the concerns about using the single-level regression model when analyzing data sets involving continuous outcomes and two or more levels of hierarchy. Multilevel Linea; Model There has been much educational research concerning the ability of a single-level regression model to deal with the hierarchical structure of data. In fact, most educational data can be seen as hierarchical where the lower level units are nested within.the upper level units. For example, students are nested within classes, classes are nested within schools, schools are nested within districts, districts are nested within counties, and counties are nested within states. Single-level analyses of data have led 'to several concerns regarding the unit of analysis and the violation of random sampling procedures (Langbein, 1977; Burstein, 1980; Kreft, 1987; Haney, 1980; Robinson, 1950; Alker, 1969; Hannan, 1971; Glass and Smith, 1979; Raudenbush and Bryk, 1988). This has led to the development of the multilevel linear model. 16 Within the field of educational research, this model not only illustrates the effect of student variables on the outcome but also the effect of school variables on both the aggregated student-dependent variable and the estimated within—school regression coefficients. This model may be represented by two equations. For the within-school (i.e. group) model, we estimate a separate regression equation for each school: yii = 130,. + aux”). + ..... + aux“). + rij rij ~ N(0,02) (2.4) where i=1,2,...,nistudents in school j, j=1,2,...N schools, and k=1,2,..,k independent variables within schools. In this model‘fijis the response for student i in school j, Xkij is the value of student-level independent variables k, and rij is the random error. However, the assumption of r3 ~ N(0,a?) is violated due to the binary nature of the outcome through Bk- are regression variable . Coefficients B,- J J coefficients that characterize the relationship within school j, and Bo is the intercept for each school. The between-school (i.e. group) model is given by a”. = 7:. + ynz”. +...+ 76ij + U”. ukj ~ N(O,1) (2.5) where Ur J the random effects k=0,1,...,k are assumed to be multivariate normal, each with a mean of zero, and some variance Var(Uh-) = 1“. For any pair of random effect it and r, Cov(k,k.) = 1a.. z,,...,zp are independent school variables, yd, 17 is the overall intercept, and 7m"°'VhK are the regression coefficients that capture the effects of school-level variables on the school regression coefficient Bkj adjusted for student intakes. The key assumptions of the multilevel linear model are (a) the errors, I}, are normally distributed; and (b) within- group regression coefficients (B's) are assumed to be multivariate normally distributed. Both will be violated if the outcome in the within-group model is dichotomous (Leonard T., 1972b). Mnltilevel nglstlc Bennesgign Mgggl In the case of binary response data, there have been concerns regarding the violations of normality assumptions of the residuals. These concerns have been indicated by several researchers (e.g. Mason et al. 1984; Clogg et al. 1990; Leonard, 1972a, 1972b, 1975; Anderson and .Aitkin, 1985; Stiratelli et al. 1984; Wong and Mason, 1985; Raudenbush and Bryk, 1986; Raudenbush, 1988; Goldstein, 1987; Braun, 1989; Lindely and Smith, 1972 in discussion p. 24). Recently, Longford (1990) has expressed this concern by stating: Normal distribution of the random terms in multilevel analysis is an important restrictive assumption. Much of the observational data in the social sciences are inherently discrete, and in the extreme, binary (e.g., Yes/No responses to survey questions). For such data the normal linear multilevel analysis is not appropriate not only because of the violationlof the assumption of normality, but also because we usually wish to use a 18 nonlinear scale such as the logit for binomial data, logarithm for Poisson data, etc. It is therefore desirable to have an extension of the multilevel methods for a wider class of distributional assumptions, which would at the same time be an extension of the methods for regression' analysis of independent non-normally distributed data. (p. 2) Mason et a1, (1984) have.also indicated a similar concern for estimation methods that account for discretion: The methodology presented in this chapter by no means exhausts the subject of multilevel estimation. There is a need for estimation procedures to handle discrete micro response variables. (p. 100) After comparing four major computer packages for multilevel linear regression techniques (i.e. GENMOD, HLM, ML2 and.‘VARCL), Kreft. et (al. (1990a, 1990b) found. that. the assumption of linearity in existing techniques and the assumption of normality of residuals were the limitations of some existing multilevel techniques. In fact, the inadequacy of the multilevel linear model due to the violation of the normality assumption of the residuals could contribute to the following concerns: 1. Inadequacy in estimates of the within-school (or group) model variance, 0’. Since Yb- takes only the values 0 and 1, then Viiz‘Ya and variance of if,Ii = OECL - Bfi). This violates the assumption of the estimate that variance(Y,)=a’ could effect the l9 estimated standard error of the within-school coefficient (B's) when the hypothesis, Ho: B=0 is tested. The above concerns have also been stated by Raudenbush and Bryk (1986): There has been little empirical work on the consequences of violating normal distribution assumptions in HLM, but we suspect that problems are most likely to occur in estimates of the model variances, 0’ and 1, and in hypothesis- testing application. (p. 14) 2. It is possible that since‘fijtakes values 0 and 1, the obtained fitted values of the regression parameter for the linear regression model would not satisfy the condition that 0 s E(Y|X,Z) = 05 s l. The research literature has also shown several different approaches to overcome the above concerns and to take into account both the binary response and multilevel data structure. (These approachestare summarized in'Table 2-1). The majority of these approaches are based on the idea that new techniques should consist of the advantages of both the logistic regression model and the multilevel linear model. e o e M ' eve 's 'c Re s' Initial concerns regarding using multilevel linear model analysis (Equations 2.4 and 2.5) in the case of binary outcomes were indicated by Leonard (cited in a commentary by Lindley and Smith (1972), where he stated: I would like to make a few remarks about the possible extension of the excellent ideas expressed in this paper to 20 situations where the exchangeable parameters cannot be considered to be normally distributed. In such circumstances, a good procedure is usually to transform the parameters in such a way that the normality assumption is more realistic for the new parameters. (P- 24) Table 2-1.--Several multilevel binary analysis techniques. Author Methods of estimation Methodological Reference Leonard T. Bayesian* Lenoard (1972a, 1972, Chamberlain G. Wong G. & Mason N. Stiratelli et al. Anderson D.& Aitkin M. Clogg C. et al. Longford N. Goldstein H. Korn E. & Whittmore A. Maximum Likelihood Empirical Bayes Maximum Likelihood for fixed effect & variance component. Empirical Bayes estimate of random effect. Maximum Likelihood Bayesian* Quasi-likelihood Generalized Least Square Maximum Likelihood* 1975) Chamberlain (1980) Wong & Mason (1985) Stiratelli et al. (1984) Anderson & Aitkin (1985) Clogg C. et al. (1990) Nelder & Pregibon (1987) Goldstein H. (1990) Horn & Whittmore (1979) * The goal of these methods is to combine the regression coefficients across groups into single coefficients for each of the covariates (i.e.borrowing strength). Leonard suggested the use of Log-odds transformation (where E(Yi)= 0i fOr i=1'2'ee’n' Y, being independent and binomial distributed with parameter 0,) where a, =log(03/(1-0i )) , with the assumption that ai ~ N(u.,a’), where u is uniformly 21 distributed and 02 possesses an inverse x? distribution when 02 is known. The main point for this transformation was to estimate.0L adjusted for each group and the overall mean. Using the Bayesian estimation.procedure, Leonard (1972a, 1972b, 1975) extended his ideas for binary data with an application to the prediction of college (i.e. group) success rates (Y3). In this case, student college grades corresponded to the pass/fail situation with i=1,2,...,ninumber of student within-college (or group) , and j=1, . . ,N colleges. By combining the available information (Xuj's, student independent ‘variables, student test scores on k different scales previous to college entry) from all the colleges to obtain predictors, more reliable results were produced than if the predictors were based only on information from one college. Leonard assumed that Yij's are mutually independent and have a Bernoulli distribution with parameter 0,5 (E(Yij)=0fi, YiJ-Ioir Bernoulli(0ii) ) . For the within-college model, a separate logistic regression equation was estimated for each college (the symbols have been modified in order to be consistent with previously used symbols) aij = 805 + Buxfij +.....+ Bijuj (2.6) where cm is the latent outcome variable which has been transformed to the log-odds by afi=log(0ij/(1-0ij)); 93 is the predicted probability of the student i obtaining a degree if he goes to the j-th college, 22 satisfying P(i(ij=1)=0ij and P(Yij=0)=1-0~ given the Yij, ij’ binary outcome for student i (i.e., pass/fail) with college j; Bo through Bu are within-college level logistic regression coefficients; and Km is the within-college level predictor k for student i in the school j. This is assuming that the within- college logistic regression coefficient Bj's are exchangeable. In addition, Leonard made two assumptions regarding the prior distribution for the vectors of the logistic regression coefficients (i.e. Bq,...,BU). (a) Given pa and H9, the (80,-, ...,Bkj) are independent and have multivariate normal distributions with common mean vector u.B and precision matrix H3. (b) the mean vector pg is uniformly distributed over (K) dimensional real space. Also WHB is independent of unand has a Wishart distribution with W degrees of freedom and parametric matrix 25% Leonard applied a Bayesian approach (estimating the joint posterior modes for Bq,...,Bn) with Newton's iterative procedure in order to obtain within-college coefficients. However, he encountered a problem in finding a starting value for the within-college coefficients (the st). In addition, .his model did not include school (or college) level variables at the second stage of the model. But the aim of the model was 23 to combine the available information (i.e. within colleges) in all colleges to obtain predictors which were more reliable than if the predictors were based.only on the information from one college (i.e. borrowing strength). However, several researchers have indicated a concern regarding the use of the approximate normal distribution for the posterior distribution (Laird, 1978; Laird and Louis, 1982; Geisser, 1984). As Laird and Louis (1982) indicated: The normal approximation has been used (see Leonard, 1975; Laird, 1977), but no indication of its validity was given in these papers . For the censored exponential, the normal approximation fails to account for the skewness of the gamma; for the 99.9 percent confidence interval it produces a negative left endpoint. (p. 199) Chamberlain (1980) studied a random effects model for binary outcomes.in which.the intercepts were assumed to follow a distribution (i.e, random intercept for the within-group model), while other logistic regression coefficients were fixed across groups (in Ihis study the groups were the individuals) in order to capture group differences. Thus, for the within-group model (similar to Equation 2.6, with assumed fixed regression slope’s, B,, ...,Bk) . a-- = 130,. + 13,xIii +.....+ six“). (2.7) While the between group model is Boj = 7m + 7°1le+.....+ ‘yq’xpj + U05 (2.8) (Assuming that 05 are independent and identically distributed.) 24 In the above analysis, Chamberlain’s concern was the within-group estimator (B,,...,Bk). Thus he used a random intercept (i.e., Bo) :model in order to capture omitted variables that were group specific. Maximum likelihood procedures were used to estimate the model's parameters. Wong and Mason (1985) introduced a multilevel binary model called a "Hierarchical Logistic Regression Model" which combined the advantages of the multilevel linear'model and the logistic regression model when dealing with binary outcomes. They used the logistic regression model as the within-group model (i.e. within-school) and the multilevel linear model (Equation 2.5) for the between-group model. Thus, the within- group model is similar to Leonard’s (1972b, 1975) model (Equation 2.6), where a separate logistic regression coefficient was estimated for each group. The between-group model (Equation 2.5) represented.the:effect.of group variables on the estimated logistic regression coefficients for each group. This allowed the specification of the effect of the upper level (i.e., groupumembership) on the lower level of the hierarchy. In fact, the major difference between Equation 2.3 (the fixed effect model) and the above two-stages model of Wong and Mason (also known as the mixed model) is the presence of the error terms in the between-group model (i.e., UQ;UU,...,UH). Thus, if the between-group error terms are suppressed, the multilevel logistic regression model becomes a logistic regression model. 25 In deriving their mixed model, Wong and Mason have made two main assumptions: (a) within-group regression coefficients (83’s) are assumed to be normally distributed over group membership; (b) there is flat prior in the between-group coefficients given by 7 ~ N(m,E) , 2:" --->0. In addition, it is assumed "...that the njare large enough to permit estimation of all Br" (p. 514). In fact, the above hierarchical logistic regression model could be viewed as a classical discrete mixed model with fixed effects, 7, and random effects, Uh. Empirical Bayes estimation procedures were used to estimate the parameters of the model where 1 was estimated by the indirect Maximum Likelihood estimator using the EM alogarithm (Dempster et al. 1977, 1981). This was because of the difficulties in direct numerical maximization of the likelihood. Approximate posterior interval estimates were used to estimate 7’s and st. In spite of the advantages of the proposed Wong and Mason model which takes intOTaccount.the‘multilevel structure of the data and the nature of the binary student response, there are several concerns. These concerns are summarized as follows: (a) The above model requires a large sample sizewithin each group in order to permit the estimation of all Bj's (Wong and Mason, 1985 p.514). This is often not the case in the field of education where the number of students within each school is small. In their study, Wong and Mason used the countries as the unit of the analysis in the second stage of their model (i.e. between-group model); 26 (b) Another concern that is also indicated by Wong and Mason is that, "Extensive exploration of the data using the computational procedure described here is costly for large data sets, because of the slow convergence of the EM algorithm for variance and covariance component problems. " (p. 522) (c) Raudenbush (1988) also indicated some concerns regarding the estimation procedures of Wong and Mason (1985): ...the data are binomial distributed conditional on the logistic regression coefficients for each country. These "random coefficients" are then assumed normal. Since the normal is not the conjugate prior for the binomial, the exact form of the posterior is intractable, but the authors provide a normal approximation which facilitates inference. (p. 98) Stiratelli, Laird and‘Ware (1984) introduced a different estimation procedure for a more general mixed model (similar to the Wong and Mason model) where they also assume that the logistic parameter for each group to be normally distributed in the population. Their estimation is based on the Maximum Likelihood estimation of fixed effects (the 7's) and Maximum Likelihood using the EM algorithm for variance components and empirical Bayesian estimation of the random effects (the Ug's). In fact, the approach of Stiratelli et al. is a generalization of Horn and Whittemore (1979) that assumes a logistic regression model with normally distributed random coefficients (i.e., random-effect.model). Korn and Whittemore 27 used a maximum likelihood estimation procedure that is based on a separate, logistic regression for each group. However, Horn and Whittemore's concern was to combine logistic regression coefficients across groups into a single logistic regression coefficient for each of the covariates (similar to Leonard, 1972b). Therefore, they did not include any group variables into their model. Similarly, Clogg et al. (1990) introduced a simple Bayesian method in order to combine the logistic regression model across different regressions in a single equation. However, this estimation procedure is based on the maximum Posterior estimation that assumes Jeffrey’s prior (i.e., noninformative prior, see Box and Tao, 1973 p.41; Rubin and Schenker, 1987) for the logistic regression model (i.e., B’s). Anderson and Aitkin (1985) derived a Maximum Likelihood estimation procedure in order to estimate the parameters in multilevel logistic and probit models. The logistic regression model was used, where:the interviewee was considered as lower- level and the interviewer as upper-level of the hierarchy model. Their estimation is method based on the Bernoulli model for the binary response with the underlying assumption that the dependent variable is normally distributed. In addition, it was assumed that the random intercepts (random effect) had a normal distribution, and there was a fixed effects with its associated covariates. Anderson and Aitkin concluded that the proportion of the variance of the dependent variable that is explained by 28 variance component is nearly double the ANOVA estimate. They suggested that the use of ANOVA methods needs to be examined closely. Longford (1988) used a quasi-likelihood estimation procedure based on the Nelder and Pregiborn (1987) estimation method. This is an extension of the quasi-likelihood estimation method "...to allow the comparison of variance function as well as those of linear predictors and link functions." (p. 221). To obtain quasi-likelihood estimates, there is the need to define the quasi-likelihood function which is only to specify the relationship between the mean and the variance of the observation. But in order to define a likelihood function there is the need to specify the form of the distribution of the observation (Wedderburn, 1974). In fact, maximum quasi- likelihood estimates have many properties parallel to those of maximum likelihood estimates (Wedderburn, 1974; McCullagh, 1983). Several assumption have been considered: (a) the usual assumptions of the normality of the random effects; (b) the non-normal error distribution; (c) the random effects, U5, k=0,1,...,k, are assumed to multivariate normal each with a mean of zero, and some variance, Var(Ukj) = 1“. For any pair of random effects, , and p, Cov(k,k.) = 1a.; (d) the assumption that the mean, 0m, is related to linear predictors by a logit link function. 29 Thus, using "logit" as a link function will result in obtaining logistic regression coefficients having random slopes. Therefore, an estimate of within- and between-school parameters can be obtained (using the quasi-likelihood method) for a multilevel binary model (similar to the Wong and Mason model). Goldstein (1989) proposed a multilevel nonlinear model when modelling discrete data. Here the within-group model for the binary outcome is specified with two dummy variables (k=2, xlij and XE). Thus the within-group logistic regression model is similar to Equation 2.6. The between-group model assumes the EU (k=2) to be random similar to Equation 2.5. Similar to other models. Goldstein assumed that; (a) the predictors are fixed, (b) the upper- level random terms U6 and Ulj have a joint distribution with mean.0 and can.be represented in.a'variance covariance matrix. As indicated for the within-group model, this model deals only with dummy variables for the within-group model by applying the iterative generalized least squares "IGLS" estimation method (Goldstein, 1986). A real example of the above case that is provided in the ML3 manual (Prosser et al. 1991) is as follows: "...a sampled person working in factory j might be in one of eight job status categories, level 2 unit here are the factories, and level 1 units are the categories." (p. 22). Thus, the logit (05) is considered as the dependent variable, where 0ii is defined as the proportion of individuals in the job status 30 category 1 in factory j that answered "Yes" to a "Yes" or "No" question. Braun (1989) suggested a different estimation method for the hierarchical logistic regression model (i.e., the Wong and Mason model). Braun suggests first obtaining the ordinary logistic regression estimates (Equation 2.6), B»i of Bi, along with the estimated of of these estimates ( 8, ~ N(Bi,ai’)). Following this the empirical Bayesian estimates of Bi can be derived from Bi ~ N(Bi,o,’) and the between-group model (Equation 2.5). S nngmar y Concerns regarding the distribution of normal errors in the case of data with binary outcomes have led to the development of the logistic regression model. However, there are some disadvantages of the logistic regression model when dealing with data sets involving two or more levels of hierarchy. For the past decade the concerns regarding the appropriate analysis of multilevel data structure have led to the studies of several methods of estimation for multilevel linear models with normally distributed outcomes.- However, several researchers have expressed concern regarding the use of multilevel linear model analysis when the normality assumption of the residuals is violated, specifically in the case of binary outcomes. This has led to development of several different approaches that take into account both the binary response and multilevel data structure 31 (see Table 2-1). However, the popularity of these estimation methods and their applications in the field of educational research are limited. CHAPTER III METHOD Intnoguction This chapter has been divided into three sections. The first section deals with the pilot study carried out on all four methods of estimation using the computer programs of ML3, VARCL, MULTILOGIT, and SPSS. These four computer programs will first be presented by describing the requirements for operating each program and indicating the initial advantages and disadvantages of each of them. The second section will address the real data. A brief description of the real data will be presented. This will be followed by presenting the five multilevel logistic regression (MLR) models, and the two MLR models using the VARCL method of estimation. The final section will address the simulated data. First, a description of the simulated model will be presented. Second, an account of the selected values for the conditions of interest that were used in the simulated model will be given. Third, the procedure used to generate simulated data will be described. Finally, the statistics used to evaluate the accuracy and properties of both VARCL and SPSS estimation methods will be presented. 32 33 The Pilot Study A pilot study was first conducted on the SPSS program which takes into account structure of the outcome for single- level data. Subsequently, a study was conducted on the other programs VARCL, ML3, MULTILOGIT which take into account the multilevel structure of the data. A random sample of 20 schools was first drawn out of 411 schools from the real data (i.e., the Thailand data). A total of 406 students were found in the sample. Three dichotomous variables were also selected. These were (a) student repetition as dependent variable, (b) student sex as student- level covariate, and (c) school location (urban versus rural) as school-level (or group-level) covariate. Dichotomous variables were selected as covariates because the ML3 program requires that the two covariates be dichotomously coded. The purpose of the pilot study was to run these four computer programs using the sample data in order to observe the advantages and disadvantages of these programs before conducting any further real or simulated data analysis. As such, the obtained estimates of these programs were not compared in this pilot study. In the following account the researcher will introduce each of the computer programs, describe the requirements for operating each program, discuss the advantages and disadvantages for each program, and state the concerns of each program for further analysis in this study. 34 For simplicity, each estimation method will be identified by the name of the program. The programs will be identified as follows: the Maximum Likelihood estimation method as the SPSS program, the Quasi-Likelihood estimation method as the VARCL program, the Generalized Least Square estimation method as the ML3 program, and lastly, the Empirical Bayesian estimation method as the MULTILOGIT program. T e VARCL P 0 am The VARCL program was first initiated by Aitkin and Longford (1986) and maintained by Longford. It is designed for the fitting of mixed linear models with nested random effects on data involving hierarchies of nesting. The analysis using the VARCL program in this study was based on its microcomputer version. The researcher was also able to obtain the mainframe version of the program. It is useful to note that the interface of the VARCL program combines both an interactive and a batch feature of operation. To run the VARCL program, the user needs to identify three input files, namely: (1) the.basic information file, (2) the data file for student-level variables and the interaction term between student variables and school variables, and (3) the data file for school-level variables. The following information should be furnished to the basic information file: line 1: the research title, line 2: the number of levels of nesting (two or three levels 35 of nesting), line 3: the number of units for both students and groups (schools), line 4: the number of variables for both student- and school- levels, line 5: the maximum number of iterations, frequency of report of convergence, and precision (a choice up to 4 decimal places, .0001), line 6: the name of the unit-level (i.e., student), and name of the school-level (i.e., school), line 7: the name of the file containing the student data, line 8: the format of the student data, line 9: the name of the file containing the school data, line 10: the format of the school data. The rest of the lines contain the name of the variables together with the number of its categories (this is equal to 1 for continuous variables), and finally, the number of subjects within each group. An example of the basic information file specified for this study is found in Appendix 3-1. The specification of the model part and both the fixed and random effects of the model was done interactively. The VARCL estimates converged to give the estimate of the parameters of the pilot data. The analysis of the results of the pilot revealed the following minor disadvantages of the VARCL program: 36 1) The model specification for the VARCL program is different from the MULTILOGIT program. The VARCL program macro (i.e. , school variable) variable could not be specified as a predictor of the micro regression coefficient. However, the same MULTILOGIT and VARCL combined model can be obtained. 2) The independent covariates variables values had to be coded as "1"'s and "2"'s instead of "0"’s and "1"’s. 3) The user of the program has to specify the random effects twice to obtain the estimate of the random part of the model. By running the VARCL program using the pilot data, it was found that the program uses standard logistic regression estimates (the same estimate up to four decimal places) as its initial estimate. However, the MULTILOGIT program requires that initial estimates be given for each specified model (within-school regression model) and for each school in the sample. It was also observed that the VARCL program converged to the estimates very rapidly. In addition, VARCL program was friendly and easy to use by combining both an interactive and a batch feature of operation. The manual for VARCL contained not only the information about the procedure to create an VARCL batch file and mixed model specification, but also provided many examples to assist the investigator. 111W The ML3 software program is used for two and three-level multilevel data analysis by Rabash Prosser and Goldstein, 37 based on Goldstein (1987). The researcher was able to obtain both the microcomputer and the mainframe version of the program. However, the analysis was based on the microcomputer version. The ML3 program operates interactively. The user is required to identify a single data file that contains both the student and school variables, identifying each level by an identification code. The ML3 program is easy to use, and the furnish manual was sufficient, containing information about the estimation method, multilevel model specification, and procedure to operate the program. Some major disadvantages of the ML3 program were revealed during the pilot analysis of the pilot data. ML3 requires that both the levels of the variables be dichotomous. In addition, ML3 requires the specification of the number of students (nfl in each sex by URB/RRL (urban or rural the school location variable) cell categories, and number of students from each of the nij cells who repeated. These specific requirements made the running of the ML3 program very cumbersome. In spite of detailed model specifications for the ML3 program design, the estimates of the ML3 programs in the pilot study did not converge. This may have been caused by having one urban school (with a total of 28 students) and 19 rural schools (with a total of 378 students) within the 20 schools randomly drawn. Since the ML3 program requires dichotomous covariate variables at both levels and design specification, it was 38 decided to drop out the ML3 estimation method from further data analysis. This is because both the design and covariates specifications would be different for real and simulated data. For example, the simulated independent variables for student and school level would be continuous variables, while the real data would contain both continuous and dichotomous variables. In addition, the ML3 program‘was comparatively much slower to run . Ine MQLTILOGII Program The MULTILOGIT program was written by Albert F. Anderson, of the Population Studies Center at the University of Michigan, from instructions provided by George Y. WOng and William M. Mason. The program executes the multilevel logistic regression model that is proposed by Wong and Mason (1985). The program is only available in the mainframe at the University of Michigan. To run the program, several manuals were required to explain how to operate the University of Michigan computer terminal system (MTS), and secondly how to use the HTS file editor. In addition, a PCTIE program needs to be purchased in order to allow the microcomputer to operate as a terminal to the university of Michigan network host. The PCTIE command also allows the transfer of files between the microcomputer and MTS. In order to run the MULTILOGIT program, the user is required to specify four input files: (1) a micro (student) data input file, (2) a macro (school) data input file, (3) a 39 coefficient input file, and (4) a command file. These micro and macro data files contain the student- and school-level data sets. The coefficient input file contains the classical within-group (school) logistic regression coefficients for each school in the macro data. These coefficients will be used to generate starting values for the iterative algorithm. The command file performs the following functions: (1) It defines the multilevel logistic regression model, (2) provides terminating conditions for the algorithm, and (3) specifies input and output files (a copy of a command file is found in Appendix 3-2). The command file operates as a MULTILOGIT batch file. Initially, there were some problems running the MULTILOGIT program, because the manual set-up specifications were a little different. When the program was finally run on the pilot data, estimates of the parameters were obtained. Other than this initial problem, the program was easy to use, and the researcher had only to deal with the command file in order to change the multilevel logistic regression model. In addition, the program converged rapidly when used on the mainframe. The supplied manual contained sufficient information on how to write a command file, and run the program. However, some limitations of the MULTILOGIT program were found and summarized as follows: 1. The micro (student) data file could only include:9 distinct micro variables (not counting the micro intercept). 40 2. The program could only read 5 macro variables (not counting the macro intercept). 3. The maximum number of schools (group) that could be used was 59. 4. The program required classical within-school logistic regression coefficients for each school (20 schools in the pilot study) in the analysis. These values had to be supplied by the researcher in order to generate the starting values for the iterative algorithm. 5. The MULTILOGIT program assumed that all micro regression (intercept and slopes) were random coefficients. In other words, the MULTILOGIT program did not accept the fixing of any within-school regression coefficient. 6. The model specification for the MULTILOGIT program was different from the VARCL program. However, the same combined model for the VARCL and the MULTILOGIT program was obtained. The MULTILOGIT program specifies the school (or group) variables to be used as regressors in a between-school regression model in which the dependent is the slopes coefficient or intercepts. The difference between the model specifications of the VARCL and the MULTILOGIT program will be clarified later when the real data is analyzed using the different models for three estimation procedures, VARCL, MULTILOGIT and SPSS. In addition to the above disadvantages, the cost of running the MULTILOGIT program on the mainframe computer of 41 the University of Michigan was also a major financial concern. The SPSS Program The SPSS program is a multi-purpose statistical package. It available on the mainframe at Michigan State University and also as a microcomputer version (both forms of SPSS were used in the analysis). The SPSS uses the single—level logistic regression model ignoring the hierarchical structure of the data. In other words, it assumes that the logistic regression parameter (slopes and intercepts) have fixed effects, ignoring the group (school) effect on the variability between slopes and intercepts. This model specification is considered as the major disadvantage of this program. In the SPSS program the maximum-likelihood method of estimation is used to obtain the estimates of the logistic regression model parameters. In addition, since the model is nonlinear, an iterative algorithm is used for parameter estimation. The Finding of the Pllo; Study The pilot study revealed several limitations, advantages and disadvantages of the four computer programs. The ML3 program requires dichotomous covariate variables at both levels and has an inconvenient design specification. Because of this, a decision was made to exclude the ML3 estimation method from further data analysis. The maximum number of schools (group) that can be used (59) was a serious 42 limitation of the MULTILOGIT program. In addition, the MULTILOGIT program requires the specification of the classical within-school logistic regression coefficients for each. of the schools in the analysis. This proved to be too cumbersome. Furthermore, the MULTILOGIT program always assumes that the micro regression coefficients are random. Finally, the cost of running the MULTILOGIT program on the mainframe computer at the University of Michigan proved to be a major financial concern especially when considered for use with simulated data. In fact, the cost of running the MULTILOGIT program at the University of Michigan mainframe computer center lead the researcher to run VARCL, SPSS, and MULTILOGIT on the real data first rather than the simulated data in order to»determine the real cost. This allowed the researcher to predict the extremely high financial cost of running the MULTILOGIT program on the simulated data. Charactegistics g: the Real Data The real data analyses were based on data from Thailand collected in 1988 under the sponsorship of the BRIDGES (Basic Research in Developing Educational Systems) project. A random sample of 59 schools (due to the limitation of the MULTILOGIT program on the maximum number of groups) consisting of 1244 sixth-grade students was utilized. The analysis was based on several models, from simple to more.complex, using two student variables: (1) the student repetition where "1" indicates 43 "ever" and "0" indicates "never", and (2) student socioeconomic status (i.e., SES). In addition to this, five school-level variables were also included: (1) the school location (urban vs. rural), (2) school SES (i.e., MEAN SES, the student SES was aggregated at the school level to measure the school SES), and (3) three geographic variables. These were allocated in terms of location of the school in the central, north or south regions of Thailand. These variables were chosen based on previous work which indicated that they were related to student repetition. The descriptive statistics for the real data at both the student- and the school-level are presented in Appendix 3-3. All the variables at both levels had to be centered in order to be able to compare the regression coefficients across the different approaches. This was because the VARCL program centered all the variables. Each of the three programs were run several times in order to ensure that the data loaded on to the program had been read accurately. ve 'st' e ress on o els Five different Multilevel Logistic Regression (MLR) Models, from simple to complex were analyzed. This was done in order to compare the following estimated statistics in the three estimation procedures: (a) the estimates of the regression1coefficients and their standard errors, and (b) the 44 variance and covariance of the random effects and their standard errors. An additional analysis was also performed using the VARCL program comparing the random intercept and fixed regression coefficient model with the combined random intercept and random regression coefficient model. This was done in order to show advantages of using one model over the other, and the ability of the ‘VARCL. program to 'test the variance and covariance of the random effects of the model (i.e., Ho:1oo=0, H°:1"=0, and Ho:1°,=0) . Each of these five multilevel logistic regression models will be presented in this chapter. The results of the real data analysis and the: comparisons between ‘the estimated statistics, using three methods of estimation, will be subsequently presented in chapter four. MLB_MQQ§l_1 The simplest MLR model considered in this study included no student-level and school-level independent variable as covariates. The within-school equation (or group) for MULTILOGIT and VARCL is represented as Logit (repetition),’- = B (3.1) oj The between-school equation (or group) for MULTILOGIT and VARCL is represented as Boj= 700 +001. (3.2) 45 where B0’. is the average of the Logit (repetition)ij in school j (Equation 3.2 shows that Ed varies around the grand mean 700 with variance 02(Uoj)=7m) . U“ is the random effect associated with school j. Thus the combined equation of MLR model 1 for MULTILOGIT and VARCl.was obtained by substituting Equation 3.2 into 3.1, Logit (repetition)ili = 70° + Uoj (3.3) The MLR model 1 for SPSS is simply represented as, Logit (repetition)ij = 700 (3.4) MLR model 1 for MULTILOGIT and VARCL is a useful way to estimate much of the variation that exists in the dependent variable between schools. It is clear that the MLR model for SPSS does not account for the between-school (or group) variation (compare Equation 3.3 with 3.4). ML3 Model 2 The second MLR model considered in this study included the student-level variable of student socioeconomic status (i.e., SES) in the within-school equation as a covariate. The school-level variable was also excluded. The within-school equation (or group) for MULTILOGIT and VARCL is represented as Logit (repetition)ij = Boj + B”- (SES)ij (3.5) The between-school equations (or group) for MULTILOGIT and VARCL is represented as 46 Boj = 70° + Uoj (3.6) 815 = 710 + Uij (3-7) where Boj is the adjusted school mean (i.e. , the raw school mean minus an adjustment for its SES mean); and is the effect of the student SES on the outcome within school j. In the case above, both the adjusted school mean, Bfi,tand the school regression coefficients, Bur vary across schools around their grand mean. Thus the combined equation of MLR.model 2 for MULTILOGIT and VARCL was obtained by substituting Equation 3.6 and 3.7 into 3.5, Logit (repetition)ij = 70° + 7,0 (SES)ij + U”- (SES)ij + U0). (3.8) The error term in Equation 3.8 is presented as (Uh. (325),). + um.) . The MLR model 2 for SPSS is simply represented as Logit (repetition)ij==70° + 7b (SES)ij (3.9) ML3 Model 3 The third MLR.model considered in this study was similar to the second model. The only difference was that the school- level variable (i.e., school SES, MSES) was included as a covariate. This model was specified differently for the MULTILOGIT and the VARCL programs. However, the combined MLR model 3 for both programs was identical. 47 The ‘within-school equation. for' MULTILOGIT is similar to Equation 3.5. The between-school equations for MULTILOGIT is represented as 130,. = 70° + 701 (MSES)j + Uoj (3.10) 31,- = 7:. + U],- (3.11) The within—school equation for VARCL is represented as Logit (repetition)ij = B01. 4» Blj (SES)ij 4i- 70, (MSES)j (3.12) The between-school equations for VARCL model 3 is similar to Equation 3.6 and 3.7. Thus, the combined MLR model 3 for MULTILOGIT (substituting Equation 3.10 and 3.11 into 3.5), and VARCL (substituting Equation 3.6 and 3.7 into 3.12) is derived as Logit (repetitionhj==7m + 7m (SES)ij+-7ol (MSESM + Uh (SES)ij + Um. (3.13) The MLR model 3 for SPSS is represented as Logit (repetition)fi==7m_+ 7m (SES)fi+-7d (MSESh (3.14) MLR Mggel 4 In the fourth MLR model, another school-level variable (i.e., school location, urban versus rural, URB/RRL) was included as a covariate regressed on the regression lepes (i.e., Bu)(only for the within-school model. This variable was added in order to compare the interaction coefficient that was associated with it (i.e., (URB/RRL)j * (SES),§). Thus, only the result associated with the URB/RRL variable will be discussed. The ‘within-school equation. for ‘MULTILOGIT is similar' to Equation 3.5. 48 The first between-school equation for MULTILOGIT, (associated with the random intercept) is similar to Equation 3.10, while the second between-school equation (associated with SES regression slope) is represented as B, J = 7m + 7n (URB/RRLH+ U” (3.15) The within-school equation for VARCL is represented as Logit (repetition)ij = Bo,- + B,,- (SES)ij + 70, (MSES)j + 7,, ((URB/RRL), * (5123),) (3.16) The between-school equation for VARCL, is similar to Equation 3.6 and 3.7. Thus, the combined MLR model 4 for MULTILOGIT (substituting Equation 3.15 and 3.10 into 3.5), and VARCL (substituting Equation 3.6 and 3.7 into 3.16) is derived as Logit (repetition),,==y0° + 7“ (SEShi+-7m (MSESH + 7,, ((URB/RRL), * (325),.) + 11,, (533),, + 00, (3.17) The MLR model 4 for SPSS is represented as Logit (repetition)fi:= 7m + 7“ (SES)fi-+ 7M (MSES)j + 7n ((URB/RRL)5* (SES)fi) (3°18) MLB_MQQ§1_§ In this model the school-level variables for geographical region were included. The within-school equation for MULTILOGIT, is similar to Equation 3.5. The first between-school equation for MULTILOGIT is represented as 49 so, = 70,, +70, (URB/RRL), + 702 (CENTRAL), + 70:, (NORTH), + 70, (SOUTH), + 70, (MSES), + DD, (3.19) While, the second between-school equation (associated with SES regression slope) is similar to Equation 3.15. The within-school equation for VARCL is represented as Logit (repetition)ij = Bo, + B,, (SES), + 70, (URB/RRL), + 702 (CENTRAL), + 70, (NORTH), + 70, (SOUTH), + 705 (MSES), + 711 ((URB/RRL),‘ * (SES)ij) (3.20) The between-school equation for VARCL, is similar to Equation 3.6 and 3.7. Thus, the combined MLR model 5 for MULTILOGIT (substituting Equation 3.15 and 3.19 into 3.5) and VARCL (substituting Equation 3.6 and 3.7 into 3.20) is derived as Logit (repetition), = 70° + 7,0 (SES),, + 70, (URB/RRL), + 7,2 (CENTRAL), + 70, (NORTH), + 7.. (SOUTH), + 7.5 (MSES), + 7,, ((URB/RRL), * (SES),,) + U,, (SES),, + O, (3.21) The MLR model 5 for SPSS is represented as Logit (repetition), = 70° + 7,0 (SES), + 70, (URB/RRL), + 7,2 (CENTRAL), + 70, (NORTH), + 7.. (SOUTH), + 70, (MSES), + 711 (“IRE/ML); * (SES)ij) (3.22) C 'n o odels Us'n eV C e d E ' at This analysis will Show the advantages of using one model over the other. This comparison is made possible because of the VARCL program’s ability to test the variance and covariance of the random effects. Two models, A and B, will be specified in this study. Model A having a random intercept and 50 random regression slope, and model B having a random intercept model and fixed regression slope. For model A the within-school, between-school, and the combined VARCL equation are the same as the VARCL MLR.model 5 (refer to equations 3.20, 3.6, 3.7, and 3.21). For model B the within-school equation is similar to Equation 3.20. The first between-school equation (associated with the random intercept) is similar to Equation 3.6, while the second between-school equation (associated with the SES regression slope) is represented as 13,, = 7:. (3.23) Thus, the combined. model B for ‘VARCL (substituting Equation 3.23 and 3.6 into 3.20) is derived as Logit (repetition),,==7°° + 7,0 (SESh,+-7m (URB/RRLM + 702 (CENTRAL), + 70, (NORTH), + 70, (SOUTH), + 7.5 (MSES), + ‘Yu ((URB/RRL); * (3133),) + U.,- (3-24) The only difference between the combined model A (i.e., Equation 3.21), and combined model B (i.e., Equation 3.24) is that model B suppresses the error term associated with SES, U” (SES),. The results of the analysis running the SPSS, VARCL and MULTILOGIT estimation methods on the five proposed multilevel logistic regression models and two proposed MLR models (A and B) using the VARCL method of estimation using real data (Thailand data) will be presented in chapter four. 51 gationale for Excluding the MULTILOGIT Program A more complicated multilevel logistic regression model was attempted by including more covariates in the within- school model. The results of running this new model were obtained for both the SPSS and VARCL methods of estimation. However, the MULTILOGIT program did not run with this new model. It registered "bomb out" indicating an error message. The investigation into why this occurred revealed yet another disadvantage of the MULTILOGIT program. In order for the MULTILOGIT program to run, a specification of the coefficient input file is required. This file contains the classical within-group (school) logistic regression coefficients that will be used to generate the starting values for the iterative algorithm. These regression coefficients are obtained by estimating the classical logistic regression coefficients separately for each group (school) using the SPSS program. Thus, for each of the five specified models in this study, the logistic regression coefficient for each of the 59 schools was obtained. This meant that each data line in the coefficient input file of MULTILOGIT was associated with a single school (59 different data lines for the 59 schools in each model) containing the intercept and the regression slope of the within-group logistic regression model. However, since the number of students in each of the 59 schools range from a minimum of 8 to a maximum of 37 students, the within-school logistic regression coefficient (intercepts 52 and slopes) estimates for the MULTILOGIT coefficient input file were estimated as zero. This caused the MULTILOGIT program to "bomb out". In fact, the concern regarding the number of subjects within each group was also mentioned in Wong and Mason (1985). Based on the results of the pilot study and real data analysis, it was decided to exclude the MULTILOGIT program method of estimation from the simulated data analysis. This decision was based on the following reasons: 1. The high financial cost of running the MULTILOGIT program on the University of Michigan Mainframe Computer Center. Despite running the program in the minimum charge time, which was generally between 2:00 a.m. to 7:00 a.m., the estimated cost of running the MULTILOGIT program on 1200 replications simulated data would be at least US$15,000.00. This figure was based on the cost of running the MULTILOGIT program on the real data and pilot data of this study. 2. The MULTILOGIijrogr m will not run in 600 out of 1200 replications of the simulated data. This is because the simulated condition for the number of subjects (students) within each group is considered as 10 (n=10). This will cause the within-group (school) logistic regression coefficients (intercepts and slopes) data for the MULTILOGIT coefficient input file to be estimated as zero. 53 3. For each of 1200 replications of the simulated data, 60 classical (i.e., SPSS estimates) within-group logistic regression coefficients need to be specified. These sixty data line estimates of the logistic regression coefficients for each replication point in the MULTILOGIT coefficient input file are due to the number of groups within each simulated replication. Obtaining all the coefficients would entail an enormous task. As a result of this, the analysis of the simulated data was conducted using only the two methods of estimation: (a) VARCL, designed for data involving hierarchies of nesting having a binary outcomes, and (b) SPSS, designed for single- level model having binary outcomes. Charncteristics of tne Simulated Model The simulated model was a two-stage multilevel logistic regression model having random intercept and a random regression coefficient. The with-group model is represented as a,, = so, + 13,, x, (3.25) The between-group model is represented as ' B0,- = 70° + 70, Z, + 00, (3.26) Bu = 7m + 711 zj + Uij (3-27) The generated data has, within each group (or school), the micro predictor, X,,, normally distributed with mean of zero and a variance of one (i.e., X,,~N(0,1)). Similarly, the 54 macro predictor, Z. is normally distributed with mean of zero J! and a variance of one (i.e. Z,~N(0,1)). In addition, U0, and U,, are mutually independent, as they are generated separately (i.e., 10, = 0). The random effects (i.e., U. o, and U,,) were generated having both a normal distribution (ND) and t-distribution (TD). The normal distribution of the random effects were investigated under both a large magnitude of the random regression slope variance, 17.6% (denoted by RRSL), and a small magnitude of the random regression slope variance .005% (denoted by RRSS) of the intercept variance. Therefore, the random effects with RRSL were generated having a normal distribution with mean of zeros and variance-covariance components as U0j 100 101 .85 .00 Var = = Ulj 110 111 .00 .15 The RRSS the random effects were also generated to have a normal distribution with mean of zeros and variance-covariance components as Uoj 100 101 .995 .00 Var = :- Ulj 1'10 Tl]. .00 .005 While, the t-distribution of the random effects were investigated only under a large magnitude of the random regression slope variance of the intercept variance. Thus the 55 random effects with RRSL were generated having a t- distribution with four degrees of freedom with mean of zeros and variance-covariance components as U0j 100 101 .85 .00 Var Ulj 110 111 .00 .15 In addition, the following values were chosen for 7's: 7” = -1.80, 7d = -1.20, 7m = -.50, and 7" = .75 . These values were chosen in order to represent realistic values of the situation. In fact, these values for 7's were obtained from the previous analysis of real data (i.e., the results of the fourth model of the real data analysis using three main estimation methods). Similarly, the Simulated true values for the variance-covariance components of random effects (i.e. , 10,, and 1“). Thus by substituting the 7's values into Equation 3.26 and 3.27, and further substituting the two equations (i.e., 3.26, and 3.27) into equation 3.25 the combined model is derived as a, = -l.8 - 1.20 z, - .50 x, + .75 (x,,*z,) + (0,, x, + U”) (3.28) The accuracy and properties of the parameter estimation for both VARCL and SPSS programs of the simulated data were evaluated under the moderate number of sixty schools, j=1,2,...60. In addition, the statistical properties of these estimation procedures were investigated under realistic values 56 for the three following conditions: 1. Mnmbgr of snbjects within eacn group (n). Simulating data with n = 10 and 60 units (subjects) within each group. These small and large values of the number of subjects within group was based on several studies (Bock, 1983; Aitken and Longford, 1986; Wong and Mason, 1985). 2. Magnitude of the nandom :ggression slope (RRS) variance. Specifying the RRS variance of .005% (small variance denoted by RRSS) and 17.6% (Large variance denoted by RRSL) of the intercept variance, the following values were chosen, 1m = .995 and 1” = .005 to obtain RRSS and 100 = .85 and 1" = .15 to obtain RRSL. These values were selected based on (Wong and Mason, 1985) and the results of the previous analysis of the real data. 3. 'st ’ ' d -d's ' ' e znndgn effects, U, and g”. esi e ' u t In order to establish the design of the simulated study, three conditions of interest (with two levels within each condition) had to be considered. This resulted in a design that consisted of a total of six cells see Figure 3-1. For simplicity, each cell was identified by the following notations: 57 (ND,n10,RR88) defining a normal distribution (ND) of the random effects, with 10 (signifying a small number) subjects within each cell (n10), and small random regression slope (RRSS) of the intercept variance. (ND,n10,RRSL) defining a normal distribution (ND) of the random effects, with 10 subjects within each cell (n10), and large random regression slope (RRSL) of the intercept variance. (ND,n60,RRSS) defining a normal distribution (ND) of the random effects, with 60 (signifying a large number) subjects within each cell (n60), and small random regression slope (RRSS) of the intercept variance. (ND,n60,RRSL) defining a normal distribution (ND) of the random effects, with 60 subjects within each cell (n60), and large random regression slope (RRSL) of the intercept variance. (TD,n10,RR8L) defining a t-distribution (TD) of the random effects, with 10 subjects within each cell (n10), and large random regression slope (RRSL) of the intercept variance. (TD,n60,RR8L) defining a t-distribution (TD) of the random effects, with 60 subjects within each cell (n60), and large random regression slope (RRSL) of the intercept variance. 58 Procedure used to Generate the Simulated Data A Gauss computer program was used with an IBM compatible 386/Mhz microcomputer to»generate the data for each of the six cells (combing the three conditions of interest and equations satisfying 3.25, 3.26 and 3.27). A math coprocessor was installed in the microcomputer to speed up the process. Since the analysis is based on a moderate number of 60 groups, a vector of 60 by 1 was first generated for the group predictor, 2,, having a normal distribution with a mean of zero and a variance of one. A copy of the program is shown in Appendix 3-4. In addition, a 600 by 1 vector was generated for the within-group predictor, X- This was because there were 10 g- subjects within each of the 60 groups resulting in 10 by 60 (i.e., 600) subjects of2&,covarite being generated, having a normal distribution within each group (or school) with a mean of zero and a variance of one. A copy of the program is found in Appendix 3-5. Similarly, a 3600 by 1 vector was also generated for X, and used where the number of subjects within each group was taken to be sixty, n=60. Note that both 2, and X, are considered as fixed variables. The random effects, U, and U,, of the equations 3.26 and 3.27 were also generated satisfying the conditions of interest (i.e., RRSS or RRSL, and ND or TD). The random effects, U,,, and U,,, were generated having a t- distribution with four degrees of freedom. Selecting four 59 Figure 3-1.--The design of the six cells of the simulated study The random regression slope of the intercept variance I l Small. RRSS Large, RRSL (.0051) (17.60) i J Number of n10 n60 n10 n60 subjects withini—— ‘—f—‘ each group L The distribution ND ND ND TD ND TD of the random * ‘—— (——‘ effects Cell (ND,n10,RRSS)(ND,n60,RRSS) (ND,n10,RRSL)(TD,n10,RRSL) (ND,n60,RRSL)(TD,n60,RRSL) identification' ' 200 replication within each cell. degrees of freedom would make the t-distribution deviate from the normal distribution. The distribution of the random effects was checked in order to ensure that the program was working properly. Thus, using the generated U,,,, U,,, z, and the assigned values of 7's (70,, = -1.80, 7“ = -1.20. 7h - -.50, and 7" = .75), together with equations 3.26 and 3.27, the Bq's and B,’s were computed for each group. Using the computed Ed’s, B,,'s for each group and the generated X, values for each subject within the same group derived from Equation 3.25, a, was computed for each unit (i.e., the individual subject) within each group (school). 60 Since the objective of the study was to obtain binary (0’s and 1's) outcomes for individual units, 0, was first obtained from a, by using the procedure 0, = e“‘5 / (1 + edj) , (0, = P(Y,=1) is the probability that the i-th micro observation will select the first category (i.e. , Y,=1) of the response variable. This was obtained by solving the equation, below. Since Ln [0, / (1- 0,)] = a, i,- / (1- 6,) = em" 0, = (1- 0,) edj 0,, = e"‘ij - 0, ea] 0, + 0, emij = ed} 0,, (1 + edi) = edi Thus 0,, = e"ij / (1 + e‘dj) = 1 / (1 + e'“‘") Finally, assuming that Y,|0, ~ Bernoulli (0,) , the binary scores for each unit (individual student outcome), 31,, was obtained. By drawing a number at random from a uniform distribution within the range of zero to one, Y, is assigned to value one, if the value of the random number is less than or equal to the 0,. If the random number drawn exceeds 0, , Y, is recorded as zero. Each of the above steps in this simulated study was checked thoroughly in order to be confident that the simulation program was doing what it was expected to do. 61 For each of the six cells, 200 replications were performed. However, due the space limitation in the hard disk of the researcher's microcomputer, the program was run twice with 100 replications each time in order to obtain the 200 replications for each cell. A total of 1200 replications were performed (200 replications for each of the six cells), generating two sets of data that could be used on the SPSS and VARCL programs. Fortunately, it was discovered from the real data and the simulated data analysis that the VARCL program used the SPSS estimates (the same values, up to four decimal places) as its initial values in order to obtain VARCL estimates. Thus, only one set of generated data was applied to VARCL program as the SPSS estimates could be obtained from the VARCL printout. The data obtained from each replication run were, ‘fi,unit outcomes (the dependent variable for each unit or student within a group), )g unit within group predictor, Z, group predictor, and x,*z, the interaction term between the within-group predictor and the group-level predictor. ~ 4 A copy of the GAUSS program that was used to generate a data set for the cell (ND,n10,RRSS) is found in Appendix 3-6. In addition, the statistics, 1“,, 1,, and 10,, for each replication were saved on separate files. This was done in order to compare their standard errors with the estimated standard error of variance-covariance of the random effects 62 from the VARCL estimation procedure later. The VARCL program was run on each set of the 1200 replications (data set) resulting in a printout of 1200 values consisting of the SPSS and VARCL estimates. These parameter estimates of the simulated model were used to evaluate both the estimation methods. The estimated parameters that were obtained for both the SPSS and VARCL procedures were saved on a single file. These statistics were later used to evaluate the accuracy and properties of both the VARCL and SPSS estimation procedures. The average time to obtain 100 replications (100 sets of data for the VARCL program) for n=10 was approximately 45 minutes and approximately 2 hours for n=60. The average time of running the VARCL program for one data set, where n=10, was approximately 1.4 minutes and approximately 4 minutes for n=60. Staristical Comparisgn of tne Estimation Metnggs To compare the two estimation methods the following statistics were computed : 1. The mean (average) of estimates, E(79 = 7 where'% is the VARCL and SPSS estimate for each of the parameters of interest. 2. The bias, Eofi -‘%) where‘n is the true parameter value. A 95% confidence interval of the bias (bias 1 1.96 x S.E.(bias)) was also constructed. In addition, the magnitude of the bias was compared to the true value. The 63 percent of this bias is obtained by dividing the absolute value of the bias by the absolute true value and multiplying the value obtained by 100. 3. The mean square error (MSE) of estimates. This statistic combined the bias and the dispersion of an estimator into a single quantity: MSE = Ev}, - 70’ = mm + Ev}. - 7.)” = mm + Bias2 4. The probability of type I error rate under a true null hypothesis (Ho: 7 = 7, ). This is determined by counting the frequency with which the test statistic, Z = (7 — 70/8.E.(7) in each replication, exceeds a specified critical value (at .05 significance level), and dividing by the total number of replications. The results of the analysis comparing the VARCL and SPSS estimation methods using simulated data will be presented in Chapter Four. Summerx The research study began with a pilot study in order to evaluate the multilevel and single-level logistic regression models as analyzed by four computer programs: ML3, VARCL, MULTILOGIT, and SPSS. As.a result of this, the ML3 program*was excluded from the analysis, and some concerns arose regarding the MULTILOGIT program. The computer programs were then run using the real data and a simulation exercise was also executed. Five multilevel logistic regression models and two MLR models using the VARCL method of estimation were 64 demonstrated for the real data analysis. As a result of the real data analysis, the MULTILOGIT program was excluded from the study. The procedure for running the simulation study using the GAUSS program was also explained. CHAPTER IV RESULTS lntrgductign The results of this study are presented in three sections. The chapter begins by presenting the results of the real data analysis. The second section will address the results of the simulation data analysis, comparison of the SPSS and VARCL estimation methods, and evaluation of the effect of the three conditions on both estimation procedures. The third section will deal with the accuracy of the VARCL estimation method and the properties of the estimates of the variance-covariance components of random effects and its standard errors with the true values. es ts o t e e a na 5 The results of running the SPSS, VARCL, and MULTILOGIT programs of estimation methods of the real data will be presented for each of the five specified multilevel logistic regression (MLR) models and the two MLR models using the VARCL estimation procedure (i.e., model A versus model B). W The results of the analysis indicate that the absolute value of intercept coefficient, 7m, for the MULTILOGIT 65 66 approach had the largest value (-1.92074) followed by the SPSS and VARCL approach with values of -1.7010 and -1.632833, respectively (See Table 4-1a). The difference of the 70,, estimate between the SPSS and VARCL estimation methods was very small. Table 4-1a.--Estimated regression coefficient and standard error (given in parentheses) for MLR model 1 using different estimation methods. Estimation Mgtnod M.L. Quasi-likelihood Empirical Bayesian (SPSS) (VARCL) (MULTILOGIT) Intercept,70° -l.7010 -l.632833 -l.92074 (.0785) (NA) (.1770906) Note: NA-not given by the program. In addition, the results indicate that the MULTILOGIT estimate of the standard error for 7w (intercept coefficient) is larger (.17709) than the SPSS estimate (.0785), while the VARCL program did not report the standard error for 7w. This is one disadvantage of using the VARCL program. The variance of the intercept of the random effects, 1“, was also compared using the MULTILOGIT and VARCL methods (the SPSS method did not report 10,, since the program does not account for between-school variation). The results shown in Table 4-1b indicate a slightly larger variance of the intercept of the random effects for MULTILOGIT (1.29436) than for VARCL (1.084792). This is because VARCL program uses approximation of the maximum likelihood estimate. 67 Table 4-1b.--Estimated variance of the intercept random effects and S.E. for V10° (given in parentheses) for MLR model 1 using the VARCL and MULTILOGIT estimation methods. Estinarion Mgtngd Quasi-likelihood Empirical Bayesian (VARCL) (MULTILOGIT) Interceptpr,° 1.084792* 1.29436 (.128574) (NA) Note (1) NA-not given by the program. (ii) *-significant at .05 (Ha 1,50, t-statistic =1.04lS/.1286=8.1007). The VARCL program also reported a standard error for the intercept standard deviation of the random effects, .128574 (standard error of 71”). This provided a significance test of the between-school variation. The null hypothesis here was: Ho: 10°=0. The t-statistic test of significance at p = .05 indicated that the null hypothesis should be rejected and that there were significant differences among schools with respect to their mean outcomes (i.e. Logit (repetition)fl, t=1.04153/.128574 = 8.1007 (mei- The hypotheses testing of variance-covariance components (i.e., H°:1,,,,=0, H°:1,,=0 and H°:1°,=0) of the VARCL results help investigators decide whether the regression intercepts and slopes for the within-group model should be specified as fixed or random, only when 1 is Significant. Note that the SPSS estimation method assumed that the regression intercepts and slopes were fixed, thus ignoring the variation of these regression coefficients among the groups. 68 By testing the above hypotheses the researcher is able to make a proper decision as to whether the regression coefficient among schools is fixed or random. This testing feature is found only in the VARCL program. es s o odel The results (Table 4-2a) indicate that.both the intercept coefficient, 70°, and its standard error (S.E.(yoon have a pattern similar to the MLR model 1. Table 4-2a.-—Estimated regression coefficient and standard error (given in parentheses) for MLR model 2 using different estimation methods. Estimntion Metnod M.L. Quasi-likelihood Empirical Bayesian (SPSS) (VARCL) (MULTILOGIT) Intercept,70° -1.8063 -l.750708 -l.98894 (.0873) (NA) (.1687909) SES,7h -.8841 -O.545613 -0.645738 (.1726) (0.197848) (.2290343) Note: NA-not reported by the program. Comparing the slope regression coefficients associated with SES, 7“, the MULTILOGIT and VARCL estimates of 7“ were found. to be (-0.6457) and (-0.5456), respectively. This indicates that they were quite close in value to each other, and the absolute value of MULTILOGIT estimate is slightly larger than VARCL estimate. While, the SPSS estimate was somewhat larger (-.8841). The results also indicate that the MULTILOGIT estimate of the standard error of 7“ was slightly larger (.2290) than the 69 VARCL estimate (0.1979). However, both their values were larger than the SPSS estimate (.1726) of S.E.(yh). The estimates variance-covariance components of the random effects produced by MULTILOGIT are slightly larger than VARCL estimates (see Table 4-2b). In addition, the VARCL approach provided a test (by providing an estimate and its standard error) of the hypothesis of no variation across schools in: (a) 80,, the adjusted school mean, Hon“, = 0; (b) B,,, the SES regression coefficient, Ho:1,, = 0; and (c) the covariance random effects between B, and B,,, Homo, = 0. The t-statistic test of significance at p =.05 level implies no variation across the SES regression coefficients (t = 1.3152), and no significance cgvnriance exists between the adjusted mean, 80,, and the SES regression coefficients, 8,, (t = -0.4518). The results of the test suggest that the ‘variation across the adjusted :mean is significant (t = 7.48886). This result is similar to that obtained in MLR model 1. Thus, in analyzing this data, the regression coefficient of the SES slopes for the within-group model might well be fixed (i.e., changing the Equation 5.7 into B, = 75). This decision cannot be made utilizing the MULTILOGIT program since it does not report any testing for the random effects portion of the model. 70 Table 4-2b.--Estimated variance-covariance components of the random effects and S.E. for 71w, V1”, and 1“ (given in parentheses) for MLR.model 2 using the VARCL and MULTILOGIT estimation methods. Egtimation Metnod Quasi-likelihood Empirical Bayesian (VARCL) (MULTILOGIT) Intercept 0.926374* 1.01563 (.128522) (NA) SES SLOPES —O.10108+ O.2114** .0572946 .358306 (.223707) (.34959) (NA) (NA) Note (i) NA-not given by the program. (ii) *-significant at .05 (Ho: 10,80, t-statistic=.96248/.1285=7.4889). (iii) **-not significant at .05 (Ho: 1,,80, t-statistic=.4598/.3496=1.3152). (iv) +-not significant at .05 (H°:1,,,=0, t-statistic=-.10108/.22371= -.452). e esults f M 0 e 3 The results of this analysis (Table 4-3a) show that the regression coefficient estimates of the within-school variable (i.e., intercept, 7m, SES slope 7m, and MSES slope 7“) using the SPSS and the VARCL approach are close in values. The absolute MULTILOGIT estimate, however, is larger than both the SPSS and the VARCL estimates. The figures in Table 4-3a illustrate that the standard error estimate for both the within- and the between-school variable regression coefficients (i.e., S.E.(ym), S.E.(yh), and S.E.(7M)) for the VARCL and the MULTILOGIT approach are closer in value to each other. Their estimates of standard error were larger than the SPSS estimate. The results of the variance-covariance components of the random effects estimates, their hypotheses tests, and the comparison between the different approaches for MLR model 3 71 Table 4-3a.--Estimated regression coefficient and standard error (given in parentheses) for MLR model 3 using different estimation methods. ima 'o et od M.L. Quasi-likelihood Empirical Bayesian (SPSS) (VARCL) (MULTILOGIT) Interceptnv°° -1.7875 -l.795934 -2.00627 (.0879) (NA) (.1655765) SES,7M -.2454 -0.294160 -O.399041 (.2202) (0.247753) (.2440238) MSES,7°l —1.2960 -1.315930 -1.32994 (.3164) (0.459423) (0.4655073) Note: NA-not reported by the program. are similar to the previous results for MLR model 1 and 2 (Table 4-3b). T e es ts 0 Mode 4 The estimates for the regression coefficients associated with interaction term, 7", for MULTILOGIT (.75059) and VARCL (.75448) are close in value (Table 4-4a). Both these estimates are larger than the SPSS (.6560) estimate. Table 4-4a again shows that the MULTILOGIT and VARCL estimates of standard error of 7” are larger than the SPSS estimate. The MULTILOGIT and VARCL estimates of 10°, and 1,, are close in value (Table 4-4b). However, again the MULTILOGIT estimates are larger than the VARCL estimates. 72 Table 4-3b.--Estimated variance-covariance components of the random effects and S.E. for V1”, V1”, and 1“ (given in parentheses) for MLR model 3 using the VARCL and MULTILOGIT estimation methods. st'mat et od Quasi-likelihood Empirical Bayesian (VARCL) (MULTILOGIT) Intercept 0.86048* 0.924759 (.129164) (NA) SES SLOPES 0.17049+ 0.4314“r 0.122416 .319217 (.256121) (.35782) (NA) (NA) Note (i) NA-not reported by the program. (ii) *-significant at .05 (H5 1330, t-statistic-.92762/.1292 I=7.1817). (iii) **-not significant at .05 (Ho: 1,,=0, t-statistics.65683/.3578-1.836). (iv) +-not significant at .05 (Ho: 10,80, t-statistics.l7049/.2561280.6657). Table 4-4a.--Estimated regression coefficient and standard error (given in parentheses) for MLR model 4 using different estimation methods. Entinntion Mernog M.L. Quasi-likelihood Empirical Bayesian (SPSS) (VARCL) (MULTILOGIT) Intercept,-y0° -1.7973 -1.767842 -2.08383 (.0875) (NA) (.1747097) SES,-yIo -.4097 -0.387944 -0.541050 (.2383) (0.244380) (.2478449) MSES,7d, -1.1891 -1.152334 -l.23976 (.3254) (0.456739) (0.469006) URB x SES,7” 0.6560 0.754480 0.750593 (.3568) (0.405380) (0.453603) Note: NA-not reported by the program. \ 73 Table 4-4b.--Estimated variance-covariance components of the random effects and S.E. for 71m, 71”, and 1“ (given in parentheses) for MLR.model 4 using the VARCL and MULTILOGIT estimation methods. st mat o et od Quasi-likelihood Empirical Bayesian (VARCL) (MULTILOGIT) Intercept 0.92129* 0.975108 (.127039) (NA) SES SLOPES 0.33158+ 0.1194** 0.215233 .243533 (.213225) (.31258) (NA) (NA) Note (i) NA-not reported by the program. (ii) *-significant at .05 (Ho: 100:0, t-statistic=.95984/.127l=7.5526). (iii) **-not significant at .05 (Ho: 1,,80, t-statistica.3455/.31268a1.105). (iv) +-significant at .05 (H; 1d=0, t-statistic= .57583 / .21323 a 2.701). esu s o R Mode 5 The results, shown in Table 4-5a, with respect to the 7m and/y,o parameters are identical to the results of MLRLmodel 3. However, comparing the regression coefficient estimates for the school-level variables (i.e. 7a, 7a, 7s: 7d , yfi, and 7“), the results suggest that for 7a, 7“,, 7s: and 7” the SPSS and VARCL estimates are close in value. The smallest estimates are observed in the SPSS approach, while the largest estimates are observed in the MULTILOGIT approach. The VARCL estimates are close to SPSS estimates, and the estimates of the standard errors of both the within- and the between-school regression coefficient variables for MULTILOGIT and VARCL are also very close in value. However,the MULTILOGIT estimates of the standard errors are consistently slightly larger than the VARCL estimates. For 74 Table 4-5a.--Estimated regression coefficient and standard error (given in parentheses) for MLR model 5 using different estimation methods. Est mat on Method M.L. Quasi-likelihood Empirical Bayesian (SPSS) t-STAT (VARCL) t-STAT (MULTILOGIT) t-STAT Intercept,7m-l.8357 -1.82266 -2.10500 (.0906) (NA) (.1726824) SES,7M -.3810 -l.63 -0.39l991 -1.62 -0.536057 -2.20 (.2340) (.24151) (.2437511) (URB/RRL),7,,| -.4444 -1.56 -0.580533 -1.23 -0.423996 -0.89 (.2844) (.473119) (.47761909) (CENTRAL),7a .0919 0.36 0.073548 0.17 0.0415828 0.09 (.2557) (.425883) (.44582059) (NORTH),7a .8605 4.14 0.867967 2.18 1.03390 2.46 (.2078) (.399045) (.42000833) (SOUTH),7M .3835 1.61 0.382027 0.93 0.412540 0.94 (.2389) (.412146) (.43958617) MSES,7¢ -.8753 -2.39 -0.902494 -1.70 -0.985701 -1.84 (.3670) (.531539) (.53678115) URB x SES,7" .6794 1.86 0.700078 1.60 0.740221 1.63 (.3644) (.438400) (.453797) Note: NA-not reported by the program. example, the MULTILOGIT estimates of the standard errors for 7m: 7a: 7a: 7a: 7a , 7a: and 7" are .244, .478, .446, .420, .440, .537, and .454 while the‘VARCL.estimates are .242, .473, .426, .399, .412, .532, and .438, respectively. Both the‘VARCL and the MULTILOGIT estimates of the standard error are also much larger than the SPSS estimates. The t-statistic computed for each of the regression coefficients, 7's, for the three estimation methods is shown in Table 4-5a. This t-statistic provides a test of significance of the regression coefficient of the model. The null hypothesis is Ho: 7, = 0. This test helps resolve whether there is a significant relationship between the micro (or macro) covariate variables and the dependent variables. In 75 this case, the objective of the t-statistic test is to compare the decisions made regarding the micro and macro covariate 'variables (i.e., either' rejecting or accepting the null hypothesis) in the three methods of estimations. The results of the covariate hypothesis testing for the micro variable show’ that. the SPSS and 'VARCL. estimation procedures each produced the same conclusion (except with respect to MSES variable) . However, a different conclusion was reached using the MULTILOGIT estimation method. For example, the t-test of the SES regression coefficient variable, 7“, of the SPSS (t = -1.63) and the VARCL (t = -1.62) estimation procedures indicate that the null hypothesis cannot be rejected, while the MULTILOGIT'(t.= -2.2) method of estimation rejected the null hypothesis. Every hypothesis was tested at a .05 level of significance (see Table 4-5a). The results of the covariate hypothesis tests for the macro variables show that the MULTILOGIT and VARCL estimation procedures reached the same conclusion unlike the SPSS estimation method. For example, at a .05 level of significance the t-test of .MSES regression coefficient variable, 7d, indicates that both the MULTILOGIT (t = -1.84) and VARCL (t = -1.70) estimation procedures could not reject the null hypothesis, while the SPSS (t = -2.39) method of estimation rejected the null (see Table 4-5a). The results of the variance-covariance components of the random effects estimates shown in Table 4-5b are similar to results of previous MLR models. As mentioned earlier, the 76 VARCL approach provides a variance-covariance components test of the random effects estimates. Again (as in MLR model 2) test results indicate no variation across groups in the SES within-school variable regression slopes (t = 1.1460). Thus, the SES slopes for the within-school model had to be fixed in order to compare the next two MLR models using the VARCL estimation method. Table 4-5b.--Estimated variance-covariance components of the random effects and S.E. for 71m, 71”, and 1" (given in parentheses) for MLR model 5 using the VARCL and MULTILOGIT estimation methods. Est m ' ethod Quasi-likelihood Empirical Bayesian (VARCL) (MULTILOGIT) Intercept 0.813675* 0.914446 (.125398) (NA) SES SLOPES 0.353929+ 0.15396** 0.228719 .214819 (.211922) (.34239) (NA) (NA) Note (i) NA-not reported by the program. (ii) *-significant at .05 (H5 TmFO, t-statistic=.902039/.12539887.l934). (iii) **-not significant at .05 (H; 1H=0, t-statistics.39238/.3424-1.15). (iv) +-significant at .05 (H; 1,30, t-statietics.59492/.21192=2.8073). es Co 'n ' V C The two MLR models analyzed were denoted as model A and model B. Model B was a random intercept model, while model A had a combined random intercept and a random regression slope for the SES variable (This is similar to the VARCL model in Table 4-5a). Id H! 1' 77 The comparison of these two models shows the effect of using one model over the other in decision making (i.e., rejecting or accepting the null hypotheses) regarding the effect of the student- or school-level variables on the dependent variable. the regression As can be seen from the results, coefficients and the standard error (S.E. (7’s)) estimates, 7’s, of model A and model B are different from each other. The t-statistic confirms this observation (see Table 4-6). Table 4-6.--Estimated regression coefficient, and standard error (given in parentheses) for the model A having random intercept and random regression slope and model B having random intercept and fixed regression slope using VARCL estimation method. MODEL T MODEL T A STATISTIC B STATISTIC Intercept,-y0° -l.82266 -1.812083 (NA) (NA) SESpy,o -0.391991 -1.623 -0.344946 -1.483 (.241509) (.232561) (URB/RRL),7M -0.580533 -1.227 -0.653884 -l.473 (.473119) (.444054) (CENTRAL),702 0.073548 0.173 0.031822 0.077 (.425883) (.413072) (NORTH),7¢ 0.867967 2.175 0.803747 1.950 (.399045) (.412093) (SOUTH),7“ 0.382027 0.927 0.245883 0.586 (.412146) (.419792) MSES,7¢ -0.902494 -1.698 -0.831582 -1.680 (0.531539) (.495013) URB x SES,7" 0.700078 1.597 0.657214 1.646 (.438400) (.399225) Note: NA-not reported by the program. The t-statistic for 7“ using model A suggests that Va is significantly different from zero at a .05 significance level (t = 2.175), while for model B the test indicates that 7, is 78 not significant (t = 1.95). Thus, the ability of the VARCL program to test the variance of the random effects is useful, whenever we want to account for the group membership effect. es t ' u e a First, the results of comparing the SPSS and VARCL estimation methods will be discussed with respect to: (a) the estimates of the macro parameters, 7's (7”, 7d, 7h, 7“), and (b) the estimates of the standard errors of the macro parameter. Second, the effect of the following three simulated conditions will be evaluated: (a) number of units within each group, (b) the magnitude of the random regression slope variance in contrast to the intercept variance, and (c) the distribution of the random effects. For both the SPSS and VARCL estimation procedures, the above three conditions will be considered with respect to the following statistics: (a) the macro parameters estimates, and (b) the estimates of standard errors of the macro parameters. Finally, the accuracy of the VARCL estimate of the variance-covariance components of the random effects and its estimate of the standard error will be discussed. nuns-1 - ’ 1'1: 6‘! 1"8 :10 VCR; T .11: 91 1191211299 The purpose of this analysis was to compare the SPSS (standard single logistic regression model using Maximum- 79 likelihood methods of estimation) and the VARCL (multilevel logistic regression model using Quasi-likelihood methods of estimation) properties of macro estimation. Table 4-7 shows the true value, the mean of all four macro parameters and their standard errors of estimate, MSE of estimate, and the bias of both the SPSS and the VARCL estimation method. The statistical values of the macro parameters were obtained using 1200 replications. Similar statistical values were used to compare the properties and the accuracy of both estimation procedures for macro parameters under different experimental conditions (i.e., six cells) having 200 replications within each cell, presented in Tables 4-8 through 4-11. The results of the analysis indicated that both the VARCL and the SPSS estimates of 7’s were statistically significantly biased at the significant level of p = .05. In fact, both estimation procedures underestimated the population parameters of 7's. On the average, the estimates of 70° for both estimation methods were 13 percent smaller than the true value. Similarly, 70, was 12 percent smaller, 7,, was 14 percent smaller and 7,, was 12 percent smaller than their true values. The VARCL and SPSS estimates of macro parameters were found, on the average, to be approximately equal for different statistics (i.e., mean, standard errors of estimate, MSE of estimate, and bias). A similar pattern of results was also detected with respect to estimates in the analysis of the real 80 Table 4-7.--The true macro parameter value, Mean, S.E?.,ZMSE, and bias for estimated 7's of the SPSS and VARCL estimation procedures‘. figrimnrion Method SPSS VARCL Macro Parameter 70° The True Value -1.800 -l.800 Mean of Estimate -l.569 -1.572 S.E. of Estimate .15 .149 MSE of Estimate .076 .074 Bias .231 .228 95% CI Bias .22,.24 .22,.24 Percent of Bias 13% 13% Macro Parameter 7d The True Value -l.200 -l.200 Mean of Estimate -1.062 -1.074 S.E. of Estimate .163 .163 MSE of Estimate .045 .043 Bias .138 .126 95% CI Bias .13,.15 .12,.14 Percent of Bias 12% 11% Macro Parameter 7,0 The True Value -0.500 -0.500 Mean of Estimate -.430 -.428 S.E. of Estimate .112 .106 MSE of Estimate .017 .016 Bias .070 .072 95% CI Bias .06,.08 .07,.08 Percent of Bias 14% 14% Macro Parameter 7” The True Value .75 .75 Mean of Estimate .662 .668 S.E. of Estimate .144 .140 MSE of Estimate .028 .026 Bias -.088 -.082 95% CI Bias -.10,-.08 -.09,-.07 Percent of Bias 12% 11% Note ' Observed standard deviation * From 1200 replications. of estimates 81 Table 4-8.--The true value, Mean, S.Efi, MSE, and bias for estimated 700 by cell identification for the SPSS and the VARCL estimation procedurefl Macro Parameter 70° 9211.12222111922122 Iflfllnlfliflflfifil 1H212121332L1 Estimation Method SPSS VARCL SPSS VARCL The True Value -1.800 -1.800 -1.800 -1.800 Mean of Estimate -l.555 -1.557 -l.586 -1.590 S.E. Of Estimate .175 .174 .169 .170 MSE of Estimate .091 .89 .075 .073 Bias .245 .243 .214 .210 95% CI Bias .22,.27 .22,.27 .19,.24 .19,.23 Percent of Bias 14% 14% 12% 12% 9.2112222221222122 260 SS Estimation Method SPSS VARCL SPSS VARCL The True Value -1.800 -1.800 -1.800 -1.800 Mean of Estimate -1.556 -1.560 -1.546 -1.553 S.E. Of Estimate .141 .137 .118 .121 MSE of Estimate .079 .076 .079 .076 Bias .244 .240 .254 .247 95% CI Bias .22,.26 .22,.26 .24,.27 .23,.27 Percent of Bias 14% 13% 14% 14% 9.9112222911122122 Wu 0 60 Estimation Method SPSS VARCL SPSS VARCL The True Value -l.800 -l.800 -l.800 -1.800 Mean of Estimate -1.587 -1.589 -1.582 -1.585 S.E. of Estimate .155 .156 .127 .125 MSE of Estimate .069 .069 .064 .062 Bias .213 .211 .218 .215 95% CI Bias .19,.24 .19,.23 .20,.24 .20,.23 Percent of Bias 12% 12% 12% 12% Note -200 replications were performed within each cell. ND -normal distribution of the random effects. TD -t-distribution of the random effects. n10 -10 subjects within each group. n60 -60 subjects within each group. RRSS-small magnitude of the random regression slope variance to the intercept variance (i.e, 1w=.995, 1"=.005). RRSL-large magnitude of the random regression slope variance to the intercept variance (i.e, 1w=.85, 1"=.15). 82 Table 4-9.--The true value, Mean, S.E‘., MSE, and bias for estimated yd by cell identification for the SPSS and the VARCL estimation procedurefl Macro Parameter 7m 99211.12223111221122 1HD1DLQLBB§§1 iflD1DLQ1BB§L1 Estimation Method SPSS VARCL SPSS VARCL The True Value -1.200 -1.200 -1.200 -1.200 Mean of Estimate -1.077 -1.086 -1.057 -1.070 S.E. of Estimate .193 .196 .180 .182 MSE of Estimate .052 .051 .053 .050 Bias .123 .114 .143 .130 95% CI Bias .10,.15 .09,.14 .12,.17 .11,.16 Percent of Bias 10% 10% 12% 11% 'de ' 'C lflELDQQLBB§§l 13212221BB§L1 Estimation Method SPSS VARCL SPSS VARCL The True Value -1.200 -1.200 -1.200 -1.200 Mean of Estimate -1.048 -1.057 -1.059 -1.077 S.E. of Estimate .153 .148 .133 .132 MSE of Estimate .046 .042 .037 .032 Bias .152 .143 .141 .123 95% CI Bias .13,.17 .12,.16 .12,.16 .11,.14 Percent of Bias 13% 12% 12% 10% £211 identificatign 0 0 Estimation Method SPSS VARCL SPSS VARCL The True Value -1.200 -1.200 -1.200 -1.200 Mean of Estimate -1.059 -1.068 -1.074 -1.085 S.E. of Estimate .176 .178 .133 .131 MSE of Estimate .051 .049 .034 .030 Bias .141 .132 .126 .115 95% CI Bias .12,.17 .11,.16 .11,.14 .10,.13 Percent of Bias 12% 11% 11% 10% Note ND -normal distribution of the random effects. TD -t-distribution of the random effects. n10 -10 subjects within each group. n60 -60 subjects within each group. RRSS-small magnitude of the random regression slope variance -200 replications were performed within each cell. to the intercept variance (i.e, 1w=.995, r"=.005). RRSL-large magnitude of the random regression slope variance to the intercept variance (i.e, 1m=.85, 1“=.15). 83 Table 4-10.--The true value, Mean, S.Efi, MSE, and bias for estimated 7“ by cell identification for the SPSS and the VARCL estimation procedurefi Macro Parameter 7“ Ce 'd ' °C ' 1321212138221 iHDLlengng Estimation Method SPSS VARCL SPSS VARCL The True Value -.50 -.50 -.50 -.50 Mean of Estimate -.437 -.437 -.427 -.424 S.E. of Estimate .133 .131 .138 .135 MSE of Estimate .021 .021 .024 .024 Bias .063 .063 .073 .076 95% CI Bias .05,.08 .05,.03 .05,.09 .06,.10 Percent of Bias 13% 13% 15% 15% Q211.ifi£fl£ifii£2£igfl ‘ 50 S 1E212221BB§L1 Estimation Method SPSS VARCL SPSS VARCL The True Value -.50 -.50 -.50 -.50 Mean of Estimate -.432 -.433 -.418 -.413 S.E. of Estimate .080 .052 .081 .078 MSE of Estimate .011 .007 .013 .014 Bias .068 .067 .082 .087 95% CI Bias .06,.08 .06,.08 .07,.09 .08,.10 Percent of Bias 14% 13% 16% 17% Cell identification (TD,n10,BBSL) LIQLQQQ‘BB§L1 Estimation Method SPSS VARCL SPSS VARCL The True Value -.50 -.50 -.50 -.50 Mean of Estimate -.436 -.434 -.429 -.427 S.E. of Estimate .139 .135 .078 .072 MSE of Estimate .023 .023 .011 .011 Bias .064 .066 .071 .073 95% CI Bias .04,.08 .05,.09 .06,.08 .06,.08 Percent of Bias 13% 13% 14% 15% Note -200 replications were performed within each cell. ND -normal distribution of the random effects. TD -t-distribution of the random effects. n10 -10 subjects within each group. 260 -60 subjects within each group. RRSS-small magnitude of the random regression slope variance to the intercept variance (i.e, 1m=.995, r"=.005). RRSL-large magnitude of the random regression slope variance to the intercept variance (i.e, rws.85, 1"=.15). 84 Table 4-11.--The true value, Mean, S.Efi, MSE, and bias for estimated 7" by cell identification for the SPSS and the VARCL estimation procedurefl Macro Parameter 7“ Qeii identification (ND,ni0,BBSS) (Np,niQ.BBSL) Estimation Method SPSS VARCL SPSS VARCL The True Value .75 .75 .75 .75 Mean of Estimate .677 .677 .654 .661 S.E. of Estimate .172 .167 .178 .184 MSE of Estimate .035 .033 .041 .042 Bias -.073 -.073, -.096 -.089 95% CI Bias -.1,-.05 -.1,-.05 -.12,-.1 -.11,-.1 Percent of Bias 10% 10% 13% 12% Ceil identification (ND.n60,RRSS) (ND,n60,BBSL) Estimation Method SPSS VARCL SPSS VARCL The True Value .75 .75 .75 .75 Mean of Estimate .649 .651 .656 .664 S.E. of Estimate .085 .071 .095 .084 MSE of Estimate .017 .015 .018 .014 Bias -.101 -.099 -.094 -.086 95% CI Bias -.1,-.09 —.1,-.09 -.11,-.1 -.1,-.o7 Percent of Bias 13% 13% 13% 11% Ceii idennification (TD,n10.RRSL) T 60 S Estimation Method SPSS VARCL SPSS VARCL The True Value .75 .75 .75 .75 Mean of Estimate .674 .683 .662 .669 S.E. of Estimate .195 .193 .089 .082 MSE of Estimate .044 .041 .016 .013 Bias -.076 -.067 -.088 -.081 95% CI Bias -.1,-.05 -.1,-.04 -.1,-.08 -.1,-.07 Percent of Bias 10% 9% 12% 11% Note -200 replications were performed within each cell. ND -normal distribution of the random effects. TD -t-distribution of the random effects. n10 -10 subjects within each group. n60 -60 subjects within each group. RRSS-small magnitude of the random regression slope variance to the intercept variance (i.e, 1m=.995, r"=.005). RRSL-large magnitude of the random regression slope variance to the intercept variance (i.e, rms.85, 1”=.15). 85 data. In addition, the standard errors for estimated 700, 70,, 7,0 and 7" were found to be quite close for both estimation procedures. However, the sampling distribution of these macro parameter estimates were different for both estimation methods (see Appendix 4-1 through 4-4). .A comparison of the MSE for both procedures indicated that the estimates of MSE for 7”, Va: 7“ and 7” were very close to zero for VARCL (.074, .043, .016 and .026) and slightly smaller than the SPSS estimates (.076, .045, .017 and .028). A similar conclusion may be deduced if the results of the macro parameters compared. 'were under different experimental conditions, see Table 4-8 through 4-11. The results in Table 4—8 through 4-11 implied that both the SPSS and VARCL programs estimates of four macro parameters were statistically significantly biased at the significant level of p = .05. The bias ranged between 9% and 17% smaller than the true value. Further investigation was carried out on the simulated program. First, all the commands of the simulated program and the transformation of the dependent variable into binary outcomes were rechecked. The sample size of the number of schools were increased and the programs were executed again. The ‘VARCL. estimates proved to be still significantly biased even with the increased sample size. The simulated program was then subdivided and each part analyzed separately. The distribution of the random effects was checked and found to be normal. The simulation was run for the fixed 86 multi-level logistic regression model excluding the random effects (Dd, U”) from the school level model. The findings indicated that the VARCL estimates of the ys were unbiased. The variance of U0’. and U”- were estimated as zero. These results indicated that the simulation program was working correctly with the fixed model. Three models were then run: (a) Model A (Random Intercept Logistic Regression Model) represented as Boj = ‘70. + U..- U.,-~N(0.T.o) ; (b) Model B (Fixed Intercept Logistic Regression Model) represented as a..=B u 2 8d = 7m ; and (C) Model C (Random Effects Intercept Logistic Regression Model) represented as a..=B d B.=U. 0] Uoj~N(0,T 00) This was done in order to isolate the effect of the independent variables of the school (29 and the students (X9 from the estimation parameters and help identify the source of the problem. The results in‘Table 4-12 using'Model B, the fixed model, showed that the VARCL estimates 7” and 1” were very close to the true ‘values. However, the results using the random intercept model, Model A, showed that the VARCL estimates, 87 Table 4-12.--Estimated 7m and estimated 1” for different models using VARCL estimation methods'. Estimated True Estimated Parameter Value Value MODEL A 7“ -l.800 -1.560 Std Dev. .019 Maximum -1.52 Minimum -1.59 1” .85 0.710 Std Dev. .048 Maximum .78 Minimum .64 MODEL § 7m -1.800 -l.789 Std Dev. .032 Maximum -1.74 Minimum -1.84 I” 0.00 0.001 Std Dev. .003 Maximum .01 Minimum .000 MOQ§L C 7” 0.00 -0.006 Std Dev. .021 Maximum .03 Minimum -.04 1” .85 0.605 Std Dev. .032 Maximum .65 Minimum .55 Note: ' 10 replications were used for each model. 88 both 700 and 700, were biased and under estimated the true values. These results were based on ten replications using 130 schools with 60 students within each school (N=7800) in each replication. rm was set to .85, 7” was set to -1.800 and U01' was centered (i.e., mean of zero) for these preliminary analyzes. Similar analyzes using model A were also performed for both the SPSS and VARCL estimation methods, where 7” was set to -1.00 (rather than -1.800 in the earlier analysis) and 1m was set to have different values: .04, 1.0, .30, .50, .70, .85. This was done because of the concern that earlier extreme magnitudes of the simulated value for 10°, and you may have caused the VARCL estimate to be biased and inconsistent. The results in Table 4-13 confirmed earlier findings. In fact, the results also indicated.that as the true value for rm increased from .04 to .85 the magnitude of bias for the VARCL estimate of both 7m and I” increased. Both estimates underestimated the true values. This result was based on 5 replications for each situation on a total of 7800 subjects in each replication ( j=130 groups, i=60 subjects). In fact, the real data analysis that was based on 59 schools consisting of 1244 students also showed that the VARCL estimates were of smaller magnitude than the MULTILOGIT estimates of Wong & Mason (see Table 4-1a and 4-1b). In addition, the results in Table 4-13 also indicated that the SPSS estimates of 7m,moved further away form the true value (i.e., the magnitude of bias increases) as the true 89 Table 4-13.--Estimated 7m and estimated 1” using VARCL programfi Estimation Methods Estimated Parameter VARCL SPSS VARCL SPSS VARCL SPSS Case 1 3 7m True Value -1.00 -1.00 -1.00 -1.00 -1.00 -1.00 Estimate -.986 -.986 -.966 -.966 -0.94 -.94 Stand. Dev. .035 .035 .014 .014 0.019 .019 Maximum -.93 -.93 -.951 -.951 -0.917 -.917 Minimum -1.04 -1.04 -.983 -.983 -0.964 -.964 100 True Value .04 .04 .100 .100 .300 .300 Estimate .037 NA .099 NA 0.291 NA Stand. Dev. .013 NA .026 NA 0.036 NA Maximum .06 NA .133 NA 0.332 NA Minimum .02 NA .063 NA 0.235 NA Case 5 6 7m True Value -1.00 -1.00 -1.00. -1.00 -1.00 -1.00 Estimate -0.900 -0.900 ‘-0.879 ‘-0.879 '-0.834 -0.834 Stand. Dev. 0.018 0.018 0.029 0.029 0.016 0.016 Maximum -0.872 -0.872 -0.838 -0.838 -0.817 -0.817 Minimum -0.919 -0.919 -0.915 -0.915 -0.854 -0.854 1“) True Value .500 .500 .700 .700 .85 .85 Estimate 0.420 NA 0.581 NA 0.609 NA Stand. Dev. 0.030 NA 0.042 NA 0.014 NA Maximum 0.459 NA 0.641 NA 0.619 NA Minimum 0.374 NA 0.543 NA 0.585 NA Note: ' 5 replications were used for each case with the exception of case number 1 where 10 replications were used. 90 value for rm increased from .04 to .85. The reason for this was because the model generated in the study did not fit the SPSS model. The SPSS program does not account for the effect of the random effects in the model. In fact, the SPSS estimates of 78 for the fully fixed model (generated by excluding the random effects, U5 and Ulj from the full random effects model, based on j=60 and i=60) were unbiased. see Table 4-14. The Fully Fixed model. The with-group model is represented as a-- = 20,. + 13.lj x.. u '1 The between-group model is represented as Boj = 700 + 701 zj Bu = 7m + 7n zj Table 4-14.--The true value, estimated 7’s, for three replications having the fully fixed model using the SPSS estimation method. Replication Estimated True First Second Third Mean of Parameter Value repl. repl. repl. ' Estimates yoo -1.800 -1.738285 -1.893364 -1.845619 -1.826 71o -.50 -0.416737 -0.627042 -0.401306 -0.482 111 .75 0.789271 0.762302 0.754291 0.769 701 -1.20 -1.156474 -1.291167 -1.189164 -1.212 91 The results of the above analyses indicated that perhaps the VARCL program is actually biased and inconsistent in estimating 7” and IN for the random effects model. Communication with Longford (1992). confirmed that a bias existed in the estimation of 7m by the VARCL program. The negative bias of the ML estimator of rm was partly due to the approximation of the maximum likelihood method. Therefore, the results of the analysis on real and simulated data using‘VARCL program should be looked at with caution as the program gives a negative bias estimator of both 7m and 1m. gomnarisnn of the Standard Ergo; n: tne 1's netween tne SPSS nnn VABCL Estimation Metnods One of the aims of the study was to compare estimates of the standard errors for the macro parameter, 7’s (i.e., 7a: 7“, and 7"), using the SPSS and the VARCL estimation methods. These three macro parameters were used in this analysis because the VARCL program printout did not report the standard error of the intercept coefficient, 7m. The estimates of the standard error of macro parameters were obtained for both estimation methods. The ' mean, the standard error, the MSE, and the bias of estimated standard errors of the three macro parameters (i.e., 7d, 7m, and 7") for each estimation method are shown in Table 4-15. In addition, its true value was obtained from the observed standard error of the estimated macro parameters from the 1200 replications shown in Table 4-7. Similar statistical values 92 were used to compare the properties and the accuracy of both estimation procedures for the estimated standard error of 7d, 7m, and 7” under six experimental conditions (see Tables 4-16 through 4-18) . The observed standard error of the three estimated macro parameters in Tables 4-9 through 4-11 were used as the true value of the standard error in the Tables 4-16 through 4-18. The:results showed.that, on the average, the estimates of the standard error of 7's for VARCL were consistently larger than the SPSS estimates of the standard error. In addition, the results in Table 4-15 showed, on the average, that the standard deviation for the estimates of the standard errors of 7's for VARCL and SPSS were close. The results also indicated that the VARCL. estimates of the standard error for 7d, 7m and 7” were less biased than the SPSS estimates. In fact, the 95% confidence interval for bias, shown in Tables 4-16 through 4-18, indicated generally that the VARCL estimate of the standard errors of the three macro parameters was unbiased. Furthermore, the MSE for the estimated standard error for both methods of 7M, 7“ and 7“ were also found to be quite close. However, the MSE for the estimated standard error of 7a was slightly smaller for VARCL than SPSS for each of the different experimental conditions, see Table 4-16. The probability of type I error was also investigated. The Z-score (Z = (Q - 7,)/S.E.(§)) was calculated for each replication, and the probability of the type I error rate 93 Table 4-15.--The true standard error', Mean, S.E., MSE, and bias for estimated standard error for macro parameters by the SPSS and VARCL estimation procedures*. a 'o th d SPSS VARCL Macro parameter 72 The True standard error .163 .163 Mean of Estimate .101 .160 S.E. of Estimate .044 .033 MSE of Estimate .006 .001 Bias -.062 -.003 95% CI Bias -.063,-.060 -.005,-.001 Percent of Bias 38% 2% Macro parameter 7“ The True standard error .112 .106 Mean of Estimate .090 .103 S.E. of Estimate .041 .040 MSE of Estimate .002 .002 Bias -.022 -.003 95% CI Bias -.024,-.020 -.005,-.001 Percent of Bias 20% 3% Macro parameter 7” The True standard error .144 .140 Mean of Estimate .112 .126 S.E. of Estimate .051 .051 MSE of Estimate .004 .003 Bias -.032 -.014 95% CI Bias -.034,-.030 -.016,-.012 Percent of Bias 22% 10% Note * 1200 replications were performed in each method. Lu ' The true value was obtained from the observed standard error of the estimated macro parameters by VARCL and SPSS estimation methods from the 1200 replications shown in Table 4-7. 94 Table 4-16.--The true standard error', Mean, S.E., MSE, and bias for estimated standard error of 72 by cell identification for the SPSS and the VARCL estimation procedure”. 0 SPSS VARCL .180 .182 .143 .186 .011 .019 .001 .000 -.037 .004 -004'-e035 0000' 001 Macro Parameter 7m 9211.12221111221122 1221212133221 Estimation Method SPSS VARCL The true standard error .193 .196 Mean of Estimate .144 .193 S.E. of Estimate .012 .020 MSE of Estimate .003 .000 Bias -.049 -.003 95% CI Bias -.05,-.04 -.01,0.00 Percent of Bias 25% 2% Cell identifigation (NQ,n60.BB§§) Estimation Method SPSS VARCL The true standard error .153 .148 Mean of Estimate .058 .139 S.E. of Estimate .003 .013 MSE of Estimate .009 .000 Bias -.095 -.009 95% CI Bias -.1,-.1 -.01,0.00 Percent of Bias 62% 6% W den ° ' t' W Estimation Method SPSS VARCL The true standard error .176 .178 Mean of Estimate .143 .186 S.E. of Estimate .011 .020 MSE of Estimate .001 .000 Bias -.033 .008 95% CI Bias -.031,-.03 0.0,.01 Percent of Bias 19% 5% 56% 21% 2% 60 s SPSS VARCL .133 .132 .058 .130 .003 .012 .006 .000 -.075 -.002 -.08,-.08 -.004,0.0 56% 2% (IQ,n6Q,BBSL) SPSS VARCL .133 .131 .058 .128 .003 .017 .006 .000 -.075 -.003 -.08,-.08 -.01,0.0 2% Note The true standard error value was obtained from the observed standard error of the estimated 7“ macro . parameter by cell identification for the SPSS and the VARCL estimation procedure shown in Table 4-9. ND -normal distribution of the random effects. TD -t-distribution of the random effects. -10 subjects within each group. -60 subjects within each group. n10 260 -200 replications were performed within each cell. RRSS-small magnitude of the random regression slope variance to the intercept variance (i.e, ins-995, 1"=.005). RRSL-large magnitude of the random regression slope variance to the intercept variance (i.e, 1m=.85, rn=.15). 95 Table 4-17.--The true standard error', Mean, S.E., MSE, and bias for estimated standard error of 7“ by cell identification for the SPSS and the VARCL estimation procedure”. Macro Parameter 7“ Cell identifigation (NQ,nlO,BBSS) (NQ‘le‘BBSLl Estimation Method SPSS VARCL SPSS VARCL The true standard error .133 .131 .138 .135 Mean of Estimate .130 .138 .131 .143 S.E. of Estimate .008 .011 .009 .014 MSE of Estimate .000 .000 .000 .000 Bias -.003 .007 -.007 .008 95% CI Bias -.01,0.0 0.0,.01 -.01,0.0 0.0,.01 Percent of Bias 2% 5% 5% 6% Ce ’ t' ‘ ' W88 122.252.113.322). Estimation Method SPSS VARCL SPSS VARCL The true standard error .080 .052 .081 .078 Mean of Estimate .050 .054 .050 .070 S.E. of Estimate .002 .004 .002 .008 MSE of Estimate .001 .000 .001 .000 Bias -.030 .002 -.031 -.008 95% CI Bias -.03,-.03 .002,.002 -.03,-.03 -.01,0.0 Percent of Bias 4% 4% 38% 10% W W10 8 1.111.222.3821). Estimation Method SPSS VARCL SPSS VARCL The true standard error .139 .135 .078 .072 Mean of Estimate .131 .143 .050 .069 S.E. of Estimate .008 .013 .002 .008 MSE of Estimate .000 .000 .001 .000 Bias -.008 .008 -.028 -.003 95% CI Bias -.01,0.0 0.0,.01 -.03,-.03 -.01,0.0 Percent of Bias 6% 6% 36% 4% Note The true standard error value was obtained from the observed standard error of the estimated 7“ macro parameter by cell identification for the SPSS and the VARCL estimation procedure shown in Table 4-10. -200 replications were performed within each cell. ND -normal distribution of the random effects. TD -t-distribution of the random effects. 210 -10 subjects within each group. n60 -60 subjects within each group. RRSS-small magnitude of the random regression slope variance to the intercept variance (i.e, 1m=.995, 1"=.005). RRSL-large magnitude of the random regression slope variance to the intercept variance (i.e, 1w=.85, 1“=.15). 96 Table 4-18.--The true standard error‘, Mean, S.E., MSE, and bias for estimated standard error of 7" by cell identification for the SPSS and the VARCL estimation procedure”. Macro Parameter 7” 22111222122212 N 10 S 0 Estimation Method SPSS VARCL SPSS VARCL The true standard error .172 .167 .178 .184 Mean of Estimate .163 .173 .162 .176 S.E. of Estimate .016 .018 .014 .019 MSE of Estimate .000 .000 .000 .000 Bias -.009 .006 -.016 -.008 95% CI Bias -.01,0.0 0.0,.01 -.02,-.01 -.01,0.0 Percent of Bias 5% 4% 9% 4% Cell igentificgtion (ND,n60,gB§§) (up,n60,ggsn) Estimation Method SPSS VARCL SPSS VARCL The true standard error .085 .071 .095 .084 Mean of Estimate .062 .068 .061 .083 S.E. of Estimate .003 .005 .003 .008 MSE of Estimate .001 .000 .001 .000 Bias -.023 -.003 -.034 -.001 95% CI Bias -.02,-.02 -.003,-.003 -.03,-.03 -.002,0.0 Percent of Bias 27% 4% 36% 1% 9211.12221111221122 11212121332L1 11212222832L1 Estimation Method SPSS VARCL SPSS VARCL The true standard error .195 .193 .089 .082 Mean of Estimate .163 .177 .062 .082 S.E. of Estimate .015 .018 .003 .008 MSE of Estimate .001 .001 .001 .000 Bias -.032 -.016 -.027 .000 95% CI Bias -.034,-.03 -.02,-.01 -.03,-.03 0.0,0.0 Percent of Bias 16% 8% 30% 0% Note The true standard error value was obtained from the observed standard error of the estimated 7“ macro parameter by cell identification for the SPSS and the VARCL estimation procedure shown in Table 4-11. -200 replications were performed within each cell. ND -normal distribution of the random effects. TD -t-distribution of the random effects. n10 -10 subjects within each group. n60 -60 subjects within each group. RRSS-small magnitude of the random regression slope variance to the intercept variance (i.e, 1w=.995, 1"=.005). RRSL-large magnitude of the random regression slope variance to the intercept variance (i.e, 1m=.85, 1"=.15). 97 under a true null hypothesis (Ho: 7 = 7, ) was determined by counting the frequency with which the z-score exceeded the critical value for .05 significance level and dividing by the total number of replications (i.e. 1200). The results in Table 4-19 showed that the probability of type I error rate under a true null for the VARCL tests of the macro parameters 7's were, on the average, relatively smaller than the SPSS error rates. However, both estimation methods gave unacceptable high type I error rates (i.e., p > .05). This was confirmed by further investigation of the probability of type I error rates under a true null hypothesis under the different experimental conditions (i.e., six cells), see Table 4-20. Table 4-19.--The probability of type I error rates' for tests of macro estimators under a true null by the SPSS and VARCL estimation procedures+. Macro Parameter Estinatinn Method SPSS VARCL 72 .438 .136 7“ .223 .146 7" .261 .165 Note + From 1200 replications were performed for each method. ° .05 significance level. Similar results were also reported in several other research studies that compared the single-level regression 98 Table 4-20.—-The probability of type I error rates+ for tests of macro estimators under a true null by cell identification for the SPSS and the VARCL estimation procedureh Cell igennificetion Estimation Method é§§§2121252§1L SgSS VARCL 7a .255 .075 . .275 .125 7m .085 .065 .110 .090 7n .125 .080 .125 .105 Qell identification (HQ,n§0,gBSS) (ND.DGQ.BB§L) Estimation Method SPSS VARCL‘ SPSS VARCL 7a .650 .200 .610 .165 7m .280 .220 .405 .245 7“ .410 .355 .370 .155 921L12g11111222122 112.2110 QSL) Estimation Method SPSS VARCL SPSS VARCL 7a .225 .090 .610 .160 Tn .075 .065 .385 .190 7" .155 .110 .380 .185 Note 4. -.05 significance level. - -200 replications were performed within each cell. ND -normal distribution of the random effects. TD -t-distribution of the random effects. n10 -10 subjects within each group. n60 -60 subjects within each group. RRSS-small magnitude of the random regression slope variance to the intercept variance (i.e, 1w=.995, r"=.005). RRSL-large magnitude of the random regression slope variance to the intercept variance (i.e, 1m=.85, 1”=.15). 99 model with the multilevel regression model (Walsh, 1947; .Aitkin, Anderson, and Hinde, 1981; Raudenbush and Bryk, 1989). These studies concluded that using the single level model instead of the multilevel model increased the probability of a type I error. In the case of the SPSS estimation procedure, this was due to the liberal t-statistic values caused by small standard error estimates for the regression coefficients when assuming the single-level logistic regression model. T e f ect o o 's The effect of the number of units within each group (i.e., n=10 versus n=60) for both estimation procedures was evaluated based on the following statistics: (a) the macro parameters estimates, and (b) the estimates of standard errors of the macro parameters. The 1200 replications were split into two categories based on the number of units (subjects) within each group. This resulted in 600 replications in the first category where n=10, and 600 replications in the second category where n=60. In addition, Tables 4-8 through 4-11, show the effects of different sample sizes on the four macro parameters under different experimental conditions. The effect of the number of units (subjects) within each group for both the VARCL and SPSS estimation methods on the macro parameter estimates was similar. As such, the following discussion of the effect of number of units on the macro parameters applies equally to both the VARCL and SPSS 100 estimation procedures. Examination of the bias estimates for 7m, 7d, 7m and 7”, on the average, indicated a sightly negative effect of an increasing number of units within each group on the above macro parameters, see Table 4-21. The VARCL bias of 7m, 7m, 7m and y" (.234, .127, .076 and -.089) when n=60 were larger (.221, .125, .068 and -.076) when n=10. However, the standard error and MSE of the 7”, Va: 7»: and 7" estimates were smaller when n=60 than when n=10. For example, with n=10, and 60 the standard error and MSE for VARCL 7m macro parameter estimate dropped from .134 and .023 to .069 and .010, respectively. Ine Effect 9f n on tne Estimateg §tanderd Ergo; gt 1 The results of the analyze are shown in Tables 4-22 and 4-16 through 4-18. 0n the average, the estimated standard error for the three macro parameters was found to be smaller when n=60 than when n=10, for both estimation procedures. In addition, the results also confirmed the earlier finding that, on the average, the estimated standard error of the 7's of VARCL was consistently larger than the. SPSS estimates. The results in Tables 4-16 through 4-18 and 4-22 indicated that generally, VARCL estimate of the standard error of macro parameters were unbiased. In addition, the results in'Table 4-22 indicated that, on the average, increasing the number of subjects within each 101 Table 4-21.--The true value, Mean, S.E., MSE, and bias for ' estimated 7's by the number of subject within each group for the SPSS and VARCL estimation procedure. st at'o ethod SPSS VARCL SPSS VARCL Number Of Subject Within Each Group+ 10 10 60 60 Macro Parameter 7m The True Value -1.800 -1.800 -l.800 -1.800 Mean of Estimate -1.576 -1.579 -l.561 -l.566 S.E. of Estimate .167 .167 .130 .128 MSE of Estimate .078 .077 .074 .071 Bias .224 .221 .239 .234 95% CI Bias .21,.24 .21,.23 .23,.25 .22,.24 Percent of Bias 12% 12% 13% 13% Macro Parameter 72 The True Value -1.200 -l.200 -1.200 -l.200 Mean of Estimate -1.064 -1.075 -1.060 -1.073 S.E. of Estimate .183 .185 .140 .137 MSE of Estimate .052 .050 .039 .035 Bias .136 .125 .140 .127 95% CI Bias .12,.15 .11,.14 .13,.15 .12,.14 Percent of Bias 11% 10% 12% 11% Macro Parameter 72 The True Value -.500 -.500 -.500 -.500 Mean of Estimate -.434 -.432 -.426 -.424 S.E. of Estimate .136 .134 .080 .069 MSE of Estimate .023 .023 .012 .010 Bias .066 .068 .074 .076 95% CI Bias .05,.08 .06,.08 .07,.08 .07,.08 Percent of Bias 13% 14% 15% 15% Macro Parameter 1” The True Value .75 .75 .75 .75 Mean of Estimate .668 .674 .656 .661 S.E. of Estimate .182 .181 .090 .079 MSE of Estimate .040 .039 .017 '.014 Bias -.082 -.076 -.094 -.089 95% CI Bias -.10,-.07 -.09,-.06 -.10,-.09 -.09,-.08 Percent of Bias 11% 10% 13% 12% Note: * 600 replications were performed for each estimation method. 102 Table 4-22.--The true standard error', Mean, S.E., MSE, and bias for estimated standard error for macro parameters by the number of subject within each group for the SPSS and VARCL estimation procedures. s at od SPSS VARCL SPSS VARCL Number Of Subject Within Each Group+ 10 10 60 60 Macro Parameter 7d The True standard error.183 .185 .140 .137 Mean of Estimate .143 .188 .058 .132 S.E. of Estimate .012 .020 .003 .015 MSE of Estimate .002 .000 .007 .000 Bias -.040 .003 -.082 -.005 95% CI Bias -.04,-.04 .001,.005 -.082,-.082 -.007,-.003 Percent of Bias 22% 2% 59% 4% Macro Parameter 1h The True standard error.136 .134 .080 .069 Mean of Estimate .131 .141 .050 .064 S.E. of Estimate .008 .013 .002 .010 MSE of Estimate .000 .000 .001 .000 Bias -.005 .007 -.030 -.005 95% CI Bias -.005,-.005 .005,.009 -.03,-.03 -.005,-.005 Percent of Bias 4% 5% 38% 7% Macro Parameter y" The True standard error.182 .181 .090 .079 Mean of Estimate .162 .175 .062 .077 S.E. of Estimate .015 .018 .003 .010 MSE of Estimate .001 .000 .001 .000 Bias -.020 -.006 -.028 -.002 95% CI Bias -.022,-.018 -.008,-.004 -.028,-.028 -.002,-.002 Percent of Bias 11% 3% 31% 3% Note * 600 replications were performed for each estimation method. ' The true value was obtained from the observed standard error of the estimated macro parameters by the number of subject within each group for the SPSS and VARCL estimation procedures shown in Table 4-21. 103 group had slight effect on the standard deviation and MSE of the VARCL estimated standard error for 7d, 7M and 7“. In addition, the results indicated that the SPSS estimated standard error of the macro parameters were statistically significantly biased. In fact, the absolute magnitude of bias increased as the sample size increased. For example, Table 4-22 indicated that, on the average, the percent of bias for ya, 7“ and 7” when n=10 were 22%, 4%, and 11% respectively. This increased to 59%, 38%, and 31% respectively when n=60. Finally, the results also indicated that for different sample sizes, the MSE for the standard error of 7m and 7" for both estimation methods were very close. However, for ya, the MSE of VARCL is smaller than the SPSS estimate. The e t o th andom ects D'st 'but'on 's The effect of having a normal distribution (ND) versus t- distribution (TD) of the random effects, 05 and U”, for both estimation procedures was evaluated on the basis of the following statistics: (a) the macro parameters estimates, and (b) the estimates of standard errors of the macro parameters. The 1200 replications were again split into two categories based on distribution of the random effects (i.e., ND vs. TD). This resulted in 800 replications in the first category where the distribution of the random effects was normally distributed, and 400 replications in the second category where distribution of the random effects was t- 104 distributed. This uneven balance of the replications between the two categories was caused by simulating a t-distribution of the random effects only under large values of the random regression slope variance (i.e., RRSL). Using the replications within each category, the true value, mean, standard error, MSE, and bias of estimated macro parameters for each estimation method are shown in'Table 4-23. In addition, Tables 4-8 through 4-11 show the effects of the random effects. distribution. on four ‘macro parameters by different experimental conditions. The results indicated that there were no»clear effects of the random effects distribution on estimation of three macro parameters. 72. 71. 7m and Yu- The Effect of the.3andom Effects Qistribution en the Eetineteg sgandand Error of 1's The averaged standard.deviation, the MSE, and the bias of estimated standard error of the three macro parameters for both the estimation methods were compared. This was done in terms of having normal distribution (ND) versus t-distribution (TD). The results are shown in Tables 4-24 and 4-16 through 4-18. The results indicated that having a normal distribution or a t-distribution of the random effects had no clear effect on the estimated standard error of the macro parameters. 105 Table 4-23.--The true value, Mean, S.E., MSE, and bias for estimated 7's by the distribution of the random effects for the SPSS and VARCL estimation procedure. Estimation Method SPSS VARCL SPSS VARCL The distribution of the random effects normal distribution+ t-distributionH Macro Parameter 7“ The True Value -1.800 -1.800 -1.800 -1.800 Mean of Estimate -1.561 -1.565 -1.584 -1.587 S.E. of Estimate .153 .153 .142 .141 MSE of Estimate .081 .078 .067 .065 Bias .239 .235 .216 .213 95% CI Bias .23,.25 .23,.24 .20,.23 .20,.23 Percent of Bias 13% 13% 12% 12% Macro Parameter 72 The True Value -1.200 -l.200 -1.200 -1.200 Mean of Estimate -1.060 -1.073 -1.066 -1.076 S.E. of Estimate .166 .167 .156 .156 MSE of Estimate .047 .044 .042 .040 Bias .140 .127 .134 .124 95% CI Bias .13,.15 .12,.14 .12,.15 .11,.14 Percent of Bias 12% 11% 11% 10% Macro Parameter 7h The True Value -.500 -.500 -.500 -.500 Mean of Estimate -.429 -.427 -.432 -.431 S.E. of Estimate .111 .105 .112 .108 MSE of Estimate .017 .016 .017 .017 Bias .071 .073 .068 .069 95% CI Bias .06,.08 .07,.08 .06,.08 .06,.08 Percent of Bias 14% 15% 14% 14% Macro Parameter 7” The True Value .75 .75 .75 .75 Mean of Estimate .659 .663 .668 .676 S.E. of Estimate .139 .136 .152 .148 MSE of Estimate .028 .026 .030 .027 Bias -.091 -.087 -.082 -.074 95% CI Bias -.10,-.08 -.10,-.08 -.10,-.07 -.09,-.06 Percent of Bias 12% 12% 11% 10% Note * 800 replications were performed for each estimation method. ** 400 replications were performed for each estimation method. 106 Table 4-24.--The true standard erraffl Mean, S.E., MSE, and bias for estimated standard error for macro parameters by the distribution of the random effects for the SPSS and VARCL estimation procedure. t' o et od SPSS VARCL SPSS VARCL The distribution of the random effects normal distribution+ t-distributionH Macro Parameter 7d The True standard error.166 .167 .156 .156 Mean of Estimate .100 .162 .101 .157 S.E. of Estimate .044 .032 .043 .035 MSE of Estimate .006 .001 .005 .001 Bias -.066 -.005 -.055 .001 95% CI Bias -.07,-.06 -.007,-.003 -.06,-.05 -.003,.005 Percent of Bias 40% 3% 35% 1% Macro Parameter 7h The True standard error.111 .105 .112 .108 Mean of Estimate .090 .101 .091 .106 S.E. of Estimate .041 .041 .041 .039 MSE of Estimate .002 .002 .002 .001 Bias -.021 -.004 -.021 -.002 95% CI Bias -.023,-.02 -.006,-.002 -.025,-.017 -.006,.002 Percent of Bias 19% 4% 19% 2% Macro Parameter 7“ The True standard error.139 .136 .152 .148 Mean of Estimate .112 .125 .113 .130 S.E. of Estimate .052 .052 .051 .050 MSE of Estimate .003 .003 .004 .003 Bias -.027 -.011 -.039 -.018 95% CI Bias -.03,-.023 -.015,-.007 -.045,-.033 -.022,-.014 Percent of Bias 19% 8% 26% 12% Note * 800 replications were performed for each estimation method. ** 400 replications were performed for each estimation method. ' The true value was obtained from the observed standard error of the estimated macro parameters by the distribution of the random effects for the SPSS and VARCL estimation procedures shown in Table 4-23. 107 e ec t S Va 'a ce a 'tude o ’s The effects of the magnitude of the random regression slope (RRS) variance to the intercept variance (RRSS versus RRSL) were evaluated for both estimation procedures on the basis of the following statistics: (a) the macro parameters estimates, and (b) the estimates of the standard errors of the macro parameters. The 1200 replications were split into two categories based on magnitude of the random regression slope variance (i.e., RRSS vs. RRSL). This resulted in 400 replications in the first category where the random regression slope variance was small and 800 replications in the second category where the random regression slope variance was large. As mentioned earlier, this uneven balance of replications between the two categories was caused by simulating a t-distribution of the random effects only under a large random regression slope variance (i.e., RRSL), see Table 4-25. In addition, Tables 4-8 through 4-11 show the effect of RRSS vs. RRSL on four macro jparameters under different experimental conditions with 200 replications performed within each cell. The results of the analyzes shown in Tables 4-8 through 4-11 indicated no clear effect of RRSS vs RRSL on the macro parameters, estimates you and 10,. However, the results in Tables 4-10 and 4-11 indicated that the macro parameters estimates, 7,0 and 7,, generally had a smaller MSE and bias under RRSS than RRSL. For example, when n=60 with RRSL, the Table 4-25.—-The true value, Mean, S.E., MSE, and bias for 108 estimated 7’s by the magnitude of random regression slope variance for the SPSS and VARCL estimation procedure. st mat o et d SPSS VARCL SPSS VARCL The Magnitude of Random Regression Slope Variance to the Intercept Variance SMALL+ SMALL+ LARGE++ LARGE++ Macro Parameter 7” The True Value -l.800 -1.800 -1.800 -1.800 Mean of Estimate -1.555 -1.559 -1.575 -1.579 S.E. of Estimate .159 .156 .145 .145 MSE of Estimate .085 .083 .072 .070 Bias .245 .241 .225 .221 95% CI Bias .23,.26 .23,.26 .22,.23 .21,.23 Percent of Bias 14% 13% 13% 12% Macro Parameter 72 The True Value -1.200 -1.200 -1.200 -1.200 Mean of Estimate -1.063 -1.072 -1.062 -l.075 S.E. of Estimate .174 .174 .157 .157 MSE of Estimate .049 .047 .044 .040 Bias .137 .128 .138 .125 95% CI Bias .12,.15 .11,.15 .13,.15 .11,.14 Percent of Bias 11% 11% 12% 10% Macro Parameter 7” The True Value -.500 -.500 -.500 -.500 Mean of Estimate -.435 -.435 -.428 -.425 S.E. of Estimate .110 .100 .113 .109 MSE of Estimate .016 .014 .018 .018 Bias .065 .065 .072 .075 95% CI Bias .06,.07 .06,.07 .06,.08 .07,.08 Percent of Bias 13% 13% 14% 15% Macro Parameter 7” The True Value .75 .75 .75 .75 Mean of Estimate .663 .664 .662 .669 S.E. of Estimate .136 .129 .147 .146 MSE of Estimate .026 .024 .029 .028 Bias -.087 -.086 -.088 -.081 95% CI Bias -.10,-.07 -.10,-.07 -.10,-.08 -.10,-.07 Percent of Bias 12% 12% 12% 11% Note * 400 replications were performed. ** 800 replications were performed. 109 MSE and percent of bias for estimated 7“ were .014 and 17%. The same estimates with.RRSS were slightly smaller at .007 and 13% respectively. The results in Table 4-26 indicated that SPSS estimate of 7m andyll had a slightly smaller bias and.MSE for RRSS than for RRSL. e et o the anitude e S V 'a ce n he Esnimated Egandard Error of 1’s The mean, standard deviation, MSE and bias of estimated standard errors of the three macro parameters for both the estimation methods were compared in terms of small versus large variancezof the random regression slope to the intercept variance (i.e., RRSS versus RRSL). See Tables 4-26, and 4-16 through 4-18. The results indicated that RRSS and RRSL had no clear effects on the estimated standard error of the macro parameters. In addition, the results again indicated clearly a smaller bias for 7d, 7m and 7“ with the VARCL estimates when compared with the SPSS estimates of the standard error of the same macro parameters in either condition (i.e., RRSS versus RRSL). Qnecking the Accnzecy of fine Vezience-Covaniance Connenen; e: do ec s stimate s'n V CL One purpose of this study was to investigate the accuracy of the VARCL estimation method in estimating the variance- 110 Table 4-26.--The true standard error', Mean, S.E., MSE, and bias for estimated standard error for macro parameters by the magnitude of random regression slope variance for the SPSS and VARCL estimation procedure. Estimation nethog SPSS VARCL SPSS VARCL The Magnitude of Random Regression Slope Variance to the Intercept Variance SMALL+ SMALL‘ LARGE++ LARGE** Macro Parameter 7d The True standard error .174 .174 .157 .157 Mean of Estimate .101 .166 .101 .157 S.E. of Estimate .044 .032 .043 .033 MSE of Estimate .007 .001 .005 .001 Bias _ -.073 -.008 -.056 .001 95% CI Bias -.08,-.07 -.012,-.004 -.06,-.05 -.002,.002 Percent of Bias 42% 5% 36% 0% Macro Parameter 7h The True standard error .110 .100 .113 .109 Mean of Estimate .090 .096 .091 .106 S.E. of Estimate .041 .043 .041 .038 MSE of Estimate .002 .002 .002 .001 Bias -.020 -.004 -.022 -.003 95% CI Bias -.02,-.016 -.Ol,.00 -.024,-.02 -.005,-.001 Percent of Bias 18% 4% 19% 3% Macro Parameter 7“ The True standard error .136 .129 .147 .146 Mean of Estimate .112 .120 .112 .130 S.E. of Estimate .052 .054 .051 .049 MSE of Estimate .003 .003 .004 .003 Bias -.024 -.009 -.035 -.016 95% CI Bias -.03,-.02 -.015,-.003 -.04,-.03 -.02,-.01 Percent of Bias 18% 7% 24% 11% Note * 400 replications were performed. ** 800 replications were performed. 'The true value was obtained from the observed standard error of the estimated macro parameters by the magnitude of random regression slope variance for the SPSS and VARCL estimation procedures shown in Table 4-25. 111 covariance components.of the random effects (i.e., 1w, 1”, and 721- In order to do this, several statistics were computed that took into account the difference between the VARCL estimated variance-covariance components of the random effects and its true value. Another purpose of the study was to investigate the effect of different combined conditions on the estimates of the variance-covariance components of the random effects. t e ccu ac o e 'm ted ta' e V539; Eetimation Eennod Table 4-27 shows the true value, mean, standard error, MSE, and bias statistics for 1w . These statistics were used to evaluate the accuracy of the VARCL estimation method of I” using the true values across six cells. The results suggested that the VARCL estimates of the 1m parameter were significantly biased. The negative sign of the bias implied that the VARCL underestimated I”. In fact, the percent of the bias for the six cells ranged between 15% and 26%. When the bias was arranged from the smallest to the largest values, the magnitude of the bias was smaller as the number of units within each group increased. Similar trends were observed when the MSE was ordered in terms of magnitude. e f t The bias and MSE for estimated rm under different sample sizes, while holding the other factors fixed (i.e., 112 Table 4~27.~~The true value, Mean, S.E., MSE, and bias for estimated 100 by cell identification for the VARCL estimation procedureh Qell identificetion The True Value Mean of Estimate S.E. of Estimate MSE of Estimate Bias 95% CI Bias Percent of Bias Cell identification The True Value Mean of Estimate S.E. of Estimate MSE of Estimate Bias 95% CI Bias Percent of Bias Cell identification The True Value Mean of Estimate S.E. of Estimate MSE of Estimate Bias 95% CI Bias Percent of Bias 1321212133221 13212121BB§L1 .995 .85 .754 .625 .277 .264 .134 .120 -.241 -.225 -.28,-.20 -.26,-.19 24% 26% D 50 S iflflinQQLBfifiLl .995 .85 .846 .699 .211 .171 .067 .052 -.149 -.151 -.18,-.12 -.17,-.13 15% 18% 0 0 s (TQ.2§O.BB§L) .85 .85 .633 .669 .358 .271 .174 .106 -.217 -.181 -.27,-.17 -.22,-.14 26% 21% Note n10 ~10 subjects within each group. n60 ~60 subjects within each group. RRSS-small magnitude of the random regression slope variance to the intercept variance. RRSL-large magnitude of the random regression slope variance to the intercept variance. ~200 replications were performed within each cell. ND ~norma1 distribution of the random effects. TD ~t~distribution of the random effects. 113 (ND,n10,RRSS) versus (ND,n60,RRSS), (ND,n10,RRSL) versus (ND,n60,RRSL), and (TD,n10,RRSL) versus (TD,n60,RRSL)} were consistently smaller for n=60 than for n=10 (see Table 4~27). For example, comparing the (ND,n10,RRSL) versus (ND,n60,RRSL) cells, the bias and MSE for the rm parameter dropped from ~.225 and .120 to ~.151 and .052, respectively. fecto te aitueo tie SVaiace n e Estimafed I” The results indicated that the estimated 100 parameter had a slightly smaller percent of bias for RRSS than for RRSL. This was done by comparing the (ND,n10,RRSS) versus (ND,n10,RRSL) and (ND,n60,RRSS) versus (ND,n60,RRSL) cells (see Table 4~27). ,e _,,ect o he Random -_ -cts Ifst 'outfon .1 -_ - .t-d ;m The results showed that the estimated 1” parameter had a smaller MSE for a normal distributiOn when compared to a t- distribution. By comparing the cells (ND,n10,RRSL) versus (TD,n10,RRSL) for n=10, the bias was about the same for both the random effects distributions (see Table 4~27). By comparing the cells (ND,n60,RRSL) versus (TD,n60,RRSL), with n=60, the bias was slightly smaller when the random effects distribution was normally distributed (~.151) as compared to when it was t~distributed (~.181) (see Table 4~27). 114 Qnecning tne Accnfecy of tne Estimated Stangerd Efner of 11w entained ffom VAECL Estimation procedufe Table 4~28 shows several statistics (i.e., true value, mean, standard error, MSE, and bias statistics) used to evaluate the accuracy of the VARCL estimation for the estimated standard error of V1” (notice that VARCL program reports the standard error for «7” rather than I”) using the true values across six cells. The results suggested that the VARCL estimates of the standard error for V100 were significantly biased, and the magnitude of the bias and.MSE was smaller as the number of the units within each group increased. Tne Effect ef n on Estinated Standafg Error ef 11“ While holding the other factors fixed, the bias, the percent of bias, the standard deviation, and the MSE for estimated standard error for «Iron under different sample sizes, were consistently smaller for n=60 than for n=10. see Table 4-28. ect . 1e '.- 5 ude . e 1:8 V. 'a. - .~ ,st'm.t-- mm... The results indicated a slightly larger percent of bias with RRSL than with RRSS for the estimated standard error of V1”, after holding the other factors fixed. see Table 4~28. 115 Table 4~28.-~The true value', Mean, S.E., MSE, and bias for estimated standard error for «1” by cell identification for the VARCL estimation procedure”. 921L1222111i221122 W0 S 122.212.288.221 The True Value .085 .084 Mean of Estimate .167 .171 S.E. of Estimate .011 .013 MSE of Estimate .007 .008 Bias .082 .087 95% CI Bias .08,.084 .085,.089 Percent of Bias 96% 104% 9211.12222121222122 iflflinéflififiéfii iflflin§Q1BB§L1 The True Value .096 .083 Mean of Estimate .100 .094 S.E. of Estimate .008 .007 MSE of Estimate .000 .000 Bias .004 .011 95% CI Bias .002,.006 .009,.013 Percent of Bias 4% 13% The True Value .121 .117 Mean of Estimate .173 .093 S.E. of Estimate .022 .011 MSE of Estimate .003 .001 Bias .052 ~.024 95% CI Bias .048,.056 -.026,-.022 Percent of Bias 43% 21% Note ' ~The true value was obtained form the standard deviation of the Vrm’s(i.e., square root of the true parameter, I”) for each corresponding cell. ~200 replications were performed within each cell. ND ~normal distribution of the random effects. TD ~t~distribution of the random effects. n10 ~10 subjects within each group. n60 ~60 subjects within each group. RRSS-small magnitude of the random regression slope variance to the intercept variance. RRSL-large magnitude of the random regression slope variance to the intercept variance. 116 e ect o the andom f ects istributio on st' at d Standard Error of VT” For n=10 the results in Table 4~28 indicated slightly a smaller bias and MSE but a slightly larger standard deviation on estimated standard error for «1” that had a t~distribution rather than one with a normal distribution. However, for n=60, the results indicated. a smaller 'bias, MSE and standard deviation for the normal distribution than the t~distribution. ghecging tne Accuracy of Estimated In o t ' e b t e V Estimation Method Table 4~29 contains several statistics used to evaluate the accuracy of the VARCL estimation method of 7“ using the true values across six cells. First, the results indicated that the VARCL estimates of 1” were significantly biased, and the percent of bias was very large in (ND,n10,RRSS) cell. However, the magnitude of bias was reduced by increasing the sample size. In addition, the results also indicated that the size of MSE was clearly affected by the number of units within each group. The larger the sample size, the smaller the MSE. Wu The results indicated that the MSE for estimated 1” was smaller when n=60 than when n=10. 117 Table 4~29.~~The true value, Mean, S.E., MSE, and bias for estimated T“ by cell identification for the VARCL estimation procedureh Cell identification D 10 SS N 0 S The True Value .005 .15 Mean of Estimate .057 .134 S.E. of Estimate .086 .168 MSE of Estimate .010 .028 Bias .052 ~.016 95% CI Bias .04,.06 -.04,.01 Percent of Bias 1040% 11% Cell identification ND n60 SS (ND.n60,EESL) The True Value .005 .15 Mean of Estimate .014 .130 S.E. of Estimate .019 .061 MSE of Estimate .000 .004 Bias .009 ~.020 95% CI Bias .007,.01 -.03,-.01 Percent of Bias 180% 13% Cell identification (TD,nl0,RR§L) (IQ,n60,EE§L) The True Value .15 .15 Mean of Estimate .131 .120 S.E. of Estimate .170 .066 MSE of Estimate .029 .005 Bias -.019 -.030 95% CI Bias -.04,.005 ~.04,-.02 Percent of Bias 13% 20% Note ~200 replications were performed within each cell. ND ~normal distribution of the random effects. TD ~t~distribution of the random effects. n10 ~10 subjects within each group. n60 ~60 subjects within each group. RRSS-small magnitude of the random regression slope variance to the intercept variance. RRSL-large magnitude of the random regression slope variance to the intercept variance. 118 The Effect of Magnituge of tne BBS yeriance on tne Esfimated In The cells (ND,n10,RRSS) versus (ND,n10,RRSL) and (ND,n60,RRSS) versus (ND,n60,RRSL) were compared (see Table 4~29). The results indicated that the percent of bias of estimated 1" was much larger with RRSL than with RRSS. See Table 4~29. The Effecf of Random Effects Eistfibution on Estimefeg 1" By comparing the cells (ND,n10,RRSL) versus (TD,n10,RRSL) and (ND,n60,RRSL) versus (TD,n60,RRSL) the results for n=60 indicated that the 1,, parameter estimate had a slightly smaller bias, percent of bias, MSE, and standard deviation when the random.effects.had.a:normal distribution than when it had a t~distribution (see Table 4~29). Qnecfiing the Accuracy of the Eetimated Sfandafg Erfo; ef 11“ ed om t e V RC st' at' n et od Table 4~30 shows the true value, mean, standard deviation, and bias of the VARCL estimated standard error for V1" across six cells, having 200 replications within each cell. e t on st te a or o n The results in Table 4~30 suggested that the VARCL estimates of the standard error for V1" were significantly biased, and the percent of bias was very large specially for 119 Table 4~30.~~The true value', Mean, S.E., MSE, and bias for estimated standard error for «1" by cell identification for the VARCL estimation procedure”. 9211.12222111221122 .LEQIDLQIBB§§1 .iflflinlfliBfiéLl The True Value .006 .032 Mean of Estimate .311 .309 S.E. of Estimate .165 .141 MSE of Estimate .120 .096 Bias .305 .277 95% CI Bias .28,.33 .26,.30 Percent of Bias 5083% 865% W9 'et' 'a'on W60 122.222.3821.). The True Value .006 .034 Mean of Estimate .134 .076 S.E. of Estimate .095 .013 MSE of Estimate .025 .002 Bias .128 .042 95% CI Bias .11,.14 .04,.044 Percent of Bias 2133% 124% 921W 110.212.3822). W The True Value .051 .054 Mean of Estimate .300 .079 S.E. of Estimate .169 .015 MSE of Estimate .091 .001 Bias .249 .025 95% CI Bias .23,.27 .023,.027 Percent of Bias 488% 46% I; -.. Note ' ~The true value was obtained form the standard deviation of the «rn's(i.e., square root.of the true parameter, r") for each corresponding cell. ~200 replications were performed within each cell. ND ~normal distribution of the random effects. TD ~t~distribution of the random effects. n10 ~10 subjects within each group. 260 ~60 subjects within each group. RRSS-small magnitude of the random regression slope variance to the intercept variance. RRSL-large magnitude of the random regression slope variance to the intercept variance. 120 small sample sizes. In addition, the magnitude of the bias, standard deviation, and MSE were reduced as the sample size within each group increased, holding the other factors as fixed. he ect the Ma 'tude e RR Va 'a e the Eetimated Standard Erfef of 11" Comparing (ND,n10,RRSS) versus (ND,n10,RRSL) and (ND,n60,RRSS) versus (ND,n60,RRSL) , the results suggested that increasing the magnitude of RRS variance to the intercept variance led to a smaller bias, MSE, and standard deviation estimate of the standard error for VT“. ,- _ ect o t e '. do _ _ects ITTt,f-u '- .1 ,- 2- '1- -. Sfendafd Erfef 9f 11” The results indicated a smaller percent of bias for the estimated standard error for V1" when the random effects had a t~distributed than when it was normally distributed, holding other factors as fixed. 9222151119 tne neeufacy nf fne Esfimeteg 1°, Obteineg my fne V C st a ' Met od Table 4~31 contains several statistics used to evaluate the accuracy of the VARCL estimation method of rd using the true value, across six cells. 121 Table 4-31.~~The true value, Mean, S.E., MSE, and bias for estimated 10, by cell identification for the VARCL estimation procedurefi Qell igentification (ND.le,EESS) (EQ,nlO,BESL) The True Value .000 .000 Mean of Estimate .033 .030 S.E. of Estimate .151 .158 MSE of Estimate .024 .026 Bias .033 .030 95% CI Bias .01,.05 .01,.05 Percent of Bias NA NA Cell igentification (NQ,n69,EESS) (EQ,n60,EESL) The True Value .000 .000 Mean of Estimate .019 .002 S.E. of Estimate .063 .078 MSE of Estimate .004 .006 Bias .019 .002 95% CI Bias .011,.03 -.009,.014 Percent of Bias NA NA Cell identification (1D,le,EESL) ilfllnéfliflfifiLl The True Value .000 .000 Mean of Estimate .020 .013 S.E. of Estimate .184 .084 MSE of Estimate .034 .007 Bias .020 .013 95% CI B135 -0005, 005 0001’ 002 Percent of Bias NA NA Note ~200 replications were performed within each cell. ND ~normal distribution of the random effects. TD ~t~distribution of the random effects. n10 ~10 subjects within each group. n60 ~60 subjects within each group. RRSS-small magnitude of the random regression slope variance to the intercept variance. RRSL-large magnitude of the random regression slope variance to the intercept variance. NA ~ Not Applicable. 122 T f ct 0 st' ted 1 The results in Table 4~31 indicated that the size of bias, standard.deviation, and MSE were clearly affected by the number of units within each group. The larger the sample size, the smaller the bias, standard deviation and MSE for the estimated rd. Tne Effect of the Eagnifude of tne BBS Verience on fne Estimateg I“ By comparing the cell (ND,n10,RRSS) versus (ND,n10,RRSL) , it was found that the effect of magnitude of the random regression slope was very small when n=10. See Table 4~31. Similarly, by comparing the cell (ND,n60,RRSS) versus (ND,n60,RRSL) the bias for estimated rd was smaller for RRSL (.002) than for RRSS (.019) when n=60. 1e ,1 - t - ;-nd-m ects I7 t_'buti-1 ., ,- , 1a ‘1 4 By comparing the cell (ND,n60,RRSL) versus (TD,n60,RRSL) (see Table 4~31), it was observed that the To, parameter estimate had a smaller bias, standard deviation, and MSE when the random effects had a normal distribution as compared to a t~distribution, for n=60. The type of the random effects distribution had no clear effect when n=10. 123 Checking tne Accufacy of tne Estimated Standafg Erfor of 1“ Obteinen by the VABCE Estimation Table 4~32 shows the true value, mean, standard deviation, and bias of VARCL estimated standard error for 1“ across six cells, having 200 replications within each cell. Ins Effect nf n on Estimated Stengefd EffOf 9f 1“ The results in Table 4~32 showed that the VARCL estimates of the standard error for I“ were significantly biased, with a large percent of bias. However, the magnitude of bias, standard deviation, and MSE became smaller as the sample size within each group increased from 10 to 60, holding the other factors as fixed. Ine Effect of the Magnitude ef tne ens Vafiance on fne Estimafed Stangard Enron of I“ The results indicated that for n=10 and n=60, increasing the magnitude of RS variance to the intercept variance led to a smaller bias and MSE estimate of the standard error for I“. e ct e Rando ect Distribut'o n‘th st' t a da or 1, The results in Table 4~32 indicated that for both n=10 and n=60, having a normal distribution of the random effects led to a slightly smaller bias, percent of bias, standard deviation, and.MSE‘estimated.standard error for 1d than having a t~distribution of the random effects. 124 Table 4~32.~~The true value‘, Mean, S.E., MSE, and bias for estimated standard error for 1w by cell identification for the VARCL estimation procedure". Qell identifieation. (NQ,n10,BBSS) lflflyleyBBELl The True Value .008 .045 Mean of Estimate .162 .162 S.E. of Estimate .023 .024 MSE of Estimate .024 .014 Bias .154 .117 95% CI Bias .15,.16 .11,.12 Percent of Bias 1925% 260% Cell identification (ND,n§9,BB§S) (Np,n60,BB§L) The True Value .010 .045 Mean of Estimate .052 .064 S.E. of Estimate .007 .011 MSE of Estimate .002 .000 Bias .042 .019 95% CI Bias .042,.042 .017,.02 Percent of Bias 420% 42% Bell identification (IQ,le,BBSL) 12212221BB§L1 The True Value .041 .039 Mean of Estimate .162 .062 S.E. of Estimate .026 .012 MSE of Estimate .015 .001 Bias .121 .023 95% CI Bias .12,.13 .021,.025 Percent of Bias 295% 59% Note ' ~The true value was obtained form the standard deviation of the rd's (i.e., square root of the true parameter, I“) for each corresponding cell. ~200 replications were performed within each cell. ND ~normal distribution of the random effects. TD ~t~distribution of the random effects. n10 ~10 subjects within each group. n60 ~60 subjects within each group. RRSS-small magnitude of the random regression slope variance to the intercept variance. RRSL-large magnitude of the random regression slope variance to the intercept variance. 125 Summafy A summary statistics of the key results that were discussed in this chapter is presented in Table 4~33. Table 4~33.~~A summary of several statistics for different parameter by the SPSS and the VARCL estimation procedure. Parameter Bias Consistency MSE Type I Error ' Rate Estimatinn Metned SPSS VARCL SPSS VARCL SPSS VARCL SPSS VARCL yoo Yes Yes No No ND ND NA NA 701 Yes Yes No No ND ND H - 71o Yes Yes No No ND ND H ~ 711 Yes Yes No No ND ND H ~ S.E.(yol) Yes No No Yes -~ S ~ ~ S.E.(ylo) Yes No No Yes ND ND ~ ~ S.E.(yll) Yes No No Yes ND ND ~ ~ Too NA Yes NA Yes NA .109 NA - 111 NA Yes NA No NA .013 NA ~ 101 NA Yes NA Yes NA .017 NA ~ S.E.(Vroo)NA Yes NA Yes NA .003 NA ~ S.E.(V111)NA Yes NA Yes NA .056 NA ~ S.E.(rol) NA Yes NA Yes NA .009 NA ~ Note MSE ~An average Mean Square Error across six cells. ND ~No difference between the two estimation methods. NA ~Not applicable. H ~Higher than the other estimation method. 8 ~Smaller than the other estimation method. CHAPTER V CONCLUSION lntfednctinn This chapter presents the conclusions of the analyzes of the study. The chapter begins by first presenting the conclusions based on the real data analysis. This will be followed by the conclusions based on the simulated data analysis. The implications of the findings will then be addressed. This is followed by a discussion of the consequences of the real and simulated data analysis conclusions. The final section of this chapter will present some suggestions for future research. Con lnsiens Based en fne Reel Qafe Anelysis The conclusions based on running the SPSS, VARCL, and MULTILOGIT estimation methods on real data are as follows: (1) The regression coefficient estimates for the within- school variable for the SPSS and VARCL approaches were close, while that the MULTILOGIT estimate had a larger absolute value than both the SPSS and VARCL approaches. However, there appears to be no consistent pattern with regard to the regression coefficient estimates for the school-level variables. 126 127 (2) The estimated standard. error of ‘the regression coefficient for both the within- and between-school variables for the VARCL and MULTILOGIT approaches using the real data were close. However, the MULTILOGIT estimates were slightly larger than the VARCL estimates. (3) The results.of real data analysis also indicated that the magnitude of the VARCL and MULTILOGIT estimates of the standard error of the regression coefficient were much larger than the SPSS estimates. (4) The variance-covariance components of the random effects estimate of MULTILOGIT and VARCL using the real data were close. However, the MULTILOGIT estimates were generally larger in absolute values than the VARCL estimates. o c 5' ns B sed o the S' te 's The following conclusions were based on running the SPSS and VARCL programs estimation procedures on simulated data that were generated for the multilevel logistic regression model (a random effects model with binary outcomes): (1) Both the VARCL and SPSS estimates of 7's were found to be significantly biased. The percentages of biased ranged between 10% and 17% lower than the true values. The VARCL and SPSS estimates of 1m, 7m , 7b and 7" parameter were found to be approximately equal for different statistics (i.e., mean, standard deviation, MSE, and bias). For both estimation :methods, increasing the number of units within each group (n) resulted in slightly increasing the bias of the estimated 128 macro parameters. However, increasing n led to a slightly smaller MSE of the 7”, 7M, 7M, and 7" estimates for both the VARCL and SPSS estimation methods. This reduction in MSE is caused by the smaller magnitude of the standard deviation of the estimated macro parameters as a result of increasing the sample size. There was also no clear effect of the random effects distributions (i.e., ND versus TD) on all four macro parameters for both estimation procedures. Finally, the VARCL estimate of the macro, 7m and 7”, parameter estimates had a slightly smaller bias and MSE for RRSS (having a small magnitude of random regression slope variance in contrast to the intercept variance) as compared to RRSL (having a large magnitude of random regression slope variance in contrast to the intercept variance). The magnitude of the random regression slope variance appeared to have no clear effect on the VARCL estimate of 7m and 7“ parameters. In addition, the results also indicated that the SPSS estimate of 7m and 7", had a slightly smaller bias and MSE for RRSS as compared to RRSL. While there was no clear effect on the SPSS estimation of the macro parameters, 7” and yd. Therefore, under the random effects model for binary outcomes, the VARCL estimates of the macro parameter was significantly biased and inconsistent. A similar result was obtained for the SPSS estimates of the macro parameters. In fact, the results in Table 4~13 indicated that the SPSS of 7” (from simple random effects model for binary outcome) moved further away from the true value (i.e., the magnitude of bias 129 increases) as the true value for rm,increases. This is because the generated data under the random effects model is different from the SPSS model assumptions. (2) On the average, the estimated standard errors of 7’s for VARCL were larger than the SPSS estimate of the standard errors. And the SPSS estimates of the standard error of macro parameters were Clearly significantly biased, while the VARCL estimates of the standard error of macro parameter were unbiased. This is due to the larger estimates of the standard errors for 7’s of VARCwahen compared to SPSS. In addition, in for both estimation methods, the random effects distributions and the magnitude of the random regression slope variance had no clear effect on the estimated standard errors of the estimated macro parameters. Increasing the sample size resulted in slightly smaller standard deviation and MSE of the VARCL estimates of the standard error of the three macro parameters (i.e., 70,, 7,0 and 7"). However, with the SPSS program, increasing the sample size resulted in estimates of the standard error of the three macro parameters that were slightly larger in bias and MSE. (3) The probability of type I error rate under a true null hypothesis tests of the macro parameters 7's were much smaller for the VARCL than the SPSS program. However, both estimation methods gave unacceptable high type I error rates (i.e., p > .05). (4) The VARCL estimates of 1m, 1" and 1“ parameters were significantly biased and underestimated the true values. 130 However, the magnitude of the bias and MSE was reduced as the number of units within each group increased. The magnitude of the regression slope variance (i.e., RRSL vs. RRSS) had no Clear effect on I” and 1m. Except for 1", the percentage of bias were smaller for RRSL.when compared to RRSS. Finally, the results also indicated that the estimated variance—covariance components of the random effects parameter had a slightly smaller bias, MSE, and standard deviation when the random effects had a normal distribution than when it had a t- distribution, explicitly for large n. (5) The VARCL estimates of the standard error for VT”, V1“, and 1m were significantly biased. However, the magnitude of the bias, standard deviation, and MSE were reduced as the sample size within each group increased from 10 to 60 (i.e., consistent). Increasing the magnitude of random regression slope variance to the intercept variance led to a smaller percentage of bias, MSE, and standard deviation estimate of the standard error for VT“, and 10,. However, there was slightly smaller percentage of bias for RRSS when compared to RRSL for the estimated standard error of «Tm. Finally, a large n (i.e., n=60) for the normally distributed random effects resulted in a slightly smaller bias, standard deviation, and MSE of the estimated standard error for Vim, and I“ when this was compared to the t~distributed random effects. 131 implications of the Bindings The first part of this chapter addressed the statistical accuracy of the computer estimation programs on real and simulated data. However, this section will address the implications of the findings by identifying the limitations, the advantages, and the disadvantages of running these programs. The usefulness of some of the reported statistics for the investigators in making critical educational decisions will also be discussed. The SPSS program estimation for a random effects model for binary outcomes indicated several disadvantages: (1) The SPSS estimates of 7's were found to be significantly' biased. and. inconsistent. The estimates underestimated the true value. (2) the SPSS estimates of the standard error of macro parameters were significantly biased and inconsistent. Increasing the sample size resulted in SPSS estimates of the standard error of the three macro parameters having a larger bias and MSE. (3) The SPSS estimates gave a large probability of type I error rate under a true null testing the macro parameters, 7’s. Similarly, there were some disadvantages in the using the current VARCL program: ‘ (1) The VARCL estimates of 7’s were found to be significantly' biased. and inconsistent. The estimates underestimated the true value. 132 (2) The VARCL estimates of the standard error of macro parameter proved to be unbiased and consistent. This meant that increasing the sample size resulted in the VARCL estimates of the standard error of the three macro parameters having a smaller bias and MSE. (3) The VARCL estimates gave a small probability of type I error rate under a true null testing of the macro parameters, 7's, relative to the SPSS estimates. However, the VARCL type I error rate was not small enough to be acceptable (i.e., p > .05). (4) The VARCL estimates of the 1m, 1" and TM parameters were significantly biased, and underestimated the true values. However, the magnitude of the bias and.MSE were reduced as the number of units within each group increased (i.e., they were consistent). (5) The VARCL estimates of the standard error for V1”, V1", and I“ were significantly biased. However, the magnitude of bias, standard deviation, and MSE were reduced as the sample size within each group increased. In other words they were consistent. The simulation study demonstrated that using the standard logistic regression estimation procedure for multilevel data with binary outcomes could lead to misleading results and conclusions. This is because the standard logistic regression estimates were found to be significantly biased and inconsistent for both the 7’s and the standard error of 7's. 133 The following explanation is given for the bias that exists when the fixed.model is used.toldevelop an estimate for the intercept (i.e., the random intercept logistic regression model, see model A on page 86, and the SPSS and VARCL estimates in Table 4~13) in a random effects model. Consider the logistic function yr: f(a) = e“ =.__1L__ 1+ 9“ 1+ 8" The first derivative of the logistic function (refer to Appendix 5—1) is as follows f”(«) = 9.“ (1+ 2’“)2 The second derivative of the logistic function is given as f”(a) _._ 9"‘(9-2‘I '1) (1+ e")‘ Let the a random variable, a, be expressed as follows a=7+u where u ~ N(0,05 auui y is a constant (i.e., intercept). Expanding f(a) (i.e., Taylor expansion) about 7 up to quadratic terms, 134 5’: f1“) * f(Y)'+IV(Y)(a -'y)-+.£Z%li(a,_ Y)2 Hence Y = flat) 9 £0) + my) (u) + 312(1in note that E(u) = 0, E(u’) = 02, and E(y) 1 f(y) + £91.02 For example, let 1 = -1.80, 0’=%.85. Thus E(y) is given as E(y) = .1419 + (.0872 / 2) (.85) = .1790. The logit (.1790) = ~1.52, where as logit (.1419) = ~1.80. Therefore, if E(y) is estimated by an unbiased estimate, the logit of this estimate will be about ~1.52 whereas the intercept, 7, is ~1.80. In fact, the similarity of the VARCL and the SPSS estimates of 7's (see Table 4~13) makes it highly likely that a similar reasoning will explain the bias of the VARCL estimates of 7’s. The explanation for the significantly biased estimates of the standard error of the regression parameters for the standard logistic regression estimation method, in case of multilevel data, is attributed to ignoring the parameter variance of the single level model in its estimate of the standard error of the regression parameters. The multilevel approaches account for both the parameter variance and 135 sampling variance in its estimate for the standard error of the regression parameters. Therefore, caution should be exercised when studying multilevel data with binary outcomes using the standard single logistic regression estimation procedure (i.e., the SPSS program) instead of the multilevel logistic regression estimation procedure. This is because of the high probability of a type I error for the standard single logistic regression estimation method (see Tables 4~19 and 4~20). This error was due to the liberal t~statistic values, caused partly by the small standard error estimates for the regression coefficients, and partly by the significantly biased estimates of 7's when assuming the single-level logistic regression model by using the standard logistic regression estimation method. In addition, the VARCL type I error rate (under a true null hypothesis, Ho: 7 = 7,) was not small enough to be ignored (i.e., p >.05). This was because the VARCL estimates of 7's were found to be significantly biased, inconsistent, and underestimated the true values. The results of the real data showed that: (a) the estimated regression coefficient for the MULTILOGIT had a larger absolute value than the VARCL estimate, (b) the estimated standard errors of the regression coefficient for MULTILOGIT were slightly larger than the VARCL estimates, and (c) the MULTILOGIT estimates for variance-covariance components of the multilevel logistic regression model were generally larger absolute value than VARCL estimates. Based on 136 the knowledge that the current VARCL program underestimated both the (a) the macro parameters, and (b) variance-covariance components of the random effects (Longford, 1992), the MULTILOGIT program may be more efficient program than the VARCL. However, there were several reasons that made operating the VARCL program more attractive than the MULTILOGIT program. These are summarized as follows: (1) The MULTILOGIT program had a limit in the number of micro and macro variables that could be included in an analysis. No such limitation was indicated by the VARCL program. (2) The MULTILOGIT program also had a limit of 59 groups (or schools) that could be used in the analysis. Again no such limitation exists for the VARCL program. (3) The MULTILOGIT program proved to be inconvenient to operate. This was essentially because the coefficient input file required the researcher to provide the estimates of the classical within-group logistic regression coefficients for each school in the analysis. On the other hand, the VARCL program generated its own initial estimates. (4) The MULTILOGPT program model specification always assumed that all the micro regression (intercept and slope) were random coefficients. The VARCL program, however, had the option to assume fixed or random regression coefficients among schools. In fact, the ability of the VARCL program to test the variance-covariance components of the random effects is 137 critical for the investigator in deciding whether to assume fixed or random regression coefficients. This facility is not available for the MULTILOGIT program user. (5) The inability of the educational researcher to run the more complicated MLRM with the MULTILOGIT program (i.e., by including more covariates in the within-school model) was due to the small number of students within each school. This insensitivity to the small number of subjects within each group was not observed with the VARCL program. (6) It was found to be financially very expensive to run the MULTILOGIT program at the University of Michigan Mainframe Computer Center. The personal computer version of the MULTILOGIT program is presently unavailable. T e Co e ue ces t e Co c us'o s f the a a ' u d ata na ses The argument in 'the last section indicated several statistical disadvantages in using the SPSS program for the random effects model having binary outcomes. Similarly, the VARCL indicated some disadvantages in estimation the (a) the macro parameters, and (b) variance-covariance components of the random effects. Therefore, based on the parameter estimation, the MULTILOGIT program may be more efficient than the VARCL programs. However, There are several reasons that has been indicated in the last section made operating the MULTILOGIT program very restrictive. Thus, if one were able to account for the existing bias in the current the VARCL 138 program, it would be perhaps more advantageous to choose the VARCL program over MULTILOGIT program. An exception would be if the researcher can accept the disadvantages and limitations of the MULTILOGIT'program (i.e., cost, sample size within.each group, limitation in the number of covariates, limitation in the number of groups, inability to provide standard logistic regression estimates coefficient of each group, inability of the researcher to statistically decide whether to fixed or assumed the random regression coefficient among groups). Suggestiens for Eutufe Reeearen This study suggests that future research in developing an new program that accounts for the disadvantages in both the VARCL and MULTILOGIT programs. In addition, to overcoming the above disadvantages, the new program should efficient for small number of subjects 'within each. group. This would represent a more realistic educational research situation. In fact, concerns regard a small number of subjects within each group on parameter estimations for binary outcome were indicated by Longford (1992). The MULTILOGIT program required a large number of subjects within each group in order to run. Finally, the new program may also consider a more simplified model due to the nature of the outcome variable (i.e. , binary outcome). Like one having a random intercept and a fixed regression coefficient slope model (model I) rather than one having a random intercept and a random regression coefficient slope model (model 11). In fact, several 139 researchers (Chamberlain, 1980; Horn and Whittemore, 1979) have recommended this for normally distributed outcomes. This would make the estimation procedure of model I less complex than the model II. Raudenbush (1988) has also pointed out the advantage for this by stating: A random intercept model has two computational advantages: (a) the number of microcoefficients reduces to one per group; and, therefore, (b) the variance- covariance matrix of the random effects (T in our notation) becomes diagonal, which simplifies estimation formulas (p. 106). Similarly, the simplicity of the random intercept model was also indicated by Wong and Mason (1985). In addition to these advantages, Shigemasu (1976) indicated concerns regarding the cost of computation for using model II saying that "... the model (i.e., model I) is expected to reduce substantially the cost computation" (p. 158). APPENDICES 140 APPENDIX 3-1 AN EXAMPLE OF VARCL PROGRAM "BASIC INFORMATION FILE" SPECIFIED FOR THIS STUDY SIMULATED LRM FOR 60 SCHOOLS HAVING 10 STUDENTS IN EACH SCHOOL 2 600 60 3 1 90 10 4 student school d:\vac_02dt.001 (f8.5,1x,f8.5,1x,f8.5) d:\schvar.dat (9x,F8.5) dep 2 xij 1 INTXiZZ 1 zzj 1 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 141 APPENDIX 3-2 AN EXAMPLE OF MULTILOGIT PROGRAM "COMMAND FILE" SPECIFIED FOR THIS STUDY \ This is the first heading line. \ This is the second heading line. * This is the first line of comments. * This is the second line of comments. * This is the last line of comments. 2 0.01 0.001 10 0.01 0.001 0 9 1sx0t:stu59mts.DAT 6 lsx0t:sch59mts.DAT 2 sthzmode13.DAT -TEMP 59 9 2 l 6 schoolOl schoo102 schoo103 schoo104 schoolOS schoo106 schoolO7 schoo108 schoo109 schoollO schoolll schoo112 schoo113 schooll4 schoollS schooll6 schooll? schoo118 schoo119 schoo120 schoo121 scho0122 schoo123 schoo124 schoo125 schoolz6 schoo127 schoo128 schoo129 school30 schoo131 school32 school33 schoo134 schoo135 schoo136 schoo137 school38 school39 school40 school4l school42 school43 school44 school45 school46 school47 school48 school49 schoo150 schoolSl schoo152 schoo153 schoo154 schoolSS schoolS6 schoo157 schoo158 schoo159 repeat subject intercept SES 0 0 gammaoo gammaol 142 APPENDIX 3-3 THE DESCRIPTIVE STATISTICS FOR THE REAL DATA AT BOTH THE STUDENT- AND SCHOOL-LEVEL Descriptive statistics For 59 schools used in real data analysis. W221 Variable Mean Std Dev Minimum Maximum N URB_RRL .00 .44 ~.25 .75 59 SOUTH .00 .39 ~.19 .81 59 BANGKOK .07 .25 .00 1.00 59 MSES .00 .46 ~.90 1.81 59 URB_RRL URBAN/RURAL AREA Value Label Value Frequency Percent Valid Percent ~.25 44 74.6 74.6 74.6 .75 15 25.4 25.4 100.0 Total 59 100.0 100.0 CENTRAL Valid Cum Value Label Value Frequency Percent Percent Percent ~.24 45 76.3 76.3 76.3 .76 14 23.7 23.7 100.0 Total 59 100.0 100.0 NORTH Valid Cum Value Label Value Frequency Percent Percent Percent ~.17 49 83.1 83.1 83.1 .83 10 16.9 16.9 100.0 Total 59 100.0 100.0 SOUTH Valid Cum Value Label Value Frequency Percent Percent Percent .81 11 18.6 18.6 100.0 Total 59 100.0 100.0 Mean SES (MSES) Mean ~.001 Std err .060 Median ~.146 Mode ~.902 Std dev .460 Variance .212 Rurtosis 3.982 s E Kurt .613 Skewness 1.672 S E Skew .311 Range 2.711 Minimum ~.902 Maximum 1.808 Sum ~.060 Valid cases 59 Missing cases 0 143 §tudent level: 1244 student were involved in this analysis. Valid Variable Mean Std Dev Minimum Maximum N SCHOOLID 98078.12 54720.88 10101.00 180550.0 1244 URB_RRL .06 .46 ~.25 .75 1244 CENTRAL .00 .43 ~.24 .76 1244 NORTH .03 .40 ~.17 .83 1244 SOUTH -.03 .37 ~.19 .81 1244 MSES .06 .48 ~.90 1.81 1244 SEX .00 .50 ~.50 .50 1244 DIALECT .00 .50 ~.49 .51 1244 LUNCH .00 .37 ~.84 .16 1244 SES .00 .68 ~1.72 3.28 1244 SCPPEDl .00 1.00 ~1.09 .91 1244 BRAKFAST .00 .39 ~.81 .19 1244 REPl .15 .36 .00 1.00 1244 SEX_MSES .00 .38 ~.94 1.77 1244 SCP_MSES .00 .43 ~.86 1.59 1244 URB_SES .00 .35 ~.89 2.32 1244 URB_RRL URBAN/RURAL AREA Valid Value Label Value Frequency Percent Percent ~.25 854 68.6 68.6 .75 390 31.4 31.4 Total 1244 100.0 100.0 CENTRAL . Valid Value Label Value Frequency Percent Percent ~.24 946 76.0 76.0 .76 298 24.0 24.0 Total 1244 100.0 100.0 NORTH Valid Value Label Value Frequency Percent Percent ~.17 999 80.3 80.3 .83 245 19.7 19.7 Total 1244 100.0 100.0 SOUTH Valid Value Label Value Frequency Percent Percent ~.19 1046 84.1 84.1 .81 198 15.9 15.9 Total 1244 100.0 100.0 Mean SES (MSES) Mean .056 Std err .014 Median Mode .537 Std dev .481 Variance Kurtosis 2.995 S E Kurt .139 Skewness S E Skew .069 Range 2.711 Minimum Maximum 1.808 Sum 70.096 Valid cases 1244 Missing cases 0 Cum Percent 68.6 100.0 Cum Percent 76.0 100.0 Cum Percent 80.3 100.0 Cum Percent 84.1 100.0 -.116 .231 1.508 -.902 144 SEX Valid Value Label Value Frequency Percent Percent ~.50 621 49.9 49.9 .50 623 50.1 50.1 Total 1244 100.0 100.0 DIALECT Valid Value Label Value Frequency Percent Percent ~.49 636 51.1 51.1 .51 608 48.9 48.9 Total 1244 100.0 100.0 LUNCH DO STUDENT HAVE LUNCH DAILY Valid Value Label Value Frequency Percent Percent ~.84 200 16.1 16.1 .16 1044 83.9 83.9 Total 1244 100.0 100.0 SES Mean Std err .019 Median Mode Std dev .680 Variance Kurtosis S E Kurt .139 Skewness S E Skew Range 5.003 Minimum Maximum Sum ~4.410 Valid cases Missing cases .0 SCPPEDl Valid Value Label Value Frequency Percent Percent ~1.09 569 45.7 45.7 .91 675 54.3 54.3 Total 1244 100.0 100.0 BRAKFAST Valid Value Label Value Frequency Percent Percent ~.8l 234 18.8 18.8 .19 1010 81.2 81.2 Total 1244 100.0 100.0 REPl EVER REPETITION Valid Value Label Value Frequency Percent Percent NEVER .00 1052 84.6 84.6 EVER 1.00 192 15.4 15.4 Total 1244 100.0 100.0 Cum Percent 49.9 100.0 Cum Percent 51.1 100.0 Cum Percent 16.1 100.0 -.213 .463 1.993 -1.719 Cum Percent 45.7 100.0 Cum Percent 18.8 100.0 Cum Percent 84.6 100.0 SEX MSES Mezn Mode Kurtosis S E Skew Maximum Valid cases scr_nsss Mean Mode Kurtosis S E Skew Maximum Valid cases URB_SES Mean “Mode Kurtosis S E Skew Maximum Valid cases ~.001 .317 3.069 .069 1.588 1244 .000 -.043 12.876 .069 2.322 1244 145 Std err Std dev S E Kurt Range Sum Missing cases Std err Std dev S E Kurt Range Sum Missing cases Std err Std dev S E Kurt Range Sum Missing cases .011 .378 .139 2.711 -3.094 .012 .432 .139 2.452 -.788 0 .010 .351 .139 3.209 .168 Median Variance Skewness Minimum Median Variance Skewness Minimum Median Variance Skewness Minimum -.040 .143 2.690 -.942 -.057 .186 1.358 -.864 -.043 .123 2.643 -.887 146 APPENDIX 3-4 A COPY OF THE GAUSS PROGRAM USED TO GENERATE DATA FOR THE GROUP PREDICTOR @= THIS IS FOR PROGRAM USED TO CREATING SCHOOL LEVEL VARIABLE FOR 60 SCHOOL:j=60,i=10,N=600 07/11/91 0 new; output file=c:\gauss\datakam\schvar.dat reset; j=60;n=600; i=10; [*creating ZJ’S */ ZlJ=ones(j,1); ZZJ=rndn(j,1); /* ZZj~N(0,1) */ 23 = z1j~22j; format /rd 8,5; print 23 ; output off; end; 147 APPENDIX 3-5 A COPY OF THE GAUSS PROGRAM USED TO GENERATE DATA FOR THE WITHIN-GROUP PREDICTOR @= THIS IS PROGRAM USED TO CREATE 10 STUDENT WITHIN EACH SCHOOL FOR 60 SCHOOL J=60,I=10,N=600 07/11/91 9 new; output file=c:\gauss\datakam\STVAR10W.DAT reset; j=60;n=600; i=10; /* generating X's for 600 student in 60 school */ so=1; DO WHILE SD <= J; x1= rndn(I,1); /*X1~N(0,1) */ SD = SD +1; == MX1=meanc(x1);stx1=stdc(x1);vax1=vcx(x1);==@ FORMAT /RD 8,5 ; PRINT X1;; ENDO; end; 148 APPENDIX 3-6 A COPY OF THE GAUSS PROGRAM USED TO GENERATE THE DATA SET FOR THE CELL (ND,n10,RRSS) @= j=60, i=10, N=600 @ New ,20000; j=60;N=600; i=10; /*creating BlJ’S AND B2J’S FOR 60 SCHOOLS*/ G gm10= ~1.80, gm11= ~1.20, gm20= ~.5, gm21= .75 , z1j a vector 60 * 1 of one's z2j a vector 60 * 1 of normal distribution with mean of 0 and variance of 1 6 /* creating gamma */ gm10= ~1.80; gm11= ~1.20; gm20= -.50; gm21= .75; load X[N,1] = \gauss\datakam\stVAR10W.dat; load 2[J,2] = \gauss\datakam\schvar.dat; le SUBMAT (2,0,1); 22j ,SUBMAT (2,0,2); H = reshape (X,j,i); /* generating alj and a2j */ f1="c:\\gauss\\datakam\\studinf.cel"; output file =“f1 reset ; rr=1; do while rr <= 100 ; taj0= -rndn(j, 1); a1j=0. 9975*taj0; /*a1j~N(0,.995) */ taj1=rndn(j,1); a2j=0. 07071*taj1; /*a2j~N(0,.005) */ if rr ==1; f2="c: \\gauss\\datakam\\spc_ 01dt. 001" f3="c: \\gauss\\datakam\\vac_ 01dt. 001" elseif rr == 2; f2= "c: \\gauss\\datakam\\spc_ 01dt. 002" f3="c: \\gauss\\datakam\\vac_ 01dt. 002" elseif rr ==3; f2="c: \\gauss\\datakam\\spc_ 01dt. 003" f3="c: \\gauss\\datakam\\vac_ 01dt. 003" elseif rr ==4; f2="c:\\gauss\\datakam\\spc_01dt.004" f3="c:\\gauss\\datakam\\vac_01dt.004" elseif rr ==5; ‘0 ‘0 ‘0 ‘0 ‘0 ‘0 ‘0 ‘0 149 f2="c: \\gauss\\datakam\\spc_ 01dt. 095" f3="c: \\gauss\\datakam\\vac_ 01dt. 095" elseif rr ==96; f2="c: \\gauss\\datakam\\spc_ 01dt. 096" f3="c: \\gauss\\datakam\\vac_ 01dt. 096" elseif rr ==97; f2="c: \\gauss\\datakam\\spc_ 01dt. 097" f3="c: \\gauss\\datakam\\vac_ 01dt. 097" elseif rr= =98; f2="c: \\gauss\\datakam\\spc_ 01dt. O98" f3="c: \\gauss\\datakam\\vac: 01dt. 098" elseif rr= =99; f2="c: \\gauss\\datakam\\spc_ 01dt. 099" f3="c: \\gauss\\datakam\\vac_ 01dt. 099" elseif rr= =100; f2="c:\\gauss\\datakam\\spc_01dt.100" f3="c:\\gauss\\datakam\\vac_01dt.100" endif; /* B1j equations , sz equations */ Blj le * gm10 + zzj * gm11 + a1j sz le * gm20 + 22j * gm21 + a2j ‘0 ‘0 L = B1J~BZJ; OUTPUT OFF; output file =“f1 OUTPUT ON ; ‘0 ‘0 ‘0 ‘0 ‘0 Q. ‘0 ‘0 ‘0 ‘0 ‘9 Q. Mb1j=meanc(b1j);stb1j=stdc(b1j);VAB1J=VCX(B1J); Mb2j=meanc(b2j);stb2j=stdc(b2j);VAB2J=VCX(B2J); Ma1j=meanc(a1j), sta1j=stdc(a1j), VAA1J=VCX(A1J); Ma2j=meanc(a2j), sta2j=stdc(a2j), VAA2J= VCX(A2J); A12=A1J~A2J; COVA12=VCX(A12); COVB12=VCX(L); Format /rd 8,5; PRINT; print "********** The run # PRINT RR; PRINT; print "Mean stand division variance print Mb1j~stb1j~VAB1J; print; print "Mean stand division variance print Mb2j~stb2j~VAB2J; print; print "Mean stand division variance print Ma1j~sta1J~VAA1J; print; print "Mean stand division variance **********"; of BIj"; of B2j"; of alj"; of a2j"; 150 print Ma2j~sta2j~VAA2J; PRINT; print "variance covariance matrix of alj and a2j"; PRINT COVA12; print; print "variance covariance matrix of b1j and b2j"; PRINT COVBlZ; print ; print " ------------------------ n; print ; print " -------------------------- u I. output off; output file =‘f2 reset; output file =“f3 reset; K = ones(i,1); /* generating dependent variable */ SD=1; DO WHILE SD <= j ; H1 = submat (H,SD,0); /*X1~N(0,1) */ X1 = Hl’; Y = K * B1j[SD,1] + X1 * B2J[SD,1] ; EYl = exp(Y); EY2 = (EYl + 1 ); EY = EY1./EY2; u=rndu(i,1); dep = ( u .<= ey ); schvar = K * 22j[SD,1]; INTX122=X1 .* SCHVAR ; SD = SD + 1; format /rd 8,5; print dep~X1~INTX122;; output off; output file=‘f2; output on; format /rd 8,5; print dep~x1~INTX122~schvar;; output off; output file=‘f3; output on ; ENDO; /* end the loop creating data for each school */ rr = rr +1 ; endo; /* the end of 100 replication */ end; 151 APPENDIX 4-1 HISTOGRAM FREQUENCY FOR ESTIMATED 7m BY THE SPSS ESTIMATION Count 1 0 2 4 16 29 54 136 203 236 225 171 80 33 Midpoint -2.250 -2.175 -2.100 -2.025 -1.950 -1.875 -1.800 -1.725 -1.650 -1.575 -1.500 -1.425 -1.350 -1.275 -1.200 -1.125 -1.050 METHOD. IOOOO+OOOOI0.00+...OIOOOO+OOOOIOCOO+OCOOI 0 80 160 240 320 HISTOGRAM FREQUENCY FOR ESTIMATED 7” BY THE VARCL ESTIMATION Count 0 1 1 6 14 32 53 137 212 243 220 161 78 31 Midpoint -2.250 -2.175 -2.100 -2.025 -1.950 -1.875 -1.800 -1.725 -1.650 -1.575 -1.500 -1.425 -1.350 -1.275 -1.200 -1.125 -1.050 METHOD. IOOOO+OOOOIOOOO+OOOOIOOOO+OOOOIOOOO+OOOOI 0 80 160 240 320 152 APPENDIX 4-2 HISTOGRAM FREQUENCY FOR ESTIMATED 7m BY THE SPSS ESTIMATION Count 2 3 3 11 22 54 121 165 195 217 192 126 60 21 Midpoint -1.700 -1.625 -1.550 -1.475 -1.400 -1.325 -1.250 ~1.175 -1.100 -1.025 -.950 -.875 -.800 -.725 -.650 -.575 -.500 METHOD I...O+OCOIIOOOO+OOOOIOOOO+O...I...O+.OOOI 80 160 240 320 HISTOGRAM FREQUENCY FOR ESTIMATED 7m BY THE VARCL ESTIMATION Count Midpoint -1.9 -1.8 -1.7 -1.6 -1.5 -1.4 -1.3 -1.2 -1.1 -1.0 -.9 -.8 -.7 -.6 -.5 -.4 -.3 METHOD IOOOO+OOOOIOOOO+IOOOOIOOO0+...OIOO00+...OI 0 80 160 240 320 153 APPENDIX 4-3 HISTOGRAM FREQUENCY FOR ESTIMATED 7m BY THE SPSS ESTIMATION METHOD Count 0 0 1 7 26 107 370 434 201 40 7.: OHOOONH Midpoint -1.1 -1.0 -.9 -.8 -.7 -.6 -.5 -.4 -.3 -.2 -.1 .0 I....+....I....+....I....+. .I....+ 400 ...I....+... O 100 200 300 HISTOGRAM FREQUENCY FOR ESTIMATED 7” BY THE VARCL ESTIMATION METHOD Count 1 8 6 24 38 86 159 239 258 172 111 58 18 16 2 3 1 Midpoint -.81 -.76 ~.71 -.66 -.61 -.56 -.51 -.46 ~.41 -.36 ~.31 -.26 -.21 -.16 -.11 -.OG -.01 :[O'OI+...CI.OD0+0...I....+..COIOCOC+OOCOI 0 80 160 240 154 APPENDIX 4-4 HISTOGRAM FREQUENCY FOR ESTIMATED 7” BY THE SPSS ESTIMATION METHOD Count 1 1 4 20 37 94 177 282 281 151 82 38 ... NUNQN Midpoint .100 .175 .250 : .325 .3. .400 _. ~475 _- ~55° _- -525 —3_ ~700 —=— -775 — - ~35° _ ° .925 _. 1.000 .: 1.075 : 1.150 1.225 1.300 I.0.0+.O..I....+....I....+O..OIOO..+....I...O+0.0.I 0 80 160 240 320 400 HISTOGRAM FREQUENCY FOR ESTIMATED 7” BY THE VARCL ESTIMATION METHOD Count 0 2 16 30 50 123 269 307 215 94 45 27 HOkaD Midpoint .150 .225 .300 .375 .450 .525 .600 .675 .750 .825 .900 .975 1.050 1.125 1.200 1.275 1.350 IOOOO+OOOOIOOOO+OOOOIOOOO+OOOOIOOO0+000010000+OOOOI 0 80 160 240 320 400 155 APPENDIX 5-1 THE FIRST AND SECOND DERIVATIVE OF THE LOGISTIC FUNCTION The logistic function I f(a) = e = ___1__ 1+ e“ 1+ e“ The first derivative of the logistic function f’(a) = (_1)(_e-a) _ e.“I (1+ 9")2 (1+ 453'“)2 The second derivative of the logistic function fun) = (1+ e")2(-e"') - e" 2(1+ e“) (-e"‘) (1+ e")‘ (1 + 29'“ + 9'“) (-e") + e‘“ 2(1 + 9'“) (1+ e'“)‘ ’ _e-a _ze-Za _e-3a + ze-Za +ze-3a (1+ e")‘ _e-a + e-3a (1+ e")‘ -c —2¢_ = e (e 1) >0 if «<0 (1+ e“)‘ BIBLIOGRAPHY BIBLIOGRAPHY Aitkin, M., Anderson, D., and Hinde, J. (1981). Statistical modeling of data on teaching styles. gourgei e: the Royal statisticei Societyl Setiee A. i44 (4), 419-461. Aitkin, M., and Longford, N. (1986). Statistical modelling issues in school effectiveness Studies (with discussion). on al e o a at's 'c S ciet Seties A, 142 (1), 1-43. Alba, R. D. (1987). Interpreting Parameters of Log-Linear Models. Seeieio ogieei netnegs & Reeeateh, i6 (1), 45- 77. Alker, H. R. (1969). A Topology of Ecological Fallacies, in M. Dogan and S. Rokkan (Eds. ), Seeiei Ecoiegy. Cambridge, MA: MIT Press. Anderson, J. A. (1980). Logistic Discrimination, in P. R. Krishnaiah and L. Kanal (Eds.), flandboot o; Statietiee, Vo . 2: C assif'cat' a te o 't' edu ti ef Dimension, New York: North-Holland. Anderson, D. A., and Aitkin, M. (1985). variance Component Models with Binary Response: Interviewer Variability. Jogtnei e: the Regal Stetisticai Seeiety. Seties B, 57(2), 203-210. Bock, R. D. (1983). The Discrete Bayesian. In H. Wainer and Messick (eds.), P inci les ode s cholo ica easureme t: e tsc ' t F ed 'c . . Hillsdale, N.J.: Lawrence Erlebaum Associates. Box, G. E. P., and Tiao, G. C. (1973). a es'a e e 'n Stetistical Anelysis. Reading, MA: Addison-Wesley. Braun H. (1989). Empirical Bayes (Methods: A Tool for Exploratory Analysis. t' ve s’s Educatienai pete, edited by D. Bock. New York: Academic Press. Breslow, N. E., and.Day, N. E. (1980). Statistical Methods in Cancer Research: Vol 1 MW Studies. Lyon, France: International Agency on Cancer. Bryk, A. S., Raudenbush, S. W., Seltzer, M., and Congdon (1988) . Ag Inttoduction te £21.: Cementet Eregtem and Uset'e Ggide. Vetsion 1.0, University of Chicago. 156 157 Bull, S. B., and Donner, A. (1987). The Efficiency of Multinomial Logistic Regression Compared with Multiple Group Discriminant Analysis. Johtnal e; the American Stetisticai Association, 82, 1118-1122. Burstein, L. (1980). The Analysis of Multilevel Data in Educational Data in Educational Research and Evaluation. Beyiew Qt Beseeteh in Educatieh, 8, 158-233. Chamberlain, G. (1980). Analysis of Covariance with Qualitative Data. Review of Eeehomie Studies, 57, 225-238. Clogg, C. C., Rubin, D. B., Schenker, N., Schultz, B, and Weidman, L. (1990). Multiple Imputation of Industry and Occupation Codes in Census Public-Use Samples Using Bayesian Logistic Regression. (In Press). COX. 0. R- (1970)- W London: Chapman and Hall. Cox, D. R. & Snell, E. J. (1989). The Ahelysis Q: ninety Qata. Second edition. London: Chapman and Hall. Dempster, A.P.; Laird, N.M.; and Rubin, D.B. (1977). Maximum Likelihood From Incomplete Data Via the EM Algorithm (with discussion. Journal t e Ro a Statistieel Soeiety, Series B. 32, 1-8. Dempster, A. P., D. Rubin, & R. Tsutakauwa (1981). Estimation in Covariant Component Models. W Stetisticai hssoeietienI 76: 341-353. Draper, N. and Smith, H. (1966). e ' Ana ’ . New York: Wiley. Dunteman, G. H. (1984). Introduction to Linear Models. Beverly Hills, California: Sage Publications. Efron, B. (1975) . The Efficiency of Logistic Regression Compared to Normal Discriminant Analysis. Jouzhel e: the Amerieah Statistieei AS§Q§i§tiQD. 70, 892-898. Geisser, S. (1984). 0n.Prior Distributions for Binary Trials. W148. 244-247- Glass, G. V., and Smith, M. L. (1979). Meta-analysis of the Research on Class Size and Achievement. Eli—catiehei Eyelgetion ahg Eeiicy Aheiyeie, 1, 2-16. Goldstein, H. I. (1986). Multilevel Mixed Linear Model analysis Using Iterative Generalized Least Squares. We. 11(1) . 43-56- 158 Goldstein, H. I. (1987) . Muitilegei h edeie in Educational end Social Research. London: Oxford University Press. Goldstein, H. I. (1990) . Nonlinear Multilevel models, with an Application to Discrete Response data. (Submitted for publication). Haberman, S. J. (1974). The Aheiysis ef Eteguehcy pete. Chicago: University of Chicago Press. Haberman, S. J. (1977) . Log-Linear Models and Frequency Tables With Small Expected Counts. Annals of Statistics, e, 1148- 1169. Haney, W. (1980) . Units and Levels of Analysis in Large Scale Evaluation. (New Directions :9; Methodeiegy Q: Soeie]. end Behavioral SciencesI 6, 1-15. Hannan, M.T. (1971) . Aggregatieh ahd Qisaggtegatieh in Socieiogy. Lexington, MA: D.C. Heath. Hanushek, E. A., and Jackson, J. E. (1977) . Statisticei Metheds to; Sociel Seiehtiste. New York: Academic Press. Hosmer, D. W., and Lemeshow S. (1989). W Begtession. New York: Wiley & Sons. Kreft, I. G. (1987). Methods and Models for the Measurement of School Effects. Dissertation, University of Amsterdam. Kreft, I. G., and Kim, K. (1990a). GENMOD, HLM, ML2 and VARCL, Four Statistical Packages for Hierarchical Linear Regression. in Theory and Mogel in Muitilevei Beseezch: Cohvetgence er Divergence ? edited by Eeden, P. , Hox, J. , and Hauer, J. Amsterdam:SISWO. Kreft, I. G., De Leeuw, J. , and Kim, K. S. (1990b). Comparing Four Different Packages for Hierarchical Linear Regression GENMOD,HLM, ML2 and VARCL", CSE Report # 310, Center for the Study of Evaluation, UCLA. Korn, E. L., and Whittemore, A. S. (1979). Methods for Analyzing Panel Studies of Acute Health Effects of Air Pollution. Bigmettigg, 35, 795‘804. Laird, N. M. (1978). Empirical Bayes Methods for Two-Day Contingency Tables. M, 581-590. Laird, N. M., and Louis, T. A. (1982). Approximate Posterior Distributions for Incomplete Data Problems. Jourhai e; the Royel Statisticel Societyl Series Bll 44, 190-200. 159 Langbein, L. I. (1977) . Schools or Students: Aggregation Problems in the Study of Student Achievement. W Studies Review Ahnuel, , 270-298. Leonard, T. (1972a). Bayesian Methods for Binomial Data. Riomettika, 59(3), 581. Leonard, T. (1972b). Bayesian Methods for Discrete Data. Act Iechnicai Ruiletih, Re.i0, 1-20. Leonard, T. (1975). A Bayesian Approach to the Linear Model with Unequal Variances. Iechhemetzice, i7(1), 95-102. Lindley, D. V., and Smith, A.F.M. (1972). Bayes Estimates for the Linear Model. our o t e Ro S ' c Seciety, Series R, 34, 1-41. Longford N. T. (1992). Private communication. Longford N. T. (1988). A Quasilikelihood Adaptation for Variance Component Analysis. Presented at the Annual Meeting of ASA, New Orleans, LA, August 1988. Longford N. T. (1990). Multilevel Modelling into the 1990s. Muitiievei Modelling Newsletter, 2(1), 2. Mason, W.M.; Wong, G.Y.; and Entwisle, B. (1984). Contextual Analysis Through the Multi-level Linear Model. Socioiogieei Rethedoiegy. San Francisco, CA: Jossey Bass, 72-103. McCullagh, P. (1983). Quasi—likelihood functions. BBL Statist. 1 , 59-67. McCullagh, P. , and Nelder J. A. (1983). Geneteiized Liheet Redei_. London: Chapman and Hall. Nelder, J. A., and Pregibon, D. (1987). An Extended Quasilikelihood Function. BiomettikaI 74, 221-232. Prosser R., Rasbash.J., and Goldstein H. (1991). ‘ML3 Software for Three-level Analysis Users’ Guide for V.2’, Institute of Education, University of London. Randenbush, S.W. (1988). Educational Applications of Hierarchical Linear Models: A Review", gourhel e: Rdueatienai statistics, 13 (2), 85-116. Randenbush, S. W., and C. Bhumirat (1989). Results of a National Survey of Primary Education in Thailand, Part I: School Size, Sector, Facilities, Equipment, Textbook and Teaching Materials as Predictors of Pupil Achievement. 160 Randenbush, S. W., and Bryk, A. S. (1986). A Hierarchical Model for Studying School Effects. Soc' 0 o EducationI 52, 1-17. Randenbush, S. W., & A. S. Bryk (1988). Methodological Advances in Analyzing the Effects of Schools and Classrooms on Student Learning. Reyieg_ef_Reeeezeh_in Educations, i5, 423- 476. Robinson, W.S. (1950). Ecological Correlations and ‘the Behavior of Individual. e 'ca S 'o 'c Rev'ew 1;, 351-357. Rubin, D. B., and Schenker, N. (1987). Logit-Based Interval Estimation for Binomial Data Using the Jeffreys Prior. pp-131-144 in C c. C1099. ed-. Sgsiglggigal_nethgdglegx 1987, Washington, D. 0.: American Sociological Association. Scheffe, H. (1959). The Anaiyeis ot Vatience. New York: Wiley. Shigemasu, K. (1976). Development and ‘Validation of a Simplified m-Group Regression Model. denrnei of Educational etatistiee, 1(2), 157-180. Stiratelli, R., Laird, N., and Ware, J. H. (1984). Random Effects Models for Serial Observations With Binary Response. RionetticsI 40, 961-971. Walsh, J. D. (1947). Concerning the Effect of the Intraclass Correlation on Certain Significance Test. We: Mathematieai Statisties, 18, 88-96. Wedderburn, R. W. M. (1974). Quasi-likelihood functions, generalized linear models and the Gauss-newton method. fiigmettika, fil, 439-47. Weisberg S. (1985). ° d ' e e r ' . New York: Wiley & Sons. Wong, G.Y., and Mason, W.M. (1985). The Hierarchical Logistic Regression Model for Multilevel Analysis. Jeurnai e: Americen Stetieticei Assecietien, 80, 513-524.