l ‘11 M NH" i‘} I‘XITI‘HENI‘ ‘1 V “\‘L m w W IAN; I##) .mNN A. LIBRARY Michigan State University ‘I This is to certify that the dissertation entitled THREE ESSAYS ON NONLINEAR MODELS FOR FRACTIONAL RESPONSE VARIABLES WITH TIME- VARYING INDIVIDUAL HETEROGENEITY presented by YOUNG GUI KIM has been accepted towards fulfillment of the requirements for the .PhD. degree in Economics V Major Professor’s Signature \J u Iy Q‘ILI :2 O O? I I Date MSU is an Affirmative Action/Equal Opportunity Employer PLACE IN RETURN BOX to remove this checkout from your record. To AVOID FINES return on or before date due. MAY BE RECALLED with earlier due date if requested. DATE DUE DATE DUE DATE DUE 5/08 KzlProleocaPresIClRCIDateDue.indd THREE ESSAYS ON NONLINEAR MODELS FOR FRACTIONAL RESPONSE VARIABLES WITH TIME-VARYING INDIVIDUAL HETEROGENEITY By Young gui Kim A DISSERTATION Submitted to Michigan State University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY Economics 2009 ABSTRACT THREE ESSAYS ON NONLINEAR MODELS FOR FRACTIONAL RESPONSE VARIABLES WITH TIME-VARYING INDIVIDUAL HETEROGENEIT Y By Young gui Kim This dissertation consists of three chapters on nonlinear panel models for fractional response variables. The first chapter extends the framework of Papke and Wooldridge (2008) by allowing unobserved individual heterogeneity to vary over time. An interaction term between a time effect and an individual effect of Kiefer (1980) and Lee (1991), known as an interactive effect, is added to the model. This allows each cross-sectional individual to respond differently to time effects common to all individuals. Based on Papke and Wooldridge (2008), I discuss the pooled Quasi maximum likelihood estimator (QMLE) and the multivariate weighted nonlinear least squares (MWNLS) estimator to estimate the model in strictly exogenous case. The two-step procedure of Rivers and Vuong (1988) and the single-step estimator of Wooldridge (2007) are discussed in the endogenous case. The methods are applied to analyze the effect of school spending on math test pass rates of 4th graders in Michigan. According to the Monte-Carlo simulations, the fractional time-varying model gives the least root mean squared errors among all three models in a reasonable range of correlation between the regressors and individual heterogeneity. The second chapter continues studying fractional response models and considers binary endogenous explanatory variables (EEVs). Because EEVs are not continuous, I modify the bivariate probit model to derive a conditional mean fimction for a fractional response variable. The pooled quasi-limited information maximum likelihood estimator (QLIMLE) is proposed based on this mean function. The conditional mean function for a fractional response variable and the reduced form for a binary endogenous variable are estimated together to identify all parameters of interest [Wooldridge (2007)]. Also the average treatment effects (ATEs) of a binary EEV are discussed. The estimation method is applied to study the effect of fertility on women's fractions of working hours. The simulations show the root mean squared errors of the ATEs of the fractional bivariate model are less than both the linear and the fractional probit model. In the last chapter, I discuss the hurdle model for a fractional dependent variable. Binary endogenous explanatory variables and time-varying individual heterogeneity are considered as before. The hurdle model allows us to separate the determination of comer solutions (y=0 or y>0) from the decision of the amount of dependent variables conditional on y>0. This hurdle model allows us to use different sets of EEVs for the comer solution equation and the equation for the amount and to obtain the ATEs conditional on y>0. I study the fertility effect of the second chapter again by using the hurdle model. ACKNOWLEDGMENTS I cannot express enough thanks to my advisor, Professor Jeffrey M. Wooldridge for his enthusiastic advice and support. Without his advice and discussion, I could not have written this dissertation. He also provided me the data for the first chapter. I am very gratefill to Professor Timothy Vogelsang. His generous support made me possible to successfully finish the doctoral program. I would like to thank my committee members, Professor Leslie E. Papke and Professor Yuehua Cui for their valuable comments and constructive discussions. I would like to express graditute to my family for their understanding and encouragement. iv LIST OF TABLES 1 1.1 1.2 1.3 1.4 1.5 1.6 1.7 1.8 2.1 2.2 2.3 2.4 2.5 2.6 3.1 3.2 3.3 3.4 3.5 3.6 TABLE OF CONTENTS FRACTIONAL RESPONSE MODELS WITH TIME-VARYING vi INDIVIDUAL HETEROGENEITY 1 Introduction 1 Basic Model 3 Estimation methods with strictly exogenous variables 7 Estimation methods with endogenous explanatory variables 12 Test statistics for time-varying individual effects 15 Empirical application: math test pass rates 17 Monte-Carlo simulations 24 Conclusion 27 THE FRACTIONAL MODELS WITH BINARY EN DOGEN OUS EXPLANATORY VARIABLES AND TIME-VARYING INDIVIDUAL HETEROGENEITY 30 Introduction 30 The Model with binary endogenous explanatory variables 33 Estimation methods . 36 Application: fertility effect on working hours of women 40 Monte-Carlo simulations 45 Conclusion 48 THE HURDLE MODEL FOR FRACTIONAL RESPONSE VARIABLES WITH BINARY ENDOGENOUS EXPLANA- TORY VARIABLES AND TIME—VARYING INDIVIDUAL HETEROGENEITY 50 Introduction 50 The basic fractional hurdle model 51 Estimation methods under strict exogeneity 54 Estimation methods with binary endogenous explanatory variables ................. 56 Application: fertility effect on working hours of women 61 Conclusion 65 Appendix: A Tables for Chapter 1 66 Appendix: B Tables for Chapter 2 81 Appendix: C Tables for Chapter 3 BIBLIOGRAPHY vi 88 93 Table A.1 Table A2 Table A.3 Table A.4 Table A.5 Table A6 Table A7 Table A8 Table A.9 Table A.10 Table All Table A. 12 Table A.13 Table A.14 Table B. 1 Table 32 Table B.3 LIST OF TABLES Descriptive Statistics ......................................... 67 The Linear and the Fractional Model: Exogenous Spending ........ 68 The Fractional Time-varying Model: Exogenous Spending ......... 69 The Linear and the Fractional Model: Endogenous Spending ....... 70 The Fractional Time—varying Model: Endogenoussspending, Two-step Estimator .......................................... 71 The Fractional Time-varying Model: Endogenous Spending, Limited Information MLE ..................................... 72 The Scale Factors at Different Spending Levels (in 1995 and 2001) ...................................................... 73 Mean Squared Errors and Biases of the APEs ( Increasing 7), Pro = 0.1 — 0.5) ............................................. 74 Mean Squared Errors and Biases of the APEs ( Increasing 77, Pm = 0.6 — 0.9) ............................................. 75 Mean Squared Errors and Biases of the APEs ( Decreasing 77, Pm = 0.1 — 0.5 ) ............................................. 76 Mean Squared Errors and Biases of the APEs ( Decreasing 71, Pm = 0.6 — 0.9) ............................................. 77 Mean Squared Errors and Biases of the APES (pm 2 0.2) .......... 78 Mean Squared Errors and Biases of the APEs (pxc = 0.3) .......... 79 Mean Squared Errors and Biases of the APEs (ch = 0.4) .......... 80 Descriptive Statistics ......................................... 82 The Linear and the Fractional Model: Exogenous Fertility ......... 83 The Hactional Time-varying Model: Exogenous Fertility .......... 84 vii Table B.4 Table 35 Table B.6 Table C. 1 Table C .2 Table C.3 Table C4 The Linear and Fractional Model: Endogenous Fertility ........... 85 The Fractional Time-varying Model: Endogenous Fertility ......... 86 Mean Squared Errors and Biases of the ATEs .................... 87 The Fractional Hurdle Model: Exogenous Fertility ................ 89 The Fractional Hurdle Time-varying Model: Exogenous Fertility .................................................... 90 The Fractional Hurdle Model: Endogenous Fertility ............... 91 The Fractional Hurdle Time-varying Model: Endogenous Fertility .................................................... 92 viii Chapter 1 F RACTIONAL RESPONSE MODELS WITH TIME-VARYING INDIVIDUAL HETEROGENEITY 1.1 Introduction Some economic variables are measured as fractions, percentages, or proportions. Examples include market shares, 401(k) participation rates, test pass rates, expenditure shares, and cost shares. Many models and estimation methods have been proposed for fractional dependent variables. Fractional variables always takes values between 0 and 1 including two extreme values. A simple linear model is easy to be estimated and interpreted, but the fitted values could be greater than 1 or negative even though the dependent variable is defined in the unit interval. One might use the log-odd transformation to avoid this problem, but we may not recover the original dependent variable without further assumptions. Papke and Wooldridge (1996, 2008) proposed a new approach to modeling fi'actional response variables based on a correctly specified conditional mean function. I focus on their two estimation methods. The first one is the pooled quasi-maximum likelihood estimator (QMLE) based on the Bernoulli distribution. With large enough samples, models can consistently estimated without distributional assumption by their estimation methods [For detailed discussion, see Papke and Wooldridge (1996), Gourieroux, Monfort, and Trognon (1984)]. The second method is the weighted non-linear least squares (WNLS) estimator for a non-linear mean function. A multivariate WNLS (MWNLS) estimator is proposed to address serial correlations in the cross sectional dimensions of their panel data. In this chapter, I combine this framework with the time-varying individual effects of Kiefer (1980) and Lee (1991). In most panel analyses, unobserved individual heterogeneity has been assumed as a time-invariant factor. Kiefer (1980) and Lee (1991) introduced time-varying individual heterogeneity in a linear model by adding interaction terms between individual effects and time effects. I use this interaction term (sometimes called an interactive effect) to relax the constant individual heterogeneity assumption in our non-linear modell. This allows each individual to respond differently to the common time effects. One example where this can be useful is in a model of income where the productivity of an individual (unobserved ability) varies over the business cycle [Ahn, Lee and Schmidt (2007)]. In our application, the effect of school spendings on math test pass rates in Michigan is studied by using school district level data. The model with time-varying individual heterogeneity allows each school district to respond differently to the difficulty levels of the math tests. The remainder of the chapter is organized as follows: In section 2, I introduce the basic model. Section 3 and section 4 contain estimation methods under the assumption of strictly exogenous explanatory variables and endogenous explanatory variables, respectively. I provide three test statistics for the null hypothesis of no time-varying individual effect in section 5. I apply the estimation methods to the test pass rates for fourth graders in Michigan and conduct Monte-Carlo simulations in sections 6 and section 7. The last section concludes. 1.2 Basic Model Let us assume N randomly drawn cross-sectional units from the population and T While Lee (1991), and Ahn, Lee, Schmidt (2007) considered mulitple time-varying factors in a linear model, one time-varying factor in a nonlinear model is considered because of an identification problem. 2 observations for each cross-sectional observation. The number of cross-sectional units is assumed to be substantially larger than the number of time series observations (large N fixed T asymptotics). Let’s assume a correctly specified conditional mean function: E (yitixitvci) = 9 (33215 + 772%), (H) where i indexes a cross-sectional unit and t indexes time (2' = 1,2, ..., N, t = 1, 2, ..., T). yit denotes a fractional dependent variable for individual 2' and time t that is continuous between 0 and 1 (also Mr: is allowed to take 0 or 1 with positive probability), and wit is a 1 x K vector of explanatory variables. 0,- indicates unobserved individual heterogeneity (an individual effect) and 7it is its corresponding coefficient that is allowed to vary over time. 722: can be regarded as a time effect in that this coefficient is common to all cross-sectional units at time t. Therefore, 77tCz' is sometimes called an interactive effect [Bai (2005)]. This term is assumed to be linear additive in the model. To ensure that the fitted values take values between 0 and l, cumulative distribution functions (cdf) can be used for the mean function of yit- In particular, the standard normal cdf, (-), is used in this chapter because this gives a computationally simple estimator when the model contains unobserved heterogeneity and endogenous regressors. The main contribution of this paper is to allow the effect of unobserved individual heterogeneity to vary over time in the linear index. One might attempt to consider the most flexible individual efiect which varies freely over time (Cit)- However, there are two problems: First, it is impossible to distinguish a time-varying individual effect (Cit) from an idiosyncratic error (Hit) without further assumptions. Second, it seems unreasonable 3 to assume that individual heterogeneity, such as innate ability, changes over time. An alternative is an interaction term between a time effect (in) and a constant individual effect (Ci). We can interpret this interactive effects in two ways: the effect of individual heterogeneity is varying over time or time dummies have individual specific intercepts. This interactive effect model includes the constant individual effect model as a special case and still retains a manageable structure. I set 771 = 1 as a normalization. Because this is a nonlinear model, we are interested in the partial effects not the coeflicients. Dropping subscript i, we can obtain the effect of the kth regressor, as“, on yt from 6E(yt|:ct,c)/6:ct,k = fikqbcrtfi + UtC) if $t,k is continuous, or (:r:§1)fl + UtC) — @(rémfi + me) where mil) and 10:0) denote two different values of mt, k if 931:, k is discrete. As we can see, the partial efi‘ect depends on set and c. It is a good idea to evaluate this effect at interesting values of x’s or average the partial effects across individuals. However, we may not observe c. Therefore, a popular method is to calculate the average partial effects (APEs) by averaging the partial effects across the distribution of c. The APEs can be obtained from Ec[flk¢(a:tfi + ntc)] or Ermine + m6) —- mime + men. In order to identify both fl and APEs, we use two assumptions: E (yitlxia 02') = E (yrtll‘ita 02'), (12) where 3,; E (in, ..., xiT) is a set of explanatory variables in all time periods. The first assumption is that xit conditional on c,- is strictly exogenous. This assumption rules out dynamic models and feedback models. Also this implies that we do not consider traditional simultaneous models and the omitted variable problem. If appropriate 4 instrumental variables are available, this assumption can be relaxed so that explanatory variables are allowed to be endogenous. The second assumption is c,- = tb+§3,-€+a,,a,-|:1:,- ~ N(o,a?,), (1.3) where T,- (E T‘1 231:1 :r:,-t) is a 1 x K vector of time-averaged explanatory variables and a,- denotes the part of unobserved individual heterogeneity which is not correlated with 33,. 0,2, is the variance of c,- given 13,-. This Mundlak (1978)-Chamberlain (1980) device is adopted to restrict the distribution of c,- conditional on 33,-, which enables us to get the APEs by using the law of iterated expectations (LIE). This implies that constant individual heterogeneity is correlated with time-invariant parts of strictly exogenous explanatory variables. For the purpose of flexibility, one can consider more general functional forms for D(c,- |x,-) by adding squared and/or interaction terms. Also parametric modelling for Var(a,-|:r:,) or relaxing the normality assumption of a,|:c,~ can be considered, but I focus on (1.3) in this chapter. [For detailed discussion about more general specifications for D(c,-|:r:,~), see Papke and Wooldridge (2008).] Under assumptions (1.2) and (1.3), E (yrtlfcz', at) = ‘1) (WP + $715 + Ut’fif + mar) (14) nta,|:r:, ~ N (0, 77%02) . 53,-, must contain only time-varying elements so that there is no perfect multicollinearity between 23,-, and 5,. Adding time dummies to the model is desirable in order to allow different intercept for each time. Even though nttb is absorbed into the coefficients of time dummies, I do not consider time dummies explicitly for notational simplicity. By the properties of the normal distribution, we can rewrite the equation (1.4) as 77M + 113213 + 772317326 Err/mm.) 2 2 (1.5) 1 + 77, 0a 5 9 Watt + witflat + iiéat), where 't/Jat E 471% , fiat = 2 2 , and Eat E _’71_§2_2 denote 1+7], 0a 1+1], 0a 1+7], 0a scaled parameters. The subscript a means that all parameters are scaled by 0,2, and the subscript it means those parameters depend on time due to 77t- Because we can observe (1, :r:,-t,E,-), the scaled parameters Wat, fiat, Eat) are identified. Papke and Wooldridge (2008) proposed methods to estimate parameters and APEs consistently without distributional assumptions about D(y,-, |:c,, 0,). By the LIE, the average structural function is obtained as follows. Er, [PU/lat + Svtfiat + fitter” - (1-6) The APEs for each t can be obtained by differentiating equation (1.6) with respect to CL‘t,k. 513,- is redundent in the conditional mean firnction and c,- and :r:,-t are independent conditional on 52,-. With these two assumptions, we can use the arguments in Wooldridge (2002, sectional 2.2.5). Therefore, the APEs of a: k at time t can be consistently estimated by N-1 2,121 Bat,k¢ (that + 1,13,“; + sing“) for each t. A scale factor also can be obtained from (NT)"'1 2;:1 ELI ¢ (12201, + mitBat + iiéat) We can compare coefficients of the nonlinear models with those of linear models after multiplying the coefficients of the nonlinear model by this scale factor. 1.3 Estimation methods with strictly exogenous variables Let us define wit E (1, z,t,E,) and 7rt E (t/Jgt, 5:1,, €;t)’ for notational simplicity. Then, the conditional mean function is (w,t1rt). Before proceeding, I need to discuss two practical issues: normalization of 0,2, and recovering parameters of interest. For identification, 0a is set to l as normalization, but this does not affect the results because we have a nonlinear model. From now on, I will drop the subscript a. Our interest lies in 6 E (10’, 77’, 5’, 5’)' not 7r E (rt/1, 7rI2, ""7rf’1‘) , and there are specific relationships between 7t and 0 (For example, wt = 7,0/ 1 + 77?). Therefore, two approaches can be used: the classical minimum distance (CMD) estimator and estimation with constraints. First, the CMD estimator recovers 6 from 7r by minimizing the weighted differences in the relationships between 7r and 6. This CMD estimator uses the variance-covariance matrix for it as weight. The detailed explanation for the CMD estimator will be provided later. Second, one can estimate the model after imposing the relationships directly on the objective functions. Estimation methods with constraints are straightforward. Now focus on the ways to estimate 7r consistently. Given the conditional mean fimction, there are many possible estimators. One estimator is the pooled nonlinear least squares (PNLS) estimator. This estimator is consistent and x/N-asymptotically normal (T is fixed). However, the pooled NLS estimator is ineflicient because this assumes homoskedasticity, but this assumption does not hold. A reasonable model for the variance, V(y,-t |:r:,-), has the following form: V (yit I 3373) = 790021:th [1 - ‘1’ (witfltlL (1-7) where r is a constant. [Readers are referred to Papke and Wooldridge (1996) for detailed discussion] Based on (1.7), the weighted nonlinear least squares (WNLS) estimator can be an alternative. In this chapter, the pooled quasi-maximum likelihood estimator (QMLE) based on Bernoulli distribution is used instead of the WNLS estimator. The pooled QMLE is equivalent to the WNLS when the inverse of (1.7) is used as a weighting matrix , but the pooled QMLE is simpler because the NLS estimator needs preliminary results to obtain the weighting matrix. Also this estimator is strongly consistent even if the true distribution of y is not Bernoulli once the first moment is assumed to be correctly specified because the Bernoulli distribution is belong to the linear exponential family (LEF) [Papke and Wooldridge (1996), Gourieroux, Monfort and Trognon (1984)]. The pooled QMLE is obtained by maximizing the sum of the following log-likelihood function. I call this estimator the pooled fractional probit estimator (FPE) as Papke and Wooldridge (2008) did. git (“121;”) = yit10g[(w,-7r). Applying MWNLS estimator requires the variance-covariance matrix of 3),, Var(y,-|:r:,-). Along with (1.7), we need to form Cov(y,-t, y,3|:r:t),t # 3. Instead of using a parametric model for Var(y,- [33,), its working version proposed in the generalized estimating equations (GEE) literature is used. Also the standardized errors are assumed to have the same correlation (exchangeable correlation assumption). The common correlation is calculated by f) = [N T(T —— 1)]_1 Z,N___1 ELI 2,3753 é,té,-S, where éit = flit/«@(witfit) [I —- (w,t‘7l't)] and {fit E yit — (I)(wit7i't). ‘~’ means that all are evaluated at the preliminary consistent estimates from the pooled FPE. The MWNLS estimator solves N ngnZIyi‘mi(wiv7r)iII7i—1Iyi‘mi(wiv7I)I (1-9) i=1 V,- = Diag [6,(1—6,)]1/2C , (away [in-(1 — in] 1/ 2 where I7,- is the T x T matrix where its diagonal elements are from (1.7) and its off-diagonal elements from the ‘working correlation matrix’. C(23) is the T x T matrix where its diagonal elements are l and its off-diagonal elements are p [Papke and Wooldridge (2008), Liang and Zeger (1986)]. This MWNLS estimator is equivalent to the GEE when the same weighting matrix is used. Note that we cannot consider the serial correlations when we recover the parameters by the CMD estimator because pure cross-section data is available for each t. Therefore only constrained MWNLS (or constrained GEE) can be used to consider possible serial correlations. For testing hypotheses and using the CMD estimator, we need a consistent estimator of asymptotic variance matrix. Hypotheses include independence between the unobserved heterogeneity and regressors (H0 : g = 0) and time-invariant individual heterogeneity (H0 : 77t = 1). Because we do not assume that the conditional variance of y is correctly specified, a consistent estimator of asymptotic variance robust to heteroskedasticity and serial correlation has so-called sandwich form [Arellano (1987), Wooldridge (2002)]. With regularity conditions, the pooled FPE is x/N-asymptotically normal with the variance of REE (erMLE) = A-IBA—l/N, (1.10) where A = N-1 iii. 23;. (mic-m), B = N-1 2.1212; Eases, and is}, = Vw€,t(fr‘) [Wooldridge (2002, Theorem 12.3)]. In the case of the MWNLS estimator has the following asymptotic variance. —1 N AvarnMWNLs) = vamgvflvm. (1.11) i=1 N A A x zvflmgt/i—la,agig—1vnm, i=1 N —1 X(ZV7rfn;I/i—1V7rrh, /N, i=1 where Vwrh, is the Jacobian of the mean firnction, m,- is m,(w,-, 7r) evaluated at 7?, and u,(E y,- - m,(w,, 7%)) is a residual vector. With scaled estimates (it) and their asymptotic variances (E), the CMD estimator is used to estimate (9 from fr. We have (T + 2K) parameters of interest (6), but 7? is a 10 (T + 2TK) x 1 vector. Let’s suppose h(-) be a continuously differentiable function to represent relationships between 6 and 1t (7r = 12(6)). The CMD estimator minimizes the A distance between it and h(6). H31; [7} — h(0)]’§—1 [7r — h(6)]. (1.12) From the first order condition for 6 and a standard mean value expansion around 6, the following asymptotic distribution of 6 is obtained. W (a — 9) fl: N (0, ways—111(9)) , (1.13) where H (6) (E V9h(6)) is the (T + 2TK) x (T + 2K) Jacobian of h(6). The appropriate estimator of asymptotic variance of 6 is A A A I A A Avar(6) = (H E‘lH)-1/N, (1.14) where H is H evaluated at 6. When we use the estimators with constraints to get 6 directly, the robust asymptotic variance of 6 has the same form with (1.14). 1.4 Estimation methods with endogenous explanatory variables Now let’s relax the strict exogeneity assumption given appropriate instrumental variables. Time-varying individual heterogeneity is considered as above. One endogenous variable case is considered in this chapter, but considering multiple endogenous variables case is straightforward. The basic model with an endogenous explanatory variable is as follows: 11 E (Mull/£132: Zrt, Ci1,vit1) = E (yitliyith 261,011,1511) (1-15) = ‘1’ (0113/62 + 26151 + 771611 + Um) where yitg denotes an endogenous explanatory variable which is assumed to be correlated not only with individual heterogeneity (0,1) but also with unobserved omitted variable (1),“). 2,, (E (2,,1, z,t2)) denotes a set of exogenous explanatory variables and instrumental variables. Instrumental variables (2,,2) should not have significant effects on ym and must be at least partly correlated with 3,1,,2. The Mundlak(1978) -Chamberlain(l980) device is applied again (cillz, ~ N ($1 + E2'51"" ai11031))- E(yrt1|yit2, Zr. Tm) = (1’07th + alyit2 + 221151 + 77155161 + 7"211) (1-16) where rm denotes a composite error of man and 22,131. In this structural equation, mtg is assumed to be correlated with 11",“. The reduced form for y,t2 is assumed as mm = Wt + 22152 + 5162 + 73:52 E 101127 + Trt2~ (1-17) This equation can be consistently estimated by the pooled OLS estimator which gives the same estimates with the fixed effects estimator because the averages of explanatory variables are added. Endogeneity of y,t2 implies that mtg is correlated with both ail and 22,“. I assume that rm given rm is conditionally normal and this leads m1 = Pth‘tZ + 612:, eitIZz' ~ N(0, T?)- 12 Substituting r,t1 with Pth't2 + Git, E(yrt1|yz't2. 22', TM) = @(ntwt + alyitZ + Zit151 + with (1-18) +Pt7‘z't2 + err) mm + 0113622 + 3213161 + 61:26 1 + ptt‘z'tz 1 + 7% Because joint normality of (7,”, 73,2) is assumed, the final model (1.18) comes from properties of normal distribution. T1 is set to 1 as normalization. With a continuous endogenous variable, two possible approaches can be used to estimate the equation (1.18) consistently: the two-step estimation procedure and the limited information maximum likelihood estimator(LIMLE). The LIMLE is called the single step estimator compared to the two-step estimation procedure. The two-step procedure proposed by Rivers and Vuong (1988) is as follows. 1. Estimate the reduced form for yg (1.17) and get the residuals, 63,2. 2. Replace 7,152 with 53,2 as an additional regressor and estimate the structural model (1.18). Now all explanatory variables are observable and the omitted variable problem can be avoided by adding 72,2. Even though fag is added to the model, mg is still allowed not to be strictly exogenous. Therefore, I use the FPE instead of the MWNLS estimator. The MWNLS estimator requires the strict exogeneity as the generalized least squares (GLS) estimator in linear models does. The endogeneity (H0 : (0,; = 0) can be tested by using the Wald statistics. If the null hypothesis is rejected, we should adjust the asymptotic variance for inference because the final model (1.18) contains a generated regressor (Fug) from the 13 first step estimation. In other words, the asymptotic variance of W (6 — 6) depends on the asymptotic variance of x/N (7i — 7) [For detailed discussion, Wooldridge (2002) ch. 12.4 and Papke and Wooldridge (2008) Appendix 1]. This two-step procedure has some advantages. Along with computationally simplicity, the endogeneity can be easily tested because we do not have to adjust asymptotic variances under the null hypothesis. However, the asymptotic variances should consider generated regressors if the exogeneity is rejected, and this approach cannot be used if endogenous explanatory variables are not continuous. . Wooldridge (2007) proposed the quasi limited information MLE (QLIMLE) of nonlinear models without additional assumptions of the joint distribution of y1 and yg. Substituting mg with (ym — 212,127), we can rewrite the equation (1.18) as 77112 +ay- +z- 6 +17'z'f E(yit1I3/it2azi1wit2) = ‘I’I t 1 1 “2 m 1 t z 1 (1-19) 1+7? ”1+7? E (w,-t17r). Because I assume a correctly specified mean firnction and the Bernoulli is the member of the linear exponential family, the Bernoulli QMLE estimates the equation (1.19) consistently. To identify all parameters, equation (1.19) is estimated along with the equation (1.17). The the quasi-log-likelihood function is as follows: 121(9) = yit 108 Iq)(wz't17rt)I + (1 — yit)108 [1 - @(wz'tlfitll (121) —(1/2) loge?» — mm [(m + waif/032] , where 032 E Var(r,t2|w,t2). Compared to the two-step procedure, the QLIMLE 14 might be more efficient because both the structural equation and the reduced form for the endogenous variable are estimated together, and estimating valid standard errors is straightforward. Also this approach can be used when the models contain discrete or limited endogenous variables, while the two-stage estimator is invalid [Wooldridge (2007)]. 1.5 Test statistics for time-varying individual effects In this section, I discuss test statistics for testing constant individual heterogeneity (H0 : 17, = 1). Let us start from the basic estimating equation of Papke and Wooldridge (2008), E(y,,|:r,) = (I) (7,!) + 510,,6 + 53,0. To consider time-varying individual heterogeneity, consider a more general estimating equation, E (y,,|:r:,) = (130/1 + $63 + M + 5::(52'5 X Ttl) E 4’ (wz'fl + Mité >< T0) E mu, Where 716 X Tr indicates interaction terms between 33,7; and all time dummies. The degree of freedom does not depend on the number of regressors in this case. The first statistic is a regression-based test which is also known as a variable addition test. This is a special case of Ramsey’s RESET in that fitted values are used as additional explanatory variables. The procedure is as follows: 1. Estimate the null model and get the coefficients of E,- (E). 2. Add interaction terms (3,? x T,) between 5,? and time dummies. 3. Estimate the model of 37,, on 1, a set of all time dummies, :r,,, E,, (25,75 x T,). 4. Test whether all coefficients of the interaction terms are 0 (H0 : (it = 0). This test can be implemented by using usual statistical packages such as E-views or STATA. Also it is easy to make the test statistics robust to possible misspecified second moments. 15 The score or Lagrangian multiplier (LM) statistic is ideally suited for specification testing. Only restricted estimates are needed to implement the score test. In most cases, restricted models are quite easier to estimate. Let the restriction be c(6) = O, (6 E (7r’, 69’) and its Jacobian matrix be C'. With consistent estimates, each element of the statistic is evaluated at the restricted parameters, 6 = (71’, 0’)’. Because I do not assume correctly specified variance, the general form of the statistics should be used. I LM :11, £545) 547—15” [EAT—lééi—lé’] ‘1 Sis-(‘6') 2. X%T_1), i=1 i=1 (1.22) where S, (6) is the score function evaluated at the restricted estimates. A and B are defined in (1.10), respectively. the LM statistic should be 0 evaluated at the unrestricted estimates (6). If the restriction is valid, the LM statistic evaluated at the restricted estimates (6) also should be close to 0. The following procedure can used to produce the same test statistic with (1.22). Divide 6 into the (K — T + 1)><1 vector of 7r and the (T — 1)x1 vector of 6,. Let Vnm,, and V, t m,, denote the gradients with respect to 7r and 6,. 1. Run a multivariate regression thm,, on Vnm,, and obtain the 1 x (T — 1) residuals, f,,. Then make 11,,f,, by multiplying 11,, by each elements of i‘,,, where 11,, is the residuals from the restricted model. 2. Run the regression 1 on fr,,f,, and construct LM = NT — SSR, where SSR is the usual sum of squared residuals. This statistic has a limiting XCZZLI distribution. The last one is the minimum distance statistic which comes from the CMD estimator. Going back to the time-varying model, let’s suppose a vector of the restricted parameters be a E (w’, 1, 6’, 5')’, where 77, = 1 is imposed and g(-) shows the relationship between rt 16 and (1 [7r = 9(a)]. The minimum distance statistic is N [(5% — Nana—10; — 9(a))j — N [(5% — h(6))’§"1 (a — 14%)] 3 X%_,. (1.23) This statistic is based on the distance between two minimized objective firnctions. The ' intuition of the statistics is the same with that of likelihood ratio (LR) test. If the restriction is true, the distance must be close to 0. However, we need to get 6 and 62 to obtain this statistic like the LR test. 1.6 Empirical application: math test pass rates In 1994, Michigan reformed K-12 school funding system in order to equalize educational opportunities in terms of school spending. From a policy prospective, it is important to estimate the effect of school spending on student performance consistently. Papke (2005, 2008) found that increase in per-student spending has significantly positive effect on student performance measured as pass rates on fourth grade math test (the Michigan Educational Assessment Program, MEAP, test). In the paper, she allowed the endogeneity of spending and used foundation grant as an instrument variable for school spending. While Papke (2008) adopted the fixed effects estimator to control school heterogeneity in the linear model, Papke and Wooldridge (2008) provided a new approach to consider a nonlinear conditional mean function with unobserved heterogeneity. They also found a significantly positive effect of school spending on math test pass rates. I use the same data with Papke and Wooldridge (2008). Table (A. 1) provides simple descriptive statistics for key variables over the years of 1995 through 2001 used in this chapter. Papke and Wooldridge (2008) used 501 school districts, while 503 districts are used here. Equation (1.24) shows the mean firnction for math test pass rates with time-varying 17 unobserved school district heterogeneity. E(math4,,|-) = [T,6, + [31 log(avgre:rpp),, + figlunch,, (1.24) +53 log(enr0ll),, + €110g(avgrexpp), + 52W,- +€3Wi + 77M]. where math4,, denotes the fraction of 4th graders who pass the MEAP fourth grade math test in district 2' for year t. The sample includes 503 school districts and 7 years. T, is a set of time dummies to allow a different intercept for each year. Papke and Wooldridge (2008) used three explanatory variables: the 4-year averaged value of spending per pupil in real dollar (log(avgrexpp)), the fraction of students who are eligible for free or reduced lunch program (lunch), and the number of enrollment (log(enroll)). Because Papke (2005) found that not only the current spending but lagged spendings have significant effects on the performance, averaged real spending per pupil in first, second, third, and fourth grade (avgrerpp) is considered. Logarithm transformation implies a diminishing effect of spending and enrollment on the test pass rates. The purpose of this model is to find the effect of spending on the test pass rates, so all variables which might be correlated with spending and have significant effect on the test pass rates should be controlled. Two control variables are added into the model; lunch is used as a proxy for the poverty rate or economic well-being, and log(enroll) is for controlling school size. Our estimation procedure is as follows: (1) estimate equation (1.24) for each year by using the pooled FPE and get total 49 parameters (it). There are 7 parameters and each school district has 7 years of observations. (2) Recover the parameters of interest 18 6 = (6’,17’,B’,§’) by the CMD estimator from if. Also the model with the restriction of 7r = h(6) is estimated by both the pooled constrained FPE and the constrained MWNLS estimator. Table (A2) and (A.3) contain the estimates of Papke and Wooldridge (2008)’s and a linear model. Because all models are nonlinear, the APEs of all three explanatory variables are provided to compare these results with those of the linear model. The first column of the table (A.2) contains the results of the fixed effects estimator of the linear model, and the next columns are the replicated results of Papke and Wooldridge (2008). The table (A.3) shows the estimation results of the time-varying model (the model with time-varying individual heterogeneity). Three estimation methods are used: the CMD estimator after the pooled F PE for each time, the constrained FPE (the constrained fractional probit estimator), and the constrained MWNLS estimator. The constrained estimators indicate the estimators after imposing the relationships between 7r and 6 on the conditional mean function. The results show two things. First, the effect of school spending on the performance is still significant and positive after considering time-varying individual heterogeneity. The coefficients of log(avgre:cpp) of the time-varying model (0.949 ~1.103) are slightly bigger than those of Papke and Wooldridge (2008) (0.883 ~0.886), but it is meaningless to compare the coefficients directly. Comparing the APEs, I find the effect of the time-varying model (0.214 ~0.228) is smaller than those of Papke and Wooldridge (2008) (0.298). Under the time-varying heterogeneity assumption, increase in spending per student by 10 percentage points leads increase in test pass rates by 2.1 to 2.2 percentage points. These APEs of the nonlinear models are smaller than the coefficient of log(avgre:r:pp) in the 19 linear model. Second, f7, varies fi'om 0.84 to 1.24. Table (A2) contains the results of the LM test and the variable addition test. The constant heterogeneity hypothesis is reject at 5% significant level. Also the minimum distance statistic in the table (A.3) is 35 and its p-value is zero to four decimal places. Because the LM test and the variable addition test are to test a necessary condition for the time-varying heterogeneity, minimum distance statistics is greater. These three test statistics show that the assumption of constant individual heterogeneity is too restrictive. a . Under proposal A, each school district was given the foundation grant based on its spending level per student in 1994. As a result, all districts received per-student spending at least as much as a basic grant. For example, in 1995, school districts spent under $4200 in 1994 were given $4200 per student or an additional $250 per student. High revenue districts were held harmless in that they could receive at least as much amount as before. As discussed in Papke (2005) and Papke and Wooldridge (2008), the foundation grant variable is a non-smooth function of spending in 1994. After controlling for spending in 1994, it seems reasonable to assume that the foundation grant is not correlated with unobserved shocks that affect the test pass rate. On the other hand, it is easily show that the foundation allowance is partially correlated with the average expenditure per student because a district’s revenue is largely determined by the foundation amount. As discussed above, the estimating equation for math test pass rates has the contrl fiinction form, 20 E(math4,,|-) = [T,6,1 + alog(avgrexpp),, + dlllunch,, (1.25) +51210s(em‘oll)z't + €11Wz‘ + 513t10sIT€$pp94li +£12W + ptt‘z‘tz + eitia where log(re:z:pp94) denotes logarithm of real expenditure per pupil in 1994. Adding this variable allows us to assume tha the foundation allowance is unrelated to the error, e,,. The variable r,,2 is the reduced form error from the equation for log(avgre:r:pp) (1.26). The reduced form equation is log(avgr‘e:r:pp),, = T,6,2 + dlglunch,, + 621 log(enr0ll),, (1.26) +522t10s(T6$W94)r + 5231: 103(9mntlrt + éztmz' +£2210g(enroll), + 7,,2, where log(grant),, denotes logarithm of real foundation grant. I have two estimation strategies. First, estimate equation (1.26) and get residuals (73,2). Substitute r,,2 in (1.25) with the residuals and estimate the model by the pooled FPE (and the CMD estimator) or the constrained FPE. Second, equation (1.26) and equation (1.25) can be estimated at the same time by the pooled Quasi-LIMLE (the single step estimator). Table (A.4) contains three results: the first column shows the result of the fixed effects instrumental variable estimator, and the rest columns are the replicated results of Papke and Wooldridge (2008) using the two-step FPE and the quasi-LIMLE, respectively. School spending still has a significantly positive effect on the test pass rates, and the effects are much greater than those in the exogenous case. This implies that the effect of spending 21 has downward bias under the exogeneity assumption. The QLIMLE has smaller standard errors and bigger APEs than the two-step procedure. The coefficient of spending in the linear model is between them. In both methods, t-statistics reject the null hypothesis of the exogeneity of spending. The hypothesis of constant individual heterogeneity is rejected at 5% significant level. Table (A5) and table (A.6) show the results of the fractional model with time-varying individual heterogeneity. Table (A5) contains the results of the two-step procedure, while table (A.6) reports the results of the single-step estimator (QLIME). In both tables, the spending has a significantly positive effect on the math test pass rates. In the two-step procedure, both the CMD estimator and the constrained F PE report slightly different coefficients of the spending and the constrained F PE seems to be less efficient than the CMD estimator. In the single step estimator, both estimation methods give quite similar results and the constrained methods (CQLIMLE) looks more eflicient. The APEs of the fiactional model with time-varying individual heterogeneity are from 0.507 to 0.592, and are similar with the coefficient of the linear model. Under endogenous spending assumption, increase in school spending per student by 10 percent points results in increase in the math test pass rates by 5.6 percentage points. The exogeneity of spending is rejected at 5% level, and the constant individual heterogeneity is also rejected. Table (A.7) shows the scale factors at difierent spending levels. Using linear models, Papke (2005) found different effects of spending at different spending levels and different initial performance levels, by estimating separate models for various subgroups. Papke and Wooldridge (2008) used a more parsimonious approach to allowing nonconstant partial effects by applying the fractional probit model, but with constant coefficients on 22 the heterogeneity. The table contains the scale factors for nine cases: strictly exogenous and endogenous spending and the two-step procedure and the single-step estimator for endogenous spending. For each case, three results are reported: the fractional probit model of Papke and Wooldridge (2008), the CMD estimator and the FPE (or the CMWNLS estimator in exogenous case) of the time-varying model, for the earliest and latest years in the data, 1995 and 2001. From the results, we can see the effect of school spending is greater at school districts with lower spendings. Let us focus on the results of the QLIMLE for the time-varying model with endogenous spending because we already reject the time constant heterogeneity and exogeneity of spending. The APEs are 0.668 in 1995 and 0.526 in 2001 at 5th the percentile of spending, while the APEs are 0.456 and 0.327 in 1995 and 2001 at 95th percentile. We can find diminishing APEs of school spending as spending levels go up. The difference in the APEs in 1995 and 2001 of Papke and Wooldridge (2008) is 0.03 (0827-0797). When we allow time-varying heterogeneity, the difference becomes 0.142 (0.668-0.526). Generally, there are nontrivial differences in the APEs across different parts of the spending distribution when time-varying factors are allowed on the unobserved heterogeneity. 1.7 Monte-Carlo simulations We can study the small-sample properties of the various estimation methods, when the models are and are not correctly specified, via a simulation study. Because some of the models are time consuming to estimate, I use 500 Monte Carlo replications. Nine different sample size combinations are used: the number of cross-sectional units are 100, 300, and 500 and the number of time periods are 5, 7, and 10. Equation (1.27) shows the data generating process of an explanatory variable. 23 at = men/ham (1.27) Ci weft N N(011)r where £13,, denotes an explanatory variable and is allowed to be correlated with unobserved heterogeneity (c,). The variance of 27,, is set to 1. The data generating process of a fractional response variable is as follows. H Beroulli outcomes, 10,, h are generated with probabilities that depend on the covariate, the unknown heterogeneity, and unobservables u,,. The observed fractional response is the fraction of successes. pit = PIwz'th = 1) = @(VytTt + Exit + "tci + “it)auit ~ NIO. 1) (1-28) 1 H ya: = ‘1; 2: with, with N Bernoulli(p,,), where T, is time dumrfirizels. The data generating mechanism is the same as creating the fractional variable, y,,, from the Binomial distribution in which the probability of a success is defined as equation (1.28) and there are H trials. I use H = 1000. The normalization I use on the time-varying coefficients is 171 = 1. I set up two different populations: 17, is increasing by 0.1 and by —0.1 over time. 7y, is T x 1 vector of known numbers and I set 6 = 1. Let the correlation between a: and c be ch- Because variances of :r and c are set to 1, the correlation of :2: and c is the same with the coefficient, 7,36. Nine different correlations between :1: and c from 0.1 to 0.9 are considered. The larger p3,C is, the better the APEs of the fractional model with time-varying heterogeneity fit intuitively. 24 Only exogenous case is considered because both the endogeneity effect and time- varying individual heterogeneity might be mixed up so that we have difficulty in interpreting results. Six estimators are compared. The first one is the fixed effects estimator for the purpose of comparison. The next two are the pooled FPE and the MWNLS estimator proposed by Papke and Wooldridge (2008). The last three are the CMD estimator based on estimates of the pooled FPE for each time, the constrained F PE (CFPE), and the constrained MWNLS (CMWNLS) estimator. Because the model in the population is nonlinear, a closed form for the average paratial effects is difficult to obtain. In stead, the APEs in the population are approximated as APEt,pop = Em) [fies ("lytTt + 5n + 771:0 + 1%)] (1.29) 22 'l [3i ‘2]: d> VytTt + Exit N . 2 that is, I use a sample average across the distribution of 22,, but with the true values of the parameters inserted. Equation (1.30) shows the conditional mean function and the estimated APE: E(y,,|:c,) = ‘I’ITtTt + Eth’t + 6,573,) (1'30) - 1 N - - APE, = fitIV— Z ¢(EtTt + Erma + 5%), i=1 where, of course, these will differ by estimation method. To indicate different estimation methods, I write APE,,,. By comparing the root mean squared errors (RMSE) and biases between the APEs of the population and those of all estimators, I attempt to find the effect of considering time-varying individual heterogeneity on the APEs. 25 RMSE, = \J % iMPEu — APE,,p0p)2 (1.31) Table (A8) through table (A.14) cofrfalin the simulation results. Two cases are considered as explained above: an increasing individual effect and a decreasing individual effect. Table (A8) through (A.11) show the RMSEs and biases from 1022c =0.1 to 0.9 when N is 500. The results shows the RMSEs become bigger as pzc increases. When pm is large (ch > 0.5), all estimators have quite similar RMSEs. In large pm, the time-varying model gives slightly larger RMSEs than even a linear model. Note that high correlations between a: and c mean there is not enough variations in sc, which results in imprecise estimates and large RMSEs. So I restrict Pro to reasonable ranges (from 0.2 to 0.4). Table (A.12) through (A.14) report the results of different N and different T. In most cases, the time-varying model gives better results than not only the linear model but the model of Papke and Wooldridge (2008). The constrained MWNLS estimator, which accounts, albeit in a misspecified way, for serial correlation, generally shows the best results. For examples, when p5,, = 0.4 and 17, is increasing, the RMSEs of the time-varying model are 0.0208~0.0203 with N = 500 and T = 10. The RMSE of the linear model is 0.0270, and those of Papke and Wooldridge (2008) are 0.0270~0.0271. If we calculate the APEs at different percentiles, Papke and Wooldridge (2008) model must show lower RMSEs than the linear model. 1.8 Conclusion I allow unobserved individual heterogeneity to vary over time in a fractional response model. Based on the framework of Papke and Wooldridge (2008), the model is extended by adding an interaction term (an interactive effect) between a time effect and an individual 26 effect proposed by Kiefer (1980) and Lee (1991). This interaction term can be interpreted as a time-varying coefficient of individual heterogeneity or an individual specific intercept at each time. Therefore, each individual is allowed to respond differently to the common effects (the time effect). This setting contains the constant individual effect model as a special case and we still have a manageable form. The Mundlak-Chamberlain device is used to restrict of the distribution of individual effect conditional on explanatory variables. Based on a conditional mean function for fractional dependent variables, the pooled fractional probit estimator and the multivariate non linear least squares estimator are discussed as in Papke and Wooldridge (2008). Two cases are considered: strictly exogenous explanatory variables and continuous endogenous variables with appropriate instrumental variables. Especially, I adopt the single-step estimator of Wooldridge (2007) along with the traditional two-step estimator when the model contains endogenous explanatory variables. Also I mention three test statistics for time-varying individual heterogeneity. In particular, the variable addition test can be conducted by using usual statistical software programs such as the STATA even though the time-varying model cannot be estimated by those kinds of software. I apply the estimation methods to analyze the effects of school spending per student on students’ performance in terms of the math test pass rates in school district level. The time-varying individual heterogeneity model allows each school district to respond difl'erently to common time effects such as the difficulty level of the test. I find that the effect of school spending on test pass rates is still statistically significant after considering time varying individual heterogeneity and the APEs are quite similar in both the time-varying model and the time-constant heterogeneity model. However, when the 27 APEs are evaluated at different percentiles of spending and in different years, the APEs of the time-varying model are smaller that those of the time-constant model. The hypothesis of a constant individual effect is rejected by all three test statistics. The Monte-Carlo simulations are also conducted to find finite sample properties of estimators. Two cases are considered: an increasing and a decreasing individual effect using nine different samples with N=100, 300, 500 and T=5, 7, 10. Also different correlations between the regressor and individual heterogeneity from 0.1 to 0.9 are considered. When our interest is restricted to the reasonable ranges of the correlations between an explanatory variable and individual heterogeneity (0.2~0.4), the time varying model gives less RMSEs than both the linear model and the model with constant individual effects. Especially, constrained MWNLS estimator which considers a possible serial correlation shows the best results among all estimator. 28 Chapter 2 THE FRACTIONAL MODELS WITH BINARY ENDOGENOUS EXPLANATORY VARIABLES AND TIME-VARYING INDIVIDUAL HETEROGENEITY 2.1 Introduction In this chapter, I continue to study the fractional response model, but now I allow for the presence of a binary endogenous explanatory variable. I combine the problem of an endogenous binary explanatory variable with unobserved heterogeneity. As in the first chaper, I allow time-varying coefficients on the individual heterogeneity. In the first chapter, continuous EEVs are considered and an extension of the two-step control function approach proposed by Rivers and Voung (1998) is used. However, it is well known that there is no consistent two-step estimator when EEVs are not continuous in nonlinear models. Instead, the bivariate probit model is modified to derive a conditional mean function and the single step estimator. Recently, Wooldridge (2007) has shown that in some cases — fractional responses being a leading one — quas-limited information maximum likelihood can be used. Therefore, the main contribution of this chapter is to derive the conditional mean function for fiactional dependent variables with binary EEVs and to extend Wooldridge’s approach to the panel data case. The model here has many applications. For example, y,,1 could be the proportion of questions correctly answered on a test, and y,,2 an intervention, such as getting individual tutoring help. Or, 37,,1 can be the fraction of pension assets invested in the stock market and y,,2 and indicator for taking a “financial awareness” class. The empirical example I use here is to analyze the effect of fertility on female labor supply. Women are still 29 primarily responsible for providing child care and doing housework. Jacobsen (1994) pointed out that women spend four times as much time for child care as men. Therefore, it is not surprising that many studies has found a negative relationship between fertility and women’s labor supply. However, to interpret this causal relationship clearly, we have to consider the endogeneity of fertility in female labor supply equation. Using my model and estimation method, I can address two kinds of endogeneity: fertility and female labor supply are likely to be jointly determined and unobserved individual heterogeneity might affect both variables. To consider this endogeneity, simultaneous equation systems may be used to consider joint determination of fertility and labor supply. However, if there is a misspecification in any equation, this misspecification transmits its inconsistency to all system. In other words, even if our interest'lie in the labor supply equation and this model is correctly specified, inconsistent estimates might be obtained when the other model has a misspecification problem. An alternative is to use an instrumental variable estimation method with appropriate instruments for fertility in reduced form for labor supply. Rosenzweig and Wolpin (1980) used twins in the first birth as an instrument. Because twining occurs randomly, this would produce consistent estimates, but we have a sample size problem. There are only 87 twin mothers in Rosenzweig and Wolpin (1980). Angrist and Evans (1998) proposed a new instrumental variable, an indicator for whether the sexes of the first two children are the same, for fertility based on the parental preference on the mixed sex sibling in developed country2. Angrist and Evans (1998) compared two instrumental Park and Cho (1995), and Arnold and Zhaoxiang (1981) use parental preference for boys in developing countries. 30 variable results: twins at the first birth and the same sex. This allows them to find the effect of children of different ages. Carrasco(2001) extended Angrist and Evans (1998) to panel data, but he focuses on the same sex. Some studies use lagged dependent variables as proxies to control unobserved heterogeneity, while panel data allow us to control individual heterogeneity explicitly. Instead adding lagged dependent variables, I use Mundlak(1978) - Chamberlain (1980) device to restrict the distribution of unobserved heterogeneity. Another issue is definitions of labor supply and children variables. There are two major measures of labor supply variables: job participation (extensive margin) and working hours (intensive margin). Angrist and Evans (1998) used participation, weeks worked, and hours per a week, while Carrasco’s (2001) dependent variable is labor force participation. The effect of fertility on labor supply is not clear a prior because an income effect and a substitution effect coexist. However, in most cases mothers are likely to withdraw entirely from labor market or at least reduce their working hours to take care of their new born babies. The binary dependent models might underestimate the effect of the fertility in that the binary indicator does not change as long as mothers take part in the labor market even if they reduce working hours. In this paper, fraction of working hours to total available hours is used as a dependent variable. Many researchers use linear models for working hours, but working hours have two limits: maximum time limit and 0. Therefore, linear models cannot ensure the fitted values exist between these limits. Also constant effects of independent variables might be too restrictive. In this paper, I focus on the effect of new born babies because most of the effect of fertility depends on that more than, for example, on the number of children [Carrasco (2001)]. 31 3 2.2 The Model with binary endogenous explanatory variables As in the first chapter, I assume N randomly drawn cross-sectional units from the population and T observations for each cross-sectional observation. Again, large N and fixed T asymptotics is used. Let us assume a correctly specified conditional mean function as E(l/it1I3/z't2a Zitla Ci1> Uitl) = War/m + 26161 + mm + Urn), (2-1) where 2' indexes a cross-sectional unit and t indexes time (2' = 1,2, ..., N ,t = 1,2, ..., T). y,,1 denotes a fractional dependent variable which can take on any value between zero and one, including zero and one. The variable 37,,2 indicates a binary endogenous explanatory variable (EEV). In this chapter, I focus on one EEV case; in principle, it one can extend the approach to more than one EEV. 2,,1 is a set of strictly exogenous regressors. The interactive effect, 17,c,, is added to consider time-varying individual heterogeneity again. v,,1 is an unobserved omitted variable and is thought to be correlated with y,,2. Also the standard normal cumulative distribution function (cdf) is used for the mean function of y,,1 to make the expected value between zero and one. This particular choice also provides some computational advantages. Our main interest lies in the partial effect of a binary EEV. We can obtain this from ©(a + z,161 + 77,c1 + 27,1) — (z,161 + 771501 + 11,1) dropping the subscript z'. The partial effect depends on 2,1, cl, and 11,13. We can evaluate the effect at interesting values of 2’s or average the effects across individuals, but we may not observe c1 and 11,1. Therefore, a popular method is to calculate the average treatment effects (ATEs) by averaging the This partial effect is called treatment effect when the variable of interst is binary or discrete. 32 treatment effect across the distribution of c1 and v1. Therefore, ATEt = E(c1,v1)l‘1’(0t + Ztrfit + 772:61 + vtl) - ‘I’Iztlfit + 77:01 + ”all (22) To identify the parameters and the ATEs, the Mundlak(1978)-Chamberlain(l980) device is adopted to restrict the distribution of individual heterogeneity (c1). — ' 2 C,1|Z, v‘ N061 +z,C1+a,1,aa1), (2.3) _ T where z, E (2,1, ..., 2,17), Z, E l/T :tzl Z,,, and 031 E Var(a,1|z,). Z,, E (2,,1, Z,,2) is a set of all exogenous variables and z,,2 indicates a set of instrumental variables. Rewrite the mean function with the device as follows. Eiyitllyit2aziarit1) = @(W/Jr + ayrtz + 22321.31 + mirCr + Tm). (2-4) where r,,1 E (77,a,1 + v,,1) denotes a composite error. The endogeneity of y,,2 implies that r,,1 is correlated with y,,2. To consider the nonlinearity of yg, I use the probit model. There is no theoretical difference between the probit and logit model. However, because plugging probit fitted values for y,,2 into the structural model is not valid, we exploit the properties of the normal distribution. (1 refer to Wooldridge (2002) in the cross section case for probit.) The equation (2.5) shows the reduced form model for y,,2: E (yit2lzitv 012) = (“27162 + 622 + 22,-,2) (2-5) _ 2 Cr2|2r m N(¢2+Zr€1+ai2,0a2), 33 where c,2 indicates individual heterogeneity but is assumed invariant over time for simplicity. v,,2 is unobservable but assumed independent of 2,, and 0,2 for identification. 021 is the conditional variance of a,2. With the Mundlak (1978) - Chamberlain (1980) device, Eillz’tQIZz'» 73252) = (PU/12 + zitB2 + 5&2 + 7‘62) (2-6) = ‘I’Iwit27T2 + 711:2), 703152 E (lazz'tiz'), where 7,,2 E (a,2 + 2),,2) is defined to represent a composite error and is independent of z,,. 112,,2 and 7r2 are defined for notational simplicity. Because the endogeneity of y,,2 implies that (r,,1, r,,2) are correlated, a linear projection of r,,1 on 7,,2 and joint normality are assumed as: r,,1 = p,r,,2 + s,,, s,,| (2,, r,,2) ~ N (0, ”r? ) p, is allowed to vary over time because r,,1 contains a time-varying factor, the coeflicient. From joint normality of (r,,1, r,,2) and independency of 5,, on y,,2 and 2,, Uri/’1 + ayit2 + 221151 + UtEiCI + Writ? (2 7) E(yz‘t1Iyz't2,Zz'ta7‘rt2) = ‘13 (n+7,2 ‘1)(7W17 + avyirz + 2211617 + UtZiCIT + P11773112) ‘I>(wrt17rlt—r + Purim), where subscript 7 means all parameters are scaled by 7'. For notational simplicity, w,,1 E (1, y,,2, z,,1, 2,) and its corresponding coefficient vector, 7r”, are defined and I drop subscript 7' from now on. z, and r2 conditional on c1 and 121 are redundant in equation (2.1) and c1, TI, and y2 are independent conditional on (2,, r2). By the law of iterated expectations, 34 ATEt = E07137?) [‘1’ (mi/11 + a + Zt151 + 77225241 + PtTit2) (2-8) 41’ (mt/)1 + Ztlfit + with + pt'l‘z’tzll- From the properties of the normal distribution, we can rewrite (2.8) as “Ll-p? 4, nt¢1+zt151+nt5iC1 ] 1+p§ 2.3 Estimation methods The models with binary EEVs may not be estimated by the traditional two-step estimation, where generated regressors in the first step estimation are added as additional explanatory variables or used proxy variables for EEVs. In the nonlinear settings, this two-step estimator produces inconsistent estimates because we cannot pass the expectation through nonlinear functions. An alternative is to use the full information maximum likelihood estimator (FIMLE) afier assuming the joint distribution of (371,372) given 2. However, the F IMLE produces inconsistent estimates if the assumed joint distribution is not true in the population or there is misspecification in any equation. In this chapter, the bivariate probit model is modified to obtain a conditional mean function for yl and the Bernoulli quasi maximum likelihood estimator (QMLE) is used based on the conditional mean function. So, I call this the fractional bivariate probit estimator (FBPE). To get the conditional mean function, take expectation with respect to 7‘2. 35 E(yrt1|yit2,zz‘) = (2.10) yitZEIq)(wit17I1t+Pt7‘it2)l7‘it2 > —wz'1:27r 2] +(1—yit2)EI(p(wit17I1t+pt7'it2)l7"z't2 < —wz't27T21 Because Q is assumed normally distributed, wz't27r2 y' , E(yit1Iyrt2»Zr) E ——q,(wi:22,,2) ‘I’Iwz't17r1t+/Jt7‘2)¢(7‘2)d7‘ (2-11) 2 —OO ‘wit27r2 / @(witlfllt + Pt72)¢(7"2)dt‘ “—00 l-ww (Di-wit27r2) + (1’2Iwz't1771t, (23/7172 - llwz't27r2t, (2.1/11:2 - 1).0ti ‘I’IIZyz'tz — 1)wz’t27r2ti To identify all parameters, the structural equation (2.11) and the reduced form (2.6) can E 777.2,. be estimated together. Wooldridge (2007) argues that, because the Bernoulli log likelihood is in the linear exponential family, only the mean E (y,,1 |y,,2, 2,) needs to be correctly specified along with the probit model for 37,,2 conditional on 2,. Here I am just applying a pooled method, and so the argument is essentially identical. The quasi-log-likelihood firnction is ln€,, = yitl ln(mit) + (1 - yit1) 111(1 — mit) (2-12) +yit21niq)(wit27r2)i + (1 — 312732) lnll — @(wit27r2)i Where mit = E (yitllyit27 Zil- With consistent estimates, the ATEs of y,,2 can be obtained from equation (2.13). 36 A l N A A A A A _'A N 3:1 ‘/1 + p, 4, firth + ZnEl + 7%»?le ]. 1 + p? The endogeneity of 3,7,,2 can be easily tested by the two step estimator of the Rivers-Vuoung (1988). If some of the p, are different fi'om zero, we have to use the F BP estimator to account for endogeneity of y,,2. If p, is zero, the conditional mean firnction can be reduced to the simple mean function without endogeneity, that is ‘1’2Iwit1711t.(2%'t2 '— 1)wit27r2ta 0] : ‘I’IIwrtflltI‘I’lIQyz‘w “‘ 1)wit27r2ti _ E (D (217', 7f], . etch-,2 ‘1)wz‘t27I2ti 1<2y32 —1>w.t2w2t1 I z 1 I For inference, we need a consistent estimator for the asymptotic variance. Avar(7r)— — A—IBA_1/N,N (2.14) N T Where A = N4 2 Z —EIV%1D€itI )I B=N_1 Z Z 21 SrsITF (”lsz'tIW)’, i=1 t=1 i=1t1 3:1 and 5,,(7r) E V7, 1n 6,,(7r) [Wooldridge (2002) Theorem 12.3]. The gradient of the quasi-log-likelihood function for each (i, t), V7, 1n 6,,(7r), can be written as 51111321 _ yitl — mit (15Iwzt19’57r 104504131) (2 15) a ‘ -(1— MW —1) 1""1 ' 7T1, mzt mzt yit2 10712772 E yitl *- mz‘t g“ mitIl — mit) z (23/2152 - 1)wit27r 2 — (21/222 — llpt’wz‘tflr 1 l—pg where (b() indicates the standard normal probability density fimction (pdf) and I define ”11731 = 9,,1 to simplify notations for the Hessian matrix. 37 In the same way, am (it = Ilitl — mz‘t (291:2 - 1)¢Iwit27r2)¢(vit2) (97f 2 mitIl - mit) ‘I’II2yz't2 - 1)wit27r2i yit? - ‘I’I’wz'1127r 2) ®Iwit27r2) [1 - ‘thitzfiz _ yitl ‘- mit g_ + 36132 — (w,,27r2) mitIl - mu) ”2 ¢Iwit27r2l [1 - @(wz'tzflz 71212 = witl” 1 - (23/62 - 1)Pt(2yit2 - 1)wit27r2' (/1 — pi? Because 7r2 appears in both the structural equation and the reduced form for 372, there is an 513222 (2-16) )] [yin - (PIwitmll $212 )] uithitQ additional term. The score function for p, is 3111522 = 93131 — mit (2.17) apt mitIl — mit) X (231272 — 1)¢2Iwit17r1t3(2yit2 — 1)w,,27r 2» (23/222 — llptl “(23/222 - llwz't27r2l E yitl — mit 9'2: mitIl — mit) 2 '0’ where ¢2(-) denotes the bivariate standard normal pdf. Some components of V72r 1n 6,,(7r) are 621M —1 2 , —— = g- at, :13. , (2.18) Orlan’l 2:21; m,,(l —— m,,) ”1 z 1 “1 _ = 't 't 'tlr 6316p; i=1,=1 mitIl — mit) 2 1 z p z T 6211.16 —1 I _ = 9't19't2x't1x' - 6216762 1;; m'z'tI1— mit) z z 2 “2 The remaining elements of the Hessian matrix can be obtained in the similar way. The test for time-varying individual heterogeneity is conducted based on the more 38 general function for yl as in the first chapter. See the first chapter for detailed explanation about the more general function. Also the LM (score) test, the variable addition test, and the minimum distance test are conducted as before. 2.4 Application: fertility effect on working hours of women I apply the method to study the effect of fertility on working hours of women with two or more children. As I mentioned above, the same sex indicator is used as an instrumental variable. Therefore, the interpretation of the average treatment effects of a new-bom is the effect of fertility moving from the second child to the third one on fractions of working hours of women with two or more children. Panel Study of Income Dynamics (PSID) for 1986-1989 is used as in Carrasco (2001). I refer readers to Carrasco (2001) for detailed explanation about the data. The PSID is longitudinal survey over 5000 households since 1968 and consists of two independent subsamples: equal probability sample of US household and below 150% of the poverty line. The data in this paper includes both subsamples to increase sample size. The data contains 1442 married or cohabitating women with two or more children between the ages of 18 and 55 in 1986. Table (B.1) shows descriptive statistics for key variables. The women spent about 19% of their total hours to work and there is only one observation of 1. 14.3% of women had a new born baby and 37% of them had children aged between 2 and 6 years in 1986. Their average age is 31.302 and about 77.3% of them are black. As the ratio of women have new born babies decreases, the fraction of working hours to total hours of women increases. The conditional mean function for labor supply is 39 tht + afe'rt,, + kid26,t51 + hinc,,,Bg + 010,619,363 ”1+7? + 0961354 + ntdsefrz'C 1 + fltkid262'42 + UthinciC 3 + 10:73:12] EIf’UJhitI') E (Di (2-19) 2 1+7, where T, indicates a set of year dummies. f wh is the ratio of working hours to total available hours and f ert is a fertility variable which equals 1 if the mother has a child aged 1 at t+ 1, because it is not clear that a child who is a year old at the time of interview means a mother gives birth to the child at that year or previous year [Carrasco (2001)]. Two more exogenous variables are added: kid26 takes 1 if a woman has children aged between 2 and 6 years and hinc is a logarithm form of husband’s income. Mothers’ race (black) and age (age) are also controlled. The time averages of exogenous variables are added to control individual heterogeneity. dsea: is a same sex indicator, which equals 1 if the first two children have the same sex. This variable is used as an instrument variable for the fertility based on parental preference on mixed siblings. Because of indistinguishability and exogeneity of black and age, their time averages do not appear in the conditional mean function for f wh. To consider the endogeneity of f art, a linear projection of r,,1 = p,r,,2 + e,,, 5,,I2, ~ N (0, 7?.) is assumed. r,,1 denotes an unobserved omitted variable in the model for f wh and is assumed to be correlated with f ert. 7,,2 appears in the reduced form for f ert. Equation (2.20) shows the reduced form for f ert. 40 E ( fert,,|-) = (w,T, + 6dsex,, + kid26,,71 + hinc,,72 + blue/9,73 (2.20) +age,,”y4 + d—se-Eptl + 73326,).2 + Wflg + 7",,2). To obtain the conditional mean function for f wh, the bivariate probit model is used and the result looks like equation (2.11). The conditional mean function for f wh along with the reduced form for f art is estimated by Bernoulli QMLE. For the purpose of comparison, a linear model is also estimated and reported. Let us explain how to estimate the linear model before looking at estimation results. In strictly exogenous case, the fixed effects estimator is used. In binary EEVs case, I use the two-step estimator. yin = wz‘t17r1 + Um 312752 = 1011222712 + ”62 > 0), where y,,1 and y,,2 denote the fractional dependent variable and a binary EEV, respectively. Because the endogeneity of 37,,2 implies v,,1 = pv,,2 + 6,,1. I define a new regressor, yit2¢Iw¢t27r2)/ —wit27T2) + (1 — yit2)EIvit2 I Uitz < —w,,27r2). If we assume the probit model for y,,2, we can prove E(v,,1 | w,,1) = yit2¢Iwit27I2)/(I)Iwit27r2) - (1 - yit2l¢Iwit27I2V I1 - ‘I’Iwz't27r2ll- Therefore, we 41 estimate the probit model for 37,,2 at the first step, and the fixed effects estimator is used after adding the generated regressor [Freedman and Sekhon (2008)]. As Wooldridge pointed out, the fixed effects instrumental variable (FEIV) estimator can be used under weaker assumptions in that the FEIV does not require yg follows a probit. Because the linear model is used to compare the coefficients of the linear model with the ATEs of the fractional model and the fractional model assumes the probit reduced form for yg, I use this Heckrnan approach. One can use the plug-in method, where the fitted values for y,,2 at the first step, (I) E (w,,27r‘2), is used an IV for y,,2. Because E(y,,2 | w,,2) = (w,,27r2) holds and we have a linear model, this plug-in method also produces consistent estimates4. Table (3.2) and Table (33) show the estimates of various models for strictly exogenous fertility case. Table (B.2) reports the results of the linear and the fractional model. All three methods show significantly negative effect of fertility on fractions of working hours. The coefficient of fertility in the linear model is -0.021 and the ATEs of fertility in the fractional model are -0.023 in both the FPE and MWNLS estimator. Compared to women without new born babies, mothers with the new born reduce their fractions of working hours by 2.3% points. Also table (32) contains two test statistics for time-varying heterogeneity. While the LM test statistics accept the null hypothesis of constant individual heterogeneity (H0 : r7, = 1), the variable addition test rejects the hypothesis. Table (B.3) contains the results of the fraction model with time-varying individual heterogeneity. The first column and the third column are the estimates of the constrained According to the simulation results in this chapter, both two-step estimator and the plug-in estimator show quite similar results in term of the root mean squared erros, compared to the average treatment effects in the population. 42 pooled F PE and the constrained MWNLS where the restriction of 7r = h(6) is imposed directly on the model. The second column contains the estimates of the CMD, where the set of parameters of interest 6 is recovered from 7r by the classical minimum distance estimator. The ATEs are -0.022 ~-0.023 and these number are quite similar with those of the linear and the time-constant heterogeneity model. Also the GMM test statistic cannot reject the null hypothesis of r7, = 1. Table (B4) and table (B.5) contain the results of the linear, the fractional model, and the fractional time-varying model for endogenous fertility case. According to table (B.4), the effect of fertility on the fi'action of working hours is -0.142 in the linear model, while the ATEs of fertility in the fractional model is -0. 129. Also table (B.5) shows quite similar results with the ATEs of the fractional model in table (B4). The ATEs in table (B5) are -0. 122 ~ -0.126 depending on estimation methods. In the same way, we can interpret these results as follows: mothers with new born babies reduce their fiactions of working hours by 12.9 ~ 12.2 % points compared to women without. According to table (B.1), the mean values of f wh are around 0.2. Therefore, the ATEs of fertility seem not only statistically but also economically significant and meaningfirl. The exogeneity of fertility is rejected in both the fractional model (the estimate of the correlation between TI and r2 is 0.214 and its standard error is 0.058) and the fractional time-varying model (the p-value of the Wald test for endogeneity is 0.0293), but the time-constant individual heterogeneity cannot be rejected. 2.5 Monte-Carlo simulations Simulations are conducted to find finite sample properties of a linear model, inconsistent two-step procedure, and the single-step estimator. For the purpose of comparison, the 43 ATEs of a binary EEV are computed and their mean squared errors and biases are reported. The number of replication is 500 and I generate eight samples with N=300, 500, 700, 1000 and T=5, 7. The equation (2.21) shows the data generating process for a binary EEV. xit = II'YastTt + 7.22271 + 73663 + ”it > Oil (221) c,,u,, ~ N(0,1), where 3,, and 2,, denote a binary EEV and its instrumental variable, respectively. c, indicates unobserved individual heterogeneity and is assumed to be correlated with 2,,. Also T, (time dummies) are added to the model to allow different intercepts for each time. 2., = 72cc.- + (p — 730..., we ~ N(0,1). (2.22) Equation (2.22) is the data generating process for 2,,. Also 2,, is set to be correlated with C,. p = (DITytTt + A/yxxit + Ci + vit): (2-23) ”it E Putt + eita ez’t N N(0,1), H 1 . yz‘t E E Z yithiyith ~ Bernoulldp), h=1 where 1),, denotes an unobserved omitted variable and is correlated with 2,, through u,,. The fractional dependent variable, y,,, is generated based on the Bernoulli distribution with probability of 77. As before, I use H = 1000 Bernoulli draws. The average treatment effect is approximated as N 1 ATEt,p0p z N Z [(FITytTt + 73,73: + Ci + vit) _ @IVytTt + Ci + ”20] - (2-24) i=1 The equation (2.24) shows the ATEs for time t in population. As mentioned above, the root mean squared errors (RMSEs) and biases are compared based on the ATEs of three estimation methods. 1. The fixed effects estimator of a linear model yit E TytTt + Vyxfliz‘t + 53/in + pvz'tz- (2.25) PIxit = 1|2,) E @IaxtTt + axzzit + 5225i)- (1) Estimate the second equation of (2.25) by the probit estimator and generate a new regressor, 6,,2 E x,,63,,/,, -— (1 — fL'it)$it/(1 — ,,), where 65;, = ¢I5¥xtTt + (3222:2711 + Emit) and ‘i’z‘t = ‘PIflxtTt + 63332,, + 3x223)- (2) Replace vitz with 6,,2 and estimate the model by the fixed effects estimator. (3) get ‘y‘yx. 2. The Two-step fractional probit estimator (FPE) E (yitlxr‘ta 27;) E ‘I’IvytTt + 7372:3327. + (lg/zit + P712112) (2-26) (1) Estimate the same equation of (2.26) by the probit estimator and get 0,,2. (2) Substitute 22,,2 with ’17,,2 and estimate the first equation by the fractional probit estimator based on the conditional mean function. (3) Compute the ATEs from (2.27). A 1 N . A e _ M ‘(I’I’lytTt + (lg/252' + BEEN]- 45 3. Single-step fractional bivariate probit estimator (FBPE) (D2[wit17r1, (2:13“ — ”102127er (23311 _ Up]. (2.28) ‘I’Kzaiz‘t — 1)wz't27T2] (1) Estimate the equation (2.28) by the pooled QMLE. (2) Calculate the ATEs from E(yit|$itv 21') = (2.29). 'iytTt + ‘nyivz't + 53/in ‘rytTt + 5yz5i N A 1 ATEt,FBPE = 7V- 2 ‘1’ - (/1+ 2)? i=1 1+2)? (2.29) Table (B.6) contains simulation results. The numbers reported are the RMSEs and biases of three estimation methods. As N and T increase, the RMSEs decrease. The RMSES of the FBPE gives the best fits among three methods. For examples, the RMSE of the FBPE is 0.0238 and the RMSEs of the linear and the FBE are 0.0295 and 0.0294, respectively when N = 1000 and T = 7. The FPE and FE estimators are inconsistent in this setting, but the FPE performs somewhat better. The nonlinearity in the population seems to lead to this result. 2.6 Conclusion In this chapter, I extend the first chapter to a binary endogenous explanatory variable (EEV) case. Because the EEV is discrete, the traditional two-stage procedure of Rivers and Vuong (1988) produces inconsistent estimates. Alternatively, the full-information maximum likelihood estimator (FIMLE) can be used. However, if distributional assumption is not true in population or there is misspecification in any equation of the system, the FIMLE also is an inconsistent estimator. I modify the bivariate probit model to derive a conditional mean fimction for a fractional dependent variable with a binary 46 endogenous variable. Based on this mean function, the pooled quasi-limited information maximum likelihood estimator is proposed. For identification, the conditional mean function along with a reduced form for a binary endogenous variable are estimated [Wooldridge (2007)]. This single-step estimator gives simple asymptotic variances and possibly enhances efficiency of estimates. The estimation methods are applied to the effect of fertility on women’s fraction of working hours. Based on the parental preference on the mixed siblings, the same sex indicator of the first two children is used as an instrumental variable to consider the endogeneity of fertility. The ATEs of fertility are 0023 under exogenous fertility assumption and -0.129 under endogenous fertility assumption. The exogeneity of fertility assumption is rejected, but time-constant individual heterogeneity cannot be rejected. The ATEs of fertility imply that mothers with a new born reduce their working hours by 12.9% points compared to women without new born babies. Also simulations are conducted and the ATEs of a binary endogenous regressor are compared in a linear model, the inconsistent two-step fractional probit estimator (FPE), and the fractional bivariate probit estimator (FBPE). The FBPE gives the best fit among the estimation models and methods. 47 Chapter 3 THE HURDLE MODEL FOR F RACTIONAL RESPONSE VARIABLES WITH BINARY EN DOGENOUS EXPLANATORY VARIABLES AND TIME-VARYING INDIVIDUAL HETEROGENEITY 3.1 Introduction This chapter studies the hurdle model (or, the generalized Tobit model) for fractional variables. Also time-varying individual heterogeneity and binary endogenous variables (EEVs) are considered as in the second chapter. The fractional variables can take 0 or 1 with positive probabilities. This stack of 0’s or 1’s occurs because fractional variables cannot be defined outside the unit interval or are comer solutions (optimal choices). Our two applications in the first and second chapter show two different examples. In the first chapter, nontrivial proportion of school district reports 100% of test pass rates because test pass rates cannot exceed 1. In the second chapter, some mothers report 0 working hours and this stack of 0’s seems to be the result of optimal choices. The fractional hurdle model allows us to separate the determination of comer solutions (y = 0 or y > 0) from the decision of the amount of y conditional on y > 0, where y denotes a fractional dependent variable. By using the fractional hurdle model, we can allow different sets of explanatory variables for the equation for comer solutions and the equation for positive fractional dependent variables. Also, we can obtain both E (y | :13) and E (y | ac, y > 0), and their corresponding average treatment effects (ATEs) can be calculated. The motivation of the second chapter is that the effect of fertility might be underestimated when the binary model for job market participation is used because the binary model ignores women who are working but reduce their working hours after having 48 new babies. The ATEs of fertility in E (y, | x, y > 0) shows how much the binary model possibly underestimate the effect of new born babies. After deriving a conditional mean function, the single step estimator of Wooldridge (2007) is used. To obtain a conditional mean function the bivariate probit model is modified as before. The estimation methods are applied to analyze the fertility effect on women’s labor supply by using the same data as in the second chapter. Two average treatment effects (ATEs) of the fertility are obtained and compared with those in chapter 2. The first ATEs are obtained using the whole sample and the second ATEs are calculated conditional on the fractions of working hours are greater than 0. 3.2 The basic fractional hurdle model I assume N randomly drawn cross-sectional units and T observations for each cross- sectional unit. Also N is assumed to be greater than T (large N fixed T asymptotics). The basic hurdle model for fractional response variables consists of two equations. The first equation is (3.1), which indicates the probability that the fractional dependent variable, yit, takes 0 or positive values. P(dit = llivz't,cz'1)= 20(3/z‘t > lez‘tacil) = @(xit51+ Ci1)- (3-1) Cillxi ~ N(1/11 + Iii/3’2 + ai1)avar(aillxi) = 021, where 2' indicate a cross sectional unit and t indicate time (i = 1,2, ..., N ,t = 1,2, ...T). dit equals 1 when yitl takes positive values. $it and Gil denote a set of explanatory variables and individual heterogeneity, respectively. The individual heterogeneity is assumed invariant over time. One can easily consider time-varying heterogeneity by using 49 interactive effects model. The distribution of c1 is restricted by using the Mundlak(1978) - Chamberlain(1980) device. From the properties of the normal distribution, P(dit = llfliz') = ‘1’ (2/11 + aSit/31 + fir/32 + a2:1) (32) $1 + 3321.51 + 532'52 2 ‘/1+aa1 E @(witfll)2wit5(lixit1ji)i (I) where 33,; E (5132-1, ..., xiT) is a set of explanatory variables in all times and i E N “1 Sill xit is a set of time averages of suit. Because only scaled parameters are identified, I drop subscript a from now on and define wit and 7r1 for notational simplicity. The second equation is (3.3), which shows the mean function conditional on y > 0. E(yitlxita 0224121: > 0) = “5132251 + fltCz'Ql (3-3) Cizlivz' ~ NW2 + 5352 + 012, 0,212), where ntcig is added to consider time varying individual heterogeneity. Applying the Mundlak (1978) - Chamberlain (1980) device leads the equation (3.4). E(yit|5’3ia yit > 0) = (I) (mt/12 + 1172151 + 77221—5152 + 77:02'2) (3-4) 77th +1“ 51 +0535 _ — Q) 2t 2 2 z 2 : ©(wit7r2t)' 1+ "t aa2 Therefore, the conditional mean function for y is 50 E(yit|$z') = P(yit = Olmz‘) x 0 +p(yz‘t > Olivi) X E(yit|$iayit > 0) (3-5) ¢(wit7rl)<1>(wz't7r2t) = (1)2(wz't7711 witmta 0) E m(wz‘ta 7ft), where (1)2 indicates the bivariate standard normal cumulative density function (cdf). The conditional mean function turns out to be the product of two standard normal cdfs and we can rewrite the mean function by using the bivariate standard normal cdf with the correlation of 0. I will use this property later to obtain the average treatment effects (ATEs) of a binary explanatory variable. The partial effects of the kth explanatory variable, :1: k, can be obtained either fi'om taking derivatives of the conditional mean fimction with respect to :1: k if x k is continuous, or from the differences of the mean firnction evaluated at two values of 53k if £12k is discrete. As before, these partial effects depend not only observables but unobserved c1 and c2. Therefore, it is a good idea to calculate the average partial effects by averaging partial effects across the distributions of cl and 02. APEt = E(g;,c1,02)[13k¢($t51 + Ci1)‘1’($t51 + Utcz'2) (3—6) +Bk‘b($t51 + Ci1)¢($t51 + ntcz‘2)] ATEt = E(x,01,c2)[<1>(a:§1)51 + Ci1)<1>($,(g1)51 + 7725622) -‘I’(fv,(;0)51 + Ci1)®($§0)51 + 7mm], where APEt and ATEt stands for the average partial effects and the average treatment effects of 33k at t, respectively and 5125:) and 239) denote sets of explanatory variables for two different values of witk- Exploiting the Mundlak (1978) - Chamberlain (1980) device 51 and assuming the independency of (ail: Gig) conditional on 33,-, we can rewrite (3.6) as E ME = E5,- ——’“—2 ¢(wt771)q)(wt772t) (3.7) 1+0a1 E + ’3 2 ¢ 1+77t0a2 ATE. = E— [E(w§1’vr1)<1>¢>,-(1—i) ~ ]1/2 Conway [FE-(1 — .-> v.2 = May W1 — Emil/2 C(fiiwiag W1 — Eva->11”, where m,- E m(w,-, 7r) and (I),- E (w,-7r1). ‘~’ indicates that all elements are evaluated at the preliminary estimates. After estimating (3.2) and (3.5) by the nonlinear least squares (NLS) estimator, one can calculate the correlations (76f, 772’ ) by using the standardized residuals from the NLS estimates. See Papke and Wooldridge (2008) and the first chapter for detailed explanation. With consistent estimates, the ATEs can be obtained from (3.10). N A 1 ATEt = N E [@(w§1)ir1)(w§1)ir2t) — c(w§0)fr1)(w§0)fr2,) . (3.10) i=1 3.4 Estimation methods with binary endogenous explanatory variables Now relax the strict exogeneity assumption. I consider a binary endogenous explanatory variable (EEV) case with appropriate instrument variables. Estimation methods with continuous EEVs can be found in the first chapter. In continuous EEVs case, the two-step estimator by Rivers and Voung (1988) can be used and this estimator produces consistent estimates. However, there is no convenient two-step estimators but inconsistent plug-in estimator with binary EEVs (Wooldridge, 2007). The equation for corner solutions is defined as follows. 53 dit = 1(alyz't2 + zit1fi1+ 611 + Um > 0)adit 5 (31m > 0)- (3-11) Cillzz' ~ N(l/Jl + 52732 + 021,021), where z, E l/T 2:1 Zita zit = (2,11,11,12). zit contains both explanatory variables (Zitl) and instrumental variables (Zit2)- yit2 is a binary EEV and Um denotes an unobserved omitted variable that is assumed to be correlated with yitg. The general index fimction, 1(-), is used, but I assume the probit model for d to exploit the properties of the standard normal cumulative density fimction (cdf). From the Mundlak (1978) - Chamberlain (1980) device, dit = 1(1/11 + 0113/2752 + 2123151 + 2732 + a11+ vitl > 0) (3-12) = 1(101 + alyit2 + Zinfil + izfiz + Tm > 0) 1(wit17r1'i' Titl > 0)awit1 5(1azit1a2i)- The following equation shows the mean function conditional on y > 0. E(yit1|yz't2, Zitla 07:2, 11212, yitl > 0) = (1901231212 + 221151 + 77tcz'2 + Uit2)(3-13) - 2 Ci2|~Ti ~ N(I/Jz + 22:52 + W2, 0,12), where vitg denotes an unobserved omitted variable and is assumed to be correlated with yit1° In the same way, we can rewrite the mean function as 54 E(yit1|yit2a Ziarit2w3/z't1 > 0) = ‘I’lflth + 023/212 + 221151 (3-14) +77t32'52 + Tm] ©(wz’tl7r2 + 732:2), 731:2 5 W112 + vit2- Now let’s define the reduced form for yg. Because the reduced form is also a nonlinear function, the Mundlak(l978)-Chamberlain(1980) device is applied. yit2 = 1(Zz't7 + 613 + “213 > 0)- (3.15) c- lz- N(Y/J ‘- - 2 23 2t V‘ 3+ZZ<+aZ3i0a3)a We can rewrite (3.15) as yitz = 1(¢3 + Zit'Y + 52C + 731:3 > 0) (3-16) 5 1(wz‘t27r 3 + T213 > 0), W3 5 023 + vit3- The endogeneity of yit2 implies that r213 is correlated with both Titl but 73,52. Therefore, linear projections of TM and Titg on 73,53 are assumed. 2 Titl = P173153 + eit1,V(eit1|Zit) = T1- (3-17) 2 ”Fitz = p2trit3 + Baal/(Brawn) = T2t- After assuming joint normality (rm, mtg, 727.3), I modify the multivariate probit model to obtain the conditional mean function for y,“ by taking expectation. Equation (3.12) can be rewritten as follows. 55 llyit2,ziirit1) = ©(wz't17T1 + Tm) (3-13) wz’t17T1 + P1T1t3 1 + 7% llyz'tzzzwriw) = <1) @(wz'tlfll + 1017313)- Because only scaled parameters are identified, I drop subscript for simplicity. From the results of the second chapter, Er3(ditlyit2,zi) = Er3[q)(wit17r 1 + P1Tz't3)] (3-19) wit27r3 ——yit2 / (w- 7r + - P 7‘ t )QW )dT 1 —wit27r3 - 31212 f + (1’0”th + Plr't )¢(7“3)d7"3 (I)(_wit27r3) 00 z 7' 3 ‘1’2lwz't17r1, (23/212 - 1)wz't27T3, (231m — 1)p1] ‘I’l(2yz‘t2 - 1)’wz't27r 2] m21it(wz'tla wit2, 7T) In the same way, we can rewrite the equation (3.14) as E(yit1|yit2aZitafit21yitl > 0) = 9(wz't17r1 + TM) (320) = witifli + P2trit3 2 1 + Tu Taking expectation with respect to 73 leads 56 Er3(yit1lyit2,ziayit1 > 0) = Er3[@(wit17T 2t + Pztmsfl (3-21) wit27r3 yit2 / @(w —— 't17r2t + 92th: )¢(T3)d7‘3 @(wz't27r3) z z 3 —OO 1 "wit27r3 _ yit2 + ‘1)(w't17r2t+ p2t"'t3)¢(7‘3)d7“3 (I’(“wit27r3) z 1 —OO ©2lwz't17r2t,(2yit2 - 1)wz't27r3, (2.71212 - Um] (91(23/2'232 '— 1)wit27T2] m22it(wit1a 10212, 7f)- To obtain the conditional mean function for yl, E(yit1|yit2,zi) = I I I 10017;: = llyit2,zi) X E (yitlyz‘t2,zivyitl > 0) (3-22) ®2[wit1“11(2yit2 - 1)wit27r3i (231212 - 1)!)1] ‘I’[(29it2 — 1)wz‘t27r2] X ©2Iwz’t17r2ta (23/212 - 1)wz't2773a (23/212 — 1)P2t] (Plat/212 - 1)wit27r2] m21it(wit1’ 10212, 7r) X m22it(wit1, wit2a 7r) m2it(witlv wit2a 7T), where 7r E (alt, ”2t? 7r3)’. After estimating the model, we can test the endogeneity of y2 based on p1 and pgt. To identify all parameters and possibly enhance the efficiency of estimates, the equation (3.22) is estimated along with (3.19) and (3.16) by the pooled QMLE. 57 111321 = dit 111(m21it) + (1 - dit) 1n(1 — 7712121) (3.23) +yit1 111(m2z't) + (1 — yit1)1n(1 ‘ "1221) Han 1nl‘1’2(w7;127r3)] + (1 - 21212) 11111 - (I)(wit27r3)]- Now let us derive the ATEs of the binary EEV, 312. Two assumptions hold: yg is independent of (c1, 211,02, v2) conditional on (2,73) and (2., T3) are redundant in the structural mean function. From Wooldridge (2002 section 2.2.5), the ATEs can be obtained from differences in the following function evaluated at yitg = 0 and yitg = 1. Dropping the subscript z', E(cl,v1,c2,v2)[q)(a1yt2 + Ztlfil + 01 + 'Utl) (3-24) x 130123112 + 21151 + 77102 + ”Ut2)] = E(z,r1,r2)[q)(wt17r1 + Tt1)(p(wt17r2t + 712)] = E(z,r3) [©(wt1711 + P1Tt3)‘1’(wu7rzt + 9215713)] By exploiting the properties of normal cdfs after transforming the errors, we can rewrite (3.24) as follows and consistent estimator for the ATEs is N (1)- w(1) . . 1 111 7r 7r AT Et _N 21% t1 1 wtl 2t P1P2t (3.25) (0) 4921(me 7r12 wil 7r2t P1P2t where wall) and w(t1)den°te sets of explanatory variables with yztg = 1 and yth— — 0, respectively. 58 Testing endogeneity of y2 (H0 : p1 = pgt = 0) and time-constant individual heterogeneity (H0 : 77t = 1) are conducted in the same way in the chapter 1 and chapter 2. 3.5 Application: fertility effect on working hours of women I apply the estimation methods to study the effect of fertility on working hours of women with two or more children using the same data as in the second chapter. The main contribution of this chapter is to separate the determination of labor market participation from the determination of fractions of working hours of women. According to the results from the second chapter, the effect of a new born on mother’s working hours is significantly negative after controlling the endogeneity of fertility. This implies that women outside job market would not take part in the market after delivering babies. Therefore, focusing on women who remain in the market produces more meaningful implication. The estimation model for comer solutions is E(‘d,-, = 1|.) = chm/Jun + 61 f «271,-, + kid26,t,32 + hing-1553 (3.26) +black,54 + amt/35 + Mia,- + REE/37 + Wm; + rm), where f wh is the fraction of working hours to total hours and f ert is a fertility dummy, which is 1 if a woman has the new born at t+l. Two time-varying explanatory variables are added: kid26 is 1 if a woman has children aged between 2 and 6 and hinc is logarithm of husband’s income. dsea: is 1 if a woman has the same sex children and is used as an instrument variable (IV) for the fertility. This IV is based on parental preference on mixed siblings. Also black and age are controlled. However, because they are likely to be 59 independent on individual heterogeneity and we may not identify the coefficients, their time averages do not appear in the equation. Tt indicates a set of all time dummies to allow different intercepts. The mean function conditional on positive fractional variable is E(fwhit|fwhit > 0, ) = @(l/JZtTt + dlfertit + kid26it62 + hincz-t63 (3.27) +blackz-64 + ageit65 + ESE—£13,156 + E32621}; + 523752-68 + mtg). In this chapter, I use the same explanatory variables for both the determination equation for a corner solution and the mean fitnction conditional on a positive dependent variable. However, it is straightforward and sometimes desirable to allow different explanatory variables for each equation. In our application, some regressors might affect the decision of whether mothers participate the labor market, but might not affect the amount of hours worked. The probit model is assumed for fertility. E ( f ertitl) = (w3tTt + Vldsexit + kid26it'yg + hincing (3.28) +blackn4 + 0962175 + 33—65176 + miw + W178 + T113)- Table (C.1) contains the estimates of the fractional hurdle model under strictly exogeneity assumption. The first 3 rows reports the estimates of equation (3.26) and the next 3 rows are estimates of equation (3.27). The reported ATEs are fi'om equation (3. 10). I report two ATEs: the ATEs conditional on f wh > 0 and the ATEs. The ATEs conditional on f wh > 0 are obtained based on estimates of equation (3.27). The ATEs of fertility 60' is -0.022 in both the QMLE and the MWNLS estimator and the ATEs conditional on f wh > 0 are -0.014~-0.013. The ATEs are quite similar with those of the simple fraction model, but the ATEs conditional on f wh > 0 are about half of the ATEs. This difference occurs because some women drop out of the labor market after delivering babies and the ATEs conditional on f wh > 0 do not count of them. Both the LM test and the variable addition test accept the hypothesis of constant individual heterogeneity (H0 : 771: = 1). Table (C.2) reports the estimates of the hurdle fractional model with time-varying individual heterogeneity. As we can see, the GM test statistic also accepts the hypothesis. Therefore, it is not surprising that the ATEs are quite similar with those of the fractional model with constant heterogeneity. The next two tables (table (C3) and table (C.4)) contain the results under endogenous fertility assumption. The same sex indicator is used as an instrumental variable. In table (C3), the first 4 rows report estimates of equation (3.26) and the next 4 rows contain estimates of equation (3.27). In table (C3) and table (C4), the ATEs are obtained by using formula (3.25) and the ATEs conditional on f wh > 0 are obtained by using the formula in the second chapter based on estimates of equation (3.27). The ATEs are -0.129 and quite similar with those of the simple fractional model. Time-varying individual heterogeneity hypothesis is rejected and the endogeneity of fertility is accepted marginally. In table (C.4), we can interpret the results similarly in table (C3). The ATEs are -0.124 ~-0.117 depending on the estimation methods. the ATEs conditional on f wh > 0 are -0.07 ~-0.09. Even though the point estimates of 77,; are 1.131 ~0.037, their standard errors are too big. The time-varying heterogeneity is rejected by all test statistics. This might imply that we cannot estimate the models precisely because of their nonlinearity and the number of 61 parameters estimated. Therefore, let’s focus on the results of the model with time-constant individual heterogeneity. As a result, fertility is marginally endogenous in the model and individual heterogeneity is time-constant. The ATEs of fertility are significantly negative, but the values are quite similar in both the fractional model and the hurdle fractional model. the ATEs conditional on f wh > 0 imply that the women who are still working after delivering babies reduce their fraction of working hours by 9 % points. This fertility effect on working women is usually ignored in the binary model, because their binary indicators for labor market participation after having babies do not change. 3.6 Conclusion In this chapter, I consider the hurdle model for fractional dependent variables. Binary EEVs and time-varying individual heterogeneity are considered as before. With the hurdle model, we can separate the determination of comer solutions from the decision of the amount of dependent variables. In the application, some women report 0 working hour and these might be their optimal choices. By using the fractional hurdle model, we can allow different sets of explanatory variables for each equation and calculate the average treatment effects (ATEs) conditional on the positive dependent variable. These ATEs are ignored in the binary model. To derive a conditional mean fimction, the bivariate probit model is modified. Based on the derived conditional mean function, the pooled QMLE is used and the reduced form for the binary endogenous variable and the mean fimction for comer solutions are estimated together for identification. I propose how to compute the average treatment effects (ATEs) of binary endogenous variables. The estimation methods are applied to analyze the effect of fertility on women’s fraction of worked hours. The ATEs are -0.129 and the ATEs 62 conditional on f wh > 0 are -0.09 under endogenous fertility assumption. The endogeneity of fertility is accepted marginally, but the time-varying heterogeneity is rejected by all three test statistics. 63 Appendix A Tables for Chapter 1 64 .Szow 32 E :9& 8a $56:on 53% m $5395 98 9:828 23 mo @92mb 3: E aaamamae 5qu wvmmvfi wvovmfi 323 835 mama: aim: mmwofi 682 a: EH mm: H: a: m5 «3 E2 @8de wmcswvu made: www.mmmw madamomw ~35th $0.ng 5% .Em 65 ”wagon 89.2.8 H238 232m 832m 83on E88 ca: 38588 83 News meme 23 $3. $3 433 as: SS 23 83 23 Bed 83 33 52 ES 33 83 a; $3 83 $3 5% .Em wand 32 mag £2. £3 two 82 :82 ES: 833.: 232.: Swan: 8.3%: 82%: owmdmm: 08.5.: 52 «3.3% madam $38 $334 $.33. 25.23. 358% a: 83% Swag Spam Macao 83mm 228 48.4% 5% .Em 253$ www.mcmo $230 2988 $2.38 83me @333 $32 “$838 83 83 83 83 0:3 83 0de cs: Sad .83 $2 82 83 $3 23 £2 .83 mm; mm; a; $3 $3 a; 5% .3m 82 ES $3 £2 83 mag 23 Ea: 81:8 id. 88 88 82 82 32 .82 82 48> motmufim oZEComoQ :4 via. .83QTQ mqmvaoammtoo 2: 28 £8330an E 8838:: 2: 93 H H “E ” om ”28$ 3 wow: 88 88 55:68 @3828 98 moswsfim 35 A3 .8282an con 8on £88 @8988 wmmaafimaooa Sm mmE< 2: no“ 88.8 @8938 931 CL .888 Edvagm “250.“ Se 28:38an E 8828:: 23. A3 .mofiatg muoudcflaxm 23 mo 888.8 was ma... 88838 ma Z >22 23 98 man on... was SON nwsofiu 83 you 835:6 88:. 53:00 £8on :< “3.13838on8 ..88ES8 8.838 88— 58530: BEE?» 3382238 one 98 £38838 £303 3:030de 9: £88838 38mm 686 .2: 8% “53m am Z 32 was .mmm ,mm AS ”8qu 2.8.8 82: as 8:68 8595/ $8.3 82: 28.8 So: as 388$ 33 333 $88 33.3 $88 33.3 53 god 5:. $3. mood £2852 $8.3 some $8.3 $8.3 8.3.3 Rod- mad- :3. 82- £3. 8:3 58 some :03 some :38 83 38 $2. $8 22 E83832 mE< mmoO ma< “woo “woo v.38 8.8 H88 ”gcwvcoama 822:2 mam mm 882 $83 82535 as 838 .885 382 8%:on msccowoxm “~26on Ecouofim 05 can 885 22. ”N4 2an 66 .8388 858008800 2: 8.8 £83888 8 880,858 2: 88 H H or ” om 08o. 0o. 08: 2 0503.08 880qu 858822 AB 8:038:88 com 80b 808.8 @8888 800803002 80 mmm< 85 80m 8088 080880 23. A3 .8088 080830 $80.08 88 £85880 8 888:: 23. Q; .8303? 8088.3on 85 £0 88808 083 88. 0:0 Sow £3808: 83 80“ 038830 88:. 80800 8808 Ex Amv .0 885 00% 3888088: 859308 30 8205800 on... mm or Amv 58208088 £08858 88:3 082 088808 onwwoB 3883088 8880800 08. 08.0 £08838 0508 8803008. 8880800 2: £08858 808306 888888 0683 may 80“ 088.0 QmZ>>EO 080 £990 .920 A; ”8002 Aooooo ooooo 8%...20 82:82 383 0.3.3 383 33.3 08.3 8.23 mooo oooo mooo goo woo oomo 002582 aoooo 08.3 05.3 ASN.3 Goooo 08.3 oooo- oooo- oooo- mono- oooo- mono- SE: 0.83 03.3 0.8.3 8&3 0.8.3 383 oooo 2o; mono goo Emo ovoo 882382 08.3 383 0:3 on: a: £8 E 32.3 02.3 32.3 88 no? 38 o: 32.3 22.3 0:3 $8 No? no? 2. 0.8.3 68.3 62.3 Rog ooo£ m8 3. Aooooo 38.3 0.8.3 goo goo oooo o: 05.3 0.8.3 :83 oooo oooo go No mE< “800 mag. .800 mE< .800 38 8.0m 8H. ”pamwammom 82.820 88 020 882 $688088: $838883 £05 #0008 8803003 8002 8:8on msoaowoxm ”6002 warfigésfi. 1803005 05. ”m4 03$. 67 4830?: w:%:o:8boo 0:: 0:0 £09388: 5 803:5: 05 0:0 H H “E ” om H8: 8 :00: 0:0 :8: :owtwwa 0302.0.» 0:0 830330 33 3V .0:0300=:0: con 80c 8080 0:00:30 8:988:00: 0:0 0:88:08: 5 803:5: 09H Q; .88 £353» 82 :8 838:6 085 0:0 03.0?0». 03: :00Ba0n 0:80: :OSQEEE 0:0 30% :osavgam mo wofi 05 0:0 8Ew€0> 0:083:05 0:9~ A3 .58 swsofip 33 gm 852:3: 0:5 0:0 030E0> 0:: 80:58 0:20: :0300895 0:0 .32 :m 0:00:00 :0: w:€:0:m mo mod 535032 0:0 5:3 mo 88:08 0:5 0:: “Bow smsofiu 83 :8 8:883: 0:5 £038 20on =< Amv 0303:0080: d306E8 0093005 8:888 :0308:£E v0fiE= 60:: 0:: ~50 £30838 :52: 1:20.08“ 05 #30858 030E0> 3308385 800“? 85m 0:: :8 0:30 @4239 U50 .mErm SEE 3v ”8qu :38 3.2 8:68 0525). 28.8 8.8 $8.3 $0.: mesmsfimzq some 938 :38 $5. $3- 3.0- E 38.8 3:3 $3.8 ASN.8 83.3 mm; 05.0 33 08¢ 83 $2532 35.3 5:: :58 88.3 35.3 :3- and- 83. £2- $3. $5: $83 32.3 :38 638 some $2 $3 £3 35 mod 8883302 mm< .«000 mm< .0000 m0oO 0:0: 80: :88 5:008:09 0:250 80 @0335 28 09:02 10:28.0:m 80:5 #0022 wEwEmw 80:08:25 :0on 3:088m 05 0:0 80:5 0:2. 64 050B 68 83978 wamvaommwboo 2: 98 mwmmfiamudm E 838:: 05 ES 8 H “S H E: wamvcoam 80:8 ofi .8 3688ch $3 Rom mm 53 Bag A8 .8381” wfiwcommohoo 23 88 $838888 5 838:: 2: 98 H H “E H om $8 3 com: E osmsfim 853me 858832 A3 8:038:88 con 89a £88 @8983 839883003 mud $8388an 5 238:: BE. A3 .88 338:: no.3 H8 38826 was was mfidflg mEp 8953 £53 £283an 98 6mg E 883m 8a maximum m0 m2 .988va was :83 mo mewfigw was 2: .Som amsefi 83 8% 88:88 was 5588 £388 :< Amv .88“ @8388 2: Bob EH3 8:8 8 823mm“ 85 mo 820588 2: m“ “3 98 .8 was 8% 3888882 8533?: Mo 888508 2: mm S A8 38388me $08858 £08 1828me wwfimbmcoo ofi 98 88838 853m? 82888 Emma? 23 8% 98pm mmmo «:8 Q20 AS “882 38.8 $3: 888 ”3.: 883% 26.3 888 $23 082% 8:832 :38 $2- 83.8 33. 5 38.8 83- $38 83- mg $28 $5.7 $3.: 33. mg as: 83. 38.8 83. § 88.: Sam- 33.8 33. m3 888 fig- :38 33. mg 388 33. 83.8 $3. 5 68.8 8.3 $38 $3 $38 83 a8 3.2 82:83 38.8 83. :38 $2- $88 ~88- 88.8 «2.0- SS: 328 83 8:8 83 63.8 Boo 38.8 Eqm 33238? $28 $2 388 E3 E $.38 83 :28 £3 E $38 ES 838 83 8 85.8 E: 82.8 at: 3. 2.3.8 wad 68.8 £2 E :88 $3 688 awe «2 Em. 3m mm< tm 3m .mmoO Em Em mm< Em 3m 800 $8“ $88 ummH. ”€898an mam mwmpmuoBB Q20 083.039. @0532 888:5 33-02% .mficcommmmsonomowam :0on wEbQToEw—l Raouofim BE. ”m4 2an 69 63878 wcwvnoamotou 2: 0.8 $838988 8 #58:: 2t 88 8 H ”S ” om: @58QO 30:8 2: mo Emoqmwovcm “mop 8“ mm $3 3.35 A8 .8381“ wnmwaoamotoo 2: 08 $828088 8 8on8=8 2: 88 H H E u om $3 op 38 mw oBmSSm mocfimmv 888822 AB 88038288 com 8on 835 88¢:QO Umam‘maumuoon o8 mmmmfiqgam 8 838:: 8H. 3L .Som 8958: cam: 8* 888886 m8£ 88 @388.» mi: 88.3qu m883 828888 98 swam: 8 883m 58 wfiwammm mo m2 .3083?“ 88 :83 mo 8388 m83 2: 48m nwsofiu 83 8“ 888886 083 88800 2258 :< Amv .888 @85va one 8on 8.8”— 88m .8 #8862 8: mo 820288 2.3 mm “3 «:8 a m8$ .8... 3658838 #86368 mo 820580 Qt E “5 A3 5950888 #08838 @0953: 888888 80588888 @938: 688 388888 2: 98 88838 883me 888888 0633808858 80056:: 888888 8288888 @088: 2: 8m 98% MAESOO 98 QEOAEZE A3 “8qu 88.8 $8: 88.8 808 853% 285 88.8 55% 893% 8:852 838 $8. 388 88- § 838 $3- $88 8:- as 3:8 22. 88.8 .83- mg 98 mama- £38 83- § 32.8 :3- 388 Rod- mg 88.8 .038 $8.: .83- mg 33.8 88- 338 8%. § 88.8 mm; 88.8 $3 888 mg 838 $3 88532 :88 8:8 838 88.8 88.8 83. $88 ”3.8 SS: 808 End 838 08% 338 End 888 8% 383382 82.8 88 CS8 32 E 82.8 mm: 8.8 a: 8 A88 $2 $28 88 m: :88 $3 838 8: 8 £88 mad 3.8.8 888 m: :88 88. :88 83 S :m Em mm< Em— Em mooO Em Em mE< Em Em .mooO mead“ mama pmmrh “828030 8880 820825 8:82 uni—2 8238885 3:85 @888on msoaowowqm ”3802 wEbw>b8F antoahm BEL 64 £an 70 0803002808 com 80¢ 3080 0.80830 008833300: 08 060580808 8 800888 088 Amy ..G «J 0E3 338 033 80a 008330 08 3303 0800 000:9 A3 ”0302 :88 $3 :38 32 388 .32 A888 083 :28 End :88 83 83 Ag 8 038 A838 83 A88 8:3 :38 $3 :88 $8 88.8 33 £8 Ammo 8 m8; 888 33 :88 $30 :38 $3 ASN.8 83 A808 23 £8 ANS. 8 £3 :38 83 :88 and :88 33 :38 SS A308 53 8mm :81§.o A088 83 88.81088 :88 $3 $38 33 A208 53 :3 Somfimmxx mmafimm’w SomeHm< magnum—mad: Sommnfiafl maofimmx‘. .808 000 352880 020 320 32qu 98:80am 0:080w008m :88 3.3 $38 83 :38 82 :88 8g A398 :3 88.8 mmvd £3 :38 as; :88 no: :88 £2 :88 83 :88 :88 :88 32 £3 A58 23 :88 33 83.8 8% $28 32 :88 $3 A308 3.0 £8 :88 880 A88 088 A28 88 :58 82 A88 :3 ASN.8 03.0 53 A088 82 A028 33 A88 83 :28 $3 :88 83 A888 30.0 as Hoommmxx mmmflmmafl Hoommmafl mamHm—njw Hoommmxx mmmfim—mx‘ 0~Ec0080nm 0%.: 852800 020 080 3800008 030-038V 9580mm 0:080w008m :88 ES A088 ammo A038 3.3 :88 ammo A858 ammo Ammo. 8 mag 53 :88 .32. 68.8 mg :5 8 83 A80 8 $2 A8. 8 memo Am So. 8 and £2 :58 2:8 A898 $2 A88 8 33 :8 8 SS Ago 8 Ed 88.8 32 £8 :88 83 :38 82 6.88 83 A B 8 $20 Ago 8 $2. :88 wmmd 8mm A888 82 A208 33 A85. 8 22 Ana. 8 83 :8. 8 mag :00. 8 $2 :3 259.710: magma/x Hoommnfifl mmmfimm< Hoommm< mmmfimmxx 023:0ou0nw 8232 859880 020 080 m8080Qm 0:080w0xm A0008 9830870830 A88 25 32 8 £83 865% 32005 a £38m 28m 2: S< 038 A858 008883 as 9:80 71 .AOV 08000800000 3002308 000 AHV 030E0> 88000808 00; 0002503 00300800 00500808 0:: m: 080 Amy 00808300 ma Z >22 008080000 00: 000 mam 008050000 05 00008300 0000003 8088038 300000030 05 00008300 0000000 00002000 000380? 0000003038 000 00008300 3308 30003000“ 000 How 000% mu Z .320 .mmmO H220 Jm Z >32 ,m—na 030000800000 3002308 m8§0>083 £00 30008 0003008 00”. 000 Awoomv 03003003 000 0x83 00m 000% >8 000 gm AS ”00002 mmmod- mvmod 58d- momod Elod- uofiod Hoood- ooHod vmood- woood mAZBEOQ/B ommod- mvmod wwHodu nomod HEod- ooHod Hoood- mmaod omood- ozod mmmOd/B ommod- ovmod wwHod- womod HEod- oSod ooood- wmfiod mmood- wSod QEUQKH mmmod- womod «wad- Emod oEod- mvmod Hoood- oHNod umood- momod quggugm good. good mwfiod- onmod omfiod- mvmod Hoood- Smod Nmood- momod @905?“ ommod- nomod mwaod- onmod mmfiod- mvmod ooood- wfimod mmood- momod 00003 SHE mmmod- ommod Emod- owmod ommod- wmmod moHod- HSod good- :Sd qugzouxrb ammod- Homod mumod- owmod onod- ovmod 33d- moHod ooood- hiod mmmOIAKH Hmmod- mmmod «god- owmod ofimod- ovmod 38d- mofiod moood- o3od OZONPH ofimod- wvmod oomod- nomod mfimod- vomod mofiod- oamod ooood- wSod m42>>2fl>>n~ flood- wvmod oomod- nomod wfimod- vomod 38d- oflmod ooood- wSod mmmlam Emod- wvmod womod- homod Smod- momod HoHod- onod woood- oSod 00003 5H8 vowod- mvmod wwfiod- ooHod modd- mmfiod vwood- moHod vmood- Hoood m42>>2q>8 mmmod- wvmod nwfiod- womod umfiod- onod wwood- mmHod omood- voHod mmmO->B mmmod- nvmod owfiod- womod umHod- oofiod mwood- HmHod mmood- moHod Q20->,H Homod- ommod owfiod- onod nmfiod- oSod vwood- wmfiod omood- NSod mAZ>>2$>m ommod- mmmod owfiod- onod umfiod- mfiod «wood- wmfiod mmood- mzod mam$>nm 083- 80:. $03- 085 88.0- .80; 008.8 8030 $8.? 88. 08: mne 000 0m: 8:0 mm: 020 08 000 0.02 900 mm: 0802:0022 3 we no so do 80 A md I Hd H 30‘ 6 w0mm0000£ V mm: 08 m0 335 000 808m 000000m 000E d4 030B .on mumwcowosumn 126365 «:8 A3 @3828, 3398388 2: 588503 coma-£8.80 dosflsaom 23 mm 8i Amv .HoadESmo ma Z >32 8385300 2: «as mam wwfimsmaoo one .uopaasmm 8:3me EEEEE Rommmflo 2: £38838 wok-mag Swanson 83863 3383888 2: 838888 8888 888888 a: 88 8838 $2320 .8980 .920 .8823: .8: 88888838 882285 8885.888 883 8888 8888.8 2: 88 8883 882283 888 93.8 .88 88% >8- 88 38 3 H8882 7 88.? 888 888- 8888 88.? 888 888.? 888 82320.5- 888- 888 888- 888 88.? 888 88.? 888 maze->8- 88? 888 $8.? 888 88.? 888 88.? 888 QED->8. 88.? 888 28.? 888 888- 888 88.? 888 3232-38 88.? 888 288- 888 88.? 888 88.? 888 mam-8E 888- 888 888- :88 88.? 888 888- 888 58: Sue 88.? 888 8888- 888 88.? 888 88.? 8888 22320.5- 88? 888 888- 888 88.? :88 88.? 888 ammo->8 888.. 888 $88- 888 88.? £88 88.? 888 ago->8- 88? 888 888- 888 88.? 888 888- 888 $232.88 888- 888 888- 888 88.? 888 88.? 888 mam-25 888- R88 888- 8888 88.? 8888 88.? 888 $85 We 88.? 888 888- 8888 88.? 888 88.? 888 22320.5. 888.? 888 888. 8888 88.? 888 88.? 888 EEO->8- 88? 888 888.? 888 88.? 888 E88- 888 ago->8. 888- 888 888- 8888 88.? 888 :88- 888 3232-38 88.? 888 8888- 8888 88.? 888 :88- 888 mam-38 88.? 888 88.? 888 88.? 888 88.? 888 88: mug- Em mm: Em mm: 85 882 mam mm: 88882882 88 88 88 88 88 8.0 I ed H 8Q. a: @5882: v mm? of mo momma can muobm wohasvm :82 ”m.< Ban-H A3 588888: 8:83:88 88 A3 838.5» bog-8.398 2: 88382 805-3888 cos-£8809 8: E can A8 838838 maze/2 88.8888 2? 88 mam 888888 on» £88858 88888 858888 18683 85 £38838 88:3 H8880: 83883 8888388 23 £38838 8308 880388 2: So @888 mAZBEO .mE-mO .Q20 :5 Z >22 .mmm «388888: 182368 9888.883 8:? #808 18288: 2: 88 $83 828200?» 88 818A So 88% >8 88 gm CV 88qu 23d- mhmdd modd- wmmdd mEdd- ode odddd- ded dedd- moddd mAZBEO->B ommdd- whmdd wded- mmmdd oEdd- dtdd doddd- Nded wvddd- Eddd ammo->9 mwmdd- Rodd Hmmdd- dumdd midd- mdmdd mzdd- mded moddd- muddd QED->8 madd- mumdd Rad- ovmdd wmfidd- dedd odddd- oded wvddd- ded mAZ-sz-z/m mfimdd- :mdd ozdd- ovmdd umfidd- dfimdd odddd- oded dedd- whfidd mam-gm Sadd- mmmdd ded- ovmdd wmfidd- wfimdd vmdddu mafidd good- 35d H884 SHE Sodd- dedd nomdd- obmdd vdmdd- ofimdd vied- Vofidd dwddd- ddddd mAZBEO->B Sodd- mom-dd domdd- wnmdd odmdd- dflmdd Sudd- ondd dbddd- Nded mambo->5 ovmdd- mom-dd vwmdd- vomod dfimdd- vmmdd diod- mofidd muddd- 83d QED->8 ddmdd- dmmdd Homdd- owmdd dadd- mvmdd midd- wmfidd dedd- doHdd quazlgm M ddmdd- dmmdd Homdd- owmdd dadd- mvmdd mEdd- wdfidd $ddd- doHdd mam-3m ddmdd- wandd Homdd- owmdd wadd- dvmdd 33d- odfidd dwddd- uoadd 8884 ENE oomdd- oomdd wdmdd- :mdd nodd- ooHdd dedd- ode godd- ooddd maZBEO->B homdd- womdd wdmdd- dmmdd ooHdd- «Edd Sadd- omfidd good- ooddd ammo->9 onmdd- dwmdd wfimdd- Hmmdd doHdd- ozdd 53d- oded mvddd- owddd QED->9 wvmdd- dumdd momdd- dmmdd «odd- nwfidd dded- oEdd godd- Efidd quzfizugm wvmdd- dnmdd mdmdd- dmmdd «odd- nwfidd dded- oEdd mvood- 38d mam-3m 888.? 888 888- 888 88.? 888 888.? 8888 88.? 8:88 88: 8H8. 88 82 8m mm: 88 mm: 85 mm: 88 mm: 85838882 88 88 88 88 H8 88. A od I Hd H 8i .: mammweoon— v $58N 2: mo 88mm 98 888m 888m :82 A: .< 038% .A0v 38880888 888888 888 A.3 038t8> 280888808 888 88388 88888800 8888808 2: mm 09.8 Amv 808858 oak/:52 88888800 888 A088 mam 88888800 888 F808838 8088820 888888 80683 85 808838 8888 888808 88883 88883288 85 808838 8808 8803088 888 How 8888 ma Z 320 ,mmmO .QEO Jm Z 32 dam 288880888 888888 M8888>88E 883 8808 8805088 888 A088 Aweemv 83882003 888 888% Row 8888 >8 888 3% AS ”882 uwwed- eHved eHwed- owwed mwmed- Howed momed- vaed m42>>2q>re vaed- mmved :wed- eewed eemed- oowed eomed- eHwed mambo->5 eHved- wwved oewed- woved vowed- Sved omwed- ohwed QED->3 Hmwed- wowed wemed- wvwed wpmed- Hmwed vvmed- hemed mAZBE-Bm Hmwed- wowed wemed- wvwed mnmed- Hmwed vvmed- hemed mam-3m hmwed- enwed wewed- evwed oumed- omwed bvmed- eemed 8085 eHHH noved- weoed mwved- moved wewed- vmved mowed- whwed mAZEVEO-xxh mnved- :oed wwved- muved eeved- ewved wowed- vowed mambo->8 emoed- vooed meved- vmoed wvved- wpved eewed- omved QED->8 eoved- woved eHved- ewved wwwed- oeved wvwed- eowed mAZBE-gm eoved- woved eaved- ewved wwwed- oeved wvwed- eowed mam-3m eoved- eoved waved- ewved mwwed- oeved mvwed- wowed 8885 5H8 oHved- wvved whwed- Heved ewwed- oowed wemed- :wed mAZBEO-PH Hmved- moved vwwed- neved ovwed- Howed mewed- oned ammo->9 moved- meved Hmved- ovved obwed- mowed wmwed- ovwed QED->8 oewed- eaved vowed- enwed wmwed- ovwed eemed- eewed mAZBE-Bm oewed- eHved wowed- whwed wmwed- ovwed eemed- wewed mam-3m 88. 83.0 88.0- 88 88. 8258 $820- 88.0 888 mue 88 82 8 82 Em 82 88 82 80822852 8 8 S 8 28 Add I od H 8d 8 w8888fl v 5% 88 o0 885 88 88.5 88888m 882 ; fi< 0388- #35838 .0442; 3555.88 on... can mam $055528 2: 383838 85.5va Eda—.EE Eofimflu ofi £03858 8.853 82:18: EEmEB Enigma—:8 2: £35838 ”.503 130335 23 new 38.6 mAZEVEU .mmmO .QEO .AmZB—Z .mmm 56538qu 1323?: waEHg-oaz fits $608 1305me 9: 98 Awoomv $2530? 98 83mm .5“ 953.0. >.H «:8 3m CV 60qu woeed- ezod eSod- waged «Sod- domed Sood- eoHod mmHed- Hzed Hoaod- vowed mAZBEU-E eoood- waged wzed- «Edd Sadd- eHNod Heood- defied need. owHed HeHod- mowed mmmO->.H «Sod- mmfiod :Hod- 38d mwfiod- ommed ooood- wmfied vmfiod- owHed eoood- mmmed QED->8 meood- defied eZed- vowed Shed- wvmed Seed- eHmed defied. mowed uveHed- Seed mQZBE-Bm neoed- defied wSod- vowed «Sod- wvmed Seed. ofimod mmHed- mmmod moHod- Seed mam-3m veood- mefiod wSed- mowed mtod- named oeood- wamod mmfiod- mmmed eeeod- Emod 30:5 oHHH. 38.? vaod mmfiod- nEod Sfiod- eEed moHed- :Hed mmfied- mmfied floHod- defied mAZBEO->H 3.8.? meod Sled- mnaed eSed- Shed fiefiod- defied mmfiod- defied ~o~o.e- Gmod mmmU->.H oEod- moHod Sadd- wnfiod oZed- ”Sod Head- meHed wedd- owfied oeood- ovwed QED->8 mzod- onod emHed- eofiod Saod- mwfiod meHod- oamod omHod- mowed voHod- anod mAZBE-am «Sod. oned om~od- oowed oSod- edged Hedd- eHmed nmfied- mowed me~od- wwHod mam-3m HEod- defied meod- defied oZod- defied HeHod- damed mmfiod- mowed eeeod- oeHod 82:: NHB mofiod- mzod Hofiod- omfiod Sood- emfiod vwood- defied wwood- 33d meood defied m42320->.H Hoaod- Awmaod ooHod- mmHod oveod- defied vwood- mmHod wooed- defied Heoed boned mmmO->.H HoHod- mmfiod eeood- vowed good- defied eweod- HmHod eweod- aged ooood uomod QED->8 eoaod- died Hofiod- nEod wwood- waged vwood- wmfied veeed- defied Hoood- gadd mAZBE-Bm ooHod- died eoHod- uvfiod hvood- omHod «wood- wmfied vwood- oEod ooood emfiod manna-gm eeood- mgod neood- vied «wood- emfiod mwood- oEed mwood- aged wooed mmHod “mom: nHH eon eon oefi oom oom ooH Z mmmm Wm—Z mam WmE 92m mwz mflm mmz 3mm Mmz mmmm m—mE 603831 66on : wzmmamuoofl : mafia-83 And N 8.3 mm: 05 mo 835 ES muobm Ransom 532 ”Nd .< £de :038300 m1— Z 32 00:35.38 0:... v3 MAE “3:350:00 0:... £03833 00:3...va 5:833: 3063? 05 $038300 003:3 30:::o: 003303 03233::E 05 #03838 950:: 3:033um 0A» .8“ v3.5 mAZBEO .mmmo .QEO .QmZBE dam .mfi0:0wo:0u0: 3:63:03 m:§3>-0:5 3:? EUR: 3:033: 05 v3 Ameowv 0333003 “0:3 Siam :8 v3.5 >8 93 3mm CV ”00qu mzod- ozod oeHod- uefiod wwwod- owwod vied- wmfied wwwed. defied nmwod- :med mAZgEO-ow midd- ezod mofied- defied mwwod- wwwed HEod- defied mofied- mowed owwod- wvmod Nanny->8 mtod- mowed oeHod- wwwed vowed. oowed aged- ozod momod- oowed Hmwod- ommod GEO->8 mmHod- emwod n2o.o- Hmwed mwwed- wwwed oEod- mvwed moHod- wowed mmwod- mvmod mAZBE-gn— meod- eHwod nnHed- Hmwod mwwod- wwwed emHod- mvwod oofiod- flowed fimwed- $mod mam-3m mm~ed- meod wfiod- mmwod owwod- mowed mmHod- wwwed ooHod- oowed wmwod- ommod 30:5 oH-I-H vowod- onod mmHed- mowed ooHod- weHod owwod- wwwed weHod- oowed nmfiod- wowod mQZBEU-PH nowod- eHwod emHod- wowed HSed- oowed oawod- ovwed defied. owwod vmfiod- vnwod Whammy->8 efiwod- vmwod vewod- mwwod nmfied- wwwed awed. ovwed oodd- owwod 38d- vmwod QED->9. oedd- wwwed vaod- omwod headd- wwwod mfiwod- vowed aged- mvwod wnHod- wwwed mAZBE-gm eoHod- wwwed uvmfiod- emwod head- wwwed meod- vowed defied- wwwed omHod- wwwed mam-3m moHod- ovwed meod- ewwed oedd- wwwed :wed- mowed wwwed. wwwed emHod- wwwed 30:5 wHH wmaod- defied omHod- owHod moood- defied mmHod- meod emfiod- defied wvood- fixed mA§20->H omHod- wwHod enfiod- meod mooed- mted wmfied- moaod omHod- mmHod good- mowed ammo->8 eoHod- ovwed mmfiod- omHod Hoood- mtod wmfiod- mowed meed- meod wvood- mowed QED->8 vaod- wwwed vmfiod- mmfiod nwooed- defied meod- ended mmHed- thod wwood- vaod mAZBS-Bm ungod- wwwed vmfiod- mmfiod moood- defied meod- mzod mmHod- wtod ovood- emHod mam-3m meed- mmHod Hmfiod- wmfiod mmood- defied omHod- wwwed mmHod- eSod good- wgod 30:3 mHH. oom oom ooH oom oom oofi Z 35 MmE 3mm Mm; .035 Mmz 3mm mmz 3mm WmE 3am Mmz U0fi02>0fi02 : wfi30uo0fl : w:m30:o:m Amd H page mmE< 0:”— mo 335 «0:0 flotm cesium 302 ”2 .< 033:. :035300 mac/:52 00:35.38 05 0:0 mam 00:35200 0:: £03853 00:3va :SEEE: 306020 0:» £03838 003000 30:2:o: 003303 03203038 0:» £03838 950a 3:053b 0:» Hoe 6:30 m52>>20 damn—O .920 .5mZ>>2 .mmm .wum0:0m0:0a0s 3:03:05 m:E3>-0:5 new? 300:: 3:0535 0:: 0:0 Aweowv 0353003 0:0 ERAS :8 0:03 >8 0:0 him So “00:02 mmHod- wwwed Howod- mmwod mowed- Hmmod SHed- mowed ofiwod- mmwod ewmed- Emod mAZEO-xrfi mde- wmwod oowod- wwwed wwwod- ewmod mmHod- wowed Ewed- wwwed mwmod- owmod ammo->8 ~mwod- owwod owwod- mvwod onod- wwmod mmfiod- mowed Ewod- mvwod vwmod- wwmod Q20->.H. wwHod- wwwed mo~o.o- wowed oowod- Homed vmfiod- waod :wod- wowed vwmed- $mod mAZBE-Bm owfiod- wwwed meHod- eewod newed- ewmed meod- owwod oowod- oowed ewmod- owmod mam-3m meed- wwwed weHod- mmwod eewod- Hmmod wwHod- owwod wewod- wmwod ~wmed- Hemod 30:5 eHHH. wmwod- wwwed emwod- mmwod wHwod- wwwed uwwwed- omwed wvwod- omwed mowed- mvwod m52>>20->.H eowod- wwwed wwwed- mowed waed- mmwod wwwed- omwed wwwed- oowed vowod- wwwed mmmO-xrb wmwod- vemod wewod- mmwod ovwed- owwod wwwed- omwed wwwed- wowed vowod- owwod Q20->H Hmwod- omwed mmwed- waod Ewod- omwed eewod- wemod owwod- wmwod vowod- owwod m5232->>m Hmwod- owwod mmwod- fiwwod meod- omwed oowed- womod emwod- mmwod mowod- mmwod mam-3m Bwod- mmwod Hmwod- owwod oewod- oowed mowed- wemod omwod- Hmwed wofiod- omwed 30:5 wHB mowod- w~wed wowod- oned mmHed- meed meed- defied ooHod- mowed Seed- defied m57§>20->,H mowod- owwod wowod- wwwed meod- weHod meod- mowed emHod- eHwod mmood- wowed Mambo->8 meod- Hmwod Swod- mmwod emfiod- mowed ewfiod- mowed mmfiod- wHwod mmeod- oowed Q20->H mowod- owwod Mwowod- amwod omHod- omHod emHod- onod meod- meod emeod- mmfiod m57§>2u>>m mowod- owwod vowod- Hmwed mmfiod- omHod omfiod- Swod meod- meed mmood- defied mam-BAH Howod- wwwed flowod- omwed mde- ode vmfiod- Swod vmfied- meed fiwood- ooHod 30:5 nHH oom oom oefi oom oom ooH Z 35 mm2 3mm Wm2 3mm Wm2 32m mm2 3mm wm2 32m Wm2 0050220002 : w:m30:00m : w:m30:0:H Gd H 300 mmm< 05 do 835 0:3 muobm 00300m :302 ”E .< 030:. Appendix B Tables for Chapter 2 79 0:05.030: 5:350:30 03 0:005:03: :2 802:3: 05 mid 0030> 900:: 03 203:3: 05 ”0:02 22.3 22.3 22.8 $2.3 H o 2.3 $3 $3 H020 Em 2:9. 2: $2 82220 95 H20 20 m H 3% 525 $3.3 320V :32 3.8 o 232 802 H32 20.: 3582 02.882on 2E 0.2.3 $2.3 $2.8 32.3 H 0 H03 23 ES two a - m B? 22 22H 0208 2: :H H 23 $8.3 $8.3 $8.3 38.3 $ 2 20% 80mm 80% 8.0.2.0. 005 @822: 2: 0% SHE: 83.8 83.8 3:1: H o 2.3 23 2.3 2.3 253 2 .0208 2: 2 H 283 32.8 208 203 303 H 0 mod 23 HH.o 2.3 Hi E H 0 38 “$055: a: :0 ems 2: HH H E: 22.8 22.3 32.8 22.3 H o 82 2:0 $3 20 28; 380:5: 0:283 Si 08: a: $2 0me $2 22 8095mm: 0325/ 8:235 0>umtom0Q 3m 0302. 0 00 00:13-2 w:€:oam0:oo 05 0:0 200228.82 :m 052:5: 05 «0:0 H H :2 H oh 000... 3 :00: 0:0 000: 922300 052805. «0:: 0052030 3: 23 0:03:03: mfimmabmuoon oom Sea U0E030 0:0 mmm< 0:: mo 0:020 20:30? 05 0:0 0:020 0:02:30 00:20: 0:: 200230.82 5 $0252: 02% A8023 U:.0 630.3 2.8.x. mo m0w0.H0>0 02:3 0H: 0.02 2020:: 20:03.08“ 05 “:0 mm? 2320.5: £2 :8 00:38:30 :00» 0:62:00 20:2: :0 Amv 502200200: 2308300 005220 22002 80:2:0: REE?» 02283228 0:0 2808300 2503 20:05Q0d 05 :3 «0:30 Amzzfiz 9:0 .mmm CV 00qu 288.8 32.0 2.888 22.0 as 8225 0225/ 283.8 883 203.8 38% 232882: 88.8 28.8 88.8 28.8 8 H88- 33. H88- 33. :58. 2E 28.8 2.88 28.8 2208 2.8.8 Hood mood H88 23 H88 @802 2.88 288 2.8.8 :88 225.8 23. $3- 3;. god- H88- ti 28.8 28.8 28.8 83 83 08.9 8.5 25.8 25.8 88.8 22¢. 23.. 83. 02% 0%. H80 0%. H80 H80 SE UE25090: £232 000 38: 38% H520 H5222 EEVSQSE 30:5 #0602 3:328 mso:0woxm 20:02 20:0:me 05 0:0 80:5 0.: ”Wm 053; 81 .moflgd wfiwaoammboo 23 m3 £55533 E $3852 23 9:2 H w: ” om amen. 3 1%: mm 0383.5 220 A3 8:03:25“ wfimaabfioon oom Sod 3:830 mam mmm< 05 m0 888 Esta-3m 2: was Soto Eavamum $532 was $8588.83 5 338:: 2:- 9; 6:3 98 .883 3.me mo mowflga 08$ 2: was am? 59523 33 H8 8:556 30% mE-ficoo £355 :8 A8 .N was .8“ EEQmong-o: 1323?: mo 886580 23 mm 8: Amy .bgsooammu £38838 mayday-a- »mafl $0530: 3332» 3352:58 8595880 23 98 £36838 mod-mummy 82558 60633 2: Jog-«83mm EOE 3:030an wofiwbmaoo 2: H8 madam 152320 95 .QED .mmmo CV ”mm-.62 $53 888 :2 220 88.3 38.3 88.3 88.3 88.3 38.3 8.0- @88- 88- @88- 88- @88- 2% 38.3 :83 $8.3 38.3 38.3 38.3 838 :38 838 23¢ 83. :3. 83.4 38.3 98.3 $8.3 38.3 383 $83 80.0. 83- 838. RS- 888- 83- Ex 88.3 38.3. 38.3 838 88.8 88.8 28 :83 38.3 :83 83- 83- 83- 838 38.3 88.3 38.3 M88 388 r88 8 383 8.8.3 :83 83 83 £3 8 88.3 98.3 :83 83 $3 83 a: $2 88 $2 88 an? 88 8; 58238 32320 820 $20 833 82:02 bate-m msoaowoxm ”$on $193-02: H :28ch m of- ”m.m 03mm. 82 .mmggd 95000080000 05 08 £83000an E 3008:: 05 000 H H “E “ em 80... 00 wow: 08 0mg 005200 03-3-5» 0:0 mosmsfim 33 3; 000330000 9200053002 oom 80¢ 005030 20 00-0 who-:0 080006 ”74500 80 man-050050 E 2008:: 05H 9; 03:38.“ 00m 020m 00:02 05 E 800-8 2t 95 2:0: 08:03 #000303.“ How 0030000 ESQ-055m 05 E 0008 05 000333 00330200 2: 850000 Q A3023 0:0 .333 “Hm-£0 W0 mmwmugw 083 on» was 6008 $00500£ 0:0 was $3 :mzoufi owe 00m 86:56 30% mag-3:00 $0008 :0 A8 503000080 £30838 20,000 #00050de 23 000 0000858 0385? $888:me 300mm 083 0.: 00m 000% mam was >55 CV 8802 A853 880 “we 8:68 232:3 388.3 83.0 88 A283 s: 38.3 End q 383 38.3 88.3 8.8- 838. 88. “5.3 88.3 38.3 88.3 838. 83. 88.8- .883 :83 88.3 38.3 83- 83- 33- Ex 38.3 :83 N88 838 .85 0.83 88.3 «88. 83c- «83 mE< 00205000 pamwowcooO 33K unmccmaofl an: 380 BE 8822 anfiogrm H0003 #0002 baton mzoaowowcm ”—0002 3000005 98 8005 BE- ”vd 2an 83 .m030>.m m0€00000too 05 0.8 06050050 E 00038:: 05 0:0 H H 0: H ch ”.00”. 0» 000: mm 0303000 220 A8 .o M “Q ” om 00m 000”. 30.5 05‘ mm 0000 3m000m000m 3V 000330000 mafiadfimuoon oom 80¢ 00530.0 000 mmm< 05 m0 000000 00000000 05 000 000000 00000000 09500 000 0600300000 E 3090:: 0:9 93 .023 0:0 .805 .8000 m0 m0w§0>0 085 0:... 000 $3 5900:: owe 00m 009556 000% 0500000 E0005 :0 3v .0 0:5 00m 3M000w000u0£ 10:03:05 m0 E06E000 05 E 0: A8 0203000000“ "00008500 0000006 EEEEE #006,430 05 000 00008300 0505 BEEZQ 10003000.“ 05 00m 0030 QED 000 mama 2V ”00002 3033 83:. @3220 @523 3.8.2 02 3050020 :33 £3 E 383 $83 383 $.83 N83 Sod- Sod- Sod- 2% 833 $33 :83 883 $5.? $3- SS. 33. $5 383 323 0.83 :23 $3. 9.3. $3. $3- Ex $83 383 SS. mood ”3 $83 $83 33. 83. 02.3 :33 $33 $20 and § :53 $03 :3 30.0 .3 0.3.3 833 was ”2: S 0% 080 an? 080 Si 082300 020 0000 380 @232 bfitom msoaowowam A0002 wfib0>08F Ecouofim 2:. ”Wm 030B 84 0003050000 00200 083 0:0 0000500000000 “0 000800 05 000000 rm 000 2 QB 40008 03000 000E0>E 005008 05 0000 Emma 0000-0flwfim 00”. 0:0 030fl0> 00003000 00 m0 0m0am 0000 05 ...0 005050 3:300“ 000 m020> 00000 05 0min mam 0000.038 Amv 5030000000 #0008300 0505 0002039 3020000 05 000 00008500 00.05 “0000.000 000 ..8008300 03000». BE0EE$E 300m0 00x0 0% 00m 0005 mmmm 000 .mmm (Cam AS ”00002 85 353- £80 8:53 «38¢ «$85 $33 82 :83- $83 $83- 2300 £83 583 as. good- 23:. :83 088d 383 30.3. com 8:5. 933 $83- $035 £83 333 com fins 258d- game... £33 £83 $23. £956 82 $33- 38.0 883 £086 8.0.30 Sago CE 5803 80:2. 883 053 $230 :33 com. :83- 833 883 $03 3.de 333 can an.“ z 900 032 920 mm: E0 092 0000 30.2050 000 @335. >000 09:22 Edema—00¢ .000ch #0002 mmh< 05 00 momfim 000 £85 08003 0002 6d 038. Appendix C Tables for Chapter 3 86 008870 980800000000 00: 000 00005000000 8 0000880 000 0000 H H :0 ” cm 0000 00 000: 00.0 0000 80000000 03000000 080 080000000 33 3 0800800000 9080000000000 oom. 8000 008030 000 0mm< 00: 00 000000 000000000 00: 000 000000 000000000 005000 000 0000080000 8 0000808 09H A9023 080 .303 .0000. 00 00m000>0 080.0 003 080 .0w0 £005 6on Am0080 owmfi 000 0008800 000% 080800 000008 :0 A3 5030000000 .00008500 00000000 00000 00088000 00.5808 00000030008 0000 .00008500 00080005 8080008000000 00: 000 08000 Angz 0000 @320 AS ”00002 280.8 88.0 380.8 080.0 8:88 28:8 880.8 800.0 88.8 800.0 08128820 888 88.8 :88 88.8 88.8 :88 08.8 08.8 08.8- 088- 08.8- 088- 20.8 38.8 88.8 88.8 38.8 08.8 88.8 88 08.8 88 8.8 08.8 088 803 888 8.8.8 0088 38.8 88.8 88.8 08.8- 08.8. 088- 088- 888- 088- ti 833.80%. 080 080 8335 000 000.. 080 03 050580 88.8 08.8 88- 08.8 0:3 008.8 08.8 08.8 08.8 8E $8.8 08.8 80.8- 80.8- 0.0.0 “000 0000 8 A 83%: “80000000Q 0.0232 0.020 0280 09:22 08—00000 0:000woxm ”#0002 20080 800000000 05. ”0.0 050,—. 87 0080070 80000000000000 0000 000 0000000000000 8 000080000 0000 0000 H .|.. 000 0 00.00 0000 00 000: 00 000000000 0220 A3 000000000000 w008000000000, com 8000 008030 000 0mm< 0000 000 000000 00000000 00: 0000 000000 0000000000 000500 0.00 0000000000000 8 00000808 00H. A3 .0003 0000 .8080 000.0000. 00 00m000>0 0800 0000 0000 .0w0 000003 6me 009000000 $2 000 00088000 000% 08000000 000008 :0 Amv .0 0800 000 0800003000000 0000000008 00 0000005000 0000 00 000 A3 0000300000000 0000800000 000000000 00000 0008000000 008803 00000030008 00: 0000 .000080000 000000000 8008088 000000000 00: 00000080000 000800000 8080008000000 00: 000 000000 0520502 080 .020 £320 CV 000002 00000.8 0000.0 080 02020 88.8 008.8 88.8 008.8 88.8 008.8 08.00- 0.8.00- 08.00- 08.00- 088- 08.00- 20.20 008.8 008.8 008.8 88.8 88.8 008.8 08.00 08.00 08.00 0008 08.00 08.0. 800:0 $8.8 88.8 008.8 008.8 008.8 008.8 08.00- 08.8- 08.00- 08.00- 808- 008.00- 0.00 008.8 008.8 008.8 80.00 000.00 08.00 :0 0000.8 000.8 008.8 08.0 08.0 08.00 8 00008 008.00 008.8 000.0 000.0 08.0 8 000.. 080 000 080 002 080 800 H0800002080 88.8 008.8 88.8 88.8 008.8 08.8- 08.8- 08.8- 08.00- 08.00- 2.0.20 008.8 008.8 008.8 68.8 88.8 08.0. 808 08.00 808 08.00- 8050 88.8 008.8 008.8 88.8 008.8 0088- 80.00- 0000.00- 0008- 808- 0.00 a A 0005 0000 080 a A 00.05 0000 080 s A 85 00.0 080 8 A 850 H00000052080 007082 0500020280 0000 00000 0500020280 00500002 080000000 0000000w000m ”000002 8300070800. 000080 0000000000 000,—. ”NO 030,—. 88 0080070 9800000000000 080 000 000080000000 8 00080080 0000 800 H H 000 “ cm 0000 00 000: 000 0000 00000300 0300000. 800 0000000000 33 AE 00000000000 w8mm00000000' com 8000 00080000 000 800 000000 00000000 000500 000 000080000000 8 0000880 08H. AB 80020000 000 8000 008000 080 8 000000 080 000 0030 000 00000000 8 000000 080 00000 0000200000 000 8000 008000 080 8 000000 0000 800 SAbiH 00.0 00000000000 8 00000 080 000030000 0000000000000 080 000000 an 800 S A3023 800 .8080 .8000 00 00w000>0 0080 0:0 800 .0w0 .0303 6me 830080 .0me 0000 0008880 000% 0800000 00008 00% :v ”00002 608.3 8008 8808 2005 308.3 00088 080. $083 20 $083 80.8 00 88.3 883 00.83 88.8- 08.8- 888- 8.20 88.3 38.3 88.3 088- 08.8- 0088- 80.00 68.3 008.3 800.3 088- 808- 808- 0.00 3385 00.0 00.0 080 030 080880 @003 088 a 38.3 088- 0:00 08.3 088- 80.0 0000.3 088- 0.00 0000 8 A 008.30 ”0000800009 88808 0800900 88588 0080 8082 00000000000 0:000w000m ”00002 000080 1000000000 .0 05. ”0:0 030B 89 0080070 9880080000000 0000 000 00000000000000 8 0000080000 0000 800 H H 000 0 00000 0000 00 0000: 00 000000000 3:20 A8 .0 H 090 0 cm 00.0 0000 2005 0000 00 0000 08000000m080m A3 0000000000000 9805000000080 oom 8000 00080000 000 0mE< 0000 00 000000 0000000000 0000 00000 000000 00000000 000800 000 0000000000000 8 000080000 003. A3 .0003 00000 690.000 .8000 .00 00w000>0 0080 0000 00000 .000 £003 .mwg 003000000 00me 000 0008000000 000% 08000000 00008 0008 Amv .0 0080 000 0800000w0000000 00008308 00 0000000000000 080 00 000 Amy 080080000000 0000008000 000000000 000000888 000000000 0000 0000 000008000 000800002 88800080000000 0000 000 80000 D20 0000 @0020 CV 0000 Z 888.3 888 080 02020 80083 8888 080 000880080 883 088 000 888.3 88.3 88.3 888.3 88.3 88.3 0088- 888- 888- 0088- 888- 088- 8.08 0008.3 88.3 008.3 808.3 88.3 88.3 0088- 088- 888- 8088- 088- 8088- 88.0 88.3 88.3 80.3 88.3 88.3 800.3 0088- 0008- 088- 0.88- 0.008- 088- 0.0.0 808.3 88.3 0.88 088 08 88.3 88.3 0008 0008 800 0000.3 80.3 823000000 00< 080 888500.00 0000 080 002.0. 85008080 0020 0020 008080880 8900800 080000000 0000000w0000m ”00002 8800070800. 0:083 000000000000 000.0. 000.0 030.0. 90 BIBLIOGRAPHY 91 Ahn S.C., Y.H. Lee and P. Schmidt (2001), "GMM estimation of linear panel data models with time-varying individual effects," Journal of Econometrics 101. Ahn S.C, Y.H. Lee and P. Schmidt (2007), "Panel Data Models with Multiple Time- Varying Individual Effects," Journal of Productivity Analysis 27. Angrist, D. and Evans, W. N. (1998), "Children and Their Parents Labor Supply: Evidence From Exogenous Variation in Family Size," American Economic Review 88. Arellano, M. (1987), "Computing Robust Standard Errors for Within-Groups Estimators," Oxford Bulletin of Econometrics and Statistics 49. Carrasco, R. (2001), "Binary Choice With Binary Endogenous Regressors in Panel Data: Estimating the Effect of Fertility on Female Labor Participation,"Journal of Business and Economic Statistics 19. Chamberlain, G (1980), "Analysis of Variance with Qualitative Data," Review of Economic Studies 47. Freedman, D. and Sekhon, J (2008), "Endogeneity in Probit Response Models," Unpublished. U. C. Berkeley CA 94720 J acobsen, J .P., Pearce, J. and Rosenbloom, J .L (1999), "The Effects of Childbearing on Married Women's Labor Supply and Earnings Using Twin Births as a Natural Experiment," The Journal of Human Resources 34 Bay, J. (2005), "Panel Data Models with Interactive Fixed Effects," unpublished manuscript, New York University. Kiefer, NM. (1980), "A Time Series-Cross Section Model with Fixed Effects with an Intertemporal Factor Structure," unpublished manuscript, Cornel University. Lee, Y.H. (1991), "Panel Data Models with Multiplicative Individual and Time Effects: Application to Compensation and Frontier Production Functions," Ph.D. Dissertation, Michigan State University. Liang, KY. and SI. Zeger (1986), "Longitudinal Data Analysis Using Generalized Linear Models," Biometrika 73. Mundlak, Y. (1978), "On the Pooling of Time Series and Cross Section Data," Econometrica 46. Papke, LE. (2005), "The effects of Spending on Test Pass Rates: Evidence from Michigan," Journal of Public Economics 89. 92 Papke 2008 : Papke, LE. (2008), "The effects of Changes in Michigan's School Finance System," Public Finance Review 89. Papke, LE. and J.M. Wooldridge (1993), "Econometric Methods for Fractional Response Variables with an Application to 401 (K) Plan Participation Rates, " Journal of Applied Econometrics 89. Papke, LE. and J .M. Wooldridge (2008), "Panel Data Methods for Fractional Response Variables with an Application to Test Pass Rates," Journal of Econometrics 145. Rivers, D. and Q.H. Vuong (1988), "Limited Information Estimators and Exogeneity Tests for Simultaneous Probit Models," Journal of Econometrics 39. Rosenzweig, M. R. and Wolpin, K. I. (1980), "Life-Cycle Labor Supply and Fertility: Causal Inferences From Household Models," Journal of Political Economy 88. Terza, J .V. (1998), "Estimating Count Data Models with Endogenous Switching: Sample Selection and Endogenous Treatment Effects," Journal of Econometrics 84. Wooldridge, J .M. (1991b), "Specification Testing and Quasi-Maximum Likelihood Estimation," Journal of Econometrics 48. Wooldridge, J .M. (2002), Econometric Analysis of Cross Section and Panel Data. MIT press, Cambridge, MA. Wooldridge, J .M. (2005), "Unobserved Heterogeneity and Estimation of Average Partial Effects," in Identification and Inference for Econometric Models: Essays in Honor of Thomas Rothenberg. D.W.K. Andrews and J .H. Stock,(Eds.), Cambridge University Press, Cambridge. Wooldridge, J.M. (2007), "Quasi-LIML Estimation of Nonlinear Models with Endogenous Explanatory Variables," unpublished manuscript, Michigan State University. 93