.Wfimfiuwh J‘I' . u I. p r. 1. mm .t. 3. u 0‘7 ||| d4 3. Ad ”Hat . J; r. , .3“ . ‘15:}: "H222... 1. Lu ‘uvfimw . . . . ‘ .52 . ‘tnzi .. 5 1. K‘ 5a.?) .. ifr an»... 6 , v u; 3 .hv u p b I I : 2W u. :33 5.2. 1‘... . % .....o.u..u~.w.u . J. 1 t .P Q A. V .f A. , _ . u. ”gar“... «inn: 0. . in 5.; 2.33.: . z. .2 .12.. 4% h |.._1\ a); q ('41. 1' ”'53-.- . «oflnuu : . . at. {75. .525... A V g, I”. l: 1 0.. Jul! 5.1:! (I urns“ >u: .‘ {€19}; . «v.37 . {Wins . ..‘. m 2&0? .LIBRARY Michigan State University This is to certify that the dissertation entitled THREE TOPICS ON ELECTIONS presented by MAKOTO TANAKA has been accepted towards fulfillment of the requirements for the PhD degree in ECONOMICS ()élc/x.\ 7 ilk/L, Major Professor’s Signature 8 ) 290102? Date MSU is an affirmative-action, equal-opportunity employer PLACE IN RETURN BOX to remove this checkout from your record. TO AVOID FINES return on or before date due. MAY BE RECALLED with earlier due date if requested. DATE DUE DATE DUE DATE DUE 5108 Kthroj/AccaPreleIRC/Dateoue.indd THREE TOPICS ON ELECTIONS By Makoto Tanaka A DISSERATION Submitted to Michigan State University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY Economics 2008 ABSTRACT THREE TOPICS ON ELECTIONS By Makoto Tanaka This dissertation consists of three chapters. In chapter one and two, I consider information transmission problem from politicians to voters about possible outcomes of policies in a two-period-and-two-elections model (Chapter one) and in an infinite period model with political parties as overlapping-generations organizations (Chapter two). In Chapter three, I analyze the choice of policy instrument by an incumbent politician who can use an inefficient tariff and a less inefficient production subsidy to help domestic manufacturers. Chapter Qne: “Learning Through Elections” I consider choices of policy platform at elections when there is uncertainty about outcomes of policies, and one party has information advantage over another party and voters. This ability might reflect the ability of the politician in the party. Since the platform choice is based on the information the politician has, the choice at the election transmits some information to voters. Then, in one possible equilibrium, I show a case that the higher the ability of the politician, the worse the information transmission from politicians to voters. Chapter Two: “Information transmission from overlapping political parties” I analyze an information transmission problem, again, but in infinite period with two political parties as overlapping—generations organizations. Policy outcomes depend on the true state of the world, which changes in each period. Both parties receive information about the true state. Platform choices are made by old politicians in parties who will retire soon. So, the next election does not restrict their opportunistic behavior directly. Still, I show that the OLG party structure restrains the opportunistic choice of platform by old politicians. Chapter Three: “The Choice of Inefficient Instruments in a Simple Retrospective Voting Mgdel with Vgter Abstention” Tariffs are more inefficient than production subsidies as the instrument to help domestic industry. Still, tariffs have been used. In this chapter, I propose one explanation about why governments use inefficient tariffs. The basic idea is the manipulation of voter abstention. If an incumbent politician decides to help a domestic industry, he also needs to determine how to distribute the cost of the help among voters. Assuming that some voters’ voting abstain partially depending on the utility they receive from the incumbent’s policy, the incumbent could affect the choice of voters to vote or not. Naturally, he will try to take advantage of it. I analyze if the incumbent has incentive to choose tariff in two cases: same tax rate for all voter groups, and different tax rates for groups. To My Mother, Yumiko Tanaka iv ACKNOWLEDGEMENT I have so many people I would like to thank for my dissertation. However, because of limitation of space, I have to minimize the number of people to thank. First of all, I would like to thank my committee members. I appreciate the help of my adviser Professor John D. Wilson. Without his guidance and suggestions, I could not have completed my dissertation. I thank Professor Anthony Creane for his caring. I really appreciate it when things were rough for me. I also appreciate the conversations I had with Professor Susan Zhu and Professor Eric Chang. Because of their help, I could improve my dissertation a lot. Regarding chapter three, I also want to thank Professor Nelson of Tulane University. The conversation I could fortunately have with him helped and encouraged me a lot while I was working on the chapter. I thank Jennifer Carducci for her professional job and help. She deserves more chocolate. I also thank Stephanie O. Smith for her help on the format of this dissertation. I appreciate her kind help and apologize for all of my formatting mistakes. She deserves more cakes. Finally, but not leastly, I would like to thank all of my friends. They helped and encouraged me, and sometimes inspired me with their way of life. I appreciate all the people I met in Michigan. TABLE OF CONTENTS LIST OF FIGURES ................................................................................ viii Chapter 1 Learning through Elections ............................................................... 1 1 .1 Introduction ............................................................................. 1 1.2 Model ..................................................................................... 3 1.3 Updates and Platforms ................................................................ 9 1 .4 Equilibrium ........................................................................... 1 6 1.5 Conclusion of Chapter 1 ............................................................ 20 Appendix of Chapter 1 ........................................................................ 22 Endnotes of Chapter 1 ......................................................................... 33 References ....................................................................................... 35 Chapter 2 Information Transmission from Overlapping Generations parties ................. 36 2.1 Introduction ........................................................................... 36 2.2 Model and Behavior of Politicians ................................................... 41 2.2.1 Model ........................................................................ 42 2.2.2 Behavior of Politicians ...................................................... 52 2.2.3 Definition of equilibrium ................................................... 56 2.3 Results .................................................................................. 58 2.4 Conclusion of Chapter 2 ............................................................. 62 Appendix of Chapter 2 ......................................................................... 68 Endnotes of Chapter 2 ......................................................................... 75 References ....................................................................................... 78 Chapter 3 The Choice of Inefficient Instruments in a Simple Retrospective voting Model with Voter Abstention ............................................................................... 81 3.1 Introduction ........................................................................... 81 3.2 Model ................................................................................... 83 3.2.1 Economy side ............................................................... 83 3.2.2 Political side .................................................................. 86 3.3 Results with two groups ............................................................... 91 3.3.1 Single Tax Case .............................................................. 91 3.3.2 Different Tax Rates ......................................................... 99 vi 3.4 Three groups: heterogeneity in workers ........................................... 103 3.4.1 The optimal tax rates ....................................................... 105 3.4.2 Will incumbent use tariff? ................................................. 107 3.5 Literature Review ..................................................................... 1 11 3.6 Conclusion of Chapter 3 ............................................................ 115 Appendix of Chapter 3 ...................................................................... 120 Endnotes of Chapter 3 ....................................................................... 124 References ..................................................................................... 1 26 LIST OF FIGURES Figure 1.1 Flow of Game ........................................................................... 8 Figure 1.2 ............................................................................................... 9 Figure 1.3 .............................................................................................. 11 Figure 1.4 .............................................................................................. 14 Figure 1.5 ............................................................................................. 17 Figure 1.6 ............................................................................................. 19 Figure 1.7 ............................................................................................. 28 Figure 1.8 ............................................................................................. 29 Figure 1.9 ............................................................................................. 31 Figure 2.1 ............................................................................................. 50 Figure 2.2 ............................................................................................. 51 Figure 3.1 ............................................................................................. 90 viii 1 Learning through Elections 1. 1 Introduction The objective of this chapter is the construction of a model of two consequent elections with uncertainty about policy outcomes. Tiaditionally, electoral competition models in Political Economy have assumed that there is no uncertainty about outcomes of government policies. Clearly, this is a strong assumption. Before the implementar tions of policies, people can have only beliefs about outcomes. Politicians, however, might have more information about outcomes than voters. Suppose that a politician has some private information about possible outcomes. Then, given the information asymmetry, what policy platform will the politician choose during an election? Since the politician chooses the platform based on the information he has, the choice of the platform can be considered as the information transmission from him to voters. Then, will the politician reveal the information honestly? In other words, will the politician lie to voters? This is the information transmission problem during the election. From 19903, many papers have analyzed the problem with various assumptions about uncertainty (Harrington (1993a), etc.) have. Observing platform choices by parties during an election, voters can update their beliefs about the uncertainty. How- ever, there is no reason we can expect this transmission of information to be perfect. Obviously, the information asymmetry enables politicians to take Opportunistic be- havior; they can choose a bad policy and lie to voters that it is a good policy. The common structure of papers in the literature is the following; there is the uncertainty about policy outcomes, which depend on the state of the world. A politician (or politicians) has some private information about possible policy outcomes. He chooses a policy. This choice may be his platform during an election (Heidhues and Lagerof (2003)), or the policy he implements before the election (Majumdar and Mukand (2004)). Then, at the election, voters vote for the candidate they prefer. The im- portant part is this; since the politician’s choice gives voters some information about the uncertainty, politicians can manipulate preferences of voters on policies and /or politiciansl. This situation gives voters two Opportunities of learning about the uncertainty (i.e., the state of the world); one is when voters observe the choices of parties during the election, and the other is when voters observe the outcome of the implemented policy after the election. In multi-election models, the voting decisions at elections depend on these two opportunities of learning. Thus, politicians have to take into consideration the impact of the election platform at one election on the election after that. To model this, I consider a two-period and two-election model. Of these two elections, the first election is for the information transmission. The second one is for the retrospective voting based on the result of the implemented policy after the first election, which guarantees that at least some information is transmitted to voters during the first election. In addition, I assume that one party has the information advantage not only over voters but also over the other party. This information advan- tage can be considered as showing the high ability of the politician in the party. The symmetry between parties during an election is a common assumption in electoral competition models without an incumbent. Though Heidhues and Lagerof (2003) and Bernhardt, Duggan and Squintani (2007) also assume the asymmetry between parties, the asymmetry in these papers means that parties have different information. The asymmetry here is stronger since only one party receives the information. This is a strong and uncommon assumption. However, the asymmetry between parties in the sense of this chapter is not unnaturalz. As a factor causing such asymmetry between parties during an election, "valence" has been incorporated into electoral competition models recently (Schofield (2005), etc.) So, the ability of the politician in this chapter can be considered as the valence factor in this chapter. Then, I show that this asymmetry has an interesting implication when the ability of the politician changes. The remainder of this chapter is organized as follows; section 1.2 describes the model. Section 1.3 describes the behaviors of voters and two parties. Section 1.4 shows the existence of equilibrium and other results. Section 1.5 discusses about this model and its results. All proofs are in the Appendix of this chapter. 1.2 Model I consider a society with two parties (Party 1 and Party 2) and voters in two periods. There are two possible policies, at and b. The society can implement only one policy in one period. There is no uncertainty about the outcome of the policy b. But the outcome of the policy a. is uncertain. A natural interpretation of these two policies is that the policy b is the continuation of a current policy, and the policy a is a new policy. As I explained in the introduction, no one is perfectly sure about the outcome of the new policy, there is the uncertainty about its outcome. In each period, there is an election. The society decides which policy it implements in the period by the election. Candidates from two parties (one from each party) announce one of two policies as their policy platform. Voters vote for the candidate they prefer, and the elected politician implements his platform. Candidates from two parties are new candidates. If a candidate loses the first election, the game is over for him. At the second election, the party that lost the first election put up a new candidate. The politician who won the election gets a fixed rent, which is normalized to one. Let 6 be the common discount factor. The objective of candidates is the maximization of expected sum of rents from two periods. They are risk neutral. The outcome of the policy b is the utility 6 6 (0, 1) to voters. The outcome of the policy a could be success or failure. The outcome depends on the 1 state of the world, w0 and w . If the true state is two, the outcome of the policy a is failure and the utility 0 to voters. If the state is 2111, the outcome is success and utility 1 to voters. No one in the society knows the true state. Instead, voters have belief q 6 (0,1) as the probability of w1 being the true state. This q is lower than 5. This structure of uncertainty is almost same to the Majumdar and Mukand (2004). But unlike the paper, one implementation of the policy a is enough to know the true state of the world. Party 2 candidate also has belief q, same as voters. Since q < 6, without further information about the uncertainty, the society should choose the policy b since the expected utility of the policy a is only q. But Party 1 candidate receives a signal about the probability of the true state being w1 before the first election. To make this model simple, I assume that he receives the probability of state to1 directly. Denote it as p. This means, the best policy choice of Party 1 candidate for voters is choosing the policy a if p > 5 and choosing the policy b if p < 6. If p = 6, his choice does not matter. This p is a random variable, and its distribution depends on the true state of the world. If the true state is wl, the cumulative distribution function is F (p) If wo, the function is C(p). For 1) E (0, 1), C(p) > F(p). Moreover, F and G are assumed to be differentiable, F’ (p) > 0 and G’(p) > 0 on (0,1), and limp__,0 F’(p)/G"(p) = 0, limp_,1F’(p)/G’(p) = 00, and F' (p) / G’ (p) is monotonically increasing (i.e., monotone likelihood ratio property.)3 These cumulative distribution functions can be considered as representing the ability of the politician. These properties mean that when the candidate receives the high value of p it is more likely that the true state is w1 than that the true state is mo, and vice versa. These distributions are common knowledge. So, there is an informational asymmetry between Party 1 candidate and vot— ers/ Party 2 candidate. Then, Party 1 candidate’s policy platform is considered as a signal from the candidate about the probability of the true state being wl. As the strategy of Party 1 candidate, I consider the following simple one; before the first election, the candidate chooses the value T E [0, 1] as the threshold value such that if he receives p > 7', he will announce the policy a, and if he receives p < T, he will announce the policy b. If T = p, he will or will not announce the policy a. Since p and B are the expected utilities of policies a and b, respectively, the value of T different from ,6 means inefficiency. If T < B, when p E [T, 6), Party 1 candidate would pr0pose the policy a even though he knows the policy b has higher expected utility for voters. If T > B, when p E ([3, T), the candidate would propose b even though the policy a has higher expected utility. With this strategy, T = 1 is interpreted as the candidate never announces the policy a. T = 0 is that the candidate always announces the policy (1. Since the choice of a when p > B and the choice of b when p < B is the best for voters, the best value of T for voters is 5. Party 1 candidate’s strategy during the second election is simple because of the assumption about the uncertainty. He announces the policy that voters have higher expected utility. In a case that he announced the policy a during the first election and won, then at the start of the second period, every member of society knows the true state. If the state is wl, it is clear that the policy a is better than the policy b. Then, the candidate should announce the policy a as the platform during the second election. If the state is 100, it is also clear that the policy b is better. In the case that he announced the policy b during the first election and won, he announces the policy b again during the second election. This is because his platform during the second election does not give any information to voters. This is intuitively clear. Since the candidate cares only about the office rent, not the outcome of policies, if announcing the policy a during the second election is beneficial for him with a certain value of p, then it is beneficial for the candidate with any value of p to announce the policy a during the second election because his expected utility during the second election does not depend on p. Thus, whatever the value of p, Party 1 candidate announces the policy d. Then, the platform a. does not give any information to voters. There is no reason for voters to update their belief. Thus they do not update their belief. Since the belief during the second election is the updated one through Party 1 candidate’s platform during the first election and it is lower than [5 (since Party 1 candidate announced the policy b during the first election), it is beneficial for the candidate to announce the policy b. The strategy of Party 2 candidate is as follows; during the first election, expecting the value of T, the candidate uses the mixed strategy. Denote as c the probability of his announcing the policy a during the first election. His strategy during the second election is same as Party 1 candidate. Since during the second election, it is clear that which policy should be proposed, I do not need to consider about the strategy of the new candidate put up by the party that lost in the first election. Knowing Party 1 candidate’s strategy, voters (and Party 2 candidate) try to infer the value of T rationally and update their beliefs about the state it)1 after observing Party 1 candidate’s platform, by the Bayesian rule. If Party 1 candidate announces the policy a, the updated belief is higher than the initial one. If the candidate announces the policy b, the updated belief is lower. At each election, voters vote for a party according to their period preference. The preference consists of three parts. The first part is the expected utility from the outcome of implemented policy. The second is a random noise. Thus, this model has an element of probabilistic voting model (Persson and Tabellini (2002)). All voters receive same noise during each election. This noise represents factors about parties and candidates other than the utility derived directly from the implemented policy, like the advent of charismatic candidate or the eruption of scandals. This noise 5 is an i.i.d. random variable with a distribution H. This s is voters’ bias for Party 1 and independent of the announced policies by candidates. It can take negative value as well as positive value. If it is negative, it means bias against Party 1. I assume that H (0) = 0.5 and H ’ (e) > 0 on its support, the support of H is connected and large enough, and H is symmetric. The third part is the assumption of the retrospective voting behavior on the implementation of the policy a by Party 1 in the first period. (A) : If the policy a is implemented by Party 1 after the first period and the outcome is success, voters vote for Party 1 candidate at the second election. If the outcome is failure, voters vote for Party 2 candidate. As I wrote above, if policy a was implemented in the first period, there is no un- certainty in the second period and both parties pr0pose same policy as their platform during the second election. So, the choice of the party to vote for does not matter for voters. Thus, voters can use the second election to reward or punish Party 1 (i.e., the retrospective voting.) It is natural to assume that the success increases the probability of winning the reelection and that the failure decreases the probability. This is the retrospective voting; voting based on past performance. This (A) is the simplified and extreme version of such reaction. This works as the incentive for Party 1 candidate to announce the policy a. Without this, as I will show later, there is no equilibrium. This voting reaction to the policy a is applied only to Party 1 during the second election. In other cases, voters vote for Party 1 candidate if, EU (Party 1 candidate/s platform) + e > EU (Party 2 candidate/s platform) where, EU means the expected utility ¢=> e > EU(Party 2Is platform) — EU(Party 1/3 platform) If the opposite inequality holds, voters vote for Party 2 candidate. If the equality holds, voters vote for Party 1 with probability 0.5. Since a is a random variable, the probability of Party 1 candidate winning the election is 1-- H(EU(Party 2'splatform) —- EU(party 1'splatform)) The flow of the game in the first period (see figure 1.1) First period Flow of Game 2 Signal p 9: d 5 P l h m arty c ooses ,_, g, platform Voters update E? E g 589 g . belief —' 3 g Q a g E! —" (b '"h "U (D D (p Si 53+ 3 :l : Party 2 chooses 5 g a co {3+ platform H l: :3 r» H. (D Party 1 chooses threshold 1 for its platform. Party 2 expects 1.‘ and chooses its platform. Figure 1.1: Party 1 candidate receives the signal p. Candidates announce platforms simultaneously. Voters observe platforms and update their belief. Voters receive noise 5, and vote for the party they prefer. The Elected party implements its platform. The outcome of the implemented policy is observed. If the policy was a, then every member’s belief about the state of world is updated. Second period. The equilibrium is defined as the profile of values of T and e such that, (1) Voters and Party 2 expect the value of T and update their belief rationally (Bayesian), (2) With the expectation of T, c is the optimal value for Party 2 candidate. (3) With the value of c and the expectation about the value of T by voters and Party 2 candidate, T is the optimal for Party 1 candidate in the sense that it is the optimal for a candidate with p > T to announce the policy a, and that it is optimal for a candidate with p < T to announce the policy b. (4) All expectations are correct. 1.3 Updates and Platforms In this section, I consider behaviors of voters and party candidates in turn. For notational simplicity, I abuse notations in denoting not only the threshold value choice by Party 1 candidate but also all expectations about it as T. I also treat T and p interchangeably when I consider belief updates. Voters. (see figure 1.2) ‘___qa(t) ‘_____qb(t) 12,13 Figure 1.2 Voters update their belief about the probability of the true state being wl after observing the platform of Party 1 candidate. Let T be the threshold value expected by voters. In the case that Party 1 candidate announces the policy a, then the updated belief is, = q{1 —F(r)} qU-JW0f+U-QM1-GUH This is well defined for T E [0, 1).This updated belief has the following properties. (LUQWU Lemma 1.1. q“(r)= q 1_G>que(o,1) 4+(1‘qlf_—F‘ lim7-_,1 qa(T) = 1 q (10 =_____=, q() q+1_q q qa’(T) > 0 for all T 6 (0,1) In the case that Party 1 candidate announces the policy b, the updated belief is, qFU) flWfl+ll-®Gfii (Lmq%d= This is well defined for T E (0, 1]. This updated belief has the following properties. Lemma 1.2. qb(T) = q C < q for T 6 (0,1) £1 + (1 — (1)—F“ 1iHIT—40 qb(T) : 0 b q 1 = ____._ = q() 9+0-a) q TWOZO Between qa(T) and qb(T), there is the following relations. From Lemma 1 and 2, the following is obvious. 10 Lemma 1.3. q"(’r) > qu _ a q"(T)-q (1-3) (1’0) —q (T) — qF+(1_q)G Party 1 candidate. (see figure 1.3) Party 1 candidate’s choice of T during the first election depends on three things; signal p, his expectation of voters’ expectation of T and Party 2 candidate’s mixed strategy 0 during first election. Let q“ and qb be Party 1 candidate’s expectation of voters’ updates (these come from the candidate’s expectation of voters’ expectation of T.) I consider the candidate’s choice of T depending on different values of c. Q“ qa(p) b Kq 1 (p) LP Figure 1.3 The case of c = 0: this is the case that Party 2 candidate announces the policy b during the first election. Then, the expected sum of rents of announcing the policy a 11 for Party 1 candidate with p is, {1 - H(-qa + {3)}[1 + 5p] The expected sum of rents of announcing the policy b is, {1— H(0)}i1 +6{1— H(0)}l = 31—5 It is beneficial for the candidate to announce the policy a if, {1-1‘1(—q“+fi)}l1+620]23-11E 2+6 a (1.4) => 1— m 2 H(-q H3) From this, the necessary minimum level of q“ for the Party 1 candidate with p to announce the policy a can be defined. Let g? (p) be the value of q“ defined from the above condition satisfied with equality. Then, the following result can be obtained. Since qa(0) = q < 5 and qa(1) = 1, this Lemma implies there is a crossing point The case of c = 1. This is the case that Party 2 candidate announces the policy a during the first election. Then, the expected sum of rents of announcing the policy a for Party 1 candidate with p is, {1- H(0)}l1 +6101 = % The expected sum of rents of announcing the policy b is, {1— H(—B+qb)}l1+6{1 — H(0)}i = 2T‘“’{1 — H(—fi+qb)} 12 It is beneficial for the candidate to announce the policy a if, 1+6 2+5 2102—2 {1-H(-6+qb)} 1+6p _ b > __ (1-5) =>H( 6+q)_1 2+6 Same as q‘l’, the necessary minimum level of qb for a candidate with p to announce the policy a can be defined from this condition. Let qll’(p) be the value of qb defined from the above condition satisfied with equality. For this qll’ (p), the following results can be obtained. Lemma 1.5. Unlike the case of qa(T) and q? (p), it is possible that there is no crossing point of qb(T) and q? (p) Between qi” (p) and q? (p), there is a following relation. Lemma 1.6. qi’fi?) 2 TM) The equality holds when p = 0.5. The case of c E (0, 1) is the combination of above two cases. Party 2 candidate. (see figure 1.4) Since Party 2 candidate has the same belief q as voters, it seems at first that he always announces the policy b. But he knows that the belief will be updated after Party 1 candidate announces its platform. Following the update, the probability of Party 2 candidate winning the first election by announcing the policy b changes. If Party 2 candidate believes that it is very likely that Party 1 candidate announces the policy a and that the updated belief after that is high enough, it might be beneficial for the candidate to announce the policy a. Notice that there is a trade-off between 13 the likelihood of Party 1 candidate announcing the policy a and the updated belief. Since Party 2 candidate does not have any private information about the probability of the true state being w], announcing the policy b is completely an Opportunistic behavior. But, since the outcome of the policy depends on the true state not on which candidate implements the policy, voters do not punish such an Opportunistic behavior in this model. a q2 [q 2(1) 1 / q_ 1,1) Figure 1.4 Again abusing the notation, I denote the value of T expected by Party 2 candidate as T. Then, Party 2 candidate thinks the probability of Party 1 candidate announcing the policy a during the first election is, q{1TF(T)}+(1-q){1-G(T)}= 1-qF(T)-(1-CI)G(T) 14 The expected sum of rents Of Party 2 candidate critically depends on the assumptions about what happens after Party 2 candidate winning the first election. I restate two related assumptions. The new candidate put up by Party 1 at the second election after the defeat Of the Party at the first election does not have any private informa- tion (actually it does not matter whether or not he has private information since no information can be transmitted credibly to voters during the second election.) Voters do not apply (A) to Party 2 candidate even if the candidate implements the policy a in the first period. Then, the expected sum Of rents for Party 2 candidate when the candidate an- nounces the policy a during the first election is, {1 — qF(T) - (1 - q)G(T)}H(0)[1+ 5151(0)] +{qF(T) + (1 - Q)G(T)}H(—,5 + qb(T))[1+ 611(0)] = [0.5 + {qr + <1 — qu}{H(-fi + A — 0.5}12—‘219 The expected sum of rents when the candidate announces the policy b is, {1 - qF(T) - (1 - q)G(T)}H(-q" + B)I1+ 6H(0)l +{qF(T) + (1 - q)G(T)}H(0)[1+ 511(0)] 2 + 6 = [H(_qa + 5) + {qF + (1 — q)G}{0-5 — H(€1a + 6)}17 Then, the candidate announces the policy a at the first election if, 0.5 + {qF + (1 — q)G}{H(—fi + qb) — 0.5} 2 H(-q" + [3) + W” + (1 - q)G}{0-5 - H(q" + 5)} (16) => W” + (1 - q)G}{H(-fi + q”) + H(-q" +13) - 1} 2 H(-q" + fl) - 0-5 Unlike two conditions for Party 1 candidate, this condition (1.6) has both q“ and qb. Using (1.3), this condition is converted into the one with only q“. 15 a (17) MP +(1— q)G}{H(—6 + q“ — (11,362”) + Ht—q“ + H) — 1} 2 H(-q" + B) - 0-5 Then, the minimum necessary level of q“ for a given T is defined as the value of q“ satisfyng this condition with equality. Denote the minimum level as q§(T). Since this condition is not easy to handle, I consider also the condition with a given value of qb. Given the fixed value of qb, the minimum necessary level of q“ for a given T is obtained from the condition satisfied with equality. Denote the minimum necessary level for a given qb as q‘2’(T; qb). Then, the following Lemma can be obtained. Lemma 1.7 For T E (0, 1), 3q§’(T;qb) > 8T (96130; (1") qu (ISO) 2 (1‘2"(T; (1’) (13(0; (1”) = (13(0) = B the value of T such that q§’(T) = 1 is lower than 1. 0 <0 1.4 Equilibrium Firstly, I show there is no equilibrium without voting behavior (A) in this model. Lemma 1.8. Suppose that voters vote during the second election according to their period preference without (A). Then, there is no equilibrium. Thus, (A) is a necessary condition in this model. Next, I state the Proposition of the equilibrium existence. Proposition 1.1. With (A), there is an equilibrium in the model. 16 There are four possible cases of the existence of equilibrium. Since these cases are rather complicated and do not give clear and intuitive characterizations of equilibrium, I concentrate on only one case that q‘2’(T) and qa(T) do not cross (see figure 1.5.) (I q , q [(120) 1 ./ CL ‘ q (r) [34 q— a i 4 q1(p’ 0 t, p t* 1 Figure 1.5 In this case, the equilibrium value of T and c are T* such that qa(T*) = qi’(T*) and c = 0. Party 2 candidate never proposes the policy a since qg(T) > qa(T); q"(T) is never high enough for Party 2 candidate to prOpose the policy. For this case, there is a simple sufficient condition. Let H (e), the distribution of e, be the uniform distribution (its support is assumed to be large enough.) Then, (1.7) becomes q 2 fl. Since 6’ > q, qé’ (p) is always larger than q“ (p) and Party 2 candidate never announces the policy a. Thus, the case like figure 1.5 is obtained. The equilibrium choice of T is determined at the crossing of qa(T) and q‘l’“ (p). Denote the equilibrium value of T as T* . Then, there is no guarantee that T* is equal to 6. T* could be higher or lower 17 than [5. Whichever it is, that means the inefficient choice of Party 1 candidate for voters. In this case, a more interesting result can be Obtained; the higher the ability of Party 1 candidate to discern the true state, the lower the equilibrium outcome value of T. I give a more formal explanation. As I wrote before, the cumulative distribution functions F and G can be considered as representing the ability of Party 1 candidate. Then, a high ability candidate is the one who is more likely to receive high values of p when the true state is w1 and less likely to receive high values of p when the true state is wO. Let F 8:67 and F&G be two different sets of distributions. Then, if 150051700) and C701) 2 0(1)) for any I) 6 (0,1) and if 170?) < F00) or @(p) > 0(1)) for some p 6 (0,1), then the candidate with F856} can be considered as having the higher ability than the one with F &G . Since voter’s updated belief is, qa(7.) = q{1 _ F(T)} q{1— Fng} + (1 - <1){1— 9(7)} — 1— C(T) (1+ (1 —Q)m the updated belief with the higher ability candidate is never smaller than the updated belief with the lower ability candidate, and strictly higher for some T. 18 (L a (11, q 1 a q (s=3) .J a ’3 q (8:2) (1‘ a S ta = i < > i <— i >l<>l< 1* l 1'" p 0 t 1: 0.5 1 Figure 1.6 In the figure 1.6, the two different curves of voter’s updated beliefs, q“ for s = 2 and q“ for s = 3 are drawn based on F(p) = p3 and G(p) = 1 -— (1 - p)3. With these function forms, higher value of 3 means higher ability. On (0, 1), the curve of q“ (s = 3) is above the curve of q“ (s = 2). Since the change of F and C does not affect q‘f, the equilibrium value of T moves to left. Since T* < B = 0.6 in figure 1.6, the outcome with s = 2 is inefficient as the policy b can guarantee the utility 6 to voters. The candidate with p E (T*, 6) exploits the voters’ trust on the ability of the candidate. Since T** < T*, with the higher ability candidate the inefficiency worsens in this case. I state this as a Proposition. Proposition 1.2. If H is a uniform distribution, the equilibrium exists with T" such that qa(T*) = qi’(T*) and c = 0. Moreover, in this case, the higher the ability of Party 1 candidate, the lower T*. 19 This proposition is not saying that having a higher ability candidate is always bad. If the ability of current candidate is so low that T* with him is to the right of B, changing him with a higher ability candidate could help voters with T* moving closer to [3. However, changing candidates with higher and higher ability candidates, T* eventually passes 6. After that, higher ability candidates make things worse for voters. 1.5 Conclusion Of Chapter 1 In this chapter, I considered the platform choice in a two-election—and—two—period model under uncertainty with one party having information advantage. Most papers in the literature of the policy choice with the uncertainty consider models with only one election. Since government offices usually allow multiple terms with multiple elections, candidates must consider about the future elections. Thus it is worth considering the policy choice of candidates considering future elections as well as the current election. Harrington (1993b) also considers a two-election model. However, the interest of the paper is whether politicians will keep their campaign platform after they win the election, which is different from this chapter since the interest of this chapter is on what platform a politician will choose. This chapter assumes strong assumptions. The assumption that only one party receives the signal is clearly a strong one. In the literature of the information trans- mission during the election, the more common assumption is that both parties receive signals (Heidhues and Lagerof (2003) and Bernhardt, Duggan and Squintani (2007), etc.) However, it is not clear that which is more natural assumption. During elec- tions, often one party has advantage over the other party (see the Rasmussen reports (June 21, 2008).) The assumption is a simplified description of such reality. The other strong assumption is that voters apply (A) only to Party 1 candidate. When Party 2 candidate implements the policy a and its outcome is failure, the voters does 20 not punish the candidate. Since Party 2 candidate’s choice of policy a is completely an Opportunistic behavior, this seems unrealistic. It seems very likely that voters punish the candidate strongly. However, it does not change results qualitatively. The equilibrium value of T depends on functions and parameters. So, there is no reason to expect T* = 5. If T* < [3, the candidate with p E (T*,B) announces the policy a while believing the policy is actually worse than the policy b. Thus, this could be considered as the inefficiency result. Similar inefficiency happens when T* > 6. Then the interesting result of this chapter that the higher the candidate’s ability the lower the equilibrium value of T in the case I explained in the previous section means that a higher ability candidate could make the inefliciency worse. This happens because the high ability of the candidate causes voters to update their belief higher, which gives more opportunity to the candidate to exploit. Though the result is interesting, to obtain more general result, it might be necessary to consider if the same result would happen even when two parties receive signals, which is a future research topic. 21 Appendix of Chapter 1 Lemma 1.1. 90(7) = q > q for T 6 (0,1) 1 - G q + (1 — 901—17: limT—d (10.0.) = 1 q a 0 = —-—-—-—- = _ q() q+1_q q qa'(T) > 0 for all T 6 (0,1) Proof. Since C(p) > F(p),1—E%g% < 1 for p 6 (0,1). Then, qa(T) = q > q for T 6 (0,1), and q “(0): ——i—-——— — q. From L’HOpital’s q+1-q 1—G F Inwandumnslaéil (fl = 00, limT_,1q"(T) = 1. The derivative of qa(T) with respect tOTis, al _ Q{1_q} 0,1 _—__F_ q (T) [q{1-F}+(1—q){1—G}12F{7G” —G 1 —F Since limT_,0— G =0 andi —G > 1 for T 6 (0,1), q“’ (T) is positive1 for low value —F of T. Suppose that q"’ (t);0 at T = t 6 (0,1). This implies -FT:1—_——é - 1< _.O dl—F F' G’l —F_ dG’l—F _1—FdG’ Then, d—T1—__G— 1—G’F—71———’—<0' Wit this]; [F —GdTF’+ G d 1— F d C" . . . G 1— F1 F72; 1 _ G < 0 since dTF7 < 0. G’This implies, if F71_——67 — 1<0at a certain value 01 F of T, for all T’ > T, the value of F7I———G_ — 1 < 0. Then, q"(T) is downward sloping for T’ 2 T. Since lim.,_,1 qa(T) = 1, this is a contradiction. Thus q"’ (T) > 0 for all T E (0, 1). The updated belief qa(T) goes up from q to 1 as increases from 0 to 1. I Lemma 1.2. qb(T) = q G < q for T 6 (0,1) q+(1-®F limT—->0 qb(T) = 0 b __ q = guy—0+0-Q) q leZO 22 q m+U-q%- F I m— — q. From limT__,0 5—7—8: 3— - O, limT__,0 qb(T) = 0. The deriv- . . . 1— q} F’ G . ative of qb T With respect to T 13 q I” T = (A F G' —— — 1 . Since ( ) ( ) [qF +(1— (1)012 [ ] G’ F G — 00 and f > 1 for T E (0, 1), qb’ (T) is positive for T close to 1. Moreover, Proof. From the definition, qb(T) = < q for T 6 (0,1). Clearly, $0)= I ‘7— 10:" G , F’ . G _ G’ , F’ , G limT_,0— G’ F— = 11m7-_,0 a7[lim7_,0 f — lim7-_.0 F7] = llm7-__,0 a7[hm7._,0 F — Fl limT_,0 37:] = 0. Suppose that qb’(t) = O at T = t 6 (0,1). This means 6%- — 1 = 0. d ’F F_’G d F’ d F’ d_% : %2_[—G' —]— "‘ O. Wlth thlS, 'd—'[-G—’%— — gd—‘b—i > 0 SlIlCG > 0. Since q b(0) = 0, this guarantees q b’(T) 2 0. Thus, the updated belief limT_,1 Then, dF’ dTG—’ qb(T) goes up from 0 to q as T increases from O to 1. I Lemma 1. 4. 8p <0 (11(0 )> B > CF11“) 2 +6 a Proof. (1.4) 1 — m 2 H(—q +5) The left hand side of (1.4) is the increasing function of p. The right hand side of (1.4) is decreasing function of q“ as H’ (s) > 0. Thus, from the implicit function 3__q1 _ 5{2 + (5} ’ 6p 4{1 + 6p}2H’(—q a + a) qi”(0) > B > qi”(1), substitute p = 0 into the left hand side of (1.4),1 < 0. To obtain _ (2 + 6) theorem, the derivative of qi‘(p) < 0.5. Since H (0) = 0.5, —qa + ,8 < 0. Thus, qi‘(0) > H. Substituting p = 1 into the right hand side of (1.4), H(—q‘1‘(1) + B) = 1 — —2+—6— > 0.5. Then, 5 > q‘1’(1). I 4(1 + 6) Lemma 1.5. aqll’ 3p— < 0 qWM>B>qflU Proof (15) H(—fi+ b) > 1 — 1 +61) ' ' q — 2+ 6 The left hand side of (1.5) is the increasing function of qb. The right hand side 23 of (1.5) is decreasing fimction of p. Thus, from the implicit function theorem, the 6 (2 + 5)H’(-b’ + qb) To get this result, substitute p = 0 into the right hand side of (1.5), 1 — -——L— > 0.5. < 0. Moreover, ql1’(0) > ,8 > q'l’(1). 8 derivative of qll’ (p), 5g: = — 2 + 5 Then, —5 + qb > 0 => qf(0) > 3. Substituting p = 1 into the right hand side of (1.5), 1 (5 1— 3:75 < 0.5. Then, H(—B + q’l’(1)) < 0.5 => q’1’(1)< 6. I Lemma 1.6. (1%?) Z q'f(p) The equality holds when p = 0.5. Proof. To show this result, solve (1.4) and (1.5) satisfied with equality for q‘f (p) 2 + 5 1 + 6p b a _____ _ -—1 _ __ b = —1 _ 311d 91(Pl- Then, (11(1)) l8 H (1 4(1+ 619)), and (11(1)) fl'l'H (1 2 + 6 )- _ 2+6 _ 1+6p . . . a __ b = _ 1 _ 1 _ Thus ql (p) q1(p) {H (1 4(1 +6p))+H (1 2 +5 )}. SinceHissymmetric and H’1(k) 2 (<)0 4:) k 2 (<)O.5, the sigh of the left hand side is same to the sign 2+5 1 +510 52 2 f—1————0.5- 1— —O.5 = 2—1 0’ { 4(1+6p) } { 2+6 } 4(1+6p)(2+6)(p ) => q‘1‘(p) Z q’l’(p) (the equality holds at p = 0.5.) I Lemma 1.7. For T 6 (0,1), 34%(T; (1”) 8T aqfofh; qb) 6gb (13(7) 2 43(7; 61") (13(0; qb) = (13(0) = B the value of T such that q§l(T) = 1 is lower than 1. Proof. (1.6) {qF+(1—q)G}{H(—fi+qb)+H(—q“+fi)—1} 2 H(—qa+fi)—0.5 5q§‘(T;qb) 2 8T >0 <0 The derivatives of q§"(T; qb) with respect to T and qb for T E (0, 1) are, {qF’ + (1 - q)G’}{1 - H(-fi + (1”) - H(-qa + 6)} {1 - qF - (1 - a)G}H’(—q“ + fl) The sign of this is same to the sign of 1 — H(—6 + qb) — H(—qa + 5). Since 24 [3 2 qb, q“ > B guarantees that the sign is positive. Unfortunately, it is possible that B 2 q“. Suppose this. Since H is symmetric, the sign is same to B — qb + q“ - [3 = a a a_bTh - - a_b: a,_q-{1-qF-(1-(I)G}q = q -q > q q. eSignis,q q q qF+(1—q)G qF+(1—q)G“0 Only case this is zero is q“ —- q = 0. This implies T = 0. It is easy to obtain W21 (T; (1") (W + (1 - Q)G}H'(—fi + q”) 3q§(r; qb) . . —- = - < 0. Th —— < O h aqb {1 — qF — (1 — q)G}H'<—qa + a) 9‘" aqb ‘mp es q‘2"(T) 2 qg’(T; q). If T = 0, the condition (1.6) becomes, 0.5 2 H(—qa +fi) => q“ 2 B. This does not depend on the value of qb. So, q§‘(0;qb) = q3(0) = B. If T = 1, the condition (1.6) becomes, H(-fi+qb)+H(-qa+fl)- 1 2 H(-q“+fi)—0-5 => H(—5+qb) 2 0.5 =>qb2B Sincefi > q 2 qb, this condition is never satisfied. This implies that the value of T such that q§(T) = 1 is lower than 1. I Lemma 1.8. Suppose that voters vote during the second election according to their period preference without (A). Then there is no equilibrium. Proof. No (A) means that the probability of winning second election after the implementation of the policy a by Party 1 candidate is same to the one after the implementation of the policy b. Thus, the expected second period rent is same for both cases. This implies the decision-making during the first election is same to the case there is only one period. Then, there is no reason for voters to update their belief. This is the reason of no equilibrium. The following is a formal proof. Suppose that there is an equilibrium without (A). Let T* and c* be the equilibrium value of T and 0. Let qa* and qb* be the updated beliefs at the equilibrium. The expected sum of rents for announcing the policy a is, (1 - C”‘)(1 - H(-qa* + 6)}[1 + 5(1- H(0)}l + 0*{1- H(0)}[1 + (5(1 - H(0)}l The expected sum of rents for announcing the policy b is, 25 (1 - C"‘){1 - H(0)}[1 + 5{1 - H(0)}] + C"‘{1 - H(-6 + qb*)}[1 + 5{1 - H(0)}l The candidate with p announces the policy a if, (1 - c"‘){1 - H(—q”* + a} + 6*{1- H(0)} 2 <1— c*){1 — mm} + c*{1— H<—/3 + q”*)} => (1 - c"‘){H(0) - H(-q“* + {3)} + 6*{H(-fi + 61’”) - H(0)} 2 0 => 11(0) — H(-q“* H3) +c*{H(—fi+qb*) +H(—q“* +6) - 1} 2 0 Since this condition does not depend on p, if this condition holds for some Party 1 candidate with p, it holds for any Party 1 candidate. This implies T = 0. Since qa*(0) < q < B, H(O) — H(—qa* + B) < 0. As shown in a previous footnote, H(—B + qb*) + H (—q"’* + B) — 150. Thus the condition does not hold. This implies that Party 1 candidate never announces the policy a. Then, it has to be T* = 1. In this case, c* has to be 0. Since H (O) — H (-q“(1) + B) > 0, the condition satisfied, a contradiction. Proposition 1.1. With (A), there is an equilibrium in the model. Proof. Since different sets of functional forms and parameters produce different kind of equilibrium, I cannot obtain a simple general result. So I am going to show the existence of equilibrium in all possible cases in turn. The first case is the simplest one. Since q3(0) = B and q§(T) reaches 1 before reaches 1, and q“(0) = q and qa(1) = 1, it is possible that q§(r) and qa(T) do not CI‘OSS. The Case 1: qg'(T) and qa(T) do not cross. Since q‘1‘(0) > B > q‘f(1) and qa(0) = q and limT_.,1 qa(T) = 1, q? ) and qa(T) always cross. Moreover, since they are monotone, the crossing point is unique. Let the point he (T*, q‘”). Then, T = T* E (0, 1) and c = 0 is the equilibrium. Since the assumption that q§(r) and q"'(T) do not cross implies q§(r) > qa(T), clearly it is never 26 beneficial for Party 2 candidate to announce the policy a during the first election. Thus c = O. This implies that (ll) (p) is not relevant to Party 1 candidate’s choice 0. of platform. Only qi”(p) is relevant. Since 951—qu < 0, for p < T*, the updated belief q“* is lower than the necessary minimum level of the updated belief qi‘ ). Thus, the candidate with p < T* does not announce the policy a. On the contrary, for the candidate p > T*, q‘” > q?(p). Thus the candidate with p announces the policy a. So, the set of T = T* and c = O is the equilibrium. The case of that qg(T) and qa(T) cross is more complicated. In the case, both qi"(p) and q? (p) are relevant to the choice of Party 1 candidate, but not the entire part of them. Let Q E {T 6 [0, 1] : q‘2‘(T) 2 qa(T)}. Then, on Q, q? (p) is relevant to Party 1 candidate’s choice of platform because on Q Party 2 candidate announces the policy a.4 On (20 E {T E [0, 1] : q§(r) < qa(T)}, qi”(p) is relevant to Party 1 candidate’s choice of its platform since Party 2 candidate does not announce the policy a. Depending on functional forms and parameter values, this 9 may be a convex set or may be a collection of disjoint convex sets. Let {2* C Q be a convex set such that w* E minw E {2* and w** E maxw E 0* are boundary points of 9. Since q§‘(T) reaches 1 before T reaches 1, w** < 1. It may be or may not be 9* = D. To show the existence of the equilibrium, I only need to consider 9*. So, in the following, I simply assume Q = 9*. The Case 2 (Figure 1.7): q§(T) and q“ cross, and qf(p) and qa(T) cross on QC and/or qll’(p) and qb(T) cross on Q. 27 a 1 “\q (I) \ q‘,’ qb(T), that q‘1’(p) and q“(p) do not cross on QC implies that qi’(w*) > qa(w*) and qa(w**) > q‘1’(w**). Then two cases are such that; the case of qb(w*) > q?(w*) and the case that q’l’(w**) > qb(w**). The Case 3 (Figure 1.8): q§‘(T) and q“ cross, and qb(w*) > q’l’(w*), no crossing between q‘1’(p) and qa(T) on QC nor between qll’(p) and qb(T) on Q. a a q ,q’0 q\2(p) \ E \\qa(r) 13— i q qbfilf‘ /qt1’(p) ql1’(p) ’ i O (0* g 1 r,p Figure 1.8 In this case, T = w* and some mixed strategy c 6 (0,1) is an equilibrium. Since a q1(w*) > qa(w*), if c = 0, announcing the policy a is not beneficial for Party 1 29 candidate. But qb(w*) > q’1’(w*) implies that if c = 1 it is beneficial for Party 1 candidate to announce the policy a. Since Party 1 candidate with p = 0.2” is not indifferent between announcing the policy a and announcing the policy b if c = O or c = 1, T = w* and c = 0 (or c = 1) is not an equilibrium. But there is a value c* 6 (0,1) such that T = w* and c = c* is the equilibrium. Let q"* E qa(w*) and qb* = qb(w*). Given some mixed strategy 0, the expected sum of rents of Party 1 candidate for announcing the policy a is, 1+6p (1 — c){1- H(—q“* + {3)}{1 + 6p} + c 2 The expected sum of rents for announcing the policy b is, 2 (1— a2 :1" 5 + (Fig—"p _ H(—B + qb*)} Party 1 candidate announces the policy a if, <1— c){1—- H<—qa* + 5)}{1 + 6p} + c1 1” 5” 2 (1 —c)2+" +33% — H(-fi+q”*)} : <1— cm - H<—qa* + 3)}{1 + 6p} — 33—9-1 +412” — 2:5{1 — H(—B+qb*)}] 2 0 With T = w*, the coefficient of (1 — c) is negative, and the coefficient of c is positive. Denote them as (—) and (+). Then, the condition is, (1 - C)(—) + C(+) = (—) + C{(+) - (-)l 2 0 From this, the value c* E (—+-_—(__—()_—) E (0, 1) is defined as the value that satisfies the condition with equality. Then the set of T = a2” and c = 0* is an equilibrium. Party 1 candidate with p = w* is indifferent between announcing the policy a and announcing the policy b. Since both q‘f (p) and qll’ (p) are decreasing functions of p, the values of (+) and (—) above increase with p. Thus, Party 1 candidate with p 2 w* announces the policy a. At same time, this means that Party 1 candidate with p < w* does not announce the policy a. For Party 2 candidate, since the fact that w* is a point of 30 9 means q‘2’(w*) = q“(w*), Party 2 candidate is indifferent between announcing the policy a and announcing the policy b. The Case 4 (Figure 1.9): q§(T) and q“ cross, and q’l’(w**) > qb(w**), no crossing between qi’ (p) and qa(T) on QC nor between qll’ (p) and qb(T) on Q. a a b q2(\p) \ \qa(t) .Q C" A c-i v /q11)(p) LP Figure 1.9 In this case, the combination of T = w** and some mixed strategy 0 6 (0,1) is an equilibrium. The proof is almost same to the Case 3. Since qa(w**) > q‘1’(w**), if c = 0, announcing the policy a is beneficial for Party 1 candidate. But qll’(w**) > qb(w**) implies that if c = 1 it is not beneficial for Party 1 candidate to announce the policy a. Since Party 1 candidate with p = w* is not indifferent between announcing the policy a and announcing the policy b if c = O or c = 1, then T = w” and c = 0 (or c = 1) is not an equilibrium. But there is a value c** E (0, 1) such that T = w** 31 b** _ an E qa(w**) and q = qb(w**). Given some and c = c** is the equilibrium. Let q mixed strategy c, the expected sum of rents of announcing the policy a for Party 1 candidate is, (1 - c){1 —— H(—qa** + 5)}{1 + 6p} + c1+ 6p The expected sum of rents of announcing the policy b is, 2 6 2 5 M j; +c—J25—{1-H(—B+q”**)} Party 1 candidate announces the policy a if, <1— c){1 — H(—qa** + 5)}{1+ 6p} + .1 + 6’ 2 (1 - c)2 + 6 + 03121?“ — H(—B + qb**)} => (1 -— c){{I — H(-q"'** + mm + 6p} — 11—61 +411,” — 33—511— Heal—gran 2 0 With T = w**, the coefficient of (1 — c) is positive, and the coefficient of c is negative. Denote them as (+) and (—-). Then, the condition is, (1- C)(+) + C(-) = (+) + C{(—) — (+)} Z 0 From this, the value c** = —-fi—)—— 6 (O, 1) is defined as the value that satisfies + _ _ the condition with equality. Thefi th(e s)et of T = w** and c = c** is an equilibrium. Party 1 candidate with p = w** is indifferent between announcing the policy a and announcing the policy b. Since both q‘f (p) and qll’ (p) are decreasing functions of p, (+) and (—) above increase with p. Thus, Party 1 candidate with p 2 w* announces the policy a. At same time, this means that Party 1 candidate with p < w* does not announce the policy a. For Party 2 candidate, since to“ = maxw 6 52 means q‘2‘(w**) = qa(w**), Party 2 candidate is indifferent between announcing the policy a and announcing the policy b. Thus the set of T = w** and c = c** is the equilibrium. Since above four cases cover all possible cases, the proof is completed. I 32 Endnotes of Chapter 1 1. The uncertainty is not restricted only to the policy outcomes in the literature. The uncertainty about the ability of politicians is often assumed (Majumdar and Mukand (2004), etc.) Cukierman and Tommasi (1998) and Schultz (2002) consider the uncertainty about the preference of politicians as well as the policy outcomes. 2. According to the Rasmussen reports (June 21, 2008), voters trust the Demo- cratic party more than the Republican party on ten key issues including Economy, National Security, and the war on Iraq. Since the natural interpretation of this "trust" is competency, this result could be considered as implying that voters assume the abil- ity gap between two parties. Though even this interpretation is actually inconsistent with the model of this chapter, at least this implies the meaning of considering the asymmetry between two parties. 3. As I wrote, this is a simplified, short cut way of modeling the signal. Following Majumdar and Mukand, this can be stated formally as the following; at first Party 1 candidate has the belief q same as voters. Then the candidate receives a random signal 3:, which is in [_:r_, is]. This random signal a: has a density 450 if the state is l w0 and a density ail if the state is wl. I assume that limxnzg (LO—{:2 = 00 and 1 ¢ (:1?) limxng Z—ELP—i = 0. Then from this signal, the candidate deduce the new belief p of a: the state being w1 using the Bayesian rule, q¢1(x) W) = q¢1($) _ we) q1($) + (1 - (1)9500?) (1)1(33) 4—5—— + (1 — (I) 1 ¢ (:8) a5 (I) Assuming O :1: is strictly increasing as a function of :13, :1: has a one to one corre- spondence. Then, from (151 and (to, two distribution F and G can be constructed. 4. Of course, on the bound of Q, Party 2 candidate is indifferent between announc- ing the policy a and announcing the policy b. So, the candidate might not announce the policy a on the bound of 9. Actually, the condition that they are indifferent is 33 necessary for the existence of an equilibrium. 34 References [1] Adams, James AND Samuel Merrill III (2008): “Candidate and Party Strategies in Two-Stage Elections Beginning with a Primary” American Journal of Political Science, 52(2), 334-359. [2] Bernhardt, Dan, John Duggan AND Francesco Squintani (2007): “Electoral com- petition with privately-informed candidates” Games and Economic Behavior, 2007, 58, 1-29. [3] Cukierman, Alex AND Mariano Tommasi (1998): “When does it take a Nixon to go to China?” The American Economic Review, 88(1), 180—197. [4] Harrington, JR., Joseph, E. (1993a): “Economic policy, Economic performance, and elections” The American Economic Review, 83(1), 27-42. [5] Harrington, JR., Joseph, E. (1993b): “The Impact of Reelection Pressures on the Fulfillment of Campaign Promises” Games and Economic Behavior, 5, 71-97. [6] Heidhues, Paul AND Johan Lagerof (2003): “Hiding information in electoral competition” Games and Economic Behavior, 42, 48- 74. [7] Majumdat, Sumon AND Sharun W. Mukand (2004): “Policy Gambles” The American Economic Review, 94(4), 1204-22. [8] Persson, Torsten AND Guido Tabellini (2002): Political Economics Eaplaining Economic Policy, The MIT press. [9] Rasmussen reports (2008): “Trust on Issues Democrats Trusted More on All Ten Electoral Issues” http: / / www.rasmussenreports.com / public_content / politics / mood__of_ america /trust_on_issues/trust_on_issues [10] Schultz, Christian (2002): “Policy biases with voters’ uncertainty about the economy and the government” European Economic Review, 46, 487-506. [11] Schofield, Norman (2005): “A valance model of political competition in Britain: 1992-1997" Electoral Studies, 24, 347-370 35 2 Information Transmission from Overlapping Generations Parties 2.1 Introduction This chapter considers a model of electoral competition between two overlapping- generations—organization political parties with uncertainty about policy outcomes. The interest is on how to restrain opportunistic behavior of politicians who are in their last period. Suppose that there is an information asymmetry between politicians and voters of which politicians can take advantage. In multi—period models, usually the concern about future elections can restrain opportunistic behavior of politicians to some degree (Austen-Smith and Banks (1989)). This is an intuitive result. However, what would happen if they are in their last period so that they do not need to worry about future elections? It seems that in such situations opportunistic behavior of politicians, i.e., moral hazard, is inevitable. However, unless political parties consisted only of members who would retire soon, not everyone in parties would be happy to see such opportunistic behavior. This suggests that there would be conflict of interests in parties that voters could exploit to restrain politicians. Though the problem of Opportunistic behavior by politicians in their last periods sounds very artificial since not all politicians in power are in their last period / term anyway, what I want to show is that the natural party structure consisting of politicians with various political life spans can give voters some leverage to restrain party bosses. The opportunistic behavior of politicians in their last period is a good example to show such possibility. 36 For this objective, I employ a model with OLG political parties that follows Alesina and Spear (1988) and Harrington (1992). Traditionally, literature of Political Economy has not paid much attention to the difference between politicians and political parties. Parties in many electoral compe- tition models (Calvert (1985), Wittman (1983), etc.) are nothing but labels put on candidates. Although there are also many papers in which parties are main players and different subjects from politicians, in those papers parties exist as independent and consistent entities (for example, Grossman and Helpman (1996).) However, par- ties in real world are organizations consisting of many individuals who often have different preferences5. Then, recognizing parties as organizations of individual politi- cians, not as independent entities, a natural question is what implication parties have on the behavior of politicians and election outcomes6. Recently, papers are appearing that consider this question. Snyder and Ting (2002) formalize the intuitive idea of political parties as brand names and shows how parties can work as signals about preferences of their candidates to voters. Cail- laud and Tirole (2002) model the intra-party competition among politicians in the same party and study what kind of impacts such competitions have on the trust of voters for politicians and on elections. Levy (2004) considers the endogenous party formation through stable coalition formations and shows that the existence of par- ties increases the number of credible election platforms, i.e., that the existence of parties makes some outcomes feasible that would not be possible otherwise. Most of papers are finite period models. Alesina and Spear (1988) and Harrington (1992), however, consider infinite period models in which parties are overlapping-generations- 37 organizations. They consider the problem of policy commitment. This chapter is one of such papers, and I consider the problem of the informa- tion transmission from politicians to voters. In short, this is a problem about if politicians lie or not. I show the party organization can help voters restrain oppor- tunistic behavior of the party boss. The situation I consider is this; there is the uncertainty about policy outcomes. No one is sure about outcomes of policies until policies are really implemented. This is a natural assumption for government policy decision-makings. The outcomes of almost any political decisions, especially big ones, are uncertain. Consider big regulation changes, or tax system changes. Maybe the decision of launching a war is the most evident example. In almost any policy decision- makings, there is the uncertainty about policy outcomes. However, it can be safely assumed that politicians have the informational advantage about possible outcomes over voters. In democracy, politicians choose policies and voters choose politicians. Thus, to achieve good policy outcomes, it is critically important to make sure that politicians honestly reveal their information to voters during elections. Given the in- formational asymmetry, however, this is not guaranteed. There are many papers on this problem (Harrington (1993), Majumdar and Mukand (2004) and Heidhues and Lagerlof (2003), etc.) in finite period models. Exact results depend on the structure of models. But the general conclusion is that it is possible that politicians choose policies of which they know outcomes are likely to be bad for voters since such poli- cies are their favorites or since choosing those policies increase the probability of their winning the next election. As an example, take Heidhues and Lagerlof (2003). They consider a one—period model in which two parties go to an election with their election 38 platforms. The outcome of implemented policy depends on the state of the world, and there is the uncertainty about the state of the world. Parties (or candidates) receive private signals about the state of the world before the election. Candidates have no preference on policies. Voters, however, have a belief on which state is likely. Since the objective of the candidates is winning the election, candidates have the incentive to follow voters’ belief. Thus, it is possible that candidates would propose a policy that they know is not good for voters. This result is intuitive and also individually rational for candidates. But, looking at this from the viewpoints of their parties, not from the viewpoints of individual candidates, this is clearly a bad result. Suppose that an elected politician in his last political term implements his proposed platform and a bad outcome realizes. Though the politician does not need to worry about the next election, the party of the politician will have to face voters in future elections. Then, it is likely that party will be disadvantaged in the next election, i.e., the lower probability of the party winning the election (because of the retrospective voting be— havior.) Thus, there is a conflict of interests in the party; the conflict between current candidates and other party members (especially candidates who will go to the next election). I consider a possibility of exploiting this conflict for voters’ benefit. Since this is an electoral competition model, not only parties but also how people vote is important. I assume voting depends on party reputations. By “party rep— utation,” I mean the reputation on a party that the party is revealing information honestly to voters or not. This party reputation depends, in part, on the behav- ior of party insiders. To understand how this works, suppose an election between two parties. Then, pick a voter who does not have any particular party affiliation. 39 Suppose also that the voter is given an opportunity of touring the headquarters of both parties. Then, in one party’s H.Q., the voter finds that all staff are shirking and complaining about their candidate. In other party’s H.Q., he finds that all staff are working hard and talking about what great works their candidate can do for the country. Then, during the election, for which party will he vote? It is very likely that he will vote for the party with hard-working staff. Next, suppose that a candidate from one party chooses a policy based only on his own self-interest and does not care about what will happen to the party in the future. Then, will the staff work hard for the candidate? It seems unlikely. This suggests a possible route of a signal from the party to voters and a possibility of restraining the opportunistic behavior of current candidates through the route, especially through future candidates. In the model, I assume that young politicians are supposed to make the election campaign effort for old politicians during elections. The effort causes negative utility to young politicians. But they still make the campaign effort since that helps their party and strong party will help them in future. If old politicians make the choice that hurts the party, however, the choice will reduce the young’s willingness to make the campaign effort. The campaign effort is assumed to be observable to voters. At first, I assume that voters require a certain level of effort (this will be changed later.) The required effort level is the required disutility level for young politicians during elections. Then, I consider about the reaction of young politicians to the opportunistic behavior of the party boss and if it can restrain such opportunistic behavior. In section 2.2, I explain the model, the behavior of politicians and the definition 40 of equilibrium. I state results in section 2.3. Section 2.4 is the conclusion with a brief literature review. All proofs and calculations are in the Appendix of this chapter. 2.2 Model and Behavior of Politicians Before explaining the model, I state a brief description of the flow in each period. At the start of each period, the nature chooses the state of the world in the period. Then, two parties receive the same signal about the state of the world. Old politicians in both parties choose their election platforms, and young politicians choose campaign effort levels. Observing effort levels, voters update their party reputations and vote for the party during the election based on the party reputations. The winner of the election implements the policy of its election platform. The outcome of the policy is observed by voters and they update party reputations again. In the reputation update, basically, party reputations get downgraded as punishment for not making campaign efforts (suggesting suspicious behavior of old politicians) and for choosing a bad policy (i.e. retrospective voting). These two punishments are instrument and incentive for young politicians. The reputations are carried over to the next period. To not have a bad party reputation in the next period, young politicians use the campaign effort as their instrument to restrain old politicians. The party reputation in this model means the belief by voters about if the party honestly reveals its information to voters (i.e., choosing the good policy as its platform given the information) or not during the election. Basically, old politicians want to abuse the good reputation and young ones want to keep the good reputation for their 41 election in the future. This “reputation” is different from the one in the literature of “reputation” like Tadelis (2002), which is a paper of an OLG reputation model. In this chapter, there is no “good” politician and no “bad” politician. Party reputations are about if the party is trustable or not. Also, even though reputations are updated as voters receive new information, the update is not Bayesian. Following Harrington (1992), the reputation and its update in this model are very simple, and are given by an assumption7. This is because assuming the Bayesian updating in this model makes the model unnecessarily complicated; more elaborate update systems naturally lead to more strategic behavior among politicians, even assuming non-strategic behavior of voters. Since such complication is not essential to the idea of this chapter, I assume a simple “reputation” structure in the following. 2.2.1 Model Society and its members: The society consists of voters and politicians of two parties (party 1 and party 2.) They are risk neutral. All voters are same and they do not have any particular affiliation with either party8. Voters care only about the current period utility; they do not try to maximize the expected utility over the infinite period. This may be because they are short-lived or myopic. In any interpretation, this assumption rules out the possibility of their employing complex strategies. Parties and politicians: Party 1 and 2 are modeled as overlapping-generations- organizations with an old politician and a young politician in each party in each 42 period. Following Alesina and Spear (1988), I call the old politicians at the period t as Presidential candidates (PCs) at the period t. They decide the party platforms of the period. Correspondingly, I call the young politicians at period t as Vice Presidential candidates (VPCS) at period t. VPCS make the campaign effort for their parties. The effort is observable to voters and causes the disutility to VPCS. I call politicians by the words “PC” and “VPC” in regardless of their being in an election or in office. A generation t politician of party i 6 {1,2} enters into the party at period t as a VPC and becomes a PC of party i at period t + 1. Then he retires and leaves the party. All politicians in a party are same except for their generations. Then, in the election at the start of each period, old politicians in two parties choose the election platform of their parties and young politicians make the campaign effort. Assuming the policy commitmentg, the winning party will implement its platform after the election. Policies and the state of the world: There are two possible policies: a1 and a2. Because of the policy commitment, the election determines which policy will be implemented in the period. However, the policy outcome depends on the state of the world, and there is the uncertainty about the true state. The number of possible states of the world is two: w1 and w2. The state of the world is chosen by Nature at the start of every period with probability % for both states. This is common knowledge. If the policy of (i E {1, 2}) is implemented in the state wi, then voters receive utility 1. If the policy ai is implemented in the state wj (j aé i), voters receive utility ——1. Thus, voters can tell the true state in the period after a policy is implemented. Information: Before the election in each period, parties receive a common signal 43 about the state of the world in the period. From the signal, politicians can derive the probability of the true state being wl. For the simplicity, I assume that parties receive the probability directly as the signal. Let r be the probability. This r is a random variable with the uniform distribution“). Voters do not observe the signal. Campaign effort: during elections, only VPCS can make the campaign effort. VPCS suffer the disutility c to make the effort c. Voters can observe the campaign effort. Since the probability of the realization of each state is common knowledge and voters can perfectly observe party platforms after they are proposed, the campaign effort does nothing on informing or deceiving voters about abilities of parties or platforms. The role of the effort in this model is that the level of the effort can work as a signal from VPCS to voters about the honesty of PCs. However, if c is too B high, VPCS never make the effort c. So, I set the upper bound of c at Z. Utilities of politicians: If maximizing the welfare of voters is the objective of politicians, parties would propose their platforms honestly, i.e., proposing policy a1 when they receive signal r > :— and a2 when r < %. But that is not the objective of politicians. Preferences of politicians are different from voters. Since voters do not observe the signal, politicians have incentive to lie to voters. Party i 6 {1,2} politicians receive (dis)utilities at three occasions in their two—period political life. Politicians’s (dis)utility: 1. Winning election: when Party i wins the election at period t, the PC of party i at the period receives a fixed benefit B > 1. 2. Implementing policy: PC of party i receives utility 1 when he implements 44 the policy al and receive —1 when he implements aj, j 71$ i. If party j politician implements a policy, party i politician receives zero utility“. VPCS receive zero utility fiom the implementation of policies. 3. Making campaign efforts: If VPCS make the campaign effort c during an election, this gives utility -—c to them. PCs do not make the campaign effort. Thus, PCs and VPCS do not receive same utility in same period”. The objective of politicians is the maximization of the sum of utilities from two periods. Since polit- ical life is just two periods, I do not use a discount factor for calculations of politicians’ utility. Although this specification is for simplifying the model, this reflects reality to some degree except the assumption that PCs do not make the campaign effort. What is important for the model is, however, that they receive different utilities in same period. In the model, Party 1 and Party 2 are mirror images of each other. PCs of Party I prefer the policy a1 to the policy a2 and PCs of Party 2 prefer a2 to al. Both parties receive the same probability r of w1 being the true state as a signal. Also each state of the world realizes with the same probability %. Since voters are assumed to have no affiliation with either party, they are basically indifferent between parties. However, they have preference on party strategies. If Party 1 is honest on the choice of platforms (r 2 :- —+ a1 and r < :— ——> a2) but Party 2 is dishonest (always proposing the policy a2), voters prefer honest Party 1 to dishonest Party 2. So, if they thought Party 1 was more honest than Party 2, they would have a derived preference for Party 1. Since party strategies are unobservable to voters, however, voters need a something 45 that can induce politicians to behave honestly. That “something” is the reputation on parties and the voting response based on it. Appropriate voting responses based on party reputations give incentives for VPCS to restrain the opportunistic behavior of PCs, though there are no honest politicians in the model. When new information about parties is obtained, the reputation on parties will be updated. I assume very simple strategies for parties and only two types of party reputations. Party reputation and Voting response: Voters have only two types of rep- utations on parties; Honest (H) or Suspicious (S). Honest means that voters think that the party is likely to pr0pose a1 when r > g and a2 when r < % (when r = %, choice does not matter.) Suspicious means that voters think that the party is likely to propose its preferred policy regardless of the signal the party receives. Notice that reputations are on parties, not on a particular politician, so this is not about politicians’ type. This reputation system is very coarse compared to the set of PC’S strategies, which will be explained later. Still, this coarse reputation system can improve voters’ welfare. During the election, voters prefer the party with Honest reputation to the party with Suspicious reputation. They will vote for the honest party over the suspicious party. Since all voters are assumed to be same, the honest party definitely wins. If both parties have same reputation (Honest and Honest, or Suspicious and Sus- picious), following the tradition of the electoral competition literature, both parties 1 have probability § of winning the election. Reputation update by voters: Voters cannot see signals. Instead, they use 46 the observable campaign effort of both parties and the policy outcome as proxies of the signal and the strategies. Thus, when they observe VPCs’ effort levels and when they observe the policy outcome so that they can tell the true state from the outcome, they update reputations. Zero effort chosen by VPC suggests that the PC of the zero effort party chose a policy based on his preference, not based on the signal received. After the policy outcome is observed, voters can tell the true state and which party chose the bad policy (of when wj, i # j.) Since parties receive a signal about the true state, the bad policy also suggests that the PC who chose the bad policy chose it based on his preference, not on the signal. Then, voters downgrade the reputations of parties that proposed the bad policy as its platform, regardless of whether the party won the election in the period or not. The update based on VPC’s effort levels can affect the election outcome in the period. The update based on the policy outcome can aflect the election outcome in the next period. Though these updates are given as assumptions exogenously, these are basically simple and intuitive. Reputation update: (1) Before the election, if VPC of one party makes the effort level less than c, the reputation of the party is updated to Suspicious. (2) After observing the policy outcome, the reputations of parties that proposed the bad policy are updated to Suspicious. Above two updates are about downgrading reputations. Since parties have pos- itive probabilities of choosing the bad policy even if they choose their policies com- pletely honestly, the opportunities for upgrading reputations are necessary. I assume 47 following two for upgrading. (3) Before the election, if (1) happens to one party and the reputation of the other party was Suspicious at the start of the period, the reputation of the other party is updated to be honest. (4) After the election, the reputation of the party that was Suspicious at the start of the period is updated to Honest. (5) If both parties have Suspicious reputation at the end of period, voters update their reputations to Honest. The party reputations are carried over to the next period. (3) strengthens the punishment of no effort for the case that only one party has Honest reputation. Without this, it is possible that the PC of the party with Honest reputation prefers no effort with his opportunistic choice to the effort level c with honest choice. (1) and (3) make sure that the party of which a VPC does not make the required campaign effort gets punished during the election. Combined with (2), (4) is essentially saying that the party that proposed the good policy (of when wi) get rewarded for the choice. (5) is for the case that both parties obtain Suspicious reputation. Since the fact that keeping Honest reputation increases the prospect of winning election is the reason that VPCS can restrain the opportunistic behavior of PCs, upgrading both parties to Honest is beneficial for voters than keeping two Suspicious parties. This updating system is just one example of many. Other systems, especially more complicated systems, might be able to achieve better results. However, the point is that even this simple system can improve voter welfare. 48 Party strategies: Politicians have two occasions to make decisions; once in each period of their political life. PCs choose party platforms. Observing PC’s choice, VPCS decide the level of the campaign effort to make in the period. Then, during each election, there is an intra-party game in each party between its PC and its VPC; though it is PCs who determine party platforms, VPCS can have a say in platform choices through their campaign effort choices. The entire strategy of a politician is the profile of decisions as a VPC and as a PC. I assume that the strategy is based only on party reputations and the signal in the current period. So, all politicians in the same party have the same strategy. I consider the strategy as follows. Take a PC of Party 1 at period t. Let (A, B) be a profile of Party 1’s reputation A and Party 2’s B (A, B E {H, S}, H for Honest and S for Suspicious) at the start of the period. Let r be the signal parties receive. Then, the strategy of the Party 1’s PCis, >1=> 1 r — a _2 <1 7‘ _ 2 1 :>(H,H).r2§—eoz>a1 1 0 2 r<——e =>a 2 1 (H,S):r2§—el=>a1 1 1 2 T<——E =>CL (S,H):a1 Thus, the strategy of Party 1 PCs is the choice of thresholds 50 and 51, or prob- abilities of lying, for cases of (H, H) and (H, S) (see figure 2.1.) 49 A A f T 1| D 0 I 1/2'8 1/2 |1 Figure 2.1 If r 2 %, then there is no reason for Party 1 PCs to lie. It is the happy coincidence of their preferred policy and the socially Optimal choice. If r < %, there is an incentive for PCs to lie to voters. Since they do not need to worry about the next election, if they could choose 50 and 51 freely, they would choose 80 = 51 = %. Since VPCS in the current period will be PCs in the next period, higher values of 50 and 51 could benefit VPC if they would have reputation H in the next period. However, such choice would hurt the prospect of VPCS having reputation H in the next period. Thus, if PCs choose to lie for very low r, VPCS will not make the required campaign effort c. The result of this zero effort is that the smaller expected utilities of PCs. To avoid such loss, PCs might choose values of 50 and 51 that are lower than —1-.As for the case of (S, H), as long as Party 2 makes the required campaign effort, Party 1 never wins the election. Thus, actual choice does not matter for PCs. One implication of this and assumptions about the reputation system is that if (S, H) then Party 1 VPCS will not make the positive campaign effort. This is because the behavior of VPCS does not matter for the reputation in the next period. This could be changed so that VPCS in such situation makes the positive campaign effort with apprOpriate 50 changes of reputation update assumptions. Since such change does not alter results qualitatively, I keep the current update system for the sake of simplicity. I do not consider the case (S, S) because this happening at the start of a period is ruled out by the reputation update. The strategies of Party 2 PC, éO and 9:1, can be obtained by changing (H, S) to (S, H), the inequality signs, negative sings of 50 and 51 to positive signs, and exchanging a1 and a2 (see Figure 2.2.) 0 1/2 1/2-3. 1 Figure 2.2 As for VPC’s strategy, since VPCS move after PCs in stage games, their decision is simply about if they should accept the PC’s choice and make the effort c demanded by voters or should not accept it and make zero effort (when voters demand the positive level c of the campaign effort, it is meaningless to choose an effort level between 0 and c.) So, the strategy of VPCS is simply to accept PC’s choice of platform only if it gives expected utility no less than the expected utility of not accepting it. Thus, the complete set of the strategy for Party 1 politicians can be denoted as (50,51) (the strategy for Party 2 politicians is (3:0, £1).) The sequence of the game in period t: The start of the period t. 51 If (S, S) then (H, H) Nature chooses the state of the world for the period. Both parties receive r. PC chooses platform. Intra-party game: VPC decides campaign effort level. Middle of the period. Voters observe party platforms and campaign effort levels. Voters update party reputations. ElectionzVoters vote according to party reputations. Suspicious reputation at the start of the period is updated to Honest. The winner of the election implements its platform. End of the period. Voters receive utility from the policy outcome. Voters update party reputations. Next period. 2.2.2 Behavior of Politicians In this subsection, I explain behaviors and conditions for PCs and VPCS in symmetric equilibrium. In each period, in both parties, a PC and a VPC play a stage game to decide the party strategy and the campaign effort level in the period. I derive the expected utilities of politicians in the equilibrium with VPCS making the required effort c when the party reputation is H. 52 Behaviors of PCs First, I calculate expected utilities of a Party 1 PC at the start of a period so he knows party reputations but has not received the signal r yet. Given an equilibrium strategy (50, 51) with VPC making the effort c when the party reputation is H, let VH, H be the expected utility of Party 1 PC at the start of a period with (H, H). Then, this is defined as, 1 B—1 1 8+1 B+250 VH,H=(‘2'—50) 2 +(‘2'+50) = 2 2 1 (g — so) of the first term is the probability of r < -2- -— 50 (r has the uniform 2. The probability of his winning 1 distribution.) If r < 5 —- 50, the PC proposes a 1 . . . . . . 1 election is 5, and he Will receive utility B — 1 if he W'lnS. (§ + 50) of the second term 1 1 is the probability of r 2 5 — 50. If r _>_ 2 — 50, he proposes al. The probability of his 1 winning election is 2’ and if he wins, he will receive utility (B + 1). If (H, S), 1 1 1 1 1 VH,S=(§-‘5 )(B-—l)+(-2-+E )(B+1)=B+2€ If (S, H), Party 1 cannot win as long as Party 2 VPC makes the positive effort. So, VS,H = 0 Expected utilities of Party 2 PC can be calculated similarly with (£0, £1). Since PCs are in their last period, they do not need to worry about the next election. . . . . 1 Thus, they always want to increase values of 50 and 81 to their maximum, i.e., —. The behavior of VPCS is more complicated since it involves calculations of expected utilities. 53 Behavior of VPCS Given an equilibrium strategy (50, 51) with VPC making the effort c when the party reputation is H, a Party 1 VPC’s expected utilities of accepting and not accepting PC’s choice are calculated as follows. Notice that when VPCS make decisions they already know the value of r in their periods. Since PCs’ choice when r > $- is honest, I do not need to consider the case of r > 1. So, I consider only the 2 1 case of rgé. (H, H) 1 1 1 1 1 (2.1) a : r[-2-VH,5 + EVH,Sl+(1 — T)[§VS,H + '2'VS,Hl — C =r(B+2el) —c This is the expected utility of accepting PC’s choice of a1 when (H, H). The true state is w1 with probability r and w2 with the probability 1 — r. With (H, H), both parties have probability :- of winning the election. Party 1 chooses a1 and Party 2 chooses a2 (because r < é.) After the outcome of the policy implemented is observed, voters can tell which party chose the bad policy (i.e., ai for the true state wj.) So, the party that chose the bad policy is punished with the bad reputation. I 1 l l (2.2) a2: Tl§VH,H + BVH’H] + (1 — T)[§VH,H + EVH’H] — C _ B+2e0 _ 2 This is the expected utility of accepting PC’s choice a2 when (H, H). In this case, both parties choose the same policy, so, the expected utility for VPCS does not depend on r. (2.3) 0: rVHfl + (1 — r)VS,H = g(B + 250) This is the expected utility of not accepting PC’s choice. Since the reputation of 54 Party 1 becomes Suspicious for no campaign effort, the only case that VPC can have positive expected utility at the start of the next period is when the choice of Party 2 turns out to be bad. In the case, the reputation of Party 2 would become Suspicious, too. At the start of the next period, their reputations would be updated to Honest. (H15) (2.4) a1: rVH,5 + (1 — r)V5,H —- c = r(B + 251) — c B+2e0 2 (2.6) 0: rVHfi + (1 — 7:)V5,H = g-(B + 250) (2.5) a2: TVHJJ + (l — TlvH,H - C = If Party 1 VPCS do not make the effort c when (H, S), reputations become (S, H) before the election. After the implementation of policy, this will be kept same or be changed to (H, H), depending the outcome of the implemented policy. (2.2) and (2.5) are same. Though (2.1) and (2.3) are also same to (2.4) and (2.6) for the same value of r, respectively, they should be treated as different conditions because they are for different threshold values. (2.1) and (2.3) are related to 50 and (2.4) and (2.6) are related to 51. In the case of (S, H), PCs always choose a1 and VPCS always make zero effort. So, I do not need to consider expected values of VPCS when (S, H). Given (H, H) and r, if (2.1) is lower than (2.3), VPCS never accept PCs’ choice of a1, since VPCS prefer making no campaign effort to accepting a1. Same for (2.2) and (2.3). So, what is important is the difference between (2.1) and (2.3), and between (2.2) and (2.3). Same thing can be said for (H, S). Thus, I subtract (2.3) from (2.1) and (2.2), and (2.6) from (2.4) and (2.5), and denote them as (2.1), (2.2), (2.4) and 55 (2.5), again. (2.1) g-(B — 250 + 451) — c (2.2) l—g—WB + 250) — c (2.4) 721(3 — 250 + 451) — c (2.5) 1421(3 + 250) — 0 Since B > 1, the derivatives of (2.1) and (2.4) with respect to r is positive. So, if (2.1) or (2.4) are non-negative at the threshold r0 E g — 50 or r1 E g — 51, it is positive for r higher than the threshold values. Thus, for any r higher than the threshold values of r, PCs always propose a1 and VPCS always accept it. Moreover, since the upper bound of c is g, (2.2) and (2.5) never become negative for any r and 50 in [0, g]. Because of this, I can ignore (2.2) and (2.5). This means VPCS always accept a2 if PCs choose a2. Thus I need to consider only (2.1) and (2.4). Since what I am looking for are threshold values of 50 and 51, I substitute the corresponding threshold values of r, r0 E g — so and r1 E g — 51, into above (2.1) and (2.4). To distinguish expressions with r and without r, I put “an” to expressions without r. (2.1)* — eO)(B — 250 + 451) — c ( ( [\DIF-‘MII-J [\DIHNIH (2.4)* — 51x3 — 250 + 451) — 0 2.2.3 Definition of Equilibrium Consistency and Deviation: Let (50*, 51*) be an equilibrium strategy with VPCS making the required effort c when the party reputation is H. Then, this (50*,51*) needs to satisfy the following condition. 56 1 (a) (Consistency) (2.1)* and (2.4)* are non-negative for thresholds r0 = 5 — 50* for (2.1)* and r1 = g — 81* for (2.4)*. If this condition does not hold, (50*, 51*) cannot be an equilibrium strategy with VPCS making the required effort c. The condition (a) is about the consistency of (50*,51*); if (a) does not hold, VPCS will not accept a1 for thresholds r0 and r1. Thus, (50 51) that does not satisfy (a) cannot be even an appropriate strategy with VPCS making the effort c, let alone being an equilibrium strategy. In such case, VH, H and VH, S cannot be calculated like above since the behavior of VPCS does not follow (50*, 81*). Then, when PCs prOpose an inconsistent strategy, I assume VPCS never make the required campaign effort c since they cannot tell making the effort is beneficial or not. For a successful deviation, the deviated values need to satisfy (a). Suppose that 50* < :- or 51* < :2- and that the current period Party 1 PC 0:1 of course _€0*( deviates from (50*, 51*) when (H, H); proposing a1 when r < r r 2 0.) Since the party reputation is (H, H), the PC can deviate only from 50*. Let r = g — '0. This deviation is equal to proposing 50 instead of 50*. If (a) holds for (3:0, 51*), the current period VPC will accept this deviation. If (a) does not hold, VPC will not accept al. This is obvious if (2.1)* is negative with (50, 51*). However, even if (1)* is not negative with (50, 51*), if (2.4)* is negative, the VPC will not accept a1 since (50, 51*) is not a consistent strategy. He cannot calculate VH,S> which is needed for the calculation of his expected utility. Same holds for (H, S). Thus, for (50*, 51*) to be an equilibrium strategy, it has to satisfy (a). For Party 2 with (é0*,é1*), similar 1 1 things can be said with thresholds to = 2 + 630* and 7‘1 = 5 + 5:1”. Now, I state the 57 definition of equilibrium. Equilibrium: Given the voting response of voters and the required campaign effort level c, Party 1 strategy (50*, 51*) and Party 2 strategy (50*, 5:”) are a pair of equilibrium strategies if the following conditions hold. 1) Given (220* , 9:1”), there is no consistent strategy (50, 51) that is Pareto superior to Party 1 PCs in the situation of (H, H) and (H, S) compared to (50*, 51*). 2) Given (50*, 51*), there is no consistent strategy (60, 5‘1) that is Pareto superior to Party 2 PCs in the situation of (H, H) and (S, H) compared to (é0*, 51*). Notice that the definition of equilibrium is not saying anything about voters, though it is them who choose the level of c. In the following, I consider only the symmetric equilibrium so that (50*, 51*) = (50*,é1’“). 2.3 Results First, I state about the existence of equilibrium. I consider only r_<_%. Given B > 1, since c has the upper bound 5, choosing a consistent strategy is always beneficial than choosing any non-consistent strategy for PCs. Since (2.1) and (2.4) have the campaign effort c, and positive c can make these values negative for small enough r. This means that VPCS prefer no campaign effort to accepting a1 for such small r. Since VPCS always accept a2, choosing a2 for such small values of r is more beneficial for PCs than accepting the loss of election by no campaign. Thus, PCs choose a2 for such small r. This means that in all equilibrium strategies VPCS make the required effort 0 when the party reputation is H. Moreover, since (0,0) makes (2.1)* and 58 B (2.4)* non negative for any c E [0, 41’ it is obvious that an equilibrium exists for any B c 6 [0,1] and B > 1. I state this as a lemma. B . . . . Lemma 2.1. For any B > 1 and c 6 [0,1], an equilibrium crists. Then, what kind of characteristics does the equilibrium have? I consider the easy case of c = 0 first. Since B > 1, and 50 and 51 take values in [0, g], it is easy to show that (2.1)* and (2.4)* are always non negative when c = 0. Thus, when c = 0, Party 1 VPCS always accept a1. So, Party 1 PCs always choose al. The same thing also holds for Party 2 politicians about a2. 011 :5 :— Lemma 2.2. If c = 0, the only equilibrium is e 2 Next, I consider if positive c can achieve better result for voters, i.e., if positive c can induce PCs to lie less often. Since both (2.1)* and (2.2)* become —c when 50 = 51 = g, the threshold values of r for positive c must be higher than zero. This means the threshold values of 50 and 51 are lower than %. The following proposition is about this, i.e., about the existence of equilibrium with 50 + 51 < 1; Party 1 PCS do not always lie when c is positive (same for Party 2 PCs when c is positive.) o o . . . . O 1 . . B Proposmion 2.1. The equilibrium with e + e < 1 exists with c 6 (0,1). Of course, 60 + 51 < 1 means the improvement of the voter welfare. I calculate 1 — 16(5051)2 2 + 4030)2 + 4(51)2 (the derivation of this is in the Appendix.) When 50 = 51 = 2’ the long-run average the long-run average voter welfare as the welfare criterion. It is voter welfare is zero. Since appropriate positive values of c can achieve 50 + 51 < 1, positive c can make the average welfare positive. So, requiring positive campaign efforts is beneficial for voters. Then, it is natural to ask if it is possible to achieve 0 = 51 = 0. It is obvious that if B is so large that the optimal result for voters, i.e., 8 utility gain/ loss from policy preference does not matter to politicians; they behave more honestly, and vice versa. The following Pr0position is about this point. B Proposition 2.2. If B < 2 and c 6 (0,;), there is no equilibrium with 50 = B 51 = 0. If B 2 2, the equilibrium with 50 = 51 = 0 exists for c = —4—. Thus, if B Z 2, voters can achieve the Optimal result by requiring c = g. If B E (1, 2), this is not possible. This impossibility result of 50 = 51 = 0 does not depend on the upper bound of c. Even if c could take values higher than %, 50 = 51 = 0 is impossible for B 6 (1,2). Then, when B 6 (1,2), what is the characterization Of the equilibrium? B Proposition 2.3. When B 6 (1,2) and c E (0, —4—), equilibria are characterized by, 1 —>512€0 0 __ E1>2e +2 B [\D 8 51: 2—B+2€0+\/(2——B+2€0)2—32c 8 1 1 All (50,51) 6 [0, 5] X [0, 5] that satisfy above three conditions are equilibrium. Given this characterization, what can be said about the Optimal level Of c when B E 1 — 16(5051)2 2 + 4(EO)2 + 4(51)2 (1, 2)? Since the long-run average voter welfare is , its indifference — k curve is 51 = ~1— ——1——02- — k (k is a fixed welfare level.) The closer the curve is 2 k + 4(5 ) 60 to the origin, the better for voters. Then, what value of c should voters require to achieve the highest welfare? The next Preposition is about this. Proposition 2.4. When B 6 (1,2), the best long-run average voter welfare 16 2 — B B 2 2 achievable is (B _ 2)2 + 32 with 50 = 0 and 51 = T by requiring c = Lg?)— Though the highest achievable average voter welfare by requiring positive 6 when 16 (B — 2)2 + 32 1 . . . . . 2’ it is still better than the average voter welfare 0 With requiring c = 0. However, B 6 (1,2) is , which is lower than the maximum average voter welfare the assumption that voters require some positive level Of the election campaign effort is unrealistic. Moreover, this requires that voters know the value of B. Though the (B + 2)2 t' f = calcula ion 0 c 32 is easy if B is known, there is actually an easier way to achieve the highest average voter welfare; simply, letting parties compete in the campaign effort level. Change the reputation update during the election from, (1) Before the election, if VPC Of a party makes the effort level less than c, the reputation Of the party is updated to Suspicious. to (1)’ Before the election, if VPC of a party makes the effort level less than the effort level the VPC Of the other party makes, the reputation Of the party is updated to Suspicious. Then, what is the result of this change? It is clear that PCs will choose 50 and 51 for which VPCS will make positive campaign efforts. Then, what 50, El and effort level will PCs and VPCS choose? 61 Proposition 2.5. With the reputation update (1)’, if B _>_ 2, the equilibrium is 2 — B 50 = 51 = 0. If B E (1,2), the equilibrium is 50 = 0 and £1: 8 Thus, by letting parties compete in the campaign effort level, the best that can be achieved by requiring positive c can be achieved automatically for voters. 2.4 Conclusion of Chapter 2 In this chapter, I considered a model of electoral competition between OLG political parties under the uncertainty about policy outcomes. Exploiting the natural conflict of interests within parties, I showed the possibility Of restraining the Opportunistic behavior Of politicians. Requiring young politicians Of making campaign eflorts, it is possible to deter the Opportunistic behavior Of politicians completely when B _>_ 2. When B E (1, 2), the restraint on politicians is not perfect. But it can still achieve higher voter welfare than not requiring the effort. The last Proposition shows the importance of the competition between parties, which can achieve the best possible outcome automatically for voters. However, even with the party competition, politi- cians might lie if B E (1, 2). (H, S) and (S, H) are situations that the trust Of voters is biased favorably to one party over the other. Such biased trust makes exploiting the trust too attractive for the trusted party when B is relatively small. Reputation systems and voting behaviors different from the ones in the chapter could achieve different results. But, since such possibility of abusing the trust is a part of the re- ward for young politicians Of restraining Old politicians, the similar exploitation of the voter trust is inevitable when B is small. 62 The idea of this chapter is based on the party reputation; young politicians make the campaign effort even though it gives them negative utility, because it helps their party and the strong party will help them in future. The “strong party” is the party with good reputation, and young politicians invest in it. Tadelis (2002) is a OLG reputation paper and considers a model in which retiring agents can sell their firm names, or reputations. The values of firm names depend on their reputations. To keep a good reputation so that they can sell firm names at high price at the end Of their life, even agents in their last period make costly effort. Similarity is clear, but difference is also clear. In this chapter, it is not Old politicians in their last period but young politicians with future who make costly efforts. SO, this chapter shows a different route to keep a good reputation. The reality is not the major virtue Of this model. Thus, the model depends on the strong assumptions. However, at same time, explicitly modeling parties as or- ganizations consisting of individuals is a more realistic way of modeling parties than treating the parties as consistent existences or simply treating parties only as labels on candidates. Loosening strong assumptions would change results quantitatively. But as long as there is the intertemporal conflict of interests and there is some action observable to voters, the qualitatively same result could be obtained. As I wrote in the Introduction Of this chapter, there are papers of the retrospective voting models, in which re-elections in the multi-period partially restrains the incumbent politicians from practicing selfish behaviors (Austen-Smith and Banks(1989).) Though this chap- ter could be counted as one of them, unlike usual retrospective voting models in which the incumbent will face the election on the results of his own policies directly, in the 63 model Of this chapter the incumbent (i.e., PC in Office) will not face the election. PCs face election only once to become the incumbent. Thus this chapter assumes out the possibility Of the self-restraint of the incumbent in the usual retrospective voting models. Still, improving voters’ welfare is possible. Though there is no direct retrospective voting on politicians, the party structure makes it possible for voters to use the retrospective voting13. This is different from the case Of repeated elections between two parties that are consistent entities. Though I consider a particular prob— lem Of information asymmetry, the same approach in this chapter can be applied to other moral hazard problems. What important is the conflict Of interests in a party and the existence Of a way for insiders to send signals to voters. Although OLG models are Often used in Political Economy literature (see Persson and Tabellini (2002)) and Alesina and Spear (1988) argue that “the ‘overlapping generations’ model can be usefully applied to the political arena,” as far as I know, Alesina and Spear (1988) and Harrington (1992) are only other papers Of electoral competition model with OLG political parties. Recent papers on how party structures affect politicians and elections consider static or finite period models (though Snyder and Ting (2003) consider infinitely repeated game.) As Alesina and Spear (1988) states, the OLG structure is a good way to capture the intertemporal conflict Of interests among party members. As I said, this chapter shares the OLG political party model with Alesina and Spear (1988) and Harrington (1992). The difference between their papers and this chapter is about what problem to analyze. Their papers analyze the problem Of policy commitment. SO, they consider about if politicians keep their choices during 64 elections, and I analyze about what choices they make during elections. Usually the policy commitment is assumed in electoral competition models (this chapter also assumes it.) As Alesina (1988) shows, however, the lack Of the assumption could have a big implication on credible platform choices. Then, Alesina and Spear (1988) and Harrington (1992) show that the policy commitment could be achieved as the equilibrium outcome, not as the assumption. Alesina and Spear (1988) consider a transfer scheme from young politicians to Old politicians for the purpose. The young politicians make the transfer as they need the Old politicians to keep their promises for their own future elections. Thus, the logic is based on the conflict Of interest among party members”. As one more related paper, I would like to discuss about Caillaud and Tirole (1999). Based on the conflict of interests among party members, Caillaud and Tirole (1999) explain why policy convergence does not happen in reality, unlike in many electoral competition models. It is a one-period model (though implicitly it assumes the second period) and has an Office-oriented party leader and ideological party rank- and-files. Because Of the information asymmetry between the party leader and voters, the leader might not work hard for voters. However, because of the conflict Of interests between the leader and rank-and-file party members, rank-and—file members can work as monitors on the behavior of the leader for voters. Such monitoring can increase the possibility of winning election. But, since only ideological members conduct such monitoring (since effective monitoring could be costly), only ideological parties can have such monitoring. If the results in the paper would be taken literary, they look like implying that 65 the higher level of election campaigns might be good for voters. Clearly this is a wrong interpretation. What the results really imply is that the party structure that helps the party boss actually binds his behavior. Since he needs his party for him, he has to take care of the party, which has the longer life span than the party boss. Otherwise, party subordinates would send a signal to let voters know what is going on inside the party. There would be some Obstacles to apply this result directly to actual election campaigns. Firstly, the campaign effort in this model is different from the actual campaign effort. In the model, the campaign effort is essentially the alias of VPCs’ disutility. The costs of actual election campaigns are not entirely burdened by young politicians who will go to the future elections. This obscures observability. Also it would lessen the extent of the conflict of interests. For example, consider a case that VPCS can enjoy some of campaign contributions from special interests. It might not eliminate the conflict of interests entirely. But if voters are not sure about the extent to which VPCS can receive the contribution, this would add uncertainty. Secondly, there might be the information asymmetry between party leaders and subordinates. In such case, the ability of party subordinates working for voters is diminished. However, as long as the asymmetry within the party is less severe than the asymmetry between the party boss and voters, the ability is not zero. Voters’ appropriate reaction to the actions from party insiders is also very impor— tant for the results. It does not seem possible to obtain similar results without using voters’ reaction but with cooperation in OLG organizations (Cremer (1986), Kandori (1992)”) Although the self-restraint through internal cooperation or inter-party co- operation might be important, I think, in electoral competition models, assuming a 66 voters’ appropriate reaction is a simpler and more intuitive way. 67 Appendix of Chapter 2 Lemma 2.1. For any B > 1 and c E [0, i], an equilibrium eaists. Proof. Let D E {(50,231) E [0, g] x [0, g] : (2.1)* 2 0 and (4)* Z 0 for (€0,el)}. This D is a set of consistent strategies, closed, and non-empty from the assumption c E [0, g]. If (50,51) ¢ D, the expected utility of PCs from this (50,51) is zero. If (50,51) 6 D, the expected utility of PCs is positive. Thus, PCS never choose (60, 51) ¢ D. Then, let P be the set of (£0, 51) E D which is strictly Pareto efficient. Since D is not an empty set, P is not empty. From the definition of equilibrium, this non-empty set P is the set of equilibrium strategies. I B Proposition 2.1. The equilibrium with 50 + 81 < 1 exists with c E (0, Z). Proof. (2.1)* éé— — 50)(B — 250 + 451) — c 1 1 (2.4)* 5% — 51x3 — 250 + 451) — c If e = 0, (1)* and (4)* are zero for 50 = 51 = --21-. Then, slightly increase c from 1 zero. At 80 = 51 = -2-, (1)* and (4)* become negative. However, from the previous Lemma, an equilibrium must exist. Since (5, 5) ¢ D and all equilibrium strategies are in D, the equilibrium with so + 51 < 1 must exist. I The derivation of long-run average voter welfare 0 Since I consider symmetric equilibrium, 5 = 60 and 51 = 61. Then, the transition probabilities are, 1_0 1+0 HH HH'2E ‘d 28r1—rd‘ I ‘ (, ):>(, ).f0 (1—r)r+f%_€0(-2-+ 2 )r+f%+807dr _§_,02 —, m 68 ULH):(HJWHSEDr§+@%2 ursnwaH)E (H,rn; [___€1 1+€1 rdr+ _1__1(—1 r) d)r+ rdr+ 1+ 1(—1-—)r )dr [0 2... 0 21 = Z +(51)2 (Hflfl&6flfl=¢fli$flm$H):g—(EB From these, the long-run probability of (H, H) at the start of period is derived as 1 + 4(51)2 . . . 2 + 4(80)2 + 4(51)2’ and the long-run probability of (H, S)&(S, H) is 1 + 4(50)2 . _ . 2 + 4(50)2 + 4 (131)2' Next, the expected voter welfare when reputation is (H, H) 13 2’50 2+50 1 1 f0 {1—r—r}dr+fl_ 0{—(r—(1—r))+—((1—r)—r)}dr E 2 2 + [1+8 (1 — r))dr 1 _ 5 _ 2(:0)20 The expected voter welfare when (H, S) or (S, H) is, l _l_ + 81 -—e “”0 (1—2r) dr+fl_1(2)r—1dr+f0 (1—2r) dr+f1+ E(QT-1W] _1_ 12 _2 2(5 ) Thus, the long-run average voter welfare is, 1+4n52 1—4a%2 1+2e%2 114622 2 + 4(50)2 + 4(51)2 2 2 + 4(50)2 + 4(51)2 2 _ 1 — 16(8081)2 _ 2 + 4(80)2 + 4(El)2 B Proposition 2.2. If B < 2 and c 6 (0,1), there is no equilibrium with 50 = B 51 = 0. If B 2 2, the equilibrium with 50 = 51 = 0 exists for c = Z. 1(1— 50)(B — 260 + 451) — c Proof. (21)* 2 2 69 1 1 (2.4)* 212 — 51)(B — 250 +451) — c Let (50*,51*) be an equilibrium strategy. I stated conditions (a) above that (50*, 51*) must satisfy to be an equilibrium. Actually, there are more conditions. 0* 1*). (a) (2.1)* and (2.4)* are non-negative for (e e (b) At least one of (2.1)* and (2.4)* is zero at (50*,51”). Denote this one as (25)" (this could be one of (2.1)* and (2.4)* or both.) (c) At least one of the following two holds. (c.1) 50* = 51* = —. (62) For at least one (z)*, the derivative of (z)* with respect to 50 and 61 are non-positive16. If (a) holds but (b) does not hold, both (2.1)* and (2.4)* would be positive. Then, PCS could deviate from (50*, 61*) with holding (a). Similar thing can be said to (c); if (c) does not hold, PCs could deviate successfully. Thus, condition (b) and (c) are necessary conditions for (60*, 51*) to be an equilibrium strategy. Then, substituting B B 50* = 51* = 0 into (2)“, I obtain c = Z the maximum of c. Actually, if CZZ’ both of (2.1)* and (2.4)* are zero for 50* = 51* = 0 (conditions (a) and (b) hold.) This 1* = 0 is an equilibrium. Since value of c, however, does not guarantee that 60* = e 50* = 51* = 0 cannot satisfy (c.1), it has to satisfy (c2). Derivatives of (2.1)* and (2.4)* with respect to and 50 and 81 at (50*,51*) are, 8(2.1)* _ 1 0 1 1 O 850 — 2(B 25 +45) (2—e)<0 6(2.1)* 0 851 = 1 — 2e 2 0 a 2.4 * 1 70 2.4 * 1 6(8 1) = ——2-(B—2z-:0-+—4el)+1—2e1 e 6(2.1)* 6(2.4)* 851 and 851 Thus, 50* = 61* = 0 cannot be an equilibrium when B < 2. This result does not Suppose that B < 2. Then, both at 50* = 51* = 0 is positive. B B depend on the upper bound of c, 4' Even if c can go higher than 71—, 50 = el = 0 could not be obtained. If c > %, all of (2.1), (2.2), (2.3) and (2.4) become negative 0 at e = 51 = 0. VPCS never accept of any of 50 and 51. Next, suppose that B Z 2 and that there is an equilibrium with (50*, 81*) -,£ (0,0) 0:1,E B for c = -4-. Because (5 1*) is Pareto superior to (0,0), if such equilibrium exists, B B 50* = 0 and 51* = 0 cannot be an equilibrium for c = —. Since c = —, the 0*_1* . . 1 . . 0* 1*) cannot satisfy (c.1) Since 5 — e = -2- and the pOSitive c equilibrium (5 ,5 make (2.1)* and (2.4)* negative; (a) cannot hold. SO, (c2) must hold. Then, looking at (2.1)* and (2.4)*, it is clear that if 50* > 51* then (2.1)* is zero, and if 50* < 51* then (2.4)* is zero. If 50* = 51* then both are zero. 1 consider these three cases. Case 1. 50* > 51* In this case, (2.1)* = 0 and (2.4)* > 0. This case is not consistent with the assumption that (60*, 51*) 1 a 2.1 * . . . . 50* < 2’ (0 1) = 1 — 250* > 0; condition (C2) is no satisfied. 6 . .. . . 1 IS an equihbrium. Since c > 0, 50* cannot be -2—. For Case 2. 51* > 60* 2.4 * In this case, (2.4)* = 0 and (2.1)* > 0. Since 3(6) ) < 0 (the equality holds only 6 1* 1 when 6 = 5 and this does not happen as c > 0) and * 061 is negative for B Z 2, (c2) might be satisfied. 1 (2 — 51*)(3 - 250* + 451*) — 2c = 0 71 B => ——4(el"‘)2 + (2 — B + 250*)51” — 50* (B and c are cancelled out as c = Z) 1: 2-B+250+\/(2+B+250)2—32c 17 8 =>€ Since 81* > 50*, \/(B — 2 — 2230*)2 — 16.6:1F > (B — 2 — 250*) + 850* 2 0. Ffom this, (B — 2 — 250*)2 — 1650* > (B -— 2 — 250*)2 + 1650*(B — 2 — 250*) + 64(50*)2 => 32(50*)2 +16(B — 1).:0 < 0 B -— 1 2 B - 1 So, 50* E (——2—, 0). However, since 50* => 3250* (50* + )<0 is not negative, this is a contradiction. Case 3. 50* = 51* In this case, (2.1)* =(2.4)* = 0. Let 5 = 50* = 51*, and substitute 5 into (2.1)* or (2.4)*. Then, %(%—e)(B+2e)—c=0 B 1— :>—2e2+(1—B)e+§—2c=—2e(e——2—B—)=0 2 Since < 0, it must be e = 0. Thus, (50*,51”) = 0. I B Proposition 2.3. When B 6 (1,2) and c 6 (0,1), equilibria are characterized H —>51250 2—B 20 51> + 8 {\D 8 51: 2—B+250+\/(2+B+250)2—32c 8 72 Proof. Let (50, 81) be an equilibrium. The proof of previous Proposition already showed that the case of 50 > 51 at equilibrium is not possible (this result does not depend on the value of B.) Thus, always 51 2 50. From this, (2.4)* = —(— — 2—B+250+\/(2+B+250)2—32c 8 . 1 Finally, since c > 0, 51 is less than 2' Then, (c2) in the proof of previous Proposition BQAY BQAY m2n* d = 860 an 851 650 1 624* 1 —(§ — 51)_<_0- For ( ) = —-—(B —— 250 + 451) + 1 — 251 to be non positive, 661 2 1> 250+2—B 51)(B — 250 +451) — c = 0. Solving this, 51 2 must hold for (2.4)*. SO, must be non-positive. e . Thus, any equilibrium satisfies above three conditions. Then, 1 1 pick (50*,51*) E [0, -2-] x [0, -2—] that satisfies above three. This (80*,81*) satisfies (a), (b) and (c) in the proof of previous Proposition. I check if PCs have an incentive to 24* 22 * _* deviate from (50*,51*). Since 6(651) _<_0 at (50*,51*), %% < 0 and 6:3) $0, 1 1 any point (60,81) 6 [0,-2] X [0,5] such that 60 Z 50*, 51 2 51* and (60*,51*) aé (50, 51) makes (2.4)* negative. Thus, PCs will not deviate to such points. Any points satisfying above three conditions are equilibrium. So, above three conditions are the B characterization of equilibria for B E (1, 2) and c E (0, —4—). I Proposition 2.4. When B 6 (1,2), the best long-run average voter welfare 16 2—B B 22 achievable is (B _ 2)2 + 32 with 50 = 0 and 51 = T by setting c = L%l-. 1 1 20 2—B Proof. Let E E {(50,51) 6 [0’2] x [0’2] : 51 2 5 +8 }. If (50,51) is not in E, that (50,51) cannot be an equilibrium. So, I consider only (50,51) in E. 1 — 16(5081)2 The long-run average voter welfare is , and its indifference curve 2 + 4(50)2 + 4(El)2 is 81 = 1 ——1:—]?—— — k (k is a fixed welfare level.) Since the marginal rate of 2 k + 4(50)2 d1 Ol—k substitution between 50 and 51 for voters is 3:7)- : —;—(£:222)—), the indifference 73 curve is downward sloping. Because the bottom side of E is upward sloping (on the l 1 bottom, :7)- = —) and (0,0) ¢ E, it is obvious that voter welfare is maximized at 4 2 — B the lower left corner of E. The corner is 50 = 0 and 51 = —. I check if this 2 — B can be an equilibrium for some value of c. Substituting 50 = 0 and 51 = -——, 8 1_2-B+250+\/(2+B+250)2—32c (B+2)2 , — 8 ——32 . This 2 _ c= Egg—makes (0,2 B 32 2 — B . . * . 1 1 . u 0 — maximize (2.4) in [0, 2] x [0, 5] when B 6 (1,2), the combination of e = 0 8 2—B B 22 2—B and 51 = 8 is the only equilibrium when c = Lfigli (0, 8 2 — B 2 0 2 B 2 0 2 — 32 1: + 5+\/(8+ + 5) CandE.Thus,any 1 1 B _2 2 other point in [0, 5] X [0, 2] cannot satisfy even condition for c = ( :2 l 2 — B) the lon run avera e voter welfare is 16 I ’ g g (B — 2)2 + 32' 5 holds only when c = ) an equilibrium strategy. Since 50 = 0 and 51 = ) is the only intersection of a curve 5 . Finally, at (0, Proposition 2.5. With the reputation update (1)’, if B 2 2, the equilibrium is 2_-2 8 Proof. PCS of both parties want to increase the campaign effort level to the 50 = 51 = 0. If B 6 (1,2), the equilibrium is 50 = 0 and 51: maximum. This maximum campaign effort level is same to the maximum campaign effort level that VPCS would accept with the reputation update (1). When B 2 2, since (0,0) is the only equilibrium with the update assumption (1) and its required B level of c is 4’ (0,0) is also the equilibrium result with the assumption (1)’. When 2 — B B E (1, 2), the combination of 50 = 0 and 51 = T maximizes the value of (2.4)* 1 1 250 2 — B in E E {(80,81) 6 [0,5] x [0,-é] : 51 2 ——+§——}. Thus, in both cases, the Optimal with the assumption (1) is obtained with the assumption (1)’. I 74 Endnotes of Chapter 2 5. There are many empirical research papers about special interest groups based on Grossman and Helpman (1996). They found a “puzzle” that the governments of US and other countries put much higher weights on consumer welfare than on contributions from special interests. Gawande and Hoekman (2006) also obtain such a result on agricultural special interests and argue that such results are due to the assumption of “unitary” government in Grossman and Helpman (they also talk about “policy uncertainty”, which is the uncertainty about legislative outcomes, not the uncertainty about the outcomes of implemented policies in this chapter.) 6. In the literature of the theoretical studies of legislative voting in Political Science, parties and politicians are different existences, and the function of parties are big subject (Aldrich & Rohde, Krehbiel & Meirowitz (2002), etc.) 7. Of course, no good/bad politician assumption and the non-Bayesian, simple reputation update are related to each other. Because of the non-Bayesian update, I do not need good / bad politicians (or some other assumption about characteristics of politicians.) Because of no good / bad politicians, a simple Bayesian update does not work in the model. 8. Alternatively, this can be considered as follows; there are three types of voters: voters who have the affiliation with party 1, voters who have the affiliation with party 2, and independent voters who have no affiliation with any party. Neither of party 1 and party 2 voter groups is a majority group. Thus, the election outcome depends on the independents. In other words, they are median voters. 75 9. See Alesina and Spear (1988) and Harrington (1992) for the policy commitment in OLG political parties models. 10. This is a simplified, short cut way of modeling the signal. Following Majumdar and Mukand (2004), this can be modeled as follows; at first politicians in both parties have the belief % as the probability of the true state being wl. Then politicians receive a common stochastic signal .73, which is in [_a_:, 5:]. This signal a: would be received with the density function Q51 if the state is wl and the density fimction qfi” if the state is w2. I assume that limxag (151 (:r)/¢2(:r) = 00 and liqug ¢1(:1:)/¢2(:r) = 0. Then, from this signal, the candidate deduces new probability 1‘ of the state being wl using the Bayesian rule, r(x) = 1 2¢1(x)/<§¢1/¢2 1, so capitalists are richer than workers. group a E {K , L}. All members of two groups have same utility function, U (:13, y); no disutility of labor for workers. Thus, the income tax does not cause distortions, same as lump-sum tax”. This utility function is an increasing, strictly concave and differ- entiable function in both arguments. Let y(I“, p) be the common demand function of good Y of a member of group a E {K , L} derived from the utility maximization of U (:r, y) with income I 0 (here abusing notation, I a is after-tax, disposable income) and the domestic price of good Y, p. From this, the demand for good X of a member of group a E {K, L} is :13“ = I“ — py(p,Ia). Here, deviating from the usual as- sumption in International Economics, I assume the common utility function U is not homothetic and that the income elasticity of good Y is less than one. The total and average demand of good Y is g E N K yK + N LyL. Since f (l) is the total domestic production of good Y, the import of good Y is defined as m E g —- f (l) The indirect 85 utility function is V(Ia,p) = U(Ia' — py(Ia,p),y(I“,p)), a E {K, L}. 3.2.2 Political side I assume that voters decide to vote for the incumbent or not based on the utility they received during the incumbent’s term compared to the utility they would receive under the policy of the challenger, which is assumed to be the free trade policy. So, this is the retrospective voting. The incumbent has to take into his consideration the reactions from his constituency. The objective of the incumbent politician is winning the election, i.e., the maximization of the probability of winning the election. The only decisions that incumbent can make before the election are the decision to help capitalists or not and how to finance the help. The challenger at the election proposes the free trade policy. His role in this model is just to provide an alternative to voters and makes the retrospective voting possible for them. He can be considered as a convinced free trader. The voting rates of groups are determined by two factors: the voting decision making, which is also based on voting costs and utility differences between the policies of the incumbent and the challenger, and stochastic terms. These two determine the actual number of votes cast and the election outcome. Voter abstention: Some people do not vote. The problem of the voter ab- stention, or the paradox of not voting (Feddersen (2004)), is a big issue of Political Economy, but not a topic of this chapter. So, here, I simply use a modified version of the Group Rule-utilitarian voting model (Feddersen and Sandroni (2006), Coate 86 and Conlin (2004).) If the incumbent decides to help capitalists, no potential voter is indifferent between two candidates. Every capitalist prefers the incumbent to the challenger, and every worker prefers the challenger to the incumbent. Then, they face two obstacles before actual voting for their preferred candidate. First, voting is costly. Take a potential voter in group a E {K , L}. Let d“ be the utility difference between his preferred candidate and the opponent. Let c be his voting cost, which varies among potential voters. Then, if the voting cost is so high that do” — c < 0, there is no reason for him to vote. Thus the voting cost is the first obstacle. The second obstacle is the fact that the probability of being the pivotal voter is essentially zero unless the number of actual votes is very low. Since the population is continuum in this model, such probability is zero. So, there is actually no reason for anyone to vote. Following the Group Rule-utilitarian voting literature, however, I assume that some people still vote when d“ — c 2 0.22 I call potential voters who do not vote even when d“ — c 2 0 as free-riders. They do not vote since they think other peOple will vote for them anyway. The fraction of such free-riders is unknown and stochastic. Let gba be the participation rate of group a E {K , L} among non free-ride potential voters. This qb depends on the voting cost function, c(n). This voting cost can be considered as consisting of time, money and other factors necessary for going to a voting booth. Different potential voters have different voting costs. Then, the voting cost function c(n) gives the voting cost of n-th voter in the group when every potential voters in group a are aligned from the lowest voting cost voter to the highest voting cost voter. So, n of potential voters in a group have voting cost less or equal to c(n). I assume that both groups have same cost function and that c’ > 0 and c” < 0. As 87 a simple example giving a cost function of this type, consider a voting district with one voting booth at the center of the district. Potential voters live uniformly in the district, and the voting cost of a potential voter is represented by the distance from his home to the booth. Then, voting function becomes concave like c. Further, I assume that c(O) < 0, so some people always want to vote, maybe due to the sense of civic duty or strong partisan feelings. Since both groups have the same voting cost function c(n), the number of votes is same for both groups when dK = dL = 0. Since N L > N K , this means qbK > qu at no utility difference. I assume that all actual capitalist voters vote for the incumbent and all actual working voters vote for the challenger even when they are indifferent between two candidates. Then, given the utility difference d“ > 0, the participation rate of the group a, 45“, is defined by, d“ — c(qbaNa) = 0 From this, as“ _ 1 Bid—0 _ Mariam) > 0 The participation rate d)“ of non free-ride potential voters in group a is determined by d“ and c. But the actual number of votes is determined by d)“ and stochastic terms about the fraction of free-riders. This stochastic terms maybe come from many factors like the conflict of the voters’ sense of the civic duty and the fact of the impossibility of being the pivotal voter, and all other uncertain factors not resolved until the election day, like weather, possible transportation troubles, abrupt schedule changes of potential voters, etc. I assume the following setup; a potential voter in group a 88 is not a free-rider with probability n“. Thus the potential voter is a free-rider with probability 1 — a“. For simplicity, all members in group a E {K , L} have same a“. This a“ is a random variable. K NK ¢K The number of votes from group K is ,u , and the number of the votes from group L is uLgbLNL. Thus, the election result is determined by ”K N K qbK — uLNquL. If ,uK N K 45K —— nLNLdJL > 0, the incumbent wins the election”. Since us are random variables, this is a random variable, too. Let H (,uK , 111’) be the cumulative distribution function of ,uK and ,uL. Then, the probability of the incumbent winning the election can be calculated as fflKNK¢K_#LNL¢L>0dH(luKa”Ll- This is the objective of the incumbent. Without knowing the exact function form of H, it is not possible to know the exact value of the probability. However, to analyze the incumbent’s choice, I do not need to know the exact form. Let A be the intersection NL L L K . K L Of[0,1]><[0,1]and{(/i 1/1' ).l1 2’1 NK¢K } in [1L X ,uK dimension (see figure 3.1.) 89 ”KzuLNL¢L/NK¢K M/ Figure 3.1 Clearly, the maximization of the probability of winning the election is equal to NL L NK¢K the maximization of the area of A. Moreover, since ”K = 111’ always holds at ”K = 0 and ML 2 0, the maximization of the area of A is equal to the minimization L L NK K . But this is equal to the maximization of —. NK¢K NL¢L of Thus, the incumbent does not need to care about the distribution function H. I state this as a claim. Claim 3.1 The incumbent tries to maximize his probability of winning the election, prNK¢K_uLNL¢L>OdH(HKa ,uL). This is equal to the maaimization of NK¢K NL¢L 90 3.3 Results with two groups Before stating results, let me briefly describe my strategy of obtaining results. Let the sum of the production subsidy and the tariff rate as H E s + r. In the following analysis, firstly I consider if the incumbent has an incentive to use a tax to give the production subsidy s with zero tariff. Next, I consider if the incumbent has an incentive to raise the tariff rate r for the given level of H E s + r. Such increase of T means fl: = —1. Since the objective of this chapter is to consider why the incumbent might want to use a tariff to give the help when he can give a production subsidy, this is an easy way to achieve the objective. Because of this order, the tariff does not cause additional distortions in production; it keeps the distortion in production at the level previously caused by the production subsidy, though it causes distortions in consumption. Since the tariff revenue is used to finance the production subsidy (with the tax revenue from the income tax), higher tariff means lower tax rate. Lower tax is good news for all people. But because the income elasticity of good Y is less than one, how good it is is different among people with different income levels. Regarding the tax rate, I consider two cases; the case of single uniform tax rate and the case of different tax rates for two groups. 3.3.1 Single tax rate The incumbent will help capitalists? In this section, I consider if the incumbent would use the tax to provide the production subsidy to capitalists. I assume, for now, that the incumbent does not use the tariff. Then, the utility differences of capitalists 91 and workers are defined as follows. Utility difference: :1: S _ :1: f _ f dK(t)=V((1——t)(p +1313” l,p*)—V(p f(l ) l 19*) dL(t) = V(1.p*) - V(1-t,p*) Notice that they are defined as the subtraction of the utility of the non-preferred candidate’s policy from the utility of the preferred candidate’s policy. When the government uses only the income tax to provide the subsidy, its budget constraint is sf (l ) = t(7r + N L ) Taking the derivative of this with respect to t, it is obtained that gig—n+NL fiom§£=0 ds , 217“” + Sf (”as dt ‘ Bl The equilibrium condition for production is, (p* + s) f ’ (l) - 1 = 0. From this, fl = __f’__ = .92 ds (10* +s)f” f” Thus, ds_ L _ (_f’fi_1_(rr+NL)f” Zi_(7r+N )(f S f”) _ff”-S(f’)3 f(l)—8 d d,{{(t)=l/IK —7V7r—K+—N—%t—} fromazlE =0 dt dflt): 1L3; K K Then, the numerator of the derivative of W with respect to t is, ds 1 7r f(l)— L 1L K K __ _ dt N o N NKc’(NK¢K)VI { NK + (1 t) K } 1 ds _NK KNL L_ Q5 NLd<¢LNL)V’ dt f L When t = 0, it should be that s = 0 and l = If, so _8_s|t:0 = W—j—iV-fi where If at f(lf) and 7rf are l and 71’ at free trade, respectively. Thus the above expression becomes, 92 1 NL 1 K vK__ NL L _NK KNL vL NKc’(¢KNK) I NK 4’ (’5 NLduLNL) 1 Moreover, when t = 0, dK = (11’ = 0. So, ¢KNK = gbLNL and c’(NKgbK) = c’ (N LgbL ) Thus, the expression becomes, K 1 V, NLNquLc’(NL¢L)[NK— NL] If this is positive, the incumbent would help capitalists with the production sub- sidy. Lemma 3.1. The incumbent will raise tax to help capitalists if the following condition holds at free trade vK vL W _ N_I, > 0 (3.1) VK VL Since N L > N K , it is likely that N K NI L > 0 unless the marginal utility of income decreases very fast. Especially if the marginal utility of income is constant, it is positive. Thus, the incumbent has an incentive to help capitalists. Though the logic of this result is a very simple and intuitive, this shows the importance of considering the voter abstention. The tax and substitution scheme I am considering here means that taking money from a larger group and giving it to a smaller group. Because of diflerences in group sizes, the amount of money giving to one person of the smaller group is larger than the amount of money taken from one person in the larger group. Thus, naturally, the money given to the smaller group can have bigger impact on number of votes than the money taken from the larger group. If 10 cents are taken from everyone in a group with 100 million people, it is unlikely that the 10 93 cents could cause the huge‘increase of votes from the group. However, if everyone can gain 100 dollars by going to a voting booth, that could cause a big increase of votes from a group with 100,000 people. Of course, this is same to the usual logic of special interests, so, anything but new (Persson and Tabellini (2002), pp.159).) However, if I assumed that everyone votes, this logic could not hold here since 100 million people would go to voting booths because of their loss of 10 cents! The logic would not hold anymore. Clearly, this first Lemma shows the possible impact of the voter abstention on the policy choices of politicians. How will incumbent help capitalists? In the previous subsection, I showed that the incumbent is likely to raise the tax and provide the production subsidy. However, he can also use a tariff to help capitalists; no need for him to stick only to the production subsidy. But, if he uses both instruments, he would need to consider how they would affect voting behavior of potential voters24. As I said before, I consider is (17' a given total level of H E s + r > 0, the higher tariff means the lower tax rate. The the change of the tariff and the subsidy such that = —1, i.e., dH = 0. Thus, for government budget constraint is now sf (l) = t(7r + N L ) + rm. Since dH = 0, higher tariff does not change the production level of good Y, the labor employment for the OZ 8 _ production of good Y and the profit. Thus, 5-; = 57’; = 0. Let I E 7r + N L be the 0 average and total income. Then, d7 = — I. T . The effect of higher tariff on utility: The higher tariff hurts workers, but it could benefit capitalists if the tariff is not too high. This is possible since the income elasticity of good Y is less than one. 94 Because of this, the consumption share of workers of good Y is higher than their income share in the total pOpulation. Thus, the shift from the tax to the tariff is beneficial for capitalists even if dH = 0. Given the level of H, the level of s is derived from r. Then, from the government budget constraint, the level of t can be derived from 7'. Thus, a new indirect utility function of r and H can be defined as (p*+s+r)f(l)—l NK VKU:H)=V((1—t)IK,p*+T)=V((1'—tl 40*“) VL(T:H)=V(1—t,p*+r) The derivative of VK with respect to r is, * VK yL yK IK (9m I I I N y 8T The sign of this depends on the inside of the bracket. It is natural to assume that ('9 37: < 0. However, since 1K > I L = 1 and the income elasticity of good Y is less than 31" yK one, 17' — 17 > 025. Thus, the inside of the bracket could be positive. Especially when r = 0, this is definitely positive. Thus, when H = s + 7' > 0, capitalists prefer the positive tariffs to zero tariff. The derivative of VL with respect to r is, “L VIL KLKyL :9 VT ldH=0=————N I I lI—L-Tfi-N—K‘éfgl The inside of the bracket is positive. Thus, this is negative. Not surprisingly, workers always prefer no tariff. The utility differences Then, the utility differences are defined as follows. dK(T:H)=VK(T:H)—VK(020) 95 dL(r : H) = VL(0 : 0) — VL(T : H) Their derivatives with respect to 7' are d5 = VTK and df’ = —VTL. K K The effect on N ¢ NL¢L K K The derivative of —L- with respect to r is, 1 NL¢ K K K L L_ K K L L L ———(NL,L)2[N caddTN i N ¢ N m] 1 LV_IK LLLL__ VIL KLKKIK 31K =.——— N I N -——N I N ——-—— 07” VIK LLK VIL KK ___... -———~——N +7"019m!“ f” I +an) f l] where c(oI) E c(No‘qfia) At T = 0, this becomes, L ___IKNKNLZI L 31K L¢L_ VI NK¢K) I(NL¢L)2 (IL IK —)(NKc’(K' ) NLC'(L) L K From the less-than-one income elasticity and the single tax rate t, %L — g—K- > 0 Thus, the sign of this 18 equal to the sign of m LgbL— mNKQSK. This can be rewritten into, K L 1 VI 1 VI * K K L L _ ( ) N Cb N d) [NK¢K NKc'(K) NL¢L NLc’(L)] Thus, the condition for the incumbent to raise the tariff from zero is, 1 VIK _ 1 V} NK¢K NKc’(K) NL¢L NLc’(L) ¢> NL NquL/VIL > C’(K) NK NKQK/VIK C/(L) >0 This condition can be denoted in terms of elasticities of number of voters with respect to the utility differences, and- L L L L K L N d WI > in g, and > End K K K K K K K L L rL N d /VI and N d /VI N d ”I 96 At first, this looks a bit weird. If dL is very small and dK is very high, then the left side of the left inequality would be very small and would make it diflicult for the condition to hold, even though it sounds like a good condition of the tarifl increase for the incumbent. Looking at the definition of the elasticity, however, it is clear that the sizes of ds do not matter since as have d inside, too. Moreover, unless the marginal utility of income decreases quite fast, the left side cannot be so small. Notice that Nada ° th ’ 1 ' ' t f ' Th NLdL/VIL s 088 0 am ms 0 o . e , V10 is e group a r g in er me me n NKdK/VIK ratio of the loss of workers over the gain of capitalists. If the transfer from workers is the to capitalists is done efficiently, the ratio should be one. However, because of the inefficiency (i.e., dead weight loss), the ratio is likely to be more than one unless the marginal utility of income decreases very fast. Especially if the marginal utility of income is constant, the left side should be more than one from the dead weight loss (this dead weight loss comes from the production subsidy since I am considering this at r = 0.) Then, the condition says if the voting response of workers to their utility loss is not so high compared to the voting response of capitalists then the incumbent would raise the tariff. One simple way to understand the condition is this; assuming that the constant marginal utility of income, the condition above is equal K NL/NK to end—d—K— At T = 0, the dead weight loss from the tarifl is negligible, > E n ‘15? so, the marginal transfer rate of the loss of workers to the gain of capitalists by the tariff at r = 0 is one; one dollar loss of group L means one dollar gain of group K. Suppose that every worker loses one dollar from the tariff increase. Because of L the group size difference, this means N—K- dollar gain for every capitalist. Thus, Zi—L— can be considered as the ratio of the change in the worker’s utility difference, and 97 NL/NK dK as the ratio of the change in the capitalist’s utility difference. Then the elasticities of votes are multiplied to them respectively. Thus, the condition simply requires that the increase of the votes from capitalists is larger than the increase of the votes from workers. Proposition 3.1. Suppose that the incumbent has decided to help capitalists. If K L ___—___— > _— NKdK/VIK NLdL/VIL at 'r = 0, then the incumbent will use the tariflr to help them ( maybe with the produc- tion subsidy.) NL ¢L NK¢K -—a—T-— = 0 defines the optimal value of the tariff for the incumbent, it is unlikely to be solvable analytically. So, I consider the different characterization of Though r the optimal tariff rate. Let 5d, 2 (12%. From (*), the condition for the optimal tariff is, NKag‘dJYNLhL — NKaKNthdI; = 0 KK LL ®N¢N¢[5KEK—EL5L]=0 7. nddr nddr Of course, this means that the impact of tariff on the voting from capitalists is same to the one on the voting from workers. Corollary The condition of the optimal tarifi' rate for the incumbent is, K K _ L L Erictgd’r _ Endedr 98 3.3.2 Different tax rates In the previous subsection, I assumed that the incumbent has only one tax rate for two groups. In this section, I assume different income tax rates for different voter groups. This changes the result obtained above hugely. I call tax rates of capitalists and workers as tK and tL, respectively. However, to keep the previous model unchanged as much as possible, I denote those two tax rates tK and tL in terms of t and tax burden share h E [0, 1]. So, tKrr = ht(7r + NL) #NL=(r—mnn+NL) Thus, the total income tax payment is still t(7r + N L), but capitalists pay the share h of that total income tax payment and workers pay the share 1 — h of it. Then, tK=mm+N% 7f tL = (1 — h)t(7r + NL) NL I assume that the incumbent can choose the tax burden share h freely. This means that in the previous subsection there was a restriction that tK = tL, which means h = {IV—L. Of course, the change in h affects utility gains and losses of capitalists II and workers. Thus, given the total income tax t(7r + N L), the incumbent will choose the rate h that maximizes {Xi—(15:: Since change in h does not change the total income tax payment, no change in s happens. So, I do not need to worry whether the change in h might change it. In the following, firstly, I consider what rate of h the incumbent would choose for a given level of t(7r + N L). Then, given that h, I consider if and how the incumbent 99 would provide the protection to capitalists. The determination of tax rates In this section, I assume 7 = 0, again. Thus, H = s. Then, given the tax rate t, the indirect utilities of capitalists and workers of h are given as follows. 7f— 7T L VK=V< ”32;“ ).p*) L_ _ 7, L VL(h:t)=V(N (1 152“ “V ).p*) From these, utility differences, ds, are derived. Then, the incumbent has to find NK K K K —. The derivative of _— NL¢L NL¢L the optimal level of h for him to maximize with respect to h is, ————l L [NchfifdffNquL — NK¢KNL¢5d£1 (NW) )2 _ M _ L L__V_IK_ K Ki ’” (NquL)2[ N ¢ NKd+N f NLdl The incumbent chooses the value of h such that the inside of the bracket is zero or corner solutions. However, the corner solution h = 1 does not make sense since this means that capitalists pay all of the production subsidy they receive (in this section I assume 7 = 0.) Thus, the possible choice by the incumbent is h E [0, 1). Lemma 3.2. Given t, if the incumbent can choose the tax burden share h freely, he will set h at the level such that VIK > VIL NKNK¢K6(K) ‘ NLNquLc’(L) 100 In terms of elasticity and: this condition is equal to K L End and NKdK/V,K — NLdL/VIL By assumption, N chLc' (L) = NK chc' (K) when t = 0. Thus, the condition VL VK becomes figfi at t = 0. So, unless the marginal utility of income decreases very fast, the strict inequality should hold. So, when t = 0, it is likely that h = 0 (strict inequality means that the inside of the bracket above is negative.) Moreover, KT L e 5 it becomes ——-—”d——— > ——"—d—, which is same to 3.2 that I obtained as the NKdK/VIK NLdL/VIL ( ) condition for the choice of the positive tariff in the previous subsection. Given the result about h above, then, will the incumbent decide to raise the tax to help capitalists? The Lemma 3.3 gives a simple condition for that. If L V V Lemma 3.3. If -—If > Ji- at t = 0, then the incumbent will choose the tax rate N N t > 0 with tax burden share h = 0. The proof is in Appendix. Since the objective of the tax is to provide the help to capitalists, h = 0 is a natural result. Notice that the condition in this Lemma is the same condition for the incumbent to raise tax in the case of the single tax rate. The logic for this result is also same. Moreover, I showed the same condition for the positive tariff holds. However, these same conditions imply one important difference about the tariff. Given h = 0, there is no incentive for the incumbent to raise the tariff rate. In the previous single tax rate case, capitalists prefer low but positive tariffs 101 to zero tariff since the positive tariffs mean lower tax burden for them. Though the higher tariffs have the negative eflect on their utility, as long as the positive effect of the lower tax outweighs it, capitalists prefer the higher tariffs. Since h = 0 means capitalists do not pay tax, there is no positive effect of the higher tariff for capitalists. So, the negative effect cannot be outweighed. Of course, the positive tariff rate means some transfer of the burden from workers to capitalists. So, it might be possible that workers would prefer the positive tariff to zero tarifl. Given h = 0, the derivative of worker’s indirect utility with respect to r at 7' = 0 is .. vL yL yK VTleH=0 = —FIL'NKILIK1— — —l IL 1K yL yK Because of the less-than—one income elasticity of good Y, it is W — I7? > 0. III" yK If the tax rate is low, the inside of the bracket, I—L — —I—K—, is still positive. If the tax yL yK rate on workers so high that If — 717 < 0 (remember I a is before-tax income), it could be positive. However, it still can be shown that positive tariffs will lower NK K VK vL ————L— because of I K 2 I L . Thus, there is no reason for Ma NKNK¢ c’(K) NLNLq’) c’(L) the incumbent to use the tariff. Proposition 3.2. Suppose that the incumbent can choose the tar burden share VK VL freely and that N176 > i at t = 0. Then, even when the incumbent use the production subsidy to help capitalists, he will not use the tarifi. The proof is in Appendix. In the previous single tax rate case, the condition K L and End . . . . . L gives the reason for the incumbent to raise the tariff since ___—___. > ___—___... K K L L NKd /V, N d /V, the tariff increase causes negligible dead weight loss at r = 0. By raising the tariff, not by raising the tax and subsidy, the incumbent can help capitalists with the lower 102 dead weight loss. However, if he could change the tax burden share h, this is a better instrument for the incumbent to provide the help since the burden share change does not cause any distortion at any level. Thus, when the incumbent can choose the tax rate freely so that no tax for capitalists is the Optimal for the incumbent, there is no need for the inefficient tariff. However, this result depends on the assumption that the workers are homogeneous. If they are heterogeneous, the incentive for the incumbent to raise the tariff might appear again. I consider this in the next section. 3.4 Three groups: heterogeneity in workers In this section, I extend the previous model so that it has three groups. Three groups consist of one capitalist group and two worker groups. The difference between two worker groups is the income level. Denote two worker groups as group M and group P, with their group sizes NM, N P and NM + N P = N L. I assume workers in group P has one unit of labor and workers in group M have 1 + _w_ w > 0, units of labor. NM’ Thus, workers in group AI are richer than workers in group P, though they are still poorer than capitalists. A natural interpretation is that group AI is middle class and group P is poor workers. The total income of two worker groups is N L + w. The model in previous section cannot incorporate three groups smoothly. To achieve that, I assume that the group M and P are subgroups of the group L. Let NM . . m E N77. For a given level of voting cost 0, I assume that among workers who have voting cost equal to or lower than c, m of them are group AI workers and 1 — m of them are group P workers. Horn this assumption, the following voting cost functions 103 for group M and P are defined. NM¢M Voting cost of group M for the N M ¢M -th voter: c( m ) = c(NquM) P P Voting cost of group P for the N P (bP -th voter: c(—1——95—-) = c(NquP) — m Since the voting rate (b is determined in the same way, d — c = 0, the derivatives of qbs of group M and P are defined as follows. a 1 53% = m for a = Al or P, where c’(a) = c’(NLng) M M P P Then, it is defined that ¢L E N ¢ + N ¢ . In addition, I assume that group NL M and P share same random variable ”L. This means the objective of the incumbent is still which is equal to W’ N M (W + N1D qu ' Though the total income of workers is now N L + to, not N L , with these new assumptions, the change from the previous model is minimum from the viewpoint of the incumbent about what he should do regarding capitalists. In this section, I do not consider if the incumbent decides to help capitalists or not. There should not be any qualitative difference from the results in the previous section about this. Thus, I consider only if the incumbent would use a tariff when he decides to help capitalists, and about its implication. In addition to above assumptions, I also assume that the preference is special one; U (11:, y) = :r + u(y), no income effect on good Y and the marginal utility of income is one. I keep the assumption that the incumbent can choose tax rates freely. 104 3.4.1 The optimal tax rates Before I check if the incumbent would use a tariff or not, I consider what is the optimal tax burden shares for the incumbent. Same as the subsection considering the same problem in the case of two groups, I assume 7' = 0 in this subsection. Let t“ be the income tax rate of the group a E (K, M, P}. Then, since the incumbent can choose the tax rate freely, given a total tax payment from three groups, it is obvious from the previous section that he imposes tK = 0 on capitalists. This is a natural result considering the objective of the tax. The implication of this is that KK the numerator of the derivative of W with respect to tK with corresponding changes in taxes on other groups so that no change in the total tax payment is N L (bLN K (bfdg; — N K ¢KN L (155’de < Oat tK = 0 (I ignore the case that the equality holds at tK = 0.) Since the group L now consists of two groups, it is not clear what dL is. However, since N K d5, is the change in the tax burden of capitalists as a group, NLdtLK /NKd{1{{has a clear interpretation as the ratio of marginal transfers from capitalists to workers. Since the marginal utility of income is fixed at one and the total tax payment from three groups does not change so that no change in subsidy and no change in the production of good Y, this ratio is one: NLdtLK /NKdtI,{( 2 (here I am considering this with r = 0.). Thus, NquLaf,‘ — NKhKagNLde/Ndef. > 0 => NLthg‘ > NquKaf; both dtlk and dig are negative. Substituting partial derivatives of gb into this, 1 1 NM NP NKNquKc’(K) > NLNL¢L{NLc’(M) + NLc’(P)} 105 85d {NM¢M5 51% NquP EPd NKdK > NLgbL NLdM NLgbL NLdP} <=> vK I > NKNKch’(K) Thus, a formula similar to the one in the previous section (i.e., vL 8K ELd I End 41) > —— 15 obtained. Then, what 18 theo timal NLNL¢La(L) NKdK/V,K NLdL/VIL) p tax burden share between group M and P for the incumbent? Notice that since the NK¢K NM (151” + NP¢P burden sharing among group M and P does not affect capitalists, the optimal share and since the tax objective of the incumbent is the maximization of h is the one such that minimizes N M ¢M + N P qu . Here, I define t and h, again, and I use them to denote tM and tP. Now, t is the tax rate on the total income of worker groups N L + w, and h is the tax burden share of group M. Then, ht(NL + w) = tM(NM + w) (1 - h)t(NL + w) = tPNP From these, I can obtain tM and tP in terms h and t. L NM+wL NP K K By substituting these into indirect utilities and taking a derivative of W, the next Lemma can be obtained. Lemma 3.4. If the incumbent can choose taa: rate freely, he will choose tarz: rates such that l I ATKNK MKC’( ) >NLNL¢LCI(M) The proof is in Appendix. 3.4.2 Will incumbent use tariff? Now I consider if the incumbent would use a tariff or not. Thus, I consider a situation that the incumbent is using the income tax to provide the production subsidy but K K not using a tariff. Then, if the derivative of W with respect to 'r at r = 0 is positive, the incumbent will use the tariff. Since the tax rate on capitalists is zero, there is no reason for capitalists to prefer positive tariffs to zero tariff. However, because of the heterogeneity in workers, workers in group M prefer positive tariffs to zero tariff. Even in the case of two groups with zero tax on capitalists, there is some possibility that workers would prefer positive tariffs to zero tariff if tax rate is very high. Here, such high tax is not required. The logic behind this is same as the one behind the case that capitalists prefer low but positive tariffs to zero tariff. Since workers in group M have higher income than workers in group P, their ratio . . yM . . . of good Y consumption to income, I—M—, is lower than the ratio of workers in group P. Thus, raising the tariff rate is equal to the transfer of the tax burden from group M to group P (as long as the tariff rate is low.) Moreover, since capitalists is not paying tax, the positive tariff is also equal to the transfer from group M to group K. Thus they have a reason to prefer low but positive tariffs to zero tariff. Of course, capitalists and group P workers do not share the reason. However, as long as workers in group M prefer positive tariffs, it might be possible for the incumbent to use tariffs 107 for his advantage. I am going to consider if this is really the case or not. Of course, I consider the change of T with dH = 0. The indirect utilities of three groups are defined as follows. K_ II NL+w NM+w VM=V l—ht ,* 1—htN +w Derivatives of them with respect to T at T = 0 with dH = 0 are, K K KVPK K VT =Vp =VI {f—IR-=—y T _ NM p <1—h) t > t . Particularly, t — 0. The pOpulation of voter groups is N K < NM < Np. Then, suppose that the incumbent is giving the help to capitalists without using a tariff. Then, the numerator of the derivative of NK¢K with respect to T at T = 0 is, 617}; (NM¢M+NP¢P)_NK¢K( 617M + d5 ) C’(K) 6’0”) 6’0”) If this is positive, the incumbent would increase the tariff from zero. Because of the logic for the single tax case in the precious section, higher tariffs reduce the burden of the help on capitalists. This increases dK , so d5 is positive. Thus, the first term is positive, i.e., c’tfi) (N M (bM +N P ng ) > 0. The same logic says the burden on the group P voters rises. So, at? is positive. The effect of the tariff increase on d9”! is ambiguous in general. From the logic, we know a part of the burden on capitalists is transferred to group M voters. But the same logic also says a part of burden on group M also goes to group P. The sign of div! depends on which effect is stronger than the other. However, if NM is much larger than N P and much smaller than N P , d4” would P be negative. Moreover, if limd_,0 c’(d) = 00 (like the Inada condition), T 0 -c’—(_P) = since t1D = 0. Thus, the above value is positive. The incumbent would use the tariff. The voter abstention and different group sizes give an incentive for the incumbent to redistribute resource / income. If he can use policy instruments that produce less dead weight loss, there would be no reason for him to use other instruments that produce more dead weight loss. However, if he cannot use policy instruments freely, he would use the inefficient instruments. In the setup of this chapter, if the regressive tax rate 110 system is not possible, that might induce the incumbent to use the inefficient tariff, as shown in the the example above. Since the reason that the regressive tax system is not possible in real world is the sense of fairness in the tax system, this might be able to be said as a trade-off between efficiency and fairness. However, the main problem is the fact that the voter abstention gives the incumbent the opportunity to manipulate votes. If all potential voters vote, the incumbent would not help capitalists. Here I state the main Theorem. Theorem 3.1 If the incumbent can choose tax rates freely so that he can choose even regressive tact: rates, he will never use the tariff to help the domestic industry. In this sense, the inefiiciency of the tariff is the price of the fairness in the tax system. 3.5 Literature Review There are other papers that consider why tariffs are used. Generally, however, what they do is the comparison of economic welfare between the case that a tariff is used and the case that a subsidy is used; they show situations where the tariff achieves higher welfare than the subsidy. Rodrik (1986) shows that since the tarifl has the character of public goods, the lobbying effort for the tariff is under-supplied, compared to the lobbying effort for the firm specific subsidy. So the distortion under the tariff could be lower than the one under the subsidy. Mayer and Riezman (1987) show that when the country is small and voters are heterogeneous in factor endowments, voters prefer the production tax cum subsidy to the tariff. So, if voters will vote on trade policy like in Mayer (1984), the tariff will not be chosen. But in cases that 111 the country is large or that voters are heterogeneous in both of factor endowment and consumption preference, they say it is possible for all or some voters prefer tarifl. Wilson (1990) shows that the efficient instrument can invite more lobbying pressures so the total distortion can be larger for the efficient instrument than for the inefficient one. Since politicians care about the level of total distortion which could affect voters’ sentiment toward politicians negatively, there is an incentive for politicians to restrict the possible policy instruments to the inefficient ones. Mayer and Riezman (1990) consider several cases in which some voters prefer tariff to subsidy, like the ease of heterogeneous voters, the case of risk averse voters in the uncertainty about tariff rate and tax rate, etc., though they do not show how such voter preferences will be reflected in the political process of choosing policy instruments. In this chapter, preferences on instruments are also different among different groups. The reason for such difference is similar to the one in one section of Mayer and Riezman (1990)26. Naito (2006) shows that given a reasonable information constraint on government, when government wants to design a tax system to redistribute income to unskilled workers, tariffs on unskilled labor intensive goods could be Pareto-improving. Though they do not consider about tariffs directly, Coate and Morris (1995) con- sider why inefficient instruments are used by government in a model with uncertainties about two things; one is about the outcome of public project, and another about the type of the incumbent politician. The incumbent politician has information about the project outcome and will face re-election. Good type politician will carry on the project only when the information he has tells him that the project has high proba- bility of producing good outcome. Bad incumbent wants to transfer money to special 112 interests. He can do that through the direct subsidy or the public project. Voters update their reputations on the type of politician based on what the incumbent does and what its outcome is. So, to increase the probability of winning the reelection, the bad type politician chooses the project over the direct subsidy (this gives him higher utility than the project) so that he can pretend a good type believing that the project has high probability of success. Acemoglu and Robinson (2001) consider a two-period model with a special interest group whose power source is the number of voters belonging to it. In multi-periods models, policy commitment by politicians is an important problem. If it is not guaranteed by an assumption, it needs to be guar- anteed by some mechanism (Harrington (1992)), or no policy commitment (Alesina (1988)). In the model of Acemoglu and Robinson (2001), the special interest prefers the inefficient policy to the efficient one since it distorts newcomers’ incentive toward joining to the special interest, which guarantees the power of the special interest in the second period and secures the commitment by the government to the transferring resource to the special interest in the period. Above, I mentioned papers in which there is no intrinsic difference in the difficulty of policy implementations. If there is difference in the difficulty of policy implemen- tations (for example, the cheaper collection cost of tariffs and the higher cost of other taxes), however, it is actually no wonder that governments would choose the pol- icy that is easier to implement. Gordon and Li (2005a) argues that governments of developing countries rely more on tariffs as their revenue sources than the ones of deveIOped countries because it is more difficult to monitor economic activities of do- mestic private firms in deveIOping countries than in developed countries. Gordon and 113 Li (2005b) claim that the model of Gordon and Li (2005a) fits actual data better than the model of Grossman and Helpman (1994). Even in current develOped countries, the tax collection used to be a difficult problem in the past (maybe still is.) Gard- ner and Kimbrough (1992) applied to US. this line of explanation about tariffs27. Though this chapter is different in many aspects from this collection cost type ex- planation, there is a common element; politicians choose the policy instruments that have lower political costs. I considered the maximization problem in this chapter. But votes for the challenger at the election and the loss of votes for the incumbent can be considered as the political costs of the policy instrument choices. Then, the same problem can be considered as a political cost minimization problem, like the expenditure minimization problem28. Then, difference in the nature of the cost could have a big implication for welfare (maybe even a benevolent planner might decide to use tariffs when the collection costs of taxes are high. But will the planner use tariffs for the political cost reason when there is no intrinsic cost difference between tariffs and taxes?) But from the viewpoint of each politician, whatever the source of the cost of a policy is (collection costs or loss of votes for him, etc.,) they are political costs of the policy he wants to implement. So, politicians must try to minimize those political costs. In this sense, I think this chapter and the collection cost explanation are in complimentary, looking at different type of costs, not contradicting each other as the explanation about why tariffs are used. As I wrote in previous sections, the idea of the paper is that the incumbent can take advantage of the voter abstention by redistribution costs and benefits of helping a domestic industry. From this point, this chapter is related to the literature 114 of the Vote Buying/Turnout Buying. Nitchter (2008) analyses turnout-buyings in Argentina; political party rewarding potential voters who favor the party to its rival party for voting for the party. Even though they favor the party, they might abstain from voting without rewarding. Vote Buying is the Opposite; rewarding voters who do not favor the party for voting for the party. Though the direct rewards for voting is different from what I consider in this chapter (choice of policy instruments), similar elements exist; the Optimal choice of policy instruments (or the optimal distribution of rewards) could increase votes for the incumbent from people who benefit from the policy of the incumbent and decrease votes for the Opponent from people who suffer from the policy. 3.6 Conclusion of Chapter 3 In this chapter, I considered why the incumbent might decide to use a. tariff to help domestic industry. What important in this chapter is the voter abstention. After the incumbent makes the decision about the help, he will face the election. So, he takes the election into his consideration when he chooses policy instruments. Because of the voter abstentions, he might decide to give the help. Then, if there is no restriction on the choice of efficient policy instruments (the tax and the production subsidy here), the incumbent would not use the tariff for the help. However, its outcome does not look politically feasible since it involves the regressive tax rates. So, if the regressive tax rates are not feasible because of some restrictions on the tax system, the incumbent would use the tariff to take advantage of the voter abstention. Because 115 of the voter abstention, there is a problem similar to the one under the externality; the undersupply of the good policy. Though no tariff is better for national welfare, this would not matter for the incumbent since he might not be rewarded fully for the good choice because of the voter abstention. So, the incumbent must try to take advantage of the voter abstention by putting more costs on people who do not vote. When there is the political restriction on the possible policy instrument choices, this could lead to the choice of tariffs. For the incumbent, it is the politically low cost distribution of the burden of his policy. The less-than-one income elasticity is just one example of the sources for the voter heterogeneity which the incumbent can exploit. Since it means normal goods, real world examples are easy to find. Moreover, what I need in this chapter is actually the consumption pattern difference among diflerent income groups, which is further easier to find. As an example related to trade restrictions, consider the voluntary export restriction of Japanese automobiles in 19803. In 19803, Japanese cars were mostly small to mid size cars, no luxury cars. Thus the rich’s expenditure share of Japanese cars in 19803 should have been much lower than the middle class and working class people’s share29. Though it is not related to trade, state lotteries also can be considered as such a case since it is an easy way for politicians to generate revenue without offending voters with higher taxes. Garrett and Coughlin (2008) estimate that the income elasticity of state lotteries is less than one. The model in this chapter has many moving parts, which might be confusing. So I briefly explain about the relationships of some important assumptions and results 116 here. The challenger as Free-trader: The incumbent might not be rewarded for free-trade policy, which discourages him from pursuing the free-trade policy. He is not be rewarded not because his challenger is a convinced free trader (there would be no difference between them if the incumbent chooses the free trade policy.) Even if the challenger was a protectionist, the results in this chapter still would suggest that the incumbent might have an incentive to choose the protectionist policy. If choosing the fiee trade policy against the protectionist challenger were politically beneficial, he would not have chosen helping the domestic industry in this model. Endogenous decision of abstention: This is the hinge of the idea in this chapter. Without this, the incumbent does not have the power of manipulating votes. What important is that abstention rates vary depending on the utility differences. Suppose that the some voters abstain, but the rates of election participation among groups are fixed. This is basically same to the case of no voter abstention with the different composition of population. To understand this, consider the following situation; ratios of group populations over total pOpulation are 50% for group P, 25% for group M and 25% for group K. Moreover, suppose that the participation rates are 50% for all groups. Then, ratios of actual voters over total actual voters are same to population ratios. So, this is equivalent to the case of no voter abstention with the half population size. Since group M and P always prefer challenger, the incumbent never decides to give the help. Even if participation rates are different among groups, it is still equivalent to a case of no abstention. If the participation rates are 10% for group P, 40% for group M and 100% for group K, it is same to a case of no 117 abstention with population ratios of 12.5% for group P, 25% for group M, and 62.5% for group K. In this case, the incumbent always gives the help. What important for this chapter is that voting behaviors are influenced by utility differences, which can be manipulated by the incumbent. Non-homothetic preference: The incumbent can manipulate voters’ utility diflerences because of the non-homothetic preference. If preferences are homothetic, he could not do that. However, what important is not the non-homothetic preference itself, but the existence of some factors that can give the manipulating power to the incumbent. The results in this chapter depend on strong assumptions. However, the basic logic is more general. For the logic, specific assumptions, like the less-than—one income elasticity, are not important. If there are more than one policy instruments and there is heterogeneity, similar results could be obtained. Because of the voter abstention the incumbent politician can put the cost of his policy on voter groups whose voting rates are inelastic. Similar to the argument of the Ramsey rule about minimizing the dead weight loss from commodity taxes, the incumbent can minimize the political cost of his policy (votes for his challenger) for the same level of political gain (votes for him) by putting cost on the voting—inelastic voter groups. Then it is natural to assume that the incumbent wants to use the most efficient policy instrument (e. g. the tax and production subsidy) for that purpose. However, political restrictions, like the requirement for fairness, might prevent the use of the most efficient policy instrument at the optimal level for the incumbent (e.g., regressive income tax rates.) Then, it 118 would be beneficial for the incumbent to use less efficient instruments. In this sense, the use of inefficient tarifl could be considered as the price of the fairness in the tax system. 119 Appendix of Chapter 3 K L Lemma 3.3 If -—1— > —I— at t = 0, then the incumbent will choose the tax rate NK NL t > 0 with tar burden share h = 0. Proof. Proof: The derivative of the capitalist’s indirect utility with respect to t is, L d8 —f— h(7r +N )_ht21?f I NK L N As I showed before, 43-— — (7r + N )f dt ff” — s(f’)3 . Thus, NL 1 — ht ” NK ff” - 8(f’) and, (1— —h)L respect to t is, VIK (1 — hon” * K K L L L _ _ K K L L L _V_I_ _ tff” N a N as (7r+N )sgndLNLu h)[1+ff,,_s(f,)3] V,L V,K < this becomes, Since the incumbent chooses h so that , egndLNL—aggdKNK * K K L L L VIK (1—htlff” (1- hltf " ( )ZN ¢ N 45 (MN )EKdKjl’rflfftsv'fi —h—1+h_ff”- s3l V _ II : NKt’KNLtL‘" + NL)eKdII{NK[f(f1” 28103 ‘ 1] When t = 0, the inside of the bracket is zero. This means when the incumbent chooses the corner solution h = 0 so that the strict inequality holds, the derivative of 120 NK K . . . . - __N L: L at t = 0 1S pOSItive. When the incumbent chooses a non—corner solution so L K th VI VI t = NLNL¢Lc’(L) NKNquKc’(K) t = 0. It is easy to prove that the only another value of t that makes the inside of the , the result means the derivative is zero at bracket zero is negative, and it is also easy to prove that t = 1 makes the inside —1. The inside is negative for any t E (0, 1]. This implies that if t > 0 and if the incumbent is choosing a a non-corner solution for h so that the equality holds, he would have V,K > V,L NKNK¢Kd(K) NLNL¢LcI(L) at t = 0, the incumbent has an incentive to raise the tax rate t. The only consistent an incentive to lower the tax rate. However, if combination of t and h is t > 0 and h = 0. As I showed before, the condition becomes K L .‘L>YI_att=o . NK NL ' Proposition 3.2: Suppose that the incumbent can choose the tar: burden share K vL freely and it holds that —I— > —I— at t = 0. Then, even when the incumbent use NK NL production subsidy to help capitalists, he will not use tariff. Proof. Proof: When h = 0, the government budget constraint is sf (1) = tNL + Tm. The derivative of tax rate t with respect to tariff T for the fixed H E s + T at dt f + m T=0isa7= NL. The indirect utilities of capitalists and workers are, 171% : H) = V(IK,p* +T) VL(T : H) = V(1—t,p* +T) Then derivatives of them with respect to T at T = 0 with dH = 0 is VTK = VpK = _ IKyK L . f + m V V! = VIL N1. + VI = —N’L (f + m — NLyL) 121 HereIassume VTL>0©f+m—NLyL>0 K K Then, the numerator of derivative of W with respect to T at T = 0 with dH = 0 is, KK LL ‘WK .KK If LL N N N + + —— N VK VL From I 2 I andf+m—NLyL>0, NKNchKc’(K) NLNL¢Ld(L) _VIK NKyK + VIL (f+m _ NLyL) NKNKch’(K) NLNL¢Ld(L) WK KK LL < + —N —N = I Lemma 3.4: If the incumbent can choose tact rate freely, he will choose tax rates such that 1 > 1 NKNK¢Kd(K) NLNL¢Ld(M) 1 _ 1 d( ) C’(P) Proof. Proof: The indirect utilities of workers in group M and P are defined as follows. NL+w NM+w VM=V1—-ht ,* << NM”) NM 1») _ L w _(1 (1)1131; + hp...) VP=Vu The derivatives of these with respect to h are, VM 2 _fl'fifl h NM h NP Since the help to capitalists hurt them, d5); = —V,:". Thus, the derivative of 122 NMch + NPng with respect to h is, NM t(NL+w)_ NP t(NL+w) NLc'(M) NM NLc’(P) NP NMoyth + Npafdf = _t(NL+w) 1 1 ‘ NL (6W) 3(7)) If h = 0, all tax burden falls on group P. It would be same to the free trade situation for group M but that would induce workers in group P to vote. Thus, c’ (P) < c’ (M) from c” < 0. If h = l, the Opposite would happen. Thus, the 1 incumbent should choose h so that W — d—(1P-) = 0. Substituting this into the inequality I obtain, 1 1 NM NP 1 NKNchKc’(K) > NLNL¢L{NLc’(M) + NLc’(P)} = NLNLch’(M) I 123 Endnotes of Chapter 3 18. According to CNN exit poll (CNN.com (2004)), 68% of voters of 2004 US. presidential election made $75,000 or less, and 82% of voters made $100,000 or less. 19. Though I used the word "lobbying", this is not a lobbying model, so, no campaign contribution from them. 20. This assumption can be changed so that the tariff revenue is distributed to people, not used to finance the subsidy. Such change makes model more complicated, but does not change qualitative results. 21. Of course, the production subsidy that is financed by the tax does cause distortions. 22. Group Rule-Utilitarian voting literature consider more complicated voting decision making (Feddersen and Sandroni (2006), and Coate and Conlin (2004)) than the one I employ here. Their decision-making is, as its name suggests, more group- based. 23. If uKNKgbK — ,uLNquL < 0, the challenger wins. If ,uKNKgbK — uLNLpL = 0, then both candidates have same probability of winning the election. 24. I do not consider the impact of the tariff on the amounts of export and imports. Though the tariff increase with dH = 0 does not change domestic productions, it certainly changes domestic consumptions, which changes the amounts of export and import. However, I simply assume that the change is small enough so that the country is still exporting good X and importing good Y. 25. Because of the income tax, the good Y consumption - income ratio is actually 124 ya / (1 — t)I“. Since the tax rate is same for both group, I can ignore (1 — t) here. 26. The reason is similar, but not same. In the paper of Mayer and Riezman (1990), the preference for tariff comes from the difference of income tax rate and the share of distribution of tariff revenue. In this chapter, the preference comas from the difference of income share and the consumption share of different consumers (in the single tax case.) 27. I thank Professor Wilson and Professor Nelson for putting my attention to those papers and this line of argument. 28. Actually, the previous version of this paper considers the minimization of votes for challenger for given level of votes for the incumbent. 29. Bordley and McDonald (1993) estimate that income elasticities of automobile in any segments (small, economy, luxury, etc.) are more than one. This is incon- sistent with the assumption of this chapter. However, it is natural to suppose that income elasticities of goods change as income level changes. Bordley and McDonald (1993) also estimate that the more expensive the automobile segment is, the higher the elasticity. This is consistent with the conjecture that the elasticity of Japanese economy cars in 80’s was very low for the rich. 125 References [1] Aldrich, John H. (1993): “Rational Choice and Turnout” American Journal of Political Science, 37, 246-78. [2] Alesina, Alberto (1988): “Credibility and Policy Convergence in a two-Party Sys- tem with Rational Voters” American Economic Review, 78, 796-805. [3] Bordley, Robert F. AND James B. McDonald (1993): “Estimating Aggregate Au- tomotive Income Elasticities fiom the Population Income-Share Elasticity” Journal of Business 65 Economic Statistics, 11(2), 209-214. [4] CNN.com. “Election 2004” , http: / / www.cnn.com / ELECTION / 2004 / pages / results / states / US / P / 00 / epolls.0 .html [5] Coate, Stephen AND Stephen Morris (1995): “On the Form of Transfers to Special Interests” Journal of Political Economy 103, 1210-1235. [6] Coate, Stephen AND Michael Conlin (2004): “A Group Rule—Utilitarian Ap- proach to Voter Turnout: Theory and Evidence”, American Economic Review, 94, 5, 1476—1504. [7] Dixit, Avinash (1986): “Gains from Ti‘ade without Lump Sum Compensation” Journal of International Economics,'21, 111-122. [8] Dixit, Avinash AND Victor Norman ( 1980): Theory of International Trade, Cam- bridge: Cambridge University Press. [9] Dixit, Avinash AND Thomas Roemer ( 2006): “Political Explanations of Inefficient Economic Policies - An Overview of Some Theoretical and Empirical literature” Presentation at IIPF conference “Public Finance: Fifty Years of the Second Best -and Beyond” in Paphos, Cyprus, August 2006. [10] Feddersen, Timothy J (2004): “Rational Choice Theory and the Paradox of Not Voting” Journal of Economic Perspective, 18, 99—112. [11] Feddersen, Timothy J. AND Alvaro Sandroni (2006): “A Theory of Participation in Election” American Economic Review, 96, 1271-1282. 126 [12] Gardner, Grant W. AND Kent P. Kimbrough (1992): “Tax Regimes, Tariff Revenues and Government Spending” Economica, February 1992, 59, 75-92. [13] Garrett, Thomas A. AND Cletus C. Coughlin (2008): “Inter-temporal Differ- ences in the Income Elasticity of Demand for Lottery Tickets” Federal Reserve Bank of St. Louis working paper series 2007—042b (revised version). [14] Gordon, Roger AND Wei Li (2005a): “Tax Structure in Developing Countries: Many Puzzles and a Possible Explanation” NBER Working paper No.11267. [15] Gordon, Roger AND Wei Li (2005b): “Puzzling Tax Structures in DeveIOping Countries A Comparison of two Alternative Explanations” NBER Working paper No.11661. [16] Grossman, Gene M. AND Elhanan Helpman (1994): “Protection for Sale” Amer- ican Economic Review, 84, 833-50. [17] Harrington, Jr., Joseph E. (1992): “The role of party reputation in the formation of policy” Journal of Public Economics, 49, l07-21. [18] Harsanyi, John C. (1980): “Rule Utilitarianism, Rights, Obligations and the Theory of Rational Behavior” Theory and Decision, 12, 115-33. [19] Hinich, Melvin J ., John Ledyard, AND Peter C. Ordeshook (1972): “Nonvot- ing and the existence of Equilibrium under Majority Rule” Journal of Economic Theory, 4, pp.144—153. [20] Lijphart, Arend (1997): “Unequal Participation: Democracy’s Unsolved Dilemma” American Political Science Review, 91, 1-14. [21] Mayer, Wolfgang (1984): “Endogenous Tariff Formation” American Economic Review, 970-85. [22] Mayer, Wolfgang AND Raymond Riezman ( 1987): “Endogenous choice of trade policy instruments” Journal of International Economics, 23, 377-81. [23] Mayer, Wolfgang AND Raymond Riezman (1990): “Voter preferences for trade policy instruments” Economics and Politics, 2, 259-73. 127 [24] Nichter, Simon (2008): “Vote Buying or Turnout Buying? Machine Politics and the Secret Ballot” American Political Science Review, 102, 19—31. [25] Naito, Hisahiro (2006): “Redistribution, production inefficiency and decentral- ized efficiency” International Tar: and Public Finance, 13, 625-640. [26] Persson, Torsten AND Guido Tabellini (2002): Political Economics Explaining Economic Policy, MIT Press [27] Rodrik, Dani (1986): “Tariffs, subsidies, and welfare with endogenous policy” Journal of International Economics, 21, 285-99. [28] Rodrik, Dani (1995): “Political Economy of Trade Policy” in Gene M. Grossman and Kenneth Rogofl, ed., Handbook of International Economics Vol.3, Elsevier Science B.V., Amsterdam, The Netherlands, 1457-1494. [29] Shachar, Ron AND Barry Nalebuff (1999): “Follow the Leader: Theory and Evidence on Political Participation” American Economic Review, 89, 525-47. [30] Wilson, John D. (1990): “Are Efficiency Improvements in Government Transfer Policies Self-Defeating in Political Equilibrium?” Economics and Politics, 2, 241-58. [31] Yang, C. C. (1995): “Endogenous Tariff Formation under Representative Democ- racy: A Probabilistic Voting Model” American Economic Review, 85, 956-63. 128 II]]]'I]]I]]]I]]]]T][[16]]!