ESSAYS IN MARKET DESIGN

By

Pallavi Pal

A DISSERTATION

Submitted to

Michigan State University

in partial fulllment of the requirements

for the degree of

Economics  Doctor of Philosophy

2020

ABSTRACT

ESSAYS IN MARKET DESIGN

By

Pallavi Pal

Chapter 1: Quality Dierentiation and Optimal Pricing Strategy in Multi-sided

Markets

This paper analyzes the generalized quality dierentiation model in multi-sided markets with

positive externalities, which leads to new insights into the optimal pricing structure of the

rm. We nd that quality dierentiation for users on one side leads to a decrease in the

price charged to users on the other side, thereby aecting the pricing structure of multi-sided

rms. In addition, quality dierentiation aects the strategic relationships among the choice

variables for the platform, so that the platform strategically uses quality dierentiation to

raise its prots.

Chapter 2: Dynamic Game with Multidimensional Type: The Case of Carbon-

Credit Market

A signicant problem with the carbon credit market that has become apparent in recent

years is that the market price has been far more volatile than originally envisioned. The

underlying problem is the ill-understood pricing anomalies in a repeated period dynamic

setting. In this paper, we drive the equilibrium price path in a dynamic setting and suggests

ways to overcome price instability. The model setup allows the rms to dier in terms of

their value for the carbon credit as well as the urgency of obtaining it. For example, a rm

with an early deadline for obtaining the carbon credits will have a higher demand urgency.

We nd that the equilibrium price is aected by future demand and supply expectations.

The ndings show that the cap or the supply limit for each period can be used to decrease

price instability. Currently, the government or the carbon credit seller decides a per period

limit on the supply, which decreases over time. However, this paper suggests that to curb

price uctuation the per period supply should be a function of expected future demand. We

show that correlating supply rate with expected future demand leads to a more stable price.

Chapter 3: Revenue-Maximizing Number of Ads per Page in the Presence of

Market Externalities

Firms use advertising as a medium to gain a competitive advantage, which is negatively

aected if the ad appears alongside their rival's ada form of externality. The multiple ad

display setting on search engines, such as Google and Yahoo!, introduces such externalities

in the market. In this paper, I estimate a structural model based on a novel data set of

Yahoo! ads to (i) quantify the eect of externality on an advertiser's willingness to pay

and (ii) simulate the revenue-maximizing number of ads for a search engine. First, I nd

that externality depends on the quality and quantity of competing ads. For example, an

advertiser's willingness to pay decreases by 18.5 percent due to the addition of a second

high-quality ad, but only by 0.15 percent due to the addition of a seventh low-quality ad.

Second, the counterfactual results suggest that the revenue-maximizing number of ads per

page diers across the ad product category, with the average being ve ads per page, and

implementing the suggested number of ads would lead to a 4.5 percent increase in revenue, on

average. These results provide evidence in support of recent changes in the online advertising

market; for example, Microsoft introduced a service called RAIS that provides advertisers

with an option of an exclusive ad display.

Copyright by
PALLAVI PAL
2020

ACKNOWLEDGMENTS

Firstly, I would like to express my sincere gratitude to my advisors Kyoo il Kim and Thomas

D. Jeitschko for the continuous support of my Ph.D study and related research, for their

patience, motivation, and immense knowledge. Besides my advisors, I would like to thank

the rest of my thesis committee: Hanzhe Zhang , Jay Pil Choi and Chenhui Guo, for their

insightful comments and encouragement. A special thanks to Mike Conlin, Ashish Agarwal

and David Croson who provided me an opportunity to discuss and give a fresh prospective

to my research.

I would like to acknowledge nancial support from the Department of Economics, Grad-

uate School, College of Social Sciences, and Council of Graduate Students, along with the

numerous fellowships received through the duration of these PhD years that have been instru-

mental in successful completion of this work. The administrative help and support received

from Lori Jean Nichols and Jay Feight has made navigating this PhD program easier than

it would otherwise have been.

I thank my fellow graduate students Riju Joshi, Sambojyoti Biswas, Haoyang Li, Bhavna

Rai, Katie Harris, Alyssa Carlson, Udita Sanga, Katie Bollman, Akanksha Negi, Tram Hoang

and Ruonan Xu for the stimulating discussions, for the sleepless nights we were working

together before deadlines, and for all the fun we had over the last few years. My warmest

gratitude goes to my family  Mom, Dad, Neha, Sid, and Parikshit as well as my friends

back in India  Ojal, Deeksha, Devangana and Ameya for inspiring me to follow my dreams

and supporting me spiritually throughout writing this thesis and my life in general. Finally,

I would like to thank my best friend and life partner Dhruv Gupta, with whom I shared the

joys and challenges of the PhD life and who motivated me to pursue PhD.

v

TABLE OF CONTENTS

LIST OF TABLES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . viii

LIST OF FIGURES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

ix

EGY IN MULTI-SIDED MARKETSâ€ 

CHAPTER 1 QUALITY DIFFERENTIATION AND OPTIMAL PRICING STRAT-
. . . . . . . . . . . . . . . . .
1.1
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
1.2 Model setup . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
1.3 Model
1.3.1 Model with one quality case . . . . . . . . . . . . . . . . . . . . . . .
1.3.2 Model with two qualities case . . . . . . . . . . . . . . . . . . . . . .
1.4 Equilibrium results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
1.5 Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
1.5.1 Quality dierentiation and optimal pricing strategy . . . . . . . . . .
1.5.2 Quality dierentiation and extra markup . . . . . . . . . . . . . . . .
1.6 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

OF CARBON-CREDIT MARKETâ€ 

CHAPTER 2 DYNAMIC GAME WITH MULTIDIMENSIONAL TYPE: THE CASE
. . . . . . . . . . . . . . . . . .
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
2.1
2.2 Model setup
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
2.3 Symmetric equilibrium . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . .
2.4 Applications and comparative statics : emission market
2.4.1 Comparison with price control strategies . . . . . . . . . . . . . . . .
2.5 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

CHAPTER 3 REVENUE-MAXIMIZING NUMBER OF ADS PER PAGE IN THE

PRESENCE OF MARKET EXTERNALITIESâ€  . . . . . . . . . . . .
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.1
3.2 Literature review . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.3 Market environment and theoretical Model . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . .
3.3.1 Overview of the search advertising market
3.3.2 Theoretical model . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.3.3 Consumers side . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.3.4 Advertising model . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . .
3.5 Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.5.1 Data details . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.5.2 Characterizing features of the data . . . . . . . . . . . . . . . . . . .
Identication . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

3.4.1 Bounds on exclusive ad value

3.4 Equilibrium bid analysis

3.6

vi

1
1
5
9
9
12
15
22
22
23
24

26
26
30
33
36
38
40

41
41
45
47
47
49
49
51
53
56
59
59
62
64

3.7.1
3.7.2
3.7.3

3.6.1 Advertisers value distribution . . . . . . . . . . . . . . . . . . . . . .
3.6.2 Consumers click behavior
. . . . . . . . . . . . . . . . . . . . . . . .
3.6.3 Externality estimation . . . . . . . . . . . . . . . . . . . . . . . . . .
3.6.4 Creating market using unsupervised machine learning
. . . . . . . .
3.7 Econometric Method . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Step 1: Estimate consumer click probability . . . . . . . . . . . . . .
Step 2: Estimate the externality eect on the advertiser's bid . . . .
Step 3: Estimate lower and upper bounds on advertiser's externality-
free value distribution . . . . . . . . . . . . . . . . . . . . . . . . . .
3.8 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.8.1 Consumers side . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.8.2 Externality eect . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.8.3 Non-linear eect of externality . . . . . . . . . . . . . . . . . . . . . .
3.8.4 Advertiser willingness to pay for an ad and prot margin . . . . . . .
. . . . . . . . .
3.9.1 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.10 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

3.9 Counterfactual analysis: revenue-maximizing number of ads

64
65
66
67
69
70
71

73
74
74
75
76
77
78
79
80

APPENDIX A
APPENDIX B
APPENDIX C
APPENDIX D
APPENDIX E
APPENDIX F
APPENDIX G
APPENDIX H

83
APPENDICES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
84
Figures for Chapter 1 . . . . . . . . . . . . . . . . . . . . . .
85
Proofs for Chapter 1 . . . . . . . . . . . . . . . . . . . . . .
93
Figures for Chapter 2 . . . . . . . . . . . . . . . . . . . . . .
Proofs for Chapter 2 . . . . . . . . . . . . . . . . . . . . . .
94
Figures for Chapter 3 . . . . . . . . . . . . . . . . . . . . . . 101
Tables for Chapter 3 . . . . . . . . . . . . . . . . . . . . . . 117
Proofs for Chapter 3 . . . . . . . . . . . . . . . . . . . . . . 125
Revenue-maximizing number of ads: looking at exible num-
ber of ads . . . . . . . . . . . . . . . . . . . . . . . . . . . . 132
Robustness Check . . . . . . . . . . . . . . . . . . . . . . . . 135
Data cleaning for Chapter 3 . . . . . . . . . . . . . . . . . . 137

APPENDIX I
APPENDIX J

REFERENCES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 141

vii

LIST OF TABLES

Table F.1: List of variables available for consumers side data . . . . . . . . . . . . . . 117

Table F.2: List of variables available on advertisers side

. . . . . . . . . . . . . . . . 118

Table F.3: Summary statistics for consumers side data . . . . . . . . . . . . . . . . . 118

Table F.4: Summary statistics for advertisers side data . . . . . . . . . . . . . . . . . 119

Table F.5: Features of dierent categories . . . . . . . . . . . . . . . . . . . . . . . . 119

Table F.6: Externality co-ecient across categories . . . . . . . . . . . . . . . . . . . 120

Table F.7: Quality measure across product categories . . . . . . . . . . . . . . . . . . 120

Table F.8: Estimated average partial eect of position on click probability . . . . . . 121

Table F.9: Results for the revenue-maximizing number of ads per page . . . . . . . . 122

Table F.10: Identifying dierent categories

. . . . . . . . . . . . . . . . . . . . . . . . 123

Table F.11: Robust check: externality co-ecient across categories . . . . . . . . . . . 123

Table F.12: Robust check: dierent externality index denitions

. . . . . . . . . . . . 124

viii

LIST OF FIGURES

Figure A.1: Two-sided rm with quality dierentiation . . . . . . . . . . . . . . . . .

84

Figure C.1: Decrease in uctuation due to price ceiling and oor

. . . . . . . . . . .

93

Figure C.2: Price path for carbon credits for RGGI auction . . . . . . . . . . . . . .

93

Figure E.1: Sample of Yahoo! search result page . . . . . . . . . . . . . . . . . . . . . 101

Figure E.2: Timeline . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101

Figure E.3: Snapshot of an advertiser's account on Yahoo!

. . . . . . . . . . . . . . . 102

Figure E.4: Percentage of clicks across pages . . . . . . . . . . . . . . . . . . . . . . . 103

Figure E.5: The popularity of the product categories for google in the past ve years

104

Figure E.6: No. of keywords specied by advertisers across days

. . . . . . . . . . . 104

Figure E.7: Average click rate and bid across ad positions

. . . . . . . . . . . . . . . 105

Figure E.8: Revenue per ad across bid . . . . . . . . . . . . . . . . . . . . . . . . . . 105

Figure E.9: Mean of bids at dierent winning positions . . . . . . . . . . . . . . . . . 106

Figure E.10: Mean click probability for each ad across position . . . . . . . . . . . . . 106

Figure E.11: Relation between the advertiser's bid and percentage of other high-

quality ads on the page

. . . . . . . . . . . . . . . . . . . . . . . . . . . 107

Figure E.12: Eye tracking Pattern . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 108

Figure E.13: Example of sub category within the laptop category . . . . . . . . . . . . 108

Figure E.14: Keywords for Cosine and Euclidean distance . . . . . . . . . . . . . . . . 109

Figure E.15: The predicted click probability across positions

. . . . . . . . . . . . . . 109

Figure E.16: The aect of choosing higher position on click probability . . . . . . . . . 110

Figure E.17: Ad-Quality . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111

ix

Figure E.18: Cumulative distribution of Externality Index . . . . . . . . . . . . . . . . 112

Figure E.19: Externality coecient across categories . . . . . . . . . . . . . . . . . . . 113

Figure E.20: Eect of an additional ad on advertisers bid across dierent ad quality . 113

Figure E.21: Diminishing eect of an additional ad on advertisers bid . . . . . . . . . 114

Figure E.22: Upper bound for the cumulative distribution function of the ad value . . 114

Figure E.23: Cumulative distribution of advertiser's ad value . . . . . . . . . . . . . . 115

Figure E.24: Gain in revenue from using the alternate method in appendix(H)

. . . . 116

Figure E.25: Revenue-maximizing number of ads from alternate method suggested in

appendix(H) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116

x

CHAPTER 1

QUALITY DIFFERENTIATION AND OPTIMAL PRICING STRATEGY IN

MULTI-SIDED MARKETSâ€ 

1.1

Introduction

There has been an increasing shift toward multi-sided rms in many industries. The early

pioneering models of multi-sided platforms were introduced by Armstrong (2006), Caillaud

and Jullien (2003), Parker and Van Alstyne (2005), and Rochet and Tirole (2003).

Im-

portantly, many real-world organizations determine how close or how far they are from a

multi-sided economic model based on changing industrial parameters.

In this paper, we

examine how platforms in multi-sided markets optimally use quality dierentiation as a

business strategy.

As many previous studies emphasize, it is important to establish how multi-sided plat-

forms dier from typical one-sided rms and how such dierences lead to new business

implications. Many markets that have traditionally featured one-sided rms now feature

more two-sided rms due to advanced technology; for example, the taxi industry only had

one-sided rms before Uber appeared. Firms typically begin with an one-sided model and

switch to a multi-sided model as they become more established. Doing so allows poten-

tial platforms to overcome the chicken-and-egg problem by rst providing complementary

goods themselves. For instance, Amazon started o as a pure retailer but has moved closer to

a two-sided model over time by enabling third-party sellers to trade directly with consumers

on its website. We can also nd many other examples in which a rm faces a strategic choice

of how many sides to pursue. For instance, in the personal computer market, Apple produces

its own hardware, whereas Microsoft leaves this to independent manufacturers. As a result,

Apple manages only a two-sided platform with consumers and software providers, while Mi-

â€ This work is joint with Soo Jin Kim and is unpublished.

1

crosoft manages a three-sided platform with consumers, software providers, and hardware

providers. Indeed, there is an increasing shift from traditional business structures in which

the rm focuses only on one side of users to platforms where the rm serves more than one

side of users.

Given this shift toward multi-sided business, the main strategic choice that we analyze

in this paper is the choice of the quality of interaction between users on two dierent sides.

We focus on platforms that enable interaction between two sides, for example, buyers and

sellers. The term interaction" covers various traditional interactions, such as those observed

in auction houses and those on internet sites in person-to-business transactions, for example,

Amazon. This also includes interactions or exchanges between application developers and

application users on software platforms such as computers (e.g., Apple, Microsoft); mobile

devices (e.g., iPhone, Samsung); and video games (e.g., Sony PlayStation, Xbox). These

markets usually have more than one type of quality access on the buyers' side. For example,

Amazon oers buyers two types of quality access; basic (low-quality) access is free, while

premium access (Prime membership, which is high-quality) is the paid service. It is easy to

see that the interactions or exchanges through the high-quality access have better quality

for example, Prime provides two-day shipping. Many ride-sharing services oer dierentiated

quality tiers for customers. Uber provides riders multiple types of quality services. Although

each region may have dierent availability, there are basically low-quality service at a cheaper

price (e.g. Uber X, Uber Pool ) as well as high-quality service at a little more expensive price

(e.g. Uber XL, Uber Select, Uber Black ). For instance, whereas trips with Uber Pool may

take longer time to nish by sharing the same car with other strangers, Uber Select or

Black is a private car sharing service with more luxurious car. Other ride-sharing services

oer similar quality dierentiation: Didi in China provides three quality tiers with Express,

Premier, and Luxe.

These are examples of quality dierentiation by rms serving two sides. Ultimately, this

paper provides business implications of quality dierentiation in the multi-sided platform

2

market by building a simple model of two-sided monopolists in a market with interactions

between buyers and sellers. The rm chooses the price and the quality of interaction for the

buyers' side.

The rst main nding of the paper is that a rm provides higher quality per dollar

to buyers as it serves more sides. That is, when a rm expands its business from one-

sided service (serving buyers/riders only) to multi-sided service as a platform (serving both

buyers/riders and sellers/drivers), it provides better quality to buyers. Intuitively, if a rm

oers a better quality price menu to buyers, it attracts more buyers. When it serves the

sellers' side at the same time, more demand from the buyers' side makes sellers on the other

side earn more revenue. The platform can extract those additional revenues on the sellers'

side, which incentivizes the platform to oer a better quality price ratio to buyers.

We also nd that the quality dierentiation on the buyers' side decreases the price charged

on the sellers' side compared to the price charged by a single-quality, two-sided rm. The in-

tuition behind this nding is related to product dierentiation and demand elasticity. If the

platform provides multiple quality options for buyers, it means that it provides more dier-

entiated products, which makes buyers more inelastic to price changes because the platform

can extract extra surplus from high-quality buyers without deterring buyer participation. As

the buyers' side becomes more inelastic, it creates an incentive for the platform to extract a

higher rent from the buyers' side and subsidize the sellers' side by lowering the price faced

by sellers.

By providing more quality choices, even if the quality dierentiation is small, the platform

is able to obtain more prots. Given that buyers are heterogeneous in their valuation of

product quality, there is an incentive for the platform to oer dierent levels of product

quality at dierent prices to extract more rents from buyers. Following quality dierentiation,

the platform needs to charge a lower price for low-quality buyers. However, it is able to earn

higher extra markup from the high-valuation group of buyers (who will buy a high-quality

product) if the quality gap is widened. Given that the number of high-quality buyers is

3

suciently large, the platform's ultimate prot is greater with quality dierentiation. Based

on the model predictions, we discuss some business implications concerning how quality

dierentiation in one side of the market helps the platform raise its prot.

Related literature There is a broad literature on the corresponding problem of a

monopolistic rm seeking to maximize prots by oering quality-dierentiated products in

one-sided standard markets. The seminal papers are Spence (1977), Mussa and Rosen (1978),

Maskin and Riley (1984), and M. Itoh (1983). We generalize this problem by varying the

number of sides served by the rm.

This paper is also related to the literature on pricing structure in markets with multi-sided

rms (e.g., Rochet and Tirole (2003), Armstrong (2006b), and Reisinger (2010)). Aside from

being broadly related to the literature on pricing in multi-sided markets, papers on skewed

pricing in multi-sided markets are closely related to our paper in terms of our theoretical

implications. Suarez and Cusumano (2008) discuss the platform's subsidy pricing strategy to

attract greater user adoption, although they do not set up an economic model to conrm this

strategy. Bolt and Tieman (2008), Schmalensee (2011), and Dou and Wu (2018) study skewed

pricing strategies in two-sided markets, i.e., the subsidy and money sides. However, those

papers do not consider forms of product dierentiation, such as the quality dierentiation

examined in our paper, as a means of skewing prices.

Regarding markets with multi-sided rms, few papers have focused on rms choosing the

quality of interaction on their platform. These papers study markets with negative network

externalities and do not endogenize the quality choice. Crampes and Haritchabalet (2009)

examine the choice of oering a pay ads regime and no pay ads packages. Peitz and Valletti

(2004) compare the advertising intensity when media operators oer free services and when

the subscription price is positive. Viecens (2006) is an exception because she studies a

setup with endogenous quality dierentiation on two-sided platforms. However, the quality

dierentiation in her model takes a dierent form from ours in that she focuses on the

quality provided by users on one side and not on the quality provided by the platform itself.

4

Therefore, her results do not provide any implications for the platform's dynamic pricing

structure, such as subsidizing one side at the expense of the other. Another quality-related

aspect explored in the context of two-sided markets is the case in which users care about the

quality of the other users with whom they interact. These papers are relevant for matching

markets such as dating sites. Jeon et al. (2016) examine this problem in a platform setting.

Renato and Pavan (2016) consider this problem in a matching setup. Hagiu (2012) studies

a model in which users value the average quality of other users. The setup in these papers,

however, is dierent from that in ours in that we focus on the quality of interaction and not

on the quality of users.

Our analyses are also related to the literature on product dierentiation in multi-sided

markets, in that quality dierentiation is one form of product dierentiation. Smet and

Cayseele (2010) focus on product dierentiation in platform markets, which still diers from

our paper in that they do not account for its consequences for optimal pricing strategies.

The literature has not focused on endogenizing the choice of network quality in multi-

sided markets with positive externalities. Therefore, our results on how multi-sided markets

are combined with quality dierentiation on a platform provide new insights into the related

business.

1.2 Model setup

We model the interaction between buyers and sellers. Economic value is created through

the interaction between these two sides. We consider the case of a single rm providing

a platform for interactions between buyers and sellers. For instance, Amazon acts as a

transaction-based platform for interactions between sellers and buyers. In addition to charg-

ing an access fee to use the platform, the rm can also control the quality of interaction.

Figure(A.1) displays the structure of the two-sided rm with quality dierentiation on the

buyers' side using Amazon as an example. The two types of access quality oered on the

buyers' side are Amazon Basic and PrimeAmazon Prime is the higher-quality access, as

5

it provides premium services including free two-day shipping.

Specically, we model a two-sided monopolist rm that price discriminates on the buyers'
side by oering two dierent types of access quality: qk âˆˆ {ql, qh}, where k denotes the
quality provision, either low or high.
In this model, the users' gain from the platform

comes through interaction between users on two sides: buyers and sellers. Both buyers and

sellers obtain utility from interacting with each other. The quality variable controls the gain

from each such interaction. We assume that both buyers and sellers are heterogeneous with

respect to the per-interaction or usage benet. The usage or per-interaction benets are
bb
i (qk) for the buyers' side (for individual buyer i) and bs
seller j), where the superscript b (s) denotes buyers (sellers). For simplicity, we assume

j for the sellers' side (for individual

that bb

i (qk) = BÎ±b

i qk, where B represents the basic benet for every buyer. This benet

is dependent on the quality of interaction; thus, the monopolist can control the benet by
choosing the quality of platform access qk on the buyers' side. The term Î±b
i denotes the
heterogeneity among buyers; it follows a distribution function F b with support on [0, 1] and

density f b. Additionally, each buyer pays a price for using the platform, which is denoted pb
k

as a usage or per-transaction fee. For a buyer i, the utility function is given by the following:

U b
ik = [bb
â‡” U b
ik = (BÎ±b

i (qk) âˆ’ pb
i qk âˆ’ pb

k]N s, where k âˆˆ {l, h}
k)N s.

(1.1)

(BÎ±b

i qk âˆ’ pb

The buyers' utility is the net benet from each interaction with the other side, i.e.,
k), multiplied by the number of interactions, which is denoted N s.1 From the
utility specication, it can be shown that the buyer with the highest benet is that with
Î±b
i = 1, i.e., BÎ±b
1We assume here that every buyer interacts with every seller and that every seller interacts
with every buyer. We can easily extend this to a model where the number of interactions
is a function of the total number of sellers, such as g(N s). The results are robust to such
extensions.

i = B.

6

We assume that the access fee pb

k is charged per-transaction for simplicity. Indeed, certain

kinds of platforms charge for premium service based on usage: for Uber, a basic service (Uber

X) is cheaper than a premium service (Uber Select) for the same trip. We view that the

per-usage fee for the basic service (low-quality) is zero (i.e., pb

l = 0 in this example) whereas

that for the premium service (high-quality) is the dierence between two riding costs (i.e.,

pb
h > 0). Nevertheless, it is worth noting that many platforms charge an one-time xed

access fee: for instance, Amazon Prime costs either nothing for Basic but $119 per year (or

$12.99 per month) for Prime service. Note that the qualitative results hold under the model

with xed fees, as shown in the Appendix(B). Such one-time xed fee structure can also be

considered as a variation of per-transaction fee by considering the xed fee as continuous:

if Amazon Prime members make hundred transactions per year on average, the average

per-transaction cost is $1.19 (i.e., pb

l = 0 and pb

h = $1.19).

The sellers' side is not aected by quality; therefore, there is no quality component. We

assume that bs

j represents seller j's heterogeneity with respect to the per-
usage benet, which is also distributed by a distribution function F s with support on [0,1]

j = SÎ±s

j, where Î±s

and density f s. For seller j, the utility function is given by the following:

U s
j = (bs
â‡” U s
j = (SÎ±s

j âˆ’ ps)N b
j âˆ’ ps)N b.

(1.2)

j âˆ’
The sellers' utility is the net benet from each interaction with the other side, i.e., SÎ±s
ps, multiplied by the number of interactions, which is denoted N b.2 All sellers are charged

ps per interaction, so the price discrimination is only on the buyers' sidefor example, an

individual seller on Amazon pays $0.99 for each sale. The total number of interactions is

N bN s.3

2In this model, we assume that the sellers' side is aected only by the total number
of buyers, not the type of buyer with which a seller interacts. Thus, the total number of
interactions for each seller is N b, and the total number of interactions by sellers is not aected
by the type of buyer with which they interact.

3Note that the qualitative results still hold under a generalized setup with any function

of N bN s.

7

Next, the cost of the two-sided monopoly rm depends on the quality provided. The total
cost of a transaction is given by c(ql) â‰¥ 0 for a transaction between a low-quality buyer and a
seller and c(qh) â‰¥ 0 for a transaction between a high-quality buyer and a seller. In Amazon
example, such cost dierentiation captures that two-day shipping for Prime members is

costlier than standard shipping for Basic members. We normalize the cost for sellers cs = 0.
The cost function is assumed to be increasing and convex in quality (c(cid:48)(q) > 0 , c(cid:48)(cid:48)(q) > 0).
We analyze the nontrivial case in which qh > ql.

The demand on the buyers' and sellers' sides is represented by Db and Ds, respectively.

In equilibrium, demand will be equal to the number of participants on each side, which

means Db

k = N b

k andDs = N s, where k âˆˆ {l, h}. Given the equilibrium demands, we turn

to the monopolistic platform's problem. The monopoly platform's problem can be written

as follows:

Âˆ If one type of quality is oered:

Î  = [pb + ps âˆ’ c(q)]DbDs.

max

(ps,pb,q)

Âˆ If two types of quality are oered:

max
,pb
h

(ps,pb
l

,ql,qh)

Î  = [pb

l + ps âˆ’ c(ql)]Db

l Ds + [pb

h + ps âˆ’ c(qh)]Db

hDs.

(1.3)

(1.4)

Note that if the monopolist serves only one side of the market, say the buyers' side, its

prot function is given by the following:

Î one-sided monopolist = [pb âˆ’ c(q)]Db,

max
(pb,q)

(1.5)

in the case of one quality. Throughout the paper, we make the following assumptions.

Assumption 1. The cost is increasing and convex in quality: c(cid:48)(q) > 0 , c(cid:48)(cid:48)(q) > 0.

This suggests that it becomes increasingly costly to provide higher-quality service, which

is a standard assumption.

8

Assumption 2. The distribution functions for buyers and sellers are increasing in type:
(f b)(cid:48) â‰¥ 0 and (f s)(cid:48) â‰¥ 0, which implies that f b(1) > 0 and the inverse hazard rate is non-

increasing in user type:

âˆ‚

â‰¤ 0, k = {b, s}.

1âˆ’F k(Î¸)
f k(Î¸)
âˆ‚Î¸

This implies that we consider the case with a positive mass of consumers. Additionally,

it guarantees that as users value the per-usage benet more, there are more platform users

than nonusers.

Assumption 3.

âˆ‚U b(Î±b
i ,pb,q)
âˆ‚pb

< 0,

âˆ‚U b(Î±b
i ,pb,q)
âˆ‚q

> 0,

Assumption 4. The single crossing property holds:

âˆ‚U s(Î±s
âˆ‚ps

i ,ps)

< 0.
âˆ‚2U b(Î±i,q(Î±i))

âˆ‚Î±iâˆ‚q

> 0.

This means that we can always distinguish high-type buyers from low-type buyers based

on their Î±i.

1.3 Model

1.3.1 Model with one quality case

As a benchmark, we begin by considering a model that omits the practice of quality dier-

entiation and then modify the model to allow for its practice in Section 1.3.2. The two-sided

monopolist oers a single quality of access in this case. Buyers have the following utility:

U b
i = (BÎ±b

i q âˆ’ pb)N s,

where q âˆˆ [0, 1] denotes the quality index. The seller has the following utility:

U s
j = (SÎ±s

j âˆ’ ps)N b.

(1.6)

(1.7)

We rst analyze the user side (buyers and sellers) to identify the equilibrium demand.

The equilibrium demand functions are derived from the participation constraint:

(cid:16)
(cid:16)

Db = P rob

Ds = P rob

i â‰¥ 0
U b

j â‰¥ 0
U s

(cid:17) â‡” Db = 1 âˆ’ F b
(cid:17) â‡” Ds = 1 âˆ’ F s

(cid:32)
(cid:18) ps

pb
Bq

(cid:33)
(cid:19)

.

S

.

(1.8)

9

The monopoly problem can be written as follows:

Î  = [pb + ps âˆ’ c(q)]DbDs,

max

(ps,pb,q)

(1.9)

where we have normalized the marginal cost on the seller's side to zero, i.e., cs = 0. We can

solve for the optimal solutions for the buyers' side and sellers' side as follows:

pb âˆ’ c(q) + ps

pb

=

1
Îµb

;

ps âˆ’ c(q) + pb

ps

=

1
Îµs .

(1.10)

Equation (1.10) is similar to the standard Lerner pricing formula, which states that the

markup on price is equal to the inverse of the price elasticity of demand. The price elasticity

of demand for buyers (or sellers) is represented by Îµb (or Îµs). The added term in the case of a

two-sided rm is the extra opportunity cost of increasing the price on the buyers' (or sellers')
side, which is the marginal loss on the sellers' (or buyers') side, equal to ps (or pb âˆ’ c(q)).

Finally, the optimal condition for quality is given as follows:

c(cid:48)(q) =

[pb âˆ’ c(q) + ps]

Db

âˆ‚Db
âˆ‚q

.

(1.11)

The optimal quality equates the marginal cost to the marginal revenue of increasing

quality. The marginal revenue is the change in buyers' demand,4 which is represented by
âˆ‚Db
âˆ‚q

times the per-transaction prot. From the conditions given above, the equilibrium

price-quality structure is given by Theorem 1.3.1.

Theorem 1.3.1. The equilibrium price and quality variables are given by the following equa-

tion:

qc(cid:48)(q)
Î½b

=

pb
Îµb

=

ps
Îµs ,

(1.12)

where Î½b is the quality elasticity of demand on the buyers' side, and Îµb or Îµs is the price

elasticity of buyer or seller side, respectively.

4In the optimal quality equation, the term for the number of sellers (Ds) is canceled out,

as it is present on both sides of the equation.

10

Equation (1.12) is similar to the equilibrium of two-sided market with quality decision by

the platform as in Rochet and Tirole (2003). However, our equilibrium condition still diers

from that in Rochet and Tirole (2003) insofar as we focus on a three-way trade-o which

includes the quality variable as well, instead of a trade o between prices on the two sides.

By placing a main focus on price variables only, Rochet and Tirole (2003) nd that price on

the buyers' side would be lower if sellers are more price elastic because the platform wants

to balance the sizes of two sides. Theorem 1.3.1 further says that the higher quality the

platform oers, the higher price it charges to buyers, while such upward pressure on prices

diminishes as buyers becomes less quality sensitive.

Given that many platforms consider their quality of service as another important decision

variable, our nding broadens the necessary considerations for the platform: it needs to set

optimal prices not only by considering the price structures in terms of the ratio of price

elasticities, as pointed out in Rochet and Tirole (2003), but also by taking quality elasticity

into account, as we additionally nd. For instance, if buyers on Amazon become more

elastic to quality increase, Amazon is able to increase the buyers' side demand substantially

by improving its quality at a minimal level. Accordingly, if Amazon changes its shipping

policy for Prime members to three-day shipping, instead of two-day shipping, and lowers the

fee charged to Prime members (i.e., pb is lower than $119 per year, which is currently set for

two-day shipping benets), this makes Amazon better o because more Basic members will

sign up for Prime members: the demand increasing eect arising from quality improvement

outweighs price decreasing eect due to large quality elasticity.

Allowing the platform to consider how high or low quality its service is oered, or how

quality tiers it provides are dierentiated, opens more doors for optimal business strategies,

insofar as quality choice gives another level of exibility in terms of prot maximization to

the platform. Indeed, as we will show, the platform's optimal pricing decisions depend on

its quality choice.

11

1.3.2 Model with two qualities case

We start by analyzing the users' side (buyers and sellers) to identify the equilibrium demand.

First, the buyers have two choices for accessing the platform. They can join the platform

through either low-quality access or high-quality access. Given the two types of quality, high

and low, the number of participants joining with low-quality access is determined by the

number of buyers who satisfy the following two conditions:

1. (IR constraint) The buyers' utility from low-quality access is greater than zero: P r(U b

l â‰¥

0).

2. (IC constraint) Buyers for whom the utility derived from low-quality access exceeds that

from high-quality access: P r(U b

l â‰¥ U b
h).

The two conditions jointly determine the proportion of low-type buyers.

(cid:18) pb
Db
l = Pr
l â‰¡ Db(pb

h âˆ’ pb
B(qh âˆ’ ql)
h, pb

l

(cid:19)

(cid:19)

(cid:18) pb

h âˆ’ pb
B(qh âˆ’ ql)

l

(cid:19)

,

(cid:18) pb

l
Bql

âˆ’ F b

â‰¥ Î±b

i â‰¥ pb
l
Bql

= F b

(1.13)

where Db

l , qh, ql). Similarly, the number of participants joining the high-quality

service is given by the number of buyers who satisfy the following two conditions:

1. (IR constraint) The buyers' utility from the high-quality good is greater than zero: P r(U b

h â‰¥

0).

2. (IC constraint) Buyers for whom the utility derived from the high-quality good exceeds

that from the low-quality good: P r(U b

h â‰¥ U b
l ).

The IR condition is satised when the IC constraint of the high type and IR constraint

of the low type holds.5 Thus, the proportion of high-type buyers is given by:

Db

h = Pr(U b

â‡” Db

h = Pr

(cid:18)

h â‰¥ U b
l )
h âˆ’ pb
i â‰¥ pb
Î±b
B(qh âˆ’ ql)

l

(cid:19)

= 1 âˆ’ F b

(cid:18) pb

h âˆ’ pb
B(qh âˆ’ ql)

l

(cid:19)

,

(1.14)

5We maintain the standard single crossing condition, which implies that higher types
have greater willingness to pay (WTP) for quality at any price or that consumers may be
ordered by their type.

12

where Db

h â‰¡ Db(pb

h, pb

l , qh, ql). Given the utility function for buyers, the total number of

buyers joining the platform is given by the following:

Db = Pr(U b
â‡” Db = Pr(BÎ±b

l â‰¥ 0) = Pr(bb
i ql â‰¥ pb

l ) = Pr(Î±b

l â‰¥ pb
l )
i â‰¥ pb
l
Bql

(cid:19)

,

(cid:18) pb

l
Bql

) = 1 âˆ’ F b

(1.15)

where Db â‰¡ D(pb

l , ql). Equation (1.15) shows us how the number of participants on the

buyers' side depends only on the price and quality of the low-quality good. Although there

are network externalities in the total utility derived from the platform or the gross transaction

utility, the per-unit transaction demand is not dependent on the participation rate on the

other side.6 This is because the participation constraint (IR constraint) for the high-quality

buyers is slack. This means that the participation of low-quality buyers guarantees the

participation of high-type buyers. In other words, the buyers on the margin of joining the

platform are low-quality buyers.

Next, the total number of sellers who join the platform is given by the following:

Ds = Pr(U s â‰¥ 0) = 1 âˆ’ F S

,

(1.16)

where Ds â‰¡ Ds(ps). Given the total number of buyers and sellers, the equilibrium level of

(cid:18) ps

(cid:19)

S

participation is the following:

Db = D(pb

Db

l = F b

(cid:19)
(cid:18) pb
(cid:18) pb
(cid:19)
(cid:18) pb
l
Bql
(cid:19)
l , qh, ql) = 1 âˆ’ F b

(cid:18) pb
(cid:19)
l , ql) = 1 âˆ’ F b
(cid:18)ps

h âˆ’ pb
B(qh âˆ’ ql)
h, pb

l
Bql
âˆ’ F b

,

l

.

S

Db

h = Db(pb
Ds = Ds(ps) = 1 âˆ’ F s

(cid:19)

h âˆ’ pb
B(qh âˆ’ ql)

l

Given quality dierentiation, the monopoly problem can be written as follows:

max
,pb
h

(ps,pb
l

,ql,qh)

Î  = [pb

l + ps âˆ’ c(ql)]Db

l Ds + [pb

h + ps âˆ’ c(qh)]Db

hDs.

6This setup has one restriction that we need to impose, which is that the proportion of

low-type buyers has to be nonnegative: Db

l â‰¥ 0.

13

(1.17)

(1.18)

The following is the breakdown of the equilibrium prices and quality for buyers and

sellers:7

1.3.2.1 Price of low-quality access on the buyer side

We rst examine the price of low-quality access on the buyer side as follows.

l âˆ’ c(ql) + ps
pb

pb
l

=

1
Îµb

, after using equilibrium value of pb
h,

(1.19)

where the price elasticity of demand for low-quality access is represented by Îµb. Note that

the equilibrium conditions for the price elasticity with respect to the lower price is denoted

as those for the full demand: that is why we have Îµb, not Îµb
l .

1.3.2.2 Price of high-quality access on the buyers' side

The price of high-quality access on the buyers' side can be obtained as follows:

pb
h = pb

l +

[c(qh) âˆ’ c(ql)] +
(cid:124)

(cid:123)(cid:122)

pb
h
Îµb
h

(cid:125)

additional cost plus extra market power

,

(1.20)

where the price elasticity of demand for high-quality access is represented by Îµb

h. The optimal

price for high-quality access is equal to the price for low-quality access and the additional
cost, i.e., c(qh) âˆ’ c(ql), plus an additional markup, i.e.,

.

pb
h
Îµb
h

1.3.2.3 Price for sellers

We now turn our attention to the price for sellers.
ps

l + ps âˆ’ c(ql) =
pb

Îµs âˆ’ pb

h
Îµb
h

Db
h
Db

.

(1.21)

Again, Equation (1.21) is similar to the standard Lerner pricing formula, which states

that the markup on price is equal to the inverse of the price elasticity of demand. The added

term in the case of a two-sided rm is the extra opportunity cost of increasing the price on

7The details are in the Appendix.

14

the sellers' side, which is the marginal loss on the buyers' side, and is equal to the average

per-interaction prot on the buyers' side.8

1.3.2.4 Low-quality service for buyers

Given prices, the monopolistic platform solves the prot maximization problem separately

for the low- and high-quality services for buyers. First, for the low-quality service, the rst

order condition can be derived as follows:

l + ps âˆ’ c(ql) =
pb

l c(cid:48)(ql) + Db
Db
Db

hc(cid:48)(qh)

,

ql
Î½b

(1.22)

where Î½b denotes the quality elasticity of demand for low-quality access. The cost of low

quality service equates the marginal cost to the marginal revenue of increasing quality.

1.3.2.5 High-quality service for buyers

Here, the rst order condition for the high-quality service for buyers is given by the following:

c(cid:48)(qh)

=

qh
Î½b
h

pb
h
Îµb
h

,

(1.23)

where Î½b
h denotes the quality elasticity of demand for high-quality access. The marginal cost
c(cid:48)(qh) should be equal to the marginal revenue, which is the product of increased high-quality

, and the extra markup generated from the increase in high-quality buyers,

Î½b
h
qh

buyers, i.e.,

i.e.,

pb
h
Îµb
h

.

1.4 Equilibrium results

We derive several important implications from the model. Before we examine the diverse

eects of quality dierentiation on the platform's strategies, we rst address that the mo-

nopolistic rm provides higher quality to buyers for each dollar that they pay when it serves

more sides of the market. As the monopolist opens more sides to serve, say from a one-sided

8Note that we have normalized the cost on the seller's side to zero, so cs = 0.

15

rm serving buyers only to a two-sided rm as a platform serving both buyers and sellers,

such openness increases the quality per dollar oered to buyers. Mathematically, the quality
per dollar is denoted as q

pb . This nding is summarized in Proposition 1.4.1.

Proposition 1.4.1. As a rm serves more sides of the market, it provides higher quality

per dollar oered to buyers:

One-sided rm â‰¤ q
pb

q
pb

Two-sided rm.

Intuitively, the monopolist serving multiple sides has more incentive to oer a better

quality price ratio to buyers because it now obtains more prot from the seller's side. If the

two-sided rm oers a better quality price menu to buyers, it attracts more buyers. When

it serves the sellers' side at the same time, more demand from the buyers' side makes sellers

on the other side earn more revenue. The platform can extract those additional revenues

on the sellers' side, which incentivizes the platform to oer a better quality price ratio to

buyers.

In other words, the openness of the platforms from serving one side of users only to serving

both sides increases the quality per dollar, which ultimately attracts more buyers than a

one-sided rm. Both traditional one-sided rm and two-sided platforms oer dierentiated

quality tiers for buyers. When those two types of rms engage in quality dierentiation on

buyers' side, we show that the platform is more willing to oer a better deal to low type

members by giving a high quality per dollar in the basic level than a one-sided rm. For

example, the average quality of services provided by taxi companies, as traditional one-sided

rms, is known to be lower than that provided by ride sharing platforms, such as Uber

and Lyft. As Liu et al. (2019) shows, taxi drivers are more likely to detour with non-local

customers, which results in longer travel time. Such empirical evidence supports our nding

that the platform's basic quality provision is better than a one-sided rm's. The reason

the platform provides a higher quality for basic service is because it is able to exploit such

quality improvement in one side to encourage more participation from the other side, thereby

raising its prots. As in the ride sharing platforms example, when such platforms provide

higher basic quality for riders, which attracts more riders, it ultimately gives a stronger

16

incentive for drivers to join the platforms, thereby permitting them to extract more rents

from drivers: the underlying incentives for the platforms to provide higher service provision

than a one-sided rm arise from this cross-subsidization motive. This nding is summarized

in Corollary 1.4.2.

Corollary 1.4.2. The platform in two- or multi-sided market is more likely to oer higher

quality of basic service or product than a one-sided rm.

Now, we will show how the quality provision decided by the platform aects its opti-

mal use of the interaction between two sides, such as cross-subsidization, in the following

propositions below. Returning to the multi-sided platform case with two qualities provision,

we rst nd that quality dierentiation reduces the other side's price level. The Appendix

contains a proof of this result.

Proposition 1.4.3. Quality dierentiation on the buyers' side decreases the price charged

on the sellers' side relative to the price charged by the platform that oers one quality.

The eect of oering low- and high-quality access to the buyers makes the buyers more

price inelastic because the platform can extract additional surplus from high-quality buyers

without deterring buyers' participation. Now, recall that for multi-sided rms, the optimal

pricing scheme is to subsidize the more elastic side of the market and extract rents from the

other, more inelastic side. As the buyers' side becomes more inelastic, it creates an incentive

for the platform to extract higher rent from the buyers' side and subsidize the sellers' side

by lowering the price charged to sellers. This can be explained by comparing the equilibrium

pricing structures of the two cases:

one-quality case:

two-quality case:

ps
Îµs =
ps
Îµs =

pb
Îµb
pb
l
Îµb

=

+

qc(cid:48)(ql)
Î½b
pb
h
Îµb
h

Db
h
Db

.

l c(cid:48)(ql) + Db
Db
Db

hc(cid:48)(qh)

=

ql
Î½b

hc(cid:48)(qh)
Db
Db

.

+

qh
Î½b
h

(1.24)

Similar to the single-quality case, the platform faces a trade-o of whether to charge a

higher price on the buyers' or sellers' side in cases with two quality types. After quality dif-

ferentiation, the trade-o features an additional markup benet on the buyers' side, namely,

17

the extra margin from high-quality buyers. Thus, the monopolist becomes more ecient in

extracting rent from the buyers' side. Given this, the platform has a higher incentive to

increase the size of seller demand by lowering the price on the sellers' side.

We also nd that quality dierentiation leads to greater prot for the platform, as in

Corollary 1.4.4. The detailed proof is in the Appendix.

Corollary 1.4.4. The platform strictly prefers to price discriminate by quality on the buyers'

side

Corollary 1.4.4 implies that a platform that provides only one type of quality is able

to obtain more prot if it slightly dierentiates product quality. Even a minor quality

improvement with a small price increase can increase the platform's prot as long as it

continues to provide dierentiated products, such as low- and high-quality products. Thus,

quality dierentiation permits the platform to earn more prot by implementing premium

service in addition to basic service, which not only expands the total buyer market size, but

also extracts more rents from relatively high-quality type of buyers.

As for the the platform's optimal quality provision, we nd that the high quality provision

is independent of the sellers' side equilibrium whereas the low quality provision is related

to the sellers' side equilibrium. By comparing Equations (1.22) and (1.23), we derive the

following proposition.

Proposition 1.4.5. The platform's optimal quality level for the high-quality service does not

depend on the sellers' side equilibrium, whereas that for low-quality service increases in the

marginal revenue from the sellers' side.

Proposition 1.4.5 states that the platform has an incentive to increase the quality level of

the basic service (low-quality) because of the positive network eect coming from the sellers'

side, whereas it does not consider the network eect when determining the optimal quality

level of the premium (high-quality) service. The intuition behind this nding relates to total

buyer demand. As in Equation (1.15), the participation constraint for high-quality buyers is

18

slack, which means that the eect of increasing the quality level of the premium service does

not increase the total buyer demand, although it increases the share of high-quality buyers.

The platform will be able to increase the total buyer demand, Db, only by attracting more

low-quality buyers, and one way of doing so is to increase the quality level of the basic (low-

quality) service. More total buyer demand arising from increasing ql boosts the marginal

revenue from the sellers' side due to the positive network eect on the platform. Thus, the

optimal condition for ql is a function of the marginal revenue on the sellers' side.

Moreover, we nd that the additional markup from quality dierentiation is increasing in

the quality gap. Thus, if the platform suciently dierentiates its product line with respect

to quality, it can earn a higher markup. Proposition 1.4.6 summarizes this nding.

Proposition 1.4.6. The platform's incentive to raise the fee charged to high-quality buyers

is increasing in the dierence in quality, as shown in

âˆ‚

(cid:17)

(cid:16)
pb
h/Îµb
h
âˆ‚(qhâˆ’ql) > 0.

Proposition 1.4.6 implies an interesting resultthe platform is more likely to charge a

higher fee to high-quality buyers if it either provides much better service (e.g., one-day rather

than two-day shipping for Amazon Prime members) or maintains the high-quality service at

the same level while reducing the quality of the basic service (e.g., increasing the minimum

order qualied for free shipping for Amazon Basic members). Section 1.5.2 discusses this

point in depth.

In addition, we examine the strategic relationships between the platform's choice variables

to derive further implications. We rst examine whether the platform's decision variables

are strategic complements or substitutes in the game of the one-quality case.

Proposition 1.4.7. The prices on the buyers' side (pb) and the sellers' side (ps) are strategic

substitutes. Whether the price variable (buyer or seller side) and quality variable are strategic

substitutes or complements is ambiguous.

From Proposition 1.4.7, reducing pb is the prot-maximizing response to increasing ps,

and vice versa. This means that the platform does not maximize its prots if it increases

19

both prices. Unlike this case, the strategic relationship between price and quality variables

is ambiguous. The price for the buyer and quality are strategic substitutes if
pb

Bq ) âˆ’ 1

Bq2 [pb + ps âˆ’ c(q)]f(cid:48)( pb

Bq )} + c(cid:48)(q) 1
Bq2{Dsf b( pb
the seller and quality are strategic substitutes if âˆ‚2Î 
If F b and F s both have a uniform distribution, âˆ‚2Î 
âˆ‚pbâˆ‚q
whereas âˆ‚2Î 

âˆ‚psâˆ‚q is negative if c(cid:48)(q) > S(S âˆ’ ps) pb

Bq f b( pb
âˆ‚psâˆ‚q = Ds pb

Bq ) < 0. Similarly, the price for
Bq2 f b( pb

S ) < 0.
is always positive (because f(cid:48) = 0),

Bq )âˆ’c(cid:48)(q) 1

S2 f s( ps

Bq2 . This parametric example suggests that
if the platform increases quality, it charges buyers a higher price but sellers a lower price

âˆ‚2Î 
âˆ‚pbâˆ‚q

=

(provided that the marginal cost of increasing quality is above a certain threshold).

Next, we derive results from the two-qualities case. For simplicity, we assume that F b

and F s both have a uniform distribution, which implies that the prot in the two-qualities

case is given as follows.

Î  =

1 âˆ’ ps
S

h âˆ’ q2
pb

(cid:18)

(cid:19)ï£±ï£²ï£³(cid:16)

h + ps(cid:17)(cid:34)

(cid:35)

(cid:32)

+ 1

+

(cid:33)(cid:16)

1 âˆ’ pb
l
Bql

l âˆ’ pb
pb
B (qh âˆ’ ql)

h

l + ps(cid:17)ï£¼ï£½ï£¾ .

l âˆ’ q2
pb

By the rst order conditions derived in 1.3.2, we nd the following.

(cid:1) + ps(cid:105)

(cid:104)

pb
h + qh

(cid:0)qh âˆ’ 2ql
2(cid:0)qh âˆ’ ql
(cid:1) 2
ï£¹ï£ºï£» .

+ 2ql

.

(1.25)

(1.26)

âˆ‚pb

âˆ‚2Î 
l âˆ‚pb
h
âˆ‚2Î 
âˆ‚pb
l âˆ‚ql
âˆ‚2Î 
âˆ‚pb
l âˆ‚ps
âˆ‚2Î 
âˆ‚pb
hâˆ‚ql

=

2(qh âˆ’ ql)

;

ql

ï£®ï£¯ï£°B +

qh

=

1
2

âˆ‚2Î 
âˆ‚pb
l âˆ‚qh
h âˆ’ q2
pb

(cid:16)
(cid:0)qh âˆ’ ql

= âˆ’ ql
h + ps(cid:17)
(cid:1)2

=

qh

2(qh âˆ’ ql)

âˆ’ 1;

= âˆ’ B
2

;

âˆ‚2Î 
hâˆ‚ps

âˆ‚pb

âˆ‚2Î 
hâˆ‚qh

âˆ‚pb
= âˆ’ 1
2

.

=

B
2

+ qh.

From Equation (1.26), we identify whether two choice variables are strategic substitutes

or complements for the platform. The following proposition summarizes the ndings.

Proposition 1.4.8. The price for high-quality access for buyers (pb

ment for the price for low-quality access for buyers (pb

h) is a strategic comple-
l ) and high-quality service (qh), whereas

20

it is a strategic substitute for the price for sellers (ps) and low-quality service (ql). Whether

the price for low-quality access for buyers is a strategic substitute or complement for quality

measures and the price for the seller is ambiguous.

Note that if the high-quality service is much better than the low-quality service,9 pb

l is
a strategic substitute for qh but a strategic complement for ps. This implies that if the

platform improves the high-quality service (qh increases), the prot-maximizing response is

to increase the price for high-quality access for buyers while decreasing that for low-quality

access. In addition, the best response to an increase in the price for high-quality access for

buyers is to decrease the price for sellers. However, if the platform increases the price for

low-quality access for buyers, the optimal response is to increase the price for sellers provided

that the quality dierence is large enough.

Finally, we also conduct comparative statistics to determine how changes in the exogenous

parameters aect the equilibrium outcomes. In particular, we are interested in how B, which

represents the basic benet from the quality dimension for every buyer, aects consumer

demand in the two qualities case. Proposition 1.4.9 summarizes the result.

Proposition 1.4.9. Consumer demand for the high-quality service always increases in the

basic benet from better quality (B). Whether consumer demand for low-quality service

increases in B is ambiguous.

In other words,

âˆ‚Db
âˆ‚B is always positive (where Db
h

âˆ‚Db
âˆ‚B is positive only if a certain condition is met (where Db
l
l
hâˆ’pb
pb
B(qhâˆ’ql)

âˆ‚Db
l
âˆ‚B is positive if

Specically,

hâˆ’pb
pb
qhâˆ’ql

l

f b

pb
l
ql

f b

pb
l
Bql

>

(cid:32)

(cid:33)

h is given by Equation (1.14)), whereas

(cid:32)

(cid:33)

is given by Equation (1.13)).

l

and negative otherwise.

That is, when the basic benet from the quality dimension increases, it can reduce the

buyers' demand for low-quality access if the price-quality ratio for the quality dierence

(between high- and low-quality) is greater than that for low-quality service. As in Equation

9We can guarantee that

âˆ‚2Î 
âˆ‚pb
âˆ‚qh
l

< 0 and âˆ‚2Î 

âˆ‚pb
l

âˆ‚ps > 0 if qh > 2ql.

21

(1.13), buyers in the middle range of willingness to pay (i.e., who are willing to pay for low-

quality service but not for higher priced high-quality service) demand low-quality service

access. As B increases, we observe two simultaneous outcomes: (1) more buyers who were

(cid:32)

(cid:33)

pb
l
ql

f b

pb
l
Bql

(cid:32)

(cid:33)

not in the market join the low-quality service (measured by

), and (2) more

buyers who used to use the low-quality service switch to high-quality service (measured by
hâˆ’pb
pb
qhâˆ’ql
buyers for the low-quality service. Per Propositions 1.4.7, 1.4.8, and 1.4.9, we can see that

). If the latter eect is larger than the rst, a greater B leads to fewer

f b

hâˆ’pb
pb
B(qhâˆ’ql)

l

l

considering quality dierentiation for the platform aects its strategic choices on the price

and demand structure. If the platform overlooks such dynamic relationship between quality

choice and optimal pricing, it could lose a potential opportunity for better business strategies

using three-way interaction among quality, buyers', and sellers' side.

1.5 Discussion

Quality dierentiation is one example of product dierentiation, which makes buyers'

demand less elastic. In other words, the platform can strategically use quality dierentiation

to maximize its prot. In Section 1.5, we discuss several business implications based on our

theoretical predictions.

1.5.1 Quality dierentiation and optimal pricing strategy

As in the model, if the platform provides dierent quality choices to buyers, it faces more

inelastic demand from them, which allows the platform to charge a lower price to users on

the other side, namely sellers. Specically, the platform can extract a higher margin from

buyers by providing multiple dierent qualities, which makes their demand inelastic. Given

more inelastic demand from buyers, the platform nds it optimal to increase the number of

sellers, as this increases the utility of buyers from interactions through the platform. The

platform can increase the number of sellers by decreasing the price on the sellers' side, which

is one way of subsidizing sellers.

22

There are several instances in which the platform might nd it protable to subsidize

sellers by exploiting buyers' inelastic demand (arising from quality dierentiation). One

such example is a ride sharing service that connects riders and drivers such as Uber. The

model ndings predict that if Uber provides two or multiple dierent quality of services, for

example a premium service with a higher usage fee and a basic service with a lower usage

fee, which makes consumers' demand inelastic (due to quality dierentiation), the platform

(Uber) is able to maximize its prot by charging a lower driver's pay rate (a price charged to

drivers). In particular, if there is competitive pressure in attracting drivers, the platform can

strategically exploit the quality dierentiation on the riders' side to subsidize the drivers'

side, which helps it overcome intense competition on the drivers' side. Suppose that the

competition for drivers is so intense, after Lyft and other competitors start operating, that

it is dicult for the company to have sucient number of drivers during the peak time. The

platform wants to attract more drivers because widespread availability of drivers is crucial

for expanding its business. Here, if the Uber charges much lower fees for drivers, more drivers

will be willing to work for the platform. Our theory predicts that the platform will oer

lower fees in equilibrium by dierentiating on product quality on the riders' side, which leads

to a higher driver participation rate.

1.5.2 Quality dierentiation and extra markup

The model predicts that the platform will earn more extra markup if it widens the quality

gap. This suggests that the platform can adopt either of two strategies: it can improve the

high-quality product while maintaining the low-quality product at the same level, or it can

reduce the quality of the basic product while maintaining the high-quality product at the

same level. Both strategies lead to more quality dierentiation, which results in more extra

markup being charged to the high-quality buyers per Proposition 1.4.6.

Most incumbent platforms can use their pre-existing resources to develop a much higher

quality product, thereby generating more quality dierentiation (higher qh). However, be-

23

cause entrants or small platforms lack substantial resources, they might not be able to make

large investments to produce better-quality products. What those platforms can do is to

provide a very basic quality product at a lower price (or zero price) and a slightly higher

quality product at a higher price (lower ql). In other words, by lowering the basic product

quality at an almost zero price (or free of charge), the platforms can enjoy more extra markup

even without having very high product quality. For example, there are two ways for Amazon

to widen the quality gap. It could make its high-quality service more attractive by providing

one-day shipping for Prime members. Alternatively, it could oer more dierentiated ser-

vices by maintaining the two-day shipping policy for Prime members while reducing some

of the benets for basic members, for example, increasing the order minimum to qualify for

free shipping from $25 to $30.

1.6 Conclusion

This paper analyzes a generalized version of quality dierentiation by a monopolist in

a multi-sided market. The main focus of this paper's analysis is the eect of buyers' side

quality dierentiation on the optimal pricing strategy for the platform. We rst showed

that quality dierentiation on the buyers' side will decrease the price charged to sellers.

To understand the intuition behind this result, recall that the pricing structure for multi-

sided rms depends on the relative elasticity of the two sides. Quality dierentiation on the

buyers' side can increase the surplus extracted from buyers, which means that the buyers'

side becomes relatively inelastic. The platform can exploit this inelastic demand among

buyers to extract more prot from that side while discounting the sellers' side by decreasing

the price charged to them. This nding suggests how a platform can subsidize the more

elastic side by introducing quality dierentiation on the other side.

We also found that quality dierentiation, which leads to a lower price on the sellers'

side, ultimately increases the platform's prot. Thus, the platform can strategically use

quality dierentiation to raise its prots. Another strategic variable that the platform can

24

use is the extent to which the two qualities dierthe platform can earn extra markup from

relatively high-valuation buyers by widening the quality gap. Given that the driving force of

this greater markup is the quality gap, rather than how high the quality of the better-quality

product should be, we derive relevant business implications, especially for small platforms or

entrants without substantial resources: if they are unable to make a higher quality product

due to their limited resources, they can lower their basic product quality, which would lead

to similar consequences in terms of the quality gap.

Overall, our ndings suggest one plausible business strategy for the platform: how quality

dierentiation implemented by the platform can be used as an optimal business strategy. It

would be interesting for future research to investigate how competition in the platform market

alters our results. In this regard, a model with an asymmetric setup in the competitive market

structure faced by the platform could determine the extent to which such asymmetry aects

the platform's optimal business strategy.

25

CHAPTER 2

DYNAMIC GAME WITH MULTIDIMENSIONAL TYPE: THE CASE OF

CARBON-CREDIT MARKETâ€ 

2.1

Introduction

One of the most popular market-based solutions for limiting greenhouse gas emissions

(GHG) is to use Cap-and-Trade schemes. Under the scheme, rms use emission credits to

pay for GHG emissions. The scheme's governing body begins by setting a cap on allowable

emissions. It then distributes or auctions o emissions allowances that total the cap. The

rms that do not have enough allowances to cover their emissions must either make reductions

or buy another rm's spare credits. Members with extra allowances can sell them or bank

them for future use.

A successful cap-and-trade scheme relies on a strict but feasible cap that decreases emis-

sions over time. If the cap is set too high, an excess of emissions will enter the atmosphere

and the scheme will have no eect on the environment. A high cap can also drive down the

value of allowances, causing losses in rms that have reduced their emissions and banked

credits. If the cap is set too low, allowances are scarce and overpriced. The previous liter-

ature has looked at the optimal rate of decrease in supply for the carbon credit market by

accommodating the eect on carbon credits being optimally valued.

In this paper, we extend this analysis and look at the optimal per period supply when

there is an uncertainty of future demand in the market. We nd that in the dynamic setting

introduction of demand uncertainty changes the optimal supply rate. In particular, we nd

that the expectation about future supply and demand changes the equilibrium price and

contributes to price uctuation in the market.

Several papers have shown evidence of price uncertainty in the carbon permits market.
â€ This work is joint with Thomas D.Jeitschko and is unpublished.

26

The demand of carbon permits is uncertain and uctuates due to multiple reasons such

as weather conditions, high correlation with electric power generation, nancial trading in

carbon permits. For example, in the EU emissions trading system (EU ETS) 1 a dry summer

in July 2005 in Southwestern Europe led to under-utilization of hydroelectric plants and a

lack of cooling water for nuclear power plants, which therefore increased the demand for

carbon permits. Thus, it is integral to accommodate the uncertainty in the demand in the

model so as to make it more realistic.

The results show that a decrease in future supply increases the current price. This is

because increasing the future supply makes it easier for the rm to buy the carbon credit

in the future which makes them less willing to pay a high price in the current period. On

the other hand, we nd that an increase in expected future demand leads to a decrease in

the current price. The intuition behind a negative eect of an increase in expected future

demand on the current price is that shift in future demand curve makes the rm demand

more in the current period as the carbon credits can be stored and used in future periods.

The outward shift of the demand curve consequently increases the equilibrium price. Using

these results the paper suggests that the future supply should be a function of expected

future demand. In particular, the government should decrease the supply every time the

expected future demand decreases and visa-versa. We show that such a policy can decrease

the price uctuation in the market.

The cap and trade scheme uses uniform price auction to distribute the carbon credits.

The allowance is auctioned over multiple periods. To study this market, we look at a dynamic

repeated uniform price auction setting with stochastic supply and demand. The dynamic

setting in the single good case can be of two kind. The rst keeps the set of buyers xed and

have their types change over a time as a function of allocations selected in earlier periods (for

example Athey and Segal (2007), Eso and Szentes (2007) and Akan et al. (2009). The second

is where a nite number of goods are sold to buyers arriving over time. This paper looks

1as noted by Thomson Rueters Point Carbon (2006)

27

at the second type of dynamic setting which we refer to as the changing buyer case2. The

focus here is on the sale of greenhouse gas emission, as used in the European Union, the US

(RGGI, NOx, California Cap and Trade), Switzerland and other countries. The auctioned

goods are close substitutes (assumed to be homogeneous) with the bidder having dierent

degrees of urgency to get the permits. This is modeled in the paper as bidders having a

dierent number of periods of active demand. To accommodate the dierent degrees of

urgency among the buyers, the model allows the buyer's type to dier in terms of their

demand duration. Thus, buyers have multi-dimensional private information, namely about

their valuation and demand duration. The buyers have single-unit demand 3

In this setting, we derive the equilibrium bid and show that the bid is truthful and ex-

post incentive-compatible. The optimal bidding strategy indicates that the buyers take their

future period payos into consideration when deciding the value of winning in the current

period. Thus, the outside option value is endogenously determined through the expected

payo of auctions in future periods. Apart from the carbon credit auction, this model can

be applied to any market where the bidders dier in terms of the value of the object as well

as the urgency for getting good. This includes markets such as the sale of spectrum (used

for a wireless network), auction of house or electricity markets with dierent delivery dates.

This paper is related to the literature on price and supply restriction imposed in the

cap and trade auction. The early work on this was done by Robert & Spence (1976) and

Weitzman (1978) . They considered price ceiling and oor under demand and supply un-

certainty in the static model. In the dynamic setting papers such as Philibert (2008) and

Burtraw et al (2010) look at price control with quantity regulation. Webster et al. (2010)

compare price ceiling to a policy that allocates allowance on emission targets. This paper

2The term changing buyer type is taken from Dynamic Mechanism design literature review

by Rakesh V. Vohra

3The objective of this paper is to focus on the question of auction periodicity, hence we
abstracting away from multi-unit demand. Another way of thinking of this is the simplica-
tion of each bidder having the homogenous number of units demands, thus the item is sold
is as a bundle of a specic amount.

28

contributes to this literature by looking at how instead of a xed supply rate or price ceiling,

we can implement a dynamic control for price uctuation by changing the supply rate with

the change in demand uncertainty and thereby aecting the equilibrium price.

This paper is also related to the literature on dynamic mechanism design with multidi-

mensional private information. It extends this literature to look at the optimal auction for

an auctioneer with an unknown number of buyers and sellers in each period. The literature

on dynamic mechanism design is vast(see the survey by Bergemann and Said for details).

The most relevant papers for current work are Pai and Vohra (2013), where they look at the

optimal auction for a single seller selling multiple units to stochastically arriving bidders.

These authors also allow for private information about the arrival time. The main dierence

in this paper is that we are looking at auctioneer's problem of ecient mechanism with

stochastically arriving sellers. Also Pai and Vohra paper assumes perfectly patient bidders

with a focus on optimal auction for maximizing seller revenue. Another related paper is

Mierendor, K., (2014):, where the author looks at a revenue-maximizing mechanism for a

seller selling a single good in a dynamic environment with buyers having multidimensional

private information. The main dierence with current work is that we are looking at an e-

cient mechanism in a dynamic environment with multiple and stochastically arriving sellers.

This paper is also related to the literature on Ecient Sequential Auction with Impatient

Buyers. The most relevant paper is Gershkov and Moldovanu (2010). They examine the

allocation of a set of durable goods to the dynamic buyer population. In their setting, ob-

jects are durable whereas in this paper objects are non-durable and the total supply in every

period is stochastic.

In the current model, we are analyzing sequential auctions with stochastic supply and

demand. The early work on stochastic auctions includes Mcafee, R. P., and McMillan, J.

(1987), Konrad, M.(2013), looks at an ecient dynamic allocation of a single object when

bidder arrival is stochastic. On the supply side, Thomas.J (1999) looks at stochastic supply

in a sequential auction.

29

The remainder of the paper is structured as follows : Section(2.2) and section(2.3) de-

scribe the generalized model setup and derive the equilibrium bid. Section(2.4) applies the

model to emission trade market and section(2.5) concludes.

2.2 Model setup

Consider a sequential auction in innite, discrete time period model, t  {âˆ’âˆž.., 0, ..âˆž}.

In each period multiple units of a homogeneous good are auctioned. In the case of a spectrum

market, we look at the trading auction where the secondary users bid to buy spectrum units,

sold by primary users. Buyers with single unit demand arrive over time. Buyer i has a

valuation vi, for one unit of the good, which is an i.i.d random draw from the distribution

F (v) on [v, Â¯v]. The demand for each buyer lasts for multiple periods. In particular, each

buyer has an arrival time ai and a demand duration ki. This means that the demand
lasts for all t  {ai, ....(ai + ki)}. Thus apart from heterogeneous valuation, a buyer also
dier in terms of their demand lifetime. The type of a buyer is a triplet consisting of his

valuation, arrival time and demand duration, x = (vi, ai, ki); and the type space is given as
X = [v, Â¯v] Ã— [a, Â¯a] Ã— [1, Â¯k]. Buyer's type is an i.i.d random draw from a commonly known
Fi = Fv Ã— Fa Ã— Fk over X. We assume that the three components of

distribution (cid:81)

i={v,a,k}

the type space, i.e. valuation, arrival time and demand duration are independent.

In our model, in period t, an agent of type xi = (vi, ai, ki) and makes a payment zt derives

the following instantaneous utility:

ï£±ï£´ï£´ï£²ï£´ï£´ï£³(vi âˆ’ zt),

0,

Ui,t(vi, ki) =

if he wins the auction

otherwise

The buyers arrival rate is stochastic; In any period t, nt new buyers arrive. The arrival rate,
nt, is an i.i.d random draw from the distribution F n on {1, ...., Â¯n}. The distribution of arrival
rate of buyers in each period is common knowledge, however the exact number of per period

new buyers is unknown. Apart from this uncertainty in the market leads to uncertainty of

the current bidders' likelihood of surviving in the future. To capture this, we dene Ï„ as

30

the probability of surviving in the next period for each active buyer. The total number of
potential buyers is given by N  IN such that N â‰¥ Â¯nÂ¯k.

Sellers, with single period supply, arrive over time. Seller j has a per unit value vj ,
which is assumed to be the same for all sellers and is normalized to zero, vj = 0 âˆ€ j  S4.

We allow the sellers to have more than one unit to sell but assume that each sale request is

sent to the auctioneer independently5.

Similar to the buyers, the sellers also arrive stochastically. However unlike the buyers,

sellers are active only for one period. The units auctioned by the sellers are identical, and

these units remain valid only for one period. Each unit is considered for sale independent

of their provider, as sellers oer homogeneous goods.

In any period t, mt new units are
available in the market. It is assumed that for each tth period, mt is an i.i.d random draw
from distribution F m on [m, Â¯m]. The total number of potential seller is given by M  IN
such that M â‰¥ Â¯m. The distribution of units available in each period and seller's valuation
is common knowledge however the exact per period supply is unknown ex ante. 6.

Information Structure: The distribution of buyer's type space, the distribution of avail-

able per period units, and seller's valuation for the object are assumed to be common knowl-

edge. The buyer has private knowledge about his own type, composed of his valuation and

demand lifetime. He does not have knowledge of exact number of other buyers and sellers,

their types and reports , or previous allocation decision by the auction. The arrival rate of

buyers and the number of items in each period are stochastic. Thus the exact per period

demand and supply is unknown for future periods.

The timeline of the game is as follows. In each period, t, the active buyers report a bid

for a single unit to the auctioneer. The strategy set of the buyers in period t compromises of

4This assumption has been made so that we can concentrate on the buyers side in this

paper

5Thus this is equivalent to each seller selling single unit item
6In this model we consider the case of multiple sellers. The role of a seller is limited upto
to reporting the available unit, and thereby accepting to sell at a non negative price as the
auctioneer knows seller's valuation.

31

their bid in period t, i.e. bt(vi, ri), where vi is the value and ri is the numebr of active periods
left at tth period. The active buyers comprises of new buyers arriving in the current period

and the existing buyers which are still active and have unfullled demand for the item. The

active sellers report the items available for sale. After receiving bids from the active buyers

and an estimate of number of items to be sold from the active sellers, the auctioneer holds

an auction to decide the number of items to be traded, denoted as St and the clearing price,

denoted as zt.

We now give a detail description of the auction mechanism. The auctioneer holds a series

of static uniform price auctions. Bidders simultaneously submit sealed bids for the item and

sellers report the units available for sale. The auctioneer calculates the total number of items

to be traded, St, by equating the demand and supply in the current period. If the number of

items traded in the market are equal to the available units i.e. St = mt, auctioneer accepts

the sale request of all the active sellers. On the other hand if the number of items traded

is less than available units i.e. St < mt, auctioneer accepts the sale request of randomly

selected St number of sellers. The buyers are ranked in ascending order of their bids and

St highest bids are accepted to allocate the item. Each winning bidder gets one unit of the

item and pays a price equal to the highest losing bid.

To formally determine the clearing price, we denote the order statistic for bid as b(l),

which represents the lth highest order statistic of the bidding values. Note that St highest

bids are selected in period t to trade. So the bid of a buyer is accepted for trade if bt(vi, ri) 
{b(1), b(2)............b(St)}. Every winning bidder pays the bid of the highest losing bidder, i.e.
zt = b(St+1). The price paid by all buyer in every period is equal to the (St + 1)th highest

order statistic of that period. All buyers who bid above zt win the item at price zt and each

accepted sale request yields a payment equal to zt for sellers.

We solve the repeated period problem using Subgame Perfect Bayesian Nash equilibrium

as the solution concept. In each period the buyers reports a bid for single unit of the item

32

to the auctioneer. The equilibrium denes a set of strategy and beliefs, such that given the

opponents' strategies, the expected payo of every buyer is maximized in each period.

Observation 2.2.1. Notice that, although the arrival time ai is private knowledge, the buyer

does not have any incentive to lie about the arrival period. This is due to the independence

of buyer's payo function and the arrival time ai. Thus the relevant private information of

buyer i is (vi, ki).

2.3 Symmetric equilibrium

A bidding strategy for a bidder i consists of a set of bid functions dened as bi =
{b1(vi, ri), b2(vi, ri), ....bâˆž(vi, ri)} , where bt(vi, ri) denotes the bid in the tth period given
that the bidder's value is vi and demand will last for ri periods after period t.7 Note that

the bidders arrive and leave at dierent time periods , thus the subscript on period tri, i.e
ri, denotes the number of active periods for bidder i in period tri.

As this is a dynamic setting, we start by dening the state variables. Let Ïƒt = {mt, nt}
denotes the auction "state" consisting of the number of bidder i at time period t, and it is

represented with a k-dimensional matrix n = [nÂ¯k, ..., n1] and m represents the number of

sale units available in that period. Here, nx denotes the number of bidders with x periods

of active demand. Let N (k, n) =

nx denote the set of active buyers in current period.

Â¯k(cid:80)

x=0

Let us now consider the maximizing objective function for the buyer which is the total

lifetime payo after realization of demand and supply , denoted by V (vi, ri|Ïƒ)

(cid:26)

(cid:18)

P r

b(vi, ri) > b

(mt)
j(cid:54)=i

E

vi âˆ’ b(mt)(vi, ri)

V (vi, ri|Ïƒt) â‰¡

(cid:18)

+ P r

b(vi, ri) > b

(mt)
j(cid:54)=i

(cid:90)

(cid:19)

(cid:12)(cid:12)(cid:12)(cid:12)Ïƒt

Ï„

Ïƒt+1

(cid:20)

(cid:19)

(cid:12)(cid:12)(cid:12)(cid:12)Ïƒt

V (vi, ri âˆ’ 1|Ïƒt+1)

(cid:12)(cid:12)(cid:12)(cid:12)b(vi, ri) > b

(mt)
j(cid:54)=i

(cid:21)

(2.1)

(cid:27)

The rst term represents the expected payo from auction in the rst period and the second

term is the future period payo integrated over possible arrival rate Ïƒt+1. The future payo

7All this assumes, of course, that he particular bidder has not already won an object so

is still active in period t

33

is equal to the total payo for value vi and ri âˆ’ 1 pd left. Now we dene the total payo
e-ante, i.e., before the realization of the state variable 'Ïƒt'

(cid:20)
vi âˆ’ b(mt)(vi, ri)

E

(cid:12)(cid:12)(cid:12)(cid:12)b(vi, ri) > b

(cid:21)

(mt)
j(cid:54)=i

(2.2)

(cid:90)

(cid:26)

W (vi, ri) â‰¡
(cid:18)

Ïƒtri

+ P r

b(vi, ri) > b

(mt)
j(cid:54)=i

Ï„

P r

b(vi, ri) > b

(cid:12)(cid:12)(cid:12)(cid:12)Ïƒtri
(cid:19)
(cid:27)

(mt)
j(cid:54)=i

(cid:19)
W (vi, ri âˆ’ 1)

(cid:18)
(cid:12)(cid:12)(cid:12)(cid:12)Ïƒtri +1
(cid:26)
(cid:18)

Using the total payo ex-ante, we can rewrite the ex-post per period payo in eqn(2.1) as

) â‰¡

(cid:20)
V (vi, ri|Ïƒtri
(mt)
j(cid:54)=i
b(vi, ri) > b
vi âˆ’ W (vi, ri âˆ’ 1) âˆ’ b(mt)(vi, ri)
E

P r

(cid:21)

(cid:27)
+ W (vi, ri âˆ’ 1)

(2.3)

(mt)
j(cid:54)=i

(cid:12)(cid:12)(cid:12)(cid:12)Ïƒtri
(cid:19)
(cid:12)(cid:12)(cid:12)(cid:12)b(vi, ri) > b

Pseudo type for each period:

Observe that we have the probability of winning in each period dened in terms of the bid,

in order to dene that in term of the bidder's type, we will dene a per period pseudo type

for each period t and bidder i as Î·i,t. Let Gt(.) be the distribution for Î·t,i. The pseudo type

is dened as follows :

(cid:82)

ï£±ï£´ï£´ï£´ï£²ï£´ï£´ï£´ï£³vi âˆ’ ki(cid:80)

0,

l=t+1

Î·i,t =

(cid:18)
Î·i,l âˆ’ E[(Î·
(Î·i,l|Ïƒl)

Ï„ lâˆ’tG

(ml)
l

Ïƒl

(cid:19)

(ml)
j,l

)j(cid:54)=i|Î·i,l > Î·

(ml)
j,l

]

,

if t{ai, ki}

otherwise

The pseudo type helps simplify the payo function as well as the bid. Following lemma

shows that relation between pseudo type and the payo function

Lemma 2.3.1. The equilibrium bid is increasing in pseudo type and the total payo in period

t can be rewritten as follows :

(cid:90)

(cid:18)
(cid:18)
(Î·i,t|Ïƒt)
Î·i,t âˆ’ E[(Î·
(Î·i,t|Ïƒt)

Î·i,t âˆ’ E[(Î·

(mt)
j,t

(mt)
Ï„ G
t

W (Î·i,t) =

Ïƒt
(mt)
V (Î·i,t) = G
t

(mt)
j,t

(cid:19)
)j(cid:54)=i|Î·i,t > Î·

(mt)
j,t

)j(cid:54)=i|Î·i,t > Î·

(mt)
j,t

(cid:19)

]

+ W (Î·i,t+1)

(2.4)

]

+ W (Î·i,t+1)

(2.5)

We can use this equation in order to determine equilibrium bid functions, as demonstrated

in the following result. We focus on symmetric Bayesian Nash equilibrium.

34

Theorem 2.3.2. If auction in every period is vickery auction then the equilibrium bidding

strategy in the dynamic auction game is given as:

bi(Î·i,t) = vi âˆ’ W (Î·i,t+1)

or

bi(Î·i,t) = Î·i,t

35

2.4 Applications and comparative statics : emission market

Here we look at the case of xed decreasing supply with an uncertainty of demand in

each period. In the emission auction markets, the goal is to reduce the supply of emission

credits, thereby reducing the total emission. This is usually done by deciding in advance the

decreasing rate of supply of emission credits. The general model, introduced in the previous

section, is modied in terms of supply rate to t the market better.

Instead of having

stochastic supply, for this section we have the supply decreasing at a constant rate.8 Let

the rate at which supply decreases be denoted by Î»  [0, 1]. This changes the state variable

Ïƒt = (Î»mt, nt). Additionally, the probability of active demand Ï„ will aect the future as well

as the current value for the object. This is because any change in demand for credit in the

future aects current demand as well. Everything else is the same. In this section, we look

at comparative statics in terms of how Î» and Ï„ . This will give insight into how the supply

rate and the demand uncertainty aect bidder behavior. The rst two propositions look at

how the supply rate, i.e. Î» aects the market variables. This is done to understand how the

supply rate can be strategically used to stabilize the price.

Proposition 2.4.1. The bidder's expected payo is increasing with increase in the future

supply rate.

d(V (Î·i,t|Ïƒt))

d(Î»)

â‰¥ 0

(2.6)

The above proposition shows that the expected payo in each period t is positively

correlated with the future periods' supply rate. The intuition behind the result is that as

the supply in the future increases, the chances of acquiring credits in the future increase,

thereby increasing the total expected payo from current and future periods. However, our

purpose is to understand how the supply rate can be used to control the price. In the next

8To focus on the eect of supply and demand dynamics we will be abstracting away from

the multi-unit demand nature of this market

36

proposition, we shift our focus to see how the supply rate aects the bid as well as the

equilibrium price.

Proposition 2.4.2. The bid and price are decreasing with increase in the future supply rate.

d(b(Î·i,t))

â‰¤ 0

d(Î»)
dE(Pt)
d(Î»)

â‰¤ 0

The price and the bid in the current period are negatively aected by an increase in the

supply rate. Thus, even though the total payo is increasing for the bidder, they will still

decrease the price in the current period. This is because increasing the supply rate increases

the payo in the future periods, which also increases the opportunity cost of winning in

the current period (tth period). Therefore, bid and thereby price is negatively correlated

to supply rate. We have now established the eect of the supply rate on market variables.

However, in order to see how the supply rate can stabilize the eect of demand uncertainty

on the price we need to look at the eect of demand uncertainty as well. We now look at

how do market variables react to demand uncertainty.

Proposition 2.4.3. The bidder's expected payo is decreasing in the uncertainty of demand.

d(V (Î·i,t|Ïƒt))
d(1 âˆ’ Ï„ )

â‰¤ 0

(2.7)

Recall that Ï„ represents the probability that bidders will have demand for the credits in
the next period.9 Thus, 1 âˆ’ Ï„ represents the probability of demand reduction in the next
period. For this study, we take 1 âˆ’ Ï„ as a proxy for expectations about future demand

uncertainty. The proposition(2.4.3) shows that the payo is negatively aected when the

demand uncertainty increases in the market.

9this is assuming bidder has active demand next period and is between his entry and

deadline period.

37

Proposition 2.4.4. The current bid and price increase with decrease in expected demand

uncertainty.

d(b(Î·i,t))
d(1 âˆ’ Ï„ )
dE(Pt)
d(1 âˆ’ Ï„ )

â‰¤ 0

â‰¤ 0

This shows that increase(decrease) in expected demand increases(decrease) the price.

Now that we have established the individual eects of supply rate and uncertainty on price,

we will look at how to utilize it to stabilize the price path. From propositions 2.4.2 and 2.4.4

we see that Î» and Ï„ have an opposite eect on the price. Thus, an optimal strategy will

negatively correlate the future supply rate and expected demand. Let the proposed supply

rate be dened as follows :

Î»new = f (Ï„ )
f(cid:48)(Ï„ ) â‰¥ 0

(2.8)

(2.9)

The new supply rate changes with the change in demand. The next proposition proves that

the change in price due to change in demand uncertainty would be less in case of the new

supply rate.

Proposition 2.4.5. If the demand in the future changes by x then, the change in price would

be lower in case of the new supply rate.

dE(Pt|Î»new)

d(Ï„ )

dE(Pt|Î»)

d(Ï„ )

<

2.4.1 Comparison with price control strategies

The literature provides a xed price ceiling and reserve price as a solution to control the price

uctuation. However, even after the price caps, there can be high variance in price within

the bounded prices (as seen in gure(C.1). In this paper, we instead present a more exible

strategy, where the supply rate would be linked to future uncertainties and thereby adjust

38

higher for high-level uncertainty and low for low-level uncertainty. Let's take an example of

the RGGI carbon credit market. Figure(C.2) shows the price path with major events during

the timeline. We can see that price uctuates with changes in the demand such as in 2106 we

see a big fall in price as the demand uncertainty increased as in 2016 supreme court halted

the Environmental Protection Agency's Clean Power Plan. In our model, the supply rate

would have temporarily adjusted to decrease the fall.

This work shows that linking the supply rate to changing market factors will stabilize

the price. Further, empirical analysis is required to understand the implementation of such

a policy.

39

2.5 Conclusion

This paper analyzes a dynamic auction setting with the stochastic arrival of bidders and

multidimensional bidder's type. We derive the BNE bid in the repeated game. We apply

this setting to the Cap and Trade scheme for the auction of carbon credits. The setup is

used to understand the uctuation of price in this market. Price uncertainty is a major

concern in the Cap and Trade market. The reason for that is that for rms to change to

more renewable energy they would need a more stable short term supply of carbon credits.

The paper identies two factors aecting price uctuation, namely the rate of supply and

uncertainty in future demand. One of them is not in control of the auctioneer (or the

government in this case), that is the future uncertainty in demand. However, the rate of

supply is decided by the government. The paper shows that the eect of the rate of supply

on price is opposite to the eect of uncertainty in future demand. Thus, one policy suggested

in this paper is that the government should correlate the supply rate with the uncertainty

in the market. Specically, they should decrease the future supply rate when the future

demand uncertainty in the market increases. The results show that this policy would lead

to a more stable price over time. The paper also analyzes the general model setup, which

can be applied to other markets as well. One key feature of the paper is that we look at the

multi-dimensional type of bidders. Thus, this model can t markets where the bidders dier

in more ways than just the value for the object. Future extensions of this paper can look

at extending this setting to multi-unit demand and looking at double auction setting which

can further generalize the auction design.

This work shows that linking the supply rate to changing market factors will stabilize

the price. Further, empirical analysis is required to understand the implementation of such

a policy.

40

CHAPTER 3

REVENUE-MAXIMIZING NUMBER OF ADS PER PAGE IN THE

PRESENCE OF MARKET EXTERNALITIESâ€ 

3.1

Introduction

Online advertising is a new but rapidly growing market. Back in 1998, when Google was

a small startup company, there were on average 10,000 search queries made per day; fast-

forward to 2019, Google recorded an average of 5.5 billion searches per day.1 Furthermore,

this rapid growth is accompanied by new and evolving ad features. One such example is the

pay per click feature, wherein the advertiser pays for the click on the ad link and not for

the ad display.2 In such a dynamic environment, even a feature as simple as the number

of ads per page can have signicant implications. This paper focuses on the implications

of the number of search ads per page on the search engine's ad revenue. Search ads are

paid search links that appear above generic3 search results on major search engines such as

Google, Yahoo! and Bing. Figure E.1 gives an example of the Yahoo! result page.

The motivation for looking at the number of ads per page is twofold. First, analyzing

this feature can help us derive the revenue-maximizing number of ads per page, which can

provide suggestions on how to improve the current practices. Second, the empirical literature

on online advertising does not address how the number of ads impacts the ad price and,

thereby, the search engine's revenue. This paper lls this gap by quantifying the revenue

impact of the number of ads per page.

â€ This work is unpublished.
1https://martechseries.com/mts-insights/guest-authors/googles-seo-strategy-is-

constantly-changing-four-ways-small-businesses-can-keep-up/

2This is true for most of the online advertising platforms such as Google, Yahoo! Face-

book, Amazon, and eBay.

3Generic search results are non-sponsored links.

41

The search engine faces a trade-o while deciding the number of ads per page. The

addition of one more ad leads to an increase in the quantity of ads sold; however, it also

leads to a decrease in the ad price. Thus, the revenue-maximizing number of ads is an

empirical question that is derived after quantifying the change in price due to a change in

the number of ads. Note that the price for each ad is decided through an auction mechanism.

The change in equilibrium price is broadly due to the auction design, as well as due to the

externality exerted from other ads on the page. The externality is dened as a side eect of

the presence of other competing ads on an advertiser's benet from the ad. I will rst talk

about the externality in the market and then give details of the auction design.

Let us look at an example to understand the consequences of externality in this market.

Consider Walmart as an advertiser that has an ad appearing in two consumer search queries.

In the rst case, the ad appears adjacent to one from Amazon (as seen in gure E.1), and

in the other, it appears as an exclusive ad. In this example, the consumer who sees only

Walmart's ad is more likely to buy from them, compared to the consumer who sees both

Amazon and Walmart's ad. The presence of Amazon's ad exerts an externality on Walmart's

benet from the ad, which translates into Walmart having a higher willingness to pay for the

exclusive display. Notice that the externality from the other ad is dependent on the number

as well as the quality of the ads, where the quality measures the consumers' preference for an

advertiser. For instance, an adjacent ad from a high-quality advertiser, say Amazon, exerts

a higher externality than an ad from a low-quality advertiser. Thus, I create an externality

index that captures the quality as well as the number of other ads on the page.

Although the externality estimates are interesting by themselves, this paper aims to

compute the revenue-maximizing number of ads for which we also need to accommodate other

components that aect the equilibrium price. The selection and pricing of the ads is done

through an auction referred to as Generalized Second Price (GSP) auction.4 Additionally,

the advertisers pay per click and not for the ad's display; thus, the advertiser submits a per

4A new auction is held for consumer search.

42

click bid. The winning ads are decided using the weighted bid, where the weight captures

an ad's click probability. Note that a single auction is used to allot all the ad positions on a

given page. The positions are assigned in order of the weighted bids, meaning an ad with a

higher weighted bid is allotted a higher position. The equilibrium condition shows that the

ad price depends on the advertiser's willingness to pay, the consumer's click probability, and

the information available to the advertiser.

To evaluate this empirically, I use a data set provided by the Yahoo! research lab. It covers

all ads5 displayed on Yahoo! search result page, over four months, for ve major categories:

laptop, TV cable, cruise travel, collectible coins, and car insurance. I have information about

the number of displays, the number of clicks, ad description, and the ad position. The data

also provides the bids for each ad. Additionally, I can measure the number of advertisers per

day and the number of ads per page.6 Notice that the data does not provide the quality of

an ad, I solve this limitation by estimating the quality of an ad in terms of the advertiser's

eect on consumer's click probability.

For estimation, I use a discrete choice method to model consumers' click decisions and a

partial identication method to bound the advertisers' willingness to pay. The estimation is

performed in three steps. In the rst step, I estimate the parameters that aect a consumer's

click decision using a weighted logit model.7 In the second step, I set up a hedonic regression

to estimate the eect of the externality on an advertiser's bid. The nal step estimates

bounds on the distribution of the advertiser's unobserved willingness to pay for which I

follow the Haile and Tamer (2003) methodology for partial identication of distribution.

The ndings show that externality has a negative and non-linear eect on the advertiser's

5The data excludes ads that appear for the search of brand names.
6the number of ads on the rst page is assumed to be seven ads unless observed less than
7 ad positions. This is a common assumption made for papers using this data set from yahoo
such s Agarwal and Mukhopadhyan (2016) Agarwal & Mukhopadhyay (2016)

7The reason for using a binary logit model instead of a multinomial logit is for two reasons.
First, the choice set of the displayed ads varies depending on the consumers' preferences.
Thus, the choice set is endogenous. Second, the choice set does not have a xed number of
choices.

43

willingness to pay. For a one-percent increase in externality leads to 0.34, 0.6, 1.3 and 0.14

percent decrease in the advertiser's willingness to pay for categories car insurance, laptop,

cable tv and coins respectively.8 The eect of including one more ad is dependent on the

quality of the additional ad as well the number of ads already on the page. In the laptop

category following the Walmart example earlier, this would imply that addition of a second

ad decreases willingness to pay by twenty ve percent if the new ad is from Amazon, but

only by two percent if it is from a local retailer.9

These estimated primitives and equilibrium price condition are used in the Monte Carlo

simulation to derive and measure gains from implementing the revenue-maximizing number of

ads. To derive the revenue-maximizing number I calculate the expected revenue for dierent

values of the number of ads per page and then select the one that has the highest expected

revenue. The results show that the revenue-maximizing number is 5 ads per page, on average.

For three out of ve categories the revenue-maximizing number of ads are dierent than the

currently display quantity of ads per page (i.e. seven ads).

Implementing the suggested

number of ads per page would lead to, on average, a 4.5 percent gain in revenue, which

translates into 5.2 billion dollars in revenue.10 One of the reason for a dierence in the

result in this paper and current practices is that the paper suggests optimizing number of

ads separately for each category, however the search engines currently optimize the number

of ads jointly for all categories.

These results provide evidence on how search engines can increase revenue by changing

the design of the ad space. Furthermore, these suggestions extend the recent changes in

the online advertising market; for example, Microsoft has introduced a service (RAIS) that

provides advertisers with an option of an exclusive ad display.

This paper contributes to several dierent strands of literature.11 It contributes to the

8Note that cruise did not show a negative or signicant eect of externality
9assuming local retailer is of low quality and amazon is of high quality
10calculated using google's advertising revenue in the second quarter of 2019 - see here for

details https://www.statista.com/statistics/266249/advertising-revenue-of-google/

11Please refer to the next section for a full literature review.

44

studies that look at the eect of externality in the online advertising market. To the best of

my knowledge, this is the rst paper that estimates the eect of externality on advertiser's

willingness to pay. Additionally, this paper also contributes to the studies on equilibrium

price in auction design, where recent papers have looked at solving the equilibrium price un-

der a more realistic assumption on the information available to the advertisers (for example,

Athey and Nekipelov (2010) look at entry uncertainty). In this paper, I extend this further

by looking at the incomplete information case and provide for equilibrium conditions that

can be estimated empirically.12 Lastly, this paper contributes to the relatively new literature

on ad display design; the closest related paper is Jerath and Sayedi (2015). They look at

introducing exclusive ad display options, whereas this paper looks at the more general case

of the revenue-maximizing number of ads, in which an exclusive ad is a special case.

3.2 Literature review

This research is related to a few dierent strands of literature. It contributes to the lit-

erature on externality in the online advertising market. The empirical studies on externality

have focused on the eect of externality on the consumer's decision to click on an ad, such

as Jeziorski and Segal (2015) and Narayanan and Kalyanam (2015).13 Despite the growth

of literature on online ad externality, little eort has been made to empirically estimate the

indirect eect of externality on an advertiser's behavior. This paper focuses on these previ-

ously unexplored issues: the eect of online ad externality on an advertiser's willingness to

pay for an ad and, consequently, on ad platform revenue.

This paper is also related to the work on estimating the unobserved advertisers' will-

12The closest paper to this analysis is Gomes (2014), which solves for the incomplete
information case in the GSP auction. Here, I do a nontrivial extension of their work by
solving the incomplete information case in the weighted auction. Furthermore, I contribute
to this literature by providing equilibrium bounds that can be estimated empirically.

13For instance, Jeziorski and Segal (2015) Jerath & Sayedi (2015) show that consumers
click on multiple ads and that the click probability is aected by the presence of other ads.
Narayanan and Kalyanam (2015)Narayanan & Kalyanam (2015) show that, for large rms,
higher ad positions lead to smaller click probability improvements.

45

ingness to pay using the equilibrium bid. In the theory literature, Edelman et al. (2007)

(referred to as EOS) and Varian (2007) were among the rst to derive the equilibrium bid.

Although online ad auctions have received great attention in the theoretical literature, em-

pirical research remains sparse. Borgers et al.

(2013) analyze Yahoo! data to estimate

position-dependent value, and Yang et al. (2013) structurally estimate EOS's model. Athey

and Nekipelov (2010) propose and estimate a structural model tailored to features of spon-

sored search auctions run by US search engines (such as Google or Microsoft).14 A key

contribution of this paper is that it looks at the equilibrium behavior under weaker infor-

mation assumptions. The empirical literature in sponsored search auctions has looked at

variants of full information, with few looking at uncertainty in the market. This paper

relaxes the full information assumption and examines the optimal bidding behavior under

incomplete information. Gomes and Sweeney (2014) solve for the incomplete information

case in a non-weighted Generalized Second Price (GSP) auction. This paper extends their

work by looking at the incomplete information case in weighted GSP auction. The extension

is nontrivial as the weight introduces a multidimensional type of the bidder. Additionally,

the equilibrium bid does not have a closed-form. Thus, the paper further contributes to

this literature by providing closed-form bounds on the equilibrium bids that give us partial

estimates for the advertiser's willingness to pay. To the best of my knowledge, this is the rst

paper that proposes how to estimate the willingness to pay under incomplete information.

This paper is also related to the literature exploring multi-ad display settings in sponsored

search ads. For instance, few papers look at giving the advertisers the option of bidding

for both multi-ad and exclusive-ad option, such as Jerath and Sayedi, (2015), Deng and

Pekec (2013), and Ghosh and Sayedi (2010). The change in the auction design makes the

advertisers strategically change their bid, which has led to advocacy for changing the auction

design to VickreyClarkeGroves (VCG) auction, for example, in Sayedi, Kinshuk, Baghaie

14Specically, they accommodate uncertainty in bidders' perceptions (due to randomness
in a bidder's quality score over time, as well as in the set of competitor bidding in the auction
at any time).

46

(2018). In this paper, the counterfactual suggestion is on showing a xed number of ads

that dier across the ad product category. The advantage of the suggested xed number of

ads is that it does not require a change of setup for the advertisers. Thus, it is easier to

execute. Another contribution of this paper is that, unlike the previous papers that look at

this question from a theoretical point of view, this paper estimates the market parameters

in the empirical section. Thus, the simulations provide a more realistic magnitude of gain

through the proposed new method.

Lastly, this paper is related to econometric theory papers on partial identication meth-

ods. The methodology in this paper closely follows a method rst proposed in Haile and

Tamer (2003). Their paper shows how to estimate bounds on the distribution of object value

in an English auction. I extend it and show how to apply the method in an online auction,

i.e., a Generalized Second-Price auction.

3.3 Market environment and theoretical Model

3.3.1 Overview of the search advertising market

In this section, I discuss the search ad market from the advertiser's point of view. Sponsored

search ads are paid search links that appear alongside search results (as shown in g(E.1)).

These ad links are purchased by advertisers in order to have their website appear higher in

the search results page. Multiple ad position slots are allotted for each search result page

through an auction. The ad display process has three stages, as shown in the timeline (in

g(E.2)).

Stage 3.3.1. Advertisers select their bid and keywords.

In the rst stage, the advertiser has to decide the bid per click as well as the ad-related

words (referred to as keywords). Following the earlier example, this means Walmart needs

to decide the bid as well as a set of keywords for an ad on a gaming laptop. The advertisers

can specify multiple keywords for an ad. For instance, in this example, Walmart species

47

keywords `gaming laptop' and `gaming laptop cheap'. The multiple keywords help in reaching

consumers with diverse search queries.

Stage 3.3.2. Auction is held, and winners are decided.

In the second stage, for every search query, Yahoo! collects all related ads, which is done

by matching the search query words to the ad's keywords. The matched ads enter an auction

to decide the winning ads that will be displayed to the consumers. Yahoo!, similar to other

search engines such as Google, uses a special auction for deciding on the winning ad called

the Generalized Second Price (GSP henceforth) auction.

Stage 3.3.3. Yahoo! sends feedback on ad's performance and ad price to the advertisers.

The third and nal stage compromises of a feedback report that goes to the advertiser

about the ad performance. gure E.3 shows a snapshot of an advertiser's account on Ya-

hoo!. The variables in the data set are similar to the feedback information available to the

advertiser. The advertiser gets detailed feedback on the ad, which is aggregated for each

specied keyword. Following the earlier example, this means for Walmart's ad on `cheap

gaming laptop', Walmart gets periodical information15 on the two keywords, namely `gam-

ing laptop' and `cheap gaming laptop'. For each keyword, they get the display and click

frequency in each ad position. Apart from the consumer response, they also get information

on the price paid per click for each winning position. For example, feedback for Walmart's

ad shows that the search queries that matched keyword `cheap gaming laptop' had their ad

display 100 times in the 1st position, and that translated into ve clicks, with 0.2 cents as

the price per click. The data set used in this paper contains similar information, where,

on the consumer side, it reports the display and click frequency for each position won by

keyword-ad observation,16 and on the advertisers' side, it reports the advertiser's bid and
15I assume that the advertiser uses the day as a given period to change/revise their bid.
Note that the information provided to the advertiser can be more detailed than the daily
aggregation that I assume here. However, previous papers have noted that the change in a
bid does not change much within a day; see Borger (2013)BÃ¶rgers et al. (2013).

16aggregated on a daily basis

48

winning positions.

3.3.2 Theoretical model

In this section, I present a model of the advertiser's equilibrium bid and the consumer's click

decision using features of weighted Generalized Second Price auction (GSP w) and discrete

choice model. The reason for modeling the consumer side is to get an estimate of relative

click rate of dierent positions and each advertiser's quality score.

In the section below,

I specify the click behavior using a discrete choice model. In the next section, I present a

model of advertiser's equilibrium bid, in which the advertisers choose the bid that maximizes

their prot from advertising.

3.3.3 Consumers side

Each consumer i enters the market with a unit demand for a product/service and conse-

quently starts the search by putting a query on an online search engine. Once the result

page displays all links related to the query, the consumer clicks on all relevant links and pur-

chases a good or service from one of the clicked links. In this section, I model the consumer's

click decision.

The online environment motivates several considerations. Firstly, the consumer antici-

pates the derived click benet by visible characteristics of the ad. Along with the visible ad

characteristics, the consumer also uses the belief that ads at a higher position are of higher

quality and relevance. This belief stems from the fact that the search engine's algorithm

assigns a higher position to ads with higher quality score, ceteris paribus. Thus, I also add

the position as a variable that predicts ad benet. Another consideration is that in an online

space, each click requires the consumer to spend a considerable amount of time on it, which

can be thought of as a search cost or time cost.

49

The expected utility of consumer i receives from clicking in ad j in market m is given as:

Ui,j,m = U (xj, kj, xi; Î·j, Î·i) + i,j,m

(3.1)

Where {Î·j, Î·i} are coecients that reect how intensely the cost and benet variables aect
the utility. The variables {xj, xkj
} capture the benet from a click on ad j at position kj;
this includes advertiser specic and position-specic xed eects, along with ad's popularity
measure17. Apart from these variables, {xi} are consumer specic variables that help capture
heterogeneous search cost; this includes variables such as how detailed is the search (captured

by the length of search query18). The term i,j is the idiosyncratic shock to the consumer's

benet from clicking on the link; it represents a part of the utility which is observed by

consumer i, but not by the researcher.

I assume i,j,m is independently and identically

distributed according to type 1 extreme value distribution. Additionally, if the consumer

does not click on the ad, she uses her time for an outside good, leading to a normalized
utility of Ui,0,m â‰¡ 0. Given the utility function in the above equation, I now dene the
equilibrium click behavior of the consumers.

Proposition 3.3.4. consumers in equilibrium may click on multiple ads per page.

Essentially in the equilibrium, consumers click on all ad links where the benet of a click

is more than the search cost.19 Let yâˆ—
equilibrium click decision for ad j , with yâˆ—

i,j denote the binary variable capturing consumer i's

i,j = 1 if consumer decides to click on the ad.

17measured as the proportion of times the ad appeared in the search result page relative

to total search queries in the category

18This is measured as the number of words in the matched keyword, since I do not have

the queries

19although most of the literature assume single click per page, this is more real-
istic situation in this market, as can be seen by a new move by Bing to give an
option of opening a new tab every time you click on a link. Here is the link to
the article: https://searchengineland.com/bing-is-testing-an-open-in-new-window-icon-in-
the-search-results-301922

50

Then the click decision can be written as follows:

ï£±ï£´ï£´ï£²ï£´ï£´ï£³1

0

yâˆ—
i,j =

if U (xj, kj, xi; Î·j, Î·i) + i,j,m > 0

otherwise

(3.2)

The above equation is used in the empirical section to estimate the probability of click

aggregated over all consumers. The estimation gives us the predicted probability of a click

for an ad j in position k, represented by ctrj,k. Assuming that the ad and position eects

are separable20 the click probability can be rewritten as:

ctrj,k = sjck

âˆ€ j J & k  {1, 2, ...K}

(3.3)

where

sj: The eect of advertisement j on probability of a click.

ck: The eect of ad position k on probability of a click.

The click probability in equation(3.3) is used in advertiser's maximizing problem as described

in the next section.

3.3.4 Advertising model

3.3.4.1 Model without externality

In this section, I rst specify the auction design. I then model the advertiser's maximization

problem and derive the equilibrium bid. In the end, I look at the case of added externality

to the per-click ad value.

Auction Setup

For each consumer query, a single auction is held to sell multiple ads on the result page. Let
there be K ads on sale and N potential advertisers denoted by j  J := {1, ...J}. Each

advertiser submits a single bid bj, which can be interpreted as per click payment.

20This is a similar assumption adopted by various papers in the literature for identication

of the quality of advertiser

51

The K positions are allotted through a weighted Generalized Second Price auction (re-

ferred to as GSP w henceforth). Each advertiser is assigned a quality score, sj, which mea-
sures the ad's impact on the consumer's click probability.21 The advertisers are then ranked

in terms of their weighted bids denoted as bw

j , i.e.,
j = bj Ã— sj
bw

The auction assigns position in decreasing order of weighted bids, essentially allotting

kth position to advertiser with kth highest weighted bid. For example, the top position goes

to advertiser with the highest weighted bid, second position goes to second highest and so

on. The price paid is equal to the bid of advertiser in the next slot weighted by their relative

score:22

Ad position k alloted to j if bw

j = b

[k]
w

Per click price is equal to pk =

b

[k+1]
w
s[k]

Here b

[k]
w is the kth highest weighted bid and s[k] denotes the score of advertiser with kth
highest weighted bid. Thus, the kth highest position goes to the advertiser with kth highest

weighted bid, and he pays the price of the bid below him weighted by their relative score,

which is equivalent to the weighted bid of the advertiser below him divided by his score.

Advertisers maximization problem

Each advertiser j has a per click ad value of vj. The advertiser's gain from winning position

k is given as

Ï€k(vj) = ctrk,j(vj âˆ’ pk)

(3.4)

21The score can also be thought of as the quality of the ad, as higher consumer relevance

is an indication that the consumer perceived it to be of a higher quality.

22I assume no reserve price for simplicity.

52

where vj denotes the per click value to the advertiser and pk is the per click price for position

j. The term ctrk,j denotes the probability of a consumer click for ad j in position k. The

probability of a click can be further broken down as

ctrk,j = sjck

(3.5)

here sj is the advertiser's aect on click probability and ck is the position aect on click

rate. These variables are estimated on the consumers side.23

Note that the advertiser is uncertain as to which position he wins. Thus, the expected

prot is given as the prot from each position times the probability of winning that position.
[k]âˆ’j,w

Before we write the expected prot, we dene the order statistic among bids.

let b

denoted the kth highest weighted bid among all other advertisers except j.

Advertiser j wins position k if the weighted bid bw
j

is less than (k âˆ’ 1)th weighted bid

and more than kth weighted bid, i.e. b

[k]âˆ’j,w â‰¤ bw

j â‰¤ b

[kâˆ’1]âˆ’j,w . As shown below:

K(cid:88)

k=1

Î (bj; vj, sj) =

P rob(b

[k]âˆ’i,w â‰¤ bw

j â‰¤ b

[kâˆ’1]âˆ’i,w ) âˆ— ctrk,j

3.4

Equilibrium bid analysis

(cid:20)

(cid:18)
vj âˆ’ E

(cid:12)(cid:12)(cid:12)(cid:12)bw

pk

j = b[k]

(cid:19)(cid:21)

(3.6)

Model Assumptions  This model looks at the incomplete information case. Each adver-

tiser does not know the bid or value of other advertisers but knows the primitive distributions;

namely the distribution of value Fv, the distribution of score Fs as well as the the distri-
bution of a variable referred as the weighted value, denoted as Ï‰j â‰¡ sj Ã— vj), Ï‰ âˆ¼ Fw(.).
The weighted value is the product of the per click value and the click rate of the advertiser.

Apart from this, the number of advertisers and ads per page are common knowledge. For the

23Note, that as I assume one ad per advertiser for each auction, I have used subscript j

interchangeably for advertiser as well as an ad.

53

estimate of the externality we also need the additional assumption that the average quality

of ads in each position is known to each advertiser24.

Given the setup, the equilibrium bid maximizes the following objective function:

K(cid:88)

k=1

K(cid:88)

k=1

(cid:20)
vj âˆ’ E

(cid:18) b

(cid:12)(cid:12)(cid:12)(cid:12)b

[k+1]
w
sj

(cid:19)(cid:21)

Ë†b

[k]
w = sj

(cid:18) b

(cid:12)(cid:12)(cid:12)(cid:12)b

[k+1]
w
sj

(cid:19)(cid:21)

Ë†b

[k]
w = sj

[kâˆ’1]âˆ’j,w )ctrj,k
(cid:20)

[kâˆ’1]âˆ’j,w )ck

b(vj, sj) = Arg max

Ë†b

P rob(b

[k]âˆ’j,w â‰¤ Ë†bsj â‰¤ b

Using ctrj,k = cksj we get,

â†’b(vj, sj) = Arg max
Ë†b

P rob(b

[k]âˆ’j,w â‰¤ Ë†bsj â‰¤ b

vj âˆ’ E

In order for the advertisers to solve for the equilibrium bid they need to know the distri-

bution of other bids, however we assume that they only know the distribution of the value

and do not have information about other bids (i.e.

incomplete information case). In the

standard auction this is solved by inverting bid and using the value distribution. However,

in this case the weighted bid bw(sj, vj)) is is multi-dimensional as it depends on the value

vj as well as the score sj.

Due to this limitation most of the papers in the literature have concentrated on non-

weighted auction in the incomplete information case. This paper over comes the problem of

bid by showing equivalence between the weighted GSP and another auction which I refer as

the modied GSP auction (GSPM henceforth). In the GSPM auction, instead of a per click
bid the advertisers bid Ëœb for sj clicks. The value for sj click then becomes wj, which recall

is the weighted value wj = sjvj. Thus, the bid in GSPM serves a similar purpose as the

weighted bid in GSP auction. The dierence being that in GSPM the bid can be written as

a single dimension function given as follows:

bGSP M
w

: (Ï‰j) â†’ R+, where Ï‰j = sj Ã— vj

bGSP M (Ï‰j). In the next lemma, I formally prove the equivalence between the weighted bid

in weighted GSp to the bid in GSPM.

24The assumption of average quality of the ad in each position is only needed for exter-
nality, the BNE equilibrium in incomplete information case does not need this assumption.

54

Lemma 3.4.1. The weighted GSP auction equilibrium bid function bGSP w

(vj, sj) â†’ R+ is

equivalent to the bid in GSPM auction divided by the quality score sj

bGSP M (Ï‰j)

sj

= b(vj, sj),

â†’bGSP M (Ï‰j) = b(vj, sj) âˆ— sj = bw

j (vj, sj)

where Ï‰j = sj Ã— vj

(3.7)

The lemma shows that the weighted bidding function is equivalent to a function that

is dependent only on one-dimensional advertiser's type Ï‰j. Additionally, it shows that at

equilibrium we can rewrite the weighted bid sjbj as a function of weighted value , i.e. bw(Ï‰j).

This simplication comes in handy for proof of bounds and equilibrium as now inverse of

bid function is one dimension.

Proposition 3.4.2. The unique symmetric Bayesian Nash equilibrium of the weighted GSP

auction is given by the following bidding strategy:

bâˆ—(Ï‰) = Ï‰ âˆ’ Î“(Ï‰) âˆ’

Mn(Ï‰, t)Ï†(t)dt

âˆ€Ï‰ âˆ¼ Fw(.)

(3.8)

(cid:90) Ï‰

âˆž(cid:88)

n=1

0

where

Î“(Ï‰) =

ck

ck

k=1

Nâˆ’2

kâˆ’1 (1 âˆ’ F (Ï‰))kâˆ’2(cid:82) Ï‰
K(cid:80)
K(cid:80)
K(cid:80)
K(cid:80)
(cid:90) Ï‰

0 F Nâˆ’k(x)dx
Nâˆ’2
kâˆ’1 (1 âˆ’ F (Ï‰))kâˆ’1F Nâˆ’kâˆ’1(Ï‰)
Nâˆ’2
kâˆ’1 (1 âˆ’ F (Ï‰))kâˆ’2F Nâˆ’kâˆ’1(t)
Nâˆ’2
kâˆ’1 (1 âˆ’ F (Ï‰))kâˆ’1F Nâˆ’kâˆ’1(Ï‰)

ck

ck

k=1

k=1

M1(Ï‰, t) =

k=1

Mn(Ï‰, t) =

M1(Ï‰, )Mnâˆ’1(Ï‰, )d

0

âˆ€n â‰¥ 2

The above proposition shows the equilibrium bid25. The equilibrium bid in equation

(3.8) has some issues regarding empirical identication. First, the structural elements can

25Once the lemma(3.4.1) is used to make the objective function depend on weighted value
only , the subsequent proof of the equilibrium is similar to BNE derived in Gomes & Sweeney
(2014)

55

not be identied using observed parameters in the data; specically, the equation uses the

unknown latent distribution F (.). Secondly, it does not have a closed-form solution. In order

to derive identication of equilibrium bid empirically, I derive bounds on the bid as shown

in the next section.

3.4.1 Bounds on exclusive ad value

As the equilibrium bid does not have a closed-form, I will instead use bounds on the equilib-

rium bid to derive bounds on the exclusive ad value. The next proposition bounds the bid in

the Generalized Second Price auction in between bid from two more well know auctions that

have a closed and easily tractable equilibrium bid. Specically, I use VickreyClarkeGroves

auction (VCG) auction and a Weighted Generalized First Price auction (GFP). The next

proposition bounds the bid shading in these three auction designs. Bid shading is the amount
by which the advertisers shade their bid below their ad value  (vj âˆ’ bj).

Proposition 3.4.3. The bid shading in GSP w auction can be bounded between the bid in

analog VCG and generalized rst price (GFP) auctions.26

(cid:125)
(vj âˆ’ bj)VCG â‰¤ (vj âˆ’ bj)GSP w

(cid:123)(cid:122)

(cid:124)

price less than bid

(cid:124)
(cid:125)
â‰¤ (vj âˆ’ bj)GF P w

(cid:123)(cid:122)

price equal to bid

âˆ€j×’

(3.9)

Shows that the bid shading is more than the truthful bidding in VCG; however, it is less

than that of the rst-price auction. This is because even though the GSP auction bid aects

price, the eect on price is still less than that of a rst-price auction. To understand this,

recall that in GSP, your price is equal to the bid below your bid, whereas, in the analog

rst-price auction, you pay your bid. Using the above, I can bound the value as follows:

Proposition 3.4.4. If the proposition(3.4.3) holds, advertiser's value can be bounded in

26proof of proposition (4) and (5) are given in appendix(G.1)

56

terms of observed variables27:

bj â‰¤ vj â‰¤ bj + Î¦(fw(.), ck, bj)

âˆ€j×’

(3.10)

Here Î¦(fw(.), ck, bj) is the bid shading amount, which shows that advertisers would bid

below their value. The term can be expanded in terms as shown below

Î¦(fw(.), ck, bj) =

(cid:0)Nâˆ’1(cid:1)
(cid:0)Kâˆ’1(cid:1)(cid:20)

K(cid:80)

k=1

ck

sj

f (Ï‰j )
(cid:48)
b
(Ï‰j )

(cid:0)Nâˆ’1(cid:1)
(cid:0)Kâˆ’1(cid:1) (1 âˆ’ F (Ï‰j))kâˆ’1(F (Ï‰j))Nâˆ’k

K(cid:80)

k=1

ck

(N âˆ’ k)(1 âˆ’ F (Ï‰j))kâˆ’1F (Ï‰j)Nâˆ’kâˆ’1 + (k âˆ’ 1)(1 âˆ’ F (Ï‰j))kâˆ’2F (Ï‰j)Nâˆ’k

(cid:21)

The numerator is the probability of winning any position in the auction and the denomi-

nator the dierentiation of that probability. Notice that the function Î¦(fw(.), ck, bj) has an

unobserved distribution fw i.e., distribution of weighted value. However, we can substitute

that with the bid distribution as the weighted bid is monotonic.

I will use the following

equality conditions :

Thus, the function

Î¦(gb(.), ck, bj) =

(cid:0)Nâˆ’1(cid:1)
(cid:0)Kâˆ’1(cid:1)(cid:20)

K(cid:80)

k=1

ck

sjg(bw)

G(bw) = F (Ï‰|N )
f (Ï‰|N )
(Ï‰)
b

g(bw) =

(cid:48)

(cid:0)Nâˆ’1(cid:1)
(cid:0)Kâˆ’1(cid:1) (1 âˆ’ G(bw))kâˆ’1(G(bw))Nâˆ’k

K(cid:80)

k=1

ck

(N âˆ’ k)(1 âˆ’ G(bw))kâˆ’1G(bw)Nâˆ’kâˆ’1 + (k âˆ’ 1)(1 âˆ’ G(bw))kâˆ’2G(bw)Nâˆ’k

(cid:21)

(3.11)

The lower bound is a well known observation in the auction literature that a rational

advertiser will not bid more than the value. The upper bound is the bid plus the bid shade

amount in case of the GFP auction.

27proof in appendix(G.1)

57

3.4.1.1 Model with externality

Now we look at the Advertiser's model with externality. The aim of this subsection is to

show how we can separate the value with and without externality.

The per-click value to the advertiser is compromised of two-part, (i) an advertiser-specic

ad value and (ii) a negative value from the number of other ads on the page  externality

component. To have a simple and tractable model we assume that the two-component are

separable. Altogether the per click value for buyer j is given as:

Vj = vj(EXTj)Î²1

(3.12)

where vj corresponds to the externality free part as analyzed in the previous section. The

second part captures the externality which depends on other ads EXTj is a function of

Kj, which denotes the set of ads present next to ad j. Note Î²1 captures the intensity

of externality's eect on ad value.

In the empirical section I well estimate Î²1 ( refer to

step(3.7.2) of the estimation section for more details).

Deriving externality eect on the bid: The goal of this step is to recover the

externality impact on the bid and in turn, on the advertiser's ad value. Using eqn(3.12),

I can derive the relation between externality and the advertisers' bid; the next proposition

formally states the result.

Proposition 3.4.5. The additive separability of externality and individual eect on valuation

leads to additive separable bid parameter (after controlling for various factors).28

(cid:124)

(cid:123)(cid:122)

(cid:125)

(cid:124)

(cid:123)(cid:122)

(cid:125)

(cid:124)

(cid:123)(cid:122)

(cid:125)

Log(b(vj, sj; Î±K,n, Extj)

) = Log( b(vj, sj; Î±K,n)

) +

Î²1Log(Extj)

(3.13)

observed bid

externality free bid

externality eect on bid

Where the bid b(vj, sj; Î±K,n, Extj) gives the bid in the presence of externality, which is

dependent on advertiser specic ad value (vj) and score (sj) as well as the auction parameters

Î±K,n, where K denotes the number of ads and n denotes the number of advertisers. The

28proof in the appendix(G.1)

58

proposition shows that the log of bid in the presence of externality is separable in externality

eect Î²1Log(Extj,Kj
the market Log(b(vj, sj; Î±K,n)).

) and the unobserved hypothetical bid if there was no externality in

The above proposition will help to estimate the externality in the empirical section,

where I estimate Î²1 that captures the intensity of externality's eect on ad value. Another

insight from this proposition is that once we know the Î²1Log(Extj) we can estimate the

externality-free bid, i.e. b(vj, sj; n). In the next section, I look at the equilibrium bid in case

of no externality. The next section helps derive the externality free ad value, i.e. vj, which

can also be thought of as the exclusive ad value.

3.5 Data

3.5.1 Data details

The data set is provided by Yahoo! as part of the Yahoo! Research Alliance Webscope pro-

gram.29 It is a four-month period of data covering search queries from January 2008 to April

2008. The sample covers all search ads in 5 categories, namely Laptop, TV Cable, Cruise,

Collectible Coins and Car Insurance.30 Each category is treated as a separate data set, and

the results are obtained separately for each of them. The advantage of data from multiple

industries is that after the estimation, we can compare the results across industries to see

whether the results are sensitive to industry characteristics. The data set has two parts. The

rst part has consumer side information, and the second part has advertiser bid and auction

outcome information. For this the analysis, I limit my sample to ads on the rst page of the

search result.31 This is mainly done because 90% of the clicks in the data are from rst page

ads, as can be seen by gure E. This nding is also consistent with the observed pattern in

29The data set I analyzed was part of the Advertising & Markets Data and, more specif-
ically A3. Yahoo! Search Marketing Advertiser Bid-Impression-Click data on competing
Keywords

30Search of specic brands names are removed from the data.
31A similar restriction was followed in Athey and Nekipelov (2010)Athey & Nekipelov

(2010), who use Bing data

59

the industry, which shows that the consumers do not go beyond the rst page.32 For more

information on data cleaning refer to appendix(J.0.1).

Consumers side data: On the consumer side, I have information about ad display and

consumers' click response for each position-advertiser-keyword combination. Continuing the

Walmart example, the data will report that Walmart's keyword `cheap gaming laptop' got

100 displays in the 1st position, and that translated into ve clicks. Table(F.1) gives the

list of variables used and table(F.3) gives the summary statistics. Notice that this is similar

to feedback given to an advertiser in gure E.3. Thus, this data is useful in analyzing how

advertisers decide their equilibrium prot-maximizing bid.

Apart from the clicks and display information, a few other important measures can be

obtained from the data. First, we can deduce the click rate of each ad,33 which is measured

as the ratio of the number of clicks over the number of displays. The summary statistic show

an average of 1% click rate, implying that about 1% of the ads get clicked. Additionally,

the keyword (matched words between ad and search) gives an approximation on the type of

search. Therefore, the number of words in the keyword referred to as keylength can be used

as an approximation for the length of the search query. Previous papers in the literature34

have noted that longer search queries are typically associated with a more focused search

intent and can thus be more valuable for the advertisers. The maximum number of words

is 10, with an average keylength of 3 words. Another variable used is the popularity of the

keyword, measured by the relative number of searches. This controls for the possible eect

of the popularity of the search.

Advertisers side: On the advertisers' side, the data is likewise aggregated on a day level.

32Various articles show that, apart from Yahoo!

consumers on Google and Bing so
not go beyond page 1: https://www.conversionguru.co.za/2017/05/29/90-people-dont-go-
past-page-1-google-search-results-searching/, https://www.theleverageway.com/blog/how-
far-down-the-search-engine-results-page-will-most-people-go/

33ad is dened as the set of keywords for which the advertiser had the same bid on a given

day

34for instance, Ramaboa, Kutlwano KKM, and Fish, Peter (2018) look at dierences in

consumers with dierent search length

60

For each ad,35 I have information on the bid for the ad, the number of times the ad won

an auction, the winning ad position and the total number of advertisers shown in a day.36

Table F.2 lists the variables and table F.4 provides the summary. The bid is measured in

terms of cents. To mask the actual amount, Yahoo! scaled all bids by an unknown amount.

I subtract the bid with the lowest value, Thus, the bid can be taken as the lower bound

on the actual bid. The average bid is 0.6 cents. I limit my analysis to the ads on the rst

page considering the top 7 ads.37 These variables help in identifying the auction outcome

for each advertiser. Through the data, I can measure how many times an advertiser had a

winning ad in the auction, which position he won and what was his corresponding bid.38

Apart from the given variables, I can also measure the ad specicity in terms of the number

of keywords specied for an ad. This gives me an approximation of whether the ad was

made for a broader search or a specic search. Additionally, I also use the popularity of the

keywords measure by the display frequency.

A note on grouping ads: As the data set has no information on consumer queries, it

is hard to know which ads enter the same auction. Nonetheless, the rule through which

Yahoo! decides which ads enter an auction can provide useful insights on how ads were

matched together. Recall that here, each consumer query is a separate auction. For each

query, Yahoo! pulls out relevant ads by matching the ads' keywords with the consumer query.

In eect, keywords related to each other enter the same auction. Thus, the paper creates

markets that are sub-groups of keywords that are related to each other. I assume the sub-

35I dene ad as the set of keywords-advertiser combination in a day for which advertiser

had the same bid

36I assume that the total number of potential ads is equal to the total number of ads that

won at least once in a day

37On average, seven ads were shown on the page. I assume the number of are seven ads

unless the number of positions observed was less than seven.

38The real identity of keywords and advertisers are kept condential by de-identifying the
data. This is done to avoid revealing any proprietary information. Also, all the bids are
scaled by an unknown amount in order to avoid revealing information about the total revenue
of the platform. Even though the data is de-identied, I can still track the same keywords
as the same de-identied number is used for all observations.

61

groups are created such that only the ones within a market compete with each other.39 I

assume that the same set of advertisers enter all auctions within a market; this way, the

markets can be treated as a proxy for auctions.

Dierences across ad product category: Table(F.5) shows mean values for dierent vari-

ables across categories.40 The relative popularity of these categories compared to the total

daily search can be seen in gure(E.5); laptop has a high popularity for consumer search

with value of 55% meaning it is more than half as popular as the peak popular search topic

measured as 100%. The car-insurance category stands out with a high bid level, with an

average 4.36 cents. It also has a relatively concentrated market with 20 advertisers per day.

The high bid level makes this an important market to analyze from Yahoo!'s perspective.

Apart from car insurance, the laptop market is also important to analyze from Yahoo!'s

perspective as it is the most popular search category with an average 540.7 search per day.

It also seems to have a high level of competition as there are, on average, 45 advertisers per

day. Lastly, the TV cable category has the highest high click rate and the second highest

per click bid making it yet another protable market for Yahoo!. Thus, the data provides

categories that have dierentiating characters and can, therefore, help us check how the

results vary across the category.

3.5.2 Characterizing features of the data

This section provides evidence on how the assumptions of the current model t the observed

data. I also look at the variation in the data that might be helpful to identify the parameters

of interest.

1. Winning bid statistics: A distinct feature of this auction is that each auction has

39I give more details on the process of making markets in the estimation section.
40Note that the data was masked, so the actual keywords were converted into random
I know the name ve dierent categories, which I match to the
numbers and alphabet.
masked categories using characteristics of the market. Please refer to the appendix(J.0.2)
for more details

62

more than one winner. Thus, unlike standard auction designs, where the researcher

can only use a single winning bid for each auction, I can use multiple winning bids for

each auction. This is especially useful in the last step of the analysis, where I have to

use bids to estimate the distribution of advertisers' willingness to pay. Figure(E) shows

the mean bids for dierent winning positions in the ve categories41. It seems that in

some categories, such as laptops, the bidders are willing to bid substantially higher to

get the rst position. However, in other categories, such as cable, advertisers are not

particularly willing to pay a higher price for a better rank. This dierence might be

due to the varying sensitivity of consumers to the position of the ad. Note that as the

auction mechanism decides the allocation of the ad position according to the weighted

bid, it is not necessary for a higher bid to get a higher position.42

2. Variation in attributes of the ad:

To better understand the determinant of how advertisers and consumers value each

other. I look at how consumers interact with the ad. One of the attributes of ads is the

position of the ad. In general, consumers tend to focus on higher positioned ads than

the ones below. To see whether I can disentangle the eect of an ad's characteristics

from the eect of an ad position on the consumer's behavior, I need variation in the

position allotted to an ad. I observe many ads in the data which are placed at dierent

positions on dierent days. Using this, in gure E.10, I plot the relative click of the

same ad in dierent positions. To compare this across ads, I measure the clicks for

each ad relative to the clicks the ad received in the rst position. The measure of clicks

used here is click-through rate, which is the probability of an ad getting a click. The

click-through rate is calculated as the ratio of clicks to impressions. In the estimation

section, this is derived from the consumers' choice model.

41To consistently graph all the categories in one graph, I have scaled the bid in car insurance

category to match the bid range of other categories (only for this graph)
42if there are two advertisers such that b1 > b2, advertiser 2 might get a higher position if
he has a higher weighted bid, i.e. bs âˆ— s2 > b1 âˆ— s2.

63

3. Externality: An important observation from the data, is the preliminary evidence

of externality. The externality is evident by examining the dependence of bid for a

given advertisement on the relevance of other ads displayed in the keyword category.

For example, gure E.11 shows the linear relation between the average bid and the

percentage of high-quality ads in the market.43 This shows how markets with high-

quality ads have a net negative eect on bids. This result gives preliminary evidence

that the advertiser chooses to bid less when other high-quality ads appear in the search

result page, indicating they take into account the eect of externality on consumer's

response to their ad.44

3.6

Identication

In this section, I provide intuition for the identication of the advertiser's externality-free

value distribution and the parameters for externality. I also characterize the machine learning

algorithm used for creating markets. Note that for the theoretical model I suppressed the

market subscript. However for next few sections, I add the subscript m, which represents a

market. Note, market is the ad-group and day combination.

3.6.1 Advertisers value distribution

The approach followed to derive upper and lower bounds on advertiser's value distribution

uses a combination of two standard method in the auction literature, as discussed in paper

(Guerre, Perrigne, Voung, 2000) and (Haile and Tamer 2003). The inequalities derived at

the equilibrium bid (Proposition (3.10) imply for each advertiser i, we have the following

43the the result is obtained regressing bid on the dummy variable for high-quality ads,

after controlling for position, category, popularity, keyword eects.

44The quality is measured as the ratio of clicks to impressions. In the model, it will be

formally derived from the consumer side.

64

bounds:

j,m â‰¤ vj,m â‰¤ Ë†bext
Ë†bext

j,m + Î¦(Ë†g(.|Ë†b), Ë†ck,m, Ë†bext

j,m, Ë†sj,m)

âˆ€ j, m

(3.14)

This gives an upper and lower bound for each advertiser in the sample. Thus, the distribution

of advertisers' valuation, Fv(.) can be recovered if I know (1) the estimates of reduced(i.e.
externality free) bids (i.e. Ë†bext

j,m); and (2) the bid shading amount, Î¦(Ë†g(.|Ë†b), Ë†ck,m, Ë†bext

j,m, Ë†sj,m).

These parameters are estimated in the initial steps of the estimation process and are then

used to derive the distribution bounds in the last step. For now lets assume we can estimate

them.

From the bounds on value in equation(3.14), I derive bounds on its distribution by using
stochastic dominance, which recall implies that if x â‰¤ y âˆ€ x y, then Fx() â‰¥ Fy(). Let Hb()
be the distribution of Ë†bext
j,m, Ë†sj,m).

j,m and HÏ†(.) be the distribution of Ë†bext

j,m + Î¦(Ë†g(.|Ë†b), Ë†ck,m, Ë†bext

Using stochastic dominance in equation(3.14) gives the following result:

HÏ†(.) â‰¤ Fv(.) â‰¤ Hb(.)

(3.15)

Consistent and asymptotically normal estimates of the pointwise upper and lower bounds

can be obtained by taking the idealized sample analogs of these endpoints. This is a standard
case of nonparametric estimation of a CDF using kernel estimation, which gives us Ë†HÏ†(.)
and Ë†Hb(.). Further details are given in the estimation section.

Now getting back to the estimates used above, i.e. (Ë†ck,m, Ë†sj,m, Ë†bext

j,m, Ë†g(.|b)). These are
discussed in the next two subsections. The rst two (Ë†ck,m, Ë†sj,m) are estimated using con-
j,m, Ë†g(.|b)) are estimated
sumer's click data in a discrete choice model. The other two (Ë†bext

advertisers bid data.

3.6.2 Consumers click behavior

I use the utility function of consumers in equation (3.2) to estimate the predicted probabilities

of a consumer clicking on the advertisement. The weighted logit model is used to separately

65

estimate the eect of ad characteristic (i.e. Ë†sj,m) as well as the position (i.e. Ë†ck,m). The data

identies the position and ad characteristic for each ad click by the consumer. I also observe

the impressions, which signify the number of times an ad was displayed per period. The

variables, impressions, and clicks together help identify the number of clicks and no-click

observations per day. Further detail is given in the method section.

3.6.3 Externality estimation

(cid:113)(cid:80)

Externality is dened as the quality weighted sum of number of ads per page, which can be
k(cid:54)=K Ë†s2âˆ’j,m, where Ë†s is the average quality of an ad in position k for
all ads except ad j.45 The quality estimate, Ë†s, is derived from the consumer side analysis46.

written as Extj,m =

The quality is denoted on 0 to 1 range with 1 being the highest quality. Fig(E.18) shows

the distribution of externality variable across dierent categories. I use variation in average

quality of ads across markets and periods to identify the externality co-ecient.47 The
estimates are then used to calculate the externality-free bid, bextâˆ’f ree
= bj,m,t(Extj,m)âˆ’Î²1.

j,m,t

There is a concern that externality might be endogenous. Even though the quality of rival's
ads does not directly impact the advertiser j(cid:48)s bid, there is possibly an indirect eect as

the relative quality aects the winning probability. Thus, the variable externality is treated

endogenous, and the externality in other markets are used as instruments. The instrument

variable is independent of any supply-side eect from the advertisers, and at the same time,

it correlates with the demand side i.e. consumer behavior. Thus, it is correlated to the

externality but is independent of any eect from the bid decision.48

45this term is weighted by dierent positions the advertiser gets in a day
46This is under the assumption that the quality of an ad can be captured by the ad
dependent aect on consumer's click probability. Note that this measure is widely used as
quality measure by search engines such as google and yahoo.

47The number of ads in most cases is 7 ads for rst page
48This is similar to the standard practice of using prices in other markets as an instrument

for price in IO literature.

66

3.6.4 Creating market using unsupervised machine learning

Each advertiser species a set of keywords that describe his ad. As mentioned earlier, Yahoo!

matches these keywords with consumer's query in order to pull out relevant ads. In eect,

keywords related to each other enter the same auction. Thus, the paper creates markets that

are sub-groups of ads such that only the ones within a market compete with each other. In

this section, I give details of how the markets are created.

Fig(E.14) shows the visualization of the distance between keywords for the ve main

categories. The distance measure used is cosine similarity. A key insight that will help

increase precision for estimating markets is to use the fact we know the main categories. We

will use them to see what is the best method to cluster the markets. Specically, we will

focus on two aspects that are needed for the algorithm, namely the attributes that capture

the similarity between ads and the distance matrix used.

The rst step of grouping keywords is to nd a way to calculate the similarity between

dierent keywords. A keyword is composed of multiple words. These words are used as

attributes that take positive value for keywords that contain the word. The similarity mea-

sure then assigns a positive score for each matched word and a negative score for each

non-matched word. Additionally, the score can be improved by putting more weight on a

less frequent word being matched, under the assumption that such words are more valuable

signal for grouping similar keywords than words that frequently appear across all keywords.

In order to do this, I use tf-idf (term frequency inverse document frequency)49 as a weighting

factor. The weight increases proportionally to the number of times a word was used for an

ad, and is oset by the overall frequency of the word, which helps to adjust for the fact that

some words appear more frequently in general.50. Apart from using words as attributes for

the ad, I also need to specify the distance metric used between keywords. There are several

49tf-idf is a numerical statistic that is intended to reect how important a word is to a

document in a collection or corpus.

50tfidf is one of the most popular term-weighting schemes today; 83% of text-based

recommender systems in digital libraries use tfid.

67

distance measures, such as euclidean distance and cosine similarity. I use dierent distance

measures to sort the keywords in the ve main categories and see which one works the best.

For details, see g(E.14), the graph plots the distance between keywords for various distance

measures51. It turns out cosine similarity measure works the best, coincidentally this is also

one the most commonly used distance measure to match text documents in the machine

learning literature. Thus, cosine similarity will be the selected choice.

The advantage of using unsupervised machine learning is that except the similarity mea-

sure and features(words in this case), no other variable needs to be specied. Furthermore,

we don't need a measure of actual markets to get a prediction. Remember that here we

are creating subgroups(or markets) within a base category, for example, one of the base

categories is Laptop. Thus the algorithm might have a tendency to group all of them in

one big group as `laptop' will be a common word for all of them. To solve this, we delete

the single word `laptop' as a matching feature. This means two keywords `Business Laptop,'

and `Student Laptop' will have zero similarity score as the only word common is `laptop',

which is disregarded in case of making markets. The rest of this section gives details of the

k-means algorithm.

After the keywords are processed and vectorized, the markets are dened using k-means

clustering. Given a set of keywords X = {x1, x2, ...xNm} and an exogenously determined
number of group, i.e. M , the algorithm assigns each keyword x  X to one of the m groups.

For each group has a centroid, which is one of the elements from X. A keyword xl is in
group m if and only if the similarity measure is the highest for other x(cid:48)s in m than to those
in other groups. Let Î»l,m denote the allocation variable such that:

ï£±ï£´ï£´ï£²ï£´ï£´ï£³1,

0, otherwise

Î»l,m =

if xl is in cluster m

(3.16)

Let Î¸m be the chosen centroid of group m. The cluster algorithm decides the allocations

(Î») and centroids(Î¸) by maximizing the mean squared distance between points within the

51compressed in two dimensional space

68

group, as shown below:

(cid:18)
1 âˆ’ m(cid:88)

m=1

min
Î¸,Î»

(cid:19)

Î»l,m

X.Î˜m

||X||||Î˜m||

(3.17)

The k-means algortihm used in this paper uses cosine distance to calculate similarity between

points52. Equation(3.17) is solved recursively. It is a two steps process which is repeated till

stable groups are reached. In the rst step it optimally selects Î»l,m for each keyword i given
Î¸ = {Î¸1, ...Î¸M} and in second step optimal Î¸ is picked given Î». The algorithm is given as:

(cid:18)
1 âˆ’ M(cid:88)

m=1

(cid:19)

Î»l,m

X.Î˜m

||X||||Î˜m||

Step:1 min

Î»

Nm(cid:88)

Step:2

Î»l,m(xl âˆ’ Î¸m) = 0

âˆ€i

âˆ€m

l=1

The steps are repeated until convergence of Î» and Î¸. The optimal number of clusters

(i.e. markets). The optimal number of clusters m is determined by repeating the algorithm

across a dierent number of clusters and then using the silhouette score, which provides an

average measure of how well each keyword matches with the allotted market, compared to

how it matches with other markets.

3.7 Econometric Method

In this section, I describe tmhe estimation method. The estimation steps are as follows:

Âˆ Step 1: Estimate consumers click probability.

Âˆ Step 2: Estimate the externality eect on advertisers bid.

Âˆ Step 3: Estimate lower and upper bounds on the advertisers exclusive ad value distri-

bution.

52There are several other distance measures that can be used in k-means algorithm

69

3.7.1 Step 1: Estimate consumer click probability

In this step, I model the consumers click choice. The main aim of this step is to derive

ad position and advertisers' eect on the click probabilities. The ad position eect on click

probability are used as a measure for click rate for each position.53 The advertisers' partial

eect on click probability is used to create the measure for ad quality. To estimate these two

eect I consider a click/no-click binary choice setting using a weighted logit model,54 where

the weights are on the frequency of clicks and no click. As shown in the theory section the

consumers i's click choice on ad j is given as follows:

ï£±ï£´ï£´ï£²ï£´ï£´ï£³1

0

yi,j =

consumer i clicks on ad j if Ui,j > 0
otherwise Ui,j â‰¤ 0

The above equation shows that the click decision is captured by the binary variable yi,j = 0, 1,

where y = 1 is consumer decides to click on the ad. It also shows that consumer click on the

ad whenever the utility from the click, represented by Ui,j is greater than zero. We can use

this variable to set up a logit model. The utility from a click can be further elaborated to

depend on observable as follows:

Ui,j,m = Ï†0 + Î¦1Xposition dummy + Î¦2Xadvertiser dummy + Î¦2Xcontrols + i,j,m

(3.18)

where the position dummy captures the ad position eect and advertiser dummy captures

the ad eect. Apart from this the control variables include consumer specic variables such

as search popularity measure55 as well as the keylength which captures how detailed is the

search. Lastly the term i,j is the idiosyncratic shock which is independently and identically

distributed according to type 1 extreme value distribution. The logit model gives us the

53click rate is the expected percentage of clicks received in each position
54similar set up is considered in Athey & Nekipelov (2010)
55measured as the proportion of times the keyword appeared in the search result page

relative to total search queries in the category

70

probability that consumer i chooses to click ad j in market m is:

Ë†P (yi,j,m = 1|x) =

exp(Î¦xi,j,m)

1 + exp(Î¦xi,j,m)

where yi,j,m is a binary variable, which equals to 1 if consumer j chooses to click on ad i in

market m, and xi,m denotes the set of variables considered in equation(3.18).

Parameters estimated in step 1:

Âˆ Click rate of position k in market m (Ë†ck,m): this is measured as the predicted proba-

bility of a click in position k in market m.

Âˆ Quality measure (Ë†sj,m): The quality measure used the predicted probability of a click

for advertiser j in market m, denoted by si,m. This measure is then scaled to be

between [0, 1] by dividing it by the highest value.56

Âˆ Externality measure: Once we have the quality measure, we can derive the externality

measure. Externality for ad j is equal to the weighted sum of the number of other

ads, where the weight is equal to the square of the average quality of other ads in each
position for market m. Thus, externality in market m for ad j is equal to Ë†Extj,m =
57, where Ë†s is the average quality of an ad in position k for all ads except

(cid:114) (cid:80)

Ë†s2âˆ’j,m

k(cid:54)=Kj

ad j in market m.

3.7.2 Step 2: Estimate the externality eect on the advertiser's bid

The goal of this step is to recover the externality impact on the bid. Proposition (3.4.5)

in the theoretical model provides the equation showing the relationship between externality

and equilibrium bid. Using this, I now analyze the relationship empirically. I use a hedonic

regression approach suggested by Haile et al. (2003) for english auction (this paper shows

56Note the top 0.01% of the values are considered the highest quality, thus are given value

1 in the 0-1 scale

57note that here the subscript kj is suppressed for simplicity, and the subscript j captures

the dependence on ad j

71

how to apply for GSP auction setting). It presents equilibrium bid (denoted as bj,m) as a

function of externality (denoted as Extm,j), the number of bidders, quality of the ad and a

vector of observable market characteristics. Thus, using the bid data, the underlying bid for

advertiser j in market m can be written as:

log(bm,j) = Î²0 + Î²1log(Extm,j) + ZmÎ²2 + Î²3Ë†sj + Î´N,K + um,j

(3.19)

Ë†Extm,j = estimated externality in the market

Zm = Market characteristics

Î´N,K = Dummy variables for number of advertisers, ads and ad-position

Ë†sj = estimated ad quality

In above equation, Î²1 measures the eect of externality on the advertiser's bid, interpreted

as percentage change in bid for 1 more additional ad on the page.

I control for market

characteristics such as popularity of the keywords of the ads, the keylength as well as the

specicity of the ad(measure whether the ad is made for a specic or broad category58).

Lastly, I control for number of advertisers and number of ads eect by using dummy variables

for each one of them.

As noted in the identication section, even though quality of rival's ads does not directly
impact the advertiser j(cid:48)s bid, there is possibly an indirect aect as the relative quality aects

the winning probability. The resulting correlation between externality and of competitor's

weighted bid induces a positive bias in the OLS estimate of Î²1. Therefore I estimate ex-

ternality via instrumental variables, focusing on correlation between externality in dierent

markets for a given period, which gives potential exogenous variation by capturing the de-

mand side variation but being independent of the supply side. My rst stage regression in

the 2SLS methods is:

Extm,j = Î³1Extm(cid:48),j + Î“2Zm + Î³2Ë†sj + Î´N,K + Âµj,m

(3.20)

Parameters estimated
58measure by the number of keywords specied for an ad

72

Âˆ Î²1: This measures the impact of externality on bid. As externality oncreases by 1%,

bid will decrease by Î²1 percent.

Âˆ Externality free bid - the externality-free bid is residual when the externality eect is

removed from the observed bid bextâˆ’f ree

.

j,m

Âˆ The distribution of externality free bid: I can also calculate the distribution of the

weighted externality free bid, as follows:

(cid:88)

(cid:88)

j

m

Ë†g(b) =

1
Î´g

1{Ë†b

[w]extâˆ’f ree
j,m

â‰¤ b}K

(cid:18)Ë†b

[w]extâˆ’f ree
j,m

(cid:19)

âˆ’ b

Î´b

(3.21)

3.7.3 Step 3: Estimate lower and upper bounds on advertiser's externality-free

value distribution

This step involves estimating the distribution of advertisers value, Fv(.). Note, that this

step uses a nonparametric estimation method since in this step the goal is to estimate a

distribution and not a parameter. Additionally, since the distribution is partially identied,

meaning that the only the upper and lower bound on the distribution is identied, I would

estimate a lower and upper distribution that bound Fv

This step used equation(3.10) from the theory section,as reproduced below:
Ë†bextâˆ’f ree

+ Î¦(Ë†g(.), Ë†ck, Ë†bextâˆ’f ree

â‰¤ vj â‰¤ Ë†bextâˆ’f ree

 {1, 2, ....nm,t}

âˆ€ j

, Ë†sj)

j;m,t

j;m,t

j

(3.22)

From the bounds on value in equation(3.14), I derive bounds on its distribution by using
stochastic dominance, which recall implies that if x â‰¤ y âˆ€ x y, then Fx() â‰¥ Fy(). Let Hb()
be the distribution of Ë†bext
j,m, Ë†sj,m).

j,m and HÏ†(.) be the distribution of Ë†bext

j,m + Î¦(Ë†g(.|Ë†b), Ë†ck,m, Ë†bext

Using stochastic dominance in equation(3.14) gives the following result:

I use kernel estimation to estimate the cdf, as shown below:

HÏ†(.) â‰¤ Fv(.) â‰¤ Hb(.)
(cid:88)
(cid:88)

1{Ë†bj,m â‰¤ h}K

(cid:18)Ë†bj,m âˆ’ h

Î´h

(cid:19)

(3.23)

(3.24)

Ë†Hb(h) =

1
Î´h

j

m

73

Similar equation is used for the upper bound, giving the nal estimate as:

Ë†HÏ† â‰¤ Fv â‰¤ Ë†Hb

(3.25)

3.8 Results

In this section, I discuss the results that are reported in gure(E.15-E.23) and tables

(F.6-F.8). The discussion on results is presented separately for each estimation step: step(1)

consumer side - click behavior step(2) Externality eect on advertiser's bid (3) Deriving of

Advertisers' parameters - advertiser's value for an ad and bid markdown.

3.8.1 Consumers side

On the consumer side, I derive consumer's click probability using a weighted logit model

with keylength and popularity of search as controls. The main parameters of interest are

the eect of position and ad on click probability. To capture the eect of each position and

ad I include a dummy variable for position and advertiser.59

Position eect on click probability: Table(F.8) shows results of average partial eect(APE).

Fig(E.15) provides the predicted click probability (referred to as CTR) for dierent ad posi-

tions. As expected, the results show that higher positions have a higher probability of a click.

Categories cable, car insurance, and coins see a higher probability of click with an average

click rate of around 3%. In graph(E.16), I plot the ratio of predicted click probabilities (or

CTR) for adjacent positions, i.e.

predicted clicks at position k
predicted clicks at position k+1. This ratio helps in under-
standing the proportional increase in a click when you switch to one position above your

current one. It shows that the eect of a switch to the position above you is heterogeneous

and dependent on which position you are switching to. For category Car Insurance, it seems

the biggest gain is in being in the top 5, as the jump to the 5th position has the highest

59I include a xed eect for the market as well

74

gain. For laptop top 4 positions have the highest gain, whereas for Cable and Cruise rst

position holds the most importance. This heterogeneity gain across position is important as

it impacts advertisers' bid behavior; an advertiser will be willing to pay more for a higher

position if there is a large enough increase in the click rate.

Ad Quality: Measured as the eect of the ad on click probability:- The ad's quality is

identied as the predicted click probability for dierent advertisers. This is obtained by using

the predicted click probability for each market and advertiser combination. To compare the

quality across advertisers I normalize it to a 0-1 range, with 1 being the highest quality

ad.60 As shown in graph(E.17), the quality estimates dier across product categories, where

some categories are more skewed towards low quality than others. Car insurance seems to

have the most skewed quality distribution with the mean of only 0.05, which means that the

average ad quality is 5% of the magnitude of highest quality ad. On the other hand, cable

and laptop seem to have relatively well distributed ad quality, with an average of 0.3 and

0.2 respectively.

Externality Index: As stated earlier externality variable is the quality weighted sum of the

number of other ads on the page. I dene the externality index to give it similar properties

to the HHI index. Thus, the externality index is calculated using weights equal to the square

of quality. This intuitively has an advantage over the linear index as in the current index

a single high-quality ad would have a higher eect than two average quality ads. Fig(E.18)

shows the distribution of externality variable across dierent categories.

3.8.2 Externality eect

Table(F.6) shows results of the impact of externality on advertiser's bid using Eqn(3.19).

The model is estimated to look at non-linear eects by looking at the log of externality

eect on the log of the bid. Fig(E.19) plots the estimated Î²1 co-ecient that captures the

percentage change in bid when externality is increased by 1% . The results show that 1%

60The top 0.01% are given a score of 1

75

increase in externality leads to a decrease of 0.39%, 0.59%, 1.37%, 0.08% in categories car

insurance, laptop, cable and coins respectively. The cruise does not show any signicant

aect of externality. The result in percentage term is dicult to interpret. Thus I will now

look at increase in externality in terms of addition of an ad next to an advertiser's own ad.

For example, including one more high quality ad when 7 ads are already present on the page

decreases bid by 0.9% percent for car insurance, 1.5% in laptop, 3.56% in cable, 0.2% in

coins. I nd evidence that the externality has a non-linear eect on bid as it depends on

the quality of the ad as well as the number of ads already present on the page. The next

two paragraphs elaborate on it further. In appendix(I.0.1) I look at alternative denition of

externality and other robust checks.

3.8.3 Non-linear eect of externality

Âˆ Eect of quality of the ad: The externality imposed on advertisers' willingness to pay

by the addition of one more ad on the page is inuenced by the quality of the additional

ad. Graph(E.20) shows the percentage decrease in the bid when the display goes from

an exclusive to two-ad display.61 The graph shows how the aect of the additional

ad depends on its quality.62 The graph is plotted for the laptop category. A similar

pattern is observed in all the other categories as well. Following our earlier example

this means, when an ad is included next to Walmart's ad, Walmart decreases their bid

by 25% if its a high quality ad from Amazon's, and only 1% if it is a low quality ad.

Âˆ Eect of the number of ads included on the page: The eect of externality is diminishing

in the number of ads added to the page. This means that the addition of the rst ad

has the highest negative eect. This eect decreases as more ads are included next to

an ad. This diminishing eect can be thought of as saturation of the market, where

61Note that the theoretical setup proves that the percentage decrease in advertisers will-
ingness to pay due externality is same as the percentage decrease in bid. Thus, here I can
talk about this interchangeably

62I assume the base externality without any ads on the page is one.

76

after a certain number, more ads by competitors do not impact the advertiser. The

graph(E.21) plots the decrease in advertisers' willingness to pay due to the addition of

an ad for laptop category. For example, the eect of the second ad decrease advertisers

willingness to pay by 25%, however, the eect of the seventh ad decrease the advertisers'

willingness to pay by only 2%. Thus, this shows that the eect of an additional ad is

more when there are fewer ads present on the page.

3.8.4 Advertiser willingness to pay for an ad and prot margin

In this section, I analyze the advertiser's benet from an ad. I deduct the externality eect

on the bid from stage 2 results and use the residual of step 2 as the externality-free bid for

the analysis in this section. This step derives the distribution for advertisers exclusive ad

value.

Advertiser's ad value: Using equation(3.22), I get the bounds on the advertiser's

maximum willingness to pay for an ad or ad value. The distribution bounds are estimated

for each category, as shown in the graph(E.23). The bounds are tight for three out of 5

categories, implying that inequality is sucient for inference. The graph(E.22) plots for all

product categories, the upper bound estimates for the cumulative distribution function(cdf)

of the ad value.

It seems that the ad value follows a log-normal distribution, with the

variance varying across categories.63

An interesting nding in stage 3 results is that the decision of whether to use GSP (Gen-

eralized Second-Price auction) or GFP(Generalized First Price auction) does not matter

much when the number of bidders is very high. This is usually the case with online markets.

This is important as the choice of the auction is discussed intensively in the academic liter-

ature as well as the industry. For example, Google switched to GSP from GFP auction, and

facebook uses VCG instead of GSP. In this paper, I show I can bound the GSP bid between

63The estimated distributions use a boundary condition, which is that the value have to

be less than the highest observed bid

77

the VCG bid and the GFP bid. The results show that the bounds are very close to each

other, implying the three auctions might give a similar bid. In this case, one might favor the

rst-price auction as you are paying your bid, thus increasing revenue. I further explore the

design of auction by looking at which one gives higher payo when we include a prior stage

of selecting the optimal number of ads.

3.9 Counterfactual analysis: revenue-maximizing number of ads

In this section, I examine a method to calculate the revenue-maximizing number of

ads.

I allow the search engine, Yahoo!, to vary the number of ads per page, with the

range being from one to seven ads per page.64 For each case of the number of ads page,

I derive the equilibrium bid and thus, the expected revenue for Yahoo!.65 The revenue-

maximizing number of ads is selected as the one that has the highest expected revenue. This

counterfactual is done separately for each product category so the selected number of ads

can vary across category. This helps us see whether it is a good strategy to set the number

of ads dierent for each category.

The steps below give details of the algorithm used to determine the revenue-maximizing

number of ads. For each simulation round, the following steps are executed:

1. Draw N independent values from the empirical distribution:

vj âˆ¼ Ë†HU ( Ë†Ï†)

2. Solve for equilibrium bid using the empirical estimates of average quality (Â¯s), average

click rate (Ë†ck) and externality co-ecient ( Ë†Î²1).

64The upper bound of the range is selected to be seven as the data showed seven ads
per page. Thus, this counterfactual is trying to determine whether seven was optimal or a
smaller number of ads was better.

65Note, as the equilibrium bid does not have a closed-form I use the upper bound on the bid,
which is given by the bid for generalized rst-price auction as shown in proposition(3.4.3).
Additionally these simulations are done assuming everyone has average quality, however this
can be extended to look at advertisers with varying score quality. The externality is assumed
to be one for base case of exclusive ad.

78

3. Calculate the revenue (i.e. TR) for each case of number of ads per page:

T R(K)

âˆ€K = {1, 2...7}

4. Pick the revenue maximizing number of ads (Kâˆ—)

Kâˆ— = Argmax

K

M ean[T R(K)]

These steps are repeated for 1000 iterations. At the end of the iterations, I calculate the

mean revenue from each quantity of ads per page. The revenue-maximizing number of ads

is the one that has the highest average revenue Kâˆ— = ArgmaxK

. The

(cid:80)1000

it=1 T Rit(K)

1000

percentage increase in revenue is calculated by taking the dierence in the revenue between

the selected optimal one and the current number of ads (i.e. seven ads).

3.9.1 Results

The results from the counterfactual simulation are presented in table(F.9). The table shows

that for three out of the ve categories, the number of ads suggested by this algorithm is less

than the current number of ads displayed (i.e. less than seven). I also calculate the potential

gain in revenue by comparing the revenue from the number of ads suggested by this method

to the currently used number of ads (i.e. seven ads). The results show that the largest gain

is for the cable TV category of about [15.6%, 22.7%] increase in revenue, with the selected

number being two-three ads per page. It seems that the categories that had a high externality

co-ecient had the highest gain. However, apart from the externality eect, the gains also

depend on the mean quality of the ads in the category. For example, laptop category, which

has a medium level of externality eect and a medium level of average quality, show a higher

gain of about [1.014%,2.985%] compared to the car insurance category that had medium

externality eect but low average quality and showed a gain of around [0.6942%, 1.698%].

The dierence in the revenue-maximizing number of ads across ad product categories

show that the features of the ad market might inuence the choice of the search engine.

79

Features that determine the bid as well as the externality index play a key role in determining

the number of ads per page. This paper suggests that the markets that have a high average

quality and more homogeneous products should show lesser number of ads per page compared

to other categories.

3.10 Conclusion

This research looks at the externality generated by the multi-ad display setting on search

engines such as Google, Yahoo!, and Bing. The externality in this market is dened as

the external cost on an advertiser's willingness to pay for an ad due to the presence of

competitors ads on the same page. In particular, I estimate the impact of the number of

ads per page on the ad price and then use the estimate to simulate the revenue-maximizing

number of ads.

I begin by developing a theoretical model to derive the equilibrium ad

price in the market. The equilibrium conditions show that the ad price is dependent on the

externality eect and the auction design The primary empirical contribution is twofold. The

rst contribution is the estimation of the impact of externality on the ad price. The ndings

show that the advertiser's willingness to pay decreases by an average of 18.48% when the

display changes from an exclusive to a two-ad display, where the average is across dierent

product categories.66 This decrease is more if a high-quality ad is displayed compared to a

low-quality one. Following the Walmart example from the introduction, this means that the

Walmart's willingness to pay decreases by 25% when the display changes from an exclusive

to a two-ad display, where the new ad is by Amazon. Furthermore, the decrease is only 2%

if the ad next to Walmart is by a low-quality advertiser rather than Amazon. Additionally,

the eect on advertisers' willingness to pay is more for the rst few ads added next to their

ad, implying that the eect of an additional ad becomes minimal when there are already ve

to seven ads on the page.

The second empirical contribution of this paper is that it estimates the advertiser's

66next to a high-quality ad

80

externality-free willingness to pay (also referred as exclusive ad value). Using the equi-

librium conditions, I can estimate bounds on the distribution of the exclusive ad value. This

result is essential as it helps to simulate how the advertiser's payment behavior changes with

change in the market environment. Unlike the consumer side for which the search engine

can do a randomized controlled trail to test the changes, a similar approach is tough on the

advertisers' side. This is because the advertisers' reaction to changing market factors such

as pricing mechanism is usually slower. Additionally, frequent changes in the environment

can make advertisers leave the ad platform due to increase in diculties. Thus, companies

such as Microsoft and Google often estimate the advertiser's unobserved parameters such

as ad value and then simulate their best response to changes in the environment. Thus,

the estimation of bounds on the distribution holds importance in this market and can be

used to simulate revenue implications of changes in the market. I nd that the estimated

distribution is close to the log-normal distribution.

These results are further used to evaluate the expected revenue for dierent quantities of

ads per page and derive the revenue-maximizing number of ads. The counterfactual analysis

shows that the revenue-maximizing number of ads will dier across ad product categories

according to the market concentration and the product dierentiation. I nd that three out

of the ve categories provided in the data show sub-optimal number of ads. Furthermore,

using the suggested number of ads leads to, on average, a 4.5 percent increase in revenue.,

which translates into a revenue gain of 5.2 billion dollars in revenue.67

These counterfactual results can be further combined with other design improvements to

increase the expected gain. For example, the restriction on the number of ads can be imple-

mented with an increase in the ad size. Additionally, this research has broader implications

as these results can be applied to any online advertising platform, which shows multiple ads

on the same ad space or to the same consumer. A few examples are Amazon and Facebook

67calculated using 2018 Google's ad revenue of 116.3 billion US dollars.

- see here for

details https://www.statista.com/statistics/266249/advertising-revenue-of-google/

81

that show multiple ads to the consumer.68

While this paper provides a possible method for determining the optimal number of ads

for the search engine, I believe the dierentiating features of the ad product category are the

key to further analyzing a more intricate revenue-maximizing number of ads per page.

68Note that the application is limited to the case of the adjacent ads being from the same

ad product category.

82

APPENDICES

83

APPENDIX A

Figures for Chapter 1

Figure A.1: Two-sided rm with quality dierentiation

84

APPENDIX B

Proofs for Chapter 1

Details of the equilibrium price and quality equations: The monopoly problem as in

Equation (1.4) can be rewritten as follows:

max
,pb
h

(ps,pb
l

,ql,qh)

Î  = [pb

l + ps âˆ’ c(ql)]DbDs + [(pb

h âˆ’ c(qh)) âˆ’ (pb

l âˆ’ c(ql))]Db

hDs.

(B.1)

Using Equation (B.1), the rst order conditions for choice variables can be simplied as

follows.

Âˆ Price for low-quality access on the buyer side: Equation (1.19) can be obtained

by the following steps.
(Db âˆ’ Db

h)Ds + [pb

l + ps âˆ’ c(ql)](Db)(cid:48)
pb
l
+ [(pb

l + ps âˆ’ c(ql)](Db)(cid:48)
pb
l

â‡” (Db

l ) + [pb
k)(cid:48)
where (Db
pb
k

means that

h âˆ’ c(qh)) âˆ’ (pb
k)(cid:48)
and (Db
pb
j

means that

h)(cid:48)
l âˆ’ c(ql))](Db
pb
l
âˆ‚Db
k
âˆ‚pb
j

, where j (cid:54)= k.

= 0,

âˆ‚Db
k
âˆ‚pb
k

Ds + [(pb

h âˆ’ c(qh)) âˆ’ (pb

h)(cid:48)
l âˆ’ c(ql))](Db
pb
l

Ds = 0.

Using equilibrium condition for pb

h from Equation (1.20),

Db

l + [pb

l + ps âˆ’ c(ql)](Db)(cid:48)
pb
l

+

ï£®ï£¯ï£¯ï£°âˆ’ Db

h

h)(cid:48)
(Db
pb
h

ï£¹ï£ºï£ºï£» (Db

h)(cid:48)
pb
l

= 0.

h)(cid:48)
Using (Db
pb
l

h)(cid:48)
= âˆ’(Db
pb
h

, Db

l + [pb

l + ps âˆ’ c(ql)](Db)(cid:48)
pb
l

+

â‡” Db

l + [pb

l + ps âˆ’ c(ql)](Db)(cid:48)
pb
l

+ Db

h = 0.

ï£®ï£¯ï£¯ï£° Db

h

h)(cid:48)
(âˆ’Db
pb
h

ï£¹ï£ºï£ºï£» (âˆ’Db

h)(cid:48)
pb
h

= 0.

â‡” [pb

l + ps âˆ’ c(ql)] =

=

pb
l
Îµb

.

Db
(Db)(cid:48)
pb
l

Âˆ Price for high-quality access on the buyers' side: Equation (1.20) can be ob-

tained by the following steps.

85

{[pb

h âˆ’ c(qh)] âˆ’ [pb
h âˆ’ c(qh)] âˆ’ [pb

â‡” {[pb

â‡” {[pb

h âˆ’ c(qh)] âˆ’ [pb

Ds + Db

hDs = 0.

h)(cid:48)
l âˆ’ c(ql)]}(Db
pb
h
h)(cid:48)
l âˆ’ c(ql)]}(Db
pb
h
Db
h
h)(cid:48)
(âˆ’Db
pb
h

l âˆ’ c(ql)]} =

+ Db

h = 0.

=

pb
h
Îµb
h

.

Âˆ Price for sellers: Equation (1.21) can be obtained by the following steps.

l âˆ’ c(ql)]}Db

ps + DbDs = 0.

l + ps âˆ’ c(ql)]Db + {[pb
{[pb
l + ps âˆ’ c(ql) =

â‡” pb

h âˆ’ c(qh)] âˆ’ [pb
âˆ’ {[pb

Ds
(âˆ’Ds)(cid:48)
ps
l + ps âˆ’ c(ql) =

h âˆ’ c(qh)] âˆ’ [pb

Using Equation (1.20), pb

.

h}(Ds)(cid:48)
l âˆ’ c(ql)]}Db
(cid:34)
(cid:35)
h
Db
Db
(cid:34)
h
Db
pb
h
Îµb
h

ps
Îµs âˆ’

pb
h
Îµb
h

âˆ’

,

(cid:35)

Db
h
Db

.

Ds
(âˆ’Ds)(cid:48)
ps
l + ps âˆ’ c(ql) =

which can be further simplied as pb

Âˆ Low-quality service for buyers: Equation (1.22) can be obtained by the following

steps.

Ds+

(Db âˆ’ Db

h)c(cid:48)(ql)Ds + [pb

h âˆ’ c(qh)) âˆ’ (pb
[(pb
â‡” (Db âˆ’ Db
+ {[pb

h âˆ’ c(qh)] âˆ’ [pb

h)c(cid:48)(ql) + [pb

h)(cid:48)
l âˆ’ c(ql))](Db
ql

l + ps âˆ’ c(ql)](Db)(cid:48)
ql
Ds = 0.
l + ps âˆ’ c(ql)](Db)(cid:48)
ql

h)(cid:48)
l âˆ’ c(ql)]}(Db
ql

= 0.

Using Eqn (1.20), âˆ’ Db

l c(cid:48)(ql) + [pb

l + ps âˆ’ c(ql)](Db)(cid:48)
ql

Using Eqn (1.23), Db

l c(cid:48)(ql) + [pb
h)(cid:48)
, âˆ’ (Db
= âˆ’(Db
qh

l + ps âˆ’ c(ql)](Db)(cid:48)
ql
l )c(cid:48)(ql) + [pb

h)(cid:48)
Using (Db
ql

ï£®ï£¯ï£¯ï£°âˆ’ Db

h

ï£¹ï£ºï£ºï£» (Db

h)(cid:48)
ql

= 0.

+

h)(cid:48)
(Db
pb
h
Db
l )(cid:48)
+ [c(cid:48)(qh)
](Db
h
h)(cid:48)
ql
(Db
qh
âˆ’ c(cid:48)(qh)Db
l + ps âˆ’ c(ql)](Db)(cid:48)
ql

= 0.

h = 0,

which is simplied as,

l + ps âˆ’ c(ql)] =
[pb

Db
(Db)(cid:48)
ql

l c(cid:48)(ql) + Db
Db
Db

hc(cid:48)(qh)

l c(cid:48)(ql) + Db
Db
Db

hc(cid:48)(qh)

,

=

ql
Î½b

86

k)(cid:48)
where (Db
qk

means that

âˆ‚Db
k
âˆ‚qk

k)(cid:48)
and (Db
qj

means that

âˆ‚Db
k
âˆ‚qj

, where j (cid:54)= k.

Âˆ High-quality service for buyers: Equation (1.23) can be obtained by the following

h âˆ’ c(qh)] âˆ’ [pb

steps.
{[pb
Using Eqn (1.20), c(cid:48)(qh)

h)(cid:48)
l âˆ’ c(ql)]}(Db
qh
Db
h
h)(cid:48)
(Db
qh

=

hc(cid:48)(qh) = 0.
, which can be simplied as, c(cid:48)(qh)

âˆ’ Db
pb
h
Îµb
h

=

qh
Î½b
h

pb
h
Îµb
h

.

Proof of Proposition 1.4.1. It shows the platform oers a better price-quality ratio in

the case of one quality oering. This basically implies

pb

q(cid:124)(cid:123)(cid:122)(cid:125)

Platform (two-sided)

â‰¤

pb

q(cid:124)(cid:123)(cid:122)(cid:125)

One-sided

.

(B.2)

We start by comparing the prot functions for two dierent cases. For simplicity, we nor-

malize cs to zero.

Î one-sided = [pb âˆ’ c(q)]Db â‰¡ Î¨(pb, q).
Î platform = [pb + ps âˆ’ c(q)]DbDs â‰¡ Î¨(pb, q)Ds + psDbDs.

(cid:16) Â¯pb, Â¯q
(cid:17)

The one-sided monopolist's prot maximization problem is given in Equation (1.5). Let

be the optimal solution for the one-sided monopoly problem and(cid:101)x =

Â¯x =
for the two-sided platform. Given that the monopolist optimal value for ps (cid:54)= 0,

(B.3)

(cid:19)

(cid:18)(cid:101)pb,(cid:101)q,(cid:101)ps

(B.5)

(cid:4)
(cid:4)

As ( Â¯pb, Â¯q) is the optimal solution Î one-sided; this implies that Î¨( Â¯pb, Â¯q) â‰¥ Î¨((cid:101)pb,(cid:101)q) â‰¥
Î¨((cid:101)pb,(cid:101)q)(cid:102)Ds, so Equation (B.4) holds only if

(B.4)

Î¨((cid:101)pb,(cid:101)q)(cid:102)Ds + (cid:101)ps(cid:102)Db(cid:102)Ds â‰¥ Î¨( Â¯pb, Â¯q).

(cid:102)Ds(cid:101)ps(Db)at(cid:101)x > (cid:102)Ds(cid:101)ps(Db)at Â¯x.
q(cid:124)(cid:123)(cid:122)(cid:125)
â‡” pb

q(cid:124)(cid:123)(cid:122)(cid:125)

pb

<

.

platform

one-sided

Hence proved.

Proof of Corollary 1.4.2. It is guaranteed by Proposition 1.4.1.

87

Proof of Proposition 1.4.3. We use proof by contradiction. Let us suppose that (ps)1Q â‰¤
(ps)2Q where the subscripts 1Q and 2Q denote the model with one quality and with two

qualities, respectively. This implies

As ps

Îµs is a decreasing function of ps, we derive the following:

(B.6)

(B.7)

(B.8)

(Ds)1Q â‰¥ (Ds)2Q.
(cid:19)
(cid:19)

(cid:18) ps

â‰¥

1Q

Îµs

.

2Q

(cid:18)ps
(cid:33)
(cid:32)

Îµs

pb
Îµb

â‰¥ pb
l
Îµb

+

Db
h
Db

pb
h
Îµb
h

.

1Q

Using Equation (1.24), we obtain the following:

Using rst order condition for pb in the model with one quality and those for pb

l and pb
h

in the model with two qualities, we obtain the following:

(cid:124)

(cid:123)(cid:122)

(cid:125)

(cid:124)

[pb + ps âˆ’ c(q)]

â‰¥ [pb

l + ps âˆ’ c(ql)] +

average prot for 1Q

Db
h
Db

{[pb

(cid:123)(cid:122)
h âˆ’ c(qh)] âˆ’ [pb

average prot for 2Q

Given Equation (B.9), the rm would provide two qualities only if (DsDb)

.

(cid:125)
l âˆ’ c(ql)]}
(cid:124) (cid:123)(cid:122) (cid:125)

1Q

(B.9)

< (DsDb)

(cid:124) (cid:123)(cid:122) (cid:125)

2Q

holds. This condition along with equation(B.6) imply:

By Equation (B.8), we have

(cid:19)

(cid:18)

pb
Îµb

< (Db)

2Q
pb

(cid:124)(cid:123)(cid:122)(cid:125)
q(cid:124)(cid:123)(cid:122)(cid:125)
(cid:33)

2Q

.

(Db)

1Q
pb
l

(cid:124)(cid:123)(cid:122)(cid:125)
ql(cid:124)(cid:123)(cid:122)(cid:125)
(cid:32)

1Q

â‰¥

>

pb
l
Îµb

1Q

l(cid:124)(cid:123)(cid:122)(cid:125)

pb

1Q

2Q

â‰¥ pb(cid:124)(cid:123)(cid:122)(cid:125)

2Q

(B.10)

(B.11)

(B.12)

, which yields

Furthermore from Equation (B.8), we also have

88

(cid:32)

(cid:32)

â‰¥

(cid:33)

1Q

qc(cid:48)(q)
Î½b

l c(cid:48)(ql) + Db
Db
Db

hc(cid:48)(qh)

ql
Î½b

(cid:33)

2Q

Now, from the equilibrium conditions for 1Q we have
qc(cid:48)(q)
pb
Î½b
Îµb
1 âˆ’ F (p/q)
f (p/q) 1
q
â†’ c(cid:48)(q)q = p

1 âˆ’ F (p/q)
f (p/q) p
q2

â†’ c(cid:48)(q)

=

=

Similarly using 1.12, for 2Q case we get

substituting eqn(B.14) and eqn(B.15) in eqn(B.13), we get

ql

(cid:32)

l c(cid:48)(ql) + Db
Db
Db
(cid:33)
(cid:33)

pb
l
Î½b

2Q

1 âˆ’ F (p/q)
f (p/q) p
q

p

â‰¥

1Q

(cid:19)
(cid:18) p
(cid:32)

Î½b

â†’

â‰¥

1Q

hc(cid:48)(qh)

= pb
l

ï£«ï£¬ï£¬ï£­pb

l

ï£¶ï£·ï£·ï£¸

2Q

1 âˆ’ F (pb

f (pb

l /ql)

l /ql)
pb
l
ql

(B.13)

(B.14)

(B.15)

using eqn(B.12), we get

pb
l

ql(cid:124)(cid:123)(cid:122)(cid:125)

1Q

.

q(cid:124)(cid:123)(cid:122)(cid:125)
â‰¤ pb

2Q

This contradicts Equation (B.11). Hence (ps)1Q > (ps)2Q

(cid:4)

Proof of Corollary 1.4.4. Let (Â¯pb, Â¯q, Â¯ps) be the prot maximization variable for the

single-quality case. We prove whether the platform wants to set a nonzero demand for high-

quality products at this price and quality level. The demand for high-quality products will

be zero if

h âˆ’ pb
pb
B(qh âˆ’ ql)

l

= 1. â‡” pb

h âˆ’ pb

l = B(qh âˆ’ ql).

(B.16)

89

We choose ((pb

whether the rst order condition on (pb

h)âˆ—, (qh)âˆ—) such that Equation (B.16) is satised and then determine
h, qh) shows that the platform will attempt to increase

demand for the high-quality product above zero. The rst order condition with respect to
pb
h at (Â¯pb, Â¯q, Â¯ps, (pb

.

as

h âˆ’ Â¯pb
pb
B(qh âˆ’ Â¯q)

= 1.

(B.17)

Î¦pb

h = Ds

{[(pb

h)âˆ— âˆ’ c(qâˆ—

h)âˆ—, (qh)âˆ—) is given as follows:
(cid:40)
h)(cid:48)
h)] âˆ’ [Â¯pb âˆ’ c(Â¯q)]}(Db
(cid:26)(cid:110)
(cid:111)
pb
h
= Ds(cid:110)(cid:8)B(qâˆ—
h)] âˆ’ [Â¯pb âˆ’ c(Â¯q)]

(cid:111)
h) âˆ’ c(Â¯q)](cid:9) (âˆ’f b(1))

h)âˆ— âˆ’ c(qâˆ—
h âˆ’ Â¯q) âˆ’ [c(qâˆ—

(âˆ’f b(1))

= Ds

[(pb

+ Db
h

(cid:27)

.

(cid:41)

â‰¤ 0

if f b(1) (cid:54)= 0 and c(cid:48)(Â¯q) < B.

In a similar manner, we can prove that at qâˆ—

h, the rst order condition is greater than zero.
h (cid:54)= 0. Therefore, we see that
Thus, the platform will decrease pb
the prot increases when oering two product qualities as long as f b(1) (cid:54)= 0 and c(cid:48)(Â¯q) < B.
c(cid:48)(Â¯q) < B is true, as c(cid:48)(Â¯q) < B implies that Â¯p
(cid:4)

h and increase qh such that Db

B Â¯q < 1, which holds for nonzero demand.

(1.23)

Proof of Proposition 1.4.5.

Proof of Proposition 1.4.6. The proof is in the paper

It is trivial to prove from checking Equation (1.22) and
(cid:4).
(cid:4)
(cid:4)
Proof of Proposition 1.4.7. The proof is in the paper.
Proof of Proposition 1.4.8. The proof is in the paper, as shown in Equation (1.26). (cid:4)
(cid:4)

Proof of Proposition 1.4.9. The proof is in the paper.

Proof of equivalence of this model with xed fee case:

Proof. Suppose that on one of the side the platform also charges a xed fee. For example in

case of Amazon they charge a xed fee, i.e. Prime membership on the buyers side. The key to

the equivalence is that adding xed fee doesn't change the model as long as there are no xed

benets for the users. This case ts the Amazon example as even though Amazon charges

xed fee, the buyers only benet through transactions and do not have a non-transaction

benet from Amazon. Fixed fee Case WLOG, let us assume buyers side is also charged

90

a xed fee Buyers have the following utility:

i = (BÎ±b
U b

i q âˆ’ pb)N s âˆ’ P b,

Here Pb is the xed fee charged to the buyers. The seller has the following utility:

j = (SÎ±s
U s

j âˆ’ ps)N b.

(B.18)

(B.19)

We rst analyze the user side (buyers and sellers) to identify the equilibrium demand.

The equilibrium demand functions are derived from the participation constraint:

(cid:16)
(cid:16)

Db = P rob

Ds = P rob

i â‰¥ 0
U b

j â‰¥ 0
U s

(cid:17) â‡” Db = 1 âˆ’ F b
(cid:17) â‡” Ds = 1 âˆ’ F s

ï£¶ï£·ï£¸ .

ï£«ï£¬ï£­pb + P b
(cid:19)
(cid:18) ps

Bq

N s

.

S

(B.20)

The monopoly problem can be written as follows:

Î  = [pb + ps âˆ’ c(q)]DbDs + P bDb,

(B.21)

max

(ps,pb,q)

Modied xed fee case: Now we modify the above case to show the equivalence. Let

new = pb + P b
pb
the model parameters are as follows:

N s be the per transaction fee on the buyers side and let xed fee be zero. Then

Buyers have the following utility:

U b
i = (BÎ±b

= (BÎ±b

i q âˆ’ pb)N s âˆ’ P b
i q âˆ’ pb âˆ’ P b

N s )N s = (BÎ±b

i q âˆ’ pb

new)N s

(B.22)

Thus, this shows the utility is the same in the case of (1) usage fee pb & xed fee - P b (2)

usage fee - pb

new

The seller has the following utility:

U s
j = (SÎ±s

j âˆ’ ps)N b.

(B.23)

91

We rst analyze the user side (buyers and sellers) to identify the equilibrium demand.

The equilibrium demand functions are derived from the participation constraint:

(cid:16)
(cid:16)

Db = P rob

Ds = P rob

i â‰¥ 0
U b

j â‰¥ 0
U s

(cid:17) â‡” Db = 1 âˆ’ F b
(cid:17) â‡” Ds = 1 âˆ’ F s

(cid:33)

(cid:32)
(cid:18) ps

pb
new
Bq

(cid:19)

.

S

The monopoly problem can be written as follows:

Î  = [pb + ps âˆ’ c(q)]DbDs + P bDb

max

(ps,pb,q)

= [pb + ps âˆ’ c(q) +

P b
Ds ]DbDs

= [pb

new + ps âˆ’ c(q)]DbDs

.

(B.24)

(B.25)

(B.26)

(B.27)

Thus, as shown above the prot function for the platform and the users utilities are the same
(cid:4)

in these two case.

92

APPENDIX C

Figures for Chapter 2

Figure C.1: Decrease in uctuation due to price ceiling and oor

Figure C.2: Price path for carbon credits for RGGI auction

93

APPENDIX D

Proofs for Chapter 2

Proof of Lemma(2.3.1) Recall the denition of Î· is as follows:

(cid:18)
Î·i,l âˆ’ E[(Î·
(Î·i,l|Ïƒl)

Ï„ lâˆ’tG

(mt)
l

Ïƒl

(mt)
j,l

)j(cid:54)=i|Î·i,l > Î·

(mt)
j,l

(cid:82)

ï£±ï£´ï£´ï£´ï£²ï£´ï£´ï£´ï£³vi âˆ’ ki(cid:80)

0,

l=t+1

Î·i,t =

(cid:19)

]

,

if t{ai, ki}

otherwise

Through recursive addition and subtraction, it is easy to see that the above is equivalent to

the following :

Replacing probability of bid with probability of pseudo type we can rewrite the payo

(cid:19)

]

(mt)
j,t

)j(cid:54)=i|Î·i,t > Î·
(cid:19)

functions as

(cid:90)

W (Î·i,t) =

(mt)
Ï„ G
t

Ïƒt
(mt)
t

vi âˆ’ E[(Î·

(mt)
j,t

+ (1 âˆ’ G

(cid:18)
(Î·i,t|Ïƒt)
(cid:18)
(Î·i,t|Ïƒt))W (Î·i,t+1)
vi âˆ’ E[(Î·
(Î·i,t|Ïƒt)
(mt)
V (Î·i,t) = G
t
(cid:18)
vi âˆ’ W (Î·i,t+1) âˆ’ E[(Î·
(Î·i,t|Ïƒt)
(cid:18)
(Î·i,t|Ïƒt)

vi âˆ’ W (Î·i,t+1) âˆ’ E[(Î·

)j(cid:54)=i|Î·i,t > Î·

(mt)
V (Î·i,t) = G
t

(mt)
Ï„ G
t

W (Î·i,t) =

(mt)
j,t

(cid:90)

Ïƒt

This can be rewritten as

(mt)
j,t

]

+ (1 âˆ’ G

(mt)
t

(Î·i,t|Ïƒt))W (Î·i,t+1)
(cid:19)

]

+ W (Î·i,t+1)

(D.1)

(mt)
j,t

(mt)
j,t

)j(cid:54)=i|Î·i,t > Î·
(cid:19)

(mt)
j,t

)j(cid:54)=i|Î·i,t > Î·

(mt)
j,t

]

+ W (Î·i,t+1)

(D.2)

Now, we rewrite the pseudo type in terms of the payo function. Using addition and sub-

94

traction and using the denition of Î· we can rewrite the denition of Î· as :

(cid:90)
Î·i,t =
vi âˆ’
ki(cid:88)

(cid:90)

Ïƒt+1

(mt)
Ï„ G
t

(cid:18) lâˆ’(t+1)(cid:89)

(cid:18)
(Î·i,t|Ïƒl)
(cid:90)

(1 âˆ’

l=t+2

Ïƒl

q=1

, if t â‰¥ ai or t â‰¤ ki

Ïƒq

vi âˆ’ E[(Î·

(mt)
j,t

)j(cid:54)=i|Î·i,t > Î·

(cid:19)
Ï„ lâˆ’tG

(ml)
l

mq
G
q

(Î·i,q))

(cid:19)
(cid:18)
vi âˆ’ E[(Î·
(Î·i,l|Ïƒl)

(mt)
j,t

+

]

(cid:19)

]

)j(cid:54)=i|Î·i,l > Î·

(ml)
j,l

(ml)
j,l

Note that the second term in the equation above is equal to W (Î·i,t+1). Thus we have

ï£±ï£´ï£´ï£²ï£´ï£´ï£³vi âˆ’ W (Î·i,t+1),

0,

Î·i,t =

if t â‰¥ ai or t â‰¤ ki

otherwise

Thus, equation(D.1) and (D.2) as

(cid:90)

(cid:18)
(cid:18)
Î·i,t âˆ’ E[(Î·
(Î·i,t|Ïƒt)
(Î·i,t|Ïƒt)

Î·i,t âˆ’ E[(Î·

(mt)
j,t

(mt)
Ï„ G
t

W (Î·i,t) =

Ïƒt
(mt)
V (Î·i,t) = G
t

(mt)
j,t

(cid:19)
)j(cid:54)=i|Î·i,t > Î·

(mt)
j,t

)j(cid:54)=i|Î·i,t > Î·

(mt)
j,t

(cid:19)

]

+ W (Î·i,t+1)

(D.3)

]

+ W (Î·i,t+1)

(D.4)

(cid:4)

Proof of theorem(3.8)

The symmetric Bayesian Nash equilibrium bid in period t maximizes the following payo of

bidder t:

V (Î·i,t) =

(cid:26)

(mt)
G
l

(cid:20)
(Î·i,t|Ïƒl)E

(cid:12)(cid:12)(cid:12)(cid:12)b(ni,t) > b

(mt)
j(cid:54)=i

ni,t âˆ’ b(mt)(ni,t)

(cid:21)

(cid:27)

+ W (Î·i,t+1)

Notice that W (Î·i,t) in the above expression is merely an additive constant. we will use this

and backward induction to solve for the equilibrium bidding function.

First from the structure of V (Î·i,t|Ïƒt) , it is clear that after the last active period ki, the
t = 0 âˆ€ t > ki. This is because the buyer
is only active till period ki and would earn a negative prot from winning if he is active after

buyer's equilibrium bid will be equal to zero, i.e. bi

95

the actual deadline. Thus we can rewrite the equilibrium bidding strategy as a set of nite
)}. Now we will show that in the last active round

bids given as bi = {b(Î·i,1), b(Î·i,2), ....b(Î·i,ki
of bidder i's lifetime i.e. kth

i period, bidder bids their pseudo valuation, so b(Î·i,ki
Note that in the last period pseudo type is equal to value of the bidder, i.e. Î·i,ki

) = Î·i,ki.
= vi

< vi, the current period discounted utility from winning is positive i.e (viâˆ’ztki

Âˆ If b(cid:48) < vi.
In cases where the price for the object is in-between b(vi, ki) and vi, i.e. b(vi, ki) <
ztki
0 but the buyer does not win. Thus this is not optimal
Âˆ If b(cid:48) > vi.
In cases where the price for the object is in-between b(vi, ki) and vi, i.e. b(vi, ki) >
ztki
0. Thus this is not optimal.

> vi, the current period discounted utility from winning is negative i.e (viâˆ’ztki

) >

) <

From above we get that any other bid than b(Î·i,ki
payo. Thus b(vi, ki) = Î·i,ki is an optimal bid in the last active period (kth
bidder i.

) = vi = Î·i,ki would decreases buyers
i period) for

Next we prove reporting bid equal to b(Î·i,t) is optimal in an arbitrary t during the active

demand period, i.e., ai < t ki , assuming it is optimal in all period after t. Recall that

equilibrium bid maximizes V (Î·i,t)

V (Î·i,t) =

(mt)
G
l

(cid:26)

(cid:20)
(Î·i,t|Ïƒl)E

(cid:12)(cid:12)(cid:12)(cid:12)b(ni,t) > b

(mt)
j(cid:54)=i

(cid:21)

(cid:27)

+ W (Î·i,t+1)

ni,t âˆ’ b(mt)(ni,t)

Here the rst term represents the expected current period discounted utility and the second

term represents the expected utility from future periods if he loses the current period auction

. Notice that the second term is independent of the bid in period tr. Thus, this is equivalent

to the bid maximizing the rst term.
Note that Î·i,t = vi âˆ’ W (Î·i,t+1) represents the adjusted value for bidder i in period t.We will
now show b(Î·i,t) = Î·i,t maximizes eqn(6). Consider any arbitrary b(cid:48) (cid:54)= b(Î·i,t).

96

Âˆ If b(cid:48) < b(Î·i,t)
In cases where the price for the object is in-between b(cid:48) and b(Î·i,t), i.e. b(cid:48) < zt < b(Î·i,t),
the current period discounted utility from winning is positive i.e vi âˆ’ W (Î·i,t+1) âˆ’ ztr
> 0 but the buyer does not win. Thus this is not optimal.
Âˆ If b(cid:48) > b(Î·i,t)
In cases where the price for the object is in-between b(cid:48) and b(Î·i,t), i.e. b(cid:48) > zt > b(Î·i,t),
the current period discounted utility from winning is negative i.e vi âˆ’ W (Î·i,t)âˆ’ zt < 0.
Thus this is not optimal.

Which gives the optimal bidding strategy as, b(Î·i,t) = vi âˆ’ W (Î·i,t+1).

(cid:4)

Proof for proposition(2.4.1) First we look at how the rate of future supply eects the

equilibrium payo of the bidder . The period t bid maximizes the following

+ (1 âˆ’ Gmt(Ë†Î·))W (Î·i,t+1|Î»)

(D.5)

b(Ë†Î·) Arg max

Ë†Î·

Gmt(Ë†Î·)E

vi âˆ’ b(mt)(Î·j,t)j(cid:54)=i

(cid:20)

(cid:12)(cid:12)(cid:12)(cid:12)b(Ë†Î·) > b(mt)(Î·j,t)
(cid:21)

Note that the bid is made after the supply and demand realization, so the supply rate Î» only

aects payo from future periods. Also, W (Î·i,t+1) is dependent on the supply and demand

distributions as well as Î». Usually we suppress this dependence for easier notation Here we

reintroduce its dependence on Î» as it is critical here. Using integral-form Envelop theorem

we get :

Î´(V (Î·i,t|Ïƒt))

Î´(Î»)

=

(cid:18)

1 âˆ’ G

(mt)
t

(cid:19) Î´(W (Î·i,t+1|Î»))

Î´(Î»)

[Î·i,t|Ïƒt]

(D.6)

Now we use equation(8) and backward induction to derive the derivative

Let us derive this in the last period of active demand. Recall that the equilibrium bid in last

period will be bi(Î·(vi, ri)) = Î·(vi, 1) = vi and W (Î·i(vi, 0) = 0, thus

Î´(W (Î·i(vi, 1)))

Î´(Î»)

â‰¥ 0

97

Now let us assume for any arbitrary r(cid:48) period of active demand left, i.e. assume
, we show that this hold in r(cid:48) + 1 too. Rewriting equation (2.4) for t(cid:48)
Î´(W (Î·i(vi, t(cid:48)

r + 1

r + 1)))

Î´(W (Î·i,t(cid:48) )

Î´(Î») â‰¥ 0

Î·i,t(cid:48)

r+1 âˆ’ E[(Î·m
j,t(cid:48)
Î´Î»

)j(cid:54)=i|Î·i,t(cid:48)

r+1 > Î·m
j,t(cid:48)

r+1

r+1

+ W (Î·i,t(cid:48)

r+1)

(cid:19)

]

(cid:19)

]

(cid:4)

(D.7)

(D.8)

(D.9)

(cid:4)

Î´(cid:82)

Î´(Î»)

Ïƒ

t(cid:48)
r+1

Î»m

Ï„ G

t(cid:48)
r+1(Î·i,t(cid:48)

r+1)

=
Using envelop theorem and t(cid:48)
(cid:90)

Î´G

Î»m

t(cid:48)
r+1(Î·i,t(cid:48)
Î´Î»

r+1)

=

Ï„

Ïƒ

t(cid:48)
r+1

(cid:18)

(cid:18)

Î»m

âˆ’ G

t(cid:48)
r+1(Î·i,t(cid:48)

r+1)

using the assumption

â‰¥ 0

Thus,

Î´(Î») â‰¥ 0 âˆ€t
Î´(W (Î·i,t)

r + 1 maximization equation we get

Î·i,t(cid:48)

r+1 âˆ’ E[(Î·m
j,t(cid:48)

r+1

r+1 > Î·1
j,t(cid:48)

r+1

)j(cid:54)=i|Î·i,t(cid:48)
(cid:19)

Î´E[(Î·m
j,t(cid:48)

r+1

)j(cid:54)=i|Î·i,t(cid:48)

r+1 > Î·m
j,t(cid:48)

r+1

Î´(W (Î·i,t(cid:48))

Î´(Î»)

Î´Î»

> 0, we get

]

+

Î´W (Î·i,t(cid:48)
Î´Î»

r+1)

Proof for proposition(2.4.2) rst we show bid is decreasing in supply rate:

Î´(b(Î·i,t)

Î´(Î»)

=

Î´Î»

Î´(vi âˆ’ W (Î·i,t+1|Î»))
= âˆ’Î´(W (Î·i,t+1|Î»))
â‰¤as a resutl of proposition(2.4.1)

Î´Î»

Notice that this also implies

(cid:18)
Î´(Î») â‰¤ 0 , thus we have
Î´(Î·i,t)

(cid:19)

E(Pt) = E

â†’ Î´E(Pt)

Î´(Î»)

=

Î·

(Î»âˆ—mt)
t
(Î»âˆ—mt)
Î´(E[Î·
t
Î´(Î» âˆ— mt)

]

mt +

Î´(E[Î·
t

(Î»âˆ—mt)
Î´(Î·t)

]

Î´(Î·t)
Î´(Î»)

â‰¤ 0

98

Proof for proposition(2.4.3) First we look at how the rate of future supply eects the

equilibrium payo of the bidder . The period t bid maximizes the following

(cid:20)

(cid:12)(cid:12)(cid:12)(cid:12)b(Ë†Î·) > b(mt)(Î·j,t)|Ï„

(cid:21)

b(Ë†Î·) Arg max

Ë†Î·

Gmt(Ë†Î·)E

Ï„ vi âˆ’ b(mt)(Î·j,t)j(cid:54)=i

+ (1 âˆ’ Gmt(Ë†Î·))W (Î·i,t+1|Ï„ )

(D.10)

Note that the bid is made after the supply and demand realization, so the supply rate Ï„ only

aects payo from future periods. Also, W (Î·i,t+1) is dependent on the supply and demand

distributions as well as Ï„ . Usually we suppress this dependence for easier notation Here we

reintroduce its dependence on Ï„ as it is critical here. Using integral-form Envelop theorem

we get :

Î´(V (Î·i,t|Ïƒt))

Î´(Ï„ )

= V (Î·i,t|Ïƒt) > 0

proof of proposition(2.4.4) rst we show bid is decreasing in uncertainty:

Î´(b(Î·i,t)

Î´(Ï„ )

=

Î´(Ï„ vi âˆ’ W (Î·i,t+1|Ï„ ))

Î´Ï„

vi âˆ’ Î´(W (Î·i,t+1|Ï„ ))

Î´Ï„

â‰¥ 0

Notice that this also implies

Î´(Ï„ ) â‰¥ 0 , thus we have
Î´(Î·i,t)
(cid:19)

(cid:18)

E(Pt) = E

(mt)
Î·
t
Î´(E[Î·
(mt)
t

â†’ Î´E(Pt)

Î´(Ï„ )

=

]

Î´(Î·t)
Î´(Ï„ )

â‰¥ 0

Î´(Î·t)

99

(D.11)

(cid:4)

(D.12)

(D.13)

(D.14)

(cid:4)

Proof of proposition(2.4.5)
dE(Pt|Î»new)

d(Ï„ )

Î´E(Pt|Î»new)

Î´(Î»new)

Î´(Î»new)

Î´Ï„

+

=

Î´E(Pt|Î»new)

Î´(Ï„ )

as rst term is positive we have
Î´E(Pt|Î»new)
Î´(Ï„ )
dE(Pt|Î»)

>

>

d(Ï„ )

(cid:4)

100

APPENDIX E

Figures for Chapter 3

Figure E.1: Sample of Yahoo! search result page

Figure E.2: Timeline

The gure shows the stages within each period. The advertiser decides the bid and keywords
at the start of the period, and the bid enters all auctions held in the period. A separate
auction is held for each search query. Keywords are words specied by the advertiser that
describe the ad and are used by Yahoo! to match it to the search queries.

101

Figure E.3: Snapshot of an advertiser's account on Yahoo!

The gure shows a sample snapshot of information available to advertisers. The rst one
shows aggregated data for each keyword. The second one shows the details for each keyword.
Click through rate refers to the ratio of the number of clicks times the number of displays.
Refer to this link for more information.

102

Figure E.4: Percentage of clicks across pages

The graph plots the percentage of the clicks received across dierent pages. This also shows
why the paper restricts the analysis to rst page. As we need to model consumers click
behavior limiting the ads to rst position captures most of the click decision. The later
pages will show less variation in the consumer's click behavior

103

Figure E.5: The popularity of the product categories for google in the past ve years

The graph plots the popularity of the categories for google in the past ve
years. The y-axis numbers represent search interest relative to the highest
point on the chart for the given region and time. A value of 100 is the
peak popularity for the term. A value of 50 means that the term is half
as popular. A score of 0 means there was not enough data for this term.
The average popularity score across last ve years for laptop, cable, car
insurance, cruise, coins are 55, 30, 20, 40, 32 respectively. Source: Google
Search Analytics click here for details

Figure E.6: No. of keywords specied by advertisers across days

The graph shows a scatter plot of the number of
keywords specied per advertiser across days. The
red line tracks the change in number of keywords
specied by the same advertiser. This variation
shows that the advertiser is using keywords to cap-
ture dierence in ad environment across searches,
this variation is used to see how the advertiser bids
dierently across keywords to accommodate the ef-
fect of other ads on the page.

104

Figure E.7: Average click rate and bid across ad positions

Click rate is the ratio of number of clicks over the
number of displays. The car insurance bid is scaled
by 1/10th of original value for descriptive analysis
only.

Figure E.8: Revenue per ad across bid

The graph shows the revenue per bid, where the
size of the circle shows the click rate of the ad
(larger circle signifying higher click rate). This
graph motivates the decision of the search engine
to weight the bid by the advertiser's eect on click
rate. Note: the revenue per ad is equal to price per
click multiplied by the quality score. As the price is
not given, this graph plots the bid instead of price,
which gives the upper bound on the revenue. The
results are for cruise category.

105

Figure E.9: Mean of bids at dierent winning positions

The bid is measured in cents. Note that as
the auction mechanism decides the allocation
of the ad position according to the weighted
bid, it is not necessary for a higher bid to get a
higher position. Due to the large dierence in
bids for car-insurance and other categories, I
have scaled the bid in car-insurance to 1/10th
of a cent.
(This is only done for this graph
and not in the data)

Figure E.10: Mean click probability for each ad across position

The plot shows the relative click of a single ad in
dierent positions. The click probability is mea-
sured relative to the clicks the ad received in the
rst position.

106

Figure E.11: Relation between the advertiser's bid and percentage of other high-quality ads
on the page

The plot graphs linear regression of bid on the pres-
ence of other high-quality ads on the page. The
plot shows preliminary evidence that the bid de-
creases with the increase in other high quality ads
on the page. The regression includes controls for
auction and consumer heterogeneity. The quality
is measured as click probability, which is calculated
as the ratio of click to impressions (note in ac-
tual estimation this is estimated on consumer side
model)

107

Figure E.12: Eye tracking Pattern

This gure plot the results of a eye tracking re-
search. It shows the eye movement of the consumer
on the page. The red region is the on scanned the
most followed by yellow and then blue. The pic-
ture reconrms that consumers do a top to bottom
scan. Thus, making ads on higher position more
valuable.

Figure E.13: Example of sub category within the laptop category

The plot shows an example of what a sub-category would look
within a major category such as laptop. I use a machine learning
clustering algorithm is to create such sub-groups.

108

Figure E.14: Keywords for Cosine and Euclidean distance

The plot shows keywords in the ve main categories. It shows
the relative performance of using dierent distance measure. the
one on the left uses Euclidean distance and the one on the right
uses Cosine distance. For the analysis cosine distance is used.

Figure E.15: The predicted click probability across positions

The plot shows the predicted click probability across
positions. The click probability is the predicted value
from the weighted logit model in step 1 of the estimation
method. The values are shown in percentage.

109

Figure E.16: The aect of choosing higher position on click probability

The plot shows the jump in click probability when
the ad moves one position up i.e. from position k
to k âˆ’ 1. for example, the value in position 1 is the
percentage increase in the click probability when
you move from 2nd position to 1st. The graph plots
the two highest jump for each category.

110

Figure E.17: Ad-Quality

The plot shows distribution of ad quality across the ve categories. The quality score is
measured as the advertiser's eect on click rate. It is estimated on the consumer side.

111

Figure E.18: Cumulative distribution of Externality Index

(a) Cable TV

(b) Cruise

(c) Car Insurance

(d) Coins

(e) Laptop

Each graph plots the predicted externality index for the ve ad categories. Ex-
ternality index is measured as a weighted sum of number of other ads where the
weight is a function of the quality score of the ad.

112

Figure E.19: Externality coecient across categories

In this graph, I plot the co-ecient from 2SLS log-log regression
of log of bid on log of externality. The coecient is interpreted
as the percentage increase in the bid when externality increases
by 1%.

Figure E.20: Eect of an additional ad on advertisers bid across dierent ad quality

Notes: In this graph, I plot the percentage decrease in a bid for
each additional ad. The x-axis shows the quality of the added ad.
The graph plots result in the laptop product category. Similar
results hold for others as well.

113

Figure E.21: Diminishing eect of an additional ad on advertisers bid

The graph plots results for laptop product category. Similar
result holds for others as well.

Figure E.22: Upper bound for the cumulative distribution function of the ad value

The plot shows the upper bound of the estimated distribution of
advertisers' ad value. The distribution of advertisers value for an
ad diers across product category. Thus, estiamting the distri-
bution empirically is critical for getting realistic counterfactual
results.

114

Figure E.23: Cumulative distribution of advertiser's ad value

(a) Cable TV

(b) Cruise

(c) Car Insurance

(d) Coins

(e) Laptop

Notes: This graph plots the upper and lower bound estimated for the ad-value
distribution. The x-axis plots the values and y-axis shows the corresponding cumu-
lative distribution at each point. Here ad value captures the advertisers' externality
free value that can be thought of as their value for an exclusive ad

115

Figure E.24: Gain in revenue from using the alternate method in appendix(H)

Figure E.25: Revenue-maximizing number of ads from alternate method suggested in ap-
pendix(H)

The plot shows the average number of ads that are
displayed in the new auction design that selects the
number of ads as well as the winning ads.

116

APPENDIX F

Tables for Chapter 3

Table F.1: List of variables available for consumers side data

Variable

Description

The day of the month.

Day
Advertiser ID The id for each advertiser.
Clicks

Ad Displays

Keyword

Keylength

Ad Position

The number of clicks received by a advertise-
ment.
The number of times an advertisement was
displayed.
The keyword gives
matched between search and the ad.
The number of words specied in the key-
word.
The winning position of the advertisement on
the search result page.

the specied words

Ad denition:
bid gives the advertisement

id-keywords combination for the same

117

Table F.2: List of variables available on advertisers side

Variable

Description

The day of the month.

Day
Advertiser ID The id for each advertiser.
Bid
Ad Displays

The bid used for an ad.
The number of times an advertisement was
displayed.
The keyword gives
matched between search and the ad.
The number of words specied in the key-
word.
The winning position of the advertisement on
the search result page.

the specied words

Keyword

Keylength

Ad Position

Ad denition:
bid gives the advertisement

id-keywords combination for the same

Table F.3: Summary statistics for consumers side data

Variable

Mean/Range Std. Dev/Max

Consumer's side variables:

Keywords (ad description & search common words)

3174 (count)

Keylength (no. of words in keyword)

CTR: Click Through Rate (click percentage)

# of Search per Day

Ad-position

3.06

.98%

1925.6

1-7

-

10(max)

9.86

996.3

-

Ad display frequency and number of ads are on per day level. Consumer side
data: Aggregated for each day-advertiser- keyword - ad position observation. Total
number of observations: 131524. Restricting data to ads on rst page

118

Table F.4: Summary statistics for advertisers side data

Variable

Mean/Range Std. Dev/Max

Advertiser's side variables:
Per Click Bid1
Number of ads per page

.632

6.95

Keywords (ad description & search common words)

3174 (count)

Keylength (no. of words in keyword)

Number of advertisers

Ad-position

3.06

26.4

1-7

2.0

.27

-

10(max)

10.04

-

[1] Bid measured in cents.
Advertiser side data:Aggregated for each day-ad position obser-
vation. Total number of observations: 21,599. Restricting data
to ads on rst page.

Table F.5: Features of dierent categories

Category

Click Rate

Bid Advertisers Searches/Day

Car Insurance

Laptop

TV Cable

Cruise

Collectible Coins

2.1%

1.6%

2.4%

1.1 %

1.7%

4.359

.233

.600

.371

.174

20

45

25

22

23

386.24

540.73

277.26

533.84

103.52

Table reports mean value of variables across the ve product
categories. The bid is measured in terms of cents

119

Table F.6: Externality co-ecient across categories

Car Ins

Laptop

Log Bid
Cable TV

Coins

Cruise

Log Externality

Quality

Ad Popularity

Keylength

Ad Specicity

-0.336
(0.133)

-1.9

(0.454)

-0.592
(0.211)

-0.618
(0.341)

0.0000725
(0.0000469)

0.000753
(0.00014)

0.0046342
(0.00142)

-0.064
(0.007)

0.006
(0.003)

-0.124
(0.026)

-1.365
(0.621)

1.5882
(0.4865)

0.0019
(0.0002)

0.00675
(0.0019)

-0.0836
(0.079)

1.071

(0.1967)

0.146

(0.0879)

-1.322
(0.0667)

0.0001

(0.00014)

-0.00004
(0.000019)

0.0103521
(0.0010219)

-0.076

(0.01145)

-0.0945421
(0.00537)

0.007206
(0.00043)

-0.0319
(0.00309)

No. of advertisers
Sub-category
No. of ads
N

X
X
X
729

X
X
X

X
X
X

X
X
X

X
X
X

6977

3219

3589

5041

Table F.7: Quality measure across product categories

Car-Insurance Laptop Cable Cruise Coins

Quality Score (0 âˆ’ 1):
Mean

0.05

0.21

0.30

0.27

0.13

(0.01)

(0.04)

(0.06)

(0.06)

(0.02)

Quantiles

25%

50%

75%

90%

0.00

0.01

0.06

0.13

0.05

0.14

0.29

0.51

0.11

0.21

0.46

0.66

0.06

0.21

0.42

0.64

0.05

0.10

0.16

0.26

The table summaries statistics of the predicted quality
score.

120

Table F.8: Estimated average partial eect of position on click probability

Variable

Laptop

Car-insurance Cable

Cruise

Coins

2nd Position -0.00110
(0.0002)

3rd Position

4th Position

5th Position

6th Position

7th Position

-0.00180
(0.0002)

-0.00235
(0.0002)

-0.00288
(0.0002)

-0.00301
(0.0002)

-0.00300
(0.0002)

-0.0116
(0.0010)

-0.0112
(0.0011)

-0.0202
(0.0012)

-0.0214
(0.0012)

-0.0247
(0.0012)

-0.0248
(0.0012)

-0.0228
(0.0015)

-0.0288
(0.0015)

-0.0343
(0.0015)

-0.0390
(0.0015)

-0.0407
(0.0015)

-0.0422
(0.0015)

-0.00367
(0.0003)

-0.00399
(0.0003)

-0.00431
(0.0003)

-0.00459
(0.0003)

-0.00455
(0.0003)

-0.00493
(0.0003)

-0.00831
(0.0015)

-0.0176
(0.0014)

-0.0193
(0.0015)

-0.0233
(0.0015)

-0.0258
(0.0015)

-0.0249
(0.0015)

The table shows the average partial eect on click probability relative to
1st position as calculated in the step 1 of consumer's click choice model.
Fixed eects for market and advertiser included.

121

Table F.9: Results for the revenue-maximizing number of ads per page

Predicted no. of ads per page
lower bound
upper bound

Predicted Change in prot

Externality eect Avg Quality

lower bound

upper bound

0.6942 %

1.698%

(0.694%, .694%)

(1.697%, 1.699%)

Medium

Low

1.014 %

2.985 %

Medium

Medium

(1.014%, 1.015%)

(2.985%, 2.987%)

15.575 %

22.7 %

(15.574%, 15.577%)

(22.697%, 22.704%)

0

0

0

0

High

Low

Nill

1000

High

Low

Medium

1000

Car Insurance

Laptop

TV Cable

Coins

Cruise

5

5

3

7

7

3

4

2

7

7

Simulation

1000

1000

1000

1000

Please refer to section(3.9) for more details. Note: The table shows the results from the counterfactual analysis of
deriving the revenue-maximizing number of ads. The results are calculated separately for each category and within each
category for each upper and lower bound of the ad-value distribution. The table also shows the percentage increase
in revenue from using the suggested number of ads as compared to the current number of ads, which is seven. The
externality estimate and quality variables are labeled low, medium and high in order to make it simpler to compare
categories. The top level among the ve categories is given the label high, the next two are given medium and the next
two are given low index.

122

Table F.10: Identifying dierent categories

Variable

Description

Clicks Bid No. of adv.

search

Car Insurance High price per click & highly con-
`Car Insur-

2.1%

4.359

20

386.24

1.6%

.233

2.4%

.600

1.1 % .371

1.7%

.174

45

25

22

23

540.73

277.26

533.84

103.52

across

Laptop

Cable

Cruise

Coins

centrated market :
ance'
Popular & high competition :
`Laptop'
Less popular & above average
price : `Cable'
Relatively popular & detailed
search : `Cruise'
Low value
`Coins'

variables:

Showing mean value for each category

Table F.11: Robust check: externality co-ecient across categories

Car Ins

Laptop

Cable TV

Coins

Cruise

Externality Index(1)

-.1714833
( .072278 )

-.2950749
(.1050882 )

-.6921709
( .319323 )

-.0419864
(.0399867 )

.0496461

( .0603489)

Externality Index(2)

-.3858009

-.6256772

-1.391457

-1.385585

.1321744

( .1372449 )

( .2365638 )

( .5827886 )

( 1.772531 )

( .0772414 )

No. of adv FE
Sub-category FE
No. of ads FE

X
X
X

X
X
X

X
X
X

X
X
X

X
X
X

The table shows the coecient of the externality index in step 2 of the estimation of 2SLS
regression of log of bid on externality

123

Table F.12: Robust check: dierent externality index denitions

Car Ins

Laptop

Cable TV

Coins

Cruise

top 10%

-.3362724

( .1330385 )

-1.541917
(.6485241 )

-.8776404
(.2510247 )

-.1130883

.1793007

( .0768405 )

( .1057224 )

above your quality

-.5183959
( .579788 )

-.2325425

(0.341)

-.09395

-1.385585

-2.85473

( .3748843 )

( 1.772531 )

(28.34121 )

No. of adv FE
Sub-category FE
No. of ads FE

X
X
X

X
X
X

X
X
X

X
X
X

X
X
X

The table shows the coecient of the externality index in step 2 of the estimation of 2SLS
regression of log of bid on externality

124

APPENDIX G

Proofs for Chapter 3

G.1 Theoretical Proofs

G.1.1 Proof for proposition(3.4.5)

K(cid:88)

k=1

(cid:20)

(cid:125)

(cid:20)

(cid:18)
vj âˆ’ E

(cid:12)(cid:12)(cid:12)(cid:12)bw

(cid:19)(cid:21)

The prot maximizing objective function for the case of no externality is given as below :

Î (b; vj, sj) = max

b

P rob(bw

j = b[k]) âˆ— ck,j

P (k)

j = b[k]

(G.1)

Let bâˆ— be the equilibrium bid of the no externality case, then I show that bâˆ—(EXTKj

)b1

is the equilibrium bid of externality case. Now in case of externality the ad value is

= vj(EXTKj

Vj,Kj
prot maximizing objective function in the presence of externality.

)b1 instead of vj. Substituting this in the equation above , I get the

(cid:20)

(cid:18)

(cid:19)(cid:21)

Î (b; vj, sj) = max

b

P rob(bw

j = b[k]) âˆ— ck,j

vj(EXTKj

)b1 âˆ’ E

P (k)

j = b[k]

(G.2)

K(cid:88)

k=1

Note that the above transformation does not aect the ranking of the ad values and thus

they also do not impact the ranking of the bid.1. Thus, This can be further solved to:

Î (b; vj, sj) = (EXTKj

)b1

max

b

P rob(bw

j = b[k]) âˆ— ck,j

P (k)

j = b[k]

Thus, the solution to the above objective function is bâˆ—âˆ—(EXTKj
bid in the presence of externality. Thus, the externality bid given by b(vj, sj; Î±K,n, ext) is

)b1, which is the equilibrium

equal to b(vj, sj; n, Î±K,n)

*(EXTKj

)b1. Taking log this implies the following:

K(cid:88)

k=1

(cid:12)(cid:12)(cid:12)(cid:12)bw
(cid:12)(cid:12)(cid:12)(cid:12)bw

(cid:20)

(cid:18)
vj âˆ’ E

(cid:19)(cid:21)(cid:21)

(cid:124)

(cid:123)(cid:122)

externality free bid

(cid:125)

(cid:124)

(cid:123)(cid:122)

Log(b(vj, sj; Î±K,n, ext)

) = Log(b(vj, sj; n, Î±K,n)

) + b1Log(Extj,Kj

(cid:124)

(cid:123)(cid:122)

(cid:125)

(cid:124)

(cid:123)(cid:122)

)

(cid:125)

observed bid

externality free bid

externality eect on bid

(G.3)

(cid:4)

1assuming monotonic bid

125

G.1.2

lemma(3.4.1) proof:

Recall that in the weighted GSP auction, the advertisers report per click bid. The equilibrium

bid bGSP for advertiser i is given as :

bGSP = Arg max

Ë†b

Î (Ë†b|vi, si) = max
Ë†b

(cid:20)

vi âˆ’

Î³ick

K(cid:88)

k=1

E

GSP,[k+1]
w

b

[k]

w = Ë†b âˆ— si

(cid:12)(cid:12)(cid:12)(cid:12)b

si

(cid:19)

(cid:21)

Ã—P rob(b

GSP,[k+1]
w

â‰¤ Ë†b âˆ— si â‰¤ b

GSP,[k+1]
w

)

(G.4)

Consider an alternative auction, which I refer GSP modied (GSPM). In this auction the

advertisers report bid for si number of clicks, where the number of clicks is equivalent to the
advertiser's quality score. The equilibrium bid bGSP M for advertiser i is given as :

(cid:18)

(cid:18)

(cid:12)(cid:12)(cid:12)(cid:12)b

(cid:19)

(cid:21)

E

GSP M,[k+1]
w

b

[k]
w = Ëœbw

bGSP M = Arg max
Ëœbw

Î (bw|vi, si) = max
Ëœbw

vi âˆ’

Î³ick

k=1

Ã—P rob(b

GSP,[k+1]
w

si
â‰¤ Ë†b â‰¤ b

GSP,[kâˆ’1]
w

)

(G.5)

Ëœb =

Ë†bw
si

, then we can rewrite the optimizing problem as

bGSP M = Arg max

Ëœb

Î (bw|vi, si) = max
Ëœb

vi âˆ’

Î³ick

(cid:18)

E

b

(cid:19)

[k]
w = Ëœb

(cid:21)

(cid:12)(cid:12)(cid:12)(cid:12)b

GSP M,[k+1]
w

si

Ã—P rob(b

GSP,[k+1]
w

â‰¤ Ë†b â‰¤ b

GSP,[kâˆ’1]
w

)

(G.6)

(cid:20)

(cid:20)

K(cid:88)

K(cid:88)

k=1

Now I will use the information that si is know to advertiser i and the auctioneer. Thus, the
above problem can be rewritten to maximize Ë‡b = Ëœb/si so that the optimal bid per click looks

like :

bGSP M = Arg max

Ë‡b

Î (Ë‡b|vi, si) = max
Ëœb

(cid:20)
vi âˆ’

Î³ick

K(cid:88)

k=1

(cid:18)

E

(cid:12)(cid:12)(cid:12)(cid:12)b

(cid:19)

(cid:21)

[k]

w = Ë‡b âˆ— si

GSP M,[k+1]
w

b

si

â‰¤ Ë‡b âˆ— si â‰¤ b

GSP M,[kâˆ’1]
w

)

(G.7)

Ã—P rob(b

GSP M,[k+1]
w

126

The optimization problem in equation(G.5) and (G.7) are equivalent, Thus, if all other
bcGSP M i

, then advertiser will also have bGSP

advertisers j (cid:54)= i have bGSP

bGSP M
j

=

=

i

.

si

j

sj

This shows that the GSPM equilibrium is one of the equilibrium for GSP, but since GSP
(cid:4)

has unique equilibrium, it implies the two auctions give the same equilibrium bid.

G.1.3 Proof of proposition (3.4.3)

Suppose bidder i bids Ëœb and everyone else is playing according to the equilibrium increasing
bidding strategy b(vj, sj). This equivalently means that bidder i has weighted bid Ëœbw and

everyone else bids the equilibrium weighted bid bw(wj) (refer to lemma(3.4.1) for more

details). Recall that Fw(.) is the distribution of the weighted value Ï‰ and Gw(.) is the

distribution of the equilibrium weighted bid bw. To solve for the equilibrium we rst elaborate

on the probability of getting a position. The probability of winning position k can be written

as :

(cid:0)N âˆ’ 1(cid:1)
(cid:0)K âˆ’ 1(cid:1)(1 âˆ’ G(Ëœbw))kâˆ’1(G(Ëœbw))Nâˆ’k

K(cid:88)

k=1

zk(Ëœbw) =

(G.8)

as the equilibrium weighted bid is an increasing function of weighted value wj, the above is

equivalent to

zk(bâˆ’1

w (Ëœbw)) =

K(cid:88)

k=1

(cid:0)N âˆ’ 1(cid:1)
(cid:0)K âˆ’ 1(cid:1)(1 âˆ’ F (bâˆ’1

w (Ëœbw)))kâˆ’1(F (bâˆ’1

w (Ëœbw)))Nâˆ’k

(G.9)

(G.10)

Consider an ecient equilibrium , then the prot function is given as :

ckzk(bâˆ’1

w (Ëœbw))

vj âˆ’ E

[k]
w = Ëœbw

(cid:19)(cid:21)

K(cid:88)

k=1

(cid:20)

(cid:18) b

(cid:12)(cid:12)(cid:12)(cid:12)b

[k+1]
w
si

Substituting value of zk(bâˆ’1

w (Ëœbw)) from equation(G.9) and then dierentiating we get the

above equation we get :

127

)

b

ck

Ã—

[k+1]
w
si

(cid:18)
vj âˆ’ E(

(cid:19)
(cid:0)N âˆ’ 1(cid:1)
(cid:0)K âˆ’ 1(cid:1)(N âˆ’ k)(1 âˆ’ F (bâˆ’1

(cid:20) K(cid:88)
(cid:0)N âˆ’ 1(cid:1)
(cid:0)K âˆ’ 1(cid:1)(k âˆ’ 1)(1 âˆ’ F (bâˆ’1
(cid:0)N âˆ’ 1(cid:1)
âˆ’ K(cid:88)
(cid:0)K âˆ’ 1(cid:1)(1 âˆ’ F (bâˆ’1

w (Ëœbw)))b

+ ck

k=1

ck

k=1

(G.11)

w (Ëœbw))kâˆ’1)F (bâˆ’1

w (Ëœbw)))kâˆ’2F (bâˆ’1

w (Ëœbw))Nâˆ’kf (bâˆ’1

(cid:48)

w (Ëœbw))b

w (Ëœbw))Nâˆ’kâˆ’1f (bâˆ’1
(cid:48)
(bâˆ’1
(cid:18)
w (Ëœbw))

w (Ëœbw))b

(bâˆ’1
(cid:21)
w (Ëœbw))
(cid:19)

(cid:48)

(bâˆ’1
w (Ëœbw))kâˆ’1(F (bâˆ’1

w (Ëœbw)))Nâˆ’k

d

[k+1]
E( b
w
si
d(b)

)

= 0

I focus on the symmetric equilibrium where bw() = Ëœbw() thus, the above can be rewritten as

(cid:18)
vj âˆ’ E(

(cid:19)(cid:20) K(cid:88)

(cid:0)N âˆ’ 1(cid:1)
(cid:0)K âˆ’ 1(cid:1)(N âˆ’ k)(1 âˆ’ F (Ï‰j)kâˆ’1)F (Ï‰j)Nâˆ’kâˆ’1 f (Ï‰j)

(cid:48)

b

(Ï‰j)

b

[k+1]
w
si

)

ck

k=1

+ ck

(cid:0)N âˆ’ 1(cid:1)
(cid:0)K âˆ’ 1(cid:1)(k âˆ’ 1)(1 âˆ’ F (Ï‰j)kâˆ’2)Nâˆ’kf (Ï‰j)b
(cid:0)N âˆ’ 1(cid:1)
âˆ’ K(cid:88)
(cid:0)K âˆ’ 1(cid:1)(1 âˆ’ F (Ï‰j))b

ck

(Ï‰j)kâˆ’1(F (Ï‰j))Nâˆ’k

(cid:48)

(Ï‰j)

(cid:48)

d

(cid:21)
(cid:18)

k=1

(cid:19)

)

[k+1]
E( b
w
si
d(b)

= 0

(G.13)

(G.12)

128

To further solve the last term lets rst open up the expected price:

(cid:19)

(cid:12)(cid:12)(cid:12)(cid:12)b

(cid:18) b

E

[k+1]
w
si
bk+1:N
= E(
w

si

b1:Nâˆ’k

w

= E(

(cid:90) Ëœbw

=

0

si
x
sj

(cid:124)

(cid:90) Ëœbw
(cid:18) b

0
[k+1]
w
si

(cid:12)(cid:12)(cid:12)(cid:12)b

=

Ëœbw
sj
â‡’ E

F Nâˆ’k(bâˆ’1(x))

(cid:19)

Ëœbw
sj

[k]
w = Ëœbw
|bk+1:N â‰¤ Ëœbw â‰¤ bkâˆ’1:N )
|b1:Nâˆ’k â‰¤ Ëœbw)
(N âˆ’ k)F Nâˆ’kâˆ’1(bâˆ’1(x))g(bâˆ’1(x))

F Nâˆ’k(bâˆ’1(Ëœbw))

(cid:123)(cid:122)

conditional distribution

(G.14)

dx

(cid:125)

Using integration by parts, we get :

(N âˆ’ k)F Nâˆ’kâˆ’1(bâˆ’1(x))f (bâˆ’1(x))

(cid:90) Ëœbw

0

dx âˆ’

F Nâˆ’k(bâˆ’1(x))
sjF Nâˆ’k(bâˆ’1(Ëœbw))

dx

[k]
w = Ëœbw

=

âˆ’ Î“(Ëœbw, f (.))

(G.15)

As can be expected the price in generalized second price auction is less than the bid and the

decrease in the bid is dened by Î“(Ë†b, sj, g(.)) which is equal to(cid:82) Ë†bsj

0

GNâˆ’k(x)
sj GNâˆ’k(b)

dx.substituting

129

the expected price in equation(G.13) we get:

(cid:18)

(cid:19) f (Ï‰j )

(cid:20) K(cid:88)

(cid:0)N âˆ’ 1(cid:1)
(cid:0)K âˆ’ 1(cid:1) (N âˆ’ k)(1 âˆ’ F (Ï‰j )kâˆ’1)F (Ï‰j )Nâˆ’kâˆ’1

vj âˆ’ E(

b

[k+1]
w
si

)

+ ck

âˆ’

â‡’

ck

ck

k=1

(cid:48)

b

k=1

(Ï‰j )

(cid:21)
(cid:18)Ëœbw

(cid:19)
âˆ’ Î“(Ëœbw, f (.)))

(cid:0)N âˆ’ 1(cid:1)
(cid:0)K âˆ’ 1(cid:1) (k âˆ’ 1)(1 âˆ’ F (Ï‰j ))kâˆ’2F (Ï‰j )Nâˆ’k
(cid:0)N âˆ’ 1(cid:1)
K(cid:88)
(cid:0)K âˆ’ 1(cid:1) (1 âˆ’ F (Ï‰j ))kâˆ’1(F (Ï‰j ))Nâˆ’k
(cid:0)N âˆ’ 1(cid:1)
(cid:18)
(cid:0)K âˆ’ 1(cid:1) (N âˆ’ k)(1 âˆ’ F (Ï‰j )kâˆ’1)F (Ï‰j )Nâˆ’kâˆ’1
vj âˆ’ E(
(cid:0)N âˆ’ 1(cid:1)
(cid:21)
(cid:0)K âˆ’ 1(cid:1) (k âˆ’ 1)(1 âˆ’ F (Ï‰j )kâˆ’2)F (Ï‰j )Nâˆ’k
(cid:0)N âˆ’ 1(cid:1)
(cid:18) 1
K(cid:88)
(cid:0)K âˆ’ 1(cid:1) (1 âˆ’ F (Ï‰j )kâˆ’1(F (Ï‰j ))Nâˆ’k

(cid:19) f (Ï‰j )

âˆ’ d(Î“(Ëœbw, f (.)))

(cid:20) K(cid:88)

[k+1]
b
w
si

(cid:19)

(Ï‰j )

k=1

d(b)

d(b)

d

sj

)

(cid:48)
b

= 0

= 0

ck

sj

+ ck

âˆ’

ck

k=1

â‡’ vj = E(

b

[k+1]
w
si

K(cid:80)

k=1

ck

sj

f (Ï‰j )
(cid:48)
(Ï‰j )
b

(cid:0)Nâˆ’1(cid:1)
(cid:0)Kâˆ’1(cid:1) (1 âˆ’ F (Ï‰j ))kâˆ’1(F (Ï‰j ))Nâˆ’k
ck
(N âˆ’ k)(1 âˆ’ F (Ï‰j ))kâˆ’1F (Ï‰j )Nâˆ’kâˆ’1 + (k âˆ’ 1)(1 âˆ’ F (Ï‰j ))kâˆ’2F (Ï‰j )Nâˆ’k

1 âˆ’ d(Î“(Ëœbw,f (.)))

(cid:18)

(cid:19)

d(b)

(cid:21)

)+

K(cid:80)
(cid:0)Nâˆ’1(cid:1)
(cid:0)Kâˆ’1(cid:1)(cid:20)

k=1

using 1 âˆ’ d(Î“(Ëœbw,f (.)))

d(b)

< 1 I get

â‰¤ E(

b

[k+1]
w
si

)+

K(cid:80)

k=1

ck

sj

f (Ï‰j )
(cid:48)
b
(Ï‰j )

K(cid:80)

(cid:0)Nâˆ’1(cid:1)
(cid:0)Kâˆ’1(cid:1) (1 âˆ’ F (Ï‰j ))kâˆ’1(F (Ï‰j ))Nâˆ’k

ck

k=1

(cid:21)
(N âˆ’ k)(1 âˆ’ F (Ï‰j )kâˆ’1)F (Ï‰j )Nâˆ’kâˆ’1 + (k âˆ’ 1)(1 âˆ’ F (Ï‰j )kâˆ’2F (Ï‰j )Nâˆ’k)

(cid:0)Nâˆ’1(cid:1)
(cid:0)Kâˆ’1(cid:1)(cid:20)

using the auction property that bid is always greater than the price, i.e. E(
â‡’ vj â‰¤ bw
sj

(cid:0)Nâˆ’1(cid:1)
(cid:0)Kâˆ’1(cid:1) (1 âˆ’ F (Ï‰j ))kâˆ’1(F (Ï‰j ))Nâˆ’k

K(cid:80)

ck

k=1

+

sj

f (Ï‰j )
(cid:48)
(Ï‰j )
b

K(cid:80)

k=1

ck

(cid:0)Nâˆ’1(cid:1)
(cid:0)Kâˆ’1(cid:1)(cid:20)

(N âˆ’ k)(1 âˆ’ F (Ï‰j )kâˆ’1F (Ï‰j )Nâˆ’kâˆ’1 + (k âˆ’ 1)(1 âˆ’ F (Ï‰j ))kâˆ’2F (Ï‰j )Nâˆ’k

b

[k+1]
w
si

) < bw

(G.16)

(cid:21)

(G.17)

130

Now to use this inequality in estimation part I will need to substitute the bid distribution

in place of latent distribution using the following equality conditions :

G(bw) = F (Ï‰|N )
f (Ï‰|N )
(Ï‰)
b

g(bw) =

(cid:48)

Thus, equation(G.17) can be written as :

vj â‰¤ bw
sj

+

K(cid:80)

k=1

ck

sj g(bw)

(cid:0)Nâˆ’1(cid:1)
(cid:0)Kâˆ’1(cid:1)(cid:20)

(cid:0)Nâˆ’1(cid:1)
(cid:0)Kâˆ’1(cid:1) (1 âˆ’ G(bw))kâˆ’1(G(bw))Nâˆ’k

K(cid:80)

k=1

ck

(N âˆ’ k)(1 âˆ’ G(bw))kâˆ’1G(bw)Nâˆ’kâˆ’1 + (k âˆ’ 1)(1 âˆ’ G(bw))kâˆ’2G(bw)Nâˆ’k

(cid:21)

Next I also show that the upper bound is equal to the equilibrium bid of the analog

generalized rst price auction. The generalized rst price auction (GFP) will have agents

pay their bid for dierent positions, since in this case auctioneer considers a score, the agents

pay their weighted bid. The maximizing prot function would be

bGF P (vj, sj) = Arg max
Ë†bsj

ckzk(Ë†bsj)

sjvj âˆ’ bjsj

(cid:20)

(cid:21)

K(cid:88)

k=1

the equilibrium bid will then be :

K(cid:88)

k=1

d(zk(Ë†bsj))

d(b)

ck

(cid:20)
sjvj âˆ’ bjsj

(cid:21)

âˆ’ K(cid:88)
K(cid:80)
K(cid:80)

k=1

k=1

k=1

ckzk(Ë†bsj) = 0

ckzk(Ë†bsj)

d(zk(Ë†bsj ))

d(b)

ck

â†’ bGF P = vj âˆ’

â†’ bGF P = vj âˆ’ G(bw)
âˆ†g(bw)

It can be shown that in case of one position , this equilibrium bid is equal to the equilibrium
(cid:4)

bid of rst price auction.

131

APPENDIX H

Revenue-maximizing number of ads: looking at exible number of ads

The empirical exercise has shown that the advertiser's account for the externality imposed

by other advertisers' presence on the ad space. The bids of the advertiser decrease with an

increase in the number of ads on the page. The natural question then is what is the optimal

number of advertisements on the ad space.

In this section, I propose an addition to the

auction mechanism, that can derive the optimal number of ads on the page. I propose a

new mechanism that adds a pre-auction stage where the optimal number of ads are decided.

following are the steps of involved in the mechanism:

Âˆ Bidder:

1. Bidders submit a 2-dimension bid. 1

2. The bid is composed of the bid for an exclusive ad option and a specied per-

centage decrease in the bid for each additional advertisement on the same page.

2

Âˆ auction mechanism

1. step 1: Optimal number of ads is decided

2. step 2: GSP auction is held

The proposed auction can derive the optimal number every time an auction is held; in other

words, every time the search is entered in Yahoo!. Using the estimates of the advertisers'

value and externality estimate, we can compare the old and new pricing mechanisms using

simulation. This has been left for future research.

1I am assuming the externality, i.e. the percentage decrease in the bid due to each ad-
ditional bid, is not known to the auctioneer. An alternative specication would be the
advertisers only report the bid, and the auctioneer applies the pre-specied percentage de-
crease.

2the percentage decrease is equal to the externality calculated in the step 1.

132

H.0.1 Simulation details

I now specify the steps involved in the simulation for deciding the revenue-maximizing num-

ber of ads. The steps are repeated for 100 simulation rounds, and the simulation is done

separately for each category.

1. Draw N independent values from the empirical distribution:

vj âˆ¼ Ë†HU ( Ë†Ï†)

2. Solve for equilibrium bid using:

Quality Â¯s), click rate (Ë†ck) and externality co-ecient (Î²1).

3. Pick the revenue-maximizing number of ads:

Nâˆ— = Argmax

N

T R(N )

4. Compare Yahoo!'s revenue in revenue-maximizing Nâˆ— and the average seven ads:

âˆ†(gain) = T R(Nâˆ—) âˆ’ T R(7)

H.0.2 Results

Âˆ The revenue-maximizing number of ads: The revenue-maximizing number of ads changes

with each category. The graph(E.25) plots the average number of ads allotted in the

new auction design, along with the condence interval. The product categories cable

TV and laptop are shown to have the highest decrease in the number of ads shown

compared to the current norm of showing an average of 7 ads.

133

Âˆ Revenue gain across categories: The new auction design leads to higher revenue for the

search engine. Graph(E.24) plots the average gain in revenue across categories. The

highest gain is observed in the cable TV, followed by the laptop. Notice that although

laptop and car insurance had similar co-ecient for the externality eect, the gains are

dierent. This is because the gain also depends on the quality of the ad. The average

quality is higher for laptop category than car insurance.

134

APPENDIX I

Robustness Check

I.0.1 Robust checks

Âˆ Denition of Externality index:

The externality index captures the eect of the quality as well as the number of other

ads on the page. In this paper the relation between the quality and the number of ads

is captured by using a weighted sum of number of ads where the structure is similar

to a euclidean distance. For convenience the index is reproduced below:

(cid:115)(cid:88)

k(cid:54)=K

Extj,m =

Ë†s2âˆ’j,m

The above denition is used as it captures the dierences over a single ad having high

quality compared to 4-5 ads having lower quality that sum upto the high quality of

single ad. Note that a linear sum over the quality other ads would not have been able

to capture such dierence. The square root is added to normalize the eect of squaring
the average quality (Ë†sâˆ’j,m). To check the sensitivity of the results to externality

index the step was rerun for alternative denition of the externality index as shown in

Table(F.11). The alternative denitions are as follows:

Externality index(1) = Extj,m = (cid:80)
Externality index(2) = Extj,m = (cid:80)

Ë†s2âˆ’j,m
Ë†sâˆ’j,m

k(cid:54)=K
k(cid:54)=K

Overall the table shows that the sign and signicance is not aected by the denition.

Âˆ Type of externality:

Another assumption made in the main analysis is that the externality is generated from

the presence of all the other ads. However the externality can be also be generated

from only the high quality ads, or instead only from the ads above you. Table(F.12)

presents results for externality when the eect is only from wither the top ten percent

135

of the ads or from ads with quality above the advertiser's quality. The results show that

externality is still negative and signicant. However, it might be more or less for certain

type of ads depending on the ad category. Another interesting observation is that in

cruise category the advertisers bid is negatively eected if only the ads with quality

above the advertisers ad is considered. This is insightful as for the general externality

that captures the eect of all the ads on the page, showed positive and insignicant

eect. This can be an indication that for cruise the advertisers are specically focusing

only on the competitors that have a higher or similar quality to theirs. Further analysis

on this topic can be done if additional information about the ads is provided such as

the similarity of product sold and price charged for the product. As the advertisers

names are masked such analysis can not be done with the current data

136

APPENDIX J

Data cleaning for Chapter 3

J.0.1 Data cleaning

The data set is provided by Yahoo! as part of the Yahoo! Research Alliance Webscope

program. It gives details about ve dierent categories, namely laptop, cable, coins, cruise,

and car insurance, over 123 days from January 2008 to April 2008. The data has information

about keywords, bid, clicks, ad position, and display frequency. The keywords in the data

set include one of the base category word 'coin,' 'laptop,' 'cable,' and 'car insurance.' Apart

from the base category word, the keywords also include one or more additional words. For

example, 'business laptop' and 'student laptop' are two keywords within the base category

laptop. The additional words provide a more targeted ad. For instance, an ad 'business

laptop' targets consumers that are specically looking for business usage; however, keyword

'laptop' captures a broader search for any laptop need of a consumer. The maximum number

of words in a keyword is 10.

Another key characteristic of this data is the keywords, and the advertiser's id is masked.

This means I can track the same advertiser across ads and time; however, the actual identity

of the advertiser is masked. More details on identifying the base categories are given in

the appendix(J.0.2). To do the analysis, I restrict the dataset to ads on the rst page and

consider only the rst seven ads. The total observations in the raw data set are 207982.

After restricting the data to the rst page, the data has 131524. The restriction to ads on

the rst page is important as the consumers click probability drops drastically as the ads go

beyond the rst page. As seen in g(E) 92% of the clicks are on rst page ads. Additionally,

the data is divided into two parts according to the aggregation needed for consumers or

advertisers analysis. On the consumer's side, the analysis focuses on the click decision on

the consumer. The data provider, Yahoo! Research Lab, aggregated the data for each day-

137

keyword-advertiser-position combination. Let us take an example of observation, for January

1st, 2008, the data reports that the keyword 'business laptop' specied by Walmart that was

displayed in the rst position got 100 displays and ve clicks. For the same keyword and

advertiser, i.e. Walmart ad with keyword 'business laptop,' I will have a dierent observation

for the ad displayed in the second position, which got 50 displays and two clicks. This means

that the data does not aggregate over dierent positions and reports results for each position

and keyword separately. This is an advantage for the consumer side, as the dierent keywords

can help in capturing the dierence in consumer search. Thus, the total observation on the

consumer side analysis is 131524.

On the other hand, for analysis on the advertiser's side, I need information on how the

advertiser maximized prot for each ad. This means here the relevant variation is the bid over

ads and not the keywords. Therefore, on the advertisers side, I aggregate the data on day-

ad-advertiser-position combination.1 This means that when deciding the bid, the advertiser

accommodates the expected clicks and price for each ad across dierent winning positions

over a day. An example of an observation is; on January 1st, 2008, the data reported that

an ad by Walmart was displayed in the rst position, got 200 displays, and ten clicks. The

results of the estimation without the aggregation on the consumer side are similar to the

one without the aggregation. However, as the model assumes that the advertiser maximizes

prot over an ad, the results from aggregation are more aligned with the advertiser's prot-

maximizing strategy. Note that the advertiser can always specify a dierent bid for each

keyword, and in this case, each keyword would be treated as a separate ad in this data set.

So the aggregation is only on the keywords for which the advertiser has specied the same

bid, meaning the bid was maximized over all the keyword that has the same bid per day.

The aggregated data gives 21,599 observations.

The estimation uses subgroups in dierent categories on the basis of similar keywords.

For more detail refer to section(3.6). On the consumer side, the analysis needs an estimation

1Recall the denition of an ad is the set of keywords for which the advertiser has specied

the same bid.

138

of click probability across a position for each market. However, there are some markets

that do not observe any clicks and therefore have to be dropped from the analysis. This

is less than 1% of the data. Apart from this, there are few advertisers that do not get

any click in the data. For these advertisers, I am not able to get the quality score as the

quality score is captured through the advertiser's eect on click probability. Therefore, I

assume the minimum observed quality score in the data as the imputed quality score for

these advertisers. The analysis is robust to assuming less than the minimum quality score,

as well. Note that dropping these observations would have introduced bias in the estimation

as we need to accommodate all the advertisers in the market. Additionally, there was some

market that did not have a consistent observation, meaning they had an observation for

lower ad position but not for higher ad position. These were treated as inconsistent with the

market rules as a lower ad position is only shown when there are ads at higher ad positions.

Therefore these were also dropped from the data.

J.0.2 Within product category variation

Recall that this data has varied product categories, namely `laptop', `cable', `cruise', `coin'

and `car insurance'. Although we know the dierent product categories, the keywords are

declassied and thus the product categories are also declassied.2 To overcome this we ana-

lyze the dierences in the deidentied category and match it to the closest possible category

among [ `laptop', `cable', `cruise', `coin' and `car insurance'] according to the observed fea-

tures. Table(F.10) gives a summary of how variables dier among categories. Additionally

the table(F.10) shows the corresponding mean value for all features for dierent categories.

Let us rst look at features of category 0. This category is characterized by above average

bid and small number of competitors relative to other categories, this is consistent with the

car insurance product category. They are known to be the industry with one of the highest

2the categories are identied through specicity of keyword

139

pay per click.3 This is due to the high prot margins in auto insurance industry(which

is result of it being a highly concentrated market). Other observations about this market

which makes it consistent with the car insurance is that there are no keywords with one

word, again this is consistence with car insurance since you have to at least type two words

`car' + `insurance'.

The next category that stands out is category 2, which is characterized by high number

of competition, high number of ads per search and high number of search queries per day.

Due to its high volume of consumer searches this is likely a consumer good, which makes it

closest to `Laptop' category in the data.4

Apart from this the other category that is easy to identify is category 4. Due to its low

value for search volume , bid and clicks, it is likely to be the less popular category in the data

i.e. `Coins'. Now lets try and identify the last two categories, these ones are very similar and

harder to identify. Thus it is rst important to analyze characteristic of the category left in

the data, which are `Cable' and `Cruise'. `Cruise' is a more popular search category and has

more detailed search that is higher keylength. By analyzing the data it seems category 3 ts

`Cruise' and category 2 ts `Cable'. The table below summarizes the ndings. Note although

these claims are just approximation, we will use them for the rest of the analyzes. Even if

there is some error in identifying the category , we can still use the features of the category

and interpret how and why the results might dier for categories with dierent features.

3refer to these articles for more information :

most-expensive-keywords-in-paid-search-by-cost-per-click-and-ad-spend/
http://www.automotivedigitalmarketing.com/photo/1970539:Photo:28810

- https://www.adgooroo.com/the-
and

4as that is the only consumer good category in the data

140

REFERENCES

141

REFERENCES

Agarwal, A., & Mukhopadhyay, T. (2016). The impact of competing ads on click performance

in sponsored search. Information Systems Research, 27 (3), 538557.

Aggarwal, G., Feldman, J., & Muthukrishnan, S.

equilibria of position-based auctions.
online algorithms (pp. 1528).

(2006). Bidding to the top: Vcg and
In International workshop on approximation and

Aggarwal, G., Goel, A., & Motwani, R. (2006). Truthful auctions for pricing search keywords.

In Proceedings of the 7th acm conference on electronic commerce (pp. 17).

An industry survey conducted by PwC. (2017). Iab internet advertising revenue report.

Armstrong, M. (2006). Competition in two-sided markets. The RAND Journal of Economics,

37 (3), 668691.

Ase, J., & Chade, H. (2008). An optimal auction with identity-dependent externalities.

The RAND Journal of Economics, 39 (3), 731746.

Athey, S., & Ellison, G. (2011). Position auctions with consumer search. The Quarterly

Journal of Economics, 126 (3), 12131270.

Athey, S., & Nekipelov, D.

(2010). A structural model of sponsored search advertising

auctions. In Sixth ad auctions workshop (Vol. 15).

Athey, S., & Segal, I.

(2013). An ecient dynamic mechanism. Econometrica, 81 (6),

24632485.

Bajari, P., Hong, H., & Ryan, S. P. (2010). Identication and estimation of a discrete game

of complete information. Econometrica, 78 (5), 15291568.

Bajari, P., Houghton, S., & Tadelis, S. (2004). Bidding for incomplete contracts.

Baye, M. R., Kovenock, D., & De Vries, C. G. (1996). The all-pay auction with complete

information. Economic Theory, 8 (2), 291305.

Bergemann, D., & Said, M. (2010). Dynamic auctions. Wiley Encyclopedia of Operations

Research and Management Science.

Bolt, W., & Tieman, A. F. (2008). Heavily skewed pricing in two-sided markets. International

Journal of Industrial Organization, 26 (5), 12501255.

BÃ¶rgers, T., Cox, I., Pesendorfer, M., & Petricek, V. (2013). Equilibrium bids in sponsored
search auctions: Theory and evidence. American economic Journal: microeconomics,
5 (4), 16387.

142

Brynjolfsson, E., Hui, X., & Liu, M. (2019). Does machine translation aect international
trade? evidence from a large digital platform. Management Science, 65 (12), 54495460.

Burtraw, D., Palmer, K., & Kahn, D. (2010). A symmetric safety valve. Energy Policy,

38 (9), 49214932.

Caillaud, B., & Jullien, B.

(2003). Chicken & egg: Competition among intermediation

service providers. RAND journal of Economics, 309328.

Constantin, F., Rao, M., Huang, C.-C., & Parkes, D. C.

(2011). On expressing value

externalities in position auctions. In Aaai.

Crampes, C., Haritchabalet, C., & Jullien, B. (2009). Advertising, competition and entry in

media industries. The Journal of Industrial Economics, 57 (1), 731.

Deb, R., & Pai, M. M. (2013).

Ironing in dynamic revenue management: posted prices
& biased auctions. In Proceedings of the twenty-fourth annual acm-siam symposium on
discrete algorithms (pp. 620631).

Deng, C., & Pekec, S. (2013). Optimal allocation of exclusivity contracts. Work .

Desai, P. S., Shin, W., & Staelin, R. (2014). The company that you keep: when to buy a

competitor's keyword. Marketing Science, 33 (4), 485508.

De Smet, D., & Van Cayseele, P. J. (2010). Product dierentiation on a platform: the

informative and persuasive role of advertising.

Dou, Y., & Wu, D. (2019). Platform competition under network eects: piggybacking and

optimal subsidization. Georgia Tech Scheller College of Business Research Paper (18-3).

Edelman, B., Ostrovsky, M., & Schwarz, M. (2007). Internet advertising and the generalized
second-price auction: Selling billions of dollars worth of keywords. American economic
review , 97 (1), 242259.

Fox, J. T. (2018). Estimating matching games with transfers. Quantitative Economics, 9 (1),

138.

Fox, J. T., & Bajari, P. (2013). Measuring the eciency of an fcc spectrum auction. American

Economic Journal: Microeconomics, 5 (1), 100146.

Gatti, N., & Rocco, M.

(2013). Which mechanism for sponsored search auctions with
externalities? In Proceedings of the 2013 international conference on autonomous agents
and multi-agent systems (pp. 635642).

Gershkov, A., & Moldovanu, B. (2009). Dynamic revenue maximization with heterogeneous
objects: A mechanism design approach. American economic Journal: microeconomics,
1 (2), 16898.

Gershkov, A., & Moldovanu, B. (2010). Ecient sequential assignment with incomplete

information. Games and Economic Behavior , 68 (1), 144154.

143

Ghosh, A., & Mahdian, M. (2008). Externalities in online advertising. In Proceedings of the

17th international conference on world wide web (pp. 161168).

Ghosh, A., & Sayedi, A. (2010). Expressive auctions for externalities in online advertising.

In Proceedings of the 19th international conference on world wide web (pp. 371380).

Giotis, I., & Karlin, A. R. (2008). On the equilibria and eciency of the gsp mechanism in
keyword auctions with externalities. In International workshop on internet and network
economics (pp. 629638).

Goldman, M., & Rao, J. (2014). Position auctions in practice.

Goldman, M., & Rao, J. (2016). Experiments as instruments: Heterogeneous position eects

in sponsored search auctions. EAI Endorsed Trans. Serious Games, 3 (11), e2.

Gomes, R., Immorlica, N., & Markakis, E. (2009). Externalities in keyword auctions: An
empirical and theoretical assessment. In International workshop on internet and network
economics (pp. 172183).

Gomes, R., & Pavan, A. (2016). Many-to-many matching and price discrimination. Theo-

retical Economics, 11 (3), 10051052.

Gomes, R., & Sweeney, K. (2014). Bayesnash equilibria of the generalized second-price

auction. Games and Economic Behavior , 86 , 421437.

Guerre, E., Perrigne, I., & Vuong, Q. (2000). Optimal nonparametric estimation of rst-price

auctions. Econometrica, 68 (3), 525574.

Hagiu, A. (2011). Quantity vs. quality: Exclusion by platforms with networks eects. Cite-

seer.

Haile, P. A., Hong, H., & Shum, M. (2003). Nonparametric tests for common values at

rst-price sealed-bid auctions (Tech. Rep.). National Bureau of Economic Research.

Haile, P. A., & Tamer, E. (2003). Inference with an incomplete model of english auctions.

Journal of Political Economy, 111 (1), 151.

Heish, Y., Shum, M., & Yang, S. (2014). To score or not to score? structural estimation of

sponsored search auctions (Tech. Rep.). mimeo.

HortaÃ§su, A., & McAdams, D.

(2018). Empirical work on auctions of multiple objects.

Journal of Economic Literature, 56 (1), 15784.

Hsieh, Y.-W., Shum, M., & Yang, S.

(2015). To score or not to score? estimates of a

sponsored search auction model.

Hummel, P., & McAfee, R. P. (2014). Position auctions with externalities and brand eects.

arXiv preprint arXiv:1409.4687 .

Itoh, M. (1983). Monopoly, product dierentiation and economic welfare. Journal of Eco-

nomic Theory, 31 (1), 88104.

144

Iyengar, G., & Kumar, A. (2006). Characterizing optimal keyword auctions.

In Second

workshop on sponsored search auctions.

Izmalkov, S., Khakimova, D., & Romanyuk, G. (2016). Position auctions with endogenous

supply.

Jehiel, P., & Moldovanu, B. (2005). Allocative and informational externalities in auctions

and related mechanisms.

Jeitschko, T. D.

(1999). Equilibrium price paths in sequential auctions with stochastic

supply. Economics Letters, 64 (1), 6772.

Jeon, D.-S., Kim, B.-C., & Menicucci, D. (2016). Second-degree price discrimination by a

two-sided monopoly platform.

Jerath, K., & Sayedi, A. (2015). Exclusive display in sponsored search advertising.

Jeziorski, P., & Moorthy, S. (2017). Advertiser prominence eects in search advertising.

Management Science, 64 (3), 13651383.

Jeziorski, P., & Segal, I. (2015). What makes them click: Empirical analysis of consumer
demand for search advertising. American Economic Journal: Microeconomics, 7 (3), 24
53.

Krasnokutskaya, E. (2004). Identication and estimation in highway procurement auctions

under unobserved auction heterogeneity.

Lahaie, S. (2006). An analysis of alternative slot auction designs for sponsored search. In

Proceedings of the 7th acm conference on electronic commerce (pp. 218227).

Lu, S., Zhu, Y., & Dukes, A. (n.d.). Position auctions with budget-constrained advertisers.

Lu, Z., & Riis, C. (2016). Bayes-nash equilibria in generalized second price auctions with

allocative externalities.

Maskin, E., & Riley, J. (1984). Monopoly with incomplete information. The RAND Journal

of Economics, 15 (2), 171196.

McAfee, R. P., & McMillan, J. (1987). Auctions with a stochastic number of bidders. Journal

of economic theory, 43 (1), 119.

Mierendor, K. (2013). The dynamic vickrey auction. Games and Economic Behavior , 82 ,

192204.

Mierendor, K. (2016). Optimal dynamic mechanism design with deadlines. Journal of

Economic Theory, 161 , 190222.

Milgrom, P. (2007). Simplied mechanisms with applications to sponsored search and package

bidding (Tech. Rep.). Working paper, Stanford University.

145

Mussa, M., & Rosen, S. (1978). Monopoly and product quality. Journal of Economic theory,

18 (2), 301317.

Muthukrishnan, S. (2009). Bidding on congurations in internet ad auctions. In International

computing and combinatorics conference (pp. 16).

Narayanan, S., & Kalyanam, K. (2015). Position eects in search advertising and their

moderators: A regression discontinuity approach. Marketing Science, 34 (3), 388407.

Ostrovsky, M., & Schwarz, M. (2011). Reserve prices in internet advertising auctions: A
eld experiment. In Proceedings of the 12th acm conference on electronic commerce (pp.
5960).

Pai, M. M., & Vohra, R. (2013). Optimal dynamic auctions and simple index rules. Mathe-

matics of Operations Research, 38 (4), 682697.

Parkes, D. C., & Singh, S. P. (2004). An mdp-based approach to online mechanism design.

In Advances in neural information processing systems (pp. 791798).

Pavan, A., Segal, I., & Toikka, J. (2008). Dynamic mechanism design: Revenue equivalence,

prot maximization and information disclosure preliminary and incomplete.

Peitz, M., & Valletti, T. M. (2008). Content and advertising in the media: Pay-tv versus

free-to-air. international Journal of industrial organization, 26 (4), 949965.

Qin, T., Chen, W., & Liu, T.-Y. (2015). Sponsored search auctions: Recent advances and
future directions. ACM Transactions on Intelligent Systems and Technology (TIST), 5 (4),
60.

Ramaboa, K. K., & Fish, P. (2018). Keyword length and matching options as indicators
of search intent in sponsored search. Information Processing & Management, 54 (2), 175
183.

Roberts, M. J.

(n.d.). Spence, michael (1976).". Euent Charges and Licenses Under

Uncertainty." Journal of Public Economics, 5 , 193208.

Rochet, J.-C., & Tirole, J. (2003). Platform competition in two-sided markets. Journal of

the european economic association, 1 (4), 9901029.

Rochet, J.-C., & Tirole, J. (2006). Two-sided markets: a progress report. The RAND journal

of economics, 37 (3), 645667.

Said, M. (2011). Sequential auctions with randomly arriving buyers. Games and Economic

Behavior , 73 (1), 236243.

Said, M. (2012). Auctions with dynamic populations: Eciency and revenue maximization.

Journal of Economic Theory, 147 (6), 24192438.

Satterthwaite, M., & Shneyerov, A. (2007). Dynamic matching, two-sided incomplete in-
formation, and participation costs: Existence and convergence to perfect competition.
Econometrica, 75 (1), 155200.

146

Schmalensee, R.

(2011). Why is platform pricing generally highly skewed? Review of

Network Economics, 10 (4).

Schurter, K. (2017). Identication and inference in rst-price auctions with collusion. work-

ing Paper, University of Chicago.

Seim, K. (2006). An empirical model of rm entry with endogenous product-type choices.

The RAND Journal of Economics, 37 (3), 619640.

Spence, A. M. (1975). Monopoly, quality, and regulation. The Bell Journal of Economics,

417429.

Spence, A. M. (1980). Multi-product quantity-dependent prices and protability constraints.

The Review of Economic Studies, 47 (5), 821841.

Suarez, F. F., & Cusumano, M. A.

(2009). The role of services in platform markets.

Platforms, markets and innovation, 7798.

Szalay, D. (2009). Regulating a multi-attribute/multi-type monopolist. University of Bonn,

unpublished working paper .

Varian, H. R. (2007). Position auctions. international Journal of industrial Organization,

25 (6), 11631178.

Varma, G. D.

(2002). Standard auctions with identity-dependent externalities. RAND

Journal of Economics, 689708.

Viecens, M. F. (2006). Two-sided platforms with endogenous quality dierentiation.

Watts, A. (2016). Two ways to auction o an uncertain good. Journal of Economics, 119 (1),

115.

Webster, M., Wing, I. S., & Jakobovits, L. (2010). Second-best instruments for near-term
climate policy: Intensity targets vs. the safety valve. Journal of Environmental Economics
and Management, 59 (3), 250259.

Weitzman, M. L. (1978). Optimal rewards for economic regulation. The American Economic

Review , 68 (4), 683691.

Yang, S., Lu, S., & Lu, X. (2013). Modeling competition and its impact on paid-search

advertising. Marketing Science, 33 (1), 134153.

Zhu, Y., & Wilbur, K. C. (2011). Hybrid advertising auctions. Marketing Science, 30 (2),

249273.

147