MULTIVARIATE FRACTIONAL RESPONSE MODELS IN A PANEL SETTING WITH AN
APPLICATION TO PORTFOLIO ALLOCATION
By
Michael Anthony Carlton

A DISSERTATION
Submitted to
Michigan State University
in partial fulfillment of the requirements
for the degree of
Economics – Doctor of Philosphy
2013

ABSTRACT
MULTIVARIATE FRACTIONAL RESPONSE MODELS IN A PANEL SETTING WITH AN
APPLICATION TO PORTFOLIO ALLOCATION
By
Michael Anthony Carlton
Several papers use subjective survival probabilities as a measure of mortality risk in studying
economic behavior. The first chapter “Wealth Holdings, Asset Allocation and Mortality: A Test
of the Information Content of Subjective Survival Probabilities” studies whether subjective
survival probability measures contain any additional information that can explain differential
wealth holdings and asset allocation among households. We find some evidence that survival
probabilities can explain differences in household wealth holding and allocation once we control
for other factors that affect decision-making. We also find that the estimated impact of subjective
survival is sensitive to the inclusion of reported survival probabilities of one.
Some fractional response variables, like the proportion of financial wealth allocated
across multiple assets, must satisfy an adding up restriction. In the second chapter “A Model for
Multivariate Fractional Responses with an Application to Asset Allocation”, we develop a twostep procedure where we estimate a model with multiple fractional response variables exploiting
the fact that these variables sum to one in each period and are correlated over time. The first step
entails estimation of the multivariate fractional responses using the multinomial quasi-likelihood
function which explicitly imposes the adding-up restriction and the second step uses the
Classical Minimum Distance estimator to account for serial correlation.
Many panel data estimators implicitly assume that we have a balanced panel at our
disposal. Unfortunately this is rarely the case and dropping observations is an unsatisfactory
solution to the problem. Estimation of fractional responses in a panel requires assumptions about

the distribution of the unobserved effect and its relationship with observables, which requires
special treatment in an unbalanced panel. In the third chapter, “Estimation of a Multivariate
Fractional Response Model with Unbalanced Panel Data”, we extend the approach in
Wooldridge (2010) to the case of multiple fractional responses and apply this to unbalanced
panel data on the allocation of financial wealth across several assets.

ւ

≺⊙ ⊙ ց
TABLE OF CONTENTS

LIST OF TABLES ......................................................................................................................... iv
LIST OF FIGURES ...................................................................................................................... vii
ABBREVIATIONS ..................................................................................................................... viii
CHAPTER 1
WEALTH HOLDINGS, ASSET ALLOCATION, AND MORTALITY: A TEST OF THE
INFORMATION CONTENT OF SUBJECTIVE SURVIVAL PROBABILITIES ......................1
1.1 Introduction ....................................................................................................................1
1.2 Theory and Literature Review .......................................................................................5
1.3 Model Specification, Descriptive Statistics and Estimation Strategy..........................10
1.4 Results and Discussion ................................................................................................17
1.5 Conclusion ...................................................................................................................27
CHAPTER 2
A MODEL FOR MULTIVARIATE FRACTIONAL RESPONSES WITH AN APPLICATION
TO ASSET ALLOCATION ..........................................................................................................30
2.1 Introduction ..................................................................................................................30
2.2 Fractional Response Variables .....................................................................................32
2.3 Single Equation Fractional Probit ................................................................................33
2.4 The Multinomial Quasi-Likelihood .............................................................................37
2.5 Classical Minimum Distance Estimation .....................................................................41
2.6 An Alternative Specification........................................................................................46
2.7 Results ..........................................................................................................................48
2.8 Conclusion ...................................................................................................................56
CHAPTER 3
ESTIMATION OF A MULTIVARIATE FRACTIONAL RESPONSE MODEL WITH
UNBALANCED PANEL DATA ..................................................................................................58
3.1 Introduction ..................................................................................................................58
3.2 Single Equation Fractional Probit in a Balanced Panel ...............................................60
3.3 Single Equation Fractional Probit in an Unbalanced Panel .........................................63
3.4 The Multinomial Quasi-Likelihood .............................................................................65
3.5 Results ..........................................................................................................................69
3.6 Conclusion ...................................................................................................................76
APPENDIX ....................................................................................................................................78
BIBLIOGRAPHY ........................................................................................................................118

iii

ւ

≺⊙ ⊙ ց
LIST OF TABLES

Table 1: Summary Statistics for Single Households......................................................................79
Table 2: Summary Statistics for Married Households ...................................................................80
Table 3: Average Wealth and Proportion of Financial Wealth Allocated by Pr(Live to 75),
Single Households .........................................................................................................................81
Table 4: Average Wealth and Proportion of Financial Wealth Allocated by Husband’s
Pr(Live to 75), Married Households ..............................................................................................82
Table 5: Average Wealth and Proportion of Financial Wealth Allocated by Wife’s
Pr(Live to 75), Married Households ..............................................................................................82
Table 6: Wealth Holding Estimation Results, Single Households ................................................83
Table 7: Average Partial Effects (Reported as Percentages) for Allocation of Financial
Wealth, Single Households, Estimated by Single Equation Fractional Probit ..............................84
Table 8: Wealth Holdings Estimation Results, Married Households ............................................86
Table 9: Average Partial Effects (Reported as Percentages) for Allocation of Financial
Wealth, Married Households, Estimated by Single Equation Fractional Probit............................88
Table 10: Sensitivity Analysis, Removing Permanent Income,
Single and Married Households, Wealth Holding and Asset Allocation.......................................91
Table 11: Sensitivity Analysis, Treating Pr(Live to 75) = 1 Differently,
Single Households, Wealth Holding and Asset Allocation ...........................................................92
Table 12: Sensitivity Analysis, Treating Pr(Live to 75) = 1 Differently,
Married Households, Wealth Holding and Asset Allocation.........................................................93
Table 13: Means and Medians of Relevant Variables ...................................................................94
Table 14: Proportion of Variation Between Households for Relevant Variables ..........................94
Table 15: Average Partial Effects from Pooled Linear Fixed Effects OLS Estimation
on Entire, Unbalanced Panel ..........................................................................................................95
Table 16: Average Partial Effects from Single Equation Pooled Bernoulli Quasi-Maximum
Likelihood Estimates for each Balanced Panel Subset, Averaged Across Balanced Panel
Subsets ...........................................................................................................................................96

iv

ւ

≺⊙ ⊙ ց

Table 17: Average Partial Effects from Pooled Multinomial Quasi-Maximum Likelihood
Estimates for each Balanced Panel Subset, Averaged Across Balanced Panel Subsets ................97
Table 18: Procedure 2: Classical Minimum Distance Estimation Applied to Multinomial
Quasi-Maximum Likelihood Estimates for each Year Within Balanced Panel Subsets ...............98
Table 19: Comparing Average Partial Effect Estimates from Pooled Single Equation
Bernoulli Quasi-Maximum Likelihood Estimates for each Balanced Panel Subset,
Averaged Across Balanced Panel Subsets to Procedure 2 Average Partial Effect Estimates .......99
Table 20: Comparing Average Partial Effects Estimates from Pooled Multinomial
Quasi-Maximum Likelihood Estimates for each Balanced Panel Subset, Averaged Across
Balanced Panel Subsets to Procedure 2 Average Partial Effect Estimates ....................................99
Table 21: Comparing Average Partial Effect Estimates from Pooled Linear Fixed Effects
OLS on Entire, Unbalanced Panel to Procedure 2 Average Partial Effect Estimates..................100
Table 22: Means and Medians of Relevant Variables .................................................................101
Table 23: Proportion of Variation Between Households for Relevant Variables ........................101
Table 24: Average Partial Effect Estimates from Pooled Linear Fixed Effects OLS
Estimation on Entire Unbalanced Panel ......................................................................................102
Table 25: Average Partial Effects from Single Equation Pooled Bernoulli Quasi-Maximum
Likelihood Estimates on Entire Unbalanced Panel,
Ignoring Unbalanced Nature of the Panel ....................................................................................103
Table 26: Average Partial Effects from Single Equation Pooled Bernoulli Quasi-Maximum
Likelihood Estimates for each Balanced Panel Subset,
Averaged Across Balanced Panel Subsets ...................................................................................104
Table 27: Average Partial Effects from Pooled Bernoulli Quasi-Maximum Likelihood
Estimates on Entire, Unbalanced Panel, Adjusting for Unbalanced Nature of the Panel ............105
Table 28: Average Partial Effects from Pooled Multinomial Quasi-Maximum Likelihood
Estimates for Entire, Unbalanced Panel, Ignoring Unbalanced Nature of the Panel ...................106
Table 29: Average Partial Effects from Pooled Multinomial Quasi-Maximum Likelihood
Estimates for each Balanced Panel Subset, Averaged Across Balanced Panel Subsets ..............107
Table 30: Average Partial Effects from Multinomial Quasi-Maximum Likelihood Estimates
on Entire, Unbalanced Panel, Adjusting for Unbalanced Nature of the Panel ............................108

v

ւ

≺⊙ ⊙ ց

Table 31: Average Partial Effects from Multinomial Quasi-Maximum Likelihood Estimates
using the Multinomial Logit Functional Form on the Entire, Unbalanced Panel,
Adjusting for the Unbalanced Nature of the Panel ......................................................................109

vi

ւ

≺⊙ ⊙ ց
LIST OF FIGURES

Figure 1: Average Structural Function for the Proportion of Financial Wealth Allocated
to Stocks Against Husband Word Recall .....................................................................................110
Figure 2: Average Structural Function for the Proportion of Financial Wealth Allocated
to Bonds Against Husband Word Recall .....................................................................................111
Figure 3: Average Structural Function for the Proportion of Financial Wealth Allocated
to CDs Against Husband Word Recall ........................................................................................112
Figure 4: Average Structural Function for the Proportion of Financial Wealth Allocated
to Checking Against Husband Word Recall ................................................................................113
Figure 5: Average Structural Function for the Proportion of Financial Wealth Allocated
to Stocks Against Wife Word Recall ...........................................................................................114
Figure 6: Average Structural Function for the Proportion of Financial Wealth Allocated
to Bonds Against Wife Word Recall ...........................................................................................115
Figure 7: Average Structural Function for the Proportion of Financial Wealth Allocated
to CDs Against Wife Word Recall...............................................................................................116
Figure 8: Average Structural Function for the Proportion of Financial Wealth Allocated
to Checking Against Wife Word Recall ......................................................................................117

vii

ւ

≺⊙ ⊙ ց
ABBREVIATIONS

AHEAD............................................................. Asset and Health Dynamics among the Oldest Old
ANOVA ........................................................................................................... Analysis of Variance
APE ................................................................................................................ Average Partial Effect
ASF ...................................................................................................... Average Structural Function
CAMS .............................................................................. Consumption and Activities Mail Survey
CDF ............................................................................................. Cumulative Distribution Function
CMD ................................................................................................... Classical Minimum Distance
HRS ..................................................................................................... Health and Retirement Study
IRA ................................................................................................... Individual Retirement Account
OLS .............................................................................................................. Ordinary Least Squares
P75 ................................................................................ Subjective Probability of Living to Age 75
Pr(Live to 75) ............................................................... Subjective Probability of Living to Age 75
QMLE ................................................................................. Quasi-Maximum Likelihood Estimates

viii

ւ

≺⊙ ⊙ ց
CHAPTER 1
WEALTH HOLDINGS, ASSET ALLOCATION, AND MORTALITY: A TEST OF THE
INFORMATION CONTENT OF SUBJECTIVE SURVIVAL PROBABILITIES

1.1 Introduction
Economic theory predicts that individuals save to finance their consumption in
retirement. An important factor in deciding the level and allocation of wealth to different assets
is the length of time the individual expects to live after retirement. The risk that an individual
faces if they do not properly account for their survival expectations is that they will outlive their
assets. Social Security and pensions can help to alleviate this risk by providing a fixed stream of
income in retirement, but if individuals are attempting to smooth consumption there is a potential
for a significant decrease in utility later in life from consuming too much early in life. Therefore,
we should see that forward-looking individuals adjust their current behavior to their individual
life expectancy.
It is straightforward to determine whether mortality has an impact on wealth holding and
asset allocation. With repeated observations on individuals, we can simply see if wealth holding
and asset allocation is different for individuals with longer actual lifetimes. The drawback to this
approach is that it is looking at actual lifetimes, not expected lifetimes. In any period, an
individual does not know their actual date of death but only has some idea of their probability to
reach a target age. One can use life-table probabilities as a proxy for individual survival, but
since these are averages over the entire population, they only vary by age, race, and gender. The
ideal measure that we would like to use is individual’s expected survival probabilities at the
point in time that they are making their decision to save and allocate wealth.
Our goal in this paper is to determine whether we can explain any of the heterogeneity in

1

ւ

≺⊙ ⊙ ց

wealth holdings and asset allocation using subjective survival probabilities elicited in the Health
and Retirement Study (HRS). The HRS collects detailed information on household asset
holdings as well as measures of subjective survival probabilities. Therefore, we have a source of
data that contains the information that we would need to determine the impact of subjective
survival probabilities on wealth holding and allocation behavior.
Several studies have analyzed the validity of these subjective survival probabilities in
terms of actual mortality risk. Hurd and McGarry (1995) find that these measures are internally
consistent, match closely to life table averages and covary with known correlates of actual
1

mortality . In a follow-up study, Hurd and McGarry (2002) find that these probabilities are also
good predictors of actual mortality experiences of the HRS respondents. Smith, Taylor and Sloan
(2001) support this finding but also point out that there is a large portion of the sample that report
very small changes in their survival probabilities over time.
Elder (2012) looks closely at the subjective survival probabilities in the HRS and finds
systematic differences as compared to life tables probabilities. He finds that the subjective
survival probabilities do not account for yearly increases in mortality rates and that individuals
do not update their survival probabilities as expected. He also finds that the life-table survival
probabilities are considerably better predictors of actual survival than the subjective survival
probabilities. Perhaps most concerning is that Elder (2012) provides compelling evidence that
subjective survival probabilities in the HRS contain significant noise; in fact so much random
noise that it may overwhelm any individual information that reflects actual heterogeneity.
While the above studies provide evidence that subjective survival probabilities match
aggregate life-table probabilities but systematically differ from how mortality rates actually vary
1

Hurd and McGarry (1995) do note that blacks report higher subjective survival probabilities
than their white counterparts. This is inconsistent with the life-table probabilities.
2

ւ

≺⊙ ⊙ ց

over the lifetime it is still important to determine whether they are useful in explaining
heterogeneity in economic behavior. What is important to note is that the findings in Elder
(2012) suggest that we are likely to see very little impact from subjective survival probabilities
simply because there is little signal in these measures.
Our empirical work focuses on estimating reduced-form relationships of wealth holdings
and asset allocation with subjective survival probabilities elicited from individuals. We attempt
to control for the heterogeneity in factors that economic theory tells us should affect the wealth
holding and allocation decision including basic demographics (age, education, gender, etc.),
income (both current and permanent), as well as proxies for an individual’s time preference,
cognition, and risk aversion. Since the HRS collects financial data at the household level, we
model the household’s wealth holding and the proportion of financial wealth allocated to stocks,
bonds, CDs, and checking/savings/money market accounts. We estimate different models for
single and married households since multi-person households have to take into account the
survival expectations of both members. To estimate our allocation equations we use the QuasiMaximum Likelihood approach proposed in Papke and Wooldridge (1996, 2008) which is
appropriate for estimating fractional response variables.
2

We find mixed evidence for the impact of subjective survival on wealth holdings . For
single households, we find that a one-percentage point difference in reported subjective survival
probability (a difference of 0.01) leads to about $266 more Net Worth and about $124 more
Financial Wealth holdings, both significant at the 5% level. For married households we find that
a one-percentage point difference in the husband’s subjective survival probability leads to

2

We find mixed evidence in terms of statistical significance of our estimates. Regardless of
statistical significance, all of our estimated impacts for subjective survival probabilities are very
small in magnitude.
3

ւ

≺⊙ ⊙ ց

increased Net Worth of about $241 and about $29 more Financial Wealth but both estimates are
statistically insignificant at standard levels. A one-percentage point difference for the wife’s
subjective survival probability is associated with about $425 more in Net Worth (statistically
significant at the 5% level) but, contrary to theory, leads households to hold about $22 less in
Financial Wealth, though this estimate is not statistically significant.
There appears to be no impact of survival probability on the proportion of financial
wealth allocated across assets for single households though we note that we see the correct sign
of the impact of survival probabilities on the proportion of wealth allocated to stocks and
checking. For married households, we find that a one-percentage point difference in the
husband’s subjective survival probability leads the household to allocate about 0.026% more to
stocks and a one-percentage point difference in the wife’s subjective survival probability leads
the household to allocate around 0.022% more financial wealth to stocks; these are significant at
the 5% and 10% levels. The average partial effects for the subjective survival probabilities in our
checking equation are negative for both husband and wife but are very small and statistically
insignificant at standard levels.
We find no evidence that our estimated impacts of subjective survival probabilities are
sensitive to the inclusion of a measure of permanent income. If we drop our measure of
permanent income, we find that the estimated impact of subjective survival increases slightly for
wealth holdings and asset allocation equations. We also show that our estimated impacts for
subjective survival on wealth holding and asset allocation look very different when we treat
reported survival probabilities of one differently. This is consistent with the findings from Elder
(2012) which suggest that there is significant measurement error in these reported survival
probabilities.

4

ւ

≺⊙ ⊙ ց
The outline of this paper is as follows. Section 2 briefly discusses the underlying theory

and reviews studies that use subjective survival probabilities to explain economic behavior.
Section 3 briefly explains the HRS, provides descriptive statistics and patterns in the data, and
details the analysis of the informational content of survival probability measures in terms of
wealth holding and allocation. Section 4 presents the results of our analysis and Section 5
concludes.
1.2 Theory and Literature Review
Economic theory indicates that forward-looking individuals with higher survival
probabilities will hold more wealth and allocate a larger portion of their wealth to the risky asset
than individuals with lower survival probabilities. This is because with time-separable utility the
introduction of non-zero survival probabilities effectively multiplies a time varying factor to the
individual’s constant discount rate. Bernheim, Skinner, and Weinberg (2001) perform a simple
simulation and demonstrate that individuals with lower discount rates hold more wealth than
individuals with higher discount rates. Cocco, Gomes, and Maenhout (2005) and Sahm (2007)
show that the underlying policy function that defines the optimal portfolio choice of the
household is a function of total wealth: the sum of discounted future income and current
financial wealth holdings. With uncertain lifetimes, the individual will discount future income by
incorporating the probability that they will earn that future income stream. They show that the
proportion of financial wealth allocated to the risky asset is positively related to how “certain”
their future income stream is. DeNardi et al. (2009) also document that higher survival
probabilities (even if they are small) will lead even the oldest and sickest individuals to spend
down their retirement wealth very slowly. DeNardi et al. (2009) also show that the impact of
lower survival probabilities leads individuals to decrease their wealth holdings.

5

ւ

≺⊙ ⊙ ց
The basic premise here is that individuals will smooth the marginal utility of

consumption over time by setting the marginal utility of current consumption to the marginal
utility of the discounted future consumption stream. As long as individuals discount the future by
including the probability of survival we will see that a person with higher survival probabilities
will hold more wealth and allocate a larger proportion of wealth to the risky asset.
Several studies attempt to link subjective survival probabilities to economic behavior and
these tend to focus on the areas of retirement, bequests, wealth holding, and consumption. Hurd
et al. (2002) analyze the decisions to retire and claim Social Security benefits early. Using
responses to subjective survival to age 85 and Social Security earnings data matched to the first
four waves of the HRS, they estimate their model on two different samples: those who are retired
prior to age 62 and those not retired by 62. They find that there is a small, statistically significant
increase in retirement and claiming of benefits but only for those individuals that report that their
probability of living until age 85 is zero.
Delevande et al. (2006) revisit the retirement and claiming issue with the hope of
obtaining estimates that are more accurate by instrumenting for measurement error in the
reported survival probabilities. They use the response to the question of survival to the age of 75
instead of 85 used by Hurd et al. (2002). As instruments for the subjective survival probability,
they use demographic information, mortality experience of parents, and an optimism index,
which is a predicted value generated from a factor analysis of the remaining subjective
probability questions. This optimism index reflects the correlation of the responses to all
probability expectation questions in the HRS and potentially represents the unobserved
heterogeneity in individual expectations. Notably this constructed optimism index has a large
and statistically significant impact on their instrumental variable for the subjective survival

6

ւ

≺⊙ ⊙ ց

probability. Similar to Hurd et al. (2002) they estimate the impact of survival probabilities on
claiming behavior for the group of respondents that are retired by age 62 and the impact of
survival probability on early retirement and early claiming for the sample of respondents that are
still working at age 62. Using the raw subjective survival probability measures they find no
statistically significant impact of survival on either claiming or retirement. When they instrument
for subjective survival they find that there is a statistically significant impact on early claiming
but not on retirement. Their results suggest that a five-percentage point increase in the predicted
survival probability will lead to a 1.9 percentage point decrease in the number of people that will
claim Social Security benefits early. Delevande et al. (2006) claim that using instrumental
variables to rid the subjective survival probability of measurement error is the reason that they
are able to find a statically and economically significant effect of subjective survival on claiming
behavior.
Gan, Gong, Hurd, and McFadden (2004) study the impact of subjective survival
probability measures on the bequest behavior of elderly households. They derive estimable
equations from a life-cycle model so that they can estimate structural parameters that can
describe the individual’s preferences for bequests. To account for focal points and to calculate a
survival curve they construct a measure of yearly mortality rates based on responses to subjective
survival questions in the Asset and Health Dynamics among the Oldest Old (AHEAD) study. In
addition, they estimate their equations using the life-table survival curves to compare the
predictive power of the subjective survival curves. Since they are estimating a life-cycle model,
they are able to simulate consumption and wealth trajectories and compare model predictions to
actual decisions within the AHEAD panel. They find that their predicted consumption and
wealth trajectories using the survival curves derived from the subjective survival probabilities

7

ւ

≺⊙ ⊙ ց

outperform the predicted values using life-table survival. They also note that their estimates
suggest that bequest motives of the older population represented in the AHEAD are very small
and that most bequests are involuntary or accidental.
Salm (2006) attempts to estimate the structural parameters from the Euler equation for
consumption derived from a simple life-cycle model. He uses data from single households that
completed the HRS interview in 2000 and 2002 and completed the Consumption and Activities
Mail Survey (CAMS) in 2001 and 2003. Using the subjective survival probabilities, he
constructs yearly survival rates following Gan et al. (2004). He interprets the inverse of his
estimated coefficient on the subjective survival probability as his estimated value for the risk
aversion parameter. To allow for precautionary savings he estimates the model including an
estimated variance of out-of-pocket medical expenses to proxy for consumption risk. Salm
(2006) finds that higher subjective survival probabilities lead to decreases in the growth rate of
consumption.
Perry (2005) estimates an empirical model derived from the Euler equation for
consumption. He constructs yearly survival rates from the subjective survival probability
responses in the HRS. Perry (2005) constructs his measure of consumption by looking at
differences between wealth holdings across periods. He finds that there is no statistically
significant relationship between the constructed subjective survival probabilities and his measure
of consumption. Perry (2005) points to substantial measurement error in his measure of
consumption as the culprit for the lack of statistical significance in his estimates.
Bloom et al. (2006) studies the impact of subjective survival probabilities on retirement
and wealth holdings of both single and married households. Bloom et al. (2006) take a sample of
individuals that were aged 50-70 in 1992 and estimate the relationship between their retirement

8

ւ

≺⊙ ⊙ ց

decisions and wealth holding and subjective survival probability to age 75. To correct for
potential measurement error they instrument subjective survival probabilities using mortality risk
factors and parental mortality experience. They find no statistically significant impact of
subjective survival on either retirement or wealth holding for single households. Looking at
married households, they find no statistically significant relationship for the retirement decision
of either the husband or wife. For married households they find that once they instrument for the
subjective survival probability they estimate a statistically significant impact of survival
probabilities of both spouses. Their estimates suggest that a ten percentage point change in the
husband’s survival probability leads to a $27,600 increase in wealth (significant at the 10%
level) and a ten percentage point increase in the wife’s survival probability leads to a $32,600
increase in wealth. Bloom et al. (2006) interpret these findings as evidence that households save
more in the face of higher expected lifetimes because there is no incentive to postponing
retirement.
DeNardi, French, and Jones (2009) look at the impact of survival uncertainty, medical
expenses, and health uncertainty on the wealth holdings of elderly individuals in the AHEAD.
They estimate a structural model using the Method of Simulated Moments and matching the
medians that were in the data to the medians estimated by the structural model. They did not use
the subjective survival probabilities that are collected in the AHEAD survey; instead, they
3

estimate future survival relying on the actual mortality experience in the panel . They find that
the structural model fits the data very well. This lends much more credibility to the simulations
that they perform to isolate the impact of differential mortality on wealth holdings. Since they
have modeled their survival probabilities as a function of gender, health status, and permanent
3

The model for survival probabilities includes the prior period’s health status, permanent
income, and gender.
9

ւ

≺⊙ ⊙ ց

income, they perform simulations that show the impact of changes within the components of
survival. It appears that the impact of health, gender, and permanent income all have similar
impacts on the wealth holdings of elderly households. The simulations clearly point to the fact
that the slow spend down of wealth in old age is due to uncertain lifetime. As long as there is a
possibility of outliving one’s assets there will be a significant precautionary savings motive.
With the exception of Hurd et al. (2002), Bloom et al. (2006), and Delevande et al.
(2006), all of the cited works estimate either an Euler equation or a structural model using either
the HRS or the AHEAD. Most of the studies that use the subjective survival probabilities either
transform them into yearly survival curves or use instrumental variable techniques to correct for
measurement error. Our work is probably closest to Bloom et al. (2006), although we take
serious the findings of the other studies that we have reviewed. We estimate models for wealth
holdings of both single and married households and find a statistically significant but small
impact on wealth holdings for single and married households. While there are a few empirical
studies of asset allocation in the HRS they look at health status and not survival probabilities
(Rosen and Wu (2004) and Berkowitz and Qiu (2006)). Lillard and Willis (2001) estimate a
model of asset allocation including a measure of the number of focal point responses across all
subjective probability questions, which they interpreted as a measure of cognition. They found
that fewer focal point responses correlate with increased allocation to the risky asset.
1.3 Model Specification, Descriptive Statistics, and Estimation Strategy
The goal of our empirical exercise is to determine whether subjective survival
probabilities help to explain the differences in wealth holding and allocation behavior between
households in the Health and Retirement Study. We estimate reduced form equations to
determine the relationship between household wealth holdings, allocation of wealth and survival

10

ւ

≺⊙ ⊙ ց

probabilities. In our model specification we include basic demographic characteristics: age,
education, working status, household size, gender, marital status, and current income. Theory
dictates that the household’s wealth holding and allocation decisions are a function of not only
their current resources but also what they expect their future resources will be. To this end, we
include a measure of permanent income in our model specifications. Differences in risk aversion
among households can affect the amount of wealth and the allocation of wealth across assets. To
proxy for risk aversion we include responses to questions regarding a household’s willingness to
accept different income gambles. Households can also differ in their discount rates. We include
responses to questions about the length of time that the individual considers for financial
planning as a proxy for discount rates. Previous research (Elder (2012), McArdle et al. (2009)
and Lillard and Willis (2001)) suggest that an individual’s cognition can also affect the outcome
of the wealth holding and asset allocation decisions; we include measures of word recall and
simple numerical calculations captured in the HRS as proxies for cognition.
We will estimate separate equations for single and married households. We present
equations for single households; married households will include the same set of regressors for
both the husband and wife. Let i represent the household and t represent the year of the survey.
To estimate the impact of survival probabilities on wealth holding we specify the following
equation:
Wit = β1 ⋅ Pr(Live to 75) + βXit + uit

(1.3.1)

where Wit is a measure of wealth, either Net Worth or Financial Wealth, Pr(Live to 75)
represents our measure of subjective survival probability from the HRS (see below for the
specific question used to solicit the survival probability), and Xit are control variables that we
detail below. We also estimate the relationship between survival probabilities and the allocation
11

ւ

≺⊙ ⊙ ց

of Financial Wealth across four types of assets; stocks, bonds, CDs, and checking/savings/money
market accounts:

yitg = Φ(θ1 ⋅ Pr(Live to 75) + θZit ) + ε it

(1.3.2)

where yitg represents the proportion of financial wealth allocated to asset g in year t by
household i . Here Zit represents control variables that will include the same variables we use in
the wealth holding equations with the addition of a measure of total wealth available to the
household at time t . Due to the fractional nature of the asset share equation in (1.3.2) we have
followed Papke and Wooldridge (1996, 2008) and specified that the conditional mean function is
nonlinear; Φ represents the standard normal cumulative distribution function.
We include controls for standard demographic variables that can affect behavior such as
2

3

age, age , age , household size, education, as well as controls for year effects. Theory tells us
that it is necessary to control for current income as well as the value of lifetime income. We
measure current income as the sum of all non-capital income of the household as reported in all
waves. As a measure of lifetime resources we follow Altonji and Doraszelski (2005) and
calculate a household’s permanent income. To do this we regress current household income on
2

3

4

age, age , age , age , a set of year dummies, household size, and indicators for marital status,
4

gender and education level. For each household we calculate the average of the residuals from
this regression and then compute the permanent income of the household as the sum of the
average residual and the predicted value of income for the education level of the individual and

4

For married households, we do not include indicators for gender or marital status.
12

ւ

≺⊙ ⊙ ց
5

assuming that the individual is at the average age in our sample. Since there is likely a nonlinear
impact of these income measures we also include squares of current and permanent income as
6

well as their cross product in our wealth holding equations .
Theory dictates that total wealth has an impact on the allocation of wealth across
available assets. Ideally, the measure of wealth that we would include in our model would
include pension wealth, Social Security wealth and the present value of all future income. For
our measure of wealth we use the log of Net Worth which is the sum of Financial Wealth (the
value of holdings in stocks, bonds, CDs and checking/savings/money market accounts) and NonFinancial Wealth (the value of holdings in IRA/Keogh accounts, housing, vehicles, other real
7

estate, and trusts) minus any debt associated with these asset holdings. Our hope is that the
combination of Net Worth and the estimated permanent income measure will act as a good proxy
for the total wealth measure that theory dictates will affect decision-making.
In addition to controlling for differences in household demographics, income, and wealth
there is also a need to control for risk aversion and discount rates of households. To proxy for
discount rates we will use responses to HRS question regarding the financial planning horizon of
8

the household . To control for risk aversion we will use questions meant to solicit aversion to
income risk and we will categorize individuals based on their responses to a series of unfolding

5

For single households we use calculate permanent income at age 58. For married households
we use 58 for the husband and 54 for the wife, both of these are the average ages in our sample.
6
In our asset allocation equations we actually use permanent log income, which we estimate in a
similar manner to permanent income with the exception that we regress the log of current income
on the demographic variables.
7
We use RAND imputations for missing values of income and asset holdings when the
respondent was unable to give exact values.
8
The choices available from the planning horizon question are: Next Few Months, Next Year,
Next Few Years, 5-10 Years and 10+ Years. We use Next Few Months as our base category.
13

ւ

≺⊙ ⊙ ց

questions regarding different income scenarios following Barsky, et al. (1997). This essentially
categorizes an individual into one of four groups: High Risk Aversion, Medium-High Risk
Aversion, Medium-Low Risk Aversion and Low Risk Aversion. We use High Risk Aversion as
our base group.
We include two additional controls to proxy for the cognitive ability of the individual.
The HRS collects several measures of individual cognition, but not all measures are available in
all waves of the survey. In every wave of the HRS, respondents are given a list of nouns and then
asked to repeat this list immediately and at the end of the cognition section. Our first measure of
cognitive ability is the proportion of words recalled at the end of the cognition section. In
addition, each individual performs a series of five simple numerical calculations. First, the
individual subtracts 7 from 100. The next question in the series asks the individual to subtract 7
from the answer to the previous calculation. The individual performs this calculation a total of
five times. As a second measure of cognition, we use the number of correct calculations, ranging
from 0 to 5. By no means do we think that these two measures will completely capture the
cognitive ability of the individual. Our hope is that accounting for education level and some
time-varying measure of cognitive ability that we can accurately proxy for the cognitive ability
of an individual.

9

We use data from the Health and Retirement Study (HRS). The HRS began in 1992 and
was nationally representative of all non-institutionalized individuals aged 51-61 in that year. In
1998 the HRS combined with the Asset and Health Dynamics of the Oldest Old (AHEAD) and
added several new cohorts to be nationally representative of the population of non9

McArdle, Smith, and Willis (2009) study the ability of cognitive measures in the HRS to
explain wealth holdings and allocation. They find that word recall is positively correlated with
allocation to the risky asset (stocks) and the amount of wealth held. Their analysis does not
account for survival probabilities.
14

ւ

≺⊙ ⊙ ց

institutionalized individuals aged 51 and older. In addition to initial respondents, spouses are
interviewed and followed in subsequent interviews. We use data for all cohorts from Waves 1-7
(1992-2004) of the HRS. In addition to basic demographic variables, the HRS collects detailed
information on wealth, allocation, income and its sources, health and measures of the probability
that future events occur. The main variable of interest in this analysis is the response to the
following question:
On a scale from 0 to 100, where 0 is no chance and 100 is absolutely certain, what are
the chances that you will live to age 75 or older?
We use responses to this question as a measure of subjective survival probabilities.
We construct our data set by household. For married households we combine spouses in
each wave so that our panel consists of household observations by year. To select individuals for
our sample we include observations that satisfy the following criteria:
(i)

respondent is younger than 65 at the time of the interview,

(ii)

respondent does not have missing values for subjective survival probability,

(iii)

responses are not from a proxy interview and

(iv)

the respondent is still living at the time of the interview.

These criteria are applied at the individual level so married households that do not have both
spouses are dropped. The decision to allocate wealth across assets is conditional on holding
positive financial wealth; therefore, for estimating allocation relationships we drop observations
where financial wealth is zero. Imposing the above criteria for married households leaves us with
6,608 unique households (18,603 observations) to estimate wealth holding equations (we call this
our Wealth Sample). If we drop those observations where households are holding zero financial
wealth we are left with 6,048 households (16,690 total observations) which we will call our

15

ւ

≺⊙ ⊙ ց

Allocation Sample. Applying the criteria to single households we are left with 5,022 households
(14,275 total observations) for our Wealth Sample and dropping observations with zero financial
wealth we are left with 4,053 households (10,573 total observations) in our Allocation Sample.
We calculate means and medians for the Wealth and Allocation Samples for both single
and married households. Table 1 displays these descriptive statistics for single households. We
can see that single households are predominately white females. A majority of individuals have
not completed college (a little over 80%). Most single households hold most of their net worth in
non-financial wealth. Between the Wealth and Allocation sample, we can see that individuals
that hold positive financial wealth tend to report higher survival probabilities, have slightly more
education, income, and wealth. On average, it appears that single households tend to hold higher
amounts of their financial wealth in checking, savings, and money market accounts.
Table 2 displays descriptive statistics for the Wealth and Allocation Samples for married
households. Compared to single households, married household members tend to have slightly
higher education. Married households also report holding a large portion of their net worth in
non-financial wealth. Married households earn more income on average and they have higher
wealth holdings relative to single households. Comparing the Wealth Sample to the Allocation
Sample, it appears that households with positive financial wealth are better educated, earn
slightly more income, and hold more wealth. In comparison to single households it appears that
married households allocate about 10% more of their financial wealth to stocks and ten percent
less to checking, savings, and money market accounts.
Table 3 displays the raw relationship between average wealth holdings and average asset
allocation and survival probabilities for single households. Tables 4 and 5 show the same
relationship for married households; Table 4 uses the husband’s reported survival probability and

16

ւ

≺⊙ ⊙ ց

Table 5 uses the wife’s reported survival probability. We can see that average wealth holdings
are higher for higher values of survival with the exception of those individuals that report a
survival probability of one. We can also see that the average proportion of financial wealth to
stocks is increasing with higher survival probabilities and the allocation to checking is
decreasing with higher survival probabilities. Since stocks and checking are the most risky and
least risky of the four financial assets this finding seems to fit well with what theory predicts.
The objective of this study is to determine whether this correlation still exists once we have
removed the impact of other factors that affect wealth holding and allocation behavior.
1.4 Results and Discussion
Using equation (1.3.1) we estimate models of wealth holding for single households;
results are presented in Table 6.

10

We see a positive impact of survival probabilities on both Net

Worth and Financial Wealth; a one-percentage point difference in survival probability is
associated with about $256 more Net Worth and $124 more Financial Wealth, both estimates are
significant at the 5% level. Education plays a key role in explaining differences in wealth
holdings among single households. Households that have a college degree have nearly $111,000
more in Net Worth (and nearly $51,000 more Financial Wealth) than individuals that have not
completed High School. There also appears to be a significant positive impact of a longer
financial planning horizon. Those households that plan for more than ten years into the future
have about $95,000 more in Net Worth and $39,000 more Financial Wealth than households that
only plan for the next few months.
We now turn to the results of estimating equation (1.3.2) using the Quasi-Maximum
10

Throughout the paper (including the tables) we refer to a one-percentage point change in the
subjective survival probability. This is equivalent to a 0.01 change in our measured variable. The
reported coefficients and average partial effects in the tables are already calculated for a 0.01
change in subjective survival probability.
17

ւ

≺⊙ ⊙ ց

Likelihood Estimation proposed in Papke and Wooldridge (1996, 2008) for fractional response
variables. Table 7 presents the estimated average partial effects from estimation of this fractional
probit on the proportion of Financial Wealth allocated to the four different assets: stocks, bonds,
CDs, and checking/savings/money market accounts.
There appears to be no impact of survival probabilities on the allocation of wealth for
single households. While the signs on the average partial effects of survival probability on stocks
and checking are what theory predicts (positive and negative, respectively) they are estimated to
be nearly zero and are all statistically insignificant at any standard level. We see that households
with a college degree allocate 11% more of their financial wealth to stocks and 10% less to
checking accounts than households that have not completed High School, both significant at the
5% level. Financial Planning Horizon also plays a role in asset allocation, we estimate that
households with a horizon of ten years or more will allocate 5% more financial wealth to stocks,
1% more to bonds, and 6% less to checking accounts than those households that have a horizon
of only a few months.
Table 8 contains the results from estimation of the wealth holding equation (1.3.1) for
married households. We see a positive impact of a wife’s subjective survival on Net Worth
holdings; a one-percentage point difference in a wife’s survival probability is associated with
about $425 more in Net Worth, this is significant at the 5% level. A one-percentage point
difference in the husband’s subjective survival is associated with about $241 more Net Worth,
but this estimate is not statistically significant. Looking at the impact of subjective survival on
Financial Wealth, we see that the estimated impact for a husband’s survival is very small and
insignificant, and that the estimate of the impact of a wife’s survival probability is actually
negative, though both estimates have very large standard errors. The estimated impact of

18

ւ

≺⊙ ⊙ ց

education on wealth holdings is not as clear-cut for married households as for single households.
We see that the husband’s education has no statistically significant relationship with Net Worth;
it appears that having less than high school education leads to more wealth holdings than if the
husband completed high school or some college. The wife’s education level does have the
expected impact on Net Worth. If the wife has a college degree, we estimate that household to
hold about $84,000 more in Net Worth than a similar household with a wife that did not
complete high school. Our estimates for education in the financial wealth regressions make a
little more sense. We find that a college degree for the husband leads to about $37,000 more in
financial wealth. If the wife also completed college, we estimate that the household’s financial
wealth will increase by $38,000. We also see that having the longest financial planning horizon
(10 + years) has a positive impact on wealth holdings. If both spouses have horizons of ten years
or more we see them holding about $110,000 more in Net Worth and about $52,000 more in
Financial Wealth.
Table 9 displays the results of the fractional probit estimation of equation (1.3.2) on the
allocation of Financial Wealth across the four asset types. We see that there is a statistically
significant impact of both husband and wife survival probabilities on the allocation to stocks and
CDs. A one-percentage point difference in the husband’s subjective survival leads households to
hold about 0.026% more of their financial wealth in stocks and about 0.013% less in CDs, these
estimates are significant at the 5% and 10% level respectively. A one-percentage point difference
in the wife’s survival probability leads the household to hold about 0.022% more financial
wealth in stocks and 0.019% less financial wealth in CDs, both these estimates are significant at
the 10% level. While the estimated average partial effects of husband and wife survival
probabilities are statistically insignificant for the allocation of wealth to checking we can see that

19

ւ

≺⊙ ⊙ ց

we estimate negative, though small impacts of higher survival.
Similar to our findings for single households we see that higher education levels of both
spouses lead to increased allocation to stocks and decreased allocation to checking. Not only is
the estimated impact of a college degree statistically significant for both spouses for all assets but
the magnitude is also large compared to the estimated impact of subjective survival to age 75.
We see that the financial planning horizon of the husband only is positively associated with the
allocation of wealth to stocks and negatively associated with the allocation to checking. This is
not the case for the wife’s planning horizon. For stock allocation, we see that longer planning
horizons have a negative impact relative to the shortest horizon of only a few months. Looking at
the results for the allocation of wealth to checking we can see that only the longest planning
horizon (10+ years) has a negative impact, though none of the estimated average partial effects
for the wife’s planning horizon are statistically significant.
One main reason that our results for single and married households differ is that married
households need to consider the impact of the characteristics of both household members.
Women live longer than men and therefore would be the more likely of the two spouses to have
to face the risk of outliving assets. While we find an impact of the wife’s increased survival
probabilities on Net Worth, it is very small and does not appear to carry over to the financial
wealth of the households. Looking at the allocation decision, we see that the impact of the wife’s
survival probabilities is smaller than the estimated impact of the husband’s. In addition, it
appears that only the financial planning horizon of the husband has any impact on allocation. We
see that the longest planning horizon (relative to the shortest horizon) for the husband increases
the proportion allocated to stocks by about 3% and decreases the proportion allocated to
checking by 4%. It appears that the characteristics of the wife play a small role in the decision of

20

ւ

≺⊙ ⊙ ց

how much to save and how to allocate wealth across assets. This could be consistent with the
idea that the household maximizes their joint utility, i.e. they only consider the lifetime of the
household when both spouses are alive. If this is the case, we would see that divorced or
widowed women would have less wealth at their disposal. From our analysis of single
households, we see that divorced women have less wealth but widowed individuals have greater
wealth. The asset allocation of divorced and widowed households in the analysis of singles
indicates that they allocate less to stocks and more to checking, but none of these estimates are
statistically significant. A more detailed analysis into the decision making of married households
and the way that they pool their characteristics to make wealth and allocation decisions is
definitely an avenue for future research.
We test the sensitivity of our results for subjective survival responses for wealth holding
and asset allocation for single and married households to dropping variables that include our
constructed measure of permanent income. We remove permanent income, its square, and the
cross product of permanent income and current income from the wealth holding equations and
we remove permanent log income from the asset equations.
For single households, we can see that removing permanent income from our wealth
holding equations leads to an increased impact of subjective survival probabilities. Without
controlling for permanent income, we see that a one-percentage point difference in subjective
survival leads single households to hold $339 more Net Worth and $156 more Financial Wealth.
Removing permanent log income from our asset allocation equations we see that the impact of
subjective survival does not change much. There is no statistically significant impact of
difference in subjective survival, though we do see that the estimated impacts of higher
subjective survival increased slightly for stocks and bonds and decreased slightly for CDs and

21

ւ

≺⊙ ⊙ ց

checking.
When we remove permanent income variables from the wealth holding equations for
married households, we estimate that a one-percentage point difference in husband’s subjective
survival probability leads the household to hold $356 more in Net Worth and $66 more in
Financial Wealth. A one-percentage point difference in the wife’s subjective survival probability
increases Net Worth by $522 and Financial Wealth by $12. While the estimated impact for
Financial Wealth remains statistically insignificant and economically small for both spouses we
see that the estimated impact of a one-percentage point difference in subjective survival
probability for either spouse is generating about $100 more Net Worth and that the estimate for a
husband’s subjective survival probability is now significant at the 10% level. Removing
permanent log income from the asset allocation equations has a similar effect as in single
households. The estimated impact of subjective survival probabilities slightly increases for
stocks and bonds and slightly decreases for CDs and checking. We see that our estimated impact
for a one-percentage point difference in the husband’s subjective survival probability leads to
0.027% more of the households wealth allocated to stocks and only 0.015% less wealth allocated
to checking. We see that the one-percentage point difference in the wife’s subjective survival
probability is now driving a larger impact on the allocation of wealth to stocks. Including
permanent log income we see that the impact was around 0.022% more in stocks (significant at
the 10% level), but dropping our permanent log income measure we see that this estimate
increases slightly to 0.023% and remains statistically significant at the 10% level. In addition,
including permanent log income we estimated a negative (statistically insignificant) impact of
increased subjective survival probabilities of both spouses on the proportion of wealth allocated
to checking. When we drop permanent log income, the estimated impacts become slightly more

22

ւ

≺⊙ ⊙ ց

negative.
Overall, excluding our measure of permanent income from our wealth holding and
allocation equations leads us to find nearly identical coefficients and average partial effects as
estimates from models that include permanent income. Removing permanent income, we see the
estimated impacts of subjective survival on wealth holdings and asset allocation move away
from zero. This suggests that the permanent income is likely weakly positively correlated to the
reported subjective survival probabilities. Controlling for a measure of expected lifetime
resources appears to have a negligible effect on the estimated impact of reported survival
probabilities. It is interesting though to note that the inclusion of a measure of permanent income
shrinks the estimated impacts for survival probabilities to zero.
Bunching at focal points has been a concern of the subjective survival probabilities in the
HRS since Hurd and McGarry (1995). The worry is that focal point responses may reflect either
difference in cognition (Lillard and Willis (2001)) or measurement error (Bloom et al. (2006)).
We provide evidence that the relationship between wealth holdings and asset allocation in the
raw data appears to support theoretical predictions; average wealth holding and allocation to the
risky asset increase as subjective survival probabilities increase, with the exception of
households that report survival probabilities of one. Bloom et al. (2006) points out that
households that report survival probabilities of one appear to have mortality rates similar to those
people responding with survival probabilities closer to 0.7 or 0.8.
To assess the impact of these focal points on our estimates we treat focal point responses
of one differently. First, we estimate our wealth holding and asset allocation equations dropping

23

ւ

≺⊙ ⊙ ց

observations where the respondent reports a survival probability of one.

11

Second, recognizing

that a focal point response may reflect some underlying time-constant heterogeneity of the
household, we estimate our models dropping the all observations for the household if any
respondent has ever reported a survival probability of one.
For single households, excluding observations where the respondent reported a survival
probability of one leads us to estimate a larger impact of subjective survival probabilities on
wealth holdings. Despite increases in the standard errors from reducing the number of
observations we can see that we are estimating that a one-percentage point difference in
subjective survival probability translates into $394 more Net Worth and $182 more Financial
12

Wealth . If we drop households that ever reported a survival probability of one our estimate of
the impact of a one-percentage point difference in subjective survival on Net Worth is $466 and
$143 on Financial Wealth.
More interesting is the impact that dropping focal point responses has on the estimated
impact of subjective survival probabilities for asset allocation. When we drop observations
where the respondent gives a survival probability of one we estimate that a one-percentage point
difference in subjective survival increases the allocation of financial wealth to stocks by 0.032 %
and decreases allocation to checking by 0.035%; both estimates are statistically significant at the
5% level despite the increases in standard error due to fewer observations. We see an even larger
impact of removing households that ever reported a survival probability of one: we estimate that
a one-percentage point difference in survival probabilities leads to 0.05% more financial wealth
11

For married households, we drop the observation if either member of the household reports a
survival probability of one.
12
Though not statistically significant, this is $13,837 more Net Worth and $5,877 more
Financial Wealth than estimates including observations with reported survival probabilities of
one.
24

ւ

≺⊙ ⊙ ց

allocated to stocks and 0.047% less allocated to checking; both estimates are significant at the
5% level.
Treating reported survival probabilities of one differently leads to significantly different
estimates of the impact of subjective survival on wealth holdings and asset allocation for married
households. Removing observations where either the husband or the wife reported a survival
probability of one leads us to estimate that a one-percentage point difference in the husband’s
survival probability results in only $57 more Net Worth and $5 less Financial Wealth (both
estimates are statistically insignificant). A one-percentage point difference in the subjective
survival probability of the wife leads to an estimated $698 (significant at the 5% level) difference
in Net Worth and $129 difference in Financial Wealth. Removing focal point responses causes
the estimate for the wife’s subjective survival to increase for wealth holding equations and the
estimate for the husband’s subjective survival to decrease for both wealth holding equations.
Removing households where either spouse ever reported a survival probability of one
leads us to estimate that a one-percentage point difference in the husband’s subjective survival
probability results in $171 less Net Worth and $14 more Financial Wealth. A one-percentage
point difference in the wife’s subjective survival probability leads to $497 more Net Worth and
$215 more Financial Wealth. Interestingly the estimated impacts of subjective survival
probabilities are statistically insignificant once we remove focal point households.
When we remove observations where either spouse reported a survival probability of one
we see that a one-percentage point difference in the husband’s survival probability results in
0.035% more financial wealth allocated to stocks (significant at the 5% level), 0.021% less
allocated to CDs (significant at the 5% level) and 0.017% less in checking (statistically
insignificant). A one-percentage point difference in the wife’s survival probability results in

25

ւ

≺⊙ ⊙ ց

0.032% more financial wealth allocated to stocks (significant at the 5% level), and 0.026% less
allocated to CDs (significant at the 10% level). When we drop households that ever report a
survival probability of one we estimate that a one-percentage point difference in the husband’s
subjective survival leads us to find a statistically significant (5% level) increase of 0.012% of
financial wealth to bonds. Perhaps more interesting is the fact that we now estimate that a onepercentage point difference in subjective survival for the wife leads the household to hold 0.06%
more financial wealth in stocks and 0.036% less in CDs, while still small in magnitude they are
considerably larger than our estimates that include all households.
It is apparent that the estimated impact of subjective survival probabilities on wealth
holding and asset allocation are sensitive to the inclusion of reported survival probabilities of
one. Our sensitivity analysis suggests that households that ever report a survival probability of
one act very differently than the rest of our sample. Removing all observations for households
that ever report a survival probability of one leads us to find that single households tend to
increase their Net Worth more than their Financial Wealth in response to an increase in longevity
risk. We also find that single households increase their allocation of financial wealth to stocks
and decrease their allocation to checking. For married households we find that once we remove
the focal point households the wife’s survival probabilities becomes more important in the
wealth holding and asset allocation decisions. While the estimates of the impact on wealth
13

holding of wife’s survival probabilities are not statistically significant at standard levels , they
do coincide well with the allocation equation findings that the wife’s survival leads to a
significant positive impact on stock allocation. In fact, despite the lack of statistical significance
the estimated impact of the wife’s subjective survival probability is not much different than the
13

The estimated impact for Net Worth is significant at the 14% level and for Financial Wealth at
the 12% level.
26

ւ

≺⊙ ⊙ ց

estimated impacts for the single household’s survival probability.
It appears that including households that ever report survival probabilities of one has a
significant impact on the estimated impacts of subjective survival on wealth holding and asset
allocation. Excluding these “focal point” households leads us to find estimated impacts that seem
reasonable from a theoretical perspective and are generally statistically significant, despite
increased standard errors due to smaller sample sizes. While there is still likely significant
measurement error present in the subjective probabilities, we have identified that at a minimum
the inclusion of households that report survival probabilities of one greatly affects estimates of
wealth holding and allocation equations.
1.5 Conclusion
Theory points to the need to include estimates of an individual’s subjective survival
probability in estimating models of economic decision-making. Several studies use the subjective
survival probabilities in the HRS to explain economic behavior with mixed results. We present
here a very simple analysis that attempts to explain differences in wealth holding and asset
allocation behavior using differences in subjective survival probabilities.
We find that there does appear to be a small, statistically significant impact of subjective
survival probabilities on wealth holding (Net Worth, particularly) and asset allocation (stocks, in
general). We find that our results for subjective survival probabilities are insensitive to the
inclusion of a measure of permanent income, though we do see that including a measure of
permanent income causes the estimated impact of subjective survival probabilities to move
towards zero. If we exclude our measure of permanent income, we find that our estimated impact
of subjective survival increases (but not by the total impact of permanent income) and the
estimated impact on asset allocation moves away from zero. We also find that inclusion of

27

ւ

≺⊙ ⊙ ց

households that ever report a survival probability of one has a significant detrimental impact on
the estimated impact of subjective survival for asset allocation equations for single households
and for wife’s in married households. It appears that longevity risk plays a statistically
significant role for single households, which contradicts the findings of Bloom et al. (2006).
By no means does our study attempt to contradict the hypothesis that there is
measurement error in these subjective survival probabilities. In fact, it appears that bunching
does represent a form of measurement error but it may be distinctly different (or additive) to the
measurement error that is most likely present in all the subjective survival responses. Elder
(2012) finds that life table probabilities are better predictors of in-sample mortality and that
subjective survival probabilities probably reflect more measurement error than they reflect
heterogeneity in mortality expectations of HRS respondents. These findings raise questions about
both the validity of these subjective survival probabilities as proxies for mortality risk and how
we can interpret the estimated relationships between these measures and decision-making. Our
results suggest that controlling for other factors that can affect a household’s wealth holding and
allocation decisions (such as permanent income) almost completely removes the correlation with
subjective survival that we see in the raw data. In addition, we find that excluding households
that ever report a survival probability of one significantly affects our findings. We are not sure
what these focal point responses mean in terms of mortality or if they represent heterogeneity in
cognition but it does suggest that we are probably not able to interpret these probabilities as
reflecting actual mortality risk.
Future avenues for research would be to try different specifications for married
households, particularly around different types of objective functions that combine attributes of
the spouses differently. In addition, including the cohort survival probability module as a

28

ւ

≺⊙ ⊙ ց

standard part of the HRS expectations survey would help to tease out measurement error and the
learning process that respondents go through when estimating their expectations. It might also be
beneficial to look into the impact of bequest intentions as they may play a large role for some
households compared to others.

29

ւ

≺⊙ ⊙ ց
CHAPTER 2

A MODEL FOR MULTIVARIATE FRACTIONAL RESPONSES WITH AN APPLICATION
TO ASSET ALLOCATION
2.1 Introduction
Many interesting economic variables are fractional in nature. By definition, some
fractional response variables are related, such as the budget share of goods in a demand system
or the allocation of wealth across available assets. Since these fractional responses are shares of a
total, they must satisfy an adding up restriction. Due to the limited nature of the fractional
responses we are often interested in estimating relationships using nonlinear conditional mean
functions. Analogous to the case where we estimate linear conditional means, the adding up
restriction imposes constraints on the marginal effects of the covariates in our model. Since the
marginal effects of covariates in a nonlinear conditional mean are themselves nonlinear, this
causes some difficulty in estimation as it amounts to imposing nonlinear constraints on our
coefficients. We get around this issue by imposing the adding up restriction using the
multinomial quasi-likelihood.
Sivakumar and Bhat (2002) and Mullahy (2010) show how this approach works when
using cross-sectional data. In fact, this approach extends very naturally to the panel data case,
even if we specify a time-invariant unobserved effect for each cross-sectional unit. We can
appeal to the Correlated Random Effects approach to estimation (Chamberlain (1980) and
Mundlak (1978)) and specify that the conditional mean of the unobserved effect is a parametric
function of the time-averages of the independent variables. As pointed out in Wooldridge (2010),
this approach is only appropriate when we have a balanced panel. With an unbalanced panel, at a
minimum, we violate the assumption that the unobserved effect has a constant variance across

30

ւ

≺⊙ ⊙ ց

cross-section units, since we have a different number of observations for each unit. We address
this concern by exploiting the independence of our cross-section units. To do this we estimate the
multinomial quasi-likelihood on balanced panel subsets of the full, unbalanced panel. We create
the balanced panels by combining households that are observed in the sample for the same
number of time periods. Since the households in different balanced panel subsets are independent
of each other we are able to take the weighted average of the average partial effects (and their
variances) estimated in each balanced panel subset to get the average partial effect for the entire
unbalanced panel.
This approach ignores the potential serial correlation present within the cross-sectional
unit. A natural next step is to correct for this serial correlation by using a weighting matrix that
accounts for serial correlation. To do this we use Classical Minimum Distance estimation.
Simply put, this method takes the estimated coefficients from the multinomial quasi-likelihood
estimation on each balanced panel subset of data and estimates a single coefficient for each
covariate that is a weighted average of all the balanced panel subset coefficients. The weighting
matrix that we construct contains the estimated variance-covariance matrices from the
multinomial quasi-likelihood estimated on each balanced panel subset and estimates of the serial
correlation within and across equations over time.
This paper takes the single equation fractional probit technique laid out in Papke and
Wooldridge (2008) and extends it to cover multiple fractional responses that must meet an
adding up restriction. In addition, we estimate an unbalanced panel using the Correlated Random
Effects framework by exploiting the independence of our cross-sectional units. Lastly, we also
show how we can apply Classical Minimum Distance to this procedure to potentially derive
more efficient estimates. Section 2 briefly covers some of the work done on fractional response

31

ւ

≺⊙ ⊙ ց

variables. Section 3 presents the single equation specification of our multiple fractional
responses. Section 4 combines the single equations together using the multinomial quasilikelihood. Section 5 lays out the Classical Minimum Distance estimator. Section 6 describes an
alternative estimator that may be more parsimonious. Section 7 describes the data that we use in
our analysis and presents results of estimation and section 8 concludes.
2.2 Fractional Response Variables
Gourieroux, Monfort, and Trognon (1984) summarizes the theory underlying the quasimaximum likelihood estimator. They show that if a random variable comes from the linear
exponential family and we correctly specify the conditional mean, then estimation by quasimaximum likelihood generates consistent, though potentially inefficient, estimates for the
impacts of our covariates. This finding holds under the general conditions that we require for MEstimation. Under the standard assumptions, the quasi-maximum likelihood estimators are
asymptotically normal and converge at the rate of N −1/ 2 .
Papke and Wooldridge (1996) estimate a single fractional response variable in the crosssection using the logit functional form for the conditional mean function. Papke and Wooldridge
(2008) estimate a single equation fractional response variable in the panel data context using the
probit functional form for the conditional mean. They insert the time-averages of their
coefficients in their conditional mean function, thus parametrically “removing” the unobserved
effect. They also explore the possibility of improving efficiency by constructing a “working”
correlation matrix that allows for some correction of the serial correlation within cross-sectional
units. This technique essentially follows the logic that even if we incorrectly specify the
correlation structure that we can gain some efficiency by not ignoring its presence.

32

ւ

≺⊙ ⊙ ց
Sivakumar and Bhat (2002), Mullahy (2010) and Koch (2010) study the multivariate

fractional response case in cross-sectional data. They follow a similar approach that we take in
this paper and exploit the robustness properties of the multinomial quasi-likelihood. They specify
their conditional means using the multinomial logit specification that imposes the adding up
restriction across equations. In this paper, we use the probit specification of the conditional mean
functions since it more easily accommodates the use of the Chamberlain-Mundlak device to
control for the presence of an unobserved effect.
2.3 Single Equation Fractional Probit
A natural starting point for our analysis is to ignore the relationship between our
fractional response variables and proceed as though we will estimate each equation separately.
We do this for two reasons; first our procedure essentially takes our single equation framework
and extends it to the multiple fractional response case. Second, the single equation framework
gives us a baseline set of estimates to compare to the results of the procedure that we present
here.
Assume that at each time period t , we observe household i allocate a proportion of their
financial wealth to each asset g . We assume that g = 1, 2,..., G exhausts all possible investment
options so that in each period the household invests all their financial wealth. This assumption
implies that

G

∑ g yitg = 1 for each i = 1, 2,..., N

and t = 1, 2,..., T . In addition to our response

vector, we observe a set of covariates xit and an unobserved effect cig . Our goal is to estimate a
model of the conditional mean of y given x to determine the impact of each covariate. By
construction, yitg is bounded between zero and one. Assuming that we use the same set of
covariates for each equation we can specify the conditional mean as:

33

ւ

≺⊙ ⊙ ց

E ( yitg | xit , cig ) = Φ(xit β g + cig ); i = 1,..., N ; t = 1,..., T ; and g = 1,..., G (2.3.1)
We point out that the above specification completely ignores the fact that these fractional
responses sum to one. It is simply acting as though we are treating each asset equation by itself.
Nevertheless, it is useful to study what we can derive from these single equation models since
our approach here essentially takes the specification in (2.3.1) for each equation and uses the
multinomial quasi-likelihood to impose the adding up constraint.
Our interest lies in estimating the average partial effects of the covariates, xit on the
proportion of financial wealth allocated to each asset. Following the results for a probit model
the direction of the partial effect is determined by the sign of β g . For a continuous variable xtk
(dropping the i subscript);
∂E ( ytg | xt , c g )
(2.3.2)
= β gk φ ( xt β g + c g )
∂xtk
To obtain the average partial effect we can average this over the distribution of cg and xt .
To consistently estimate the coefficients in this single equation framework we require
that the covariates are not correlated with the error term. We assume that xit is strictly
exogenous conditional on the unobserved effect,

E ( yitg | xi , cig ) = E ( yitg | xit , cig ).

(2.3.3)

Following Chamberlain (1980) and Mundlak (1978) we assume that
2
cig = ψ g + xi ξ g + aig ; where aig | xi ~ Normal (0, σ a ).

(2.3.4)

Using (2.3.4) we can rewrite (2.3.1) as:

E ( yitg | xi , aig ) = Φ(ψ g + xit β g + xi ξ g + aig ).
So the mean of yitg conditional on xi is:

34

(2.3.5)

ւ

≺⊙ ⊙ ց

ψ + x β + x ξ
g
it g
i g
E ( yitg | xi ) = E (Φ (ψ g + xit β g + xi ξ g + aig | xi ) = Φ 
 (1 + σ 2 )−1/ 2
ag



 (2.3.6)



or

(

)

E ( yitg | xi ) = E (Φ (ψ g + xit β g + xi ξ g + aig | xi )) = Φ ψ ga + xit β ga + xi ξ ga (2.3.7)
where we use the a subscript to denote that our estimated coefficients are scaled by
2
(1 + σ ag ) −1/ 2 . Woodridge (2002, Section 15.8.2) shows that the mixing properties of the

normal distribution lead to equation (2.3.7).
For identification of the scaled coefficients in (2.3.7) we require that there is no perfect
collinearity between the elements of xit and that there is enough variation in xit over time.
With the above parametric specification of the unobserved effect we can now write the Average
Structural Function (ASF) following Blundell and Powell (2003):

(

)

ASFg (xt ) = Ex  Φ ψ ga + xt β ga + xiξ ga 
i


(2.3.8)

A consistent estimator of ASFg ( xt ) is:

N
−1 Φ ψ + x β + x ξ
ˆ
ˆ
ˆ ga
ASF g (xt ) = N ∑
t ga i ga .
i =1

(

)

(2.3.9)

ˆ
ˆ
ˆ
where ψ ga , β ga , and ξ ga are consistent estimates of the scaled coefficients in equation (2.3.7).

For a continuous variable x j we can calculate the average partial effect by taking the derivative
of (2.3.9) and average over both i and t :

APE g ( x j ) =

ˆ
ˆ
ˆ
∂Φ(ψ ga + xit β ga + xi ξ ga )
∂x j

= NT −1

T N

ˆ
ˆ
∑ ∑ βˆga, jφ (ψˆ ga + xit β ga + xiξ ga )

t =1i =1

(2.3.10)
35

ւ

≺⊙ ⊙ ց

For a binary x j we can calculate the average partial effect by calculating the difference in the
Average Structural Function evaluated at zero and one.
The above presentation closely mirrors that of Papke and Wooldridge (2008) for the
single equation fractional probit model. We laid out the basic specification of the conditional
mean functions for each of our asset equations. We can use the individual conditional mean
functions and the previous results to estimate each equation assuming that there is no correlation
between equations. Estimation of the individual equation conditional means would entail
maximizing the Bernoulli quasi-likelihood function. While this works, it ignores the fact that
there is a definite relationship between the fractional responses, i.e. that they sum to one for each
household in each time period.
Analogous to the linear case, imposing the adding up restriction places a constraint on the
marginal effects of the covariates. The adding up restriction states that

G

∑ g =1 yitg = 1 . Taking

expectations of this conditional on xit and cig gives us the following restriction:

E ( yitg | xit , cig ) = 1.

(2.3.11)

Taking the derivative of (2.3.11) with respect to xk :

G ∂E ( y | x , c ) G
itg it ig
= ∑ β gk φ (xit β g + cig )
∑
∂xk
g =1
g =1
=

(2.3.12)

G

∑ β gk φ (ψ g + xit β g + xiξ g ) = 0.

g =1

Equation (2.3.12) shows that the adding up restriction implies that the sum of the marginal
effects of a specific covariate across equations is zero. Therefore, the effect of a change in a
covariate will lead to a reallocation of financial wealth across the available assets.
36

ւ

≺⊙ ⊙ ց
A very simple way to impose the adding up restriction in estimation is to exploit the

multinomial quasi-likelihood function.
2.4 The Multinomial Quasi-Likelihood
We use the multinomial quasi-likelihood function to allow for correlation across
equations and impose the restriction that our dependent variables must sum to one in each period
for each household. We assume our fractional responses ( yit1, yit 2 ,..., yitG ) are exhaustive and
mutually exclusive categories representing the population and that yitg ∈ [0,1] and

G

∑ g =1 yitg = 1 . Let xit

be a set of covariates that affect our quantity of interest, E ( yitg | xi ) .

Assume that

E ( yitg | xi ) = Φ(ψ o,t , g ,a + xit βo,t , g ,a + xi ξo,t , g ,a ) = Φ(xit , θo,t , g ,a )

(2.4.1)

For identification of the parameters the multinomial quasi-likelihood function requires that the
following constraint holds:
Φ (xit , θo,t ,G , a ) = E ( yitG | xi )
= 1 − Φ (xit , θo,t ,1, a ) − Φ (xit , θo,t ,2, a ) − ⋅⋅⋅ − Φ (xit , θo,t ,G −1, a ).

(2.4.2)

So that for any θt = {θt ,1,a , θt ,2,a ,..., θt ,G −1,a } ,

G

∑ Φ g (xit , θt ) = 1.

g =1

Assuming that the conditional mean functions are continuously differentiable in θt the adding up
restriction implies
∇θ ΦG (xit , θt ) = −∇θ Φ1(xit , θt ) − ∇θ Φ 2 (xit , θt ) − ⋅⋅⋅ − ∇θ ΦG −1(xit , θt ).
t
t
t
t

37

ւ

≺⊙ ⊙ ց
Assuming that we have a random draw i the multinomial quasi-likelihood equation for

each observation is:

ℓ it (θt ) = yit1 log[Φ1(xit , θt )] + yit 2 log[Φ 2 (xit , θt )] + ⋅⋅⋅ + yitG log[ΦG (xit , θt )]. (2.4.3)

14

The score function is

sit (θt ) =

yit1
yit 2
∇θ Φ1(xit , θt )′ +
∇θ Φ 2 (xit , θt )′ + ⋅⋅⋅
t
Φ1(xit , θt )
Φ 2 (xit , θt ) t

(2.4.4)

yitG
+
∇θ ΦG (xit , θt )′
ΦG (xit , θt ) t
Assuming the correct specification of the conditional mean
E (sit (θt ) | xi ) =
+

E ( yit1 | xi )
E ( yit 2 | xi )
∇θ Φ1(xit , θt )′ +
∇θ Φ 2 (xit , θt )′ + ⋅⋅⋅
Φ1(xit , θt ) t
Φ 2 (xit , θt ) t

E ( yitG | xi )
∇θ ΦG (xit , θt )′
ΦG (xit , θt ) t

= ∇θ Φ1(xit , θt )′ + ∇θ Φ 2 (xit , θt )′ + ⋅⋅⋅ + ∇θ ΦG (xit , θt )′
t
t
t
= 0.
Let Ht (xi , y i , θt ) be the Hessian, it can be shown that

− E[Ht (xit , yit , θo,t ) | xi ] =
+

∇θ Φ1(xit , θo,t )′∇θ Φ1(xit , θo,t ) ∇θ Φ 2 (xit , θo,t )′∇θ Φ 2 (xit , θo,t )
t
t
t
+ t
+ ⋅⋅⋅
Φ1(xit , θo,t )
Φ 2 (xit , θo,t )

∇θ ΦG (xit , θo,t )′∇θ ΦG (xit , θo,t )
t
t
ΦG (xit , θo,t )

and is consistently estimated by

14

We point out that using the Normal cumulative distribution function does not explicitly
restrict the predicted values of the “omitted” equation from falling outside of [0,1]. Using the
logarithm function in the quasi log-likelihood implicitly imposes this restriction. In our
application, we do not see this as a cause for concern as all predicted values of the “omitted”
equation fall within [0,1], though this may be a concern where there is a greater frequency of
zeros and ones in the response variable.
38

ւ

≺⊙ ⊙ ց

N
ˆ = N −1 ∇ Φ(x , θ )′W (x , θ )∇ Φ(x , θ )
At
∑ θt it ˆ t t it ˆ t θt it ˆ t
i =1
where

ˆ
 ∇θ Φ1(xit , θt ) 
t


ˆ
 ∇ Φ (x , θ ) 
θt 2 it t 
ˆ
∇θ Φ(xit , θt ) = 
t


⋮


 ∇ Φ (x , θ ) 
ˆ 
 θ G it t
 t

and

1

 Φ (x , θ )
ˆ
 1 it t


0
ˆ )=
Wt (xit , θt 

⋮



0







1

⋱
⋮
ˆ
.
Φ 2 (xit , θt )


⋱
⋱
0


1

…
0
ˆ 
ΦG (xit , θt ) 
0

⋯

0

Define

N
ˆ
ˆ
ˆ
Bt = N −1 ∑ sit (θt )sit (θt )′.
i =1
Then an estimator of the variance matrix that only relies on the correct specification of the
conditional means is given by

−1
N
 N

ˆ− ˆ ˆ−
ˆ
ˆ
ˆ
ˆ
ˆ
At 1Bt At 1 / N =  ∑ ∇θ Φ(xit , θt )′Wt (xit , θt )∇θ Φ(xit , θt )   ∑ sit (θt )sit (θt )′ 
t
t

 

 i =1
  i =1

N

ˆ
ˆ
ˆ
 ∑ ∇θ Φ(xit , θt )′Wt (xit , θt )∇θ Φ(xit , θt ) 
⋅
t
t


 i =1

39

−1
.

(2.4.5)

ւ

≺⊙ ⊙ ց
In the above presentation we laid out the basic approach to estimating the multinomial

quasi-likelihood for each time period in a balanced panel. With a few minor changes we can
rewrite the above equations to estimate a pooled version of the multinomial quasi-likelihood
where we do not allow the estimated parameter vector to vary by time period. We can write the
partial multinomial quasi-likelihood function as

ℓ i (θ) =

T

∑ yit1 log[Φ1(xit , θ)] + yit 2 log[Φ 2 (xit , θ)] + ⋅⋅⋅ + yitG log[ΦG (xit , θ)].

t =1

Then the score function is
yit1
yit 2


 Φ (x , θ ) ∇θ Φ1(xit , θ )′ + Φ (x , θ ) ∇θΦ 2 (xit , θ )′ + ⋅⋅⋅ 
T
T
2 it
 1 it

si (θ) = ∑ sit (θ) = ∑ 

yitG
t =1
t =1

⋅⋅⋅ +
∇θΦG (xit , θ )′


ΦG (xit , θ )


and the negative of the expected Hessian is

 ∇θ Φ1(xit , θo )′∇θ Φ1(xit , θo ) ∇θΦ 2 (xit , θo )′∇θ Φ 2 (xit , θo )

+
+ ⋅⋅⋅ 
T 
Φ1(xit , θo )
Φ 2 (xit , θo )


− E[ H ( xi , y i , θ o ) | xi ] = ∑ 
.
∇θ ΦG (xit , θo )′∇θ ΦG (xit , θo )
t =1

⋅⋅⋅ +


ΦG (xit , θo )



The consistent estimator of the Hessian is

N T
ˆ = NT −1
ˆ
ˆ
ˆ
A
∑ ∑ ∇θΦ(xit , θ)′W(xit , θ)∇θΦ(xit , θ)
i =1t =1
where

40

ւ

≺⊙ ⊙ ց
ˆ
 ∇θ Φ1(xit , θ) 


ˆ
 ∇ Φ ( x , θ) 
θ 2 it
ˆ

∇θΦ(xit , θ) = 


⋮


 ∇ Φ ( x , θ) 
ˆ
 θ G it 

and

1

 Φ (x , θ )
ˆ
 1 it


0
ˆ) = 
W(xit , θ 

⋮



0







1

⋱
⋮
ˆ)
.
Φ 2 (xit , θ


⋱
⋱
0


1

…
0
ˆ 
ΦG (xit , θ ) 
0

⋯

0

Define

N T
ˆ
ˆ
ˆ
B = NT −1 ∑ ∑ sit (θ)sit (θ)′.
i =1t =1
Then a consistent estimator of the variance matrix that only relies on the correct specification of
the conditional means is

−1 N T
N T



ˆ ˆˆ
ˆ
ˆ
ˆ
ˆ
ˆ
A −1BA −1 / N =  ∑ ∑ ∇θΦ(xit , θ )′W(xit , θ )∇θΦ(xit , θ )  ⋅  ∑ ∑ sit (θ )sit (θ )′  ⋅




 i =1t =1

 i =1t =1

−1
N T

ˆ
ˆ
ˆ
⋅  ∑ ∑ ∇θΦ(xit , θ)′W(xit , θ)∇θΦ(xit , θ)  .


 i =1t =1

2.5 Classical Minimum Distance Estimation
Classical Minimum Distance estimation (CMD) is similar to Generalized Method of
Moments and involves minimizing the Euclidean distance between a set of “reduced-form”

41

ւ

≺⊙ ⊙ ց

parameters and their structural counterparts. The basic exposition is as follows and can be found
in Wooldridge (2002, Section 14.6).
Suppose that we have a P × 1 vector of structural parameters θo related to πo a S ×1
(where S > P ) vector of reduced-form parameters such that:
πo = h(θo )

(2.5.1)

where h is a known, continuously differentiable function that maps the structural parameters to
the reduced-form parameters. To perform CMD estimation we first obtain estimates of the

ˆ
ˆ
reduced-form parameters, say π , and then find an estimator θ that minimizes the weighted
ˆ
ˆ
Euclidean distance between π and h(θ) . Analogous to GMM, we can use any weighting matrix

so long as it is positive semi-definite, such as an S × S identity matrix, but the optimal weighting
matrix is the one that makes the CMD estimator the minimum chi-square estimator.
The CMD estimator solves the following problem:
ˆ −1 ˆ
ˆ
min {π − h(θ)}′Ξ {π − h(θ)}
θ∈Θ

(2.5.2)

ˆ
ˆ
where plim N →∞ Ξ = Ξo . Assuming that π is a consistent estimator of πo and that
a
ˆ
ˆ
N ( π − πo ) ~ Normal(0, Ξo ) then Ξ−1 is the inverse of any consistent estimate of the
asymptotic variance of

ˆ
ˆ
N ( π − πo ) . The solution to the optimization problem in (2.5.2) is θ

ˆ
ˆ
which minimizes the weighted Euclidean distance between π and h(θ) it can be shown that

a
ˆ − θ ) ∼ Normal[0, H′ Ξ−1H ] where H = H (θ ) and H (θ) ≡ ∇ h(θ) the S × P
N (θ o
o o o
o
o
θ

ˆ
Jacobian of h(θ) . The appropriate estimator of Avar(θ)
ˆ
ˆ
ˆ
Avar(θ) ≡ ( H ′Ξ−1H ) −1 / N = ( H ′[Avar( π )]−1 H ) −1

42

(2.5.3)

ւ

≺⊙ ⊙ ց
If the mapping function h is linear then we can state the general form of the CMD

estimator as follows (Wooldridge, 2002, Problem 14.7):
ˆ
ˆ
ˆ ˆ
θ = ( H′Ξ−1H ) −1 H′Ξ−1π.

(2.5.4)

This implies that the optimal CMD estimator of the structural parameter vector is a weighted
average of the reduced form parameter vector where the weighting matrix is the estimated
asymptotic variance matrix of the reduced-form parameters.
The mapping function that we use in the CMD estimation step is indeed linear and
therefore we can use (2.5.4) to construct the estimates of the structural parameter vector. To
obtain our reduced-form parameter vector we maximize the multinomial quasi-likelihood for
each time period within each balanced panel subset. Using these estimates we can then construct
the weighting matrix that we will use in the CMD estimation. The weighting matrix will be a
block matrix where the estimated variance matrices from each time period are on the diagonal
and the off-diagonal block matrices contain estimates of the covariance within and across
equations over time.
Given the results in section 3 and the specifications of the conditional mean functions in
section 2 we define the diagonal elements of the weighting matrix as

−1
N
 N

−1B A −1 / N =  ∇ Φ(x , θ )′W (x , θ )∇ Φ(x , θ )   s (θ )s (θ )′ 
ˆ
At ˆ t ˆ t
∑ θt it ˆ t t it ˆ t θt it ˆ t   ∑ it ˆ t it ˆ t 

 i =1
  i =1

N

ˆ
ˆ
ˆ
⋅  ∑ ∇θ Φ(xit , θt )′Wt (xit , θt )∇θ Φ(xit , θt ) 
t
t


 i =1


−1
.

We construct the off-diagonal elements of the weighting matrix in a similar manner

43

(2.5.5)

ւ

≺⊙ ⊙ ց

−1
N
 N

ˆ− ˆ ˆ
ˆ
ˆ
ˆ
ˆ
ˆ
At 1Bts A −1 / N =  ∑ ∇θ Φ(xit , θt )′Wt (xit , θt )∇θ Φ(xit , θt )   ∑ sit (θt )sis (θ s )′ 
s
t
t

 

 i =1
  i =1

−1
N

ˆ
ˆ
ˆ
⋅  ∑ ∇θ Φ(xis , θ s )′Ws (xis , θ s )∇θ Φ(xis , θ s ) 
s
s


 i =1


(2.5.6)

for s ≠ t.
The weighting matrix that we use in CMD estimation is

 A −1B A −1 A −1B A −1
ˆ ˆ ˆ
ˆ ˆ ˆ
1 12 2
 1 11 1
 −1
ˆ ˆ ˆ
ˆ ˆ ˆ
 A B A −1 A −1B 22 A −1
2
2
ˆ =  2 21 1
Ω

⋮
⋮

 ˆ −1 ˆ ˆ −1 ˆ −1 ˆ
ˆ−
 A BT 1A
AT BT 2 A 2 1
 T
1

ˆ− ˆ ˆ−
A1 1B1T AT 1 


ˆ ˆ
ˆ−
⋯ A −1B 2T AT 1 
2
.

⋱
⋮

−1B A −1 
ˆ

⋯ AT ˆ TT ˆ T 
⋯

(2.5.7)

We calculate this weighting matrix for each balanced panel subset in our data. The mapping
function that we use is linear, it maps the T reduced-form coefficient vectors to the single
structural parameter vector for each balanced panel subset. Therefore, we can state that the CMD
ɶ
estimator β takes the following form

ɶ
ˆ
ˆ
β = ( H′Ω −1H ) −1 H ′Ω −1β

(2.5.8)

ˆ
ˆ
where Ω is defined above, β represents the T (G − 1) K × 1 ( K covariates, G − 1 equations, T

time periods in each balanced panel subset) vector of parameter estimates from the multinomial
quasi-likelihood maximization and H is a T (G − 1) K × (G − 1) K matrix that maps the reducedform parameter estimates to the structural parameter estimates:

44

ւ

≺⊙ ⊙ ց

 It =1
 (G −1) K

 It = 2
 (G −1) K
H=

⋮

 t =T
 I (G −1) K

where I













t= j
(G −1) K is a (G − 1) K × (G − 1) K identity matrix for each time period j .

After deriving the CMD estimates of the structural parameters in (2.5.8), we can calculate
ɶ
the average partial effects of our covariates as in Section 2 using β . Therefore, our proposed

procedure is
Procedure 1:
1. For each time period obtain the multinomial quasi-likelihood estimates of the
coefficients, the gradient vector, and the robust variance matrix.
2. Create the weighting matrix as in (2.5.7), and the matrix H
3. Obtain the MD coefficient estimates using (2.5.8) and the asymptotic variance matrix
of these estimated coefficients as in (2.5.3).
The above presentation necessarily assumes that we have a balanced panel; this is due to
the use of the Chamberlain-Mundlak device employed to remove the unobserved effect. If we
have an unbalanced panel then, at a minimum, our assumption of a common variance across
households, conditional on the number of time periods, is incorrect. Households in the panel for
fewer time periods will necessarily have a larger estimated variance for their time-averaged
observables. One solution to this problem is to drop observations so that we have a balanced
panel. While this works technically, it has the potential to cause problems that are even more

45

ւ

≺⊙ ⊙ ց

worrisome. Fortunately, there is a very simple way to allow us to use this procedure in the
context of an unbalanced panel.
Since we have independence within the cross-section, i.e., households are randomly
sampled and independent within each time period, we can simply subset our data based on the
number of times that we observe households and then average the estimated average partial
effects across these subsets. This entails creating balanced panels that contain households that are
observed for the same number of time periods. We estimate the multinomial quasi-likelihood on
each of these balanced panels by time period, followed by the minimum distance estimation to
account for serial correlation. We then calculate the average partial effects for each balanced
panel and average these across all the balanced panels.
We adapt Procedure 1 in the following way to allow for estimation on an unbalanced
panel:
Procedure 2:
1. Subset the panel into balanced panels which contain all households that have exactly j
observations, where j = 2,..., T .
2. For each time period in each balanced panel subset maximize the multinomial quasilikelihood obtaining the estimates of the coefficients, the gradient vector, and the robust
variance matrix.
3. Apply CMD estimation as detailed above to each subset of the data.
4. Calculate the average partial effects and their variances for each balanced panel subset
and then average these across all balanced panel subsets.
2.6 An Alternative Specification
The procedure presented above essentially takes single equation fractional probits and

46

ւ

≺⊙ ⊙ ց

combines them into a system of equations. While this is the logical first step in estimating
multiple fractional response variables, it assumes that there are G sources of heterogeneity. This
is a very general assumption and acts as though we have a different distribution for the
unobserved effect for each equation. We know that heterogeneity is at the household level, and
we want to control for these time-invariant unobserved characteristics. Suppose that what we
cannot measure is household risk aversion and that risk aversion is time-invariant but correlated
with our covariates. If we do not control for risk aversion then our estimated coefficients will be
inconsistent. Although risk aversion is constant over time and across assets, we anticipate that
the expected value of the impact of risk aversion will vary depending on the asset under
consideration. A very risk-averse household will allocate a smaller proportion of their wealth to
the risky asset and a larger proportion to less risky assets. Therefore, the expected value of the
unobserved effect should vary by equation. Our model allows for this in a very general way. One
concern is that the model that we propose is too general and leads to estimation of too many
parameters.
A potential solution is to specify a slightly different model where instead of estimating a
different parameter for each time average and intercept by equation we estimate a single
parameter vector for the time averages and then estimate parameters to allow the effect of the
time-averaged covariates to vary by equation. We can adapt the above model as follows:

E ( yitg | xit , ci ) = Φ(xit β g + δ g ci ).
Here we introduce an equation specific parameter δ g that allows the impact of the unobserved
effect to vary across equations. In addition, we make the assumption that there is a single source
of heterogeneity.

47

ւ

≺⊙ ⊙ ց
ci = Ψ + ξxi + ai
2
ai | xi ~ N (0, σ a )

Again, we assume strict exogeneity conditional on the unobserved effect:

E ( yitg | xi , ci ) = E ( yitg | xit , ci )
Plugging ci into E ( yitg | xit , ci ) gives us the mean conditional on observed and unobserved
factors:
E ( yitg | xit , ai ) = Φ (δ g Ψ + xit β g + δ g ξxi + δ g ai )
2 2
where δ g ai | xi ~ Normal (0, δ g σ a )
Therefore the conditional mean of yitg given xi is

δ Ψ + x β +δ ξx
g
it g
g i
E ( yitg | xi ) = Φ 
2 2

(1 + δ gσ a )1/ 2







This is nearly identical to equation (2.3.6) above, except that we have a different scale factor.
Regardless, we will still be able to derive the consistent estimates of the average partial effects.
We can estimate the above conditional means using the multinomial quasi-likelihood.
Since we cannot identify a parameter for each equation, we would need to normalize one say,

δG to one. We do not estimate this specification here, but we point out that in other applications
this can be considered since it is more parsimonious and may be more efficient.
2.7 Results
We use data from the Health and Retirement Study (HRS). The HRS began in 1992 and
was nationally representative of all non-institutionalized individuals aged 51-61 in that year. In
1998, the HRS combined with the Asset and Health Dynamics of the Oldest Old (AHEAD) and
added several new cohorts to be nationally representative of the population of non-

48

ւ

≺⊙ ⊙ ց

institutionalized individuals aged 51 and older. Both initial respondents and their spouses are
interviewed and followed in subsequent waves. We use data for all cohorts from Waves 1-7
(1992-2004) of the HRS. In addition to basic demographic variables, the HRS collects detailed
information on wealth holdings, allocation of wealth, income and its sources, health, and
measures of the probability that future events occur. We use subjective survival probability, age,
word recall, household income, and non-financial wealth as regressors in our model.
Our measure of subjective survival probabilities comes from responses to the following
question:
On a scale from 0 to 100, where 0 is no chance and 100 is absolutely certain, what are
the chances that you will live to age 75 or older?
Our measure of income is calculated as the sum of all non-capital income received by the
respondent and spouse during the year. To calculate wealth variables we use responses to
questions about the value of holdings within various financial assets. The assets that are
measured are stocks and/or mutual funds, bonds (corporate and government), Certificates of
Deposit (CDs) and checking, saving and/or money market accounts. We also have information
about other assets owned by the household including housing, vehicles, other real estate,
IRA/Keogh accounts, and trusts. We distinguish between two measures of wealth holdings. Nonfinancial wealth consists of the value of housing, real estate, IRA/Keogh accounts, vehicles, and
trust holdings, less any associated debt. Financial Wealth is the value of all holdings in
stocks/mutual funds, bonds, Certificates of Deposit and savings, checking and money market
accounts. HRS respondents are given a list of nouns and then asked to repeat this list
immediately and then again at the end of the cognition section. We use the proportion of words
recalled at the end of the section as a proxy for cognition.

49

ւ

≺⊙ ⊙ ց
We construct our data set by household. We combine spouses in each wave so that our

panel consists of household observations by year. To select individuals for our sample we drop
observations that satisfy the following criteria:
(i)

older than 65,

(ii)

missing values for subjective survival probability,

(iii)

is a proxy interview or

(iv)

is deceased.

We apply these criteria at the individual level so households that do not have both spouses are
dropped. The decision to allocate wealth across assets is conditional on holding positive financial
wealth; thus, we drop observations where financial wealth is zero. In addition, we drop
households that are only observed for one time period. Imposing the above criteria for married
households leaves us with 3,872 households (14,514 total observations).
In Table 13, we display means and medians of the relevant variables in our analysis. We
can see from Table 13 that the average allocation of wealth generally falls into two assets: stocks
and checking. There is some investment in CDs but very little in Bonds. There is some concern
regarding the time variation in the HRS in particular with subjective survival probability
measures.

15

Table 14 presents the proportion of the variation that is between households in our

dataset. We can see that the bulk of the variation in our variables is between households, but
there is some variation over time that we may be able to exploit.
As a first step, we estimate our allocation equations by linear fixed effects. In general
linear fixed effects coefficients should provide fairly good estimates of the average partial effects
and provide a reasonable baseline to compare other estimation techniques. The results of the
15

Smith et al. (2001) and Elder (2010) point to the lack of appropriate variation in the subjective
survival probabilities collected in the HRS.
50

ւ

≺⊙ ⊙ ց

linear fixed effects estimation are presented in Table 15.
We now turn to different methods of estimating the average partial effects of our
covariates using a nonlinear conditional mean specification. As a first pass, we know that we can
estimate each equation using a single equation nonlinear method. We follow Papke and
Wooldridge (2008) and use the procedure that they laid out for the estimation of a pooled
fractional probit. Since we have an unbalanced panel we cannot simply estimate the single
equation fractional probit on the entire panel. Instead we subset the unbalanced panel as
described above, maximize the Bernoulli quasi-likelihood for each balanced panel subset,
calculate the average partial effects and then average these across the balanced panel subsets.
The results from this estimation approach are presented in Table 16. With the exception of the
estimated average partial effects on Husband and Wife Word Recall, the estimated average
partial effects are similar to those estimated by linear fixed effects but we can see that estimating
using balanced panel subsets significantly increased our estimated standard errors so that none of
our covariates are statistically significant. The difference in the estimated average partial effects
for both Husband and Wife Word Recall may suggest that there is some nonlinearity in the
impact of this variable on the allocation of wealth
Next, we maximize the multinomial quasi-likelihood on each balanced panel subset,
calculate the average partial effects for each balanced panel subset, and then average these over
the balanced panel subsets. Table 17 contains the estimated average partial effects and the
standard errors from this approach. Overall, we can see that the estimated average partial effects
for all equations are very similar to those estimated by the single equation fractional probit
technique. Looking at the standard errors, we can see that they are nearly identical to the
fractional probit standard errors. It appears that imposing the adding up restriction does not affect

51

ւ

≺⊙ ⊙ ց

the estimates of either the average partial effects or the standard errors.
Table 18 presents the estimated average partial effects and standard errors from
Procedure 2 outlined above. Our main points of comparison here are Tables 15, 16 and 17, the
average partial effects estimated by linear fixed effects, maximizing the Bernoulli quasilikelihood and maximizing the multinomial quasi-likelihood on the balanced panel subsets.
Looking first at the estimated standard errors, we can see that they are smaller than those in both
Tables 16 and 17 but larger than those estimated by linear fixed effects. This suggests that
accounting for serial correlation within and between equations has led to more precise estimates
relative to the estimation that uses balanced panel subsets. In particular, the estimated standard
errors for the allocation of wealth to Bonds and CDs are almost half the size of the standard
errors estimated by maximizing the pooled multinomial quasi-likelihood.
Turning to the estimated average partial effects we can see that there are some differences
between the results in Table 18 compared to Tables 15, 16 and 17. Tables 19, 20 and 21 show the
p-values of testing the difference in estimated average partial effects across the different
estimation approaches. Tables 19 and 20 compare the estimated average partial effects of
Procedure 2 to the estimated partial effects from maximizing the Bernoulli quasi-likelihood for
each asset equation on balanced panel subsets and from maximizing the multinomial quasilikelihood on balanced panel subsets. Given the large standard errors from estimation on
balanced panel subsets with either the single equation or multiple equation approaches we only
see a statistically significant difference (at the 5% level) for the estimated average partial effect
of Log(Income) on the allocation of wealth to Bonds. Looking at Table 21, which compares
estimated average partial effects between Procedure 2 and linear fixed effects we can see that the
estimated average partial effect of Log(Income) on bonds is also statistically significantly

52

ւ

≺⊙ ⊙ ց

different than the linear fixed effect estimate. It is unclear whether we should expect a negative
correlation between income and bond holdings, since it is not certain what increasing income
levels mean for the riskiness of that income. Theoretically a more risky income stream would
lead to an increased incentive to allocate wealth to less risky investments, but this will depend
upon the correlation between the riskiness of the return from each asset and income. Since the
estimated average partial effects from linear fixed effects, maximizing the Bernoulli quasilikelihood for each equation and maximizing the multinomial quasi-likelihood are relatively
close to zero with large standard errors it seems plausible that the effect is negative but not
controlling for serial correlation leads us to estimate a very small impact from income.
Simply looking at the magnitude of the estimated average partial effects it appears that
the largest differences occur within the stock and checking equations. The estimated average
partial effects that are the most different are for Husband and Wife Age, Husband and Wife
Pr(Live to 75), and Husband and Wife Word Recall. While the impacts of these covariates have
changed for both Husband and Wife there are distinct differences between the two members of
the household. The average partial effect of a ten percentage point increase in survival
probability on the allocation of wealth to stocks increased nearly tenfold for the Husband, but
decreased twofold for the Wife. We can also see that the estimated average partial effect on
stocks from Husband Word Recall is about 0.003 percentage points greater in Table 18
compared to the linear fixed effects estimate in Table 15, but this is not the case when comparing
Wife Word Recall. Nevertheless, though the magnitudes of the estimated average partial effects
are larger from Procedure 2 we cannot reject the null hypothesis that they are statistically
equivalent to the estimates from maximizing the single equation Bernoulli quasi-likelihoods or
the multinomial quasi-likelihood.

53

ւ

≺⊙ ⊙ ց
Turning to Table 21, which compares the estimated average partial effects between linear

fixed effects and Procedure 2, the covariates with statistically significant different estimates (at
the 5% level) are Husband Age for stocks, Wife Pr(Live to 75) and Log(Income) for bonds and
Husband Age and Wife Age for checking. The differences in the estimated average partial
effects for Husband and Wife Age are not surprising given the fact that Procedure 2 is using the
between household variation from time averages within years of the balanced panel subsets to
estimate the impact of age. Perhaps more surprising is the fact that we now estimate a negative
average partial effect for Wife Pr(Live to 75) for bonds. Though this estimated average partial
effect from Procedure 2 is not statistically different from the linear fixed effect estimate it
suggests that the Wife may prefer less risky assets (and potentially more liquid in the case of
CDs and Checking) when facing an increased lifespan. While this is purely conjecture it may
suggest that there is an avenue for research into the different preferences of household members
against both the expected risk and return from assets and the liquidity of these assets.
There are several reasons that we would expect to see differences in the estimated
average partial effects from our Procedure 2. First, the two-step procedure maximizes the
multinomial quasi-likelihood for each time period within each balanced panel subset; this
technique exploits the variation in the differences from the time averages between households.
Second, Procedure 2 is weights the coefficients based on the serial correlation in the allocation of
wealth to each asset, the serial correlation in the covariates and the correlation of the covariates
across equations within each time period and also across time. Our procedure combines all of
these correlations into weights that are then applied to the coefficients from each time period
within the balanced panel subset. What is promising is that while we do see some statistically
different estimated average partial effects from Procedure 2 compared to maximizing single

54

ւ

≺⊙ ⊙ ց

equation Bernoulli quasi-likelihoods, maximizing multiple equation multinomial quasilikelihoods and single equation linear fixed effects the actual impact from these covariates all
point to relatively small effects on the allocation of wealth across these assets.
Overall it seems that our estimated impacts from the procedure we laid out here are
reasonably close to estimates from models that we know will generate consistent but inefficient
results. In addition, despite slightly different estimated average partial effects our proposed
procedure still estimates small impacts of these covariates on the allocation decision which is
consistent with the other approaches that we used.
One of the advantages of the method that we proposed here is that it allows the effect of
the covariates to affect our response variables in a non-linear way. Even though we have
estimated our multinomial QMLE on time periods within balanced panel subsets we can still
construct the Average Structural Function (ASF) for the entire sample:

N T
−1
ˆ
ˆ
ˆ
ASF (xt ) = N ∑ ∑ 1[Ti = r ]Φ Ψ r + xt β gr + xi ξ gr for r = 2,..., T .
i =1 r = 2

(

)

We plot ASF (xt ) for Husband Word Recall for the allocation of wealth to Stocks, Bonds, CDs,
and Checking in Figures 1 – 4 and for Wife Word Recall in Figures 5 – 8. As a point of
reference, we also plot the corresponding ASF from the linear fixed effects estimation, which are
straight lines because the coefficient estimates are constant over the entire range of the covariate.
From figure 1 we can see that there is some nonlinearity in the impact of Husband Word Recall
on stocks. It seems that the impact of remembering additional words increases slightly as more
words are recalled. Figure 2 displays the average structural function for bonds. There appears to
be a steep decline in the predicted proportion of wealth allocated to bonds as the number of
words recalled increases, but this impact dampens as more and more words are recalled. For the
55

ւ

≺⊙ ⊙ ց

allocation of wealth to CDs we see a relatively stable small impact of Husband Word Recall that
gets slightly larger as the number of words recalled increases. In figure 4 we can see that the
impact of Husband Word Recall on the proportion of wealth allocated to checking is initially
small, but increases as the number of words recalled increases.
Turning to Wife Word Recall we can see in Figure 5 that the average structural function
is fairly linear across the range of word recall. Figure 6 shows that there is some significant
nonlinearity in the impact of word recall on the allocation of wealth to bonds. It appears that
impact of recalling additional words increases once the wife has recalled greater than 50% of the
words given in the survey. Figure 7 shows that there is a slight dampening effect of recalling
additional words on the allocation of wealth to CDs, but that it sets in near the end of the range
of possible values. As we can see from Figure 8, there appears to be “diminishing returns” to the
impact of additional words recalled by the wife and it seems that this begins to set in slightly past
60% of the words recalled.
2.8 Conclusion
In this paper, we have presented a two-step estimation procedure for cases where we have
multiple fractional responses that must satisfy an adding up restriction. We leveraged the
properties of the multinomial quasi-likelihood and the normal cumulative distribution function to
account for the adding-up restriction and the restricted range of our multiple response variables.
In our application, we were able to exploit the random sampling of cross-sectional units to derive
consistent and more slightly more efficient estimates of the average partial effects of our
covariates on the allocation of financial wealth to various assets in an unbalanced panel while
still controlling for a time-invariant unobserved effect. Since the multinomial quasi-likelihood
function is a member of the linear exponential family we are able to achieve consistent results

56

ւ

≺⊙ ⊙ ց

under the assumption that we have correctly specified the conditional mean. Our second step
allows us to increase the efficiency of our estimates from using nonlinear methods on balanced
panel subsets of our data while still estimating average partial effects that are (for the most part)
not statistically different from these other approaches.
We also laid out an alternative estimation approach that makes a further assumption on
the impact of our time averages. This refinement is a clear next step along this line of research.
There is also much to be gained in applying the two-step procedure to other problems where we
have multivariate fractional responses that must satisfy an adding-up restriction. In particular, it
would be useful to see how well this estimation technique performs in balanced panels and also
in panels where there is more within variation in the covariates. It would also be fruitful to see
the impact that different assumptions about the serial correlation have on the estimates from the
CMD step. One assumption that is obvious and that it used in the Generalized Estimating
Equation literature is to assume that the correlation within units over time is exchangeable.
Papke and Wooldridge (2008) find that there is almost no difference between their estimates
when assuming an exchangeable correlation structure compared to a completely unstructured
correlation.

57

ւ

≺⊙ ⊙ ց
CHAPTER 3
ESTIMATION OF A MULTIVARIATE FRACTIONAL RESPONSE MODEL WITH
UNBALANCED PANEL DATA

3.1 Introduction
Estimation using panel datasets is more prevalent with the increase of available data on
repeated observations on micro-level units. With panel data we are careful to remove unobserved
effects by differencing the data or using fixed-effects. With a linear model this is straightforward
and allows for the unobserved effect to be correlated with observables and having a balanced
panel is not necessary. Once we begin to look at estimation of nonlinear models on panel data
this issue becomes a little more complicated.
Treating the unobserved effect as a parameter to estimate leads to inconsistent estimates,
referred to as the “incidental parameters problem”. In general it is assumed that the unobserved
effect is part of the conditional mean that we specify for our dependent variable. As this
unobserved effect is folded into a nonlinear function we are not able to remove it via differencing
or time-demeaning. We can avoid this problem either by ignoring it or using a random-effect
technique that assumes that the unobserved effect is not correlated with any observables. While
the random effects procedure provides a solution, the assumption that the unobserved effect is
uncorrelated with the independent variables can lead to inconsistent estimates if violated.
Another approach to controlling for the unobserved effect in panel data is through
Correlated Random Effects models. The approach makes parametric assumptions about the way
that the time-invariant effect is related to other independent variables in the conditional mean
model. The usual set up assumes that there is a linear relationship between the unobserved effect

58

ւ

≺⊙ ⊙ ց

and the time-averages of the regressors. This Chamberlain-Mundlak device is relatively simple
to implement assuming that we have a balanced panel.
In the context of an unbalanced panel care needs to be taken when employing the
Chamberlain-Mundlak device to control for unobserved effects. The standard assumption is that
the unobserved effect ci is distributed Normal (ψ + ξ xi , σ c ) . This assumption is not accurate if
some cross-section units have fewer observations than others since the estimated variance of the
unobserved effect will be larger for units that are observed fewer times. The most obvious
approach to remedy this problem is to simply choose the largest balanced panel, essentially
dropping cross-section units that we do not observe for a set number of periods. This approach is
dangerous for two reasons. First, reducing the number of observations decreases the accuracy of
our estimates. Second, even if we assume that selection is uncorrelated with the idiosyncratic
errors there is a potential for selection to be correlated with the observables and unobserved
heterogeneity which will lead to inconsistent estimates of their effects on our dependent variable.
Wooldridge (2010) presents a modification of the Correlated Random Effects approach that
allows selection to be correlated with the covariates and the unobserved effect. This modification
allows us to apply the Correlated Random Effects approach to be applied to unbalanced panel
data.
Wooldridge (2010) presents an example using a standard probit (or fractional probit)
model and suggests that we can extend this modification to multivariate fractional responses.
Sivakumar and Bhat (2002) and Mullahy (2010) use the multinomial quasi-likelihood to estimate
models of multivariate fractional responses that have an adding up restriction and estimate their
models using cross-sectional data. Extending the models to balanced panel data is relatively
straightforward as we can simply estimate pooled models on repeated cross-section observations.

59

ւ

≺⊙ ⊙ ց

Of course, we would want to make sure that we use clustered standard errors to make inference
robust. In a balanced panel we could use the Correlated Random Effects approach and control for
the unobserved heterogeneity by making parametric assumptions of the moments of its
distribution.
In this paper we combine the approach for multivariate fractional responses and the
modifications to the Correlated Random Effects approach to estimate models on multiple
fractional responses on cross-section units that are observed over time, but not for the same
number of time periods. We do this by combining the conditional mean specifications of each of
our fractional responses and estimate using the multinomial quasi-likelihood function which
explicitly imposes the adding up restriction of the multivariate fractional responses. We assume
that selection is uncorrelated with the idiosyncratic error but allow it to be correlated with the
unobserved effect and the covariates in our conditional mean.
The outline of the paper is as follows. Section 2 presents the single equation fractional
probit model discussed in Papke and Wooldridge (2008), section 3 extends the single equation
fractional probit to allow for selection following Wooldridge (2010), section 4 presents the
multinomial quasi-likelihood and details how we will estimate the multivariate fractional
response models on an unbalanced panel simultaneously, section 5 presents results from
estimation of this model comparing it to results from other estimation methods and section 6
concludes.
3.2. Single Equation Fractional Probit Models in a Balanced Panel
We begin by laying down the framework for estimating a model of portfolio allocation on
a balanced panel. We proceed in this manner since it makes obvious the modifications that we

60

ւ

≺⊙ ⊙ ց

need to make for the unbalanced panel case and demonstrates the impact that these modifications
will have on the estimation of the average partial effects.
Assume that at each time period t , we observe household i allocate a proportion of their
financial wealth to each asset g . We assume that g = 1, 2,..., G exhausts all possible investment
options so that in each period the household invests all their financial wealth. This assumption
implies that

G

∑ g yitg = 1 for each i = 1, 2,..., N

and t = 1, 2,..., T . In addition to our response

vector, we observe a set of covariates xit and an unobserved effect cig . Our goal is to estimate a
model of the conditional mean of y given x to determine the impact of each covariate. By
construction, yitg is bounded between zero and one. Assuming that we use the same set of
covariates for each equation we can specify the conditional mean as:

E ( yitg | xit , cig ) = Φ(xit β g + cig ); i = 1,..., N ; t = 1,..., T ; and g = 1,..., G (3.2.1)
We point out that the above specification completely ignores the fact that these fractional
responses sum to one. It is simply acting as though we are treating each asset equation by itself.
Nevertheless, it is useful to study what we can derive from these single equation models since
our approach here essentially takes the specification in (3.2.1) for each equation and uses the
multinomial quasi-likelihood to impose the adding up constraint.
Our interest lies in estimating the average partial effects of the covariates, xit on the
proportion of financial wealth allocated to each asset. Following the results for a probit model
the direction of the partial effect is determined by the sign of β g . For a continuous variable xtk
(dropping the i subscript);
∂E ( ytg | xt , c g )
= β gk φ (xt β g + cg )
∂xtk
61

(3.2.2)

ւ

≺⊙ ⊙ ց

To obtain the average partial effect we can average this over the distribution of cg and and xt .
To consistently estimate the coefficients in this single equation framework we require
that the covariates are not correlated with the error term. We assume that xit is strictly
exogenous conditional on the unobserved effect,

E ( yitg | xi , cig ) = E ( yitg | xit , cig ).

(3.2.3)

Following Chamberlain (1980) and Mundlak (1978) we assume that
2
cig = ψ g + xi ξ g + aig ; where aig | xi ~ Normal (0, σ a ).

(3.2.4)

Using (3.2.4) we can rewrite (3.2.1) as:

E ( yitg | xi , aig ) = Φ(ψ g + xit β g + xi ξ g + aig ).

(3.2.5)

So the mean of yitg conditional on xi is:

ψ + x β + x ξ
g
it g
i g
E ( yitg | xi ) = E (Φ (ψ g + xit β g + xi ξ g + aig | xi ) = Φ 
 (1 + σ 2 )−1/ 2
ag



 (3.2.6)



or

(

)

E ( yitg | xi ) = E (Φ (ψ g + xit β g + xi ξ g + aig | xi )) = Φ ψ ga + xit β ga + xi ξ ga (3.2.7)
where we use the a subscript to denote that our estimated coefficients are scaled by
2
(1 + σ ag ) −1/ 2 . Woodridge (2002, Section 15.8.2) shows that the mixing properties of the

normal distribution lead to equation (3.2.7).
For identification of the scaled coefficients in (3.2.7) we require that there is no perfect
collinearity between the elements of xit and that there is enough variation in xit over time.
With the above parametric specification of the unobserved effect we can now write the Average
Structural Function (ASF) following Blundell and Powell (2003):
62

ւ

≺⊙ ⊙ ց

(

)

ASFg (xt ) = Ex  Φ ψ ga + xt β ga + xiξ ga 
i


(3.2.8)

A consistent estimator of ASFg ( xt ) is:

N
ˆ
ˆ
ˆ
ASF g (xt ) = N −1 ∑ Φ ψ ga + xt β ga + xi ξ ga .
i =1

(

)

(3.2.9)

ˆ
ˆ
ˆ
where ψ ga , β ga , and ξ ga are consistent estimates of the scaled coefficients in equation (3.2.7).

For a continuous variable, x j we can calculate the average partial effect by taking the derivative
of (3.2.9) and average over both i and t :

APE g ( x j ) =

ˆ
ˆ
ˆ
∂Φ(ψ ga + xit β ga + xi ξ ga )
∂x j

T N
ˆ
ˆ
ˆ
ˆ
= NT −1 ∑ ∑ β ga, jφ (ψ ga + xit β ga + xi ξ ga )
t =1i =1
(3.2.10)

For a binary x j we can calculate the average partial effect by calculating the difference in the
Average Structural Function evaluated at zero and one.
3.3 Single Equation Fractional Probit Models in an Unbalanced Panel
Once we are dealing with an unbalanced panel we need to make explicit the fact that each
cross section unit in our sample is not observed for the full span of time. Let Ti represent the
number of periods that we observe each cross sectional unit. We also introduce a selection
indicator to index whether an observation is used in estimation. Let sit be an indicator that takes
on a value of one if the observation is used in estimation and zero otherwise. To get consistent
estimates of the average partial effects we need to make explicit the assumption that selection is
not correlated with the idiosyncratic error term. Therefore, in addition to the usual strict
exogeneity assumption we assume that selection is conditionally ignorable. We write this
assumption as follows:
63

ւ

≺⊙ ⊙ ց

E ( yitg | xi , cig , si ) = E ( yitg | xit , cig ) = Φ(xit β g + cig )

(3.3.1)

Now we need to specify a model for the distribution of the unobserved effect. In the typical case
this amounts to specifying that the expected value of the unobserved effect varies linearly with
the time averages of the observed covariates.
In an unbalanced panel, we need to specify that the mean and the variance of the
unobserved effect vary by Ti since intuitively the estimated moments will vary simply because
we are using a different number of observations to estimate the parameters of the distribution.
Following Wooldridge (2010), we define wi to represent functions of the covariates and
selection {( sit , sit xit : t = 1,..., T } . A straightforward extension of the standard ChamberlainMundlak device would be to allow E (cig | wi ) to vary by the number of time periods that we
16

observe our cross-section units :

E (cig | wi ) =

T

∑ ψ gr ⋅1[Ti = r ] + xiξ g

(3.3.2)

r =1

We allow the variance of cig to vary across the unbalanced panel through the following
specification:
 T −1

Var (cig | w i ) = exp  τ + ∑ 1[Ti = r ] ⋅ ω gr 


r =1



(3.3.3)

The specification in (3.3.3) allows the variance of the unobserved effect to vary with the number
of time periods that we observe a cross-section unit in our panel.

16

Wooldridge (2010) actually specifies that the conditional mean of the unobserved effect
includes the interaction between the indicator for number of time periods in sample and the time
averages of the covariates. We only include the indicator variables here due to non-convergence
of our likelihood functions in estimation.
64

ւ

≺⊙ ⊙ ց
Given equations (3.3.2) and (3.3.3) and assuming cig conditional on wi is distributed

normal we can rewrite the conditional mean for a given asset as

E ( yitg | xit , wi , sit = 1) =


 x β + T ψ ⋅1[T = r ] + x ξ 
it g ∑ r = 2 gr
i
i g 
Φ

1/ 2 
 1 + exp τ + ∑T −11[T = r ] ⋅ ω 


gr 
r =1 i







{

(3.3.4)

}

We can reparameterize equation (3.3.4) so that the denominator is unity if we have a balanced
panel:
E ( yitg | xit , w i , sit = 1) =



 x β + T ψ ⋅1[T = r ] + x ξ 
it g ∑ r =1 gr
i
i g
Φ
1/ 2 

T

 exp  ∑

 r = 21[Ti = r ] ⋅ ω gr 





(3.3.5)

Given the specification in (3.3.5) we follow Blundell and Powell (2003) and construct the
Average Structural Function as:

 
  x β + T ψ ⋅1[T = r ] + x ξ
t g ∑ r =1 gr
i
i g
ASFg (xt ) = Ex Φ 
i 
1/ 2
  exp  ∑T 1[Ti = r ] ⋅ ω gr 
 r =2

 












(3.3.6)

A consistent estimate of this is


T ˆ
 ˆ
ˆ
−1 Φ  xt β g + ∑ r =1ψ gr ⋅1[Ti = r ] + xi ξ g
ASF g (xt ) = N ∑
1/ 2


i =1  exp  T 1[T = r ] ⋅ ω 
ˆ gr 
∑ r =2 i



N

65








(3.3.7)

ւ

≺⊙ ⊙ ց

ˆ
ˆ
ˆ
ˆ
where β g , ψ gr , ξ g and ω gr are consistent estimates of the coefficients in (3.3.5). For a

continuous variable xk we can calculate the average partial effect by taking the derivative of
(3.3.7):





ˆ
ˆ
N  xt β g + ∑T ψ gr ⋅1[Ti = r ] + xi ξ g  
ˆ
r =1
ˆ 

β gk  N −1 ∑ φ 
1/ 2  



i =1  exp  T 1[T = r ] ⋅ ω 
ˆ gr 

 ∑ r =2 i







(3.3.8)

where φ [ i] represents the standard normal probability distribution function. Once again we can
ˆ
see that β gk determines the direction of the partial effect.

3.4 The Multinomial Quasi-Likelihood: Multivariate Fractional Responses
We now describe the multinomial quasi-likelihood function and detail the how estimation
proceeds given the specifications for the conditional mean and variance. Since an observation
can only have a log-likelihood value if it is included in estimation the following exposition
necessarily assumes that the likelihood is only defined for observations where sit = 1 .
The multinomial quasi-likelihood function for a random draw i in a specific time period
t is given by:
ℓ it (θ) = yi1t log [ m(xit , w i , θ1) ] + yi 2t log [ m(xit , wi , θ2 )] + ⋯
⋯ + yiGt log [ m(xit , wi , θG ) ]

Where

66

(3.4.1)

ւ

≺⊙ ⊙ ց


 x β + T ψ ⋅1[T = r ] + x ξ
it g ∑ r =1 gr
i
i g
m(xit , wi , θ g ) = Φ 
1/ 2


 exp  ∑T 1[Ti = r ] ⋅ ω gr 

 r =2





 . 17




For identification of the parameters the multinomial quasi-likelihood requires that the following
constraint hold:
m(xit , w i , θG ) = 1 − m(xit , w i , θ1) − m(xit , w i , θ 2 ) − ⋅⋅⋅ − m(xit , w i , θG −1) (3.4.2)
Since we will be pooling observations the partial quasi-likelihood function is

ℓ i (θ) =

T

∑ yi1t log [ m(xit , wi , θ1)] + yi 2t log [ m(xit , wi , θ2 )] + ⋯ + yiGt log [ m(xit , wi , θG )]

t =1

(3.4.3)
The score function is then
∇ m(xit , w i , θ 2 )′


∇ m(xit , w i , θ1)′
+ yit 2 ⋅ θ
+ ⋯
 yit1 ⋅ θ
T
T 
m(xit , w i , θ1)
m(xit , w i , θ 2 )

si (θ) = ∑ sit (θ ) = ∑ 
 (3.4.4)
∇ m(xit , w i , θG )′


t =1
t =1
θ
⋯ + yitG ⋅

m(xit , w i , θG )


Define Hi (xi , y i , θ) as the Hessian then

17

We note that using the Normal cumulative distribution function does not explicitly restrict the
predicted value of the “omitted” equation to fall in the unit interval, though it is implicitly
imposed by the logarithm function. In our application, the predicted values of the omitted
equation remain in the unit interval. We also estimate an approximate multinomial logit form of
the system which does explicitly impose the adding-up restriction of the equations, the results are
nearly identical.
67

ւ

≺⊙ ⊙ ց

 ∇ m(x , w , θ )′ ∇ m(x , w , θ )

θ
it i 1 θ
it i 1 + ⋯ 

m(xit , w i , θ1)




T  ∇ m(x , w , θ )′∇ m(x , w , θ )
it i 2 θ
it i 2 + ⋯ 
− E[Hi (xi , y i , θ | xi )] = ∑  θ

m(xit , w i , θ 2 )


t =1


⋯ + ∇θ m(xit , w i , θG )′∇θ m(xit , w i , θG ) 


m(xit , w i , θG )



(3.4.5)

this is consistently estimated by:

A = ( NT )−1

N T

ˆ
ˆ
ˆ
∑ ∑ µ(xit , wi , θ)′Wit (xit , wi , θ)µ(xit , wi , θ)

(3.4.6)

ˆ
 ∇θ m(xit , w i , θ1) 


ˆ
 ∇ m( x , w , θ ) 
θ
it i 2 
ˆ
µ(xit , w i , θ) = 


⋮


 ∇ m( x , w , θ ) 
ˆ
it i G 
 θ

(3.4.7)

i =1t =1
where

and

1

 m(x , w , θ )
ˆ
it i 1



0
ˆ) = 
Wit (xit , wi , θ 

⋮



0


ˆ
Define B =





1

⋯
0
ˆ )
 . (3.4.8)
m(xit , wi , θ2


⋮
⋱
⋮


1

0
⋯
ˆ 
m(xit , wi , θG ) 
0

N T

⋯

0

ˆ
ˆ
∑ ∑ sit (θ )sit (θ)′ , then we can construct the robust sandwich form of the

i =1t =1

asymptotic variance-covariance matrix as

68

ւ

≺⊙ ⊙ ց

N T

ˆ ˆˆ
ˆ
ˆ
ˆ
A −1BA −1 / N =  ∑ ∑ µ(xit , wi , θ)′Wit (xit , wi , θ)µ(xit , wi , θ) i


 i =1t =1

N T

ˆ
ˆ
∑ ∑ sit (θ )sit (θ)′i

(3.4.9)

i =1t =1

−1
N T

ˆ
ˆ
ˆ
 ∑ ∑ µ (xit , wi , θ)′Wit (xit , wi , θ)µ(xit , wi , θ) 


 i =1t =1

3.5 Results
We use data from the Health and Retirement Study (HRS). The HRS began in 1992 and
was nationally representative of all non-institutionalized individuals aged 51-61 in that year. In
1998, the HRS combined with the Asset and Health Dynamics of the Oldest Old (AHEAD) and
added several new cohorts to be nationally representative of the population of noninstitutionalized individuals aged 51 and older. Both initial respondents and their spouses are
interviewed and followed in subsequent waves. We use data for all cohorts from Waves 1-7
(1992-2004) of the HRS. In addition to basic demographic variables, the HRS collects detailed
information on wealth holdings, allocation of wealth, income and its sources, health, and
measures of the probability that future events occur. We use subjective survival probability, age,
word recall, household income, and non-financial wealth as regressors in our model.
Our measure of subjective survival probabilities comes from responses to the following
question:
On a scale from 0 to 100, where 0 is no chance and 100 is absolutely certain, what are
the chances that you will live to age 75 or older?
Our measure of income is calculated as the sum of all non-capital income received by the
respondent and spouse during the year. To calculate wealth variables we use responses to

69

ւ

≺⊙ ⊙ ց

questions about the value of holdings within various financial assets. The assets that are
measured are stocks and/or mutual funds, bonds (corporate and government), CDs, and checking,
saving and/or money market accounts. We also have information about other assets owned by
the household including housing, vehicles, other real estate, IRA/Keogh accounts, and trusts. We
distinguish between two measures of wealth holdings. Non-financial wealth consists of the value
of housing, real estate, IRA/Keogh accounts, vehicles, and trust holdings, less any associated
debt. Financial wealth is the value of all holdings in stocks/mutual funds, bonds, CDs, and
checking, savings, and money market accounts. HRS respondents are given a list of nouns and
then asked to repeat this list immediately and then again at the end of the cognition section. We
use the proportion of words recalled at the end of the section as a proxy for cognition.
We construct our data set by household. We combine spouses in each wave so that our
panel consists of household observations by year. To select individuals for our sample we drop
observations that satisfy the following criteria:
(i)

older than 65,

(ii)

missing values for subjective survival probability,

(iii)

is a proxy interview or

(iv)

is deceased.

We apply these criteria at the individual level so households that do not have both spouses are
dropped. The decision to allocate wealth across assets is conditional on holding positive financial
wealth; thus, we drop observations where financial wealth is zero. In addition, we drop
households that are only observed for one time period. Imposing the above criteria for married
households leaves us with 3,872 households (14,514 total observations).
In Table 22, we display means and medians of the relevant variables in our analysis. We

70

ւ

≺⊙ ⊙ ց

can see from Table 22 that the average allocation of wealth generally falls into two assets: stocks
and checking. There is some investment in CDs but very little in Bonds. There is some concern
regarding the time variation in the HRS in particular with subjective survival probability
measures.

18

Table 23 presents the proportion of the variation that is between households in our

dataset. We can see that the bulk of the variation in our variables is between households, but
there is some variation over time that we may be able to exploit.
As a first step we estimate our allocation equations by linear fixed effects. In general
linear fixed effects coefficients should provide fairly good estimates of the average partial effects
therefore they should provide us with a reasonable baseline to compare other estimation
techniques. The results of the linear fixed effects estimation are presented in Table 24.
We now turn to different methods of estimating the average partial effects of our
covariates using nonlinear estimation. As a first pass we know that we can estimate each
equation using a single equation nonlinear method. We follow Papke and Wooldridge (2008) and
use the procedure that they laid out for the estimation of a pooled fractional probit. In this
approach we are ignoring the unbalanced nature of the panel and simply maximizing the
Bernoulli quasi-likelihood function for each asset equation over the full dataset. The results from
this approach are presented in Table 25. While we have controlled for an unobserved effect in
this estimation we have not accounted for the fact that we have an unbalanced panel, in fact we
have simply assumed that each household is observed the same number of times.
Table 26 presents the results of maximizing the Bernoulli quasi-likelihood for each
balanced panel subset of the full, unbalanced panel and then averaging the estimated average
partial effects (and their variances) across the balanced panel subsets. We can see that the
18

Smith et al. (2001) and Elder (2010) point to the lack of appropriate variation in the subjective
survival probabilities collected in the HRS.
71

ւ

≺⊙ ⊙ ց

estimated average partial effects are nearly identical to the estimates provided by completely
ignoring the unbalanced nature of the data. The difference in standard errors between these two
approaches is significant though. The increase in the estimated standard errors arises from the
fact that there are considerably fewer observations within each balanced panel subset and this
naturally leads to larger variances. While this approach assumes that there is a difference in the
estimated moments of the unobserved effect depending on the number of times that a household
is viewed in the panel it does not provide a way to test whether the number of times that we
observe a household has a statistically significant impact on the estimates.
Our next step is to estimate each asset equation on the full, unbalanced panel correcting
for the unbalanced nature of the panel by allowing the mean and the variance of the unobserved
effect to vary with the number of time periods that we observe a household. We maximize the
Bernoulli quasi-likelihood and include indicator variables for the number of time periods that a
household is in the sample in the specification of the conditional mean and the conditional
variance of the unobserved effect. This is what Wooldridge (2010) proposes; except that we have
made the assumption that there is a constant slope on the time averages of the covariates in the
model. The results of this estimation approach are displayed in Table 27.
There are some differences in the estimated average partial effects in Table 27 for some
covariates when we compare them to those estimated by the other methods in Tables 24, 25 and
26. This appears to be most prevalent in the estimated average partial effects for the covariates in
the bond equation. For the most part, the larger average partial effects are not that worrisome
since the estimated standard errors are large enough so that they are statistically insignificant at
standard levels. The only estimated average partial effect which stands out as statistically
different is Wife Pr(Live to 75). While the overall impact on allocation is still small – a ten

72

ւ

≺⊙ ⊙ ց

percentage point increase in Wife Pr(Live to 75) leads to a 0.005 percentage point increase in the
allocation of financial wealth to bonds - it is still five times larger than the estimated average
partial effect from the other single equation estimates. Perhaps most striking is that in the other
single equation estimation approaches we very nearly have average partial effects that sum to
zero across the four equations, but this does not appear to be the case in Table 27. In fact it
appears that none of the marginal effects of any covariate sum to zero across equations.
We also test the joint significance of the indicator variables for the total number of time
periods that a household is observed in both the mean and the variance equation. From our test of
joint significance we can see that the allocation of wealth to CDs is the only equation where we
see that we obtain a test statistic of 52.16 which is significant at the 5% level, but this is only for
the mean equation. This implies that there is little evidence of variation in the moments of the
unobserved effects at the single equation level, with the exception of the mean of ci for the
allocation of wealth to CDs.
Our next step is to estimate the allocation equations by maximizing the multinomial
quasi-likelihood function. We first estimate the multinomial quasi-likelihood function on the
entire unbalanced panel ignoring the fact that we observe households for a different number of
time periods. The results of this approach are shown in Table 28. We can see that our estimated
average partial effects are very similar to those that we estimated using linear fixed effects and
maximizing the Bernoulli quasi-likelihood for each equation. It appears that the biggest
difference in estimated average partial effects is for Log (Income) which is larger in the single
equation approaches than it is for the multinomial approach. This is not surprising as the
multinomial approach leads us to estimated coefficients that maximize a likelihood that is
restricted to values that satisfy the adding up constraint.

73

ւ

≺⊙ ⊙ ց
As a next step we acknowledge the unbalanced nature of our panel by maximizing the

pooled multinomial quasi-likelihood for each balanced panel subset and then average the
estimated average partial effects and their variances across the balanced panel subsets. Table 29
displays the results of this estimation approach. Once again it is clear that our standard errors
have increased significantly since the sample sizes of the balanced panel subsets are smaller than
the full panel. Nevertheless, there appears to be little difference in the estimated average partial
effects compared to the estimates in Tables 24, 25 and 26. In fact, see that the estimated impact
of Log (Income) are closer to those estimated using the single equation approaches.
Table 30 presents the results of maximizing the multinomial quasi-likelihood on the full,
unbalanced panel and specifying that the moments of the unobserved effect to vary based on the
number of times that we observe households in the panel. The estimated average partial effects
are nearly identical to those estimated by linear fixed effects. Unlike the estimates from the
maximization of the Bernoulli quasi-likelihood in table 27 the estimated average partial effects
for each covariate sum to zero (with some rounding error) in table 30. This is due to the fact that
the adding up restriction is naturally imposed by the specification of the conditional mean
function and therefore restricts the maximization routine to focus on coefficients that satisfy the
restriction. Looking at the joint significance test for the indicator variables for the total number
of times that a household is observed we can see that there are some slight differences from the
similar test presented in Table 27. First, since we are “excluding” the checking equation in the
multinomial approach we do not have coefficients to test so these are missing. Second, it appears
that both the conditional mean and conditional variance of the unobserved effect for the
allocation of wealth to bonds differs across the number of time periods that we observe
households. Similar to the estimates in table 27 we see that the conditional mean of the

74

ւ

≺⊙ ⊙ ց

unobserved effect in the allocation of wealth to CDs does differ depending on the total number of
time periods that a household is in sample. The difference in the bond allocation equation is most
likely due to the imposition of the adding up restriction. Once we force the impact of these
covariates to cancel out across the four assets it restricts the search for coefficients to those that
satisfy the adding up constraint. Since the allocation of wealth to bonds is so small (in fact for a
majority of households it is zero more often than not) and the quasi-likelihood function is flatter
than for other assets, a single equation approach is likely to find many coefficients that maximize
the flat likelihood but that do not maximize the multinomial quasi-likelihood.
As another point of comparison, we include Table 31 which shows the estimated average
partial effects from estimating equation (3.3.5) but replacing the Normal Cumulative Distribution
Function with the multinomial logit functional form. We should note that the below estimation
equation is an approximation, the actual derivation of the appropriate specification of the
conditional mean and variance function when using the multinomial logit functional form is
more complicated. We perform this additional estimation since using the Normal Cumulative
Distribution Function does not explicitly impose the restriction that the predicted value of the
omitted equation must be in the unit interval; instead we are relying on the logarithm to impose
this constraint. Specifically we are estimate the following equation:

E ( yitg | xit , w i , sit = 1) =


 x β + T ψ ⋅1[T = r ] + x ξ 
it g ∑ r =1 gr
i
i g
exp 
1/ 2 

T

 exp  ∑

 r = 21[Ti = r ] ⋅ ω gr 




.


 x β + T ψ ⋅1[T = r ] + x ξ 
G −1
it g ∑ r =1 gr
i
i g
1 + ∑ exp 
1/ 2 

T
g =1
1[Ti = r ] ⋅ ω gr 
 exp  ∑

 r =2





75

(3.5.1)

ւ

≺⊙ ⊙ ց

We can see from Table 31 that the estimated average partial effects are nearly identical to those
reported in Table 30, though the estimated standard errors for all equations are smaller than those
estimated using the multinomial fractional probit approach. An interesting difference between
Table 30 and 31 arises in the χ 2 test for ψ gr and ω gr for the Bond equation. In table 30 we
can see that there the covariates are in general not statistically significant, but the indicator
variables for the length of time in sample are significant in both the conditional mean and
variance equation. Using the multinomial logit functional form we see that the these indicator
variables are no longer statistically significant in either the conditional mean or variance
equation, but that now the estimated partial effects are more precisely estimated. The fact that the
estimated average partial effects are nearly identical across all estimation approaches suggests
that we can leverage the flexibility (and potential efficiency) of non-linear estimation techniques
on fractional response variables in an unbalanced panel.
Overall our estimated average partial effects from the procedure laid out in this paper are
nearly identical to those estimated by methods that we know will give us consistent estimates,
but it appears that estimating a nonlinear model that appropriately imposes the adding up
restriction and also accounts for the unbalanced nature of our panel has led to a happy
compromise between ignoring this information and over controlling for it.
3.6 Conclusion
One of the major criticisms of estimating nonlinear models on panel data with a
correlated random effects approach is that it cannot handle unbalanced panels. Wooldridge
(2010) offers a simple solution to this problem that is parsimonious and easily estimable by
standard software. We have extended this approach to multiple fractional response variables and
have exploited the fact that we have an adding up restriction that allows us to use the
76

ւ

≺⊙ ⊙ ց

multinomial quasi-likelihood function. Since the multinomial quasi-likelihood is in the linear
exponential family we can be confident that we have consistent estimates of the average partial
effects of our covariates.
We have also compared our results from our procedure to other estimation methods that
will give us consistent estimates of the average partial effects. We have found that the estimation
procedure we laid out here lies somewhere in between ignoring the unbalanced nature of the
panel and more drastic corrections for it such as estimation of standard correlated random effect
models on balanced panel subsets. Since we have explicitly modeled that the conditional mean
and variance of our unobserved effect varies with the number of time periods we observe a crosssectional unit we are able to use the full data set to estimate our average partial effects. This has
allowed us to maintain consistency while increasing the precision of our estimated average
partial effects.
Possible avenues for future research would be to attempt to more parsimoniously specify
that the unobserved effect will differ across equations. Here we have allowed for each covariate
in the conditional mean and variance of the unobserved effect to be different for each equation. It
might be fruitful to instead specify that the unobserved effect is the same across each equation
but instead there is an overall scale factor for each equation’s conditional mean and variance
function. With the appropriate normalization we could then estimate fewer coefficients leading
to possible more efficiently estimated average partial effects.

77

APPENDIX

78

Table 1. Summary Statistics for Single Households
Wealth Sample
Allocation Sample
Mean
Median
Mean
Median
Age
58
58
58
59
White
0.62
0.71
Black
0.27
0.21
Hispanic
0.09
0.06
Other
0.03
0.03
Less than High School
0.28
0.19
High School
0.34
0.35
Some College
0.20
0.23
College
0.08
0.10
Post Grad
0.10
0.13
Male
0.29
0.29
Divorced/Separated
0.56
0.56
Widowed
0.30
0.30
Never Married
0.14
0.14
Working
0.57
0.63
Retired
0.18
0.20
Unemployed
0.04
0.03
Disabled
0.18
0.12
Household Size
1.81
1
1.72
1
Word Recall
0.63
0.71
0.67
0.71
Series 7 Correct
2.34
2
2.66
3
High Risk Aversion
0.62
0.63
0.12
0.13
Medium-High Risk Aversion
Medium-Low Risk Aversion
0.10
0.10
Low Risk Aversion
0.14
0.13
Plan for Next Few Months
0.25
0.21
Plan for Next Year
0.11
0.11
Plan for Next Few Years
0.27
0.28
Plan for 5-10 years
0.26
0.29
Plan for 10+ years
0.10
0.10
Pr(Live to 75)
0.63
0.70
0.65
0.75
Net Worth
$131,430 $38,530 $168,903 $64,186
Financial Wealth
$35,029 $1,485 $47,295 $4,733
Current Income
$22,764 $15,300 $27,155 $19,407
Log(Current Income)
9.55
9.64
9.77
9.87
Permanent Income
$21,405 $15,303 $25,585 $18,983
Permanent Log Income
9.44
9.50
9.65
9.71
Stocks
0.17
Bonds
0.02
CDs
0.08
Checking
0.74
79

Table 1 (cont’d).
Observations
14,275
10,573
Households
5,022
4,053
Note: Reported Wealth and Income measures are in 1992 dollars.
Table 2. Summary Statistics for Married Households
Wealth Sample
Allocation Sample
Mean
Median
Mean
Median
Husband Wife Husband Wife Husband Wife Husband Wife
Age
58
54
58
55
58
55
58
55
White
0.80
0.81
0.84
0.85
Black
0.10
0.10
0.08
0.08
Other
0.02
0.02
0.02
0.02
Hispanic
0.07
0.07
0.05
0.05
Less than
0.21
0.19
0.17
0.15
High School
High School
0.32
0.39
0.33
0.40
Some College
0.21
0.23
0.22
0.24
College
0.12
0.10
0.13
0.10
Post Grad
0.14
0.09
0.15
0.10
High Risk
0.60
0.63
0.60
0.63
Aversion
Medium-High
0.13
0.15
0.14
0.15
Risk Aversion
Medium-Low
0.10
0.10
0.10
0.10
Risk Aversion
Low Risk
0.14
0.10
0.14
0.10
Aversion
Plan for Next
0.13
0.15
0.12
0.13
Few Months
Plan for Next
0.10
0.11
0.09
0.10
Year
Plan for Next
0.29
0.31
0.30
0.31
Few Years
Plan for 5-10
0.35
0.32
0.36
0.33
Years
Plan for 10+
0.12
0.11
0.12
0.11
Years
Word Recall
0.58
0.65
0.59
0.71
0.60
0.66
0.71
0.71
Pr(Live to 75)
0.64
0.67
0.70
0.75
0.65
0.68
0.70
0.75
Working
0.68
0.59
0.69
0.61
Retired
0.25
0.11
0.26
0.11
Unemployed
0.02
0.02
0.02
0.02
Disabled
0.09
0.06
0.07
0.05
Series 7
2.44
2.26
3
2
2.54
2.36
3
2
Correct
80

Table 2 (cont’d).
Current
Income
Log(Current
Income)
Household
Size
Permanent
Income
Permanent
Log Income
Net Worth
Financial
Wealth
Stocks
Bonds
CDs
Checking
Observations
Households

$56,086

$43,750

$59,317

$46,498

10.58

10.69

10.68

10.75

2.16

2

2.16

2

$56,914

$46,670

$59,988

$49,218

10.64

10.72

10.73

10.77

$304,024

$140,111

$330,922

$159,045

$65,593

$10,887

$73,111

$15,200

0.25
0.02
0.09
0.63
18,603
6,608

16,690
6,048

Note: Reported Wealth and Income measures are in 1992 dollars.

Table 3. Average Wealth and Proportion of Financial Wealth Allocated by Pr(Live to 75),
Single Households
Fraction
Financial
Stocks Bonds CDs Checking
Net Worth
of
Wealth
Sample
$41,804
$7,063
0.08
0.01
0.06
0.86
0.0594
0 ≤ P75 < 0.1
$70,412
$14,898
0.13
0.01
0.07
0.79
0.0282
0.1 ≤ P75 < 0.2
$107,678
$24,816
0.16
0.01
0.08
0.75
0.0355
0.2 ≤ P75 < 0.3
$103,123
$17,245
0.13
0.02
0.09
0.76
0.0210
0.3 ≤ P75 < 0.4
$129,572
$35,292
0.14
0.03
0.08
0.75
0.0217
0.4 ≤ P75 < 0.5
$116,788
$28,681
0.15
0.01
0.09
0.75
0.2487
0.5 ≤ P75 < 0.6
$176,562
$57,327
0.19
0.02
0.08
0.70
0.0824
0.6 ≤ P75 < 0.7
$177,824
$44,265
0.21
0.02
0.10
0.68
0.0760
0.7 ≤ P75 < 0.8
$185,234
$51,814
0.21
0.02
0.09
0.68
0.2026
0.8 ≤ P75 < 0.9
$347,801
$168,255
0.24
0.02
0.07
0.67
0.0162
0.9 ≤ P75 < 1
$111,057
$26,632
0.15
0.01
0.01
0.77
0.2085
P75 = 1
Note: Wealth figures are in 1992 dollars.

81

Table 4. Average Wealth and Proportion of Financial Wealth Allocated by Husband’s
Pr(Live to 75), Married Households
Fraction
Net
Financial
Stocks Bonds CDs Checking
of
Worth
Wealth
Sample
$150,509 $30,026
0.17
0.01
0.09
0.73
0 ≤ P75 < 0.1
0.0524
0.18
0.01
0.07
0.73
0.1 ≤ P75 < 0.2 $195,464 $31,120
0.0255
0.21
0.02
0.1
0.67
0.2 ≤ P75 < 0.3 $226,670 $48,087
0.0367
0.21
0.02
0.11
0.67
0.3 ≤ P75 < 0.4 $200,316 $41,090
0.0259
0.22
0.02
0.09
0.68
0.4 ≤ P75 < 0.5 $238,363 $45,413
0.0241
0.22
0.02
0.1
0.66
0.5 ≤ P75 < 0.6 $273,417 $57,287
0.2490
0.3
0.03
0.09
0.58
0.6 ≤ P75 < 0.7 $336,502 $80,045
0.1067
0.31
0.03
0.09
0.57
0.7 ≤ P75 < 0.8 $439,344 $116,920
0.0795
$364,742 $76,434
0.3
0.03
0.09
0.59
0.8 ≤ P75 < 0.9
0.2026
$404,967 $91,239
0.39
0.03
0.05
0.53
0.9 ≤ P75 < 1
0.0143
$306,326 $61,103
0.23
0.02
0.09
0.65
P75 = 1
0.1833
Note: Wealth figures are in 1992 dollars.

Table 5. Average Wealth and Proportion of Financial Wealth Allocated by Wife’s Pr(Live to
75), Married Households
Fraction
Net
Financial
Stocks Bonds
CDs
Checking
of
Worth
Wealth
Sample
$118,553 $18,608
0.14
0.01
0.08
0.77
0 ≤ P75 < 0.1
0.0395
0.18
0.01
0.1
0.71
0.1 ≤ P75 < 0.2 $180,629 $48,247
0.0174
0.16
0.01
0.1
0.74
0.2 ≤ P75 < 0.3 $177,559 $33,642
0.0280
0.22
0.01
0.11
0.66
0.3 ≤ P75 < 0.4 $183,728 $44,373
0.0176
0.19
0.02
0.12
0.68
0.4 ≤ P75 < 0.5 $159,264 $28,859
0.0200
$264,519 $58,833
0.22
0.02
0.1
0.66
0.5 ≤ P75 < 0.6
0.2395
0.26
0.02
0.1
0.62
0.6 ≤ P75 < 0.7 $307,643 $68,165
0.0954
0.31
0.03
0.08
0.58
0.7 ≤ P75 < 0.8 $434,637 $97,380
0.0841
0.3
0.03
0.09
0.59
0.8 ≤ P75 < 0.9 $370,027 $83,681
0.2436
$450,742 $115,980
0.35
0.03
0.07
0.55
0.9 ≤ P75 < 1
0.0205
$310,064 $57,644
0.25
0.02
0.09
0.64
P75 = 1
0.1944
Note: Wealth figures are in 1992 dollars.

82

Table 6. Wealth Holding Estimation Results, Single Households
Financial
Net Worth
Wealth
-51,213
-14,279
Age
(29,894)*
(13,684)
1,153
350
2
Age
(590)*
(276)
-8
-3
3
Age
(4)**
(2)
-45,779
-5,396
Black
(17,874)**
(6,116)
19,166
17,318
White
(18,174)
(6,211)**
-23,964
-625
Hispanic
(18,216)
(6,337)
17,680
8,859
High School
(8,268)**
(2,833)**
58,107
23,101
Some College
(12,235)**
(4,975)**
111,375
50,635
College
(23,695)**
(10,958)**
165,583
79,858
Post Grad
(28,868)**
(13,114)**
25,972
12,850
Male
(10,429)**
(4,692)**
-2,113
-3,394
Divorced/Separated
(10,033)
(4,784)
46,520
9,966
Widowed
(11,615)**
(5,101)*
248
7,630
Working
(13,335)
(8,387)
36,706
16,949
Retired
(10,668)**
(5,008)**
3,935
-5,174
Unemployed
(21,743)
(6,628)
-4,772
-2,668
Household Size
(1,845)**
(769)**
26,900
9,479
Word Recall
(12,737)**
(4,822)**
4,466
2,328
Series 7 Correct
(1,924)**
(856)**
6,102
-1,879
Medium-High Risk Aversion
(12,438)
(3,570)
2,723
2,850
Medium-Low Risk Aversion
(15,650)
(8,821)
83

Table 6 (cont’d).
29,291
9,374
(13,970)**
(5,480)*
6,775
860
Plan for Next Year
(10,052)
(4,315)
26,510
9,814
Plan for Next Few Years
(6,049)**
(2,362)**
50,030
22,221
Plan for 5-10 Years
(9,069)**
(4,045)**
94,892
38,520
Plan for 10+ Years
(15,926)**
(8,780)**
256
124
Pr(Live to 75)
(95)**
(44)**
-25,613
-23,819
Current Income
(9,518)**
(6,887)**
330
228
2
Current Income
(153)**
(97)**
31,176
11,922
Permanent Income
(7,194)**
(3,949)**
267
108
2
Permanent Income
(62)**
(35)**
Current Income*Permanent
-74
19
Income
(301)
(185)
Observations
14,275
Households
5,022
Note: Robust standard errors reported in parentheses. ** p<0.05 and
* p<0.1. A full set of year dummies is included in all models.
Reported coefficients for Pr(Live to 75) are for a one-percentage
point difference (0.01).
Low Risk Aversion

Table 7. Average Partial Effects (Reported as Percentages) for Allocation of
Financial Wealth , Single Households, Estimated by Single Equation
Fractional Probit
Stocks
Bonds
CDs
Checking
-0.117
0.002
0.117
-0.027
Age
(0.092)
(0.022)
(0.067)*
(0.105)
-2.054
-0.167
0.989
0.787
Black
(2.616)
(0.688)
(2.086)
(3.103)
3.830
0.360
1.555
-5.819
White
(2.387)
(0.594)
(1.781)
(2.819)**
0.676
0.168
-0.390
-0.421
Hispanic
(3.269)
(0.907)
(2.187)
(3.635)
4.729
0.830
0.806
-4.637
High School
(1.511)** (0.462)*
(0.994)
(1.563)**
84

Table 7 (cont’d).
Some College
College
Post Grad
Male
Divorced/Separated
Widowed
Working
Retired
Unemployed
Household Size
Word Recall
Series 7 Correct
Medium-High Risk Aversion
Medium-Low Risk Aversion
Low Risk Aversion
Plan for Next Year
Plan for Next Few Years
Plan for 5-10 Years
Plan for 10+ Years
Pr(Live to 75)
Log(Current Income)
Permanent Log Income

9.291
(1.711)**
11.068
(2.203)**
10.005
(2.164)**
0.257
(0.961)
-0.820
(1.257)
-0.349
(1.373)
0.642
(1.131)
1.915
(1.201)
-1.392
(2.081)
-0.783
(0.400)*
1.630
(1.800)
0.116
(0.284)
-1.369
(1.113)
1.450
(1.269)
2.607
(1.114)**
0.291
(1.464)
4.222
(1.171)**
5.163
(1.203)**
5.750
(1.499)**
0.0133
(0.0136)
0.539
(0.440)
1.199
(0.756)
85

-0.038
(0.399)
0.911
(0.639)
1.019
(0.595)*
0.005
(0.231)
0.227
(0.285)
0.405
(0.326)
-0.086
(0.296)
0.710
(0.377)*
1.511
(1.021)
-0.199
(0.127)
-0.255
(0.441)
0.065
(0.090)
-0.125
(0.275)
0.193
(0.330)
0.372
(0.341)
0.699
(0.539)
0.459
(0.363)
0.317
(0.353)
1.134
(0.544)**
-0.0046
(0.0035)
-0.010
(0.139)
0.240
(0.214)

-0.609
(1.077)
0.396
(1.330)
-0.705
(1.251)
-1.227
(0.656)*
-0.451
(0.926)
-0.970
(0.986)
0.742
(0.848)
0.768
(0.932)
2.082
(1.738)
-0.686
(0.295)**
-2.310
(1.273)*
-0.189
(0.214)
-0.012
(0.802)
-1.427
(0.806)*
-3.175
(0.739)**
1.885
(1.133)**
1.920
(0.860)
1.005
(0.829)
-0.973
(0.980)
-0.0037
(0.0093)
0.067
(0.283)
-1.145
(0.541)**

-6.789
(1.740)**
-10.372
(2.192)**
-9.451
(2.167)**
0.866
(1.089)
1.319
(1.420)
1.263
(1.537)
-0.601
(1.314)
-2.906
(1.378)**
-1.538
(2.533)
1.617
(0.451)**
1.139
(2.053)
-0.033
(0.328)
1.499
(1.265)
-0.069
(1.431)
-0.146
(1.277)
-2.390
(1.650)
-5.943
(1.303)**
-6.049
(1.307)**
-6.096
(1.619)**
-0.0037
(0.0154)
-0.620
(0.508)
-0.391
(0.873)

Table 7 (cont’d)
0.693
1.900
-6.884
(0.129)** (0.165)** (0.329)**
Observations
10,573
Households
4,053
Note: Average partial effects reported. Robust standard errors are in
parentheses. ** p<0.05 and * p<0.1. A full set of year dummies is included in
all models. Reported Average Partial Effects for Pr(Live to 75) are for a onepercentage point difference (0.01).
Log(Net Worth)

5.031
(0.319)**

Table 8. Wealth Holding Estimation Reults, Married Households
Net Worth
Financial Wealth
Husband
Wife
Husband
Wife
-10,009
-4,413
-81,003
-122,988
Age
(14,993)
(37,969)** (54,066)** (14,784)
1,953
2,480
262
118
2
Age
(738)**
(1,071)**
(293)
(323)
-14
-16
-2
-1
3
Age
(5)**
(7)**
(2)
(2)
-98,615
18,022
-32,990
-6,577
Black
(38,207)**
(33,643)
(18,756)*
(18,957)
6,275
32,419
322
-13,212
White
(36,598)
(32,772)
(17,389)
(19,586)
-51,752
51,415
-6,556
-16,710
Hispanic
(38,673)
(37,185)
(16,973)
(19,363)
-11,679
21,975
-3,068
5,762
High School
(14,385)
(13,271)*
(3,803)
(3,783)
-1,573
14,839
-27,431
55,406
Some College
(20,164)
(23,091)**
(5,098)
(5,294)**
54,901
84,406
37,505
38,063
College
(39,712)
(36,931)** (12,501)** (15,272)**
8,167
67,598
30,609
41,897
Post Grad
(36,482)
(38,935)* (10,834)** (13,844)**
-18,926
-2,707
4,303
-300
Medium-High Risk Aversion
(19,420)
(17,566)
(7,331)
(6,772)
-48,768
23,058
-6,619
3,323
Medium-Low Risk Aversion
(18,796)**
(21,725)
(5,551)
(7,363)
-7,599
46,826
-1,033
-240
Low Risk Aversion
(18,355)
(27,255)*
(6,077)
(6,792)
-35,539
-12,209
-6,624
-785
Plan for Next Year
(21,740)
(16,122)
(9,458)
(5,215)
-21,460
20,647
-7,968
9,455
Plan for Next Few Years
(20,062)
(15,212)
(7,436)
(5,234)*

86

Table 8 (cont’d)
-25,427
14,505
-8,661
7,219
(21,058)
(17,162)
(7,321)
(4,579)
49,614
59,879
24,764
27,512
Plan for 10+ Years
(27,072)* (23,574)** (10,742)** (8,563)**
-14,951
-15,527
-13,304
4,178
Word Recall
(37,053)
(50,025)
(12,258)
(9,782)
14,161
821
4,867
2,327
Series 7 Correct
(4,998)**
(5,921)
(1,646)**
(1,644)
-11,193
-4,319
Household Size
(5,887)*
(2,196)**
-43,525
-98,558
-13,889
-31,661
Working
(13,399)** (17,784)** (5,442)** (6,289)**
-64,020
-80,664
-19,344
-18,303
Unemployed
(20,379)** (22,084)** (6,320)** (8,064)**
15,281
19,732
11,103
3,034
Retired
(15,312)
(24,190)
(5,732)*
(9,127)
241
425
28
-22
Pr(Live to 75)
(203)
(193)**
(62)
(65)
11,292
-2,335
Current Income
(5,504)**
(1,303)*
21
132
2
Current Income
(138)
(20)**
46,914
13,071
Permanent Income
(6,340)**
(1,764)**
-281
-55
2
Permanent Income
(117)**
(19)**
Current Income*Permanent
83
-145
Income
(381)
(57)**
Observations
18,603
Households
6,608
Note: Robust standard errors reported in parentheses. ** p<0.05 and * p<0.1. A full set
of year dummies are included in all models. Reported coefficients for Pr(Live to 75) are
for a one-percentage point difference (0.01).
Plan for 5-10 Years

87

Table 9. Average Partial Effects (Reported as Percentages) for Allocation of Financial Wealth, Married
Households, Estimated by Single Equation Fractional Probit
Stocks
Bonds
CDs
Checking
Husband
Wife
Husband
Wife
Husband
Wife
Husband
Wife
0.041
0.029
0.128
0.077
-0.012
-0.273
-0.175
0.167
Age
(0.022) (0.055)**
(0.047)
(0.098) (0.085)**
(0.091)* (0.079)** (0.027)
Black

-2.420
(4.658)

-5.679
(4.267)

-0.299
(0.769)

1.241
(1.126)

-1.809
(2.718)

2.880
(3.423)

4.746
(4.736)

1.714
(4.734)

White

-0.713
(3.121)

-0.592
(2.884)

-0.065
(0.881)

0.688
(0.636)

1.372
(2.121)

-0.726
(1.936)

-0.762
(3.292)

0.572
(3.028)

Hispanic

-3.732
(3.390)

-6.245
(3.064)**

-0.751
(0.803)

0.179
(1.055)

0.337
(2.607)

-2.656
(1.765)

3.419
(3.761)

8.234
(3.366)**

High School

4.215
(1.307)**

5.521
(1.364)**

0.663
(0.456)

0.989
(0.488)**

0.994
(0.804)

2.955
(0.855)**

-4.253
-8.104
(1.358)** (1.365)**

Some College

8.674
(1.502)**

5.378
(1.539)**

0.957
(0.518)*

0.869
(0.559)

0.016
(0.860)

2.447
(1.017)**

-7.612
-7.538
(1.544)** (1.578)**

College

8.929
(1.718)**

1.854
1.939
-1.858
8.163
(1.955)** (0.667)** (0.785)** (0.899)**

2.275
(1.276)*

-7.693
-11.581
(1.749)** (1.983)**

Post Grad

6.747
(1.745)**

1.863
1.742
7.056
(2.012)** (0.687)** (0.761)**

-0.282
(0.998)

0.837
(1.185)

-6.994
-9.431
(1.845)** (2.096)**

MediumHigh Risk
Aversion
Medium-Low
Risk
Aversion
Low Risk
Aversion

1.471
(0.974)

-0.764
(0.911)

0.306
(0.276)

0.170
(0.268)

0.470
(0.611)

0.108
(0.577)

-2.204
(1.061)**

0.373
(0.994)

1.683
(1.144)

-0.637
(1.053)

-0.218
(0.315)

-0.084
(0.308)

-0.788
(0.676)

-1.254
(0.646)*

-0.568
(1.291)

1.769
(1.194)

2.788
(1.039)**

-0.053
(1.144)

0.161
(0.288)

0.018
(0.326)

-1.622
(0.604)**

-0.418
(0.686)

-1.599
(1.120)

0.230
(1.257)

88

Table 9
(cont’d).
Plan for Next
Year

0.270
(1.494)

-1.711
(1.340)

-0.326
(0.386)

-0.285
(0.383)

0.421
(0.931)

0.095
(0.870)

-0.053
(1.568)

1.885
(1.462)

Plan for Next
Few Years

2.076
(1.189)*

-1.620
(1.107)

-0.159
(0.345)

0.033
(0.359)

1.204
(0.749)

-0.004
(0.681)

-2.892
(1.256)**

1.661
(1.180)

Plan for 5-10
Years

1.996
(1.157)*

-0.153
(1.124)

-0.124
(0.337)

-0.084
(0.351)

0.292
(0.716)

0.090
(0.700)

-1.986
(1.229)

0.224
(1.202)

Plan for 10+
Years

3.167
(1.434)**

-0.772
(1.297)

-0.448
(0.342)

0.184
(0.415)

1.164
(0.913)

0.967
(0.883)

-4.121
(1.506)**

-0.460
(1.434)

Word Recall

4.500
(1.753)**

1.829
(1.703)

-0.404
(0.542)

0.393
(0.579)

-0.006
(1.178)

-0.319
(1.139)

-3.745
(1.942)*

-1.550
(1.899)

Series 7
Correct

0.624
(0.319)*

0.374
(0.278)

-0.020
(0.101)

-0.081
(0.091)

-0.035
(0.195)

0.242
(0.188)

-0.674
(0.334)**

-0.559
(0.302)*

Household
Size

-0.827
(0.426)*

-0.031
(0.114)

0.395
(0.279)

0.499
(0.462)

Working

0.247
(1.024)

0.218
(0.801)

-0.101
(0.289)

-0.194
(0.228)

0.195
(0.664)

0.636
(0.512)

0.104
(1.139)

-0.101
(0.871)

Unemployed

0.833
(2.332)

0.647
(2.253)

-0.742
(0.511)

0.229
(0.827)

-1.642
(1.352)

0.213
(1.466)

1.748
(2.478)

-0.980
(2.428)

Retired

2.846
(1.052)**

0.984
(1.090)

0.315
(0.311)

-0.044
(0.294)

1.188
(0.665)*

-0.141
(0.651)

-3.879
(1.145)**

-0.726
(1.178)

0.0031
(0.0036)

0.0036
(0.0040)

-0.0130
-0.0189
(0.0075)* (0.0081)**

-0.0146
(0.0131)

-0.0037
(0.0139)

Pr(Live to 75)

0.0262
0.0224
(0.0121)** (0.0131)*

Log(Current
Income)

1.055
(0.492)**

0.123
(0.156)

0.300
(0.300)

-1.531
(0.549)**

Permanent
Log Income

2.300
(0.813)**

0.050
(0.222)

-1.209
(0.458)**

-1.359
(0.885)

89

Table 9
(cont’d).
Log(Net
6.670
0.968
1.256
-8.115
Worth)
(0.389)**
(0.112)**
(0.159)**
(0.400)**
Observations
16,690
Households
6,048
Note: Average partial effects reported. Robust standard errors are in parentheses. ** p<0.05 and * p<0.1. A full
set of year dummies included in all models. Reported Average Partial Effect for Pr(Live to 75) is for a onepercentage point difference (0.01).

90

Table 10. Sensitivity Analysis, Removing Permanent Income, Single and
Married Households, Wealth Holding and Asset Allocation
Single Households
Wealth Holdings
Pr(Live to 75)

Net Worth
339
(100)**

Financial Wealth
156
(50)**

Asset Allocation
Pr(Live to 75)

Stocks
0.0142
(0.0136)

Bonds
-0.0044
(0.0035)

CDs
-0.0046
(0.0093)

Checking
-0.0040
(0.0154)

Married Households
Wealth Holdings
Husband Pr(Live to 75)
Wife Pr(Live to75)

Net Worth
356
(208)*
522
(200)**

Financial Wealth
66
(63)
12
(66)

Asset Allocation
Stocks
Bonds
CDs
Checking
0.0274
0.0032
-0.0135
-0.0152
Husband Pr(Live to 75)
(0.0121)** (0.0036) (0.0075)*
(0.0131)
0.0230
0.0036
-0.0193
-0.0040
Wife Pr(Live to75)
(0.0131)* (0.0040) (0.0081)** (0.0139)
Note: We remove all variables that include our measure of Permanent
Income. ** p<0.05, * p<0.10. Average Partial Effects (reported as
percentages are shown for Stocks, Bonds, CDs, and Checking. Reported
coefficients and Average Partial Effects for Pr(Live to 75) are for a onepercentage point difference (0.01).

91

Table 11. Sensitivity Analysis, Treating Pr(Live to75)=1 differently, Single Households,
Wealth Holding and Asset Allocation
Treatment 1: Removing
Treatment 2: Removing
Observations
Households
Net Worth
394
466
Pr(Live to 75)
(124)**
(141)**
Financial Wealth
182
143
Pr(Live to 75)
(54)**
(43)**
Observations
11,176
8,022
Households
4,461
3,241
Stocks
Pr(Live to 75)

0.0320
(0.0174)*

0.0501
(0.0209)**

-0.0022
(0.0046)

0.0016
(0.0059)

0.0043
(0.0117)

-0.0019
(0.0139)

Bonds
Pr(Live to 75)
CDs
Pr(Live to 75)
Checking
-0.0351
-0.0473
(0.0195)*
(0.0231)**
Observations
8,369
6,117
Households
3,566
2,588
Note: In Treatment 1 we drop observations where the household reports a survival
probability of one. In Treatment 2 we drop households that ever reported a survival
probability of one. ** p<0.05, * p<0.10. Average Partial Effects (reported as
percentages) are shown for Stocks, Bonds, CDs, and Checking. Reported coefficients
and Average Partial Effects for Pr(Live to 75) are for a one-percentage point difference
(0.01).
Pr(Live to 75)

92

Table 12. Sensitivity Analysis, Treating Pr(Live to 75)=1 differently, Married
Households, Wealth Holding and Asset Allocation
Treatment 1: Removing
Treatment 2: Removing
Observations
Households
Net Worth
56
-171
Husband Pr(Live to 75)
(253)
(314)
698
497
Wife Pr(Live to 75)
(253)**
(337)
Financial Wealth
-5
13
Husband Pr(Live to 75)
(81)
(98)
129
215
Wife Pr(Live to 75)
(97)
(141)
Observations
12,436
7,610
Households
5,272
3,252
Stocks
Husband Pr(Live to 75)
Wife Pr(Live to 75)

0.0359
(0.0161)**
0.0326
(0.0170)*

0.0288
(0.0203)
0.0602
(0.0208)**

0.0053
(0.0047)
0.0012
(0.0053)

0.0121
(0.0061)**
0.0055
(0.0067)

-0.0212
(0.0101)**
-0.0262
(0.0106)**

-0.0249
(0.0126)**
-0.0363
(0.0132)**

Bonds
Husband Pr(Live to 75)
Wife Pr(Live to 75)
CDs
Husband Pr(Live to 75)
Wife Pr(Live to 75)
Checking
-0.0002
-0.0123
(0.0002)
(0.0220)
-0.0039
-0.0253
Wife Pr(Live to 75)
(0.0180)
(0.0225)
Observations
11,183
6,753
Households
4,812
2,912
Note: In Treatment 1 we drop observations where any member of the household
reports a survival probability of one. In Treatment 2 we drop households where any
member ever reported a survival probability of one. ** p<0.05, * p<0.10. Average
Partial Effects (reported as percentages) are shown for Stocks, Bonds, CDs, and
Checking. Reported coefficients and Average Partial Effects for Pr(Live to 75) are for
a one-percentage point difference (0.01).
Husband Pr(Live to 75)

93

Table 13. Means and Medians of Relevant Variables
Mean
Median
Husband Wife Husband Wife
Pr(Live to 75)
0.65
0.69
0.70
0.75
Age
58
55
59
55
Word Recall
0.61
0.67
0.71
0.71
Income
$59,578
$47,000
Non-Financial Wealth
$268,579
$131,590
Allocation to Stocks
0.26
Allocation to Bonds
0.02
Allocation to CDs
0.09
Allocation to Checking
0.62
Observations
14,514
Households
3,872
Note: All dollar values are stated in 1992 dollars. In the right
panel all households with only one time observation are dropped

Table 14. Proportion of Variation Between
Households for Relevant Variables
Husband
Wife
Independent Variables
Pr(Live to 75)
0.66
0.65
Age
0.59
0.76
Word Recall
0.43
0.40
Income
0.65
Non-Financial Wealth
0.82
Dependent Variables
Allocation to Stocks
0.61
Allocation to Bonds
0.42
Allocation to CDs
0.52
Allocation to Checking
0.61
Observations
14,514
Households
3,872
Note: Results obtained by running an ANOVA
using the Household as the categorical variable.

94

Table 15. Average Partial Effects from Pooled Linear Fixed Effects OLS
Estimation on Entire Unbalanced Panel
Stocks
Bonds
CDs
Checking
-0.001
0.004
-0.008
0.005
Husband Age
(0.007)
(0.003)
(0.005)
(0.008)
0.001
-0.000
-0.001
0.000
Husband Pr(Live to 75)
(0.001)
(0.001)
(0.001)
(0.002)
0.006
-0.000
0.002
-0.008
Husband Word Recall
(0.002)**
(0.001)
(0.001)
(0.002)**
-0.004
-0.003
0.009
-0.002
Wife Age
(0.007)
(0.003)
(0.005)*
(0.008)
-0.003
0.001
0.000
0.002
Wife Pr(Live to 75)
(0.001)** (0.001)**
(0.001)
(0.002)
0.002
0.001
-0.002
-0.001
Wife Word Recall
(0.001)
(0.001)
(0.001)*
(0.002)
0.011
0.001
0.004
-0.017
Log(Income)
(0.005)**
(0.002)
(0.003)
(0.005)**
Log(Non-Financial
-0.007
0.001
0.001
0.005
Wealth)
(0.004)*
(0.002)
(0.003)
(0.005)
Observations
14,514
Households
3,872
Note: Standard errors in parentheses are robust to heteroskedasticity and
serial correlation. ** p-value < 0.05, * p-value < 0.10. Reported average
partial effect for Husband and Wife Pr(Live to 75) and Husband and Wife
Word Recall represent a 10 percentage point change.

95

Table 16. Average Partial Effects from Single Equation Pooled
Bernoulli Quasi-Maximum Likelihood Estimates for Each Balanced
Panel Subset, Averaged Across Balanced Panel Subsets
Stocks
Bonds
CDs
Checking
-0.001
0.004
-0.007
0.004
Husband Age
(0.019) (0.006) (0.012)
(0.020)
0.001
-0.000
-0.001
0.001
Husband Pr(Live to 75)
(0.004) (0.001) (0.002)
(0.004)
0.004
0.000
0.002
-0.006
Husband Word Recall
(0.004) (0.002) (0.003)
(0.005)
-0.003
-0.002
0.009
-0.005
Wife Age
(0.019) (0.006) (0.012)
(0.020)
-0.003
0.001
0.000
0.002
Wife Pr(Live to 75)
(0.004) (0.001) (0.002)
(0.004)
0.001
0.001
-0.002
0.000
Wife Word Recall
(0.004) (0.001) (0.003)
(0.005)
0.012
0.002
0.003
-0.016
Log(Income)
(0.012) (0.004) (0.007)
(0.014)
Log(Non-Financial
-0.006
0.002
0.001
0.005
Wealth)
(0.012) (0.004) (0.007)
(0.013)
Observations
14,514
Households
3,872
Note: Standard errors in parentheses are robust to heteroskedasticity and
serial correlation. ** p-value < 0.05, * p-value < 0.10. Reported average
partial effect for Husband and Wife Pr(Live to 75) and Husband and
Wife Word Recall represent a 10 percentage point change.

96

Table 17. Average Partial Effects from Pooled Multinomial QuasiMaximum Likelihood Estimates for Each Balanced Panel Subset,
Averaged Across Balanced panel Subsets
Stocks
Bonds
CDs
Checking
-0.001
0.004
-0.006
0.003
Husband Age
(0.019) (0.006) (0.012)
(0.020)
0.000
-0.001
-0.001
0.001
Husband Pr(Live to 75)
(0.004) (0.001) (0.002)
(0.004)
0.004
-0.000
0.002
-0.006
Husband Word Recall
(0.004) (0.002) (0.003)
(0.004)
-0.003
-0.002
0.009
-0.004
Wife Age
(0.019) (0.006) (0.012)
(0.020)
-0.003
0.001
0.000
0.002
Wife Pr(Live to 75)
(0.004) (0.001) (0.002)
(0.004)
0.000
0.001
-0.002
0.001
Wife Word Recall
(0.004) (0.001) (0.003)
(0.004)
0.009
0.001
0.003
-0.013
Log(Income)
(0.012) (0.004) (0.007)
(0.012)
Log(Non-Financial
-0.006
0.001
0.000
0.005
Wealth)
(0.012) (0.004) (0.007)
(0.013)
Observations
14,514
Households
3,872
Note: Standard errors in parentheses are robust to heteroskedasticity
and serial correlation. ** p-value < 0.05, * p-value < 0.10. Reported
average partial effect for Husband and Wife Pr(Live to 75) and
Husband and Wife Word Recall represent a 10 percentage point
change.

97

Table 18. Procedure 2: Classical Minimum Distance applied to Multinomial
Quasi-Maximum Likelihood Estimates for Each Year Within Balanced Panel
Subsets
Stocks
Bonds
CDs
Checking
0.027
0.004
-0.005
-0.026
Husband Age
(0.013)**
(0.003)
(0.007) (0.013)**
0.005
-0.000
0.000
-0.005
Husband Pr(Live to 75)
(0.002)*
(0.001)
(0.002)
(0.003)*
0.009
-0.000
-0.000
-0.008
Husband Word Recall
(0.003)**
(0.001)
(0.002) (0.003)**
-0.026
-0.003
0.003
0.026
Wife Age
(0.013)**
(0.003)
(0.007) (0.013)**
-0.007
-0.001
0.003
0.005
Wife Pr(Live to 75)
(0.003)**
(0.001)
(0.002)
(0.003)*
-0.001
0.000
-0.001
0.002
Wife Word Recall
(0.003)
(0.001)
(0.002)
(0.003)
0.023
-0.007
-0.003
-0.013
Log(Income)
(0.008)** (0.002)** (0.005)
(0.008)
Log(Non-Financial
0.003
0.000
-0.004
0.001
Wealth)
(0.008)
(0.002)
(0.005)
(0.009)
Observations
14,514
Households
3,872
Note: Standard errors in parentheses are robust to heteroskedasticity and
serial correlation. ** p-value < 0.05, * p-value < 0.10. Reported average
partial effects for Husband and Wife Pr(Live to 75) and Husband and Wife
Word Recall represent a 10 percentage point change.

98

Table 19. Comparing Average Partial Effect Estimates from Pooled
Single Equation Bernoulli Quasi-Maximum Likelihood Estimates for
Each Balanced Panel Subset, Averaged Across Balanced Panel Subsets
to Procedure 2 Average Partial Effect Estimates
Stocks
Bonds
CDs
Checking
Husband Age
0.137
0.972
0.913
0.153
Husband Pr(Live to 75)
0.267
0.939
0.602
0.182
Husband Word Recall
0.237
0.762
0.332
0.679
Wife Age
0.228
0.952
0.588
0.117
Wife Pr(Live to 75)
0.348
0.141
0.324
0.393
Wife Word Recall
0.698
0.743
0.890
0.759
Log(Income)
0.344
0.023** 0.370
0.805
Log(Non-Financial Wealth) 0.466
0.637
0.534
0.782
Note: p-values are robust to heteroskedasticity and serial correlation.
** p-value < 0.05, * p-value < 0.10. Standard errors used to compute
the test statistic are from Table 4: Weighted Average of the standard
errors for Average Partial Effects estimated by Single Equation
Bernoulli Quasi-Maximum Likelihood for each balanced panel subset.

Table 20. Comparing Average Partial Effect Estimates from Pooled
Multinomial Quasi-Maximum Likelihood Estimates for Each Balanced Panel
Subset, Averaged Across Balanced Panel Subsets to Procedure 2 Estimated
Average Partial Effects
Bonds
CDs
Checking
Stocks
Husband Age
0.133
0.955
0.935
0.145
Husband Pr(Live to 75)
0.229
0.922
0.519
0.129
Husband Word Recall
0.219
0.815
0.348
0.641
Wife Age
0.219
0.902
0.592
0.122
Wife Pr(Live to 75)
0.342
0.139
0.310
0.428
Wife Word Recall
0.735
0.793
0.918
0.759
Log(Income)
0.241
0.026** 0.410
0.989
Log(Non-Financial Wealth)
0.435
0.758
0.576
0.752
Note: p-values are robust to heteroskedasticity and serial correlation. ** pvalue < 0.05, * p-value < 0.10. Standard errors used to compute the test
statistic are from Table 5: Weighted Average of the standard errors for
Average Partial Effects estimated by Multinomial Quasi-Maximum
Likelihood for each balanced panel subset.

99

Table 21. Comparing Average Partial Effect Estimates from Pooled Linear
Fixed Effects OLS on Entire Unbalanced Panel to Procedure 2 Estimated
Average Partial Effect Estimates
Stocks
Bonds
CDs
Checking
Husband Age
0.027**
0.985
0.708
0.019**
Husband Pr(Live to 75)
0.116
0.705
0.377
0.053*
Husband Word Recall
0.311
0.776
0.209
0.934
Wife Age
0.079*
0.921
0.391
0.030**
Wife Pr(Live to 75)
0.150
0.005**
0.136
0.224
Wife Word Recall
0.253
0.729
0.632
0.372
Log(Income)
0.132
0.000**
0.110
0.610
Log(Non-Financial Wealth)
0.223
0.552
0.328
0.688
Note: p-values are robust to heteroskedasticity and serial correlation. ** pvalue < 0.05, * p-value < 0.10. Standard errors used to compute the test
statistic are from Table 6: Standard errors for Average Partial Effects
estimated by Procedure 2 (Classical Minimum Distance applied to
Multinomial Quasi-Maximum Likelihood estimates for each year within each
balanced panel subset).

100

Table 22. Means and Medians of Relevant Variables
Mean
Median
Husband Wife Husband Wife
Pr(Live to 75)
0.65
0.69
0.70
0.75
Age
58
55
59
55
Word Recall
0.61
0.67
0.71
0.71
Income
$59,578
$47,000
Non-Financial Wealth
$268,579
$131,590
Allocation to Stocks
0.26
Allocation to Bonds
0.02
Allocation to CDs
0.09
Allocation to Checking
0.62
Observations
14,514
Households
3,872
Note: All dollar values are stated in 1992 dollars. In the right
panel all households with only one time observation are dropped

Table 23. Proportion of Variation Between
Households for Relevant Variables
Wife
Husband
Independent
Variables
Pr(Live to 75)
0.66
0.65
Age
0.59
0.76
Word Recall
0.43
0.40
Income
0.65
Non-Financial Wealth
0.82
Dependent
Variables
Allocation to Stocks
0.61
Allocation to Bonds
0.42
Allocation to CDs
0.52
Allocation to Checking
0.61
Observations
14,514
Households
3,872
Note: Results obtained by running an ANOVA
using the Household as the categorical variable.

101

Table 24. Average Partial Effects from Pooled Linear Fixed Effects OLS
estimation on entire unbalanced panel
Stocks
Bonds
CDs
Checking
-0.001
0.004
-0.008
0.005
Husband Age
(0.007)
(0.003)
(0.005)
(0.008)
0.001
-0.000
-0.001
0.000
Husband Pr(Live to 75)
(0.001)
(0.001)
(0.001)
(0.002)
0.006
-0.000
0.002
-0.008
Husband Word Recall
(0.002)**
(0.001)
(0.001) (0.002)**
-0.004
-0.003
0.009
-0.002
Wife Age
(0.007)
(0.003)
(0.005)*
(0.008)
-0.003
0.001
0.000
0.002
Wife Pr(Live to 75)
(0.001)** (0.001)**
(0.001)
(0.002)
0.002
0.001
-0.002
-0.001
Wife Word Recall
(0.001)
(0.001)
(0.001)*
(0.002)
0.011
0.001
0.004
-0.017
Log(Income)
(0.005)**
(0.002)
(0.003) (0.005)**
-0.007
0.001
0.001
0.005
Log(Non-Financial Wealth)
(0.004)*
(0.002)
(0.003)
(0.005)
Observations
14,514
Households
3,872
Note: Standard errors in parentheses are robust to heteroskedasticity and serial
correlation. ** p-value < 0.05, * p-value < 0.10. Reported average partial effect
for Husband and Wife Pr(Live to 75) and Husband and Wife Word Recall
represent a 10 percentage point change.

102

Table 25. Average Partial Effects from Single Equation Pooled Bernoulli QuasiMaximum Likelihood Estimates on Entire Unabalanced Panel, Ignoring
Unbalanced Nature of the Panel
Stocks
Bonds
CDs
Checking
-0.001
0.004
-0.008
0.005
Husband Age
(0.007)
(0.003)
(0.005)
(0.008)
0.001
-0.000
-0.001
0.000
Husband Pr(Live to 75)
(0.001)
(0.001)
(0.001)
(0.002)
0.006
-0.000
0.002
-0.008
Husband Word Recall
(0.002)**
(0.001)
(0.001)* (0.002)**
-0.004
-0.002
0.008
-0.002
Wife Age
(0.007)
(0.003)
(0.005)
(0.008)
-0.003
0.001
0.000
0.002
Wife Pr(Live to 75)
(0.002)** (0.001)**
(0.001)
(0.002)
0.002
0.001
-0.002
-0.001
Wife Word Recall
(0.002)
(0.001)
(0.001)
(0.002)
0.011
0.001
0.004
-0.016
Log(Income)
(0.005)**
(0.002)
(0.003)
(0.005)**
-0.007
0.002
0.001
0.005
Log(Non-Financial Wealth)
(0.005)
(0.002)
(0.003)
(0.005)
Observations
14,514
Households
3,872
Note: Standard errors in parentheses are robust to heteroskedasticity and serial
correlation. ** p-value < 0.05, * p-value < 0.10. Reported average partial effect for
Husband and Wife Pr(Live to 75) and Husband and Wife Word Recall represent a
10 percentage point change.

103

Table 26. Average Partial Effects from Single Equation Pooled Bernoulli
Quasi-Maximum Likelihood Estimates for Each Balanced Panel Subset,
Averaged Across Balanced Panel Subsets
Stocks
Bonds
CDs
Checking
-0.001
0.004
-0.007
0.004
Husband Age
(0.019) (0.006) (0.012)
(0.020)
0.001
-0.000
-0.001
0.001
Husband Pr(Live to 75)
(0.004) (0.001) (0.002)
(0.004)
0.004
0.000
0.002
-0.006
Husband Word Recall
(0.004) (0.002) (0.003)
(0.005)
-0.003
-0.002
0.009
-0.005
Wife Age
(0.019) (0.006) (0.012)
(0.020)
-0.003
0.001
0.000
0.002
Wife Pr(Live to 75)
(0.004) (0.001) (0.002)
(0.004)
0.001
0.001
-0.002
0.000
Wife Word Recall
(0.004) (0.001) (0.003)
(0.005)
0.012
0.002
0.003
-0.016
Log(Income)
(0.012) (0.004) (0.007)
(0.014)
-0.006
0.002
0.001
0.005
Log(Non-Financial Wealth)
(0.012) (0.004) (0.007)
(0.013)
Observations
14,514
Households
3,872
Note: Standard errors in parentheses are robust to heteroskedasticity and
serial correlation. ** p-value < 0.05, * p-value < 0.10. Reported average
partial effect for Husband and Wife Pr(Live to 75) and Husband and Wife
Word Recall represent a 10 percentage point change.

104

Table 27. Average Partial Effects from Pooled Bernoulli Quasi-Maximum
Likelihood Estimates on Entire Unbalanced Panel Adjusting for Unbalanced
Nature of the Panel
Stocks
Bonds
CDs
Checking
0.000
0.015
-0.007
0.005
Husband Age
(0.007)
(0.011)
(0.005)
(0.008)
0.001
-0.000
-0.001
0.000
Husband Pr(Live to 75)
(0.002)
(0.002)
(0.001)
(0.002)
0.006
-0.001
0.001
-0.008
Husband Word Recall
(0.002)**
(0.002)
(0.001)
(0.002)**
-0.004
-0.010
0.006
-0.002
Wife Age
(0.007)
(0.011)
(0.005)
(0.008)
-0.004
0.005
0.0001
0.002
Wife Pr(Live to 75)
(0.002)** (0.002)** (0.001)
(0.002)
0.002
0.003
-0.001
-0.001
Wife Word Recall
(0.002)
(0.002)
(0.001)
(0.002)
0.011
0.008
0.004
-0.016
Log(Income)
(0.005)**
(0.007)
(0.003)
(0.005)**
-0.007
0.007
-0.001
0.005
Log(Non-Financial Wealth)
(0.005)
(0.008)
(0.003)
(0.005)

χ 2 Test Statistic for ψ gr
p-value

1.34
0.931

6.75
0.240

52.16
0.000**

3.80
0.579

1.45
4.74
5.26
2.83
χ 2 Test Statistic for ω gr
0.919
0.448
0.385
0.726
p-value
Observations
14,514
Households
3,872
Note: Standard errors in parentheses are robust to heteroskedasticity and serial
correlation. ** p-value < 0.05, * p-value < 0.10. Reported average partial effect
for Husband and Wife Pr(Live to 75) and Husband and Wife Word Recall
represent a 10 percentage point change.

105

Table 28. Average Partial Effects from Pooled Multinomial Quasi-Maximum
Likelihood Estimates for Entire Unbalanced Panel, Ignoring Unbalanced Nature
of the Panel
Stocks
Bonds
CDs
Checking
-0.001
0.004
-0.008
0.005
Husband Age
(0.007)
(0.002)
(0.005)* (0.008)
0.001
-0.000
-0.001
0.001
Husband Pr(Live to 75)
(0.001)
(0.001)
(0.001)
(0.002)
0.006
-0.000
0.002
-0.007
Husband Word Recall
(0.002)**
(0.001)
(0.001)* (0.002)**
-0.004
-0.003
-0.002
0.009
Wife Age
(0.007)
(0.002)
(0.005)* (0.008)
-0.004
0.001
0.000
0.002
Wife Pr(Live to 75)
(0.002)** (0.001)** (0.001)
(0.002)
0.002
0.001
-0.002
-0.001
Wife Word Recall
(0.002)
(0.001)
(0.001)* (0.002)
0.008
0.000
0.002
-0.009
Log(Income)
(0.005)*
(0.002)
(0.003) (0.005)**
-0.008
0.001
0.000
0.007
Log(Non-Financial Wealth)
(0.005)*
(0.002)
(0.003)
(0.005)
Observations
14,514
Households
3,872
Note: Standard errors in parentheses are robust to heteroskedasticity and serial
correlation. ** p-value < 0.05, * p-value < 0.10. Reported average partial effect
for Husband and Wife Pr(Live to 75) and Husband and Wife Word Recall
represent a 10 percentage point change.

106

Table 29. Average Partial Effects from Pooled Multinomial QuasiMaximum Likelihood Estimates for Each Balanced Panel Subset,
Averaged Across Balanced Panel Subsets
Stocks
Bonds
CDs
Checking
-0.001
0.004
-0.006
0.003
Husband Age
(0.019) (0.006) (0.012)
(0.020)
0.000
-0.001
-0.001
0.001
Husband Pr(Live to 75)
(0.004) (0.001) (0.002)
(0.004)
0.004
-0.000
0.002
-0.006
Husband Word Recall
(0.004) (0.002) (0.003)
(0.004)
-0.003
-0.002
0.009
-0.004
Wife Age
(0.019) (0.006) (0.012)
(0.020)
-0.003
0.001
0.000
0.002
Wife Pr(Live to 75)
(0.004) (0.001) (0.002)
(0.004)
0.000
0.001
-0.002
0.001
Wife Word Recall
(0.004) (0.001) (0.003)
(0.004)
0.009
0.001
0.003
-0.013
Log(Income)
(0.012) (0.004) (0.007)
(0.012)
-0.006
0.001
0.000
0.005
Log(Non-Financial Wealth)
(0.012) (0.004) (0.007)
(0.013)
Observations
14,514
Households
3,872
Note: Standard errors in parentheses are robust to heteroskedasticity and
serial correlation. ** p-value < 0.05, * p-value < 0.10. Reported average
partial effect for Husband and Wife Pr(Live to 75) and Husband and Wife
Word Recall represent a 10 percentage point change.

107

Table 30. Average Partial Effects from Multinomial Quasi-Maximum
Likelihood Estimates on Entire Unbalanced Panel, Adjusting for the
Unbalanced Nature of the Panel
Stocks
Bonds
CDs
Checking
-0.001
0.003
-0.006
0.004
Husband Age
(0.007)
(0.003) (0.005)
(0.007)
0.001
-0.000
-0.001
0.001
Husband Pr(Live to 75)
(0.001)
(0.001) (0.001)
(0.002)
0.006
-0.000
0.001
-0.007
Husband Word Recall
(0.002)** (0.001) (0.001) (0.002)**
-0.002
-0.002
0.006
-0.002
Wife Age
(0.007)
(0.003) (0.005)
(0.008)
0.001
0.001
0.002
-0.004
Wife Pr(Live to 75)
(0.002)
(0.002)** (0.001)* (0.001)
0.001
0.001
-0.001
-0.001
Wife Word Recall
(0.002)
(0.001) (0.001)
(0.002)
0.008
0.001
0.003
-0.012
Log(Income)
(0.005)* (0.002) (0.003) (0.005)**
-0.007
0.001
-0.001
0.007
Log(Non-Financial Wealth)
(0.005)
(0.002) (0.003)
(0.005)
2 Test Statistic for ψ
2.13
17.40
69.65
χ
gr
0.830
0.004** 0.000**
p-value
0.80
11.78
6.35
χ 2 Test Statistic for ω gr
0.977
0.038** 0.274
p-value
Observations
14,514
Households
3,872
Note: Standard errors in parentheses are robust to heteroskedasticity and serial
correlation. ** p-value < 0.05, * p-value < 0.10. Reported average partial
effect for Husband and Wife Pr(Live to 75) and Husband and Wife Word
Recall represent a 10 percentage point change.

108

Table 31. Average Partial Effects from Multinomial Quasi-Maximum Likelihood
Estimates Using the Multinomial Logit Functional Form on Entire Unbalanced
Panel, Adjusting for Unbalanced Nature of the Panel
Stocks
Bonds
CDs
Checking
-0.000
0.003
-0.008
0.005
Husband Age
(0.000)
(0.000)**
(0.006)
(0.007)
0.001
-0.000
-0.001
0.000
Husband Pr(Live to75)
(0.000)**
(0.000)**
(0.001)**
(0.001)
0.006
-0.000
0.001
-0.007
Husband Word Recall
(0.000)**
(0.000)**
(0.001)*
(0.001)**
-0.004
-0.002
0.008
-0.001
Wife Age
(0.000)**
(0.000)**
(0.008)
(0.008)
-0.004
0.001
0.001
0.002
Wife Pr(Live to 75)
(0.000)**
(0.000)**
(0.001)
(0.001)**
0.002
0.001
-0.001
-0.001
Wife Word Recall
(0.000)**
(0.000)**
(0.001)**
(0.001)*
0.011
0.002
0.003
-0.016
Log(Income)
(0.000)**
(0.000)**
(0.007)
(0.009)*
-0.008
0.002
-0.000
0.007
Log(Non-Financial Wealth)
(0.000)**
(0.000)**
(0.006)
(0.008)

χ 2 Test Statistic for ψ gr
p-value

2.42
0.789

6.27
0.281

13.91
0.016**

-

1.52
6.75
4.86
χ 2 Test Statistic for ω gr
0.911
0.240
0.434
p-value
Observations
14,514
Households
3,872
Note: Standard errors in parentheses are robust to heteroskedasticity and serial
correlation. ** p-value < 0.05, * p-value < 0.10. Reported average partial effects for
Husband and Wife Pr(Live to 75) and Husband and Wife Word Recall represent a
10 percentage point change.

109

Figure 1. Average Structural Function for the Proportion of Financial Wealth Allocated to
Stocks Against Husband Word Recall

Linear Fixed Effects

0.34

Classical Minimum Distance on Multinomial QMLE for
each year within balanced panel subsets

0.32
0.3
0.28
0.26
0.24
0.22
0

0.1

0.2

0.3

0.4

0.5

0.6

Husband Word Recall

110

0.7

0.8

0.9

1

Figure 2. Average Structural Function for the Proportion of Financial Wealth Allocated to
Bonds Against Husband Word Recall

0.036
0.034
0.032
Linear Fixed Effects
0.03
Classical Minimum Distance on Multinomial
QMLE for each year within balanced panel
subsets

0.028
0.026
0.024
0.022
0

0.1

0.2

0.3

0.4
0.5
0.6
0.7
Husband Word Recall

111

0.8

0.9

1

Figure 3. Average Structural Function for the Proportion of Financial Wealth Allocated to
CDs Against Husband Word Recall

0.13

0.12
Linear Fixed Effects
0.11
Classical Minimum Distance on Multinomial QMLE
for each year within balanced panel subsets
0.1

0.09

0.08
0

0.1

0.2

0.3

0.4
0.5
0.6
0.7
Husband Word Recall

112

0.8

0.9

1

Figure 4. Average Structural Function for the Proportion of Financial Wealth Allocated to
Checking Against Husband Word Recall

0.7

Linear Fixed Effects
Classical Minimum Distance on Multinomial
QMLE for each year within balanced panel
subsets

0.65

0.6

0.55

0.5
0

0.1

0.2

0.3

0.4
0.5
0.6
0.7
Husband Word Recall

113

0.8

0.9

1

Figure 5. Average Structural Function for the Proportion of Financial Wealth Allocated to
Stocks Against Wife Word Recall

0.32

0.3
Linear Fixed Effects
0.28

Classical Minimum Distance on
Multinomial QMLE for each year within
balanced panel subsets

0.26

0.24
0

0.1

0.2

0.3

0.4
0.5
0.6
Wife Word Recall

114

0.7

0.8

0.9

1

Figure 6. Average Structural Function for the Proportion of Financial Wealth Allocated to
Bonds Against Wife Word Recall

0.036

0.032
Linear Fixed Effects
0.028
Classical Minimum Distance on Multinomial
QMLE for each year within balanced panel
subsets

0.024

0.02
0

0.1

0.2

0.3

0.4
0.5
0.6
Wife Word Recall

115

0.7

0.8

0.9

1

Figure 7. Average Structural Function for Proportion of Financial Wealth Allocated to
CDs Against Wife Word Recall

0.125
Linear Fixed Effects
0.115
Classical Minimum Distance
on Multinomial QMLE for
each year within balanced
panel subsets

0.105

0.095

0.085
0

0.1

0.2

0.3

0.4
0.5
0.6
Wife Word Recall

116

0.7

0.8

0.9

1

Figure 8. Average Structural Function for Proportion of Financial Wealth Allocated to
Checking Against Wife Word Recall

0.64
0.62
Linear Fixed Effects

0.6
0.58

Classical Minimum Distance on Multinomial
QMLE for each year within balanced panel
subsets

0.56
0.54
0.52
0

0.1

0.2

0.3

0.4
0.5
0.6
Wife Word Recall

117

0.7

0.8

0.9

1

BIBLIOGRAPHY

118

BIBLIOGRAPHY

Altonji, J.G. and Doraszelski (2005), “The Role of Permanent Income and Demographics in
Black/White Differences in Wealth,” Journal of Human Resources, 40 (1), pp. 1-30.
Barsky, R.B., F.T. Juster, M.S. Kimball and M.D. Shapiro (1997), “Preference Parameters and
Behavioral Heterogeneity: An Experiemental Approach in the Health and Retirement
Study,” The Quarterly Journal of Economics, 112 (2), pp. 537-579.
Berkowitz, M.K. and J. Qui (2006), “A Further Look at Household Portfolio Choice and Health
Status,” Journal of Banking and Finance, 30, pp. 1201-1217.
Bernheim, B.D., J. Skinner, and S. Weinberg (2001), “What Accounts for the Variation in
Retirement Wealth Among U.S. Households?,” American Economic Review, 91, pp. 832857.
Bloom, D.E., D. Canning, M. Moore, and Y. Song (2006), “The Effect of Subjective Survival
Probabilities on Retirement and Wealth in the United States,” Working Paper 12688,
National Bureau of Economic Research.
Blundell, R. and J.L. Powell (2003), “Endogeneity in Nonparametric and Semiparametric
Regression Models, with Richard Blundell,” in Advances in Economics and
Econometrics: Theory and Applications, Eighth World Congress, Volume 2, eds. M.
Dewatripoint, L.P. Hanses, and S.J. Turnovsky, Cambridge: Cambridge University
Press, pp. 312-357.
Chamberlain, G. (1980), “Analysis of Variance with Qualitative Data,” Review of Econometric
Studies, 47, pp. 225-238.
Chamberlain, G. (1982), “Multivariate Regression Models for Panel Data,” Journal of
Econometrics, 18, pp. 5-46.
Chamberlain, G. (1984), “Panel Data,” in Handbook of Econometrics, Volume 2, ed. Z. Griliches
and M.D. Intriligator, Amsterdam: North Holland, pp. 1247-1318.
Cocco, J., F. Gomes, and P. Maenhout (2005), “Consumption and Portfolio Choice over the Life
Cycle,” The Review of Financial Studies, 18 (2), pp. 491-553.
Delevande, A., M. Perry, and R. Willis (2006), “Probabilistic Thinking and Social Security
Claiming,” Working Paper 2006-129, Michigan Retirement Research Center.
DeNardi, M., E. French, and J.B. Jones (2009), “Life Expectancy and Old Age Savings,”
American Economic Review, 99 (2), pp. 110-115.

119

Elder, T. (2012), “The Predictive Validity of Subjective Mortality Expectations: Evidence from
the Health and Retirement Study,” Demography, forthcoming.
Gan, L., G. Gong, M. Hurd, and D. McFadden (2004), “Subjective Mortality Risk and
Bequests,” Working Paper 10789, National Bureau of Economic Research.
Gourieroux, C., A. Monfort, and A. Trognon (1984), “Pseudo Maximum Likelihood Methods:
Theory,” Econometrica, 52, pp. 681-700.
Hurd, M. and K. McGarry (1995), “Evaluation of the Subjective Probabilities of Survival in the
Health and Retirement Study,” Journal of Human Resources, 30, pp. S268-S292.
Hurd, M. and K. McGarry (2002), “The Predictive Validity of Subjective Probabilities of
Survival,” Economic Journal, 112, pp. 966-998.
Hurd, M., J. Smith, and J. Zissimopoulous (2002), “The Effects of Subjective Survival on
Retirement and Social Security Claiming,” Working Paper 9140, National Bureau of
Economic Research.
Kédzi, G. and R. Willis (2003), “Who Becomes a Stockholder? Expectations, Subjective
Uncertainty, and Asset Allocation,” Working Paper 2003-039, Michigan Retirement
Research Center.
Koch, S.F. (2010), “Fractional Multinomial Response Models with an Application to
Expenditure Shares,” Working Paper 2010-21, University of Pretoria.
Lillard, L. and R. Willis (2001), “Cognition and Wealth: The Importance of Probabilistiv
Thinking,” Working Paper 2001-007, Michigan Retirement Research Center.
McArdle, J., J. Smith, and R. Willis (2009), “Cognition and Economic Outcomes in the Health
and Retirement Survey,” Working Paper 15266, National Bureau of Economic Research.
Mullahy, J. (2010), “Multivariate Fractional Regression Estimation of Econometric Share
Models,” Working Paper 16354, National Bureau of Economic Research.
Mundlak, Y. (1978), “On the Pooling of Time Series and Cross Section Data,” Econometrica,
58, pp. 809-837.
Papke, L. and J. M. Wooldridge (1996), “Econometric Methods for Fractional Response
Variables with an Application to 401(k) Plan Participation Rates,” Journal of Applied
Econometrics, 11, pp. 619-632.
Papke, L. and J. M. Wooldridge (2008), “Panel Data Methods for Fractional Response Variables
with an Application to Test Pass Rates,” Journal of Econometrics, 145, pp.121-133.

120

Perry, M. (2005), “Estimating Life-Cycle Effects of Subjective Survival Probabilities in the
Health and Retirement Study,” Working Paper 2005-103, Michigan Retirement Research
Center.
Rosen, H. S. and S. Wu (2004), “Portfolio Choice and Health Status,” Journal of Financial
Economics, 72, pp.457-484.
Sahm, C. (2007), “Risk Tolerance and Asset Allocation,” Dissertation, University of Michigan.
Salm, M. (2006), “Can Subjective Mortality Expectations and Stated Preferences Explain
Varying Consumption and Savings Behaviors among the Elderly?,” IZA DP No. 2467,
IZA Discussion Paper Series.
Sivakumar, A. and C. Bhat (2002), “Fractional Split-Distribution Model for Statewide
Commodity-Flow Analysis,” Transportation Research Record, 1790, Paper No. 2142.
Smith, V. K., D. Smith, D.H. Taylor, Jr., and F. Sloan (2001), “Longevity Expectations and
Death: Can People Predict Their Own Demise?,” American Economic Review, 91, pp.
1126-1134.
Wooldridge, J. M. (2002), Econometric Analysis of Cross Section and Panel Data, MIT Press,
Cambridge, MA.
Wooldridge, J. M. (2006), “Cluster-Sample Methods in Applied Econometrics: An Extended
Analysis,” Unpublished Manuscript, Michigan State University.
Wooldridge, J. M. (2010), “Correlated Random Effects Models with Unbalanced Panels,”
Unpublished Manuscript, Michigan State University.
Yaari,M. (1965), “Uncertain Lifetime, Life Insurance and the Theory of the Consumer,” Review
of Economic Studies, 32, pp. 137-150.

121