MODEL SELECTION AND PREDICTION FOR LONG MEMORY MODELS IN ECONOMIC
TIME SERIES
by
Chaleampong Kongcharoen

A DISSERTATION
Submitted to
Michigan State University
in partial fulﬁllment of the requirements
for the degree of
DOCTOR OF PHILOSOPHY
ECONOMICS
2011

ABSTRACT
MODEL SELECTION AND PREDICTION FOR LONG MEMORY MODELS IN ECONOMIC
TIME SERIES
by
Chaleampong Kongcharoen
This dissertation considers methodological issues involving both prediction and model selection
in the application of long memory processes to economic time series data. We also have a section
which discusses some practical aspects of the implementation of the model selection methodology to real GNP, monthly CPI inﬂation series, interest rates and realized volatility from currency
markets.
Chapter 1 is the Introduction and sets the scene, established notation for the approaches to
be used later. Chapter 2 examines different approaches to estimate the optimal, minimum mean
square errors (MSE) multi step predictor to univariate fractionally autoregressive moving average
(ARFIMA) models. We consider two competing approaches of (i) using the MLE of the ARFIMA
parameters inserted into the optimal predictor, and (ii) using an alternative predictor using estimates
arising from the Local Whittle Two Step Estimator (LW T SE) are applied. In the latter approach, an
initial semi parametric estimate of the long memory parameter is used to feasibly fractionally ﬁlter
the series to obtain an estimate of the short memory series, which is then used to estimate the short
memory ARMA parameters. This latter approach seems very relevant given that the most popular
methodology in current econometric literature is to obtain an initial semi parametric estimate of
a long memory parameter. We found that the predictor based on MLE is generally superior in
MSE sense to the predictor based on the LW T SE estimation. In general the “optimal” bandwidth
LW T SE brings about worse predictions than the LW T SE based on the conventional one of using
an agnostic bandwidth of the square root of the sample size.
Chapter 3 is concerned with the issue of model selection and considers the choice of lag length
of the p and q parameters in an ARFIMA(p, d, q) model. Perhaps surprisingly, there appears to be a

major gap in the previous literature, which has only considered this problem for the very restrictive
case of the ARFIMA(p, d, 0) model. We show that when estimation is performed by either MLE or
QMLE the AIC is inconsistent and show that the widely used BIC and HQIC are consistent model
selection procedures. We accordingly extend the theory to be applied to the ARFIMA(p, d, q)
model. The implementation of the technique involves selection of the maximum orders of the short
memory process that the investigator is willing to consider. In our simulation study we restrict the
maximum values of p and q to be 8, which involves 81 different ARFIMA models to be estimated
by MLE and then assessed from a model selection perspective. In our detailed simulation we
consider data generating processes up to orders of an ARFIMA(4, d, 1) model. The BIC works
well for some low order models; but has a mixed performance on high order models with p = 4
and is quite dependent on the part of the true parameter space. We also show how the theory can
be extended for seasonal ARFIMA models.
Chapter 3 also considers the two step procedure, which in this context amounts to the application of the LW T SE where an initial estimate of the long memory is obtained by Local Whittle
and is then used to fractionally ﬁlter the series. The short memory ARMA(p, q) model is then
estimated by the minimization of the conditional sum of squares (CSS), which has been shown by
Robinson (2006) and Granger and Newbold (1996) among others to be a valid procedure. Our
simulations show that this procedure can also work quite well, but generally not as well as the
previously discussed direct estimation of ARFIMA by MLE.
However, the above procedure arguably suffers from the “generated regressor” problem and
following recent work on factor models in VARs, it seems possibly desirable to also include a
small sample adjustment to the BIC which reﬂects the sub optimal rate of convergence of the Local
Whittle estimate to the true value of the long memory parameter. We also report some simulations
to assess the effectiveness of this procedure. There are some cases when this appears to be a useful
procedure.
Chapter 4 provides a detailed set of applications of the model selection with MLE and LW T SE
to real economic and ﬁnancial time series.

ACKNOWLEDGMENTS

I would like to gratefully thank Professor Richard T. Baillie for his guidance, understanding, support and patience. His mentorship was paramount in completing this dissertation. I would like
to thank Professor Christine Amsler, Professor Robert J. Myers, and Professor Peter Schmidt for
their guidance and advice. I also would like to thank Professor Hira L. Koul and Professor Emma
Iglesias for helping me at the beginning of this dissertation. I also want to thank Ruth Mendel for
helping me prepare for the dissertation defense.
The ﬁnancial support from both Faculty of Economics, Thammasat University, and Thammasat University for pursuing graduate study is gratefully acknowledged.
Finally, and most importantly, I would like to deeply thank my wife Prae and my daughter
Prim for their support, encouragement, patience and love. I thank my parents, patents-in-law,
sister, and sister-in-law for their support and encouragement.

iv

TABLE OF CONTENTS

LIST OF TABLES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

vi

LIST OF FIGURES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

ix

CHAPTER 1 INTRODUCTION
Appendix 1A: Figures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

1
7

CHAPTER 2 PREDICTION FROM ARFIMA MODELS
2.1 Introduction . . . . . . . . . . . . . . . . . . . . . .
2.2 Prediction from the ARFIMA Model . . . . . . . . .
2.3 Estimation of ARFIMA Models . . . . . . . . . . . .
2.4 Practical Issues and Simulation . . . . . . . . . . . .
2.5 Conclusion . . . . . . . . . . . . . . . . . . . . . .
Appendix 2A: The Prediction Weight Formula . . . .
Appendix 2B: Tables . . . . . . . . . . . . . . . . .
CHAPTER 3
3.1
3.2
3.3
3.4
3.5
3.6

INFORMATION CRITERIA METHODS
ARFIMA MODELS
Introduction . . . . . . . . . . . . . . . . . . . . . .
Information Criteria for Model Selection . . . . . . .
Simulation Study . . . . . . . . . . . . . . . . . . .
Modiﬁed Information Criteria for LW T SE . . . . . .
Inﬂation Example . . . . . . . . . . . . . . . . . . .
Conclusion . . . . . . . . . . . . . . . . . . . . . .
Appendix 3A: Tables . . . . . . . . . . . . . . . . .

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

9
9
10
12
15
18
20
22

.
.
.
.
.
.
.

30
30
31
35
37
39
40
43

.
.
.
.
.
.
.

70
70
70
71
72
74
75
77

FOR SELECTION OF
.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

.
.
.
.
.
.
.

METHODS FOR SELECTION OF ARFIMA MODELS: APPLICATIONS
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Application to Real Output . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Application to Inﬂation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Application to Realized Volatility of Exchange Rate . . . . . . . . . . . . . . . .
Application to Interest Rates . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Appendix 4A: Tables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

CHAPTER 4
4.1
4.2
4.3
4.4
4.5
4.6

CHAPTER 5

CONCLUSION

85

BIBLIOGRAPHY

87

v

LIST OF TABLES

2.1

Biases of Different Predictors, ARFIMA(1, d, 0) with d = 0.4, ϕ = 0.8, and T = 200 . 22

2.2

Biases of Different Predictors, ARFIMA(1, d, 0) with d = 0.4, ϕ = 0.9, and T = 200 . 22

2.3

Biases of Different Predictors, ARFIMA(1, d, 0) with d = 0.4, ϕ = 0.8, and T = 1, 000

23

2.4

Biases of Different Predictors, ARFIMA(1, d, 0) with d = 0.4, ϕ = 0.9, and T = 1, 000

23

2.5

MSEs of Different Predictors, ARFIMA(1, d, 0) with d = 0.4, ϕ = 0.8, and T = 200 . . 24

2.6

MSEs of Different Predictors, ARFIMA(1, d, 0) with d = 0.4, ϕ = 0.9, and T = 200 . . 24

2.7

MSEs of Different Predictors, ARFIMA(1, d, 0) with d = 0.4, ϕ = 0.8, and T = 1, 000

25

2.8

MSEs of Different Predictors, ARFIMA(1, d, 0) with d = 0.4, ϕ = 0.9, and T = 1, 000

25

2.9

MSEs of Different Predictors, ARFIMA(0, d, 1) with d = 0.4, θ = 0.8, and T = 200 . . 26

2.10 MSEs of Different Predictors, ARFIMA(2, d, 1) with d = 0.4, ϕ1 = −0.5, ϕ2 = −0.9,
θ = 0.6, and T = 200 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
2.11 Biases of Different Predictors, ARFIMA(2, d, 0) process with ϕ1 = 0.08, ϕ2 = 0.9025,
d = 0.2, and T = 400 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
2.12 Biases of Different Predictors, ARFIMA(2, d, 0) process with ϕ1 = 0.08, ϕ2 = 0.9025,
d = 0.4, and T = 400 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27
2.13 Biases of Different Predictors, ARFIMA(2, d, 0) process with ϕ1 = 0.08, ϕ2 = 0.9025,
d = 0.49, and T = 400 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
2.14 MSEs of Different Predictors, ARFIMA(2, d, 0) process with ϕ1 = 0.08, ϕ2 = 0.9025,
d = 0.2 and T = 400 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
2.15 MSEs of Different Predictors, ARFIMA(2, d, 0) process with ϕ1 = 0.08, ϕ2 = 0.9025,
d = 0.4 and T = 400 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
2.16 MSEs of Different Predictors, ARFIMA(2, d, 0) process with ϕ1 = 0.08, ϕ2 = 0.9025,
d = 0.49 and T = 400 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
3.1

Percentage of Model Selected, DGP ARFIMA(1, d, 0), ϕ1 = 0.8, d = 0.2, T = 400 . . 43

vi

3.2

Percentage of Model Selected, DGP ARFIMA(1, d, 0), ϕ1 = 0.8, d = 0.4, T = 400 . . 44

3.3

Percentage of Model Selected, DGP ARFIMA(1, d, 0), ϕ1 = 0.8, d = 0.8, T = 400 . . 45

3.4

Percentage of Model Selected, DGP ARFIMA(1, d, 0), ϕ1 = 0.9, d = 0.4, T = 400 . . 46

3.5

Percentage of Model Selected, DGP ARFIMA(0, d, 1), θ1 = 0.9, d = 0.4, T = 400 . . 47

3.6

Percentage of Model Selected, DGP ARFIMA(1, d, 1), ϕ1 = 0.5, θ1 = −0.7, d = 0.4,
T = 400 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48

3.7

Percentage of Model Selected, DGP ARFIMA(2, d, 0), ϕ1 = 0.25, ϕ2 = 0.375, d = 0.4,
T = 400 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49

3.8

Percentage of Model Selected, DGP ARFIMA(2, d, 0), ϕ1 = 0.25, ϕ2 = 0.375, d = 0.4,
T = 1, 000 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50

3.9

Percentage of Model Selected, DGP ARFIMA(2, d, 0), ϕ1 = 1, ϕ2 = −0.5, d = 0.4,
T = 400 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51

3.10 Percentage of Model Selected, DGP ARFIMA(2, d, 1), ϕ1 = 0.25, ϕ2 = 0.375 θ1 =
1/3, d = 0.4, T = 400 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52
3.11 Percentage of Model Selected, DGP ARFIMA(2, d, 1), ϕ1 = 1, ϕ2 = −0.5, θ1 = 1/3,
d = 0.4, T = 400 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53
3.12 Percentage of Model Selected, DGP ARFIMA(2, d, 1), ϕ1 = 1.4, ϕ2 = −0.6 θ1 = −0.8,
d = 0.4, T = 400 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54
3.13 Percentage of Model Selected, DGP ARFIMA(2, d, 1), ϕ1 = −0.5, ϕ2 = −0.9, θ1 =
−0.6, d = 0.4, T = 400 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
3.14 Percentage of Model Selected, DGP ARFIMA(3, d, 0), ϕ1 = −13/20, ϕ2 = 3/5, ϕ3 =
27/80, d = 0.4, T = 400 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56
3.15 Percentage of Model Selected, DGP ARFIMA(3, d, 0), ϕ1 = −0.56163, ϕ2 = 0.57986,
ϕ3 = −0.92705, d = 0.4, T = 400 . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57
3.16 Percentage of Model Selected, DGP ARFIMA(4, d, 0), ϕ1 = 5/4, ϕ2 = −3/8, ϕ3 =
−1/4, ϕ4 = 3/16, d = 0.4, T = 400 . . . . . . . . . . . . . . . . . . . . . . . . . . . 58
3.17 Percentage of Model Selected, DGP ARFIMA(4, d, 0), ϕ1 = −9/20, ϕ2 = 73/100,
ϕ3 = 87/400, ϕ4 = 27/400, d = 0.4, T = 400 . . . . . . . . . . . . . . . . . . . . . . 59
3.18 Percentage of Model Selected, DGP ARFIMA(4, d, 0), ϕ1 = 1.350, ϕ2 = −0.7, ϕ3 =
0.4, ϕ4 = −0.31, d = 0.4, T = 400 . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60

3.19 Percentage of Model Selected, DGP ARFIMA(4, d, 0), ϕ1 = 1.350, ϕ2 = −0.7, ϕ3 =
0.4, ϕ4 = −0.31, d = 0.3, T = 400 . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61
3.20 Percentage of Model Selected, DGP ARFIMA(4, d, 1), ϕ1 = 5/4, ϕ2 = −3/8, ϕ3 =
−1/4, ϕ4 = 3/16, θ1 = 1/3,d = 0.4, T = 400 . . . . . . . . . . . . . . . . . . . . . . 62
3.21 Modiﬁed BIC and HQ for LW T SE, Percentage of Model Selected, DGP
ARFIMA(2, d, 0) with ϕ1 = 1,ϕ2 = −0.5, d = 0.4 and T = 400 . . . . . . . . . . . . . 63
3.22 Modiﬁed BIC and HQ for LW T SE, Percentage of Model Selected, DGP
ARFIMA(2, d, 0) with ϕ1 = 1,ϕ2 = −0.5, d = 0.4 and T = 1, 000 . . . . . . . . . . . . 64
3.23 Modiﬁed BIC and HQ for LW T SE, Percentage of Model Selected, DGP
ARFIMA(1, d, 0) with ϕ1 = 0.8, d = 0.4 and T = 400 . . . . . . . . . . . . . . . . . . 65
3.24 Modiﬁed BIC and HQ for LW T SE, Percentage of Model Selected, DGP
ARFIMA(4, d, 0), ϕ1 = 5/4, ϕ2 = −3/8, ϕ3 = −1/4, ϕ4 = 3/16, d = 0.4, and T = 1, 000 66
3.25 Modiﬁed BIC and HQ for LW T SE, Percentage of Model Selected, DGP
ARFIMA(4, d, 0), ϕ1 = 1.350, ϕ2 = −0.7, ϕ3 = 0.4, ϕ4 = −0.31, d = 0.4, and T = 1, 000 67
3.26 Estimated ARFIMA Models for U.S. Inﬂation . . . . . . . . . . . . . . . . . . . . . . 68
3.27 Estimated ARFIMA Models for U.S. Inﬂation . . . . . . . . . . . . . . . . . . . . . . 69
4.1

Model selection for U.S. Real Output . . . . . . . . . . . . . . . . . . . . . . . . . . . 77

4.2

Estimated ARFIMA Models for U.S. Real Output . . . . . . . . . . . . . . . . . . . . 78

4.3

Model Selection for G-7 Monthly Inﬂation . . . . . . . . . . . . . . . . . . . . . . . . 79

4.4

Estimated ARFIMA Models for G-7 Monthly Inﬂation . . . . . . . . . . . . . . . . . 80

4.5

Model Selection for Realized Volatility of Exchange Rates . . . . . . . . . . . . . . . 81

4.6

Estimated ARFIMA Models for Realized Volatility of Exchange Rates . . . . . . . . . 81

4.7

Model Selection for Interest Rates . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82

4.8

Model Selection for Interest Rate Spreads . . . . . . . . . . . . . . . . . . . . . . . . 82

4.9

Estimated ARFIMA Models for U.S. Interest Rates . . . . . . . . . . . . . . . . . . . 83

4.10 Estimated ARFIMA models for U.S. Interest Rate Spreads . . . . . . . . . . . . . . . 84

LIST OF FIGURES

1.1

1.2

Inﬁnite AR Representations, Inﬁnite MA Representations, and Theoretical MSE of Predictors, ARFIMA(1, d, 0) process with d = 0.4 and ϕ = 0.8 . . . . . . . . . . . . . . .

7

Inﬁnite AR Representations, Inﬁnite MA Representations, and Theoretical MSE of Predictors, ARFIMA(1, d, 0) process with d = 0.4 and ϕ = 0.9 . . . . . . . . . . . . . . .

8

ix

Chapter 1

INTRODUCTION

There is a burgeoning literature on the theoretical and statistical aspects of long memory models
in economic and ﬁnancial time series; where the series possess hyperbolically decaying autocorrelations. The work of Granger (1980), Granger and Joyeux (1980) and Hosking (1981) has been
particularly inﬂuential in this regard. These models have proved extremely relevant for many time
series in ﬁnance and macroeconomics; see Baillie (1996) and Henry and Zaffaroni (2003) for a
survey of some of the literature.
A time series is said to have long memory, (or long-range dependence or strong dependence),
if its autocorrelations decay to zero at very slow hyperbolic rates. For a more thorough and detailed
deﬁnitions of long memory see Baillie (1996) and Beran (1994). The univariate long memory,
fractionally autoregressive moving average (ARFIMA) models have been introduced by Granger
and Joyeux (1980) and Hosking (1981). A fractionally integrated or long memory process generates hyperbolic rates of decay in its autocorrelation function and impulse response weights. A
univariate time series process, yt , which is said to be fractionally integrated of order d, or I(d), if
(1 − L)d yt = ut ,

t = 1, ...T

(1.1)

where L is the lag operator and ut is a short memory, I(0) process. The d parameter represents the
degree of “long memory”, or persistence in the series. If −0.5 < d < 0.5, the process is stationary
and invertible. If 0.5 < d < 1, the process has an inﬁnite variance but it is mean reverting. If the
short memory component is represented as a stationary and invertible ARMA(p, q) process, then
equation (1.1) becomes the well-known ARFIMA(p, d, q) model,

ϕ (L)(1 − L)d yt = θ (L)εt

(1.2)

where L is the lag operator, and E(εt ) = 0, E(εt2 ) = σ 2 , E(εt εs ) = 0, s ̸= t. Also, the lag polynoq

p

mial operators are deﬁned as ϕ (L) = 1 − ∑ j=1 ϕ j L j and θ (L) = 1 + ∑ j=1 θ j L j . For a stationary
1

long memory ARFIMA(p, d, q) process with −0.5 < d < 0.5, the ARFIMA model can be represented by the AR(∞) and MA(∞) representations of

π (L)yt = εt ,
where

π (L) = 1 −

(1.3)

∞

∑ π j L j = ϕ (L)(1 − L)d θ (L)−1

j=1

and,
yt = ψ (L)εt ,
where

ψ (L) = 1 +

(1.4)

∞

∑ ψ j L j = θ (L)ϕ (L)−1(1 − L)−d .

j=1

It is also assumed that all the roots of ϕ (L) and θ (L) lie outside the unit circle. For large lags
k, these coefﬁcients decay at the very slow hyperbolic rates of ψk ∼ c1 kd−1 and similarly the
inﬁnite autoregressive representation coefﬁcients decay at the rate of c2 k−d−1 and autocorrelation
coefﬁcients at the rate of c3 k2d−1 , where c1 , c2 and c3 are constants. The hyperbolic decay that is
generated by such a process is known as the ‘Hurst effect’, after Hurst (1956), who ﬁrst discovered
the phenomenon in hydrological time series data. The full impact of these different rates of decay
can be visually seen in Figures 1.1 and 1.2, where the inﬁnite AR weights decline far more quickly
that the Wold decomposition, or inﬁnite MA weights. This has relevance for the construction of
predictions from these models, as compared with Impulse Response Weights (IRW s).
Much applied work, such as Barkoulas and Baum (2006) for monetary indices, and Doornik
and Ooms (2004) for US and UK inﬂation, have stressed the usefulness of ARFIMA models for
forecasting macroeconomic and ﬁnancial time series. Because of the possibility of misspeciﬁcation of ARFIMA models, there is a burgeoning literature on estimating the single long memory
parameter using semi parametric estimators (SPE), such as the GPH’s local periodogram estimator, proposed by Geweke and Porter-Hudak (1983) and the Local Whittle (LW ) estimation,
proposed by Kunsch (1987) and Robinson (1995).

2

The standard methodology for constructing dynamic multi step ahead predictions from any
dynamic model is to ﬁrst derive the theoretical minimum MSE predictor with known parameter
values. The feasible minimum MSE predictor is then obtained by substituting estimated parameters for the theoretical unknown parameters. Of course, this approach assumes a known parametric
model, whereas it may be desirable to use some neural net or non parametric approach; see Diebold
and Mariano (1995). In our chapter 2 we continue this approach with ARFIMA models where the
obvious method of using MLEs of the parameters are then substituted into the minimum MSE
predictor with known theoretical values.
However, in recent years the dominant paradigm in estimation of long memory models is
to use the semi parametric estimators of the long memory parameter e.g. the Local Whittle of
Robinson (1995). It is not quite clear how to use this in ﬁnding the optimum predictor since the
SPE, or the Local Whittle estimator, only gives us information on one parameter, and a typical
model may also involve many short run parameters, such as ARMA and nonlinear terms such as
Exponential Smooth Transition Autoregressive (ESTAR) or Logistic Smooth Transition Autoregressive (LSTAR). Therefore we use the recent suggestion of Baillie and Kapetanios (2007) and
Baillie and Kapetanios (2008) to use a Local Whittle Two Step Estimator, or LW T SE. Their
suggestions have been in terms of ﬁrst testing for nonlinearity within a long memory process and
second to estimate a long memory non linear autoregression with smooth transition regimes; or
FI − NLAR − ESTAR model. Both approaches compare the one step MLE approach to testing and
to the LW T SE. Chapter 2 of this dissertation extends this type of investigation to the prediction
situation and we compare the MLE and the LW T SE used for substituting into the theoretical minimum MSE predictor. We consider various representations for the minimum mean square errors
(MSE) predictor with known parameters. We then conduct a detailed simulation study when the
true parameters are replaced by estimates that are either based on MLE or LW T SE. The predictors
are compared in terms of their bias and MSE for different prediction horizons. In general the predictors based on MLE are found to be sometimes only slightly superior to the predictors based on
the LW T SE method. Since LW T SE estimates depend on the bandwidth selection, the “optimal”

3

bandwidth LW T SE is also considered. In general the “optimal” bandwidth LW T SE brings about
worse predictions than the LW T SE based on the conventional one of using an agnostic bandwidth
of the square root of the sample size. Although the MLE based predictor is generally better, the
LW T SE nevertheless is better than expected given its rather poor performance on estimating IRW s
and short memory parameters as documented by Baillie and Kapetanios (2009). We also present
some results on prediction where the MLE is based on a mis-speciﬁed models. A summary of the
results then concludes chapter 2.
Chapter 3 is concerned with a completely different problem; namely the total model speciﬁcation of an ARFIMA model. There are now many techniques available for determining if a
univariate time series has long memory; and in many cases the initial use of SPEs like GPH or
Local Whittle may conform the presence of long memory. While the basic long memory ARFIMA
model is widely used in time series analysis, the problem of selection of the ARMA orders of the
model appears to have been unresolved. In Chapter 3, we show the Bayesian (BIC) and HannanQuinn (HQ) information criteria are consistent model selection procedures for the class of long
memory ARFIMA models when the models are estimated by Quasi-MLE (QMLE). Very surprisingly, it seems that this very basic problem has never been resolved in time series analysis before.
Our end result is that the BIC is consistent and is an easily implemented procedure for model selection of an ARFIMA(p, d, q) model. The implementation of the technique involves selection of
the maximum orders of the short memory process that the investigator is willing to consider. In our
simulation study we restrict the maximum values of p and q to be 8, which involves 81 different
ARFIMA models to be estimated by MLE and then assessed from a model selection perspective. In
our detailed simulation we consider data generating processes up to orders of an ARFIMA(4, d, 1)
model. The BIC works well for some low order models; but has a mixed performance on high
order models with p = 4 and is quite dependent on the part of the true evidence indicates that the
BIC based on QMLE works very well for low order models but can be rather mixed for high order
models.
The initial focus of this chapter is thus on comparing the IC where all the models are estimated

4

by MLE. We also consider a modiﬁed approach where the AIC, BIC and HQIC are all based on
the LW T SE. Although this seems on face value a rather strange method, it does provide a role for
the popular LW T SE. In many of the designs the use of the LW T SE does surprisingly well. One
interesting aspect of using the LW T SE for the purpose of ICs is that we are in the domain of a
generated regressor problem and we investigate the possibility of introducing an adjustment that
takes into account the fact that the IC should be appropriately modiﬁed. Some simulation evidence
suggests that this may provide an improvement over the traditional IC with LW T SE.
Moving ahead, chapter 4 is concerned with some illustrations of the previous methodology.
In particular, we show how the model selection methods can be used to select ARFIMA models
for the very seasonal monthly US CPI inﬂation series. The applications of model selection criteria
are compared with MLE and LW T SE are considered in Chapter 4. We examine U.S. real output,
G-7 inﬂation, U.S. interest rates and spreads, and daily volatility of exchange rate. Our empirical
results conﬁrm long memory features found in literature. For model selection issue, the BIC with
MLE estimators, generally, selects the parsimonious model. The model selection and degree of
persistence from MLE mostly differ from ones from LW estimator.

5

APPENDIX

6

Appendix 1A: Figures
Figure 1.1: Inﬁnite AR Representations, Inﬁnite MA Representations, and Theoretical MSE of
Predictors, ARFIMA(1, d, 0) process with d = 0.4 and ϕ = 0.8

Key: Author’s calculation. Dash line represents inﬁnite AR representation coefﬁcients. Solid line
represents inﬁnite MA representation coefﬁcients. Dot line represents theoretical MSE (right axis).

7

Figure 1.2: Inﬁnite AR Representations, Inﬁnite MA Representations, and Theoretical MSE of
Predictors, ARFIMA(1, d, 0) process with d = 0.4 and ϕ = 0.9

Key: See Figure 1.1.

8

Chapter 2

PREDICTION FROM ARFIMA MODELS

2.1 Introduction
A currently important research issue concerns the estimation of the long memory parameter in long
memory, ARFIMA models. The method of semi parametric estimation (SPE) of the long memory
parameter has been popular in the theoretical literature with the Local Whittle (LW ) estimator
emerging as the most widely recommended; in particular see Robinson (1995), Dalla, Giraitis,
and Hidalgo (2005) and others. Also, Phillips (2007), Phillips and Shimotsu (2004) and many
others have provided results on further extensions and reﬁnements. The excessive attention to just
one parameter in the model building, or research program of an applied econometrician has been
questioned by Baillie and Kapetanios (2008), Baillie and Kapetanios (2009) and others. They
suggest that an empirical investigator is typically interested in a wider range of issues that will
generally include (i) constructing a model involving both short and long memory components, (ii)
estimating Impulse Response Weights (IRW s) and (iii) making predictions over short and long
horizons. In particular Baillie and Kapetanios (2009) show that the SPE Local Whittle (LW )
estimator of the long memory parameter in a univariate time series is shown to perform poorly
in the presence of persistent short memory component. These problems with the LW estimator
extend to the estimation of short run parameters and Impulse Response Weights (IRW s) and the
difﬁculties remain even when optimal bandwidths, or local polynomial Whittle methods are used.
Furthermore, Baillie and Kapetanios (2009) have suggested an alternative time domain SPE of
estimating a high order autoregressive approximation, which turns out to have desirable properties
for estimating IRW s with the method also being relevant for certain non-stationary processes.
This chapter considers the equally important practical issue of the impact of estimation strategy on the properties of predictions from a long memory process. Hence this chapter brings to9

gether two previously unconnected areas of the literature; namely the choice of using MLE or LW
estimation procedures and the impact on prediction.
The plan of the rest of this chapter is as follows; the next section brieﬂy reviews the form
of predictors from the ARFIMA, which is the most basic long memory model. Section 2.3 reviews the SPE estimation methodology, which is based on using the LW as a ﬁrst step and then
deriving the estimate of the short memory parameter(s) as a second step. Hence the complete estimator is referred to as the Local Whittle Two Step Estimator, or LW T SE. These estimates are
then used to replace the unknown parameters in the theoretical minimum MSE predictor. Section
2.4 discusses the practical issues involved in constructing predictors from a sample and reports the
results of a simulation study. The results indicate the difﬁculties in choosing the bandwidth for
the LW estimator. Several different LW estimators are considered based on bandwidth considerations. The predictors based on “optimal” bandwidths generally perform poorly in the simulation
designs. Their performance deteriorates with the degree of persistence of the short memory AR
components. While the MLE is expected to produce superior forecasts in terms of MSE, the extent
of the improvement is surprising.
Some additional results of particular interest concern the predictors when the estimation of
the model is mis-speciﬁed. This situation should conceptually be ideal for the LW T SE since it is
designed to be robust to ignorance of short run dynamics. However, the predictor based on MLE
from a mis-speciﬁed model is found to substantially outperform the predictor from the LW T SE for
all lead times up to 40 periods ahead. Further details are given in section 2.4. The chapter ends
with a short conclusions section.

2.2 Prediction from the ARFIMA Model
Several previous articles have examined issues concerning prediction from ARFIMA models. The
s step ahead predictor formed at forecast origin t is denoted by yt,s , and the prediction error and

10

MSE with known parameters are given by
et,s = yt+s − yt,s =

s−1

∑ ψ j εt+s− j

j=0

and
Var(et,s ) = σ 2

s−1

∑ ψ2
j

j=0

respectively. Some important results on the long run properties of predictors from ARFIMA models are due to Diebold and Lindner (1996), who showed that for I(d) where −1/2 < d < 1/2,
( )
( )
2
then Var en,s = O(1) and lims→∞ Var et,s = σε ∑∞ ψ 2 = Var(yt ). Moreover, for the non
j=0 j
stationary case, where 1/2 < d < 3/2, then the Var(et,s ) = O(s2d−1 ), which is undeﬁned as the
forecast horizon gets large. Hence prediction from these models tends to be dominated at high forecast horizons by the hyperbolic decay that is evident in the theoretical optimal forecasts. At low
forecast horizons the impact of short memory components adds to the complexity of the forecasts.
The practical calculation of the optimum minimum MSE predictor is not as straightforward
as for ARMA models. The ﬁrst proposed method was to use a truncated version of the inﬁnite
autoregressive representation; see Granger and Joyeux (1980) and Geweke and Porter-Hudak
(1983). Also, Peiris (1987) and Peiris and Perera (1988) discussed computationally convenient
ways for calculating predictions from the autoregressive representation. Crato and Ray (1996)
have found that AIC is biased towards selecting low order AR models. However, Poskitt (2007)
has analyzed information criterion to determine the lag of autoregressive model.
An alternative methodology is to use the state-space representation, of Chan and Palma
(1998); although this can be computationally quite complicated. A further alternative predictor
is the partial moving average representation used in the context of regression models with ARMA
disturbances by Baillie (1980) and Yamamoto (1981). Then ARFIMA model can be expressed as
the partial moving average expansion
s−1

yt+s =

∑ ψ j εn+s− j + {1 − µ (L)π (L)}L−syt ,

j=0

11

(2.1)

where µ (L) = ∑s−1 ψ j L j . Equation (2.1) can also be written as
j=0
s−1

j=0

yt+s =

∞

j=0

∑ ψ j εt+s− j + ∑ τs j yt− j

(2.2)

where τs j is the weight placed on an observation j periods ago when forming the s step ahead
predictor. The prediction weights can be shown to be
{

τs j =

s−1

}

∑ πk+ j ψs−k+ j

+ πs+ j

(2.3)

k=1

where π j and ψ j are deﬁned in (1.3) and (1.4) respectively. A brief derivation of the above results
is given in the Appendix 2A. The s step ahead predictor at time t is then given by
∞

yt,s =

∑ τs j yt− j .

(2.4)

j=0

We now consider the issues of parameter estimation before returning to the prediction problem
in practice when parameters are unknown and have to be replaced by their estimated values.

2.3 Estimation of ARFIMA Models
Given the substantial recent econometric literature on semi-parametric estimation of the long memory parameter, see, e.g., Phillips and Shimotsu (2006) and Phillips (2007), this study also considers an alternative two step estimation procedure. There is now a wide variety of semi-parametric
methods for the estimation of d and this study uses Local Whittle (LW ), which is proposed by Kunsch (1987) and Robinson (1995), and known to have desirable properties; see Robinson (1995),
Dalla, Giraitis, and Hidalgo (2005) and Phillips and Shimotsu (2006). The LW estimate of d is
obtained by minimizing the objective function
[
]
1 m 2d
2d m
ω j I(ω j ) −
ln
∑
∑ ln(ω j )
m j=1
m j=1

(2.5)

iω t 2
1
with respect to d, where I(ω j ) is the periodogram given by I(ω j ) = 2π T ∑T yt e j . An imj=1

portant issue concerns the choice of bandwidth, m. A relatively agnostic and often chosen value is

12

to choose m = T 0.5 . Methods for determining the optimal bandwidth in the sense of MSE reduction have been proposed by Henry and Robinson (1996), Delgado and Robinson (1996), Henry
(2001), Andrews and Guggenberger (2003), Andrews and Sun (2004), Phillips and Shimotsu
(2006) and the references therein. From Henry (2001), the optimal bandwidth, for −0.5 < d < 0.5,
is

(
mLW =

)
3 4/5 ∗ d −2/5 4/5
T
ν +
4π
12

(2.6)

′′
where ν ∗ = f ∗ (0)/2 f ∗ (0) and f is the spectral density function of the long memory process.

This study uses simulated data which allows ν ∗ to be analytically calculated. In particular, for the
ARFIMA(1, d, 0) model, it can be shown that ν ∗ = ϕ1 (1 − ϕ1 )−2 . For a known value of d, the
series can be fractionally ﬁltered to obtain
t−p

ut = yt −

∑ πl (d)yt−l ,

l=1

where (1 − L)d yt = yt − ∑∞ πl (d)yt−l . Given the LW estimate of d denoted as dˆLW , then the
l=1
Feasible Fractionally Filtered (FFF) series is
t−p

ut = yt −
ˆ

ˆ
∑ πl (dˆLW )yt−l .

(2.7)

l=1

The ARMA(p, q) parameters of the original ARFIMA(p, d, q) process in equation (1.2) are then
estimated by minimizing the conditional sum of squares. Further discussion of this technique and
its properties can be found in Robinson (2006) and Baillie and Kapetanios (2009).
While there is a large literature on semi-parametric estimation of d, there is relatively little consideration of subsequent estimation of the remaining parameters in the model. The Local
Whittle Two Step Estimator (LW T SE), approach considered in this study is to fractionally ﬁlter
the observed series yt using the inﬁnite AR representation of yt , given by
t−p

ut = yt −

∑ πl (d)yt−l ,

(2.8)

l=1

where d is replaced by its semi-parametric estimate. This generates residuals denoted by ut . The
ˆ
next stage is to estimate an ARMA model for the ut series, by minimizing the CSS. The small samˆ
ple properties of such an estimation strategy with the correct speciﬁcation, as an alternative to the
13

one-step MLE, are considered in Baillie and Kapetanios (2009) which is concerned with a Monte
Carlo study. Baillie and Kapetanios (2009) also provide consistency and a rate of convergence of
second-step parameters in the LW T SE as
(
)
ˆ
β dˆLW − β0 (d0 ) = O p (m−1/2 )
(
)
ˆ
where β = (ϕ1 , ..., ϕ p , θ1 , ..., θq )′ , β0 (d0 ) denote the true parameter values, and β dˆLW be the
ˆ
LW T SE of β based on the fractionally ﬁltered series ut .
The more direct approach to estimating the ARFIMA(p, d, q) process is to assume the innovations to be NID(0, σ 2 ), then the Gaussian likelihood is numerically maximized with respect to
the complete vector of parameters ϑ . Under these conditions, Fox and Taqqu (1986) have shown
the asymptotic distribution of the MLE to be
(
)
ˆ
T 1/2 ϑ − ϑ 0 → N{0, A(ϑ 0 )−1 }
where ϑ 0 denotes the true value of the vector of parameters, and where A(ϑ 0 ) is the information
matrix. Hosoya (1997) has shown, under the condition that non-Gaussian innovations satisfy mild
mixing and moment conditions, the asymptotic distribution of the MLE to be
(
)
ˆ
T 1/2 ϑ − ϑ 0 → N{0, A(ϑ 0 )−1 B(ϑ 0 )A(ϑ 0 )−1 }
where B(ϑ 0 ) is the outer product gradient, evaluated at the true parameter values ϑ 0 . Recently,
Baillie and Kapetanios (2008) have extended this analysis to include the estimation of long memory models with nonlinear autoregressive structures. The computation of such models is achieved
by the minimization of the conditional sum of squares (CSS).
It should be noted that the approximate time domain MLE approach has been successfully
applied in many extensions involving far more complicated models which include a long memory
parameter. This includes for example models with long memory and nonlinearity in the conditional
mean; see van Dijk, Frances, and Paap (2002) and Baillie and Kapetanios (2008); long memory
GARCH models of Baillie, Bollerslev, and Mikkelsen (1996), long memory stochastic volatility
models of Breidt, Crato, and de Lima (1998), long memory models with GARCH innovations,
multivariate long memory models, see Jensen (2009) and others.
14

2.4 Practical Issues and Simulation
Assuming the correct speciﬁcation of the model, the practical predictors are formed by inserting
ˆ
parameter estimates into the minimum MSE predictor to obtain τs j . Hence the feasible predictor
with estimated parameters is

T −1

yt,s =
ˆ

ˆ
∑ τs j yt− j .

(2.9)

j=0

The purpose of this simulation is to assess importance of using LW T SE with agnostic bandwidth
choice, as compared to LW T SE with optimal bandwidth, as compared to AR(P) with P = ln(T )2 ,
as compared with MLE.
A natural approach for prediction with long memory is to use high order autoregressive
schemes. These methods can be based on an OLS projection on a lagged information set which is
repeated for each different forecast horizon; see Ray (1993), Crato and Ray (1996), Bhansali and
Kokoszka (2002), Doornik and Ooms (2004) and Barkoulas and Baum (2006). These studies
have produced rather mixed results for forecasting from long memory processes. For example,
Ray (1993) has noted the sensitivity of predictions to the order of approximation of the AR model
and also the sensitivity to the forecast horizon. Previous studies by Baillie and Kapetanios (2008)
and Baillie and Kapetanios (2009) have described the properties of MLE and the LW T SE for these
models.
The following simulation was designed to evaluate the relative performance of predictors
based on the LW T SE with agnostic bandwidth choice, the LW T SE with optimal bandwidth, the
AR(P), and compared with the benchmark MLE. A selection of the simulations that were performed are reported in the text and the reported results are indicative of the general ﬁndings.
Realizations of an ARFIMA(1, d, 0) process were generated for sample sizes of T = 200, and
T = 1, 000. The simulation designs are for values of the AR coefﬁcient given by ϕ = 0.8, ϕ = 0.90,

ϕ = 0.95 and ϕ = 0.99; and for d = 0.2 and d = 0.4. In each simulation design, 10, 000 replications were generated with predictions generated up to s = 1, 2, 3, ....40 periods ahead from equation
(2.9). The bias and MSE of the different predictions were computed for each predictor based on

15

the different estimation methods, and also for each one of the 40 period forecast horizons. These
results are compared to the true model with known parameter values plugged into predictor formula. This study does not use the theoretical asymptotic results on prediction MSE which have
been derived for ARMA models by Baillie (1980) and Yamamoto (1981), and which have been
extended to ARFIMA models by Katayama (2008).
In this chapter we deal with the analogous issue concerning prediction. Tables 2.1 and 2.2
report the biases from the various methods for designs with a sample size of T = 200, while tables
2.3 and 2.4 report analogous quantities for T = 1, 000. As can be seen, the forecast biases are
extremely reasonable for virtually all methods and prediction horizons.
Tables 2.5 and 2.6 present the prediction MSE for the sample size of T = 200. The ﬁrst column
gives the prediction lead time and the second column the theoretical prediction MSE derived form
knowledge of the true parameters in each design; see dot lines in Figures 1.1 and 1.2 for examples.
The MSE quantity was also computed from the predictor with the true theoretical values inserted
into the optimal prediction formula and reported in the third column of Tables 2.5 through 2.8. As
expected, these MSEs were extremely close to the true theoretical values. The rest of the table
gives the simulated MSEs from the MLE, AR(P) and LW based on the m = T 0.5 bandwidth and
also the LW based on the optimal bandwidth. In general the MLE compares well with the SPE
versions and the optimal bandwidth version of the LW is generally inferior to the simple LW . For
example, in Table 2.6, at 10 periods ahead, the MSE of MLE and LW with m = T 0.5 are 21.8
and 23.1, respectively. The performance of the optimal bandwidth LW deteriorates with increasing
persistence of the AR parameter. The prediction MSEs of the LW T SE markedly differ from those
of the true model for d = 0.4 and especially when ϕ = 0.9. In general the predictor based on
the MLE performs very well in the sense that its value is close to the MSE of the predictor with
known parameter values. The predictor based on the LW T SE is generally inferior with higher MSE
compared to that of the predictor based on the MLE. The predictor based on the AR(P) performs
poorly. For example, in Table 2.6, MSE at 40 period forecast from AR(P) is 42 percent higher
than theoretical MSE comparing to 2 percent from MLE. In general the relative performance of

16

the predictor based on the LW T SE deteriorates as the forecast lead time increases. Analogous
results for the larger sample size of T = 1, 000 are to be found in tables 2.7 and 2.8. In general the
performance of the predictors improves as the observations increases. We also consider the case of
ARFIMA(0, d, 1) process and ARFIMA(2, d, 1) process. The results in Tables 2.9 and 2.10 imply
that the performance of predictors from MLE are generally slightly better than LW T SE.
The above results are all predicated on the assumption that the order of the short memory component, i.e. an AR(1) process is known and that the unknown quantities are the parameter values.
This assumption is applied to both the LW T SE and the MLE and in this sense treats both methods
equally. In practice the order of say an ARMA(p, q) process for ut would have to be determined
before the parameter estimation method is implemented. In order to increase the realism of the
results a limited simulation was conducted where the order p of an autoregressive process was determined by Bayesian Information Criteria (BIC) for an ARFIMA(p, d, 0) model when using MLE
and also on the FFF series ut when using the LW T SE approach. Some interesting results were for
ˆ
when the short run dynamics of the ARFIMA model are mis-speciﬁed. This is a situation where
the MLE would be expected to potentially perform very poorly, and the LW to be superior since
it is robust to arbitrary short run dynamics. One simulation was based on the ARFIMA(2, d, 0)
process
(1 − L)d (1 − ϕ1 L − ϕ2 L2 )yt = εt
it can be shown after some algebra that

υ∗ =

(ϕ1 ϕ2 − ϕ1 − 4ϕ2 )
(1 − ϕ1 − ϕ2 )2

while the process
(1 − L)d (1 − 0.0800L − 0.9025L2 )yt = εt
has an autoregressive component with complex roots and which has a damping factor of
|−ϕ2 |1/2 = 0.98, which generates very slow decay it its autocorrelation function and impulse response weights with a sinusoidal pattern superimposed. In this exercise, we estimate
ARFIMA(1, d, 0) model for both the MLE and LW T SE. The results are presented in Tables 2.11
17

to 2.16. Again, the MLE, although based an a mis-speciﬁed model performs more satisfactorily
than the LW T SE for either agnostic or optimal bandwidth. The AR(P) performs very poorly comparing to the MLE and LW T SE. The results of prediction based on SPE are disappointing in
ARFIMA(1, d, 0) with high autoregressive coefﬁcients and misspeciﬁed cases. These suggest the
need for further work to be done on the prediction issue in SPE.

2.5 Conclusion
This chapter considers the effects on multi step predication of using semi parametric Local Whittle
estimators compared to MLE for long memory ARFIMA models. We consider various representations for the minimum MSE predictor with known parameters. We then conduct a detailed
simulation study for when true parameters are replaced by estimates. The predictor based on MLE
is found to be superior in MSE sense to the predictor based on two step Local Whittle estimation.
The “optimal” bandwidth Local Whittle estimator gives rise to worse predictions that the Local
Whittle using an agnostic bandwidth of the square root of the sample size. Although the Local
Whittle is theoretically robust to short run dynamics, it nevertheless gives rise to inferior predictions to a predictor using MLE from a mis-speciﬁed model. This provides doubt on the usefulness
of the Local Whittle estimator for the issue of prediction in long memory ARFIMA models, and
suggests further work is required on how to form short and medium term predictions based on semi
parametric estimates of the long memory parameter.

18

APPENDICES

19

Appendix 2A: The prediction weight formula
Any stationary time series yt has a Wold decomposition of the form yt = ∑∞ ψ j εt− j . When
j=0
considering the the formation of the optimum minimmum MSE predictor, it is convenient to use
the partial moving average expansion:
yt+s = µ (L)εt+s + ∂ (L)yt ,

(2.10)

where µ (L) = ∑s−1 ψ j L j .
j=0
Let consider the case of yt is the ARFIMA(p, d, q) model where ϕ (L)(1 − L)d yt = θ (L)εt . It
follows that
yt+s = µ (L)εt+s + ∂ (L)yt
= µ (L)θ (L)−1 ϕ (L)(1 − L)d yt+s + ∂ (L)yt
= {µ (L)θ (L)−1 ϕ (L)(1 − L)d + ∂ (L)Ls }yt+s ,
where

{

∂ (L) =
Or by deﬁning

φ ∗ (L) =

θ (L) − µ (L)ϕ (L)(1 − L)d
θ (L)

}
L−s .

}
{
d L−s ,
θ (L) − µ (L)ϕ (L)(1 − L)

the partial MA representation of the ARFIMA(p, d, q) model is
yt+s = µ (L)εt+s +
or

φ ∗ (L)
yt ,
θ (L)

s−1

∞

s−1

j=0

yt+s =

j=0

(2.11)

j=0

∑ ψ j εt+s− j + ∑ τs j yt− j = ∑ ψ j εt+s− j + τs(L)yt ,

(2.12)

where τs (L) = ∑∞ τs j L j . Note that τs j is the weight given to the (t − j) observation in the
j=0
calculation of the s step forecast.
Further, we consider this formula with the AR(1) model, yt = ϕ yt−1 + εt . It follows that

ψ j = ϕ j.

]
[
ϕ ∗ (L) = 1 − (1 + ϕ L + ϕ 2 L2 + ... + ϕ s−1 Ls−1 )(1 − ϕ L) L−s = ϕ s .
20

Hence yt+s = ∑s−1 ϕ j εt+s− j + ϕ s yt .
j=0
Next, we derive the explicit formula for the prediction weight.

Let consider the

ARFIMA(0, d, 0) model, (1 − L)d yt = εt . We can rewrite ARFIMA(0, d, 0) in AR(∞) and MA(∞)
representations as

∞

∑ π j L j ≡ π (L),

(1 − L)d = 1 −

j=1

and
(1 − L)−d = 1 +

∞

∑ ψ j L j ≡ ψ (L).

j=1

Since θ (L) = 1 and ϕ (L) = 1, then

τs

(L) = φ ∗ (L) =

{
}
d L−s = {1 − µ (L)π (L)} L−s
1 − µ (L)(1 − L)

(2.13)

By replacing π (L) and ψ (L) in equation (2.13), we can show that, in case of s = 1,

τ1, j = π1+ j

for

j = 0, 1, 2, ...,

(2.14)

in case of s = 2,

τ2, j = π1+ j ψ1+ j + π2+ j

for

j = 0, 1, 2, ...,

(2.15)

in case of s = 3,

τ3, j = π1+ j ψ2+ j + π2+ j ψ1+ j + π3+ j

for

j = 0, 1, 2

(2.16)

and so on. By the induction from equations (2.14) through (2.16), the prediction weights are
{
}

τs, j =

s−1

∑ πk+ j ψs−k+

+ πs+ j

(2.17)

k=1

for s = 1, 2, 3, ... and j = 0, 1, 2, ... . With the same argument, the formula in equation (2.17) is the
prediction weights for any ARFIMA(p, d, q) models.

21

Appendix 2B: Tables
Table 2.1: Biases of Different Predictors, ARFIMA(1, d, 0) with d = 0.4, ϕ = 0.8, and T = 200
Forecast
Horizon(s)
1
2
3
4
5
10
20
30
40

True
Parameter
value
-0.012
-0.009
-0.006
0.000
0.008
-0.019
-0.025
-0.028
-0.025

MLE

AR(P)

LW
m = T 0.5

LW
optimal m

-0.013
-0.010
-0.008
-0.002
0.007
-0.020
-0.025
-0.026
-0.024

-0.014
-0.008
-0.004
0.003
0.021
-0.012
-0.022
-0.018
-0.002

-0.011
-0.008
-0.005
0.001
0.010
-0.017
-0.024
-0.026
-0.024

-0.009
-0.007
-0.006
-0.001
0.007
-0.025
-0.035
-0.038
-0.036

Key: Number of replications is 10,000 and P = ln(T )2 ≈ 28 in the AR(P).
Table 2.2: Biases of Different Predictors, ARFIMA(1, d, 0) with d = 0.4, ϕ = 0.9, and T = 200
Forecast
Horizon(s)
1
2
3
4
5
10
20
30
40

True
Parameter
value
-0.008
0.003
0.013
0.031
0.032
0.054
0.045
0.043
0.004

MLE

AR(P)

LW
m = T 0.5

LW
optimal m

-0.008
0.002
0.012
0.029
0.030
0.047
0.029
0.023
-0.017

-0.005
0.009
0.016
0.038
0.044
0.089
0.071
0.068
0.021

-0.008
0.003
0.012
0.029
0.030
0.048
0.034
0.029
-0.012

-0.012
-0.004
0.003
0.016
0.013
0.009
-0.061
-0.156
-0.336

Key: See Table 2.1.

22

Table 2.3: Biases of Different Predictors, ARFIMA(1, d, 0) with d = 0.4, ϕ = 0.8, and T = 1, 000
Forecast
Horizon(s)
1
2
3
4
5
10
20
30
40

True
Parameter
value
-0.007
-0.005
-0.004
-0.004
-0.008
0.009
0.042
0.014
0.037

MLE

AR(P)

LW
m = T 0.5

LW
optimal m

-0.008
-0.006
-0.005
-0.006
-0.010
0.007
0.042
0.015
0.039

-0.007
-0.004
-0.006
-0.006
-0.012
0.008
0.036
0.000
0.023

-0.007
-0.005
-0.004
-0.005
-0.009
0.008
0.041
0.014
0.038

-0.007
-0.005
-0.003
-0.003
-0.007
0.011
0.044
0.017
0.040

Key: Number of replications is 10,000 and P = ln(T )2 ≈ 47 in the AR(P).

Table 2.4: Biases of Different Predictors, ARFIMA(1, d, 0) with d = 0.4, ϕ = 0.9, and T = 1, 000
Forecast
Horizon(s)
1
2
3
4
5
10
20
30
40

True
Parameter
value
-0.012
-0.015
-0.022
-0.006
-0.006
0.023
0.019
0.011
0.028

MLE

AR(P)

LW
m = T 0.5

LW
optimal m

-0.012
-0.016
-0.024
-0.008
-0.009
0.020
0.019
0.013
0.032

-0.011
-0.018
-0.024
-0.009
-0.008
0.031
0.037
0.031
0.045

-0.012
-0.015
-0.023
-0.007
-0.008
0.018
0.011
0.000
0.017

-0.011
-0.015
-0.023
-0.008
-0.010
0.012
0.002
-0.007
0.012

Key: See Table 2.3.

23

Table 2.5: MSEs of Different Predictors, ARFIMA(1, d, 0) with d = 0.4, ϕ = 0.8, and T = 200
Forecast
Horizon(s)

Theoretical
MSE

1
2
3
4
5
10
20
30
40

1.000
2.440
3.978
5.456
6.809
11.502
15.252
16.754
17.629

True
Parameter
value
1.008
2.452
4.027
5.575
6.954
11.611
15.356
17.076
18.189

MLE

AR(P)

LW
m = T 0.5

LW
optimal m

1.021
2.498
4.112
5.706
7.141
11.966
15.998
17.783
18.823

1.197
2.966
4.972
6.942
8.777
15.202
20.775
22.946
24.285

1.022
2.507
4.139
5.77
7.255
12.388
16.955
18.993
20.221

1.06
2.568
4.202
5.836
7.32
12.488
17.248
19.562
20.865

Key: See Table 2.1.

Table 2.6: MSEs of Different Predictors, ARFIMA(1, d, 0) with d = 0.4, ϕ = 0.9, and T = 200
Forecast
Horizon(s)

Theoretical
MSE

1
2
3
4
5
10
20
30
40

1.000
2.690
4.793
7.130
9.584
21.488
37.790
46.400
51.312

True
Parameter
value
0.995
2.687
4.801
7.116
9.466
20.982
37.072
46.141
50.366

MLE

AR(P)

LW
m = T 0.5

LW
optimal m

1.014
2.753
4.928
7.332
9.776
21.780
38.651
47.957
52.263

1.207
3.292
5.979
9.026
12.229
28.480
53.797
70.359
78.776

1.020
2.786
5.036
7.539
10.135
23.113
43.774
57.243
65.143

2.340
3.671
5.935
8.535
11.317
29.166
125.699
546.540
1973.282

Key: See Table 2.1.

24

Table 2.7: MSEs of Different Predictors, ARFIMA(1, d, 0) with d = 0.4, ϕ = 0.8, and T = 1, 000
Forecast
Horizon(s)

Theoretical
MSE

1
2
3
4
5
10
20
30
40

1.000
2.440
3.978
5.456
6.809
11.502
15.252
16.754
17.629

True
Parameter
value
0.997
2.436
3.989
5.518
6.899
11.556
15.365
16.724
17.663

MLE

AR(P)

LW
m = T 0.5

LW
optimal m

0.999
2.439
3.994
5.528
6.915
11.630
15.517
16.916
17.838

1.042
2.565
4.207
5.791
7.247
12.374
16.628
18.186
19.207

0.999
2.447
4.016
5.558
6.954
11.724
15.622
17.112
18.055

1.000
2.448
4.018
5.564
6.959
11.735
15.588
17.057
18.017

Key: See Table 2.3.

Table 2.8: MSEs of Different Predictors, ARFIMA(1, d, 0) with d = 0.4, ϕ = 0.9, and T = 1, 000
Forecast
Horizon(s)

Theoretical
MSE

1
2
3
4
5
10
20
30
40

1.000
2.690
4.793
7.130
9.584
21.488
37.790
46.400
51.312

True
Parameter
value
1.000
2.696
4.793
7.129
9.620
21.969
38.099
47.299
51.908

MLE

AR(P)

LW
m = T 0.5

LW
optimal m

1.004
2.706
4.819
7.175
9.680
22.114
38.282
47.565
52.241

1.054
2.838
5.087
7.603
10.314
23.566
41.372
51.854
57.663

1.015
2.747
4.897
7.324
9.899
22.821
40.101
50.257
55.751

1.022
2.748
4.887
7.279
9.828
22.545
39.466
49.067
54.084

Key: See Table 2.3.

25

Table 2.9: MSEs of Different Predictors, ARFIMA(0, d, 1) with d = 0.4, θ = 0.8, and T = 200
Forecast
Horizon(s)

Theoretical
MSE

1
2
3
4
5
10
20
30
40

1.000
2.440
2.800
3.001
3.138
3.494
3.781
3.927
4.023

TRUE
Parameter
Value
1.044
2.463
2.837
3.198
3.027
3.484
3.992
3.714
4.208

MLE

AR(P)

LW
m = T 0.5

1.060
2.498
2.861
3.215
3.038
3.506
4.028
3.733
4.214

1.213
2.822
3.370
3.788
3.638
4.102
4.597
4.355
5.047

1.131
2.749
3.116
3.476
3.276
3.667
4.174
3.888
4.342

Key: See Table 2.3.

Table 2.10: MSEs of Different Predictors, ARFIMA(2, d, 1) with d = 0.4, ϕ1 = −0.5, ϕ2 = −0.9,
θ = 0.6, and T = 200
Forecast
Horizon(s)

Theoretical
MSE

1
2
3
4
5
10
20
30
40

1.000
1.250
1.647
1.713
2.295
3.142
4.056
4.370
4.487

TRUE
Parameter
Value
0.975
1.258
1.621
1.702
2.256
3.173
4.030
4.442
4.566

Key: See Table 2.3.

26

MLE

AR(P)

LW
m = T 0.5

1.003
1.281
1.653
1.734
2.323
3.278
4.180
4.556
4.653

1.159
1.507
1.923
2.027
2.742
3.734
4.768
5.130
5.195

1.062
1.370
1.670
1.759
2.419
3.304
4.202
4.558
4.670

Table 2.11: Biases of Different Predictors, ARFIMA(2, d, 0) process with ϕ1 = 0.08,
ϕ2 = 0.9025, d = 0.2, and T = 400
Forecast
Horizon(s)
1
2
3
4
5
10
20
30
40

MLE

AR(P)

-0.009
-0.015
-0.01
0.001
-0.002
0.024
0.011
-0.02
-0.019

-0.009
-0.015
-0.01
-0.003
-0.003
0.011
0.002
-0.022
-0.011

LW
m = T 0.5
-0.011
-0.016
-0.013
-0.001
-0.006
0.019
0.002
-0.031
-0.033

Key: Number of replications is 10,000 and P = ln(T )2 ≈ 35 in the AR(P). The MLE and LW T SE
are estimated as ARFIMA(1, d, 0).

Table 2.12: Biases of Different Predictors, ARFIMA(2, d, 0) process with ϕ1 = 0.08,
ϕ2 = 0.9025, d = 0.4, and T = 400
Forecast
Horizon(s)
1
2
3
4
5
10
20
30
40

MLE

AR(P)

0.006
0.013
0.006
0.01
0.01
0.014
-0.007
-0.047
-0.032

0.003
0.013
0.003
0.007
0.009
0.017
0.004
-0.028
-0.011

Key: See Table 2.11.

27

LW
m = T 0.5
0.004
0.013
0.004
0.009
0.009
0.014
-0.007
-0.046
-0.031

Table 2.13: Biases of Different Predictors, ARFIMA(2, d, 0) process with ϕ1 = 0.08,
ϕ2 = 0.9025, d = 0.49, and T = 400
Forecast
Horizon(s)
1
2
3
4
5
10
20
30
40

MLE

AR(P)

0.005
-0.001
0.011
0.007
0.008
-0.002
-0.066
-0.068
-0.076

0.000
0.002
-0.003
0.002
-0.011
-0.026
-0.104
-0.116
-0.115

LW
m = T 0.5
0.002
0.000
0.008
0.008
0.005
-0.005
-0.077
-0.087
-0.102

Key: See Table 2.11.

Table 2.14: MSEs of Different Predictors, ARFIMA(2, d, 0) process with ϕ1 = 0.08, ϕ2 = 0.9025,
d = 0.2 and T = 400
Forecast
Horizon(s)
1
2
3
4
5
10
20
30
40

MLE

AR(P)

1.034
1.086
2.278
2.409
3.563
6.933
15.671
25.269
36.194

1.124
1.203
2.496
2.655
3.947
7.851
18.39
30.38
43.855

Key: See Table 2.11.

28

LW
m = T 0.5
1.09
1.15
2.4
2.555
3.755
7.215
15.803
24.699
34.329

Table 2.15: MSEs of Different Predictors, ARFIMA(2, d, 0) process with ϕ1 = 0.08, ϕ2 = 0.9025,
d = 0.4 and T = 400
Forecast
Horizon(s)
1
2
3
4
5
10
20
30
40

MLE

AR(P)

1.195
1.275
3.041
3.438
5.637
14.138
43.603
84.881
133.739

1.109
1.392
3.109
3.797
6.050
16.211
52.669
106.780
174.338

LW
m = T 0.5
1.180
1.442
3.267
3.963
6.272
15.832
46.090
85.967
131.450

Key: See Table 2.11.

Table 2.16: MSEs of Different Predictors, ARFIMA(2, d, 0) process with ϕ1 = 0.08, ϕ2 = 0.9025,
d = 0.49 and T = 400
Forecast
Horizon(s)
1
2
3
4
5
10
20
30
40

MLE

AR(P)

1.515
1.502
3.873
4.434
7.467
21.238
71.020
146.790
240.306

1.117
1.500
3.435
4.419
7.178
22.723
82.612
181.008
315.058

Key: See Table 2.11.

29

LW
m = T 0.5
1.350
1.812
4.199
5.382
8.520
24.537
77.451
152.963
242.127

Chapter 3

INFORMATION CRITERIA METHODS FOR SELECTION OF ARFIMA MODELS

3.1 Introduction
As previously stated, the ARFIMA model has become the standard basic model for representing
discrete time long memory processes. There are now many methods available for detecting the
presence of long memory, but surprisingly the issue of model selection in the ARFIMA(p, d, q)
model has not been satisfactorily resolved. Hence once it has been determined that the long memory parameter d is non zero, it then remains to determine the appropriate orders of the short memory ARMA structure; namely p and q.
It is ﬁrst worth recollecting some results on the estimation of ARFIMA models that were summarized in the introduction. However, successful implementation of QMLE for ARFIMA models
requires the correct speciﬁcation of the short memory component of the conditional mean. This
caveat can be avoided by the use of one of the many Semi Parametric Estimation (SPE) procedures,
where an estimate of the long memory parameter is obtained which does not require the speciﬁcation of the short memory process. Following the work of Geweke and Porter-Hudak (1983), many
such SPEs have now been proposed, with versions of the Local Whittle (LW ) proving particularly
popular; see Phillips and Shimotsu (2006), Phillips (2007), etc. While this method appears attractive in terms of avoiding the speciﬁcation of the short memory components of the process, it
does not however lead to a simple way of either specifying or estimating all the parameters of the
ARFIMA model.
This chapter proposes a solution to the above problem of model selection of ARFIMA models
by utilizing some standard Information Criteria, (IC). We show that model selection based on the
BIC is consistent in the sense of Sin and White (1996). The alternate strategy of using HQ is
also consistent, while the AIC is inconsistent. Our proposed method requires initial speciﬁcation
30

of the maximum lags pmax and qmax of the ARFIMA(p, d, q) process which are to be entertained
by the investigator. The method then uses MLE or QMLE to estimate the possibly misspeciﬁed
(pmax + 1)(qmax + 1) models, and uses the BIC for model selection. We report the results of a
detailed simulation to assess the proposed methodology; and we ﬁnd that the proposed MLE − IC
procedure generally works well in terms of the selection of the most appropriate model. The
method appears particularly effective for situations where the true data generating process is of
relatively low order with p and q less than or equal to two or three. For completeness we also
investigate the application of the IC based on a fractionally ﬁltered series where an SPE such as
LW is used at an initial stage. This method works quite well for some designs, but is generally
inferior to the MLE based procedure on the original series. Hence we propose a more limited role
for SPE, since once the presence of long memory or fractional integration has been conﬁrmed by
LW , or one of its many variants, the obvious strategy is to then revert immediately to using QMLE
with our suggested model selection strategy. We also suggest an adjustment of BIC which reﬂects
the sub optimal rate of convergence of parameters in second step of LW T SE.
We conclude that our proposed approach for selection of ARFIMA models works well for low
order models where p and q are less than or equal to 2; but there is mixed evidence for higher order
models. There are also detailed examples of the methodology applied to monthly CPI inﬂation
series.

3.2 Information Criteria for Model Selection
The successful implementation of a modeling strategy for using ARFIMA(p, d, q) models requires
an adequate model selection approach for the orders of p and q. Previous articles by Beran,
Bhansali, and Ocker (1998) and Crato and Ray (1996) have discussed this issue and have addressed less general formulations of the problem. There is a long literature on the implementation
of information criteria on weakly dependent processes; for example Shibata (1976) and Hannan
and Quinn (1979) have studied the AR(p) model and Hannan (1980) has investigated the stationary ARMA(p, q) model. The Bayesian (BIC) of Schwarz (1978) and the Hannan-Quinn (HQ)
31

Hannan and Quinn (1979) information criteria consistently estimate the lag order of linear models
with weakly dependent data, while the Akaike Information Criterion (AIC) is known to be inconsistent. Further, Sin and White (1996) and Kapetanios (2001) have shown that these properties
extend to nonlinear models for weakly dependent processes. Hidalgo (2002) has shown that similar results are valid for regressions involving long memory regressors. Recently Poskitt (2007)
has extended the optimality results of Shibata (1980) to stationary long memory processes.
Since none of the above results are directly relevant for the model considered in this study,
it is necessary to provide some new theory for lag order selection of p and q using information
criteria. In the following the conditional sum of squares objective function is denoted as
T

Q(y1 , ..., yT ; Θ p,q ) =

∑

εt (Θ p,q )2 ,

t=max(p,q)

where Θ p,q = (ϕ1 , ..., ϕ p , θ1 , ..., θq , d)′ and εt = εt (Θ p,q ) is deﬁned as εt = θ (L)−1 ϕ (L)(1 − L)d yt .
Then, lag order selection, via the use information criteria, is carried out by maximizing
ICT (p, q) = −Q(y1 , ..., yT ; Θ p,q ) − cT (p, q)

(3.1)

over 0 ≤ p ≤ pmax and 0 ≤ q ≤ qmax , where cT (p, q) is a penalty term depending on p, q and T .
The following theorem provides a result on the properties of the estimated lag orders.
Theorem 3.1. Consider the ARFIMA(p, d, q) process

ϕ (L)(1 − L)d yt = θ (L)εt

(3.2)

where ϕ (L) and θ (L) are polynomials in the lag operator of orders p and q respectively, with
all their roots lying outside the unit circle. Then for some 0 ≤ p0 ≤ pmax and 0 ≤ q0 ≤ qmax ,
without loss of generality, it is assumed that ϕi ̸= 0, for i = 1, .., p0 and θi ̸= 0, for i = 1, .., q0 . The
estimates of the lag orders p0 and q0 , are denoted by p and q and are determined by minimizing
ˆ
ˆ
(3.1) over 0 ≤ p ≤ pmax and 0 ≤ q ≤ qmax . It is assumed that if p < p0 or q< q0 then there
[
]
exists a unique Θ∗ that minimizes E εt (Θ p,q )2 . Then, if cT (p, q) = o(T ) for all p, q, and
p,q
(
)
cT (p, q) − cT (p0 , q0 ) → ∞ when p > p0 and q > q0 , then limT →∞ Pr ( p, q) = (p0 , q0 ) = 1
ˆ ˆ
32

The conditions on cT (p, q) given in Theorem 3.1 are standard for information criteria consistency for weakly dependent data. They imply that the Bayesian and Hannan-Quinn information
criteria provide consistent estimates of p0 and q0 whereas the Akaike information criterion is not
consistent.

Proof of Theorem 3.1
To prove the Theorem 3.1 it is necessary to use the theoretical framework of Sin and White (1996),
and to show the two properties
({
lim Pr

T →∞

and that

({
lim Pr

T →∞

p < p0

} {
})
0
∪ q<q
=0

(3.3)

p > p0

} {
})
0
∩ q>q
=0

(3.4)

Together, (3.3) and (3.4) imply Theorem 3.1. It is ﬁrst necessary to show that for all p and q,
ˆ
there exist some Θ∗ such that Θ p,q − Θ∗ = O p (T −1/2 ). The two separate cases of (i) p ≥
p,q
p,q
ˆ
p0 and q ≥ q0 and (ii) p < p0 or q < q0 are distinguished. For the ﬁrst case, Θ p,q − Θ p,q =
O p (T −1/2 ) where Θ p,q = (ϕ1 , ..., ϕ p0 , 0, ..., 0, θ1 , ..., θq0 , 0, ..., 0, d)′ , follows by Theorems 2.1 and
2.2 of Hosoya (1997).

[
]
For the second case, note the assumption that E εt (Θ p,q )2 is uniquely minimized at some

Θ∗ . Then, on taking this assumption with the treatment of Hosoya (1997), who does not assume
p,q
ˆ
that the true model belongs to the class of models being estimated, it follows that Θ p,q − Θ∗ =
p,q
O p (T −1/2 ), which uses identical arguments to those in the proof of Corollary 1 of Baillie and
Kapetanios (2008).
It is now possible to start by showing (3.3). On deﬁning Q∗ (Θ p,q ) = −Q(y1 , ..., yT ; Θ p,q )
T
and on taking a two term mean value expansion of Q∗ (Θ∗ ) around Θ p,q and noting that
p,q
T

33

ˆ
∇Q∗ (Θ p,q ) = 0, gives
T
(
)
ˆ
ˆ
ˆ
Q∗ (Θ∗ ) = Q∗ (Θ p,q ) + ∇Q∗ (Θ p,q )′ Θ p,q − Θ∗
p,q
p,q
T
T
T
)′
)
(
(
∗ (Θ ) Θ
ˆ p,q − Θ∗
¯ p,q ˆ p,q − Θ∗
+0.5 Θ
p,q ∇QT
p,q
(
)′
(
)
ˆ
ˆ
¯
ˆ
= Q∗ (Θ p,q ) + 0.5 Θ p,q − Θ∗
∇2 Q∗ (Θ p,q ) Θ p,q − Θ∗
p,q
p,q
T

T

¯
By the quadratic nature of Q∗ (Θ p,q ), it follows that ∇2 Q∗ (Θ p,q ) = O p (T ) and therefore that
T
T
ˆ
Q∗ (Θ p,q ) − Q∗ (Θ∗ ) = O p (1)
p,q
T
T
(
)
ˆ
Q∗ (Θ p,q ) = Q∗ (Θ∗ ) + o p (T ) = E Q∗ (Θ∗ ) + o p (T )
p,q
p,q
T
T
T

Then,

(3.5)

(3.6)

where the second equality follows by combining (2.8) of Robinson (2006) and Lemma 3.3 of
Hosoya (1997). Then, (3.3) holds if
(
)
lim Pr ICT (p, q) < ICT (p0 , q0 ) = 1

T →∞

(3.7)

when either p < p0 or q < q0 . However, by (3.6) and the fact that cT (p, q) = o(T ), ICT (p, q) <
ICT (p0 , q0 ) with probability approaching one, if
(

)

(

)

T −1 E Q∗ (Θ p0 ,q0 ) − T −1 E Q∗ (Θ∗ ) > 0
p,q
T
T
But this follows immediately by the identiﬁability condition that ϕi ̸= 0, i = 1, .., p0 and θi ̸= 0,
i = 1, .., q0 . Note that the weaker condition ϕ p0 ̸= 0, and θq0 ̸= 0, may not be sufﬁcient as under
this condition the various sub models may not be strictly nested requiring more stringent conditions
for the penalty term. The next stage is to show (3.4). By the fact that Q∗ (Θ∗ ) = Q∗ (Θ∗ 0 0 ) for
p,q
T
T
p ,q
all pmax ≥ p ≥ p0 and qmax ≥ q ≥ q0 , and using (3.5), it follows that, for all pmax ≥ p ≥ p0 and
qmax ≥ q ≥ q0 ,
ˆ
ˆ
ICT (p0 , q0 ) − ICT (p, q) = Q∗ (Θ p0 ,q0 ) − Q∗ (Θ p,q ) + cT (p, q) − cT (p0 , q0 )
T
T
= cT (p, q) − cT (p0 , q0 ) + O p (1)
But, since cT (p, q) − cT (p0 , q0 ) → ∞ when p > p0 and q > q0 , (3.4) follows immediately.
34

3.3 Simulation Study
This section reports the results from a detailed Monte Carlo study to investigate the performance
of the various information criteria and estimation procedures. The model selection criteria we consider are Akaike Information Criterion (AIC), Baynesian Information Criterion (BIC), and HannanQuinn Information Criterion (HQ). ICs are deﬁned as
ˆ
ln(σ 2 ) + kc(T )/T,
where c(T ) = 2 in the Akaike case, c(T ) = ln(T ) in the BIC case, c(T ) = 2α ln(ln(T )), α > 1 in
ˆ
the HQ case, and k = 2 + p + q is the number of parameters, σ 2 is the MLE estimator of the error
variance. The selected model is the one that has the minimum IC.
Tables 3.1 through 3.4 report the performance of the various MLE − IC applied to realizations
from an ARFIMA(1, d, 0) data generating process, and with a sample size of T = 400. Table 3.1
is for ϕ = 0.8 and d = 0.2; table 3.2 is for ϕ = 0.8 and d = 0.4; and table 3.3 is for ϕ = 0.8
and d = 0.8. It can be seen that the BIC and to a slightly lesser extent the HQ perform very well
for all the designs; and interestingly the third design with a long memory parameter in the non
stationary region has a very similar performance over the replications. In fact the BIC selects the
appropriate model on 89%, 91% and 88% of the occasions respectively. However, as the short
memory persistence increases with ϕ = 0.9 and d = 0.4 in table 3.4. then the BIC does deteriorate
and only selects the correct model on 59% occasions. The HQ is slightly worse and has success
rates of between 69% and 73%; while the inconsistent AIC is only successful 8% to 12% of the
time.
The results of applying the corresponding SPE −CSS − IC method to the fractionally ﬁltered
series are also given in these tables. The performance of these corresponding IC are generally
slightly worse than for the MLE − IC and with a similar ranking in performance of the different
ICs. Hence there does not appear to be any reason for making use of the SPE and LW statistics for
model selection once long memory has been detected.
Table 3.5 reports the results for the ARFIMA(0, d, 1) model and the performance of the
35

MLE − BIC is excellent with an 80% success rate of selecting the correct model. interestingly, the
inclusion of the MA short memory parameterization signiﬁcantly deteriorates the performance of
the SPE − BIC to only a 49% success rate. Table 3.6 describes the results for the ARFIMA(1, d, 1)
process, with ϕ = 0.5, θ = −0.7 and d = 0.4. The MLE − BIC has a correct selection rate of 63%.
The properties of the IC in higher order AR models are explored in Tables 3.7 through 3.8
which report simulation results for the IC when the data generating process is the ARFIMA(2, d, 0)
process of the form
(1 − 0.25L − 0.375L2 )(1 − L)0.4 yt = εt
where the short memory component has real roots of (4/3) and −2 and d = 0.4. The BIC is still the
best performing IC statistic and successfully selects the correct speciﬁcation 78% of the time for a
sample size of T = 400 and 93% of the time for a sample size of T = 1, 000. The SPE −CSS −IC is
surprisingly slightly superior to the MLE − IC in these cases; which is an exception to the general
pattern of the simulation results.
A further ARFIMA(2, d, 0) process of the form
(1 − 1.00L + 0.50L2 )(1 − L)0.4 yt = εt
where the short memory component has complex conjugate roots of (1 + i) and (1 − i) is investigated in Table 3.9 and for a sample size of T = 400 the BIC only selects the correct speciﬁcation
on 49% of the time. In this case the SPE −CSS − IC is substantially inferior with a success rate of
only 38%.
Tables 3.10 through 3.13 report results where the true model is an ARFIMA(2, d, 1); and it
can be seen that the performance of the BIC is very poor in some cases such as the process in table
3.10 where it is affected by strong ﬁrst and second order AR coefﬁcients of AR representation.
ARFIMA(2, d, 1) in Table 3.10 can be represented by [1 − (7/12)L1 − (29/72)L2 + (25/216)L3 +
...](1 − L)d yt = εt . So, high proportion of selecting ARFIMA(1, d, 0) or ARFIMA(2, d, 0) can
be expected. [Pukkila, Koreisha, and Kallinen (1990)] Hence, BIC tends to select the simpler
ARFIMA(1, d, 0) model 41% of the time. Another reason for selecting lower order model is near

36

cancellation of the AR and MA root. The success rates of the BIC are moderate for Tables 3.11 and
3.12 with 49% and 60%. However, the BIC selects the correct ARFIMA(2, d, 1) model on 92% of
the occasions in table 3.13.
The performance of the BIC is mixed for higher order AR short memory structures. The
ARFIMA(3, d, 0) model is analyzed in tables 3.14 and 3.15, and the ARFIMA(4, d, 0) model results
are reported in tables 3.16 through 3.19. The success rate of the BIC is very mixed and very much
depends on the parameter space of the data generating process. The model selection seems to work
well in the case of a generating process with strong coefﬁcient at high order, such as ϕ3 = −0.92705
in case of table 3.15 and ϕ4 = −0.31 in case of tables 3.18 and 3.19. In table 3.18 it is excellent
with the BIC having 77% success rate and 83% in table 3.19. The ARFIMA(4, d, 1) model is
examined in table 3.20.

3.4 Modiﬁed Information Criteria for LW T SE
The previous results on applying the SPE − BIC and the SPE − HQIC to ARFIMA models has
made the standard assumption that all the parameters are estimated simultaneously in the estimation before the IC are calculated. However, the implementation of the LW T SE requires initial
estimation of d by a m1/2 consistent estimator and the short memory parameters to be based on the
second step procedure. Groen and Kapetanios (2010) have considered with problem in the issue
of estimating regressions with unknown factors, as in Bai and Ng (2002) and have conjectured the
possibility of modiﬁcation of the IC. In our problem the LW T SE has a rate of convergence of the
short memory parameters maybe slower than MLE and depends on the ﬁrst step SPE estimation
which is related to the convergence of dˆ to d. This is very similar to the two step regression type
problem in other areas of econometrics; see Baillie and Kapetanios (2009). One possibility is to
modify the information criteria as
ˆ
Modiﬁed IC = T ln(σ 2 ) + kc(T ) + kc(T )T δ

37

where, δ = 0.5α and the rate of convergence of the estimate of d is related to the bandwidth used
in the ﬁrst step estimation, so in this case is m = T α , which is derived by Wright (1995) and where
m0.5 = T 0.5α , and where generally 0 < α < 0.8. Hence when α = 0.5, then δ = 0.25. We have
accordingly done some initial simulations with this modiﬁed SPE − BIC and have reported them
in tables 3.21 through 3.25.
To motivate this, consider the ARFIMA(p, d, q) model ϕ (L)(1 − L)d yt = θ (L)εt where ϕ (L)
and θ (L) are polynomials in the lag operator of orders p and q respectively, with all their roots lying
outside the unit circle, there are (1 + p + q) parameters in the vector β ′ = (d, ϕ1 , , ..., ϕ p , θ1 , ..., θq ).
When the coefﬁcient vector β has superscript 0 then it refers to a model using the true set of parameters. Hats on β indicate estimated parameters in a model with estimated short memory parameters
and estimated d. Tildes on β indicate estimated parameters in a model with estimates short memory parameters, but the true d parameter. The penalty term for the generic set of variables is
0
denoted by CT and the penalty term for the true set of variables by CT . Finally, let e = (e1 , ..., eT )′ .

The feasible information criterion takes the following form
IC(β ,CT ) =

T
ln{σ 2 } +CT
2

We also introduce the infeasible criterion given by
IC(β ,CT ) =

T
ln{σ 2 } +CT
2

We decompose IC(β ,CT ) as follows



IC(β 1 ,CT ) = IC(β 2 ,CT ) +

σ2

1



T 

ln
2
2
σ2

Then, consistency requires that
ˆ
ˆ0 0
lim P{IC(β ,CT ) − IC(β ,CT ) < 0} = 0

T →∞

Then following the analysis of Groen and Kapetanios (2010) this becomes
[ ]
(
)
T
σ2
ln
= O p T 1−δ = o p (T )
2
σ2
38

(3.8)

(3.9)

0
as long as T → ∞. Since CT −CT = o(T ) and from Wright (1995),

(
)
1 T
′
(ut − ut )yt = O p T −δ , δ < 0.5
∑
T t=1

(3.10)

0
as long as CT,N −CT,N → −∞. But, (3.9) implies this is not enough for (3.8) to hold.

We consider ARFIMA(2, d, 0) process of the form (1 − 1.00L + 0.50L2 )(1 − L)0.4 yt = εt ,
because the performance of LW T SE is substantially worse than MLE. The results are shown in
Tables 3.21 and 3.22. In Table 3.22, the modiﬁed BIC selected the correct model on 87% which
is 15% higher than standard BIC. The result for ARFIMA(1, d, 0) with ϕ = 0.8 and d = 0.4 is
presented in Table 3.23. And we ﬁnd that modiﬁed information criteria improve model selection
for LW T SE.

3.5 Inﬂation Example
An example of the above methodology is provided by the US CPI inﬂation series from January
1947 through July 2010, making T = 762 observations. The properties of inﬂation have been the
subject of much research in the past with some articles claiming to ﬁnd evidence for long memory
and others examining the possibility of regime switching or some other type of non-linearity. The
balance of evidence would seem to suggest a combination of long memory and nonlinearity to be
the optimal model for explaining the inﬂation series. The purpose of the present study is mainly to
provide an example. First, we consider CPI inﬂation calculated from seasonally unadjusted CPI.
The application of the LW gave estimates of the long memory parameter of dLW = 0.451 from a
bandwidth of m = T 0.50 ; and dLW = 0.574 from m = T 0.60 ; and dLW = 0.392 from m = T 0.75 ; and
dLW = 0.380 from m = T 0.80 . Since the series is likely to be seasonal the initial speciﬁcation of the
maximum lags was pmax = 15 and qmax = 4. While this choice was deliberately conservative for
the MA structure, the allowance made for seasonality still gave a total of 80 models to be estimated.
The application of the BIC suggests that the most appropriate model is the ARFIMA(0, d, 0), while
the BIC grid of values below is relatively ﬂat in the neighborhood of (p, q) = (0, 0), (p, q) = (1, 0)

39

and (p, q) = (0, 1). Estimation of an ARFIMA(0, d, 0) model realized
(1 − L)0.332(0.046) {1000∆ ln(CPIt ) − 3.101(1.434)} = εt .
However, examination of standard diagnostics such as the autocorrelations of the residuals indicated the presence of substantial autocorrelations at the seasonal lags of 11, 12 and 13. After some experimentation the most appropriate model from the BIC was the ARFIMA(0, d, 0) −
SARMA(1, 0), see Table 3.26:
(1 − L)d (1 − ΦL12 ){1000∆ ln(CPIt ) − µ } = εt .
This model could not have been selected form the previous analysis since its choice depended
on the implementation of eleven zero restrictions on the AR lag polynomial of the standard
ARFIMA(p, d, q) model of the form φ1 = φ2 = .... = φ11 = 0. Hence a more appropriate application of the BIC methodology would be to search over the set of ARFIMA(p, d, q) − SARMA(P, Q)
models.
For seasonally adjusted series, see Table 3.27, the BIC for the ARFIMA(0, d, 0) is less
than the BIC from the seasonal ARFIMA(0, d, 0) − AR(12) model and the coefﬁcient of the seasonal AR parameter is statistically insigniﬁcant. Hence an investigator would select the regular
ARFIMA(0, d, 0) model. The situation is reversed with the CPI series from seasonally unadjusted
data, where the coefﬁcient of the seasonal AR parameter is statistically signiﬁcantly different from
zero. Hence an investigator would select the seasonal ARFIMA(0, d, 0) − AR(12) model.

3.6 Conclusion
This chapter has examined estimation and lag order selection for fractionally integrated ARFIMA
models. We show the Bayesian (BIC) of and Hannan-Quinn (HQ) information criteria are consistent model selection procedures for the class of long memory ARFIMA models when the models
are estimated by QMLE. We compare and contrast this approach with a method that initially uses
a semi-parametric estimate of the long memory parameter. Simulation evidence indicates that the
40

BIC based on QMLE works very well for low order models but can have mixed properties for high
order models. The methodology is illustrated with analysis of series of inﬂation.

41

APPENDIX

42

Appendix 3A: Tables
Table 3.1: Percentage of Model Selected, DGP ARFIMA(1, d, 0), ϕ1 = 0.8, d = 0.2, T = 400
p
0
0
0
0
0
0
0
0
0
1
1
1
1
1
1
1
1
1
2
2
2
2
2
2
2
2
2
3
3
3
3
3
3
3
3
3
4
4
4
4
4

q
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4

AIC
0
0
0
0
0
0
0
0
0
15
1
0
0
0
0
0
0
0
2
1
0
0
0
0
0
0
0
1
0
3
1
0
0
0
0
0
0
0
2
2
1

QMLE
BIC HQ
1
0
1
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
89
72
1
3
0
0
0
0
0
0
0
0
0
0
0
0
0
0
3
5
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
2
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
1
0
0

AIC
0
0
0
0
0
0
0
0
0
13
1
0
0
0
0
0
0
0
2
1
0
0
0
0
0
0
0
1
0
4
0
0
0
0
0
1
0
0
1
1
1

LW T SE
BIC
0
1
0
0
0
0
0
0
0
89
2
0
0
0
0
0
0
0
5
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0

HQ
0
0
0
0
0
0
0
0
0
70
3
1
0
0
0
0
0
0
7
1
1
0
0
0
0
0
0
1
0
3
1
0
0
0
0
0
0
0
0
1
0

p
4
4
4
4
5
5
5
5
5
5
5
5
5
6
6
6
6
6
6
6
6
6
7
7
7
7
7
7
7
7
7
8
8
8
8
8
8
8
8
8

q
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8

AIC
1
0
1
1
0
0
1
1
4
2
2
1
1
0
0
1
1
3
4
2
3
2
0
0
1
0
2
2
6
4
3
0
0
1
0
1
2
5
7
7

QMLE
BIC HQ
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
1
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
1
1
1
1

AIC
1
1
1
1
0
0
1
1
3
3
2
1
1
0
0
0
1
3
3
3
3
2
0
0
1
0
2
3
6
5
5
0
0
0
0
1
2
5
8
6

LW T SE
BIC
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0

HQ
0
0
0
0
0
0
1
0
1
0
0
0
0
0
0
0
0
0
1
0
0
1
0
0
0
0
0
0
1
0
0
0
0
0
0
0
0
0
1
1

Key: Number of replications is 1,000 and numbers in bold show the percent of correct selections.

43

Table 3.2: Percentage of Model Selected, DGP ARFIMA(1, d, 0), ϕ1 = 0.8, d = 0.4, T = 400
p
0
0
0
0
0
0
0
0
0
1
1
1
1
1
1
1
1
1
2
2
2
2
2
2
2
2
2
3
3
3
3
3
3
3
3
3
4
4
4
4
4

q
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4

AIC
0
0
0
0
0
0
0
0
0
12
1
1
0
0
0
0
0
0
2
1
0
0
0
1
0
0
0
1
0
3
1
0
1
1
0
1
0
0
1
1
1

QMLE
BIC HQ
1
0
1
1
1
1
0
0
0
0
0
0
0
0
0
0
0
0
91
73
2
3
0
1
0
0
0
0
0
0
0
0
0
0
0
0
3
6
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
2
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
1
0
0

LW T SE
AIC BIC
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
8
90
2
3
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
4
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
2
0
2
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
2
0
3
0
1
0

HQ
0
0
0
0
0
0
0
0
0
66
6
0
0
0
0
0
0
0
7
1
0
0
0
0
0
0
0
1
0
2
1
0
0
0
0
0
0
0
1
1
0

p
4
4
4
4
5
5
5
5
5
5
5
5
5
6
6
6
6
6
6
6
6
6
7
7
7
7
7
7
7
7
7
8
8
8
8
8
8
8
8
8

Key: See Table 3.1.

44

q
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8

AIC
1
1
1
1
0
0
0
0
5
2
2
1
2
0
0
0
1
2
4
2
2
3
0
0
0
0
1
3
6
4
4
0
0
0
0
1
2
7
7
6

QMLE
BIC HQ
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
1

LW T SE
AIC BIC
1
0
1
0
1
0
1
0
0
0
0
0
1
0
1
0
5
0
3
0
3
0
2
0
2
0
0
0
0
0
0
0
0
0
3
0
4
0
3
0
4
0
2
0
0
0
0
0
0
0
0
0
1
0
2
0
7
0
4
0
5
0
0
0
0
0
0
0
0
0
2
0
1
0
5
0
7
1
4
0

HQ
0
0
0
0
0
0
0
1
2
1
1
0
0
0
0
0
0
1
1
0
0
1
0
0
0
0
0
0
1
0
1
0
0
0
0
0
0
0
1
0

Table 3.3: Percentage of Model Selected, DGP ARFIMA(1, d, 0), ϕ1 = 0.8, d = 0.8, T = 400
p
0
0
0
0
0
0
0
0
0
1
1
1
1
1
1
1
1
1
2
2
2
2
2
2
2
2
2
3
3
3
3
3
3
3
3
3
4
4
4
4
4

q
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4

AIC
0
0
0
0
0
0
0
0
0
12
1
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
0
1
0
2
1
0
0
1
0
0
0
0
1
1
1

QMLE
BIC HQ
0
0
1
1
0
0
0
1
0
0
0
0
0
0
0
0
0
0
88
69
2
5
0
0
0
0
0
0
0
0
0
0
0
0
0
0
4
5
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
2
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
1
0
0

LW T SE
AIC BIC
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
11
88
1
2
0
0
0
0
0
0
0
0
0
0
0
0
0
0
2
4
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
2
0
1
0
0
0
0
0
1
0
0
0
0
0
0
0
0
0
1
0
2
0
1
0

HQ
0
1
0
1
0
0
0
0
0
66
4
0
0
0
0
0
0
0
7
2
0
0
0
0
0
0
0
1
0
3
1
0
0
0
0
0
1
0
1
1
1

p
4
4
4
4
5
5
5
5
5
5
5
5
5
6
6
6
6
6
6
6
6
6
7
7
7
7
7
7
7
7
7
8
8
8
8
8
8
8
8
8

Key: See Table 3.1.

45

q
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8

AIC
1
1
1
1
0
0
1
1
4
2
2
2
2
0
0
1
1
2
4
3
3
2
0
0
1
1
2
3
6
5
3
0
0
1
0
1
2
5
4
7

QMLE
BIC HQ
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
1
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
1
1
1
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
1
1
1

LW T SE
AIC BIC
1
0
0
0
1
0
1
0
0
0
0
0
1
0
0
0
3
0
3
0
2
0
2
0
1
0
0
0
0
0
1
0
1
0
3
0
4
0
2
0
3
0
1
0
0
0
0
0
1
0
1
0
2
0
3
0
7
0
5
0
4
1
0
0
0
0
0
0
0
0
1
0
2
0
5
0
8
0
5
0

HQ
0
0
0
0
0
0
1
0
2
1
1
0
0
0
0
0
0
1
1
0
0
0
0
0
0
0
0
1
1
0
1
0
0
0
0
0
0
1
1
0

Table 3.4: Percentage of Model Selected, DGP ARFIMA(1, d, 0), ϕ1 = 0.9, d = 0.4, T = 400
p
0
0
0
0
0
0
0
0
0
1
1
1
1
1
1
1
1
1
2
2
2
2
2
2
2
2
2
3
3
3
3
3
3
3
3
3
4
4
4
4
4

q
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4

AIC
0
0
0
0
0
0
0
0
0
7
4
0
0
0
0
0
0
0
3
0
0
0
0
0
0
0
0
1
0
2
0
0
1
0
0
0
0
0
2
2
1

QMLE
BIC
3
5
0
0
0
0
0
0
0
59
13
0
0
0
0
0
0
0
15
0
0
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0

HQ
1
2
1
1
0
0
0
0
0
43
14
0
0
0
0
0
0
0
15
1
0
0
0
0
0
0
0
3
0
2
0
0
0
0
0
0
0
0
1
1
0

LW T SE
AIC BIC
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
6
68
5
16
1
0
0
0
0
0
0
0
0
0
0
0
0
0
4
10
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
1
0
1
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
1
0
1
0

HQ
0
1
0
0
0
0
0
0
0
43
21
2
0
0
0
0
0
0
13
1
0
0
0
0
0
0
0
2
0
2
1
0
0
0
0
0
0
0
1
1
0

p
4
4
4
4
5
5
5
5
5
5
5
5
5
6
6
6
6
6
6
6
6
6
7
7
7
7
7
7
7
7
7
8
8
8
8
8
8
8
8
8

Key: See Table 3.1.

46

q
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8

AIC
1
1
1
1
0
0
1
1
3
3
1
1
1
0
0
1
0
2
3
2
2
3
0
0
1
1
2
3
8
5
4
0
0
1
0
1
2
4
8
6

QMLE
BIC HQ
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
1
0
1
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
1
0
0
1
1
0
0
0
0
0
0
0
0
0
0
0
1
0
1
1
1
1
1

LW T SE
AIC BIC
1
0
1
0
1
0
1
0
0
0
0
0
1
0
1
0
3
0
3
0
2
0
2
0
1
0
0
0
0
0
1
0
1
0
3
0
3
0
3
0
2
0
2
0
0
0
0
0
0
0
0
0
2
0
2
0
6
0
5
0
5
0
0
0
0
0
0
0
0
0
1
0
2
0
6
0
5
0
8
0

HQ
0
1
0
0
0
0
0
0
1
1
0
0
0
0
0
0
0
1
1
0
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
0
1
1
1

Table 3.5: Percentage of Model Selected, DGP ARFIMA(0, d, 1), θ1 = 0.9, d = 0.4, T = 400
p
0
0
0
0
0
0
0
0
0
1
1
1
1
1
1
1
1
1
2
2
2
2
2
2
2
2
2
3
3
3
3
3
3
3
3
3
4
4
4
4
4

q
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4

AIC
0
12
2
0
0
0
0
0
0
3
1
0
0
0
0
0
0
0
1
1
0
2
1
1
1
1
0
0
0
1
1
1
1
0
0
0
0
0
1
1
1

QMLE
BIC HQ
1
0
79
62
2
3
0
0
0
0
0
0
0
0
0
0
0
0
13
11
1
2
0
1
0
0
0
0
0
0
0
0
0
0
0
0
1
4
0
1
0
1
0
2
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
1
0
1
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
1

LW T SE
AIC BIC
0
2
5
49
1
6
1
0
0
0
0
0
0
0
0
0
0
0
2
26
1
2
1
5
0
0
0
0
0
0
0
0
0
0
0
0
2
5
1
3
0
0
1
0
1
0
1
0
0
0
0
0
0
0
1
1
0
0
1
0
1
0
1
0
1
0
1
0
1
0
0
0
0
0
0
0
0
0
1
0
2
0

HQ
0
34
6
1
0
0
0
0
0
17
3
8
0
0
0
0
0
0
6
3
1
1
1
0
0
0
0
2
0
1
1
1
0
0
0
0
0
0
1
1
1

p
4
4
4
4
5
5
5
5
5
5
5
5
5
6
6
6
6
6
6
6
6
6
7
7
7
7
7
7
7
7
7
8
8
8
8
8
8
8
8
8

Key: See Table 3.1.

47

q
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8

AIC
3
1
2
2
0
0
1
1
1
3
3
2
1
0
0
1
0
1
2
3
5
4
0
0
0
0
1
2
3
5
5
0
0
0
1
0
1
3
5
5

QMLE
BIC HQ
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0

LW T SE
AIC BIC
2
0
2
0
1
0
1
0
0
0
0
0
1
0
1
0
2
0
2
0
3
0
3
0
2
0
0
0
0
0
0
0
0
0
2
0
3
0
2
0
4
0
4
0
0
0
0
0
0
0
0
0
1
0
3
0
5
0
6
0
7
0
0
0
0
0
0
0
1
0
1
0
1
0
4
0
5
0
5
0

HQ
1
0
0
0
0
0
0
1
1
1
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
0
1
1
1
1
0
0
0
0
0
0
0
0
1

Table 3.6: Percentage of Model Selected, DGP ARFIMA(1, d, 1), ϕ1 = 0.5, θ1 = −0.7, d = 0.4,
T = 400
p
0
0
0
0
0
0
0
0
0
1
1
1
1
1
1
1
1
1
2
2
2
2
2
2
2
2
2
3
3
3
3
3
3
3
3
3
4
4
4
4
4

q
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4

AIC
0
0
1
1
0
0
0
0
0
0
9
1
0
0
0
0
0
0
0
1
0
0
1
1
1
0
0
0
0
0
2
1
2
2
1
1
0
0
0
1
1

QMLE
BIC HQ
0
0
5
1
8
6
1
2
0
1
0
0
0
0
0
0
0
0
0
0
63
49
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
2
0
0
0
0
0
2
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
3
0
1
0
1
1
1
0
0
1
1
0
0
0
0
0
0
0
1
0
0

AIC
0
0
0
1
0
0
0
0
0
0
10
1
1
0
0
0
0
0
0
1
0
0
1
1
0
1
0
0
0
0
2
1
1
1
1
1
0
0
0
1
2

LW T SE
BIC
0
3
8
1
0
0
0
0
0
0
67
0
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0

HQ
0
1
6
2
1
1
0
0
0
0
53
1
1
0
0
0
0
0
0
2
1
1
1
0
0
0
0
0
0
0
2
1
0
1
1
1
1
0
0
1
0

p
4
4
4
4
5
5
5
5
5
5
5
5
5
6
6
6
6
6
6
6
6
6
7
7
7
7
7
7
7
7
7
8
8
8
8
8
8
8
8
8

Key: See Table 3.1.

48

q
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8

AIC
1
2
4
3
0
0
0
0
2
3
3
4
4
0
0
0
0
1
3
3
4
5
0
0
0
0
1
1
2
6
4
0
0
0
0
1
1
2
5
4

QMLE
BIC HQ
0
1
1
2
3
3
2
2
0
0
0
0
0
0
0
0
0
0
0
1
1
2
2
2
2
3
0
0
0
0
0
0
0
0
0
0
0
0
0
0
2
2
2
3
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
1
1
2
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
1

AIC
2
2
4
4
0
0
0
1
1
3
4
4
4
0
0
0
1
0
3
3
3
5
0
0
0
0
0
2
2
4
4
0
0
0
1
1
2
3
5
6

LW T SE
BIC
1
1
2
2
0
0
0
0
0
0
1
2
2
0
0
0
0
0
0
0
1
2
0
0
0
0
0
0
0
0
2
0
0
0
0
0
1
0
0
1

HQ
1
1
2
2
0
0
0
0
0
1
2
3
3
0
0
0
0
0
1
1
1
3
0
0
0
0
0
1
0
1
2
0
0
0
0
0
1
0
1
1

Table 3.7: Percentage of Model Selected, DGP ARFIMA(2, d, 0), ϕ1 = 0.25, ϕ2 = 0.375, d = 0.4,
T = 400
p
0
0
0
0
0
0
0
0
0
1
1
1
1
1
1
1
1
1
2
2
2
2
2
2
2
2
2
3
3
3
3
3
3
3
3
3
4
4
4
4
4

q
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4

AIC
0
0
2
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
10
1
0
0
0
0
0
0
0
1
0
0
0
1
1
0
0
1
0
0
2
1
1

QMLE
BIC HQ
1
0
0
0
11
9
0
0
0
1
0
0
0
0
0
0
0
0
2
0
1
1
1
3
0
0
0
0
0
0
0
0
0
0
0
0
80
64
0
3
0
0
0
0
0
1
0
0
0
0
0
0
0
0
2
3
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
2
0
1
0
1

AIC
0
0
1
0
1
0
0
0
0
0
0
2
0
0
0
0
0
0
10
1
0
0
0
0
1
0
0
0
0
0
0
1
1
0
1
1
0
0
2
1
1

LW T SE
BIC
0
0
8
0
0
0
0
0
0
1
0
2
0
0
0
0
0
0
83
1
0
0
0
0
0
0
0
2
0
0
0
0
0
0
0
0
0
0
0
0
0

HQ
0
0
6
0
0
1
0
0
0
0
0
5
0
0
0
0
0
0
66
1
1
0
0
0
0
0
0
2
1
0
0
1
0
0
0
0
0
0
2
1
0

p
4
4
4
4
5
5
5
5
5
5
5
5
5
6
6
6
6
6
6
6
6
6
7
7
7
7
7
7
7
7
7
8
8
8
8
8
8
8
8
8

Key: See Table 3.1.

49

q
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8

AIC
1
1
1
2
0
0
1
2
1
1
3
2
2
0
0
1
1
4
4
3
2
3
0
0
1
1
3
3
3
4
5
0
0
1
1
2
4
5
4
7

QMLE
BIC HQ
0
0
0
0
0
0
0
1
0
0
0
0
0
1
0
1
0
0
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
1

AIC
1
2
1
1
0
0
1
2
1
1
2
2
1
0
0
1
1
5
4
3
1
3
0
0
0
1
2
3
3
3
7
0
0
1
0
2
3
5
6
5

LW T SE
BIC
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0

HQ
0
1
0
0
0
0
0
1
0
0
1
0
0
0
0
0
1
1
1
0
0
0
0
0
0
0
1
0
0
0
1
0
0
0
0
0
0
0
0
1

Table 3.8: Percentage of Model Selected, DGP ARFIMA(2, d, 0), ϕ1 = 0.25, ϕ2 = 0.375, d = 0.4,
T = 1, 000
p
0
0
0
0
0
0
0
0
0
1
1
1
1
1
1
1
1
1
2
2
2
2
2
2
2
2
2
3
3
3
3
3
3
3
3
3
4
4
4
4
4

q
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4

AIC
0
0
2
0
1
1
0
0
0
0
0
1
0
1
0
0
0
0
31
7
1
0
0
0
0
0
0
6
1
0
0
0
0
0
1
0
1
0
1
1
0

QMLE
BIC HQ
0
0
0
0
5
4
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
2
0
0
0
0
0
0
0
0
0
0
0
0
93
80
1
4
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
5
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
0

AIC
0
0
0
0
1
0
0
0
0
0
0
1
1
0
0
0
0
0
25
8
1
0
0
0
0
0
0
5
0
0
0
0
0
0
0
1
2
0
1
1
1

LW T SE
BIC
0
0
2
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
94
1
0
0
0
0
0
0
0
2
0
0
0
0
0
0
0
0
0
0
0
0
0

HQ
0
0
1
0
0
0
0
0
0
0
0
2
1
0
0
0
0
0
79
6
0
0
0
0
0
0
0
5
0
0
0
0
0
0
0
0
1
0
1
0
0

p
4
4
4
4
5
5
5
5
5
5
5
5
5
6
6
6
6
6
6
6
6
6
7
7
7
7
7
7
7
7
7
8
8
8
8
8
8
8
8
8

Key: See Table 3.1.

50

q
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8

AIC
1
0
1
0
0
0
0
2
1
1
1
1
1
0
0
0
0
1
2
1
1
3
0
0
1
0
2
3
2
2
2
0
0
0
1
1
2
3
4
3

QMLE
BIC HQ
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0

AIC
1
1
0
1
1
0
1
1
1
1
1
1
2
0
0
1
1
2
1
1
2
2
0
0
0
1
1
3
2
2
4
0
0
0
1
2
1
5
5
3

LW T SE
BIC
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0

HQ
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0

Table 3.9: Percentage of Model Selected, DGP ARFIMA(2, d, 0), ϕ1 = 1, ϕ2 = −0.5, d = 0.4,
T = 400
p
0
0
0
0
0
0
0
0
0
1
1
1
1
1
1
1
1
1
2
2
2
2
2
2
2
2
2
3
3
3
3
3
3
3
3
3
4
4
4
4
4

q
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4

AIC
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
0
0
6
1
0
0
0
1
1
0
0
0
0
0
0
0
2
3
2
2
0
0
2
0
0

QMLE
BIC HQ
0
0
0
0
0
0
1
1
0
0
0
0
0
0
0
0
1
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
49
37
1
1
0
0
0
0
0
0
0
1
0
0
0
0
0
0
1
1
0
0
0
0
0
0
0
0
1
2
2
3
2
2
1
1
0
1
0
0
0
1
0
0
0
0

AIC
0
0
0
0
0
0
1
0
1
0
0
0
0
0
0
0
0
0
4
1
1
0
0
1
1
1
0
0
0
0
0
0
2
2
2
1
0
0
1
0
1

LW T SE
BIC
0
0
0
1
0
0
1
0
1
0
0
1
0
0
0
0
0
0
38
5
1
0
0
0
1
0
0
4
1
0
0
0
2
2
2
1
0
0
0
0
0

HQ
0
0
0
1
0
1
1
0
1
0
0
0
0
0
0
0
0
0
26
5
1
0
0
0
1
0
0
4
1
0
0
0
2
2
2
1
1
0
1
0
0

p
4
4
4
4
5
5
5
5
5
5
5
5
5
6
6
6
6
6
6
6
6
6
7
7
7
7
7
7
7
7
7
8
8
8
8
8
8
8
8
8

Key: See Table 3.1.

51

q
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8

AIC
3
5
5
4
0
0
1
1
0
2
4
5
3
0
0
1
1
2
2
3
4
4
0
0
0
0
2
2
2
4
4
0
0
0
0
1
2
4
4
4

QMLE
BIC HQ
2
2
5
5
4
5
2
3
0
0
0
0
0
1
0
0
0
0
1
1
3
3
5
5
2
2
0
0
0
0
0
0
0
0
0
1
0
0
2
2
3
3
2
3
0
0
0
0
0
0
0
0
0
0
0
0
0
1
3
3
2
3
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
1
3
3

AIC
3
5
5
3
0
0
1
1
1
2
4
5
3
0
0
0
1
2
2
2
4
4
0
0
0
0
1
3
2
4
5
0
0
0
0
1
2
3
4
4

LW T SE
BIC
3
4
5
2
0
0
0
0
0
1
3
4
2
0
0
0
0
0
0
2
3
3
0
0
0
0
0
0
1
2
3
0
0
0
0
0
0
0
1
1

HQ
3
5
5
2
0
0
1
1
0
1
4
4
2
0
0
0
0
1
0
2
3
3
0
0
0
0
0
0
1
2
3
0
0
0
0
0
0
0
1
2

Table 3.10: Percentage of Model Selected, DGP ARFIMA(2, d, 1), ϕ1 = 0.25, ϕ2 = 0.375
θ1 = 1/3, d = 0.4, T = 400
p
0
0
0
0
0
0
0
0
0
1
1
1
1
1
1
1
1
1
2
2
2
2
2
2
2
2
2
3
3
3
3
3
3
3
3
3
4
4
4
4
4

q
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4

AIC
0
0
1
0
0
0
0
0
0
1
1
2
0
0
0
0
0
0
3
2
0
0
1
1
0
1
0
1
0
1
0
1
0
0
1
0
0
0
1
1
1

QMLE
BIC HQ
15
2
0
0
9
9
0
1
0
1
0
0
0
0
0
0
0
0
41
19
0
3
1
4
0
0
0
0
0
0
0
0
0
0
0
0
28
31
3
6
0
0
0
0
0
1
0
0
0
0
0
0
0
0
1
5
0
0
0
2
0
1
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
1
0
1

AIC
0
0
1
0
0
0
0
0
0
1
2
1
0
0
0
0
0
0
5
2
0
0
0
0
1
1
0
1
0
1
1
1
0
0
0
1
0
0
1
2
1

LW T SE
BIC
1
0
7
0
0
0
0
0
0
28
7
2
0
0
0
0
0
0
48
3
0
0
0
0
0
0
0
2
0
0
0
0
0
0
0
0
0
0
0
0
0

HQ
0
0
6
1
1
0
0
0
0
10
8
4
0
0
0
0
0
0
38
7
0
0
0
0
0
0
0
3
0
2
1
1
0
0
0
0
0
0
2
1
1

p
4
4
4
4
5
5
5
5
5
5
5
5
5
6
6
6
6
6
6
6
6
6
7
7
7
7
7
7
7
7
7
8
8
8
8
8
8
8
8
8

Key: See Table 3.1.

52

q
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8

AIC
1
1
2
2
0
0
2
1
3
2
3
2
2
0
0
1
1
3
4
3
2
4
0
0
1
0
3
2
5
5
4
0
0
1
1
2
2
5
5
8

QMLE
BIC HQ
0
0
0
0
0
1
0
0
0
0
0
0
0
1
0
1
0
1
0
1
0
0
0
0
0
1
0
0
0
0
0
0
0
0
0
1
0
1
0
0
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0

AIC
0
1
1
1
0
0
1
2
3
2
3
2
2
0
0
1
1
3
4
3
2
3
0
0
1
0
3
3
4
4
5
0
0
0
0
2
2
5
6
7

LW T SE
BIC
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0

HQ
0
0
0
0
0
0
1
1
1
1
1
0
0
0
0
0
0
1
0
0
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
0
1
0
1

Table 3.11: Percentage of Model Selected, DGP ARFIMA(2, d, 1), ϕ1 = 1, ϕ2 = −0.5, θ1 = 1/3,
d = 0.4, T = 400
p
0
0
0
0
0
0
0
0
0
1
1
1
1
1
1
1
1
1
2
2
2
2
2
2
2
2
2
3
3
3
3
3
3
3
3
3
4
4
4
4
4

q
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4

AIC
0
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
2
11
1
1
1
1
1
1
1
6
2
0
0
0
1
1
0
0
2
0
1
1
1

QMLE
BIC
0
0
0
3
1
0
0
0
0
0
0
1
0
0
0
0
0
0
28
49
1
0
0
0
0
0
0
16
1
0
0
0
0
0
0
0
0
0
0
0
0

HQ
0
0
0
2
2
1
0
0
0
0
0
0
0
0
0
0
0
0
12
45
2
1
1
0
1
0
0
15
2
0
0
0
0
0
0
0
3
0
0
1
0

AIC
0
0
0
1
1
0
1
0
0
0
0
0
0
0
0
0
0
0
1
9
2
1
0
1
1
0
1
7
2
0
0
0
1
1
1
0
2
0
2
1
1

LW T SE
BIC
0
0
0
1
0
0
0
0
0
0
0
1
0
0
0
0
0
0
16
45
2
0
0
0
0
0
0
31
1
0
0
0
0
0
0
0
1
0
0
0
0

HQ
0
0
0
1
1
1
0
0
0
0
0
0
0
0
0
0
0
0
7
38
4
1
0
1
0
0
0
25
2
1
0
0
0
0
0
0
3
1
1
1
0

p
4
4
4
4
5
5
5
5
5
5
5
5
5
6
6
6
6
6
6
6
6
6
7
7
7
7
7
7
7
7
7
8
8
8
8
8
8
8
8
8

Key: See Table 3.1.

53

q
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8

AIC
1
1
2
1
1
0
1
1
1
1
1
2
2
1
0
1
1
1
3
2
2
3
0
0
1
1
2
4
3
4
2
0
0
1
1
1
3
4
6
7

QMLE
BIC HQ
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
1
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
0

AIC
1
1
1
2
1
0
1
1
1
1
2
1
2
1
0
1
1
1
3
3
2
2
0
0
1
1
2
4
3
3
3
0
0
0
1
1
3
4
6
6

LW T SE
BIC
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0

HQ
0
0
0
0
1
0
1
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
0
0
0
1
1
0
0
0
0
0
0
0
0
0
0
0
0

Table 3.12: Percentage of Model Selected, DGP ARFIMA(2, d, 1), ϕ1 = 1.4, ϕ2 = −0.6
θ1 = −0.8, d = 0.4, T = 400
p
0
0
0
0
0
0
0
0
0
1
1
1
1
1
1
1
1
1
2
2
2
2
2
2
2
2
2
3
3
3
3
3
3
3
3
3
4
4
4
4
4

q
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4

AIC
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
2
2
1
0
1
0
0
0
0
0
0
1
1
1
0
0
0
1
0
0
0
4
1

QMLE
BIC HQ
0
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
21
9
60
47
2
3
0
0
0
1
0
1
0
0
0
0
0
0
5
5
0
0
0
3
0
0
0
1
0
0
0
0
0
1
0
0
2
1
0
0
0
0
0
3
0
0

AIC
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
3
0
1
0
0
0
0
0
0
1
1
0
0
0
0
2
1
0
0
1
5
0

LW T SE
BIC
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
19
58
2
0
0
0
0
0
0
3
2
0
0
0
0
0
0
1
1
0
1
0
0

HQ
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
8
46
1
0
0
0
0
0
0
2
1
1
1
0
0
1
0
1
1
0
2
2
1

p
4
4
4
4
5
5
5
5
5
5
5
5
5
6
6
6
6
6
6
6
6
6
7
7
7
7
7
7
7
7
7
8
8
8
8
8
8
8
8
8

q
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8

AIC
1
1
2
1
0
0
0
2
0
2
1
4
3
0
0
1
0
3
13
5
6
3
0
0
0
1
0
2
4
4
2
0
0
0
0
2
0
5
9
8

QMLE
BIC HQ
0
1
0
1
0
0
1
1
0
0
0
0
0
0
1
2
0
0
0
1
0
0
3
3
2
2
0
1
0
0
0
0
0
0
0
0
0
4
0
0
2
3
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
1
1
0
0
0
0
0
0
0
0
0
0
0
0
0
2
0
0
0
1

AIC
2
1
4
4
0
0
0
1
4
0
0
3
2
0
0
1
2
0
4
4
6
4
0
0
0
1
1
3
4
4
4
0
0
0
0
0
3
4
11
7

LW T SE
BIC
0
1
1
1
0
0
0
0
0
0
0
2
2
0
0
0
0
0
0
0
2
1
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
0
1

HQ
1
1
2
2
0
0
0
1
2
0
0
2
2
1
0
0
1
0
3
1
3
2
0
0
0
1
0
1
1
1
2
0
0
0
0
0
0
0
0
1

Key: Number of replications is 100 and numbers in bold show the percent of correct selections.

54

Table 3.13: Percentage of Model Selected, DGP ARFIMA(2, d, 1), ϕ1 = −0.5, ϕ2 = −0.9,
θ1 = −0.6, d = 0.4, T = 400
p
0
0
0
0
0
0
0
0
0
1
1
1
1
1
1
1
1
1
2
2
2
2
2
2
2
2
2
3
3
3
3
3
3
3
3
3
4
4
4
4
4

q
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4

AIC
0
0
0
0
0
0
0
1
0
0
0
1
0
0
0
0
0
0
0
24
1
2
0
0
0
0
0
0
4
1
0
0
0
1
0
0
0
2
2
1
1

QMLE
BIC HQ
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
1
0
0
0
0
0
0
1
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
92
83
1
3
0
0
0
0
0
0
0
0
0
0
0
0
0
0
4
6
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
2
0
0
0
0
0
1
0
0

AIC
0
0
1
0
0
0
0
3
0
0
0
0
0
0
0
0
0
1
0
12
2
3
0
0
0
0
0
0
4
5
0
0
0
1
0
0
0
2
1
1
0

LW T SE
BIC
0
0
1
0
0
0
0
2
0
0
0
0
0
0
0
0
0
1
0
65
10
0
0
0
0
0
0
0
11
7
0
0
0
0
0
0
0
2
0
0
0

HQ
0
0
1
0
0
0
0
3
0
0
0
0
0
0
0
0
0
1
0
48
11
1
1
0
0
0
0
0
7
7
0
0
0
0
0
0
0
4
0
1
0

p
4
4
4
4
5
5
5
5
5
5
5
5
5
6
6
6
6
6
6
6
6
6
7
7
7
7
7
7
7
7
7
8
8
8
8
8
8
8
8
8

Key: See Table 3.12.

55

q
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8

AIC
1
0
0
1
1
1
0
0
1
0
1
0
0
0
0
0
2
5
1
1
1
1
3
0
1
0
3
4
3
3
2
1
1
2
0
3
5
2
4
5

QMLE
BIC HQ
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0

AIC
0
1
1
0
2
1
0
1
5
1
2
0
2
1
0
0
0
1
3
2
1
0
0
0
0
0
3
2
7
6
1
1
0
1
1
2
1
4
3
8

LW T SE
BIC
0
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0

HQ
0
1
0
0
4
0
0
2
2
0
0
0
0
2
0
0
0
0
0
0
0
0
0
0
0
0
0
1
1
1
0
0
0
0
0
0
0
0
0
1

Table 3.14: Percentage of Model Selected, DGP ARFIMA(3, d, 0), ϕ1 = −13/20, ϕ2 = 3/5,
ϕ3 = 27/80, d = 0.4, T = 400
p
0
0
0
0
0
0
0
0
0
1
1
1
1
1
1
1
1
1
2
2
2
2
2
2
2
2
2
3
3
3
3
3
3
3
3
3
4
4
4
4
4

q
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4

AIC
0
10
0
0
0
0
0
0
0
0
16
0
0
0
0
0
0
0
0
13
0
0
0
0
0
0
0
0
16
0
0
0
0
0
0
0
1
7
0
0
0

QMLE
BIC
0
10
0
0
0
0
0
0
0
0
17
1
0
0
0
0
0
0
0
13
0
0
0
0
0
0
0
5
16
0
0
0
0
0
0
0
0
7
0
0
0

HQ
0
10
0
0
0
0
0
0
0
0
17
1
0
0
0
0
0
0
0
13
0
0
0
0
0
0
0
4
16
0
0
0
0
0
0
0
1
7
0
0
0

AIC
0
10
0
0
0
0
0
0
0
0
15
0
0
0
0
1
0
0
0
10
0
0
0
0
0
0
0
3
13
0
0
0
0
0
0
0
0
4
0
0
0

LW T SE
BIC
0
10
0
0
0
0
0
0
0
0
16
1
0
0
0
0
0
0
0
10
1
0
0
0
0
0
0
12
13
0
0
0
0
0
0
0
1
4
0
0
0

HQ
0
10
0
0
0
0
0
0
0
0
15
1
0
0
0
1
0
0
0
10
1
0
0
0
0
0
0
10
13
0
0
0
0
0
0
0
1
4
0
0
0

p
4
4
4
4
5
5
5
5
5
5
5
5
5
6
6
6
6
6
6
6
6
6
7
7
7
7
7
7
7
7
7
8
8
8
8
8
8
8
8
8

Key: See Table 3.12.

56

q
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8

AIC
0
0
0
0
0
16
1
0
0
0
0
0
0
0
7
0
1
0
0
0
0
0
0
5
0
0
0
0
0
0
0
0
4
0
1
0
0
0
0
2

QMLE
BIC HQ
0
0
0
0
0
0
0
0
0
0
16
16
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
7
7
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
4
4
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
4
4
0
0
0
0
0
0
0
0
0
0
0
0
0
0

AIC
0
0
0
1
0
16
0
0
0
2
0
0
0
0
4
0
1
0
0
0
3
0
0
7
0
0
0
0
0
0
0
0
6
0
0
1
0
0
0
3

LW T SE
BIC
0
0
0
0
0
15
0
0
0
0
0
0
0
0
4
0
0
0
0
0
0
0
0
7
0
0
0
0
0
0
0
0
6
0
0
0
0
0
0
0

HQ
0
0
0
0
0
16
1
0
0
0
0
0
0
0
4
0
0
0
0
0
0
0
0
7
0
0
0
0
0
0
0
0
6
0
0
0
0
0
0
0

Table 3.15: Percentage of Model Selected, DGP ARFIMA(3, d, 0), ϕ1 = −0.56163,
ϕ2 = 0.57986, ϕ3 = −0.92705, d = 0.4, T = 400
p
0
0
0
0
0
0
0
0
0
1
1
1
1
1
1
1
1
1
2
2
2
2
2
2
2
2
2
3
3
3
3
3
3
3
3
3
4
4
4
4
4

q
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4

AIC
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
15
1
0
0
0
0
0
12
3
11
4
0
1
0
0
0
13
0
1
0
0

QMLE
BIC
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
19
0
28
0
0
0
0
0
0
38
1
1
0
0
0
0
0
0
12
0
0
0
0

HQ
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
7
0
20
1
0
0
0
0
0
39
1
10
1
0
0
0
0
0
15
1
0
0
0

AIC
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
2
1
0
0
0
0
0
0
0
3
17
2
0
0
0
0
0
0
6
3
0
0
0

LW T SE
BIC
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
7
1
0
0
0
0
0
0
0
40
42
0
0
0
0
0
0
0
10
0
0
0
0

HQ
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
5
1
1
1
0
0
0
0
0
16
45
4
0
0
0
0
0
0
13
2
0
0
0

p
4
4
4
4
5
5
5
5
5
5
5
5
5
6
6
6
6
6
6
6
6
6
7
7
7
7
7
7
7
7
7
8
8
8
8
8
8
8
8
8

Key: See Table 3.12.

57

q
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8

AIC
0
0
0
0
3
0
4
1
0
2
1
0
3
1
0
1
1
0
2
0
1
0
0
0
2
0
3
2
2
1
1
0
0
1
0
0
3
0
2
2

QMLE
BIC HQ
0
0
0
0
0
0
0
0
0
2
0
0
0
0
0
0
0
0
0
0
1
1
0
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0

AIC
0
0
0
2
4
0
2
4
2
0
0
1
0
0
0
3
1
2
0
1
0
0
0
0
0
0
3
6
4
2
3
0
0
0
0
4
5
6
9
2

LW T SE
BIC
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0

HQ
1
0
0
0
4
1
0
0
0
1
0
0
0
0
0
1
0
1
0
0
0
0
0
0
0
0
2
0
1
0
0
0
0
0
0
0
0
0
0
0

Table 3.16: Percentage of Model Selected, DGP ARFIMA(4, d, 0), ϕ1 = 5/4, ϕ2 = −3/8,
ϕ3 = −1/4, ϕ4 = 3/16, d = 0.4, T = 400
p
0
0
0
0
0
0
0
0
0
1
1
1
1
1
1
1
1
1
2
2
2
2
2
2
2
2
2
3
3
3
3
3
3
3
3
3
4
4
4
4
4

q
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4

AIC
0
0
0
1
1
0
1
1
1
0
0
1
1
0
0
0
0
0
0
0
2
0
0
1
1
1
0
1
0
0
0
1
1
3
4
3
2
0
0
0
1

QMLE
BIC HQ
0
0
0
0
0
0
4
2
3
4
0
0
0
1
1
1
1
1
0
0
0
0
14
8
1
2
0
0
0
0
0
0
0
0
0
0
12
4
0
0
2
5
0
0
0
0
0
0
0
1
0
0
0
0
10
6
0
1
0
1
0
0
0
1
0
1
2
3
3
3
2
2
8
10
0
0
0
0
0
0
0
0

AIC
0
0
0
0
1
0
1
1
1
0
0
1
1
0
0
0
0
0
0
0
1
0
0
0
1
1
0
0
0
0
0
1
1
3
3
2
4
0
1
1
1

LW T SE
BIC
0
0
0
3
3
0
0
1
1
0
0
15
1
0
0
0
0
0
12
0
3
0
0
0
1
1
0
7
1
0
0
0
1
2
3
2
10
0
0
0
0

HQ
0
0
0
1
3
1
1
1
1
0
0
8
2
0
0
0
0
0
4
0
5
0
0
1
1
1
0
4
2
1
0
1
1
2
3
2
13
0
0
1
0

p
4
4
4
4
5
5
5
5
5
5
5
5
5
6
6
6
6
6
6
6
6
6
7
7
7
7
7
7
7
7
7
8
8
8
8
8
8
8
8
8

Key: See Table 3.1.

58

q
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8

AIC
2
4
6
4
0
0
1
1
0
1
4
7
5
0
0
1
0
1
1
3
5
4
0
0
0
0
1
1
2
3
4
0
0
0
0
1
1
2
2
3

QMLE
BIC HQ
1
1
3
4
5
5
3
3
0
1
0
0
0
0
0
0
0
0
0
0
3
4
5
5
3
4
0
0
0
0
0
0
0
0
0
0
0
0
1
1
3
4
3
4
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
1
2
3
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
1

AIC
2
5
4
4
0
0
1
1
1
1
4
5
5
0
0
1
1
1
1
2
4
5
0
0
0
0
1
1
2
4
5
0
0
0
0
2
1
3
3
5

LW T SE
BIC
1
4
4
3
0
0
0
0
0
0
4
4
3
0
0
0
0
0
0
0
2
3
0
0
0
0
0
0
0
1
2
0
0
0
0
0
0
0
0
2

HQ
1
5
4
3
0
0
0
0
0
0
4
4
4
0
0
0
0
0
0
1
2
4
0
0
0
0
0
0
0
1
3
0
0
0
0
1
0
0
0
2

Table 3.17: Percentage of Model Selected, DGP ARFIMA(4, d, 0), ϕ1 = −9/20, ϕ2 = 73/100,
ϕ3 = 87/400, ϕ4 = 27/400, d = 0.4, T = 400
p
0
0
0
0
0
0
0
0
0
1
1
1
1
1
1
1
1
1
2
2
2
2
2
2
2
2
2
3
3
3
3
3
3
3
3
3
4
4
4
4
4

q
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4

AIC
0
9
0
0
0
0
0
0
0
0
16
1
0
0
0
0
0
0
1
5
0
0
0
0
0
0
0
0
14
1
0
0
0
1
0
0
0
5
0
0
0

QMLE
BIC HQ
0
0
9
9
0
0
0
0
0
0
0
0
0
0
0
0
0
0
3
0
18
17
8
6
0
0
0
0
0
0
0
0
0
0
0
0
3
1
5
5
1
1
0
0
0
0
0
0
0
0
0
0
0
0
8
5
14
14
0
2
0
0
0
0
0
0
0
0
0
0
0
0
2
4
4
5
0
0
0
0
0
0

AIC
0
10
0
0
0
0
0
0
0
0
8
0
0
0
0
0
0
0
0
9
0
1
0
0
0
0
0
1
19
1
0
0
0
0
0
0
2
11
0
0
0

LW T SE
BIC
0
10
0
0
0
0
0
0
0
1
8
2
0
1
0
0
0
0
0
9
0
0
0
0
0
0
0
19
17
0
0
0
0
0
0
0
4
11
0
0
0

HQ
0
10
0
0
0
0
0
0
0
0
8
2
0
0
0
0
0
0
0
9
1
1
0
0
0
0
0
11
18
0
0
0
0
0
0
0
6
11
0
0
0

p
4
4
4
4
5
5
5
5
5
5
5
5
5
6
6
6
6
6
6
6
6
6
7
7
7
7
7
7
7
7
7
8
8
8
8
8
8
8
8
8

Key: See Table 3.12.

59

q
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8

AIC
0
0
0
0
0
15
0
1
0
1
1
0
0
0
1
2
0
0
0
0
1
2
0
7
0
0
0
3
0
2
1
0
1
1
0
0
2
3
2
1

QMLE
BIC HQ
0
0
0
0
0
0
0
0
0
0
15
15
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
1
1
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
6
6
0
0
0
0
0
0
1
1
0
0
0
0
0
0
0
0
1
1
0
1
0
0
0
0
0
1
0
2
0
0
1
1

AIC
0
0
0
0
0
9
0
1
0
0
1
0
0
0
5
1
2
0
0
0
0
2
0
4
0
0
2
1
1
0
1
0
0
0
0
0
4
2
0
2

LW T SE
BIC
0
0
0
0
0
9
0
0
0
0
0
0
0
0
5
0
0
0
0
0
0
0
0
4
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0

HQ
0
0
0
0
0
9
0
1
0
0
0
0
0
0
5
2
1
0
0
0
0
0
0
4
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0

Table 3.18: Percentage of Model Selected, DGP ARFIMA(4, d, 0), ϕ1 = 1.350, ϕ2 = −0.7,
ϕ3 = 0.4, ϕ4 = −0.31, d = 0.4, T = 400
p
0
0
0
0
0
0
0
0
0
1
1
1
1
1
1
1
1
1
2
2
2
2
2
2
2
2
2
3
3
3
3
3
3
3
3
3
4
4
4
4
4

q
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4

AIC
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
2
1
0
1
0
1
0
0
0
2
1
0
0
0
3
17
4
4
0
0

QMLE
BIC HQ
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
3
1
3
1
1
2
1
3
0
0
0
0
0
1
0
0
1
1
0
0
0
0
0
0
0
2
0
0
0
0
0
0
0
1
3
3
77
56
1
5
1
2
0
1
0
0

AIC
0
0
0
2
0
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
3
2
0
0
0
1
0
0
0
1
0
0
1
1
2
16
2
2
0
0

LW T SE
BIC
0
0
0
2
0
0
0
0
1
0
0
0
0
1
0
0
0
0
2
1
1
1
1
0
0
0
1
0
0
0
1
0
0
0
0
2
69
4
0
0
0

HQ
0
0
0
2
0
0
0
0
1
0
0
0
0
1
0
0
0
0
1
1
1
6
2
0
0
0
1
0
0
0
1
0
0
1
0
2
47
4
0
0
0

p
4
4
4
4
5
5
5
5
5
5
5
5
5
6
6
6
6
6
6
6
6
6
7
7
7
7
7
7
7
7
7
8
8
8
8
8
8
8
8
8

Key: See Table 3.12.

60

q
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8

AIC
0
1
2
2
2
0
0
0
1
0
2
1
3
0
0
4
1
2
2
1
2
2
0
0
1
1
1
3
3
1
7
0
0
0
1
1
2
5
7
2

QMLE
BIC HQ
0
0
0
1
1
1
1
1
2
4
0
0
0
0
0
0
0
1
0
0
0
0
0
1
2
3
0
0
0
1
0
2
0
0
0
0
0
0
0
0
0
1
1
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
2
2
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
2
0
0

AIC
1
1
2
0
1
1
0
0
4
1
1
1
4
4
0
3
0
0
1
0
1
1
0
0
0
4
3
2
5
2
6
0
0
0
0
3
3
4
4
3

LW T SE
BIC
0
0
1
0
3
2
0
0
1
1
0
0
1
2
0
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1

HQ
0
1
1
0
3
1
0
0
3
1
0
0
3
5
0
2
0
0
1
0
0
1
0
0
0
1
0
0
1
0
3
0
0
0
0
1
0
0
0
1

Table 3.19: Percentage of Model Selected, DGP ARFIMA(4, d, 0), ϕ1 = 1.350, ϕ2 = −0.7,
ϕ3 = 0.4, ϕ4 = −0.31, d = 0.3, T = 400
p
0
0
0
0
0
0
0
0
0
1
1
1
1
1
1
1
1
1
2
2
2
2
2
2
2
2
2
3
3
3
3
3
3
3
3
3
4
4
4
4
4

q
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4

AIC
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
4
0
0
0
0
0
0
0
0
1
1
0
1
0
0
22
1
1
0
0

QMLE
BIC HQ
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
4
0
1
1
2
1
1
5
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
2
0
0
0
0
0
1
0
0
0
0
83
66
1
3
0
1
0
0
0
0

AIC
0
0
1
0
0
1
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
2
1
0
0
0
1
0
0
0
1
0
0
1
0
0
15
1
1
1
0

LW T SE
BIC
0
0
1
0
0
1
0
0
1
0
0
0
0
0
0
0
0
0
4
2
1
1
1
0
0
0
1
0
0
0
0
0
0
0
0
0
68
2
0
0
0

HQ
0
0
1
0
0
1
0
0
1
0
0
0
0
0
0
0
0
1
1
0
1
4
3
1
0
0
1
0
0
0
1
0
0
1
0
0
50
3
1
0
0

p
4
4
4
4
5
5
5
5
5
5
5
5
5
6
6
6
6
6
6
6
6
6
7
7
7
7
7
7
7
7
7
8
8
8
8
8
8
8
8
8

Key: See Table 3.12.

61

q
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8

AIC
1
1
2
2
3
1
1
0
2
5
0
4
2
0
0
2
1
5
1
1
1
4
0
0
3
3
2
2
2
1
4
0
0
2
0
1
2
6
0
2

QMLE
BIC HQ
0
0
0
1
1
1
1
2
2
3
0
2
0
1
0
0
0
0
0
0
0
0
0
0
1
1
0
1
0
0
0
1
0
0
0
2
0
0
0
0
0
0
1
1
0
0
0
0
0
1
1
2
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0

AIC
1
0
2
1
2
4
0
1
1
3
0
2
1
1
0
2
1
4
1
0
5
1
0
0
0
3
2
1
4
3
5
0
0
1
1
2
5
5
3
5

LW T SE
BIC
0
0
0
0
9
1
0
0
0
0
0
1
1
2
0
0
0
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
1
0

HQ
1
0
1
1
7
3
0
0
1
2
0
1
1
2
0
0
1
0
0
0
1
0
0
0
0
1
0
0
0
0
0
1
0
0
0
0
2
1
1
1

Table 3.20: Percentage of Model Selected, DGP ARFIMA(4, d, 1), ϕ1 = 5/4, ϕ2 = −3/8,
ϕ3 = −1/4, ϕ4 = 3/16, θ1 = 1/3,d = 0.4, T = 400
p
0
0
0
0
0
0
0
0
0
1
1
1
1
1
1
1
1
1
2
2
2
2
2
2
2
2
2
3
3
3
3
3
3
3
3
3
4
4
4
4
4

q
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4

AIC
0
0
0
0
1
0
1
1
3
0
0
0
0
0
0
0
1
1
1
0
1
0
0
1
3
2
1
1
0
0
0
0
3
5
7
4
2
0
0
0
0

QMLE
BIC HQ
0
0
0
0
0
0
0
0
1
2
0
0
1
1
1
1
3
3
0
0
0
0
1
1
1
1
0
0
0
0
0
0
1
1
1
1
22
9
0
0
1
2
0
0
0
0
1
1
2
3
2
2
1
1
6
5
0
0
0
0
0
0
0
0
2
2
5
5
6
6
3
4
1
4
0
0
0
0
0
0
0
0

AIC
0
0
0
0
1
0
1
2
1
0
0
0
1
0
0
0
1
1
1
0
1
1
0
1
2
2
2
1
0
1
0
0
2
5
5
3
2
0
0
0
0

LW T SE
BIC
0
0
0
0
1
0
1
2
2
0
0
2
2
0
0
0
1
1
11
0
2
0
0
1
2
1
1
13
0
0
0
0
2
4
4
3
6
0
0
0
0

HQ
0
0
0
0
2
1
1
2
1
0
0
1
2
0
0
0
1
1
5
0
4
0
0
1
2
1
1
8
0
0
0
0
3
5
5
3
6
0
0
0
0

p
4
4
4
4
5
5
5
5
5
5
5
5
5
6
6
6
6
6
6
6
6
6
7
7
7
7
7
7
7
7
7
8
8
8
8
8
8
8
8
8

Key: See Table 3.1.

62

q
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8

AIC
2
5
6
5
0
0
0
0
0
0
3
6
5
0
0
0
0
1
1
2
4
5
0
0
0
0
1
1
1
1
3
0
0
0
0
0
1
1
2
2

QMLE
BIC HQ
2
2
5
5
5
6
4
4
0
0
0
0
0
0
0
0
0
0
0
0
3
3
5
5
4
4
0
0
0
0
0
0
0
0
0
0
0
0
1
1
3
4
4
4
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
1
2
2
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
1

AIC
2
4
6
5
0
0
0
1
0
2
3
5
6
0
0
1
0
1
1
1
3
5
0
0
0
0
1
1
1
2
3
0
0
0
0
1
1
2
2
3

LW T SE
BIC
2
4
6
4
0
0
0
0
0
1
3
4
4
0
0
0
0
0
0
1
2
3
0
0
0
0
0
0
0
1
2
0
0
0
0
0
0
0
0
1

HQ
2
4
6
5
0
0
0
0
0
1
3
5
5
0
0
1
0
0
0
1
2
4
0
0
0
0
0
0
0
2
2
0
0
0
0
0
0
0
0
1

Table 3.21: Modiﬁed BIC and HQ for LW T SE, Percentage of Model Selected, DGP
ARFIMA(2, d, 0) with ϕ1 = 1,ϕ2 = −0.5, d = 0.4 and T = 400
p
0
0
0
0
0
0
0
0
0
1
1
1
1
1
1
1
1
1
2
2
2
2
2
2
2
2
2
3
3
3
3
3
3
3
3
3
4
4
4
4
4

q
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4

Std.
BIC
0
0
2
2
0
0
0
0
0
0
0
1
0
0
0
0
0
0
72
4
0
0
0
0
0
0
1
5
0
0
0
0
0
0
3
0
0
0
0
0
0

Std.
HQ
0
0
0
1
0
0
0
0
0
0
0
1
0
0
0
0
0
0
47
4
1
0
0
1
1
0
1
6
2
0
0
0
0
1
3
1
0
0
4
0
2

Mod.
BIC
0
0
2
0
0
0
0
0
0
0
2
1
0
0
0
0
0
0
87
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0

Mod.
HQ
0
0
2
0
0
0
0
0
0
0
2
1
0
0
0
0
0
0
82
3
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0

p
4
4
4
4
5
5
5
5
5
5
5
5
5
6
6
6
6
6
6
6
6
6
7
7
7
7
7
7
7
7
7
8
8
8
8
8
8
8
8
8

q
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8

Std.
BIC
0
0
0
0
0
0
1
0
0
0
0
0
0
0
0
0
1
0
1
0
1
2
0
0
0
0
1
0
0
1
0
0
0
0
0
0
0
1
0
1

Std.
HQ
0
0
0
0
0
1
1
0
0
0
0
0
0
0
0
0
1
2
3
0
2
2
0
0
0
0
1
1
0
1
1
0
0
0
0
0
3
3
0
2

Mod.
BIC
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
1
0
1
2
0
0
0
0
1
0
0
0
0
0
0
0
0
0
0
1
0
1

Mod.
HQ
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
1
0
1
2
0
0
0
0
1
0
0
1
0
0
0
0
0
0
0
1
0
1

Key: Number of replication is 100. Std. and Mod. denote standard and modiﬁed.

63

Table 3.22: Modiﬁed BIC and HQ for LW T SE, Percentage of Model Selected, DGP
ARFIMA(2, d, 0) with ϕ1 = 1,ϕ2 = −0.5, d = 0.4 and T = 1, 000
p
0
0
0
0
0
0
0
0
0
1
1
1
1
1
1
1
1
1
2
2
2
2
2
2
2
2
2
3
3
3
3
3
3
3
3
3
4
4
4
4
4

q
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4

Std.
BIC
0
2
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
64
10
1
0
0
0
0
0
0
4
3
0
0
0
0
0
0
1
0
0
0
0
0

Std.
HQ
0
2
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
54
12
1
0
2
0
0
0
0
3
4
0
0
0
0
0
0
1
0
0
0
0
0

Mod.
BIC
0
2
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
92
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
0
0

Mod.
HQ
0
2
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
89
1
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
0
0

Key: See Table 3.21

64

p
4
4
4
4
5
5
5
5
5
5
5
5
5
6
6
6
6
6
6
6
6
6
7
7
7
7
7
7
7
7
7
8
8
8
8
8
8
8
8
8

q
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8

Std.
BIC
0
0
1
1
0
0
0
0
0
0
0
0
0
0
1
0
0
0
0
1
1
0
0
0
0
0
0
2
0
2
0
0
0
0
1
1
1
1
0
2

Std.
HQ
0
0
1
1
0
0
0
1
0
0
0
0
0
0
1
1
0
0
1
1
1
0
0
0
0
0
0
3
0
2
0
0
0
1
1
1
1
1
0
3

Mod.
BIC
0
0
1
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
0
1
0
1
0
0
0
0
0
0
1
0
0
0

Mod.
HQ
0
0
1
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
0
1
0
0
0
0
0
0
0
2
0
1
0
0
0
0
0
0
1
0
0
0

Table 3.23: Modiﬁed BIC and HQ for LW T SE, Percentage of Model Selected, DGP
ARFIMA(1, d, 0) with ϕ1 = 0.8, d = 0.4 and T = 400
p
0
0
0
0
0
0
0
0
0
1
1
1
1
1
1
1
1
1
2
2
2
2
2
2
2
2
2
3
3
3
3
3
3
3
3
3
4
4
4
4
4

q
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4

Std.
BIC
0
2
0
0
0
0
0
0
0
66
2
0
0
0
0
0
0
0
4
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
0
1
0
0
1
0
0

Std.
HQ
0
1
1
0
0
0
0
0
0
38
5
0
0
0
0
0
1
0
5
3
0
0
0
0
0
0
1
0
1
4
0
0
0
0
0
1
0
0
1
0
0

Mod.
BIC
1
2
0
0
0
0
0
0
0
89
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
0
0
0
0
1
0
0

Mod.
HQ
0
2
0
0
0
0
0
0
0
81
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
0
1
0
0
1
0
0

Key: See Table 3.21

65

p
4
4
4
4
5
5
5
5
5
5
5
5
5
6
6
6
6
6
6
6
6
6
7
7
7
7
7
7
7
7
7
8
8
8
8
8
8
8
8
8

q
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8

Std.
BIC
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
1
3
2
0
0
0
0
0
1
1
3
2
0
0
0
0
0
3
1
3
2

Std.
HQ
0
0
0
0
0
0
2
0
0
1
0
2
2
0
0
1
0
1
0
3
3
3
0
0
0
0
0
1
4
3
2
0
0
0
0
0
3
2
3
2

Mod.
BIC
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
0
2
1
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
0
1
0

Mod.
HQ
0
0
0
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
0
3
1
0
0
0
0
0
1
0
1
2
0
0
0
0
0
2
1
2
0

Table 3.24: Modiﬁed BIC and HQ for LW T SE, Percentage of Model Selected, DGP
ARFIMA(4, d, 0), ϕ1 = 5/4, ϕ2 = −3/8, ϕ3 = −1/4, ϕ4 = 3/16, d = 0.4, and T = 1, 000
p
0
0
0
0
0
0
0
0
0
1
1
1
1
1
1
1
1
1
2
2
2
2
2
2
2
2
2
3
3
3
3
3
3
3
3
3
4
4
4
4
4

q
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4

Std.
SIC
0
6
0
0
4
0
0
0
0
0
3
3
0
0
0
1
0
0
2
1
9
0
0
0
2
0
0
4
1
0
0
0
0
0
0
2
25
2
0
0
0

Std.
HQ
0
6
0
0
4
0
1
0
0
0
3
0
1
0
0
1
0
0
0
0
8
0
0
0
2
0
0
0
1
0
0
0
0
0
0
2
22
3
2
0
0

Mod.
SIC
0
7
0
0
0
0
0
0
0
0
3
1
0
0
0
0
0
0
59
1
0
0
0
0
2
0
0
1
0
0
0
0
0
0
0
2
0
2
0
0
0

Mod.
HQ
0
7
0
2
0
0
0
0
0
0
3
13
0
0
0
0
0
0
35
1
1
0
0
0
2
0
0
4
0
0
0
0
0
0
0
2
2
2
0
0
0

Key: See Table 3.21

66

p
4
4
4
4
5
5
5
5
5
5
5
5
5
6
6
6
6
6
6
6
6
6
7
7
7
7
7
7
7
7
7
8
8
8
8
8
8
8
8
8

q
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8

Std.
SIC
0
0
0
1
1
2
0
0
0
0
0
0
0
0
4
0
0
0
0
0
2
0
0
4
0
0
0
0
1
0
1
0
13
0
1
0
0
0
0
5

Std.
HQ
0
0
0
1
0
3
0
1
0
0
0
0
0
0
4
1
1
0
0
1
2
0
0
5
2
0
1
0
1
0
1
0
13
0
1
1
0
0
0
5

Mod.
SIC
0
0
0
0
0
2
0
0
0
0
0
0
0
0
3
0
0
0
0
0
0
0
0
4
0
0
0
0
0
0
0
0
11
0
1
0
0
0
0
1

Mod.
HQ
0
0
0
0
0
2
0
0
0
0
0
0
0
0
3
0
0
0
0
0
1
0
0
4
0
0
0
0
0
0
0
0
13
0
1
0
0
0
0
2

Table 3.25: Modiﬁed BIC and HQ for LW T SE, Percentage of Model Selected, DGP
ARFIMA(4, d, 0), ϕ1 = 1.350, ϕ2 = −0.7, ϕ3 = 0.4, ϕ4 = −0.31, d = 0.4, and T = 1, 000
p
0
0
0
0
0
0
0
0
0
1
1
1
1
1
1
1
1
1
2
2
2
2
2
2
2
2
2
3
3
3
3
3
3
3
3
3
4
4
4
4
4

q
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4

Std.
SIC
0
39
0
0
0
0
0
0
0
0
8
0
0
0
0
0
0
0
0
3
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
8
1
0
0
0

Std.
HQ
0
39
0
0
0
0
0
0
0
0
8
0
0
0
0
0
0
0
0
3
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
6
2
0
0
0

Mod.
SIC
0
40
0
0
0
0
0
0
0
0
8
0
0
0
0
0
0
0
4
3
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
6
1
0
0
0

Mod.
HQ
0
40
0
0
0
0
0
0
0
0
8
0
0
0
0
0
0
0
0
3
0
0
0
0
0
0
0
0
1
0
0
0
0
0
0
0
9
1
0
0
0

Key: See Table 3.21

67

p
4
4
4
4
5
5
5
5
5
5
5
5
5
6
6
6
6
6
6
6
6
6
7
7
7
7
7
7
7
7
7
8
8
8
8
8
8
8
8
8

q
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8
0
1
2
3
4
5
6
7
8

Std.
SIC
0
0
0
0
0
8
0
0
0
0
0
0
0
0
12
0
0
0
0
0
0
0
0
7
0
0
0
0
0
0
0
0
12
0
0
0
1
0
0
0

Std.
HQ
0
0
0
0
0
9
0
0
0
0
0
0
0
0
12
0
0
0
0
0
0
0
0
7
0
0
0
0
0
0
0
0
12
0
0
0
1
0
0
0

Mod.
SIC
0
0
0
0
0
8
0
0
0
0
0
0
0
0
12
0
0
0
0
0
0
0
0
6
0
0
0
0
0
0
0
0
11
0
0
0
0
0
0
0

Mod.
HQ
0
0
0
0
0
8
0
0
0
0
0
0
0
0
12
0
0
0
0
0
0
0
0
7
0
0
0
0
0
0
0
0
11
0
0
0
0
0
0
0

Table 3.26: Estimated ARFIMA Models for U.S. Inﬂation
(1 − ϕ L)(1 − L)d (1 − ΦL12 )(1000 × ∆ ln(CPIt ) − µt ) = (1 − θ L)εt
ARFIMA(0,d,0)

ϕ

-2003.73
3.101
(1.434)
0.332
(0.046)
-

ARFIMA(0,d,0)SARMA(1,0)
-1999.18
3.219
(1.482)
0.316
(0.047)
-

θ

-

-

ARFIMA(1,d,0)SARMA(1,0)
-1999.07
3.168
(1.371)
0.301
(0.057)
0.026
(0.072)
-

ϕ

-

σ2

11.26
(1.003)
-0.11
7.04
30.66
452.30
2.424
2.430
2.426

-0.111
(0.053)
11.127
(1.023)
-0.17
7.43
22.84
439.44
2.415
2.427
2.419

-0.111
(0.052)
11.123
(1.021)
-0.15
7.42
22.09
437.77
2.417
2.435
2.424

Log L
µ
d

m3
m4
Q(20)
Q2 (20)
AIC
BIC
HQIC

ARFIMA(0,d,1)SARMA(1,0)
-1999.06
3.164
(1.363)
0.3
(0.057)
-0.028
(0.074)
-0.111
(0.052)
11.123
(1.021)
-0.15
7.42
22.04
437.60
2.417
2.435
2.424

Key: Seasonally unadjusted inﬂation. Standard errors are reported in parentheses. Log L is the
maximized value of the log likelihood. Q(20) and Q2 (20) are the Ljung-Box Postmanteau
statistics calculated from the ﬁrst 20 residual and squared residual autocorrelations respectively.
m3 and m4 are the sample skewness and kurtosis of the standardized residuals respectively.

68

Table 3.27: Estimated ARFIMA Models for U.S. Inﬂation
(1 − ϕ L)(1 − L)d (1 − ΦL12 )(1000 × ∆ ln(CPIt ) − µt ) = (1 − θ L)εt
ARFIMA(0,d,0)

ϕ

-1862.79
3.635
(1.982)
0.395
(0.055)
-

ARFIMA(0,d,0)SARMA(1,0)
-1859.89
3.857
(2.202)
0.391
(0.058)
-

θ

-

-

ARFIMA(1,d,0)SARMA(1,0)
-1858.55
3.485
(1.676)
0.333
(0.067)
0.094
-0.096
-

ϕ

-

σ2

7.778
-0.782
0.00
8.71
69.73
326.09
2.054
2.060
2.056

-0.088
(0.053)
7.72
-0.783
-0.02
8.83
65.36
341.57
2.049
2.061
2.054

-0.094
(0.054)
7.692
-0.782
0.09
8.88
56.67
326.40
2.048
2.066
2.055

Log L
µ
d

m3
m4
Q(20)
Q2 (20)
AIC
BIC
HQIC

Key: Seasonally adjusted inﬂation. See Table 3.26.

69

ARFIMA(0,d,1)SARMA(1,0)
-1858.12
3.427
(1.567)
0.325
(0.064)
-0.114
(0.100)
-0.096
(0.054)
7.684
-0.781
0.11
8.88
54.21
321.00
2.047
2.065
2.054

Chapter 4

METHODS FOR SELECTION OF ARFIMA MODELS: APPLICATIONS

4.1 Introduction
This chapter considers applications of the model selection procedure discussed in the previous
chapter. We ﬁnd that Information Criteria (IC) are useful in ARFIMA model speciﬁcation. Several
economic and ﬁnance time series are investigated, such as U.S. real output, G-7 monthly inﬂation,
interest rate and realized volatility of exchange rate. We consider both Maximum Likelihood
Estimation (MLE) and Local Whittle Two-Step Estimation (LW T SE) procedure. We ﬁnd that the
selected model and estimation results vary with the IC and estimation procedure. The MLE − BIC,
which is our suggested criterion, generally selects parsimonious models.

4.2 Application to Real Output
Whether real Gross Domestic Product (GDP) is difference stationary or trend stationary is long
debate in macroeconomic research. Nelson and Plosser (1982) had argued that the presence of
unit root of GNP cannot be rejected. Campbell and Mankiw (1987) found strong persistence in
output shock. In contrast, Clark (1987) and Cochrane (1988) found small persistence on the Gross
National Product (GNP) series. An application of the long memory model contributes to this debate
by avoiding “knife-edge” unit root test distinction. The GPH estimator was applied by Diebold and
Rudebusch (1989) to the U.S. real GNP and Industrial Production (IP) index. They found evidence
of long memory in both measures of real output. Sowell (1992) used the ARFIMA model to test
whether the series should be modeled by trend-stationary model or difference-stationary model.
He estimated the ARFIMA model of ﬁrst differenced GNP and found that GNP is consistent with
both models. Quarterly GDP and the monthly IP index of G-7 countries were studied by Cheung

70

and Lai (1992). They found that the U.S. and the U.K. GDP are fractional integrated. Using long
spans of real GNP and GNP per capita ranging from 1869 to 2001, Mayoral (2006) rejected unitroot and trend stationary hypothesis. She also found that the fractional integration (FI) model and
structural break (SB) model are more favorable than the linear ARIMA model. Using her proposed
technique for distinguishing between FI and SB, the FI model is more preferable than the SB
model and the persistence parameter is non-stationary but mean reversion.
The data considered in this section include quarterly data on real U.S. GDP ranging from the
ﬁrst quarter of 1947 to the second quarter of 2010 and monthly data on the U.S. IP index ranging
from January 1947 to September 2010. Both series were seasonally adjusted and collected from
the Federal Reserve at St. Louis.1 The long memory models were estimated on the ﬁrst differenced
series. The maximum lags were set at pmax = 8 and qmax = 8.
The model selection results from MLE and LW T SE are presented in Table 4.1.
ARFIMA(1, d, 0) and ARFIMA(0, d, 0) were selected by MLE − BIC for the GDP and IP index,
respectively. Table 4.2 presents the estimation results. For the GDP series, the MLE estimate of
d is -0.341. The LW estimates vary from -0.084 to 0.283. For the IP, we obtained a value of
d = 0.182. The long memory behaviors are consistent with existing literature.

4.3 Application to Inﬂation
The persistence of inﬂationary shocks is a central debate in macroeconomic literature. An understanding of the dynamics of inﬂation rates is crucial for central bankers in controlling the inﬂation.
In case of countries with high persistence of inﬂation, the inﬂationary shock will take a long time to
dissipate. Hassler and Wolters (1995) and Baillie, Chung, and Tieslau (1996) found that inﬂation
rates in major industrial countries are fractional integrated. The evidences of long memory were
extended to other industrial countries and emerging countries by Baum, Barkoulas, and Caglayan
(1999). The changing in persistence in U.S. inﬂation was considered by Kumar and Okimoto
1 http://research.stlouisfed.org/fred2

71

(2007). They found a decline of persistence in the last two decades. Hassler and Meller (2010)
provide a formal test for a break in the long memory parameter.
In this section, we focus on the inﬂation of G-7 countries, i.e. the U.S., Canada, the U.K,
France, Germany, Japan, and Italy. The monthly inﬂation series are calculated from Consumer
Price Index (CPI) obtained from the OECD statistical website.2 The data ranged from January
1995 to August 2010. The CPI series are seasonally unadjusted. Monthly inﬂation rates were
calculated by 1000∆(CPIt ).
In model identiﬁcation, the maximum lags were set at pmax = 15 and qmax = 4. The results
from the model selection procedure are shown in Table 4.3. The results from estimating ARFIMA
models are given in Table 4.4. For the U.S., an ARFIMA(0, d, 0) model was found to be favorable
among 80 speciﬁcations, with the estimate of d being 0.365 and a robust asymptotic standard error
of 0.048. The LW estimation gave estimates of the long memory parameter of dLW = 0.665 from
a bandwidth of m = T 0.5 ; and dLW = 0.685 from m = T 0.6 ; and dLW = 0.368 from m = T 0.7 ; and
dLW = 0.361 from m = T 0.8 . For other G-7 countries, the long memory estimates from the MLE
range from 0.284 for the U.K. to 0.951 for Italy.

4.4 Application to Realized Volatility of Exchange Rate
Modeling the asset price return is crucial for ﬁnancial economists and risk managers in risk management, portfolio management and other ﬁnancial activities. In general, the compounded return
of asset price can be expected to be stationary or to have little serial correlation. On the contrary, it
is widely accepted that asset price volatility is more predictable. Ding, Granger, and Engle (1993)
found long memory feature of absolute return and power transformation of absolute return in the
stock market. The evidence of long memory behavior was found in either conditional variance
model or squared returns. Baillie, Bollerslev, and Mikkelsen (1996) applied the Fractional Generalized Autoregressive Conditional Heteroskedasticity (FIGARCH) model to the exchange rate and
2 http://stats.oecd.org

72

Bollerslev and Mikkelsen (1996) applied the exponential FIGARCH model to the stock price.3
The long memory stochastic volatility (LMSV ) model was introduced by Breidt, Crato, and de
Lima (1998) and Harvey (1998). They found the evidence of long memory stochastic volatility in
the exchange rate and stock price. The modiﬁed LMSV model using LW estimator was considered
by Hurvich and Ray (2003). Because of the availability of high frequency data in the ﬁnancial
market, the “realized volatility”, which is the sum of squared of intraday returns, was introduced by
Andersen, Bollerslev, Diebold, and Labys (2001). They argued that their new volatility measure is
observable and able to implement to simple modeling techniques. Andersen, Bollerslev, Diebold,
and Labys (2001) found evidence of long memory behavior in DM/$ and Yen/$ realized volatility.
The ARFIMA models of realized volatility were considered in Andersen, Bollerslev, Diebold, and
Labys (2003) and Morana and Beltratti (2004).
The data in this section is the realized exchange rate volatility of Andersen, Bollerslev,
Diebold, and Labys (2003) for the Deutsche Mark (DM)/US Dollar and Japanese Yen/US Dollar
exchange rate. The daily realized volatility processes have been obtained by summing squared of
30-minute returns over 48 observations. The data cover the period from December 1, 1986 through
June 30, 1999, a total of 3,045 observations.4
The log of realized volatility is estimated by MLE and LW T SE with pmax = 8 and qmax = 8.
Table 4.5 presents the selected models for realized volatility series. MLE − BIC selects the most
parsimonious model, i.e. ARFIMA(0, d, 0) for both DM/$ and Yen/$. The estimates are presented
in Table 4.6. The log of realized volatilities of DM/$ and Yen/$ have long memory characteristic
with long memory parameters equal to 0.382 and 0.414, respectively. The LW estimates gave
similar long memory parameters to MLE.
3 Caporin (2003) studied the identiﬁcation issue of FIGARCH models.
4 I am grateful to Prof. Francis X. Diebold for making daily series available in his website.

73

4.5 Application to Interest Rates
The study of persistence of interest rates is a crucial topic in macroeconomic and ﬁnance literature.
In conducting monetary policy, monetary authorities change short-term interest rates that will move
long-term interest rates and, eventually, affect the real economy. Earlier works on the long memory
property of interest rates were studied by Shea (1991) and Backus and Zin (1993). Using the
GPH estimator, Shea (1991) found non-stationary long memory in the interest rate term structure.
Backus and Zin (1993) applied ARFIMA model to 3-month coupon rate and found evidence
of long memory. Caporale and Gil-Alana (2010) implemented a semi parametric test for U.S.
interest rate levels and spreads. They found that spreads are nonstationary but mean-reverting.
Long memory properties were also found in other forms of interest rates, such as, U.S. real interest
rates Tsay (2000); corporate bond yields and spreads McCarthy, Pantalone, and Li (2009).
The interest rates considered in this study are the U.S. Treasury constant maturity rates with
monthly data spanning from January 1959 to September 2010. We apply the ARFIMA models to
different maturities, which are 3 and 6 months, and 1, 5, 7 and 10 years. Data were obtained from
the Board of Governors of the Federal Reserve System.5
The ARFIMA models of interest rates and interest rates spread are estimated with pmax = 15
and qmax = 4. Model selection results are presented in Tables 4.7 and 4.8. The parsimonious
models, e.g. ARFIMA(0, d, 1) and ARFIMA(2, d, 3), were chosen by MLE − BIC except for the
3-month maturity rate. The estimates of the ARFIMA models for interest rate levels are shown
in Table 4.9. The d parameter of 3-month maturity equals 0.408. The long memory parameters
of the other maturities fall into the non-stationary region and ranges between 0.834 and 0.905.
The estimates of interest spread between short- and long-term interest rates are shown in Table
4.10. The long memory parameter of the interest rate spread between the 10-year Treasury rate
and the 3-month Treasury rate (Y10-M3) is 0.805. The interest rate spread between the 10-year
Treasury rate and the 6-month Treasury rate (Y10-M6) is also high persistence with d parameter
in the non-stationary region. The results suggest a long memory feature in interest spreads.
5 http://www.federalreserve.gov/releases/h15/data.htm

74

4.6 Conclusion
This section examines the applications of ARFIMA model selections on a set of macroeconomic
and ﬁnancial time series. We ﬁnd that the MLE − BIC generally selects relatively more parsimonious models than MLE − AIC and MLE − HQ. We ﬁnd the evidence of long memory properties
in every series we considered. We also ﬁnd that the degree of persistence of MLE and LW T SE are
typically different.
Some of the estimated models appear to have substantial ARCH effects. It is unclear if BIC
remains valid in this situation and this is a topic to possibly be investigated in the future.

75

APPENDIX

76

Appendix 4A: Tables
Table 4.1: Model selection for U.S. Real Output
Estimation Selection
procedure Criteria
QMLE
AIC
QMLE
BIC
QMLE
HQ
LW T SE
AIC
LW T SE
BIC
LW T SE
HQ
Number of observations

Selected ARFIMA(p, d, q) model
100[ln(GDPt ) − ln(GDPt−1 )] 100[ln(IPt ) − ln(IPt−1 )]
(7,d,6)
(4,d,3)
(1,d,0)
(0,d,0)
(1,d,0)
(4,d,3)
(8,d,7)
(5,d,5)
(8,d,7)
(1,d,1)
(8,d,7)
(1,d,1)
254
765

Key: GDP denotes Real Quarterly U.S. GDP and ranges from ﬁrst quarter of 1947 to second
quarter of 2010. IP denotes Index of Industrial Product and ranges from January 1947 to
September 2010. Both variables are ﬁrst differenced and taken log.

77

Table 4.2: Estimated ARFIMA Models for U.S. Real Output

Model
Log L
ˆ
µ
dˆ
ˆ
ϕ1
ˆ
σ2
m3
m4
Q(20)
Q2 (20)
AIC
BIC
HQ
LW (m = T 0.5 )
LW (m = T 0.6 )
LW (m = T 0.7 )
LW (m = T 0.8 )

100[ln(GDPt ) − ln(GDPt−1 )]
(1,d,0)
-337.283
0.816
(0.027)
-0.341
(0.183)
0.703
(0.173)
0.842
(0.102)
0.26
4.68
23.60
34.09
-0.156
-0.128
-0.145
-0.084
-0.058
0.208
0.283

100[ln(IPt ) − ln(IPt−1 )]
(0,d,0)
-967.83
0.257
(0.099)
0.182
(0.045)
0.738
(0.053)
0.18
4.96
37.80
78.71
-0.302
-0.296
-0.299
0.066
0.115
0.24
0.261

Key: Standard errors are reported in parentheses. Log L is the maximized value of the log
likelihood. Q(20) and Q2 (20) are the Ljung-Box Postmanteau statistics calculated from the ﬁrst
20 residual and squared residual autocorrelations respectively. m3 and m4 are the sample
skewness and kurtosis of the standardized residuals respectively. m is the choice of bandwidth.

78

Table 4.3: Model Selection for G-7 Monthly Inﬂation
Estimation
procedure
QMLE
QMLE
QMLE
LW T SE
LW T SE
LW T SE

Selection
Criteria
AIC
BIC
HQIC
AIC
BIC
HQIC

Canada
(4,d,2)
(4,d,2)
(4,d,2)
(9,d,1)
(9,d,1)
(9,d,1)

Order (p,d,q) of selected ARFIMA model
France Germany Italy
Japan U.K.
(14,d,3) (1,d,3) (10,d,4) (1,d,2) (2,d,1)
(3,d,3) (1,d,3) (1,d,1) (1,d,2) (2,d,1)
(12,d,0) (1,d,3) (10,d,4) (1,d,2) (2,d,1)
(15,d,3) (15,d,4) (10,d,4) (4,d,2,) (9,d,1)
(3,d,3) (5,d,4) (1,d,1) (4,d,2,) (9,d,1)
(12,d,0) (15,d,4) (10,d,4) (4,d,2,) (9,d,1)

Key: Number of observations is 668.

79

U.S.
(8,d,4)
(0,d,0)
(8,d,4)
(10,d,4)
(5,d,4)
(5,d,4)

Table 4.4: Estimated ARFIMA Models for G-7 Monthly Inﬂation

Model
Log L
ˆ
µ
dˆ
ˆ
ϕ1
ˆ
ϕ2
ˆ
ϕ3
ˆ
ϕ4
ˆ
θ1
ˆ
θ2
ˆ
θ3
ˆ
σ2
m3
m4
Q(20)
Q2 (20)
AIC
BIC
HQIC
LW (T 0.5 )
LW (T 0.6 )
LW (T 0.7 )
LW (T 0.8 )

Canada
(4,d,2)
-1841
2.419
(0.953)
0.365
(0.043)
-0.355
(0.064)
-1.054
(0.058)
-0.330
(0.063)
-0.056
(0.054)
-0.043
(0.017)
-0.972
(0.019)
14.625
(1.184)
0.55
5.37
48.83
24.94
2.704
2.751
2.722
0.856
0.692
0.386
0.368

France
(3,d,3)
-1739
1.855
(3.005)
0.564
(0.211)
-1.234
(0.227)
0.051
(0.396)
0.517
(0.257)
-

Germany
(1,d,3)
-1754
1.736
(1.397)
0.585
(0.101)
0.388
(0.128)
-

Italy
(1,d,1)
-1711
0.734
(1.774)
0.951
(0.257)
0.390
(0.178)
-

Japan
(1,d,2)
-2191
1.377
(2.214)
0.568
(0.085)
-0.163
(0.122)
-

-

-

-

-

-1.065
(0.268)
0.381
(0.495)
0.695
(0.312)
10.790
(1.259)
0.91
10.08
110.87
330.37
2.400
2.447
2.418
0.707
0.525
0.488
0.414

0.846
(0.117)
-0.019
(0.082)
0.023
(0.053)
11.295
(0.842)
0.62
4.71
119.42
56.51
2.439
2.473
2.452
0.791
0.608
0.259
0.230

Key: See Table 4.2.

80

U.S.
(0,d,0)
-1647
2.329
(0.881)
0.365
(0.048)
-

-

U.K.
(2,d,1)
-2052
3.885
(1.188)
0.284
(0.042)
-0.975
(0.051)
0.026
(0.051)
-

-

-

-

0.885
(0.071)
-

-0.995
(0.004)
-

-

-

0.334
(0.104)
0.339
(0.070)
-

-

-

9.917
(1.123)
0.14
9.54
57.17
275.14
2.303
2.324
2.311
0.924
0.649
0.504
0.453

41.775
(3.322)
0.54
5.22
142.99
187.16
3.744
3.771
3.755
0.593
0.648
0.344
0.201

27.592
(2.927)
1.35
8.50
259.27
83.17
3.330
3.357
3.340
0.722
0.656
0.417
0.283

8.174
(0.729)
-0.25
6.30
53.15
122.042
2.104
2.111
2.107
0.665
0.685
0.368
0.361

-

-

Table 4.5: Model Selection for Realized Volatility of Exchange Rates
Estimation
procedure
QMLE
QMLE
QMLE
LW T SE
LW T SE
LW T SE

Selection
Criteria
AIC
BIC
HQIC
AIC
BIC
HQIC

Selected ARFIMA(p, d, q) model
DM/$
JP Yen/$
(8,d,8)
(4,d,8)
(0,d,0)
(0,d,0)
(1,d,1)
(0,d,0)
(7,d,6)
(4,d,1)
(1,d,1)
(1,d,1)
(1,d,1)
(1,d,1)

Key: Number of observations is 3,045.

Table 4.6: Estimated ARFIMA Models for Realized Volatility of Exchange Rates

Model
Log L
ˆ
µ
dˆ
ˆ
σ2
m3
m4
Q(20)
Q2 (20)
AIC
BIC
HQIC
LW (m = T 0.5 )
LW (m = T 0.6 )
LW (m = T 0.7 )
LW (m = T 0.8 )

DM/$
(0,d,0)
-2727.495
-1.124
(0.156)
0.382
(0.013)
0.351
(0.009)
0.76
4.87
33.87
33.85
-1.046
-1.044
-1.045
0.555
0.589
0.500
0.476

Key: See Table 4.2.

81

JP/$
(0,d,0)
-2931.457
-1.271
(0.206)
0.414
(0.014)
0.402
(0.010)
0.70
4.47
15.56
76.26
-0.911
-0.910
-0.911
0.553
0.504
0.500
0.482

Table 4.7: Model Selection for Interest Rates
Estimation
procedure
QMLE
QMLE
QMLE
LW T SE
LW T SE
LW T SE

Selection
Criteria
AIC
BIC
HQ
AIC
BIC
HQ

3 months
(13,d,4)
(9,d,2)
(11,d,4)
(13,d,4)
(9,d,3)
(11,d,4)

Selected ARFIMA(p, d, q) model
6 months 1 year
5 years 7 years
(11,d,4) (12,d,4) (14,d,4) (12,d,4)
(2,d,3)
(2,d,3)
(0,d,1)
(0,d,1)
(11,d,4)
(9,d,2)
(6,d,4)
(0,d,1)
(14,d,4) (13,d,4) (14,d,4) (10,d,4)
(2,d,3)
(2,d,3)
(0,d,1)
(0,d,1)
(9,d,3)
(9,d,2)
(5,d,2) (10,d,4)

10 years
(10,d,4)
(0,d,1)
(0,d,1)
(9,d,4)
(0,d,2)
(0,d,2)

Key: Number of observations is 621.

Table 4.8: Model Selection for Interest Rate Spreads
Estimation
procedure
QMLE
QMLE
QMLE
LW T SE
LW T SE
LW T SE

Selection
Criteria
AIC
BIC
HQ
AIC
BIC
HQ

Selected ARFIMA(p, d, q) model
Y10-M3
Y10-M6
(13,d,4)
(10,d,4)
(0,d,1)
(0,d,1)
(14,d,0)
(5,d,2)
(12,d,4)
(3,d,4)
(1,d,2)
(3,d,4)
(12,d,4)
(3,d,4)

Key: Number of observations is 621. Y10-M3 is an interest rate spread between 10-year Treasury
rate and 3-month Treasury rate. Y10-M6 is an interest rate spread between 10-year Treasury rate
and 6-month Treasury rate.

82

Table 4.9: Estimated ARFIMA Models for U.S. Interest Rates
Model
Log L
ˆ
µ
dˆ
ˆ
ϕ1
ˆ
ϕ2
ˆ
ϕ3
ˆ
ϕ4
ˆ
ϕ5
ˆ
ϕ6
ˆ
ϕ7
ˆ
ϕ8
ˆ
ϕ9
ˆ
θ1
ˆ
θ2
ˆ
θ3
ˆ
σ2
m3
m4
Q(20)
Q2 (20)
AIC
BIC
HQIC
LW (T 0.5 )
LW (T 0.6 )
LW (T 0.7 )
LW (T 0.8 )

3 months
(9,d,2)
-296.58
2.817
(0.324)
0.408
(0.104)
-0.425
(0.097)
0.340
(0.102)
0.457
(0.070)
-0.115
(0.055)
0.257
(0.047)
-0.051
(0.050)
-0.146
(0.046)
0.061
(0.042)
0.353
(0.039)
-1.467
(0.060)
-0.807
(0.068)
0.152
(0.009)
-0.87
16.46
26.28
768.33
-1.844
-1.758
-1.811
0.794
1.003
0.884
0.958

6 months
(2,d,3)
-276.643
3.042
(0.313)
0.857
(0.040)
-0.892
(0.062)
-0.717
(0.053)
-

1 year
(2,d,3)
-285.458
3.365
(0.298)
0.834
(0.038)
-0.845
(0.052)
-0.695
(0.052)
-

5 years
(0,d,1)
-216.614
3.927
(0.301)
0.905
(0.038)
-

7 years
(0,d,1)
-144.877
4.047
(0.265)
0.905
(0.037)
-

10 years
(0,d,1)
-60.528
4.058
(0.238)
0.905
(0.036)
-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-

-1.502
(0.058)
-1.445
(0.051)
-0.580
(0.040)
0.143
(0.008)
-1.04
13.54
67.17
747.74
-1.928
-1.885
-1.911
0.820
1.002
0.895
0.971

-1.538
(0.046)
-1.493
(0.045)
-0.659
(0.035)
0.147
(0.008)
-0.58
10.94
51.01
877.71
-1.899
-1.856
-1.883
0.860
0.981
0.898
0.965

-0.531
(0.041)
-

-0.552
(0.040)
-

-0.509
(0.043)
-

-

-

-

0.118
(0.007)
-0.22
9.38
53.81
576.42
-2.134
-2.120
-2.128
0.938
0.959
0.920
0.982

0.093
(0.005)
0.03
7.57
43.22
502.16
-2.365
-2.351
-2.359
0.974
0.954
0.935
0.999

0.071
(0.004)
-0.02
7.93
40.07
416.31
-2.636
-2.622
-2.631
1.010
0.965
0.979
1.030

Key: See Table 4.2.
83

Table 4.10: Estimated ARFIMA models for U.S. Interest Rate Spreads

Model
Log L
ˆ
µ
dˆ
ˆ
θ1
ˆ
σ2
m3
m4
Q(20)
Q2 (20)
AIC
BIC
HQIC
LW (T 0.5 )
LW (T 0.6 )
LW (T 0.7 )
LW (T 0.8 )

Y10-M3
(0,d,1)
-221.182
1.186
(0.316)
0.805
(0.041)
-0.508
(0.046)
0.119
(0.007)
0.32
13.71
65.60
504.97
-2.119
-2.105
-2.114
0.415
0.690
0.848
0.884

Key: See Table 4.2.

84

Y10-M6
(0,d,1)
-116.9
0.972
(0.272)
0.856
(0.042)
-0.455
(0.047)
0.085
(0.005)
0.51
13.50
60.15
382.70
-2.455
-2.441
-2.449
0.421
0.751
0.888
0.926

Chapter 5

CONCLUSION

This dissertation considers two methodological issues in long memory modeling, multi-step forecasting and model selections. The comparisons between the parametric model, i.e. ARFIMA , and
the semi parametric model, e.g. LW , in term of forecasting and model selection are investigated.
Applications of model selection to real output, inﬂation rates, volatility, and interest rates are also
considered.
We consider the effects on multi step prediction of using LW T SE compared to MLE for
ARFIMA models in Chapter 2. From detailed simulations, we found that the predictor based on
MLE is generally superior in term of MSE sense to the predictor based on LW T SE. Even in case
of misspeciﬁcation, the MLE still performs better than the LW T SE.
Chapter 3 considers model selection of the ARFIMA model. The Bayesian (BIC) and HannanQuinn (HQ) information criteria are consistent model selection procedures for the ARFIMA model
estimated by QMLE. Simulation results show that the BIC based on QMLE works very well for
low order models. We also suggest the modiﬁed information criteria for LW T SE.
In Chapter 4, we apply the model selection with MLE and LW to real output, inﬂation rates,
volatility, and interest rates series. We found that the BIC with MLE mostly select the parsimonious
model. The long memory parameters from the selected model substantiate long memory behavior
in these series.

85

BIBLIOGRAPHY

86

BIBLIOGRAPHY

Andersen, T.G., T. Bollerslev, F.X. Diebold, and P. Labys. (2001) “The Distribution of realized
exchange rate volatility,” Journal of the American Statistical Association, 96, 42 – 55.
Andersen, T.G., T. Bollerslev, F.X. Diebold, and P. Labys. (2003) “Modeling and forecasting
realized volatility,” Econometrica, 71, 579 – 625.
Andrews, D.W.K. and P. Guggenberger, (2003) “A bias-reduced log-periodogram regression
estimator for the long-memory parameter,” Econometrica, 71, 675 – 712.
Andrews, D.W.K. and Y. Sun, (2004) “Adaptive local polynomial Whittle estimation of long-range
dependence,” Econometrica, 72, 569 – 614.
Backus, D.K. and S.E. Zin, (1993) “Long-memory inﬂation uncertainty: Evidence from the term
structure of interest rates,” Journal of Money, Credit and Banking, 25, 681 – 700.
Bai, J. and S. Ng, (2002) “Determining the number of factors in approximate factor models,”
Econometrica, 70, 191 – 221.
Baillie, R.T., (1980) “Predictions from ARMAX models,” Journal of Econometrics, 12, 365 – 374.
Baillie, R. T. (1996) “Long memory processes and fractional integration in econometrics,” Journal
of Econometrics, 73, 5 – 59.
Baillie, R.T., T. Bollerslev, and H.O. Mikkelsen, (1996) “Fractionally integrated generalized
autoregressive conditional heteroskedasticity,” Journal of Econometrics, 74, 3 – 30.
Baillie, R.T., C.F. Chung, and M.A. Tieslau, (1996) “Analysing inﬂation by the fractionally
integrated ARFIMA-GARCH model,” Journal of Applied Econometrics, 11, 23 – 40.
Baillie, R.T., and G. Kapetanios, (2007) “Testing for neglected nonlinearity in long- memory
models,” Journal of Business and Economic Statistics, 25, 447 – 461.
Baillie, R.T., and G. Kapetanios, (2008) “Nonlinear models for strongly dependent processes with
ﬁnancial applications,” Journal of Econometrics, 147, 60 – 71.
Baillie, R.T. and G. Kapetanios (2009) “Estimation and Inference for Impulse Response Weights
From Strongly Persistent Processes”, MSU working paper.

87

Barkoulas, J. and C.F. Baum, (2006) “Long-memory forecasting of US monetary indices,” Journal
of Forecasting, 25, 291 – 302.
Baum, C.F., J.T. Barkoulas, and M. Caglayan, (1999) “Persistence in international inﬂation rates,”
Southern Economic Journal, 65, 900 –901.
Beran, J., (1994) Statistics for Long Memory Processes, (Chapman and Hall, New York, NY).
Beran, J., R.J. Bhansali, and D. Ocker, (1998) “On uniﬁed model selection for stationary and
nonstationary short-and long-memory autoregressive processes,” Biometrika, 85, 921 – 934.
Bhansali, R.J. and P.S. Kokoszka, (2002) “Computational of the forecast coefﬁcients for multistep
prediction of long-range dependent time series,” International Journal of Forecasting, 18, 181 –
206.
Bollerslev, T. and H.O. Mikkelsen, (1996) “Modeling and pricing long memory in stock market
volatility,” Journal of Econometrics, 73, 151 – 184.
Breidt, F.J., N. Crato, and P. de Lima, (1998) “The detection and estimation of long memory in
stochastic volatility,” Journal of Econometrics, 83, 325 – 348.
Campbell, J.Y. and N.G. Mankiw, (1987) “Are output ﬂuctuations transitory?,” Quarterly Journal
of Economics, 102, 857 – 880.
Caporale, G.M. and L.A. Gil-Alana. 2010. “Fractional cointegration in US term spreads,”
Discussion Papers of DIW Berlin.
Caporin, M., (2003) “Identiﬁcation of long memory in GARCH models,” Statistical Methods &
Applications, 12, 133 – 151.
Chan, N.H. and W. Palma, (1998) “State space modeling of long memory processes,” Annals of
Statistics, 26, 719 – 740.
Cheung, Y.-W. and K.S. Lai, (1992) “International evidence on output persistence from postwar
data,” Economics Letters, 38, 435 – 441.
Clark P.K., (1987) “The cyclical component of U.S. economic activity,” Quarterly Journal of
Economics, 102, 797 – 814.

88

Cochrane, J.H. (1988) “How big is the random walk in GNP?,” Journal of Political Economy, 96,
893 – 920.
Crato, N. and B. K. Ray. (1996) “Model selection and forecasting for long-range dependent
processes.” Journal of Forecasting, 15, 107–125.
Dalla, V., L. Giraitis, and J. Hidalgo, (2005) “Consistent estimation of the long memory parameter
for nonlinear time series,” Journal of Time Series Analysis, 27, 211 – 251.
Delgado, M.A. and P.M. Robinson, (1996) “Optimal spectral bandwidth for long memory,”
Statistica Sinica, 6, 97 – 112.
Diebold, F.X. and P. Lindner, (1996) “Fractional integration and interval prediction,” Economics
Letters, 50, 305 – 313.
Diebold F.X. and R.S. Mariano, (1995) “Comparing predictive accuracy,” Journal of Business &
Economic Statistics, 13, 134 – 144.
Diebold F.X. and G.D. Rudebusch, (1989) “Long memory and persistence in aggregate output,”
Journal of Monetary Economics, 24, 189 – 209.
Ding, Z., C.W.J. Granger, and R.F. Engle, (1993) “A long memory property of stock market
returns and a new model,” Journal of Empirical Finance, 1, 83 – 106.
Doornik, J.A. and M. Ooms, (2004) “Inference and forecasting for ARFIMA models with an
application to US and UK inﬂation, Studies in Nonlinear Dynamics and Econometrics, 8, Article
14.
Fox, R. and M.S. Taqqu, (1986) “Large-sample properties of parameter estimates for strongly
dependent stationary Gaussian time series,” The Annals of Statistics, 14, 517 – 532.
Geweke, J. and S. Porter-Hudak, (1983) “The Estimation and Application of Long Memory Time
Series Models,” Journal of Time Series Analysis, 4, 221 – 238.
Granger, C.W.J., (1980) “Long memory relationships and the aggregation of dynamic models,”
Journal of Econometrics, 14, 227 – 238.
Granger, C.W.J. and R. Joyeux, (1980) “An introduction to long memory time series models and
fractional differencing.” Journal of Time Series Analysis, 1, 15 – 39.

89

Granger, C.W.J. and P. Newbold, (1996) Forecating Economic Time Series, second edition,
(Academic Press, New York).
Groen, J.J.J. and G. Kapetanios, (2010) “Model selection criteria for factor-augmented regression,”
Queen Mary University of London working paper.
Hannan, E.J., (1980) “The estimation of the order of an ARMA process,” Annals of Statistics , 81,
1071 – 1081.
Hannan, E.J. and B.G. Quinn, (1979) “The determination of the order of an autoregression,”
Journal of the Royal Statistical Society (Series B), 41, 190 – 195.
Harvey, A.C., (1998) “Long memory in stochastic volatility,” In Knight, J. and S. Satchell (eds.),
Forecasting Volatility in Financial Markets, (Butterworth-Heinemann, London, U.K.).
Hassler, U. and J. Wolters, (1995) “Long memory in inﬂation rates: International evidence,”
Journal of Business & Economic Statistics, 13, 37 – 45.
Hassler, U. and B. Meller, (2010) “Detecting Changes in U.S. inﬂation persistence: Evidence
from a new test under long memory,” Working paper, (Goethe-Universität Frankfurt, Frankfurt,
Germany).
Henry, M., (2001) “Robust automatic bandwidth for long memory,” Journal of Time Series
Analysis, 22, 293 – 316.
Henry, M. and P.M. Robinson, (1996) “Bandwidth choice in Gaussian semiparametric estimation
of long range dependence,” In: Robinson, P. and M. Rosenblatt, (eds.) Athens conference on
applied probability and time series analysis. Volume II : time series analysis in memory of E.J.
Hannan. Lecture notes in statistics 115, 220 – 232, (Springer-Verlag, Berlin).
Henry, M. and P. Zaffazoni, (2003) “The long-range dependence paradigm for macroeconomics
and ﬁnance,” In: Doukhan P., G. Oppenheim, and M. S. Taqqu. (eds.) Long-Range Dependence,
417 – 438, (Birkhauser, New York, NY).
Hidalgo, J., (2002) “Consistent order selection with strongly dependent data and its application to
efﬁcient estimation,” Journal of Econometrics, 110, 213–239.
Hosking, J.R.M., (1981) “Fractional differencing,” Biometrika, 68, 165 – 176.

90

Hosoya, Y., (1997) “A limit theory for long range dependence and statistical inference on related
models,” Annals of Statistics, 25, 105 – 137.
Hurst, H.E., (1956) “Methods of using long term storage in reservoirs,” Procedings of the Institute
of Civil Engineers, 1, 519 – 543.
Hurvich, C.M. and B.K. Ray, (2003) “The local whittle estimator of long-memory stochastic
volatility,” Journal of Financial Econometrics, 1, 445 – 470.
Jensen, M., (2009) “The long run Fisher effect: can it be tested?,” Journal of Money, Credit and
Banking, 41, 221–243.
Kapetanios, G., (2001) “Model selection in threshold models,” Journal of Time Series Analaysis,
22, 733 – 754.
Katayama, N., (2008) “Asymptotic prediction mean squared error for long-memory processes
with estimated parameters,” Journal of Forecasting, 27, 690-720.
Kunsch, H.R., (1987) “Statistical aspects of self-similar processes,” In: Yu, P and V.V. Sazanov
(eds.)Proceedings of the First World Congress of the Bernoulli Society, 67 –74, (VNU Science
Press, Utrecht).
Kumar, M.S. and T. Okimoto, (2007) “Dynamics of persistence in international inﬂation rates,”
Journal of Money, Credit and Banking, 39, 1457 – 1479.
Mayoral, L., (2006) “Further evidence on the Statistical Properties of Real GNP,” Oxford Bulletin
of Economics and Statistics, 68, 901 – 920.
McCarthy, J., C. Pantalone, and H.C. Li. (2009) “Investigating Long Memory in Yield Spreads,”
The Journal of Fixed Income, 19, 73–81.
Morana, C. and A. Beltratti, (2004) “Structural change and long-range dependence in volatility of
exchange rates: either, neither or both?,” Journal of Empirical Finance, 11, 629 – 658.
Nelson, C.R. and C.R. Plosser, (1982) “Trends and random walks in macroeconmic time series:
Some evidence and implications,” Journal of monetary economics, 10, 139 – 162.
Peiris, M.S., (1987) “A note on the predictors of differenced sequences,” The Australian Journal
of Statistics, 29, 42 – 48.

91

Peiris, M.S., and B.J.C. Perera, (1988) “On prediction with fractionally differenced ARMA
Models,” Journal of Time Series Analysis, 9, 215 – 220.
Phillips, P.C.B., (2007) “Unit root periodogram,” Journal of Econometrics, 138, 104 – 124.
Phillips, P.C.B. and K. Shimotsu, (2004) “Local Whittle estimation in nonstationary and unit root
cases,” Annals of Statistics, 32, 656 – 692.
Phillips, P.C.B. and K. Shimotsu, (2006) “Local Whittle estimation of fractional integration and
some of its variants,” Journal of Econometrics, 130, 209 – 233.
Poskitt, D.S., (2007) “Autoregressive approximation in nonstandard situations: the fractionally integrated and non-invertible cases,” Annals of the Institute of Statistical Mathematics, 59, 697 – 725.
Pukkila, T., S. Koreihsa, and A. Kallinen, (1990) “The identiﬁcation of ARMA models,”
Biometrika, 77, 537 – 548.
Ray, B.K., (1993) “Modelling long-memory processes for optimal long-range prediction,” Journal
of Time Series Analysis, 14, 511 – 523.
Robinson, P.M., (1995) “Gaussian semiparametric estimation of long range dependence,” The
Annals of Statistics, 23, 1630 – 1661.
Robinson, P.M., (2006) “Conditional-sum-of-squares estimation of models for stationary time
series with long memory,” IMS Lecture Notes-Monograph Series Time Series and Related Topics,
52, 130 – 137.
Schwarz, G., (1978) “Estimation the dimension of a model,” Annals of Statistics, 6, 461–464.
Shea, G.S., (1991) “Uncertainty and implied variance bounds in long-memory models of the
interest rate term,” Empirical Economics, 6, 287 – 312.
Shibata, R., (1976) “Selection of the order of an autoregressive model by Akaike’s information
criterion,” Biometrika, 63, 117 – 126.
Shibata, R., (1980) “Asymptotically efﬁcient selection of the order of the model for estimating
parameters of a linear process,” Annals of Statistics, 8, 147 – 164.
Sin, C.Y. and H. White, (1996) “Information criteria for selecting possibly misspeciﬁed parametric
models,” Journal of Econometrics, 71, 207–225.

92

Sowell, F.B., (1992b) “Modeling long-run behavior with the fractional ARIMA model,” Journal
of Monetary Economics, 29, 277 – 302.
Tsay, W.-J., (2000) “Long memory story of the real interest rate,” Economics Letters, 67, 325 –
330.
van Dijk, D., P.H. Frances, and R. Paap, (2002) “A nonlinear long memory model with application
to US unemployment,” Journal of Econometrics, 110, 135–165.
Wright J.H., (1995) “Stochastic orders of magnitude associated with two-stage estimators of
fractional ARIMA systems,” Journal of Time Series Analysis, 16, 119 – 126.
Yamamoto, T., (1981) “Predictions of multivariate autoregressive-moving average models,”
Biometrika, 68, 485 – 492.

93