2: Markov Models for Count Time Series (4/5)

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

44 Handbook of Discrete-Valued Time Series

For count time series, q-dependence for some positive integer q is generally not expected

based on context, as there may be no reason for independence to occur with longer lags.

Mixed Markov/q-dependent models with small p, q are better models to consider if the

dependence is not as simple as Markov dependence.

The copula version with p = q = 1 is dened below (it does extend to p, q ≥ 1by

combining the above constructions for Markov order p and q-dependence). Let C

and K

be two different bivariate copulas. Dene

= C

−1

= K

−1

= F

−1

2|1

(

s−1

), U

2|1

(

t−1

), Y

where {

} is a sequence of innovation random variables and {W

} is a sequence of unob-

served U(0, 1) random variables.

2.6 Statistical Inference and Model Comparisons

For NB margins, numerical maximum likelihood is possible for all of the Markov models in

the preceding three sections. For GP margins, all are possible except the ones based on the

generalized thinning operators denoted as I2 and I3. With numerical maximum likelihood

for Markov models, the asymptotic likelihood inference theory is similar to that for inde-

pendent observations, using the results in Billingsley (1961). The CLS method can estimate

the mean of a stationary distribution but not additional parameters such as overdispersion.

For applications to count time series with small counts, generally low-order Markov

models are adequate. The q-dependent models which are the analogies of Gaussian MA(q)

are usually less appealing within the context of count data. The analogue of Gaussian

ARMA(1,1) has been used in the literature on count time series, but there is less experience

with such models. It would be desirable to have the ARMA(1,1) equivalent of the above

models in the three categories. The joint likelihood of Y

, ..., Y

(time series of length n)

involves either a high-dimensional sum or integral so that maximum likelihood estimation

is not feasible. However for the ARMA(1,1) analogue in the CCID class or copula approach,

the joint pmf of three consecutive observations is, respectively, a triple sum, and a triple

sum plus an integral. Therefore, likelihood inference could proceed with composite like-

lihood based on sum of log-likelihoods of subsets of three consecutive observations, as in

Davis and Yau (2011) and Ng et al. (2011). Because of space constraints, the data analysis

below will only involve Markov models.

In the remainder of this section, the three classes of Markov count time series models are

tted and discussed for one data set, and some results are briey summarized for another

data set that had previously been analyzed.

For some count time data sets with low counts, serial dependence and explanation of

trends from covariates, we consider monthly downloads to specialized software. A source

for such series is http://econpapers.repec.org/software/. Such data are con-

sidered here because the next month’s total download is plausibly based on the current

month’s total download through a generalized thinning operator. A similar data set with

daily downloads for the program TeXpert is used in Weiß (2008).

The specic series consist of the monthly downloads of Signalpak (a package for

octave, which is a Matlab clone) from September 2000 to March 2013. See Figure 2.1.

45 Markov Models for Count Time Series

1.0

0.6

2002 2006 2010

ACF

Signalpak downloads

0.2

−0.2

0.0 0.5 1.0 1.5

Year Lag

FIGURE 2.1

Time series plot of monthly download of signalpak; and the autocorrelation function.

This series looks close to stationary. For longer periods of time, there is no reason to expect

the series to be stationary because software can go through periods or local trends of more

or less demand. We also consider a covariate for forecasting that is a surrogate for the

popularity of the octave software; the surrogate is the total monthly abstract views of

octave codes that are at the website referred to earlier. A summary of model ts is given in

Table 2.1.

The use of the covariate marginally improves the log-likelihood and root mean square

prediction error. Otherwise the Markov order 1 models t about the same, and Markov

2 models did not add explanatory power. As might be expected, the generalized thinning

operators t a little better than binomial thinning since those operators are more intuitive

for the context of these data. For thinning operators I2 and I3,the γ parameter was set

to be the largest possible so that they lead to more conditional heteroscedasticity than the

model with binomial thinning. Experience with I2 and I3 is that for short time series, if γ

is estimated, it is usually at one of the boundaries.

Table 2.1 lists maximum likelihood estimates and corresponding standard errors for one

of the better tting models without/with the covariate. The estimates of the univariate

parameters are similar for the different models as well as the strength of serial dependence

at lag 1. Note that the addition of the covariate decreases the estimation of overdispersion

and lag 1 serial dependence.

For another count time series data set, we also comment on comparisons of ts of

models for the data set in Zhu and Joe (2006) on monthly number of claims of short-

term disability benets, made by workers in the logging industry with cut injuries, to

the B.C. Workers’ Compensation Board for the period of 10 years from 1985 to 1994.

This data set of claims is one of few data sets in the literature where “survivor” inter-

pretion of binomial thinning is plausible. There is clear seasonality from the time series

plot and autocorrelation function plot, so we use the covariates (sin(2πt/12), cos(2πt/12)).

This led to root mean square prediction errors (RMSPE) that were from 2.78 to 2.86 with

no seasonal covariates and 2.64 to 2.73 with seasonal covariates. The Markov model with

a Gaussian copula tted a little better than those based on thinning operators in terms of



46 Handbook of Discrete-Valued Time Series

TABLE 2.1

The Covariate Is One Hundredth of the Number of Downloads of Octave Programs at the Website

in the Preceding Month

regr. Dependence Order −Loglik rmspe

No covariate

NB Gaussian copula 1 424.49 4.17

GP Gaussian copula 1 424.36 4.17

NB Gaussian copula 2 422.72 4.13

NB Frank copula 1 428.65 4.24

NB Gumbel copula 1 427.35 4.24

NB re.Gumbel copula 1 424.07 4.15

NB conv-closed oper. 1 425.24 4.18

NB binomial thinning 1 426.49 4.18

NB I2 thinning 1 425.36 4.18

NB I3 thinning 1 425.83 4.18

One covariate

NB1 Gaussian copula 1 416.07 4.02

NB2 Gaussian copula 1 416.03 4.02

GP1 Gaussian copula 1 415.99 4.02

NB1 Gaussian copula 2 415.79 4.01

NB2 Gaussian copula 2 415.76 4.02

NB1 Frank copula 1 416.98 4.05

NB2 Frank copula 1 416.90 4.05

NB1 Gumbel copula 1 416.00 4.01

NB2 Gumbel copula 1 416.25 4.01

NB1 re.Gumbel copula 1 417.14 4.02

NB2 re.Gumbel copula 1 417.04 4.02

NB1 conv-closed oper. 1 415.99 4.02

NB binomial thinning 1 416.10 4.02

NB I2 thinning 1 415.63 4.02

NB I3 thinning 1 415.86 4.02

Model (Markov order 1) MLE SE

NB with Gaussian copula and no covariates β

= 2.40 0.05

0.89 0.27

ρˆ = 0.42 0.08

NB1 with Gaussian copula β

= 1.93 0.11

= 0.09 0.02

0.48 0.18

ρˆ = 0.22 0.09

Note: For the models with covariates based on the I2 and I3 thinning operators, the formulation of Section 2.3.5

is used. The root mean square prediction error (rmspe) is dened as {(n − p)

−1

− E

t−1

t=1+p

t−1

, ..., Y

t−p

= y

t−p

)}

1/2

,where n is the length of the series and p is the Markov order and E

means the

conditional expectation with parameter equal to the maximum likelihood estimate (MLE) of the model. In

the bottom, MLEs and corresponding standard errors are given for one of the better tting models without

and with the covariate.

47 Markov Models for Count Time Series

log-likelihood, but the Frank copula model with NB2 regression margins was the best in

terms of log-likelihood and RMSPE.

Similar to Table 2.1, there is a bit more variation among copula model ts than those

based on thinning operators (the latter models tend to be close to the Gaussian copula

model). An explanation is that different copula families have a wide variety of shapes of

conditional expectation and variance functions.

Finally, we indicate an advantage of count time series models with known univariate

marginal distributions. In this case, univariate and conditional probabilities can be easily

obtained and also predictive intervals with and without the previous observation(s). A pre-

dictive interval without the previous time point is a regression inference and a predictive

interval with the previous time point is an inference that combines regression and forecast-

ing. The rst type of predictive interval would not be easy to do with other classes of count

time series models that are based on conditional specications.

Acknowledgments

This work was supported by an NSERC Discovery grant.

References

Al-Osh, M. A. and Alzaid, A. A. (1987). First-order integer-valued autoregressive INAR(1) process.

Journal of Time Series Analysis, 8:261–275.

Alzaid, A. A. and Al-Osh, M. (1990). An integer-valued pth-order autoregressive structure INAR(p)

process. Journal of Applied Probability, 27(2):314–324.

Alzaid, A. A. and Al-Osh, M. A. (1993). Some autoregressive moving average processes with

generalized Poisson marginal distributions. Annals of the Institute of Statistical Mathematics,

45:223–232.

Beare, B. K. (2010). Copulas and temporal dependence. Econometrica, 78(1):395–410.

Biller, B. (2009). Copula-based multivariate input models for stochastic simulation. Operations

Research, 57(4):878–892.

Biller, B. and Nelson, B. L. (2005). Fitting time-series input processes for simulation. Operations

Research, 53(3):549–559.

Billingsley, P. (1961). Statistical Inference for Markov Processes. University of Chicago Press, Chicago, IL.

Cameron, A. C. and Trivedi, P. K. (1998). Regression Analysis of Count Data. Cambridge University

Press, Cambridge, U.K.

Davies, R. B. (1973). Numerical inversion of a characteristic function. Biometrika, 60:415–417.

Davis, R. A., Dunsmuir, W. T. M., and Wang, Y. (2000). On autocorrelation in a Poisson regression

model. Biometrika, 87:491–505.

Davis, R. A. and Yau, C. Y. (2011). Comments on pairwise likelihood in time series models. Statistica

Sinica, 21(1):255–277.

Du, J.-G. and Li, Y. (1991). The integer-valued autoregressive (INAR(p)) model. Journal of Time Series

Analysis, 12(2):129–142.

Escarela, G., Mena, R. H., and Castillo-Morales, A. (2006). A exible class of parametric transition

regression models based on copulas: Application to poliomyelitis incidence. Statistical Methods

in Medical Research, 15:593–609.

48 Handbook of Discrete-Valued Time Series

Fokianos, K. (2011). Some recent progress in count time series. Statistics, 45(1):49–58.

Fokianos, K. (2012). Count time series models. In Rao, T. S., Rao, S. S., and Rao, C. R., eds., Time Series

Analysis: Methods and Applications, vol. 30, Chapter 12, pp. 315–347. Elsevier B.V., Amsterdam,

the Netherlands.

Gauthier, G. and Latour, A. (1994). Convergence forte des estimateurs des paramètres d’un processus

GENAR(p). Annales des Sciences Mathematique du Québec, 18:49–71.

Hua, L. and Joe, H. (2013). Strength of tail dependence based on conditional tail expectation. Journal

of Multivariate Analysis, 123:143–159.

Joe, H. (1996). Time series models with univariate margins in the convolution-closed innitely

divisible class. Journal of Applied Probability, 33(3):664–677.

Joe, H. (1997). Multivariate Models and Dependence Concepts. Chapman & Hall, London, U.K.

Jørgensen, B. and Song, P. X. K. (1998). Stationary time series models with exponential dispersion

model margins. Journal of Applied Probability, 35(1):78–92.

Jung, R. C. and Tremayne, A. R. (2011). Convolution-closed models for count time series with

applications. Journal of Time Series Analysis, 32(3):268–280.

Latour, A. (1997). The multivariate GINAR(p) process. Advances in Applied Probability, 29(1):

228–248.

Latour, A. (1998). Existence and stochastic structure of a non-negative integer-valued autoregressive

process. Journal of Time Series Analysis, 19:439–455.

Lawless, J. F. (1987). Negative binomial and mixed Poisson regression. Canadian Journal of Statistics,

15:209–225.

Lawrance, A. J. and Lewis, P. A. W. (1980). The exponential autoregressive-moving average

earma(p, q) process. Journal of the Royal Statistical Society B, 42:150–161.

McKenzie, E. (1985). Some simple-models for discrete variate time-series. Water Resources Bulletin,

21(4):645–650.

McKenzie, E. (1986). Autoregressive moving-average processes with negative-binomial and geomet-

ric marginal distributions. Advances in Applied Probability, 18(3):679–705.

McKenzie, E. (1988). Some ARMA models for dependent sequences of Poisson counts. Advances in

Applied Probability, 20:822–835.

McKenzie, E. (2003). Discrete variate time series. In Shanbag, D. N. and Rao, C. R., eds., Stochastic Pro-

cesses: Modelling and Simulation, Handbook of Statistics, vol. 21, pp. 573–606. Elsevier, Amsterdam,

the Netherlands.

McNeil, A. J., Frey, R., and Embrechts, P. (2005). Quantitative Risk Management. Princeton University

Press, Princeton, NJ.

Ng, C. T., Joe, H., Karlis, D., and Liu, J. (2011). Composite likelihood for time series models with a

latent autoregressive process. Statistica Sinica, 21(1):279–305.

Panagiotelis, A., Czado, C., and Joe, H. (2012). Pair copula constructions for multivariate discrete

data. Journal of the American Statistical Association, 107:1063–1072.

Steutel, F. W. and Van Harn, K. (1979). Discrete analogs of self-decomposability and stability. Annals

of Probability, 7(5):893–899.

Van Harn, K. and Steutel, F. W. (1993). Stability equations for processes with stationary indepen-

dent increments using branching-processes and Poisson mixtures. Stochastic Processes and Their

Applications, 45(2):209–230.

Weiß, C. H. (2008). Thinning operations for modeling time series of counts—A survey. ASTA-Advances

in Statistical Analysis, 92(3):319–341.

Zheng, H., Basawa, I. V., and Datta, S. (2006). Inference for pth-order random coefcient integer-

valued autoregressive processes.

Journal of Time Series Analysis,

27(3):411–440.

Zheng, H., Basawa, I. V., and Datta, S. (2007). First-order random coefcient integer-valued autore-

gressive processes. Journal of Statistical Planning and Inference, 137(1):212–229.

Zhu, R. and Joe, H. (2003). A new type of discrete self-decomposability and its application to

continuous-time Markov processes for modeling count data time series. Stochastic Models,

19(2):235–254.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for 2: Markov Models for Count Time Series (4/5)

Create new playlist

Sign In

Sign Up

Table of Contents for
2: Markov Models for Count Time Series (4/5)