19: Models for Multivariate Count Time Series (3/4)

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google





417 Models for Multivariate Count Time Series

Application of composite likelihood approach implies the usage, for example, of bivari-

ate marginal log-likelihood functions over all pairs instead of the usage of the multivariate

likelihood. As an illustration, consider a trivariate probability function P(x, y, z; θ), where

θ is a vector of parameters to estimate. The log-likelihood to be maximized is of the form

(θ) =

i=1

log P(x

, y

, z

; θ) while the composite log-likelihood is



(θ) = [log P(x

, y

; θ) + log P(x

, z

; θ) + log P(y

, z

; θ)] ,

i=1

that is, we replace the trivariate distribution by the product of the bivariate ones. The price

to be paid is some loss of efciency which can be very large for some models. On the other

hand, the optimization problem is usually easier. Simulations for the diagonal trivariate

MINAR(1) model have shown small efciency loss (Pedeli and Karlis, 2013a). Therefore, the

composite likelihood method makes feasible the application of some multivariate models

for time series and this is worth further exploration. Alternatively, one may employ an

expectation-maximization (EM) algorithm making use of the latent structure imposed by

the convolution (Pedeli and Karlis, 2013a).

Finally, Bayesian estimation of the BINAR model with bivariate Poisson innovations is

described in Sofronas (2012).

19.3.4 Other Models in This Category

Ristic et al. (2012) developed a simple bivariate integer-valued time series model with posi-

tively correlated geometric marginals based on the negative binomial thinning mechanism.

Bulla et al. (2011) described a model based on another operator called the signed binomial

operator allowing tting integer-valued data in Z. Recently, Scotto et al. (2014) derived a

model for correlated binomial data and Nastic et al. (2014) discussed a model with Poisson

marginals with same means.

Quoreshi (2006, 2008) described properties of a bivariate moving-average model

(BINMA). The BINMA(q

, q

) model takes the form

= u

+ a

◦ u

1,t−1

+···+a

1,t−q

= u

+ a

◦ u

2,t−1

+···+a

2,t−q

where u

are innovation terms following some positive discrete distribution. Estimation

using conditional least squares, feasible least squares, and generalized method of moments

is described. Cross-correlation between the series is implied by assuming dependent inno-

vations terms. Extensions to the multivariate case are described in Quoreshi (2008). Similar

to the MINAR model, the multivariate vector integer MA model can allow for mixing

between the series. In both the bivariate and the multivariate models, no parametric

assumptions for the innovations were given. A BINMA model has been studied in Brännäs

and Nordström (2000).













418 Handbook of Discrete-Valued Time Series

19.4 Parameter-Driven Models

Parameter-driven models have also been considered for count time series. The main idea is

that the serial correlation imposed is due to some correlated latent processes on the param-

eter space. They offer useful properties since the serial correlation of the latent process

drives the serial correlation properties for the count model. On the other hand, estimation

is usually much harder. We describe such multivariate models in the following.

19.4.1 Latent Factor Model

Jung et al. (2011) presented a factor model which in fact belongs to the family of parameter-

driven models. The model was applied to the number of trades of ve stocks belonging to

two different sectors within a 5 min interval for a period of 61 days. The model assumed

that the number of trades y

for the ith stock at time t follows a Poisson distribution with

mean θ

, while

log θ

= µ

+ γλ

+ δ

+ φ

where µ

is a mean specic to the ith stock, which perhaps may relate to some covariates

specic to the stock as well, λ

is a latent common market factor, τ

= (τ

, τ

)



is a latent

vector of industry-specic factors and ω

= (ω

, ..., ω

) are latent stock-specic factors.

The model assumes an AR(1) specication for the latent factors, namely,

| λ

t−1

∼ N

+ ν

t−1

, σ

| τ

s,t−1

∼ N

+ ν

s,t−1

, σ

| ω

i,t−1

∼ N

+ ν

i,t−1

, σ

Clearly, cross-correlation enters the model by assuming common factors, while serial cor-

relation from the latent AR processes. The likelihood is complicated, and the authors

developed an efcient importance sampling algorithm to evaluate the log-likelihood and

apply the simulated likelihood approach.

19.4.2 State Space Model

Jorgensen et al. (1999) proposed a multivariate Poisson state space model with a common

factor that can be analyzed by a standard Kalman lter. The model assumes that the multi-

ple counts y

, i = 1, ..., d at time t follow conditionally independent Poisson distributions,

namely,

∼ Poisson(α

)





 

419 Models for Multivariate Count Time Series

with α

= exp(x



), where x

is a vector of time-varying covariates including a constant

and further

|θ

t−1

∼ Gamma

t−1

where Gamma(a, b

) is a gamma distribution with mean a and coefcient of variation b.

Kalman ltering is used for the latent process.

Finally, Lee et al. (2005) proposed a model starting from a bivariate zero-inated Poisson

regression with covariates, adding random effects that are correlated in time.

19.5 Observation-Driven Models

In observation-driven models, serial correlation comes from the fact that current obser-

vations relate directly to the previous ones. The Poisson autoregression model dened in

Fokianos et al. (2009) constitutes an important member of this class for univariate series.

The model has a feedback mechanism and is dened as

t−1

∼ Poisson(λ

), λ

= d + aλ

t−1

+ bY

t−1

for t ≥ 1, where the parameters d, a, b are assumed to be positive. In addition, assume

that Y

and λ

are xed and F

t−1

is the information up to time t − 1. The model is called

INGARCH, but the name perhaps is very ambitious. Properties of the model can be seen in

Fokianos et al. (2009). The model has found a lot of work after this (Trostheim, 2012; Davis

and Liu, 2015). The Poisson INGARCH is incapable of modeling negative serial depen-

dence in the observations which is, however, possible by the self-excited threshold Poisson

autoregression model (see Wang et al., 2014). Extensions to higher-order INGARCH(p,q)

(see Weiß, 2009), nonlinear relationships, and other distributional assumptions have been

also proposed in the literature. This type of model offers a much richer autocorrelation

structure than models based on thinning, like long memory properties, for example. Their

estimation is more computational demanding, the same is true for deriving their properties.

Extension to higher dimensions has also been proposed. Liu (2012) proposed a bivariate

Poisson integer-valued GARCH (BINGARCH) model. This model is capable of modeling

the time dependence between two time series of counts. Consider two time series Y

and

. We assume that

= (Y

, Y

)|F

t−1

∼ BP2(λ

, λ

, φ),

where BP2(λ

, λ

, φ) denotes a bivariate Poisson distribution with marginal means λ

and λ

, respectively, and covariance equal to φ. This is a reparameterized version of

the distribution in (19.1). Furthermore, we assume for the general BIV. INGARCH(m,q)

model that

= δ + A

t−i

+ B

t−j

i=1 j=1

420 Handbook of Discrete-Valued Time Series

= (λ

, λ

)



, δ > 0 is 2-vector and A

and B

are 2 × 2 matrices with nonnegative entries.

Conditions to ensure the positivity of λ are given.

For example, the BIV.INGARCH(1,1) model takes the form

= δ + Aλ

t−1

+ BY

t−1

or equivalently











 



 

1,t−1

2,t−1

Stability properties for this model are not simple. Liu (2012) used the iterated random

functions approach that allows us to derive the stability properties under a contracting

constraint on the coefcient matrices. Inference procedures are also presented and applied

to real data in the area of trafc accident analysis.

Heinen and Rengifo (2007) developed a model called the Autoregressive Conditional

double Poisson model for a d-dimensional vector of counts y

. The model relates to the one

given earlier. Their model was based on the double Poisson distribution dened in Efron

(1986):

| F

t−1

∼ DP(λ

, φ

), i = 1, ..., d,

where λ

is the mean of the double Poisson distribution of the ith count at time t and φ

an overdispersion parameter. For φ

= 1, we get the Poisson distribution. Then the mean

vector λ

= (λ

, ..., λ

) is dened as a VARMA process

= ω + Aµ

t−1

+ BY

t−1

where A and B are appropriate matrices. Stationarity is ensured as long as the eigenvalues

of (I − A − B) lie within the unit circle. The VARMA order specication can be modied.

The cross-correlation between the counts is imposed via a Gaussian copula. The model

uses a trick to avoid problems when working with discrete-valued data and copulas, by

using a continued extension argument, that is, by adding some noise to the counts to make

them continuous and working with the continuous versions. The latter adjustment may

create some noise around the model. A recent paper (Nikoloulopoulos, 2013a) discusses

the problems with such a method.

Bien et al. (2011) used copulas to create a multivariate time series model dened on Z

and not only on positive integers. They modeled a bivariate time series of bid and ask

quote changes sampled at a high frequency. The marginal models used were assumed to

follow a dynamic integer count hurdle (ICH) process, tied together with a copula, which

was constructed by properly taking into account the discreteness of the data.

Finally, Held et al. (2005) described a model with multivariate counts where at time t

counts of the other series at time t − 1 enter as covariates for the mean of each series. Also,

Brandt and Sandler (2012) used the model of Chib and Winkelmann (2001) and made it

dynamic by adding autoregressive terms in the mean of the mixing distribution.

421 Models for Multivariate Count Time Series

19.6 More Models

The earlier mentioned list of models does not limit other families of models to be derived.

Such an example is the creation of hidden markov models (HMMs) for multivariate count

time series. In the univariate case, Poisson HMMs have been used to model integer-valued

time series data. Extensions to the multivariate case are possible but multivariate discrete

distributions are needed to model the state distribution. In Orfanogiannaki and Karlis

(2013), multivariate Poisson distributions based on copulas have been considered for this

purpose.

Another approach to create multivariate time series models for counts could be through

the discretization of standard continuous models. Consider, for example, the vector autore-

gressive model based on a bivariate normal distribution. Discretizing the output can lead

to the desired time series. However, such a discretization is not unique and perhaps prob-

lems may occur while estimating the parameters, as multivariate integrals need to be

calculated.

Joe (1996) described an approach to create time series models based on additively closed

families. Working with bivariate distributions with this property, as, for example, the

bivariate Poisson distribution, one can derive such a time series model. Such models

share common elements with models based on thinning operations; see Joe (1996) for a

methodology to create an appropriate operator.

19.7 Discussion

In this chapter, we have pulled together the existing literature on multivariate integer-

valued time series modeling. Table 19.1 summarizes the models. An obstacle for such

models is the lack of, or at least the lack of familiarity with, multivariate discrete distri-

butions which are basic tools for their construction. Given greater availability of such basic

tools, we expect that more models will become available in the near future. Also, ideas for

tackling estimation problems like the composite likelihood approach can help a lot to this

direction.

Such models can have also some other interesting potential. For example, taking the

difference of the two time series from a bivariate model, we end up with a time series

TABLE 19.1

Models for multivariate count time series

Type of Model References

Models based on thinning Franke and Rao (1995); Latour (1997); Pedeli and Karlis (2011); Pedeli and Karlis

(2013a,c); Pedeli and Karlis (2013b); Karlis and Pedeli (2013); Boudreault and

Charpentier (2011); Quoreshi (2006, 2008); Ristic et al. (2012); Brännäs and

Nordström (2000); Bulla et al. (2011)

Observation-driven models Liu (2012); Heinen and Rengifo (2007); Bien et al. (2011); Held et al. (2005); Brandt

and Sandler (2012)

Parameter-driven models Jung et al. (2011); Jorgensen et al. (1999); Lee et al. (2005)

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for 19: Models for Multivariate Count Time Series (3/4)

Create new playlist

Sign In

Sign Up

Table of Contents for
19: Models for Multivariate Count Time Series (3/4)