11: Bayesian Modeling of Time Series of Counts with Business Applications (2/4)

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google







 





 



250 Handbook of Discrete-Valued Time Series

where r

= γα

t−1

and p

= γβ

t−1

/γβ

t−1

+ 1. The one-step-ahead forecast is obtained as





t−1

, γ = .

t−1

An interesting property of the model is the long-run behavior of its one-step-ahead fore-

casts. As t gets large, using β

= γβ

t−1

+ 1, we can show that β

approaches 1/(1 − γ) and

we obtain

t−1

, γ =

(1 − γ)Y

t−1

+ (1 − γ)γY

t−2

+ ...+ (1 − γ)γ

which is an exponentially weighted average of the observed counts.

Although an analytic expression is not available for the k-step-ahead predictive density,

the k-step-ahead predictive means can be easily obtained. Using a standard conditional

expectation argument one can obtain E(Y

t+k

, γ) as















t+k

, γ =

t+k

,γ

t+k

|λ

t+k

, D

= E

t+k

, γ

. (11.19)

Furthermore, using the state equation (11.4), we have

t+k



t+k

, γ



= E



, γ











= E



, γ



, (11.20)

γ β

n=t+1

where E(

) = γ for any n. Therefore, combining (11.19) and (11.20), we obtain the

k-step-ahead forecasts given data up to time t as



t+k

, γ



= E



t+k

, γ



. (11.21)

Note that in the case of the model with covariates, it can be shown that

t+k



t+k

, z

t+k

, ψ, γ



= E



t+k

, z

t+k

, ψ, γ





. (11.22)

If we treat the discount factor γ as a random variable, we lose the analytical tractability of

the model described earlier. However selecting a prior distribution for γ can be handled

fairly easily. Given D

, the likelihood function of γ is given by

L(γ; D

) = p

i−1

, γ

, (11.23)

i=1

where p(Y

i−1

, γ) is negative binomial as in (11.18). The posterior distribution of γ can

then be obtained as

γ|D

∝

i−1

, γ

p(γ). (11.24)

i=1









251 Bayesian Modeling of Time Series of Counts with Business Applications

For some priors for p(γ) in (11.24), the posterior distribution will not be available in closed

form. However, we can always sample from the posterior using an MCMC method such

as the Metropolis–Hastings algorithm. Alternatively, a discrete uniform prior can be a

reasonable choice.

11.3 Markov Chain Monte Carlo (MCMC) Estimation of the Model

Since all conditional distributions previously introduced are all dependent on the param-

eter vectors ψ and γ, we need to discuss how to obtain the joint posterior densities of

ψ and γ that cannot be obtained in closed form; therefore, we can use MCMC methods

to generate the required samples. Our objective in this section is to obtain the joint pos-

terior distribution of the model parameters given observed counts up to time t,that is,

p(θ

, ..., θ

, ψ, γ|D

). We use a Gibbs sampler to generate samples from the full conditionals

of p(θ

, ..., θ

|ψ, γ, D

) and p(ψ, γ|θ

, ..., θ

, D

), none of which are available as standard

densities.

For notational convenience, we dene ω ={ψ, γ}. The conditional posterior distribution

of ω given the latent rates, (θ

, ..., θ

) is





ω|θ

, ..., θ

, D



∝



exp



p(ω), (11.25)

i=1

where p(ω) is the joint prior for ψ and γ. Regardless of the prior selection for ω, (11.25) will

not be a standard density. We use an MCMC algorithm such as the Metropolis–Hastings

to generate samples from p(ω|θ

, ..., θ

, D

). In our numerical examples, we assume at

but proper priors for ψ and γ with ψ

∼ Normal(0, 1000) for all i and γ ∼ Uniform(0, 1).

Following Chib and Greenberg (1995), the steps in the Metropolis–Hastings algorithm can

be summarized as follows:

1. Assume the starting points ω

(0)

at j = 0.

Repeat for j > 0,

2. Generate ω

∗

from q(ω

∗

|ω

(j)

) and u from U(0, 1).

3. If u ≤ f (ω

(j)

, ω

∗

) then set ω

(j)

= ω

∗

;elseset ω

(j)

= ω

(j)

and j = j + 1,

where



(j)

, ω

∗



= min

π(ω

∗

)q(ω

(j)

|ω

∗

)

. (11.26)

π(ω

(j)

)q(ω

∗

|ω

(j)

)

In (11.26), q(.|.) is the multivariate normal proposal density and π(.) is given by (11.25)

which is the density we need to generate samples from. If we repeat this a large number of

times, we can obtain samples from p(ω|θ

, ..., θ

, D













�

































�





























252 Handbook of Discrete-Valued Time Series

Generation of samples from the full conditional distribution, p(θ

, ..., θ

, ω),using

the FFBS algorithm as described in Fruhwirth-Schnatter (1994) requires the smoothing dis-

tribution of θ

s, which enable retrospective analysis. In other words, given that we have

observed the count data, D

at time t, we will be interested in the distribution of (θ

t−k

, ω)

for all k ≥ 1.

We can write

t−k

, ω =

t−k

|θ

t−k+1

, D

, ω

t−k+1

, ω

dθ

t−k+1

, (11.27)

where p(θ

t−k

|θ

t−k+1

, D

, ω) is obtained via Bayes’ rule as



, D

, ω



t−k

|θ

t−k+1

, D



t−k

, ω

(

∗

t,k)

|θ

t−k

, θ



t−k+1

, D

t−k

, ω

t−k

|θ

t−k+

∗

|θ

t−k+1

, D

t−k

, ω

(t,k)

= p

t−k

|θ

t−k+1

, D

t−k

, ω

where Y

∗

={Y

t−k+1

, ..., Y

}.Given θ

t−k+1

, Y

∗

is independent of θ

t−k

. In other words,

(t,k) (t,k)

∗

(t,k)

|θ

t−k

, θ

t−k+1

, D

t−k

, ω

= p

(

∗

t,k)

|θ

t−k+1

, D

t−k

, ω

. Thus, (11.27) reduces to

t−k

, ω =

t−k

|θ

t−k+1

, D

t−k

, ω

t−k+1

, ω

dθ

t−k+1

. (11.28)

Although we cannot obtain (11.28) analytically we can use Monte Carlo methods to draw

samples from p(θ

t−k

, ω). Due to the Markovian nature of the state parameters, we can

rewrite p(θ

, ..., θ

, ω) as

, ω

t−1

|θ

, D

t−1

, ω

...p

|θ

, D

, ω

. (11.29)

We note that p(θ

, ω) is available from (11.9) and p(θ

t−1

|θ

, D

t−1

, ω) for any t as

t−1

|θ

, D

t−1

, ω

∝ p

|θ

t−1

, D

t−1

, ω

t−1

, ω

, (11.30)

where the rst term is available from (11.4) and the second term from (11.6). It is straight-

forward to show that

t−1

|θ

, D

t−1

, ω

∼ ShGamma[(1 − γ)α

t−1

, β

t−1

; (γθ

, ∞)],

which is a shifted gamma density dened over γθ

< θ

t−1

< ∞.

Therefore, given (11.29) and the posterior samples generated from the full conditional

of ω, we can obtain a sample from p



, ..., θ

|ω, z

, D



by sequentially simulating the

individual latent rates as follows:

1. Assume the starting points θ

(

, ..., θ

(

at j = 0.

Repeat for j > 0,

















253 Bayesian Modeling of Time Series of Counts with Business Applications

2. Using the generated ω

(j)

, sample θ

(j)

from (θ

|ω

(j)

, D

3. Using the generated ω

(j)

, for each n =t − 1, ..., 1 generate θ

(j)

from

|θ

)

, ω

(j)

, D

where θ

)

is the value generated in the previous step.

If we repeat this a large number of times, we can obtain samples from the full condi-

tional of the latent rates. Consequently, we can obtain samples from the joint density of

the model parameters by iteratively sampling from the full conditionals, p(ω|θ

, ..., θ

, D

)

and p(θ

, ..., θ

|ω, D

), via the Gibbs sampler. Once we have the posterior samples from

p(θ

, ..., θ

, ω|D

) we can also obtain the posterior samples of λ

s in a straightforward

manner using the identity λ

= θ



11.4 Multivariate Extension

It is possible to consider several extensions of the basic model to analyze multivariate count

time series. For instance, the observations of interest can be the number of occurrences of

an event during day t of year j. Another possibility is to consider the analysis of J different

Poisson time series. For instance, for a given year, the weekly spending habits of J different

households which can exhibit dependence can be modeled using such a structure. Several

extensions have been proposed by Aktekin and Soyer (2011), where multiplicative Pois-

son rates for (11.3) are considered. An alternate approach for modeling multivariate time

series of counts is described by Ravishanker, Venkatesan, and Hu (2015; Chapter 20 in this

volume).

In what follows, we present a model for J Poisson time series that are assumed to be

affected by the same environment. We assume that

∼ Pois λ

,forj = 1, ..., J, (11.31)

where λ

= λ

, λ

is the arrival rate specic to the jth series and θ

is the common term

modulating λ

. For example, in the case where Y

is the number of grocery store trips of

household j at time t, λ

is the household-specic rate and we can think of θ

as the effect of

a common economic environment that the households are exposed to at time t. The values

of θ

> 1 represent a more favorable economic environment than usual, implying higher

shopping rates.

This is analogous to the concept of an accelerated environment for operating conditions

of components used by Lindley and Singpurwalla (1986) in life testing. Our case can be

considered as a dynamic version of their setup since we have the Markovian evolution

of θ

s as

t−1



, (11.32)

where, as earlier, 

t−1

, λ

, ..., λ

∼ Beta [γα

t−1

, (1 − γ)α

t−1

] with α

t−1

> 0, 0 < γ < 1,

and D

t−1

={D

t−2

, Y

1(t−1)

, ..., Y

J(t−1)

}. Furthermore, we assume that

∼ Gamma a

, b

,forj = 1, ..., J, (11.33)













































254 Handbook of Discrete-Valued Time Series

and a priori, λ

s are independent of each other as well as of θ

.Given θ

s and λ

s, Y

s are

conditionally independent. In other words, all J series are affected by the same common

environment and given that we know the uncertainty about the environment, they will be

independent.

At time 0, we assume that θ

∼ Gamma(α

, β

), and by induction we can show that

t−1

, λ

, ..., λ

∼ Gamma

t−1

, β

t−1

, (11.34)

and

t−1

, λ

, ..., λ

∼ Gamma

γα

t−1

, γβ

t−1

. (11.35)

In addition, the ltering density at time t can be obtained as

, λ

, ..., λ

∼ Gamma

, β

, (11.36)

where α

= γα

t−1

+...+Y

and β

= γβ

t−1

+λ

+...+λ

. Consequently, the marginal

distributions of Y

for any j can be obtained as







γα

t−1







γα

t−1

+ Y

− 1

p =

1 −

, (11.37)

|λ

, D

t−1

γβ

t−1

+ λ

γβ

t−1

+ λ

which is a negative binomial model as earlier. The multivariate distribution of (Y

, ··· , Y

)

can be obtained as







γα

t−1





, ..., Y

|λ

, ..., λ

, D

t−1



(

γα

t−1

)





+ 1



γβ

t−1







γα

t−1

γβ

t−1



, (11.38)

γβ

t−1

which is a dynamic multivariate distribution of negative binomial type. The bivariate

distribution p(Y

, Y

|λ

, λ

, D

t−1

) can be obtained as



γα

t−1

+ Y



γβ

t−1



γα

t−1





(

γα

t−1

)



(

+ 1

)



+ 1

+ λ

+ γβ

t−1

+ λ

+ γβ

t−1



× , (11.39)

+ λ

+ γβ

t−1

which is a bivariate negative binomial distribution for integer values of γα

t−1

. This distri-

bution is the dynamic version of the negative binomial distribution proposed by Arbous

and Kerrich (1951) for modeling accident numbers.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for 11: Bayesian Modeling of Time Series of Counts with Business Applications (2/4)

Create new playlist

Sign In

Sign Up

Table of Contents for
11: Bayesian Modeling of Time Series of Counts with Business Applications (2/4)