3: Generalized Linear Autoregressive Moving Average Models (3/6)

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

61 Generalized Linear Autoregressive Moving Average Models

these two individuals as reported earlier. The GLARMA model autoregressive coefcients

also substantially differed (φ

= 0.687 ± 0.079 and φ

= 0.894 ± 0.024). This also holds

when all 32 individual responses are modeled in this way.

We have also applied GLARMA modeling with the glarma package on binary and

binomial time series to study responsiveness to musical features in the complete panel

of 32 listeners. A clear conclusion from this extended analysis is that there is a strong

need to consider modeling of individual responses in order to address questions such

as: “Are there differences between musical expertise groups?”. Also clear is the need to

accommodate differing amounts of serial dependence for each series in the ensemble,

something that even current longitudinal data analysis for mixed models does not read-

ily allow. In the next section we explain two approaches to allowing for variation between

individual time series.

3.4 GLARMA Models for Multiple Independent Time Series

3.4.1 Examples

The musicology example of Section 3.3 is an example where ensembles of individual inde-

pendent responses need to be modeled together. In a public health policy setting, Bernat

et al. (2004) assessed, using a pooled cross-sectional random effects analysis, the impact of

lowering the legal allowable blood alcohol concentration (BAC) in motor vehicle drivers

from 0.10 to 0.08 on monthly counts of single vehicle night time fatalities in 17 U.S. states.

In that study, serial dependence was detected in some series but could not be modeled with

software and methods available at that time. The purpose of the remainder of this chapter

is to present methods that overcome these gaps.

Two approaches are developed here. The rst is referred to as the xed effects plus

GLARMA specication. Here, regression effects and serial dependence parameters may be

constrained across series to test various hypotheses of interest primarily about regression

effects, but also, about variation in serial dependence between series. For long longitu-

dinal data each series can be estimated and modeled individually using the GLARMA

models previoulsy discussed and combined likelihoods for constrained parameterizations

constructed on which to base inference. Here, the length of the individual time series allows

considerable exibility in serial dependence structures between individual series. In tradi-

tional longitudinal data analysis where the number of repeated measures is low, it may not

be possible to allow for individuality of this type.

The second approach is based on a random effect specication of the regression compo-

nent while allowing individual series to have different serial dependence structures and

strengths. This is not possible for traditional longitudinal data and the methods proposed

here is a substantial extension of existing methodology, which is enabled by the length of

the individual series.

Before dening models that combine xed or random effects with GLARMA serial

dependence, we mention some other recent examples along these lines. Xu et al. (2007)

present a parameter-driven specication for serial dependence with random effects on

covariates. Their approach is limited by the requirement that only autoregressive serial

dependence is covered and all series must share the same structure and parameter val-

ues. Additionally, the method has not been demonstrated, neither in simulation nor in









 

 

62 Handbook of Discrete-Valued Time Series

application, on series longer than 20 time points. Zhang et al. (2012) specify an exponential

decay autocorrelation on a latent process and use marginal estimation methods for their

combined serial dependence and random effects specication. They also assume that the

structure and strength must be the same for all series. These approaches, which force the

same dependence structure on all series, may be a legacy from the modeling of traditional

longitudinal data in which the individual series are very short and for which the ability to

detect different serial dependence properties between series is limited. It is our view that

for long longitudinal data, this restriction is articial for two reasons. First, in all exam-

ples we have encountered, there is strong evidence that the serial dependence structure

and strength varies between series; some series will have strong serial dependence, while

others will have none at all. One explanation for this is that the covariates used for the

ensemble for xed and random effect terms may not be inclusive of covariates that are

unavailable and impact individual series—in such circumstances serial dependence may

be stronger simply because it acts as a proxy for unobserved covariates. The second reason

relates to the fact that the use of random effects on regression variables should logically

extend to their use for serial dependence parameters. The models and methods presented

here provide considerably more exibility than those proposed by Xu et al. (2007) and

Zhang et al. (2012). However, being based on GLARMA models, which require regular

time spacing of observed outcomes, they cannot handle irregular spacings. For this irreg-

ularly spaces observations, parameter-driven models for the individual series would be

more appropriate.

It is our view that, more frequently, increasingly long longitudinal data will become

available and methods such as those proposed here will be become needed. For exam-

ple, studies with panels of subjects equipped with automatic data loggers measuring their

physical condition and activity levels are now quite feasible.

3.4.2 General Model for Multiple Independent GLARMA Models with Random Effects

Let Y

be the observation at time t = 1, ..., n

on the jth series of counts, where j = 1, ..., J

and let x

j,t

be the covariates for the jth series. We also let r

denote d random effect covariates

that apply to all series through coefcients represented as vectors of normally distributed

random effects U

∼ i.i.d N(0, (λ)), where  is a d × d covariance matrix determined by

parameters λ.

In addition to assuming that the series

are independent across the J cases, for each j,

we also assume that, given the state process W

j,t

, Y

are independent with exponential

family distribution (3.1), where

j,t

= x

j,t

(j)

+ r

+ Z

(3.15)

is the linear state process for the jth case series. Typically, some of the covariates appear in

both x

j,t

and r

in which case corresponding components of the β

(j)

will not depend on j

and some or all of the covariates can be the same across all series.

Serial dependence is modeled with the Z

assumed to satisfy (3.6) with degrees

(j)

, q

(j)

= φ

(j)

j,t−l

+ e

j,t−l

(j)

j,t−l

. (3.16)

l=1 l=1





 





63 Generalized Linear Autoregressive Moving Average Models

Let τ

(j)

, ..., φ

(

)

, θ

(j)

, ..., θ

(

)

. We distinguish three special cases for the general

state equation (3.15).

1. Generalized linear mixed model: In this specication, the serial dependence term Z

absent, giving the standard generalized linear mixed model for longitudinal data

discussed in Diggle et al. (2002) and Fitzmaurice et al. (2012)

j,t

= x

(j)

+ r

. (3.17)

The only source of within series correlation is in the random effects component.

Various packages (such as SAS and R) are available for model tting.

2. Fixed effects multiple GLARMA model: Random effects are not included but there is

a serial dependence in the form of an observation-driven model:

j,t

= x

(j)

+ Z

, (3.18)

where Z

is given by (3.16).

3. Random effects multiple GLARMA model: Both random effects and serial depen-

dence are present. This is the general specication for W

given by (3.15), with

specied by the GLARMA model (3.16).

Model Class 1 was used in Bernat et al. (2004). A previous example of Model Class 2 type

of modeling was considered in Dunsmuir et al. (2004), who investigated the commonal-

ity of regression impacts on three series of daily asthma presentation counts. Extension to

datasets with substantially more time series and more complex hypotheses on the xed

effects parameters required considerable development of the previous software. The ran-

dom effects multiple GLARMA model is developed in detail in Dunsmuir et al. (2014).

The BAC dataset will be used to illustrate tting of both models (3.18) and (3.15).

3.5 Fixed Effects Multiple GLARMA Model

3.5.1 Maximum Likelihood Estimation

In this section, we develop maximum likelihood estimation for the model with individual

series state equations specied as in (3.18). Let the log-likelihood for the jth series be as

in (3.7) and denoted by l

(j)

where θ

(j)T

, τ

(j)T

. The log-likelihood across all

series is

l(θ) =



(θ

(j)

) (3.19)

j=1

where β



(1)T

, ..., β

(J)T



and τ



(1)T

, ..., τ

(J)T



.Let θ =



, τ



denote all

parameters.





 





 

64 Handbook of Discrete-Valued Time Series

A primary focus is on testing if the parameterization across series can be simplied

so that regression coefcients or serial dependence parameters can be constrained to be

the same. We consider only linear constraints of the form θ = Aψ where A has fewer

columns than rows and ψ denotes the lower-dimensional vector of parameters in the con-

strained model. Typically ψ

, ψ

since we will generally not be interested in

relating the regression coefcients to the serial dependence parameters. In that case A will

be block diagonal appropriately partitioned. Denote the log-likelihood with respect to the

constrained parameters as l(ψ) = l(Aθ).

Maximization of (3.19) with respect to the constrained parameters can be over a high-

dimensional parameter space. Initial estimates of

are obtained by maximizing (3.19)

without constraints which is the same as maximizing all individual likelihoods separately

and combining the resulting l

(j)

. These unconstrained estimates are used to ini-

tialize the constrained parameters via

(0)

= (A

−1

. Next, using the appropriate

components of

(0)

= A

(0)

, each component log-likelihood l

(

)

and its derivatives

are calculated using the standard GLARMA software. Finally, these are combined to get

the overall l

(0)

. Derivatives with respect to ψ can be obtained using the identities

∂l(ψ)/∂ψ = A

∂l(Aψ)/∂θ and ∂

l(ψ)/∂ψ∂ψ

= A

∂

l(Aψ)/∂θ∂θ

A. This procedure

is then iterated to convergence using the Newton–Raphson or Fisher scoring algorithm.

Fisher scoring was found to be more stable in the initial stages. Once the derivative

∂l(ψ)/∂ψ settles down, the iterative search for the optimum can be switched to the

Newton–Raphson updates, which typically gives speedier convergence.

Similar to the single series case, the asymptotic properties of the MLEs

have not been

established for the general model previously described. Asymptotic results for longitudi-

nal data typically let the number of cases J tend to innity and the lengths of individual

series n

are typically held xed. For this scenario, asymptotic theory is typically straight-

forward since it relies on large numbers of independent trajectories. For our applications,

J is often bounded and we perceive of all n

as tending to innity in which case asymp-

totic results rely on those for individual time series which, as previously noted, are rather

underdeveloped. We assume, however, that asymptotic results hold and perform infer-

ence in the usual way. For example, the matrices of second derivatives

), computed in

the course of the Newton–Raphson or Fisher scoring maximization procedures, are used to

estimate standard errors for individual parameters. Also, to test the null hypothesis of com-

mon regression effects, we use the likelihood ratio statistic G

=−2

) − l(

)

, where

is the estimate obtained under the null hypothesis and

the estimate obtained under

the alternative. Degrees of freedom, for the chi-squared approximate reference distribu-

tion for G

, are calculated in the usual way. These ideas were rst illustrated in Dunsmuir

et al. (2004) for three series of daily asthma counts which are assessed for common seasonal

patterns, day of the week, weather, and pollution effects.

3.5.2 Application of Multiple Fixed Effects GLARMA to Road Deaths

Bernat et al. (2004) assessed the impact of lowering the legal allowable BAC in motor vehicle

drivers from 0.10 to 0.08 on monthly counts of single vehicle night time fatalities in 17 U.S.

states for which at least 12 months of post intervention data were available. The study

design selected 72 consecutive months of data with 36 months prior to the decrease in

65 Generalized Linear Autoregressive Moving Average Models

allowable BAC and up to 36 months after for each of the 17 states used in the analysis.

The mixed effects model presented in Bernat et al. (2004) assumed that the observed counts

of single vehicle night time deaths, Y

j,t

in month t for state j, had, conditional on the random

effects U

, a Poisson distribution with log mean given by

= β

+ β

(t) + β

2,j

(t) + β

3,j,t

+ β

4,j,t

+ O

j,t

+ r

, (3.20)

where I

(t) is the indicator variable for the change in BAC from 0.1 to 0.08 coded as 0 for

t ≤ 36 and 1 for t > 36 for all states in the study, and I

2,j

(t) the indicator variable taking the

value 0 for t < T

and 1 for t ≥ T

, where T

is the month in which an administrative license

revocation law was enacted. This potential confounder was enacted in seven of the states

during the period of data used there. The other two regression variables are x

3,j,t

, the num-

ber of Friday and Saturday nights, and a control series x

4,j,t

, the log of other motor vehicle

deaths (adjusted for population and seasonal factors), in month t for state j. An offset term,

j,t

, was used to adjust for unique state population trends and seasonal factors. Finally,

∼ N(0, G) is the multivariate normal distribution with covariance matrix G for the ran-

dom effects on the selected covariates in r

. Further details concerning the rationale and

denition of the model components mentioned earlier can be found in Bernat et al. (2004).

Because of the differences between the time periods over which the data were collected

from each of the 17 states and because of the control series used for each state, the assump-

tion that the 17 series are independent was considered reasonable. We also conrmed

this assumption using cross-correlation analysis of Pearson residuals from individual

GLARMA model ts to the 17 series. Bernat et al. (2004) discussed the likely impact that any

serial dependence might have on their key conclusion that there is a statistically signicant

lowering of overall average single vehicle night time fatalities associated with the lower-

ing of the legal BAC level but suitable software was not available at that time to assess this

statistically.

We now illustrate the xed effects plus GLARMA model dened in (3.18) on these data.

The rst step was to use the glarma package to t the model (3.18) to each of the indi-

vidual series using regression and offset terms as specied in (3.20). Examination of the

PIT residual plots for each series suggested overtting relative to the Poisson distribution.

We identied that the main contributor to overtting was the seasonal offset term used in

(3.20); this had been determined using standard seasonal adjustment methods available in

the PROC X-11 package in SAS for each series separately and based on only n = 72 months

of data. As an alternative, we used parametric harmonic seasonal terms to all 17 response

series with the other regressors and the population offset only. In almost all cases, these

seasonal harmonics were not signicant suggesting that use of the control series (without

seasonal adjustment) and lag 12 autoregressive terms in the GLARMA model are sufcient

for modeling seasonality. In view of this, we performed our reanalysis of these data using

the regression specication in (3.20), but dropping the seasonal adjustment in both the off-

set O

j,t

and logOMVD (so that the control series is now log other motor vehicle deaths per

100,000 population without seasonal adjustment), together with a GLARMA process for

serial dependence. We compared various combinations of lags for the autoregressive and

moving average terms in the GLARMA model to allow for both serial dependence at low

orders and at seasonal lags. Based on AIC the overall best specication was an autoregres-

sion of order p = 12 with zero coefcients at lags 1 through 11. Hence we initially adopted

this seasonal model for all series.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for 3: Generalized Linear Autoregressive Moving Average Models (3/6)

Create new playlist

Sign In

Sign Up

Table of Contents for
3: Generalized Linear Autoregressive Moving Average Models (3/6)