17: Autologistic Regression Models for Spatio-Temporal Binary Data (3/4)

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google









 





 



377 Autologistic Regression Models for Spatio-Temporal Binary Data

17.3.1 Model with Centered Parameterization

For site i = 1, ..., n and time t,letπ

i,t

denote the probability of Y

i,t

= 1 under spatio-temporal

independence. That is,

exp



k,i,t

i,t





1 + exp



k=0

k,i,t

Let y

∗

i,t

= y

i,t

− π

i,t

denote a centered response at site i and time t, centering around π

i,t

. For

pairwise-only dependence, Wang and Zheng (2013) dened a centered spatio-temporal

autologistic regression model via the following full conditional distributions:

p(y

i,t



: (i



, t



) = (i, t)) = p(y

i,t



: (i



, t



) ∈ N

i,t

)

∗ ∗ ∗

exp





k,i,t

i,t



j∈N

p+1

i,t

j,t

+ θ

p+2

i,t

i,t−1

+ y

i,t+



)

. (17.20)

1 + exp



k=0

k,i,t



j∈N

p+1

∗

j,t

+ θ

p+2

∗

i,t−1

+ y

∗

i,t+1

)

By the Hammersley–Clifford theorem and its corollary, the joint likelihood function of

, ..., Y

T−1

conditioned on Y

and Y

L(Y; θ) = p(y

, ..., y

T−1

, y

; θ)





 



T−1 n

n T



 

 

∗ −1 ∗ ∗ ∗ ∗ ∗

 

= c

(θ) exp



k,i,t

i,t

p+1

i,t

j,t

p+2

i,t

i,t−1



t=2

i=1 k=0 i=1 j∈N

t=2

i=1

(17.21)

where c

∗

(θ) is the normalizing constant. When the temporal autocorrelation coefcient

is zero (i.e., θ

p+2

= 0), the model reduces to a spatio-only autologistic regression model

(Caragea and Kaiser, 2009; Hughes et al., 2011).

Thus, the conditional expectation of Y

i,t

given its neighbors is

E(Y

i,t



= y



: (i



, t



) ∈ N

i,t

)

∗ ∗ ∗

exp



k,i,t



j∈N

p+1

j,t

+ θ

p+2

i,t−1

+ y

i,t+1

)

1 + exp



k,i,t



j∈N

p+1

∗

+ θ

p+2

∗

,t−1

+ y

∗

,t+1

)

which we denote as π

∗

i,t

. Suppose that the spatial autoregressive coefcient θ

p+1

and the

temporal autoregressive coefcient θ

p+2

are positive. Then, π

∗

i,t

> π

i,t

when

p+1

j,t

+ θ

p+2

i,t−1

+ y

i,t+1

)>θ

p+1

j,t

+ θ

p+2

(π

i,t−1

+ π

i,t+1

j∈N

where

j∈N

j,t

and π

i,t−1

+ π

i,t+1

are the expected numbers of nonzero spatial and tem-

poral neighbors under the independence model, respectively. Specically, if θ

p+2

= 0, then



 



378 Handbook of Discrete-Valued Time Series

∗

i,t

> π

i,t

only when the observed number of nonzero spatial neighbors is greater than the

expected number of nonzero spatial neighbors under independence. That is,

j,t



j∈N

j,t

.If θ

p+1

= 0, then π

∗

> π

i,t

only when the observed number of nonzero tempo-

ral neighbors is greater than the expected number of nonzero temporal neighbors under

independence. That is, y

i,t−1

+ y

i,t+1

> π

i,t−1

+ π

i,t+1

. Thus, the interpretation of θ

p+1

and

p+2

as local dependence parameters is more sensible. Further, the simulation study in

Wang (2013) showed that the marginal expectation of Y

i,t

under the centered parameter-

ization remains constant over moderate levels of spatial and temporal dependence (i.e.,

E(Y

i,t

k,i,t

, k = 1, ..., p) ≈ π

i,t

). The interpretation of regression coefcients as effects of

covariates is more sensible as well.

17.3.2 Statistical Inference

For the model with centered parameterization, its statistical inference has been devel-

oped based on expectation–maximization pseudo-likelihood, Monte Carlo expectation–

maximization likelihood, and Bayesian inference (Wang and Zheng, 2013).

17.3.2.1 Expectation–Maximization Pseudo-Likelihood Estimator

To obtain the maximum pseudo-likelihood estimates of the model parameters, the combi-

nation of an expectation–maximization (EM) algorithm and a Newton–Raphson algorithm,

called the expectation–maximization pseudo-likelihood estimator (EMPLE), is considered.

Specically, update π

i,t

, the expectation of Y

i,t

under the independent model, at the E step

and then at the M step, update

by maximizing



exp



k=0

k,i,t

i,t

j∈N

p+1

i,t

∗

(

l−1)

+ θ

p+2

i,t

∗

(

t−

l−

+ y

∗

(

l−

)

i,t

1 + exp





k,i,t



j∈N

p+1

∗

(

l−1)

+ θ

p+2

∗

(

t−

l−

+ y

∗

(

l−

)



where y

∗

(l)

is the centered response at the lth iteration. The M step can be carried out

by a Newton–Raphson algorithm using the standard logistic regression and the E and

M steps are repeated until convergence. A parametric bootstrap can be used to compute

the standard error of the EMPLE. For the starting value θ

at the start of the algorithm, dif-

ferent starting points can impact how long it takes to convergence. The maximum MPLE

from the uncentered autologistic regression model is a natural choice.

17.3.2.2 Monte Carlo Expectation–Maximization Likelihood Estimator

Let z

∗

= (



∗

...,



∗

 

∗



∗

. We consider a

i,t

0,i,t

i,t

p,i,t

i,t

i,t j∈N

i,t j,t

i,t

i,t i,t−1

)



rescaled version of the likelihood function

∗

(ψ)





exp(θ



∗

)



−1

∗

(ψ)L(Y; θ) =

∗

(θ)

exp(θ



∗

) =

∗

exp(θ



∗

exp(ψ



)

379 Autologistic Regression Models for Spatio-Temporal Binary Data

where ψ is a reference parameter and z

∗

is z

∗

with centers evaluated at ψ.MonteCarlo

expectation–maximization likelihood (MCEML) estimator can be used by combining an

EM algorithm and a Newton–Raphson algorithm. Specically, rst choose a reference

parameter vector ψ and generate M Monte Carlo samples of Y from the likelihood func-

tion evaluated at ψ. Then for the lth iteration, at the E step, we update π

(

−1)

and set

∗

(

l−1)

= y

i,t

− π

(

−1)

. At the M step, we maximize the rescaled version of the likelihood

function









−1

∗(m) ∗(m)

exp(θ



∗

l−1

) M

−1



exp θ



l−1

− ψ



m=1

∗

∗(l−1) ∗(m) ∗(m)

∗

where z

l−1

is z with centered responses y

i,t

and z

l−1

and z

are z evaluated

at the mth Monte Carlo sample of Y generated at the beginning of the algorithm with

centers computed at θ

(l−1)

and ψ, respectively. The M step can be carried out using a

Newton–Raphson algorithm. We compute the observed Fisher information matrix and

obtain the standard errors of the MCEMLE as a by-product of the MCEML estimation.

17.3.2.3 Bayesian Inference

We consider an MH algorithm to generate Monte Carlo samples of θ from the posterior

distribution p(θ|y) (Zheng and Zhu, 2008), where the likelihood ratio in α(θ

∗

|θ) in the

acceptance probability is approximated as

∗

p(y

, ..., y

T−1

, y

, θ

∗

)

exp(θ

∗



∗

)

∗

(θ)

∗

(ψ)

p(y

, ..., y

T−1

, y

, θ)

exp(θ



∗

)

∗

(θ

∗

)

∗

(ψ)

∗



∗(m)

− ψ



∗(m)

exp(θ

∗



∗

)

m=1

exp(θ



z z )

≈

∗



∗(m)

exp(θ



)

m=1

exp(θ

∗



∗

− ψ



)

where z

∗(m)

, z

∗(m)

,and z

∗(m)

are z

∗

evaluated at the mth Monte Carlo sample of Y with

∗

centers computed based on θ, θ

∗

,and ψ, respectively.

17.4 Data Example

For illustration, we consider the outbreak of southern pine beetle (SPB) in North Carolina.

The data consist of indicators of outbreak or not (0 = no outbreak; 1 = outbreak) in the

100 counties of North Carolina from 1960 to 1996. Figure 17.1 gives a time series of the

county-level outbreak maps. The average precipitation in the fall (in cm) will be the covari-

ate and is mapped in Figure 17.2. We use the data from 1960 to 1991 for model tting and

set aside the data from 1992 to 1996 for model validation. Two counties are considered to

be neighbors if the corresponding county seats are within 30 miles of each other.

380

1960 1968 1976 1984 1992

1961 1969 1977 1985 1993

1962

1963

1964

1965

1966

1967

1970

1971

1972

1973

1974

1975

1978

1979

1980

1981

1982

1983

1986

1987

1988

1989

1990

1991

1994

1995

1996

FIGURE 17.1

Maps of southern pine beetle outbreak from 1960 to 1996 in the counties of North Carolina. A county is lled black if there was an outbreak and is unlled otherwise.

Handbook of Discrete-Valued Time Series

381 Autologistic Regression Models for Spatio-Temporal Binary Data

2.5 − 3

3 − 3.5

3.5 − 4

4 − 4.5

4.5 − 5

5 − 5.5

5.5 − 6

6 − 6.5

6.5 − 7

FIGURE 17.2

Map of mean fall precipitation in the counties of North Carolina.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for 17: Autologistic Regression Models for Spatio-Temporal Binary Data (3/4)

Create new playlist

Sign In

Sign Up

Table of Contents for
17: Autologistic Regression Models for Spatio-Temporal Binary Data (3/4)