382 Handbook of Discrete-Valued Time Series
TABLE 17.1
Comparison of Model Parameter Estimation for the Spatio-Temporal Autologistic Model with
Uncentered Parameterization Using Maximum Pseudo-likelihood (MPL), Monte Carlo Maximum
Likelihood (MCML), and Bayesian Inference
MPL MCML Bayesian
Parameters Estimate SE Estimate SE Mean SD
Intercept θ
0
Slope θ
1
Spatial θ
2
Temporal θ
3
5.16
0.25
1.45
1.71
0.66
0.18
0.14
0.24
2.71
0.14
0.91
1.02
0.20
0.05
0.05
0.11
2.71
0.14
0.91
1.03
0.20
0.05
0.06
0.13
TABLE 17.2
Comparison of Model Parameter Estimation for the Spatio-Temporal Autologistic Model with
Centered Parameterization Using Expectation–Maximization Pseudo-likelihood (EMPL), Monte
Carlo Expectation–Maximization Likelihood (MCEML), and Bayesian Inference
EMPL MCEML Bayesian
Parameters Estimate SE Estimate SE Mean SD
Intercept θ
0
Slope θ
1
Spatial θ
2
Temporal θ
2
4.96
0.21
1.47
1.75
0.32
0.08
0.13
0.18
2.40
0.13
0.95
0.89
0.16
0.05
0.05
0.07
2.86
0.13
0.95
0.89
0.29
0.08
0.06
0.10
Tables 17.1 and 17.2 give the model parameter estimates and standard errors from tting
the spatio-temporal autologistic regression models with uncentered and centered parame-
terization, respectively (see Wang and Zheng, 2013; Zheng and Zhu, 2008). The parameter
estimates and the corresponding standard errors for both models using all of the three
inference approaches, MPLE, MCMLE, and Bayesian inference, are quite close. One pos-
sible reason for this is that for this data set, the inuence of the center is small relative to
the strength of spatio-temporal dependence. The average of the centers π
i,t
evaluated at
the MCEMLE is only 0.05, and thus, the spatio-temporal autoregressive terms dominate
the outbreak probabilities.
For comparison among various statistical inference approaches, the results suggest that
the inference for the model parameters using the posterior distribution matches well with
MCML, but the inference from pseudo-likelihood is different from both Bayesian inference
and MCML for both uncentered and centered parameterization models. In addition, esti-
mation based on pseudo-likelihood results in higher variance than the other approaches.
In terms of computing time, Bayesian inference is more time consuming compared with
the other two approaches. Further, we predict the SPB outbreak from 1992 to 2001 in
North Carolina (Table 17.3). The responses at the end time point (here, y
i,2002
) are gen-
1991
erated from independent Bernoulli trials with probability of outbreak
t=1960
y
i,t
/31 for
i = 1, ..., 100. The prediction performances based on models with uncentered and cen-
tered parameterization are comparable. Overall, our recommendation is to use a model
with uncentered parameterization if prediction is of primary interest, since the two
383 Autologistic Regression Models for Spatio-Temporal Binary Data
TABLE 17.3
Comparison of the Prediction Performance between Models with Centered and Uncentered
Parameterization
Centered Uncentered
Year EMPL MCEML Bayesian MPL MCML Bayesian
1992 0.65 0.18 0.14 0.66 0.09 0.09
1993 0.72 0.19 0.12 0.65 0.13 0.13
1994 0.70 0.20 0.14 0.74 0.06 0.08
1995 0.63 0.23 0.13 0.68 0.13 0.14
1996 0.62 0.24 0.09 0.61 0.17 0.16
Note: For centered parameterization, the prediction is based on statistical inference obtained using expectation–
maximization pseudo-likelihood (EMPL), Monte Carlo expectation–maximization likelihood (MCEML),
and Bayesian inference. For uncentered parameterization, the prediction is based on statistical infer-
ence obtained using maximum pseudo-likelihood (MPL), Monte Carlo maximum likelihood (MCML), and
Bayesian inference. Reported are the prediction error rates for each year in 1992–1996.
parameterizations provide comparable performance in prediction but the centered param-
eterization is computationally more intensive. If the focus is on the interpretation of the
regression coefcients, however, the centered parameterization is recommended.
17.5 Discussion
In this chapter, we have reviewed spatio-temporal autologistic regression models for
spatio-temporal binary data. Alternatively, a generalized linear mixed model (GLMM)
framework can be adopted for modeling such spatial data (Diggle and Ribeiro, 2007;
Holan and Wikle [2015; Chapter 15 in this volume]). The response variable is modeled by a
distribution in the exponential family and is related to covariates and spatial random effects
in a link function. Thus, GLMM is exible, as it is suitable for both Gaussian responses
and non-Gaussian responses such as binomial and Poisson random variables. Statistical
inference can be carried out using Bayesian hierarchical modeling, which is exible as more
complex structures can be readily placed on the model parameters. With suitable reduction
of dimensionality for the spatio-temporal random effects, computation is generally feasible.
In particular, faster computational algorithms are emerging such as integrated nested
Laplace approximations (INLA) (Rue et al., 2009). Although likelihood-based approaches
aresuitable, it is sometimes a challenge to attain a full specication of the likelihood function,
due to a lack of sufcient information and complex interactions among responses. In this
case, an estimating equation approach may be attractive. For spatial binary data, Lin et al.
(2008) developed a central limit theorem for a random eld under various L
p
metrics and
derived the consistency and asymptotic normality of quasi-likelihood estimators. Lin (2010)
further developed a generalized estimating equation (GEE) method for spatio-temporal
binary data, but only a single binary covariate was considered and the spatio-temporal
dependence is limited to be separable. Moreover, variable selection methods for identifying
the suitable set of covariates are of interest. For example, Fu et al. (2013) developed adaptive
Lasso for the selection of covariates in an autologistic regression model and extension to
384 Handbook of Discrete-Valued Time Series
spatio-temporal autologistic regression model is discussed. Further research on this and
other related topics will be worthwhile.
References
Bartlett, M. S. (1971). Physical nearest-neighbour models and non-linear time-series. Journal of Applied
Probability, 8(2):222–232.
Bartlett, M. S. (1972). Physical nearest-neighbour models and non-linear time-series. II Further
discussion of approximate solutions and exact equations. Journal of Applied Probability, 9:76–86.
Berthelsen, K. K. and Møller, J. (2003). Likelihood and nonparametric Bayesian MCMC inference for
spatial point processes based on perfect simulation and path sampling. Scandinavian Journal of
Statistics, 30:549–564.
Besag, J. (1972). Nearest-neighbour systems and the auto-logistic model for binary data. Journal of the
Royal Statistical Society Series B, 34:75–83.
Besag, J. (1974). Spatial interaction and the statistical analysis of lattice systems. Journal of the Royal
Statistical Society Series B, 36:192–225.
Besag, J. (1975). Statistical analysis of non-lattice data. The Statistician, 24:179–195.
Caragea, P. C. and Kaiser, M. S. (2009). Autologistic models with interpretable parameters. Journal of
Agricultural, Biological, and Environmental Statistics, 14:281–300.
Cressie, N. (1993). Statistics for Spatial Data, Revised Edition. Wiley, New York.
Diggle, P. J. and Ribeiro, P. J. (2007). Model-Based Geostatistics. Springer, New York.
Friel, N., Pettitt, A. N., Reeves, R., and Wit, E. (2009). Bayesian inference in hidden Markov random
elds for binary data dened on large lattices. Journal of Computational and Graphical Statistics,
18(2):243–261.
Fu, R., Thurman, A., Steen-Adams, M., and Zhu, J. (2013). On estimation and selection of autolo-
gistic regression models via penalized pseudolikelihood. Journal of Agricultural, Biological, and
Environmental Statistics, 18:429–449.
Gelman, A., Carlin, J. B., Stern, H., and Rubin, D. (2003). Bayesian Data Analysis. Chapman & Hall,
Boca Raton, FL.
Geyer, C. J. (1994). On the convergence of Monte Carlo maximum likelihood calculations. Journal of
the Royal Society of Statistics Series B, 56:261–274.
Geyer, C. J. and Thompson, E. A. (1992). Constrained Monte Carlo maximum likelihood for
dependent data (with discussion). JournaloftheRoyalStatisticalSocietySeriesB, 54:657–699.
Gu, M. G. and Zhu, H. T. (2001). Maximum likelihood estimation for spatial models by Markov chain
Monte Carlo stochastic approximation. Journal of the Royal Statistical Society Series B, 63:339–355.
Gumpertz, M. L., Graham, J. M., and Ristaino, J. B. (1997). Autologistic models of spatial pattern of
phytophthora epidemic in bell pepper: Effects of soil variables on disease presence. Journal of
Agricultural, Biological, and Environmental Statistics, 2:131–156.
Guyon, X. (1995). Random Fields on a Network: Modeling, Statistics, and Applications. Springer,
New York.
Holan, S. H. and Wikle, C. K. (2015). Hierarchical dynamic generalized linear mixed models for
discrete-valued spatio-temporal data. In Davis, R., Holan, S., Lund, R., and Ravishanker, N.,
eds., Handbook of Discrete-Valued Time Series, pp. 327–348. Chapman & Hall, Boca Raton, FL.
Huang, F. and Ogata, Y. (2002). Generalized pseudo-likelihood estimates for Markov random elds
on lattice. Annals of Institutes of Statistical Mathematics, 54:1–18.
Huffer, F. W. and Wu, H. (1998). Markov chain Monte Carlo for autologistic regression models with
application to the distribution of plant speicies. Biometrics, 54:509–524.
Hughes, J. P., Haran, M., and Caragea, P. C. (2011). Autologistic models for binary data on a lattice.
Environmetrics, 22:857–871.
385 Autologistic Regression Models for Spatio-Temporal Binary Data
Ising, E. (1924). Beitrag zur theorie des ferro- und paramagnetismus. PhD thesis, Mathematish-
Naturewissenschaftliche Fakultät der Universität Hamburg.
Ising, E. (1925). Beitrag zur theorie des ferromagnetismus. Zeitschrift für Physik A Hadrons and Nuclei,
31:253–258.
Kaiser, M. S. and Caregea, P. C. (2009). Exploring dependence with data on spatial lattice. Biometrics,
65:857–865.
Kaiser, M. S., Caragea, P. C., and Furukawa, K. (2012). Centered parameterizations and depen-
dence limitations in Markov random eld models. Journal of Statistical Planning and Inference,
142:1855–1863.
Kaiser, M. S. and Cressie, N. (1997). Modeling Poisson variables with positive spatial dependence.
Statistics and Probability Letters, 35:423–432.
Lin, P.-S. (2010). Estimating equations for separable spatial-temporal binary data. Environmental and
Ecological Statistics, 17:543–557.
Lin, P.-S., Lee, H.-Y., and Clayton, M. (2008). Estimating equations for spatially correlated data in
multi-dimensional space. Biometrika, 95:847–858.
Møller, J. (1999). Perfect simulation of conditionally specied models. Journal of the Royal Statistical
Society, Series B, 61:251–264.
Møller, J., Pettitt, A. N., Reeves, R. W., and Berthelsen, K. K. (2006). An efcient Markov chain Monte
Carlo method for distributions with intractable normalising constants. Biometrika, 93:451–458.
Pickard, D. K. (1976). Asymptotic inference for an Ising lattice. Journal of Applied Probability,
13:486–497.
Pickard, D. K. (1977). Asymptotic inference for an Ising lattice II. Advances in Applied Probability,
9:476–501.
Propp, J. G. and Wilson, D. B. (1996). Exact sampling with coupled Markov chains and applications
to statistical mechanics. Random Structures and Algorithms, 9:223–252.
Rue, H., Martino, S., and Chopin, N. (2009). Exact sampling with coupled Markov chains and
applications to statistical mechanics. JournaloftheRoyalStatisicalSocietySeriesB, 71:319–392.
Wang, Z. (2013). Analysis of binary data via spatial-temporal autologistic regression models. PhD
thesis, Department of Statistics, University of Kentucky, Lexington, KY.
Wang, Z. and Zheng, Y. (2013). Analysis of binary data via a centered spatial-temporal autologistic
regression model. Environmental and Ecological Statistics, 20:37–57.
Wu, H. and Huffer, F. W. (1997). Modeling the distribution of plant species using the autologistic
regression model. Environmental and Ecological Statistics, 4:31–48.
Zheng, Y. and Zhu, J. (2008). Markov chain Monte Carlo for a spatial-temporal autologistic regression
model. Journal of Computational and Graphical Statistics, 17:123–137.
Zhu, J., Huang, H.-C., and Wu, J.-P. (2005). Modeling spatial-temporal binary data using Markov
random elds. Journal of Agricultural, Biological, and Environmental Statistics, 10:212–225.
Zhu, J., Zheng, Y., Carroll, A., and Aukema, B. H. (2008). Autologistic regression analysis of spatial-
temporal binary data via Monte Carlo maximum likelihood. Journal of Agricultural, Biological,
and Environmental Statistics, 13:84–98.
..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset