342
Handbook of Discrete-Valued Time Series
60
50
60°N
40
30
45°N
165°W
90°W
20
150°W
135°W
120°W
105°W
10
0
(b)
40
35
30
60°N
25
20
45°N
15
165°W
90°W
150°W
105°W
10
135°W
120°W
5
(c)
FIGURE 15.2 (Continued)
Observed and (one-step-ahead forecasts) predicted counts for 2010 (using the median of the posterior predictive
distribution) and their median absolute deviation (MAD) for the Poisson model (Section 15.5.3) using 8 EOFs.
(b) Plot of
Y
ˆ
2010
, (c) MAD of in-sample prediction.
15.6 Conclusion
Modeling discrete-valued spatio-temporal data is often a challenging endeavor due to
model complexity and high dimensionality. That is, specifying realistic dependence struc-
ture for a real-world observed spatio-temporal process can be extremely difcult due to
inherent nonlinearities in time coupled with potential nonstationary behaviors in time
Hierarchical Dynamic Generalized Linear Mixed Models 343
1
1
1
1.5
1.0
2
2
0.5
2
3
3
3
1.0
0.5
0.0
4
4
4
5
5
5
−0.5
0.5
0.0
6
6
6
−1.0
7
7
7
0.0
−0.5
8
8
8
1 2 3
4 5 6 7 8
1 2 3
4 5 6 7 8
1 2 3
4 5 6 7 8
(a)
(b) (c)
1
1
1
0.5
1.0
2.0
2
2
2
0.0
3
0.5
3
3
1.5
−0.5
4
4
4
−1.0
0.0
5
5
1.0
5
−1.5
6
6
6
−0.5
0.5
7
−1.0
8
7
−2.0
7
8
8
1 2 3
4 5 6 7 8
1 2 3
4 5 6 7 8
1 2 3 4 5 6 7 8
(d) (e) (f)
1
1
1
1.0
0.5
2.5
2
2
2
0.0
3
2.0
3
3
0.5
−0.5
4
4
4
1.5
−1.0
5
0.0
5
5
−1.5
6
7
6
6
1.0
−0.5
7
−2.0
7
0.5
8
8
8
1 2 3 4 5 6 7 8
1 2 3
4 5 6 7 8
1 2 3 4 5 6 7 8
(g) (h) (i)
FIGURE 15.3
Image plots for element-wise posterior mean and 95% credible intervals of redistribution matrices M
i
, i = 1, 2, 3
for the Poisson model (Section 15.5.3) using 8 EOFs. Note that lower CI and upper CI denote the 2.5% and 97.5%
quantiles of the posterior distributions, respectively. (a) M
1
: Lower CI, (b) M
1
: Posterior mean, (c) M
1
: Upper CI,
(d) M
2
: Lower CI, (e) M
2
: Posterior mean, (f) M
2
: Upper CI, (g) M
3
: Lower CI, (h) M
3
: Posterior mean, and (i) M
3
:
Upper CI.
and/or space. Instead, utilizing a process-based approach and taking advantage of the
Bayesian hierarchical paradigm often prove remarkably advantageous.
Although, in principle, there are many approaches to modeling discrete-valued spatio-
temporal data, we presented a process-driven Bayesian hierarchical perspective. The key
to this paradigm is that the joint probability model for the data, process, and parameters
can be conditionally specied through linked model components. In particular, this for-
mulation facilitates the specication of complicated marginal dependence through a more
344 Handbook of Discrete-Valued Time Series
scientically motivated representation of the conditional mean, which can be modeled at
the process (or parameter) stage in the model hierarchy.
We have described BHMs for modeling discrete-valued spatio-temporal data. For
those less familiar with the hierarchical paradigm, we have provided a broad overview
(Section 15.2). To facilitate model development, we have outlined several discrete-valued
data model distributions, placing an emphasis on count-valued data model distribu-
tions (Section 15.3). Lastly, we presented an expansive framework for modeling dynamics
(Section 15.4). Combined, Sections 15.2 through 15.4 provide an extremely rich and exible
framework for modeling discrete-valued spatio-temporal data.
To illustrate the exibility and utility of this approach, we provided an application
to forecasting one-year-ahead migratory bird settling patterns across the north-central
United States and Canada. In this example, we showcased the ability to specify compli-
cated nonlinear dependence structures using a three-regime TVAR model based on the
PDSI, a climate-related covariate. In this context, efcient dimension reduction is facilitated
through EOFs. Further, the effectiveness of our model was evaluated through in-sample
and out-of-sample prediction. Finally, as a by-product of the scientically motivated model
specication, we considered the notion of site philopatry through the Markov transition
matrices.
There are several open areas of research for discrete-valued spatio-temporal modeling.
One natural direction is to extend the models described here to the multivariate context
(Fahrmeir et al., 1994). Doing so requires several modeling choices that would need to be
investigated for each given application. For example, for a given application, would speci-
cation of a multivariate data model be superior to a conditional specication, as presented
in Section 15.2? In our illustration, we specied the evolution of the latent process through
a TVAR model, which is nonlinear. Nevertheless, this only constitutes one specication.
Other specications to accommodate nonlinearity and/or nonstationarity are also open
areas of research.
Acknowledgments
This research was partially supported by the U.S. National Science Foundation (NSF) and
the U.S. Census Bureau under NSF grant SES-1132031, funded through the NSF-Census
Research Network (NCRN) program and through NSF grant DMS-1049093. The authors
would like to thank Guohui Wu for his assistance with various aspects concerning the data
analysis, including preparation of the gures.
References
Albert, J. H. and Chib, S. (1993). Bayesian analysis of binary and polychotomous response data.
Journal of the American Statistical Association, 88(422):669–679.
Arab, A., Holan, S. H., Wikle, C. K., and Wildhaber, M. L. (2012). Semiparametric bivariate
zero-inated Poisson models with application to studies of abundance for multiple species.
Environmetrics, 23(2):183–196.
Banerjee, S., Gelfand, A. E., and Carlin, B. P. (2003). Hierarchical Modeling and Analysis for Spatial Data.
CRC Press.
345 Hierarchical Dynamic Generalized Linear Mixed Models
Berliner, L. M. (1996). Hierarchical Bayesian time-series models. Fundamental Theories of Physics,
79:15–22.
Besag, J. (1974). Spatial interaction and the statistical analysis of lattice systems. Journal of the Royal
Statistical Society. Series B (Methodological), 36:192–236.
Brooks, S. and Gelman, A. (1998). General methods for monitoring convergence of iterative simula-
tions. Journal of Computational and Graphical Statistics, 7:434–455.
Carlin, B. P. and Louis, T. A. (2011). Bayesian Methods for Data Analysis. Boca Raton, FL: CRC Press.
Carlin, B. P., Polson, N. G., and Stoffer, D. S. (1992). A Monte Carlo approach to nonnormal and
nonlinear state-space modeling. Journal of the American Statistical Association, 87(418):493–500.
Congdon, P. (2007). Bayesian Statistical Modelling, vol. 704. Hoboken, NJ: John Wiley & Sons.
Cressie, N., Calder, C. A., Clark, J. S., Hoef, J. M. V., and Wikle, C. K. (2009). Accounting for uncer-
tainty in ecological analysis: The strengths and limitations of hierarchical statistical modeling.
Ecological Applications, 19(3):553–570.
Cressie, N. and Wikle, C. K. (2011). Statistics for Spatio-Temporal Data. Hoboken, NJ: John Wiley and
Sons.
Demidenko, E. (2013). Mixed Models: Theory and Applications with R. Hoboken, NJ: John Wiley and
Sons.
Fahrmeir, L. (1992). Posterior mode estimation by extended Kalman ltering for multivari-
ate dynamic generalized linear models. Journalofthe AmericanStatisticalAssociation,
87(418):501–509.
Fahrmeir, L. and Kaufmann, H. (1991). On Kalman ltering, posterior mode estimation and sher
scoring in dynamic exponential family regression. Metrika, 38(1):37–60.
Fahrmeir, L., Tutz, G., and Hennevogl, W. (1994). Multivariate Statistical Modelling Based on Generalized
Linear Models.NewYork:SpringerNewYork.
Gamerman, D. (1998). Markov chain Monte Carlo for dynamic generalised linear models. Biometrika,
85(1):215–227.
Gamerman, D., Abanto-Valle, C. A., Silva, R. S., and Martins, T. G. (2015). Dynamic Bayesian models
for discrete-valued time series. In R. A. Davis, S. H. Holan, R. Lund and N. Ravishanker, eds.,
Handbook of Discrete-Valued Time Series, pp. 165–186. Chapman & Hall, Boca Raton, FL.
Gamerman, D., Salazar, E. and Reis, E. A. (2007). Dynamic Gaussian process priors with applica-
tions to the analysis of space-time data. In Bayesian Statistics, number 8 (eds Bernardo, J. M.,
Bayarri M., Berger J., Dawid A., Heckerman D., Smith A. and West M.). New York: Oxford
University Press, pp. 149–74.
Gelfand, A. E., Banerjee, S., and Gamerman, D. (2005). Spatial process modelling for univariate and
multivariate dynamic spatial data. Environmetrics, 16(5):465–479.
Gelman, A., Carlin, J. B., Stern, H. S., Dunson, D. B., Vehtari, A., and Rubin, D. B. (2013). Bayesian
Data Analysis, 3rd edn. CRC press.
George, E. I. and McCulloch, R. E. (1993). Variable selection via Gibbs sampling. Journal of the American
Statistical Association, 88(423):881–889.
George, E. I. and McCulloch, R. E. (1997). Approaches for Bayesian variable selection. Statistica Sinica,
7(2):339–373.
Gladish, D. W. and Wikle, C. K. (2014). Physically motivated scale interaction parameteriza-
tion in reduced rank quadratic nonlinear dynamic spatio-temporal models. Environmetrics,
25(4):230–244.
Greene, W. (2008). Functional forms for the negative binomial model for count data. Economics Letters,
99(3):585–590.
Hooten, M. B. and Wikle, C. K. (2008). A hierarchical Bayesian non-linear spatio-temporal model for
the spread of invasive species with application to the Eurasian collared-dove. Environmental and
Ecological Statistics, 15(1):59–70.
Hooten, M. B. and Wikle, C. K. (2010). Statistical agent-based models for discrete spatio-temporal
systems. Journal of the American Statistical Association, 105(489):236–248.
Kitagawa, G. (1987). Non-Gaussian state–space modeling of nonstationary time series. Journal of the
American statistical association, 82(400):1032–1041.
346 Handbook of Discrete-Valued Time Series
Kottas, A., Duan, J. A., and Gelfand, A. E. (2008). Modeling disease incidence data with spatial and
spatio temporal dirichlet process mixtures. Biometrical Journal, 50(1):29–42.
Lopes, H. F., Gamerman, D., and Salazar, E. (2011). Generalized spatial dynamic factor models.
Computational Statistics and Data Analysis, 55(3):1319–1330.
McCulloch, C. E., Searle, S. R., and Neuhaus, J. M. (2001). Generalized Linear, and Mixed Models.
New York: John Wiley and Sons.
Minka, T., Shmueli, G., Kadane, J., Borle, S., and Boatwright, P. (2003). Computing with the COM-
Poisson distribution. Pittsburgh, PA: Department of Statistics, Carnegie Mellon University.
Ridout, M. S. and Besbeas, P. (2004). An empirical model for underdispersed count data. Statistical
Modelling, 4(1):77–89.
Royle, J. A. and Berliner, L. M. (1999). A hierarchical approach to multivariate spatial modeling and
prediction. Journal of Agricultural, Biological, and Environmental Statistics, 4(1):29–56.
Royle, J. A. and Dorazio, R. M. (2008). Hierarchical Modeling and Inference in Ecology: The Analysis of
Data from Populations, Metapopulations and Communities. Burlington, MA: Academic Press.
Schrödle, B. and Held, L. (2011). Spatio-temporal disease mapping using INLA. Environmetrics,
22(6):725–734.
Shmueli, G., Minka, T., Kadane, J., Borle, S., and Boatwright, P. (2005). A useful distribution for tting
discrete data: Revival of the Conway-Maxwell Poisson distribution. Journal of the Royal Statistical
Society: Series C (Applied Statistics), 54(1):127–142.
Verbeke, G. and Molenberghs, G. (2009). Linear Mixed Models for Longitudinal Data.NewYork:
Springer.
Waller, L., Carlin, B., Xia, H., and Gelfand, A. (1997). Hierarchical spatio-temporal mapping of disease
rates. Journal of the American Statistical Association, 92:607–617.
Wang, J. and Holan, S. H. (2012). Bayesian multi-regime smooth transition regression with ordered
categorical variables. Computational Statistics and Data Analysis, 56(2):4165–4179.
Wang, J. C., Holan, S. H., Nandram, B., Barboza, W., Toto, C., and Anderson, E. (2012). A
Bayesian approach to estimating agricultural yield based on multiple repeated surveys. Journal
of Agricultural, Biological, and Environmental Statistics, 17(1):84–106.
West, M., Harrison, P. J., and Migon, H. S. (1985). Dynamic generalized linear models and Bayesian
forecasting. Journal of the American Statistical Association, 80(389):73–83.
Wikle, C. K. (2003a). Hierarchical Bayesian models for predicting the spread of ecological processes.
Ecology, 84(6):1382–1394.
Wikle, C. K. (2010). Low rank representations as models for spatial processes, Chapter 8. In Handbook
of Spatial Statistics. (eds Gelfand A. E., Fuentes M., Guttorp P. and Diggle P.). Boca Raton, FL:
Chapman and Hall/CRC, pp. 107–18.
Wikle, C. K. and Anderson, C. (2003). Climatological analysis of tornado report counts using a
hierarchical Bayesian spatio-temporal model. Journal of Geophysical Research, 108(D24):9005.
Wikle, C. K. and Berliner, L. M. (2005). Combining information across spatial scales. Technometrics,
47:80–91.
Wikle, C. K. and Cressie, N. (1999). A dimension-reduced approach to space-time Kalman ltering.
Biometrika, 86(4):815.
Wikle, C. K. and Holan, S. H. (2011). Polynomial nonlinear spatio-temporal integro-difference
equation models. Journal of Time Series Analysis, 32(4):339–350.
Wikle, C. K. and Hooten, M. B. (2010). A general science-based framework for dynamical spatio-
temporal models. Test, 19(3):417–451.
Wikle, C. K. and Hooten, M. B. (2015). Hierarchical agent-based spatio-temporal dynamic models for
discrete-valued data. In R. A. Davis, S. H. Holan, R. Lund and N. Ravishanker, eds., Handbook
of Discrete-Valued Time Series, pp. 349–366. Chapman & Hall, Boca Raton, FL.
Wikle, C. K. Milliff, R., Nychka, D., and Berliner, L. M. (2001). Spatio-temporal hierarchical Bayesian
modeling: Tropical ocean surface winds. Journal of the American Statistical Association, 96(454):
382–397.
Wikle, C. K. (2002). A kernel-based spectral model for non-Gaussian spatio-temporal processes.
Statistical Modelling, 2(4):299–314.
..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset