135 State Space Models for Count Time Series
Recently, Koopman et al. (2015) have presented an approach called numerically accel-
erated importance sampling (NAIS) for the nonlinear non-Gaussian state space models.
The NAIS method combines fast numerical integration techniques with the Kalman l-
ter smoothing methods proposed by Shephard and Pitt (1997) and Durbin and Koopman
(1997). They demonstrate signicant computational speed improvements over standard
EIS implementations as well as improved accuracy. Additionally, by using new con-
trol variables substantial improvement in the variability of likelihood estimates can be
achieved. A key component of the NAIS method is construction of the importance sam-
pling density by numerical integration using Gauss–Hermite quadrature in contrast to
using simulated trajectories.
6.2.6 Composite Likelihood
As already noted, the lack of a tractably computable form for the likelihood is one of the
drawbacks in using SSMs for count time series. Estimation procedures described in previ-
ous sections essentially resort to simulation-based procedures for computing the likelihood
and then maximizing it. With any simulation procedure, one cannot be certain that the sim-
ulated likelihood provides a good facsimile of the actual likelihood or even the likelihood
in proximity of the maximum likelihood estimator. An alternative estimation procedure,
which has grown recently in popularity, is based on the composite likelihood. The idea is
that perhaps one does not need to compute the entire likelihood but only the likelihoods
of more manageable subcollections of the data, which are then combined. For example,
the likelihood given in (6.5) requires an n-fold integration, which is generally impractical
even for moderate sample sizes. However, a similar integral based on only 2 or 3 dimen-
sions can be computed numerically rather accurately. The objective then is to replace a large
dimensional integral with many small and manageable integrations.
The special issue of Statistical Sinica (Volume 21 (2011)) provides an excellent survey on
the use of composite likelihood methods in statistics. In the development below, we will
follow the treatment of Davis and Yau (2011) with emphasis on Example 3.4. Ng et al. (2011),
also in the special issue, consider composite likelihood for related time series models.
While we will focus this treatment on the pairwise likelihood for count time series, exten-
sions to combining weighted likelihoods based on more general subsets of the data are
relatively straightforward. The downside in our application is that numerically, it may not
be practical to compute density functions of more than just a few observations.
Suppose Y
1
, ..., Y
n
are observations from a time series for which we denote
p(y
i
1
, y
i
2
, ..., y
i
k
; θ) as the likelihood for a parameter θ based on the k-tuples of distinct
observations y
i
1
, y
i
2
, ..., y
i
k
. For ease of notation, we have suppressed the dependence
of p(·; θ) on k and the vector i
1
, i
2
, ..., i
k
. For k xed, the composite likelihood based on
consecutive k-tuples is given by
n−k
CPL
k
(θ; Y
(n)
) = log p(Y
t
, Y
t+1
, ..., Y
t+k−1
; θ). (6.29)
t=1
Normally, one does not take k very large due to the difculty in computing the
k-dimensional joint densities. The composite likelihood estimator
θ
ˆ
is found by maxi-
mizing CPL
k
. Under identiability, regularity, and suitable mixing conditions,
θ
ˆ
is con-
sistent and asymptotically normal. This was established for linear time series models in