292 Handbook of Discrete-Valued Time Series
f
Y
(ω; β) = β
f
Y
(ω)β = β
f
Y
re
(ω)β, where f
Y
re
(ω) denotes the real part of f
Y
(ω).The
optimality criterion can thus be expressed as
λ(ω) = sup
β
f
Y
re
(ω)β
(13.4)
β
Vβ
β
where V is the variance–covariance matrix of Y
t
. The resulting scaling β(ω) is called the
optimal scaling.
The Y
t
process is a multivariate point process, and any particular component of Y
t
is
the individual point process for the corresponding state (e.g., the rst component of Y
t
indicates whether or not the process is in state c
1
at time t). For any xed t, Y
t
represents
a single observation from a simple multinomial sampling scheme. It readily follows that
V = D − pp
, where p = (p
1
, ..., p
k+1
)
,and D is the (k + 1) × (k + 1) diagonal matrix
D = diag{p
1
, ..., p
k+1
}. Since, by assumption, p
j
> 0for j = 1, 2, ..., k + 1, it follows that
rank(V) = k with the null space of V being spanned by 1
k+1
. For any (k + 1) × k full rank
matrix Q whose columns are linearly independent of 1
k+1
, Q
VQ is a k ×k positive denite
symmetric matrix.
With the matrix Q as previously dened, and for −1/2 < ω ≤ 1/2, dene λ(ω) to be the
largest eigenvalue of the determinantal equation
|Q
f
Y
re
(ω)Q − λQ
VQ|=0,
and let b(ω) ∈ R
k
be any corresponding eigenvector, that is,
Q
f
Y
re
(ω)Qb(ω) = λ(ω)Q
VQb(ω).
The eigenvalue λ(ω) ≥ 0 does not depend on the choice of Q. Although the eigenvector
b(ω) depends on the particular choice of Q, the equivalence class of scalings associated
with β(ω) = Qb(ω) does not depend on Q. A convenient choice of Q is Q =[I
k
| 0 ]
, where
I
k
is the k × k identity matrix and 0 is the k × 1 vector of zeros. For this choice, Q
f
Y
re
(ω)Q
and Q
VQ are the upper k ×k blocks of f
Y
re
(ω) and V, respectively. This choice corresponds
to setting the last component of β(ω) to zero.
The value λ(ω) itself has a useful interpretation; specically, λ(ω)dω represents the
largest proportion of the total power that can be attributed to the frequencies ωdω for any
particular scaled process X
t
(β), with the maximum being achieved by the scaling β(ω).
This result is demonstrated in Figure 13.3. Because of its central role, λ(ω) was dened to
be the spectral envelope of a stationary categorical time series.
The name spectral envelope is appropriate since λ(ω) envelopes the standardized spec-
trum of any scaled process. That is, given any β normalized so that X
t
(β) has total power
one, f (ω; β) ≤ λ(ω) with equality if and only if β is proportional to β(ω).
Although the law of the process X
t
(β) for any one-to-one scaling β completely deter-
mines the law of the categorical process X
t
, information is lost when one restricts attention
to the spectrum of X
t
(β). Less information is lost when one considers the spectrum of Y
t
.
Dealing directly with the spectral density f
Y
(ω) itself is somewhat cumbersome since it is a
function into the set of complex Hermitian matrices. Alternatively, one can view the spec-
tral envelope as an easily understood, parsimonious tool for exploring the periodic nature
of a categorical time series with a minimal loss of information.