12.2.1 Solution to the Finite-Horizon Discrete-Time Mixed H∞2/H∞∞ Nonlinear Filtering Problem

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

We similarly consider the following class of estimators:

$\sum^{d a f} : {\begin{cases} {\hat{x}}_{k + 1} = f (x_{k}) + L ({\hat{x}}_{k}, k) [y_{k} - h_{2} (x_{k})], \hat{x} (k_{0}) = {\hat{x}}^{0} \\ \hat{z} = h_{1} ({\hat{x}}_{k}) \end{cases}$

(12.46)

where ${\hat{x}}_{k} \in X$ is the estimated state, $L (., .) \in ℳ^{n \times m} (X \times Z)$ is the error-gain matrix which is smooth and has to be determined, and $\hat{z} \in ℜ^{s}$ is the estimated output of the filter. We can now define the estimation error or penalty variable, $\overset{⌣}{z}$ , which has to be controlled as:

${\overset{⌣}{z}}_{k} : = z_{k} - \hat{z} = h_{1} ({\hat{x}}_{k}) - h_{1} ({\hat{x}}_{k}) .$

Then, we combine the plant (12.42) and estimator (12.46) dynamics to obtain the following augmented system:

$\begin{array}{l} {\overset{⌣}{x}}_{k + 1} = \overset{⌣}{f} ({\overset{⌣}{x}}_{k}) + \overset{⌣}{g} ({\overset{⌣}{x}}_{k}) w_{k,} \overset{⌣}{x} (k_{0}) = {(x^{0^{T}} {\hat{x}}^{0^{T}})}^{T} \\ {\overset{⌣}{z}}_{k} = h_{1} ({\overset{⌣}{x}}_{k}) \end{array}},$

(12.47)

where

$\begin{array}{l} {\overset{⌣}{x}}_{k} = (\begin{array}{l} x_{k} \\ x_{k} \end{array}), \overset{⌣}{f} (\overset{⌣}{x}) = (\begin{array}{l} f (x_{k}) \\ f ({\hat{x}}_{k}) + L ({\hat{x}}_{k}, k) (h_{2} (x_{k}) - h_{2} ({\hat{x}}_{k})) \end{array}), \\ \overset{⌣}{g} (\overset{⌣}{x}) = (\begin{array}{l} g_{1} (x_{k}) \\ L ({\hat{x}}_{k,} k) k_{21} (x_{k}) \end{array}), \overset{⌣}{h} ({\overset{⌣}{x}}_{k}) = h_{1} (x_{k}) - h_{1} ({\hat{x}}_{k}) . \end{array}$

The problem is then similarly formulated as a two-player nonzero-sum differential game with the following cost functionals:

$J_{1} (L, w) = \frac{1}{2} \sum_{k = k_{0}}^{K} {γ^{2} {‖ w_{k} ‖}^{2} - {‖ {\overset{⌣}{z}}_{k} ‖}^{2}},$

(12.48)

$J_{2} (L, w) = \frac{1}{2} \sum_{k_{0}}^{K} {‖ {\overset{⌣}{z}}_{k} ‖}^{2},$

(12.49)

where w := {w_k}. The first functional is associated with the $H_{\infty}$ -constraint criterion, while the second functional is related to the output energy of the system or $H_{2}$ -criterion. It is seen that, by making J₁ ≥ 0, the $H_{\infty}$ constraint ${‖ ℱ_{k} o \sum^{d a} ‖}_{ℋ_{\infty}} \leq γ$ is satisfied. Then, similarly, a Nash-equilibrium solution to the above game is said to exist if we can find a pair (L^⋆ , w^⋆) such that

$J_{1} (L^{⋆}, w^{⋆}) \leq J_{1} (L^{⋆}, w) \forall w \in W,$

(12.50)

$J_{2} (L^{⋆}, w^{⋆}) \leq J_{2} (L_{k}, w^{⋆}) \forall w \in ℳ^{n \times m} .$

(12.51)

Sufficient conditions for the solvability of the above game are well known (Chapter 3, also [59]), and are given in the following theorem.

Theorem 12.2.1 For the two-person discrete-time nonzero-sum game (12.48)-(12.49), (12.47), under memoryless perfect information structure, there exists a feedback Nash-equilibrium solution if, and only if, there exist 2(K − k₀) functions $Y, V : N \subset X \times Z \to ℜ, N \subset X$ such that the following coupled recursive equations (discrete-time Hamilton-Jacobi-Isaacs equations (DHJIE)) are satisfied:

$\begin{array}{l} Y (\overset{⌣}{x}, k) = \inf_{w \in W} {\frac{1}{2} [γ^{2} ‖ w_{k} ‖ - {‖ {\overset{⌣}{z}}_{k} (\overset{⌣}{x}) ‖}^{2}] + Y ({\overset{⌣}{x}}_{k + 1,} k + 1)}, \\ Y (\overset{⌣}{x}, K + 1) = 0, k = k_{0}, …, K, \forall \overset{⌣}{x} \in N \times N \end{array}$

(12.52)

$\begin{array}{l} V (\overset{⌣}{x}, k) = \min_{L \in ℳ^{n \times m}} {\frac{1}{2} {‖ {\overset{⌣}{z}}_{k} (\overset{⌣}{x}) ‖}^{2} + V ({\overset{⌣}{x}}_{k + 1,} k + 1)}, \\ V (\overset{⌣}{x}, K + 1) = 0, k = k_{0, …,} k, \forall \overset{⌣}{x} \in N \times N \end{array}$

(12.53)

where $\overset{⌣}{x} = {\overset{⌣}{x}}_{k}, L = L (x_{k}, k), w : = {w_{k}} .$

Thus, we can apply the above theorem to derive sufficient conditions for the solvability of the D M H 2 H I N L F P. To do that, we define the Hamiltonian functions $H_{i} : (X \times X) \times W \times ℳ^{n \times m} \times ℜ \to ℜ, i = 1, 2$ associated with the cost functionals (12.48), (12.49) respectively:

$H_{1} (\overset{⌣}{x}, w_{k} L, Y) = Y (\overset{⌣}{f} (\overset{⌣}{x}) + \overset{⌣}{g} (\overset{⌣}{x}) w_{k}, K + 1) - Y (\overset{⌣}{x}, k) + \frac{1}{2} γ^{2} {‖ w_{k} ‖}^{2} - \frac{1}{2} {‖ {\overset{⌣}{z}}_{k} ‖}^{2},$

(12.54)

$H_{2} (\overset{⌣}{x}, w_{k} L, V) = V (\overset{⌣}{f} (\overset{⌣}{x}) + \overset{⌣}{g} (\overset{⌣}{x}) w_{k}, K + 1) - V (\overset{⌣}{x}, k) + \frac{1}{2} {‖ {\overset{⌣}{z}}_{k} ‖}^{2}$

(12.55)

for some smooth functions $Y, V : X \to ℜ, Y < 0, V > 0$ where the adjoint variables corresponding to the cost functionals (12.48), (12.49) are set as p₁ = Y, p₂ = V respectively.

The following theorem then presents sufficient conditions for the solvability of the D M H 2 H I N L F P on a finite-horizon.

Theorem 12.2.2 Consider the nonlinear system (12.42) and the D MH 2 H I N L F P for it. Suppose the function h₁ is one-to-one (or injective) and the plant Σ^da is locally asymptotically-stable about the equilibrium-point x = 0. Further, suppose there exists a pair of C² negative and positive-definite functions $Y, V : N \times N \times Z \to ℜ$ respectively, locally defined in a neighborhood $N \times N \times X \times X$ of the origin $\overset{⌣}{x} = 0,$ and a matrix function $L : N \times Z \to ℳ^{n \times m}$ satisfying the following pair of coupled HJIEs:

$Y (\overset{⌣}{x}, k) = Y ({\overset{⌣}{f}}^{⋆} (\overset{⌣}{x}) + {\overset{⌣}{g}}^{⋆} (\overset{⌣}{x}) w_{k}^{⋆} (\overset{⌣}{x}), k + 1) + \frac{1}{2} γ^{2} {‖ w_{k}^{⋆} (\overset{⌣}{x}) ‖}^{2} - \frac{1}{2} {‖ {\overset{⌣}{z}}_{k} (\overset{⌣}{x}) ‖}^{2}, Y (\overset{⌣}{x}, K + 1) = 0,$

(12.56)

$\begin{array}{l} V (\overset{⌣}{x}, k) = V ({\overset{⌣}{f}}^{⋆} (\overset{⌣}{x}) + {\overset{⌣}{g}}^{⋆} (\overset{⌣}{x}) w_{k}^{⋆} (\overset{⌣}{x}), k + 1) + \frac{1}{2} {‖ {\overset{⌣}{z}}_{k} (\overset{⌣}{x}) ‖}^{2}, V (\overset{⌣}{x}, K + 1) = 0, \\ k = k_{0}, ….. K . \end{array}$

(12.57)

k = k₀,…, K, together with the side-conditions

$w_{k}^{⋆} = - \frac{1}{γ^{2}} {\overset{⌣}{g}}^{T} (\overset{⌣}{x}) {\frac{\partial^{T} Y (λ,k+1)}{\partial λ} |}_{λ = \overset{⌣}{f} (\overset{⌣}{x}) + \overset{⌣}{g} (\overset{⌣}{x}) w_{k}^{⋆}},$

(12.58)

$L^{⋆} = \arg \min_{L} {H_{2} (\overset{⌣}{x}, w_{k}^{⋆}, L, V)},$

(12.59)

${\frac{\partial^{2} H_{1}}{\partial w^{2}} (\overset{⌣}{x}, w_{k}, L^{⋆}, Y) |}_{\overset{⌣}{x} = 0} > 0,$

(12.60)

${\frac{\partial^{2} H_{2}}{\partial L^{2}} (\overset{⌣}{x}, w^{⋆}_{k}, L, V) |}_{\overset{⌣}{x} = 0} > 0,$

(12.61)

where

${\overset{⌣}{f}}^{⋆} (\overset{⌣}{x}) = \overset{⌣}{f} (\overset{⌣}{x}) |_{L = L^{⋆}}, {\overset{⌣}{g}}^{⋆} (\overset{⌣}{x}) = \overset{⌣}{g} (\overset{⌣}{x}) |_{L = L^{⋆}} .$

Then:

(i) there exists locally a Nash-equilibrium solution $(w^{⋆}, L^{⋆})$ for the game (12.48), (12.49), (12.42) locally in N

(ii) the augmented system (12.47) is locally dissipative with respect to the supply-rate $s (w_{k}, {\overset{⌣}{z}}_{k}) = \frac{1}{2} (γ^{2} {‖ w_{k} ‖}^{2} - {‖ {\overset{⌣}{z}}_{k} ‖}^{2})$ and hence has ℓ₂-gain from w to $\overset{⌣}{z}$ less or equal to γ;

(iii) the optimal costs or performance objectives of the game are $J_{1}^{^{⋆}} (L^{⋆}, w^{⋆}) = Y ({\overset{⌣}{x}}^{0}, k_{0})$ and $J_{2}^{^{⋆}} (L^{⋆}, w^{⋆}) = V ({\overset{⌣}{x}}^{0}, k_{0})$ ;

(iv) the filter Σ^daf with the gain matrix $L ({\overset{⌣}{x}}_{k}, k)$ satisfying (12.59) solves the finite-horizon D M H 2 H I N L F P for the system locally in N.

Proof: Assume there exist definite solutions Y, V to the DHJIEs (12.56)-(12.57), and (i) consider the Hamiltonian function H₁(., ., ., .). Then applying the necessary condition for the worst-case noise, we have

${\frac{\partial^{T} H_{1}}{\partial w} |}_{w = w_{k}^{⋆}} = {\overset{⌣}{g}}^{T} (\overset{⌣}{x}) {\frac{\partial^{T} Y (λ, k + 1)}{\partial λ} |}_{λ = \overset{⌣}{f} (\overset{⌣}{x}) + \overset{⌣}{g} (\overset{⌣}{x}) w_{k}^{⋆}} + γ^{2} w_{k}^{⋆} = 0,$

to get

$w_{k}^{⋆} : = - \frac{1}{γ^{2}} {\overset{⌣}{g}}^{T} (\overset{⌣}{x}) {\frac{\partial^{T} Y (λ, k + 1)}{\partial λ} |}_{λ = \overset{⌣}{f} (\overset{⌣}{x}) + \overset{⌣}{g} (\overset{⌣}{x}) w_{k}^{⋆}} : = α_{0} (\overset{⌣}{x}, w_{k}^{⋆}) .$

(12.62)

Thus, w^⋆ is expressed implicitly. Moreover, since

$\frac{\partial^{2} H_{1}}{\partial w^{2}} = {\overset{⌣}{g}}^{T} (\overset{⌣}{x}) {\frac{\partial^{2} Y (λ, k + 1)}{\partial λ^{2}} |}_{λ = \overset{⌣}{f} (\overset{⌣}{x}) + \overset{⌣}{g} (\overset{⌣}{x}) w_{k}^{⋆}} {\overset{⌣}{g}}^{} (\overset{⌣}{x}) + γ^{2} I$

is nonsingular about ( $\overset{⌣}{x}$ , w) = (0, 0), equation (12.62) has a unique solution α₁( $\overset{⌣}{x}$ ), α₁(0) = 0 in the neighborhood N₀ × W₀ of (x, w) = (0, 0) by the Implicit-function Theorem [234].

Now, substitute w^⋆ in the expression for H₂(., ., ., .) (12.55), to get

$H_{2} (\overset{⌣}{x}, w_{k}^{⋆}, L, V) = V (\overset{⌣}{f} (\overset{⌣}{x}) + \overset{⌣}{g} (\overset{⌣}{x}) w_{k}^{⋆} (\overset{⌣}{x}), k + 1) - V (\overset{⌣}{x}, k) + \frac{1}{2} {‖ {\overset{⌣}{z}}_{k} (\overset{⌣}{x}) ‖}^{2},$

and let

$L^{⋆} = \arg \min_{L} {H_{2} (\overset{⌣}{x}, w_{k}^{⋆}, L, V)} .$

Then by Taylor’s theorem, we can expand H₂(., w^⋆ , ., .) about L^⋆ [267] as

$\begin{array}{l} H_{2} (\overset{⌣}{x}, w^{⋆}, L, Y) = H_{2} (\overset{⌣}{x}, w^{⋆}, L^{⋆}, Y_{\overset{⌣}{x}}^{T}) + \\ \frac{1}{2} T r {[I_{n} \otimes {(L - L^{⋆})}^{T}] \frac{\partial^{2} H_{2}}{\partial L^{2}} (w^{⋆}, L) [I_{m} \otimes {(L - L^{⋆})}^{T}]} + \\ O ({‖ L - L^{⋆} ‖}^{3}) . \end{array}$

Therefore, taking L^⋆ as in (12.59) and if the condition (12.61) holds, then H₂(., ., w^⋆ , .) is minimized, and the Nash-equilibrium condition

$H_{2} (w^{⋆}, L^{⋆}) \leq H_{2} (w^{⋆}, L) \forall L \in ℳ^{n \times m}, k = k_{0}, …, K$

is satisfied. Moreover, substituting $(w^{⋆}, L^{⋆})$ in (12.53) gives the DHJIE (12.57).

Now substitute L^⋆ as given by (12.59) in the expression for H₁(., ., ., .) and expand it in Taylor’s-series about w^⋆ to obtain

$\begin{array}{l} H_{1} (\overset{⌣}{x}, w_{k}, L^{⋆}, Y) = Y ({\overset{⌣}{f}}^{⋆} (\overset{⌣}{x}) + {\overset{⌣}{g}}^{⋆} (\overset{⌣}{x}) w, k + 1) - Y (x, k) + \frac{1}{2} γ^{2} | | w_{k} | |^{2} - \frac{1}{2} | | {\overset{⌣}{z}}_{k} | |^{2} \\ = H_{1} (\overset{⌣}{x}, w_{k}^{⋆}, L^{⋆}, Y) + \frac{1}{2} {(w_{k} - w_{k}^{⋆})}^{T} \frac{\partial^{2} H_{2}}{\partial w_{k}^{2}} (w_{k}, L^{⋆}), (w_{k} - w_{k}^{⋆}) + \\ O ({‖ w_{k} - w_{k}^{⋆} ‖}^{3}) . \end{array}$

Further, substituting w = w^⋆ as given by (12.62) in the above, and if the condition (12.60) is satisfied, we see that the second Nash-equilibrium condition

$H_{1} (w^{⋆}, L^{⋆}) \leq H_{1} (w, L^{⋆}), \forall w \in W$

is also satisfied. Therefore, the pair $(w^{⋆}, L^{⋆})$ constitute a Nash-equilibrium solution to the two-player nonzero-sum dynamic game. Moreover, substituting $(w^{⋆}, L^{⋆})$ in (12.52) gives the DHJIE (12.56).

(ii) The Nash-equilibrium condition

$H_{1} (\overset{⌣}{x}, w, L^{⋆}, Y) \geq H_{1} (\overset{⌣}{x}, w^{⋆}, L^{⋆}, Y) = 0 \forall \overset{⌣}{x} \in U, \forall w \in W$

implies

$\begin{array}{l} Y (\overset{⌣}{x}, k) - Y ({\overset{⌣}{x}}_{k + 1}, k + 1) \leq \frac{1}{2} γ^{2} {‖ w_{k} ‖}^{2} - \frac{1}{2} {‖ {\overset{⌣}{z}}_{k} ‖}^{2}, \forall \overset{⌣}{x} \in U, \forall w \in W \\ \Leftrightarrow \overset{⌣}{Y} ({\overset{⌣}{x}}_{k + 1}, k + 1) - \overset{⌣}{Y} ({\overset{⌣}{x}}_{k}, k) \leq \frac{1}{2} γ^{2} {‖ w_{k} ‖}^{2} - \frac{1}{2} {‖ {\overset{⌣}{z}}_{k} ‖}^{2}, \forall \overset{⌣}{x} \in U, \forall w \in W \end{array}$

(12.63)

for some positive-definite function $\overset{⌣}{Y} = - Y > 0$ . Summing now the above inequality from k = k₀ to k = K we get the dissipation-inequality

$\overset{⌣}{Y} ({\overset{⌣}{x}}_{k + 1}, k + 1) - \overset{⌣}{Y} (x_{k_{0}}, k_{0}) \leq \sum_{k = k_{0}}^{K} \frac{1}{2} γ^{2} {‖ w_{k} ‖}^{2} - \frac{1}{2} {‖ {\overset{⌣}{z}}_{k} ‖}^{2} .$

(12.64)

Thus, from Chapter 3, the system has ℓ₂-gain from w to $\overset{⌣}{z}$ less or equal to γ.

(iii) Consider the cost functional J₁(L, w) first, and rewrite it as

$\begin{array}{l} J_{1} (L, w) + Y ({\overset{⌣}{x}}_{k + 1}, k + 1) - Y (\overset{⌣}{x} (k_{0}), k_{0}) \leq \sum_{k = k_{0}}^{K} {\frac{1}{2} γ^{2} {‖ w_{k} ‖}^{2} - \frac{1}{2} {‖ {\overset{⌣}{z}}_{k} ‖}^{2} + \\ Y ({\overset{⌣}{x}}_{k + 1}, k + 1) - Y ({\overset{⌣}{x}}_{k}, k)} \\ = \sum_{k = k_{0}}^{K} H_{1} (\overset{⌣}{x}, w_{k}, L_{k}, Y) . \end{array}$

Substituting (L^⋆, w^⋆) in the above equation and using the DHJIE (12.56) gives H₁( $\overset{⌣}{x}$ , w^⋆ , L^⋆ , Y) = 0 and the result follows. Similarly, consider the cost functional J₂(L, w) and rewrite it as

$\begin{array}{l} J_{2} (L, w) + V ({\overset{⌣}{x}}_{k + 1}, k + 1) - V ({\overset{⌣}{x}}_{k_{0}}, k_{0}) = \sum_{k = k_{0}}^{K} {\frac{1}{2} {‖ {\overset{⌣}{z}}_{k} ‖}^{2} + V ({\overset{⌣}{x}}_{k + 1}, k + 1) - V ({\overset{⌣}{x}}_{k}, k)} \\ = \sum_{k = k_{0}}^{K} H_{2} (\overset{⌣}{x}, w_{k}, L_{k}, V) . \end{array}$

Since V ( $\overset{⌣}{x}$ , K + 1) = 0, substituting (L^⋆ , w^⋆) in the above and using the DHJIE (12.57) the result similarly follows.

(iv) Notice that the inequality (12.63) implies that with w_k ≡ 0,

$\overset{⌣}{Y} ({\overset{⌣}{x}}_{k + 1}, k + 1) - \overset{⌣}{Y} ({\overset{⌣}{x}}_{k}, k) \leq - \frac{1}{2} {‖ {\overset{⌣}{z}}_{k} ‖}^{2}, \forall \overset{⌣}{x} \in ϒ,$

(12.65)

and since $\overset{⌣}{Y}$ is positive-definite, by Lyapunov’s theorem, the augmented system is locally stable. Finally, combining (i)-(iii), (iv) follows. □

12.2.2 Solution to the Infinite-Horizon Discrete-Time Mixed $H_{2}$ / $H_{∞}$ Nonlinear Filtering Problem

In this subsection, we discuss the infinite-horizon filtering problem, in which case we let K → ∞. Moreover, in this case, we seek a time-invariant gain $\hat{L} (\hat{x})$ for the filter, and consequently time-independent functions $Y, V : \tilde{N} \times \tilde{N} \to ℜ$ locally defined in a neighborhood $\tilde{N} \times \tilde{N} \subset X \times X$ of $(x, \hat{x}) = (0, 0)$ , such that the following steady-state DHJIEs:

$Y ({\overset{⌣}{f}}^{⋆} (\overset{⌣}{x}) + {\overset{⌣}{g}}^{⋆} (\overset{⌣}{x}) {\tilde{w}}^{⋆} (\overset{⌣}{x}) - Y (\overset{⌣}{x}) + \frac{1}{2} γ^{2} {‖ {\tilde{w}}^{⋆}_{k} (\overset{⌣}{x} ‖}^{2} - \frac{1}{2} {‖ \overset{⌣}{z} (\overset{⌣}{x} ‖}^{2} = 0 Y (0) = 0,$

(12.66)

$V ({\overset{⌣}{f}}^{⋆} (\overset{⌣}{x}) + {\overset{⌣}{g}}^{⋆} (\overset{⌣}{x}) {\tilde{w}}^{⋆} - V (\overset{⌣}{x}) + \frac{1}{2} {‖ \overset{⌣}{z} (\overset{⌣}{x} ‖}^{2} = 0 V (0) = 0,$

(12.67)

are satisfied together with the side-conditions:

${\tilde{w}}^{⋆} = - \frac{1}{γ^{2}} {\overset{⌣}{g}}^{T} (\overset{⌣}{x}) {\frac{\partial^{T} Y (λ)}{\partial λ} |}_{λ = \overset{⌣}{f} (\overset{⌣}{x}) + \overset{⌣}{g} (\overset{⌣}{x}) {\tilde{w}}_{}^{⋆}} : = α_{2} (\overset{⌣}{x}, {\tilde{w}}^{⋆}),$

(12.68)

$L^{⋆} (\hat{x}) = \arg \min_{\tilde{L}} {{\tilde{H}}_{2} (\overset{⌣}{x}, w^{⋆}, \tilde{L}, V)},$

(12.69)

${\frac{\partial^{2} {\tilde{H}}_{1}}{\partial w^{2}} (\overset{⌣}{x}, w, {\tilde{L}}^{⋆}, Y) |}_{\overset{⌣}{x} = 0} > 0,$

(12.70)

${\frac{\partial^{2} {\tilde{H}}_{2}}{\partial {\tilde{L}}_{}^{2}} (\overset{⌣}{x}, {\tilde{w}}^{⋆}, \tilde{L}, V) |}_{\overset{⌣}{x} = 0} > 0,$

(12.71)

where ${\tilde{w}}^{⋆}, {\tilde{L}}^{⋆}$ are the asymptotic values of w^⋆,L^⋆

$\begin{array}{l} {\overset{⌣}{f}}^{⋆} (\overset{⌣}{x}) = {\overset{⌣}{f} (\overset{⌣}{x}) |}_{\tilde{L} = {\tilde{L}}^{⋆}}, {\overset{⌣}{g}}^{⋆} (\overset{⌣}{x}) = {\overset{⌣}{g} (\overset{⌣}{x}) |}_{\tilde{L} = {\tilde{L}}^{⋆}}, \\ {\tilde{H}}_{1} (\overset{⌣}{x}, w_{k}, \tilde{L}, Y) = Y (\overset{⌣}{f} (\overset{⌣}{x}) + \overset{⌣}{g} (\overset{⌣}{x}) w_{k}) - Y (\overset{⌣}{x}) + \frac{1}{2} γ^{2} {‖ w_{k} ‖}^{2} - \frac{1}{2} {‖ {\overset{⌣}{z}}_{k} ‖}^{2}, \\ {\tilde{H}}_{2} (\overset{⌣}{x}, w_{k}, \tilde{L}, V) = V (\overset{⌣}{f} (\overset{⌣}{x}) + \overset{⌣}{g} (\overset{⌣}{x}) w_{k}) - V (\overset{⌣}{x}) + \frac{1}{2} {‖ {\overset{⌣}{z}}_{k} ‖}^{2} . \end{array}$

Again here, since the estimation is carried over an infinite-horizon, it is necessary to ensure that the augmented system (12.47) is stable with w = 0. However, in this case, we can relax the requirement of asymptotic-stability for the original system (12.42) with a milder requirement of detectability which we define next.

Definition 12.2.2 The pair {f, h} is said to be locally zero-state detectable if there exists a neighborhood O of x = 0 such that, if x_k is a trajectory of x_k+1 = f(x_k) satisfying x(k₀) ∈ O, then h(x_k) is defined for all k ≥ k₀ and h(x_k) = 0, for all k ≥ k_s, implies lim_k→∞ x_k = 0. Moreover {f, h} is zero-state detectable if O = X.

The “admissibility” of the discrete-time filter is similarly defined as follows.

Definition 12.2.3 A filter F is admissible if it is asymptotically (or internally) stable for any given initial condition x(k₀) of the plant Σ^da, and with w ≡ 0

$\lim_{k \to \infty} {\overset{⌣}{z}}_{k} = 0.$

The following proposition can now be proven along the same lines as Theorem 12.2.2.

Proposition 12.2.1 Consider the nonlinear system (12.42) and the infinite-horizon D M H 2 H I N L F P for it. Suppose the function h₁ is one-to-one (or injective) and the plant Σ^da is zero-state detectable. Suppose further, there exists a pair of C² negative and positive-definite functions $Y, V : \tilde{N} \times \tilde{N} \to ℜ,$ respectively, locally defined in a neighborhood $\tilde{N} \times \tilde{N} \subset X \times X$ of the origin $\overset{⌣}{x}$ = 0, and a matrix function $\tilde{L} : \tilde{N} \to ℳ^{n \times m}$ satisfying the pair of coupled DHJIEs (12.66), (12.67) together with (12.68)-(12.71). Then:

(i) there exists locally a Nash-equilibrium solution $({\tilde{w}}^{⋆}, {\tilde{L}}^{⋆})$ for the game;

(ii) the augmented system (12.47) is dissipative with respect to the supply-rate $s (w_{k}, {\overset{⌣}{z}}_{k}) = \frac{1}{2} (γ^{2} {‖ w_{k} ‖}^{2} - {‖ {\overset{⌣}{z}}_{k} ‖}^{2})$ and hence has ℓ₂-gain from w to $\overset{⌣}{z}$ less or equal to γ;

(iii) the optimal costs or performance objectives of the game are $J_{1}^{^{⋆}} ({\tilde{L}}^{⋆}, {\tilde{w}}^{⋆}) = Y ({\overset{⌣}{x}}^{0})$ and $J_{2}^{^{⋆}} ({\tilde{L}}^{⋆}, {\tilde{w}}^{⋆}) = V ({\overset{⌣}{x}}^{0})$

(iv) the filter Σ^daf with the gain matrix $L (\hat{x}) = {\tilde{L}}^{⋆} (\hat{x})$ satisfying (12.69) solves the infinite horizon D M H 2 H I N L F P locally in $\tilde{N}$ .

Proof: Since the proof of items (i)-(iii) is similar to that of Theorem 12.2.2, we prove only item (iv).

(iv) Using similar manipulations as in the proof of Theorem 12.2.2, it can be shown that a similar inequality as (12.63) also holds. This implies that with w_k ≡ 0,

$\overset{⌣}{Y} ({\overset{⌣}{x}}_{k + 1}) - \overset{⌣}{Y} ({\overset{⌣}{x}}_{k}) \leq - \frac{1}{2} | | {\overset{⌣}{z}}_{k} | |^{2}, \forall \overset{⌣}{x} \in \tilde{N}$

(12.72)

and since $\overset{⌣}{Y}$ is positive-definite, by Lyapunov’s theorem, the augmented system is locally stable. Furthermore, for any trajectory of the system ${\overset{⌣}{x}}_{k}$ such that $\overset{⌣}{Y} ({\overset{⌣}{x}}_{k + 1}) - \overset{⌣}{Y} ({\overset{⌣}{x}}_{k})$ = 0 for all k ≥ k_c > k₀, it implies that z_k ≡ 0. This in turn implies h₁(x_k) = h₁( ${\hat{x}}_{k}$ ), and x_k = ${\hat{x}}_{k}$ ∀k ≥ k_c since h₁ is injective. This further implies that h₂(x_k) = h₂( ${\hat{x}}_{k}$ ) ∀k ≥ k_c and it is a trajectory of the free system:

${\overset{⌣}{x}}_{k + 1} = (\begin{array}{l} f (x_{k}) \\ f ({\hat{x}}_{k}) \end{array}) .$

By zero-state of the detectability of {f, h₁}, we have lim_k→∞ x_k = 0, and we have internal stability of the augmented system with lim_k→∞ z_k = 0. Hence, Σ^daf is admissible. Finally, combining (i)-(iii), (iv) follows. □

12.2.3 Approximate and Explicit Solution to the Infinite-Horizon Discrete-Time Mixed $H_{2}$ / $H_{∞}$ Nonlinear Filtering Problem

In this subsection, we discuss how the D M H 2 H I N L F P can be solved approximately to obtain explicit solutions [126]. We consider the infinite-horizon problem for this purpose, but the approach can also be used for the finite-horizon problem. For simplicity, we make the following assumption on the system matrices.

Assumption 12.2.1 The system matrices are such that

$\begin{array}{l} k_{21} (x) g_{1}^{T} (x) = 0, \\ k_{21} (x) k_{21}^{T} (x) = I . \end{array}$

Consider now the infinite-horizon Hamiltonian functions

$\begin{array}{l} {\tilde{H}}_{1} (\overset{⌣}{x}, w, \hat{L}, \tilde{Y}) = \tilde{Y} (\overset{⌣}{f} (\overset{⌣}{x}) + \overset{⌣}{g} (\overset{⌣}{x}) w) - \tilde{Y} (\overset{⌣}{x}) + \frac{1}{2} γ^{2} {‖ w ‖}^{2} - \frac{1}{2} {‖ \overset{⌣}{z} ‖}^{2}, \\ {\tilde{H}}_{2} (\overset{⌣}{x}, w, \hat{L}, \tilde{V}) = \tilde{V} (\overset{⌣}{f} (\overset{⌣}{x}) + \overset{⌣}{g} (\overset{⌣}{x}) w) - \tilde{V} (\overset{⌣}{x}) + \frac{1}{2} {‖ \overset{⌣}{z} ‖}^{2} . \end{array}$

for some negative and positive-definite functions $\tilde{Y}, \tilde{V} : \hat{N} \times \hat{N} \to ℜ, \hat{N} \subset X$ a neighborhood of x = 0, and where $\overset{⌣}{x} = {\overset{⌣}{x}}_{k}, w = w_{k}, z = z_{k}$ . Expanding them in Taylor series¹ about $\hat{f} (\overset{⌣}{x})$ up to first-order:

$\begin{array}{l} {\hat{H}}_{1} (\overset{⌣}{x}, w, \hat{L}, \tilde{Y}) = {\tilde{Y} (\hat{f} (\overset{⌣}{x})) + {\tilde{Y}}_{x} (\hat{f} (\overset{⌣}{x})) g_{1} (x) w) + {\tilde{Y}}_{\hat{x}} (\hat{f} (\overset{⌣}{x})) [\hat{L} (\hat{x}) (h_{2} (x) - h_{2} (\hat{x}) + k_{21} (x) w)] \\ + O ({‖ \tilde{υ} ‖}^{2}} - \tilde{Y} (\overset{⌣}{x}) + \frac{1}{2} γ^{2} {‖ w ‖}^{2} - \frac{1}{2} {‖ \overset{⌣}{z} ‖}^{2}, \forall \overset{⌣}{x} \in \hat{N} \times \hat{N}, w \in W \end{array}$

(12.73)

$\begin{array}{l} {\hat{H}}_{2} (\overset{⌣}{x}, w, \hat{L}, \tilde{V}) = {\tilde{V} (\hat{f} (\overset{⌣}{x})) + {\tilde{V}}_{x} (\hat{f} (\overset{⌣}{x})) g_{1} (x) w) + {\tilde{V}}_{\hat{x}} (\hat{f} (\overset{⌣}{x})) [\hat{L} (\hat{x}) (h_{2} (x) - h_{2} (\hat{x}) + k_{21} (x) w)] \\ + O ({‖ \tilde{υ} ‖}^{2}} - \tilde{V} (\overset{⌣}{x}) + \frac{1}{2} {‖ \overset{⌣}{z} ‖}^{2}, \forall \overset{⌣}{x} \in \hat{N} \times \hat{N}, w \in W \end{array}$

(12.74)

where ${\tilde{Y}}_{x}, {\tilde{V}}_{x}$ are the row-vectors of the partial-derivatives of $\tilde{Y}$ and $\tilde{V}$ respectively,

$\tilde{υ} = (\begin{array}{l} g_{1} (x) w \\ \hat{L} (\hat{x}) [h_{2} (x) - h_{2} (\hat{x}) + k_{21} (x) w] \end{array})$

and

$\lim_{\tilde{υ} \to \infty} \frac{O ({‖ \tilde{υ} ‖}^{2})}{{‖ \tilde{υ} ‖}^{2}} = 0.$

Then, applying the necessary conditions for the worst-case noise, we get

$\frac{\partial H_{1}}{\partial w} = 0 \Rightarrow {\hat{w}}^{⋆} : = - \frac{1}{γ^{2}} [g_{1}^{T} (x) {\tilde{Y}}_{x}^{T} (\hat{f} (\overset{⌣}{x})) + k_{21}^{T} (x) {\hat{L}}^{T} (\hat{x}) {\tilde{Y}}_{\hat{x}}^{T} (\hat{f} (\overset{⌣}{x}))] .$

(12.75)

Now substitute ${\hat{w}}^{⋆}$ in (12.74) to obtain

$\begin{array}{l} {\hat{H}}_{2} (\overset{⌣}{x}, {\hat{w}}^{⋆}, \hat{L}, V) \approx V (\hat{f} (\overset{⌣}{x})) - \tilde{V} (\overset{⌣}{x}) - \frac{1}{γ^{2}} {\tilde{V}}_{x} (\hat{f} (\overset{⌣}{x})) g_{1} (x) g_{1}^{T} (x) {\tilde{Y}}_{x}^{T} (\hat{f} (\overset{⌣}{x})) + \\ {\tilde{V}}_{\hat{x}} (\hat{f} (\overset{⌣}{x})) \hat{L} (\hat{x}) [h_{2} (x) - h_{2} (\hat{x}) - \frac{1}{γ^{2}} {\tilde{V}}_{x} (\hat{f} (\overset{⌣}{x})) \hat{L} (\hat{x}) {\hat{L}}^{T} (\hat{x}) {\tilde{Y}}_{x}^{T} (\hat{f} (\overset{⌣}{x})) + \\ \frac{1}{2} {‖ z ‖}^{2} . \end{array}$

Then, completing the squares for ${\hat{L}}^{}$ in the above expression for ${\hat{H}}_{2}$ (., ., ., .), we have

$\begin{array}{l} {\hat{H}}_{2} (\overset{⌣}{x}, {\hat{w}}^{⋆}, \hat{L}, \tilde{V}) \approx \tilde{V} (\hat{f} (\overset{⌣}{x})) - \tilde{V} (\overset{⌣}{x}) - \frac{1}{γ^{2}} {\tilde{V}}_{x} (\hat{f} (\overset{⌣}{x})) g_{1} (x) g_{1}^{T} (x) {\tilde{Y}}_{x}^{T} (\hat{f} (\overset{⌣}{x})) + \frac{1}{2} | | z | |^{2} + \\ \frac{1}{{2γ}^{2}} {‖ {\hat{L}}^{T} (\hat{x}) {\tilde{V}}_{\hat{x}}^{T} (\hat{f} (\overset{⌣}{x})) + γ^{2} (h_{2} (x) - h_{2} (\hat{x})) ‖}^{2} - \frac{γ^{2}}{2} {‖ h_{2} (x) - h_{2} (\hat{x}) ‖}^{2} + \\ \frac{1}{{2γ}^{2}} {‖ {\hat{L}}^{T} (\hat{x}) {\tilde{Y}}_{\hat{x}}^{T} (\hat{f} (\overset{⌣}{x})) ‖}^{2} - \frac{1}{{2γ}^{2}} {‖ {\hat{L}}^{T} (\hat{x}) {\tilde{V}}_{\hat{x}}^{T} (\hat{f} (\overset{⌣}{x})) + {\hat{L}}^{T} (\hat{x}) {\tilde{Y}}_{x}^{T} (\hat{f} (\overset{⌣}{x})) ‖}^{2} . \end{array}$

Therefore, taking ${\hat{L}}^{⋆}$ as

${\tilde{V}}_{\hat{x}} (\hat{f} (\overset{⌣}{x})) {\hat{L}}^{⋆} (\hat{x}) = - γ^{2} {(h_{2} (x) - h_{2} (\hat{x}))}^{T}, x, \hat{x} \in \hat{N}$

(12.76)

minimizes ${\hat{H}}_{2}$ (., ., ., .) and renders the Nash-equilibrium condition

${\hat{H}}_{2} ({\hat{w}}^{⋆}, {\hat{L}}^{⋆}) \leq {\hat{H}}_{2} ({\hat{w}}^{⋆}, \hat{L}) \forall \hat{L} \in ℳ^{n \times m}$

satisfied.

Substitute now ${\hat{L}}^{⋆}$ as given by (12.76) in the expression for H₁(., ., ., .) and complete the squares in w to obtain:

$\begin{array}{l} {\hat{H}}_{1} (\overset{⌣}{x}, \hat{w}, {\hat{L}}^{⋆}, \tilde{Y}) = \tilde{Y} (\hat{f} (\overset{⌣}{x})) - \tilde{Y} (\overset{⌣}{x}) - \frac{1}{γ^{2}} {\tilde{Y}}_{\hat{x}} (\hat{f} (\overset{⌣}{x})) \hat{L} (\hat{x}) {\hat{L}}^{T} (\hat{x}) {\tilde{V}}_{\hat{x}}^{T} (\hat{f} (\overset{⌣}{x}) - \\ \frac{1}{{2γ}^{2}} {\tilde{Y}}_{x} (\hat{f} (\overset{⌣}{x}) g_{1} (x) g_{1}^{T} (x) {\tilde{Y}}_{x}^{T} (\hat{f} (\overset{⌣}{x})) - \frac{1}{{2γ}^{2}} {\tilde{Y}}_{\hat{x}} (\hat{f} (\overset{⌣}{x})) \hat{L} (\hat{x}) {\hat{L}}^{T} (\hat{x}) {\tilde{Y}}_{x}^{T} (\hat{f} (\overset{⌣}{x})) \\ - \frac{1}{2} {‖ z ‖}^{2} + \frac{γ^{2}}{2} {‖ w + \frac{1}{γ^{2}} g_{1}^{T} (x) {\tilde{Y}}_{x}^{T} (\hat{f} (\overset{⌣}{x})) + \frac{1}{γ^{2}} k_{21}^{T} (x) {\hat{L}}^{T} (\hat{x}) {\tilde{Y}}_{\hat{x}}^{T} (\hat{f} (\overset{⌣}{x})) ‖}^{2} . \end{array}$

Similarly, substituting w = ${\hat{w}}^{⋆}$ as given by (12.75), we see that, the second Nash-equilibrium condition

${\hat{H}}_{1} ({\hat{w}}^{⋆}, {\hat{L}}^{⋆}) \leq H_{1} (w, {\hat{L}}^{⋆}), \forall w \in W$

is also satisfied. Thus, the pair $({\hat{w}}^{⋆}, {\hat{L}}^{⋆})$ constitutes a Nash-equilibrium solution to the two-player nonzero-sum dynamic game corresponding to the Hamiltonians ${\hat{H}}_{1} (., ., ., .) and {\hat{H}}_{2} (., ., .,)$ . With this analysis, we have the following important theorem.

Theorem 12.2.3 Consider the nonlinear system (12.42) and the infinite-horizon D M H 2 H I N L F P for it. Suppose the function h₁ is one-to-one (or injective) and the plant Σ^da is zero-state detectable. Suppose further, there exists a pair of C¹ negative and positive-definite functions $\tilde{Y}, \tilde{V} : \hat{N} \times \hat{N} \to ℜ$ respectively, locally defined in a neighborhood $\hat{N} \times \hat{N} \subset X \times X$ of the origin $\overset{⌣}{x} = 0$ , and a matrix function $\hat{L} : \hat{N} \to ℳ^{n \times m}$ satisfying the pair of coupled DHJIEs:

$\begin{array}{l} \tilde{Y} (\hat{f} (\overset{⌣}{x})) - \tilde{Y} (\overset{⌣}{x}) - \frac{1}{{2γ}^{2}} {\tilde{Y}}_{x} (\hat{f} (\overset{⌣}{x}) g_{1} (x) g_{1}^{T} (x) {\tilde{Y}}_{x}^{T} (\hat{f} (\overset{⌣}{x})) - \frac{1}{{2γ}^{2}} {\tilde{Y}}_{\hat{x}} (\hat{f} (\overset{⌣}{x})) \hat{L} (\hat{x}) {\hat{L}}^{T} (\hat{x}) {\tilde{Y}}_{x}^{T} (\hat{f} (\overset{⌣}{x})) - \\ \frac{1}{γ^{2}} {\tilde{Y}}_{\hat{x}} (\hat{f} (\overset{⌣}{x})) \hat{L} (\hat{x}) {\hat{L}}^{T} (\hat{x}) {\tilde{V}}_{\hat{x}}^{T} (\hat{f} (\overset{⌣}{x})) - \\ \frac{1}{2} {(h_{1} (x) - h_{1} (\hat{x}))}^{T} (h_{1} (x) - h_{1} (\hat{x})) = 0, \tilde{Y} (0) = 0, \end{array}$

(12.77)

$\begin{array}{l} \tilde{V} (\hat{f} (\overset{⌣}{x})) - \tilde{V} (\overset{⌣}{x}) - \frac{1}{γ^{2}} {\tilde{V}}_{x} (\hat{f} (\overset{⌣}{x}) g_{1} (x) g_{1}^{T} (x) {\tilde{Y}}_{x}^{T} (\hat{f} (\overset{⌣}{x})) - \frac{1}{γ^{2}} {\tilde{V}}_{\hat{x}} (\hat{f} (\overset{⌣}{x})) \hat{L} (\hat{x}) {\hat{L}}^{T} (\hat{x}) {\tilde{Y}}_{x}^{T} (\hat{f} (\overset{⌣}{x}) - \\ γ^{2} {(h_{2} (x) - h_{2} (\hat{x}))}^{T} (h_{2} (x) - h_{2} (\hat{x})) + \\ \frac{1}{2} {(h_{1} (x) - h_{1} (\hat{x}))}^{T} (h_{1} (x) - h_{1} (\hat{x})) = 0, \tilde{V} (0) = 0, \end{array}$

(12.78)

together with the coupling condition (12.76). Then:

(i) there exists locally in $\hat{N}$ a Nash-equilibrium solution $({\hat{w}}^{⋆}, {\hat{L}}^{⋆})$ for the dynamic game corresponding to (12.48), (12.49), (12.47);

(ii) the augmented system (12.47) is locally dissipative with respect to the supply rate $s (w, \overset{⌣}{z}) = \frac{1}{2} (γ^{2} {‖ w ‖}^{2} - {‖ \overset{⌣}{z} ‖}^{2}) i n \hat{N},$ and hence has ℓ₂-gain from w to $\overset{⌣}{z}$ less or equal to γ;

(iii) the optimal costs or performance objectives of the game are approximately $J_{1}^{⋆} ({\hat{L}}^{⋆}, {\hat{w}}^{⋆}) = \tilde{Y} ({\overset{⌣}{x}}_{0}) a n d J_{2}^{⋆} ({\hat{L}}^{⋆}, {\hat{w}}^{⋆}) = \tilde{V} ({\overset{⌣}{x}}_{0});$

(iv) the filter Σ^daf with the gain-matrix $\hat{L} (\hat{x}) = {\hat{L}}^{⋆} (\hat{x})$ satisfying (12.76) solves the infinitehorizon D M H 2 H I N L F P for the system locally in $\hat{N}$

Proof: Part (i) has already been shown above. To complete it, we substitute $({\hat{L}}^{⋆}, {\hat{w}}^{⋆})$ in the DHJIEs (12.66), (12.67) with ${\hat{H}}_{1} (., ., ., .) and {\hat{H}}_{2} (., ., .,)$ replacing H₁(.,.,.,.),H₂(.,.,.,.) respectively, to get the DHJIEs (12.77), (12.78) respectively.

(ii) Consider the time-variation of $\tilde{Y}$ along a trajectory of the system (12.47) with $\hat{L} = {\hat{L}}^{⋆} :$ :

$\begin{array}{l} \tilde{Y} ({\overset{⌣}{x}}_{k + 1}) = \tilde{Y} ({\overset{⌣}{f}}^{⋆} (x) + {\overset{⌣}{g}}^{⋆} (x) w) \forall \overset{⌣}{x} \in \hat{N}, \forall w \in W \\ \approx \tilde{Y} (\hat{f} (\overset{⌣}{x})) + {\tilde{Y}}_{x} (\hat{f} (\overset{⌣}{x}) g_{1} (x) w + {\tilde{Y}}_{\hat{x}} (\hat{f} (\overset{⌣}{x})) [{\hat{L}}^{⋆} (\hat{x}) (h_{2} (x) - h_{2} (\hat{x}) + k_{21} (x) w)] \\ = \tilde{Y} (\hat{f} (\overset{⌣}{x})) - \frac{1}{{2γ}^{2}} {\tilde{Y}}_{x} (\hat{f} (\overset{⌣}{x}) g_{1} (x) g_{1}^{T} (x) {\tilde{Y}}_{x}^{T} (\hat{f} (\overset{⌣}{x})) - \\ \frac{1}{γ^{2}} {\tilde{Y}}_{\hat{x}} (\hat{f} (\overset{⌣}{x})) {\hat{L}}^{⋆} (\hat{x}) {\hat{L}}^{⋆}^{T} (\hat{x}) {\tilde{V}}_{\hat{x}}^{T} (\hat{f} (\overset{⌣}{x})) - \frac{1}{{2γ}^{2}} {\tilde{Y}}_{\hat{x}} (\hat{f} (\overset{⌣}{x})) {\hat{L}}^{⋆} (\hat{x}) {\hat{L}}^{⋆}^{T} (\hat{x}) {\tilde{Y}}_{\hat{x}}^{T} (\hat{f} (\overset{⌣}{x})) + \\ \frac{γ^{2}}{2} {‖ w + \frac{1}{γ^{2}} g_{1}^{T} (x) {\tilde{Y}}_{x}^{T} (\hat{f} (\overset{⌣}{x})) + \frac{1}{γ^{2}} k_{21}^{T} (x) {\hat{L}}^{⋆}^{T} (\hat{x}) {\tilde{Y}}_{\hat{x}}^{T} (\hat{f} (\overset{⌣}{x})) ‖}^{2} - \frac{γ^{2}}{2} {‖ w ‖}^{2} \\ = \tilde{Y} (\overset{⌣}{x}) + \frac{1}{2} {‖ \overset{⌣}{z} ‖}^{2} - \frac{γ^{2}}{2} {‖ w ‖}^{2} + \frac{γ^{2}}{2} ‖ w + \frac{1}{γ^{2}} g_{1}^{T} (x) {\tilde{Y}}_{x}^{T} (\hat{f} (\overset{⌣}{x})) + \\ {\frac{1}{γ^{2}} k_{21}^{T} (x) {\hat{L}}^{⋆}^{T} (\hat{x}) {\tilde{Y}}_{\hat{x}}^{T} (\hat{f} (\overset{⌣}{x})) ‖}^{2} \\ \geq \tilde{Y} (\overset{⌣}{x}) + \frac{1}{2} {‖ \overset{⌣}{z} ‖}^{2} - \frac{γ^{2}}{2} {‖ w ‖}^{2} \forall \overset{⌣}{x} \in \hat{N}, \forall w \in W \end{array}$

where use has been made of the first-order Taylor-approximation, equation (12.76), and the DHJIE (12.77) in the above manipulations. The last inequality further implies that

$\tilde{Y} ({\overset{⌣}{x}}_{k + 1}) - \tilde{Y} (\overset{⌣}{x}) \leq \frac{γ^{2}}{2} {‖ w ‖}^{2} - \frac{1}{2} {‖ \overset{⌣}{z} ‖}^{2} \forall \overset{⌣}{x} \in \hat{N}, \forall w \in W$

for some $\tilde{Y} = - \tilde{Y} > 0$ , which is the infinitesimal dissipation-inequality [180]. Therefore, the system has ℓ₂-gain ≤ γ. The proof of asymptotic-stability can now be pursued along the same lines as in Proposition 12.2.1.

The proofs of items (iii)-(iv) are similar to those in Theorem 12.2.2. □

Remark 12.2.2 The benefits of the Theorem 12.2.3 can be summarized as follows. First and foremost is the benefit of the explicit solutions for computational purposes. Secondly, the approximation is reasonably accurate, as it captures a great deal of the dynamics of the system. Thirdly, it greatly simplifies the solution as it does away with extra sufficient conditions (see e.g., the conditions (12.60), (12.61) in Theorem 12.2.1). Fourthly, it opens the way also to develop an iterative procedure for solving the coupled DHJIEs.

Remark 12.2.3 In view of the coupling condition (12.76), the DHJIE can be represented as

$\begin{array}{l} \tilde{V} (\hat{f} (\overset{⌣}{x})) - \tilde{V} (\overset{⌣}{x}) - \frac{1}{γ^{2}} {\tilde{V}}_{x} (\hat{f} (\overset{⌣}{x}) g_{1} (x) g_{1}^{T} (x) {\tilde{Y}}_{x}^{T} (\hat{f} (\overset{⌣}{x})) - \frac{1}{γ^{2}} {\tilde{V}}_{\hat{x}} (\hat{f} (\overset{⌣}{x})) \hat{L} (\hat{x}) {\hat{L}}^{T} (\hat{x}) {\tilde{V}}_{x}^{T} (\hat{f} (\overset{⌣}{x})) - \\ \frac{1}{γ^{2}} {\tilde{V}}_{\hat{x}} (\hat{f} (\overset{⌣}{x})) \hat{L} (\hat{x}) {\hat{L}}^{T} (\hat{x}) {\tilde{Y}}_{\hat{x}}^{T} (\hat{f} (\overset{⌣}{x})) + \\ \frac{1}{2} {(h_{1} (x) - h_{1} (\hat{x}))}^{T} (h_{1} (x) - h_{1} (\hat{x})) = 0, \tilde{V} (0) = 0. \end{array}$

(12.79)

The result of the theorem can similarly be specialized to the linear-time-invariant (LTI) system:

$\sum^{d l} : {\begin{cases} {\dot{x}}_{k + 1} = A x_{k} + G_{1} w_{k}, x (k_{0}) = x^{0} \\ {\overset{⌣}{z}}_{k} = C_{1} (x_{k} - {\hat{x}}_{k}) \\ y_{k} = c_{2} x_{k} + D_{21} w_{k}, \end{cases}$

(12.80)

where all the variables have their previous meanings, and $F \in ℜ^{n \times n}, G_{1} \in ℜ^{n \times n}, C_{1} \in ℜ^{s \times n}, C_{2} \in ℜ^{m \times n} and D_{21} \in ℜ^{m \times r}$ are constant real matrices. We have the following corollary to Theorem 12.2.3.

Corollary 12.2.1 Consider the LTI system Σ^dl defined by (12.80) and the D M H 2 H I N L F P for it. Suppose C₁ is full column rank and A is Hurwitz. Suppose further, there exist a negative and a positive-definite real-symmetric solutions ${\hat{P}}_{1}, {\hat{P}}_{2}$ (respectively) to the coupled discrete-algebraic-Riccati equations (DAREs):

$A^{T} {\hat{P}}_{1} A - {\hat{P}}_{1} - \frac{1}{{2γ}^{2}} A^{T} {\hat{P}}_{1} G_{1} G_{1}^{T} {\hat{P}}_{1} A - \frac{1}{{2γ}^{2}} A^{T} {\hat{P}}_{1} \hat{L} {\hat{L}}^{T} {\hat{P}}_{1} A - \frac{1}{γ^{2}} A^{T} {\hat{P}}_{1} \hat{L} {\hat{L}}^{T} {\hat{P}}_{2} A - C_{1}^{T} C_{1} = 0$

(12.81)

$A^{T} {\hat{P}}_{2} A - {\hat{P}}_{2} - \frac{1}{γ^{2}} A^{T} {\hat{P}}_{2} G_{1} G_{1}^{T} {\hat{P}}_{2} A - \frac{1}{γ^{2}} A^{T} {\hat{P}}_{2} \hat{L} {\hat{L}}^{T} {\hat{P}}_{1} A - \frac{1}{γ^{2}} A^{T} {\hat{P}}_{2} \hat{L} {\hat{L}}^{T} {\hat{P}}_{2} A + C_{1}^{T} C_{1} = 0$

(12.82)

together with the coupling condition:

$A^{T} {\hat{P}}_{2} \hat{L} = - γ^{2} C_{2}^{T} .$

(12.83)

Then:

(i) there exists a Nash-equilibrium solution $({\hat{w}}_{l}^{⋆}, {\hat{L}}^{⋆})$ for the game given by

$\begin{array}{l} {\hat{w}}^{⋆} = - \frac{1}{γ^{2}} (G_{1}^{T} + D_{21}^{T} {\hat{L}}^{⋆}) {\hat{P}}_{1} A (x - \hat{x}), \\ {(x - \hat{x})}^{T} A^{T} {\hat{P}}_{2} {\hat{L}}^{⋆} = - γ^{2} {(x - \hat{x})}^{T} C_{2}^{T}; \end{array}$

(ii) the augmented system

$\sum^{d l f} : {\begin{cases} {\overset{⌣}{x}}_{k + 1} = [\begin{matrix} A & 0 \\ {\hat{L}}^{⋆} C_{2} & A - {\hat{L}}^{⋆} C_{2} \end{matrix}] {\overset{⌣}{x}}_{k} + [\begin{array}{l} G_{1} \\ {\hat{L}}^{⋆} D_{21} \end{array}] w, \overset{⌣}{x} (k_{0}) = [\begin{array}{l} x^{0} \\ {\hat{x}}^{0} \end{array}] \\ {\overset{⌣}{z}}_{k} = [C_{1} - C_{1}] {\overset{⌣}{x}}_{k} : = \overset{⌣}{C} {\overset{⌣}{x}}_{k} \end{cases}$

has $H_{∞}$ -norm from w to $\overset{⌣}{z}$ less than or equal to γ;

(iii) the optimal costs or performance objectives of the game are approximately

$J_{1}^{⋆} ({\hat{L}}^{⋆}, {\hat{w}}_{k}^{⋆}) = \frac{1}{2} {(x^{0} - {\hat{x}}^{0})}^{T} {\hat{P}}_{1} (x^{0} - {\hat{x}}^{0}) and J_{2}^{⋆} ({\hat{L}}^{⋆}, {\hat{w}}_{k}^{⋆}) = \frac{1}{2} {(x^{0} - {\hat{x}}^{0})}^{T} {\hat{P}}_{2} (x^{0} - {\hat{x}}^{0});$

the filter $ℱ$ defined by

$\sum_{l d f} : {\hat{x}}_{k + 1} = A {\hat{x}}_{k} + \hat{L} (y - C_{2} {\hat{x}}_{k}), \hat{x} (k_{0}) = {\hat{x}}^{0}$

with the gain matrix $\hat{L} = {\hat{L}}^{⋆}$ satisfying (12.83) solves the infinite-horizon D M H 2 H I N L F P for the discrete-time linear system.

Proof: Take:

$\begin{array}{l} \tilde{Y} (\overset{⌣}{x}) = \frac{1}{2} {(x - \hat{x})}^{T} {\hat{P}}_{1} (x - \hat{x}), {\hat{P}}_{1} = {\hat{P}}_{1}^{T} < 0, \\ \tilde{V} (\overset{⌣}{x}) = \frac{1}{2} {(x - \hat{x})}^{T} {\hat{P}}_{2} (x - \hat{x}), {\hat{P}}_{2} = {\hat{P}}_{2}^{T} > 0, \end{array}$

and apply the result of the theorem. □

12.2.4 Discrete-Time Certainty-Equivalent Filters (CEFs)

Again, it should be observed as in the continuous-time case Sections 12.2.1, 12.2.2 and 12.2.3, the filter gains (12.59), (12.69), (12.76) may also depend on the original state, x, of the system which is to be estimated. Therefore in this section, we develop the discrete-time counterparts of the results of Section 12.1.3.

Definition 12.2.4 For the nonlinear system (12.42), we say that it is locally zero-input observable, if for all states $x_{k}, x_{k'} \in U \subset X$ and input w(.) ≡ 0,

$y (\bar{k}, x_{k}, w) = y (\bar{k}, x_{k^{'}}, w) \Rightarrow x_{k} = x_{k^{'}}$

where y(., x,w) is the output of the system with the initial condition x(k₀) = x. Moreover, the system is said to be zero-input observable if it is locally observable at each $x_{k} \in X o r U = X$ .

We similarly consider the following class of certainty-equivalent filters:

${\sum^{˜}}^{a f} : {\begin{cases} {\hat{x}}_{k + 1} = f ({\hat{x}}_{k}) + g_{1} ({\hat{x}}_{k}) w_{k}^{⋆} + \tilde{L} ({\hat{x}}_{k}, y_{k}) [y - h_{2} ({\hat{x}}_{k}) - k_{12} ({\hat{x}}_{k}) {\tilde{w}}_{k}^{⋆}]; \\ \hat{x} (k_{0}) = {\hat{x}}^{0} \\ {\hat{z}}_{k} = h_{2} ({\hat{x}}_{k}) \\ {\tilde{z}}_{k} = y_{k} - h_{2} ({\hat{x}}_{k}), \end{cases}$

(12.84)

where $\tilde{L} (., .) \in ℳ^{n \times m}$ is the gain of the filter, ${\tilde{w}}^{⋆}$ is the estimated worst-case system noise (hence the name certainty-equivalent filter) and $\tilde{z}$ is the new penalty variable. Then, if we consider the infinite-horizon mixed $H_{2}$ / $H_{∞}$ dynamic game problem with the cost functionals (12.48), (12.49) and the above filter, we can similarly define the associated corresponding approximate Hamiltonians (as in Section 12.2.3) ${\tilde{H}}_{i} : X \times W \times Y \times ℳ^{n \times m} \times ℜ \to ℜ, i = 1, 2$ as

$\begin{array}{l} {\hat{K}}_{1} (\hat{x}, w, y, \tilde{L}, \tilde{Y}) = \tilde{Y} (\tilde{f} (\hat{x}), y) - \tilde{Y} (\hat{x}, y_{k - 1}) + {\tilde{Y}}_{\hat{x}} (\hat{x}, y) [f (\hat{x}) + g_{1} (\hat{x}) w + \\ \tilde{L} (\hat{x}, y) (y - h_{2} (\hat{x}) - k_{21} (\hat{x}) w] + \frac{1}{2} γ^{2} | | w | |^{2} - \frac{1}{2} | | \tilde{z} | |^{2} \\ {\hat{K}}_{2} (\hat{x}, w, y, \tilde{L}, \tilde{V}) = \tilde{V} (\tilde{f} (\hat{x}), y) - \tilde{V} (\hat{x}, y_{k - 1}) + {\tilde{V}}_{\hat{x}} (\hat{x}, y) [f (\hat{x}) + g_{1} (\hat{x}) w + \\ \tilde{L} (\hat{x}, y) (y - h_{2} (\hat{x}) - k_{21} (\hat{x}) w] + \frac{1}{2} | | \tilde{z} | |^{2} \end{array}$

for some smooth functions $\tilde{V}, \tilde{Y} : X \times Y \to ℜ, where \hat{x} = {\hat{x}}_{k}, w = w_{k}, y = y_{k}, \tilde{z} = {\tilde{z}}_{k}$ , and the adjoint variables are set as ${\tilde{p}}_{1} = \tilde{Y}, {\tilde{p}}_{2} = \tilde{V}$ . Then

$\begin{array}{l} {\frac{\partial {\hat{K}}_{1}}{\partial w} |}_{w = {\tilde{w}}^{⋆}} = {[g_{1} (\hat{x}) - \tilde{L} (\hat{x}, y) k_{21} (\hat{x})]}^{T} {\tilde{Y}}_{x}^{T} (\hat{f} (\hat{x}), y) + γ^{2} w = 0 \\ \Rightarrow {\tilde{w}}^{⋆} = - \frac{1}{γ^{2}} {[g_{1} (\hat{x}) - \tilde{L} (\hat{x}, y) k_{21} (\hat{x})]}^{T} {\tilde{Y}}_{x}^{T} (\hat{f} (\hat{x}), y) . \end{array}$

Consequently, repeating the steps as in Section 12.2.3 and Theorem 12.2.3, we arrive at the following result.

Theorem 12.2.4 Consider the nonlinear system (12.42) and the D M H 2 H I N L F P for it. Suppose the plant Σ^da is locally asymptotically-stable about the equilibrium point x = 0 and zero-input observable. Suppose further, there exists a pair of C¹ (with respect to the first argument) negative and positive-definite functions $\tilde{Y}, \tilde{V} : \tilde{N} \times ϒ \to ℜ$ respectively, locally defined in a neighborhood $\tilde{N} \times ϒ \subset X \times Y$ of the origin $(\hat{x}, y) = (0, 0)$ , and a matrix function $\tilde{L} : \tilde{N} \times ϒ \to ℳ^{n \times m}$ satisfying the following pair of coupled DHJIEs:

$\begin{array}{l} \tilde{Y} (\hat{f} (\hat{x}), y) - \tilde{Y} (\hat{x}, y_{k - 1}) - \frac{1}{2 γ^{2}} {\tilde{Y}}_{\hat{x}} (\hat{f} (\hat{x}), y) g_{1} (\hat{x}) g_{1}^{T} (\hat{x}) {\tilde{Y}}_{\hat{x}}^{T} (\hat{f} (\hat{x}), y) - \\ \frac{1}{2 γ^{2}} {\tilde{Y}}_{x} (\hat{x}, y) - \tilde{L} (\hat{x}, y) {\tilde{L}}^{T} (\hat{x}, y) Y_{\hat{x}}^{T} (\hat{x}, y) - \frac{1}{γ^{2}} {\tilde{Y}}_{\hat{x}} (\hat{f} (\hat{x}), y) {\tilde{L}}^{T} (\hat{x}, y) {\tilde{V}}_{\hat{x}}^{T} (\hat{f} (x), y) - \\ \frac{1}{2} {(y - h_{2} (\hat{x}))}^{T} (y - h_{2} (\hat{x})) = 0, \tilde{Y} (0, 0) = 0, \end{array}$

(12.85)

$\begin{array}{l} {\tilde{V}}_{x} (\hat{f} (\hat{x}), y) - \tilde{V} (\hat{x}, y_{k - 1}) - \frac{1}{2 γ^{2}} {\tilde{V}}_{\hat{x}} (\hat{f} (\hat{x}), y) g_{1} (\hat{x}) g_{1}^{T} (\hat{x}) {\tilde{Y}}_{\hat{x}}^{T} (\hat{f} (x), y) - \\ \frac{1}{γ^{2}} {\tilde{V}}_{\hat{x}} (\hat{f} (\hat{x}), y) \tilde{L} (\hat{x}, y) {\tilde{L}}^{T} (\hat{x}, y) {\tilde{Y}}_{\hat{x}}^{T} (\hat{f} (\hat{x}), y) - \frac{1}{γ^{2}} {\tilde{V}}_{\hat{x}} (\hat{f} (\hat{x}), y) \tilde{L} (\hat{x}, y) {\tilde{L}}^{T} (\hat{x}, y) V_{\hat{x}}^{T} (\hat{f} (\hat{x}), y) + \\ \frac{1}{2} {(y - h_{2} (\hat{x}))}^{T} (y - h_{2} (\hat{x})) = 0, \tilde{V} (0, 0) = 0 \end{array}$

(12.86)

FIGURE 12.4
Discrete-Time $H_{2}$ / $H_{∞}$ -Filter Performance with Unknown Initial Condition and ℓ₂-Bounded Disturbance; Reprinted from Int. J. of Robust and Nonlinear Control, RNC1643 (published online August) © 2010, “Discrete-time Mixed $H_{2}$ / $H_{∞}$ Nonlinear Filtering,” by M. D. S. Aliyu and E. K. Boukas, with permission from Wiley Blackwell.

$\hat{x} \in \tilde{N}, y \in ϒ$ , together with the coupling condition

${\tilde{V}}_{\hat{x}} (\hat{f} (\hat{x}), y) \tilde{L} (\hat{x}, y) = - γ^{2} {(y - h_{2} (\hat{x}))}^{T}, \hat{x} \in \tilde{N} y \in ϒ .$

(12.87)

Then the filter ${\tilde{Σ}}^{a f}$ with the gain matrix $\tilde{L} (\hat{x}, y)$ satisfying (12.87) solves the infinite-horizon D M H 2 H I N L F P for the system locally in $\tilde{N}$ .

Proof: Follows the same lines as Theorem 12.2.3. □

Remark 12.2.4 Comparing the HJIEs (12.85)-(12.86) with (12.77)-(12.78) reveals that they are similar, but we have gained by reducing the dimensionality of the PDEs, and more importantly, the filter gain does not depend on x any longer.

12.3 Example

We consider a simple example to illustrate the result of the previous section.

Example 12.3.1 We consider the following scalar system

$\begin{array}{l} x_{k + 1} = x_{k}^{\frac{1}{5}} + x_{k}^{\frac{1}{3}} \\ y_{k} = x_{k} + w_{k} \end{array}$

where $w_{k} = e^{- 0.3 k} \sin (0.25 π k)$ is an ℓ₂ -bounded disturbance.

Approximate solutions of the coupled DHJIEs (12.85) and (12.86) can be calculated using an iterative approach. With γ= 1, and g₁ (x) = 0, we can rewrite the coupled DHJIEs as j = 0, 1,…. Then, with the initial filter gain l⁰ = 1, initial guess for solutions as ${\tilde{Y}}^{0} (\hat{x}, y) = - \frac{1}{2} ({\hat{x}}^{2} + y^{2}), {\tilde{V}}^{0} (\hat{x}, y) = \frac{1}{2} ({\hat{x}}^{2} + y^{2})$ respectively, we perform one iteration of the above recursive equations (12.88), (12.89) to get

${\tilde{Y}}^{j + 1} (\hat{x}, y) ≜ {\tilde{Y}}^{j} (\hat{x}, y_{k - 1}) + \frac{1}{2} {(y - x)}^{2} = {\tilde{Y}}^{j} (\hat{f}, y) - \frac{1}{2} {\tilde{Y}}_{\hat{x}}^{j^{2}} (\hat{f}, y) l^{^{j^{2}}} - {\tilde{Y}}_{\hat{x}}^{j} (\hat{f}, y) {\tilde{V}}_{\hat{x}}^{j} (\hat{f}, y) l^{^{j^{2}}},$

(12.88)

${\tilde{V}}^{j + 1} (\hat{x}, y) ≜ {\tilde{V}}^{j} (\hat{x}, y_{k - 1}) - \frac{1}{2} {(y - x)}^{2} = {\tilde{V}}^{j} (\hat{f}, y) - \frac{1}{2} {\tilde{V}}_{\hat{x}}^{j^{2}} (\hat{f}, y) l^{^{j^{2}}} - {\tilde{Y}}_{\hat{x}}^{j} (\hat{f}, y) {\tilde{V}}_{\hat{x}}^{j} (\hat{f}, y) l^{^{j^{2}}},$

(12.89)

FIGURE 12.5
Extended-Kalman-Filter Performance with Unknown Initial Condition and ℓ₂-Bounded Disturbance; Reprinted from Int. J. of Robust and Nonlinear Control, RNC1643 (published online August) © 2010, “Discrete-time mixed $H_{2}$ / $H_{∞}$ nonlinear filtering,” by M. D. S. Aliyu and E. K. Boukas, with permission from Wiley Blackwell.

$\begin{array}{l} \tilde{Y} = - \frac{1}{2} [({\hat{x}}^{1 / 5} + {\hat{x}}^{1 / 3}) + y^{2}] + \frac{1}{2} {({\hat{x}}^{1 / 5} + {\hat{x}}^{1 / 3})}^{2}, \\ {\tilde{V}}^{1} = \frac{1}{2} [{({\hat{x}}^{1 / 5} + {\hat{x}}^{1 / 3})}^{2} + y^{2}] . \end{array}$

Therefore,

$\begin{array}{l} {\tilde{Y}}^{1} (\hat{x}, y_{k - 1}) = - \frac{1}{2} {(y - x)}^{2} - \frac{1}{2} y^{2}, \\ \tilde{V} (\hat{x}, y_{k - 1}) = \frac{1}{2} {(y - x)}^{2} + \frac{1}{2} [{({\hat{x}}^{1 / 5} + {\hat{x}}^{1 / 3})}^{2} + y^{2}] . \end{array}$

We can use the above approximate solution ${\tilde{V}}^{1} (\hat{x}, y_{k - 1})$ to the DHJIE to estimate the states of the system, since the gain of the filter depends only on ${\tilde{V}}_{\hat{x}}^{1} (\hat{x}, y)$ . Consequently, we can compute the filter gain as

$l (x_{k}, y_{k}) \approx - \frac{(y_{k} - x_{k})}{y_{k} - x_{k}^{\frac{1}{2}} - x_{k}^{\frac{1}{3}}} .$

(12.90)

The result of simulation with this filter is then shown in Figure 12.4. We also compare this result with that of an extended-Kalman filter for the same system shown in Figure 12.5. It can clearly be seen that, the mixed $H_{2}$ / $H_{\infty}$ filter performance is superior to that of the EKF.

12.4 Notes and Bibliography

This chapter is entirely based on the authors’ contributions [18, 19, 21]. The reader is referred to these references for more details.

1 A second-order Taylor series approximation would be more accurate, but the first-order method gives a solution that is very close to the continuous-time case.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for 12.2.1 Solution to the Finite-Horizon Discrete-Time Mixed H∞2/H∞∞ Nonlinear Filtering Problem

Create new playlist

Sign In

Sign Up

Table of Contents for
12.2.1 Solution to the Finite-Horizon Discrete-Time Mixed H∞2/H∞∞ Nonlinear Filtering Problem