11.2 Discrete-Time Mixed H∞2/H∞∞ Nonlinear Control

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

In this section, we discuss the state-feedback mixed H2 $H_{2}$ /H∞ $H_{∞}$ -control problem for discrete-time nonlinear systems. We begin with an affine discrete-time state-space model defined on X⊂Rn $X \subset ℜ^{n}$ in coordinates (x₁,…,x_n) :

∑da : ⎧⎩⎨⎪⎪x˙k+1 = f(xk)+g1(xk)wk+g2(xk)uk; x(k0) =x0 zk = h1(xk)+k12(xk)uk yk = xk , $\sum^{d a} : {\begin{cases} {\dot{x}}_{k + 1} = f (x_{k}) + g_{1} (x_{k}) w_{k} + g_{2} (x_{k}) u_{k}; x (k_{0}) = x^{0} \\ z_{k} = h_{1} (x_{k}) + k_{12} (x_{k}) u_{k} \\ y_{k} = x_{k}, \end{cases}$

(11.29)

where all the variables and system matrices have their previous meanings and dimensions respectively. We assume similarly that the system has a unique equilibrium at x = 0, and is such that f(0) = 0 and h₁0) = 0. For simplicity, we similarly also assume the following hold for the system matrices.

Assumption 11.2.1 The system matrices are such that

hT1(x)k12(x) = 0,kT12(x)k12(x) = I. } $\begin{array}{l} h_{1}^{T} (x) k_{12} (x) = 0, \\ k_{12}^{T} (x) k_{12} (x) = I . \end{array}}$

(11.30)

Again, as in the continuous-time case, the standard problem is to design a static state-feedback controller, K^da, such that the ℌ₂-norm of the closed-loop system which is defined as

∥∥Kda ο ∑da∥∥ℓ2 ≜ sup0≠w0∈S∥z∥P∥w0∥S′ , ${‖ K^{d a} ο \sum^{d a} ‖}_{ℓ_{2}} ≜ \sup_{0 \neq w_{0} \in S} \frac{‖ z ‖ P}{‖ w_{0} ‖ S^{'}},$

and the H∞ $H_{\infty}$ -norm of the system defined by

∥∥Kda ο ∑da∥∥ℓ∞ ≜ sup0≠w1∈P′∥z∥2∥w1∥2 , ${‖ K^{d a} ο \sum^{d a} ‖}_{ℓ_{\infty}} ≜ \sup_{0 \neq w_{1} \in P^{'}} \frac{{‖ z ‖}_{2}}{{‖ w_{1} ‖}_{2}},$

are minimized over a time horizon [k₀, K] ⊂ Z, where

P′ ≜ {w: w∈ℓ∞, Rww(k),Sww(jω) exist for all k and all ω resp., ∥w∥P′ <∞} $\begin{array}{l} P^{'} ≜ {w : w \in ℓ_{\infty}, R_{w w} (k), S_{w w} (j ω) exist for all k and all ω resp ., \\ ‖ w ‖ P^{'} < \infty} \end{array}$

S′ ≜ {w: w∈ℓ∞, Rww(k),Sww(jω) exist for all k and all ω resp., ∥Sww(jω)∥∞ < ∞} $\begin{array}{l} S^{'} ≜ {w : w \in ℓ_{\infty}, R_{w w} (k), S_{w w} (j ω) exist for all k and all ω resp ., \\ {‖ S_{w w} (j ω) ‖}_{\infty} < \infty} \end{array}$

∥z∥2P′ ≜ limk→∞ 12K∑k=−KK∥zk∥2∥w0∥2S′ = ∥Sw0w0(jw)∥∞ $\begin{array}{l} {‖ z ‖}_{P^{'}}^{2} ≜ \lim_{k \to \infty} \frac{1}{2 K} \sum_{k = - K}^{K} {‖ z_{k} ‖}^{2} \\ {‖ w_{0} ‖}_{S^{'}}^{2} = {‖ S_{w_{0} w_{0}} (j w) ‖}_{\infty} \end{array}$

and Rww,Sww(jω)) $R_{w w}, S_{w w} (j ω))$ are the autocorrelation and power spectral density matrices of w [152]. Notice also that ∥(.)∥P′ and ∥(.)∥S′ ${‖ (.) ‖}_{P^{'}} and {‖ (.) ‖}_{S^{'}}$ are seminorms. The spaces P′ and S′ $P^{'} and S^{'}$ are the discrete-time spaces of bounded-power and bounded-spectral signals respectively.

However, we do not solve the above standard problem in this section. Instead, we solve an associated suboptimal problem in which w ∈ ℓ₂[k₀, ∞) is a single disturbance, and the objective is to minimize the output energy ∥z∥ℓ2 ${‖ z ‖}_{ℓ_{2}}$ subject to the constraint that ∥∥∑da∥∥ℓ∞ ≤ γ⋆ ${‖ \sum^{d a} ‖}_{ℓ_{\infty}} \leq γ^{⋆}$ for some number γ⋆ > 0 $γ^{⋆} > 0$ . The problem is more formally defined as follows.

Definition 11.2.1 (Discrete-Time State-Feedback Mixed H₂ / H∞ $H_{∞}$ Nonlinear Control Problem (DSFBMH2HINLCP)).

(A) Finite-Horizon Problem (K < ∞): Find (if possible!) a time-varying static state-feedback control law of the form:

u=α˜2d(k,xk) , α˜2d(k,0)=0, k∈Z $u = {\tilde{α}}_{2 d} (k, x_{k}), {\tilde{α}}_{2 d} (k, 0) = 0, k \in Z$

such that:

(a) the closed-loop system

∑clda : {xk+1 = f(xk)+g1(xk)wk+g2(xk)α˜2d(xk) zk = h1(xk)+k21(xk)α˜2d(xk) $\sum^{c l d a} : {\begin{cases} x_{k + 1} = f (x_{k}) + g_{1} (x_{k}) w_{k} + g_{2} (x_{k}) {\tilde{α}}_{2 d} (x_{k}) \\ z_{k} = h_{1} (x_{k}) + k_{21} (x_{k}) {\tilde{α}}_{2 d} (x_{k}) \end{cases}$

(11.31)

is stable with w = 0 and has locally finite ℓ₂-gain from w to z less or equal to γ^⋆, starting from x⁰ = 0, for all k ∈ [k₀, K] and for a given number γ^⋆ > 0.

(b) the output energy ∥z∥ℓ2 ${‖ z ‖}_{ℓ_{2}}$ of the system is minimized for all disturbances w∈W⊂ℓ2[k0,∞) $w \in W \subset ℓ_{2} [k_{0}, \infty)$ .

(B) Infinite-Horizon Problem (K → ∞ ): In addition to the items (a) and (b) above, it is also required that

(c) the closed-loop system Σ^clda defined above with w ≡ 0 is locally asymptotically-stable about the equilibrium-point x = 0.

The problem is similarly formulated as a two-player nonzero-sum differential game with two cost functionals:

minu∈U , w∈W J1d(u,w) = 12∑k=k0K(γ2 ∥wk∥2− ∥zk∥2) $\min_{u \in U, w \in W} J_{1 d} (u, w) = \frac{1}{2} \sum_{k = k_{0}}^{K} (γ^{2} {‖ w_{k} ‖}^{2} - {‖ z_{k} ‖}^{2})$

(11.32)

minu∈U , w∈W J2d(u,w) = 12∑k=k0K∥zk∥2. $\min_{u \in U, w \in W} J_{2 d} (u, w) = \frac{1}{2} \sum_{k = k_{0}}^{K} {‖ z_{k} ‖}^{2} .$

(11.33)

Again, sufficient conditions for the solvability of the above dynamic game (11.32), (11.33), (11.29), and the existence of Nash-equilibrium strategies are given by the following pair of discrete-time Hamilton-Jacobi-Isaac’s difference equations (DHJIEs):

$\begin{array}{l} W (x, k) = \inf_{w} {W (f (x) + g_{1} (x) w + g_{2} (x) u^{⋆}, k + 1) + \frac{1}{2} (γ^{2} {‖ w_{k} ‖}^{2} - {‖ z_{k}^{⋆} ‖}^{2})}, \\ W (x, K + 1) = 0, \end{array}$

(11.34)

$\begin{array}{l} U (x, k) = \inf_{u} {U (f (x) + g_{1} (x) w^{⋆} + g_{2} (x) u, k + 1) + \frac{1}{2} {‖ z_{k}^{⋆} ‖}^{2}}, \\ U (x, K + 1) = 0, \end{array}$

(11.35)

for some smooth negative and positive-definite functions $W, U : X \times Z \to ℜ$ respectively, and where $z_{k}^{⋆} = h_{1} (x) + k_{12} (x) u^{⋆} (x)$ .

To solve the problem, we define the Hamiltonian functions $H_{i} : X \times W \times U \to ℜ \to ℜ$ , i = 1,2 corresponding to the cost functionals (11.32), (11.32) respectively and the system equations (11.29):

$\begin{array}{l} H_{1} (x, w, u, W) = W (f (x) + g_{1} (x) w + g_{2} (x) u, k + 1) - W (x) + \\ \frac{1}{2} (γ^{2} {‖ w ‖}^{2} - ‖ z_{k} ‖), \end{array}$

(11.36)

$\begin{array}{l} H_{2} (x, w, u, U) = U (f (x) + g_{1} (x) w + g_{2} (x) u, k + 1) - U (x) + \\ \frac{1}{2} {‖ z_{k} ‖}^{2} . \end{array}$

(11.37)

Similarly, as in Chapter 8, let

$\begin{array}{l} \partial^{2} H (x) ≜ [\begin{matrix} \frac{\partial^{2} H_{2}}{\partial u^{2}} & \frac{\partial^{2} H_{2}}{\partial w \partial u} \\ \frac{\partial^{2} H_{1}}{\partial u \partial w} & \frac{\partial^{2} H_{1}}{\partial w^{2}} \end{matrix}] (x) ≜ [\begin{matrix} r_{11} (x) & r_{12} (x) \\ r_{21} (x) & r_{22} (x) \end{matrix}] \\ F^{⋆} (x) ≜ f (x) + g_{1} (x) w^{⋆} (x) + g_{2} (x) u^{⋆} (x), \end{array}$

(11.38)

and therefore

$\begin{array}{l} r_{11} (x) = g_{2}^{T} (x) {\frac{\partial^{2} U}{\partial λ^{2}} |}_{λ = F^{⋆} (x)} g_{2} (x) + I \\ r_{12} (x) = g_{2}^{T} (x) {\frac{\partial^{2} U}{\partial λ^{2}} |}_{λ = F^{⋆} (x)} g_{1} (x) \\ r_{21} (x) = g_{1}^{T} (x) {\frac{\partial^{2} W}{\partial λ^{2}} |}_{λ = F^{⋆} (x)} g_{2} (x) \\ r_{22} (x) = γ^{2} I + g_{1}^{T} (x) {\frac{\partial^{2} E W}{\partial λ^{2}} |}_{λ = F^{⋆} (x)} g_{1} (x) . \end{array}}$

(11.39)

The following theorem then presents sufficient conditions for the solvability of the finite-horizon problem.

Theorem 11.2.1 Consider the discrete-time nonlinear system (11.29), and the finite-horizon DSFBMH2HINLCP with cost functionals (11.32), (11.33). Suppose there exists a pair of negative and positive-definite C² (with respect to the first argument)-functions $W, U : M \times Z \to ℜ$ locally defined in a neighborhood M of the origin x = 0, such that W(0,k) = 0 and U(0,k) = 0, and satisfying the coupled DHJIEs:

$\begin{array}{l} W (x, k) = W (f (x) + g_{1} (x) w^{⋆} + g_{2} (x) u^{⋆}, k + 1) + \frac{1}{2} (γ^{2} {‖ w ‖}^{2} - {‖ z_{k}^{⋆} ‖}^{2}), \\ W (x, K + 1) = 0, \end{array}$

(11.40)

$\begin{array}{l} U (x, k) = U (f (x) + g_{1} (x) w^{⋆} + g_{2} (x) u^{⋆}, k + 1) + \frac{1}{2} {‖ z_{k}^{⋆} ‖}^{2}, \\ U (x, K + 1) = 0, \end{array}$

(11.41)

together with the conditions

$r_{22} (0) > 0, \det [r_{11} (0) - r_{22}^{- 1} (0) r_{21} (0)] \neq 0.$

(11.42)

Then the state-feedback controls defined implicitly by

$w^{⋆} = - g_{1}^{T} (x) \frac{1}{γ^{2}} {\frac{\partial W}{\partial λ} |}_{λ= f (x) + g_{1} (x) w^{⋆} + g_{2} (x) u^{⋆}}$

(11.43)

$u^{⋆} = - g_{2}^{T} (x) \frac{1}{γ^{2}} {\frac{\partial U}{\partial λ} |}_{λ= f (x) + g_{1} (x) w^{⋆} + g_{2} (x) u^{⋆}}$

(11.44)

solve the finite-horizon DSFBMH2HINLCP for the system. Moreover, the optimal costs are given by

$J_{1 d}^{⋆} (u^{⋆}, w^{⋆}) = W (k_{0}, x_{0}),$

(11.45)

$J_{2 d}^{⋆} (u^{⋆}, w^{⋆}) = U (k_{0}, x_{0}) .$

(11.46)

Proof: We prove item (a) in Definition 11.2.1 first. Assume there exist solutions W < 0, U > 0 of the DHJIEs (11.40), (11.41), and consider the Hamiltonian functions H₁(.,.,.), H₂(.,.,.). Applying the necessary conditions for optimality

$\frac{\partial H_{1}}{\partial w} (x, u, w) = 0, \frac{\partial H_{2}}{\partial u} (x, u, w) = 0,$

and solving these for w^⋆, u^* respectively, we get the Nash-equilibrium strategies (11.43), (11.44). Moreover, if the conditions (11.42) are satisfied, then the matrix

$\begin{array}{l} {\partial^{2} H |}_{w = 0, u = 0} (0) = [\begin{matrix} r_{11} (0) & r_{12} (0) \\ r_{21} (0) & r_{22} (0) \end{matrix}] = \\ [\begin{matrix} I & r_{12} (0) r_{22}^{- 1} (0) \\ 0 & I \end{matrix}] [\begin{matrix} r_{11} (0) - r_{12} (0) r_{22}^{- 1} (0) r_{21} (0) & 0 \\ 0 & r_{22} (0) \end{matrix}] [\begin{matrix} I & 0 \\ r_{22}^{- 1} (0) r_{21} (0) & I \end{matrix}] \end{array}$

is nonsingular. Therefore, by the Implicit-function Theorem, there exist open neighborhoods X₁ of x = 0, W₁ of w = 0 and U₁ of u = 0, such that the equations (11.43), (11.44) have unique solutions.

Now suppose, (u^*, w^⋆ ) have been obtained from (11.43), (11.44), then subsituting in the DHJIEs (11.34), (11.35) yield the DHJIEs (11.40), (11.41). Moreover, by Taylor-series expansion, we can write

$\begin{array}{l} H_{1} (x, w, u^{⋆} (x), W) = H_{1} (x, w^{⋆} (x) . u^{⋆} (x), W) + \frac{1}{2} {(w - w^{⋆} (x))}^{T} [r_{22} (x) + \\ O (‖ w - w^{⋆} (x) ‖)] (w - w^{⋆} (x)) . \end{array}$

In addition, since r₂₂(0) > 0 implies r₂₂(x) > 0 for all x in a neighborhood X₂ of x = 0 by the Inverse-function Theorem [234], it then follows from above that there exists also a neighborhood W₂ of w = 0 such that

$\begin{array}{l} H_{1} (x, w, u^{⋆} (x), W) \geq H_{1} (x, w^{⋆} (x), u^{⋆} (x), W) = 0 \forall x \in X_{2}, \forall w \in W_{2}, \\ \Leftrightarrow W (f (x) + g_{1} (x) w + g_{2} (x) u^{⋆} (x), k + 1) - W (x, k) + \frac{1}{2} (γ^{2} {‖ w ‖}^{2} - {‖ u^{⋆} (x) ‖}^{2} - {‖ h_{1} ‖}^{2}) \geq 0 \\ \forall x \in X_{2}, \forall w \in W_{2} . \end{array}$

Setting now w = 0, we have

$\tilde{W} (f (x) + g_{2} (x) u^{⋆} (x), k + 1) - \tilde{W} (x, k) \leq - \frac{1}{2} {‖ u^{⋆} (x) ‖}^{2} - \frac{1}{2} {‖ h_{1} (x) ‖}^{2} \leq 0$

for some function $\tilde{W}$ = − W > 0. Hence, the closed-loop system is Lyapunov-stable. To prove item (b), consider the Hamiltonian function H₂ (., w^⋆, ., .) and expand it in Taylor’sseries:

$H_{2} (x, w^{⋆}, u, U) = H_{2} (x, w^{⋆}, u^{⋆}, U) + \frac{1}{2} {(u - u^{⋆})}^{T} [r_{11} (x) + O (‖ u - u^{⋆} ‖)] (u - u^{⋆}) .$

Since

$r_{11} (0) = I + g_{2}^{T} (0) \frac{\partial^{2} W}{\partial λ^{2}} (0) g_{2} (0) \geq I,$

again there exists a neighborhood ${\tilde{X}}_{2}$ of x = 0 such that r₁₁(x) > 0 by the Inverse-function Theorem. Therefore,

$H_{2} (x, w^{⋆}, u, U) \geq H_{2} (x, w^{⋆}, u^{⋆}, U) = 0 \forall u \in U$

and the H₂-cost is minimized.

Finally, we determine the optimal costs of the strategies. For this, consider the cost functional J_1d(u^*, w^⋆ ) and write it as

$\begin{array}{l} J_{1 d} (u, w) + W (x_{k + 1}, K + 1) - W (x_{k_{0}}, k_{0}) = \sum_{k = k_{0}}^{K} {\frac{1}{2} (γ^{2} {‖ w_{k}^{⋆} ‖}^{2} - {‖ z_{k}^{⋆} ‖}^{2}) + \\ W (x_{k + 1}, K + 1) - W (x_{k}, k)} \\ = \sum_{k = k_{0}}^{K} H_{1} (x, w^{⋆}, u^{⋆}, W) = 0. \end{array}$

Since $W (x_{k + 1}, K + 1) = 0$ , we have the result. Similarly, for J₂(w^⋆, u^* ), we have

$\begin{array}{l} J_{2 d} (u, w) + U (x_{k + 1}, K + 1) - U (x_{k_{0}}, k_{0}) = \sum_{k = k_{0}}^{K} {\frac{1}{2} {‖ z_{k}^{⋆} ‖}^{2} + U (x_{k + 1}, K + 1) - U (x_{k}, k)} \\ = \sum_{k = k_{0}}^{K} H_{2} (x, w^{⋆}, u^{⋆}, U) = 0. \end{array}$

and since U(x_K+1, K + 1) = 0, the result also follows. □

The above result can be specialized to the linear discrete-time system

$\sum^{d l} : {\begin{cases} {\dot{x}}_{k + 1} = A x_{1} + B_{1} w_{k} + B_{2} u_{k} \\ z_{k} = C_{1} x_{k} + D_{12} w_{k} \\ y_{k} = x_{k} \end{cases}$

(11.47)

where all the variables and matrices have their previous meanings and dimensions. Then we have the following corollary to Theorem 11.2.1.

Corollary 11.2.1 Consider the linear system Σ^dl under the Assumption 11.1.2. Suppose there exist P_1,k < 0 and P_2,k > 0 symmetric solutions of the cross-coupled discrete-Riccati difference-equations (DRDEs):

$\begin{array}{l} P_{1, k} = A^{T} {P_{1, k} - 2 P_{1, k} B_{1} B_{γ, k}^{- 1} Γ_{1, k} - 2 P_{1, k} B_{2} Λ_{k}^{- 1} B_{2}^{T} P_{2, k} + \\ 2 Γ_{1, k}^{T} B_{γ, k}^{- T} B_{1} P_{1} B_{2} Λ_{k}^{- 1} B_{2} P_{2, k} Γ_{2, k} + Γ_{1, k}^{T} B_{γ, k}^{- 1} B_{1} P_{1} B_{1} B_{γ, k}^{- 1} Γ_{1, k} + \\ Γ_{2, k}^{T} P_{2} B_{2} Λ_{k}^{- T} B_{2} P_{1, k} B_{2} Λ_{k}^{- 1} B_{2}^{T} P_{2, k} Γ_{2, k}^{T} + γ^{2} Γ_{1, k}^{T} B_{γ, k}^{- T} B_{γ, k}^{- 1} Γ_{1, k}^{T} - \\ Γ_{2, k}^{T} P_{2, k} B_{2} Λ_{k}^{- T} Λ_{k}^{- 1} B_{2}^{T} P_{2} Γ_{2, k}} A - C_{1}^{T} C_{1}, P_{1, Κ} = 0 \end{array}$

(11.48)

$\begin{array}{l} P_{2, k} = A^{T} {P_{2, k} - 2 P_{2, k} B_{1} B_{γ, k}^{- 1} Γ_{1, k} - 2 P_{2, k} B_{2} Λ_{k}^{- 1} B_{2}^{T} P_{2, k} Γ_{2, k} + \\ 2 Γ_{1, k}^{T} B_{γ, k}^{- T} B_{1} P_{2, k} B_{2} Λ_{k}^{- 1} B_{2, k}^{T} P_{2, k} Γ_{2, k} + \\ Γ_{2, k}^{T} P_{2, k} B_{2} Λ_{k}^{- T} Λ_{k}^{- 1} B_{2}^{T} P_{2, k} Γ_{2, k}} A + C_{1}^{T} C_{1}, P_{2, Κ} = 0 \end{array}$

(11.49)

$B_{γ,k} : = [γ^{2} I - B_{2} Λ_{k}^{- 1} B_{2}^{T} P_{2, k} B_{1} + B_{1}^{T} P_{1, k} B_{1}] > 0$

(11.50)

for all k in [k₀, K]. Then, the Nash-equilibrium strategies uniquely specified by

$w_{l, k}^{⋆} = - B_{γ,k}^{- 1} Γ_{1, k} A x_{k}, k \in [k_{0}, K]$

(11.51)

$u_{l, k}^{⋆} = - Λ_{k}^{- 1} B_{2}^{T} P_{2, k} Γ_{2, k} A x_{k}, k \in [k_{0}, K]$

(11.52)

where

$\begin{array}{l} Λ_{k} : = (I + B_{2}^{T} P_{2, k} B_{2}), \forall k, \\ Γ_{1, k} : = [B_{1}^{T} P_{1, k} - B_{2} Λ_{k}^{- 1} B_{2}^{T} P_{2, k}] \forall k, \\ Γ_{2, k} : = [I - B_{1} B_{γ,k}^{- 1} (B_{1}^{T} P_{1, k} - B_{2} Λ_{k}^{- 1} B_{2}^{T} P_{2, k})] \forall k, \end{array}$

solve the finite-horizon DSFBMH2HINLCP for the system. Moreover, the optimal costs for the game are given by

$J_{1, l} (u^{⋆}, w^{⋆}) = \frac{1}{2} x_{k_{0}}^{T} P_{1, k_{0}} x_{k_{0}},$

(11.53)

$J_{2, l} (u^{⋆}, w^{⋆}) = \frac{1}{2} x_{k_{0}}^{T} P_{2, k_{0}} x_{k_{0}} .$

(11.54)

Proof: Assume the solutions to the coupled HJIEs are of the form,

$W (x_{k}, k) = \frac{1}{2} x_{k}^{T} P_{1, k} x_{k}, P_{1, k} < 0, k = 1, …, K,$

(11.55)

$U (x_{k}, k) = \frac{1}{2} x_{k}^{T} P_{2, k} x_{k}, P_{2, k} > 0, k = 1, …, K .$

(11.56)

Then, the Hamiltonians H₁(., ., ., .), H₂(., ., ., .) are given by

$\begin{array}{l} H_{1, l} (x, w, u, W) = \frac{1}{2} {(A x + B_{1} w + B_{2} u)}^{T} P_{1, k} (A x + B_{1} w + B_{2} u) - \\ \frac{1}{2} x^{T} p_{1, k} x + \frac{1}{2} γ^{2} {‖ w ‖}^{2} - \frac{1}{2} {‖ z ‖}^{2}, \end{array}$

$\begin{array}{l} H_{2, l} (x, w, u, U) = \frac{1}{2} {(A x + B_{1} w + B_{2} u)}^{T} P_{2, k} (A x + B_{1} w + B_{2} u) - \\ \frac{1}{2} x^{T} p_{2, k} x + \frac{1}{2} {‖ z ‖}^{2} . \end{array}$

Applying the necessary conditions for optimality, we get

$\frac{\partial H_{1, l}}{\partial w} = B_{1}^{T} P_{1, k} (A x + B_{1} w + B_{2} u) + γ^{2} w = 0,$

(11.57)

$\frac{\partial H_{2, l}}{\partial u} = B_{2}^{T} P_{2, k} (A x + B_{1} w + B_{2} u) + u = 0.$

(11.58)

Solving the last equation for u we have

$u = - {(I + B_{2}^{T} P_{2, k} B_{2})}^{- 1} B_{2}^{T} P_{2, k} (A x + B_{1} w),$

which upon substitution in the first equation gives

$\begin{array}{l} B_{1}^{T} P_{1, k} {A x + B_{1} w - B_{2} {(I + B_{2}^{T} P_{2, k} B_{2})}^{- 1} B_{2}^{T} P_{2, k} (A x + B_{1} w)} + γ^{2} w = 0 \\ \Leftrightarrow w_{l, k}^{⋆} = - B_{γ,k}^{- 1} [B_{1}^{T} P_{1, k} - B_{2} Λ_{k}^{- 1} B_{2}^{T} P_{2, k}] A x_{k} \\ = - B_{γ,k}^{- 1} Γ_{1, k} A x_{k} k \in [k_{0}, K], \end{array}$

(11.59)

if and only if

$B_{γ,k} : = [γ^{2} I - B_{2} Λ_{k}^{- 1} B_{2}^{T} P_{2, k} B_{1} + B_{1}^{T} P_{1, k} B_{1}] > 0 \forall k,$

where

$\begin{array}{l} Λ_{k} : = (I + B_{2}^{T} P_{2, k} B_{2}), \forall k, \\ Γ_{1, k} : = [B_{1}^{T} P_{1, k} - B_{2} Λ_{k}^{- 1} B_{2}^{T} P_{2, k}] \forall k . \end{array}$

Notice that Λ_k is nonsingular for all k since P_2,k is positive-definite. Now, substitute w^⋆ in the expression for u to get

$\begin{array}{l} u_{l, k}^{⋆} = Λ_{k}^{- 1} B_{2}^{T} P_{2, k} [I - B_{1} B_{γ,k}^{- 1} (B_{1}^{T} P_{1, k} - B_{2} Λ_{k}^{- 1} B_{2}^{T} P_{2, k})] A x_{k}, k \in [k_{0}, K] \\ = - Λ_{k}^{- 1} B_{2}^{T} P_{2, k} Γ_{2, k} A x_{k} \end{array}$

(11.60)

where

$Γ_{2, k} : = [I - B_{1} B_{γ,k}^{- 1} (B_{1}^{T} P_{1, k} - B_{2} Λ_{k}^{- 1} B_{2}^{T} P_{2, k})] .$

Finally, substituting $(u_{l, k}^{⋆}, w_{l, k}^{⋆})$ in the DHJIEs (11.40), (11.41), we get the DRDEs (11.48), (11.49). The optimal costs are also obtained by substitution in (11.45), (11.46). □

Remark 11.2.1 Note, in the above Corollary 11.2.1 for the solution of the linear discrete-time problem, it is better to consider strictly positive-definite solutions of the DRDEs (11.48), (11.49) because the condition B_γ,k > 0 must be respected for all k.

11.2.1 The Infinite-Horizon Problem

In this subsection, we consider similarly the infinite-horizon DSFBMH2HINLCP for the affine discrete-time nonlinear system Σ^da. We let K → ∞, and seek time-invariant functions and feedback gains that solve the DSFBMH2HINLCP. Again we require that the closed-loop system be locally asymptotically-stable, and for this, we need the following definition of detectability for the discrete-time system Σ^da.

Definition 11.2.2 The pair {f,h} is said to be locally zero-state detectable if there exists a neighborhood $\tilde{O}$ of x = 0 such that, if x_k is a trajectory of x_k+1 = f(x_k) satisfying $x (k_{0}) \in \tilde{O}$ , then h(x_k) is defined for all k ≥ k₀, and h(x_k) = 0 for all k ≥ k_s, implies $\lim_{k \to \infty} x_{k} = 0$ . Moreover {f,h} is said to be zero-state detectable if $\tilde{O} = X$ .

Theorem 11.2.2 Consider the nonlinear system Σ^da defined by (11.29) and the infinite-horizon DSFBMH2HINLCP with cost functionals (11.32), (11.8). Suppose

(H1) the pair {f, h₁ } is zero-state detectable;

(H2) there exists a pair of negative and positive-definite C²-functions $\tilde{W}, \tilde{U} : \tilde{M} \times Z \to ℜ$ locally defined in a neighborhood $\tilde{M}$ of the origin x = 0, such that $\tilde{W} (0) = 0 and \tilde{U} (0) = 0$ , and satisfying the coupled DHJIEs:

$\tilde{W} (f (x) + g_{1} (x) w^{⋆} + g_{2} (x) u^{⋆}) - \tilde{W} (x) + \frac{1}{2} (γ^{2} {‖ w ‖}^{2} - {‖ z_{k}^{⋆} ‖}^{2}) = 0, W (0) = 0,$

(11.61)

$\tilde{U} (f (x) + g_{1} (x) w^{⋆} + g_{2} (x) u^{⋆}) - \tilde{U} (x) + \frac{1}{2} {‖ z_{k}^{⋆} ‖}^{2} = 0, U (0) = 0.$

(11.62)

together with the conditions

$r_{22} (0) > 0, \det [r_{11} (0) - r_{22}^{- 1} (0) r_{21} (0)] \neq 0.$

(11.63)

Then, the state-feedback controls defined implicitly by

$w^{⋆} = - g_{1}^{T} (x) \frac{1}{γ^{2}} {\frac{\partial \tilde{W}}{\partial λ} |}_{λ= f (x) + g_{1} (x) w^{⋆} + g_{2} (x) u^{⋆}}$

(11.64)

$u^{⋆} = - g_{2}^{T} (x) {\frac{\partial \tilde{U}}{\partial λ} |}_{λ= f (x) + g_{1} (x) w^{⋆} + g_{2} (x) u^{⋆}}$

(11.65)

solve the infinite-horizon DSFBMH2HINLCP for the system. Moreover, the optimal costs are given by

$J_{1 d}^{⋆} (u^{⋆}, w^{⋆}) = \tilde{W} (x_{0}),$

(11.66)

$J_{2 d}^{⋆} (u^{⋆}, w^{⋆}) = \tilde{U} (x_{0}) .$

(11.67)

Proof: We only prove item (c) in the definition, since the proofs of items (a) and (b) are exactly similar to the finite-horizon problem. Accordingly, using similar manipulations as in the proof of item (a) of Theorem 11.2.1, it can be shown that with w ≡ 0,

$\tilde{W} (f (x) + g_{2} (x) u^{⋆}) - \tilde{W} (x) = - \frac{1}{2} {‖ z ‖}^{2} .$

Therefore, the closed-loop system is Lyapunov-stable. Further, the condition $\tilde{W} (f (x) + g_{2} (x) u^{⋆} (x)) \equiv \tilde{W} (x) \forall k \geq k_{c}$ , for some k_c ≥ k₀, implies that u^* ≡ 0, h₁(x) ≡ 0 ∀ k ≥ k_c. By hypothesis (H1), this implies lim_t→∞ x_k = 0, and by LaSalle’s invariance-principle, we conclude asymptotic-stability. □

The above theorem can again be specialized to the linear system Σ^dl in the following corollary.

Corollary 11.2.2 Consider the discrete linear system Σ^dl under the Assumption 11.1.2. Suppose there exist ${\bar{P}}_{1} < 0 a n d {\bar{P}}_{2} > 0$ symmetric solutions of the cross-coupled discrete- algebraic Riccati equations (DAREs):

$\begin{array}{l} {\bar{P}}_{1} = A^{T} {{\bar{P}}_{1} - 2 {\bar{P}}_{1} B_{1} B_{γ}^{- 1} Γ_{1} - 2 {\bar{P}}_{1} B_{2} Λ^{- 1} B_{2}^{T} {\bar{P}}_{2} + 2 Γ_{1}^{T} B_{γ}^{- T} B_{1} {\bar{P}}_{1} B_{2} Λ^{- 1} B_{2} {\bar{P}}_{2} Γ_{2} + \\ Γ_{1}^{T} B_{γ}^{- 1} B_{1} {\bar{P}}_{1} B_{1} B_{γ}^{- 1} Γ_{1} + Γ_{2}^{T} {\bar{P}}_{2} B_{2} Λ^{- T} B_{2} {\bar{P}}_{1} B_{2} Λ^{- 1} B_{2}^{T} {\bar{P}}_{2} Γ_{2} + γ^{2} Γ_{1}^{T} B_{γ}^{- T} B_{γ}^{- 1} Γ_{1}^{T} - \\ Γ_{2}^{T} {\bar{P}}_{2} B_{2} Λ^{- T} Λ^{- 1} B_{2}^{T} {\bar{P}}_{2} Γ_{2}} A - C_{1}^{T} C_{1}, \end{array}$

(11.68)

$\begin{array}{l} {\bar{P}}_{2} = A^{T} {{\bar{P}}_{2} - 2 {\bar{P}}_{2} B_{1} B_{γ}^{- 1} Γ_{1} - 2 {\bar{P}}_{2} B_{2} Λ^{- 1} B_{2}^{T} {\bar{P}}_{2} Γ_{2} + \\ 2 Γ_{1}^{T} B_{γ}^{- T} B_{1} {\bar{P}}_{2} B_{2} Λ_{k}^{- 1} B_{2}^{T} {\bar{P}}_{2} Γ_{2} + Γ_{2}^{T} {\bar{P}}_{2} B_{2} Λ^{- 1} Λ^{- 1} B_{2}^{T} {\bar{P}}_{2} Γ_{2}} A + C_{1}^{T} C_{1} . \end{array}$

(11.69)

$B_{γ} : = [γ^{2} I - B_{2} Λ^{- 1} B_{2}^{T} P_{2} B_{1} + B_{1}^{T} P_{1} B_{1}] > 0$

(11.70)

Then the Nash-equilibrium strategies uniquely specified by

$w_{l, k}^{⋆} = - B_{γ}^{- 1} Γ_{1} A x_{k},$

(11.71)

$u_{l, k}^{⋆} = - Λ^{- 1} B_{2}^{T} {\bar{P}}_{2} Γ_{2} A x_{k},$

(11.72)

where

$\begin{array}{l} Λ : = (I + B_{2}^{T} {\bar{P}}_{2} B_{2}), \\ Γ_{1} : = [B_{1}^{T} {\bar{P}}_{1} - B_{2} Λ^{- 1} B_{2}^{T} P_{2}], \\ Γ_{2} : = [I - B_{1} B_{γ}^{- 1} (B_{1}^{T} P_{1} - B_{2} Λ^{- 1} B_{2}^{T} {\bar{P}}_{2})], \end{array}$

solve the infinite-horizon DSF BMH2HINLCP for the system. Moreover, the optimal costs for the game are given by

$J_{1, l} (u^{⋆}, w^{⋆}) = \frac{1}{2} x_{k_{0}}^{T} {\bar{P}}_{1} x_{k_{0}},$

(11.73)

$J_{2, l} (u^{⋆}, w^{⋆}) = \frac{1}{2} x_{k_{0}}^{T} {\bar{P}}_{2} x_{k_{0}} .$

(11.74)

Proof: Take

$\begin{array}{l} Y (x_{k}) = \frac{1}{2} x_{k}^{T} {\bar{P}}_{1} x_{k}, {\bar{P}}_{1} < 0 \\ V (x) = \frac{1}{2} x_{k}^{T} {\bar{P}}_{2} x_{k}, {\bar{P}}_{2} > 0 \end{array}$

and apply the results of the theorem. □

11.3 Extension to a General Class of Discrete-Time Nonlinear Systems

In this subsection, we similarly extend the results of the previous subsection to a more general class of nonlinear discrete-time systems which is not necessarily affine. We consider the following state-space model defined on $X \subset ℜ^{n}$ in local coordinates (x₁,…, x_n)

$\sum : {\begin{cases} {\dot{x}}_{k + 1} = \tilde{F} (x_{k}, w_{k}, u_{k}), x (t_{0}) = x_{0} \\ z_{k} = \tilde{Z} (x_{k}, u_{k}) \\ y_{k} = x_{k}, \end{cases}$

(11.75)

where all the variables have their previous meanings, while $\tilde{F} : X \times W \times U \to X, \tilde{Z} : X \times U \to ℜ^{s}$ are smooth functions of their arguments. In addition, we assume that $\tilde{F} (0, 0, 0) = 0 and \tilde{Z} (0,0)=0$ . Furthermore, define similarly the Hamiltonian functions corresponding to the cost functionals (11.32), (11.33), ${\tilde{K}}_{i} : X \times W \times U \times ℜ \to ℜ, i = 1, 2$ respectively:

${\tilde{K}}_{1} (x, w, u, \tilde{W}) = \tilde{W} (\tilde{F} (x, w, u)) - \tilde{W} (x) + \frac{1}{2} γ^{2} {‖ w ‖}^{2} - {‖ \tilde{z} (x, u) ‖}^{2},$

${\tilde{K}}_{2} (x, w, u, \bar{U}) = \tilde{U} (\tilde{F} (x, w, u)) - \tilde{U} (x) + \frac{1}{2} {‖ \tilde{z} (x, u) ‖}^{2},$

for some smooth functions $\tilde{W}, \tilde{U} : X \to ℜ$ . In addition, define also

$\partial^{2} \tilde{K} (x) ≜ [\begin{matrix} \frac{\partial^{2} {\tilde{K}}_{2}}{\partial u^{2}} & \frac{\partial^{2} {\tilde{K}}_{2}}{\partial w \partial u} \\ \frac{\partial^{2} {\tilde{K}}_{1}}{\partial u \partial w} & \frac{\partial^{2} {\tilde{K}}_{1}}{\partial w^{2}} \end{matrix}] (x) = [\begin{matrix} s_{11} (x) & s_{12} (x) \\ s_{21} (x) & S_{22} (x) \end{matrix}],$

where

$s_{11} (0) = {[{(\frac{\partial \tilde{F}}{\partial u})}^{T} \frac{\partial^{2} \tilde{U}}{\partial λ^{2}} (0) \frac{\partial \tilde{F}}{\partial u} + {(\frac{\partial \tilde{Z}}{\partial u})}^{T} \frac{\partial \tilde{Z}}{\partial u}]}_{x = 0, w = 0, u = 0},$

$s_{12} (0) = {[{(\frac{\partial \tilde{F}}{\partial u})}^{T} \frac{\partial^{2} \tilde{U}}{\partial λ^{2}} (0) \frac{\partial \tilde{F}}{\partial w}]}_{x = 0, w = 0, u = 0},$

$s_{21} (0) = {[{(\frac{\partial \tilde{F}}{\partial w})}^{T} \frac{\partial^{2} \tilde{W}}{\partial λ^{2}} (0) \frac{\partial \tilde{F}}{\partial u}]}_{x = 0, w = 0, u = 0},$

$s_{22} (0) = {[{(\frac{\partial \tilde{F}}{\partial w})}^{T} \frac{\partial^{2} \tilde{W}}{\partial λ^{2}} (0) \frac{\partial \tilde{F}}{\partial w} + γ^{2} I]}_{x = 0, w = 0, u = 0} .$

We then make the following assumption.

Assumption 11.3.1 For the Hamiltonian functions, ${\tilde{K}}_{1}, {\tilde{K}}_{2}$ , we assume

$s_{22} (0) > 0, \det [s_{11} (0) - s_{12} (0) s_{22}^{- 1} s_{21} (0)] \neq 0.$

Under the above assumption, the Hessian matrix $\partial^{2} \tilde{K} (0)$ is nonsingular, and therefore by the Implicit-function Theorem, there exists an open neighborhood M₀ of x = 0 such that the equations

$\begin{array}{l} \frac{\partial {\tilde{K}}_{1}}{\partial w} (x, {\tilde{w}}^{⋆} (x), {\tilde{u}}^{⋆} (x)) = 0, \\ \frac{\partial {\tilde{K}}_{2}}{\partial u} (x, {\tilde{w}}^{⋆} (x), {\tilde{u}}^{⋆} (x)) = 0 \end{array}$

have unique solutions ${\tilde{u}}^{⋆} (x), {\tilde{w}}^{⋆} (x), with {\tilde{u}}^{⋆} (0) = 0, {\tilde{w}}^{⋆} (0) = 0$ . Moreover, the pair $({\tilde{u}}^{⋆}, {\tilde{w}}^{⋆})$ constitutes a Nash-equilibrium solution to the dynamic game (11.32), (11.33), (11.75). The following theorem then summarizes the solution to the infinite-horizon problem for the general class of discrete-time nonlinear systems (11.75).

Theorem 11.3.1 Consider the discrete-time nonlinear system (11.75) and the DSFBMH2HINLCP for this system. Suppose Assumption 11.3.1 holds, and also the following:

(Ad1) the pair { $\tilde{F} (x, 0, 0), \tilde{Z} (x,0)$ } is zero-state detectable;

(Ad2) there exists a pair of C² locally negative and positive-definite functions $\tilde{W}, \tilde{U} : \tilde{M} \to ℜ$ respectively, defined in a neighborhood $\tilde{M}$ of x = 0, vanishing at x = 0 and satisfying the pair of coupled DHJIEs:

$\tilde{W} (\tilde{F} (x, {\tilde{w}}^{⋆} (x), {\tilde{u}}^{⋆} (x))) - \tilde{W} (x) + \frac{1}{2} γ^{2} {‖ {\tilde{w}}^{⋆} (x) ‖}^{2} - \frac{1}{2} {‖ \tilde{Z} (x, {\tilde{u}}^{⋆} (x)) ‖}^{2} = 0, \tilde{W} (0) = 0,$

$\tilde{U} (\tilde{F} (x, {\tilde{w}}^{⋆} (x), {\tilde{u}}^{⋆} (x))) - \tilde{U} (x) + \frac{1}{2} {‖ \tilde{Z} (x, {\tilde{u}}^{⋆} (x)) ‖}^{2} = 0, \tilde{U} (0) = 0;$

(A3) the pair ${\tilde{F} (x, {\tilde{w}}^{⋆} (x), 0), \tilde{Z} (x, 0)}$ is locally zero-state detectable.

Then the state-feedback controls ( ${\tilde{u}}^{⋆} (x), {\tilde{w}}^{⋆} (x)$ ) solve the dynamic game problem and the DSFBMH2HINLCP for the system (11.75). Moreover, the optimal costs of the policies are given by

$\begin{array}{l} {\tilde{J}}_{1 d}^{⋆} ({\tilde{w}}^{⋆}, {\tilde{u}}^{⋆}) = \tilde{W} (x_{0}), \\ {\tilde{J}}_{2 d}^{⋆} ({\tilde{w}}^{⋆}, {\tilde{u}}^{⋆}) = \tilde{U} (x_{0}) . \end{array}$

Proof: The proof can be pursued along the same lines as the previous results. □

11.4 Notes and Bibliography

This chapter is mainly based on the paper by Lin [180]. The approach adopted throughout the chapter was originally inspired by the paper by Limebeer et al. [179] for linear systems. The chapter mainly extended the results of the paper to the nonlinear case. But in addition, the discrete-time problem has also been developed. Finally, application of the results to tracking control for Robot manipulators can be found in [80].

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for 11.2 Discrete-Time Mixed H∞2/H∞∞ Nonlinear Control

Create new playlist

Sign In

Sign Up

Table of Contents for
11.2 Discrete-Time Mixed H∞2/H∞∞ Nonlinear Control