7.3 Extensions to a General Class of Discrete-Time Nonlinear Systems

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

In this section, we extend the results of the previous sections to a more general class of discrete-time nonlinear systems that may not necessarily be affine in u and w. We consider the general class of systems described by the state-space equations on X⊂ Rn $X \subset ℜ^{n}$ containing the origin {0}:

∑d : ⎧⎩⎨⎪⎪xk+1 = F(xk,wk,uk); x(k0)=x0zk = Z(xk,wk,uk)yk = Y(xk,wk) $\sum^{d} : {\begin{cases} x_{k + 1} = F (x_{k}, w_{k}, u_{k}); x (k_{0}) = x^{0} \\ z_{k} = Z (x_{k}, w_{k}, u_{k}) \\ y_{k} = Y (x_{k}, w_{k}) \end{cases}$

(7.64)

where all the variables have their usual meanings, and F : X×W×U → X,F : (0,0,0) = 0,Z : X×W×U → Rs,Z(0,0,0) = 0, Y: X×W → Rm $F : X \times W \times U \to X, F : (0, 0, 0) = 0, Z : X \times W \times U \to ℜ^{s}, Z (0, 0, 0) = 0, Y : X \times W \to ℜ^{m}$ . We begin with the full-information and state-feedback problems.

7.3.1 Full-Information H∞ $H_{\infty}$ -Control for a General Class of Discrete-Time Nonlinear Systems

To solve the full-information and state-feedback problems for the system Σ^d (7.64), we consider the Hamiltonian function:

H˜(x,u,w)=V˜(F(x,w,u))−V˜(x)+12(∥Z(x,u,w)∥2−γ2∥w∥2) $\tilde{H} (x, u, w) = \tilde{V} (F (x, w, u)) - \tilde{V} (x) + \frac{1}{2} ({‖ Z (x, u, w) ‖}^{2} - γ^{2} {‖ w ‖}^{2})$

(7.65)

for some positive-definite function V˜ : X → R+ $\tilde{V} : X \to ℜ_{+}$ , and let

∂2H˜∂(u,w)2(0,0,0) = ⎡⎣⎢∂2H˜∂u2∂2H˜∂u∂w ∂2H˜∂w∂u∂2H˜∂w2⎤⎦⎥(0,0,0) $\frac{\partial^{2} \tilde{H}}{\partial {(u, w)}^{2}} (0, 0, 0) = [\begin{matrix} \frac{\partial^{2} \tilde{H}}{\partial u^{2}} & \frac{\partial^{2} \tilde{H}}{\partial w \partial u} \\ \frac{\partial^{2} \tilde{H}}{\partial u \partial w} & \frac{\partial^{2} \tilde{H}}{\partial w^{2}} \end{matrix}] (0, 0, 0)$

where

huu(0) =[(∂F∂u)T∂2V˜∂λ2(0)(∂F∂u)+(∂Z∂u)T(∂Z∂u)]∣∣∣x=0,u=0,w=0hww(0) =[(∂F∂w)T∂2V˜∂λ2(0)(∂F∂w)+(∂Z∂w)T(∂Z∂w)−γ2I]∣∣∣x=0,u=0,w=0huw(0) =[(∂F∂u)T∂2V˜∂λ2(0)(∂F∂w)+(∂Z∂u)T(∂Z∂w)]∣∣∣x=0,u=0,w=0 $\begin{array}{l} h_{u u} (0) = {[{(\frac{\partial F}{\partial u})}^{T} \frac{\partial^{2} \tilde{V}}{\partial λ^{2}} (0) (\frac{\partial F}{\partial u}) + {(\frac{\partial Z}{\partial u})}^{T} (\frac{\partial Z}{\partial u})] |}_{x = 0, u = 0, w = 0} \\ h_{w w} (0) = {[{(\frac{\partial F}{\partial w})}^{T} \frac{\partial^{2} \tilde{V}}{\partial λ^{2}} (0) (\frac{\partial F}{\partial w}) + {(\frac{\partial Z}{\partial w})}^{T} (\frac{\partial Z}{\partial w}) - γ^{2} I] |}_{x = 0, u = 0, w = 0} \\ h_{u w} (0) = {[{(\frac{\partial F}{\partial u})}^{T} \frac{\partial^{2} \tilde{V}}{\partial λ^{2}} (0) (\frac{\partial F}{\partial w}) + {(\frac{\partial Z}{\partial u})}^{T} (\frac{\partial Z}{\partial w})] |}_{x = 0, u = 0, w = 0} \end{array}$

Suppose now that the following assumption holds:

(GN1)

huu(0)>0, hww(0)−hwu(0)h−1uu(0)huw(0)<0 $h_{u u} (0) > 0, h_{w w} (0) - h_{w u} (0) h_{u u}^{- 1} (0) h_{u w} (0) < 0$

Then by the Implicit-function Theorem, the above assumption implies that there exist unique smooth functions u˜⋆(x),w˜⋆(x) ${\tilde{u}}^{⋆} (x), {\tilde{w}}^{⋆} (x)$ with ũ^⋆ (0) = 0 and w˜⋆(0) = 0 ${\tilde{w}}^{⋆} (0) = 0$ defined in a neighborhood X¯¯¯ $\bar{X}$

of x = 0 and satisfying the functional equations

0 = ∂H˜∂u(x,u˜⋆(x),w˜⋆(x)) = (∂V˜∂F∂F∂u+ZT∂Z∂u) ∣∣u=u˜⋆(x),w=w˜⋆(x) $\begin{array}{l} 0 = \frac{\partial \tilde{H}}{\partial u} (x, {\tilde{u}}^{⋆} (x), {\tilde{w}}^{⋆} (x)) \\ = (\frac{\partial \tilde{V}}{\partial F} \frac{\partial F}{\partial u} + Z^{T} \frac{\partial Z}{\partial u}) |_{u = {\tilde{u}}^{⋆} (x), w = {\tilde{w}}^{⋆} (x)} \end{array}$

(7.66)

0 = ∂H˜∂w(x,u˜⋆(x),w˜⋆(x)) =(∂V˜∂F∂F∂w+ZT∂Z∂w) ∣∣u=u˜⋆(x),w=w˜⋆(x). $\begin{array}{l} 0 = \frac{\partial \tilde{H}}{\partial w} (x, {\tilde{u}}^{⋆} (x), {\tilde{w}}^{⋆} (x)) \\ = {(\frac{\partial \tilde{V}}{\partial F} \frac{\partial F}{\partial w} + Z^{T} \frac{\partial Z}{\partial w}) |}_{u = {\tilde{u}}^{⋆} (x), w = {\tilde{w}}^{⋆} (x) .} \end{array}$

(7.67)

Similarly, let

huu(x) ≜ ∂2H˜∂u2(x,u˜⋆(x),w˜⋆(x))hww(x) ≜ ∂2H˜∂w2(x,u˜⋆(x),w˜⋆(x))huw(x) ≜ ∂2H˜∂w∂u(x,u˜⋆(x),w˜⋆(x))=hTwu(x,u˜⋆,w˜⋆) $\begin{array}{l} h_{u u} (x) ≜ \frac{\partial^{2} \tilde{H}}{\partial u^{2}} (x, {\tilde{u}}^{⋆} (x), {\tilde{w}}^{⋆} (x)) \\ h_{w w} (x) ≜ \frac{\partial^{2} \tilde{H}}{\partial w^{2}} (x, {\tilde{u}}^{⋆} (x), {\tilde{w}}^{⋆} (x)) \\ h_{u w} (x) ≜ \frac{\partial^{2} \tilde{H}}{\partial w \partial u} (x, {\tilde{u}}^{⋆} (x), {\tilde{w}}^{⋆} (x)) = h_{w u}^{T} (x, {\tilde{u}}^{⋆}, {\tilde{w}}^{⋆}) \end{array}$

be associated with the optimal solutions u˜⋆(x),w˜⋆(x) ${\tilde{u}}^{⋆} (x), {\tilde{w}}^{⋆} (x)$ , and let the corresponding DHJIE associated with (7.64) be denoted by

(GN2)

V˜(F(x,w˜⋆(x),u˜⋆(x)))−V˜(x)+12(∥Z(x,u˜⋆(x),w˜⋆(x))∥2−γ2∥w˜⋆(x)∥2)=0, V˜(0)=0. $\tilde{V} (F (x, {\tilde{w}}^{⋆} (x), {\tilde{u}}^{⋆} (x))) - \tilde{V} (x) + \frac{1}{2} ({‖ Z (x, {\tilde{u}}^{⋆} (x), {\tilde{w}}^{⋆} (x)) ‖}^{2} - γ^{2} {‖ {\tilde{w}}^{⋆} (x) ‖}^{2}) = 0, \tilde{V} (0) = 0.$

(7.68)

Then we have the following result for the solution of the full-information problem.

Theorem 7.3.1 Consider the discrete-time nonlinear system (7.64), and suppose there exists a C² positive-definite function V˜ : X1 ⊂ X → R+ $\tilde{V} : X_{1} \subset X \to ℜ_{+}$ locally defined in a neighborhood X₁ of x = 0 satisfying the hypotheses (GN1), (GN2). In addition, suppose the following assumption is also satisfied by the system:

(GN3) Any bounded trajectory of the free system

xk+1=F(xk,0,uk), $x_{k + 1} = F (x_{k}, 0, u_{k}),$

under the constraint

Z(xk,0,uk)=0 $Z (x_{k}, 0, u_{k}) = 0$

for all k ∈ Z₊, is such that, lim_k→∞ x_k = 0.

Then there exists a static full-information feedback control

uk=u¯⋆(xk)−h−1uu(xk)huw(xk)(wk−w¯¯¯⋆(xk)) $u_{k} = {\bar{u}}^{⋆} (x_{k}) - h_{u u}^{- 1} (x_{k}) h_{u w} (x_{k}) (w_{k} - {\bar{w}}^{⋆} (x_{k}))$

which solves the DFIFBNLHICP for the system.

Proof: The proof can be pursued along similar lines as Theorem 7.1.2. □

The above result can also be easily specialized to the state-feedback case as follows.

Theorem 7.3.2 Consider the discrete-time nonlinear system (7.64), and suppose there exists a C² positive-definite function V˜ : X2 ⊂ X → R+ $\tilde{V} : X_{2} \subset X \to ℜ_{+}$ locally defined in a neighborhood X₂ of x = 0 satisfying the hypothesis

(GN1s)

huu(0)>0, hww(0)<0 $h_{u u} (0) > 0, h_{w w} (0) < 0$

and hypotheses (GN2), (GN3) above. Then, the static state-feedback control

uk=u¯⋆(xk) $u_{k} = {\bar{u}}^{⋆} (x_{k})$

solves the DSFBNLHICP for the system.

Proof: The theorem can be proven along similar lines as Theorem 7.1.3.□

Moreover, the parametrization of all static state-feedback controllers can also be given in the following theorem.

Theorem 7.3.3 Consider the discrete-time nonlinear system (7.64), and suppose the following hypothesis holds

(GN2s) there exists a C² positive-definite function V˜ : X3 ⊂ X → R+ $\tilde{V} : X_{3} \subset X \to ℜ_{+}$ locally defined in a neighborhood X₃ of x = 0 satisfying the DHJIE

V˜(F(x,w˜⋆(x),u˜⋆(x)))−V˜(x)+12(∥Z(x,u˜⋆(x),w˜⋆(x))∥2−γ2∥w˜⋆(x)∥2) =−ψ(x)[huu(x)−huw(x)h−1ww(x)h21(x)ψ(x), V˜(0)=0 $\begin{array}{l} \tilde{V} (F (x, {\tilde{w}}^{⋆} (x), {\tilde{u}}^{⋆} (x))) - \tilde{V} (x) + \frac{1}{2} ({‖ Z (x, {\tilde{u}}^{⋆} (x), {\tilde{w}}^{⋆} (x)) ‖}^{2} - γ^{2} {‖ {\tilde{w}}^{⋆} (x) ‖}^{2}) \\ = - ψ (x) [h_{u u} (x) - h_{u w} (x) h_{w w}^{- 1} (x) h_{21} (x) ψ (x), \tilde{V} (0) = 0 \end{array}$

(7.69)

for some arbitrary smooth function ψ : X₃ → ℜ^p, ψ(0) = 0,

as well as the hypotheses (GN3) and (GN1s) with V˜ $\tilde{V}$ in place of V˜ $\tilde{V}$ . Then, the family of controllers

KSFg={uk|uk=u˜⋆(xk)+ψ(xk)} $K_{S F g} = {u_{k} | u_{k} = {\tilde{u}}^{⋆} (x_{k}) + ψ (x_{k})}$

(7.70)

is a parametrization of all static state-feedback controllers that solves the DSFBNLHICP for the system.

7.3.2 Output Measurement-Feedback H∞ $H_{\infty}$ -Control for a General Class of Discrete-Time Nonlinear Systems

In this subsection, we discuss briefly the output measurement-feedback problem for the general class of nonlinear systems (7.64). Theorem 7.2.1 can easily be generalized to this class of systems. As in the previous case, we can postulate the existence of a dynamic compensator of the form:

∑˜dcdynobs :{ξk+1 = F(ξk,w˜⋆(ξk),u˜⋆(ξk))+G˜(ξk)(yk−Y(ξk,w˜⋆(ξk)) uk = u˜⋆(ξk) ${\sum^{˜}}_{d y n o b s}^{d c} : {\begin{cases} ξ_{k + 1} = F (ξ_{k}, {\tilde{w}}^{⋆} (ξ_{k}), {\tilde{u}}^{⋆} (ξ_{k})) + \tilde{G} (ξ_{k}) (y_{k} - Y (ξ_{k}, {\tilde{w}}^{⋆} (ξ_{k})) \\ u_{k} = {\tilde{u}}^{⋆} (ξ_{k}) \end{cases}$

(7.71)

where G˜ $\tilde{G}$ (.) is the output-injection gain matrix, and w˜⋆(.),u˜⋆(.) ${\tilde{w}}^{⋆} (.), {\tilde{u}}^{⋆} (.)$ , are the solutions to equations (7.66), (7.67). Let the closed-loop system (7.64) with the controller (7.71) be represented as

xek+1 = Fe(xek,wk) zk = Ze(xek,wk) $\begin{array}{l} x_{k + 1}^{e} = F^{e} (x_{k}^{e}, w_{k}) \\ z_{k} = Z^{e} (x_{k}^{e}, w_{k}) \end{array}$

where $x^{e} = {[\begin{matrix} x^{T} & ξ^{T} \end{matrix}]}^{T},$

$F^{e} (x^{e}, w) = [\begin{array}{l} F (x, w, {\tilde{u}}^{⋆}) \\ F (x, {\tilde{w}}^{⋆} (ξ), {\tilde{u}}^{⋆} (ξ)) + \tilde{G} (ξ) (y (x, w) - y (ξ, {\tilde{w}}^{⋆} (ξ))) \end{array}],$

$Z^{e} (x^{e}, w) = Z (x, w, {\tilde{u}}^{⋆} (ξ)) .$

Then the following result is a direct extension of Theorem 7.2.1.

Theorem 7.3.4 Consider the discrete-time nonlinear system (7.64) and assume the following:

(i) Assumption (GN3) holds and rank $κ {\frac{\partial Z}{\partial u} (0, 0, 0)} = p$ .

(ii) There exists a C² positive-definite function $\tilde{V} : X \to ℜ_{+}$ locally defined in a neighborhood X of x = 0 satisfying Assumption (GN2).

(iii) There exists an output-injection gain matrix $\tilde{G} (.)$ and a C² real-valued function W : X₄ × X₄ locally defined in a neighborhood X₄ × X of (x, ξ) = (0, 0), X₄ ∩ X₄ ≠ ∅, with W (0, 0) = 0, W (x, ξ) > 0 ∀x ≠ ξ and satisfying

(GNM1)

$F^{e^{T}} (0, 0, 0) \frac{\partial^{2} W}{\partial x^{e^{2}}} (0, 0) F^{e} (0, 0, 0) + h_{w w} (0) < 0;$

(GNM2)

$\begin{array}{l} W (F^{e} (x^{e}, {\tilde{α}}_{1} (x^{e}) - W (x^{e}) + V (F (x, {\tilde{α}}_{1} (x^{e}) u^{⋆} (ξ))) - V (x) + \\ \frac{1}{2} ({‖ Z (x, {\tilde{α}}_{1} (x^{e}), {\tilde{u}}^{⋆} (ξ)) ‖}^{2} - γ^{2} {‖ {\tilde{α}}_{1} (x^{e}) ‖}^{2}) = 0, \end{array}$

where ${\tilde{α}}_{1} (x^{e}) = 0$ with ${\tilde{α}}_{1} (0) = 0$ is a locally unique solution of the equation

$\begin{array}{l} {\frac{\partial W}{\partial β} |}_{β = F^{e} (x^{e}, w)} \frac{\partial F^{e}}{\partial w} (x^{e}, w) + {\frac{\partial V}{\partial λ} |}_{λ = F (x, w, \tilde{u} ⋆ (ξ))} \frac{\partial F}{\partial w} (x, w, {\tilde{u}}^{⋆} (ξ)) + \\ Z^{T} (x,, w, {\tilde{u}}^{⋆} (ξ)) \frac{\partial Z}{\partial w} (x, w, {\tilde{u}}^{⋆} (ξ)) - γ^{2} w^{T} = 0 \end{array}$

(GNM3) The discrete-time nonlinear system

$x_{κ + 1} = F (ξ, {\tilde{w}}^{⋆} (ξ), 0) - \tilde{G} (ξ) Y (ξ, {\tilde{w}}^{⋆} (ξ))$

is locally asymptotically-stable at ξ = 0.

Then, the DMFBNLHICP for the system (7.64) is solvable with the compensator (7.71).

7.4 Approximate Approach to the Discrete-Time Nonlinear $H_{\infty}$ -Control Problem

In this section, we discuss alternative approaches to the discrete-time nonlinear $H_{\infty}$ -Control problem for affine systems. It should have been observed in Sections 7.1, 7.2, that the control laws that were derived are given implicitly in terms of solutions to certain pairs of algebraic equations. This makes the computational burden in using this design method more intensive. Therefore, in this section, we discuss alternative approaches, although approximate, but which can yield explicit solutions to the problem. We begin with the state-feedback problem.

7.4.1 An Approximate Approach to the Discrete-Time State-Feedback Problem

We consider again the nonlinear system (7.1), and assume the following.

Assumption 7.4.1

$r a n k {k_{12} (x)} = p .$

Reconsider now the Hamiltonian function (7.11) associated with the problem:

$\begin{array}{l} H_{2} (w, u) = V (f (x) + g_{1} (x) w + g_{2} (x) u) - V (x) + \frac{1}{2} ({‖ h_{1} (x) + k_{11} (x) w + k_{12} (x) u ‖}^{2} - \\ γ^{2} {‖ w ‖}^{2}) \end{array}$

(7.72)

for some smooth positive-definite function V : $X$ → ℜ₊. Suppose there exists a smooth real-valued function $\bar{u} (x) \in ℜ^{p}, \bar{u} (0) = 0$ such that the HJI-inequality

$H_{2} (w, \bar{u} (x)) < 0$

(7.73)

is satisfied for all x ∈ $X$ and w ∈ $W$ . Then it is clear from the foregoing that the control law

$u = \bar{u} (x)$

solves the DSFBNLHICP for the system Σ^da globally. The bottleneck however, is in getting an explicit form for the function ū(x). This problem stems from the first term in the HJI-inequality (7.73), i.e.,

$V (f (x) + g_{1} (x) w + g_{2} (x) u,$

which is a composition of functions and is not necessarily quadratic in u, as in the continuous-time case. Thus, suppose we replace this term by an approximation which is “quadratic” in (w, u), and nonlinear in x, i.e.,

$V (f (x) + υ) = V (f (x)) + V_{x} (f (x)) υ + \frac{1}{2} υ^{T} V_{x x} (f (x)) υ + R_{m} (x, υ)$

for some vector function v ∈ $X$ and where R_m is a remainder term such that

$\lim_{υ \to 0} \frac{R_{m} (x, υ)}{{‖ υ ‖}^{2}} = 0$

Then, we can seek a saddle-point for the new Hamiltonian function

$\begin{array}{l} {\hat{H}}_{2} (w, u) = V (f (x)) + V_{x} (f (x)) (g_{1} (x) w + g_{2} (x) u) + \\ \frac{1}{2} {(g_{1} (x) w + g_{2} (x) u)}^{T} V_{x x} (f (x)) (g_{1} (x) + g_{2} (x) u) - V (x) + \\ \frac{1}{2} {‖ h_{1} (x) + k_{11} (x) w + k_{12} (x) u ‖}^{2} - \frac{1}{2} γ^{2} {‖ w ‖}^{2} \end{array}$

(7.74)

by neglecting the higher-order term R_m(x, g₁(x)w + g₂(x)u). Since ${\hat{H}}_{2}$ (u, w) is quadratic in (w, u), it can be represented as

${\hat{H}}_{2} (w, u) = V (f (x)) - V (x) + \frac{1}{2} h_{1}^{T} (x) h_{1} (x) + \hat{S} (x) [\begin{array}{l} w \\ u \end{array}] + \frac{1}{2} [\begin{array}{l} w \\ u \end{array}] \hat{R} (x) [\begin{array}{l} w \\ u \end{array}],$

where

$\hat{S} (x) = h_{1}^{T} (x) [k_{11} (x) k_{12} (x)] + V_{x} (f (x)) [g_{1} (x) g_{2} (x)]$

and

$\hat{R} (x) = (\begin{array}{l} k_{11}^{T} (x) k_{11} (x) - γ^{2} I k_{11}^{T} (x) k_{12} (x) \\ k_{12}^{T} (x) k_{11} (x) k_{12}^{T} (x) k_{12} (x) \end{array}) + (\begin{array}{l} g_{1}^{T} (x) \\ g_{2}^{T} (x) \end{array}) V_{x x} (f (x)) (g_{1} (x) g_{2} (x))$

From this, it is easy to determine conditions for the existence of a unique saddle-point and explicit formulas for the coordinates of this point. It can immediately be determined that, if $\hat{R} (x)$ is nonsingular, then

${\hat{H}}_{2} (w, u) = {\hat{H}}_{2} (w^{⋆} (x), u^{⋆} (x)) + \frac{1}{2} {[\begin{array}{l} w - {\hat{w}}^{⋆} (x) \\ u - {\hat{u}}^{⋆} (x) \end{array}]}^{T} \hat{R} (x) [\begin{array}{l} w - {\hat{w}}^{⋆} (x) \\ u - {\hat{u}}^{⋆} (x) \end{array}]$

(7.75)

where

$[\begin{array}{l} {\hat{w}}^{⋆} (x) \\ {\hat{u}}^{⋆} (x) \end{array}] = - {\hat{R}}^{- 1} (x) {\hat{S}}^{T} (x) .$

(7.76)

One condition that gurantees that $\hat{R} (x)$ is nonsingular (by Assumption 7.4.1) is that the submatrix

${\hat{R}}_{11} (x) : = k_{11}^{T_{_{}}} (x) k 11 (x) - γ^{2} I + 1 / 2 g_{1}^{T} (x) V x x (f (x)) g_{1} (x) < 0 \forall x.$

If the above condition is satisfied for some γ > 0, then ${\hat{H}}_{2} (x, w, u)$ has a saddle-point at (û, ŵ ), and

${\hat{H}}_{2} (\hat{w}^{⋆} (x), \hat{u}^{⋆} (x)) = V (f (x)) - V (x) - \frac{1}{2} \hat{S} (x) {\hat{R}}^{- 1} (x) {\hat{S}}^{T} (x) + \frac{1}{2} h_{1}^{T} (x) h_{1} (x) .$

(7.77)

The above development can now be summarized in the following lemma.

Lemma 7.4.1 Consider the discrete-time nonlinear system (7.1), and suppose there exists a smooth positive-definite function V : X₀ ⊂ $X$ → ₊, V (0) = 0 and a positive number δ > 0 such that

(i)

${\hat{H}}_{2} (\hat{w} ⋆ (x), \hat{u} ⋆ (x)) < 0 \forall 0 \neq x \in X_{0}$

(7.78)

(ii)

${\hat{H}}_{2} (\hat{w} ⋆ (x), \hat{u} ⋆ (x)) < - \frac{1}{2} δ(|| \hat{w} ⋆ (x) | |^{2} + | | \hat{u} ⋆ (x) | |^{2}) \forall x \in X_{0},$

(iii)

${\hat{R}}_{11} (x) < 0 \forall x \in X_{0} .$

Then, there exists a neighborhood X × W of (w, x) = (0, 0) in $X \times W$ such that V satisfies the HJI-inequality (7.73) with ū = û^⋆ (x), û^⋆ (0) = 0.

Proof: By construction, H₂(x, w, u) satisfies

${\hat{H}}_{2} (x, w, u) = {\hat{H}}_{2} ({\hat{w}}^{⋆} (x), {\hat{u}}^{⋆} (x)) + \frac{1}{2} {(w - {\hat{w}}^{⋆})}^{T} {\hat{R}}_{11} (x) (w - {\hat{w}}^{⋆} (x)) .$

Since R₁₁(0) is negative-definite by hypothesis (iii), there exists a neighborhood X₁ of x = 0 and a positive number c > 0 such that

${(w - {\hat{w}}^{⋆})}^{T} {\hat{R}}_{11} (x) (w - {\hat{w}}^{⋆} (x)) \leq c {‖ (w - {\hat{w}}^{⋆} (x) ‖}^{2} \forall x \in X_{1}, \forall w .$

Now let μ = min{δ, c}

${(w - {\hat{w}}^{⋆})}^{T} {\hat{R}}_{11} (x) (w - {\hat{w}}^{⋆} (x)) \leq μ {‖ (w - {\hat{w}}^{⋆} (x) ‖}^{2} \forall x \in X_{1} .$

Moreover, by hypothesis (ii)

${\hat{H}}_{2} ({\hat{w}}^{⋆} (x), {\hat{u}}^{⋆} (x)) \leq - \frac{μ}{2} ({‖ {\hat{w}}^{⋆} (x) ‖}^{2} + {‖ {\hat{u}}^{⋆} (x) ‖}^{2}) \forall x \in X_{0} .$

Thus, by the triangle inequality,

$\begin{array}{l} {\hat{H}}_{2} (w, {\hat{u}}^{⋆} (x)) \leq - \frac{μ}{2} (‖ {\hat{w}}^{⋆} (x) ‖^{2} + \frac{1}{2} ‖ {\hat{u}}^{⋆} (x) ‖^{2} + \frac{1}{2} ‖ (w - {\hat{w}}^{⋆} (x) ‖^{2}) \\ \leq - \frac{μ}{2} (‖ w ‖^{2} + ‖ {\hat{u}}^{*} (x) ‖^{2}) \end{array}$

(7.79)

for all x ∈ X₂, where X₂ = X₀ ∩ X₁. Notice however that the Hamiltonians H₂(x, w, u,) and Ĥ₂(x, w, u) defined by (7.72) and (7.74) respectively, are related by

$H_{2} (x, w, u) = {\hat{H}}_{2} (x, w, u) + R_{m} (x, g_{1} (x) w + g_{2} (x) u) .$

(7.80)

By the result in Section 8.14.3 of reference [94], for all κ > 0, there exist neighborhoods X₃ of x = 0, W₁ of w = 0 and U₁ of u = 0 such that

$| R_{m} (x, g_{1} (x) w + g_{2} (x) u) | \leq κ ({‖ w ‖}^{2} + {‖ u ‖}^{2}) \forall (x, w, u) \in X_{3} \times W_{1} \times U_{1} .$

(7.81)

Finally, combining (7.79), (7.80) and (7.81), one obtains an estimate for H₂(w, u (x)), i.e.,

$H_{2} (w, u^{⋆} (x)) \leq - \frac{μ}{2} (1 - \frac{κ}{μ}) ({‖ w ‖}^{2} + {‖ {\hat{u}}^{⋆} (x) ‖}^{2})$

Choosing κ < μ and X ⊆ X₃, W₁ ⊆ W, the result follows. □

From the above lemma, one can conclude the following.

Theorem 7.4.1 Consider the discrete-time nonlinear system (7.1), and assume all the hypotheses (i), (ii), (iii) of Lemma 7.4.1 hold. Then the closed-loop system Σda :

$\sum^{d a} : {\begin{cases} x_{k + 1} = f (x_{k}) + g_{1} (x_{k}) w_{k} + g_{2} (x_{k}) \bar{u} (x_{k}); x_{0} = 0 \\ z_{k} = h_{1} (x_{k}) + k_{11} (x_{k}) w_{k} + k_{12} (x_{k}) \bar{u} (x_{k}) \end{cases}$

(7.82)

with ū(x) = û (x) has a locally asymptotically-stable equilibrium-point at x = 0, and for every K ∈ Z₊, there exists a number > 0 such that the response of the system from the initial state x₀ = 0 satisfies

$\sum_{k = 0}^{K} {‖ z_{k} ‖}^{2} \leq γ^{2} \sum_{k = 0}^{K} {‖ w_{k} ‖}^{2}$

for every sequence w = (w₀,…, w_K) such that w_k < ε.

Proof: Since f(.) and û (.) are smooth and vanish at x = 0, it is easily seen that for every K > 0, there exists a number > 0 such that the response of the closed-loop system to any input sequence w = (w₀,…, w_K) from the initial state x₀ = 0 is such that x_k ∈ X for all k ≤ K + 1 as long as w_k < for all k ≤ K. Without any loss of generality, we may assume that is such that w_k ∈ W. In this case, using Lemma 7.4.1, we can deduce that the dissipation-inequality

$V (x_{k + 1}) - V (x_{k}) + \frac{1}{2} (z_{k}^{T} z_{k} - γ^{2} | w_{k}^{T} w_{k}) \leq 0 \forall k \leq K$

holds. The result now follows from Chapter 3 and a Lyapunov argument. □

Remark 7.4.1 Again, in the case of the linear system Σ^dl (7.17), the result of Theorem 7.4.1 reduces to wellknown necessary and sufficient conditions for the existence of a solution to the linear DSFBNLHICP [89]. Indeed, setting B := [B₁ B₂] and D := [D₁₁ D₁₂], a quadratic function V (x) = ¹₂ x^T P x with P = P ^T > 0 satisfies the hypotheses of the theorem if and only if

$\begin{array}{l} A^{T} P A - P + C_{1}^{T} C_{1} - F_{p}^{T} (R + B^{T} P B) F_{p} < 0 \\ D_{11}^{T} D_{11} - γ^{2} I + B_{1}^{T} P B_{1} < 0 \end{array}$

where $F_{p} = - {(R + B^{T} P B)}^{- 1} (B^{T} P A + D^{T} C_{1}) .$ . In this case,

$[\begin{matrix} {\hat{w}}^{*} \\ {\hat{u}}^{*} \end{matrix}] = F_{p} x .$

In the next subsection, we consider an approximate approach to the measurement-feedback problem.

7.4.2 An Approximate Approach to the Discrete-Time Output Measurement-Feedback Problem

In this section, we discuss an alternative approximate approach to the discrete-time measurement-feedback problem for affine systems. In this regard, assume similarly a dynamic observer-based controller of the form

${\sum^{¯}}_{d y n o b s}^{d a c} : {\begin{cases} θ_{k + 1} = f (θ_{k}) + g_{1} (θ_{k}) {\hat{w}}^{⋆} (θ_{k}) + g_{2} (θ_{k}) {\hat{u}}^{⋆} (θ_{k}) + \bar{G} (θ_{k}) [y_{k} - h_{2} (θ_{k}) - \\ k_{21} (θ_{k}) {\hat{w}}^{⋆} (θ_{k}) \\ u_{k} = {\hat{u}}^{⋆} (θ_{k}) \end{cases}$

(7.83)

where θ ∈ X is the controller state vector, while ŵ (.), û (.) are the optimal state-feedback control and worst-case disturbance given by (7.76) respectively, and $\bar{G}$ is the output-injection gain matrix which is to be determined. Accordingly, the corresponding closed-loop system (7.1), (7.83) can be represented by

${\begin{cases} x_{k + 1}^{#} = f^{#} (x_{k}^{#}) + g^{#} (x_{k}^{#}) w_{k} \\ z_{k}^{#} = h^{#} (x_{k}^{#}) + k^{#} (x_{k}^{#}) w_{k} \end{cases}$

(7.84)

where x^# = [x^T θ^T ]^T ,

$f^{#} (x^{#}) = [\begin{array}{l} f (x) + g_{2} (x) {\hat{u}}^{⋆} (θ) \\ (\begin{array}{l} f (θ) + g_{1} (θ) {\hat{w}}^{⋆} (θ) + g_{2} (θ) {\hat{u}}^{⋆} (θ) + \\ \bar{G} (θ) (h_{2} (x) - h_{2} (θ) - k_{21} (θ) {\hat{w}}^{⋆} (θ) \end{array}) \end{array}],$

$g^{#} (x^{#}) = [\begin{array}{l} g_{1} (x) \\ \bar{G} (θ) k_{21} (x) \end{array}],$

and

$h^{#} (x^{#}) = h_{1} (x) + k_{12} (x) {\hat{u}}^{⋆} (θ), k^{#} (x^{#}) = k_{11} (x) .$

The objective is to find sufficient conditions under which the above closed-loop system (7.84) is locally (globally) asymptotically-stable and the estimate θ → x as t → ∞. This can be achieved by first rendering the closed-loop system dissipative, and then using some suitable additional conditions to conclude asymptotic-stability.

Thus, we look for a suitable positive-definite function Ψ₁ : $X$ × $X$ → ₊, such that the dissipation-inequality

$Ψ_{1} (x_{k + 1}^{#}) - Ψ (x_{k}^{#}) + \frac{1}{2} (| | z_{k}^{#} | |^{2} - γ^{2} | | w_{k} | |^{2}) \leq 0$

(7.85)

is satisfied along the trajectories of the system (7.84). To achieve this, we proceed as in the continuous-time case, Chapter 5, and assume the existence of a smooth C² function W : X × X → such that W (x ) ≥ 0 for all x = 0 and W (x ) > 0 for all x = θ. Further, set

$H_{2}^{#} (w) = W (f^{#} (x^{#}) + g^{#} (x^{#}) w) - W (x^{#}) + H_{2} (w, {\hat{u}}^{⋆} (θ)) - {\hat{H}}_{2} ({\hat{w}}^{⋆} (x) {\hat{u}}^{⋆} (x)),$

(7.86)

and recall that

$H_{2} (w, u) = {\hat{H}}_{2} (w, u) + R_{m} (x, g_{1} (x) w + g_{2} (x) u);$

so that

$H_{2} (w, {\hat{u}}^{⋆} (θ)) = {\hat{H}}_{2} (w, {\hat{u}}^{⋆} (θ)) + R_{m} (x, g_{1} (x) w + g_{2} (x) {\hat{u}}^{⋆} (x)) .$

Moreover, by definition

$\begin{array}{l} H_{2} (w, {\hat{u}}^{⋆} (θ)) = V (f (x) + g_{1} (x) w + g_{2} (x) {\hat{u}}^{⋆} (θ) - V (x) + \frac{1}{2} | | h_{2} (x) + \\ k_{11} (x) w + + k_{12} (x) {\hat{u}}^{⋆} (θ) | |^{2} - \frac{1}{2} γ^{2} | | w | |^{2} . \end{array}$

(7.87)

Therefore, subsituting (7.87) in (7.86) and rearranging, we get

$\begin{array}{l} H_{2}^{#} (w) + {\hat{H}}_{2} ({\hat{u}}^{⋆} (x), {\hat{w}}^{⋆} (x)) = W (f^{#} (x^{#}) + g^{#} (x^{#}) w) - W (x^{#}) + \\ V (f (x) + g_{1} (x) w + g_{2} (x) {\hat{u}}^{⋆} (θ)) - V (x) + \frac{1}{2} | | h^{#} (x^{#}) + k^{#} (x^{#}) w | |^{2} - \frac{1}{2} γ^{2} | | w | |^{2} \end{array}$

(7.88)

From the above identity (7.88), it is clear that, if the right-hand-side is nonpositive, then the positive-definite function

$Ψ_{1} (x^{#}) = W (x^{#}) + V (x)$

will indeed have satisfied the dissipation-inequality (7.85) along the trajectories of the closed-loop system (7.84). Moreover, if we assume the hypotheses of Theorem 7.4.1 (respectively Lemma 7.4.1) hold, then the term 5H₂(û (x), ŵ (x)) is nonpositive. Therefore, it remains to impose on H^#₂(w) to be also nonpositive for all w.

One way to achieve the above objective, is to impose the condition that

$\max_{w} H_{2}^{#_{}} (w) < 0.$

However, finding a closed-form expression for w^⋆⋆ = arg max{H₂(w)} is in general not possible as observed in the previous section. Thus, we again resort to an approximate but practical approach. Accordingly, we can replace the term W (f (x ) + v ) in H₂(.) by its second-order Taylor approximation as:

$W (f^{#} (x^{#}) + υ^{#}) = W (f^{#} (x^{#})) + W_{x #} (f^{#} (x^{#})) υ^{#} + \frac{1}{2} υ^{#^{T}} W_{x^{#} x^{#}} (f^{#} (x^{#})) υ^{#} + R_{m}^{#} (x^{#}, υ^{#})$

for any v ∈ $X$ × $X$ , and where R_m(x , v ) is the remainder term. While the last term (recalling from equation (7.75)) can be represented as

$\begin{array}{l} H_{2} (w, {\hat{u}}^{⋆} (θ)) - {\hat{H}}_{2} ({\hat{w}}^{⋆} (x), {\hat{u}}^{⋆} (x)) = \frac{1}{2} {[\begin{array}{l} w - {\hat{w}}^{⋆} (x) \\ {\hat{u}}^{⋆} (θ) - {\hat{u}}^{⋆} (x) \end{array}]}^{T} \hat{R} (x) [\begin{array}{l} w - {\hat{w}}^{⋆} (x) \\ {\hat{u}}^{⋆} (θ) - {\hat{u}}^{⋆} (x) \end{array}] \\ + R_{m} (x, g_{1} (x) w + g_{2} (x) {\hat{u}}^{⋆} (θ)) . \end{array}$

Similarly, observe that R_m(x, g₁(x)w + g₂(x)û^⋆ (θ)) can be expanded as a function of $x^{#}$ with respect to w as

$R_{m} (x, g_{1} (x) w + g_{2} (x) {\hat{u}}^{⋆} (θ)) = R_{m 0} (x^{#}) + R_{m 1} (x^{#}) w + w^{T} R_{m 2} (x^{#}) w + R_{m 3} (x^{#}, w) .$

Thus, we can now approximate H₂(w) with the function

$\begin{array}{l} H_{2}^{#} (w) = W (f^{#} (x^{#})) - W (x^{#}) + R_{m 0} (x^{#}) + R_{m 1} (x^{#}) + W_{x #} (f^{#} (x^{#})) g^{#} (x^{#}) w + \\ w^{T} (R_{m 2} (x^{#}) + \frac{1}{2} g^{#^{T}} (x^{#}) W_{x^{#} x^{#}} (f^{#} (x^{#})) g^{#} (x^{#})) w + \\ \frac{1}{2} {[\begin{array}{l} w - {\hat{w}}^{⋆} (x) \\ {\hat{u}}^{⋆} (θ) - {\hat{u}}^{⋆} (x) \end{array}]}^{T} \hat{R} (x) [\begin{array}{l} w - {\hat{w}}^{⋆} (x) \\ {\hat{u}}^{⋆} (θ) - {\hat{u}}^{⋆} (x) \end{array}] . \end{array}$

(7.89)

Moreover, we can now determine an estimate ŵ of w from the above expression (7.89) for H₂(w) by taking derivatives with respect to w and solving the linear equation

$\frac{\partial {\bar{H}}_{2}^{#}}{\partial w} ({\hat{w}}^{⋆ ⋆}) = 0 .$

It can be shown that, if the matrix

$\bar{R} (x^{#}) = \frac{1}{2} g^{#} {(x^{#})}^{T} W_{x^{#} x^{#}} (f^{#} (x^{#})) g^{#} (x^{#}) + {\hat{R}}_{11} (x) + R_{m 2} (x^{#})$

is nonsingular and negative-definite, then ŵ is unique, and is a maximum for H₂(w). The design procedure outlined above can now be summarized in the following theorem.

Theorem 7.4.2 Consider the nonlinear discrete-time system (7.1) and suppose the following hold:

(i) there exists a smooth positive-definite function V defined on a neighborhood X₀ ⊂ $X$ of x = 0, satisfying the hypotheses of Lemma 7.4.1.

(ii) there exists a smooth positive-semidefinite function W (x ), defined on a neighborhood Ξ of x = 0 in $X \times X$ , such that W (x ) > 0 for all x, x = θ, and satisfying

${\bar{H}}_{2}^{#} ({\hat{w}}^{⋆ ⋆} (x^{#})) < 0$

for all 0 = x ∈ Ξ. Moreover, there exists a number δ > 0 such that

$\begin{array}{l} {\bar{H}}_{2}^{#} ({\hat{w}}^{⋆ ⋆} (x^{#})) < - δ|| {\hat{w}}^{⋆ ⋆} (x^{#}) | |^{2} \\ \bar{R} (x^{#}) < 0 \end{array}$

for all x ∈ Ξ.

Then, the controller ${\bar{Σ}}_{d y n o b s}^{d a c}$ dynobs given by (7.83) locally asymptotically stabilizes the closed-loop system (7.84), and for every K ∈ Z₊, there is a number ε > 0 such that the response from the initial state (x₀, θ₀) = (0, 0) satisfies

$\sum_{k = 0}^{K} z_{k}^{T} z_{k} \leq γ^{2} \sum_{k = 0}^{K} w_{k}^{T} w_{k}$

for every sequence w = (w₀,…, w_K) such that w_k² <ε.

7.5 Notes and Bibliography

This chapter is entirely based on the papers by Lin and Byrnes [182]-[185]. In particular, the discussion on controller parameterization is from [184]. The results for stable plants can also be found in [51]. Similarly, the results for sampled-data systems have not been discussed here, but can be found in [124, 213, 255].

The alternative and approximate approach for solving the discrete-time problems is mainly from Reference [126], and approximate approaches for solving the DHJIE can also be found in [125]. An information approach to the discrete-time problem can be found in [150], and connections to risk-sensitive control in [151, 150].

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for 7.3 Extensions to a General Class of Discrete-Time Nonlinear Systems

Create new playlist

Sign In

Sign Up

Table of Contents for
7.3 Extensions to a General Class of Discrete-Time Nonlinear Systems