5 State-Feedback Nonlinear H∞-Control for Continuous-Time Systems

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

In this chapter, we discuss the nonlinear H∞ $H_{\infty}$ sub-optimal control problem for continuous-time affine nonlinear systems using state-feedback. This problem arises when the states of the system are available, or can be measured directly and used for feedback. We derive sufficient conditions for the solvability of the problem, and we discuss the results for both time-invariant (or autonomous) systems and time-varying (or nonautonomous) systems, as well as systems with a delay in the state. We also give a parametrization of all full-information stabilizing controllers for each system. Moreover, understanding the state-feedback problem will facilitate the understanding of the dynamic measurement-feedback problem which is discussed in the subsequent chapter.

The problem of robust control in the presence of modelling errors and/or parameter variations is also discussed. Sufficient conditions for the solvability of this problem are given, and a class of controllers is presented.

5.1 State-Feedback H∞ $H_{\infty}$ -Control for Affine Nonlinear Systems

The set-up for this configuration is shown in Figure 5.1, where the plant is represented by an affine causal state-space system defined on a smooth n-dimensional manifold χ ⊆ ℜⁿ in local coordinates x = (x₁,…, x_n):

Σa:⎧⎩⎨⎪⎪x˙=f(x)+g1(x)w+g2(x)u; x(t0)=x0y =xz=h1(x)+k12(x)u $Σ^{a} : {\begin{cases} \dot{x} = f (x) + g_{1} (x) w + g_{2} (x) u; x (t_{0}) = x_{0} \\ y = x \\ z = h_{1} (x) + k_{12} (x) u \end{cases}$

(5.1)

where x ∈ X $X$ is the state vector, u ∈ U $U$ ⊆ ℜ^p is the p-dimensional control input, which belongs to the set of admissible controls U,w∈W $U, w \in W$ is the disturbance signal, which belongs to the set W⊂L2([t0,∞),Rr) $W \subset ℒ_{2} ([t_{0}, \infty), ℜ^{r})$ of admissible disturbances, the output y ∈ ℜⁿ is the states-vector of the system which is measured directly, and z ∈ ℜ^s is the output to be controlled. The functions f: X →V∞(X) ,g1: X →Mn×r(X ), g2: X →Mn×p(X ),h1:X →Rs, and k12 : X →Mp×m(X ) $f : X \to V^{\infty} (X), g_{1} : X \to ℳ^{n \times r} (X), g_{2} : X \to ℳ^{n \times p} (X), h_{1} : X \to ℜ^{s}, and k_{12} : X \to ℳ^{p \times m} (X)$ are assumed to be real C^∞-functions of x. Furthermore, we assume without loss of generality that x = 0 is a unique equilibrium point of the system with u = 0, w = 0, and is such that f(0) = 0, h₁(0) = 0. We also assume that the system is well defined, i.e., for any initial state x(t₀) ∈ X $X$ and any admissible input u(t) ∈ U $U$ , there exists a unique solution x(t, t₀, x₀, u) to (5.1) on [t₀, ∞) which continuously depends on the initial conditions, or the system satisfies the local existence and uniqueness theorem for ordinary differential-equations [157].

Again Figure 5.1 also shows that for this configuration, the states of the plant are accessible and can be directly measured for the purpose of feedback control. We begin with the definition of smooth-stabilizability and also recall the definition of L2 $ℒ_{2}$ -gain of the system Σ^a.

FIGURE 5.1
Feedback Configuration for State-Feedback Nonlinear H∞ $H_{\infty}$ -Control

Definition 5.1.1 (Smooth-stabilizability). The nonlinear system Σ^a (or simply [f, g₂]) is locally smoothly-stabilizable if there exists a C⁰-function F : U $U$ ⊂ X $X$ → ℜ^p, F (0) = 0, such that x˙ $\dot{x}$ = f(x)+g₂(x)F (x) is locally asymptotically stable. The system is smoothly-stabilizable if U = X $X$ .

Definition 5.1.2 The nonlinear system Σ^a is said to have locally L2 $ℒ_{2}$ -gain from w to z in U ⊂ X $X$ , less than or equal to γ, if for any x₀ ∈ U $U$ and fixed u, the response z of the system corresponding to any w ∈ W $W$ satisfies:

∫Tt0∥z(t)∥2dt≤ γ2∫Tt0∥w(t)∥2dt+β(x0), ∀T>t0, $\int_{t_{0}}^{T} {‖ z (t) ‖}^{2} d t \leq γ^{2} \int_{t_{0}}^{T} {‖ w (t) ‖}^{2} d t + β (x_{0}), \forall T > t_{0},$

for some bounded C⁰ function β : U → ℜ such that β(0) = 0. The system has L2 $ℒ_{2}$ -gain ≤ γ if the above inequality is satisfied for all x ∈ X $X$ , or U =X $X$ .

Since we are interested in designing smooth feedback laws for the system to make it asymptotically or internally stable, the requirement of smooth-stabilizability for the system will obviously be necessary for the solvability of the problem before anything else. The suboptimal state-feedback nonlinear H∞ $H_{\infty}$ -control or local disturbance-attenuation problem with internal stability, can then be formally defined as follows.

Definition 5.1.3 (State-Feedback Nonlinear H∞ $H_{\infty}$ (Suboptimal)-Control Problem (SFBNLHICP)). The state-feedback ℌ_∞ suboptimal control or local disturbance-attenuation problem with internal stability for the system Σ^a, is to find a static state-feedback control function of the form

u=α(x,t), α:R+×N→RP, N⊂X $u = α (x, t), α : ℜ_{+} \times N \to ℜ^{P}, N \subset X$

(5.2)

for some smooth function α depending on x and possibly t only, such that the closed-loop system:

∑aclp:{x˙=f(x)+g1(x)w+g2(x)α(x,t); x(t0)=x0z=h1(x)+k12(x)α(x,t) $\sum_{c l p}^{a} : {\begin{cases} \dot{x} = f (x) + g_{1} (x) w + g_{2} (x) α (x, t); x (t_{0}) = x_{0} \\ z = h_{1} (x) + k_{12} (x) α (x, t) \end{cases}$

(5.3)

has, for all initial conditions x(t₀) ∈ N, locally L2 $ℒ_{2}$ -gain from the disturbance signal w to the output z less than or equal to some prescribed number γ^{⋆ $⋆$} > 0 with internal stability, or equivalently, the closed-loop system achieves local disturbance-attenuation less than or equal to γ^{⋆ $⋆$} with internal stability.

Internal stability of the system in the above definition means that all internal signals in the system, or trajectories, are bounded, which is also equivalent to local asymptotic-stability of the closed-loop system with w = 0 in this case.

Remark 5.1.1 The optimal problem in the above definition is to find the minimum γ^{⋆ $⋆$} > 0 for which the L2 $ℒ_{2}$ -gain is minimized. This problem is however more difficult to solve.

One way to measure the L2 $ℒ_{2}$ -gain (or with an abuse of the terminology, H∞ $H_{\infty}$ -norm) of the system (5.1), is to excite it with a periodic input w_T ∈ W, where W ⊂ W $W$ is the subspace of periodic continuous-time functions (e.g., a sinusoidal signal), and to measure the steady-state output response z_ss(.) corresponding to the steady-state state response x_ss(.). Then the L2 $ℒ_{2}$ -gain can be calculated as

∥∥∥Σa∥∥∥H∞=supw∈W∥zss∥T∥w∥T $‖ Σ^{a} ‖ H_{\infty} = \sup_{w \in W} \frac{‖ z_{s s} ‖ T}{‖ w ‖ T}$

where

∥w∥T=1T(∫t0+Tt0∥w(s)∥2ds)12, ∥zss∥T=1T(∫t0+Tt0∥w(s)∥2ds)12. $‖ w ‖ T = \frac{1}{T} {(\int_{t_{0}}^{t_{0} + T} {‖ w (s) ‖}^{2} d s)}^{\frac{1}{2}}, ‖ z_{s s} ‖ T = \frac{1}{T} {(\int_{t_{0}}^{t_{0} + T} {‖ w (s) ‖}^{2} d s)}^{\frac{1}{2}} .$

Returning now to the SFBNLHICP, to derive sufficient conditions for the solvability of this problem, we apply the theory of differential games developed in Chapter 2. It is fairly clear that the problem of choosing a control function u^{⋆ $⋆$}(.) such that the L2 $ℒ_{2}$ -gain of the closed-loop system from w to z is less than or equal to γ > 0, can be formulated as a two-player zero-sum differential game with u the minimizing player’s decision, w the maximizing player’s decision, and the objective functional:

minu∈U maxw∈W J(u,w)=12∫Tt0[∥z(t)∥2−γ2∥w(t)∥2]dt, $\min_{u \in U} \max_{w \in W} J (u, w) = \frac{1}{2} \int_{t_{0}}^{T} [{‖ z (t) ‖}^{2} - γ^{2} {‖ w (t) ‖}^{2}] d t,$

(5.4)

subject to the dynamical equations (5.1) over a finite time-horizon T > t₀.

At this point, we separate the problem into two subproblems; namely, (i) achieving local disturbance-attenuation and (ii) achieving local asymptotic-stability. To solve the first problem, we allow w to vary over all possible disturbances including the worst-case disturbance, and search for a feedback control function u:X×R→U $u : X \times ℜ \to U$ depending on the current state information, that minimizes the objective functional J(., .) and renders it nonpositive for all w starting from x₀ = 0. By so doing, we have the following result.

Proposition 5.1.1 Suppose for γ = γ^{⋆ $⋆$} there exists a locally defined feedback-control function u^{⋆ $⋆$} : N × ℜ → ℜ^p, 0 ∈ N ⊂ X $X$ , which is possibly time-varying, and renders J(., .) nonpositive for the worst possible disturbance w^{⋆ $⋆$} ∈ W $W$ (and hence for all w ∈ W $W$ ) for all T > 0. Then the closed-loop system has locally L2 $ℒ_{2}$ -gain ≤ γ^{⋆ $⋆$}.

Proof:

J(u⋆,w⋆)≤0⇒∥z∥L2[t0,T]≤γ⋆∥w∥L2[t0,T] ∀T>0. □ $J (u^{⋆}, w^{⋆}) \leq 0 \Rightarrow ‖ z ‖ L_{2} [t_{0}, T] \leq γ^{⋆} ‖ w ‖ L_{2} [t_{0}, T] \forall T > 0. □$

To derive the sufficient conditions for the solvability of the first sub-problem, we define the value-function for the game V : X $X$ × [0, T ] → ℜ as

V(t,x)=infu supw12∫Tt[∥z(τ)∥2−γ∥w(τ)∥2]dτ $V (t, x) = \inf_{u} \sup_{w} \frac{1}{2} \int_{t}^{T} [{‖ z (τ) ‖}^{2} - γ {‖ w (τ) ‖}^{2}] d τ$

and apply Theorem 2.4.2 from Chapter 2. Consequently, we have the following theorem.

Theorem 5.1.1 Consider the SFBNLHICP problem as a two-player zero-sum differential game with the cost functional (5.4). A pair of strategies (u^{⋆ $⋆$} (x, t), w^{⋆ $⋆$} (x, t)) provides, under feedback information structure, a saddle-point solution to the game such that

J(u⋆,w)≤J(u⋆,w⋆)≤J(u,w⋆), $J (u^{⋆}, w) \leq J (u^{⋆}, w^{⋆}) \leq J (u, w^{⋆}),$

if the value-function V is C¹ and satisfies the HJI-PDE (HJIE):

−Vt(x,t) = minu supw{Vx(x,t)[f(x)+g1(x)w+g2(x)u]+12(∥z∥2− γ2∥w∥2)} = supw minu{Vx(x,t)[f(x)+g1(x)w+g2(x)u]+12(∥z∥2− 12γ2∥w∥2)} =Vx(x,t)[f(x)+g1(x)w⋆(x,t)+g2(x)u⋆(x,t)]+12∥h1(x)+k12(x)u⋆(x,t)∥2− 12γ2∥w⋆(x,t)∥2; V(x,T)=0. $\begin{array}{l} - V_{t} (x, t) = \min_{u} \sup_{w} {V_{x} (x, t) [f (x) + g 1 (x) w + g 2 (x) u] + \frac{1}{2} ({‖ z ‖}^{2} - γ^{2} {‖ w ‖}^{2})} \\ = \sup_{w} \min_{u} {V_{x} (x, t) [f (x) + g 1 (x) w + g 2 (x) u] + \frac{1}{2} ({‖ z ‖}^{2} - \frac{1}{2} γ^{2} {‖ w ‖}^{2})} \\ = V_{x} (x, t) [f (x) + g 1 (x) w^{⋆} (x, t) + g 2 (x) u^{⋆} (x, t)] + \frac{1}{2} {‖ h_{1} (x) + k_{12} (x) u^{⋆} (x, t) ‖}^{2} - \\ \frac{1}{2} γ^{2} {‖ w^{⋆} (x, t) ‖}^{2}; V (x, T) = 0. \end{array}$

(5.5)

Next, to find the pair of feedback strategy (u^{⋆ $⋆$}, w^{⋆ $⋆$} ) that satisfies Isaac’s equation (5.5), we form the Hamiltonian function H:T⋆X×U×W→R $H : T^{⋆} X \times U \times W \to ℜ$ for the problem:

H(x,p,u,w)=pT(f(x)+g1(x)w+g2(x)u)+12∥h1(x)+k12(x)u∥2−12γ2∥w∥2, $H (x, p, u, w) = p^{T} (f (x) + g_{1} (x) w + g_{2} (x) u) + \frac{1}{2} {‖ h_{1} (x) + k_{12} (x) u ‖}^{2} - \frac{1}{2} γ^{2} {‖ w ‖}^{2},$

(5.6)

and search for a unique saddle-point (u^{⋆ $⋆$}, w^{⋆ $⋆$} ) such that

H(x,p,u⋆,w)≤H(x,p,u⋆,w⋆)≤H(x,p,u,w⋆) $H (x, p, u^{⋆}, w) \leq H (x, p, u^{⋆}, w^{⋆}) \leq H (x, p, u, w^{⋆})$

(5.7)

for each (u, w) and each (x, p), where p is the adjoint variable.

Since the function H(., ., ., .) is C² in both u and w, the above problem can be solved by applying the necessary conditions for an unconstrained optimization problem. However, the only problem that might arise is if the coefficient matrix of u is singular. This more general problem will be discussed in Chapter 9. But in the meantime to overcome this problem, we need the following assumption.

Assumption 5.1.1 The matrix

R(x)=kT12(x)k12(x) $R (x) = k_{12}^{T} (x) k_{12} (x)$

is nonsingular for all x ∈ X $X$ .

Under the above assumption, the necessary conditions for optimality for u and w provided by the minimum (maximum) principle [175] are

∂H∂u(u⋆,w)=0, ∂H∂w(u,w⋆)=0 $\frac{\partial H}{\partial u} (u^{⋆}, w) = 0, \frac{\partial H}{\partial w} (u, w^{⋆}) = 0$

for all (u, w). Application of these conditions gives

u⋆(x,p) = −R−1(x)(gT2(x)p+kT12(x)h1(x)), $u^{⋆} (x, p) = - R^{- 1} (x) (g_{2}^{T} (x) p + k_{12}^{T} (x) h_{1} (x)),$

(5.8)

w⋆(x,p) = 1γ2gT1(x)p. $w^{⋆} (x, p) = \frac{1}{γ^{2}} g_{1}^{T} (x) p .$

(5.9)

Moreover, since by assumption R(.) is nonsingular and therefore positive-definite, and γ > 0, the above equilibrium-point is clearly an optimizer of J(u, w). Further, we can write

H(x,p,u,w)=H⋆(x,p)+12∥u−u⋆∥2R(x)−12γ2∥w−w⋆∥2, $H (x, p, u, w) = H^{⋆} (x, p) + \frac{1}{2} {‖ u - u^{⋆} ‖}_{R (x)}^{2} - \frac{1}{2} γ^{2} {‖ w - w^{⋆} ‖}^{2},$

(5.10)

where

H⋆(x,p)=H(x,p,u⋆(x,p),w⋆(x,p) ) $H^{⋆} (x, p) = H (x, p, u^{⋆} (x, p), w^{⋆} (x, p))$

and the notation ‖a‖ _Q stands for a^TQa for any a ∈ ℜⁿ, Q ∈ ℜ^n×n. Substituting u^{⋆ $⋆$} and w^{⋆ $⋆$} in turns in (5.10) show that the saddle-point conditions (5.7) are satisfied.

Now assume that there exists a C¹ positive-semidefinite solution V : X $X$ → ℜ to Isaac’s equation (5.5) which is defined in a neighborhood N of the origin, that vanishes at x = 0 and is time-invariant (this assumption is plausible since H(., ., ., .) is time-invariant). Then the feedbacks (u^{⋆ $⋆$}, w^{⋆ $⋆$}) necessarily exist, and choosing

p=VTx(x) $p = V_{x}^{T} (x)$

in (5.10) yields the identity:

H(x,VTx(x),w,u) = Vx(x)(f(x)+g1(x)w+g2(x)u)+12∥h1(x)+k12(x)u∥2−12γ2∥w∥2 =H⋆(x,VTx(x))+12∥u−u⋆∥2R(x)+12γ2∥w−w⋆∥2. $\begin{array}{l} H (x, V_{x}^{T} (x), w, u) = V_{x} (x) (f (x) + g_{1} (x) w + g_{2} (x) u) + \frac{1}{2} {‖ h_{1} (x) + k_{12} (x) u ‖}^{2} - \frac{1}{2} γ^{2} {‖ w ‖}^{2} \\ = H^{⋆} (x, V_{x}^{T} (x)) + \frac{1}{2} {‖ u - u^{⋆} ‖}_{R (x)}^{2} + \frac{1}{2} γ^{2} {‖ w - w^{⋆} ‖}^{2} . \end{array}$

Finally, notice that for u = u^{⋆ $⋆$} and w = w^{⋆ $⋆$}, the above identity yields

H(x,VTx(x),w⋆,u⋆)= H⋆(x,VTx(x)) $H (x, V_{x}^{T} (x), w^{⋆}, u^{⋆}) = H^{⋆} (x, V_{x}^{T} (x))$

which is exactly the right-hand-side of (5.5), and for this equation to be satisfied, V(.) must be such that

H⋆(x,VTx(x))=0 $H^{⋆} (x, V_{x}^{T} (x)) = 0$

(5.11)

The above condition (5.11) is the time-invariant HJIE for the disturbance-attenuation problem. Integration of (5.11) along the trajectories of the closed-loop system with α(x) = u⋆(x,VTx(x)) $α (x) = u^{⋆} (x, V_{x}^{T} (x))$ (independent of t!) starting from t = t₀ and x(t₀) = x₀, to t = T > t₀ and x(T ) yields

V(x(T))−V(x0)≤12∫Tt0(γ2∥w∥2−∥z∥2)dt≥0 ∀w∈W. $V (x (T)) - V (x_{0}) \leq \frac{1}{2} \int_{t_{0}}^{T} (γ^{2} {‖ w ‖}^{2} - {‖ z ‖}^{2}) d t \geq 0 \forall w \in W .$

This means J(u^{⋆ $⋆$}, w) is nonpositive for all w ∈ W $W$ , and consequently implies the L2 $ℒ_{2}$ -gain of the system is less than or equal to γ. This also solves part (i) of the state-feedback suboptimal H∞ $H_{\infty}$ -control problem. Before we consider part (ii) of the problem, we make the following simplifying assumption.

Assumption 5.1.2 The output vector h₁(.) and weighting matrix k₁₂(.) are such that

kT12(x)k12(x)=I $k_{12}^{T} (x) k_{12} (x) = I$

and

hT1(x)k12(x)=0 $h_{1}^{T} (x) k_{12} (x) = 0$

for all x ∈ X $X$ . Equivalently, we shall henceforth write z=[h1(x)u] $z = [\begin{matrix} h_{1} (x) \\ u \end{matrix}]$ under this assumption.

Remark 5.1.2 The above assumption implies that there are no cross-product terms in the performance or cost-functional (5.4), and the weighting on the control is unity.

Under the above assumption 5.1.2, the HJIE (5.11) becomes

V(x)(x)f(x)+12V(x)(x)[1γ2g1(x)gT1(x)−g2(x)gT1(x)]VTx(x)+12hT1(x)h1(x)=0, V(0)=0, $V_{(x)} (x) f (x) + \frac{1}{2} V_{(x)} (x) [\frac{1}{γ^{2}} g_{1} (x) g_{1}^{T} (x) - g_{2} (x) g_{1}^{T} (x)] V_{x}^{T} (x) + \frac{1}{2} h_{1}^{T} (x) h_{1} (x) = 0, V (0) = 0,$

(5.12)

and the feedbacks (5.8), (5.9) become

u⋆(x) = −gT2(x)VTx(x) $u^{⋆} (x) = - g_{2}^{T} (x) V_{x}^{T} (x)$

(5.13)

w⋆(x) = 1γ2gT1(x)VTx(x). $w^{⋆} (x) = \frac{1}{γ^{2}} g_{1}^{T} (x) V_{x}^{T} (x) .$

(5.14)

Thus, the above condition (5.12) together with the associated feedbacks (5.13), (5.14) provide a sufficient condition for the solvability of the state-feedback suboptimal H∞ $H_{\infty}$ problem on the infinite-time horizon when T → ∞.

On the other hand, let us consider the finite-horizon problem as defined by the cost functional (5.4) with T < ∞. Assuming there exists a time-varying positive-semidefinite C¹ solution V : X $X$ × ℜ → ℜ to the HJIE (5.5) such that

p=VTx(x,t), $p = V_{x}^{T} (x, t),$

then substituting in (5.8), (5.9) and the HJIE (5.5) under the Assumption 5.1.2, we have

u⋆(x,t) = −gT2(x)VTx(x,t) $u^{⋆} (x, t) = - g_{2}^{T} (x) V_{x}^{T} (x, t)$

(5.15)

w⋆(x,t) = 1γ2gT1(x)VTx(x,t), $w^{⋆} (x, t) = \frac{1}{γ^{2}} g_{1}^{T} (x) V_{x}^{T} (x, t),$

(5.16)

where V satisfies the HJIE

Vt(x,t)+Vx(x,t)f(x)+12Vx(x,t)[1γ2g1(x)gT1(x)−g2(x)gT2(x)]VTx(x,t)+ 12hT(x)h1(x)=0, V(x,T)=0. $\begin{array}{l} V_{t} (x, t) + V_{x} (x, t) f (x) + \frac{1}{2} V_{x} (x, t) [\frac{1}{γ^{2}} g_{1} (x) g_{1}^{T} (x) - g_{2} (x) g_{2}^{T} (x)] V_{x}^{T} (x, t) + \\ \frac{1}{2} h^{T} (x) h_{1} (x) = 0, V (x, T) = 0. \end{array}$

(5.17)

Therefore, the above HJIE (5.17) gives a sufficient condition for the solvability of the finite-horizon suboptimal H∞ $H_{\infty}$ control problem and the associated feedbacks.

Let us consider an example at this point.

Example 5.1.1 Consider the nonlinear system with the associated penalty function

x˙1 = x2x˙2 = −x1−12x32+x2w+uz = [x2u]. $\begin{array}{l} {\dot{x}}_{1} = x_{2} \\ {\dot{x}}_{2} = - x_{1} - \frac{1}{2} x_{2}^{3} + x_{2} w + u \\ z = [\begin{array}{l} x_{2} \\ u \end{array}] . \end{array}$

The HJIE (5.12) corresponding to this system and penalty function is given by

x2Vx1−x1Vx2−12x32Vx2+12x22(x22−γ2)γ2+12x22=0. $x_{2} V_{x_{1}} - x_{1} V_{x_{2}} - \frac{1}{2} x_{2}^{3} V_{x_{2}} + \frac{1}{2} \frac{x_{2}^{2} (x_{2}^{2} - γ^{2})}{γ^{2}} + \frac{1}{2} x_{2}^{2} = 0.$

Let γ = 1 and choose

Vx1=x1, Vx2 = x2. $V_{x_{1}} = x_{1}, V_{x_{2}} = x_{2} .$

Then we see that the HJIE is solved with V(x)=12(x21+x22) $V (x) = \frac{1}{2} (x_{1}^{2} + x_{2}^{2})$ which is positive-definite. The associated feedbacks are given by

u⋆=−x2, w⋆=x22. $u^{⋆} = - x_{2}, w^{⋆} = x_{2}^{2} .$

It is also interesting to notice that the above solution V to the HJIE (5.12) is also a Lyapunov-function candidate for the free system: x˙1=x2,x˙2=−x1−12x32. ${\dot{x}}_{1} = x_{2}, {\dot{x}}_{2} = - x_{1} - \frac{1}{2} x_{2}^{3} .$

Next, we consider the problem of asymptotic-stability for the closed-loop system (5.3), which is part (ii) of the problem. For this, let

α(x)=u⋆(x)=−gT2(x)VTx(x), $α (x) = u^{⋆} (x) = - g_{2}^{T} (x) V_{x}^{T} (x),$

where V(.) is a smooth positive-semidefinite solution of the HJIE (5.12). Then differentiating V along the trajectories of the closed-loop system with w = 0 and using (5.12), we get

V˙(x) =Vx(x)(f(x)−g2(x)gT2(x)Vx(x)) =−12∥u⋆∥ 2−12γ2∥w⋆∥2−12hT(x)h1(x)≤0, $\begin{array}{l} \dot{V} (x) = V_{x} (x) (f (x) - g_{2} (x) g_{2}^{T} (x) V_{x} (x)) \\ = - \frac{1}{2} ‖ u^{⋆} ‖^{2} - \frac{1}{2} γ^{2} {‖ w^{⋆} ‖}^{2} - \frac{1}{2} h^{T} (x) h_{1} (x) \leq 0, \end{array}$

where use has been made of the HJIE (5.12). Therefore, V˙ $\dot{V}$ is nonincreasing along trajectories of the closed-loop system, and hence the system is stable in the sense of Lyapunov. To prove local asymptotic-stability however, an additional assumption on the system will be necessary.

Definition 5.1.4 The nonlinear system Σ^a is said to be locally zero-state detectable if there exists a neighborhood U ⊂X $X$ of x = 0 such that, for all x(t₀) ∈ U $U$ , if z(t) ≡ 0, u(t) ≡ 0 for all t ≥ t₀, it implies that limt→∞x(t,t0,x0,u) = 0 $\lim_{t \to \infty} x (t, t_{0}, x_{0}, u) = 0$ . It is zero-state detectable if U = X $X$ .

Thus, if we assume the system Σ^a to be locally zero-state detectable, then it is seen that for any trajectory of the system x(t) ∈ U $U$ such that V˙(x(t)) ≡0 $\dot{V} (x (t)) \equiv 0$ for all t ≥ t_s for some t_s ≥ t₀, it is necessary that u(t) ≡ 0 and z(t) ≡ 0 for all t ≥ t_s. This by zero-state detectability implies that limt→∞x(t)=0. $\lim_{t \to \infty} x (t) = 0.$ Finally, since x = 0 is the only equilibrium-point of the system in U $U$ , by LaSalle’s invariance-principle, we can conclude local asymptotic-stability.

The above result is summarized as the solution to the state-feedback H∞ $H_{\infty}$ sub-optimal control problem (SFBNLHICP) in the next theorem after the following definition.

Definition 5.1.5 A nonnegative function V : X $X$ → ℜ is proper if the level set V⁻¹([0, a]) = {x ∈ X $X$ |0 ≤ V (x) ≤ a} is compact for each a > 0.

Theorem 5.1.2 Consider the nonlinear system Σ^a and the SFBNLHICP for the system. Assume the system is smoothly-stabilizable and locally zero-state detectable in N ⊂ X $X$ . Suppose also there exists a smooth positive-semidefinite solution to the HJIE (5.12) in N. Then the control law

u⋆=α(x)=−gT2(x)VTx(x), x∈N $u^{⋆} = α (x) = - g_{2}^{T} (x) V_{x}^{T} (x), x \in N$

(5.18)

solves the SFBNLHICP locally in N. If in addition Σ^a is globally zero-state detectable and V is proper, then u^{⋆ $⋆$} solves the problem globally.

Proof: The first part of the theorem has already been proven in the above developments. For the second part regarding global asymptotic-stability, note that, if V is proper, then V is a global solution of the HJIE (5.11), and the result follows by application of LaSalle’s invariance-principle from Chapter 1 (see also the References [157, 268]). □

The existence of a C² solution to the HJIE (5.12) is related to the existence of an invariant-manifold for the corresponding Hamiltonian system:

XH⋆γ:⎧⎩⎨⎪⎪dxdt = ∂H⋆γ(x,p)∂pdpdt = −∂H⋆γ(x,p)∂x, $X_{H_{γ}^{⋆}} : {\begin{cases} \frac{d x}{d t} = \frac{\partial H_{γ}^{⋆} (x, p)}{\partial p} \\ \frac{d p}{d t} = - \frac{\partial H_{γ}^{⋆} (x, p)}{\partial x}, \end{cases}$

(5.19)

where

H⋆γ(x,p)=pTf(x)+12pT[1γ2g1(x)gT1(x)−g2(x)gT2(x)]P+12hT1(x)h1(x). $H_{γ}^{⋆} (x, p) = p^{T} f (x) + \frac{1}{2} p^{T} [\frac{1}{γ^{2}} g_{1} (x) g_{1}^{T} (x) - g_{2} (x) g_{2}^{T} (x)] P + \frac{1}{2} h_{1}^{T} (x) h_{1} (x) .$

It can be seen then that, if V is a C² solution of the Isaacs equation, then differentiating H^{⋆ $⋆$} (x, p) in (5.11) with respect to x we get

(∂H⋆γ∂x)p=VTx+(∂H⋆γ∂p)p=VTx∂VTx∂x=0, ${(\frac{\partial H_{γ}^{⋆}}{\partial x})}_{p = V_{x}^{T}} + {(\frac{\partial H_{γ}^{⋆}}{\partial p})}_{p = V_{x}^{T}} \frac{\partial V_{x}^{T}}{\partial x} = 0,$

and since the Hessian matrix

∂VTx∂x $\frac{\partial V_{x}^{T}}{\partial x}$

is symmetric, it implies that the submanifold

M={(x,p):p=VTx(x)} $M= {(x, p) : p = V_{x}^{T} (x)}$

(5.20)

is invariant under the flow of the Hamiltonian vector-field XH⋆γ $X_{H_{γ}^{⋆}}$ , i.e.,

(∂H⋆γ∂x)p=VTx=−(∂H⋆γ∂p)p=VTx∂VTx∂x. ${(\frac{\partial H_{γ}^{⋆}}{\partial x})}_{p = V_{x}^{T}} = - {(\frac{\partial H_{γ}^{⋆}}{\partial p})}_{p = V_{x}^{T}} \frac{\partial V_{x}^{T}}{\partial x} .$

The above developments have considered the SFBNLHICP from a differential games perspective. In the next section, we consider the same problem from a dissipative point of view.

Remark 5.1.3 With p=VTx(x) $p = V_{x}^{T} (x)$ , for some smooth solution V ≥ 0 of the HJIE (5.12), the disturbance w⋆=1γ2g1(x)VTx(x), x∈X $w^{⋆} = \frac{1}{γ^{2}} g_{1} (x) V_{x}^{T} (x), x \in X$ is referred to as the worst-case disturbance affecting the system. Hence the title “worst-case” design for H∞ $H_{\infty}$ -control design.

Let us now specialize the results of Theorem 5.1.2 to the linear system

Σl:⎧⎩⎨⎪⎪x˙ = Fx+G1w+G2u; x(0) =x0z = [H1(x) u] , $Σ^{l} : {\begin{cases} \dot{x} = F x + G_{1} w + G_{2} u; x (0) = x_{0} \\ z = [\begin{array}{l} H_{1} (x) \\ u \end{array}], \end{cases}$

(5.21)

where F ∈ ℜ^n×n, G₁ ∈ ℜ^n×r, G₂ ∈ ℜ^n×p, and H₁ ∈ ℜ^m×n are constant matrices. Also, let the transfer function w ↦ z be T_zw, and assume x(0) = 0. Then the H∞ $H_{\infty}$ -norm of the system from w to z is defined by

∥∥∥Tzw∥∥∥∞≜ sup0≠w∈L2[0,∞)∥z∥2∥w∥2. $‖ T_{z w} ‖_{\infty} ≜ \sup_{0 \neq w \in L_{2} [0, \infty)} \frac{‖ z ‖_{2}}{‖ w ‖_{2}} .$

We then have the following corollary to the theorem.

Corollary 5.1.1 Consider the linear system (5.21) and the SFBNLHICP for it. Assume (F, G₂) is stabilizable and (H₁, F) is detectable. Further, suppose for some γ > 0, there exists a symmetric positive-semidefinite solution P ≥ 0 to the algebraic-Riccati equation (ARE):

FTP+PF+P[1γ2G1GT1−G2GT2]P+HT1H1=0. $F^{T} P + P F + P [\frac{1}{γ^{2}} G_{1} G_{1}^{T} - G_{2} G_{2}^{T}] P + H_{1}^{T} H_{1} = 0.$

(5.22)

Then the control law

u=−GT2Px $u = - G_{2}^{T} P_{x}$

solves the SFBNLHICP for the system Σ^l, i.e., renders its H∞ $H_{\infty}$ -norm less than or equal to a prescribed number γ > 0 and (F−G2GT2P) $(F - G_{2} G_{2}^{T} P)$ is asymptotically-stable or Hurwitz.

Remark 5.1.4 Note that the assumptions (F, G₂) stabilizable and (H₁, F ) detectable in the above corollary actually guarantee the existence of a symmetric solution P ≥ 0 to the Riccati equation (5.22) [292]. Moreover, any solution P = P^T ≥ 0 of (5.22) is stabilizing.

Remark 5.1.5 Again, the assumption (H₁, F) detectable in the corollary can be replaced by the linear equivalent of the zero-state detectability assumption for the nonlinear case, which is

rank(A−jωIHG2I)=n+m ∀ω∈R. $r a n k (\begin{matrix} A - j ω I & G_{2} \\ H & I \end{matrix}) = n + m \forall ω \in ℜ .$

This condition also means that the system does not have a stable unobservable mode on the jω-axis.

The converse of Corollary 5.1.1 also holds, and is stated in the following theorem which is also known as the Bounded-real lemma [160].

Theorem 5.1.3 Assume (H₁, F) is detectable and let γ > 0. Then there exists a linear feedback-control

u=K x $u = K x$

such that the closed-loop system (5.21) with this feedback is asymptotically-stable and has L2 $ℒ_{2}$ -gain ≤ γ if, and only if, there exists a solution P ≥ 0 to (5.22). In addition, if P = P^T ≥ 0 is such that

σ(F−G2GT2P+1γ2G1GT1P)⊂C−, $σ (F - G_{2} G_{2}^{T} P + \frac{1}{γ^{2}} G_{1} G_{1}^{T} P) \subset C^{-},$

where σ(.) denotes the spectrum of (.), then ∥Tzw∥∞<γ ${‖ T_{z w} ‖}_{\infty} < γ$ .

5.1.1 Dissipative Analysis

In this section, we reconsider the SFBNLHICP for the affine nonlinear system (5.1) from a dissipative system’s perspective developed in Chapter 3 (see also [131, 223]). In this respect, the first part of the problem (subproblem (i)) can be regarded as that of finding a static state-feedback control function u = α(x) such that the closed-loop system (5.3) is rendered dissipative with respect to the supply-rate

s(w(t),z(t))=12(γ2∥w(t)∥2−∥z(t)∥2) $s (w (t), z (t)) = \frac{1}{2} (γ^{2} {‖ w (t) ‖}^{2} - {‖ z (t) ‖}^{2})$

and a suitable storage-function. For this purpose, we first recall the following definition from Chapter 3.

Definition 5.1.6 The nonlinear system (5.1) is locally dissipative with respect to the supply-rate s(w,z)=12(γ2∥w∥2−∥z∥2) $s (w, z) = \frac{1}{2} (γ^{2} {‖ w ‖}^{2} - {‖ z ‖}^{2})$ , if there exists a storage-function V : N ⊂ X $X$ → ℜ₊ such that for any initial state x(t₀) = x₀ ∈ N, the inequality

V(x1)−V(x0)≤∫t1t012(γ2∥w(t)∥2−∥z(t)∥2)dt $V (x_{1}) - V (x_{0}) \leq \int_{t_{0}}^{t_{1}} \frac{1}{2} (γ^{2} {‖ w (t) ‖}^{2} - {‖ z (t) ‖}^{2}) d t$

(5.23)

is satisfied for all w ∈ L2 $ℒ_{2}$ [t₀, ∞), where x₁ = x(t₁, t₀, x₀, u).

Remark 5.1.6 Rewriting the above dissipation-inequality (5.23) as (since V ≥ 0)

12∫t1t0∥z(t)∥2dt≤12∫t1t0γ2∥w(t)∥2dt+V(x0) $\frac{1}{2} \int_{t_{0}}^{t_{1}} {‖ z (t) ‖}^{2} d t \leq \frac{1}{2} \int_{t_{0}}^{t_{1}} γ^{2} {‖ w (t) ‖}^{2} d t + V (x_{0})$

and allowing t₁ → ∞, it immediately follows that dissipativity of the system with respect to the supply-rate s(w, z) implies finite L2 $ℒ_{2}$ -gain ≤ γ for the system.

We can now state the following proposition.

Proposition 5.1.2 Consider the nonlinear system (5.3) and the the SFBNLHICP using static state-feedback control. Suppose for some γ > 0, there exists a smooth solution V ≥ 0 to the HJIE (5.12) or the HJI-inequality:

Vx(x)f(x)+12Vx(x)[1γ2g1(x)gT1(x)−g2(x)gT2(x)]VTx(x)+12hT1(x)h1(x)≤0, V(0)=0, $V_{x} (x) f (x) + \frac{1}{2} V_{x} (x) [\frac{1}{γ^{2}} g_{1} (x) g_{1}^{T} (x) - g_{2} (x) g_{2}^{T} (x)] V_{x}^{T} (x) + \frac{1}{2} h_{1}^{T} (x) h_{1} (x) \leq 0, V (0) = 0,$

(5.24)

in N ⊂X $X$ . Then, the control function (5.18) solves the problem for the system in N.

Proof: The equivalence of the solvability of the HJIE (5.12) and the inequality (5.24) has been shown in Chapter 3. For the local disturbance-attenuation property, rewrite the HJ-inequality as

V˙(x) =Vx(x)[f(x)+g1(x)w−g2(x)gT2(x)VTx(x)], x∈N ≤ −12∥h∥2−γ22∥∥w−1γ2gT1(x)VTx(x)∥∥2+12γ2∥w∥2−12∥u⋆∥2. $\begin{array}{l} \dot{V} (x) = V_{x} (x) [f (x) + g_{1} (x) w - g_{2} (x) g_{2}^{T} (x) V_{x}^{T} (x)], x \in N \\ \leq - \frac{1}{2} {‖ h ‖}^{2} - \frac{γ^{2}}{2} {‖ w - \frac{1}{γ^{2}} g_{1}^{T} (x) V_{x}^{T} (x) ‖}^{2} + \frac{1}{2} γ^{2} {‖ w ‖}^{2} - \frac{1}{2} {‖ u^{⋆} ‖}^{2} . \end{array}$

(5.25)

Integrating now the above inequality from t = t₀ to t = t₁ > t₀, and starting from x(t₀), we get

V(x(t1))−V(x(t0))≤∫t1t012(γ2∥w∥2−∥z⋆∥2)dt, x(t0),x(t1)∈N, $V (x (t_{1})) - V (x (t_{0})) \leq \int_{t_{0}}^{t_{1}} \frac{1}{2} (γ^{2} {‖ w ‖}^{2} - {‖ z^{⋆} ‖}^{2}) d t, x (t_{0}), x (t_{1}) \in N,$

where z⋆=[h1(x) u⋆] $z^{⋆} = [\begin{array}{l} h_{1} (x) \\ u^{⋆} \end{array}]$ . Hence, the system is locally dissipative with respect to the supply-rate s(w, z), and consequently by Remark 5.1.6 has the local disturbance-attenuation property. □

Remark 5.1.7 Note that the inequality (5.25) is obtained whether the HJIE is used or the HJI-inequality is used.

To prove asymptotic-stability for the closed-loop system, part (ii) of the problem, we have the following theorem.

Theorem 5.1.4 Consider the nonlinear system (5.3) and the SFBNLHICP. Suppose the system is smoothly-stabilizable, zero-state detectable, and the assumptions of Proposition 5.1.1 hold for the system. Then the control law (5.18) renders the closed-loop system (5.3) locally asymptotically-stable in N with w = 0 and therefore solves the SFBNLHICP for the system locally in N. If in addition the solution V ≥ 0 of the HJIE (or inequality) is proper, then the system is globally asymptotically-stable with w = 0, and the problem is solved globally.

Proof: Substituting w = 0 in the inequality (5.25), it implies that V˙(t) ≤ 0 $\dot{V} (t) \leq 0$ and the system is stable. Further, if the system is zero-state detectable, then for any trajectory of the system such that V˙(x(t)) ≡ 0 $\dot{V} (x (t)) \equiv 0$ , for all t ≥ t_s for some t_s ≥ t₀, it implies that z(t) ≡ 0, u^* (t) ≡ 0, for all t ≥ t_s, which in turn implies that lim_t→∞ x(t) = 0. The result now follows by application of Lasalle’s invariance-principle. For the global asymptotic-stability of the system, we note that if V is proper, then V is a global solution of the HJI-inequality (5.24), and the result follows by applying the same arguments as above. □

We consider another example.

Example 5.1.2 Consider the nonlinear system defined on the half-space N12={x=x1 > 12x2} $N_{\frac{1}{2}} = {x = x_{1} > \frac{1}{2} x_{2}}$

x˙1= −14x21−x222x1−x2+wx˙2 = x2+w+uz =[x1 x2 u]T. $\begin{array}{l} {\dot{x}}_{1} = \frac{- \frac{1}{4} x_{1}^{2} - x_{2}^{2}}{2 x_{1} - x_{2}} + w \\ {\dot{x}}_{2} = x_{2} + w + u \\ z = {[x_{1} x_{2} u]}^{T} . \end{array}$

The HJI-inequality (5.24) corresponding to this system for γ = 2–√ $γ = \sqrt{2}$ is given by

( −14x21−x222x1−x2)Vx1+(x2)Vx2+14V2x1+12Vx1Vx2−14V2x2+12(x21+x21)≤0. $(\frac{- \frac{1}{4} x_{1}^{2} - x_{2}^{2}}{2 x_{1} - x_{2}}) V_{x_{1}} + (x_{2}) V_{x_{2}} + \frac{1}{4} V_{x_{1}}^{2} + \frac{1}{2} V_{x_{1}} V_{x_{2}} - \frac{1}{4} V_{x_{2}}^{2} + \frac{1}{2} (x_{1}^{2} + x_{1}^{2}) \leq 0.$

Then, it can be checked that the positive-definite function

V(x)=12x21+12(x1−x2)2 $V (x) = \frac{1}{2} x_{1}^{2} + \frac{1}{2} {(x_{1} - x_{2})}^{2}$

globally solves the above HJI-inequality in N with γ = 2–√ $γ = \sqrt{2}$ . Moreover, since the system is zero-state detectable, then the control law

u=x1−x2 $u = x_{1} - x_{2}$

asymptotically stabilizes the system over N12 $N_{\frac{1}{2}}$ .

Next, we investigate the relationship between the solvability of the SFBNLHICP for the nonlinear system Σ^a and its linearization about x = 0:

Σ¯¯¯l : ⎧⎩⎨⎪⎪x¯˙ = F x¯+G1w¯¯¯+G2u¯; x¯(0)=x¯0z¯ = [H1x¯ u¯] ${\bar{Σ}}^{l} : {\begin{cases} \dot{\bar{x}} = F \bar{x} + G_{1} \bar{w} + G_{2} \bar{u}; \bar{x} (0) = {\bar{x}}_{0} \\ \bar{z} = [\begin{array}{l} H_{1} \bar{x} \\ \bar{u} \end{array}] \end{cases}$

(5.26)

where $F = \frac{\partial f}{\begin{array}{l} \partial x \end{array}} (0) ϵ ℜ^{n}^{\times n}, G_{1} = g_{1} (0) ϵ ℜ^{n}^{\times r}, G_{2} = g_{2} (0) ϵ ℜ^{n}^{\times p}, H_{1} = \frac{\partial f}{\begin{array}{l} \partial x \end{array}} (0), a n d ū ϵ ℜ^{p}, \bar{x} ϵ ℜ^{n}, \bar{w} ϵ ℜ^{r}$ . A number of interesting results relating the $ℒ_{2}$ -gain of the linearized system Σ^l and that of the system ${\bar{Σ}}^{a}$ can be concluded [264]. We summarize here one of these results.

Theorem 5.1.5 Consider the linearized system ${\bar{Σ}}^{l}$ , and assume the pair (H₁, F ) is detectable [292]. Suppose there exists a state-feedback $\bar{u} = K \bar{x}$ for some p × n matrix K, such that the closed-loop system is asymptotically-stable and has $ℒ_{2}$ -gain from $\bar{w} t o \bar{z}$ less than γ > 0. Then, there exists a neighborhood $O$ of x = 0 and a smooth positivesemidefinite function $V : O \to ℜ$ that solves the HJIE (5.12). Furthermore, the control law $u^{⋆} = - g_{2} (x) V_{x}^{T} (x)$ renders the $ℒ_{2}$ -gain of the closed-loop system (5.3) less than or equal to $γ i n O$ .

We defer a full study of the solvability and algorithms for solving the HJIE (5.12) which are crucial to the solvability of the SFBNLHICP, to a later chapter. However, it is sufficient to observe that, based on the results of Theorems 5.1.3 and 5.1.5, it follows that the existence of a stabilizing solution to the ARE (5.22) guarantees the local existence of a positive-semidefinite solution to the HJIE (5.12). Thus, any necessary condition for the existence of a symmetric solution P ≥ 0 to the ARE (5.22) becomes also necessary for the local existence of solutions to (5.11). In particular, the stabilizability of (F, G₂) is necessary, and together with the detectability of (H₁, F ) are sufficient. Further, it is well known from linear systems theory and the theory of Riccati equations [292, 68] that the existence of a stabilizing solution P = P ^T to the ARE (5.22) implies that the two subspaces

$X_{-} ({\bar{H}}_{γ}^{⋆}) and I m [\begin{array}{l} 0 \\ I \end{array}]$

are complementary and the Hamiltonian matrix

${\bar{H}}_{γ}^{⋆} = [\begin{matrix} F & (\frac{1}{γ^{2}} G_{1} G_{1}^{T} - G_{2} G_{2}^{T}) \\ - H^{T} H & - F^{T} \end{matrix}]$

does not have imaginary eigenvalues, where $X_{-} ({\bar{H}}_{γ}^{⋆})$ is the stable eigenspace of ${\bar{H}}_{γ}^{⋆}$ . Translated to the nonlinear case, this requires that the stable invariant-manifold M⁻ of the Hamiltonian vector-field $X_{H_{γ}^{⋆}}$ through $(x, V_{x}^{T} (x)) = (0, 0)$ (which is of the form (5.20)) to be n-dimensional and tangent to $X_{-} ({\bar{H}}_{γ}^{⋆}) : = s p a n [\begin{array}{l} I \\ p \end{array}] a t (x, V_{x}^{T}) = (0, 0)$ , and the matrix ${\bar{H}}_{γ}^{⋆}$ corresponding to the linearization of $H_{γ}^{⋆}$ does not have purely imaginary eigenvalues. The latter condition is referred to as being hyperbolic and this situation will be regarded as the noncritical case. Thus, the detectability of (H₁, F ) excludes the condition that ${\bar{H}}_{γ}^{⋆}$ has imaginary eigenvalues, but this is not necessary. Indeed, the HJIE (5.12) can also have smooth solutions in the critical case in which the Hamiltonian matrix ${\bar{H}}_{γ}^{⋆}$ is nonhyperbolic. In this case, the manifold M is not entirely the stable-manifold, but will contain a nontrivial center-stable manifold.

Proof: (of Theorem 5.1.5): By Theorem 5.1.3 there exists a solution P ≥ 0 to (5.22). It follows that the stable invariant manifold M⁻ is tangent to $X_{-} ({\bar{H}}_{γ}^{⋆})$ at (x, p) = (0, 0). Hence, locally about x = 0, there exists a smooth solution V⁻ to the HJIE (5.12) satisfying $\frac{\partial^{2} V^{-}}{\partial x^{2}} (0) = P$ . In addition, since $F - G_{2} G^{T}_{2} P$ is asymptotically-stable, the vector-field $f - g_{2} g^{T}_{2} \frac{\partial V^{-}}{\partial x}$ is asymptotically-stable. Rewriting the HJIE (5.12) as

$\begin{array}{l} V_{x}^{-} (x) (f (x) - g_{2} (x) g_{2}^{T} (x) V_{x}^{- T} (x)) + \frac{1}{2} V_{x}^{-} (x) [\frac{1}{γ^{2}} g_{1} (x) g_{1}^{T} (x) + g_{2} (x) g_{2}^{T} (x)] V_{x}^{- T} (x) + \\ \frac{1}{2} h_{1}^{T} (x) h_{1} (x) = 0, \end{array}$

it implies by the Bounded-real lemma (Chapter 3) that locally about x = 0, V ⁻ ≥ 0 and the closed-loop system has $ℒ_{2}$ -gain ≤ γ for all w ∈ $W$ such that x(t) remains in $O$ . □

In the next section, we discuss controller parametrization.

FIGURE 5.2
Controller Parametrization for FI-State-Feedback Nonlinear $H_{\infty}$ -Control

5.1.2 Controller Parametrization

In this subsection, we discuss the state-feedback $H_{\infty}$ controller parametrization problem which deals with the problem of specifying a set (or all the sets) of possible state-feedback controllers that solves the SFBNLHICP for the system (5.1) locally.

The basis for the controller parametrization we discuss is the Youla (or Q)-parametrization for all stabilizing controllers for the linear problem [92, 195, 292] which has been extended to the nonlinear case [188, 215, 214]. Although the original Youla-parametrization uses coprime factorization, the modified version presented in [92] does not use coprime-factorization. The structure of the prametrization is shown in Figure 5.2. Its advantage is that it is given in terms of a free parameter which belongs to a linear space, and the closed-loop map is affine in this free parameter. Thus, this gives an additional degree-of-freedom to further optimize the closed-loop maps in order to achieve other design objectives.

Now, assuming Σ^a is smoothly-stabilizable and the disturbance signal w ∈ $ℒ_{2}$ [0, ∞) is fully measurable, also referred to as the full-information (FI) structure, then the following proposition gives a parametrization of a family of full-information controllers that solves the SFBNLHICP for Σ^a.

Proposition 5.1.3 Assume the nonlinear system Σ^a is smoothly stabilizable and zero-state detectable. Suppose further, the disturbance signal is measurable and there exists a smooth (local) solution V ≥ 0 to the HJIE (5.12) or inequality (5.24) such that the SFBNLHICP is (locally) solvable. Let $ℱ G$ denote the set of finite-gain (in the $ℒ_{2}$ sense) asymptotically-stable (with zero input and disturbances) input-affine nonlinear plants, i.e.,

$ℱ G ≜ {Σ^{a} | Σ^{a} (u = 0, w = 0) i s a s y m p t o t i c a l l y - s t a b l e a n d h a s ℒ_{2} - g a i n \leq γ} .$

Then, the set

$K_{F I} = {u | u = u^{⋆} + Q (w - w^{⋆}), Q \in ℱ G, Q : i n p u t s \mapsto o u t p u t s}$

(5.27)

is a paremetrization of all FI-state-feedback controllers that solves (locally) the SFBNLHICP for the system Σ^a.

Proof: Apply u ∈ K_FI to the system Σ^a resulting in the closed-loop system:

$\sum_{u^{⋆}}^{a} (Q) : {\begin{cases} \dot{x} = f (x) + g_{1} (x) w + g_{2} (x) (u^{⋆} + Q (w - w^{⋆})); x (0) = x_{0} \\ z = [\begin{array}{l} h_{1} (x) \\ u \end{array}] . \end{cases}$

(5.28)

If Q = 0, then the result follows from Theorem 5.1.2 or 5.1.4. So assume Q ≠ 0, and since $Q \in ℱ G, r ≜ Q (w - w^{⋆}) \in ℒ_{2} [0, \infty)$ . Let V ≥ 0 be a (local) solution of (5.12) or (5.24) in N for some γ > 0. Then, differentiating this along a trajectory of the closed-loop system, completing the squares and using (5.12) or (5.24), we have

$\begin{array}{l} \frac{d}{d t} V = V_{x} [f + g_{1} w - g_{2} g_{2}^{T} V_{x}^{T} + g_{2} r] \\ = V_{x} f + \frac{1}{2} V_{x} [\frac{1}{γ^{2}} g_{1} g_{1}^{T} - g_{2} g_{2}^{T}] V_{x}^{T} + \frac{1}{2} {‖ h_{1} ‖}^{2} - \frac{1}{2} {‖ h_{1} ‖}^{2} - \\ \frac{γ^{2}}{2} {‖ w - \frac{1}{γ^{2}} g_{1}^{T} V_{x}^{T} ‖}^{2} + \frac{1}{2} γ^{2} {‖ w ‖}^{2} - \frac{1}{2} {‖ r - g_{2}^{T} V_{x}^{T} (x) ‖}^{2} + \frac{1}{2} {‖ r ‖}^{2} \\ \leq \frac{1}{2} γ^{2} {‖ w ‖}^{2} - \frac{1}{2} {‖ h_{1} ‖}^{2} - \frac{1}{2} {‖ u ‖}^{2} + \frac{1}{2} {‖ r ‖}^{2} - \frac{γ^{2}}{2} {‖ w - \frac{1}{γ^{2}} g_{1}^{T} V_{x}^{T} ‖}^{2} . \end{array}$

(5.29)

Now, integrating the above inequality (5.29) from t = t₀ to t = t₁ > t₀, starting from x(t₀) and using the fact that

$\int_{t_{0}}^{t_{1}} {‖ r ‖}^{2} d t \leq γ^{2} {\int_{t_{0}}^{t_{1}} ‖ w - w^{⋆} ‖}^{2} d t \forall t_{1} \geq t_{0}, \forall w \in W,$

we get

$V (x (t_{1})) - V (x (t_{0})) \leq \int_{t_{0}}^{t_{1}} \frac{1}{2} (γ^{2} {‖ w ‖}^{2} - {‖ z ‖}^{2}) d t, \forall x (t_{0}), x (t_{1}) \in N .$

(5.30)

This implies that the closed-loop system (5.28) has $ℒ_{2}$ -gain ≤ γ from w to z. Finally, the part dealing with local asymptotic-stability can be proven as in Theorems 5.1.2, 5.1.4. □

Remark 5.1.8 Notice that the set $ℱ G$ can also be defined as the set of all smooth inputaffine plants Q : r ↦ v with the realization

$Σ_{Q} : \leq {\begin{cases} \dot{ξ} = a (ξ) + b (ξ) r \\ υ = c (ξ) \end{cases}$

(5.31)

where $ξ \in X, a : X \to V^{\infty} (X), b : X \to ℳ^{n \times p}, c : X \to ℜ^{m}$ are smooth functions, with a(0) = 0, c(0) = 0, and such that there exists a positive-definite function $φ : X \to ℜ_{+}$ satisfying the bounded-real condition:

$φ_{ξ} (ξ) a (ξ) + \frac{1}{2 γ^{2}} φ_{ξ} (ξ) b (ξ) b^{T} (ξ) φ_{ξ}^{T} (ξ) + \frac{1}{2} c^{T} (ξ) c (ξ) = 0.$

5.2 State-Feedback Nonlinear $H_{\infty}$ Tracking Control

In this section, we consider the traditional state-feedback tracking, model-following or servomechanism problem. This involves the tracking of a given reference signal which may be any one of the classes of reference signals usually encountered in control systems, such as steps, ramps, parabolic or sinusoidal signals. The objective is to keep the error between the system output y and the reference signal arbitrarily small. Thus, the problem can be treated in the general framework discussed in the previous section with the penalty variable z representing the tracking error. However, a more elaborate design scheme may be necessary in order to keep the error as desired above.

The system is represented by the model (5.1) with the penalty variable

$z = [\begin{array}{l} h_{1} (x) \\ u \end{array}],$

(5.32)

while the signal to be tracked is generated as the output y_m of a reference model defined by

$Σ_{m} : {\begin{cases} {\dot{x}}_{m} = f_{m} (x_{m}), x_{m} (t_{0}) = x_{m 0} \\ y_{m} = h_{m} (x_{m}) \end{cases}$

(5.33)

x_m ∈ ℜ ^l, f_m : ℜ^l → V ^∞(ℜ^l), h_m : ℜ^l → ℜ ^m and we assume that this system is completely observable [212]. The problem can then be defined as follows.

Definition 5.2.1 (State-Feedback Nonlinear $H_{\infty}$ (Suboptimal) Tracking Control Problem (SFBNLHITCP)). Find if possible, a static state-feedback control function of the form

$u = α_{t r k} (x, x_{m}), α_{t r k} : N_{O} \times N_{m} \to ℜ^{p}$

(5.34)

N_o ⊂ $X$ , N_m ⊂ ℜ^l, for some smooth function α_trk, such that the closed-loop system (5.1), (5.34), (5.33) has, for all initial conditions starting in N_o × N_m neighborhood of (0, 0), locally $ℒ_{2}$ -gain from the disturbance signal w to the output z less than or equal to some prescribed number γ^$⋆$ > 0 and the tracking error satisfies lim_t→∞{y − y_m} = 0.

To solve the above problem, we follow a two-step procedure:

Step 1: Find a feedforward-control law $u_{⋆} = u_{⋆} (x, x_{m})$ so that the equilibrium point x = 0 of the closed-loop system

$\dot{x} = f (x) + g_{2} (x) u_{⋆} (x, 0)$

(5.35)

is exponentially stable, and there exists a neighborhood U = N_o × N_m of (0, 0) such that for all initial conditions (x₀, x_m0) ∈ U the trajectories (x(t), x_m(t)) of

${\begin{cases} \dot{x} = f (x) + g 2 (x) u_{⋆} (x, x_{m}) \\ {\dot{x}}_{m} = f_{m} (x_{m}) \end{cases}$

(5.36)

satisfy

$\lim_{t \to \infty} {h_{1} (θ (x_{m} (t))) - h_{m} (x_{m} (t))} = 0.$

To solve this step, we seek for an invariant-manifold

$M_{θ} = {x | x = θ (x_{m})}$

and a control law $u_{⋆} = α_{f} (x, x_{m})$ such that the submanifold M_θ is invariant under the closed-loop dynamics (5.36) and h₁(θ(x_m(t))) − h_m(x_m(t)) ≡ 0. Fortunately, there is a wealth of literature on how to solve this problem [143]. Under some suitable assumptions, the following equations give necessary and sufficient conditions for the solvability of this problem:

$\frac{\partial θ}{\partial x_{m}} (x_{m}) (f_{m} (x_{m}) = f (θ (x_{m})) + g_{2} (θ (x_{m}) {\bar{u}}_{⋆} (x_{m})$

(5.37)

$h_{1} (θ (x_{m} (t)) - h_{m} (x_{m} (t)) = 0,$

(5.38)

where ${\bar{u}}_{⋆} (x_{m}) = α_{f} (θ (x_{m}), x_{m})$ .

The next step is to design an auxiliary feedback control v so as to drive the system onto the above submanifold and to achieve disturbance-attenuation as well as asymptotic tracking. To formulate this step, we consider the combined system

${\begin{cases} \dot{x} = f (x) + g 1 (x) w + g 2 (x) u \\ {\dot{x}}_{m} = f_{m} (x_{m}), \end{cases}$

(5.39)

and introduce the following change of variables

$ξ = x - θ (x_{m})$

$υ = u - {\bar{u}}_{⋆} (x_{m}) .$

Then

$\dot{ξ} = F (ξ, x_{m}) + G_{1} (ξ, x_{m}) w + G_{2} (ξ, x_{m}) υ$

${\dot{x}}_{m} = f_{m} (x_{m})$

where

$\begin{array}{l} F (ξ, x_{m}) = f (ξ + θ (x_{m})) - \frac{\partial θ}{\partial x_{m}} (x_{m}) f_{m} (x_{m}) + g_{2} (ξ + θ (x_{m})) {\bar{u}}_{⋆} (x_{m}) \\ G_{1} (ξ, x_{m}) = g_{1} (ξ + θ (x_{m})) \\ G_{2} (ξ, x_{m}) = g_{2} (ξ + θ (x_{m})) . \end{array}$

Similarly, we redefine the tracking error and the new penalty variable as

$\tilde{z} = [\begin{array}{l} h_{1} (ξ + θ (x_{m})) - h_{m} (x_{m}) \\ υ \end{array}] .$

Step 2: Find an auxiliary feedback control $υ_{⋆} = υ_{⋆} (ξ, x_{m})$ so that along any trajectory (ξ(t), x_m(t)) of the closed-loop system (5.40), the $ℒ_{2}$ -gain condition

$\begin{array}{l} \int_{t_{0}}^{T} ‖ \tilde{z} (t) ‖^{2} d t \leq γ^{2} \int_{t_{0}}^{T} ‖ w (t) ‖^{2} d t + κ (ξ (t_{0}), x_{m 0}) \\ \Leftrightarrow \int_{t_{0}}^{T} {‖ h_{1} (ξ + θ (x_{m})) - h_{m} (x_{m}) ‖^{2} + ‖ υ ‖^{2}} d t \leq γ^{2} \int_{t_{0}}^{T} ‖ w (t) ‖^{2} d t + κ (ξ (t_{0}), x_{m 0}) \end{array}$

is satisfied for some function κ, for all w ∈ $W$ , for all T < ∞ and all initial conditions (ξ(t₀), x_m0) in a neighborhood ${\bar{N}}_{o}$ × N_m of the origin (0, 0). Moreover, if ξ(t₀) = 0 and w(t) ≡ 0, then we may set $υ_{⋆} (t) \equiv 0$ to achieve perfect tracking.

Clearly, the above problem is now a standard state-feedback $H_{\infty}$ -control problem, and the techniques discussed in the previous sections can be employed to solve it. The following theorem then summarizes the solution to the SFBNLHITCP.

Theorem 5.2.1 Consider the nonlinear system (5.1) and the SFBNLHITCP for this system. Suppose the control law $u_{⋆} = u_{⋆} (x, x_{m})$ and invariant-manifold M_θ can be found that solve Step 1 of the solution to the tracking problem. Suppose in addition, there exists a smooth solution $Ψ : {\bar{N}}_{o} \times N_{m} \to ℜ, Ψ (ξ, x_{m}) \geq 0$ to the HJI-inequality

$\begin{array}{l} Ψ_{ξ} (ξ, x_{m}) F (ξ, x_{m}) + Ψ_{x_{m}} (ξ, x_{m}) f_{m} (x_{m}) + \\ \frac{1}{2} Ψ_{ξ} (ξ, x_{m}) [\frac{1}{γ^{2}} G_{1} (ξ, x_{m}) G_{1}^{T} (ξ, x_{m}) - G_{2} (ξ, x_{m}) G_{2}^{T} (ξ, x_{m})] Ψ_{ξ}^{T} (ξ, x_{m}) + \\ \frac{1}{2} {‖ h_{1} (ξ + x_{m}) - h_{m} (x_{m}) ‖}^{2} \leq 0, x \in {\bar{N}}_{o}, ξ \in N_{m}, Ψ (0, 0) = 0. \end{array}$

(5.40)

Then the SFBNLHITCP is locally solvable with the control laws u = ${\bar{u}}_{⋆}$ and

$υ_{⋆} = - G_{2}^{T} (ξ, x_{m}) Ψ_{ξ}^{T} (ξ, x_{m}) .$

Moreover, if Ψ is proper with respect to ξ (i.e., if Ψ(ξ, x_m) → ∞ when $‖ ξ ‖$ → ∞) and the system is zero-state detectable, then lim_{t →∞} ξ(t) = 0 also for all initial conditions $(ξ (t_{0}), x_{m 0}) \in {\bar{N}}_{0} \times N_{m}$ .

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for 5 State-Feedback Nonlinear H∞-Control for Continuous-Time Systems

Create new playlist

Sign In

Sign Up

Table of Contents for
5 State-Feedback Nonlinear H∞-Control for Continuous-Time Systems