Output-Feedback Nonlinear H∞-Control for Continuous-Time Systems
In this chapter, we discuss the nonlinear H∞ sub-optimal control problem for continuous-time nonlinear systems using output-feedback. This problem arises when the states of the system are not available for feedback, and so have to be estimated in some way and then used for feedback, or the output of the system itself is used for feedback. The estimator is basically an observer that satisfies an H∞ requirement. In the former case, the observer uses the measured output of the system to estimate the states, and the whole arrangement is referred to as an observer-based controller or more generally a dynamic controller. The set-up is depicted in Figure 6.1 in this case. For the most part, this chapter will be devoted to this problem. On the other hand, the latter problem is referred to as a static output-feedback controller and will be discussed in the last section of the chapter.
We derive sufficient conditions for the solvability of the above problem for time-invariant (or autonomous) affine nonlinear systems, as well as for a general class of systems. The parametrization of stabilizing controllers is also discussed. The problem of robust control in the presence of modelling errors and/or parameter variations is also considered, as well as the reliable control of sytems in the event of sensor and/or actuator failure.
6.1 Output Measurement-Feedback H∞-Control for Affine Nonlinear Systems
We begin in this section with the output-measurement feedback problem in which it is desired to synthesize a dynamic observer-based controller using the output measurements. To this effect, we consider first an affine causal state-space system defined on a smooth n-dimensional manifold X⊆ℜnin local coordinates x = (x1,…, xn):
Σa :{˙x = f(x)+g1(x)w+g2(x)u; x(t0)=x0z = h1(x)+k11(x)w+k12(x)uy = h2(x)+k21(x)w |
(6.1) |
where x∈X is the state vector, u∈U⊆ℜp is the p-dimensional control input, which belongs to the set of admissible controls U, w ∈W is the disturbance signal, which belongs to the set W⊂L2([t0,∞)ℜr) of admissible disturbances, the output y∈Y⊂ℜm is the measured output of the system, and z∈ℜs is the output to be controlled. The functions f:X→V∞(X), g1: X→Mn×r(X), g2: X→Mn×p(X), h1:X→ℜs, h2:X→ℜm, and k11: X→Ms×r, k12: X→Ms×p(X), k21:X→Mm×r(X) are real C∞ functions of x. Furthermore, we assume without any loss of generality that the system (6.1) has a unique equilibrium-point at x = 0 such that f(0) = 0, h1(0) = h2(0) = 0, and for simplicity we have the following assumptions which are the nonlinear versions of the standing assumptions in [92]:
Assumption 6.1.1 The system matrices are such that
k11(x)=0hT1(x)k12(x)=0kT12(x)k12(x)=Ik21(x)gT1(x)=0k21(x)kT21(x)=I.}. |
(6.2) |
Remark 6.1.1 The first of these assumptions corresponds to the case in which there is no direct feed-through between w and z; while the second and third ones mean that there are no cross-product terms and the control weighting matrix is identity in the norm function for z respectively. Lastly, the fourth and fifth ones are dual to the second and third ones.
We begin with the following definition.
Definition 6.1.1 (Measurement-Feedback (Sub-Optimal) H∞-Control Problem (MFBNLHICP)). Find (if possible!) a dynamic feedback-controller of the form:
(6.3) |
where ξ∈Ξ⊂X a neighborhood of the origin, and η:Ξ × ℜm→V∞(Ξ), η(0,0)=0, θ:Ξ × ℜm→ℜp, θ(0,0)=0, are some smooth functions, which processes the measured variable y of the plant (6.1) and generates the appropriate control action u, such that the closed-loop system (6.1), (6.3) has locally L2-gain from the disturbance signal w to the output z less than or equal to some prescribed number γ⋆ > 0 with internal stability, or equivalently, the closed-loop system achieves local disturbance-attenuation less than or equal to γ⋆ with internal-stability.
Since the state information is not available for measurement in contrast with the previous chapter, the simplest thing to do is to estimate the states with an observer, and then use this estimate for feedback. From previous experience on the classical theory of state observation, the “estimator” usually comprises of an exact copy of the dynamics of x, corrected by a term proportional to the error between the actual output y of the plant (6.1) and an estimated output ŷ. Such a system can be described by the equations:
(6.4) |
where G(.) is the output-injection gain matrix which is to be determined. A state-feedback control law can then be imposed on the system based on this estimated state (and the results from Chapter 5) as:
(6.5) |
where V solves the HJIE (5.12). The combination of (6.4) and (6.5) will then represent our postulated dynamic-controller (6.3). With the above feedback, the estimator (6.4) becomes
(6.6) |
Besides the fact that we still have to determine an appropriate value for G(.), we also require an explicit knowledge of the disturbance w. Since the knowledge of w is generally not available, it is reasonable to replace its actual value by the “worst-case” value, which is given (from Chapter 5) by
w⋆: =α1(x)=1γ2gT1(x)VTx(x)
for some γ > 0. Substituting this expression in (6.6) and replacing x by ξ, results in the following certainty-equivalence worst-case estimator
(6.7) |
Next, we return to the problem of selecting an appropriate gain matrix G(.), which is another of our design parameters, to achieve the objectives stated in Definition 6.1.1. It will be shown in the following that, if some appropriate sufficient conditions are fulfilled, then the matrix G(.) can be chosen to achieve this.
We begin with the closed-loop system describing the dynamics of the plant (6.1) and the dynamic-feedback controller (6.7), (6.5) which has the form
∑acclp:{˙x=f(x)+g1(x)w+g2(x)α2(ξ); x(t0)=x0˙ξ = ˜f(ξ)+g2(ξ)+G(ξ)(h2(x)+k21(x)w−˜h2(ξ))z = h1(x)+k12(x)α2(ξ) |
(6.8) |
where
˜f(ξ) = f(ξ)+g1(ξ)α1(ξ)˜h2(ξ)=h2(ξ)+k21(ξ)α1(ξ).
If we can choose G(.) such that the above closed-loop system (6.8) is dissipative with a C1 storage-function, with respect to the supply-rate s(w,z)=12(γ2‖w‖2−‖z‖2) and is locally asymptotically-stable about (x, ξ) = (0, 0), then we would have solved the MFBNLHICP for (6.1).
To proceed, let us further combine the x and ξ dynamics in the dynamics xe as follows.
(6.9) |
where
xe = (xξ), fe(xe) = (˜f(x)+g2(x)α2(ξ)˜f(x)+g2(x)α2(ξ)+G(ξ)(h2(x)−˜h2(ξ))) , ge(xe) = (g1(x)G(ξ)κ21(x)) ,
and let
he(xe)=α2(x)−α2(ξ).
Apply now the change of input variable r = w − α1(x) in (6.9) and set the output to v obtaining the following equivalent system:
(6.10) |
The problem would then be solved if we can choose G(.) such that the above system (6.10) is dissipative with respect to the supply-rate s(r,v)=12(γ2‖r‖2−‖v‖2) and internally-stable. A necessary and sufficient condition for this to happen is if the following bounded-real condition [139] is satisfied:
Wxe(xe)fe(xe)+12γ2Wxe(xe)ge(xe)geT(xe)WTxe(xe)+12heT(xe)he(xe)≤0 |
(6.11) |
for some C1 nonnegative function W:N×N→ℜ
locally defined in a neighborhood N⊂X of xe = 0, and which vanishes at xe = 0. The above inequality further implies that
(6.12) |
which means, W is a storage-function for the equivalent system with respect to the supply-rate s(r,v)=12(γ2‖r‖2−‖v‖2). Now assuming that W exists, then it can be seen that the closed-loop system (6.8) is locally dissipative, with the storage-function
Ψ(xe) = V(x)+W(xe),
with respect to the supply-rate s(w,z)=12(γ2‖w‖2−‖z‖2),, where V (.) solves the HJIE(5.12). Indeed,
dΨ(xe(t))dt+12(‖z‖2−γ2‖w‖2)=Ψxe(fe(xe)+ge(xe)(w−α1(x)))+12(‖z‖2−γ2‖w‖2) =Vx(x)(f(x)+g1(x)w+g2(x)α2(ξ))+ Wxe(fe(xe)+ge(xe)r)+12‖h1(x)+k12(x)α2(ξ)‖2 −12γ2‖w‖2 ≤12{‖α2(ξ)−α2(x)‖2−γ2‖w−α1(x)‖2+γ2‖r‖2−‖v‖2} ≤ 0,
which proves dissipativity. The next step is to show that the closed-loop system (6.9) has a locally asymptotically-stable equilibrium-point at (x, ξ) = (0, 0). For this, we need additional assumptions on the system (6.1).
Assumption 6.1.2 Any bounded trajectory x(t) of the system
˙x(t)=f(x(t))+g2(x(t))u(t)
satisfying
h1(x(t))+k12(x(t))u(t))=0
for all t≥ts, is such that limt→∞x(t)=0.
Assumption 6.1.3 The equilibrium-point ξ = 0 of the system
(6.13) |
is locally asymptotically-stable.
Then from above, with w = 0,
dΨ(xe(t))dt=Ψxe(fe(xe)+ge(xe)(−α1(x))) ≤ −12‖h1(x(t))+k12(x(t))α2(ξ(t))‖2 |
(6.14) |
along any trajectory (x(.), ξ(.)) of the closed-loop system. This proves that the equilibrium-point (x(.), ξ(.)) = (0, 0) is stable. To prove asymptotic-stability, observe that any trajectory xe(t) such that
dΨ(xe(t))dt≡0, ∀t≥ts,
is necessarily a trajectory of
˙x=f(x)+g2(x)α2(ξ)
such that x(t) is bounded and
h1(x(t))+k12(x(t))α2(ξ(t))≡0
for all t≥ts. By Assumption 6.1.2, this implies limt→∞ x(t) = 0. Moreover, since k12(x) has full column rank, the above also implies that limt→∞ α2(ξ(t)) = 0. Thus, the ω-limit set of such a trajectory is a subset of
Ω0={(x,ξ)|x=0, α2(ξ)=0}.
Any initial condition on this limit-set yields a trajectory
x(t)≡0
for all t≥ts, which corresponds to the trajectory ξ(t) of
˙ξ=˜f(ξ)−G(ξ)˜h2(ξ).
By Assumption 6.1.3, this implies limt→∞ ξ(t) = 0, and local asymptotic-stability follows from the invariance-principle.
The above result can now be summarized in the following lemma.
Theorem 6.1.1 Consider the nonlinear system (6.1) and suppose Assumptions 6.1.1, 6.1.2 hold. Suppose also there exists a C1 positive-(semi)definite function V: N→ ℜ+ locally defined in a neighborhood N of x = 0, vanishing at x = 0 and satisfying the HJIE (5.12). Further, suppose there exists also a C1 real-valued function W: N1×N1→ℜ locally defined in a neighborhood N1 × N1 of xe = 0, vanishing at xe = 0 and is such that W (xe) > 0 for all x ≠ ξ, together with an n × m smooth matrix function G(ξ): N2→Mn×m, locally defined in a neighborhood N2 of ξ = 0, such that
(i) |
the HJIE (6.11) holds; |
(ii) |
Assumption 6.1.3 holds; |
(iii) |
N ∩ N1 ∩ N2 = ∅. |
Then the MFBNLHICP for the system (6.1) is locally solvable.
A modified result to the above Theorem 6.1.1 as given in [141, 147] can also be proven along the same lines as the above. We first have the following definition.
Definition 6.1.2 Suppose f(0) = 0, h(0) = 0, then the pair {f, h} is locally zero-state detectable, if there exists a neighborhood O of x = 0 such that, if x(t) is a trajectory of the free-system ˙x = f(x) with x(t0) ∈ O, then h(x(t)) is defined for all t ≥ 0 and h(x(t)) ≡ 0 for all t ≥ t0, implies limt→∞ x(t) = 0
Theorem 6.1.2 Consider the nonlinear system (6.1) and assume that k21(x)kT21(x)=I in (6.2). Assume also the following:
(i) The pair {f, h1} is locally zero-state detectable.
(ii) There exists a smooth positive-(semi)definite function V locally defined in a neighborhood N of the origin x = 0 with V (0) = 0 and satisfying the HJIE (5.12).
(iii) There exists an n × m smooth matrix function G(ξ) such that the equilibrium-point ξ = 0 of the system
(6.15) |
is locally asymptotically-stable.
(iv) There exists a smooth positive-semidefinite function W (x, ξ) locally defined in a neighborhood N1×N1⊂X×X of the origin such that W (0, ξ) > 0 for each ξ ≠ 0, and satisfying the HJIE :
[Wx(x,ξ) Wξ(x,ξ)]fe(x,ξ)+ 12γ2[Wx(x,ξ)Wξ(x,ξ)][g1(x)gT1(x)00G(ξ)GT(ξ)][WTx(x,ξ)WTξ(x,ξ)]+ 12hTe(x,ξ)he(x,ξ)=0, W(0,0)=0, (x,ξ)∈N1×N1. |
(6.16) |
Then, the MFBNLHICP for the system is solved by the output-feedback controller:
(6.17) |
where fe(x, ξ), he(x, ξ), are given by:
fe(x,ξ)=( f(x)+g1(x)α1(x)+g2(x)α2(ξ)f(ξ)+g1(ξ)α1(ξ)+g2(ξ)+G(ξ)(h2(x)−h2(ξ)))he(x,ξ)=α2(ξ)−α2(x).
Example 6.1.1 We specialize the result of Theorem 6.1.2 to linear systems and compare it with some standard results obtained in other references, e.g., [92] (see also [101, 292]). Consider the linear time-invariant system (LTI):
(6.18) |
for constant matrices F∈ℜn×n, G1∈ℜn×r, G2∈ℜn×p, H1∈ℜs×n, H2∈ℜm×n K12∈ℜs×p and K21∈ℜm×r satisfying Assumption 6.1.1. Then, we have the following result.
Proposition 6.1.1 Consider the linear system (6.18) and suppose the following hold:
(a) the pair (F, G1) is stabilizable;
(b) the pair (F, H1) is detectable;
(c) there exist positive-definite symmetric solutions X, Y of the AREs:
(6.19) |
(6.20) |
(d) ρ(XY)<γ2
Then, the hypotheses (i)-(iv) in Theorem 6.1.2 also hold, with
G = ZHT2V(x) = 12xTXxW(x,ξ) = 12γ2(x−ξ)Z−1(x−ξ),
where
Z =Y (I−1γ2XY)−1
Proof: (i) is identical to (b). To show (ii) holds, we note that, if X is a solution of the ARE (6.19), then the positive-definite function V(x)=12xTX x is a solution of the HJIE (6.16). To show that (iii) holds, observe that, if X is a solution of ARE (6.19) and Y a solution of ARE (6.20), then the matrix
Z =Y (I−1γ2XY)−1
is a solution of the ARE [292]:
(6.21) |
where F1=1γ2GT1X and F2=GT2X. Moreover, Z is symmetric and by hypothesis (d) Z is positive-definite. Hence we can take V (ξ) = ξT Z ξ as a Lyapunov-function for the system
(6.22) |
which is the linear equivalent of (6.15). Using the ARE (6.21), it can be shown that
˙L = −(‖GTξ‖2+‖GT1ξ‖+1γ2‖F2Zξ‖2),
which implies that the equilibrium-point ξ = 0 for the system (6.22) is stable. Furthermore, the condition ˙L(ξ(t))=0 implies G1ξ(t) = 0 and G ξ(t) = 0, which results in
.ξ(t)=FTξ(t).
Finally, since (A, G1) is stabilizable, together with the fact that G1ξ(t) = 0, and the standard invariance-principle, we conclude that limt→∞ ξ(t) = 0 or asymptotic-stability of the system (6.22). Thus, (iii) also holds.
Lastly, it remains to show that (iv) also holds. Choosing
W(x,ξ)=12γ2(x−ξ)TZ−1(x−ξ)
and substituting in the HJIE (6.16), we get four AREs which are all identical to the ARE (6.21), and the result follows. □
Remark 6.1.2 Based on the result of the above proposition, it follows that the linear dynamic-controller
˙ξ = (F+G1F1+G2F2−GH2)ξ+Gyu =F2ξ
with F1=1γ2GT1X, F2=−GT2X and G=ZHT2 solves the linear MFBNLHICP for the system (6.18)
The preceding results, Theorems 6.1.1, 6.1.2, establish sufficient conditions for the solvability of the output measurement-feedback problem. However, they are not satisfactory in that, firstly, they do not give any hint on how to select the output-injection gain-matrix G(.) such that the HJIE (6.11) or (6.16) is satisfied and the closed-loop system (6.13) is locally asymptotically-stable. Secondly, the HJIEs (6.11), (6.16) have twice as many independent variables as the HJIE (5.12). Thus, a final step in the above design procedure would be to partially address these concerns.
Accordingly, an alternative set of sufficient conditions can be provided which involve an additional HJI-inequality having the same number of independent variables as the dimension of the plant and not involving the gain matrix G(.). For this, we begin with the following lemma.
Lemma 6.1.1 Suppose V is a C3 solution of the HJIE (5.12) and Q: O→ℜ is a C3 positive-definite function locally defined in a neighborhood of x = 0, vanishing at x = 0 and satisfying
S(x)<0
where
S(x)=Qx(x)(˜f(x)−G(x)˜h2(x))+12γ2Qx(x)(g1(x)−G(x)k21(x))(g1(x)−G(x)k21(x))TQTx(x)+12αT2(x)α2(x),
for each x ≠ 0 and such that the Hessian matrix ∂2s∂x2 is nonsingular at x = 0. Then the function
W(xe)=Q(x−ξ)
satisfies the conditions (i), (ii) of Theorem 6.1.1.
Proof: Let W(xe)=Q(x−ξ) and let
ϒ(x,ξ)=Wxe(xe)fe(xe)+12γ2Wxe(xe)ge(xe)geT(xe)WTxe((xe)+12heT(xe)he(xe).
Thus, to show that W(.) as defined above satisfies condition (i) of Theorem 6.1.1, is to show that ϒ is nonpositive. For this, set
e=x−ξ
and let
Π(e,ξ)=[ϒ(x,ξ)]x=ξ+e .
It can then be shown by simple calculations that
Π(0,ξ)=0,∂∏(e,ξ)∂e|e=0 =0,
which means Π(x, ξ) can be expressed in the form
Π(e,ξ)=eTΛ(e,ξ)e
for some C0 matrix function Λ(., .). Moreover, it can also be verified that
Λ(0,0)=∂2S(x)∂x2|x=0<0
by assumption. Thus, Π(e, ξ) is nonpositive in the neighborhood of (x, ξ) = (0, 0) and (i) follows.
To establish (ii), note that by hypothesis S(x) < 0 implies
Qx(x)(˜f(x)−G(x)˜h2(x))<0.
Therefore, Q(x) > 0 is a Lyapunov-function for the system (6.13), and the equilibrium-point ξ = 0 of this system is locally asymptotically-stable. Hence the result. □
As a consequence of the above lemma, we can now present alternative sufficient conditions for the solvability of the MFBNLHICP. We begin with the following assumption.
Assumption 6.1.4 The matrix
R1(x)=kT21(x)k21(x)
is nonsingular (and positive-definite) for each x.
Theorem 6.1.3 Consider the nonlinear system (6.1) and suppose Assumptions 6.1.1, 6.1.2 and 6.1.4 hold. Suppose also the following hold:
(i) there exists a C3 positive-definite function V, locally defined in a neighborhood N of x = 0 and satisfying the HJIE (5.12);
(ii) there exists a C3 positive-definite function Q: N1⊂X→ℜ+ locally defined in a neighborhood N1 of x = 0, vanishing at x = 0, and satisfying the HJ-inequality
(6.23) |
together with the coupling condition
(6.24) |
for some n × m smooth C2 matrix function L(x), where
f#(x)=˜f(x)−g1(x)kT21(x)R−11(x)˜h2(x), g#1(x)=g1(x)[I−kT12(x)R−11(x)k21(x)], H#(x)=(αT2(x)α2(x)−γ2˜h(x)R−11(x)˜h2(x)),
and such that the Hessian matrix of the right-hand-side of (6.23) is nonsingular for all x ∈ N1.
Then the MFBNLHICP for the system is solvable with the controller (6.7), (6.5) if G(.) is selected as
(6.25) |
Proof: (Sketch, see [139] for details). By standard completion of squares arguments, it can be shown that the function S(x) satisfies the inequality
S(x) ≥Qx(x)(˜f(x)−g1(x)kT21(x)R−11(x)˜h2(x))+12[αT2(x)α2(x)−γ2˜hT2(x)˜h2(x)]+ 12γ2Q(x)g1(x)[I−KT21(x)R−11(x)k21(x)]gT1(x)QTx(x),
and equality holds if and only if
Qx(x)G(x)=(γ2˜h2(x)+Qxg1(x)k21(x))R−11(x).
Therefore, in order to make S(x) < 0, it is sufficient for the new HJ-inequality (6.23) to hold for each x and G(x) to be chosen as in (6.25). In this event, the matrix G(x) exists if and only if Q(x) satisfies (6.24). Finally, application of the results of Theorem 6.1.1, Lemma 6.1.1 yield the result. □
Example 6.1.2 We consider the case of the linear system (6.18) in which case the simplifying assumptions (6.2) reduce to:
KT12K12=I, HT1K12=0, K21GT1=0, KT21K21=R1>0.
The existence of a C3 positive-definite function Q satisfying the strict HJ-inequality (6.23) is equivalent to the existence of a symmetric positive-definite matrix Z satisfying the bounded-real inequality
(6.26) |
where
F#=˜F−G1KT21R−11˜H2, G#1=G1(I−KT21R−11K21), H#=FT2F2−γ2˜HT2R−11˜H2
and
˜F=F+G1F1, ˜H2=H2+K21F1 F1=1γ2GT1P, F2=−GT2P
with P a symmetric positive-definite solution of the ARE (5.22). Moreover, if Z satisfies the bounded-real inequality (6.26), then the output-injection gain G is given by
G=(γ2Z−1˜HT2+G1KT21)R−11,
with L=12Z−1˜HT2.
Remark 6.1.3 From the discussion in [264] regarding the necessary conditions for the existence of smooth solutions to the HJE (inequalities) characterizing the solution to the state-feedback nonlinear H∞-control problem, it is reasonable to expect that, if the linear MFBNLHICP corresponding to the linearization of the system (6.1) at x = 0 is solvable, then the nonlinear problem should be locally solvable. As a consequence, the linear approximation of the plant (6.1) at x = 0 must satisfy any necessary conditions for the existence of a stabilizing linear controller which solves the linear MFBNLHICP.
Remark 6.1.4 Notice that the controller derived in Theorem 6.1.3 involves the solution of two uncoupled HJ-inequalities and a coupling-condition. This is quite in agreement with the linear results as derived in [92], [101], [292].
In the next subsection, we discuss the problem of controller parametrization.
6.1.1 Controller Parameterization
In this subsection, we discuss the parametrization of a set of stabilizing controllers that locally solves the MFBNLHICP. As seen in the case of the state-feedback problem, this set is a linear set, parametrized by a free parameter system, Q, which varies over the set of all finite-gain asymptotically-stable input-affine systems. However, unlike the linear case, the closed-loop map is not affine in this free parameter.
Following the results in [92] for the linear case, the nonlinear case has also been discussed extensively in references [188, 190, 215]. While the References [188, 190] employ direct state-space tools, the Reference [215] employs coprime factorizations. In this subsection we give a state-space characterization.
The problem at hand is the following. Suppose a controller Σc (herein-after referred to as the “central controller”) of the form (6.3) solves the MFBNLHICP for the nonlinear system (6.1). Find the set of all controllers (or a subset), that solves the MFBNLHICP for the system.
As in the state-feedback case, this problem can be solved by an affine parametrization of the central controller with a system Q∈FG having the realization
(6.27) |
where q∈X and a:X→V∞X, b:X→Mn×p, c:X→ℜm are smooth functions. Then the family of controllers
(6.28) |
where ΣQ varies over all Q∈FG and ŷ is the estimated output, also solves the MFBNLHICP for the system (6.1). The structure of this parametrization is also shown on Figure 6.2, and we have the following result.
Proposition 6.1.2 Assume Σa is locally smoothly-stabilizable and locally zero-state detectable. Suppose there exists a controller of the form Σc that locally solves the MFBNLHICP for the system, then the family of controllers ΣcQ given by (6.28), where Q∈FG is a parametrization of the set of all input-affine controllers that locally solves the problem for Σa.
Proof: We defer the proof of this proposition to the next theorem, where we give an explicit form for the controller ΣcQ.
Theorem 6.1.4 Suppose all the hypotheses (i)-(iv) of Theorem 6.1.2 hold, and k21(x)kT21(x)=I. In addition, assume the following hypothesis also holds:
(v) There exists an input-affine system ∑Q as defined above, whose equilibrium-point q = 0 is locally asymptotically-stable, and is such that there exists a smooth positive-definite function U : N4 → ℜ+, locally defined in a neighborhood of q = 0 in X satisfying the HJE:
(6.29) |
Then the family of controllers
∑cQ:{ ˙ξ = fb(ξ)+G(y−h2(ξ))+Γ(ξ)υ ˙q =a(q)+b(q)r υ =c(q) r =y−ˆy u =α2(ξ)+υ |
(6.30) |
where
fb(ξ)=f(ξ)+g1(ξ)α1(ξ)+g2(ξ)α2(ξ), ˆy=˜h2(ξ)=h2(ξ)+k21(ξ)α1(ξ)
solves the MFBNLHICP locally.
Proof: First observe that the controller (6.30) is exactly the controller given in (6.28) with explicit state-space realization for Σc. Thus, proving the result of this theorem also proves the result of Proposition 6.1.2.
Accordingly, we divide the proof into several steps:
(a) Consider the Hamiltonian function defined in Chapter 5:
H(x,w,u)=Vx(x)(f(x)+g1(x)w+g2(x)u)+12∥hT1∥2+12∥u∥2−12γ2∥w∥2
which is quadratic in (w, u). Observe that
(∂H∂u)w=α1(x) =0, (∂H∂u)u=α2(x)=0.
Therefore for every (w, u) we have
Vx(x)(f(x)+g1(x)w+g2(x)u) = 12[‖u−α2(x)‖2−γ2‖w−α1(x)‖2− ‖u‖2+γ2‖w‖2−‖h1‖2].
(b) Consider now the closed-loop system:
˙x =f(x)+g1(x)w+g2(x)(α2(ξ)+υ) ˙ξ =fb(ξ)+G(ξ)(y−h2(ξ))+Γ(ξ)υ} |
(6.31) |
and the augmented pseudo-Hamiltonian function
Hb(x,ξ,υ,w,λ,μ) = λT[f(x)+g1(x)w+g2(x)(α2(ξ)+υ)]+ μT[fb(ξ)+G(ξ)(y−h2(ξ))+Γ(ξ)υ]+ 12∥α2(ξ)+υ−α2(x)∥2−12γ2∥w−α1(x)∥2.
By settings (˜v,˜w) such that
(∂Hb∂υ)υ=˜υ=0, (∂Hb∂w)w=˜w=0,
then for every (v, w)we have
Hb(x,ξ,υ,w)=Hb⋆(x,ξ,υ,w)+12‖υ−˜υ‖2−12γ2‖w−˜w‖2,
where
Hb⋆(z,ξ,υ,w) = Hb(x,ξ,˜υ,˜w,WTx,WTξ) ≤ 0.
Furthermore, along any trajectory of the closed-loop system (6.31),
˙W = Wx˙x+Wξ˙ξ = Hb⋆(x,ξ,υ,w)+12‖υ−˜υ‖2−12γ2‖w−˜w‖2− 12‖α2(ξ)+υ−α2(x)‖2−12γ2‖w−α1(x)‖2.
(c) Similarly, consider now the system (6.27). By hypothesis (v), we have
Uq(q)(a(q)+b(q)r) = 12[−γ2‖r−r⋆‖2−‖υ‖2+γ2‖r‖2]
where r*=1γ2bT(q)UTq(q).
(d) Then, consider the Lyapunov-function candidate
Ω(x,ξ,q)=V(x)+W(x,ξ)+U(q),
which is locally positive-definite by construction. Differentiating Ω along the trajectories of the closed-loop system (6.31), (6.27), we get
˙Ω = Vx(x)˙x+Wx(x,ξ)˙x+Wξ(x,ξ)˙ξ+Uq(q)˙q = Hb⋆(x,ξ,υ,w)+12{[‖u−α2(x)‖2−γ2‖w−α1(x)‖2−‖u‖2+ γ2‖w‖2−‖h1(x)‖2+[‖υ−˜υ‖2−γ2‖w−˜w‖2−‖u−α2(x)‖2+ γ2 ‖w−α1(x)‖2]+[−γ2‖r−r⋆‖2−‖υ‖2+γ2‖r‖2]}.
Notice that
‖υ−˜υ‖2≤ γ2‖r−r⋆‖2
since ‖Q‖H∞≤γ. Therefore
(6.32) |
Integrating now the above inequality from t = t0 to t = T, we have
Ω(x(T),T)−Ω(x(t0),t0) ≤ 12 ∫Tt0[γ2‖r‖2+‖w‖2−‖υ‖2−‖z‖2]dt.
Which implies that the closed-loop system has L2-gain ≤ γ from [rw] as [vz] as desrired.
(e) To show closed-loop stability, set w = 0, r = 0, in (6.32) to get
˙Ω≤ -12(‖z‖2+‖υ‖2).
This proves that the equilibrium-point (x, ξ, q) = (0, 0, 0) is stable. Further, the condition that ˙Ω=0 implies that w = 0, r = 0, z = 0, v = 0 ⇒ u = 0, h1(x) = 0, α2(ξ) = 0.Thus, any trajectory of the system (xo(t), ξo(t), qo(t)) satisfying the above conditions, is necessarily a trajectory of the system
˙x = f(x)˙ξ = f(ξ)+g1(ξ)α1(ξ)+G(ξ)(h2(x)−h2(ξ))˙q = a(q).
Therefore, by Assumption (i) and the fact that ˙q = a(q) is locally asymptotically-stable about q = 0, we have limt→∞(x(t), q(t)) = 0. Similarly, by Assumption (iii) and a well known stability property of cascade systems, we also have limt→∞ ξ(t) = 0. Finally, by LaSalle’s invariance-principle we conclude that the closed-loop system is locally asymptotically-stable. □
6.2 Output Measurement-Feedback Nonlinear H∞ Tracking Control
In this section, we discuss the output measurement-feedback tracking problem which was discussed in Section 5.2 for the state-feedback problem. The objective is to design an output-feedback controller for the system (6.1) to track a reference signal which is generated as the output ym of a reference model defined by
6.33 |
xm∈ℜl, fm:ℜl→ℜl. We assume also for simplicity that this system is completely observable (see References [268, 212]). The problem can then be defined as follows.
Definition 6.2.1 (Measurement-Feedback Nonlinear H∞ (Suboptimal) Tracking Control Problem (MFBNLHITCP)). Find (if possible!) a dynamic output-feedback controller of the form
{˙xc = ˙ηc(x,xm), xc(t0) = xc0u= αdyntrk(xc,xm), αdyntrk:N1טNm→ℜp |
6.34 |
xc⊂ℜn, Nc⊂ X, ˜Nm⊂ℜl, for some smooth function αdyntrk, such that the closed-loop system (6.1), (6.34), (6.33) is exponentially stable about (x, xc, xm) = (0, 0, 0), and has for all initial conditions starting in N1טNm neighborhood of (0, 0), locally L2-gain from the disturbance signal w to the output z less than or equal to some prescribed number γ⋆ > 0, as well as the tracking error satisfies limt→∞{y − ym} = 0.
To solve the above problem, we consider the system (6.1) with
z=[h1(x) u]
and follow as in Section 5.2 a two-step design procedure.
Step 1:_ We seek a feedforward dynamic-controller of the form
6.35 |
where xc∈ℜn, so that the equilibrium point (x, xc) = (0, 0) of the closed-loop system with w = 0,
˙x = f(x)+g2(x)c(xc,xm) ˙xc = a(xc)+b(xc)h2(x)
is exponentially stable and there exists a neighborhood ˜U⊂X×ℜn×ℜl of (0, 0, 0) such that, for all initial conditions (x0,xc0,xm0)∈˜U, the solution (x(t), xc(t), xm(t)) of
{˙x = f(x)+g2(x)c(xc,xm)˙xc =a(xc)+b(xc)h2(x)˙xm = fm(xm)
satisfies
limt→∞ h1(θ(x(t)))−hm(xm(t))=0.
To solve this step, we similarly seek an invariant-manifold
˜Mθ,σ={x|x=θ(xm),xc=σ(xm)}
which is invariant under the closed-loop dynamics (6.36) and is such that the error e = h1(x(t)) − hm(xm(t)) vanishes identically on this submanifold. Again, this requires that the following necessary conditions are satisfied by the control law u=ˉu⋆(xm):
∂θ∂xm(xm)fm(xm)=f(θ(xm))+g2(θ(xm)ˉu⋆(xm)) ∂σ∂xm(xm)f(xm)=a(σ(xm))+b(σ(xm))y⋆(xm) h1(θ(x(t))) −hm(xm(t))=0,
where ˉu⋆(xm)=c(σ(xm),xm), y⋆(xm)=h2(θ(xm)).
Now, assuming the submanifold ˜Mθ,σ is found and the control law ˉu⋆(xm) has been designed to maintain the system on this submanifold, the next step is to design a feedback control v so as to drive the system onto the above submanifold and to achieve disturbance-attenuation and asymptotic-tracking. To formulate this step, we reconsider the combined system with the disturbances
{˙x = f(x)+g1(x)w+g2(x)u ˙x = a(xc)+b(xc)h2(x)+b(xc)k21(x)w˙xm = fm(xm)y = h2(x)+k21(x)w, |
(6.36) |
and introduce the following change of variables
ξ1 = x−θ(xm)ξ2 = xc−σ(xm)υ =u−˜u⋆(xm)
Then
˙ξ1=˜F1(ξ1,ξ2,xm) +˜G 11(ξ1,xm) w+˜G12(ξ1,xm)υ˙ξ2=˜F(ξ1,ξ2,xm)+˜G21(ξ2,xm)w˙xm =fm(xm)y=[h2(ξ1xm)+k21(ξ1,xm)w hm(xm)]
where
˜F1(ξ1,xm) =f(ξ1+θ(xm))−∂θ∂xm(xm)fm(xm)+g2(ξ1+θ(xm))ˉu⋆(xm)˜F2(ξ2,xm) =a(ξ2+σ(xm))+b(ξ2+σ(xm))h2(ξ1+θ(xm))−∂σ∂xm(xm)fm(xm)˜G11(ξ1,xm) = g1(ξ1+θ(xm))˜G12(ξ1,xm) =g2(ξ1+θ(xm))˜G21(ξ2,xm)=b(ξ2+σ(xm))k21(ξ1+θ(xm)).
Similarly, we redefine the tracking error and the new penalty variable as
˜z = [h1(ξ+θ(xm))−hm(xm) υ].
Step 2:_ Find a dynamic feedback compensator of the form (6.35) with an auxiliary output v*=v*(ξ,xm) so that the closed-loop system (6.35), (6.36) is exponentially stable and along any trajectory (ξ(t), xm(t)) of the closed-loop system, the L2-gain condition
T∫0||˜z(t)||2≤γ2T∫0||w(t)||2dt+˜β(ξ(t0),xm0)⇔T∫0{||h1(ξ+θ(xm))−hm(xm)||2+||υ||2}≤γ2T∫0||w(t)||2dt+β(ξ(t0),xm0)
is satisfied for some function ˜β, for all w∈W, for all T < ∞ and all initial conditions (ξ1(t0), ξ2(t0), xm0) in a sufficiently small neighborhood of the origin (0, 0).
The above problem is now a standard measurement-feedback H∞ control problem, and can be solved using the techniques developed in the previous section.
6.3 Robust Output Measurement-Feedback Nonlinear H∞-Control
In this section, we consider the robust output measurement-feedback nonlinear H∞-control problem for the affine nonlinear system (6.1) in the presence of unmodelled dynamics and/or parameter variations. This problem has been considered in many references [34, 208, 223, 265, 284], and the set-up is shown in Figure 6.3. The approach presented here is based on [208]. For this purpose, the system is represented by the model:
∑a△: {x = f(x)+Δf(x,θ,t)+g1(x)w+[g2(x)+Δg2(x,θ,t)]u; x(0)=x0 z = h1(x)=k12(x)u y = [h2(x)+Δh2(x,θ,t)]+k21(x)w |
(6.37) |
where all the variables and functions have their previous meanings and in addition X→V∞(X), Δg2:X→Mn×p(X), and Δh2:X→ℜm are unknown functions which belong to the set Ξ1 of admissible uncertainties, and θ∈Θ⊂ℜm are the system parameters which may vary over time within the set Θ.
Definition 6.3.1 (Robust Output Measurement-Feedback Nonlinear “∞-Control Problem (RMFBNLHICP)). Find (if possible!) a dynamic controller of the form (6.3) such that the closed-loop system (6.37), (6.3) has locally L2-gain from the disturbance signal w to the output z less than or equal to some prescribed number γ⋆ > 0 with internal stability, for all admissible uncertainties Δf, Δg2, Δh2, ∈ Ξ1 and all parameter variations in Θ.
We begin by first characterizing the sets of admissible uncertainties of the system Ξ1 and Θ1 as
Assumption 6.3.1 The sets of admissible uncertainties of the system are characterized as for some known matrices H1(.), F (., .), E1(.), E2(.), H3(.), E3 of appropriate dimensions.
Ξ1 ={Δf,Δg2,Δh2|Δf(x,θ,t)=H1(x)F(x,θ,t)E1(x), Δg2(x,θ,t)=g2(x)F(x,θ,t)E2(x),Δh2=H3(x)F(x,θ,t)E3(x),||F(x,θ,t)||2≤1, ∀x∈X,θ∈Θ,t∈ℜ},Θ1 = {θ|0≤θu,||θ||≤κ<∞},
for some known matrices H1(.), F (., .), E1(.), E2(.), H3(.), E3 of appropriate dimensions.
Then, to solve the RMFBNLHICP, we clearly only have to modify the solution to the nominal MFBNLHICP given in Theorems 6.1.1, 6.1.2, 6.1.3, and in particular, modify the HJIEs (inequalities) (6.39), (6.23), (5.12) to account for the uncertainties in Δf, Δg2 and Δh2.
The HJI-inequality (5.24) has already been modified in the form of the HJI-inequality (5.46) to obtain the solution to the RSFBNLHICP for all admissible uncertainties Δf, Δg2 in Ξ. Therefore, it only remains to modify the HJI-inequality (6.23), or equation (6.16) to account for the uncertainties Δf, Δg2, and Δh2 in Ξ1, Θ. If we choose to modify HJIE (6.16) to accomodate the uncertainties, then the result can be stated as a corollary to Theorem 6.1.3 as follows.
Corollary 6.3.1 Consider the nonlinear system (6.37) and assume that k21(x)kT21(x)=I in (6.2). Suppose also the following hold:
(i) The pair{f, h1} is locally detectable.
(ii) There exists a smooth positive-definite function ˜V locally defined about the origin with ˜V(0)=0 and satisfying the HJIE (5.46).
(iii) There exists an n × m matrix GΔ: X→Mn×m such that the equilibrium ξ = 0 of the system
(6.38) |
is locally asymptotically-stable.
(iv) There exists a smooth positive-semidefinite function ˜W(x,ξ) locally defined in a neighborhood ˜N1טN1 ⊂X×X of the origin (x, ξ) = (0, 0) such that ˜W(0,ξ)>0 for each ξ ≠ 0, and satisfying the HJIE
[˜Wx(x,ξ) ˜Wξ(x,ξ)]fe(x,ξ)+12γ2[˜Wx(x,ξ)˜Wξ(x,ξ)][gT1gT1(x)+HT1(x)H1(x)00G(ξ)GT(ξ)+0HT1(ξ)H1(ξ)]×[˜WTx(x,ξ)˜WTξ(x,ξ)]+12(hTe(x,ξ)he(x,ξ)+ET1(x)E1(x)+ET3(ξ)E3(ξ))=0,˜W(0,0)=0(x,ξ)∈˜N1טN1, |
(6.39) |
where fe(x, ξ), he(x, ξ) are as defined previously.
Then, the R MFBNLHICP for the system is solved by the output-feedback controller:
˜Σc2 : {˙ξ = f(ξ)+H1(ξ)E1(ξ)+g1(ξ)α1(ξ)+g2(ξ)α2(ξ)+GΔ(ξ)[y− h2(ξ)−H3(ξ)E3(ξ)]u= α2(ξ), |
(6.40) |
Proof: The proof can be pursued along the same lines as Theorem 6.1.3. □
In the next section we discuss another aspect of robust control known as reliable control.
6.3.1 Reliable Robust Output-Feedback Nonlinear H∞-Control
In this subsection, we discuss reliable control which is another aspect of robust control. The aim however in this case is to maintain control and stability in the event of a set of actuator or sensor failures. Failure detection is also another aspect of reliable control.
For the purpose of elucidating the scheme, let us represent the system on X⊂ℜn in the form
Σar : {˙x = f(x)+g1(x)w0+Σpj=1g2j(x)uj; x(0)=x0,yi = h2i(x)+wi, i=1,…,m,z = [h1(x) u1 ⋮ up], |
(6.41) |
where all the variables have their usual meanings and dimensions, and in addition w = [wT0wT1…wTm]T ∈ ℜs is the overall disturbance input vector,
g2(x) = [g21(x)g22(x)…g2p(x)], h2(x)=[h21(x)h22(x)…h2m(x)]T.
Without any loss of generality, we can also assume that f(0) = 0, h1(0) = 0 and h2i(0) = 0, i = 1, …, m. The problem is the following.
Definition 6.3.2 (Reliable Measurement-Feedback Nonlinear “∞-Control Problem (RLMFBNLHICP)). Suppose Ƶa ⊂ {1, …, p} and Ƶs ⊂ {1, …, m} are the subsets of actuators and sensors respectively that are prone to failure. Find (if possible!) a dynamic controller of the form:
(6.42) |
where ζ∈ℜV, such that the closed-loop system denoted by Σar ◦Σrc is locally asymptotically-stable and has locally L2-gain ≤ γ for all initial conditions in N⊂X and for all actuator and sensor failures in Za and Zs respectively.
To solve the above problem, let ia∈Za, i′a∈ˉZa, js∈Zs, j′s∈ˉZs denote the indices of the elements in Za, ˉZa, Zs and ˉZs respectively, where the “bar” denotes “set complement.” Accordingly, introduce the following decomposition
g2(x) = g2ia(x)⊕g2i′a(x) u= uia⊕ui′ah2(x)=h2is(x)⊕h2i′s(x) y= yis⊕yi′s w= [w1…wm]T=wis⊕wi′sb(x) = [b1(x)b2(x)…bm(x)]=bis(x)⊕bi′s(x)
where
g2ia(x)=[δia(1)g21(x) δia(2)g22(x) …δia(p)g2p(x)]uia(x) = [δia(1)u1(x) δia(2)u2(x) …δia(p)up(x)]Th2is(x) =[δis(1)h21(x) δis(2)h22(x) …δis(m)h2m(x)]Tyis(x)=[δis(1)y1 δis(2)y2 … δis(m)ym]Twis(x)=[δis(1)w1 δis(2)w2 … δis(m)wm]Tbis(x)=[δis(1)b1(x) δis(2)b2 (x) …δis(m)bm(x)]T
and
δia(i)={1 if i ∈ Za 0 if i ∉ Za δis(j) ={1 if j ∈ Zs 0 if j ∉ Zs.
Applying the controller Σrc to the system when actuator and/or sensor failures corresponding to the indices ia∈Za and is∈Zs occur, results in the closed-loop system Σar(Zs) ∘ Σrs(Za) :
{˙x = f(x)+ g1(x)w0 + g2ia′(x)ci′a(ζ); x(0) = x0 ˙ζ = a(ζ) + bi′s(ζ)yi′s = a(ζ) + bi′s(ζ)h2is′(x) + bi′s(ζ)wi′szi′a = [h1(x)ci′a(ζ)] . |
(6.43) |
The aim now is to find the controller parameters a(.), b(.) and c(.), such that in the event of any actuator or sensor failure for any ia∈Za and is∈Zs respectively, the resulting closed-loop system above still has locally L2-gain ≤ γ from wi's to zi'a and is locally asymptotically-stable. In this regard, define the Hamiltonian functions in the local coordinates (x, p) on T∗X by
Hs(x,p) = pTf(x)+12pT[1γ2g1(x)gT1(x)−g2ˉZa(x)gT2ˉZa(x)]p+ 12hT1(x)h1(x)+12γ2hT2Z(x)h2Zs(x),H0(x,p)=pTf(x)+12γ2pTg1(x)gT1(x)p+12pTg2Za(x)gT2Za(x)p+ 12hT1(x)h1(x)−12γ2hT2˜Zs(x)h2ˉZs(x).
Then, the following theorem gives a sufficient condition for the solvability of the reliable control problem.
Theorem 6.3.1 Consider the nonlinear system (6.41) and assume the following hold:
(i) The pair {f, h1} is locally zero-state detectable.
(ii) There exists a smooth C2 function ψ ≥ 0, ψ(0) = 0 and a C3 positive-definite function ˜V locally defined in a neighborhood N0 of x = 0 with ˜V(0)=0 and satisfying the HJIE:
(6.44) |
(iii) There exists a C3 positive-definite function ˜U locally defined in a neighborhood N1 of x = 0 with ˜U(0)=0 and satisfying the HJI-inequality:
(6.45) |
and such that N0∩N1≠0, H0(x,˜UTx)+ψ(x) has nonsingular Hessian matrix at x = 0.
(iv) ˜U(x)−˜V(x) is positive definite, and there exists an n × m matrix function L(.) that satisfies the equation
(6.46) |
Then, the controller Σrc solves the RLMFBNLHICP for the system Σar with
a(ζ)=f(ζ)+1γ2g1(ζ)gT1(ζ)˜VTx(ζ)−g2˜Za(ζ)gT2˜Za(ζ)˜VTx(ζ)−L(ζ)h2(ζ)b(ζ)=L(ζ)c(ζ)=−gT2(ζ)˜VTx(ζ).
Proof: The proof is lengthy, but can be found in Reference [285]. □
Remark 6.3.1 Theorem 6.3.1 provides a sufficient condition for the solvability of reliable controller design problem for the case of the primary contingency problem, where the set of sensors and actuators that are susceptible to failure is known a priori. Nevertheless by enlarging the sets Za, Zs to {1, …, p} and {1, …, m} respectively, the scheme can be extended to include all actuators and sensors.
We consider now an example.
Example 6.3.1 [285]. Consider the following second-order system.
(6.47) |
(6.48) |
(6.49) |
The index sets are given by Za={2}, Zs={0}, and the system is locally zero-state detectable. With γ = 0.81 and
ψ(x)=12x21+8γ2γ2−x21x62,
approximate solutions to the HJI-inequalities (6.44), (6.45) can be obtained as
(6.50) |
(6.51) |
respectively. Finally, equation (6.46) is solved to get
L(x)=[1.0132 0.8961]T.
The controller can then be realized by computing the values of a(x), b(x) and c(x) accordingly.