12

Mixed2/ Nonlinear Filtering

The H nonlinear filter has been discussed in Chapter 8, and its advantages over the Kalman-filter have been mentioned. In this chapter, we discuss the mixed H2/H-criterion approach for estimating the states of an affine nonlinear system in the spirit of Reference [179]. Many authors have considered mixed H2/H-filtering techniques for linear systems [162, 257], [269]-[282], which enjoy the advantages of both Kalman-filtering and H-filtering. In particular, the paper [257] considers a differential game approach to the problem, which is attractive and transparent. In this chapter, we present counterpart results for nonlinear systems using a combination of the differential game approach with a dissipative system’s perspective. We discuss both the continuous-time and discrete-time problems.

12.1  Continuous-Time Mixed H2/H Nonlinear Filtering

The general set-up for the mixed H2/H-filtering problem is shown Figure 12.1, where the plant is represented by an affine nonlinear system Σa, while F is the filter. The filter processes the measurement output y from the plant which is also corrupted by the noise signal w, and generates an estimate ˆz of the desired variable z. The noise signal w entering the plant is comprised of two components,

Image

FIGURE 12.1
Set-Up for Mixed H2/H Nonlinear Filtering

w=(w0w1), with w0S a bounded-spectral signal (e.g., a white Gaussian noise signal); and w1 ∈ P is a bounded-power signal or L2-signal. Thus, the induced-norm (or gain) from the input w0 to z (the output error) is the L2-norm of the interconnected system F ◦ Σa, i.e.,

FaL2sup0w0Szpw0S,

(12.1)

while the induced norm from w1 to z is the L-norm of Pa, i.e.,

FaLsup0w0Pz2w12.

(12.2)

The objective is to synthesize a filter, F, for estimating the state x(t) or a function of it, z = h1(x), from observations of y(τ) up to time t over a time horizon [t0, T ], i.e.,

Yt{y(τ):τt},t[t0,T],

such that the above pair of norms (12.1), (12.2) are minimized, while at the same time achieving asymptotic zero estimation error with w ≡ 0. In this context, the above norms will be interpreted as the H2 and H-norms of the interconnected system.

However, in this chapter, we do not solve the above problem, instead we solve an associated H2/H-filtering problem in which there is a single exogenous input wL2[t0, T ] and an associated H2-cost which represents the output energy of the system in z. More specifically, we seek to synthesize a filter, F, for estimating the state x(t) or a function of it, z = h1(x), from the observations Yt such that the L2-gain from the input w to the penalty function z (to be defined later) as well as the output energy defined by zH2 are minimized, and at the same time achieving asymptotic zero estimation error with w ≡ 0.

Accordingly, the plant is represented by an affine nonlinear causal state-space system defined on a manifold Xn with zero control input:

a:{˙x=f(x)+g1(x)w;x(t0)=x0z=h1(x)y=h2(x)+k21(x)w,

(12.3)

where xX is the state vector, wWL2([t0,),r) is an unknown disturbance (or noise) signal, which belongs to W, the set of admissible noise/disturbance signals, yYm is the measured output (or observation) of the system, and belongs to Y, the set of admissible outputs, zs is the output of the system that is to be estimated.

The functions f:XV(X),g1:XMn×r(X),h1:Xs,h2:Xm and k21:XMn×r(X) are real C-functions of x. Furthermore, we assume without any loss of generality that the system (12.3) has a unique equilibrium-point at x = 0 such that f(0) = 0, h1(0) = 0, h2(0) = 0. We also assume that there exists a unique solution x(t, t0, x0, w), ∀t ∈ ℜ for the system for all initial conditions x0 and all wW.

Again, since it is difficult to minimize exactly the H-norm, in practice we settle for a suboptimal problem, which is to minimize zH2 while rendering FaHγ. For nonlinear systems, this H-constraint is interpreted as the L2-gain constraint and is defined as

Tt0z(τ)2dtγ2Tt0w(τ)2dτ,T>0.

(12.4)

More formally, we define the local nonlinear (suboptimal) mixed H2/H-filtering or state estimation problem as follows.

Definition 12.1.1 (Mixed H2/H (Suboptimal) Nonlinear Filtering Problem (MH2HINLFP)). Given the plant Σa and a number γ > 0, find a filter F:YX such that

ˆx(t)=(Yt)

and the output energy zL2 is minimized while the constraint (12.4) is satisfied for all γγ, for all wW and for all initial conditions x(t0) ∈ O. In addition with w ≡ 0, limt→∞ z(t) = 0.

Moreover, if the above conditions are satisfied for all x(t0) ∈ X, we say that F solves the MH2HINLFP globally.

Remark 12.1.1 The problem defined above is the finite-horizon filtering problem. We have the infinite-horizon problem if we let T →∞.

12.1.1  Solution to the Finite-Horizon Mixed H2/H Nonlinear Filtering Problem

To solve the MH2HINLFP, we similarly consider the following Kalman-Luenberger filter structure

af:{˙ˆx=f(ˆx)+L(ˆx,t)[yh2(ˆx)],ˆx(t0)=ˆx0ˆz=h1(ˆx)

(12.5)

where ˆxX is the estimated state, L(., .) ∈ Mn×m(X ×) is the error-gain matrix which is smooth and has to be determined, and ˆz ∈ ℜs is the estimated output of the filter. We can now define the estimation error or penalty variable, z, which has to be controlled as:

z=h1(x)h1(ˆx).

Then we combine the plant (12.3) and estimator (12.5) dynamics to obtain the following augmented system:

˙x=f(x)+g(x)w,x(t0)=(xT0ˆxT0)Tz=h(x)},

(12.6)

where

x=(xx),f(x)=(f(x)f(ˆx)+L(ˆx,t)(h2(x)h2(ˆx))),g(x)=(g1(x)L(ˆx,t)k21(x)),h(x)=h1(x)h1(ˆx).

The problem can then be formulated as a two-player nonzero-sum differential game (from Chapter 3, see also [59]) with two cost functionals:

J1(L,w)=12Tt0[γ2w(τ)2z(τ)2]dτ,

(12.7)

J2(L,w)=12Tt0z(τ)2dτ.

(12.8)

Here, the first functional is associated with the H-constraint criterion, while the second functional is related to the output energy of the system or H2-criterion. It can easily be seen that, by making J1 ≥ 0 then the H-constraint ‖F ◦ PaHγ is satisfied. A Nash-equilibrium solution [59] to the above game is said to exist if we can find a pair of strategies (L, w) such that

J1(L,w)J1(L,w)wW,

(12.9)

J2(L,w)J2(L,w)LMn×m.

(12.10)

To arrive at a solution to this problem, we form the Hamiltonian function Hi : T (X × XW × Mn×m → ℜ, i = 1, 2, associated with the two cost functionals:

H1(x,w,L,YTx)=Yx(x,t)(f(x)+g(x)w)+12(γ2w2z2),

(12.11)

H2(x,w,L,VTx)=Vx(x,t)(f(x)+g(x)w)+12z2

(12.12)

with the adjoint variables p1=YTx,p2=VTx respectively, for some smooth functions Y, V : X × X × ℜ → ℜ and where Yx,Vx are the row-vectors of first partial-derivatives of the functions with respect to (wrt) x respectively. The following theorem from Chapter 3 then gives sufficient conditions for the existence of a Nash-equilibrium solution to the above game.

Theorem 12.1.1 Consider the two-player nonzero-sum dynamic game (12.7)-(12.8),(12.6) of fixed duration [t0, T ] under closed-loop memoryless perfect state information pattern. A pair of strategies (w , L ) provides a feedback Nash-equilibrium solution to the game, if there exists a pair of C1-functions (in both arguments) Y, V: X × X × ℜ → ℜ, satisfying the pair of HJIEs:

Yt(x,t)=infwWH1(x,w,L,Yx),Y(x,T)=0,

(12.13)

Vt(x,t)=minLMn×mH2(x,w,L,Vx),V(x,T)=0.

(12.14)

Based on the above theorem, it is now easy to find the above Nash-equilibrium pair for our game. The following theorem gives sufficient conditions for the solvability of the MH2HINLFP. For simplicity we make the following assumption on the plant.

Assumption 12.1.1 The system matrices are such that

k21(x)gT1(x)=0,k21(x)kT21(x)=I.

Remark 12.1.2 The first of the above assumptions means that the measurement noise and the system noise are independent.

Theorem 12.1.2 Consider the nonlinear system (12.3) and the MH2HINLFP for it. Suppose the function h1 is one-to-one (or injective) and the plant Σa is locally asymptotically-stable about the equilibrium point x = 0. Further, suppose there exists a pair of C1 (with respect to both arguments) negative and positive-definite functions Y, V : N × N × ℜ → ℜ respectively, locally defined in a neighborhood N × NX × X of the origin x = 0, and a matrix function L : N × ℜ → Mn×m satisfying the following pair of coupled HJIEs:

Yt(x,t)+Yx(x,t)f(x)+Yˆx(x,t)f(ˆx)12γ2Yx(x)g1(x)gT1(x)YTx(x)12γ2Yˆx(x)L(ˆx,t)LT(ˆx,t)YTˆx(x)1γ2Yˆx(x)L(ˆx,t)LT(ˆx,t)VTˆx(x)12(h1(x)h1(ˆx))T(h1(x)h1(ˆx))=0,Y(x)=0

(12.15)

Vt(x,t)+Vx(x,t)f(x)+Vˆx(x,t)f(ˆx)1γ2Vx(x)g1(x)gT1(x)YTx(x)1γ2Vˆx(x)L(ˆx,t)LT(ˆx,t)YTˆxγ2(h2(x)h2(ˆx))T(h2(x)h2(ˆx))+12(h1(x)h1(ˆx))T(h1(x)h1(ˆx))=0,V(x,T)=0,

(12.16)

together with the coupling condition

Vˆx(x,t)L(ˆx,t)=γ2(h2(x)h2(ˆx))T,x,ˆxN.

(12.17)

Then:

(i)  there exists locally a Nash-equilibrium solution (w, L) for the game (12.7), (12.8), (12.6);

(ii)  the augmented system (12.6) is dissipative with respect to the supply-rate s(w,z)=12(γ2w2z2) and hence has finite L2-gain from w to z less or equal to γ;

(iii)  the optimal costs or performance objectives of the game are J1(L,w)=Y(x0,t0)andJ2(L,w)=V(x0,t0);

(iv)  the filter Σaf with the gain matrix L(ˆx,t) satisfying (12.17) solves the infinite-horizon MH2HINLF P for the system locally in N.

Proof: Assume there exist definite solutions Y, V to the HJIEs (12.15)-(12.16), and (i) consider the Hamiltonian function H1(., ., ., .) first:

H1(x,w,L,YTx)=Yxf(x)+Yˆxf(ˆx)+YˆxL(ˆx,t)(h2(x)h2(ˆx))+Yxg1(x)w+YˆxL(ˆx,t)k21(x)w12z2+12γ2w2

where some of the arguments have been suppressed for brevity. Since it is quadratic and convex in w, we can apply the necessary condition for optimality, i.e.,

H1w|w=w=0,

to get

w:=1γ2(gT1(x)YTx(x,t)+kT21(x)LT(ˆx,t)YˆxT(x,t)).

(12.18)

Substituting now w in the expression for H2(., ., ., .) (12.12), we get

H2(x,w,L,VTx)=Vxf(x)+Vˆxf(ˆx)+VˆxL(ˆx,t)(h2(x)h2(ˆx))1γ2Vxg1(x)gT1(x)YTx1γ2VˆxL(ˆx,t)LT(ˆx,t)YTˆx+12(h1(x)h1(ˆx))T(h1(x)h1(ˆx)).

Then completing the squares for L in the above expression for H2(., ., ., .), we get

H2(x,w,L,VTx)=Vxf(x)+Vˆxf(ˆx)1γ2Vxg1(x)gT1(x)YTx+12(h1(x)h1(ˆx))T(h1(x)h1(ˆx))+12γ2LT(ˆx,t)VTˆx+γ2(h2(x)h2(ˆx))2γ22h2(x)h2(ˆx)212γ2LT(ˆx,t)VTˆx+LT(ˆx,t)YTˆx2+12γ2LT(ˆx,t)YTˆx2.

Thus, choosing L according to (12.17) minimizes H2(., ., ., .) and renders the Nash-equilibrium condition

H2(w,L)H2(w,L)LMn×m

satisfied. Moreover, substituting (w, L ) in (12.14) gives the HJIE (12.16).

Next, substitute L as given by (12.17) in the expression for H1(., ., ., .) and complete the squares to obtain:

H1(x,w,L,YTx)=Yxf(x)+Yˆxf(ˆx)1γ2YˆxL(ˆx,t)LT(ˆx,t)VTˆx12γ2Yxg1(x)gT1(x)YTx12γ2YˆxL(ˆx)LT(ˆx)YTˆx+γ22w+1γ2gT1(x)YTx+1γ2kT21(x)LT(ˆx,t)YTˆx212z2.

Substituting now w = w as given by (12.18), we see that the second Nash-equilibrium condition

H1(w,L)H1(w,L),wW

is also satisfied. Therefore, the pair (w, L ) constitute a Nash-equilibrium solution to the two-player nonzero-sum dynamic game. Moreover, substituting (w, L ) in (12.13) gives the HJIE (12.15).

(ii) Consider the HJIE (12.15) and rewrite as:

{Yt(x,t)+Yxf(x)+Yˆxf(ˆx)+YˆxL(ˆx,t)(h2(x)h2(ˆx)+Yxg1(x)w+YˆxL(ˆx,t)k21(x)w12z2+12γ2w2}12γ2w2Yxg1(x)wYˆxL(ˆx,t)k21(x)w12γ2Yxg1(x)gT1(x)YTx12γ2YˆxL(ˆx,t)LT(ˆx,t)YTˆx=0{Yt(x,t)+Yˆx[f(x)+g(x)w]+12γ2w212z2}12γ2ww2=0{Yt(x,t)+Yx[f(x)+g(x)w]}12γ2w212z2

(12.19)

for some function Y=Y>0. Integrating now the last expression above from t = t0 and x(t0) to t = T and x(T), we get the dissipation-inequality:

Y(x(T),T)Y(x(t0),t0)12Tt0{γ2w(t)2z(t)2}dt,

(12.20)

and hence the result.

(iii) Consider the cost functional J1(L, w) first, and rewrite it as

J1(L,w)+Y(x(T),T)Y(x(t0),t0)=Tt0{12γ2w(t)212z(t)2+˙Y(x,t)}dt=Tt0H1(x,w,L,YTx)dt=Tt0{γ22ww2+YˆxL(ˆx,t)(h2(x)h2(ˆx))+1γ2YˆxL(ˆx,t)LT(ˆx,t)VTˆx}dt,

where use has been made of the HJIE (12.15) in the above manipulations. Substitute now (L, w) as given by (12.17), (12.18) respectively to get the result.

Similarly, consider the cost functional J2(L, w) and rewrite it as

J2(L,w)V(x(t0),t0)=Tt0{12z(t)2+˙V(x,t)}dt=Tt0H2(x,w,L,YTx)dt.

Then substituting (L, w) as given by (12.17), (12.18) respectively, and using the HJIE (12.16) the result also follows.

Finally, (iv) follows from (i)-(iii). □

Remark 12.1.3 Notice that by virtue of (12.17), the HJIE (12.16) can also be represented in the following form:

Vt(x,t)+Vx(x,t)f(x)+Vˆx(x,t)f(ˆx)1γ2Vx(x,t)g1(x)gT1(x)YTx(x,t)1γ2Vˆx(x,t)L(ˆx,t)LT(ˆx,t)YTˆx(ˆx,t)1γVˆx(x,t)L(ˆx,t)LT(ˆx,t)VTˆx(x,t)+12(h1(x)h1(ˆx))T(h1(x)h1(ˆx))=0,V(x,T)=0.

(12.21)

The above result (Theorem 12.1.2) can be specialized to the linear-time-invariant (LTI) system:

l:{˙x=Ax+G1w,x(t0)=x0z=C1(xˆx)y=C2x+D21w,

(12.22)

where all the variables have their previous meanings, and An×n,G1n×r,C1s×n,C2m×n and D21m×r are real constant matrices. We have the following corollary to Theorem 12.1.2.

Corollary 12.1.1 Consider the LTI system Σl defined by (12.22) and the MH2HINLFP for it. Suppose C1 is full-column rank and A is Hurwitz. Suppose further, there exist a negative and a positive-definite real-symmetric solutions P1(t), P2(t), t ∈ [t0, T ] (respectively) to the coupled Riccati ordinary differential-equations (ODEs):

˙P1(t)=ATP1(t)+P1(t)A1γ2[P1(t)P2(t)][L(t)LT(t)+G1GT1L(t)LT(t)L(t)LT(t)0][P1(t)p2(t)]CT1C1,P1(T)=0,

(12.23)

˙P2(t)=ATP2(t)+P2(t)A1γ2[P1(t)+P2(t)][0L(t)LT(t)+G1GT1L(t)LT(t)+G1GT12L(t)LT(t)][P1(t)P2(t)]+CT1C1,P2(T)=0,

(12.24)

together with the coupling condition

P2(t)L(t)=γ2CT2,

(12.25)

for some n × m matrix function L(t) defined for all t ∈ [t0, T ]. Then:

(i)  there exists a Nash-equilibrium solution (w, L) for the game given by

w:=1γ2(GT1+DT21(t))P1(t)(xˆx),(xˆx)TP2(t)L=γ2(xˆx)TCT2;

(ii)  the augmented system

fl:{˙x=[A0L(t)C2ALC2]x+[G1L(t)D21]w,x(t0)=[x0ˆx0]z=[C1C1]x:=Cx

has H-norm from w to z less than equal to γ

(iii)  the optimal costs or performance objectives of the game are J1(L,w)=12(x0ˆx0)TP1(t0)(x0ˆx0)andJ2(L,w)=12(x0ˆx0)TP2(t0)(x0ˆx0);

(iv)  the filter Σfl with the gain matrix L(t) = L (t) satisfying (12.25) solves the finite-horizon MH2HINLFP for the system.

Proof: Take

Y(x,t)=12(xˆx)TP1(t)(xˆx),P1=PT1<0,V(x,t)=12(xˆx)TP2(t)(xˆx),P2=PT2>0,

and apply the result of the theorem. □

12.1.2  Solution to the Infinite-Horizon Mixed H2/H Nonlinear Filtering

In this section, we discuss the infinite-horizon filtering problem, in which case we let T → ∞. In this case, we seek a time-independent gain matrix ˆL(ˆx) and functions Y,V:ˆN׈NX×X such that the HJIEs:

Yx(x)f(x)+Yˆxf(ˆx)12γ2Yx(x)g1(x)gT1(x)YTx(x)12γ2Yˆx(x)ˆL(ˆx)ˆLT(ˆx)YTˆx(x)1γ2Yˆx(x)ˆL(ˆx)ˆLT(ˆx)VTˆx(x)12(h1(x)h1(ˆx))T(h1(x)h1(ˆx))=0,Y(0)=0

(12.26)

Vx(x)f(x)+Vˆxf(x)1γ2Vx(x)g1(x)gT1(x)YTx(x)1γ2Vˆx(x)ˆL(ˆx)ˆLT(ˆx)YTˆx(x)γ2(h2(x)h2(ˆx))T(h2(x)h2(ˆx))+12(h1(x)h1(ˆx))T(h1(x)h1(ˆx))=0,V(0)=0

(12.27)

are satisfied together with the coupling condition:

Vˆx(x)ˆL(ˆx)=γ2(h2(x)h2(ˆx))T,x,ˆxˆN,

(12.28)

where ˆL is the asymptotic value of L. It is also required in this case that the augmented system (12.6) is stable. Moreover, in this case, we can relax the requirement of asymptotic-stability for the original system (12.3) with a milder requirement of detectability which we define next.

Definition 12.1.2 The pair {f, h} is said to be locally zero-state detectable if there exists a neighborhood O of x = 0 such that, if x(t) is a trajectory of ˙x(t)=f(x) satisfying x(t0)O, then h(x(t)) is defined for all t ≥ t0 and h(x(t)) 0 for all t ≥ ts, for some ts ≥ t0, implies limt→∞ x(t) = 0. Moreover, the system is zero-state detectable if O=X.

Remark 12.1.4 From the above definition, it can be inferred that, if h1 is injective, then {f,h1} zero-state detectable ⇒ {f,h } zero-state detectable and conversely.

It is also desirable for a filter to be stable or admissible. The “admissibility” of a filter is defined as follows.

Definition 12.1.3 A filter F is admissible if it is asymptotically (or internally) stable for any given initial condition x(t0) of the plant Σa, and with w ≡ 0

limtz(t)=0.

The following proposition can now be proven along the same lines as Theorem 12.1.2.

Proposition 12.1.1 Consider the nonlinear system (12.3) and the infinite-horizon MH2HINLFP for it. Suppose the function h1 is one-to-one (or injective) and the plant Σa is zero-state detectable. Further, suppose there exists a pair of C1 negative and positive-definite functions Y,V:ˆN׈N respectively, locally defined in a neighborhood ˆN׈NX×X of the origin x=0, and a matrix function ˆL:ˆNMn×m satisfying the pair of coupled HJIEs (12.26), (12.27) together with (12.28). Then:

(i)  there exists locally a Nash-equilibrium solution (ˆw,ˆL) for the game;

(ii)  the augmented system (12.6) is dissipative with respect to the supply-rate s(w,z)=12(γ2w2z2) and hence has L2-gain from w to z less or equal to γ;

(iii)  the optimal costs or performance objectives of the game are J1(ˆL,ˆw)=Y(x0)andJ2(ˆL,ˆw)=V(x0);

(iv)  the filter Σaf with the gain matrix ˆL(ˆx)=ˆL(ˆx) satisfying (12.28) is admissible and solves the infinite-horizon MH2HINLFP locally in ˆN.

Proof: Since the proof of items (i)-(iii) is the same as in Theorem 12.1.2, we only prove (iv) here. Using similar manipulations as in the proof of Theorem 12.1.2, we get an inequality similar to (12.19). This inequality implies that with w = 0,

˙Y12z2.

(12.29)

Therefore, by Lyapunov’s theorem, the augmented system is locally stable. Furthermore, for any trajectory of the system x(t) such that˙Y(x)0for all ttc for some tc ≥ t0, it implies that z0tc. This in turn implies h1(x)=h1(ˆx), and x(t)=ˆx(t)ttc by the injectivity of h1. This further implies that h2(x)=h2(ˆx)ttc and it is a trajectory of the free system:

˙x=(f(x)f(ˆx)).

By the zero-state detectability of {f, h1}, we have limtx(t)=0, and asymptotic-stability follows by LaSalle’s invariance-principle. On the other hand, if we have strict inequality, ˙Y<12z2, asymptotic-stability follows immediately from Lyapunov’s theorem, and limt→∞ z(t) = 0. Therefore, Σaf is admissible. Combining now with items (i)-(iii), the conclusion follows. □

Similarly, for the linear system Σl (12.22), we have the following corollary.

Corollary 12.1.2 Consider the LTI system Σl defined by (12.22) and the MH2HINLFP for it. Suppose C1 is full column rank and (A, C1) is detectable. Suppose further, there exist a negative and a positive-definite real-symmetric solutions P1, P2, (respectively) to the coupled algebraic-Riccati equations (AREs):

ATP1+P1A1γ2[P1P2][ˆLˆLT+G1GT1ˆLˆLTˆLˆLT0][P1P2]CT1C1=0

(12.30)

ATP2+P2A1γ2[P1P2][0ˆLˆLT+G1GT1ˆLˆLT+G1GT12ˆLˆLT][P1P2]+CT1C1=0,

(12.31)

together with the coupling-condition

P2ˆL=γ2CT2.

(12.32)

Then:

(i)  there exists a Nash-equilibrium solution (w, L ) for the game;

(ii)  the augmented system Σlf has H-norm from w to z less than equal to γ;

(iii)  the optimal costs or performance objectives of the game are J1(ˆL,ˆw)=12(x0ˆx0)TP1(x0ˆx0)andJ2(ˆL,ˆw)=12(x0ˆx0)TP2(x0ˆx0);

(iv)  the filter Σlf with gain-matrix ˆL=ˆLn×m satisfying (12.32) is admissible, and solves the infinite-horizon MH2HINLFP for the linear system.

We consider a simple example.

Example 12.1.1 We consider a simple example of the following scalar system:

˙x=x3,x(0)=x0,z=xy=x+w.

We consider the infinite-horizon problem and the associated HJIEs. It can be seen that the system satisfies all the assumptions of Theorem 12.1.2 and Proposition 12.1.1. Then, substituting in the HJIEs (12.21), (12.26), and coupling condition (12.28), we get

x3yxˆx3yˆx12γ2l2y2ˆx1γ2l2υˆxyˆx12(xˆx)2=0,x3υxˆx3υˆx1γ2l2υˆxyˆxγ2(xˆx)2+12(xˆx)2=0,υˆxl+γ2(xˆx)=0.

Looking at the above system of PDEs, we see that there are 5 variables: vx, υˆx, yx, υˆx, l and only 3 equations. Therefore, we make the following simplifying assumption. Let

υx=υˆx.

Then, the above equations reduce to

x3yx+ˆx3yˆx+12γ2l2y2ˆx1γ2l2υxyˆx+12(xˆx)2=0,

(12.33)

x3υx+ˆx3υx1γ2l2υxyˆx+γ2(xˆx)212(xˆx)2=0,

(12.34)

υxlγ2(xˆx)=0.

(12.35)

Image

FIGURE 12.2
Nonlinear H2/H-Filter Performance with Unknown Initial Condition; Reprinted from Int. J. of Robust and Nonlinear Control, vol. 19, no. 4, pp. 394-417, © 2009, “Mixed H2/H nonlinear filtering,” by M. D.S. Aliyu and E. K. Boukas, with permission from Wiley Blackwell.

Subtract now equations (12.33) and (12.34) to get

(x3yx+ˆx3yˆx)+(ˆx3x3)υx+12γ2l2y2ˆx+(1γ2)(xˆx)2=0,

(12.36)

υxlγ2(xˆx)=0.

(12.37)

Now let

υx=(xˆx)υˆx=(xˆx),υ(x,ˆx)=12(xˆx)2,andl2/=γ2.

Then, if we let γ = 1, the equation governing yx (12.36) becomes

y2ˆx+2(x3yx+ˆx3yˆx)+2(ˆx3x3)(xˆx)=0

and

yx=(x+ˆx),yˆx=(x+ˆx),

approximately solves the above PDE (locally!). This corresponds to the solution

y(x,ˆx)=12(x+ˆx)2.

Figure 12.2 shows the filter performance with unknown initial condition and with the measurement noise w(t) = 0.5w0 + 0.01 sin(t), where w0 is zero-mean Gaussian-white with unit variance.

12.1.3  Certainty-Equivalent Filters (CEFs)

It should be observed from the previous two sections, 12.1.1, 12.1.2, that the filter gains (12.17), (12.28) may depend on the original state of the system which is to be estimated, except for the linear case where the gains are constants. As discussed in Chapter 8, this will make the filters practically impossible to construct. Therefore, the filter equation and gains must be modified to overcome this difficulty.

Furthermore, we observe that the number of variables in the HJIEs (12.15)-(12.27) is twice the dimension of the plant. This makes them more difficult to solve considering their notoriety, and makes the scheme less attractive. Therefore, based on these observations, we consider again in this section certainty-equivalent filters which are practically implementable, and in which the governing equations are of lower-order. We begin with the following definition.

Definition 12.1.4 For the nonlinear system (12.3), we say that it is locally zero-input observable if for all states x1, x2UX and input w(.) 0

y(.,x1,w)y(.,x2,w)x1=x2

where y(., xi, w), i = 1, 2 is the output of the system with the initial condition x(t0) = xi. Moreover, the system is said to be zero-input observable, if it is locally observable at each x0 ∈ X or U=X.

Consider now as in Chapter 8, the following class of filters:

˜af:{˙ˆx=f(ˆx)+g1(ˆx)w+˜L(ˆx,y)[yh2(ˆx)k21(ˆx)˜w],ˆx(t0)ˆx0ˆz=h2(ˆx)ˆz=yh2(ˆx),

(12.38)

where ˜L(.,.)Mn×m is the filter gain matrix, ˜w is the estimated worst-case system noise (hence the name certainty-equivalent filter) and ˜z is the new penalty variable. Then, consider the infinite-horizon mixed H2/H dynamic game problem with the cost functionals (12.7), (12.8), and with the above filter. We can define the corresponding Hamiltonians ˜Hi:TX×W×T×Y×Mn×m,i=1,2 as

˜H1(ˆx,w,y,˜L,˜YTˆx,˜YTy)=˜Yˆx(ˆx,y)[f(ˆx)+g1(ˆx)w+˜L(ˆx,y)(yh2(ˆx)k21(ˆx)w)]+˜YY(ˆx,y)˙y+12γ2w212˜z2,˜H2(ˆx,w,y,˜L,˜VTˆx,˜VTy)=˜Vˆx(ˆx,y)[f(ˆx)+g1(ˆx)w+˜L(ˆx,y)(yh2(ˆx)k21(ˆx)w)]+˜VY(ˆx,y)˙y+12˜z2,

for some smooth functions ˜V,˜Y:X×Y, and where the adjoint variables are set as ˜p1=˜YTˆx,˜p2=˜VTˆx. Further, we have

˜H1w|w=˜w=0˜w=1γ2[g1(ˆx)˜L(ˆx,y)k21(ˆx)]T˜YTˆx(ˆx,y),

and repeating similar derivation as in the previous section, we can arrive at the following result.

Theorem 12.1.3 Consider the nonlinear system (12.3) and the MH2HINLFP for it. Suppose the plant Σa is locally asymptotically-stable about the equilibrium point x = 0 and zero-input observable. Further, suppose there exists a pair of C1 (with respect to both arguments) negative and positive-definite functions ˜Y,˜V:˜N×ϒ respectively, locally defined in a neighborhood ˜N×ϒX×Y of the origin (ˆx,y)=(0,0), and a matrix function ˜L:˜N×ϒMn×m satisfying the following pair of coupled HJIEs:

˜Yˆx(ˆx,y)f(ˆx)+˜Yy(ˆx,y)˙y12γ2˜Yˆx(ˆx,y)g1(ˆx)gT1(ˆx)˜YTˆx(ˆx,y)12γ2˜Yˆx(ˆx,y)˜L(ˆx,y)˜LT(ˆx,y)˜YTˆx(ˆx,y)1γ2˜Yˆx(ˆx,y)˜L(ˆx,y)˜LT(ˆx,y)˜VTˆx(ˆx,y)12(yh2(ˆx))(yh2(ˆx))=0,˜Y(0,0)=0,ˆx˜N,yϒ

(12.39)

˜Vˆx(ˆx,y)f(ˆx)+˜Vy(ˆx,y)˙y1γ2˜Vˆx(ˆx,y)g1(ˆx)gT1(ˆx)˜YTˆx(ˆx,y)1γ2˜Vˆx(ˆx,y)˜L(ˆx,y)˜LT(ˆx,y)˜YTˆx(ˆx,y)1γ2˜Vˆx(ˆx,y)˜L(ˆx,y)˜LT(ˆx,y)˜VTˆx(ˆx,y)+12(yh2(ˆx)T(yh2(ˆx))=0,˜V(0,0)=0,ˆx˜N,yϒ

(12.40)

together with the coupling condition

˜Vˆx(ˆx,y)˜L(ˆx,y)=γ2(yh2(ˆx))T,ˆx˜N,yϒ.

(12.41)

Then, the filter ˜Σaf with the gain matrix ˜L(ˆx,y) satisfying (12.41) solves the infinite-horizon MH2HINLFP for the system locally in ˜N.

Proof: (Sketch) Using similar manipulations as in Theorem 12.1.2 and Prop 12.1.1, it can be shown that the existence of a solution to the coupled HJIEs (12.39), (12.40) implies the existence of a solution to the dissipation-inequality

˜Y(ˆx,y)˜Y(ˆx0,y0)12t0(γ2w2˜z2)dt

for some smooth function ˜Y=˜Y>0. Again using similar arguments as in Theorem 12.1.2 and Prop 12.1.1, the result follows. In particular, with w ≡ 0 and ˙˜Y(ˆx,y)0˜z0, which in turn implies xˆx by the zero-input observability of the system. □

Remark 12.1.5 Comparing the HJIEs (12.39)-(12.40) with (12.26)-(12.27) we see that the dimensionality of the former is half. This is indeed significant. Moreover, the filter gain corresponding to the new HJIEs (12.41) does not depend on x.

12.2  Discrete-Time Mixed H2/H Nonlinear Filtering

The set-up for this case is the same as the continuous-time case shown in Figure 12.1 with the difference that the variables and measurements are discrete, and is shown on Figure 12.3. Therefore, the plant is similarly described by an affine causal discrete-time nonlinear state-space system with zero input defined on an n-dimensional space X ⊆ ℜn in coordinates x = (x1, …, xn):

da:{xk+1=f(xk)+g1(xk)wk;x(k0)=x0zk=h1(xk)yk=h2(xk)+k21(xk)wk

(12.42)

where xX is the state vector, wW is the disturbance or noise signal, which belongs to the set Wl2([k0,)r) of admissible disturbances or noise signals, the output ym is the measured output of the system, while zs is the output to be estimated. The functions f:XX,g1:XMn×r(X),h1:Xs,h2:Xm, and k12:XMs×p(X),k21:XMm×r(X) are real C functions of x. Furthermore, we assume without any loss of generality that the system (12.42) has a unique equilibrium-point at x = 0 such that f(0) = 0, h1(0) = h2(0) = 0.

Image

FIGURE 12.3
Set-Up for Discrete-Time Mixed H2/H Nonlinear Filtering

The objective is again to synthesize a filter, Fk, for estimating the state xk (or more generally a function of it zk = h1(xk)) from observations of yi up to time k and over a time horizon [k0, K], i.e., from

Yk{yi:ik},k[k0,K]

such that the gains (or induced norms) from the inputs w0 := {w0,k}, w1 := {w1,k}, to the error or penalty variable z defined respectively as the l2-norm and l-norm of the interconnected system Fk Σda respectively, i.e.,

Fkdal2sup0w0szpw0s

Figure 12.43)

and

Fkdalsup0w1Pz2w12

Figure 12.44)

are minimized, where

P{w:wl,Rww(k),Sww(jω)exist for all kand all ω resp.,wP<}S{w:wl,Rww(k),Sww(jω)exist for all kand all ω resp.,Sww(jω)<}

z2PlimK12KKk=Kz2wo2S=Sw0w0(jω),w02S=Sw0w0(jω),

and Rww, Sww() are the autocorrelation and power spectral-density matrices of w [152]. In addition, if the plant is stable, we replace the induced ℓ-norms above by their equivalent ℋ-subspace norms.

The above problem is the standard discrete-time mixed H2/H-filtering problem. However, as pointed out in the previous section, due to the difficulty of solving the above problem, we do not solve it either in this section. Instead, we solve the associated problem involving a single noise/disturbance signal wW2[k0,), and minimize the output energy of the system, z2, while rendering Fk  dalγ for a given number γ>0, for all wW and for all initial conditions x0OX. In addition, we also require that with wk 0, the estimation error converges to zero, i.e., limkzk=0 = 0.

Again, for the discrete-time nonlinear system (12.42), the above H constraint is interpreted as the 2-gain constraint and is represented as

Kk=k0zk2γ2Kk=k0wk2,K>k0Z

(12.45)

for all wW, for all initial states x0OX.

The discrete-time mixed-H2/H filtering or estimation problem can then be defined formally as follows.

Definition 12.2.1 (Discrete-Time Mixed H2/H (Suboptimal) Nonlinear Filtering Problem (DMH2HINLFP)). Given the plant Σda and a number γ>0, find a filter k:YX such that

ˆxk+1=k(Yk)

and the output energy z2 is minimized, while the constraint (12.45) is satisfied for all γγ, for all wW and all x0 ∈ O. In addition, with wk 0, we have limkzk=0.

Moreover, if the above conditions are satisfied for all x0 ∈ X, we say that Fk solves the D M H 2 H I N L F P globally.

Remark 12.2.1 The problem defined above is the finite-horizon filtering problem. We have the infinite-horizon problem if we let K → ∞.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset