13

Solving the Hamilton-Jacobi Equation

In this chapter, we discuss some approaches for solving the Hamilton-Jacobi-equations (HJE) associated with the optimal control problems for affine nonlinear systems discussed in this book. This has been the biggest bottle-neck in the practical application of nonlinear H-control theory. There is no systematic numerical approach for solving the HJIEs. Various attempts have however been made in this direction in the past three decades. Starting with the work of Lukes [193], Glad [112], who proposed a polynomial approximation approach, Van der Schaft [264] applied the approach to the HJIEs and developed a recursive approach which was also refined by Isidori [145]. Since then, many other authors have proposed similar approaches for solving the HJEs [63, 80, 125, 260, 286]. However, all the contributions so far in the literature are local.

The main draw-backs with the above approaches for solving the HJIE are that, (i) they are not closed-form, and convergence of the sequence of solutions to a closed-form solution cannot be guaranteed; (ii) there are no efficient methods for checking the positivedefiniteness of the solution; (iii) they are sensitive to uncertainties and pertubations in the system; and (iv) also sensitive to the initial condition. Thus, the global asymptotic stability of the closed-loop system cannot be guaranteed.

Therefore, more refined solutions that will guarantee global asymptotic-stability are required if the theory of nonlinear H-control is to yield any fruits. Thus, with this in mind, more recently, Isidori and Lin [145] have shown that starting from a solution of an algebraic-Riccati-equation (ARE) related to the linear H problem, if one is free to choose a state-dependent weight of control input, it is possible to construct a global solution to the HJIE for a class of nonlinear systems in strict-feedback form. A parallel approach using a backstepping procedure and inverse-optimality has also been proposed in [96], and in [167]-[169] for a class of strict-feedback systems.

In this chapter, we shall review some major approaches for solving the HJE, and present one algorithm that may yield global solutions. The chapter is organized as follows. In Section 13.1, we review some popular polynomial and Taylor-series approximation methods for solving the HJEs. Then in Section 13.2, we discuss a factorization approach which may yield exact and global solutions. The extension of this approach to Hamiltonian mechanical systems is then discussed in Section 13.3. Examples are given throughout to illustrate the usefulness of the various methods.

13.1  Review of Some Approaches for Solving the HJBE/HJIE

In this section, we review some major approaches for solving the HJBE and HJIE that have been proposed in the literature. In [193], a recursive procedure for the HJBE for a class of nonlinear systems is proposed. This was further refined and generalized for affine nonlinear systems by Glad [112]. To summarize the method briefly, we consider the following affine nonlinear system

Σa:x˙(t)=a(x)+b(x)u

(13.1)

under the quadratic cost functional:

min0[l(x)+12uTRu]dt,

(13.2)

where a : ℜn →ℜn, b : ℜn →ℜn×k, l : ℜn →ℜ, l ≥ 0, a,b,lC, 0 < R ∈ℜk×k. Then, the HJBE corresponding to the above optimal control problem is given by

υx(x)a(x)12υx(x)b(x)R1bT(x)υxT(x)+l(x)=0,υ(0)=0,

(13.3)

for some smooth C2(ℜn) positive-semidefinite function v : ℜn →ℜ, and the optimal control is given by

u=k(x)=R1bT(x)υxT(x).

Further, suppose there exists a positive-semidefinite solution to the algebraic-Riccati-equation (ARE) corresponding to the linearization of the system (13.1), then it can be shown that there exists a real analytic solution v to the HJBE (13.3) in a neighborhood O of the origin [193]. Therefore, if we write the linearization of the system and the cost function as

l(x)=12xTQx+lh(x)a(x)=Ax+ah(x)b(x)=B+bh(x)υ(x)=12xTPx+υh(x)

where Q=lx(0),A=ax(0),B=b(0),P=vx(0)andlh,ah,bh,vh contain higher-order terms. Substituting the above expressions in the HJBE (13.3), it splits into two parts:

ATP+PAPBR1BTP+Q=0

(13.4)

υhx(x)Acx+υx(x)ah(x)12υhx(x)BR1BTυhxT(x)12υx(x)βh(x)υxT(x)+lh(x)=0

(13.5)

where

Ac=ABR1BTP,βh(x)=b(x)R1bT(x)BR1BT

and βh contains terms of degree 1 and higher. Equation (13.4) is the ARE of linear-quadratic control for the linearized system. Thus, if the system is stabilizable and detectable, there exists a unique positive-semidefinite solution to this equation. Hence, it represents the first-order approximation to the solution of the HJBE (13.3). Now letting the superscript (m) denote the mth-order terms, (13.5) can be written as

υhxm+1(x)Acx=[υx(x)ah(x)12υhx(x)BR1BTυhxT(x)12υx(x)βh(x)υxT(x)]m+lhm(x).

(13.6)

The RHS contains only mth, (m − 1) − th,…-order terms of v. Therefore, equation (13.5) defines a linear system of equations for the (m + 1)th order coefficients with the RHS containing previously computed terms. Thus, vm+1 can be computed recursively from (13.6). It can also be shown that the system is nonsingular as soon as Ac is a stable matrix. This is satisfied vacuously if P is a stabilizing solution to the ARE (13.4).

We consider an example.

Example 13.1.1 Consider the second-order system

x˙1=x12+x2

(13.7)

x˙2=x2+u

(13.8)

with the cost functional

J=012(x12+x22+u2)dt.

The solution of the ARE (13.4) gives the quadratic (or second-order) approximation to the solution of the HJBE:

V[2](x)=x12+x1x2+12x22.

Further, the higher-order terms are computed recursively up to fourth-order to obtain

VV[2]+V[3]+V[4]=x12+x1x2+12x221.593x132x12x20.889x1x220.148x23+1.998x14+2.778x13x2+1.441x12x22+0.329x1x23+0.0288x24.

The above algorithm has been refined by Van der Schaft [264] for the HJIE (6.16) corresponding to the state-feedback H-control problem:

Vx(x)f(x)+Vx(x)[1γ2g1(x)g1T(x)g2(x)g2T(x)]VxT(x)+12h1T(x)h1(x)=0,V(0)=0,

(13.9)

for some smooth C1-function V : M →ℜ, M ⊂ℜn. Suppose there exists a solution P ≥ 0 to the ARE

FTP+PF+P[1γ2G1G1TG2G2T]P+H1TH1=0

(13.10)

corresponding to the linearization of the HJIE (13.9), where F=fx(0),G1=g1(0),G2=g2(0),H1=h1x(0). Let V be a solution to the HJIE, and if we write

V(x)=12xTPx+Vh(x)f(x)=Fx+fh(x)12[1γ2g1(x)g1T(x)g2(x)g2T(x)]=12[1γ2G1G1TG2G2T]+Rh(x)12h1T(x)h1(x)=12xTH1TH1x+θh(x)

where Vh, fh, Rh and θh contain higher-order terms. Then, similarly the HJIE (13.9) splits into two parts, (i) the ARE (13.10); and (ii) the higher-order equation

Vh(x)xFclx=V(x)xfh(x)+12Vh(x)x[1γ2G1G1TG2G2T]TVh(x)x(x)+12V(x)xRh(x)TV(x)x+θh(x)

(13.11)

where

FclFG2G2TP+1γ2G1G1TP.

We can rewrite (13.11) as

Vhm(x)xFclx=Hm(x),

(13.12)

where Hm(x) denotes the m-th order terms on the RHS, and thus if P ≥ 0 is a stabilizing solution of the ARE (13.10), then the above Lyapunov-equation (13.12) can be integrated for Vhm to get,

Vhm(x)=0Hm(eFcltx)dt.

Therefore, Vhm is determined by Hm. Consequently, since Hm depends only on V(m1),V(m2),,V2=12xTPx,Vhm can be computed recursively using (13.12) and starting from V 2, to obtain V similarly as

VV[2]+V[3]+V[4]+

The above approximate approach for solving the HJBE or HJIE has a short-coming; namely, there is no guarantee that the sequence of solutions will converge to a smooth positive-semidefinite solution. Moreover, in general, the resulting solution obtained cannot be guaranteed to achieve closed-loop asymptotic-stability or global 2-gain ≤ γ since the functions fh, Rh, θh are not exactly known, rather they are finite approximations from a Taylor-series expansion. The procedure can also be computationally intensive especially for the HJIE as various values of γ have to be tried. The above procedure has been refined in [145] for the measurement-feedback case, and variants of the algorithm have also been proposed in other references [63, 154, 155, 286, 260].

13.1.1  13.1.1 Solving the HJIE/HJBE Using Polynomial Expansion and Basis Functions

Two variants of the above algorithms that use basis function approximation are given in [63] using the Galerkin approximation and [260] using a series solution. In [260] the following series expansion for V is used

V(x)=V[2](x)+V[3](x)+

(13.13)

where V [k] is a homogeneous-function of order k in n-scalar variables x1, x2,…, xn, i.e., it is a linear combination of

Nkn(n+k1k)

terms of the form x1i1x2i2xnin, where ij is a nonnegative integer for j = 1,…, n and i1 + i2 +. .. + in = k. The vector whose components consist of these terms is denoted by x[k]; for example,

x[1]=[x1x2],x[2]=[x12x1x2x22],….

To summarize the approach, rewrite HJIE (13.9) as

Vx(x)f(x)+12Vx(x)G(x)R1GT(x)VxT(x)+Q(x)=0

(13.14)

where

G(x)=[g1(x)g2(x)],R=[γ2001],Q(x)=12xTCTCx

and let v=[wTuT]T,v=[wTuT]T=[lTkT]. Then we can expand f, G, ν as homogeneous functions:

f(x)=f[1](x)+f[2](x)+G(x)=G[0](x)+G[1](x)+v(x)=v[1](x)+v[2](x)+

where

v[k]=R1j=0k1(G[j])T(Vx[k+1j])T,

(13.15)

f[1](x)=Fx,G[0]=[G1G2],G1=g1(0),G2=g2(0). Again rewrite HJIE (13.14) as

Vx(x)f(x)+12v(x)Rv(x)+Q(x)=0,

(13.16)

v(x)+R1GT(x)VxT(x)=0.

(13.17)

Substituting now the above expansions in the HJIE (13.16), (13.17) and equating terms of order m ≥ 2 to zero, we get

k=0m2Vx[mk]f[k+1]+12k=1m1v[mk]Rv[k]+Q[m]=0.

(13.18)

It can be checked that, for m = 2, the above equation simplifies to

Vx[2]f[1]+12v[1]Rv[1]+Q[2]=0,

where

f[1](x)=Ax,v[1](x)=R1BTVx[2]T(x),Q[2](x)=12xTCTCx,B=[G1G2].

Substituting in the above equation, we obtain

Vx[2](x)Ax+12Vx[2](x)BR1BTVx[2](x)+12xTCTCx=0.

But this is the ARE corresponding to the linearization of the system, hence

V[2](x)=xTPx,

where P = P T > 0 solves the ARE (13.10) with A = A − BR−1BT P Hurwitz. Moreover,

v[1](x)=BTPx,k[1](x)=G1TPx.

Consider now the case m ≥ 3, and rewrite (13.18) as

k=0m2Vx[mk]f[k+1]+12v[m1]TRv[1]+12k=2m2v[mk]TRv[k]=0.

(13.19)

Then, using

v[m1]T=k=0m2Vx[mk]G[k]R1,

equation (13.19) can be written as

Vx[m]f¯[1]=12k=1m2Vx[mk]f¯[k+1]12k=1m2(v[mk])TRv[k]

(13.20)

where

f¯(x)f(x)+G(x)v[1](x).

Equation (13.20) can now be solved for any V [m], m ≥ 3. If we assume every V [m] is of the form V [m] = Cmx[m], where Cm1×Nmn is a row-vector of unknown coefficients, then substituting this in (13.20), we get a system of Nmn linear equations in the unknown entries of Cm. It can be shown that if the eigenvalues of A = A − BBT P are nonresonant,1 then the system of linear equations has a unique solution for all m ≥ 3. Moreover, since A is stable, the approximation V is analytic and V [m] converges finitely.

To summarize the procedure, if we start with V[2](x)=12xTPxandv[2](x)=R1BTPx, equations (13.20), (13.15) could be used to compute recursively the sequence of terms

V[3](x),v[2](x),V[4](x),v[3](x),

which converges point-wise to V and ν. We consider an example of the nonlinear benchmark problem [71] to illustrate the approach.

Example 13.1.2 [260]. The system involves a cart of mass M which is constrained to translate along a straight horizontal line. The cart is connected to an inertially fixed point via a linear spring as shown in Figure 13.1. A pendulum of mass m and inertia I which rotates about a vertical line passing through the cart mass-center is also mounted. The dynamic equations for the system after suitable linearization are described by

ξ¨+ξ=ε(θ˙2sinθθ¨cosθ)+wθ¨=εξ¨cosθ+u}

(13.21)

where ξ is the displacement of the cart, u is the normalized input torque, w is the normalized spring force which serves as a disturbance and θ is the angular position of the proof body. The coupling between the translational and the rotational motions is governed by the parameter ε which is defined by

ε=me(I+me2)(M+m),0ε1,

where e is the eccentricity of the pendulum, and ε = 0 if and only if e = 0. In this event, the dynamics reduces to

ξ¨+ξ=wθ¨=u}

(13.22)

and is not stabilizable, since it is completely decoupled. However, if we let x:=[x1x2x3x4]T=[ξξ˙θθ˙]T, then the former can be represented in state-space as

x˙=[x2x+εx42sinx31ε2cos2x3x4εcosx3(x1εx42sinx3)1ε2cos2x3]+[011ε2cos2x30εcosx31ε2cos2x3]w+[0εcosx31ε2cos2x3011ε2cos2x3]u,

(13.23)

where1ε2cos2x30forallx3andε<1.

Image

FIGURE 13.1
Nonlinear Benchmark Control Problem; Originally adopted from Int. J. of Robust and Nonlinear Control, vol. 8, pp. 307-310 © 1998,“A benchmark problem for nonlinear control design,” by R. T. Bupp et al., with permission from Wiley Blackwell.

We first consider the design of a simple linear controller that locally asymptotically stabilizes the system with w = 0. Consider the linear control law

u=k1θk2θ˙,k1>0,k2>0

(13.24)

and the Lyapunov-function candidate

V(x)=12xTP(θ)x=12ξ˙2+12θ˙2+εξ˙θ˙cosθ+12ξ2+12k1θ2

where

P(θ)=[1000010εcosθ00k100εcosθ01].

The eigenvalues of P(θ) are {1, k1, 1±ε cos θ}, and since 0 ≤ ε < 1, P(θ) is positive-definite. Moreover, along trajectories of the the closed-loop system (13.23), (13.24),

V˙=k2θ˙20.

Hence, the closed-loop system is stable. In addition, using LaSalle’s invariance-principle, local asymptotic-stability can be concluded.

Design of Nonlinear Controller

Next, we apply the procedure developed above to design a nonlinear H-controller by solving the HJIE. For this, we note the system specifications and constraints:

|ξ|1.282,ε=0.2,|u|1.411,

and we choose

C=diag{1,0.1,0.1,0.1}.

We first expand f(x) and G(x) in the HJIE (13.14) using the basis functions x[1], x[2], x[3] ,… as

f(x)[x22524x1+524x42x3+25576x1x32+524x1124x42x365576x1x32+]

G(x)[0252425576x32+0524+65576x32+0524+65576x32+0252425576x1x32+].

Then the linearized plant about x = 0 is given by the following matrices

A=fx(0)=[010025240000001524000],B1=G1(0)=[025240524],B2=G2(0)=[052402524].

We can now solve the ARE (13.10) with H1 = C. It can be shown that for all γ > γ = 5.5, the Riccati equation has a positive-definite solution. Choosing γ = 6, we get the solution

P=[19.62832.83080.73802.88762.830815.54920.84391.99150.73800.84390.33300.49672.88761.99150.49671.4415]>0

yielding

A=[01.0000001.61340.21400.09360.27770001.0002.74081.12220.36031.1422]

with eigenvalues {−0.0415±i1.0156, −0.4227±i0.3684}. The linear feedbacks resulting from the above P are also given by

v[1](x)=[l[1]k[1]]T=[0.0652x1+0.4384x2+0.0215x3+0.0493x42.4182x1+1.1650x20.3416x31.0867x4].

Higher-Order Terms

Next, we compute the higher-order terms in the expansion. Since A is Hurwitz, its eigenvalues are nonresonant, and so the system of linear equations for the coefficient matrix in (13.20) has a unique solution and the series converges. Moreover, for this example,

f[2k](x)=0,G[2k1](x)=0,k=1,2,

Thus, V[3](x)=0,andv[2](x)=0. The first non-zero higher-order terms are

v[3](x)=R1[BTVx[4]T(x)+G[2]T(x)],Vx[4](x)Ax=Vx[2](x)[f[3](x)+G[2]v[1](x)].

The above system of equations can now be solved to yield

V[4]=162.117x14+91.1375x240.9243x4x32x10.4143x4x32x20.1911x4x33+59.6069x42x1266.0818x42x2x1+42.1915x42x228.7947x42x3x12+0.6489x44151.8854x4x13+235.0386x4x2x12193.9184x4x22x1+96.2534x4x23+43.523x4x3x1245.9440x4x3x2x1258.0269x2x13+330.6741x22x12186.0128x32x149.3835x3x13+92.8910x3x2x1270.7044x3x22x10.22068x33x2+37.1118x4x3x22+9.1538x42x3x20.2863x42x329.6367x43x1+8.0508x43x2+0.7288x43x3+46.1405x3x23+9.4792x32x126.2154x32x2x1+8.4205x32x220.4965x33x10.0156x34

and

v[3](x)[l[3]k[3]]T=[0.1261x42x30.0022x4x320.8724x42x1+1.1509x42x2+0.1089x43+1.0208x4x3x2+1.8952x3x22+0.2323x32x2+3.0554x4x125.2286x4x2x1+3.9335x4x220.6138x4x3x11.9128x3x2x10.0018x33+8.8880x2x127.5123x22x1+1.2179x3x123.2935x13+4.9956x230.0928x1x320.1853x42x3+0.0929x4x32+8.1739x42x13.7896x42x20.5132x431.8036x4x3x24.9101x3x22+0.3018x32x237.6101x4x12+28.4355x4x2x113.8702x4x22+4.3753x4x3x1+9.1991x3x2x1+0.0043x3353.5255x2x12+42.8702x22x112.9923x3x12+52.2292x1312.1508x23+0.02812x1x32].

Remark 13.1.1 The above computational results give an approximate solution to the HJIE and the control law up to order three. Some difficulties that may be encountered include how to check the positive-definiteness of the solutions in general. Locally around x = 0 however, the positive-definiteness of P is reassuring, but does not guarantee global positive-definiteness and a subsequent global asymptotic-stabilizing controller. The computational burden of the algorithm also limits its attractiveness.

In the next section we discuss exact methods for solving the HJIE which may yield global solutions.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset