3.4 Optimal Velocity Profile Estimation for a Known Path

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

3.3.8 Model-Based Predictive Control

Model-based predictive control (MPC) approaches are advanced methods that can be applied to various areas as well as to mobile robotics were reference trajectory is known beforehand. The use of predictive control approaches in mobile robotics usually relies on a linearized (or also nonlinear) kinematic model or dynamic model to predict future system states. A number of successful implementations to wheeled robotics have been reported such as generalized predictive control in [18], predictive control with Smith predictor to handle the system’s time delay in [19], MPC based on a linear, time-varying system description in [20], nonlinear predictive controller with a multilayer neural network as a system model in [21], and many others. In most approaches the control law solutions are solved by an optimization of a cost function. Other approaches obtain control law as an analytical solution, which is computationally effective and can be easily used in fast, real-time implementations [22].

This section deals with a differentially driven mobile robot and trajectory-tracking control. Reference trajectory needs to be a smooth twice-differentiable function of time. For prediction, a linearized error dynamics model obtained around the reference trajectory is used. The controller minimizes the difference between the robot future tracking error and the reference error with defined desired dynamics.

Model-based control strategies combine a feedforward solution and a feedback action in input vector u, given as follows:

$\begin{array}{l} u = u_{ff} + u_{fb} = [\begin{array}{l} v_{ref} cos e_{φ} + v_{fb} \\ ω_{ref} + ω_{fb} \end{array}] \end{array}$ $\begin{array}{l} u = u_{ff} + u_{fb} = [\begin{array}{l} v_{ref} cos e_{φ} + v_{fb} \\ ω_{ref} + ω_{fb} \end{array}] \end{array}$

si237_e (3.85)

where the feedforward input vector $u_{ff} = {[v_{ref} cos e_{φ} ω_{ref}]}^{T}$ $u_{ff} = {[v_{ref} cos e_{φ} ω_{ref}]}^{T}$ is calculated from the reference trajectory using relations (3.22), (3.23). The feedback input vector is u_fb = [v_fbω_fb]^T, which is the output of the MPC controller.

The control problem is given by the linear tracking error dynamics model (3.39), which in shorter notation reads

$\begin{array}{l} \dot{e} = A_{c} (t) e + B_{c} u_{fb} \end{array}$ $\begin{array}{l} \dot{e} = A_{c} (t) e + B_{c} u_{fb} \end{array}$

(3.86)

where A_c(t) and B_c are matrices of the continuous state-space model and e is the tracking error defined in local robot coordinates, defined by transformation (3.35) and shown in Fig. 3.21.

Discrete MPC

The MPC approach presented in [22] is summarized. It is designed for discrete time; therefore, model (3.86) needs to be written in discrete-time form:

$\begin{array}{l} \dot{e} (k + 1) = A (k) e (k) + B u_{fb} (k) \end{array}$ $\begin{array}{l} \dot{e} (k + 1) = A (k) e (k) + B u_{fb} (k) \end{array}$

(3.87)

where $A (k) \in R^{n} \times R^{n}$ $A (k) \in R^{n} \times R^{n}$ , n is the number of the state variables and $B \in R^{n} \times R^{m}$ $B \in R^{n} \times R^{m}$ , and m is the number of input variables. The discrete matrices A(k) and B can be obtained as follows:

$\begin{array}{l} \begin{matrix} A (k) & = I + A_{c} (t) T_{s} \\ B & = B_{c} T_{s} \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} A (k) & = I + A_{c} (t) T_{s} \\ B & = B_{c} T_{s} \end{matrix} \end{array}$

si243_e (3.88)

which is sufficient approximation for a short sampling time T_s.

The main idea of MPC is to compute optimal control actions that minimize the objective function given in prediction horizon interval h. The objective function is a quadratic cost function:

$\begin{array}{l} J (u_{fb}, k) = \sum_{i = 1}^{h} ϵ^{T} (k, i) Q ϵ (k, i) + u_{fb}^{T} (k + i - 1) R u_{fb} (k + i - 1) \end{array}$ $\begin{array}{l} J (u_{fb}, k) = \sum_{i = 1}^{h} ϵ^{T} (k, i) Q ϵ (k, i) + u_{fb}^{T} (k + i - 1) R u_{fb} (k + i - 1) \end{array}$

si244_e (3.89)

consisting of a future reference tracking error e_r(k + i), predicted tracking error e(k + i|k), difference between the two errors ϵ(k, i) =e_r(k + i) −e(k + i|k), and future actions u_fb(k + i − 1), where i denotes the ith step-ahead prediction (i = 1, …, h). Q and R are the weighting matrices.

For prediction of the state e(k + i|k) the error model (3.87) is applied as follows:

$\begin{array}{l} \begin{matrix} e (k + 1 | k) & = A (k) e (k) + B u_{fb} (k) \\ e (k + 2 | k) & = A (k + 1) e (k + 1 | k) + B u_{fb} (k + 1) \\ ⋮ \\ e (k + i | k) & = A (k + i - 1) e (k + i - 1 | k) + B u_{fb} (k + i - 1) \\ ⋮ \\ e (k + h | k) & = A (k + h - 1) e (k + h - 1 | k) + B u_{fb} (k + h - 1) \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} e (k + 1 | k) & = A (k) e (k) + B u_{fb} (k) \\ e (k + 2 | k) & = A (k + 1) e (k + 1 | k) + B u_{fb} (k + 1) \\ ⋮ \\ e (k + i | k) & = A (k + i - 1) e (k + i - 1 | k) + B u_{fb} (k + i - 1) \\ ⋮ \\ e (k + h | k) & = A (k + h - 1) e (k + h - 1 | k) + B u_{fb} (k + h - 1) \end{matrix} \end{array}$

si245_e (3.90)

Predictions e(k + i|k) in Eq. (3.90) are rearranged so that they are dependent on current error e(k), current and future inputs u_fb(k + i − 1), and matrices A(k + i − 1) and B. The model-output prediction at the time instant h can then be written as

$\begin{array}{l} \begin{matrix} e (k + h | k) & = Π_{j = 1}^{h - 1} A (k + j) e (k) + \\ + \sum_{i = 1}^{h} (Π_{j = i}^{h - 1} A (k + j)) B u_{fb} (k + i - 1) + B u_{fb} (k + h - 1) \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} e (k + h | k) & = Π_{j = 1}^{h - 1} A (k + j) e (k) + \\ + \sum_{i = 1}^{h} (Π_{j = i}^{h - 1} A (k + j)) B u_{fb} (k + i - 1) + B u_{fb} (k + h - 1) \end{matrix} \end{array}$

si246_e (3.91)

The future reference error (e_r(k + i)) defines how the tracking error should decrease when the robot is not on the trajectory. Let us define the future reference error to decrease exponentially from the current tracking error e(k), as follows:

$\begin{array}{l} e_{r} (k + i) = A_{r}^{i} e (k) \end{array}$ $\begin{array}{l} e_{r} (k + i) = A_{r}^{i} e (k) \end{array}$

(3.92)

for $i = 1, \dots, h$ $i = 1, \dots, h$ . The dynamics of the reference error is defined by the reference model matrix A_r.

According to Eqs. (3.90), (3.91) the robot-tracking prediction-error vector is defined as follows:

$\begin{array}{l} E^{*} (k) = {[e {(k + 1 | k)}^{T} e {(k + 2 | k)}^{T} \dots e {(k + h | k)}^{T}]}^{T} \end{array}$ $\begin{array}{l} E^{*} (k) = {[e {(k + 1 | k)}^{T} e {(k + 2 | k)}^{T} \dots e {(k + h | k)}^{T}]}^{T} \end{array}$

where E* is given for the whole interval of the observation (h) where the control vector is

$\begin{array}{l} U_{fb} (k) = {[u_{fb}^{T} (k) u_{fb}^{T} (k + 1) \dots u_{fb}^{T} (k + h - 1)]}^{T} \end{array}$ $\begin{array}{l} U_{fb} (k) = {[u_{fb}^{T} (k) u_{fb}^{T} (k + 1) \dots u_{fb}^{T} (k + h - 1)]}^{T} \end{array}$

(3.93)

and

$\begin{array}{l} Λ (k, i) = Π_{j = i}^{h - 1} A (k + j) \end{array}$ $\begin{array}{l} Λ (k, i) = Π_{j = i}^{h - 1} A (k + j) \end{array}$

The robot-tracking prediction-error vector can be written in compact form:

$\begin{array}{l} E^{*} (k) = F (k) e (k) + G (k) U_{fb} (k) \end{array}$ $\begin{array}{l} E^{*} (k) = F (k) e (k) + G (k) U_{fb} (k) \end{array}$

(3.94)

where

$\begin{array}{l} F (k) = {[A (k) A (k + 1) A (k) \dots Λ (k, 0)]}^{T} \end{array}$ $\begin{array}{l} F (k) = {[A (k) A (k + 1) A (k) \dots Λ (k, 0)]}^{T} \end{array}$

(3.95)

and

$\begin{array}{l} G (k) = [\begin{array}{l} B & 0 & \dots & 0 \\ A (k + 1) B & B & \dots & ⋮ \\ ⋮ & ⋮ & ⋱ & ⋮ \\ Λ (k, 1) B & Λ (k, 2) B & \dots & B \end{array}] \end{array}$ $\begin{array}{l} G (k) = [\begin{array}{l} B & 0 & \dots & 0 \\ A (k + 1) B & B & \dots & ⋮ \\ ⋮ & ⋮ & ⋱ & ⋮ \\ Λ (k, 1) B & Λ (k, 2) B & \dots & B \end{array}] \end{array}$

si254_e (3.96)

where dimensions of F(k) and G(k) are (n ⋅ h × n) and (n ⋅ h × m ⋅ h), respectively.

The robot reference-tracking-error vector is

$\begin{array}{l} E_{r}^{*} (k) = {[e_{r} {(k + 1)}^{T} e_{r} {(k + 2)}^{T} \dots e_{r} {(k + h)}^{T}]}^{T} \end{array}$ $\begin{array}{l} E_{r}^{*} (k) = {[e_{r} {(k + 1)}^{T} e_{r} {(k + 2)}^{T} \dots e_{r} {(k + h)}^{T}]}^{T} \end{array}$

which in compact form is computed as

$\begin{array}{l} E_{r}^{*} (k) = F_{r} e (k) \end{array}$ $\begin{array}{l} E_{r}^{*} (k) = F_{r} e (k) \end{array}$

(3.97)

where

$\begin{array}{l} F_{r} = {[A_{r} A_{r} A_{r}^{2} \dots A_{r} A_{r}^{h}]}^{T} \end{array}$ $\begin{array}{l} F_{r} = {[A_{r} A_{r} A_{r}^{2} \dots A_{r} A_{r}^{h}]}^{T} \end{array}$

si257_e (3.98)

where F_r is the (n ⋅ h × n) matrix.

The optimal control vector (3.93) is obtained by optimization of Eq. (3.89), which can be done numerically or analytically. The analytical solution is derived in the following.

The objective function (3.89) defined in matrix form reads.

$\begin{array}{l} J (U_{fb}) = {(E_{r}^{*} - E^{*})}^{T} \bar{Q} (E_{r}^{*} - E^{*}) + U_{fb}^{T} \bar{R} U_{fb} \end{array}$ $\begin{array}{l} J (U_{fb}) = {(E_{r}^{*} - E^{*})}^{T} \bar{Q} (E_{r}^{*} - E^{*}) + U_{fb}^{T} \bar{R} U_{fb} \end{array}$

(3.99)

The minimum of Eq. (3.99) is expressed as

$\begin{array}{l} \frac{\partial J}{\partial U_{fb}} = - 2 \bar{Q} G^{T} E_{r}^{*} + 2 G^{T} \bar{Q} E^{*} + 2 \bar{R} U_{fb} = 0 \end{array}$ $\begin{array}{l} \frac{\partial J}{\partial U_{fb}} = - 2 \bar{Q} G^{T} E_{r}^{*} + 2 G^{T} \bar{Q} E^{*} + 2 \bar{R} U_{fb} = 0 \end{array}$

si259_e (3.100)

and the solution for the optimal control vector is obtained as

$\begin{array}{l} U_{fb} (k) = {(G^{T} \bar{Q} G + \bar{R})}^{- 1} G^{T} \bar{Q} (F_{r} - F) e (k) \end{array}$ $\begin{array}{l} U_{fb} (k) = {(G^{T} \bar{Q} G + \bar{R})}^{- 1} G^{T} \bar{Q} (F_{r} - F) e (k) \end{array}$

(3.101)

where the weighting matrices are as follows:

$\begin{array}{l} \bar{Q} = [\begin{array}{l} Q & 0 & \dots & 0 \\ 0 & Q & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & Q \end{array}] \end{array}$ $\begin{array}{l} \bar{Q} = [\begin{array}{l} Q & 0 & \dots & 0 \\ 0 & Q & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & Q \end{array}] \end{array}$

si261_e (3.102)

and

$\begin{array}{l} \bar{R} = [\begin{array}{l} R & 0 & \dots & 0 \\ 0 & R & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & R \end{array}] \end{array}$ $\begin{array}{l} \bar{R} = [\begin{array}{l} R & 0 & \dots & 0 \\ 0 & R & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & R \end{array}] \end{array}$

si262_e (3.103)

Solution (3.101) contains control vectors u_fb^T(k + i − 1) for the whole interval of prediction (i = 1, …, h). To apply feedback control action in the current time instant k only the first vector u_fb^T(k) (first m rows of U_fb(k)) is applied to the robot. The solution is obtained analytically; therefore, it enables fast real-time implementations that may not be possible if numeric optimization of Eq. (3.89) is used.

Example 3.13

Implement model predictive control given in Eq. (3.101) for trajectory tracking of a differential-drive robot. Reference trajectory and robot are defined in Example 3.9.

The choice for prediction horizon is h = 4, the reference model matrix is A_r = I_3×3 ⋅ 0.65, and the weighting matrices are

$\begin{array}{l} Q = [\begin{array}{l} 4 & 0 & 0 \\ 0 & 40 & 0 \\ 0 & 0 & 0.1 \end{array}], R = I_{2 \times 2} \cdot 1 0^{- 3} \end{array}$ $\begin{array}{l} Q = [\begin{array}{l} 4 & 0 & 0 \\ 0 & 40 & 0 \\ 0 & 0 & 0.1 \end{array}], R = I_{2 \times 2} \cdot 1 0^{- 3} \end{array}$

si263_e

Solution

Using MPC compute the feedback part of control signal u_fb(k) and apply it to the robot together with the feedforward part u_ff(k). In the following Matlab script (Listing 3.12), possible solution and trajectory tracking results are given. Obtained simulation results are shown in Figs. 3.30 and 3.31.

f03-30-9780128042045 — Fig. 3.30 Obtained control tracking results using explicit MPC (reference is marked as dashed).

f03-31-9780128042045 — Fig. 3.31 Control action calculated using explicit MPC.

Listing 3.12

1 Ts = 0.033; % Sampling time

2 t = 0: Ts : 3 0 ; % Simulation time

3 q = [ 1 . 1 ; 0 . 8 ; 0 ] ; % Initial robot pose

5 % Reference

6 freq = 2*pi/30;

7 xRef = 1.1 + 0.7*sin( freq *t ) ; yRef = 0.9 + 0.7*sin(2* freq *t ) ;

8 dxRef = freq *0.7*cos( freq *t ) ; dyRef = 2* freq *0.7*cos(2* freq *t ) ;

9 ddxRef =− freq ˆ2*0.7* sin( freq *t ) ; ddyRef =−4* freq ˆ2*0.7* sin (2* freq *t ) ;

10 qRef = [ xRef ; yRef ; atan2( dyRef , dxRef ) ] ; % Reference trajectory

11 vRef = sqrt( dxRef.ˆ2+dyRef .ˆ2) ;

12 wRef = ( dxRef .* ddyRef− dyRef .* ddxRef ) . / ( dxRef .ˆ2+dyRef .ˆ2) ;

13 uRef = [ vRef ; wRef ] ; % Reference inputs

15 for k = 1:length( t )−4

16 e = [ cos ( q (3) ) , sin(q (3) ) , 0; . . .

17 − sin ( q (3) ) , cos(q (3) ) , 0; . . .

18 0 , 0 , 1]*( qRef ( : , k) − q ) ; % Error vector

19 e (3) = wrapToPi( e (3) ) ; % Correct angle

21 A0 = [1 , Ts*uRef (2 ,k) , 0;− Ts * uRef (2 , k) , 1 , Ts*uRef (1 ,k) ; 0 ,0 ,1];

22 A1 = [1 , Ts*uRef (2 ,k+1) , 0;− Ts * uRef (2 , k+1) , 1 , Ts*uRef (1 ,k+1) ; 0 ,0 ,1];

23 A2 = [1 , Ts*uRef (2 ,k+2) , 0;− Ts * uRef (2 , k+2) , 1 , Ts*uRef (1 ,k+2) ; 0 ,0 ,1];

24 A3 = [1 , Ts*uRef (2 ,k+3) , 0;− Ts * uRef (2 , k+3) , 1 , Ts*uRef (1 ,k+3) ; 0 ,0 ,1];

25 A4 = [1 , Ts*uRef (2 ,k+4) , 0;− Ts * uRef (2 , k+4) , 1 , Ts*uRef (1 ,k+4) ; 0 ,0 ,1];

26 B = [ Ts , 0; 0 , 0; 0 , Ts ] ;

27 C = eye(3) ;

29 Z = zeros(3 ,2) ;

30 Hm = [C *A0*B, Z, Z, Z; . . .

31 C *A0*A1*B, C *A0*B, Z, Z; . . .

32 C *A0*A1*A2*B, C *A0*A1*B, C *A0*B, Z; . . .

33 C *A0*A1*A2*A3*B, C *A0*A1*A2*B, C *A0*A1*B, C *A0* B] ;

34 Fm = [C *A0*A1, C *A0*A1*A2, C *A0*A1*A2*A3, C *A0*A1*A2*A3*A4 ] . ’ ;

36 ar = 0.65;

37 Ar = eye(3) *ar ; % Reference error dynamics

38 H = 0;

39 Fr = [ Arˆ(H +1) , Arˆ(H +2) , Arˆ(H +3) , Arˆ(H +4) ] . ’ ;

41 % Weight matrices

42 Qt = diag( repmat ( [ 1 ; 40; 0 . 1 ] , 4 , 1) ) ;

43 Rt = diag( repmat ( [ 0 . 0 0 1 ; 0.001] , 4 , 1) ) ;

45 % Optimal control calculation

46 KKgpc = ( Hm. ’*Qt* Hm + Rt) ∖( Hm. ’*Qt *(− Fm ) ) ;

47 KK = KKgpc ( 1 : 2 , : ) ; % Take current control gains

49 v = − KK * e ;

50 uF = [ uRef (1 ,k)*cos( e (3) ) ; uRef (2 ,k) ] ;

51 u = v + uF;

53 vMAX = 1; wMAX = 15; % Max velocities

54 if abs(u(1) ) > vMAX , u(1) = sign(u(1) )* vMAX; end

55 if abs(u(2) ) > wMAX , u(2) = sign(u(2) )* wMAX; end

57 % Robot motion simulation

58 dq = [ u(1) *cos(q (3) ) ; u(1) *sin(q (3) ) ; u(2) ] ;

59 noise = 0.00; % Set to experiment with noise (e.g. 0.001)

60 q = q + Ts*dq + randn(3 ,1) * noise ; % Euler integration

61 q (3) = wrapToPi(q (3) ) ; % Map orientation angle to [− pi , pi]

62 end

3.3.9 Particle Swarm Optimization-Based Control

Control of a mobile robot can also be defined as an optimization problem where the best solution among all possible solutions in a search space needs to be found in each control-loop sample time. The obtained control does not have an explicit structure, meaning that the control law cannot be given as a mapping function from system states to control actions. An optimal solution that minimizes some objective function is instead found by an iterative optimization algorithm like Newton methods, gradient descent methods, or stochastic methods such as genetic algorithms, particle swarm optimization (PSO), and the like.

If the objective function is not convex in the minimization problem then most optimization algorithms can be trapped in a local minimum, which is not the optimal solution. Using stochastic optimization the likelihood of local minimum, the solution is usually smaller because the search pattern is randomized to some extent.

PSO is inspired by social behavior of small animals such as bird flocks or fish schools [23, 24]. PSO uses a group (swarm) of particles where each particle presents its own solution hypothesis. Each particle i is characterized by parameter vector x_i defining its position in the parameter space and increment vector v_i defining velocity in the parameter space. During optimization the population of all candidate solutions are updated according to the objective function that defines a measure of quality. Each particle keeps track of its parameters and remembers the best parameters pBest_i achieved so far together with the associated objective function J_i = f(pBest_i). During optimization also the best parameter vector obtained so far for the entire swarm gBest is stored. In local version of PSO for each particle the best parameter vector gBest_i is tracked for some neighborhood of particles (e.g., k nearest particles, ring topology, and the like [25]). Local versions of PSO are less likely to be trapped in a local minimum.

Next, a basic (global) version of PSO is explained. In each control-loop iteration particles update mimic cognitive and social behavior by means of the following rules:

$\begin{array}{l} \begin{matrix} v_{i} \leftarrow ω v_{i} + c_{1} {rand}_{(0, 1)} ({pBest}_{i} - x_{i}) + c_{2} {rand}_{(0, 1)} (gBest - x_{i}) \\ x_{i} \leftarrow x_{i} + v_{i} \end{matrix} \end{array}$ $\begin{array}{l} \begin{matrix} v_{i} \leftarrow ω v_{i} + c_{1} {rand}_{(0, 1)} ({pBest}_{i} - x_{i}) + c_{2} {rand}_{(0, 1)} (gBest - x_{i}) \\ x_{i} \leftarrow x_{i} + v_{i} \end{matrix} \end{array}$

si264_e (3.104)

where ω is the inertia factor, c₁ is the self-cognitive constant, and c₂ is the social constant. In addition rand_{(0, 1)} is a vector of uniformly distributed values in the range (0, 1). The dimensions of vectors in Eq. (3.104) equal the dimension of the parameter search space. The parameters ω, c₁, and c₂ are positive tuning parameters. Basic PSO code is given in Algorithm 1.

Example 3.14

Write the controller using the PSO for the robot and trajectory defined in Example 3.9. PSO is applied in each sampling time t = kT_s to find the best controls (translational and angular velocity) that would drive the robot as close as possible to the current reference position x_ref(t), y_ref(t). Next, the robot pose prediction is computed considering robot kinematics and the particles’ proposed solution (control action). Optimal control minimizes the cost function consisting of the tracking error $e (t) = {[e_{x} (t), e_{y} (t), e_{φ} (t)]}^{T}$ $e (t) = {[e_{x} (t), e_{y} (t), e_{φ} (t)]}^{T}$ and the feedback control effort $u_{fb} = {[v_{fb}, ω_{fb}]}^{T}$ $u_{fb} = {[v_{fb}, ω_{fb}]}^{T}$ . This follows the equation J(t) = e(t)^TQe(t) +u_fb^TRu_fb, where Q and R are diagonal weighting matrices used to tune the controller behavior.

Solution

The control law includes feedforward action and feedback action similarly as in Example 3.9. Feedforward action is obtained from a known trajectory while the feedback action is computed using PSO. The tracking error is expressed in robot local coordinates (see Fig. 3.21) because this makes the optimization more efficient due to the closed-loop system being decoupled better. The error in the local x coordinate can simply be compensated by translational velocity, and errors in y and φ can be compensated by the angular velocity. However, this is not the case when using a global tracking error due to nonlinear rotation transformation in Eq. (3.35).

In Listing 3.13 a Matlab script of a possible solution is given. Obtained simulation results are shown in Figs. 3.32 and 3.33. Similar control results as in Example 3.13 are obtained. However, the computational effort is much higher than in Example 3.13. Results of Example 3.14 are not deterministic because PSO is using randomly distributed particles in optimization to find the best control in each sampling time. Therefore, resulting trajectories of the mobile robot can vary to some extent, especially the initial transition phase, until the robot reaches the reference. However, the obtained results in the majority of simulation runs are at least comparable to the results of other presented deterministic controllers (e.g., from Example 3.9).

f03-32-9780128042045 — Fig. 3.32 Obtained control tracking results using particle swarm optimization approach (reference is marked as dashed).

f03-33-9780128042045 — Fig. 3.33 Control action calculated using the particle swarm optimization approach.

Listing 3.13

1 Ts = 0.033; % Sampling time

2 t = 0: Ts : 3 0 ; % Simulation time

3 q = [ 1 . 1 ; 0 . 8 ; 0 ] ; % Initial robot pose

5 % Reference

6 freq = 2*pi/30;

7 xRef = 1.1 + 0.7*sin( freq *t ) ; yRef = 0.9 + 0.7*sin(2* freq *t ) ;

8 dxRef = freq *0.7*cos( freq *t ) ; dyRef = 2* freq *0.7*cos(2* freq *t ) ;

9 ddxRef =− freq ˆ2*0.7* sin( freq *t ) ; ddyRef =−4* freq ˆ2*0.7* sin (2* freq *t ) ;

10 qRef = [ xRef ; yRef ; atan2( dyRef , dxRef ) ] ; % Reference trajectory

11 vRef = sqrt( dxRef.ˆ2+dyRef .ˆ2) ;

12 wRef = ( dxRef .* ddyRef− dyRef .* ddxRef ) . / ( dxRef .ˆ2+dyRef .ˆ2) ;

13 uRef = [ vRef ; wRef ] ; % Reference inputs

15 vMax = 1; wMax = 15; % Velocity constraints

17 % Swarm Initial i z a t i o n

18 i t e r a t i o n s = 20; % Number of i t e r a t i o n s

19 omega = 0.5*0.5; % Inertia

20 c1 = 0.5*1; % Correction factor

21 c2 = 0.5*1; % Global correction factor

22 N = 25; % Swarm s i z e

23 swarm = zeros([2 ,N, 4 ] ) ;

24 uBest = [ 0 ; 0 ] ;

26 for k = 1:length( t )−1

27 % Initial swarm position

28 swarm ( : , : , 1 ) = repmat ( uBest , 1 , N) + diag ( [ 0 . 1 ; 3]) *randn(2 ,N) ;

29 swarm ( : , : , 2 ) = 0; % Initial p a r t i c l e velocity

30 swarm (1 ,: ,4) = 1000; % Best value so far

32 for i t e r = 1: i t e r a t i o n s % PSO i t e r a t i v e l y find best solution

33 % Evaluate p a r t i c l e s parameters

34 for i = 1:N

35 % Compute new predicted pose of the robot using i − th p a r t i c l e

36 % parameters (input velocities ) and compare obtained predicted

37 % pose to the reference pose.

38 vwi = swarm ( : , i ,1) ;

39 ui = vwi + uRef ( : , k) ; % Feedback and feedforward

40 qk = q ; % Current robot pose

41 % Predict robot pose using p a r t i c l e parameters ( velocities )

42 qk = qk + Ts *[cos(qk (3) ) , 0; sin(qk (3) ) , 0; 0 , 1]* ui ;

43 qk (3) = wrapToPi(qk (3) ) ; % Correct angle range

44 e = [cos(qk (3) ) , sin(qk (3) ) , 0; . . .

45 − sin ( qk (3) ) , cos(qk (3) ) , 0; . . .

46 0 , 0 , 1]*( qRef ( : , k+1) − qk ) ; % Error

47 e (3) = wrapToPi( e (3) ) ; % Correct angle range

48 Qt = diag ( [ 4 ; 80; 0 . 1 ] ) ; Rt = diag ( [ 1 ; 1]*0.0001) ; % Weights

49 J = e . ’*Qt*e + vwi . ’* Rt*vwi ; % Cost function

50 if J < swarm(1 , i ,4) % if new parameter i s better , update:

51 swarm ( : , i ,3) = swarm ( : , i ,1) ; % param values (v and w )

52 swarm(1 , i ,4) = J ; % and best c r i t e r i a value.

53 end

54 end

55 [˜ , gBest ] = min(swarm (1 ,: ,4) ) ; % Global best p a r t i c l e parameters

57 % Updating parameters by velocity vectors

58 a = omega*swarm ( : , : , 2 ) + . . .

59 c1*rand(2 ,N) .*( swarm ( : , : , 3 ) − swarm ( : , : , 1 ) ) + . . .

60 c2*rand(2 ,N) .*( repmat (swarm ( : , gBest ,3) , 1 , N) − swarm ( : , : , 1 ) ) ;

61 % Max param increment , acceleration: aMax =3 ==> 3*Ts=0.1

62 a (1 , a ( 1 , : ) >0.1) = 0 . 1 ; a (1 , a ( 1 , : ) <−0.1) = −0.1;

63 % Max param increment , angular acceleration: aMax =60 ==> 60*Ts=2

64 a (2 , a ( 1 , : ) >2) = 2; a (2 , a ( 1 , : ) <−2) = −2;

66 v = swarm ( : , : , 1 ) + a ; % Update velocity

67 % Limit velocity to preserve curvature . . .

68 [m, i i ] = max ( [ v ( 1 , : ) /vMax; v ( 2 , : ) / wMax; ones (1 ,N) ] ) ;

69 i = i i ==1; v(1 , i ) = sign(v(1 , i ) )*vMax;

70 v(2 , i ) = v(2 , i ) ./m ( i ) ;

71 i = i i ==2; v(2 , i ) = sign(v(2 , i ) )* wMax;

72 v(1 , i ) = v(1 , i ) ./m ( i ) ;

74 swarm ( : , : , 2 ) = a ; % Updated p a r t i c l e velocities (acceleration)

75 swarm ( : , : , 1 ) = v ; % Updated p a r t i c l e positions ( velocities )

76 end

78 % Take the best p a r t i c l e to get robot command velocities

79 uBest = swarm ( : , gBest ,1) ;

80 u = uBest + uRef ( : , k) ; % Feedback and feedforward

82 % Velocity constraints

83 if abs(u(1) ) > vMax, u(1) = sign(u(1) )*vMax; end

84 if abs(u(2) ) > wMax, u(2) = sign(u(2) )* wMax; end

86 % Robot motion simulation

87 dq = [ u(1) *cos(q (3) ) ; u(1) *sin(q (3) ) ; u(2) ] ;

88 noise = 0.00; % Set to experiment with noise (e.g. 0.001)

89 q = q + Ts*dq + randn(3 ,1) * noise ; % Euler integration

90 q (3) = wrapToPi(q (3) ) ; % Map orientation angle to [− pi , pi]

91 end

Algorithm 1

Particle swarm optimization

u03-01-9780128042045 u03-02-9780128042045

PSO is a generic algorithm that can be applied to numerous applications as long as the computational effort and time required to find the solution meet the real time requirements of the system being controlled.

MPC With PSO

Solving optimization of objective function in MPC can also be done by PSO, where the best parameters (control actions) are searched for the prediction interval (t, t + hT_S) where h is horizon. If a system has m = 2 control inputs then the optimization needs to find m ⋅ h optimal parameters, which may be computationally intense and therefore problematic for systems with short control-loop sampling times. However, there are some possibilities to lower the computational time as follows. One possibility is to assume constant controls in the prediction horizon. If the control action does not change significantly in a relatively short horizon time, then constant controls can be assumed in the horizon interval. This means that only m parameters instead of m ⋅ h need to be optimized. Another possibility is to lower the required iterations in optimization for each control-loop sampling time. This can be done by initializing the particles around the optimal solution from the previous time sample. Doing so, this would hopefully require less iterations for the particles to converge.

A possible solution of implementing MPC on PSO is given in Example 3.15.

Example 3.15

Extend Example 3.14 to a model-based predictive controller with prediction horizon h = 3.

Solution

The presented solution supposes that the control action (feedback part) in the prediction horizon interval is constant u_fb(t + (i − 1)T_S) =u_fb. Feedforward action (u_ff) is obtained from known trajectory, while the feedback action is computed using PSO. From the current robot pose the h-step-ahead prediction is made using the robot kinematic model and control actions u(t + (i − 1)T_S) =u_fb +u_ff(t + (i − 1)T_S) (i = 1, …, h) are obtained as follows:

$\begin{array}{l} \hat{q} (t + i T_{S}) = f (\hat{q} (t + (i - 1) T_{S}), u (t + (i - 1) T_{S})) \end{array}$ $\begin{array}{l} \hat{q} (t + i T_{S}) = f (\hat{q} (t + (i - 1) T_{S}), u (t + (i - 1) T_{S})) \end{array}$

where the initial state equals $\hat{q} (t) = q (t)$ $\hat{q} (t) = q (t)$ and i = 1, …, h. The control action is obtained by optimizing the cost function for the horizon period

$\begin{array}{l} J (t + h T_{S}) = \sum_{i = 1}^{h} e {(t + i T_{S})}^{T} Q e (t + i T_{S}) + u_{fb}^{T} R u_{fb} \end{array}$ $\begin{array}{l} J (t + h T_{S}) = \sum_{i = 1}^{h} e {(t + i T_{S})}^{T} Q e (t + i T_{S}) + u_{fb}^{T} R u_{fb} \end{array}$

si270_e

where e(⋅) is the tracking error in local coordinates. The optimal control action is calculated using PSO as is given in the Matlab script in Listing 3.14. Simulation results are shown in Figs. 3.34 and 3.35. A smaller tracking error and a better initial transition compared to the results in Example 3.14 are obtained.

Listing 3.14

1 Ts = 0.033; % Sampling time

2 t = 0: Ts : 3 0 ; % Simulation time

3 q = [ 1 . 1 ; 0 . 8 ; 0 ] ; % Initial robot pose

5 % Reference

6 freq = 2*pi/30;

7 xRef = 1.1 + 0.7*sin( freq *t ) ; yRef = 0.9 + 0.7*sin(2* freq *t ) ;

8 dxRef = freq *0.7*cos( freq *t ) ; dyRef = 2* freq *0.7*cos(2* freq *t ) ;

9 ddxRef =− freq ˆ2*0.7* sin( freq *t ) ; ddyRef =−4* freq ˆ2*0.7* sin (2* freq *t ) ;

10 qRef = [ xRef ; yRef ; atan2( dyRef , dxRef ) ] ; % Reference trajectory

11 vRef = sqrt( dxRef.ˆ2+dyRef .ˆ2) ;

12 wRef = ( dxRef .* ddyRef− dyRef .* ddxRef ) . / ( dxRef .ˆ2+dyRef .ˆ2) ;

13 uRef = [ vRef ; wRef ] ; % Reference inputs

15 vMax = 1; wMax = 15; % Valocity constraints

17 % Swarm Initial i z a t i o n

18 i t e r a t i o n s = 20; % Number of i t e r a t i o n s

19 omega = 0.5*0.5; % Inertia

20 c1 = 0.5*1; % Correction factor

21 c2 = 0.5*1; % Global correction factor

22 N = 25; % Swarm s i z e

23 swarm = zeros([2 ,N, 4 ] ) ;

24 uBest = [ 0 ; 0 ] ;

26 H = 3; % Prediction horizon

28 for k = 1:length( t )− H

29 % Initial swarm position

30 swarm ( : , : , 1 ) = repmat ( uBest , 1 , N) + diag ( [ 0 . 1 ; 3]) *randn(2 ,N) ;

31 swarm ( : , : , 2 ) = 0; % Initial p a r t i c l e velocity

32 swarm (1 ,: ,4) = 1000; % Best value so far

34 for i t e r = 1: i t e r a t i o n s % PSO i t e r a t i v e l y find best solution

35 % Evaluate p a r t i c l e s parameters

36 for i = 1:N

37 % Compute new predicted pose of the robot using i − th p a r t i c l e

38 % parameters (input velocities ) and compare obtained predicted

39 % pose to the reference pose.

40 vwi = swarm ( : , i ,1) ;

41 ui = vwi + uRef ( : , k) ; % Feedback and feedforward

42 qk = q ; % Current robot pose

43 % Predict robot pose using p a r t i c l e parameters ( velocities )

44 J = 0;

45 for h = 1:H

46 qk = qk + Ts *[cos(qk (3) ) , 0; sin(qk (3) ) , 0; 0 , 1]* ui ;

47 qk (3) = wrapToPi(qk (3) ) ; % Correct angle range

48 e = [cos(qk (3) ) , sin(qk (3) ) , 0; . . .

49 − sin ( qk (3) ) , cos(qk (3) ) , 0; . . .

50 0 , 0 , 1]*( qRef ( : , k + h) − qk ) ; % Error

51 e (3) = wrapToPi( e (3) ) ; % Correct angle range

52 Qt = diag ( [ 4 ; 80; 0 . 1 ] ) ; Rt = diag ( [ 1 ; 1]*0.0001) ; % Weights

53 J = J + e . ’*Qt*e + vwi . ’* Rt*vwi ; % Cost function

54 end

55 if J < swarm(1 , i ,4) % if new parameter i s better , update:

56 swarm ( : , i ,3) = swarm ( : , i ,1) ; % param values (v and w )

57 swarm(1 , i ,4) = J ; % and best c r i t i r i a value.

58 end

59 end

60 [˜ , gBest ] = min(swarm (1 ,: ,4) ) ; % Global best p a r t i c l e parameters

62 % Updating parameters by velocity vectors

63 a = omega*swarm ( : , : , 2 ) + . . .

64 c1*rand(2 ,N) .*( swarm ( : , : , 3 ) − swarm ( : , : , 1 ) ) + . . .

65 c2*rand(2 ,N) .*( repmat (swarm ( : , gBest ,3) , 1 , N) − swarm ( : , : , 1 ) ) ;

66 % Max param increment , acceleration: aMax =3 ==> 3*Ts=0.1

67 a (1 , a ( 1 , : ) >0.1) = 0 . 1 ; a (1 , a ( 1 , : ) <−0.1) = −0.1;

68 % Max param increment , angula acceleration: aMax =60 ==> 60*Ts=2

69 a (2 , a ( 1 , : ) >2) = 2; a (2 , a ( 1 , : ) <−2) = −2;

71 v = swarm ( : , : , 1 ) + a ; % Update velocity

72 % Limit velocity to preserve curvature . . .

73 [m, i i ] = max ( [ v ( 1 , : ) /vMax; v ( 2 , : ) / wMax; ones (1 ,N) ] ) ;

74 i = i i ==1; v(1 , i ) = sign(v(1 , i ) )*vMax;

75 v(2 , i ) = v(2 , i ) ./m ( i ) ;

76 i = i i ==2; v(2 , i ) = sign(v(2 , i ) )* wMax;

77 v(1 , i ) = v(1 , i ) ./m ( i ) ;

79 swarm ( : , : , 2 ) = a ; % Updated p a r t i c l e velocities (acceleration)

80 swarm ( : , : , 1 ) = v ; % Updated p a r t i c l e positions ( velocities )

81 end

83 % Take the best p a r t i c l e to get robot command velocities

84 uBest = swarm ( : , gBest ,1) ;

85 u = uBest + uRef ( : , k) ; % Feedback and feedforward

87 % Velocity constraints

88 if abs(u(1) ) > vMax, u(1) = sign(u(1) )*vMax; end

89 if abs(u(2) ) > wMax, u(2) = sign(u(2) )* wMax; end

91 % Robot motion simulation

92 dq = [ u(1) *cos(q (3) ) ; u(1) *sin(q (3) ) ; u(2) ] ;

93 noise = 0.00; % Set to experiment with noise (e.g. 0.001)

94 q = q + Ts*dq + randn(3 ,1) * noise ; % Euler integration

95 q (3) = wrapToPi(q (3) ) ; % Map orientation angle to [− pi , pi]

96 end

f03-34-9780128042045 — Fig. 3.34 Obtained control tracking results using MPC on PSO (reference is marked as dashed).

f03-35-9780128042045 — Fig. 3.35 Control action calculated using MPC on PSO.

3.3.10 Visual Servoing Approaches for Solving the Control Problems in Mobile Robotics

This part will present the use of visual servoing (VS) approaches for solving the control problems in mobile robotics. The main emphasis is on development of a mobile robotic system that is able to autonomously complete the desired task based only on visual information provided by a camera mounted on the mobile system.

In the field of VS the estimated signals obtained from images that are used in order to determine the control error are known as features. Features can be points, lines, circles, area of an object, angles between lines, or something else. The control error of a general VS control scheme can be written as a difference between the desired feature vector x_ref(t) and current feature vector x(y(t), ζ) [26]:

$\begin{array}{l} e (t) = x_{ref} (t) - x (y (t), ζ) \end{array}$ $\begin{array}{l} e (t) = x_{ref} (t) - x (y (t), ζ) \end{array}$

(3.105)

The K-element feature state vector $x (y (t), ζ) \in R^{K}$ $x (y (t), ζ) \in R^{K}$ in Eq. (3.105) is obtained based on image-based measurements y(t) (e.g., position, area, or shape of patterns in image) and some additional time-invariant knowledge about the system ζ (e.g., intrinsic camera parameters, known properties of the observed scene, system constraints). One of the crucial challenges in designing a visual control is the appropriate definition of feature vector, since this vector is used to determine control error. If the feature vector is not defined appropriately more than one situation may exist in which the control error is minimal—the problem of local minimums. Some features may only be appropriate for some classes of motions (e.g., only translations) and may be completely inappropriate for some other class of motions (e.g., translation with rotation). If the features are not selected carefully, in some situations undesired, unnecessary, or even unexpected control motions can occur—a problem also known as camera retreat. During visual servoing some tracked features may leave the camera field of view that may disable completion of the VS task. Therefore the visual control algorithm should prevent the features from leaving the camera’s field of view, use alternative features if some features are inevitably lost, or estimate locations of features that are temporarily outside of the field of view.

According to the definition of the feature vector x, VS control approaches can be classified into three main categories [27]: position-based visual servoing (PBVS), image-based visual servoing (IBVS), and hybrid VS. In the case of PBVS the control error is defined as the error between the desired and current robot pose in the 3D task space. The camera in the PBVS control scheme is used in order to estimate the 3D positions of objects and robot in the environment from the image, which is a projection of the environment. Therefore all the control approaches that have already been presented in the rest of this chapter can be used directly in this class of control approaches. Normally with the PBVS approach optimum control actions can be achieved; however, the approach requires accurate camera calibration or else the true control error cannot be eliminated. On the other hand, the control error in the IBVS control scheme is defined directly in the 2D image space (e.g., as a difference between two image points). Since the control error is defined directly in image, IBVS is less susceptible to accuracy of camera calibration. However, IBVS normally achieves less optimum trajectories than can be achieved with PBVS approach, and the IBVS approach is more susceptible to the camera retreat problem if the image features are not chosen appropriately. IBVS approaches are especially interesting, since they enable in-image task definition and the control approach known as teach-by-showing. HVS approaches try to combine good properties of PBVS and IBVS control schemes. Some control schemes only switch between PBVS and IBVS controllers according to some switching criteria that selects the most appropriate control approach based on the current system states [28]. In some other hybrid approaches the control error is composed of features from 2D image space and features from 3D task space [29, 30]. Visual control of nonholonomic systems is considered a difficult problem [31–34]. Visual control schemes are sometimes used in combination with some additional sensors [35] that may provide additional data or ease image processing.

In [26, 36] the authors have presented the general control design approach that can be used regardless of the class of VS scheme. Let us consider the case where the reference feature state vector in Eq. (3.105) is time invariant (x_ref(t) =x_ref = const.). Commonly a velocity controller is used. The velocity control vector can be written as a combination of a translational velocity vector v(t) and an angular velocity vector ω(t) in a combined vector $u^{T} (t) = [v^{T} (t), ω^{T} (t)] \in R^{M}$ $u^{T} (t) = [v^{T} (t), ω^{T} (t)] \in R^{M}$ . In general the velocity vector and angular velocity vector of an object in a 3D space both consist of three velocities, that is, v^T(t) = [v_x(t), v_y(t), v_z(t)] and ω^T(t) = [ω_x(t), ω_y(t), ω_z(t)]. However, in the case of wheeled mobile robotics these two control vectors normally have some zero elements, since the mobile robot in normal driving conditions is assumed to move on a plane, and no tumbling or lifting is expected. The relation between the velocity control vector u^T(t) and the rate of feature states ${\dot{s}}^{T} (t)$ ${\dot{s}}^{T} (t)$ can be written as follows:

$\begin{array}{l} \dot{x} (t) = L (t) u (t) \end{array}$ $\begin{array}{l} \dot{x} (t) = L (t) u (t) \end{array}$

(3.106)

where the matrix $L (t) \in R^{K} \times R^{M}$ $L (t) \in R^{K} \times R^{M}$ is known as the interaction matrix. From relations (3.105), (3.106) the rate of error can be determined:

$\begin{array}{l} \dot{e} (t) = - L (t) u (t) \end{array}$ $\begin{array}{l} \dot{e} (t) = - L (t) u (t) \end{array}$

(3.107)

The control action u(t) should minimize the error e. If exponential decay of error e is desired in the form $\dot{e} (t) = - g e (t)$ $\dot{e} (t) = - g e (t)$ , g > 0, the following control law can be derived:

$\begin{array}{l} u (t) = g L^{†} (t) e (t) \end{array}$ $\begin{array}{l} u (t) = g L^{†} (t) e (t) \end{array}$

(3.108)

where L^†(t) is the Moore-Penrose pseudoinverse of the interaction matrix L(t). If the matrix L(t) is of full rank, its pseudoinverse can be computed as L^†(t) = (L^T(t)L(t))⁻¹L^T(t). If the interaction matrix is of square shape (K = M) and it is not singular ( $det (L (t)) \neq 0$ $det (L (t)) \neq 0$ ), the calculation of pseudoinverse in Eq. (3.108) simplifies to the ordinary matrix inverse L⁻¹(t).

In practice only an approximate value of the true interaction matrix can be obtained. Therefore in control scheme (3.108) only the interaction matrix estimate $\hat{L} (t)$ $\hat{L} (t)$ or estimate of the interaction matrix inverse $\hat{L^{†}} (t)$ $\hat{L^{†}} (t)$ can be used:

$\begin{array}{l} u (t) = g \hat{L^{†}} (t) e (t) \end{array}$ $\begin{array}{l} u (t) = g \hat{L^{†}} (t) e (t) \end{array}$

(3.109)

The interaction matrix inverse estimate $\hat{L^{†}} (t)$ $\hat{L^{†}} (t)$ can be determined from linearization of the system around the current system state ( $\hat{L^{†}} (t) = {\hat{L}}_{t}^{†} (t)$ $\hat{L^{†}} (t) = {\hat{L}}_{t}^{†} (t)$ ) or around the desired (reference) system state ( $\hat{L^{†}} (t) = {\hat{L}}_{ref}^{†} (t)$ $\hat{L^{†}} (t) = {\hat{L}}_{ref}^{†} (t)$ ). Sometimes a combination in the form $\hat{L^{†}} (t) = \frac{1}{2} {({\hat{L}}_{ref} (t) + {\hat{L}}_{t} (t))}^{†}$ $\hat{L^{†}} (t) = \frac{1}{2} {({\hat{L}}_{ref} (t) + {\hat{L}}_{t} (t))}^{†}$ is used [36].

If the control (3.109) is inserted into Eq. (3.107), the differential equation of the closed-loop system is obtained:

$\begin{array}{l} \dot{e} (t) = - g L (t) \hat{L^{†}} (t) e (t) \end{array}$ $\begin{array}{l} \dot{e} (t) = - g L (t) \hat{L^{†}} (t) e (t) \end{array}$

(3.110)

A Lyapunov function $V (t) = \frac{1}{2} e^{T} (t) e (t)$ $V (t) = \frac{1}{2} e^{T} (t) e (t)$ can be used to check stability of the closed-loop system (3.110). The derivative of the Lyapunov function is

$\begin{array}{l} \dot{V} (t) = - g e^{T} (t) L (t) \hat{L^{†}} (t) e (t) \end{array}$ $\begin{array}{l} \dot{V} (t) = - g e^{T} (t) L (t) \hat{L^{†}} (t) e (t) \end{array}$

(3.111)

A sufficient condition for global stability of the system (3.110) is satisfied if the matrix $L (t) \hat{L^{†}} (t)$ $L (t) \hat{L^{†}} (t)$ is semipositive definite. In the case the number of observed features K equals to the number of control inputs M (i.e., K = M) and the matrices L(t) and $\hat{L^{†}} (t)$ $\hat{L^{†}} (t)$ both have full rank; the closed-loop system (3.110) is stable if the estimate of the matrix $\hat{L^{†}} (t)$ $\hat{L^{†}} (t)$ is not too coarse [26]. However, it should be noted that in the case of IBVS it is not simple to select appropriate features in the image space that eliminate the desired control problem in the task space.

Example 3.16

A camera, C, is mounted on the wheeled mobile robot R with differential drive at $t_{C}^{R} = {[0, 0, 0.5]}^{T}$ $t_{C}^{R} = {[0, 0, 0.5]}^{T}$ with orientation $R_{C}^{R} = R_{x} (90 degree)$ $R_{C}^{R} = R_{x} (90 degree)$ R_y(−90 degree)R_x(45 degree). The intrinsic camera parameters are α_xf = α_yf = 300, no skewness (γ = 0), and the image center is in the middle of the image with a dimension of 1024 × 768. Design an IBVS control that drives the mobile robot from initial pose [x(0), y(0), φ(0)] = [1 m, 0, 100 degree] to the goal position x_ref = 4 m and y_ref = 4 m. For the sake of simplicity assume that the rotation center of the mobile robot is visible in the image.

Solution

The camera is observing the scene in front of the mobile robot. Since the camera is mounted without any roll around the optical axis center, a decoupled control can be designed directly based on image error between the observed goal position and image of the robot’s center of rotation. A possible implementation of the solution in Matlab is shown in Listing 3.15. The obtained path of the mobile robot is shown in Fig. 3.36 and the path of the observed goal in the image is shown in Fig. 3.37. The control signals are shown in Fig. 3.38. The results confirm the applicability of the IBVS control scheme. The robot reaches the goal pose, since the observed feature (goal) is visible during the entire time of VS (Fig. 3.37).

f03-36-9780128042045 — Fig. 3.36 Obtained path with IBVS control from Example 3.16.

f03-37-9780128042045 — Fig. 3.37 Path of the observed feature (goal) in the image space from Example 3.16.

f03-38-9780128042045 — Fig. 3.38 Control signals of IBVS control from Example 3.16.

Listing 3.15

1 Ts = 0.03; % Sampling time

2 t = 0: Ts : 1 5 ; % Simulation1 time

3 r = 0 . 5 ; % Distance parameter for the intermediate point

4 dTol = 1; % Tolerance distance (to the intermediate point) for switch

5 qRef = [ 4 ; 4; 0 ] ; % Reference pose

6 q = [ 1 ; 0; 100/180*pi ] ; % Initial pose

8 % Camera

9 alphaF = 300; % alpha*f , in px/m

10 s = [1024; 768]; % screen size , in px

11 c = s /2; % image centre , in px

12 S = [ alphaF , 0 , c (1) ; 0 , alphaF , c (2) ; 0 , 0 , 1 ] ; % Internal camera model

13 RL2C = rotX (pi/2)*rotY(− pi /2) * rotX (pi/4) ; tL2C = [ 0 ; 0 ; 0 . 5 ] ; % Camera mounting

14 % Camera simulation

15 p0P = S*RL2C. ’ * ( [ 0 ; 0; 0]− tL2C ) ; p0P = p0P/p0P(3) ;

16 RW2L = rotZ(− q (3) ) ; tW2L = [ q (1:2) ; 0 ] ;

17 pP = S*RL2C. ’*(RW2L. ’ * ( [ qRef (1:2) ; 0]− tW2L ) − tL2C ) ; pP = pP / pP(3) ;

19 u = [ 0 ; 0 ] ;

20 for k = 1:length( t )

21 if pP(1)<0 || pP(2)<0 || pP(1)>s (1) || pP(2)>s (2) % I n v i s i b l e feature

22 u = [ 0 ; 0 ] ; % Lost tracked feature

23 else

24 D = sqrt(sum((pP(1:2)− p0P ( 1 : 2 ) ) .ˆ2) ) ;

25 if D < dTol % Stop when close to the goal

26 u = [ 0 ; 0 ] ;

27 else

28 u = [0 , 0.002; 0.005 , 0]*(p0P (1:2) − pP ( 1 : 2 ) ) ;

29 end

30 end

32 % Robot motion simulation

33 dq = [ u(1) *cos(q (3) ) ; u(1) *sin(q (3) ) ; u(2) ] ;

34 noise = 0.00; % Set to experiment with noise (e.g. 0.001)

35 q = q + Ts*dq + randn(3 ,1) * noise ; % Euler integration

36 q (3) = wrapToPi(q (3) ) ; % Map orientation angle to [− pi , pi]

38 % Camera simulation

39 RW2L = rotZ (− q (3) ) ; tW2L = [ q (1:2) ; 0 ] ;

40 pP = S*RL2C. ’*(RW2L. ’ * ( [ qRef (1:2) ; 0]− tW2L ) − tL2C ) ; pP = pP / pP(3) ;

42 end

3.4 Optimal Velocity Profile Estimation for a Known Path

Wheeled mobile robots often need to drive on an existing predefined path (e.g., road or corridor) that is defined spatially by some schedule parameter u as x_p(u), y_p(u), u ∈ [u_SP, u_EP]. To drive the robot on this path a desired velocity profile needs to be planned. A velocity profile is important if the robot needs to arrive from some start point SP to some end point EP in the shortest time and to consider robot capabilities and the like. To drive the robot on such path its position therefore becomes dependent on time x(t), y(t), with defined velocity profile v(t), ω(t), as shown next.

Suppose there is the special case u = t; the reference path becomes a trajectory with an implicitly defined velocity profile. The robot therefore needs to drive on the trajectory with the reference velocities v(t) = v_ref(t) and ω(t) = ω_ref(t) calculated from the reference trajectory using relations (3.22), (3.23).

To plan the desired velocity profile for the spatially given path, one needs to find the schedule u = u(t). The planned velocity profile must be consistent with the robot constraints (maximum velocity and maximum acceleration that motors can produce and which assures safe driving without longitudinal and lateral slip of the wheels). Reference velocities are similar to those in Eqs. (3.22), (3.23) are expressed as follows:

$\begin{array}{l} v (t) = & \sqrt{x_{p}^{'} {(u (t))}^{2} + y_{p}^{'} {(u (t))}^{2}} \dot{u} (t) = v_{p} (u) \dot{u} (t) \end{array}$ $\begin{array}{l} v (t) = & \sqrt{x_{p}^{'} {(u (t))}^{2} + y_{p}^{'} {(u (t))}^{2}} \dot{u} (t) = v_{p} (u) \dot{u} (t) \end{array}$

(3.112)

$\begin{array}{l} ω (t) = & \frac{x_{p}^{'} (u (t)) y_{p}^{''} (u (t)) - y_{p}^{'} (u (t)) x_{p}^{''} (u (t))}{x_{p}^{'} {(u (t))}^{2} + y_{p}^{'} {(u (t))}^{2}} \dot{u} (t) = ω_{p} (u) \dot{u} (t) \end{array}$ $\begin{array}{l} ω (t) = & \frac{x_{p}^{'} (u (t)) y_{p}^{''} (u (t)) - y_{p}^{'} (u (t)) x_{p}^{''} (u (t))}{x_{p}^{'} {(u (t))}^{2} + y_{p}^{'} {(u (t))}^{2}} \dot{u} (t) = ω_{p} (u) \dot{u} (t) \end{array}$

si297_e (3.113)

and curvature is

$\begin{array}{l} κ (t) = \frac{x_{p}^{'} (u (t)) y_{p}^{''} (u (t)) - y_{p}^{'} (u (t)) x_{p}^{''} (u (t))}{{(x_{p}^{'} {(u (t))}^{2} + y_{p}^{'} {(u (t))}^{2})}^{3 / 2}} = κ_{p} (u) \end{array}$ $\begin{array}{l} κ (t) = \frac{x_{p}^{'} (u (t)) y_{p}^{''} (u (t)) - y_{p}^{'} (u (t)) x_{p}^{''} (u (t))}{{(x_{p}^{'} {(u (t))}^{2} + y_{p}^{'} {(u (t))}^{2})}^{3 / 2}} = κ_{p} (u) \end{array}$

si298_e (3.114)

where primes denote derivatives with respect to u, while dots denote derivatives with respect to t. Time derivatives of the path therefore consider the schedule u(t) as follows: $\frac{d x (t)}{d t} = \frac{d x_{p}}{d u} \frac{d u}{d t} = x_{p}^{'} \dot{u} (t)$ $\frac{d x (t)}{d t} = \frac{d x_{p}}{d u} \frac{d u}{d t} = x_{p}^{'} \dot{u} (t)$ , $\frac{d y (t)}{d t} = \frac{d y_{p}}{d u} \frac{d u}{d t} = y_{p}^{'} \dot{u} (t)$ $\frac{d y (t)}{d t} = \frac{d y_{p}}{d u} \frac{d u}{d t} = y_{p}^{'} \dot{u} (t)$ .

The main idea of velocity profile planning is adopted from [37] and is as follows. For a given path a velocity profile is determined considering pure rolling conditions, meaning that command velocities are equal to the real robot velocities (no slipping of the wheels). This is archived by limitation of the overall allowable acceleration,

$\begin{array}{l} a = \sqrt{a_{t}^{2} + a_{r}^{2}} \end{array}$ $\begin{array}{l} a = \sqrt{a_{t}^{2} + a_{r}^{2}} \end{array}$

(3.115)

where

$\begin{array}{l} a_{t} = \frac{d v}{d t}, a_{r} = v ω = v^{2} κ \end{array}$ $\begin{array}{l} a_{t} = \frac{d v}{d t}, a_{r} = v ω = v^{2} κ \end{array}$

si302_e (3.116)

are the tangential and radial acceleration, respectively. Maximum acceleration that prevents sliding is defined by friction force F_fric:

$\begin{array}{l} a_{MAX} = \frac{F_{fric}}{m} = \frac{m g c_{fric}}{m} = g c_{fric} \end{array}$ $\begin{array}{l} a_{MAX} = \frac{F_{fric}}{m} = \frac{m g c_{fric}}{m} = g c_{fric} \end{array}$

si303_e (3.117)

where m is the vehicle mass, g is the gravity acceleration, and c_fric is the friction coefficient. Due to vehicle construction, maximal tangential acceleration a_MAXt and radial acceleration a_MAXr are usually different and can be estimated by experiment. Therefore the overall acceleration should be somewhere inside the ellipse

$\begin{array}{l} \frac{a_{t}^{2}}{a_{MAX t}^{2}} + \frac{a_{r}^{2}}{a_{MAX r}^{2}} \leq 1 \end{array}$ $\begin{array}{l} \frac{a_{t}^{2}}{a_{MAX t}^{2}} + \frac{a_{r}^{2}}{a_{MAX r}^{2}} \leq 1 \end{array}$

si304_e (3.118)

or always on the boundary in the case of time-optimal planning. In path turns the robot needs to drive slower due to higher radial acceleration. Therefore to estimate the schedule u(t), first the turning points (TPs) of the path (where the absolute value of the curvature is locally maximal) are located where u is in the interval [u_SP, u_EP]. Positions of TPs are denoted u = u_TPi, where i = 1, …, n_TP and n_TP is the number of TPs. In TPs translational velocity is locally the lowest, tangential acceleration is supposed to be 0, and radial acceleration is maximal. Tangential velocity in TPs can then be calculated considering Eq. (3.116)

$\begin{array}{l} v_{p} (u_{TP i}) = \sqrt{\frac{a_{MAX r}}{| κ (u_{TP i}) |}} \end{array}$ $\begin{array}{l} v_{p} (u_{TP i}) = \sqrt{\frac{a_{MAX r}}{| κ (u_{TP i}) |}} \end{array}$

si305_e (3.119)

Before and after the TP, the robot can move faster, because the curvature is lower than in the TP. Therefore the robot must decelerate before each TP (u < u_TPi) and accelerate after the TP (u > u_TPi) according to the acceleration constraint (3.118).

From Eq. (3.112) it follows that in each fixed point of the path v(t) and v_p(u) are proportional with proportional factor $\dot{u} (t)$ $\dot{u} (t)$ defined in time. A feasible time-minimum velocity profile is therefore defined by the derivative of the schedule $\dot{u} (t)$ $\dot{u} (t)$ , where the minmax solution is computed as described next (minimizing maximum velocity profile candidates to comply with acceleration constrains). The radial and the tangential acceleration are expressed from Eq. (3.116) considering Eq. (3.112) as follows:

$\begin{array}{l} a_{r} (t) = & (x_{p}^{'} {(u)}^{2} + y_{p}^{'} {(u)}^{2}) κ_{p} (u) {\dot{u}}^{2} (t) = v_{p}^{2} (u) κ_{p} (u) {\dot{u}}^{2} (t) \end{array}$ $\begin{array}{l} a_{r} (t) = & (x_{p}^{'} {(u)}^{2} + y_{p}^{'} {(u)}^{2}) κ_{p} (u) {\dot{u}}^{2} (t) = v_{p}^{2} (u) κ_{p} (u) {\dot{u}}^{2} (t) \end{array}$

(3.120)

$\begin{array}{l} a_{t} (t) = & \frac{x_{p}^{'} (u) x_{p}^{''} (u) + y_{p}^{'} (u) y_{p}^{''} (u)}{\sqrt{x_{p}^{'} {(u)}^{2} + y_{p}^{'} {(u)}^{2}}} {\dot{u}}^{2} (t) + \sqrt{x_{p}^{'} {(u)}^{2} + y_{p}^{'} {(u)}^{2}} ü^{2} (t) \\ = & \frac{d v_{p} (u)}{d u} {\dot{u}}^{2} (t) + v_{p} (u) ü (t) \end{array}$ $\begin{array}{l} a_{t} (t) = & \frac{x_{p}^{'} (u) x_{p}^{''} (u) + y_{p}^{'} (u) y_{p}^{''} (u)}{\sqrt{x_{p}^{'} {(u)}^{2} + y_{p}^{'} {(u)}^{2}}} {\dot{u}}^{2} (t) + \sqrt{x_{p}^{'} {(u)}^{2} + y_{p}^{'} {(u)}^{2}} ü^{2} (t) \\ = & \frac{d v_{p} (u)}{d u} {\dot{u}}^{2} (t) + v_{p} (u) ü (t) \end{array}$

si309_e (3.121)

where dependence on time for u(t) is omitted due to shorter notation. Taking the boundary case of Eqs. (3.118), (3.120), Eq. (3.121) results in an optimal schedule differential equation

$\begin{array}{l} ü = \pm a_{MAX t} \sqrt{\frac{1}{x_{p}^{' 2} + y_{p}^{' 2}} - \frac{(x_{p}^{' 2} + y_{p}^{' 2}) κ_{p}^{2} {\dot{u}}^{4}}{a_{MAX r}^{2}}} - \frac{x_{p}^{'} x_{p}^{''} + y_{p}^{'} y_{p}^{''}}{x_{p}^{' 2} + y_{p}^{' 2}} {\dot{u}}^{2} \end{array}$ $\begin{array}{l} ü = \pm a_{MAX t} \sqrt{\frac{1}{x_{p}^{' 2} + y_{p}^{' 2}} - \frac{(x_{p}^{' 2} + y_{p}^{' 2}) κ_{p}^{2} {\dot{u}}^{4}}{a_{MAX r}^{2}}} - \frac{x_{p}^{'} x_{p}^{''} + y_{p}^{'} y_{p}^{''}}{x_{p}^{' 2} + y_{p}^{' 2}} {\dot{u}}^{2} \end{array}$

si310_e (3.122)

A solution of a differential equation is found by means of explicit simulation integrating from the TPs forward and backward in time. When accelerating a positive sign is used, and a negative sign is used when decelerating. The initial condition u(t) and $\dot{u}$ $\dot{u}$ are defined by the position of TPs u_TPi and from Eq. (3.120), knowing that the radial acceleration is maximal allowable in the TPs:

$\begin{array}{l} \dot{u} |_{TP i} = \sqrt{\frac{a_{MAX r}}{(x_{p}^{'} {(u_{TP i})}^{2} + y_{p}^{'} {(u_{TP i})}^{2}) κ_{p} (u_{TP i})}} \end{array}$ $\begin{array}{l} \dot{u} |_{TP i} = \sqrt{\frac{a_{MAX r}}{(x_{p}^{'} {(u_{TP i})}^{2} + y_{p}^{'} {(u_{TP i})}^{2}) κ_{p} (u_{TP i})}} \end{array}$

si312_e (3.123)

Only a positive initial condition (3.123) is shown because u(t) has to be a strictly increasing function. The differential equations are solved until the violation of the acceleration constraint is found or u leaves the interval [u_SP, u_EP].

The violation usually appears when moving (accelerating) from the current TPi toward the next TP (i − 1 or i + 1), and the translational velocity becomes much higher than allowed by the trajectory. The solution of the differential equation (3.122) therefore consists of the segments of $\dot{u}$ $\dot{u}$ around each turning point:

$\begin{array}{l} {\dot{u}}_{l} = {\dot{u}}_{l} (u), u \in [{\underline{u}}_{l}, {\bar{u}}_{l}], l = 1, \dots, n_{TP} \end{array}$ $\begin{array}{l} {\dot{u}}_{l} = {\dot{u}}_{l} (u), u \in [{\underline{u}}_{l}, {\bar{u}}_{l}], l = 1, \dots, n_{TP} \end{array}$

(3.124)

where ${\dot{u}}_{l} = {\dot{u}}_{l} (u (t))$ ${\dot{u}}_{l} = {\dot{u}}_{l} (u (t))$ is the derivative of the schedule depending on u, and ${\underline{u}}_{l}$ ${\underline{u}}_{l}$ , ${\bar{u}}_{l}$ ${\bar{u}}_{l}$ are the lth segment borders. Here the segments in Eq. (3.124) are given as a function of u although the simulation of Eq. (3.122) is done with respect to time. This is because time offset (time needed to arrive in TP) is not yet known; what is known at this point is the relative time interval corresponding to each segment solution ${\dot{u}}_{l}$ ${\dot{u}}_{l}$ .

The solution of the time-minimum velocity profile obtained by accelerating on the slipping edge throughout the trajectory is possible if the union of the intervals covers the whole interval of interest $[u_{SP}, u_{EP}] \subseteq ⋃_{l = 1}^{n_{TP}} [{\underline{u}}_{l}, {\bar{u}}_{l}]$ $[u_{SP}, u_{EP}] \subseteq ⋃_{l = 1}^{n_{TP}} [{\underline{u}}_{l}, {\bar{u}}_{l}]$ . Actual $\dot{u}$ $\dot{u}$ is found by minimizing the individual ones:

$\begin{array}{l} \dot{u} = min_{1 \leq l \leq n_{TP}} {\dot{u}}_{l} (u) \end{array}$ $\begin{array}{l} \dot{u} = min_{1 \leq l \leq n_{TP}} {\dot{u}}_{l} (u) \end{array}$

(3.125)

Absolute time corresponding to the schedule u(t) is obtained from inverting $\dot{u} (u (t)) = \frac{d u}{d t}$ $\dot{u} (u (t)) = \frac{d u}{d t}$ to obtain $d t = {\dot{u}}^{- 1} (u) d u$ $d t = {\dot{u}}^{- 1} (u) d u$ and its integration:

$\begin{array}{l} t = \int_{u_{SP}}^{u_{EP}} \frac{d u}{\dot{u} (u)} = t (u) \end{array}$ $\begin{array}{l} t = \int_{u_{SP}}^{u_{EP}} \frac{d u}{\dot{u} (u)} = t (u) \end{array}$

si324_e (3.126)

where $\dot{u} (u)$ $\dot{u} (u)$ must not become 0. In our case the time-optimal solution is searched; therefore, the velocity (and also $\dot{u} (u)$ $\dot{u} (u)$ ) is always higher than 0. $\dot{u} (u) = 0$ $\dot{u} (u) = 0$ would imply that the velocity is zero, meaning the system would stop, and that cannot lead to the time-optimal solution.

Also, initial (in SP) and terminal (in EP) velocity requirements can be considered as follows. Treat SP and EP as normal TPs where the initial velocities v_SP, v_p(u_SP), v_EP, v_p(u_EP) are known so the initial condition for ${\dot{u}}_{SP} = \frac{v_{SP}}{v_{p} (u_{SP})}$ ${\dot{u}}_{SP} = \frac{v_{SP}}{v_{p} (u_{SP})}$ , ${\dot{u}}_{EP} = \frac{v_{EP}}{v_{p} (u_{EP})}$ ${\dot{u}}_{EP} = \frac{v_{EP}}{v_{p} (u_{EP})}$ can be computed. If these initial conditions for SP or EP are larger than the solution for $\dot{u}$ $\dot{u}$ (obtained from TPs), then the solution does not exist because it would be impossible to make it through the first turning point, even if the robot brakes maximally. If the solution for given SP and EP velocities exist then also include segments of $\dot{u}$ $\dot{u}$ computed for SP and EP to Eqs. (3.124), (3.125).

From the computed schedule u(t), $\dot{u} (t)$ $\dot{u} (t)$ , the reference trajectory is given by $x (t) = x_{p} (u (t))$ $x (t) = x_{p} (u (t))$ and $y (t) = y_{p} (u (t))$ $y (t) = y_{p} (u (t))$ , and the reference velocity is given by Eqs. (3.22), (3.23).

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for 3.4 Optimal Velocity Profile Estimation for a Known Path

Create new playlist

Sign In

Sign Up

3.3.8 Model-Based Predictive Control

Discrete MPC

3.3.9 Particle Swarm Optimization-Based Control

MPC With PSO

3.3.10 Visual Servoing Approaches for Solving the Control Problems in Mobile Robotics

3.4 Optimal Velocity Profile Estimation for a Known Path

Table of Contents for
3.4 Optimal Velocity Profile Estimation for a Known Path