Professional Documents
Culture Documents
Question 1: Consider the optimal control problem defined over a fixed time horizon 0 t T :
Z T
min J[u] = M (x(T )) +
L(x(t), u(t))dt
0
subject to the system dynamic constraints x = f (x(t), u(t), x(0) = x0 . Under appropriate
assumptions, a necessary condition for a minimising solution u = u? (t) is:
H
H
H
M (T )
= 0; x =
= f (x, u), x(0) = x0 ; p =
, p(T ) =
u
p
x
x
where H = H(x, p, u) = L(x, u) + pT (t)f (x, u) denotes the Hamiltonean of the system and p(t)
the co-state vector of the system (Lagrange multipliers). Consider now the LQR problem over
a fixed time horizon:
Z
1 T
1 T T
min J[u] = x (T )Sx(T ) +
(x Qx + uT Ru)dt
2
2 0
where S = S T 0, Q = QT 0 and R = RT > 0. Starting from the stated optimality
conditions for the general problem, show that the LQR problem reduces to the solution of the
following boundary-value problem:
!
! !
x
A BR1 B T
x
=
T
p
Q
A
p
subject to the boundary conditions x(0) = x0 and p(T ) = Sx(T ). By using the substitution
p(t) = P (t)x(t) (with P (T ) = S), show that the boundary value problem reduces to the solution
of the differential Riccati equation P (t) = AT P (t) + P (t)A P (t)BR1 B T P (t) + Q.
Question 2: Explain how the stabilising solution of the algebraic Riccati equation:
AT P + P A P BR1 B T P + C T C = 0
can be obtained from the eigenvalue-eigenvector decomposition of the associated Hamiltonean
matrix:
!
A
BR1 B T
H=
C T C
AT
Indicate clearly under what conditions the stabilising solution may fail to exist.
Question 3: (i) Consider the scalar differential Riccati equation:
p(t)
= ap2 (t) + bp(t) + c
1 of 3
where a, b and c are constants. Using the transformation p(t) = a1 u(t)
show that this can be
u(t)
transformed to a linear second-order differential equation.
u(t)
u(t)
= (t)u(t)
dt
Question 4: The system x(t)
3
0
= ax(t) + u(t) where m > 0. (i) Find the optimal state feedback.
In your calculations you need to determine which is the stabilising solution of the ARE (from
two possible solutions); (ii) Verify your conclusion by calculating the optimal closed-loop Amatrix; (iii) Explain how the response of the optimal closed-loop system varies with parameter
m. In your answer concentrate on the speed with which the state x(t) decays to zero starting
from some arbitrary initial condition x(0) = x0 (e.g. the time constant of the response) and on
the peak value maxt0 |u(t)| of the optimal control signal u(t).
Question 7: Find the optimal control of the system x 1 (t) = x2 (t), x 2 (t) = x1 (t) + u(t) which
minimises the performance index:
Z
1
1 2
(x1 (t) + u2 (t))dt
J[u] =
2 0
8
2 of 3
A=
0 1
0 0
, B=
0
1
; and C =
1 0
!
!
!
x 1
0 1
x1
=
x 2
0 0
x2
with performance index (which must be minimised):
Z
1 2
(x1 + x22 + u2 )dt
J[u] =
2 0
Express the performance index in the standard form. By solving the corresponding Riccati
equation obtain the optimal state-feedback control law. Hence derive the resulting closed-loop
system in state-space form and verify that it is asymptotically stable.
Question 10
Consider the scalar system x = x + u + v, y = x + w where v and w are mutually uncorrelated
white noise signals with variances E(v 2 ) = E(w2 ) = 1. Obtain the optimal steady-state Kalman
filter of this system and the corresponding Kalman gain L. Derive also the corresponding
dynamics of the estimation error signal e(t) = x(t)
x(t) and show that in the absence of process
and mesument noise (v = w = 0) this converges to zero for any initial error e(0) = x(0) x(0).
3 of 3