Professional Documents
Culture Documents
a b c
The Italian Institute of Technology , Via Morego, 30, 16163 Genova , Italy
Published online: 05 Apr 2012.
To cite this article: D.H. Owens (2012) Multivariable norm optimal and parameter optimal iterative learning control: a
unified formulation, International Journal of Control, 85:8, 1010-1025, DOI: 10.1080/00207179.2012.673136
To link to this article: http://dx.doi.org/10.1080/00207179.2012.673136
1. Introduction
Iterative learning control (ILC) originated with the
work of Arimoto, Kawamura, and Miyazaki (1984) is
concerned with the performance of systems that
operate in a repetitive manner and includes examples
such as robot arm manipulators and chemical batch
processes, where the task is to follow some specified
output trajectory r in a fixed specified time interval
with high precision. Motivated by human learning,
ILC uses information from previous executions of the
task in an attempt to improve performance from
repetition to repetition in the sense that the tracking
error (between the output and a specified reference
trajectory) is asymptotically reduced to zero (see, e.g.
the review papers Bristow, Tharayil, and Alleyne
(2006), Ahn, Chen, and Moore (2007) and Owens
and Hatonen (2005) and the applications described in
those review papers and papers such as Daley, Owens,
and Hatonen (2007), Ratcliffe, Lewin, Rogers,
Hatonen, and Owens (2006) and Kirchhoff, Schmidt,
Lichtenberg, and Werner (2008). Note that repetitions
are often called trials, passes or iterations in the
literature. Similarly, the time interval is sometimes
called the trial length or pass length.
*Email: D.H.Owens@sheffield.ac.uk
ISSN 00207179 print/ISSN 13665820 online
2012 Taylor & Francis
http://dx.doi.org/10.1080/00207179.2012.673136
http://www.tandfonline.com
1011
1012
D.H. Owens
x0 x0
limk!1 kuk u k 0
1013
eTk k , eTk k
1, . . . , eTk NT
2R
mN1k
CAN2 B
Alternatively, if k* 1,
2
CAk 1 B
0
0
6
k
k
1
6 CA B CA
B
0
6
6
CAk 1 B CAk B CAk 1 B
G6
6
6
..
..
..
6
.
.
.
4
CAN1 B CAN2 B
..
.
0
..
.
CAk
3
7
7
7
7
7
7
7
7
5
1
zk Gz CAk zI A1 B D0
B
7
1014
D.H. Owens
11
13
14
eTk GKek
wk1 kGKek k2
15
w 1 0, w 2 0
w 1 w 2 4 0
16
1015
17
20
23
p
(where kek denotes the Euclidean norm eT e of e).
Solving for k1ek then gives the unique POILC
control update law
18
19
22
1
T
2 1
T
uk1 uk I kek k2 W1
k1 G QG kek k Wk1 G Qek
24
This equation applies for all choices of weight
sequences {Wk1}k0. However, with the special
choice of Wk1 Rkekk2, this formula is precisely
similar to that obtained from NOILC. This proves the
following main theorem.
Theorem 1: The MIMO POILC algorithm defined by
the update law
uk1 uk k1 ek ,
ek r yk , k 0
25
1016
D.H. Owens
26
27
Proof: First, by substitution, observe that this expression is a solution of the stationary point condition for
the objective function. Uniqueness follows from the
strict convexity of the objective function and the linear
constraints.
Note that:
(1) For practical purposes, this theorem indicates
that any MIMO NOILC problem is a special
case of an MIMO POILC approach obtained
by a specific choice of weight matrices and
control law parameterisation. In this sense,
there is nothing special about NOILC other
than its potentially useful realisation in terms of
Riccati state feedback and off-line feedforward
computations. If state feedback is not acceptable, then NOILC is just an example of
POILC but one that may not be appropriate
for implementation.
(2) It is interesting to note that the fact that Wk1
is necessarily quadratic in the Euclidean norm
kekk provides some justification of the empirically suggested choice of scalar weight wk1
w 1 w 2 kek k2 proposed by Owens and Feng
(2003) for SISO POILC.
In the next section these results are used to motivate
and construct a wide range of POILC algorithms
capable of approximating a given NOILC algorithm.
opening up and motivating the possibility of generalising known SISO POILC concepts to the MIMO
POILC case by algorithm constructions that approximate a given MIMO NOILC problem. More precisely, consider the construction of a general approach
to approximating a given MIMO NOILC problem
(with given Q and R weight matrices) by an MIMO
POILC problem defined by Q, R and a chosen control
law parameterisation. This is done by exploiting the
structure of the POILC problem that reproduces
NOILC, but making possible a considerable reduction
in the (typically very large) number m(N 1 k*)2 of
free parameters. The formulation is linked to but
extends other approaches to POILC such as Owens
et al. (2008) and Owens and Feng (2003) to a new
general approach to the MIMO multi-parameter case.
but write
k1 k1
29
30
p
X
j j
31
j1
34
36
37
38
39
1017
R1GTQ(I GR1GTQ)1ek.
k0
40
1018
k1
D.H. Owens
equations as follows:
1
ek
ek
ek
T
T
WM
M
QM
kek k
kek k
kek k
ek
, k 0:
41
Q
kek k
The approach taken here, however, to ameliorate the problem in what follows, is to replace
Wk1 by
Wk1 ! W1 W2 kek k2
42
In particular,
(1) If E1 0 then limk!1 kekk 0 and the tracking
error converges to zero asymptotically.
(2) If E1 0 and w 4 0, then limk!1 kekk 0 and
the parameter vector sequence {k }k1 satisfies
P
1
2
k1 kk k 5 1 i.e. limk!1
P k 0.2
(3) If E1 4 0 and w 0, then 1
k1 kk k 5 1 (i.e.
limk!1 k 0) and all cluster points of the error
vector sequence {ek}k0 are non-zero and lie in
the limit set
S1 fe: eT QMe 0g:
where
W1 W1 T wI,
W2 W
43
Wk1 2 kek k2 W
44
48
r d0 2 RG
49
50
8k 0
45
47
Tk1 MT
ek
ek
Q
kek k
kek k
52
k1
X
j1
46
k0
k!1
Tk1 W2 k1
k1
X
kj k2
j1
which
always true that
P1 T proves that it is P1
2
W
5
1.
Also
if
j j
j1 j
k1 kk1 k 5 1
w 4 0 so that, in this case, limj!1j 0.
To prove (3), note that E1 4 0 and the topological
equivalence of the two norms kek and kekQ, implies the
k1
X
Tj Wj j
j1
P1
k1
X
Tj Wj
j1
53
1019
which
gives
eTQM(e) eTQGF(e)
T
1 T
e QGR G Qe and hence the limit set S1
{e:
GTQe 0} Q1ker(GT) Q1R(G)?
where the orthogonal complement ? is taken
with respect to the natural Euclidean inner
product hx, yi xTy. It follows that S1 \
R(G) {0} so that convergence of the error to
zero is guaranteed provided that r d0 2 R(G).
More generally, we have the following theorem.
Theorem 5: S1 always contains the vector subspace
Q1ker(GT) Q1R(G)?.
Proof: The proof follows simply from the characterisation of S1. More precisely, if e 2 S1, then
eTQGF(e) 0, which is true for all e (independent of
the parameterisation F(e)) satisfying GTQe 0, i.e.
e 2 Q1ker(GT) Q1R(G)?.
Theorem 6: For the above MIMO POILC algorithm,
(1) A sufficient condition for convergence of the
error to zero error whenever r d0 2 R(G) is that
there exists a non-zero vector P
[ 1, . . . , p]T
j
such that the symmetric part of m
is
j1 j QG
sign definite on R(G).
(2) A sufficient condition for this is that
R0GTQ 2 span{( j)}1jm for some (N 1
k*) (N 1 k*) matrix R0 with positive
definite symmetric part.
Proof: From the definition of S1,Pif e 2 S1 \ R(G) is
j
arbitrary, then eT QFe eT m
j1 j QG e 0
which, under the conditions of the theorem, implies
that e 0 and hence S1 \ R(G) {0}. Hence limk!1
ek 0. Part two follows easily as the existence of the j
is guaranteed by the inclusion condition.
6. Examples of parameterisations
Using the results of the previous sections, the choice of
parameterisation is a design decision aimed at achieving desired ILC performance. The original SISO
POILC algorithm suggested by Owens and Feng is
the simplest such choice and has been well studied. The
more complex SISO inverse-model-based algorithm
(Harte et al. 2006) and gradient algorithm (Owens et al.
2008) have also been well studied with results on
convergence and comparative robustness conditions
rigorously derived. In the following subsections, a
number of alternative new, multi-parameter parameterisations are explored to indicate the richness of
1020
D.H. Owens
p
X
j
k1
j ek
54
j1
: :
7
7
7
7
7
7K
7
7
7
7
5
:
:
:
0
p
INp
55
1021
0
0
0
2
e p
31
1
e
B
6 2 7C
B
7C
6 e
B
7C
6
B
6 : 7C
B
7C
6
B
7C
6
Bwhere e Ke 6 : 7C:
B
7C
6
B
6 : 7C
B
7C
6
B
6 p1 7C
@
5A
4 e
p
e
57
u kj
j j
k1
ek ,
1 j p,
31
u 1
B
6 2 7C
B
7C
6 u
B
7C
6
B
6 : 7C
B
7C
6
B
7C
6
Bwhere u 6 : 7C
B
7C
6
B
6 : 7C
B
7C
6
B
6 p1 7C
@
5A
4 u
58
u p
The vector k1 is computed from the approximating
POILC problem with weighting matrix
W Wij ,
Wij tri T R j
59
60
1022
D.H. Owens
p
X
1 i, j p
65
Gj
61
j1
m
X
j
k1
j ,
j R1 GTj Q, 1 j p
j1
62
There are infinity of such partitions, all of which
generate a p-dimensional search space containing the
descent direction R1GTQek for kek2Q at e ek obtained
j
1, 1 j p. The best choice
by the choice of k1
of p and the Gj for a particular application is an open
question. For example, the partition of G could be
identified with a decomposition of the transfer function
matrix zk Gz by writing
zk Gz
p
X
Gj z
63
j1
zk Gz
n
X
j1
j
1 z1 pj
64
6
7
GTj 6
.. 7
6 ..
7
4.
.5
I 0 0 0
2
0
DTj
6 T T
6 Bj Cj
DTj
6
6 T T T
B A C
6
6 j j j
6
..
6
.
4
0
BTj CTj
2
3
7 0 0 Im
0 7
76 0 0 0 7
76
7
0 76 .
.. 7
76 .
7
. 5
4 .
.. 7
7
. 5 I 0 0
m
DTj
66
7. Numerical examples
To illustrate the potential of alternative parameterisations and motivate the future exploration of new
parameterisations to meet a variety of design criteria,
this section considers the application of the proposed
parallel gradient partition to SISO systems based on
transfer function decomposition. The transfer function
decomposition chosen in all cases is that of partial
fraction expansion of zk Gz.
Example 1: Consider the simple example of a
second-order SISO system
Gs
2
s2 3s 2
67
1023
101
NOILC
Partition
Gradient
NOILC
Partition
Gradient
100
lg(||ek||)
lg(||ek||)
100
101
101
10
20
40
60
80
102
100
20
40
60
80
100
Trials
5
4
3
2
1
uref(t)
Trials
0
1
2
3
4
5
0
3
Time(s)
Example 2: Now consider the example of the nonminimum-phase electromechanical experimental rig
described in Cai, Freeman, Lewin, and Rogers
(2006). The transfer function is
Gs
165:954 s
s4 21:5s3 170:28s2 368:52s 663:82
68
1024
D.H. Owens
100.7
100.6
lg(||ek||)
lg(||ek||)
NOILC
Partition
Gradient
NOILC
Partition
Gradient
100.5
10
100.4
0
20
40
60
80
100
0
Trials
20
40
60
80
100
Trials
5
4
3
2
uref(t)
1
0
1
2
3
4
5
0
3
Time(s)
8. Conclusions
This article has demonstrated the very strong
connection between the paradigms of NOILC
Acknowledgements
Thanks are due to Dr Bing Chu, currently with the School
of Electronics and Computer Science, University of
Southampton, for his help with the computational results.
References
Ahn, H.-S., Chen, Y., and Moore, K.L. (2007), Iterative
Learning Control: Brief Survey and Categorization, IEEE
Transactions on Systems, Man and Cybernetics, Part C, 37,
11091121.
Amann, N., Owens, D.H., and Rogers, E. (1996a), Iterative
Learning Control using Optimal Feedback and
Feedforward Actions, International Journal of Control,
65, 277293.
Amann, N., Owens, D.H., and Rogers, E. (1996b), Iterative
Learning Control for Discrete-time Systems with
1025