Professional Documents
Culture Documents
I. INTRODUCTIONAND
PROBLEM STATEMENT
Different choices of design parameters offer new possibilities for improvement of transient performance and reduction
of control effort. Particularly important are the nonlinear
damping terms, which, when compared to update law normalizations, lead to faster and better-damped transients without
an increase in control effort.
Problem statement: The control objective is to asymptotically track a reference signal yr(t) with the output y of the
olant
11. STATEESTIMATION
xn = -uoy
Y = x1
+ bou
SYSTEMS
739
111. AN INTRODUCTORYEXAMPLE
+ ( k - U ) Y + b~
y = eTx
(2.2)
1, ,I:[
[
1,
-kl
k=
Ao=
-ICn
an-1
a=
i
a0
o.-*o
kn
Our recursive design procedure employs integrator backstepping in the observer [8] combined with tuning functions
[lo]. In that sense, it removes the overparameterization employed in [9], where the results of [7] and [8] were combined
to design adaptive output-feedback controllers for nonlinear
systems. For readers unfamiliar with these references, the
procedure is introduced by designing an adaptive controller
for the plant y(s) = (l/s(s - a ) ) u ( s ) ,represented as
XI = 2 2
[io],
bm
6=
b=
X,
+ a21
(3.1)
=U
[(p-i)xl].
(2.3)
is measured. For simplicity
where a is unknown, and y =
21
we let y,=const.
The observer filters (2.6) and the corresponding [ and v
variables are implemented as
6 = Aorl+ ezy,
(1 = A o ~ ,
1 2 = - A i v (3.2)
05i 5n -1
Abe, = en-;,
-kl
1
v = A,
i= A ~ A ezu,
=
(3.3)
Are, = -k,
(2.4)
so that the polynomial functions A ( . ) and B(.) from (1.1)
From (2.8), using ii as an estimate of a, the unmeasured state
satisfy
2 2 is expressed as
A(Ao)en = a - k
[+
By filtering U
B(Ao)en = b.
and y with two n-dimensional filters
6= Aov + eny
i= AoX + enu
,I.
(2.5)
(2.6)
B(Ao)X - A(Ao)v.
(2.7)
i=O
i=O
= -A;
ti = Abv
vi = AbX
in
+
+
+
= AoJn
[;= Ao& e n - ; y ,
i~;= Aov; en-+
+ a1 +
J2,2
+ 2 + iiw + ( a - ii)w,
w = G, 2
in the absence of
z2
+Y
(3.6)
It is important to point out that these expressions are implemented as algebraic identities, along with filter equations
(2.6).
In the adaptive controller design the unknown parameters
ao, . . . ,an-1 and bo, . . . ,bm, appearing in the state estimation
equation (2.Q will be replaced by their estimates.
740
=U
- IC2211 - &I
(3.9)
Now the actual control U is available and for it we design the
control law
+ +
+ +
(4.1)
z1 = 5 2 - an-ly - yr.
(3.10)
where U, is yet to be determined. The nonlinear damping
term -d2(dal/dy)2z2 was introduced in [8] to counteract
the destablizing effect of (dal/dy)cz. To choose U, and the
update law for iL we consider the Lyapunov function
v
2
= VI
=
1
-2;
where
t(2)
1
+ 21 + -TP,E
d2
+ "(2)b+ 2
(4.2)
= [Sn-1,2,..-,t0,2],
"(2) = ["m,2,~..,"0,21.
(4.3)
-22"
+ -22" +
1
-(a
27
- &)Z
+-
j2)p p o c .
Its derivative for the system (3.8), (3.9) with the control (3.10)
is
21 = &2l,- t ( 2 ) U
+ "(a)$
- an-1y - yr
2.
(4.4)
Observe that the last two terms in I& are indefinite. To cancel
them, we select the actual update law
22
= "m, 2 - Q1
(4.8)
= bmza
+ b m a l + In, + GTe - yr + 2.
2
(4.9)
msnC. et al.:
741
where
= -CIZ~
i1
O ( )= f
- d l z l + bmzz
+ bm{al+
+ pTe}+
P ( C I ~ I +d l z l +
Cn, 2
- &)
(4.10)
.,
+ 34 + . + 2.
-E;
(4.18)
=~
+ b m z ~+ bm(al+ @CP)
+ bm(p - 5 ) ~
+ZT(O - 8) + 2 - dlzl. (4.12)
-.:
1
2
1
+ -(e
2
- 8)Tr-1(e
um,z
- 4)
1
+ -eTPot
(4.13)
dl
where PO is the positive definite solution of PoAo + ATP0 =
+ u ( p - 5)'
27
-I. To design
a1
+ b m ~ 1 +~ 2bmzl(al+ 5
Vi = -C~Z:
~ )
+ Ibml(P-2i)y(YSgn(bm)cpzl
+ (e - 8 ) T r - - 1 ( m l - e )
- dlz:
+~
-2%
(4.14)
dl
If U,,, were our actual control, we would have z2 s 0 and
we would eliminate p - p, 8 - 8 and b, from (4.14) with
the choices
1 - ~' e T2 .
= - $ ~ p = - C [ C ~ Z ~ dlzl
+ In,
2 - Gv + ZTb] (4.15)
2i = Y sgn (bm)pz1
and 0 =
(4.16)
T ~ where
,
7-1
=mz1.
(4.17)
(4.20)
+ bmzz + GT(@- 8)
+ b m d p - $1 + 2
i z= z3 + a2 + p, i = - C I Z ~ - dlzl
(4.21)
1
1
v, = v, + -22"
+ --TPO.
2
dz
(4.22)
742
The derivative of
V2
% L -c1z:
+ z z [ b m z l + 23 + a2 + ~2
acr,
- 22-2
ay
- -ET
d2
-O().
(4.23)
dl
we rewrite (4.23) as
e)
- Z2--2
ay
- --ET
dz
0 , . . . , O I T ) z 2 - -41
1
- -O()
dl
(4.28)
where pi encompasses all the known terms except for Vm, i + 1 .
We now treat w,,i+l as a virtual control, that is, introducing
the new variable zi+l =
- ai, we use a; to stabilize
the ( X I , . ,zi,Ij)-system
21
=-
~ 1 ~-1d l z l
+ bmzp + ZT(6 - e)
+ b m q ( p - ?j) + 2
= lZzl
aa1
- r--wz2
aY
+ Ien+lzl(wm,
- a1)
(4.25)
would yield
(4.29)
However, since z3 $ 0, we do not use e^ = 7 2 as an update law.
Instead, we retain 7 2 in (4.25) as our second tuning function
and a2 in (4.26) as our second stabilizing function. Upon the
(4.30)
where
743
k 1satisfies
i-1
e - 1
+ Z i Z i - 1 + (e - s ) T r - l ( T i - l
I
-&z?
- 8)
j=1
(4.35)
we would rewrite
(4.31)
as
i-1
c < - c c j z ; + zi
j=1
-xi~i~-
e)
-ccjz;
1
- --ET
di
I-1
+ zi
=-ccjz;
j=1
di
aai-1
C-n()
j=ldj
+ zi
j=1
- --ET
-z
i-
i-1
- --ET
di
1
C-n().
i-l
1
C-n()
4
i-l
j=l
(4.32)
j=ldj
(4.36)
I44
For
Zi+l
zp-1+
'U.
dffp-le
+ pp -
T
I
e=
Tp
(4.42)
dY
(4.38)
2
5j 5p
- 1 (4.43)
- ap-l as
dffp-l
T
e - -d f f p - l
+pp - -
dY
dY
2 - dffp-l
88
(4.39)
v, = vp-l+ -zp
1 2 + --ET&
1
2
- dp
+-d f f p - 1 W T 8
(a,)
dffp-1
as
zp - zp-l - pp
P-1
dffp-l
a8
dY
7p
-c z k
k=2
r7
dY
dffk-1
(4.45)
which yields2
dP
Vp=
-k{ +
cjz;
dj(zj%
+&
+ (.};,
~ 2 ) ~
j=1
(4.46)
j=1
+ p - 8)Tr-1(o
- e).
1
(4.40)
e 0.
745
bmZ2 + zT(e
- 8)
= bmzz wT(8 - 8) - e;+,v,,
+
= bmz2 + wT(O - 6 ) - ( b ,
= Smz2+ w T ( e - 8) + (b,
z(O - 8)
- Sm)(z~ ~ 1 )
- Sm)?iV
(a,)
+ +
-CI -
-Sm
dl
bm
-CZ
-d2
-1
(%)
- ( T Z ~ W -c3
+ u23w
-d3
(9
u24w
1 + ( ~ 3 4 ~
...
...
QpW
...
U3pW
V I 2
-Q4W
- 1 - (T34w
1
-UzpW
-c3pw
+ u p - 1 , pw
-1 - U , - ~ , ~ W -cp- dp(
*)
746
= 2 2 - an-1y
x 2 =23
Xp-1
1
x
- an-2y
- am+lY
(5.2)
i Ab<
Y =21
+ bbY
where z = 1x1, 5 2 ,
Cb E U Pbb
, E RP. The
,x p ,
eigenvalues of the m x m matrix Ab are the zeros of the
polynomial B ( s ) , which is Hurwitz by Assumption 1.1. Although the transformation bringing (2.1) into (5.2) depends on
the unknown plant parameters, this is not important because
for our stability analysis we only need to know that such a
transformation exists. Our task is to investigate the stability
of the trajectories along which the tracking error is zero. The
zero dynamics along these trajectories are governed by
&(o) = c(0).
(5.3)
( = Ab( 4-b b Z 1 ,
t(0) = 0.
(=
(5.4)
(5.8)
+,.
= Ao9,. + %Yr,
r)T(O)
= v(0)
(5.5)
6 = Aoij + e , z l ,
i j ( 0 ) = 0.
(5.9)
(5.6)
(5.10)
f:(fz; + -&TPoE
1
) + -(P
j=1
(5.7)
X , - si-'
z-
+k
+ +
l ~ ~ * -* ~ ki-1
K(s)
A(s)
B(S)Y'
(5.11)
+
U,,
- -el I; - &I;
2
IT
j=2
f:
j=1
= Xm+i
T
+ gm,
i
lIi<7h
2 1.
SYSTEMS
141
7i = AOV+ e3y,
52
= A&,
E3
=-Ah
(6.4)
+ +
+ +
+ +
(6.9)
t-cc
(5.15)
VI. A DESIGN
EXAMPLE
(6.10)
(6.11)
where a = 3 is considered to be unknown. The relative-degreethree design contains all the features of the general design
procedure. The control objective is to asymptotically track the
output of the reference model
To derive the adaptive controller resulting from our nonlinear design, the plant (6.1) is first rewritten in the state-space
form (2.1)
$1
=22
x2
= x3
x3
=U
+ ax1
y = 21.
(6.12)
Step 3:
23
= U3 - a2
(6.13)
(6.14)
748
Plant
1 Parameter update
1+aw
- ~ 2 - d 2 ( 9 )
-1 - ow
-c3 - d ~ ( % ) ~
(6.16)
73
where 0 = y ( d a l / d h ) ( d a z / d y ) . Note again the skewsymmetry of the off-diagonal entries and the stabilizing role
of the diagonal entries.
The block diagram in Fig. 1 shows that the overall structure
of the new adaptive system has the familiar form of the input
and output filters feeding into an estimatorkontroller block.
The fundamental difference, is however, that this block is
now a nonlinear controller. Whereas in traditional schemes
this block would be a certainty-equivalence linear controller,
the new three-step procedure produces the control law (6.15)
in which both parameter estimates and filter signals enter
nonlinearly.
S2
$=
s3
+ klS2 + kzs + k3 Y ( S )
(7.3)
52
+ els + 821
+ m1s + m2 y +
8os2
e3s
+ e4
s2+m1s+m2
(7.1)
(7.4)
K R S T I ~er
, al.: ADPATIVE CONTROLLERS FOR LINEAR SYSTEMS
?tacking error y
149
- g,
'h&ing error y
Parameter estimate -6
or7
- y,
Control U
Control U
;3\J" ip,
0
10
do) = 1
Y(0) = 0
Fig. 2. Simulation results for the indirect adaptive linear scheme with
y = 500 for y(0) = 0 (left) and y(0) = 1 (right). The nonzero initial
condition is misinterpreted as a parameter error by the parameter estimator,
leading to the deteriorationof both transient performance and control effort.
Adjustment of the indirect linear scheme : For a fair comparison, our first task was to adjust the design parameters of the
indirect scheme to achieve the best transient performance with
a prescribed control effort. The trade-off between transient
performance and control effort was examined for various initial
conditions. To reduce the transients due to the mismatch of
initial conditions, the initial condition of the reference model
output was set in all tests to be equal to the initial value of
the plant output. In spite of numerous attempts, no conclusive
guidelines were found for initialization of filters in the indirect
linear scheme. The simulation results shown in Figs. 2 4 for
y(0) = 0 and y(0) = 1are representative. The available design
constants were the adaptation gain y and the coefficients of the
observer polynomial s3 IC1 s2 lczs k3 and of the controller
polynomial s2 m l s m2. After several attempts, all the
roots of the observer polynomial were placed at s = -2 with
kl = 6, IC2 = 12, k3 = 8, while the roots of the controller
polynomial were placed in a Butterworth configuration of
radius 3 with ml = 4.2426, m2 = 9. These were judged
to yield the best trade-off between transient performance and
control effort for different initial conditions. A final choice to
be made was that of the adaptation gain. Figs. 2-4 present
simulations results for three different values of that gain:
y = 500 (Fig. 2), y = 1000 (Fig. 3), and y = 2000
(Fig. 4). In each of these figures we show two simulation
runs, one with y(0) = q ( 0 ) = 0 (left) and one with
y(0) = ~ 1 ( 0 =
) 1 (right), and for each run we show the
tracking error y - yr (top), the parameter estimate ii (middle),
and the control effort U (bottom). From these three figures, it is
evident that the effect of the adaptation gain on the transient
performance and the control effort is very different for the
two sets of initial conditions. For y(0) = T ~ ( O =
) 0, both the
performance and the control effort improve with increasing
adaptation gain, while for y(0) = q ( 0 ) = 1 they both
+
+
+ +
-I I
o - -
-4 R
-- 0
Control U
750
?tacking error y - y,
Parameter estimate -8
I
0
Control U
Tracking error y - y.
Xndrect linear
Id
Nonlinear
Parameter estimate -B
Control U
~l!!.!k
-l!c
-lW
-2000
Indirect linear
-2w0
Nonlinear
Fig. 7. The lower effective adaptation gain of the nonlinear scheme makes
its parameter estimator less erratic when y(0) = 1. The benefits are evident
not only in the transient performance (Fig. 5), but also in the control effort.
KRSTIC,
et ai.: ADPATIVE
75 1
-1-
-L
-L
-MI;,
*I 4
0
Parameter estimate -ii
Parmeter estimate -i
+ -+
0
+
0
f-,
controlu~lp,
-1sw
-1oW
0
dl = d2 = ds = 0.001
0
dl
= dz = ds = 0.2
0
dl
= di = d3 = 0.6
initialized
non-initialized
= 1.
Filter initialization : In contrast to the indirect scheme, the @e Lyapunov function which are not initialized to zero are
new nonlinear scheme provides clear guidelines for filter and 8 - 8, @ - p and E, that is, only those which are not known
reference model initialization, which follow from the design or not measured.
objective of driving the z-variables to zero. According to (6.7),
VHI. CONCLUDING REMARKS
(6.10), and (6.13), the initial values of z-variables are set
to zero by choosing q ( 0 ) = y(O), wz(0) = al(O),~ ( 0 =
)
The results of this paper show that recently developed
az(0).In general, it is always possible to set zl(0) = zz(0) = tools for adaptive nonlinear control [7]-[ll] can be used to
... = z p ( 0 ) = 0 by either one of the two methods:
design adaptive controllers for linear systems, which promise
1) Setting z ( 0 ) = 0 by initializing the X-jilter. Choos- to outperform existing schemes. A particularly significant new
ing y T ( 0 ) = y(O), we set ~ ( 0 ) = 0. Now, we property is the possibility to improve transient performance
perform the initializatipn using (5.14). For given without an increase in control effort. This is achieved by nonlinear damping, which attenuates the effect of initial parameter
d o ) ,Y T ( O ) , YT(O),
V(0L Xl(O), . . . 1 &TI + i-1
(0) we set z;(O) = 0 by choosing X,+i(O)
= errors so that fast parameter convergence is not required for
-gz,i[X1(0),.-.
,Xm+i--1(O)IT+~i-1It=O,
for i = good transient performance. The proof of stability is direct
and reveals that the states of the adaptive system converge
2 , . .* ,p.
2 ) Setting z ( 0 ) = 0 by initializing the reference model. to a manifold whose dimension is the smallest possible. The
Again, by choosing y T ( 0 ) = y(O), we set q ( 0 ) = 0. robustness of the new class of adaptive controllers is a topic
Now, examining the expressions for ai, we note that of current research.
(aal/a&)= *
= ( ~ a , - ~ / ~ y $ - ) ) = fi and zi=
REFERENCES
(i--2)
w m , i+ai-l =gi(y,yT, . . . , yr
,$, e , ~ ~
, ) + k ? - [l]) K.
. J. Astrom and B. Wittenmark, Adaptive Control. New York
It is reasonable to take $(O) $ 0, because it reflects
Addison-Wesley, 1989.
[2] K. S. Narendra and A. M. Annaswamy, Stable Adaptive Systems.
the fact that the high-frequencyAgainb, is finite. Now,
Englewood Cliffs, NJ: Prentice-Hall, 1989.
for a given Y(O), Y T ( O ) ,
fl(O), V ( O ) , A@), we set
[3] S. S. Sastry and M. Bodson, Adaptive Control: Stability, Convergence
and Robustness. Englewood Cliffs, NJ: Prentice-Hall, 1989.
z;(O) = 0 by choosing y?-)(O) = - ( l / $ ( O ) ) & l t = ~ ,
[4] G. Kreisselmeier, Adaptive observers with exponential rate of converfori = 2 , . - . , p .
gence, IEEE Trans. Automat. Contr., vol. 22, pp. 2-8, 1977.
[5] G. C. Goodwin and D.Q. Mayne, A parameter estimation perspective
One simple way of setting z ( 0 ) = 0, which fits both cases
of continuous time model reference adaptive control,Automaticu, vol.
a) and b) above, is to set y,(O) = y(0) and all remaining filter
23, pp. 57-70, 1987.
[6] R. H. Middleton, Indirectcontinuous time adaptive control,Automatinitial conditions to zero: y?)(O) = 0, i = 1 , . . . , p-1, X(0) =
ica, vol. 23, pp. 793-795, 1987.
q(0) = 0.
[7] I. Kanellakopoulos, P. V. Kokotovic, and A. S. Morse, Systematic
In all tests, filter initialization was found to significantly
design of adaptive controllers for feedback linearizable systems, IEEE
Truns. Automat. Contr., vol. 36, pp. 1241-1253, 1991.
improve both the transisent performance and the control effort.
[8] -,A
toolkit for nonlinear feedback design, Sysr. Contr. Lett., vol.
A typical example is Fig. 9, where y,(O) = y(O), &(O) =
18, pp. 83-92, 1992.
yJ0) = 0, X(0) = ~ ( 0 =
) 0 have set z ( 0 ) = 0.
[9] I. Kanellakopoulos,P. V. Kokotovic, and A. S. Morse, Adaptiveoutputfeedback control of a class of nonlinear system, in P m . 30th IEEE
An explanation for the improvement of performances due
Con$ Dec. Contr., Brighton, U K , 1991, pp. 1082-1087.
to initialization is that by setting z l ( 0 ) = z z ( 0 ) = ... = [lo] M.
KrstiC, I. Kanellakopoulos, and P. V. Kokotovic, Adaptivenonlinear
z p ( 0 ) = 0, we reduce the initial value of the Lyapunov
control without overparametrization,Syst. Contr. Lett., vol. 19, pp.
177-185, 1992.
function (5.7), that is, we reduce the initial deviation from
[ l l ] R. Marino and P. Tomei, Global adaptive output-feedback control of
the globally attractive manifold M. Recall from (5.4)-(5.6)
nonlinearsystems, Part I: Linear parametrization,IEEE Trans.Automat.
that ij(0) = 0 and i(0) = 0. Thus, the only variables in
Contr., vol. 38, pp. 17-32, 1993.
a m flm
752
Miroslav Krstif (S92) was born in Pirot, Yugoslavia in 1964. He received the B.S. degree in
electrical engineering from the University of Belgrade, Yugoslavia, in 1989, and the M.S. degree
in electrical and computer engineering from the
University of California, Santa Barbara, CA, in
1992.
From 1989 to 1991 he was an Assistant at the
University of Belgrade, Department of Mining. In
1991-92 he was a Teaching Assistant, and since
1992 he has been a Research Assistant at University
of California, Santa Barbara, where he is currently working towards the Ph.D.
degree. His research interests include adaptive and nonlinear control.
He received the 1993 IEEE CDC Best Student Paper Award for the paper
Adaptive Nonlinear Control with Nonlinear Swapping coauthored with
Professor P. V. KokotoviC. In 1993 he was also a recipient of the UCSB
Engineering Fellowship.