You are on page 1of 5

Presented at The American Control Conference, San Fransisco, CA.

June 1993

Adaptive Control of a Flexible-Link Robotic Manipulator


with Unknown Payload Dynamics
Lawrence J. Alder Stephen M. Rock y
Stanford University Aerospace Robotics Laboratory
Stanford, California 94305

Abstract practical to out t the payload with sensors that measure


the internal state of the payload. The sensory input for
For many applications, exible-link robot arms may controlling the robot-payload system will be based on the
handle payloads that are not simple rigid bodies. In space robot system sensors only1 .
applications, the RMS (remote manipulator system) will The control approach that is developed and demon-
be manipulating satellites that may contain fuel or have strated experimentally in this paper is based on exten-
exible appendages. High-performance control schemes sions to the self-tuning regulator solution of adaptive con-
for exible-link manipulators require end-point feedback. trol. The approach is unique in that (1) it exploits the
If the payload dynamics are not accounted for in the con- closed-loop dynamic characteristics of the class of sys-
trol design, degraded performance or instability are pos- tems being investigated in order to solve the persistent
sible [1]. This paper presents technology that extends excitation problem and (2) it incorporates the rst exper-
the concept of end-point control to handle payloads with imental, on-line demonstration of a new eigenvalue iden-
unknown internal dynamics. ti cation technique. The result is a feasible, real-time
High-performance control is merged with an innova- control that potentially can be implemented in future
tive identi cation algorithm in a self-tuning regulator ap- space robotic systems.
proach. The identi cation of the payload is done using
recently developed subspace- tting techniques. These
techniques allow real-time determination of the order of 2 Experimental Apparatus
the payload dynamics. Sucient excitation problems are
addressed by performing the identi cation closed loop. Figure 1 shows a schematic diagram of the hardware.
Experimental results are presented. The experiment represents a large spaced-based manip-
ulator holding a payload that has internal vibrating dy-
1 Introduction namics. The robot arm is a exible beam which moves
in the horizontal plane. At one end of the beam, there
is a motor, and on the other end there is the payload
Space-based robots such as the shuttle Remote Ma- which is a pad that oats on an airbearing. The pad
nipulator System (RMS) and the proposed Space Sta- and air bearing prevent out of plane vibrations. Floating
tion Remote Manipulator System (SSRMS) have been the pad on a smooth granite table simulates the zero-g
and will be essential elements of future space explo- environment of space in the horizontal plane. The pad
ration. These manipulators are long, light-weight, and has a pendulum on it that represents the dynamics of the
are required to manipulate massive payloads. As a payload. The pendulum can be locked in place resulting
consequence, structural exibility in the links is signif- in a rigid-body payload, or it be left free to oscillate pro-
icant. Further, the payloads manipulated by these large, ducing a dynamic payload. Additionally, the pendulum
exible-link robots may themselves include unknown in- length can be manually adjusted; hence, the natural fre-
ternal dynamics: for example sloshing fuel or exible ap- quency of the pendulum is an unknown property of the
pendages (e.g., vibrating solar panels on a small satel- payload.
lite). The sensed signals available for feedback are the hub
The goal of this research is to develop control tech- angle, the inertial position of the payload, and inertial
niques that provide precise high-bandwidth end-point orientation of the payload. The pendulum angle is also
control of exible-link manipulators, while simultane- sensed but is not used for feedback or identi cation, since
ously damping any internal oscillations of the payload. it would not be available on deployed systems. The pen-
The internal dynamics of the payload will not be known dulum has a time constant of 30 seconds corresponding
apriori. Furthermore, it is assumed that it will not be to a natural damping ratio of less than 0.3%: the open-
 Ph.D., Dept of Aeronautics & Astronautics.
loop response of the pendulum is shown in Figure 2. See
y Assoc. Prof., Dept. of Aeronautics & Astronautics. 1
This includes vision sensing.
references [2] and [3] for a more detailed description of Control Parameter
the hardware. Redesign Identification

Granite Table Payload


Command
Y
Controller Process
U
Air Bearing
Flex Arm

Figure 3: Self-Tuning Regulator

Hub Motor This paper focuses on the identi cation aspects of the
self-tuning regulator. It is assumed that once the plant
has been identi ed correctly an appropriate controller
can be designed.
Figure 1: Experimental System
Identi cation Approach
In the general case, nothing about the payload is known
50 Pendulum Angle
apriori; however, for this research the structure of the
payload model is assumed known. Speci cally, in the
experiment the payload can either be a rigid body, or
degrees

0
it can include a single pendulum of unknown length. A
consequence of this is that the unknown properties of
-50 the payload can be constructed from the exact open-loop
0 5 10 15
seconds
20 25 30
eigenvalues of the system (payload plus arm). Thus, the
payload dynamics can be determined from identi cation
of eigenvalues only.
Figure 2: Pendulum Open-Loop Response To pursue this solution, an algorithm was developed
that is capable of identifying lightly damped eigenvalues
of a system (all of the open-loop eigenvalues of the ex-
3 Adaptive Algorithm perimental system are lightly damped). The algorithm is
described in Appendix A and does not require knowledge
of the system order.
Schmitz [4] surveyed control design approaches appli- A shortcoming of this approach to identifying the pay-
cable to high-order, lightly-damped structures typical of load is that all of the open-loop system modes must be
the class of systems dealt with here. Two conclusions identi ed, which requires all of the modes to be excited.
are drawn from that work. First, the use of end-point If all of the modes are not identi ed, then it may not be
feedback is critical for high-performance end-point con- possible to distinguish a exible mode of the arm from
trol of a exible arm. The second conclusion is that linear that of the payload oscillations. Since the payload may be
quadratic gaussian (LQG) synthesis tools are e ective in delicate, injecting excitation to guarantee that all modes
generating control systems that exploit end-point feed- of the arm are excited is unacceptable. To solve this
back. problem, a nominal controller is used, and the identi ca-
Extending the techniques of Schmitz to account for tion is performed closed loop. The task of the nominal
dynamic payloads, it is possible to accurately position controller is to damp the structural exibility of the arm
the payload while simultaneously damping internal vi- so that the arm dynamics are not mistaken for the lightly
brations of the payload [3]. The results show that high- damped oscillatory dynamics of the payload.
performance control is possible, but only if an accurate For the experimental system, a nominal controller is
model of the total system is available. designed using LQG synthesis techniques assuming that
In this paper, an adaptive control algorithm will be the payload is a rigid body. Figure 4 shows how the
presented that merges the LQG controllers with a iden- nominal controller performs when the payload is dy-
ti cation procedure that is able to provide the model - namic. The nominal controller yields a set of well-
delity that is required. The approach presented is based damped closed-loop system modes that have eigenval-
on extensions to the self-tuning-regulator (STR) class of ues that are relatively insensitive to the presence of pay-
adaptive control. The concept of a self-tuning regulator load dynamics. Further, if the payload is dynamic, a
is presented in Figure 3. It demonstrates the combina- lightly damped or slightly unstable mode appears that
tion of two functions. First, an identi cation algorithm is has an eigenvalue that is highly sensitive to the length
used to update the equations of motion of the plant. Sec- of the pendulum. Since the eigenvalue of the lightly
ond, a control design algorithm (either gain calculation damped mode appears in a region of the s-plane that
or scheduling) is used to modify the control law. is remote from the eigenvalues of the nominal set of con-
trolled modes, the existence of payload dynamics can be the pendulum is locked or it is not excited; in either case,
detected by determining the presence or absence of modal it does not need to be controlled. If the pendulum be-
content of the closed-loop system in a \critical region" of comes excited, the pole will be detected in the critical
the s-plane only. When a lightly damped or slightly un- region of the s-plane.
stable closed-loop mode is detected, it must be associated
with payload dynamics. Thus, only natural excitation is
required to detect and identify the payload dynamics. U Y
CONTROLLER
PROCESS
NOMINAL
60
x.
x.
50
Eigenvalue ID

40
x.
imag(s)

30 .x
X
x. X Due to Pendulum
2.5 Hz
20 X
2.3 Hz
..
. s-plane
. .
10 x x .. . . . ..
1.2 Hz

.. . .. x x...
x.. Figure 5: Closed-Loop Identi cation
0
-20 -15 -10 -5 0 5 The gure shows the eigenvalue identi cation
real(s) routine reporting the closed-loop eigenvalues
of the system. Performing the identi cation
Figure 4: Critical Region of S -Plane closed-loop allows the poles that are sensitive to
the payload dynamics to be isolated. A mode
This gure shows the locus of closed-loop roots in the critical region (dotted trapezoid) signals
of the system versus the open-loop frequency the presence of payload dynamics.
of the payload. The controller is designed as-
suming the payload is a rigid body. The x's are
the location of the closed-loop roots when the Once a pole is located in the critical region, its ex-
payload is rigid. The lines show how the closed- act location can be used to identify the unknown pa-
loop roots move as the open-loop frequency of rameter (in this case, the length of the pendulum). For
the payload is varied from 2.5 to 1.2 Hz. The the experimental system under nominal control, the lo-
open-loop frequency of the payload is the fre- cation of the lightly damped mode is directly related to
quency of the pendulum when the payload is the length of the pendulum. The relation between the
detached from the arm and oating on the air- critical eigenvalue and the pendulum length can be plot-
bearing. The dotted region in the s-plane is the ted (see Figure 4) and inverted graphically. Thus, once
critical region. Once a pole is detected in the the location of the pole is determined, the identi cation
critical region, the pendulum is known to exist, of the pendulum length is accomplished using a straight-
and its frequency can be determined. forward table look-up. Then, an appropriate controller is
designed based on the identi cation of the payload. The
The identi cation procedure that is applied to the ex- new controller is then swapped into the control loop and
perimental system is summarized in Figure 5. The pro- damps the oscillations of the payload.
cedure is split into detection and identi cation phases. Since no identi cation of the pendulum length is per-
The detection is accomplished by evaluating the modal formed until a pole is detected in the critical region, the
content (e.g., the eigenvalues) of the system while un- payload is only identi ed when sucient excitation ex-
der normal operation (closed-loop). The controller that ists naturally. There is no need to provide identi cation-
is running during the detection phase is the nominal speci c excitation.
controller|no identi cation-speci c inputs are required.
When evaluating the modal content, all of the closed-loop The performance of the adaptive logic depends heav-
eigenvalues might be found, but only the lightly damped ily on the ability of an identi cation algorithm to provide
poles within the critical region of the s-plane are of in- accurate estimates of the eigenvalues of the system. The
terest (see Figure 4). If a pole appears within the critical approach for accomplishing this utilizes a linear identi-
region, the payload is assumed to have internal dynamics. cation algorithm called ESPRIT which is described in
If a pole is not detected in the critical region, then either Appendix A.
4 Experimental Results 40
Payload Position
Actual
Desired
The result of two experimental tests are presented: the 20

disturbance response test, and the drop and slew test.

cm
0

The payload natural frequency is di erent for each test. Control Swap

The disturbance response test, shown in Figure 6,


-20

demonstrates the response of the adaptive algorithm to


-40
0 2 4 6 8 10 12 14 16 18 20

large unmodelled disturbances for both rigid and dy- seconds

namic payloads. The pendulum is initially locked hang- 50


Pendulum Angle

ing straight down. An impulsive disturbance is applied


to the tip of the manipulator, and the controller rejects

degrees
the disturbance. As expected, no adaptation takes place.
0 Pend.

This is in contrast to other adaptive algorithms that re-


Drop
Control Swap
spond unfavorably to unmodelled disturbances. The pen- -50

dulum is then released; however, since the pendulum is


0 2 4 6 8 10 12 14 16 18 20
seconds
not oscillating, there is no need to identify or control it.
A second impulsive disturbance is then applied to the tip
of the manipulator. This disturbance excites the pendu- Figure 7: Drop and Slew
lum enabling the adaptive algorithm to identify the pay-
load dynamics (natural frequency), select the proper con-
troller, swap it in, and then damp the pendulum while si-
multaneously regulating the payload position. Note that
neither the identi cation nor control algorithm is able the class of systems presented here, the identi cation is
to sense the pendulum angle directly even though it is reduced to determining the eigenvalues of the system un-
plotted. der nominal control. An eigenvalue identi cation rou-
The drop and slew test, shown in Figure 7, demon- tine based on ESPRIT is used to identify accurately any
strates adaptation during a slew. It starts with the pen- lightly damped modes that are indicative of the payload
dulum mechanically held at a 45 degree initial condition dynamics. Externally supplied excitation is not necessary
so that the payload is initially a rigid body. For the for payload identi cation: the approach does not identify
rst two seconds, the controller is regulating the pay- the payload dynamics unless they are excited naturally.
load. The pendulum is then dropped just as the robot Experiments show the response of the system in dis-
is commanded to slew the payload to a new position on turbance rejection and slew. In all cases, the adapta-
the table. The adaptive algorithm detects and controls tion only occurs when there is natural excitation of the
the pendulum oscillations while simultaneously tracking payload dynamics. If the pendulum is not excited, the
the slew command. controller does not need to account for it. Once the pen-
dulum is excited, the algorithm rst detects it and then
Payload Position controls it. This adaptive approach has proven successful
20
Disturbance 1 Disturbance 2 in experimentally demonstrating precise high-bandwidth
10
end-point control while simultaneously damping internal
0
oscillations of a payload with unknown internal dynam-
cm

-10 Control Swap


ics.
-20

A Eigenvalue Identi cation


-30
0 2 4 6 8 10 12 14 16 18 20
seconds

Pendulum Angle
40
Distrubance 1 Disturbance 2
20
This appendix describes the subspace- tting technique
called ESPRIT. Reference [5] presents the algorithm in
degrees

0
Pend. Pend. the context of direction of arrival determination for sig-
nal processing. The algorithm is presented here in the
-20 LOCKED UNLOCKED Control Swap

-40
0 2 4 6 8 10 12 14 16 18 20 context of linear system identi cation.
seconds
The subspace- tting technique is inherently digital.
Suppose that the system is linear time-invariant and is
Figure 6: Disturbance Response described in state-space form by
xk+1 = Axk + Buk (1)
5 Conclusions yk = Cxk + Duk (2)
where xk is the state of the linear system at time k, uk is
In this research, a new approach to identi cation is the input to the linear system, and yk is the observed
developed that utilizes a nominal controller in order to output (xk 2 IRn1 , uk 2 IRq1 , and yk 2 IRm1 ).
isolate the dynamic e ects related to the uncertain pa- Since the states are not measured, the system matrices
rameters from the other known system dynamics. For are identi able only to within an arbitrary non-singular
state (similarity) transformation. The identi ability of shift invariant property. Compute the singular value de-
the system matrices is also limited by the excitation of composition of YU? :
the system from the inputs or initial conditions.
The algorithm requires a window of data. The length YU? = PQT : (10)
of the data, N, must be selected. A sliding window length Let Ps be the matrix containing the rst n left singular
size M  N/2 must also be chosen. vectors of YU? . It can be shown from Equation 9 and 10
The rst step in the procedure is to take the input and that there exists a full rank n  n matrix T such that
output data and form the Hankel matrices
2 y y2  yN ,M 3 Ps = ,T : (11)
1 +1
Expanding the rows of this equation yields
Y = 4    y3
6 y 2 

yN ,M+2

75 (3)
2 P 3 2 C 3 2 C 3
yM yM +1  yN
C A2
0
2 u u2  uN ,M 3 6 P 7 6 CA2 7 6 7
T = 66 A.

6 1 7 6 CA 7 6 C 7
6 u
1
u3  uN ,M
+1
75 : (4) Ps = 66 P.2 7
7 =6 7 7 :
U = 4 4 .. 5 64 ... 7
5 4 .. 7
5
2 +2
   
uM uM +1  uN PM , 1 CAM , 1
C A M , 1

(12)
Thus,  that is the state-transition
Hence, there is a matrix A
matrix of the system in some coordinate frame such that
Y = ,X + HU (5) the second row of Ps can be found by multiplying the
where rst row by A  ; etc. Ps is , in some new coordinate
system.
To nd A , form
, = [ C (CA)    (CAM , ) ] (6)
T T T 1 T

X = [2x x    xN ,M ]
1 2 +1
3
(7) Ps  Ps with last row deleted (13)
D
1
0  0 Ps  Ps with first row deleted (14)
66 CB D  0 7 2

H = 64 CAB CB    0 775 : (8) and observe that


   
CAM , B CAM , B    D
2 3 Ps A  Ps :
1 2 (15)
The approximation is due to the fact Ps and Ps are
Thus Y 2 IRmM  N ,M and U 2 IRqM  N ,M ; , 2
( +1) ( +1)  is computed by solving the
created from noisy data. A
1 2

IRmM n ; X 2 IRn N ,M , and H 2 IRmM Mq . Note


( +1)
least squares problem or more accurately the total least
that U and Y are the only matrices that are formed from squares problem (see [5]) that is posed by Equation 15.
measured data. From A  the eigenvalues of the system can be found.
The second step of the procedure is to nd a matrix
U? such that UU? = 0. The matrix U? can be found? References
from the singular value decomposition of U. Once U [1] Daniel M. Rovner and Robert H. Cannon, Jr. Ex-
has been calculated, YU? can be formed from Equa- periments toward on-line identi cation and control
tion 5. Thus, of a very exible one-link manipulator. Interna-
tional Journal of Robotics Research, 6(4):3{19, Win-
YU? = ,XU? : (9) ter 1987.
Note that the column space of YU is contained in
? [2] L. J. Alder and S. M. Rock. Control of a exi-
the column space of ,. The matrix , has n columns. ble robotic manipulator with unknown payload dy-
Thus, performing a singular value decomposition of YU? namics: Initial experiments. In Proceedings of the
should yield n or less non-zero singular values. However, ASME Winter Annual Meeting, Atlanta GA, Decem-
since the data has noise, the last mM , n singular values ber 1991.
will not be exactly zero but will be signi cantly smaller [3] Lawrence Alder. Control of a Flexible-Link Robotic
than the rst n. The large drop in the singular values Arm Manipulating An Unknown Dynamic Payload.
of YU? determines the number of excited poles of the PhD thesis, Stanford University, Stanford, CA 94305,
system, n. February 1993.
Once the order of the system has been determined, the [4] Robert H. Cannon, Jr. and Eric Schmitz. Initial ex-
next step is to nd the A matrix that is of the proper periments on the end-point control of a exible one-
order that best ts the data. To nd A, note that the ob- link robot. The International Journal of Robotics Re-
servability matrix , possess a shift invariant structure. search, 3(3):62{75, Fall 1984.
This means that the second row of , can be found by
multiplying the rst row by A, and the third row can [5] R. H. Roy. ESPRIT { Estimation of Signal Pa-
be found by multiplying the second row by A, and so rameters via Rotational Invariance Techniques. PhD
on. However, , is unknown. Fortunately, , can be com- thesis, Stanford University, Stanford, CA., August
puted in a di erent coordinate system maintaining the 1987.

You might also like