A Method of Trajectory Optimization by Fast-Time Repetitive Computations

NASA TECHNICAL NOTE
F_-
N A S A TN D-3404
d -
I
W
t-'
t-'
s. gi
I
A METHOD OF TRAJECTORY OPTIMIZATION BY FAST-TIME REPETITIVE COMPUTATIONS

by Rodney C. Wingrove,James S. Ruby, and D. Francis Ames Keseurch Center Moffett Field, Cali$
'I
1'.
: .
.I
.,
.y
NATIONAL AERONAUTICS A N D SPACE ADMINISTRATION
WASHINGTON,
05;. .:;;
0;.
,A.P.R./~ i s a s
1'
- 9
$!
TECH LIBRARY KAFB, NM
IIlil11 1lulI 11111I l 11 0130190
NASA TN D-3404
A METHOD O F TRAJECTORY OPTIMIZATION BY FAST-TIME REPETITIVE COMPUTATIONS By Rodney C. Wingrove, J a m e s S. Raby, and D. F r a n c i s C r a n e Ame s Research Center Moffett F i e l d , Calif.
NATIONAL AERONAUTICS AND SPACE ADMINISTRATION

For sale
by the Clearinghouse for Federal Scientific and Technical Information Springfield, Virginia 22151 Price $0.30
A METHOD OF TRAJECTORY OPTIMIZATION BY FAST-TIME
RFPETITIVE COMmTTATIONS By Rodney C . Wingrove, James S. Raby, and D. F r a n c i s Crane Ames Research Center S U M M A R Y This r e p o r t p r e s e n t s a p e r t u r b a t i o n method of computing o p t i m t r a j e c t o r i e s wherein c o n t r o l impulse response f u n c t i o n s are determined by fast-time r e p e t i t i v e computations of t h e s t a t e e q u a t i o n s . This method does not r e q u i r e t h e s o l u t i o n of t h e a u x i l i a r y set of a d j o i n t e q u a t i o n s u s e d w i t h o t h e r p e r t u r b a t i o n methods. The mechanization of t h i s computing method on a h y b r i d computer i s d i s cussed and an a p p l i c a t i o n t o t h e s t e e p e s t descent o p t i m i z a t i o n of r e e n t r y t r a j e c t o r i e s i s p r e s e n t e d . I n t h i s example, t h e v e h i c l e i s t o a r r i v e a t a d e s i r e d range, t h e h e a t i n p u t t o t h e v e h i c l e i s t o b e minimized, and t h e con trol i s t o remain w i t h i n s p e c i f i e d c o n s t r a i n t s . Gptii-mm t r a j e c t o r i e s f o r t h i s example could b e o b t a i n e d i n about 8 minutes of computing time.
INTRODUCTION
It i s important f o r space v e h i c l e t r a j e c t o r i e s t o b e n e a r optimum i n t h e sense t h a t some q u a n t i t y i s e i t h e r maximized or minimized. For example, i n r e e n t r y t h e t r a i e c t o r y t o d e s i r e d t e r m i n a l c o n d i t i o n s i s n e a r optimum when t h e t o t a l aerodynamic h e a t i n g i s a minimum. This r e p o r t w i l l consider a method f o r f i n d i n g t h e Lime h i s t J r i e s of n o n l i n e a r c o n t r o l s t h a t correspond t o optimum t r a . i e c t o r i e s . S e v e r a l p e r t u r b a t i o n methods, such as t h e c a l c u l u s of v a r i a t b n s , a p p l i c a t i o n s of Ihe maximum p r i n c i p l e , and d i r e c t s t e e p e s t descent, have been considered f o r s o l v i n g t h i s c o n t r o l o p t i m i z a t i o n problem. Reference 1 c o n t a i n s a good review of t h e s e v a r i o u s methods and r e f e r e n c e 2 g i v e s s e v e r a l analog and d i g i t a l computing techniques f o r implementing them.
I n p r i n c i p l e , each of t h e s e techniques should g i v e s a t i s f a c t o r y r e s u l t s , b u t it has been found t h a t f o r many t r a j e c t o r y problems t h e computer mechani z a t i o n s are cumbersome and r e q u i r e programs t h a t a r e d i f f i c u l t f o r e n g i n e e r s t o formulate. The computing method t o be r e p o r t e d h e r e i n w a s i n v e s t i g a t e d i n an attempt t o a l l e v i a t e t h e s e d i f f i c u l t i e s and t o provide a more d i r e c t way of computing o p t i m i z a t i o n s o l u t i o n s .
I n previous o p t i m i z a t i o n s t u d i e s u s i n g p e r t u r b a t i o n techniques t h e computations have involved t h e dynamic s o l u t i o n of two s e t s of e q u a t i o n s : (1) n o n l i n e a r s t a t e e q u a t i o n s and (2) l i n e a r a d j o i n t e q u a t i o n s . The method t o b e r e p o r t e d h e r e i n d i f f e r s i n t h a t o n l y t h e s o l u t i o n of t h e n o n l i n e a r s t a t e
e q u a t i o n s i s used. The response of given f u n c t i o n s ( e . g . , t e r m i n a l e r r o r o r q u a n t i t y t o b e optimized) t o a c o n t r o l impulse i s determined along t h e t r a j e c t o r y by fast-time r e p e t i t i v e computations r a t h e r t h a n by a s o l u t i o n of t h e a d j o i n t e q u a t i o n s . Since a u x i l i a r y a d j o i n t e q u a t i o n s are n o t needed, t h e i n v e s t i g a t o r should u n d e r s t a n d t h e o p t i m i z a t i o n p r o c e s s more e a s i l y ; a l s o t h e computer program should be s i m p l e r . However, t h i s new a l t e r n a t e computing method does r e q u i r e many s o l u t i o n s o f t h e s t a t e e q u a t i o n s . This t a s k of com p u t i n g a l a r g e number of dynamic s o l u t i o n s i s i d e a l l y s u i t e d t o high-speed r e p e t i t i v e h y b r i d computation as w i l l b e considered h e r e i n . This r e p o r t w i l l p r e s e n t one a p p l i c a t i o n o f t h i s computing technique; t h a t of t r a j e c t s r y o p t i m i z a t i o n u s i n g t h e s t e e p e s t d e s c e n t method ( r e f s . 3 and 4 ) . The mechanization of t h i s method on a h y b r i d computer w i l l be d i s c u s s e d and r e s u l t s w i l l be p r e s e n t e d t o i l l u s t r a t e t h e u s e of t h i s procedure i n t h e o p t i m i z a t i o n of r e e n t r y t r a j e c t o r i e s . For t h e i n t e r e s t e d r e a d e r appendix A i l l u s t r a t e s t h e r e l a t i o n s h i p of t h e impulse response f u n c t i o n s computed i n t h i s r e p o r t t o t h e s o l u t i o n s o b t a i n e d w i t h t h e a d j o i n t e q u a t i o n s and t o t h e maximum p r i n c i p l e of o p t i m i z a t i o n . This appendix a l s o p r o v i d e s a background f o r under s t a n d i n g t h e s t e e p e s t descent o p t i m i z a t i o n e q u a t i o n s .
NOTATION The f o l l o w i n g n o t a t i o n i s u s e d i n t h e body of t h e t e x t .
Additional
symbols a r e d e s c r i b e d as t h e y are i n t r o d u c e d i n t h e appendixes.

L D
n c o n t r o l value of l i f t - d r a g r a t i o number of s t o r a g e p o i n t s i n c o n t r o l t i m e h i s t o r y
t tf
to
t ime
f i n a l t inie i n i t i a l time
At
U
t i m e increment of c o n t r o l impulse
1 function ccritr :
AU
c o n t m l impulse c o s t f u n c t i o n a t f i n a l time change i n c o s t f u n c t i o n a t f i n a l time due t o c o n t r a 1 impulses a t time t s t a t e v a l u e a t f i n a l time
&(t)
d e s i r e d s t a t e value a t f i n a l t i m e
A\I'(t)
change i n s t a t e value a t f i n a l t i m e due t o c o n t r o l impulses a t t i m e
T h e method of s t e e p e s t d e s c e n t ( r e f s . 3 and 4) i s an i t e r a t i v e procedure t h a t has been used for optimizing t r a j e c t o r i e s . The p r o c e s s commences w i t h any nonoptimal t r a j e c t o r y from which a s l i g h t l y improved one i s derived. The improved t r a j e c t o r y i s t h e n u s e d as a new nominal t r a j e c t o r y , and t h e procedure i s r e p e a t e d u n t i l t h e optimum or n e a r l y optimum t r a j e c t o r y i s found.
General O u t l i n e The i t e r a t i o n i s as f o l l o w s : (1) E s t i m a t e a reasonable program t h a t n e a r l y s a t i s f i e s t h e t e r m i n a l c o n d i t i o n s for s p e c i f i e d i n i t i a l c o n d i t i o n s ; ( 2 ) determine impulse response f u n c t i o n s t h a t d e s c r i b e t h e e f f e c t s of s m a l l changes i n t h e c o n t r o l on t h e t e r m i n a l s t a t e and on t h e c o s t ( t h e q u a n t i t y t o b e minimized). These impulse response f u n c t i o n s , combined w i t h s t e e p e s t descent computations, i n d i c a t e t h e b e s t p J s s i b l e way of making s m a l l changes t o t h e c o n t r o l t o decrease t h e c o s t and s t i l l a r r i v e a t t h e end p o i n t ; (3) add t h i s change i n c o n t r o l t o t h e previous nominal c s n t r o l program. The r e s u l t i s a new t r a j e c t o r y .with a decreased c o s t ; ( 4 ) r e p e a t t h i s p r o c e s s u n t i l t h e r e e x i s t s o n l y a v e r y s m a l l change i n c o s t f o r each new t r a j e c t o r y , i n d i c a t i n g t h a t t h e c o n t r o l i s v e r y near a l o c a l optimum. A l i m i t value of t h e c o n t r o l may b e reached b e f o r e t h e c o s t i s completely minimized. I n t h i s c a s e , t h e p r o c e s s i s continued u n t i l t h e c o n s t r a i n t ( c o n t r o l l i m i t ) i s reached, s i n c e no f u r t h e r o p t i m i z a t i o n i s p o s s i b l e . The p r o p e r t i e s of s t e e p e s t descent o p t i m i z a t i o n have been documented i n many p r e v i a u s s t u d i e s ( e . g . , r e f s . 5 - 7 ) . Although t h i s meth.;d has been regarded as t h e most p r a c t i c a l i n many a p p l i c a t i o n s , t h e r e i s no guarantee t h a t it y i e l d s t h e a b s o l u t e optimum. That i s , f o r some i n i t i a l choices of t h e nominal t r a j e c t o r y , t h e f i n a l optimized t r a j e c t o r y may r e p r e s e n t o n l y a l o c a l optimum p a t h . Also, i n some a p p l i c a t i o n s , where t h e c o s t f u n c t i o n may be r e l a t i v e l y i n s e n s i t i v e t o c o n t r o l v a r i a t i o n s , a l a r g e number of i t e r a t i o n s may be necessary t o approach t h e optimum s o l u t i o n . Computation of Impulse RespJnse Functions To i l l u s t r a t e t h e computation of t h e impulse response f u n c t i o n s l e t t h e q u a n t i t y t o be minimized be noted as cp, t h e c s s t e v a l u a t e d a t t h e f i n a l t i m e . L e t t h e s t a t e variable a t t h e f i n a l t i m e be noted as \I' and l e t t h e d e s i r e d end-point value f o r t h i s b e denoted \I'd. F i g u r e 1 i l l u s t r a t e s t h e manner i n which t h e i n f l u e n c e of s m a l l c o n t r o l changes on cp and \I' are c a l c u l a t e d i n t h i s r e p o r t . The e q u a t i o n s of motion are f i r s t s o l v e d w i t h a c o n t r o l change, a p o s i t i v e c o n t r o l impulse a t t i m e t ,
superimposed upon t h e nominal c o n t r o l . During t h e next s o l u t i o n of t h e motion e q u a t i o n s , a n e g a t i v e c o n t r o l impulse o f t h e same magnitude i s i n s e r t e d a t t i m e t . The change i n c o s t , 4, and t h e change i n t e r m i n a l state, A@, are d e r i v e d f r o m t h e s e t w o s o l u t i o n s . I n a s i m i l a r manner t h e impulse response f u n c t i o n s can be p r o g r e s s i v e l y determined at s u c c e s s i v e times along t h e t r a j e c t o r y , and t h e technique by which t h e y a r e determined i s t h e most important f e a t u r e o f t h i s computing method. The computation of t h e f u l l h i s t o r y o f @ ( t ) and A $ ( t ) f o r t h e same c o n t r o l impulse a t d i f f e r e n t t i m e s along t h e t r a j e c t o r y i s termed one " i t e r a t i o n " s i n c e it corresponds to t h e p r e v i o u s o p t i m i z a t i o n s t u d i e s where one i t e r a t i o n w i t h t h e a d j o i n t e q u a t i o n s w a s u s e d t o compute e s s e n t i a l l y t h e s e same f u n c t i o n s along t h e t r a j e c t o r y . Appendix A shows t h e r e l a t i o n s h i p o f t h i s experimental d e t e r m i n a t i o n t o t h e s t a n d a r d d e t e r m i n a t i o n u s i n g a d j o i n t equa t i o n s . These e x p e r i m e n t a l impulse response f u n c t i o n s a r e shown t o be d i r e c t l y r e l a t e d t o t h e well-known "Green's f u n c t i o n s . I'
S t e e p e s t Descent Optimization Equations The impulse response f u n c t i o n s a r e u s e d i n t h e s t e e p e s t d e s c e n t technique t o modify t h e c o n t r o l toward t h e optimum i n t h e f o l l o w i n g manner (see appen dix A).
The g a i n s Q and Kq a r e c o n s t a n t s f o r each i t e r a t i o n . The g a i n % weights t h e impulse response f u n c t i o n f o r t h e c o s t ; i t s s i g n i s n e g a t i v e i n o r d e r t o decrease t h e c o s t . The magnitude o f % i s determined e x p e r i m e n t a l l y f o r each problem. Too l a r g e a g a i n may cause i n s t a b i l i t y i n t h e convergence procedure, while t o o s m a l l a g a i n may extend t h e time o f convergence. The g a i n K q must be c a l c u l a t e d f o r each i t e r a t i o n s o t h a t t h e t e r m Kq Aq(t) w i l l account f o r t e r m i n a l displacement due t o t h e optimizing term, Kcp & p ( t ) , and any t e r m i n a l displacement e r r o r f r o m t h e p r e v i o u s i t e r a t i o n w i l l be c o r r e c t e d . The f o r m u l a t i o n f o r c a l c u l a t i n g K,J i s as f o l l o w s :
S m a l l changes, SJr, i n t h e t e r m i n a l s t a t e , $, due t o s m a l l changes, 6 u ( t ) , i n c o n t r o l can b e approx.imated by:
64f= 2 A : At
J
%
6 u (t ) A J r (t )d t
~
where A u i s t h e h e i g h t of each c o n t r o l impulse and A t i s t h e time i n t e r v a l of each c o n t r o l impulse. S u b s t i t u t i n g Q ( t ) + K,,, A $ ( t ) f r o m (1) f o r 6u, w e have :
6$ = 2 nu 1 A t Jt'
t0
(3)
Solving f o r
and I.etting -&$ = $d
$ (prevj.ous t e r m i n a l e r r o r ) we o b t a i n :
to J W S t e e p e s t descent optimization t e r m
<
to J v Terminal e r r o r correction t e r m The a c t u a l cal
This g i v e s t h e g e n e r a l form of t h e steepest descent e q u a t i o n s . c u l a t i o n s are next considered i n more d e t a i l .
HYBRID COMPUTER MECHANImTION

The mechanization of t h e o p t i m i z a t i o n procedure on a high-speed r e p e t i t i v e analog computer i s p r e s e n t e d i n f i g u r e 2. Figure 2 ( a ) i s t h e flow diagram and f i g u r e 2 ( b ) i l l u s t r a t e s t h e l o g i c u s e d i n a u t o m a t i c a l l y r e g u l a t i n g t h e problem. The mechanization c o n s i s t s of an analog computer program f o r s o l v i n g t h e t r a j e c t o r y e q u a t i o n s ; l o g i c r e q u i r e d t o c o o r d i n a t e t h e procedure; and a s e r i a l memory s t o r a g e u n i t f o r s t o r i n g t h e nominal c o n t r o l program. The s e r i a l memoryunit i s c o n t i n u o u s l y d r i v e n by counter p u l s e s (Logic no. 1 ) . The output of t h e s e r i a l memory i s t h e nominal c o n t r o l t i m e h i s t o r y w i t h n p o i n t s t h a t i s u s e d along w i t h t h e a p p r o p r i a t e c o n t r o l impulse, t o s o l v e t h e t r a j e c t o r y e q u a t i o n s . These equations are s t a r t e d a t t h e s p e c i f i e d i n i t i a l c o n d i t i o n s w i t h Logic no. 2 and stopped when t h e t r a j e c t o r y reaches t h e s p e c i f i e d end p o i n t w i t h Logic no. 3. The f i n a l v a l u e s of t h e c o s t quan t i t y , cp, and s t a t e q u a n t i t y , $', are s t o r e d a t t h e end of each r u n a s i n d i c a t e d by Logic n38. 4 and 5. The p z s i t i v e o r negative c o n t r o l impulse i s added t o t h e nominal c o n t r o l i n p u t w i t h Logic nos. 6 and 7 , r e s p e c t i v e l y . Logic no. 8 i n s e r t s t h e modifying c o n t r o l (Kcp @ + K ,, Aq) i n t o t h e s e r i a l memory. This p r s c e d u r e r u n s i n e s s e n t i a l l y a continuous manner; t h a t i s , one p o i n t o u t o f t h e n p o i n t s i n t h e nominal c o n t r o l h i s t o r y i s updated a f t e r each two r e p e t i t i v e computations, and a f t e r 2n r e p e t i t i v e computations (one i t e r a t i o n ) , every p o i n t i n s t o r a g e has been modified and t h e p r o c e s s i s r e p e a t e d . For each i t e r a t i o n t h e g a i n s Kcp and Kq are h e l d as c o n s t a n t s . A s p r e v i o u s l y men t i o n e d , t h e value of Kcp determines t h e r e l a t i v e speed and s t a b i l i t y of t h e convergence a n t 3 t h e optimum. The corresponding value of Kq t o b e u s e d w i t h each new i t e r a t i o n i s c a l c u l a t e d b y e q u a t i o n ( 4 ) as a f u n c t i o n of t h e t e r m i n a l e r r o r from each p r e v i o u s i t e r a t i o n ($d - $) and as a f u n c t i o n of t h e following two i n t e g r a t e d q u a n t i t i e s from each p r e v i o u s i t e r a t i o n :
and
The values f o r e q u a t i o n s ( 5 ) and (6) were computed as i n t e g r a l s a v e r t h e t i m e p e r i o d from t = to t o t = t f . The t i m e to w a s r e p r e s e n t e d b y a l o g i c s i g n a l a t t h e f i r s t r e p e t i t i v e computation i n an i t e r a t i o n c y c l e and t h e t i m e tf w a s r e p r e s e n t e d b y a l o g i c s i g n a l at t h e l a s t computation i n a m iteration c y c l e . It should b e n o t e d t h a t during t h o s e p u t s of t h e t r a j e c t o r y when t h e c o n t r o l w a s a t a c o n s t r a i n t l i m i t , no f u r t h e r o p t i m i z a t i o n w a s p o s s i b l e , and t h e i n t e g r a t i o n of e q u a t i o n s ( 5 ) and ( 6 ) w a s t h e r e f o r e n o t c a r r i e d o u t during t h o s e times. This type of computer mechanization w i l l b e i l l u s t r a t e d i n more d e t a i l f o r t h e f o l l o w i n g example problem.
APPLICATION TO FEEITTRY TRAJECTORY OPTIMIZATION
Statement of t h e Problem The problem t o b e i l l u s t r a t e d i n t h i s s e c t i o n i s t h a t of determining t h e time h i s t o r y of t h e v a r i a t i o n of l i f t - d r a g r a t i o ( c o n t r o l L/D) t h a t must b e flown f o r a v e h i c l e r e t u r n i n g i n t o t h e e a r t h ' s atmosphere s o t h a t : The t o t a l h e a t i n g load t o t h e v e h i c l e i s minimized. The v e h i c l e arrives a t a d e s i r e d d e s t i n a t i o n . The c o n t r o l remains w i t h i n s p e c i f i e d c o n s t r a i n t s .
Mechanization The equations of motion, p r e s e n t e d i n appendix.B, were f o r a p o i n t mass i n p l a n a r motion 3ver a s p h e r i c a l n o n r o t a t i n g e a r t h . The v e h i c l e c h a r a c t e r i s t i c s and f l i g h t c o n d i t i o n s were t h o s e f o r a manned capsule r e t u r n i n g from e a r t h orbit. I n i t i a l c o n d i t i o n s were: Altitude
Velocity
F l i g h t - p a t h angle
Range t o d e s t i n a t i o n
F i n a l stopping c o n d i t i o n s were: Altitude
76.3 lan 7.63 km/s 1 . 8 ' 1609 Ism
(250,000 f t ) (25,000 f p s )
(1000 m i )
30.48
~nn
(ioo,ooo f t )
C o n t r o l l i m i t s were:
o <L<
0 . 5
The main hardware elements u s e d i n t h e h y b r i d computer mechanization were: Hardware elements Analog computer P a r a l l e l d i g i t a l logic units Track and s t o r e a m p l i f i e r s D i g i t a l d e l a y l i n e memories1 ( w i t h D/A and A/D c o n v e r t e r s ) Program t a s k S o l u t i o n of t r a j e c t o r y e q u a t i o n s Logic c o n t r o l of program S t o r a g e o f end-point v a l u e s S t o r a g e of c o n t r o l time h i s t o r y
The 64-word d i g i t a l s e r i a l memory u n i t (1.3 b i t s p e r word) w a s accessed w i t h t h e f a s t e s t allowable c o u n t e r rate (0.002 s e c ) . A complete 64-word c y c l e w a s t h e n a v a i l a b l e e v e r y 0.128 second. To a l l o w a complete s o l u t i o n o f t h e t r a j e c t o r y e q u a t i o n s w i t h i n 0.128 second, t h e analog computer w a s time s c a l e d a t 3750 t o 1. Results
A s e r i e s of computer runs f o r t h i s problem i s i l l u s t r a t e d i n f i g u r e 3. F i g u r e 3 ( a ) p r e s e n t s t h e d e t a i l s of each r e p e t i t i v e t r a j e c t o r y computation and f i g u r e 3 ( b ) p r e s e n t s t h e d e t a i l s Jf t h e o v e r a l l convergence onto t h e optimum nominal c o n t r o l . F i g u r e 3 ( a ) shows j u s t a p o r t i o n o f i t e r a t i o n no. 0 as p r e sented i n f i g u r e 3 ( b ) .
I n t h e upper t r a c e of f i g u r e 3 ( a ) t h e c o n t r o l impulses a r e superimposed upon t h e i n i t i a l nominal c o n t r o l . Each c o n t r o l impulse had a magnitude o f L/D = 20.25 and a time increment of one c l o c k p u l s e (0.002 s e e ) . This c o n t r o l impulse w a s chosen because it gave v a r i a t i o n i n t h e f i n a l range and h e a t l o a d on t h e o r d e r o f f 5 p e r c e n t . The range and i n t e g r a t e d h e a t l o a d along each of t h e r e p e t i t i v e t r a j e c t o r i e s a r e p r e s e n t e d along w i t h t h e f i n a l v a l u e s as t h e y a r e s t o r e d w i t h t r a c k and s t o r e a m p l i f i e r s . The d i f f e r e n c e between t h e s e s t o r e d q u a n t i t i e s f o r each two p a i r s o f subsequent runs i s A@ r e p r e s e n t i n g t h e range impulse response f u n c t i o n s and &p representing the heat load impulse response f u n c t i o n s . I n f i g u r e 3 ( b ) t h e f i r s t 10 i t e r a t i o n s ( e a c h i t e r a t i o n c o n s i s t s of 128 r e p e t i t i v e computations) of t h e converging o p t i m i z a t i o n procedure are i l l u s t r a t e d along w i t h t h e f i n a l i t e r a t i o n . During t h e convergence procedure t h e range i s seen t o vary s l i g h t l y about t h e d e s i r e d value o f 1609 km (1000 m i l e s ) . The h e a t l o a d i s shown t o b e reduced about 10 p e r c e n t during t h e f i r s t t e n i t e r a t i o n s and diminished t o about 12 p e r c e n t from t h e o r i g i n a l w i t h t h e f i n a l (optimum) c o n t r o l v a r i a t i o n .
A s e r i e s of t r a c k and s t o r e a m p l i f i e r s could a l s o have been u s e d for this stxage.
The modifying c o n t r o l shown i n t h e f i g u r e i s t h e sum Q nCp + Kq A $ . For t h i s s e r i e s of runs a c o n s t a n t v a l u e of E;lp = - 2 . 5 ~ 1 0 - ~ [ u n i tof s (L/D)/(J/m2)] w a s found t o , a l l o w a f a i r l y r a p i d convergence while maintaining program s t a b i l i t y . The v a l u e of K$ w a s c a l c u l a t e d by e q u a t i o n ' ( 4 ) t o b e t h a t value f o r each i t e r a t i o n such a s ' t o a l l o w convergence i n t h e s t e e p e s t descent manner. I n t h e lower t r a c e of f i g u r e 3 ( b ) t h e nominal c o n t r o l i s recorded as it i s r e a d o u t of s e r i a l memory every 128 +1 counter p u l s e s ( w i t h Logic no. 8 ) . This g i v e s a convenient time h i s t o r y t o show t h e manner i n which t h e c o n t r o l has been modified during each i t e r a t i o n . The c o n t r o l i s seen t o be l i m i t e d w i t h i n 0 < L/D < 0.5. This w a s achieved simply by l i m i t i n g t h e output of t h e s e r i a l memory t o w i t h i n t h e s e v a l u e s .
As can be seen, t h e optimum c o n t r o l v a r i a t i o n f o r t h i s case i s a bangbang c o n t r o l . With t h e s t e e p e s t descent method, it i s u s u a l l y found t h a t n e a r optimum c o n t r o l can be achieved i n t h e f i r s t few i t e r a t i o n s , b u t t h a t t o "square up t h e corners'' and achieve f u l l optimum c o n t r o l a number of f u r t h e r i t e r a t i o n s (on t h e o r d e r of 20 t o 50) are r e q u i r e d .
Convergence and S t a b i l i t y Considerations One of t h e important a s p e c t s of any o p t i m i z a t i o n scheme i s t h e a b i l i t y t o converge, w i t h i n a reasonable time, onto t h e optimum s o l u t i o n . For t h e p a r t i c u l a r method i n th'is r e p o r t it has been p o i n t e d o u t t h a t t h i s convergence p r i m a r i l y depends upon choosing t h e p r o p e r v a l u e of t h e g a i n $. I n t h e example l e s s than 2 . 5 ~ 1 0 - ~ problem, it w a s found t h a t using any value of [ u n i t s of (L/D) /( J / m 2 ) ] r e s u l t e d i n smooth convergence; however, t h e convergence time (which w a s p r o p o r t i o n a l t o 1 / % ) became long. For i n i t i a l values of g r e a t e r t h a n twice t h e aforementioned value t h e convergence became u n s t a b l e , t h a t i s , t h e modifying 6 c o n t r o l became s o l a r g e as t o change d r a s t i c a l l y t h e s t a t e v a r i a b l e from t h e i r nominal f i n a l v a l u e s .
KCP
It w a s found t h a t , as t h e optimum c o n t r o l w a s approached ( a f t e r about 10 i t e r a t i o n s ) , t h e value of % could be i n c r e a s e d and convergence s t a b i l i t y maintained, because i n t h e s e examples t h e c o n t r o l approached bang-bang and o n l y s m a l l changes were p o s s i b l e n e a r t h e s a t u r a t i o n limits. The value of K$ i n t h e s e c a s e s could be i n c r e a s e d t o about 10 times t h e aforementioned value, b u t i n c r e a s i n g it much f a r t h e r (without analog v o l t a g e s c a l i n g changes) would a l l o w extraneous computer n o i s e t o b e magnified t o a p o i n t where it caused n o t a b l e random f l u c t u a t i o n s i n t h e computations.
For a reasonable value of g a i n , such as t h a t used f o r t h e example problem, t h e time t o converge t o a n e a r optimum s o l u t i o n ( 1 1 i t e r a t i o n s ) w a s about 3 minutes, and t o a f u l l optimum s o l u t i o n (30 i t e r a t i o n s ) , about 8 minutes. F u r t h e r changes i n t h e s e convergence times, of course, depend upon s e v e r a l f a c t o r s . For i n s t a n c e , t h e convergence time i n t h i s computing setup w a s i n p r o p o r t i o n t o n2, where n i s t h e number of p o i n t s d e s c r i b i n g t h e c o n t r o l time h i s t o r y (64 p o i n t s f o r t h e case c i t e d ) . Also t h e allowable s o l u t i o n r a t e s of t h e computer elements d i r e c t l y a f f e c t t h e convergence time. The continuing development and use of high-speed computing elements w i l l c e r t a i n l y r e s u l t i n convergence times s m a l l e r t h a n t h e time c i t e d .
The r e s u l t s o b t a i n e d b y t h i s computing method appear s a t i s f a c t o r y f o r e n g i n e e r i n g purposes; however, t h e u s u a l disadvantages o f analog computation are i n h e r e n t w i t h t h i s method. These disadvantages are p r i m a r i l y concerned w i t h t h e extraneous n o i s e i n t h e computations and t h e a b s o l u t e accuracy ( o n l y t o w i t h i n about 1 p e r c e n t ) of analog computer.
CONCLUDING
REMARKS
This r e p o r t has p r e s e n t e d a p e r t u r b a t i o n method of computing optimum t r a j e c t o r i e s . The technique u s e s fast-time r e p e t i t i v e computations i n d e t e r mining c o n t r o l impulse response f u n c t i o n s and r e q u i r e s o n l y t h e dynamic s o l u t i o n o f t h e s t a t e e q u a t i o n s ; whereas o t h e r p e r t u r b a t i o n computing t e c h n i q u e s have r e q u i r e d t h e s o l u t i o n of a d d i t i o n a l a d j o i n t e q u a t i o n s . A h y b r i d computer w a s used i n applying t h e method t o t h e s t e e p e s t descent o p t i m i z a t i o n of r e e n t r y t r a j e c t o r i e s . Mechanizing t h e computer f o r t h i s type of problem w a s r e l a t i v e l y simple, and near Q p t i m m t r a j e c t o r i e s could be o b t a i n e d i n about 3 minutes of computing t i m e and full optimum t r a j e c t o r i e s i n about 8 minutes. The advantage o f t h e technique o u t l i n e d here over a l t e r n a t i v e techniques i s t h a t t h e i n v e s t i g a t o r need n o t be familiar w i t h or u s e an a u x i l i a r y s e t of l i n e a r a d j o i n t e q u a t i o n s f o r t h e o p t i m i z a t i o n . This technique &ses, however, r e q u i r e a l a r g e number of dynamic s o l u t i o n s of t h e s t a t e e q u a t i o n s , b u t t h i s computing t a s k appears p r a c t i c a l w i t h t h e high-speed r e p e t i t i v e computation procedure p r e s e n t e d i n t h i s r e p o r t . Ames Research Center N a t i o n a l Aeronautics and Space Administration Moffett F i e l d , C a l i f . , J a n . 24, 1966
APPENDIX A
RELATIONSHIP O F
EXPERIMENTAL IMPULSE RESPONSE FUNCTIONS
TO ADJOIN" SOLUTIONS AND THE MAXIMUM PRINCIPLJT

A d j o i n t S o l u t i o n of Impulse Response F u n c t i o m
L e t the s t a t e e q u a t i o n s b e n o t e d as
where x ( t ) i s a v e c t o r of s t a t e v a r i a b l e s , u ( t ) i s t h e c o n t r o l v a r i a b l e and i s a v e c t o r of known f u n c t i o n s of x ( t ) and u ( t )
The a u x i l i a r y a d j o i n t e q u a t i o n s can b e noted as
where A i s a v e c t o r of i n f l u e n c e f u n c t i o n s and (af/ax)T i s t h e t r a n s p o s e of t h e matrix d e s c r i b i n g l i n e a r motions about t h e nominal p a t h of x ( t ) and
u(t)
It i s w e l l known t h a t any s m a l l change i n t h e c o n t r o l q u a n t i t y along t h e nominal p a t h w i l l determine a change 6(p i n any q u a n t i t y f i n a l t i m e as follows:
6u(t) cp a t t h e
+ r e p r e s e n t s a s o l u t i o n of t h e a d j o i n t e q u a t i o n s w i t h t h e boundary where & c o n d i t i o n s a t t h e f i n a l t i m e of
The q u a n t i t y
A ( a f / d u ) w i t h i n t h e i n t e g r a l i s kn3wn a s Green's f u n c t i 3 n .
1 0
Experimental Determination of Z q u l s e Response Functions
The method used i n t h e t e s t .of t h i s r e p o r t f o r f i n d i n g a change p e r t u r b the c o n t r o l e x p e r i m e n t a l l y i n t h e f o l l o w i n g manner:
6 c q
is to
ti
L J !
t0
,Nominal control
. . . .=
tf
With two s e q u e n t i a l dynamic s o l u t i o n s of the s t a t e e q u a t i o n s u s i n g f i r s t a p o s i t i v e c o n t r o l i m p u l s e and t h e n a n e g a t i v e c o n t r o l impulse, t h e f o l l o w i n g i s avai l a b l e :
From e q u a t i o n (A3) we can write t h e change 26q = Ap, f o r a s m a l l c o n t r o l , 6u = nu, a c t i n g over a s m a l l t i m e i n t e r v a l A t , as f o l l o w s :

2ep =
=
( . p T af z ) 2 nu
At
This then r e p r e s e n t s t h e correspondence between t h e impulse response f u n c t i o n s c a l c u l a t e d i n t h e text and t h o s e s o l v e d by t h e a d j o i n t s o l u t i o n . Greens f u n c t i o n e v a l u a t e d a t any t i m e , t , along t h e t r a j e c t o r y can b e noted as
and
R e l a t i o n s h i p t o t h e Maximum P r i n c i p l e
The maximum p r i n c i p l e ( r e f . 8) s t a t e s that a n e c e s s a r y c o n d i t i o n f o r a minimum (maximum) of t h e c o s t f u n c t i o n i s t h a t t h e Hamiltonian be maximized (minimized) w i t h r e s p e c t t o t h e c o n t r o l at a l l t i m e s . The Hamiltonian can be w r i t t e n as
H = A f
(A91
11
where t h e t r a n s v e r s a l i t y c o n d i t i o n must b e s a t i s f i e d a t t h e f i n a l time,
and i s a Lagrange m u l t i p l i e r c o n s t a n t chDsen s3 t h a t t h e t e r m i n a l con s t r a i n t i s s a t i s f i e d . The boundary c o n d i t i o n s on t h e a d j o i n t e q u a t i o n s a t t h e f i n a l time are :
Now t o determine i f H i s minimized w i t h r e s p e c t t o t h e c o n t r o l we can t a k e t h e p a r t i a l d e r i v a t i v e of H w i t h r e s p e c t t o u:
Or, n o t i n g t h e correspondence between e q u a t i o n s (Ah) and (All), we can write
T where &,(af/au) i s Green's f u n c t i o n f o r t h e c o s t and functionTfor t h e terminal constraint.
A,(af/au)
i s Green's
R e c a l l i n g t h e correspondence between t h e a d j o i n t s o l u t i o n for Green's f u n c t i o n and t h a t determined e x p e r i m e n t a l l y , we have t h e f o l l o w i n g :
T h i s , then, r e p r e s e n t s t h e r e l a t i o n s h i p between t h e e x p e r i m e n t a l l y det emnine d impulse r e sponse f u n c t i o n s and t h e Hamiltonian. It i s i n t e r e s t i n g t h a t t h e maximum p r i n c i p l e can be a p p l i e d through t h i s r e l a t i o n s h i p without any need f o r s o l v i n g t h e a d j o i n t e q u a t i o n s . S t e e p e s t Descent Equatlons
>
a
The g r e a t e s t change, @, i n obtained (ref.
cp
f o r a given value of
t0
6u2(t)dt
is
3) when
12
where Kcp and % are c o n s t a n t s . T h i s i s t h e s t e e p e s t descent ( o r a s c e n t ) direction t o the mini" (or maximum) cp. When t h e r e are no s t a t e o r c o n t r o l c o n s t r a i n t s , t h e s t e e p e s t descent procedure converges toward t h e n e c e s s a r y c o n d i t i o n s f o r an optimum s o l u t i o n as p r e v i o u s l y noted
Q(t> 4- -q
where i n t h e s t e e p e s t descent e q u a t i o n s , with the terminal constraint s a t i s f i e d .
a+>=
K q / %
( A16 1
on t h e o p t i m s o l u t i o n
= 7,
APPENDIX B
REENTRY T W C T O R Y EQUATIONS
The f o l l o w i n g e q u a t i o n s were programmed on t h e analog computer for t h e example c o n s i d e r e d i n this r e p o r t . These s i m p l i f i e d e q u a t i o n s were d e r i v e d f o r f l i g h t w i t h i n t h e atmosphere and t h e primary assumptions i n c l u d e : a s p h e r i c a l n o n r o t a t i n g e a r t h , s m a l l f l i g h t - p a t h a n g l e s , and a c o n s t a n t g r a v i t y term. The d e r i v a t i o n of t h e s e s i m p l i f ied e q u a t i o n s and t h e i r a p p l i c a b i l i t y have been c o n s i d e r e d i n a number of r e p o r t s . See f o r i n s t a n c e r e f e r e n c e 9. The e q u a t i o n s are
$ =
st'
t0
V dt
where
L D
h
c o n t r o l value of l i f t - d r a g r a t i o altitude, m h o r i z o n t a l v e l o c i t y , m/s f i n a l range, m
cp
P
t o t a l h e a t i n p u t , J/m2
atmosphere d e n s i t y , 1.225 e radius from earth center, -h/7160
%/m3
r
g
6.43~10~ m
9.8 m/s2
l o c a l gravitational acceleration, drag l o a d i n g , 0.004 mz/kg
(?)
14
REFERENCES
1 . Leitmann, George, ed.:
Optimization Techniques.
Academic Press, 1962.
2. Balakrishnan, A. V. ; and Neustadt, Lucien W., eds. : Computing Methods in Optimization Problems. Academic Press, 1964.
I.
3. Bryson, Arthur E. ; and Denham, Walter F. : A Steepest-Ascent Method for Solving Optimum Programming Problems. Raytheon Rep. BR 1303, 1961. Also J. Appl. Mech., vol. 29, no. 2, June 1962, pp. 247-257.
4 . Kelley, H. J. : Gradient Theory of Optimal Flight Paths. ARS J., 30, no. 10, Oct. 1960, pp. 947-954.
5. Bryson, A. E.; Denham, W. F.; Carroll, F. J.; and
Mikami,
vol.
tion of Lift or Drag Programs That Minimize Reentry Heating. space Sei., vol. 29, no. 4, April 1962, pp. 420-430.
K.: Determina J. Aero
6. Blanton, H. Elmore, ed. : Three-DimensionalTrajectory Optimization Study. Pt. 1 - Optimum Programming Formulation. NASA CR-57030, 1964.
(Supersedes Aero. Sys. Div. Rep. ASD-TDR-62-29?and Raytheon Rep. Br-1759-1).
7. Hague, D. S. : Three-Degree-of -Freedom Problem Optimization Formulation. FDL-TDR-64-1,pt. 1, V O ~ . 3, Oct. 1964.

8. Pontryagin, L. S., et al.: The Mathematical Theory of Optimal Processes.
Wiley and Sons, 1962.
9. Chapman, Dean R.: An Approximate Analytical Method for Studying Entry

Into Planetary Atmospheres. NASA TR R-11, 1959.
C , o n t r o I im pu Ise
Control, U
71
State va Iue
cos
t
t0
1 1 1 1 1 1
tf to Trajectory t i me 1 1 1 1 1 1 1 1 Computer t i m e
tf
1 1 1 1 1 1
F i g u r e 1,- I l l u s t r a t i v e time h i s t o r y of r e p e t i t i v e analog computations.
I.
c.
91 (Counter)
Analog
f 6
impulse O
I
L
I I
-Control impulse
3+2 (Start)
I I I
+
I----I
#3
(Stop)
I
I I
----Dotted
lines represenl logic signals shown in figure 2 ( b )
(a) Flow diagram.

Figure 2,- Mechanization of the optimization procedure.
#2
-"
L
(Stop)
#3
Solution time
J
I
#A
(itoie
(Store)
#5
#7
(- Control )
I
I
impulse)
#8
(Modify)
I
( b ) Problem l o g i c .
Figure 2.
Concluded.
rc
.5 Control L .25 -
n
'
/Control
impulse Nominal control
il
~ y , I y
0-
Final range,$, IO3 km
2 IW-
Final heat Ioad,4, 20
-i
0.1 Second of computer time
( a ) D e t a i l s of r e p e t i t i v e t r a j e c t o r y computations.
Figure 3.- Recorded time h i s t o r i e s for r e e n t r y t r a j e c t o r y optimization; minimum h e a t with t e r m i n a l range constraint
? Iu
Iu
P
Final range, IO3 km
+
IO
Final heat load, $ 107 Joule
m2
0-
I O Seconds of computer time
2-
Modifying -.I 6 control O .I -
%- 1 1
( b ) Details of o v e r a l l convergence procedure.
Figure
3.- Concluded,
The aeronautical and space activities of the United States shall be conducted so as to contribute . . . to the expansion of human knowl edge of phenomena in the atmosphere and space. The Administration shall provide f o r the widest practicable and appropriate dissemination of information concerning its actiuities and the results thereof.
-NATIONAL AERONAUTICS AND SPACE ACT OF 1958
\ i
NASA SCIENTIFIC A N D TECHNICAL PUBLICATIONS

TECHNICAL REPORTS: Scientific and technical information considered important, complete, and a lasting contribution to existing knowledge. TECHNICAL NOTES: Information less broad in scope but nevertheless existing knowledge. of importance as a contribution TECHNICAL MEMORANDUMS: Information receiving limited distri bution because of preliminary data, security classification, or other reasons. CONTRACTOR REPORTS: Technical information generated in con nection with a NASA contract or grant and released under NASA auspices. TECHNICAL TRANSLATIONS: Information published in a foreign language considered to merit NASA distribution in English. TECHNICAL REPRINTS: Information derived from NASA activities and initially published in the form of journal articles. SPECIAL PUBLICATIONS: Information derived from or of value to NASA activities but not necessarily reporting the results .of individual NASA-programmed scientific efforts. Publications include conference proceedings, monographs, data compilations, handbooks, sourcebooks, and special bibliographies.
Details on the availability o f these publications may be obtained from:
SCIENTIFIC AND TECHNICAL INFORMATION DIVISION
N AT1 0 NA L AERO N AUT1 CS A N D SPACE A D M I N ISTRATI 0 N

Washington,
D.C. PO546

A Method of Trajectory Optimization by Fast-Time Repetitive Computations

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

A Method of Trajectory Optimization by Fast-Time Repetitive Computations

Uploaded by

Copyright:

Available Formats

NASA TECHNICAL NOTE

A METHOD OF TRAJECTORY OPTIMIZATION BY FAST-TIME REPETITIVE COMPUTATIONS

NATIONAL AERONAUTICS A N D SPACE ADMINISTRATION

TECH LIBRARY KAFB, NM

IIlil11 1lulI 11111I l 11 0130190

NATIONAL AERONAUTICS AND SPACE ADMINISTRATION

A METHOD OF TRAJECTORY OPTIMIZATION BY FAST-TIME

NOTATION The f o l l o w i n g n o t a t i o n i s u s e d i n t h e body of t h e t e x t .

symbols a r e d e s c r i b e d as t h e y are i n t r o d u c e d i n t h e appendixes.

c o n t m l impulse c o s t f u n c t i o n a t f i n a l time change i n c o s t f u n c t i o n a t f i n a l time due t o c o n t r a 1 impulses a t time t s t a t e v a l u e a t f i n a l time

change i n s t a t e value a t f i n a l t i m e due t o c o n t r o l impulses a t t i m e

and I.etting -&$ = $d

to J v Terminal e r r o r correction t e r m The a c t u a l cal

This g i v e s t h e g e n e r a l form of t h e steepest descent e q u a t i o n s . c u l a t i o n s are next considered i n more d e t a i l .

HYBRID COMPUTER MECHANImTION

APPLICATION TO FEEITTRY TRAJECTORY OPTIMIZATION

76.3 lan 7.63 km/s 1 . 8 ' 1609 Ism

EXPERIMENTAL IMPULSE RESPONSE FUNCTIONS

TO ADJOIN" SOLUTIONS AND THE MAXIMUM PRINCIPLJT

where x ( t ) i s a v e c t o r of s t a t e v a r i a b l e s , u ( t ) i s t h e c o n t r o l v a r i a b l e and i s a v e c t o r of known f u n c t i o n s of x ( t ) and u ( t )

The a u x i l i a r y a d j o i n t e q u a t i o n s can b e noted as

where A i s a v e c t o r of i n f l u e n c e f u n c t i o n s and (af/ax)T i s t h e t r a n s p o s e of t h e matrix d e s c r i b i n g l i n e a r motions about t h e nominal p a t h of x ( t ) and

+ r e p r e s e n t s a s o l u t i o n of t h e a d j o i n t e q u a t i o n s w i t h t h e boundary where & c o n d i t i o n s a t t h e f i n a l t i m e of

Experimental Determination of Z q u l s e Response Functions

The method used i n t h e t e s t .of t h i s r e p o r t f o r f i n d i n g a change p e r t u r b the c o n t r o l e x p e r i m e n t a l l y i n t h e f o l l o w i n g manner:

With two s e q u e n t i a l dynamic s o l u t i o n s of the s t a t e e q u a t i o n s u s i n g f i r s t a p o s i t i v e c o n t r o l i m p u l s e and t h e n a n e g a t i v e c o n t r o l impulse, t h e f o l l o w i n g i s avai l a b l e :

From e q u a t i o n (A3) we can write t h e change 26q = Ap, f o r a s m a l l c o n t r o l , 6u = nu, a c t i n g over a s m a l l t i m e i n t e r v a l A t , as f o l l o w s :

where t h e t r a n s v e r s a l i t y c o n d i t i o n must b e s a t i s f i e d a t t h e f i n a l time,

and i s a Lagrange m u l t i p l i e r c o n s t a n t chDsen s3 t h a t t h e t e r m i n a l con s t r a i n t i s s a t i s f i e d . The boundary c o n d i t i o n s on t h e a d j o i n t e q u a t i o n s a t t h e f i n a l time are :

Now t o determine i f H i s minimized w i t h r e s p e c t t o t h e c o n t r o l we can t a k e t h e p a r t i a l d e r i v a t i v e of H w i t h r e s p e c t t o u:

Or, n o t i n g t h e correspondence between e q u a t i o n s (Ah) and (All), we can write

T where &,(af/au) i s Green's f u n c t i o n f o r t h e c o s t and functionTfor t h e terminal constraint.

R e c a l l i n g t h e correspondence between t h e a d j o i n t s o l u t i o n for Green's f u n c t i o n and t h a t determined e x p e r i m e n t a l l y , we have t h e f o l l o w i n g :

The g r e a t e s t change, @, i n obtained (ref.

c o n t r o l value of l i f t - d r a g r a t i o altitude, m h o r i z o n t a l v e l o c i t y , m/s f i n a l range, m

l o c a l gravitational acceleration, drag l o a d i n g , 0.004 mz/kg

1 . Leitmann, George, ed.:

Academic Press, 1962.

K.: Determina J. Aero

7. Hague, D. S. : Three-Degree-of -Freedom Problem Optimization Formulation. FDL-TDR-64-1,pt. 1, V O ~ . 3, Oct. 1964.

9. Chapman, Dean R.: An Approximate Analytical Method for Studying Entry

F i g u r e 1,- I l l u s t r a t i v e time h i s t o r y of r e p e t i t i v e analog computations.

lines represenl logic signals shown in figure 2 ( b )

(a) Flow diagram.

impulse Nominal control

Final range,$, IO3 km

Final heat Ioad,4, 20

Final range, IO3 km

Final heat load, $ 107 Joule

I O Seconds of computer time

Modifying -.I 6 control O .I -

( b ) Details of o v e r a l l convergence procedure.

NASA SCIENTIFIC A N D TECHNICAL PUBLICATIONS

Details on the availability o f these publications may be obtained from:

SCIENTIFIC AND TECHNICAL INFORMATION DIVISION

N AT1 0 NA L AERO N AUT1 CS A N D SPACE A D M I N ISTRATI 0 N

You might also like