Professional Documents
Culture Documents
F_-
N A S A TN D-3404
d -
I
W
t-'
t-'
s. gi
I
1'.
: .
.I
.,
.y
WASHINGTON,
05;. .:;;
0;.
,A.P.R./~ i s a s
1'
- 9
$!
NASA TN D-3404
A METHOD O F TRAJECTORY OPTIMIZATION BY FAST-TIME REPETITIVE COMPUTATIONS By Rodney C. Wingrove, J a m e s S. Raby, and D. F r a n c i s C r a n e Ame s Research Center Moffett F i e l d , Calif.
by the Clearinghouse for Federal Scientific and Technical Information Springfield, Virginia 22151 Price $0.30
RFPETITIVE COMmTTATIONS By Rodney C . Wingrove, James S. Raby, and D. F r a n c i s Crane Ames Research Center S U M M A R Y This r e p o r t p r e s e n t s a p e r t u r b a t i o n method of computing o p t i m t r a j e c t o r i e s wherein c o n t r o l impulse response f u n c t i o n s are determined by fast-time r e p e t i t i v e computations of t h e s t a t e e q u a t i o n s . This method does not r e q u i r e t h e s o l u t i o n of t h e a u x i l i a r y set of a d j o i n t e q u a t i o n s u s e d w i t h o t h e r p e r t u r b a t i o n methods. The mechanization of t h i s computing method on a h y b r i d computer i s d i s cussed and an a p p l i c a t i o n t o t h e s t e e p e s t descent o p t i m i z a t i o n of r e e n t r y t r a j e c t o r i e s i s p r e s e n t e d . I n t h i s example, t h e v e h i c l e i s t o a r r i v e a t a d e s i r e d range, t h e h e a t i n p u t t o t h e v e h i c l e i s t o b e minimized, and t h e con trol i s t o remain w i t h i n s p e c i f i e d c o n s t r a i n t s . Gptii-mm t r a j e c t o r i e s f o r t h i s example could b e o b t a i n e d i n about 8 minutes of computing time.
INTRODUCTION
It i s important f o r space v e h i c l e t r a j e c t o r i e s t o b e n e a r optimum i n t h e sense t h a t some q u a n t i t y i s e i t h e r maximized or minimized. For example, i n r e e n t r y t h e t r a i e c t o r y t o d e s i r e d t e r m i n a l c o n d i t i o n s i s n e a r optimum when t h e t o t a l aerodynamic h e a t i n g i s a minimum. This r e p o r t w i l l consider a method f o r f i n d i n g t h e Lime h i s t J r i e s of n o n l i n e a r c o n t r o l s t h a t correspond t o optimum t r a . i e c t o r i e s . S e v e r a l p e r t u r b a t i o n methods, such as t h e c a l c u l u s of v a r i a t b n s , a p p l i c a t i o n s of Ihe maximum p r i n c i p l e , and d i r e c t s t e e p e s t descent, have been considered f o r s o l v i n g t h i s c o n t r o l o p t i m i z a t i o n problem. Reference 1 c o n t a i n s a good review of t h e s e v a r i o u s methods and r e f e r e n c e 2 g i v e s s e v e r a l analog and d i g i t a l computing techniques f o r implementing them.
I n p r i n c i p l e , each of t h e s e techniques should g i v e s a t i s f a c t o r y r e s u l t s , b u t it has been found t h a t f o r many t r a j e c t o r y problems t h e computer mechani z a t i o n s are cumbersome and r e q u i r e programs t h a t a r e d i f f i c u l t f o r e n g i n e e r s t o formulate. The computing method t o be r e p o r t e d h e r e i n w a s i n v e s t i g a t e d i n an attempt t o a l l e v i a t e t h e s e d i f f i c u l t i e s and t o provide a more d i r e c t way of computing o p t i m i z a t i o n s o l u t i o n s .
I n previous o p t i m i z a t i o n s t u d i e s u s i n g p e r t u r b a t i o n techniques t h e computations have involved t h e dynamic s o l u t i o n of two s e t s of e q u a t i o n s : (1) n o n l i n e a r s t a t e e q u a t i o n s and (2) l i n e a r a d j o i n t e q u a t i o n s . The method t o b e r e p o r t e d h e r e i n d i f f e r s i n t h a t o n l y t h e s o l u t i o n of t h e n o n l i n e a r s t a t e
e q u a t i o n s i s used. The response of given f u n c t i o n s ( e . g . , t e r m i n a l e r r o r o r q u a n t i t y t o b e optimized) t o a c o n t r o l impulse i s determined along t h e t r a j e c t o r y by fast-time r e p e t i t i v e computations r a t h e r t h a n by a s o l u t i o n of t h e a d j o i n t e q u a t i o n s . Since a u x i l i a r y a d j o i n t e q u a t i o n s are n o t needed, t h e i n v e s t i g a t o r should u n d e r s t a n d t h e o p t i m i z a t i o n p r o c e s s more e a s i l y ; a l s o t h e computer program should be s i m p l e r . However, t h i s new a l t e r n a t e computing method does r e q u i r e many s o l u t i o n s o f t h e s t a t e e q u a t i o n s . This t a s k of com p u t i n g a l a r g e number of dynamic s o l u t i o n s i s i d e a l l y s u i t e d t o high-speed r e p e t i t i v e h y b r i d computation as w i l l b e considered h e r e i n . This r e p o r t w i l l p r e s e n t one a p p l i c a t i o n o f t h i s computing technique; t h a t of t r a j e c t s r y o p t i m i z a t i o n u s i n g t h e s t e e p e s t d e s c e n t method ( r e f s . 3 and 4 ) . The mechanization of t h i s method on a h y b r i d computer w i l l be d i s c u s s e d and r e s u l t s w i l l be p r e s e n t e d t o i l l u s t r a t e t h e u s e of t h i s procedure i n t h e o p t i m i z a t i o n of r e e n t r y t r a j e c t o r i e s . For t h e i n t e r e s t e d r e a d e r appendix A i l l u s t r a t e s t h e r e l a t i o n s h i p of t h e impulse response f u n c t i o n s computed i n t h i s r e p o r t t o t h e s o l u t i o n s o b t a i n e d w i t h t h e a d j o i n t e q u a t i o n s and t o t h e maximum p r i n c i p l e of o p t i m i z a t i o n . This appendix a l s o p r o v i d e s a background f o r under s t a n d i n g t h e s t e e p e s t descent o p t i m i z a t i o n e q u a t i o n s .
Additional
t tf
to
t ime
f i n a l t inie i n i t i a l time
At
U
t i m e increment of c o n t r o l impulse
1 function ccritr :
AU
&(t)
d e s i r e d s t a t e value a t f i n a l t i m e
A\I'(t)
T h e method of s t e e p e s t d e s c e n t ( r e f s . 3 and 4) i s an i t e r a t i v e procedure t h a t has been used for optimizing t r a j e c t o r i e s . The p r o c e s s commences w i t h any nonoptimal t r a j e c t o r y from which a s l i g h t l y improved one i s derived. The improved t r a j e c t o r y i s t h e n u s e d as a new nominal t r a j e c t o r y , and t h e procedure i s r e p e a t e d u n t i l t h e optimum or n e a r l y optimum t r a j e c t o r y i s found.
General O u t l i n e The i t e r a t i o n i s as f o l l o w s : (1) E s t i m a t e a reasonable program t h a t n e a r l y s a t i s f i e s t h e t e r m i n a l c o n d i t i o n s for s p e c i f i e d i n i t i a l c o n d i t i o n s ; ( 2 ) determine impulse response f u n c t i o n s t h a t d e s c r i b e t h e e f f e c t s of s m a l l changes i n t h e c o n t r o l on t h e t e r m i n a l s t a t e and on t h e c o s t ( t h e q u a n t i t y t o b e minimized). These impulse response f u n c t i o n s , combined w i t h s t e e p e s t descent computations, i n d i c a t e t h e b e s t p J s s i b l e way of making s m a l l changes t o t h e c o n t r o l t o decrease t h e c o s t and s t i l l a r r i v e a t t h e end p o i n t ; (3) add t h i s change i n c o n t r o l t o t h e previous nominal c s n t r o l program. The r e s u l t i s a new t r a j e c t o r y .with a decreased c o s t ; ( 4 ) r e p e a t t h i s p r o c e s s u n t i l t h e r e e x i s t s o n l y a v e r y s m a l l change i n c o s t f o r each new t r a j e c t o r y , i n d i c a t i n g t h a t t h e c o n t r o l i s v e r y near a l o c a l optimum. A l i m i t value of t h e c o n t r o l may b e reached b e f o r e t h e c o s t i s completely minimized. I n t h i s c a s e , t h e p r o c e s s i s continued u n t i l t h e c o n s t r a i n t ( c o n t r o l l i m i t ) i s reached, s i n c e no f u r t h e r o p t i m i z a t i o n i s p o s s i b l e . The p r o p e r t i e s of s t e e p e s t descent o p t i m i z a t i o n have been documented i n many p r e v i a u s s t u d i e s ( e . g . , r e f s . 5 - 7 ) . Although t h i s meth.;d has been regarded as t h e most p r a c t i c a l i n many a p p l i c a t i o n s , t h e r e i s no guarantee t h a t it y i e l d s t h e a b s o l u t e optimum. That i s , f o r some i n i t i a l choices of t h e nominal t r a j e c t o r y , t h e f i n a l optimized t r a j e c t o r y may r e p r e s e n t o n l y a l o c a l optimum p a t h . Also, i n some a p p l i c a t i o n s , where t h e c o s t f u n c t i o n may be r e l a t i v e l y i n s e n s i t i v e t o c o n t r o l v a r i a t i o n s , a l a r g e number of i t e r a t i o n s may be necessary t o approach t h e optimum s o l u t i o n . Computation of Impulse RespJnse Functions To i l l u s t r a t e t h e computation of t h e impulse response f u n c t i o n s l e t t h e q u a n t i t y t o be minimized be noted as cp, t h e c s s t e v a l u a t e d a t t h e f i n a l t i m e . L e t t h e s t a t e variable a t t h e f i n a l t i m e be noted as \I' and l e t t h e d e s i r e d end-point value f o r t h i s b e denoted \I'd. F i g u r e 1 i l l u s t r a t e s t h e manner i n which t h e i n f l u e n c e of s m a l l c o n t r o l changes on cp and \I' are c a l c u l a t e d i n t h i s r e p o r t . The e q u a t i o n s of motion are f i r s t s o l v e d w i t h a c o n t r o l change, a p o s i t i v e c o n t r o l impulse a t t i m e t ,
superimposed upon t h e nominal c o n t r o l . During t h e next s o l u t i o n of t h e motion e q u a t i o n s , a n e g a t i v e c o n t r o l impulse o f t h e same magnitude i s i n s e r t e d a t t i m e t . The change i n c o s t , 4, and t h e change i n t e r m i n a l state, A@, are d e r i v e d f r o m t h e s e t w o s o l u t i o n s . I n a s i m i l a r manner t h e impulse response f u n c t i o n s can be p r o g r e s s i v e l y determined at s u c c e s s i v e times along t h e t r a j e c t o r y , and t h e technique by which t h e y a r e determined i s t h e most important f e a t u r e o f t h i s computing method. The computation of t h e f u l l h i s t o r y o f @ ( t ) and A $ ( t ) f o r t h e same c o n t r o l impulse a t d i f f e r e n t t i m e s along t h e t r a j e c t o r y i s termed one " i t e r a t i o n " s i n c e it corresponds to t h e p r e v i o u s o p t i m i z a t i o n s t u d i e s where one i t e r a t i o n w i t h t h e a d j o i n t e q u a t i o n s w a s u s e d t o compute e s s e n t i a l l y t h e s e same f u n c t i o n s along t h e t r a j e c t o r y . Appendix A shows t h e r e l a t i o n s h i p o f t h i s experimental d e t e r m i n a t i o n t o t h e s t a n d a r d d e t e r m i n a t i o n u s i n g a d j o i n t equa t i o n s . These e x p e r i m e n t a l impulse response f u n c t i o n s a r e shown t o be d i r e c t l y r e l a t e d t o t h e well-known "Green's f u n c t i o n s . I'
S t e e p e s t Descent Optimization Equations The impulse response f u n c t i o n s a r e u s e d i n t h e s t e e p e s t d e s c e n t technique t o modify t h e c o n t r o l toward t h e optimum i n t h e f o l l o w i n g manner (see appen dix A).
The g a i n s Q and Kq a r e c o n s t a n t s f o r each i t e r a t i o n . The g a i n % weights t h e impulse response f u n c t i o n f o r t h e c o s t ; i t s s i g n i s n e g a t i v e i n o r d e r t o decrease t h e c o s t . The magnitude o f % i s determined e x p e r i m e n t a l l y f o r each problem. Too l a r g e a g a i n may cause i n s t a b i l i t y i n t h e convergence procedure, while t o o s m a l l a g a i n may extend t h e time o f convergence. The g a i n K q must be c a l c u l a t e d f o r each i t e r a t i o n s o t h a t t h e t e r m Kq Aq(t) w i l l account f o r t e r m i n a l displacement due t o t h e optimizing term, Kcp & p ( t ) , and any t e r m i n a l displacement e r r o r f r o m t h e p r e v i o u s i t e r a t i o n w i l l be c o r r e c t e d . The f o r m u l a t i o n f o r c a l c u l a t i n g K,J i s as f o l l o w s :
S m a l l changes, SJr, i n t h e t e r m i n a l s t a t e , $, due t o s m a l l changes, 6 u ( t ) , i n c o n t r o l can b e approx.imated by:
64f= 2 A : At
J
%
6 u (t ) A J r (t )d t
~
where A u i s t h e h e i g h t of each c o n t r o l impulse and A t i s t h e time i n t e r v a l of each c o n t r o l impulse. S u b s t i t u t i n g Q ( t ) + K,,, A $ ( t ) f r o m (1) f o r 6u, w e have :
6$ = 2 nu 1 A t Jt'
t0
(3)
Solving f o r
$ (prevj.ous t e r m i n a l e r r o r ) we o b t a i n :
to J W S t e e p e s t descent optimization t e r m
<
and
The values f o r e q u a t i o n s ( 5 ) and (6) were computed as i n t e g r a l s a v e r t h e t i m e p e r i o d from t = to t o t = t f . The t i m e to w a s r e p r e s e n t e d b y a l o g i c s i g n a l a t t h e f i r s t r e p e t i t i v e computation i n an i t e r a t i o n c y c l e and t h e t i m e tf w a s r e p r e s e n t e d b y a l o g i c s i g n a l at t h e l a s t computation i n a m iteration c y c l e . It should b e n o t e d t h a t during t h o s e p u t s of t h e t r a j e c t o r y when t h e c o n t r o l w a s a t a c o n s t r a i n t l i m i t , no f u r t h e r o p t i m i z a t i o n w a s p o s s i b l e , and t h e i n t e g r a t i o n of e q u a t i o n s ( 5 ) and ( 6 ) w a s t h e r e f o r e n o t c a r r i e d o u t during t h o s e times. This type of computer mechanization w i l l b e i l l u s t r a t e d i n more d e t a i l f o r t h e f o l l o w i n g example problem.
Statement of t h e Problem The problem t o b e i l l u s t r a t e d i n t h i s s e c t i o n i s t h a t of determining t h e time h i s t o r y of t h e v a r i a t i o n of l i f t - d r a g r a t i o ( c o n t r o l L/D) t h a t must b e flown f o r a v e h i c l e r e t u r n i n g i n t o t h e e a r t h ' s atmosphere s o t h a t : The t o t a l h e a t i n g load t o t h e v e h i c l e i s minimized. The v e h i c l e arrives a t a d e s i r e d d e s t i n a t i o n . The c o n t r o l remains w i t h i n s p e c i f i e d c o n s t r a i n t s .
Mechanization The equations of motion, p r e s e n t e d i n appendix.B, were f o r a p o i n t mass i n p l a n a r motion 3ver a s p h e r i c a l n o n r o t a t i n g e a r t h . The v e h i c l e c h a r a c t e r i s t i c s and f l i g h t c o n d i t i o n s were t h o s e f o r a manned capsule r e t u r n i n g from e a r t h orbit. I n i t i a l c o n d i t i o n s were: Altitude
Velocity
F l i g h t - p a t h angle
Range t o d e s t i n a t i o n
F i n a l stopping c o n d i t i o n s were: Altitude
(250,000 f t ) (25,000 f p s )
(1000 m i )
30.48
~nn
(ioo,ooo f t )
C o n t r o l l i m i t s were:
o <L<
0 . 5
The main hardware elements u s e d i n t h e h y b r i d computer mechanization were: Hardware elements Analog computer P a r a l l e l d i g i t a l logic units Track and s t o r e a m p l i f i e r s D i g i t a l d e l a y l i n e memories1 ( w i t h D/A and A/D c o n v e r t e r s ) Program t a s k S o l u t i o n of t r a j e c t o r y e q u a t i o n s Logic c o n t r o l of program S t o r a g e o f end-point v a l u e s S t o r a g e of c o n t r o l time h i s t o r y
The 64-word d i g i t a l s e r i a l memory u n i t (1.3 b i t s p e r word) w a s accessed w i t h t h e f a s t e s t allowable c o u n t e r rate (0.002 s e c ) . A complete 64-word c y c l e w a s t h e n a v a i l a b l e e v e r y 0.128 second. To a l l o w a complete s o l u t i o n o f t h e t r a j e c t o r y e q u a t i o n s w i t h i n 0.128 second, t h e analog computer w a s time s c a l e d a t 3750 t o 1. Results
A s e r i e s of computer runs f o r t h i s problem i s i l l u s t r a t e d i n f i g u r e 3. F i g u r e 3 ( a ) p r e s e n t s t h e d e t a i l s of each r e p e t i t i v e t r a j e c t o r y computation and f i g u r e 3 ( b ) p r e s e n t s t h e d e t a i l s Jf t h e o v e r a l l convergence onto t h e optimum nominal c o n t r o l . F i g u r e 3 ( a ) shows j u s t a p o r t i o n o f i t e r a t i o n no. 0 as p r e sented i n f i g u r e 3 ( b ) .
I n t h e upper t r a c e of f i g u r e 3 ( a ) t h e c o n t r o l impulses a r e superimposed upon t h e i n i t i a l nominal c o n t r o l . Each c o n t r o l impulse had a magnitude o f L/D = 20.25 and a time increment of one c l o c k p u l s e (0.002 s e e ) . This c o n t r o l impulse w a s chosen because it gave v a r i a t i o n i n t h e f i n a l range and h e a t l o a d on t h e o r d e r o f f 5 p e r c e n t . The range and i n t e g r a t e d h e a t l o a d along each of t h e r e p e t i t i v e t r a j e c t o r i e s a r e p r e s e n t e d along w i t h t h e f i n a l v a l u e s as t h e y a r e s t o r e d w i t h t r a c k and s t o r e a m p l i f i e r s . The d i f f e r e n c e between t h e s e s t o r e d q u a n t i t i e s f o r each two p a i r s o f subsequent runs i s A@ r e p r e s e n t i n g t h e range impulse response f u n c t i o n s and &p representing the heat load impulse response f u n c t i o n s . I n f i g u r e 3 ( b ) t h e f i r s t 10 i t e r a t i o n s ( e a c h i t e r a t i o n c o n s i s t s of 128 r e p e t i t i v e computations) of t h e converging o p t i m i z a t i o n procedure are i l l u s t r a t e d along w i t h t h e f i n a l i t e r a t i o n . During t h e convergence procedure t h e range i s seen t o vary s l i g h t l y about t h e d e s i r e d value o f 1609 km (1000 m i l e s ) . The h e a t l o a d i s shown t o b e reduced about 10 p e r c e n t during t h e f i r s t t e n i t e r a t i o n s and diminished t o about 12 p e r c e n t from t h e o r i g i n a l w i t h t h e f i n a l (optimum) c o n t r o l v a r i a t i o n .
A s e r i e s of t r a c k and s t o r e a m p l i f i e r s could a l s o have been u s e d for this stxage.
The modifying c o n t r o l shown i n t h e f i g u r e i s t h e sum Q nCp + Kq A $ . For t h i s s e r i e s of runs a c o n s t a n t v a l u e of E;lp = - 2 . 5 ~ 1 0 - ~ [ u n i tof s (L/D)/(J/m2)] w a s found t o , a l l o w a f a i r l y r a p i d convergence while maintaining program s t a b i l i t y . The v a l u e of K$ w a s c a l c u l a t e d by e q u a t i o n ' ( 4 ) t o b e t h a t value f o r each i t e r a t i o n such a s ' t o a l l o w convergence i n t h e s t e e p e s t descent manner. I n t h e lower t r a c e of f i g u r e 3 ( b ) t h e nominal c o n t r o l i s recorded as it i s r e a d o u t of s e r i a l memory every 128 +1 counter p u l s e s ( w i t h Logic no. 8 ) . This g i v e s a convenient time h i s t o r y t o show t h e manner i n which t h e c o n t r o l has been modified during each i t e r a t i o n . The c o n t r o l i s seen t o be l i m i t e d w i t h i n 0 < L/D < 0.5. This w a s achieved simply by l i m i t i n g t h e output of t h e s e r i a l memory t o w i t h i n t h e s e v a l u e s .
As can be seen, t h e optimum c o n t r o l v a r i a t i o n f o r t h i s case i s a bangbang c o n t r o l . With t h e s t e e p e s t descent method, it i s u s u a l l y found t h a t n e a r optimum c o n t r o l can be achieved i n t h e f i r s t few i t e r a t i o n s , b u t t h a t t o "square up t h e corners'' and achieve f u l l optimum c o n t r o l a number of f u r t h e r i t e r a t i o n s (on t h e o r d e r of 20 t o 50) are r e q u i r e d .
Convergence and S t a b i l i t y Considerations One of t h e important a s p e c t s of any o p t i m i z a t i o n scheme i s t h e a b i l i t y t o converge, w i t h i n a reasonable time, onto t h e optimum s o l u t i o n . For t h e p a r t i c u l a r method i n th'is r e p o r t it has been p o i n t e d o u t t h a t t h i s convergence p r i m a r i l y depends upon choosing t h e p r o p e r v a l u e of t h e g a i n $. I n t h e example l e s s than 2 . 5 ~ 1 0 - ~ problem, it w a s found t h a t using any value of [ u n i t s of (L/D) /( J / m 2 ) ] r e s u l t e d i n smooth convergence; however, t h e convergence time (which w a s p r o p o r t i o n a l t o 1 / % ) became long. For i n i t i a l values of g r e a t e r t h a n twice t h e aforementioned value t h e convergence became u n s t a b l e , t h a t i s , t h e modifying 6 c o n t r o l became s o l a r g e as t o change d r a s t i c a l l y t h e s t a t e v a r i a b l e from t h e i r nominal f i n a l v a l u e s .
KCP
It w a s found t h a t , as t h e optimum c o n t r o l w a s approached ( a f t e r about 10 i t e r a t i o n s ) , t h e value of % could be i n c r e a s e d and convergence s t a b i l i t y maintained, because i n t h e s e examples t h e c o n t r o l approached bang-bang and o n l y s m a l l changes were p o s s i b l e n e a r t h e s a t u r a t i o n limits. The value of K$ i n t h e s e c a s e s could be i n c r e a s e d t o about 10 times t h e aforementioned value, b u t i n c r e a s i n g it much f a r t h e r (without analog v o l t a g e s c a l i n g changes) would a l l o w extraneous computer n o i s e t o b e magnified t o a p o i n t where it caused n o t a b l e random f l u c t u a t i o n s i n t h e computations.
For a reasonable value of g a i n , such as t h a t used f o r t h e example problem, t h e time t o converge t o a n e a r optimum s o l u t i o n ( 1 1 i t e r a t i o n s ) w a s about 3 minutes, and t o a f u l l optimum s o l u t i o n (30 i t e r a t i o n s ) , about 8 minutes. F u r t h e r changes i n t h e s e convergence times, of course, depend upon s e v e r a l f a c t o r s . For i n s t a n c e , t h e convergence time i n t h i s computing setup w a s i n p r o p o r t i o n t o n2, where n i s t h e number of p o i n t s d e s c r i b i n g t h e c o n t r o l time h i s t o r y (64 p o i n t s f o r t h e case c i t e d ) . Also t h e allowable s o l u t i o n r a t e s of t h e computer elements d i r e c t l y a f f e c t t h e convergence time. The continuing development and use of high-speed computing elements w i l l c e r t a i n l y r e s u l t i n convergence times s m a l l e r t h a n t h e time c i t e d .
The r e s u l t s o b t a i n e d b y t h i s computing method appear s a t i s f a c t o r y f o r e n g i n e e r i n g purposes; however, t h e u s u a l disadvantages o f analog computation are i n h e r e n t w i t h t h i s method. These disadvantages are p r i m a r i l y concerned w i t h t h e extraneous n o i s e i n t h e computations and t h e a b s o l u t e accuracy ( o n l y t o w i t h i n about 1 p e r c e n t ) of analog computer.
CONCLUDING
REMARKS
This r e p o r t has p r e s e n t e d a p e r t u r b a t i o n method of computing optimum t r a j e c t o r i e s . The technique u s e s fast-time r e p e t i t i v e computations i n d e t e r mining c o n t r o l impulse response f u n c t i o n s and r e q u i r e s o n l y t h e dynamic s o l u t i o n o f t h e s t a t e e q u a t i o n s ; whereas o t h e r p e r t u r b a t i o n computing t e c h n i q u e s have r e q u i r e d t h e s o l u t i o n of a d d i t i o n a l a d j o i n t e q u a t i o n s . A h y b r i d computer w a s used i n applying t h e method t o t h e s t e e p e s t descent o p t i m i z a t i o n of r e e n t r y t r a j e c t o r i e s . Mechanizing t h e computer f o r t h i s type of problem w a s r e l a t i v e l y simple, and near Q p t i m m t r a j e c t o r i e s could be o b t a i n e d i n about 3 minutes of computing t i m e and full optimum t r a j e c t o r i e s i n about 8 minutes. The advantage o f t h e technique o u t l i n e d here over a l t e r n a t i v e techniques i s t h a t t h e i n v e s t i g a t o r need n o t be familiar w i t h or u s e an a u x i l i a r y s e t of l i n e a r a d j o i n t e q u a t i o n s f o r t h e o p t i m i z a t i o n . This technique &ses, however, r e q u i r e a l a r g e number of dynamic s o l u t i o n s of t h e s t a t e e q u a t i o n s , b u t t h i s computing t a s k appears p r a c t i c a l w i t h t h e high-speed r e p e t i t i v e computation procedure p r e s e n t e d i n t h i s r e p o r t . Ames Research Center N a t i o n a l Aeronautics and Space Administration Moffett F i e l d , C a l i f . , J a n . 24, 1966
APPENDIX A
RELATIONSHIP O F
L e t the s t a t e e q u a t i o n s b e n o t e d as
u(t)
It i s w e l l known t h a t any s m a l l change i n t h e c o n t r o l q u a n t i t y along t h e nominal p a t h w i l l determine a change 6(p i n any q u a n t i t y f i n a l t i m e as follows:
6u(t) cp a t t h e
The q u a n t i t y
A ( a f / d u ) w i t h i n t h e i n t e g r a l i s kn3wn a s Green's f u n c t i 3 n .
1 0
6 c q
is to
ti
L J !
t0
,Nominal control
. . . .=
tf
( . p T af z ) 2 nu
At
This then r e p r e s e n t s t h e correspondence between t h e impulse response f u n c t i o n s c a l c u l a t e d i n t h e text and t h o s e s o l v e d by t h e a d j o i n t s o l u t i o n . Greens f u n c t i o n e v a l u a t e d a t any t i m e , t , along t h e t r a j e c t o r y can b e noted as
and
R e l a t i o n s h i p t o t h e Maximum P r i n c i p l e
The maximum p r i n c i p l e ( r e f . 8) s t a t e s that a n e c e s s a r y c o n d i t i o n f o r a minimum (maximum) of t h e c o s t f u n c t i o n i s t h a t t h e Hamiltonian be maximized (minimized) w i t h r e s p e c t t o t h e c o n t r o l at a l l t i m e s . The Hamiltonian can be w r i t t e n as
H = A f
(A91
11
A,(af/au)
i s Green's
T h i s , then, r e p r e s e n t s t h e r e l a t i o n s h i p between t h e e x p e r i m e n t a l l y det emnine d impulse r e sponse f u n c t i o n s and t h e Hamiltonian. It i s i n t e r e s t i n g t h a t t h e maximum p r i n c i p l e can be a p p l i e d through t h i s r e l a t i o n s h i p without any need f o r s o l v i n g t h e a d j o i n t e q u a t i o n s . S t e e p e s t Descent Equatlons
>
a
cp
f o r a given value of
t0
6u2(t)dt
is
3) when
12
where Kcp and % are c o n s t a n t s . T h i s i s t h e s t e e p e s t descent ( o r a s c e n t ) direction t o the mini" (or maximum) cp. When t h e r e are no s t a t e o r c o n t r o l c o n s t r a i n t s , t h e s t e e p e s t descent procedure converges toward t h e n e c e s s a r y c o n d i t i o n s f o r an optimum s o l u t i o n as p r e v i o u s l y noted
Q(t> 4- -q
where i n t h e s t e e p e s t descent e q u a t i o n s , with the terminal constraint s a t i s f i e d .
a+>=
K q / %
( A16 1
on t h e o p t i m s o l u t i o n
= 7,
APPENDIX B
REENTRY T W C T O R Y EQUATIONS
The f o l l o w i n g e q u a t i o n s were programmed on t h e analog computer for t h e example c o n s i d e r e d i n this r e p o r t . These s i m p l i f i e d e q u a t i o n s were d e r i v e d f o r f l i g h t w i t h i n t h e atmosphere and t h e primary assumptions i n c l u d e : a s p h e r i c a l n o n r o t a t i n g e a r t h , s m a l l f l i g h t - p a t h a n g l e s , and a c o n s t a n t g r a v i t y term. The d e r i v a t i o n of t h e s e s i m p l i f ied e q u a t i o n s and t h e i r a p p l i c a b i l i t y have been c o n s i d e r e d i n a number of r e p o r t s . See f o r i n s t a n c e r e f e r e n c e 9. The e q u a t i o n s are
$ =
st'
t0
V dt
where
L D
h
cp
P
t o t a l h e a t i n p u t , J/m2
atmosphere d e n s i t y , 1.225 e radius from earth center, -h/7160
%/m3
r
g
6.43~10~ m
9.8 m/s2
(?)
14
REFERENCES
Optimization Techniques.
2. Balakrishnan, A. V. ; and Neustadt, Lucien W., eds. : Computing Methods in Optimization Problems. Academic Press, 1964.
I.
3. Bryson, Arthur E. ; and Denham, Walter F. : A Steepest-Ascent Method for Solving Optimum Programming Problems. Raytheon Rep. BR 1303, 1961. Also J. Appl. Mech., vol. 29, no. 2, June 1962, pp. 247-257.
4 . Kelley, H. J. : Gradient Theory of Optimal Flight Paths. ARS J., 30, no. 10, Oct. 1960, pp. 947-954.
5. Bryson, A. E.; Denham, W. F.; Carroll, F. J.; and
Mikami,
vol.
tion of Lift or Drag Programs That Minimize Reentry Heating. space Sei., vol. 29, no. 4, April 1962, pp. 420-430.
6. Blanton, H. Elmore, ed. : Three-DimensionalTrajectory Optimization Study. Pt. 1 - Optimum Programming Formulation. NASA CR-57030, 1964.
(Supersedes Aero. Sys. Div. Rep. ASD-TDR-62-29?and Raytheon Rep. Br-1759-1).
C , o n t r o I im pu Ise
Control, U
71
State va Iue
cos
t
t0
1 1 1 1 1 1
tf to Trajectory t i me 1 1 1 1 1 1 1 1 Computer t i m e
tf
1 1 1 1 1 1
I.
c.
91 (Counter)
Analog
f 6
impulse O
I
L
I I
-Control impulse
3+2 (Start)
I I I
+
I----I
#3
(Stop)
I
I I
----Dotted
#2
-"
L
(Stop)
#3
Solution time
J
I
#A
(itoie
(Store)
#5
#7
(- Control )
I
I
impulse)
#8
(Modify)
I
( b ) Problem l o g i c .
Figure 2.
Concluded.
rc
.5 Control L .25 -
n
'
/Control
il
~ y , I y
0-
2 IW-
-i
0.1 Second of computer time
( a ) D e t a i l s of r e p e t i t i v e t r a j e c t o r y computations.
Figure 3.- Recorded time h i s t o r i e s for r e e n t r y t r a j e c t o r y optimization; minimum h e a t with t e r m i n a l range constraint
? Iu
Iu
P
+
IO
m2
0-
2-
%- 1 1
Figure
3.- Concluded,
The aeronautical and space activities of the United States shall be conducted so as to contribute . . . to the expansion of human knowl edge of phenomena in the atmosphere and space. The Administration shall provide f o r the widest practicable and appropriate dissemination of information concerning its actiuities and the results thereof.
-NATIONAL AERONAUTICS AND SPACE ACT OF 1958
\ i
D.C. PO546