Professional Documents
Culture Documents
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content in a trusted
digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship. For more information about
JSTOR, please contact support@jstor.org.
Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at
http://about.jstor.org/terms
Mathematical Association of America is collaborating with JSTOR to digitize, preserve and extend access
to Mathematics Magazine
This content downloaded from 200.145.3.73 on Fri, 19 May 2017 17:44:25 UTC
All use subject to http://about.jstor.org/terms
AN ELEMENTARY INTRODUCTION TO THE
CALCULUS OF VARIATIONS
Magnus B. Hestenes
This content downloaded from 200.145.3.73 on Fri, 19 May 2017 17:44:25 UTC
All use subject to http://about.jstor.org/terms
250 MATHEMATICS MAGAZINE (May-June
2
1 C
Fig. 2.1
Here and elsewhere y' denotes the slope dy/dx of the curve at the point
(x,y). Thus, the arc of shortest length is one which minimizes I(C).
If we ask for the arc joining 1 and 2, which when revolved about the
x-axis generates the surface of revolution of minimum area, then this
arc must minimize the integral
This integral when multiplied by 21T represents the area of the surface
of revolution generated by C.
Let us now consider a path of least time joining two given -points 1
and 2 subject only to gravitational force. To this end it is convenient
to choose the y-axis to be the vertical axis directed downward as
indicated in Figure 2:2. Let us suppose therefore that we have a bead
P sliding along C from 1 to 2 with an initial speed v, at 1
This content downloaded from 200.145.3.73 on Fri, 19 May 2017 17:44:25 UTC
All use subject to http://about.jstor.org/terms
1950) INTRODUCTION TO THE CALCULUS OF VARIATIONS 251
, mg
Ir Fig. 2:2
y
ds/dt, we have the well known relations
dv dy
m t = mg sin O = mg d or v dv g dy.
-2 = 2gy + b.
Since y = Yl, v = v1 at t = 0
v 2 2g(y - yo), o =
2g
to = ) - - dx.
This content downloaded from 200.145.3.73 on Fri, 19 May 2017 17:44:25 UTC
All use subject to http://about.jstor.org/terms
252 MATHEMATICS MAGAZINE (May-June
(2:6) d =
(2:7) d (J Y = f P
f -IF P f 1=-
X -ax' fy ay y' ay
Let us apply this result to the integral (2:4) in which
f = ~ 2
fi = Y,(l +Y )S .)
Here r2
fy = r yr(1+y ) , f, = y y (+y) I
This content downloaded from 200.145.3.73 on Fri, 19 May 2017 17:44:25 UTC
All use subject to http://about.jstor.org/terms
1950) INTRODUCTION TO THE CALCULUS OF VARIATIONS 253
y = b(l +Y,2)2r
Letting y' = tanG we obtain
I 1
Consequently dy b
dx = I -= (sec 9)r do.
y r
An integration yields the following:
b I 1
(2:10) x = a + bj (sec t)r dt, y = b(sec 9)r,
r
The equations (2:12) and (2:13) are obtained from (2:10) by setting
r =- 1, r = - respectively and integrating. It is clear that through
This content downloaded from 200.145.3.73 on Fri, 19 May 2017 17:44:25 UTC
All use subject to http://about.jstor.org/terms
254 MATHEMATICS MAGAZINE (May-June
every pair of points in the upper half plane there is but one circular
arc having its center on the x-axis. Consequently, in the case r = - 1
there is a unique extremal joining a pair of points in the upper half
plane. This result remains true whenever r<0 but it is more difficult
to prove and we shall not do so here. If we accept the fact that
minimizing arcs exist when r< 0, then the extremals must be minimizing
since they are the only ones that satisfy the first necessary condition.
The situation is quite different when r> O. A complete statement
of the result can be made as follows.
If r> O, the extremals (2:10) are concave upward and the line
x=a is a line of symmetry. Given a pair of points 1 and 2 in the upper
half plane there may exist two, one or no extremals joining them. If two
extremals exist, then one is minimizing and the other is not. If but one
extremal exists, it fails to be minimizing. If no extremal joins 1
and 2, there is no minimizing arc above the x-axis. If r= J the
extremals (2:10) are parabolas
(2:14) y = b[l + [x - 2
This represents the totality of extremals through the point (1,0). This
family has the parabola
x2
This content downloaded from 200.145.3.73 on Fri, 19 May 2017 17:44:25 UTC
All use subject to http://about.jstor.org/terms
1950) INTRODUCTION TO THE CALCULUS OF VARIATIONS 255
Fig. 2:3
I = f:af(x,y,y')dx
This content downloaded from 200.145.3.73 on Fri, 19 May 2017 17:44:25 UTC
All use subject to http://about.jstor.org/terms
256 MATHEMATICS MAGAZINE (May-June
The expression 2
I1(n = {fy q+ fy 8}dx
d_
dx y y +77dxfY
we have Xz
1I(7) dx(f - fy )dx+fyJx
d
fy - x v
as was to be proved. The equation (2:7) follows from the identity
d (f-dx
Y f7) -fx + Y'[-xf
y ~~~dxY f Y d - f I = 0
The ftmction 7(x) used in this derivation is called a variation and
is frequently denoted by Sy. If we denote by SI the first variation of
I, then the formula for the first variation suggests the formalism
X2 X2 X2
SI =Sf f dx = f Sf dx = I {f7Sy + f>1Sy'}dx
x1 x 1 x1
This content downloaded from 200.145.3.73 on Fri, 19 May 2017 17:44:25 UTC
All use subject to http://about.jstor.org/terms
1950) INTRODUCTION TO THE CALCULUS OF VARIATIONS 257
Incidentally, since I(b) _ 1(0) we not only have I'(O) = 0 but also
I"(0) t 0, where I"(b) is the second derivative of 1(b). Computing IN(O)
and-calling the result 12(77) we obtain
in a class of arcs
d dx
(f y'
fY)
f ,)fO fx
.fy( + fy
f y y d i
which can be verified by carrying out the indicated differentiations.
As before, a maximizing or a. minimizing arc must satisfy Euler's
equations (4:1) and (4: 2).
In three-dimensional space with coordinates (x,y,z) the integral
I(C) is of the form X2
I(C) = ff(x*yz*y'z')dx
Xi
This content downloaded from 200.145.3.73 on Fri, 19 May 2017 17:44:25 UTC
All use subject to http://about.jstor.org/terms
258 MATHEMATICS MAGAZINE (May-June
-f I =-
dx f = fy' fz fz
(4:3)
d (f _y fy - z'fz) =fx'
One of the important applications of the results described above lies
in the field of mechanics. Let U(x,y,z) be a potential function associat-
ed with a conservative field of force. Then the equations
I = f(T-U)dt
L d L Ld L -L
dt x x dt y Y dt Z, Z
This content downloaded from 200.145.3.73 on Fri, 19 May 2017 17:44:25 UTC
All use subject to http://about.jstor.org/terms
1950) INTRODUCTION TO THE CALCULUS OF VARIATIONS 259
T = C (;2 + r2 2), U= _C
r r
d L
dt r r
d Le=L
ddt Lb L
and
T + U = h = constant.
Lb = m r2 2 constant or r2 b'
Inasmuch as r dr we see that
2 dr 2 2 4 ~~3 2 mb2__
a2 d + r2] h r + c r2, a
k I
1 - e cos (0-y)
defines a parametric arc provided x(t) and y(t) are continuous and have
piecewise continuous derivatives xi and y' with x'2 + Y,2 / 0. Th
integral to be minimized is of the form
This content downloaded from 200.145.3.73 on Fri, 19 May 2017 17:44:25 UTC
All use subject to http://about.jstor.org/terms
260 MATHEMATICS MAGAZINE (May -June
t2 t2
I(C) = f f(y,' )dt = f f(y * ,y,yy, ,y)dt,
t1 ~~~~tl
where
(5:2) dt =y fy (i=1,.,n).
Jt2 2 -2
J(C) = f +f7 T7 dt = constant.
This content downloaded from 200.145.3.73 on Fri, 19 May 2017 17:44:25 UTC
All use subject to http://about.jstor.org/terms
1950) INTRODUCTION TO THE CALCULUS OF VARIATIONS 261
has the property that its first variation 8(I-XJ)- vanishes along C.
It follows that C must satisfy the Euler equations
dF -F 4F=F.
dt X x= dt Y y
Consequently
) d ty A~xx d (X Ky x'
x + y = 1
- xx =y, =.Y'I = xI
Multiplying the first of these by y'/X and the second by x'/, we obtain
by addition the relation
The solution to our problem therefore must have constant curvature 1/A
and hence must be a circle of radius A, as was to be proved.
The problem just described is a special case of a large class of
problems, conmonly called isoperimetric problems. These problems consist
of maximizing or minimizing an integral I in a class of arcs along
which certain integrals J1, ** ,Jp take on fixed values. These problems
may be in parametric or in non parametric form.
Perhaps the simplest isoperimetric problem is. that of minimizing
an integral
This content downloaded from 200.145.3.73 on Fri, 19 May 2017 17:44:25 UTC
All use subject to http://about.jstor.org/terms
262 MATHEMATICS MAGAZINE (May-June
I(C) = fxdx
xl
C: y(x) (xl_x_X2)
which join two fixed points 1 and 2 in the xy-plane which satisfies a
single isoperimetric condition
rx
(6:2) d F. F
where
F(x,y,y') = f(x,y,y') - Ag(x,y,y')
From this fact one should not conclude that C minimizes I-AJ. This
can perhaps best be illustrated by the following example. It is a known
fact that a hanging chain assumes a position so that its center of
gravity is as low as possible. Thus, a perfectly flexible chain of con-
stant density and of prescribed length can be used to define the
solution (with the y-axis as the vertical axis) of the following minimum
problem: To find among all arcs C in the xy-plane joining two fixed
points 1 and 2 and having a given length
J(C) = (1 +yi2)dx = X
This content downloaded from 200.145.3.73 on Fri, 19 May 2017 17:44:25 UTC
All use subject to http://about.jstor.org/terms
1950) INTRODUCTION TO THE CALCULUS OF VARIATIONS 263
I(C) = fdx
xl
y
2
Fig. 6.1
= X2 2 2
I - AJ=J (y-A)(l+y' ) dx
(X2
I(C) = fJ .- y,y' ) dx
xi
in a class of arcs
C: y(x) (x <x<x2)
joining two fixed points 1 and 2. If now we permit the point 2 to be
an arbitrary point on a prescribed curve
D: x = X(u), y = Y(u)
as indicated in the figure, (next page), one encounters two new con-
cepts, namely, transversality and focal points. In order to describe
these concepts let us first recall certain facts concerning the problem
of finding the shortest line joining a fixed point 1 to a point 2 on D.
If C is such a line segment then it must have two properties:
This content downloaded from 200.145.3.73 on Fri, 19 May 2017 17:44:25 UTC
All use subject to http://about.jstor.org/terms
264 MATHEMATICS MAGAZINE (May-June
y XU., Yu
czui
-~~~ - ~x
Fig. 7:1
(7:1) (f - y' fy I) xu + fy I Yu = ?
must hold, where 1, y' and Xu, Yu denote the vectors tangen
respectively, as indicated in Figure 7:1.
X + y' Yu = 0.
Consequently transversality becomes orthogonality in this case.
In order to establish (7:1) let
y(x,u) xi A: <X(u)
be a one parameter family of arcs joining the point 1 to the point
(X(u), Y(u)) on D and containing C for u = uo. Along this fa
integral I has the value
X(u)
This content downloaded from 200.145.3.73 on Fri, 19 May 2017 17:44:25 UTC
All use subject to http://about.jstor.org/terms
1950) INTRODUCTION TO THE CALCULUS OF VARIATIONS 265
d =
dx Y fy
along C. Using this fact we obtain
X2 X2
E D
Fig. 7:2
This content downloaded from 200.145.3.73 on Fri, 19 May 2017 17:44:25 UTC
All use subject to http://about.jstor.org/terms
266 MATHEMATICS MAGAZINE (May-June
(X2(X
I(C) = f xyy')dx, J(C) = f (x,y,y') dx)
X1 X1
I(C)
J(C)
in the class of arcs
y(x) (X1 xX < X2)
I _J81-.18J
J J~~2
(8:1) J (CO )
we see that
81 - X8J = 3 j2(f_Ag) dx = 0
X1
along CO for all such variations Sy. Consequently CO must satisfy the
Euler equations
d
dx (y J-Ay . f-A
dx
fYfYy#
d
-g)
In order to illustrate this result consider the problem of minimizing
the ratio
J 2 dx
f dx
This content downloaded from 200.145.3.73 on Fri, 19 May 2017 17:44:25 UTC
All use subject to http://about.jstor.org/terms
1950) INTRODUCTION TO THE CALCULUS OF VARIATIONS 267
Here f = y,2 and g = y2. Consequently equation (8:2) takes the form
y" + Ay = 0.
y= s in 3 x
x2
{X2 E- 2
y = A + b cos h xb
The line y = A is the directrix of this catenary and hence cannot be the
y-coordinate of the centroid, as was required. The equations (8:1) and
(8:2) for this problem accordingly have no solution,
This content downloaded from 200.145.3.73 on Fri, 19 May 2017 17:44:25 UTC
All use subject to http://about.jstor.org/terms