Professional Documents
Culture Documents
Nonholonomic Systems
Hernan Cendra
Departamento de Matematica
Universidad Nacional del Sur, Av. Alem 1254
8000 Bahia Blanca, Argentina
uscendra@criba.edu.ar
Jerrold E. Marsden
Control and Dynamical Systems 107-81
California Institute of Technology
Pasadena, CA 91125
marsden@cds.caltech.edu
Tudor S. Ratiu
Departement de Mathematiques
Abstract
This paper surveys selected recent progress in geometric mechanics, focussing on
Lagrangian reduction and gives some new applications to nonholonomic systems, that
is, mechanical systems with constraints typied by rolling without slipping.
Reduction theory for mechanical systems with symmetry has its roots in the classical works in mechanics of Euler, Jacobi, Lagrange, Hamilton, Routh, Poincare and
others. The modern vision of mechanics includes, besides the traditional mechanics
of particles and rigid bodies, eld theories such as electromagnetism, uid mechanics,
plasma physics, solid mechanics as well as quantum mechanics, and relativistic theories,
including gravity.
Symmetries in mechanics ranges from obvious translational and rotational symmetries to less obvious particle relabeling symmetries in uids and plasmas, to subtle
symmetries underlying integrable systems. Reduction theory concerns the removal of
symmetries and utilizing their associated conservation laws. Reduction theory has
been extremely useful in a wide variety of areas, from a deeper understanding of many
physical theories, including new variational and Poisson structures, stability theory,
integrable systems, as well as geometric phases. Much eort has gone into the development of the symplectic and Poisson view of reduction theory, but recently the
Lagrangian view, emphasizing the reduction of variational principles has also matured.
CONTENTS
While there has been much activity in the geometry of nonholonomic systems, the
task of providing an intrinsic geometric formulation of the reduction theory for nonholonomic systems from the point of view of Lagrangian reduction has been somewhat
incomplete. One of the purposes of this paper is to nish this task. In particular, we
show how to write the reduced Lagrange dAlembert equations, and in particular, its
vertical part, the momentum equation, intrinsically using covariant derivatives. The
resulting equations are called the Lagrange-dAlembert-Poincar
e equations.
Contents
1 An Introduction to Reduction Theory
15
28
5 The Snakeboard
31
36
The Purpose of this Paper. This paper outlines some features of general reduction
theory as well as the geometry of nonholonomic mechanical systems. In addition to this
survey nature, there are some new results. Our previous work on the geometric theory of
Lagrangian reduction provides a convenient context that is herein generalized to nonholonomic systems with symmetry. This provides an intrinsic geometric setting for many of the
results that were previously understood primarily in coordinates. This solidication and
extension of the basic theory should have several interesting consequences, some of which
are spelled out in the nal section of the paper. Two important references for this work
are Cendra, Marsden and Ratiu [2000], hereafter denoted CMR and Bloch, Krishnaprasad,
Marsden and Murray [1996], hereafter denoted BKMM.
A Brief History of Reduction Theory. Reduction theory has its origins in the classical
work of Euler, Lagrange, Hamilton, Jacobi, Routh and Poincare and is one of the fundamental tools in the study of mechanical systems with symmetry. Several classical cases
of reduction (using conservation laws and symmetry to create smaller dimensional phase
spaces), such as Rouths elimination of cyclic variables and Jacobis elimination of the node,
were developed by these founding masters. The ways in which reduction theory has been
generalized and applied since that time has been rather impressive.
Routh (circa 1850) pioneered reduction theory for Abelian groups, that is, for Lagrangians with cyclic variables. By 1890, Lie had discovered many of the basic structures
in symplectic and Poisson geometry and their link with symmetry. Meanwhile, Poincare
in 1901 discovered the generalization of the Euler equations for rigid body mechanics and
uids to general Lie algebras. From then to about 1965, the subject lay, to some extent,
dormant, in a state of gestation.
The modern era of reduction theory began with the fundamental papers of Arnold [1966]
and Smale [1970]. Arnold focussed on systems on Lie algebras and their duals, as in the
works of Lie and Poincare, while Smale focussed on the Abelian case giving, in eect, a
modern version of Routh reduction.
The description of many physical systems such as rigid bodies and uids requires noncanonical Poisson brackets and constrained variational principles of the sort implicitly studied by Lie and Poincare. It is quite astounding how much rediscovery and duplication of
the basic work of Routh, Lie and Poincare occurred in the rst two thirds of the twentieth
century.
A basic noncanonical Poisson bracket on g , the dual of a Lie algebra g, is called, following
Marsden and Weinstein [1983], the Lie-Poisson bracket. These structures were known to
Lie around 1890, although Lie seemingly did not recognize their importance in mechanics.
The symplectic leaves in these structures, namely the coadjoint orbit symplectic structures,
although implicit in Lies work, were discovered by Kirillov, Kostant, and Souriau in the
1960s. At rst it may seem remarkable that the dual of any Lie algebra carries, in a natural
way, a Poisson structure. However, this structure is nothing but the natural quotient Poisson
structure induced from the canonical bracket structure on T G by the quotient operation
g = (T G)/G. This fact is explained in, for example, Marsden and Ratiu [1999].
To synthesize the Lie algebra reduction methods of Arnold [1966] with the techniques of
Smale [1970] on the reduction of cotangent bundles by Abelian groups, the work of Marsden and Weinstein [1974] developed reduction theory in the general context of symplectic
manifolds and equivariant momentum maps; related results, but with a dierent motivation
and construction (not stressing equivariance properties of momentum maps) were found by
Meyer [1973].
The construction is now standard: let (P, ) be a symplectic manifold and suppose that
a Lie group G acts freely and properly on P by symplectic maps. The free and proper
assumption is to avoid singularities in the reduction procedure as is discussed later. Assume
that this action has an equivariant momentum map J : P g . Then the symplectic
reduced space J1 ()/G = P is a symplectic manifold in a natural way; the induced
symplectic form is determined uniquely by = i where : J1 () P
is the projection and i : J1 () P is the inclusion. If the momentum map is not
equivariant, Souriau [1970] discovered how to centrally extend the group (or algebra) to
make it equivariant. Using either this construction or a modication of the original method,
one can carry out nonequivariant reduction as well.
The paper of Marsden and Weinstein [1974] showed that coadjoint orbits are, in a natural
way, symplectic reduced spaces. In the reduction construction, one chooses P = T G, with
G acting by (say left) translation, the corresponding space P is identied with the coadjoint
orbit O through together with its coadjoint orbit symplectic structure. Likewise, the LiePoisson bracket on g is inherited from the canonical Poisson structure on T G by Poisson
reduction, that is, by simply identifying g with the quotient (T G)/G. It is not clear who
rst explicitly observed this, but it is implicit in many works such as Lie [1890], Kirillov
[1962, 1976], Guillemin and Sternberg [1980], and Marsden and Weinstein [1982, 1983], and
is explicit in the papers Marsden, Weinstein, Ratiu, Schmid and Spencer [1982] and Holmes
and Marsden [1983].
Kazhdan, Kostant and Sternberg [1978] showed that P is symplectically dieomorphic
to an orbit reduced space P
= J 1 (O )/G and from this it follows that P are the symplectic leaves in P/G. This paper was also one of the rst to notice deep links between reduction
and integrable systems, a subject continued by by many authors, but the group theoretic
explanation of the integrability of the Kowalewski top in work of [1989] is particularly
noteworthy.
The way in which the Poisson structure on P is related to that on P/G was claried in
a generalization of Poisson reduction due to Marsden and Ratiu [1986], a technique that has
also proven useful in integrable systems (see, e.g., Pedroni [1995] and Vanhaecke [1996]).
Stability Theory. Reduction theory for mechanical systems with symmetry has proven
to be a powerful tool enabling advances in stability theory (from the Arnold method to
the energy-momentum method of Simo, Lewis and Marsden [1991]) as well as in bifurcation
theory of mechanical systems, geometric phases via reconstruction the inverse of reduction
as well as uses in control theory from stabilization results to a deeper understanding of
locomotion. For a general introduction to some of these ideas and for further references, see
Marsden and Ratiu [1999].
Tangent and Cotangent Bundle Reduction. The simplest case of cotangent bundle
reduction is reduction at zero in which case one chooses P = T Q and then the reduced
space at = 0 is given by P0 = T (Q/G), the latter with the canonical symplectic form.
Another basic case is when G is Abelian. Here, (T Q)
= T (Q/G) but the latter has a
symplectic structure modied by magnetic terms; that is, by the curvature of a mechanical
connection.
The Abelian version of cotangent bundle reduction was developed by Smale [1970] and
Satzer [1977] and was generalized to the nonabelian case in Abraham and Marsden [1978].
Kummer [1981] interpreted Abraham and Marsdens results in terms of a connection, now
called the mechanical connection. The geometry of this situation was used to great eect
in, for example, Guichardet [1984], Iwai [1987, 1990], and Montgomery [1984, 1990, 1991].
Routh reduction may be viewed as the Lagrangian analogue of cotangent bundle reduction.
Tangent and cotangent bundle reduction evolved into what we now term as the bundle
picture or the gauge theory of mechanics. This point of view was rst developed by
Montgomery, Marsden and Ratiu [1984] and Montgomery [1984, 1986]. That work was
motivated and inuenced by the work of Sternberg [1977] and Weinstein [1978] on a YangMills construction that is in turn motivated by Wongs equations, that is, the equations
for a particle moving in a Yang-Mills eld. The main result of the bundle picture gives a
structure to the quotient spaces (T Q)/G and (T Q)/G when G acts by the cotangent and
tangent lifted actions. This point of view of the bundle picture will be central to the main
body of this paper.
Lagrangian versus Hamiltonian Reduction. In symplectic and Poisson reduction one
focusses on how to pass the symplectic two form and the Poisson bracket structure as well
as any associated Hamiltonian dynamics to a quotient space for the action of a symmetry
group. In Lagrangian reduction theory, which proceeds in a logically independent way, one
emphasizes how the variational structure passes to a quotient space. Of course, the two
methodologies are related by the Legendre transform, although not always in a straightforward way. We shall begin our brief summary of Lagrangian reduction theory with two of
the most important classical cases, namely Routh reduction and Euler-Poincare reduction.
Routh Reduction. Routh reduction for Lagrangian systems is classically associated with
systems having cyclic variables (this is almost synonymous with having an Abelian symmetry
group); modern accounts can be found in Arnold, Kozlov and Neishtadt [1988] and in
Marsden and Ratiu [1999], 8.9. A key feature of Routh reduction is that when one drops
the Euler-Lagrange equations to the quotient space associated with the symmetry, and when
the momentum map is constrained to a specied value (i.e., when the cyclic variables and
their velocities are eliminated using the given value of the momentum), then the resulting
equations are in Euler-Lagrange form not with respect to the Lagrangian itself, but with
respect to the Routhian. In his classical work, Routh [1877] applied these ideas to stability
theory, a precursor to the energy-momentum method for stability mentioned above. Of
course, Rouths stability method is still widely used in mechanics.
Euler-Poincar
e Reduction. Besides Routh reduction, another fundamental case of Lagrangian reduction is that of EulerPoincare reduction, which occurs for the case in which
the conguration space is a Lie group G. One thinks of this case as primarily intended for
systems governed by Euler equations, such as those of a rigid body or a uid. This case
has its origins in the work of Lagrange [1788] and Poincare [1901a]. Both of them clearly
had some idea of the reduction process. Poincare realized that both the equations of uid
mechanics and the free rigid body and heavy top equations could all be described in Lie
algebraic terms in a beautiful way.
The classical Euler-Poincare equations are as follows. Let a be coordinates for the Lie
a
be the associated structure constants. Let L : T G R
algebra g of a Lie group G and let Cbd
be a given left invariant Lagrangian and let l : g = (T G)/G R be the corresponding
reduced Lagrangian. Then the Euler-Poincar
e equations for a curve (t) g are
d l
l a d
= a Cdb
,
b
dt
where a summation is implied over repeated indices. These equations are equivalent to the
Euler-Lagrange equations for L for a curve g(t) G, where g(t)1 g(t)
resulting equations inherited on this space are the LagrangePoincare equations (or the
reduced Euler-Lagrange equations).
The Lagrange-Poincar
e equations in coordinates are given as follows. Let a be
a
coordinates for the Lie algebra g of G and let Cbd
be the associated structure constants. For
a
given local coordinates x in Q/G, let A (x) be the local coordinate expression of a chosen
a
the associated curvature, whose
principal connection on the bundle Q Q/G and let B,
l
denition we review later. Let pa denote the momentum conjugate to a ; that is, pa =
a.
Then
a d
dpb
a
Cdb
Ad x
= pa Cdb
dt
l
d l
l a
a d b
= a B
x + Cdb
A ,
x
dt x
where a summation is implied over repeated indices. We shall review the intrinsic geometry
associated with these equations later. An interesting point is that if one is willing to work
, then
only with the quotient space T Q/G rather than its global realization as T (Q/G) g
a connection is not needed and the local form of the equations is given by the Hamel
equations (see Hamel [1904]), namely the equations that are formally obtained by taking
a trivial connection:
dpb
a d
=0
pa Cdb
dt
l
d l
= 0,
x
dt x
Methods of Lagrangian reduction have proven very useful in a number of areas, as with
Hamiltonian reduction. In particular, these methods are useful in optimal control problems;
for example, Koon and Marsden [1997a] extended the falling cat theorem of Montgomery
[1990] to the case of nonholonomic systems.
Lagrange-Routh Reduction. If one imposes the constraint of constancy of the momentum map on the Lagrangian side, then Lagrange-Poincare reduction becomes LagrangeRouth reduction. It is the nonabelian analogue of the classical Routh procedure mentioned
above. This theory in the nonabelian case began with Marsden and Scheurle [1993a]. Deeper
insight into the constraints one might impose on the variations was obtained by Jalnapurkar
and Marsden [2000]. Putting this theory into an intrinsic bundle context, relaxing the constraints needed on the variations and obtaining an intrinsic version of the Lagrange-Routh
equations is the subject of Marsden, Ratiu and Scheurle [2000].
Semidirect Product Reduction. In the simplest case of a semidirect product, one has
a Lie group G that acts on a vector space V (and hence on its dual V ) and then one forms
the semidirect product S = G V , generalizing the semidirect product structure of the
Euclidean group SE(3) = SO(3) R3 .
Consider the isotropy group Ga0 for some a0 V . The semidirect product reduction
theorem states that each of the symplectic reduced spaces for the action of Ga0 on T G
is symplectically dieomorphic to a coadjoint orbit in (g V ) , the dual of the Lie algebra
of the semi-direct product. This semidirect product theory was developed in a series of
papers: Guillemin and Sternberg [1980], Ratiu [1980a, 1981, 1982a], and Marsden, Ratiu
and Weinstein [1984a, b].
This construction is used in applications where one has advected quantities (such as
the direction of gravity in the heavy top, density in compressible ow and the magnetic eld
in MHD). This theory has many other interesting applications, such as to the the dynamics
of underwater vehicles (Leonard and Marsden [1997]). This semidirect product theory was
a direct precursor to the development of symplectic reduction by stages described below.
Lagrangian analogues of the semidirect product theory, focussing on variational structures, were developed in Holm, Marsden and Ratiu [1998b] with applications to many uid
mechanical problems of interest. The point of view was to extend the EulerPoincare theory to the case of systems such as the heavy top and compressible ows in which there are
advected parameters. This methodology was applied to the interesting (and also degenerate) case of the Maxwell-Vlasov equations by Cendra, Holm, Hoyle and Marsden [1998] and
Cendra, Holm, Marsden and Ratiu [1998] showed how it ts into the general framework of
Lagrangian reduction.
Reduction by Stages and Group Extensions. The semidirect product reduction theorem can be viewed using reduction by stages: one reduces T S by the action of the semidirect product group S = G V in two stages, rst by the action of V at a point a0 and
then by the action of Ga0 . Semidirect product reduction by stages for actions of semidirect
products on general symplectic manifolds was developed and applied to underwater vehicle
dynamics in Leonard and Marsden [1997]. Motivated partly by semidirect product reduction, Marsden, Misiolek, Perlmutter and Ratiu [1998, 2000] gave a signicant generalization
of semidirect product theory in which one has a group M with a normal subgroup N M
(so M is a group extension of N ) and M acts on a symplectic manifold P . One wants to
reduce P in two stages, rst by N and then by M/N . On the Poisson level this is easy:
P/M
= (P/N )/(M/N ) but on the symplectic level it is quite subtle.
Cotangent bundle reduction by stages is especially interesting for group extensions. An
example of such a group, besides semidirect products, is the Bott-Virasoro group, where
the Gelfand-Fuchs cocycle may be interpreted as the curvature of a mechanical connection.
The work of CMR briey described above, contains a Lagrangian analogue of reduction for
group extensions and reduction by stages.
Singular Reduction. Singular reduction starts with the observation of Smale [1970] that
z P is a regular point of J i z has no continuous isotropy. Motivated by this, Arms,
Marsden and Moncrief [1981] showed that the level sets J1 (0) of an equivariant momentum
map J have quadratic singularities at points with continuous symmetry. While such a result
is easy for compact group actions on nite dimensional manifolds, the main examples of
Arms, Marsden and Moncrief [1981] were, in fact, innite dimensionalboth the phase
space and the group. Otto [1987] has shown that if G is a compact Lie group, J1 (0)/G
is an orbifold. The singular structure of J1 () is closely related to important convexity
properties of the momentum map that have been studied by Guillemin, Sternberg, Atiyah,
Kirwan, and many others starting in the early 1980s. We shall not review that literature
here.
The detailed singular stratied structure of J1 (0)/G for compact Lie groups acting on
nite dimensional manifolds was developed in Sjamaar and Lerman [1991] and extended for
proper Lie group actions to J1 (O )/G by Bates and Lerman [1997], if O is locally closed
in g . Ortega [1998] and Ortega and Ratiu [2001] redid the entire singular reduction theory
for proper Lie group actions starting with the point reduced spaces J1 ()/G and also
connected it to the more algebraic approach to reduction theory of Arms, Cushman and
Gotay [1991]. Specic examples of singular reduction and further references may be found
in Cushman and Bates [1997]. This theory is still under development.
The Method of Invariants. The method of invariants seeks to parameterize quotient
spaces by group invariant functions. It has a rich history going back to Hilberts invariant
theory begun in the late 1800s. It has been of great use in bifurcation with symmetry
(see Golubitsky, Stewart and Schaeer [1988] for instance). In mechanics, the method was
developed by Kummer, Cushman, Rod and coworkers in the 1980s. We will not attempt
to give a literature survey here, other than to refer to the papers of Kummer [1990], Kirk,
Marsden and Silber [1996], Alber, Luther, Marsden and Robbins [1998] and the book of
Cushman and Bates [1997] for more details and references.
z
q
P
y
x, y Q
L(q, q)
dt = 0,
a
10
and Bryant and Griths [1983]. While such a truly variational theory is relevant and
interesting for optimal control problems, it is not relevant for the dynamical equations
of nonholonomic mechanics for the sort of examples with rolling constraints considered
above. This has been recognized in the classical literature such as Whittaker [1907], and
the Engineering literature for a long time; see, for example, Rosenberg [1977] and references
therein.
One has to be careful reading the literature since this can cause confusion. However, this
issue of whether or not the equations of nonholonomic mechanics are variational or not was
discussed extensively and put to rest already by Korteweg [1899]. These distinctions are
also clearly discussed in Bloch and Crouch [1999]. We also refer to Lewis and Murray [1995]
for an experimental verication of the Lagrange-dAlembert principle, to Jalnapurkar [1994]
for its derivation from F = ma for systems such as rolling rigid bodies, and to Martinez,
Cortes and Leon [2000] for some comments on the Poisson nature of vakanomic systems.
Thus, anyone who doubts the validity of the Lagrange-dAlembert principle should take up
the matter with both the experimental evidence and with Mr. I. Newton.
The Roller Racer. Another example is the roller racer it is a wheeled vehicle with
two segments connected by a rotational joint, as in Figure 2.2.
y
(x, y)
11
d L
L
dt r
r
= Aa
d L
L
a
a
dt s
s
.
(2.2)
(2.3)
gives a complete description of the equations of motion of the system; this procedure may
be viewed as one way of eliminating the Lagrange multipliers. Using this notation, one nds
that = a a , where
a =
d L
L
a.
dt s a
s
(2.4)
12
is the coordinate expression of the constrained Lagrangian dened by Lc = L|D and where
b
B
=
b
Ab
Ab
Ab
a
a A
+
A
A
.
r
r
sa
sa
(2.5)
Lc
d Lc
Lc
+ Aa a
dt r
r
s
r =
L
d b (q,
r).
s b
This form of the equations isolates the eects of the constraints, and shows, in particular,
that in the case where the constraints are integrable (i.e., d a = 0), the equations of
motion are obtained by substituting the constraints into the Lagrangian and then setting
the variation of Lc to zero. However in the non-integrable case the constraints generate
extra (curvature) terms, which must be taken into account.
Some Geometry. The above coordinate results can be put into an interesting and useful
intrinsic geometric framework. The intrinsically given information is the distribution and
the Lagrangian. Assume temporarily that there is a bundle structure Q,R : Q R for our
space Q, where R is the base manifold and Q,R is a submersion and the kernel of Tq Q,R
at any point q Q is called the vertical space Vq . One can always do this locally. An
Ehresmann connection A is a vertical valued one form on Q such that
1. Aq : Tq Q Vq is a linear map and
2. A is a projection: A(vq ) = vq for all vq Vq .
Hence, Tq Q = Vq Hq where Hq = kerAq is the horizontal space at q, sometimes
denoted horq . Thus, an Ehresmann connection gives us a way to split the tangent space to
Q at each point into a horizontal and vertical part.
If the Ehresmann connection is chosen in such a way that the given constraint distribution
D is the horizontal space of the connection; that is, Hq = Dq , then in the bundle coordinates
q i = (r , sa ), the map Q,R is just projection onto the factor r and the connection A can
be represented locally by a vector valued dierential form a :
A = a
,
sa
13
shows the sense in which the curvature measures the failure of the constraint distribution
to be integrable.
In coordinates, one can evaluate the curvature B of the connection A by the following
formula:
B(X, Y ) = d a (hor X, hor Y )
,
sa
Lc = FL, B(q,
q) ,
in which q is a horizontal variation (i.e., it takes values in the horizontal space) and B is
the curvature regarded as a vertical valued two form, in addition to the constraint equations
A(q) q = 0. Here , denotes the pairing between a vector and a dual vector and
Lc
d Lc
a Lc
Lc = r ,
.
r
dt r
sa
When there is a symmetry group G present, there is a natural bundle one can work
with and put a connection on, namely the bundle Q Q/G. In the generality of the
preceding discussion, one can get away with just the distribution itself and can introduce
the corresponding Ehresmann connection locally. In fact, the bundle structure Q R is
really a red herring. The notion of curvature as a Tq Q/Dq valued form makes good sense
and is given locally by the same expressions as above. However, keeping in mind that we
eventually want to deal with symmetries and in that case there is a natural bundle, the
Ehresmann assumption is nevertheless a reasonable bridge to the more interesting case with
symmetries.
Nonholonomic Reduced Equations. Reduction procedures were applied to the Lagrange dAlembert principle in BKMM. The general form of the resulting reduced equations,
which we will also call the Lagrange-dAlembert-Poincar
e equations is
g 1 g
= A(r)r + B(r)p,
p = r T (r)r + r T (r)p + pT (r)p
M (r)
r = C(r, r)
+ N (r, r,
p) +
The rst equation describes the motion in the group variables as the ow of a left-invariant
vector eld on G determined by the internal shape r, the internal velocity r,
as well as
the nonholonomic momentum p, which is, roughly speaking, the component of momentum in the symmetry directions compatible with the constraints. The second equation is the
momentum equation. Notice that the momentum equation has terms that are quadratic
in r,
linear in r and p and quadratic in p. The coecients (r) dene a connection and
this term is called the transport part of the momentum equation. The curvature of this
connection plays an important role in stability theory. The third equation describes the
motion in the shape variables r. The term M (r) is the mass matrix of the system, C is
the Coriolis term and represent internal control forces. This framework has proven to
be useful for controllability, gait selection, and locomotion of systems like the snakeboard.
One of the goals of the present paper is to give the intrinsic geometry of the LagrangedAlembert-Poincare equations, so we will return to this topic later.
14
15
satised by the momentum associated with the symmetry is useful for understanding the
dynamics of the problem and how group motion can be generated. The nonconservation
of momentum occurs even with no forces applied (besides the forces of constraint) and is
consistent with the conservation of energy for these systems. In fact, nonconservation is
crucial to the generation of movement in a control-theoretic context.
Nonholonomic Reduction. As we have indicated already, one of the important tools of
geometric mechanics is reduction theory (either Lagrangian or Hamiltonian), which provides
a well-developed method for dealing with dynamic constraints. In this theory the dynamic
constraints and the symmetry group are used to lower the dimension of the system by
constructing an associated reduced system. Koiller [1992] was one of the rst papers in
which a systematic theory of nonholonomic reduction was begun. A Hamiltonian version of
this theory was developed in Bates and Sniatycki [1993] while a Lagrangian version of this
theory was given in BKMM. Links between these theories and further developments were
given in Koon and Marsden [1997b, c, 1998]. The Lagrangian point of view is a convenient
context for applications to control theory.
Other Work on the Geometry of Nonholonomic Systems. There have been many
works on the geometry of nonholonomic systems and reduction theory from many points of
view. We cannot survey them in a comprehensive way here, but we would like to mention the
following works: Chaplygin [1897a, 1897b, 1903, 1911, 1949, 1954], Cartan [1928], Neimark
and Fufaev [1972], Rosenberg [1977], Weber [1986], Vershik and Gershkovich [1994], Koiller
[1992], Bloch and Crouch [1992], Yang, Krishnaprasad and Dayawansa [1993], van der Schaft
and Maschke [1994], Marle [1995, 1998], Udwadia and Kalaba [1996], Cantrijn, de Le
on,
Marrero and Martin de Diego [1998], and Cantrijn, Cortes, de Le
on and Martin de Diego
[2000].
Applications of Reduction Theory. Reduction theory is important for many reasons. For example, it provides a context for understanding the theory of geometric (or
Hannay-Berry) phases (see the papers Krishnaprasad [1989], Marsden, Montgomery and
Ratiu [1990], Bloch, Krishnaprasad, Marsden and S
anchez de Alvarez [1992] and references
therein). This is especially important for understanding locomotion generation in the nonholonomic context. An additional useful object that was introduced in BKMM in this regard
is the nonholonomic connection, a nonholonomic analog of the mechanical connection. We
have already mentioned the applications to the nonholonomic energy-momentum method.
Lagrangian reduction theory has also had a signicant impact on optimal control theory.
See Vershik and Gershkovich [1994], Bloch and Crouch [1994a], Montgomery [1990, 1993],
Koon and Marsden [1997a], and references therein. For a general overview of some of the
applications and for further references to locomotion and controllability issues, see Marsden
and Ostrowski [1998], Marsden [1999], and Bloch and Crouch [1998, 1999].
Goal of the Remainder of this Paper. In this paper we study geometric aspects of
basic fundamental results in nonholonomic mechanics, with an emphasis on the role of
symmetry. In particular, we provide a geometric formulation of the Lagrange-dAlembertPoincare equations.
The Lagrange-dAlembert Principle. As in the introductory sections, let Q be a manifold and let D be a distribution on Q, that is, D is a subbundle of T Q, and let L : T Q R
16
q(t, )
=0
t1
L(q, q)dt
= 0.
t0
L(q, q)dt
:=
L (q(t, ), q(t,
)) dt.
=0 t0
t0
Using the Lagrange-dAlembert Principle, equations of motion can be derived and conveniently written using an Ehresmann connection adapted to the given system in such a
way that D coincides with the horizontal distribution of the Ehresmann connection, as was
described in the introductory sections.
Next, we shall describe the Euler-Lagrange operator and Euler-Lagrange equations as we
did in CMR. We shall begin by summarizing the denition of higher order tangent bundles
and connection-like structures dened on them.
The kth Order Tangent Bundle. First, we shall recall the denition of the kth-order
(k)
(k)
tangent bundle Q : T (k) Q Q. For q Q, elements of Tq Q are equivalence classes
of curves in Q, namely, two given curves qi (t), i = 1, 2, such that q1 (t1 ) = q2 (t2 ) = q are
(l)
(l)
equivalent, by denition, if and only if in any local chart we have q1 (t1 ) = q2 (t2 ), for
(l)
l = 1, 2, . . . , k, where q denotes the derivative of order l. The equivalence class of the
(k)
curve q(t) at q = q(t) will be denoted [q]q . The projection
(k)
(k)
(k)
Q : T (k) Q Q is given by Q [q]q
= q.
It is clear that T (0) Q = Q, T (1) Q = T Q, and that, for l < k, there is a well dened ber
bundle structure
(l,k)
(l,k)
(k)
(l)
Q : T (k) Q T (l) Q given by Q
[q]q
= [q]q .
The bundles T (k) Q for k > 1 are not vector bundles, except for k = 1. The bundle T (2) Q
and is called the second order bundle (see, for example, Marsden,
is often denoted Q,
Patrick and Shkoller [1998], Marsden and Ratiu [1999] and references therein).
It is worth noting here that in eld theory one uses the bundle R Q R, whose
sections are curves in Q (the elds of classical mechanics). Then the k-jet bundle of this
trivial bundle over the time axis may be identied with the bundle R T (k) Q R Q,
where the rst component of this map is the identity. From the point of view of jet bundles
associated to maps between two manifolds, T (k) Q coincides with the ber bundle J0k (R, Q)
formed by k-jets of curves from R to Q (based at 0 R), as dened, for example in Bourbaki
[1983].
17
(k)
(k)
T (k) f : T (k) M T (k) N given by T (k) f [q]q
= [f q]f (q) .
In particular, a group action : G Q Q can be naturally lifted to a group action
(k)
(k)
(k) : G T (k) Q T (k) Q given by g(k) [q]q
= [g q](g,q .
(k)
(k)
[q]q
(k)
(k)
= (k) g, [q]q
= g[q]q .
Moreover, if : Q Q/G is a principal bundle with structure group G, then, with the
lifted action, T (k) Q is also a principal bundle with structure group G. Let M N be the
Cartesian product of the manifolds M and N . Then, for any (m,
n
) M N there is a
(k)
(k)
(k)
natural identication T(m,
M Tn
N , which induces an identication
n) ((M N ) Tm
T (k) (M N ) T (k) M T (k) N .
Let : Q Q/G be a principal bundle and consider the lifted principal bundle structure
T (k) Q with structure group G. The quotient T (k) Q/G can be easily shown to be a ber
bundle over the base Q/G. The bundle T (2) Q/G is the one that interests us most in
this paper, because the Lagrange-Poincare operator of a reduced Lagrangian is dened
on
(k)
(k)
(2)
(k)
T Q/G. The class of the element [q]q in the quotient T Q/G will be denoted [q]q
.
G
Moreover, it can be easily shown that this bundle map induces a well dened bundle map
(k)
(k)
T (k) Q/G T (k) (Q/G) given by
[q]q
T (k) [q]q .
G
Now assume that a principal connection A is given on the principal bundle Q Q/G.
(k)
Let [x]x T (k) (Q/G) and also q Q such that (
q ) = [
q ]G = x
be given. Let x(t) be any
(k)
curve belonging to the class of [x]x . Then there is a unique horizontal lift xhq of x(t). We
(k)
We must also remark that T (k) G carries a natural Lie group structure.1 If [g]g , and
(k)
(k)
classes of curves g and h in G, we dene the product [g]g [h]h as being the class
of the curve gh. The Lie algebra Te T (k) G of T (k) G can be naturally
the point gh
(k)
[h]h are
(k)
[gh]gh at
identied, as a vector space, with (k + 1)g, which, therefore, carries a unique Lie algebra
structure such that this identication becomes a Lie algebra isomorphism. There is also a
(k)
natural identication of Te G with kg.
Also, for k = 1, 2, . . . , T (k) Q is a principal bundle with structure group T (k) G in a natural
(k)
(k)
(k)
way. More precisely, if [g]g T (k) G is the class of a curve g in G and [q]q Tq Q is the
(k)
(k)
(k)
(k)
class of a curve q in Q we let [g]g [q]q Tgq Q denote the class [gq]gq of the curve gq at
(k)
the point gq. In particular, if kg and [q]q T (k) Q are given, there is a well dened
(k)
element [q]q T (k) Q.
1 Recall that T (1) G = T G is the semidirect product G g and its Lie algebra is 2g which as a vector
space is g g and it carries the Lie algebra structure of the semidirect product g g where the second factor
is regarded as the representation space of the adjoint action. We will not need, or study, the Lie group
structure of T (k) G in this paper, although this would be interesting to do.
18
, (t)] + (t)
.
Dt
G
(3.1)
G.
Dt
Using (3.1), the second covariant derivative of [q(t), v(t)]G , is given by
D2 [q(t), v(t)]G
= [q(t), [v(t), v(t)]
+ v(t)]G .
Dt2
More generally, for each k = 1, 2, . . . , we can nd, by induction, a curve vk (t) in g, having
an expression that involves v(t) and the derivatives v (l) (t), l = 1, 2, . . . , k 1, such that
Dk1 [q(t), v(t)]G
= [q(t), vk (t)]G .
Dtk1
More precisely, we have v1 = v
we obtain
v2 (t) = v(t),
+ v(t),
etc. In addition, we shall write, by denition, v0 (t) = 0. Using the fact that v(t) =
A (q(t), q(t)),
(k)
Ak [q]q
= vk (t)
gives a well dened map Ak : T (k) Q g. Therefore
there
(k)
(k)
(k)
Ak g[q]q
= Adg Ak [q]q ,
using induction, the denition of Ak , and taking into account the formulas
Adg v k =
and Adg [v, vk ] = [Adg v, Adg vk ] .
d
Adg vk
dt
19
(k)
g(t) = e and Ak [q]q
= .
In fact, since we obviously have gq (t) = g(t) it is enough to nd g(t) such that the derivatives
g (l) (t), l = 1, 2, . . . , k, satisfy the appropriate conditions as explained above. We shall call
this unique element q and the set of all such elements will be called kg
q := g
q g
q
(k times). Moreover, it is not dicult to see that the restriction Ak : kg
q kg is a
dieomorphism, and therefore it naturally denes a unique vector space structure on kg
q
such that the restriction of Ak becomes a linear isomorphism, given by Ak ( q) = , for all
kg.
(k)
By construction we see that there is a natural identication between kg
q and Tq (G
q ).
Let us dene, for each k = 1, 2, . . . , the vector bundle k
g as being the Whitney sum of
. Dene a map
k copies of g
(k)
(k)
T (k) Q k
g by [q]q q, Ak [q]q
,
G
(k)
(l)
q, Ak [q]q
= kl=1 q, Al [q]q
.
G
Using the preceding, it can be easily deduced that, given any curve q(t) in Q such that
q(t) = q, we have, at t = t,
D(l1) [q(t), v(t)]G
(k)
q, Ak [q]q
= kl=1
.
G
Dt(l1)
t=t
The above is the essence of the proof of the following:
Proposition 3.2. The map Ak : T (k) Q/G T (k) (Q/G) Q/G k
g dened by
(k)
(k)
(k)
Ak [q]q
= T (k) G (Q) [q]q Q/G q, Ak [q]q
G
(k)
(k),h
1
A
[x]
= [x]x,q .
[
q
,
]
G
Q/G
x
20
be the action of L dened for curves q(t) in Q. Let q(t, ) be a deformation of a curve
q(t), let q(t) be the corresponding variation and assume that q(ti ) = 0 for i = 0, 1. Then
there is a unique bundle map EL(L) : T (2) Q T Q such that, for any deformation q(t, ),
keeping the endpoints xed we have
t1
dS(L)(q) q =
EL(L)(q, q,
q) q,
t0
where, as usual,
d
dS(L)(q) q =
S(L) (q(t, ))
d
=0
with
q(t, )
q(t) =
.
=0
dened by
LD(L) = iD EL(L) iD .
(2)
t0
where, as usual,
dS(L)(q) q =
t0
d
S(L) (q(t, ))
,
d
=0
for all curves q(t) and variations q(t) such that (q(t), q(t))
21
The proof of this theorem is straightforward. It consists in applying the usual integration
by parts argument as in the proof of Theorem 3.3 (see for instance CMR), for the case of
curves q(t) and variations q(t) that satisfy the constraint D. We shall therefore omit the
details.
Nonholonomic Systems with Symmetry. Now we shall assume that : Q Q/G
is a principal bundle
with structure
group G and we denote by V the vertical distribution,
that is, Vq = Tq 1 ([q]G ) , for each q Q, which is obviously an integrable distribution
whose integral manifolds are the group orbits 1 ([q]G ).
Following BKMM, let us consider the following condition:
(A1) Dimension Assumption. For each q Q, we have Tq Q = Dq + Vq .
It is easy to see that, under assumption (A1), the dimension of the space Sq = Dq Vq
does not depend on q Q, and moreover, the collection of spaces Sq , q Q, is a subbundle
of D, of V, and of T Q.
Now let us now consider the following condition
(A2) Invariance of D. The distribution D is G-invariant.
It follows from (A1) and (A2) that S is a G-invariant distribution.
The Nonholonomic Connection. It is known and easy to prove that there is always a
G-invariant metric on Q. See, for example, CMR, 6.3.
In many important physical examples there is a natural way of choosing an invariant
metric, representing, for instance, the inertia tensor of the system, see BKMM. Let us
assume condition (A1). We can then dene the principal connection form A : T Q g such
that the horizontal distribution HorA T Q satises the condition that, for each q, the space
HorA Tq Q coincides with the orthogonal complement Hq of the space Sq in Dq .
This connection is called the nonholonomic connection. For each q Q, let us
denote Uq the orthogonal complement of Sq in Vq . Then it is easy to see that U is a smooth
distribution and we have the Whitney sum decomposition
T Q = H S U.
We obviously have
D =HS
and V = S U.
Under the additional assumption (A2), all three distributions H, S, and U are Ginvariant, so we can write
T Q/G = H/G S/G U/G.
The Geometry of the Reduced Bundles. Recall from CMR (see also the paragraph
Connection-like Structures on Higher Order Tangent Bundles) that there is a
vector bundle isomorphism
,
A : T Q/G T (Q/G) g
is the adjoint bundle of the principal bundle Q, dened as follows
where g
A [q, q]
G = T (q, q)
[q, A(q, q)]
G.
22
.
and A (V/G) = g
of g
by
Dene the subbundles s and u
s = A (S/G) and u
= A (U/G),
= s u
.
respectively. Clearly, we have, g
Necessary and Sucient Condition for the Constraints to be Holonomic. We
remark that our conventions for the curvature B of the principal connection A are given
by Cartans structure equation dA(u, v) = B(u, v) + [A(u), A(v)]. Recall that we dene the
on the base Q/G by B([q]
G )(X, Y ) = [q, B(X h (q), Y h (q))]G , where X h
valued 2-form B
g
h
and Y are the horizontal lifts to Q of the vector elds X and Y on Q/G. Also recall that
A the ane connection naturally induced on the vector bundle g
by the
we denoted by
principal connection A, dened in the paragraph Connection-like Structures on Higher
i , i = 1, 2, be given invariant vector elds on Q. Let
Order
Tangent
Bundles. Let X
A [Xi ]G = Xi i , i = 1, 2. A calculation given in CMR, shows that one has the following
formula for the Lie bracket on sections of Lagrange-Poincare bundles:
A
A
[X1 1 , X2 2 ] = [X1 , X2 ]
X1 2 X2 1 B(X1 , X2 ) + [1 , 2 ],
where, by denition,
[X1 1 , X2 2 ] = A
i]
i, X
[X
.
G
A 2 s;
(ii)
X1
1 , X2 ) s.
(iii) B(X
Lagrange-Poincar
e Operators. Now we shall recall the denition and basic properties
of the Lagrange-Poincare operators introduced in CMR; we refer to this paper for the proofs
of the statements below.
Theorem 3.7. Let L : T Q R be an invariant Lagrangian on the principal bundle Q.
Choose a principal connection A on Q and identify the bundles T Q/G and T (Q/G) g
using the isomorphism A and also the bundles T (2) Q/G and T (2) (Q/G) Q/G 2
g using the
isomorphism A2 . Thus an element [q, q]
G of T Q/G can be written, equivalently, as an
. Let l : T (Q/G) g
R be the reduced Lagrangian. Then
element (x, x,
v) of T (Q/G) g
there is a unique bundle map
(t) + [
v (t), (t)] + B(x(t),
x(t)),
Dt
and x(t) = T (q(t)), satisfy
A v(t) =
with (t) = [q(t), (t)]G
EL(L)(q(t), q(t),
23
l
d
(x0 , x 0 , v0 ) v =
l(x0 , x 0 , v0 + s
v ).
v
ds s=0
To dene the derivative l/x, one needs to chose a connection on the manifold
. For any
Q/G, as we will explain next. Let (x0 , x 0 , v0 ) be a given element of T (Q/G) g
given curve x(s) on Q/G, let (x(s), v(s)) be the horizontal lift of x(s) with respect to the
A on g
such that (x(0), v(0)) = (x0 , v0 ) and let (x(s), u(s)) be the horizontal lift
connection
of x(s) with respect to the connection such that (x(0), u(0)) = (x0 , x 0 ). (Notice that in
general, (x(s), u(s)) is not the tangent vector (x(s), x(s))
is dened by
C l
d
=
l (x(s), u(s), v(s)) .
(x0 , x 0 , v0 ) (x(0), x(0))
x
ds s=0
We shall often write
C l
x
l
x ,
if no confusion is possible.
, induces a corresponding
The covariant derivative on a given vector bundle, for instance g
. More precisely, let (t) be a curve
covariant derivative on the dual bundle, in our case g
. We dene the covariant derivative of (t) in such a way that for any curve v(t) on g
in g
such that both (t) and v(t) project on the same curve x(t) on Q/G, we have
d
D(t)
D
v (t)
(t), v(t) =
, v(t) + (t),
.
dt
Dt
Dt
24
Likewise we can dene the covariant derivative in the vector bundle T (Q/G). Then we
.
obtain a covariant derivative on the vector bundle T (Q/G) g
D l
It is in the sense of this denition that terms like Dt x in the second equation (which
D l
denes the horizontal Lagrange-Poincare operator) and Dt
v
in the rst equation (which
denes the vertical Lagrange-Poincare operator) of the following theorem should be interpreted. In this case D/Dt means the covariant derivative in the bundle T (Q/G). In the
.
rst equation D/Dt is the covariant derivative in the bundle g
Theorem 3.10. Under the hypothesis of Theorem 3.7 we
The vertical Lagrange-Poincar
e operator is given
D l
Ver(LP)(l) =
(x, x,
v) + adv
Dt
v
or simply,
Ver(LP)(l) =
D l
l
(x, x,
v) + adv
(x, x,
v)
Dt
v
Hor(LP)(l) x =
(x, x,
v)
(x, x,
v) x
(x, x,
v)B(x)(
x,
x)
x
Dt x
v
or simply,
Hor(LP)(l) =
C l
D l
l
(x, x,
v)
(x, x,
v)
(x, x,
v)B(x)(
x,
).
x
Dt x
t0
L(q, q)dt
=0
t0
v(t)) dt
t0
l (x(t), x(t),
v(t)) dt = 0,
t0
25
for variations x A v of the curve x(t) v(t), where A v has the form
A v =
+ [
v , ] + B(x,
x),
Dt
v
and the horizontal Lagrange-Poincar
e equations, corresponding to horizontal
variations, hold:
C l
D l
l
.
(x, x,
v)
(x, x,
v) =
(x, x,
v), ix B(x)
x
Dt x
v
In statement (ii), can be always written = [q, ]G , and the condition (ti ) = 0 for
i = 0, 1, is equivalent to the condition (ti ) = 0 for i = 0, 1. Also, if x(t) = [q]G and
v = [q, v]G where v = A (q, q),
then variations x A v such that
A v =
D[q, ]G
+ [
v , ]
+ [q, [v, ]]G
Dt
Dt
x)
with x(ti ) = 0 for i = 0, 1, correspond exactly to horizontal variations
that A v = B(x,
q of the curve q such that q(ti ) = 0.
The Lagrange-dAlembert-Poincar
e Operator. Next we shall see how the previous
results can be generalized for systems with nonholonomic constraints.
The next lemma can be easily proven using the results on higher order tangent bundles
and connection-like structures dened on them summarized before Theorem 3.3.
Lemma 3.12. Assume (A1) and (A2). Then
(2)
A2 TD Q = T (2) (Q/G) Q/G (2
g)s ,
where, by denition, (2
g)s = s Ds. In the above direct sum, the ber of the vector bundle
Ds at the base point x
= [
q ]G , is the vector space Dsx consisting of all vectors of the form
D (t)
at t = 0 is x
, where (t) is a curve in s such that the base point of (t)
.
Dt
t=0
T (Q/G) s
is : T (Q/G) g
26
dened by
LDP(l) = is LP(l) is
(2)
The Lagrange-dAlembert-Poincar
e equation is the equation LDP(L) = 0.
Now we shall state one of our main results.
Theorem 3.14. Assume (A1) and (A2) and let L : T Q R be a G-invariant Lagrangian.
Choose a principal connection A on Q and identify the bundles D/G and T (Q/G) s using
(2)
the isomorphism A and also the bundles TD Q/G and T (2) (Q/G) Q/G (2
g)s using the isomorphism A2 . Thus, in particular, an element [q, q]
G of D/G can be written, equivalently,
R be the reduced Lagrangian.
as an element (x, x,
v) of T (Q/G) s. Let l : T (Q/G) g
Then the Lagrange-dAlembert-Poincare operator
LDP(l) : TD (Q/G) Q/G (2
g)s T (Q/G) s ,
(2)
satises the condition that, for any curve q satisfying the constraints, which means that
(q(t), q(t))
Dq(t) , for all t, and any variation q of q vanishing at the endpoints, and also
satisfying the constraints, that is, q(t) Dq(t) , for all t, the corresponding reduced curve
v), where v = [q, A(q, q)]
G , and covariant variation x A v, where
[q, q]
G = (x, x,
A v(t) =
(t) + [
v (t), (t)] + B(x(t),
x(t)),
Dt
with (t) = [q(t), (t)]G Sq(t) , for all t, and x(t) = T (q(t)), we have
EL(L)(q(t), q(t),
given the decomposition of its range space as a direct sum, naturally decomposes as
LDP(l) = Hor(LDP)(l) Ver(LDP)(l),
which denes the horizontal Lagrange-dAlembert-Poincar
e operator and the vertical Lagrange-dAlembert-Poincar
e operator. The horizontal Lagrange-dAlembertPoincar
e equation and vertical Lagrange-dAlembert-Poincar
e equation are the
equations Hor(LDP)(l) = 0 and Ver(LDP)(l) = 0.
We will omit the proof of the following theorem, which is entirely similar to the proof of
Theorem 3.10, keeping track of the conditions imposed by the constraints.
27
Theorem 3.16. Under the hypothesis of Theorem 3.14 we have the following:
The vertical Lagrange-dAlembert-Poincar
e operator is given by
D l
l
Ver(LDP)(l) =
(x, x,
v) + adv
(x, x,
v) ,
Dt
v
v
where v s and s, or, simply,
D l
l
Ver(LDP)(l) =
(x, x,
v) + adv
(x, x,
v) .
Dt
v
s
The horizontal Lagrange-dAlembert-Poincar
e operator is given by
C
l
D l
l
Hor(LDP)(l) x =
(x, x,
v)
(x, x,
v) x
(x, x,
v)B(x)(
x,
x),
x
Dt x
v
where v s, or simply,
Hor(LDP)(l) =
C l
D l
l
(x, x,
v)
(x, x,
v)
(x, x,
v)B(x)(
x,
.)
x
Dt x
Summary. The next theorem is the main result of this section and it summarizes the
previous results. It contains Theorem 3.11 as the particular case in which D = T Q.
Theorem 3.17. Let q(t) be a curve in Q such that (q(t), q(t))
L(q, q)dt
=0
t0
for variations q of q(t) such that q(ti ) = 0, for i = 0, 1, and q(t) Dq(t) for all t.
(ii) The reduced Lagrange-dAlembert principle holds: The curve x(t) v(t) satises
t1
l (x(t), x(t),
v(t)) dt = 0,
t0
for variations x A v of the curve x(t) v(t), where A v has the form
A v =
+ [
v , ] + B(x,
x),
Dt
with boundary conditions x(ti ) = 0 and (ti ) = 0, for i = 0, 1, and where (t) sx(t) .
(iii) The following vertical Lagrange-dAlembert-Poincar
e equations, corresponding
to vertical variations, hold:
D l
l
(x, x,
v) = adv
(x, x,
v)
Dt
v
v
and the horizontal Lagrange-dAlembert-Poincar
e equations, corresponding to
horizontal variations, hold:
C l
D l
l
.
(x, x,
v)
(x, x,
v) =
(x, x,
v), ix B(x)
x
Dt x
28
D[q, ]G
D
+ [
v , ]
+ [q, [v, ]]G
Dt
Dt
with (ti ) = 0 (or, equivalently, (ti ) = 0) for i = 0, 1, and (t) sx(t) correspond exactly
to vertical variations q of the curve q such that q(ti ) = 0 for i = 0, 1, and q(t) Sq(t) ,
x)
with x(ti ) = 0 for i = 0, 1, correspond
while variations x A v such that A v = B(x,
exactly to horizontal variations q of the curve q such that q(ti ) = 0.
l
(x, x,
v).
29
(4.2)
Lemma 2.3.4 of CMR gives the general formula for calculating the covariant derivative of a
, namely
given curve [q(t), (t)]G in g
D[q(t), (t)]G
.
(4.3)
= q(t), [A (q(t), q(t))
, (t)] + (t)
Dt
G
.
b (x(t)) = [x(t), e, eb (x(t))]G (x(t), eb (x(t))) in g
We apply this formula to the curve e
Note that the tangent vector to the curve q(t) (x(t), e) is (q(t), q(t))
(x(t), e, x(t),
0)
and hence A (q(t), q(t))
Ae (x(t)) x.
Using equation (4.3) we obtain
D
b = [x, e, [Ae (x) x,
eb ] + e b ]G (x, [Ae (x) x,
eb ] + e b )
e
Dt
(4.4)
From equations (4.1), (4.2), and (4.4) we obtain the momentum equation
dpb
= p, [v, eb ] + e b .
dt
(4.5)
(4.6)
(4.7)
a
Let Cbd
be the structure constants of the Lie algebra g. For the given local coordinates x on
X, let Aa (x) be the coecients of Ae (x), that is, by denition, (Ae (x) x)
a ea = Aa (x)x ea .
Then equation (4.7) becomes
a d
dpb
a
v Cdb
Ad x ,
= pa Cdb
dt
(4.8)
a d
dpb (x, x,
v)
a
= pa (x, x,
v) Cdb
v Cdb
Ad (x)x .
dt
(4.9)
This equation coincides with equation (5.3.3) of BKMM as well as with equation (3.2) in
Koon and Marsden [1997a]. Since v = Ae (x) x + v, we also obtain
dpb (x, x,
Ae (x) x + v)
a d
= pa (x, x,
Ae (x) x + v)Cdb
v .
dt
(4.10)
30
Observe that one can calculate the momentum pb by taking the derivative of l with respect
to either vb or v b . In Bloch, Krishnaprasad, Marsden and Murray [1996], the variable vb is
called b and is interpreted as the locked body angular velocity. This variable is intrinsic,
given the choice of a connection, whereas v b depends on the local trivialization. In the
special case in which the connection is trivial, i.e., Ae = 0, v = v, so we get the Poincare
equation, which is one of the equations in Hamels equations.
In the case of nonholonomic systems, v must belong to S(x, e), which is equivalent to v
s. Recall that we are assuming (A1) and, therefore, dim(S(x, e)) =: s is constant. Choose
x -basis {
the g
eb (x) | b = 1, ..., dim(G)}, for each x, in such a way that, {
eb (x) | b = 1, ..., s}
generates sx . Equation (4.5) becomes thus
a
dpb (x, x,
v)
eb
a c
a
(4.11)
v) Ccb
v Cdb
Ad (x)x +
x ,
= pa (x, x,
dt
x
where (eb /x )a is the a-component of eb /x . Note that a, c = 1, ..., dim(G), whereas
b, d = 1, ..., s. The nonholonomic momentum equation (4.4.2) of BKMM, coincides with
equation (4.5), or, equivalently, with equation (4.11), for the indices b = 1, ..., s. Observe
that this is also the local expression of the vertical Lagrange-dAlembert-Poincare equation.
If the second term of the right hand side of equation (4.11) is calculated in coordinates
explicitly, one obtains equation (7.2.2) of BKMM.
The Local Horizontal Lagrange-Poincar
e Equation. To calculate the horizontal
C
Lagrange-Poincare equation we shall rst determine xl (x, x,
v) x. By denition, we have
C l
d
,
(x, x,
v) x =
l (x + x, x,
w())
x
d
=0
x+x for each , w(0)
where w()
= v, and
w()
dw()
= x + x, e, [Ae (x + x) x, w()] +
.
D
d
Therefore we must have
dw()
d
D w()
= 0. If
(4.12)
C l
l
l
(x, x,
v) x =
(x, x,
v) x +
(x, x,
v) [Ae (x) x, v] .
x
x
v
(4.13)
As we did with the vertical Lagrange-Poincare operator, it is convenient to rewrite this
equation explicitly in coordinates and we easily obtain
a
l
d l
l
a d b
(x, x,
v)
(x, x,
v) =
(x, x,
v) B
(x, e)x + Cdb
v A (x)
x
dt x
va
(4.14)
5 The Snakeboard
31
a
We remark that in these papers the convention for the sign of the curvature B
is the
opposite to the one used in this paper.
For nonholonomic systems, v must belong to S(x, e), which is equivalent to v s. Recall
x -basis
that we are assuming (A1) and, therefore, dim(S(x, e)) =: s is constant. Choose the g
{
eb (x) | b = 1, ..., dim(G)}, for each x, in such a way that, {
eb (x) | b = 1, ..., s} generates sx .
The corresponding horizontal Lagrange-dAlembert-Poincare equation is simply equation
(4.14) with restriction on the indices a = 1, ..., dim(G), b, d = 1, ..., s.
=
v A ,
B x + Cdb
a
x
dt x
v
where, as usual, a summation is implied over repeated indices. In these equations the indices
are a, d = 1, ..., dim(G), b, c = 1, ...., s, = 1, ..., dim(Q/G).
We refer to BKMM for other coordinate representations of these equations.
The Snakeboard
We now describe the snakeboard, following BKMM in which further references can be found.
See also Koon and Marsden [1997c] for further information. Our purpose is to use it to
illustrate the formalism we have developed so far.
The snakeboard is a modied version of a skateboard in which the front and back pairs
of wheels are independently actuated; see Figure 5.1. The degree of freedom , while
simultaneously moving the wheels with the proper phase relationship enables the rider to
generate forward motion.
x, y
5 The Snakeboard
32
The momentum for the snakeboard is closely related to the angular momentum of the
system about the point P shown in Figure 5.2. See Marsden and Ostrowski [1998] for further
discussion and references.
Figure 5.2: The angular momentum about the point P plays an important role in the analysis of the
snakeboard.
Other examples with a similar structure for the momentum equation are the roller racer
(see Tsikiris [1995], Zenkov, Bloch and Marsden [1998]) and the bicycle (see the papers of
Getz and Marsden [1995] and Koon and Marsden [1997c]).
We model the snakeboard as a rigid body (the board) with two sets of independently
actuated wheels, one on each end of the board. The human rider is modeled as a momentum
wheel which sits in the middle of the board and is allowed to spin about the vertical axis.
Spinning the momentum wheel causes a counter-torque to be exerted on the board. The
conguration of the board is given by the position and orientation of the board in the plane,
the angle of the momentum wheel, and the angles of the back and front wheels. Thus the
conguration-space is Q = S 1 S 1 S 1 SE(2), and we shall consider it as a principal bundle
with structure group SE(2) acting on the left. We let (, x, y) represent the orientation and
the position of the center of the board, the angle of the momentum wheel relative to the
board, and 1 and 2 the angles of the back and front wheels, also relative to the board.
We take the distance between the center of the board and the wheels as r.
The Lagrangian for the snakeboard consists only of kinetic energy terms. We take the
simplest possible model for the various mass distributions and write the Lagrangian as
2 + 1 J1 ( + 1 )2 + 1 J2 ( + 2 )2 ,
L(q, q)
= 12 m(x 2 + y 2 ) + 12 J 2 + 12 J0 ( + )
2
2
where m is the total mass of the board, J is the inertia of the board, J0 is the inertia of the
rotor, and Ji , i = 1, 2, is the inertia corresponding to i . The Lagrangian is independent of
the conguration of the board and hence it is invariant to all possible group actions.
The rolling of the front and rear wheels of the snakeboard is modeled by using nonholonomic constraints which allow the wheels to spin about the vertical axis and roll in
the direction that they are pointing. The wheels are not allowed to slide in the sideways
direction. This gives constraint one-forms
1 (q) = sin( + 1 )dx + cos( + 1 )dy r cos 1 d,
2 (q) = sin( + 2 )dx + cos( + 2 )dy + r cos 2 d.
(5.1)
5 The Snakeboard
33
Dq = span
,
+b
+c
,
(5.2)
,
,a
1 2 x
y
(5.3)
(5.4)
(note that this is not a left-invariant basis). The intersection between the tangent space to
the group orbit and the ber of the constraint distribution is thus given by
Dq Tq (Orb(q)) = span c
+a
+b
.
(5.5)
x
y
We construct the momentum by choosing a section of D T Orb(q) regarded as a bundle
over Q. Since Dq Tq Orb(q) is one-dimensional, we choose the section to be
a
+b
+c ,
x
y
(5.6)
5 The Snakeboard
34
Third, a typical element of Q = X (S 1 C) will be written (ei , ei1 , ei2 , ei , z) and a
ei1 i 1 , ei2 i 2 , ei , z, ei i,
z),
or, sometimes,
typical element of T Q as (ei , ei1 , ei2 , ei i,
i i1
i2
i
for simplicity, (e i, e i1 , e i2 , e i, z).
Using the de Moivre formulas, the numbers
a, b, and c dened above can be conveniently written as
1 i(1 2 )
i
i(1 +2 )
i(1 2 )
a + bi = re
(5.7)
e
+
+e
e
2
and
c=
1 i(1 2 )
ei(1 2 ) .
e
2i
(5.8)
H = span
,
,
1 2
is the horizontal space of the connection A. The connection A is given by right translation
to the neutral element on T S 1 C, that is,
ei1 i 1 , ei2 i 2 , ei , z, ei i,
z)
iz
+ z).
A(ei , ei1 , ei2 , ei i,
= (i,
Recall that, for a trivial bundle Q = X G, where the group acts on the left, the map
A for the trivial connection A is always given by left translation on the group factor, that
is, A (x, x,
g, g)
= (x, x,
v), where v = g 1 g.
In our case we have
ei1 i 1 , ei2 i 2 )
(x, x)
= (ei , ei1 , ei2 , ei i,
ei z).
and v = (i,
The vector bundle isomorphism A is given by
ei1 i 1 , ei2 i 2 , ei , z, ei i,
z]
A ([ei , ei1 , ei2 , ei i,
G)
i i1 i2 i i1
i2
i
= (e , e , e , e i, e i1 , e i2 , i, e z).
generated by the
Now we shall identify the bundle s. It is the line subbundle of g
, call it (ei , ei1 , ei2 , f), which is the image under A of the section that
section of g
generates the bundle S/SE(2) dened above.
Since the left translation of the tangent vector
(ei , z, ei ic, a+ib) to the neutral element is ic, ei (a + ib) , we obtain the following section
that generates s,
(ei , ei1 , ei2 , f) = ei , ei1 , ei2 , ic, ei (a + ib) .
A typical element of T X s is therefore written as
ei1 i 1 , ei2 i 2 , f),
(ei , ei1 , ei2 , ei i,
5 The Snakeboard
35
1
2
On the other hand, since the connection A is trivial, its curvature is 0, therefore, the
horizontal Lagrange-dAlembert-Poincare equation gives the conserved quantities
l
= J0 ( + );
l
= J1 ( + 1 );
1
l
= J2 ( + 2 ).
2
(5.9)
Since the Lagrangian is the kinetic energy in this example and there is no potential energy,
the Lagrangian itself is conserved, then the previous equations immediately imply that
1 2 1
J() + m|z|
2
2
2
(5.10)
= J ;
l
= mx;
l
= my,
= J ;
dt
d l
= m
x;
dt x
d l
= m
y.
dt y
36
(5.11)
x,
Since (,
y)
satises the constraints, we have, for some real valued function (t),
= c;
x = a;
y = b.
+ a;
x
= a
+ b.
y = b
Replacing these in the vertical Lagrange-dAlembert-Poincare equation (5.11) and rearranging one obtains the equation
1 d
2
(Jc
+ ma2 + mb2 ) + (Jc2 + ma2 + mb2 ) = 0.
2 dt
Solving this equation for one gets
1
= K Jc2 + m(a2 + b2 ) 2 ,
(5.12)
where K is a constant. Using equations (5.7) and (5.8) one can easily see that does not
depend on . We remark that equation (5.12) does not give more information than the one
contained in the conservation of energy equation, namely, equation (5.10). In fact, the same
expression for can be easily obtained by replacing = c, x = a, and y = b in equation
(5.10) and solving for .
Now we can combine equation (5.12), equation = c, and equations (5.9) to obtain
1
= cK(Jc2 + ma2 + mb2 ) 2 + C
1
1 = cK(Jc2 + ma2 + mb2 ) 2 + C1
12
+ C2 ,
(5.13)
(5.14)
(5.15)
where C, C1 , and C2 are constants. Due to the very special structure of this system of
three dierential equations it is possible to reduce it to a single equation. In fact, it can
be easily deduced from these equations that 1 = 1 t + E1 and 2 = 2 t + E2 ,
where i , Ei are constants for i = 1, 2. Then we can replace these expressions for 1 and
2 in the equation (5.13) to obtain the equation = F (, t), where the function F can be
calculated; it is a simple elementary function. A more detailed study of this equation and
its application to concrete questions about the motion of the snakeboard, as well as a study
of the Hamiltonian counterpart, will be the purpose of future work.
Systems with Ane Constraints In the previous sections we studied systems with
nonholonomic constraints D, where D is a vector subbundle of T Q. Now assume that P is
a given section of T Q, that is, a vector eld. Then we have an ane subbundle P + D, that
is, at each point q Q, P (q) + Dq is an ane subspace of Tq Q.
The previous geometric theory of systems with nonholonomic constraints given by a
vector subbundle D can be generalized for the more general case of systems with ane
constraints. The basic idea is that one should replace the condition q D by the condition
q P + D, while the condition q D stays.
37
P (q(t)) + D(q(t)) for all t and, besides, for any deformation q(t, ) of q(t) such that the
corresponding variation
q(t, )
q(t) =
=0
satises q(t) Dq(t) for all t, the following condition holds
t1
L(q, q)dt
= 0.
t0
38
Geometric Phases. The development of the theory of geometric phases in the Lagrangian
context is natural to develop. As we have mentioned in the introduction, geometric phases
are central to many important questions in mechanics, such as locomotion generation. The
paper of Marsden, Ratiu and Scheurle [2000] gives results for geometric phases in the context of Routh reduction for holonomic systems. As Koon and Marsden [1997b] indicates,
geometric phases should be feasible and interesting in the nonholonomic context as well and
that the Lagrangian formalism is natural for doing this. In fact, the Lagrangian setting
provides natural connections and also a natural setting for averaging which is one of the
basic ingredients in geometric phases. The results of the present paper or future work on
Lagrange-dAlembert-Routh reduction should be useful in this regard.
Variational Integrators and Discrete Reduction. As Weinstein [1996] points out,
there is a more general context for Lagrangian mechanics that also includes discrete mechanics in the sense of Veselov [1988, 1991]. It would certainly be interesting to develop
a discrete mechanics for the Lagrangian bundle picture both in the holonomic case (as in
CMR) and in the nonholonomic case, as in the present paper.
One of the interesting developments in symplectic integration algorithms has been the
progress made in variational integrators. These integration algorithms are based on direct
discretizations of Hamiltons principle following some of the ideas of Veselov [1988]. See,
for example, Wendlandt and Marsden [1997], Marsden, Patrick and Shkoller [1998], and,
for the inclusion of damping and forcing, Kane, Marsden, Ortiz and West [2000]. There
is a discrete reduction theory for this point ov view of discrete mechanics as well that is
still under development. See Marsden, Pekarsky and Shkoller [1999], Bobenko, Lorbeer and
Suris [1998], Bobenko and Suris [1999a, 1999b] and Jalnapurkar, Leok, Marsden and West
[2000]. Of course it would be of interest to develop a discrete version of the nonholonomic
theory and to implement this numerically.
Innite Dimensional Examples. In this paper we dealt with Lagrange-dAlembert reduction theory in the context of nite dimensional manifolds. Of course, the theory formally
applies to many interesting innite dimensional examples, such as rolling elastic bodies. In
the innite dimensional context, many of the expressions that appear here as pure partial
derivatives must be written in the notation of functional derivatives (see Marsden and Ratiu
[1999] for some of the basic examples, an explanation of the functional derivative notation
and additional references to the literature).
Multisymplectic Context. Another area of much current interest is that of multisymplectic geometry. The history of this subject is very complex; modern accounts are, for
example, Marsden and Shkoller [1999] and Marsden, Patrick and Shkoller [1998]. This
theory has both a Lagrangian and a Hamiltonian view and it has allowed, for example, a
development of the Moser-Veselov theory to the context of PDEs. Reduction theory in this
context is in its infancy (see, for example, Marsden, Montgomery, Morrison and Thompson
[1986] and Castrill
on L
opez, Ratiu and Shkoller [2000]). Obviously, it would be of interest
to develop such a theory for nonholonomic systems.
Singular Lagrange-dAlembert Reduction. We mentioned the importance of singular
reduction and some of the current literature in the introduction. Almost all of the theory of
singular reduction is conned to the general symplectic category, with little attention paid
to the tangent and cotangent bundle structure, as well as to the Lagrangian side, apart
from that in Lewis [1992]. Explicit examples, even the spherical pendulum (see Lerman,
Montgomery and Sjamaar [1993]) show that this cotangent bundle structure together with
REFERENCES
39
References
Abraham, R. and J. E. Marsden [1978], Foundations of Mechanics, Addison-Wesley, Second edition.
Abraham, R., J. E. Marsden and T. S. Ratiu [1988], Manifolds, Tensor Analysis and Applications, Applied
Mathematical Sciences, 75, Springer-Verlag, New York, Second edition.
Alber, M. S., G. G. Luther, J. E. Marsden and J. M. Robbins [1998], Geometric phases, reduction and
Lie-Poisson structure for the resonant three-wave interaction, Physica D, 123, 271290.
Alber, M. S., G. G. Luther, J. E. Marsden and J. W. Robbins [1998], Geometry and control of three-wave
interactions, Fields Inst. Commun., 24, 5580.
Arms, J. M., R. H. Cushman and M. Gotay [1991], A universal reduction procedure for Hamiltonian group
actions, in The Geometry of Hamiltonian systems, T. Ratiu, ed., MSRI Series, 22, 3352, Springer-Verlag.
Arms, J. M., J. E. Marsden and V. Moncrief [1981], Symmetry and bifurcations of momentum mappings,
Comm. Math. Phys., 78, 455478.
Arms, J. M., J. E. Marsden and V. Moncrief [1982], The structure of the space solutions of Einsteins
equations: II Several Killings elds and the EinsteinYangMills equations, Ann. of Phys., 144, 81106.
Arnold, V. I. [1966], Sur la geom
etrie dierentielle des groupes de Lie de dimenson innie et ses applications
a lhydrodynamique des uidsparfaits, Ann. Inst. Fourier, Grenoble, 16, 319361.
`
Arnold, V. I., V. V. Kozlov and A. I. Neishtadt [1988], Mathematical aspects of classical and celestial
mechanics, in Dynamical Systems III , V. I. Arnold, ed., Springer-Verlag.
Arnold, V. I. [1989], Mathematical Methods of Classical Mechanics, Graduate Texts in Math., 60, SpringerVerlag, Second edition.
Bates, L. [1998], Examples of singular nonholonomic reduction, Rep. Math. Phys., 42, 231247.
Bates, L. and E. Lerman [1997], Proper group actions and symplectic stratied spaces, Pacic J. Math.,
181, 201229.
Bates, L. and J. Sniatycki [1993], Nonholonomic reduction, Reports on Math. Phys., 32, 99115.
Blaom, A.D. [2000], Reconstruction phases via Poisson reduction, Di. Geom. and Appl., 12, 231252.
REFERENCES
40
Bloch, A. M. and P. Crouch [1992], On the dynamics and control of nonholonomic systems on Riemannian
Manifolds, in Proceedings of NOLCOS 92 , 368372.
Bloch, A. M. and P. Crouch [1994a], Nonholonomic and vakonomic control systems on Riemannian manifolds,
Fields Inst. Commun., 1, 2552.
Bloch, A. M. and P. E. Crouch [1994], Reduction of EulerLagrange problems for constrained variational
problems and relation with optimal control problems, Proc. CDC , 33, 25842590.
Bloch, A. M. and P. E. Crouch [1998], Newtons law and integrability of nonholonomic systems, SIAM J.
Control Optim., 6, 20202039
Bloch, A. M. and P. E. Crouch [1999], Optimal control, optimization, and analytical mechanics, in Mathematical Control Theory, 268321, J. Ballieul, Ed., Springer, New York.
Bloch, A. M., P. S. Krishnaprasad, J. E. Marsden and R. Murray [1996], Nonholonomic mechanical systems
with symmetry, Arch. Rational Mech. Anal., 136, 2199.
Bloch, A. M., P. S. Krishnaprasad, J. E. Marsden and T. S. Ratiu [1994], Dissipation induced instabilities,
Ann. Inst. H. Poincar
e, Analyse Nonline
eaire, 11, 3790.
Bloch, A. M., P. S. Krishnaprasad, J. E. Marsden and T. S. Ratiu [1996], The EulerPoincare equations
and double bracket dissipation, Comm. Math. Phys., 175, 142.
Bloch, A. M., P. S. Krishnaprasad, J. E. Marsden and G. S
anchez de Alvarez [1992], Stabilization of rigid
body dynamics by internal and external torques, Automatica, 28, 745756.
Bloch, A. M., N. Leonard and J. E. Marsden [1998], Matching and stabilization by the method of controlled
Lagrangians, Proc. CDC , 37, 14461451.
Bloch, A. M., N. Leonard and J. E. Marsden [1999], Controlled Lagrangians and the stabilization of mechanical systems I: The First Matching Theorem, IEEE Trans. Automat. Control; (to appear).
Bloch, A. M., M. Reyhanoglu and H. McClamroch [1992], Control and stabilization of nonholonomic systems,
IEEE Trans. Aut. Control, 37, 17461757.
Bobenko, A. I., B. Lorbeer and Yu. B. Suris [1998], Integrable discretizations of the Euler top, J. Math.
Phys., 39, 66686683.
Bobenko, A. I., A. G. Reyman and M. A. Semenov-Tian-Shansky [1989], The Kowalewski Top 99 years
later: A Lax pair, generalizations and explicit solutions, Comm. Math. Phys., 122, 321354.
Bobenko, A. I. and Y. B. Suris [1999a], Discrete time Lagrangian mechanics on Lie groups, with an application to the Lagrange top, Commun. Math. Phys., 204, 147188.
Bobenko, A. I. and Y. B. Suris [1999b], Discrete Lagrangian reduction, discrete Euler-Poincar equations,
and semidirect products, Lett. Math. Phys., 49, 7993.
Bondi, H. [1986], The rigid body dynamics of unidirectional spin, Proc. Roy. Soc. Lon., 405, 265274.
Bourbaki, N. [1983], Vari
et
es dierentielles et analytiqes, Fascicule de resultats, Diusion C. C. L. S., Paris.
Brockett, R. W. and L. Dai [1992], Nonholonomic kinematics and the role of elliptic functions in constructive
controllability, in Nonholonomic Motion Planning, Z. Li and J. F. Canny, eds., 122, Kluwer.
Bretherton, F. P. [1970], A note on Hamiltons principle for perfect uids, J. Fluid Mech., 44, 1931.
Bryant, R. and P. Griths [1983], Reduction for constrained variational problems and 2 /2 ds, Am. J. of
Math., 108, 525570.
Cannas da Silva, A. and Weinstein, A. [1999], Geometric Models for Noncommutative Alebras, Berkeley
Mathematics Lecture Notes, 10, Amer. Math. Soc.
Cantrijn, F., M. de Le
on and D. Martin de Diego [1999], On almost-Poisson structures in nonholonomic
mechanics, Nonlinearity, 12, 721737.
Cantrijn, F., Cortes, J., de Le
on, M. and Martin de Diego, D. [2000], On the geometry of generalized
Chaplygin systems; (preprint).
REFERENCES
41
Cantrijn, F., de Le
on, M., Marrero, J.C. and Martin de Diego, D. [1998], Reduction of nonholonoimc
mechanical systems with symmetries, Rep. Math. Phys., 42, 2545.
Cardin, F. and M. Favretti [1996], On nonholonomic and vakonomic dynamics of mechanical systems with
nonintegrable constraints, J. Geom. and Phys., 18, 295325.
Cartan, E. [1928], Sur la representation geom
etrique des syst`
emes mat`eriels non holonomes, in Atti. Cong.
Int. Matem., 4, 253261.
Castrill
on L
opez, M., Ratiu, T. S. and Shkoller, S. [2000], Reduction in principal ber bundles: Covariant
Euler-Poincar
e equations, Proc. Amer. Math. Soc., 128, 2155-2164 (see http://www.ams.org/proc/).
Cendra, H., D. D. Holm, M. J. W. Hoyle and J. E. Marsden [1998], The MaxwellVlasov equations in
EulerPoincar
e form, J. Math. Phys., 39, 31383157.
Cendra, H., D. D. Holm, J. E. Marsden and T. S. Ratiu [1998], Lagrangian Reduction, the EulerPoincare
equations and semidirect products, Amer. Math. Soc. Transl., 186, 125.
Cendra, H., A. Ibort and J. E. Marsden [1987], Variational principal ber bundles: a geometric theory of
Clebsch potentials and Lin constraints, J. Geom. Phys., 4, 183206.
Cendra, H. and J. E. Marsden [1987], Lin constraints, Clebsch potentials and variational principles, Physica
D, 27, 6389.
Cendra, H., J. E. Marsden and T. S. Ratiu [2000], Lagrangian reduction by stages, Mem. Amer. Math. Soc.;
(to appear). See http://www.cds.caltech.edu/~marsden/.
Chaplygin, S. A. [1897a], On the motion of a heavy body of revolution on a horizontal plane, in Physics
Section of the Imperial Society of Friends of Physics, Anthropology and Ethnographics, Moscow, 9, 1016;
Reproduced in Chaplygin (1954), pp. 413425.
Chaplygin, S. A. [1897b], On some feasible generalization of the theorem of area, with an application to the
problem of rolling spheres, Mat. Sbornik , XX, 132; Reproduced in Chaplygin (1954), pp. 434454.
Chaplygin, S. A. [1903], On a rolling sphere on a horizontal plane, Mat. Sbornik , XXIV, 139168; Reproduced in Chaplygin (1949), pp. 7299, and Chaplygin (1954), pp. 455471.
Chaplygin, S. A. [1911], On the theory of the motion of nonholonomic systems. Theorem on the reducing
factor, Mat. Sbornik , XXVIII, 303314; Reproduced in Chaplygin (1949), pp. 2838 and Chaplygin
(1954), pp. 426433.
Chaplygin, S. A. [1949], Analysis of the Dynamics of Nonholonomic Systems, Classical Natural Sciences,
Moscow.
Chaplygin, S. A. [1954], Selected Works on Mechanics and Mathematics, State Publ. House, TechnicalTheoretical Literature, Moscow.
Chetayev, N. G. [1941], On the equations of Poincare, J. Appl. Math. Mech., 5, 253262.
Chetayev, N. G. [1961], The Stability of Motion, Pergamon.
Chetayev, N. G. [1989], Theoretical Mechanics, Springer-Verlag.
Courant, T. [1990], Dirac manifolds, Trans. Amer. Math. Soc., 319, 631661.
Crabtree, H. [1909], Spinning Tops and Gyroscopic Motion, Chelsea.
Cushman, R. and L. Bates [1997], Global Aspects of Classical Integrable Systems, Birkh
auser, Boston.
Cushman, R., J. Hermans and D. Kemppainen [1995], The rolling disc, in Nonlinear dynamical systems and
chaos (Groningen, 1995), Progr. Nonlinear Dierential Equations Appl., 19, 2160, Birkh
auser, Basel.
REFERENCES
42
Ge, Z. and J. E. Marsden [1988], LiePoisson integrators and LiePoisson HamiltonJacobi theory, Phys.
Lett. A, 133, 134139.
Getz, N. H. and J. E. Marsden [1994], Symmetry and dynamics of the rolling disk, CPAM Preprint, 630.
Getz, N. H. and J. E. Marsden [1995], Control for an autonomous bicycle, in International Conference on
Robotics and Automation, IEEE .
Golubitsky, M., J. E. Marsden, I. Stewart and M. Dellnitz [1995], The constrained Liapunov Schmidt
procedure and periodic orbits, Fields Inst. Commun., 4, 81127.
Golubitsky, M. and D. Schaeer [1985], Singularities and Groups in Bifurcation Theory. Vol. 1, Applied
Mathematical Sciences, 69, Springer-Verlag.
Golubitsky, M. and I. Stewart [1987], Generic bifurcation of Hamiltonian systems with symmetry, Physica
D, 24, 391405.
Golubitsky, M., I. Stewart and D. Schaeer [1988], Singularities and Groups in Bifurcation Theory. Vol. 2,
Applied Mathematical Sciences, 69, Springer-Verlag.
Guichardet, A. [1984], On rotation and vibration motions of molecules, Ann. Inst. H. Poincar
e , 40, 329342.
Guillemin, V., E. Lerman and S. Sternberg [1996], Symplectic Fibrations and Multiplicity Diagrams, Cambridge University Press.
Guillemin, V. and E. Prato [1990], Heckman, Kostant and Steinberg formulas for symplectic manifolds, Adv.
in Math., 82, 160179.
Guillemin, V. and S. Sternberg [1978], On the equations of motions of a classic particle in a YangMills
eld and the principle of general covariance, Hadronic J., 1, 132.
Guillemin, V. and S. Sternberg [1980], The moment map and collective motion, Ann. of Phys., 1278,
220253.
Guillemin, V. and S. Sternberg [1982], Convexity properties of the moment map, Invent. Math., 67, 491513;
also 77, pp. 533546.
Guillemin, V. and S. Sternberg [1984], Symplectic Techniques in Physics, Cambridge University Press.
Hamel, G. [1904], Die LagrangeEulerschen Gleichungen der Mechanik, Z. f
ur Mathematik u. Physik , 50,
157.
Hermans, J. [1995], A symmetric sphere rolling on a surface, Nonlinearity, 8, 123.
Hernandez, A. [2000], Singular Reduction and Blowing Up, PhD Thesis, Caltech.
Holm, D. D. and B. A. Kupershmidt [1983], Poisson brackets and Clebsch representations for magnetohydrodynamics, multiuid plasmas and elasticity, Physica D, 6, 347363.
Holm, D. D. and B. A. Kupershmidt [1983], Poisson structures and superconductors, Lett. A, 93, 177181.
Holmes, P. J. and J. E. Marsden [1983], Horseshoes and Arnold diusion for Hamiltonian systems on Lie
groups, Indiana Univ. Math. J., 32, 273310.
Holm, D. D., J. E. Marsden and T. S. Ratiu [1986a], The Hamiltonian structure of continuum mechanics
in material, spatial and convective representations, in S
eminaire de math
ematiques sup
erieures, 100,
11122, Les Presses de LUniv. de Montreal.
Holm, D. D., J. E. Marsden and T. S. Ratiu [1986b], Nonlinear stability of the KelvinStuart cats eyes
ow, SIAM, Lects. in Appl. Math., 23, 171186.
Holm, D. D., J. E. Marsden and T. S. Ratiu [1998], EulerPoincare models of ideal uids with nonlinear
dispersion, Phys. Rev. Lett., 349, 41734177.
Holm, D. D., J. E. Marsden and T. S. Ratiu [1998], The EulerPoincare equations and semidirect products
with applications to continuum theories, Adv. in Math., 137, 18.
REFERENCES
43
Holm, D. D., J. E. Marsden and T. Ratiu [1999], The EulerPoincare equations in geophysical uid dynamics,
in Proceedings of the Isaac Newton Institute Programme on the Mathematics of Atmospheric and Ocean
Dynamics, Cambridge University Press; (to appear).
Holm, D. D., J. E. Marsden, T. S. Ratiu and A. Weinstein [1985], Nonlinear stability of uid and plasma
equilibria, Phys. Rep., 123, 16.
Ibort, A., M. De Leon, J. C. Marrero and D. Martin De Diego [1999], Dirac brackets in constrained dynamics,
Fortschr. Phys., 30, n 8, 459492.
Iwai, T. [1982], The symmetry group of the harmonic oscillator and its reduction, J. Math. Phys., 23,
10881092.
Iwai, T. [1985], On reduction of two degrees of freedom Hamiltonian systems by an S 1 action and SO(1, 2)
as a dynamical group, J. Math. Phys., 26, 885893.
Iwai, T. [1987], A geometric setting for classical molecular dynamics, Ann. Inst. Henri Poincar
e, Phys. Th.,
47, 199219.
Iwai, T. [1990], On the Guichardet/Berry connection, Phys. Lett. A, 149, 341344.
Jalnapurkar, S. M., M. Leok, J. E. Marsden and M. West [2000], Discrete Routh reduction; (preprint).
Jalnapurkar, S.M. [1994], Modeling of Constrained Systems; http://www.cds.caltech.edu/~smj/.
Jalnapurkar, S. M. and J. E. Marsden [1999], Stabilization of Relative Equilibria II, Regul. Chaotic Dyn.,
3, 161179.
Jalnapurkar, S. M. and J. E. Marsden [2000], Reduction of Hamiltons variational principle, Dynam. Stability
Systems; (to appear).
Jurdjevic, V. [1993], The geometry of the plate-ball problem, Arch. Rational Mech. Anal., 124, 305328.
Kazhdan, D., B. Kostant and S. Sternberg [1978], Hamiltonian group actions and dynamical systems of
Calogero type, Comm. Pure Appl. Math., 31, 481508.
Kane, C, J. E. Marsden, M. Ortiz and M. West [2000], Variational Integrators and the Newmark Algorithm
for Conservative and Dissipative Mechanical Systems, Int. J. Num. Math. Eng., (to appear).
Karapetyan, A. V. [1994], On the specic character of the application of Rouths theory to systems with
dierential constraints, J. Appl. Math. Mech., 58, 387392 and J. Appl. Math. Mech. 51 (1987), 431436.
Karapetyan, A. V. and V. V. Rumyantsev [1990], Stability of conservative and dissipative systems, in Applied
Mechanics: Soviet Reviews, G. K. Mikhailov and V. Z. Parton, eds., 1, Hemisphere, NY.
Kelly, S. D. and R. M. Murray [1995], Geometric phases and robotic locomotion, Journal of Robotic Systems;
(to appear).
Kirillov, A. A. [1962], Unitary representations of nilpotent Lie groups, Russian Math. Surveys, 17, 53104.
Kirillov, A. A. [1976], Elements of the Theory of Representations, Grundlehren der math. Wiss., 220,
Springer-Verlag.
Kirillov, A. A. [1976], Local Lie Algebras, Russian Math. Surveys, 31, 5575.
Kirk, V., J. E. Marsden and M. Silber [1996], Branches of stable three-tori using Hamiltonian methods in
Hopf bifurcation on a rhombic lattice, Dyn. and Stab. of Systems, 11, 267302.
Koiller, J. [1992], Reduction of some classical nonholonomic systems with symmetry, Arch. Rational Mech.
Anal., 118, 113148.
Kozlov, V. V. and N. N. Kolesnikov [1978], On theorems of dynamics, PMM , 42, 2833.
Koon, W. S. and J. E. Marsden [1997a], Optimal control for holonomic and nonholonomic mechanical
systems with symmetry and Lagrangian reduction, SIAM J. Control and Optim., 35, 901929.
Koon, W. S. and J. E. Marsden [1997b], The geometric structure of nonholonomic mechanics, Proc. CDC ,
36, 48564862.
REFERENCES
44
Koon, W. S. and J. E. Marsden [1997c], The Hamiltonian and Lagrangian approaches to the dynamics of
nonholonomic systems, Rep. Math. Phys., 40, 2162.
Koon, W. S. and J. E. Marsden [1998], The Poisson reduction of nonholonomic mechanical systems, Reports
on Math. Phys., 42, 101134.
Kobayashi, S. and K. Nomizu [1963], Foundations of Dierential Geometry, Wiley.
Korteweg, D. [1899], Ueber eine ziemlich verbreitete unrichtige Behandlungsweise eines Problemes der rollenden Bewegung und insbesondere u
ber kleine rollende Schwingungen um eine Gleichgewichtslage, Nieuw
Archiefvoor Wiskunde., 4, 130155.
Kouranbaeva, S. [1999], The CamassaHolm equation as a geodesic ow on the dieomorphism group, J.
Math. Phys., 40, 857868.
Krishnaprasad, P. S. [1989], Eulerian many-body problems, Contemp. Math., 97, 187208.
Krishnaprasad, P. S. [1990], Geometric phases and optimal reconguration for multibody systems, Proc.
Am. Control Conf., 24402444.
Kummer, M. [1981], On the construction of the reduced phase space of a Hamiltonian system with symmetry,
Indiana Univ. Math. J., 30, 281291.
Kummer, M. [1990], On resonant classical Hamiltonians with n frequencies, J. Di. Eqns., 83, 220243.
Kupershmidt, B. A. and T. Ratiu [1983], Canonical maps between semidirect products with applications to
elasticity and superuids, Comm. Math. Phys., 90, 235250.
Lagrange, J. L. [1788], M
ecanique Analytique, Chez la Veuve Desaint.
Le, H. and D. G. Kendall [1993], The Riemannian structure of Euclidean shape spaces: a novel envoronment
for statistics, Ann. of Statistics, 21, 12251271.
Leonard, N. E. and J. E. Marsden [1997], Stability and drift of underwater vehicle dynamics: mechanical
systems with rigid motion symmetry, Physica D, 105, 130162.
Lerman, E., R. Montgomery and R. Sjamaar [1993], Examples of singular reduction, in Symplectic Geometry,
London Math. Soc. Lecture Note Ser., 192, 127155, Cambridge Univ. Press, Cambridge.
Leonard, N. E. [1997], Stability of a bottom-heavy underwater vehicle, Automatica J. IFAC , 33, 331346.
Lewis, A. D. The geometry of the Gibbs-Appell equations and Gauss principle of least constraint. Rep.
Math. Phys. 38, 1128.
Lewis, A. D. [2000a] Towards F = ma in a general setting for Lagrangian mechanics (To appear in Annales
Henri Poincar
e).
Lewis, A. D. and R. M. Murray [1995], Variational principles for constrained systems: theory and experiment,
Internat. J. Non-Linear Mech., 30, 793815
Lewis, D. [1992], Lagrangian block diagonalization, Dyn. Di. Eqns, 4, 142.
Lewis, D. [1992], Bifurcation of liquid drops, Nonlinearity, 6, 491522.
Lie, S. [1890], Theorie der Transformationsgruppen, Zweiter Abschnitt, Teubner, Leipzig.
Libermann, P. and C. M. Marle [1987], Symplectic Geometry and Analytical Mechanics, Kluwer Academic
Publishers.
Littlejohn, R. and M. Reinch [1997], Gauge elds in the separation of rotations and internal motions in the
n-body problem, Rev. Mod. Phys., 69, 213275.
Marle, C.-M. [1995], Reduction of constrained mechanical systems and stability of relative equilibria, Comm.
Math. Phys., 174, 295318.
Marle, C.-M. [1998], Various approaches to conservative and nonconservative nonholonomic systems, Rep.
Math. Phys., 42, 211229
REFERENCES
45
Marsden, J. E. [1992], Lectures on Mechanics, London Math. Soc. Lecture Note Ser., 174, Cambridge
University Press.
Marsden, J. E. [1999], Park City Lectures on Mechanics, Dynamics and Symmetry, in Symplectic Geometry
and Topology, Y. Eliashberg and L. Traynor, eds., IAS/Park City Math. Ser., 7, 335430, Amer. Math.
Soc., Providence, RI.
Marsden, J., G. Misiolek, M. Perlmutter and T. S. Ratiu [1998], Symplectic reduction for semidirect products
and central extensions, Di. Geom. and its Appl., 9, 173212.
Marsden, J. E., G. Misiolek, M. Perlmutter and T. S. Ratiu [2000], Reduction by stages and group extensions,
Preprint.
Marsden, J. E., R. Montgomery, P. J. Morrison and W. B. Thompson [1986], Covariant Poisson brackets
for classical elds, Annals of Physics, 169, 2948.
Marsden, J. E., R. Montgomery and T. S. Ratiu [1990], Reduction, Symmetry and Phases in Mechanics,
Memoirs Amer. Math. Soc., 436.
Marsden, J. E. and J. Ostrowski [1998], Symmetries in motion: Geometric foundations of motion control,
Nonlinear Sci. Today; http://link.springer-ny.com.
Marsden, J. E., G. W. Patrick and W. F. Shadwick (Eds.) [1996], Integration Algorithms and Classical
Mechanics, Fields Inst. Commun., 10, Am. Math. Soc.
Marsden, J. E., G. W. Patrick and S. Shkoller [1998], Mulltisymplectic Geometry, Variational Integrators
and Nonlinear PDEs, Comm. Math. Phys., 199, 351395.
Marsden, J. E., S. Pekarsky and S. Shkoller [1999], Discrete EulerPoincare and LiePoisson equations,
Nonlinearity, 12, 16471662.
Marsden, J. E. and M. Perlmutter [2000], The Orbit Bundle Picture of Cotangent Bundle Reduction, C. R.
Math. Rep. Acad. Sci. Canada, 22, 3354.
Marsden, J. E. and T. S. Ratiu [1986], Reduction of Poisson Manifolds, Lett. in Math. Phys., 11, 161170.
Marsden, J. E. and T. S. Ratiu [1999], Introduction to Mechanics and Symmetry, Texts in Applied Mathematics, 17, Springer-Verlag, 1994; Second Edition, 1999.
Marsden, J. E., T. S. Ratiu and J. Scheurle [2000], Reduction theory and the Lagrange-Routh equations, J.
Math. Phys., 41, 33793429.
Marsden, J. E., T. Ratiu and S. Shkoller [1999], The geometry and analysis of the averaged Euler equations
and a new dieomorphism group, Geom. Funct. Anal., (to appear).
Marsden, J. E., T. S. Ratiu and A. Weinstein [1984], Semi-direct products and reduction in mechanics,
Trans. Amer. Math. Soc., 281, 147177.
Marsden, J. E., T. S. Ratiu and A. Weinstein [1984], Reduction and Hamiltonian structures on duals of
semidirect product Lie Algebras, Contemp. Math., Am. Math. Soc., 28, 55100.
Marsden, J. E. and J. Scheurle [1993a], Lagrangian reduction and the double spherical pendulum, ZAMP ,
44, 1743.
Marsden, J. E. and J. Scheurle [1993b], The reduced EulerLagrange equations, Fields Inst. Commun., 1,
139164.
Marsden, J. E. and S. Shkoller [1999], Multisymplectic geometry, covariant Hamiltonians and water waves,
Math. Proc. Camb. Phil. Soc., 125, 553575.
Marsden, J. E. and A. Weinstein [1974], Reduction of symplectic manifolds with symmetry, Rep. Math.
Phys., 5, 121130.
Marsden, J. E. and A. Weinstein [1982], The Hamiltonian structure of the MaxwellVlasov equations,
Physica D, 4, 394406.
Marsden, J. E. and A. Weinstein [1983], Coadjoint orbits, vortices and Clebsch variables for incompressible
uids, Physica D, 7, 305323.
REFERENCES
46
Marsden, J. E., A. Weinstein, T. S. Ratiu, R. Schmid and R. G. Spencer [1982], Hamiltonian systems
with symmetry, coadjoint orbits and plasma physics, in Proc. IUTAM-IS1MM Symposium on Modern
Developments in Analytical Mechanics (Torino, 1982), 117, 289340, Atti della Acad. della Sc. di Torino.
Martin, J. L. [1959], Generalized classical dynamics and the classical analogue of a Fermi oscillation, Proc.
Roy. Soc. A, 251, 536.
Martinez, S., J. Cortes and M. de Le
on [2000], The geometrical theory of constraints applied to the dynamics
of vakonomic mechanical systems: the vakonomic bracket, J. Math. Phys., 41, 20902120.
Meyer, K. R. [1973], Symmetries and integrals in mechanics, in Dynamical Systems, M. Peixoto, ed., 259
273, Academic Press.
Montgomery, R., J. E. Marsden and T. S. Ratiu [1984], Gauged LiePoisson structures, Contemp. Math.,
Amer. Math. Soc., 28, 101114.
Montgomery, R. [1984], Canonical formulations of a particle in a YangMills eld, Lett. Math. Phys., 8,
5967.
Montgomery, R. [1986], The Bundle Picture in Mechanics, Ph.D. Thesis, University of California Berkeley.
Montgomery, R. [1988], The connection whose holonomy is the classical adiabatic angles of Hannay and
Berry and its generalization to the non-integrable case, Comm. Math. Phys., 120, 269294.
Montgomery, R. [1990], Isoholonomic problems and some applications, Comm. Math Phys., 128, 565592.
Montgomery, R. [1991], Optimal control of deformable bodies and its relation to gauge theory, in The
Geometry of Hamiltonian Systems, T. Ratiu, ed., Springer-Verlag.
Montgomery, R. [1993], Gauge theory of the falling cat, Fields Inst. Commun., 1, 193218.
Moser, J. and A. P. Veselov [1991], Discrete versions of some classical integrable systems and factorization
of matrix polynomials, Comm. Math. Phys., 139, 217243.
Murray, R. M. and S. S. Sastry [1993], Nonholonomic motion planning: steering using sinusoids, IEEE
Trans. on Automatic Control, 38, 700716.
Nambu, Y. [1973], Generalized Hamiltonian dynamics, Phys. Rev. D, 7, 24052412.
Neimark, Ju. I. and N. A. Fufaev [1972], Dynamics of Nonholonomic Systems, Translations of Mathematical
Monographs, 33, Amer. Math. Soc., Providence, RI.
Newcomb, W. A. [1962], Lagrangian and Hamiltonian methods in magnetohydrodynamics, Nuc. Fusion,
Suppl. part 2, 451463.
OReilly, O. M. [1996], The dynamics of rolling disks and sliding disks, Nonlinear Dynamics, 10, 287305.
Ortega, J.-P [1998], Symmetry, Reduction, and Stability in Hamiltonian Systems, Thesis, UC Santa Cruz.
Ortega, J.-P. and T. S. Ratiu [1997], Persistence and smoothness of critical relative elements in Hamiltonian
systems with symmetry, C. R. Acad. Sci. Paris S
er. I Math., 325, 11071111.
Ortega, J.-P. and T. S. Ratiu [2001], Hamiltonian Singular Reduction, Progress in Math., Birkh
auser, (to
appear).
Ostrowski, J. [1998], Reduced equations for nonholonomic mechanical systems with dissipative forces, Rep.
Math. Phys., 42, 185209
Ostrowski, J., J. W. Burdick, A. D. Lewis and R. M. Murray [1995], The mechanics of undulatory locomotion:
The mixed kinematic and dynamic case, in IEEE Intern. Conf. on Robotics and Automation, 19451951.
Ostrowski, J., J. P. Desai and V. Kumar [1996], Optimal gait selection for nonholonomic locomotion systems,
in IEEE Conf. on Robotics and Automation; available from http://www.cis.upenn.edu/~jpo/papers.
html.
Otto, M. [1987], A reduction scheme for phase spaces with almost K
ahler symmetry. Regularity results for
momentum level sets, J. Geom. Phys., 4, 101118.
Pauli, W. [1953], On the Hamiltonian structure of non-local eld theories, Il Nuovo Cimento, X, 648667.
REFERENCES
47
REFERENCES
48
van der Schaft, A. J. and B. M. Maschke [1994], On the Hamiltonian formulation of nonholonomic mechanical
systems, Rep. on Math. Phys., 34, 225233.
Vershik, A. M. and Faddeev [1981], Lagrangian mechanics in invariant form, Sel. Math. Sov., 1, 339350.
Vershik, A. M. and V. Ya Gershkovich [1994], Non-holonomic Riemannian manifolds, in Dynamical Systems
7 , Encyclopaedia of Mathematics, 16, Springer.
Veselov, A. P. [1988], Integrable discrete-time systems and dierence operators, Funct. An. and Appl., 22,
8394.
Veselov, A. P. [1991], Integrable Lagrangian correspondences and the factorization of matrix polynomials,
Funct. An. and Appl., 25, 112123.