Professional Documents
Culture Documents
c
J
o
u
r
n
a
l
o
f
M
a
t
h
e
m
a
t
i
c
s
2
0
0
0
V
o
l
.
1
9
2
,
N
o
.
2
Pacic
Journal of
Mathematics
Volume 192 No. 2 February 2000
PACIFIC JOURNAL OF MATHEMATICS
http://www.pjmath.org
Founded in 1951 by
E. F. Beckenbach (19061982) F. Wolf (19041989)
EDITORS
Vyjayanthi Chari
Department of Mathematics
University of California
Riverside, CA 92521-0135
chari@math.ucr.edu
Robert Finn
Department of Mathematics
Stanford University
Stanford, CA 94305-2125
nn@math.stanford.edu
Kefeng Liu
Department of Mathematics
University of California
Los Angeles, CA 90095-1555
liu@math.ucla.edu
V. S. Varadarajan (Managing Editor)
Department of Mathematics
University of California
Los Angeles, CA 90095-1555
pacic@math.ucla.edu
Darren Long
Department of Mathematics
University of California
Santa Barbara, CA 93106-3080
long@math.ucsb.edu
Jiang-Hua Lu
Department of Mathematics
The University of Hong Kong
Pokfulam Rd., Hong Kong
jhlu@maths.hku.hk
Sorin Popa
Department of Mathematics
University of California
Los Angeles, CA 90095-1555
popa@math.ucla.edu
Sorin Popa
Department of Mathematics
University of California
Los Angeles, CA 90095-1555
popa@math.ucla.edu
Jie Qing
Department of Mathematics
University of California
Santa Cruz, CA 95064
qing@cats.ucsc.edu
Jonathan Rogawski
Department of Mathematics
University of California
Los Angeles, CA 90095-1555
jonr@math.ucla.edu
PRODUCTION
pacic@math.berkeley.edu
Paulo Ney de Souza, Production Manager Silvio Levy, Senior Production Editor Nicholas Jackson, Production Editor
SUPPORTING INSTITUTIONS
ACADEMIA SINICA, TAIPEI
CALIFORNIA INST. OF TECHNOLOGY
CHINESE UNIV. OF HONG KONG
INST. DE MATEMTICA PURA E APLICADA
KEIO UNIVERSITY
MATH. SCIENCES RESEARCH INSTITUTE
NEW MEXICO STATE UNIV.
OREGON STATE UNIV.
PEKING UNIVERSITY
STANFORD UNIVERSITY
UNIVERSIDAD DE LOS ANDES
UNIV. OF ARIZONA
UNIV. OF BRITISH COLUMBIA
UNIV. OF CALIFORNIA, BERKELEY
UNIV. OF CALIFORNIA, DAVIS
UNIV. OF CALIFORNIA, IRVINE
UNIV. OF CALIFORNIA, LOS ANGELES
UNIV. OF CALIFORNIA, RIVERSIDE
UNIV. OF CALIFORNIA, SAN DIEGO
UNIV. OF CALIF., SANTA BARBARA
UNIV. OF CALIF., SANTA CRUZ
UNIV. OF HAWAII
UNIV. OF MONTANA
UNIV. OF NEVADA, RENO
UNIV. OF OREGON
UNIV. OF SOUTHERN CALIFORNIA
UNIV. OF UTAH
UNIV. OF WASHINGTON
WASHINGTON STATE UNIVERSITY
These supporting institutions contribute to the cost of publication of this Journal, but they are not owners or publishers and have no respon-
sibility for its contents or policies.
See inside back cover or www.pjmath.org for submission instructions.
Regular subscription rate for 2006: $425.00 a year (10 issues). Special rate: $212.50 a year to individual members of supporting institutions.
Subscriptions, requests for back issues from the last three years and changes of subscribers address should be sent to Pacic Journal of
Mathematics, P.O. Box 4163, Berkeley, CA 94704-0163, U.S.A. Prior back issues are obtainable from Periodicals Service Company, 11
Main Street, Germantown, NY 12526-5635. The Pacic Journal of Mathematics is indexed by Mathematical Reviews, Zentralblatt MATH,
PASCAL CNRS Index, Referativnyi Zhurnal, Current Mathematical Publications and the Science Citation Index.
The Pacic Journal of Mathematics (ISSN 0030-8730) at the University of California, c/o Department of Mathematics, 969 Evans Hall,
Berkeley, CA 94720-3840 is published monthly except July and August. Periodical rate postage paid at Berkeley, CA 94704, and additional
mailing ofces. POSTMASTER: send address changes to Pacic Journal of Mathematics, P.O. Box 4163, Berkeley, CA 94704-0163.
PUBLISHED BY PACIFIC JOURNAL OF MATHEMATICS
at the University of California, Berkeley 94720-3840
A NON-PROFIT CORPORATION
Typeset in L
A
T
E
X
Copyright 2006 by Pacic Journal of Mathematics
PACIFIC JOURNAL OF MATHEMATICS
Vol. 192, No. 2, 2000
CHARACTERIZATION OF THE HOMOGENEOUS
POLYNOMIALS P FOR WHICH (P +Q)(D) ADMITS
A CONTINUOUS LINEAR RIGHT INVERSE
FOR ALL LOWER ORDER PERTURBATIONS Q
R
-
functions on R
n
into itself. It is shown that P has this prop-
erty if and only if P is of principal type and real up to a
complex constant and has no elliptic factor.
1. Introduction.
The problem of L. Schwartz to characterize those linear partial dierential
operators P(D) with constant coecients that admit a continuous linear
right inverse on C
() or T
(), so does P
m
(D).
The converse implication fails in general, as the example (
x
)
2
(
y
)
2
+
z
shows. Since the condition PL(, log) for V (P
m
) is easier to check than
for V (P), one would like to know additional conditions on P
m
which imply
that for some or all lower degree perturbations Q the operator (P
m
+Q)(D)
admits a right inverse on C
(R
n
)
and/or T
(R
n
) for each Q C[z
1
, . . . , z
n
] of degree less than m,
(2) grad P
m
(x) ,= 0 for each x R
n
0, P
m
is real up to a complex
constant, and each irreducible factor of P
m
has a non-trivial real zero.
In particular, each operator P(D) of principal type admits a right inverse on
C
(R
n
) and T
(R
n
) whenever its principal part P
m
is real and no irreducible
factor of P
m
is elliptic. Note that these operators P(D) admit fundamental
solutions with large lacunas, as the results of Meise, Taylor, and Vogt [8], [9]
imply (see 4.8). Also, Theorem 1.1 proves nally what had been suggested
by many examples (see [12], Example 4.9, [13], Lemma 4), namely that the
existence of real non-zero singular points in V (P
m
) implies the existence of
a perturbation Q of degree less than m for which (P
m
+ Q)(D) does not
admit a right inverse on C
(R
n
).
The proof of Theorem 1.1 in one direction is a modication of the result
of Meise, Taylor, and Vogt [12] mentioned above. For the other direction
we use the concept of quasihomogeneity of polynomials. We show that this
notion together with [12], Lemma 4.7, provides a systematic method to nd
necessary conditions for V (P) to satisfy PL(R
n
, log) which can be checked
easily and directly on the given polynomial P.
2. Preliminaries.
In this section we introduce some of the denitions that are used in this
paper. First we recall the denition of a weight function from [1], then
we introduce conditions of Phragmen-Lindelof type for algebraic varieties
according to Meise, Taylor, and Vogt [9], [11], [12] and we explain the
signicance of these conditions.
Throughout the paper, [[ will denote the euclidean norm and B
(z) =
w C
n
: [w z[ < an open ball in that norm. Zero is not a natural
number.
Denition 2.1. Let : [0, [ ] 0, [ be continuous and increasing and
assume that it has the following properties:
() (2t) = O((t)) ()
_
1
(t)
t
2
dt <
() log t = O((t)), as t tends to innity () x (e
x
) is convex.
By abuse of notation, : z ([z[), z C
n
, will be called a weight function.
Throughout this paper we assume that (0) 1. It is easy to check that
this can be assumed without loss of generality.
PERTURBATION OF PARTIAL DIFFERENTIAL OPERATORS 203
Note that each weight function satises (z) = o([z[). Moreover, each
weight function is plurisubharmonic in C
n
in view of 2.1().
Denition 2.2. Let V be an algebraic variety of pure dimension k in C
n
and an open subset of V . A function u : [, [ will be called
plurisubharmonic if it is locally bounded above, plurisubharmonic in the
usual sense on
reg
, the set of all regular points of V in , and satises
u(z) = limsup
reg
,z
u()
at the singular points of V in . By PSH() we denote the set of all
plurisubharmonic functions on .
Denition 2.3. Let V C
n
be an algebraic variety and let be a weight
function. Then V satises the condition PL(R
n
, ) if the following holds:
There exists A 1 such that for each > 1 there exists B > 0 such that
each u PSH(V ) satisfying () and () also satises (), where:
() u(z) [Imz[ + O((z)), z V ,
() u(z) [Imz[, z V ,
() u(z) A[Imz[ + B(z), z V .
2.4. Phragmen-Lindelof conditions and continuous linear right
inverses. To explain the signicance of the condition PL(R
n
, ), let P(z) =
||m
a
||m
a
i
||
||
f
x
(R
n
), this was obtained earlier in [9]
and that Palamodov [15] proved that a dierential complex of C
-functions
over R
n
splits if and only if the associated varieties satisfy PL(R
n
, log).
From Meise, Taylor, and Vogt [12], 4.7, we recall the following lemma
which for many examples was the only tool to show that they do not satisfy
PL(R
n
, ) for some weight function .
Lemma 2.5. Let V be an algebraic variety in C
n
that satises PL(R
n
, )
with constants A > 0 and B
d,=m
a
, z C
n
,
where d, ) =
n
j=1
d
j
j
and where not all a
) V (P) with
1
, R,
R
n1
, and
,= 0,
(3) the polynomial P(,
(
1
) and
that
:= inf
||=
[P(
1
+ ,
2
, . . . ,
n
)[ > 0.
PERTURBATION OF PARTIAL DIFFERENTIAL OPERATORS 205
By a compactness argument there exists
0
> 0 so that whenever [z
k
k
[
0
for 2 k n and [[ = we have
[P(
1
+ , z
2
, . . . , z
n
)[ /2. (3.1)
Next x R 1 and let
s() :=
1
R
m
(P + Q)(R
d
1
(
1
+ ), R
d
2
2
, . . . , R
d
n
n
), C.
By hypothesis, we have Q =
m1
k=0
Q
k
where Q
k
is zero or d-quasihomogen-
eous of degree k. Since P is d-quasihomogeneous of degree m, it follows
that
s() P(
1
+ ,
2
, . . . ,
n
) =
m1
k=0
1
R
mk
Q
k
(
1
+ ,
2
, . . . ,
n
).
Hence there exists R
0
> 1 such that for R R
0
[s() P(
1
+ ,
2
, . . . ,
n
)[ /4 if [[ = .
Because of this and (3.1), Rouches theorem implies that for each R R
0
there exists (R) C satisfying [(R)[ < and s((R)) = 0. Hence
z(R) := (R
d
1
(
1
+ (R)), R
d
2
2
, . . . , R
d
n
n
)
belongs to V (P + Q). By (2) we have
[Imz(R)[ = R
d
1
[Im(
1
+ (R))[.
Since [(R)[ <
1
4
[Im
1
[, we have
3
4
R
d
1
[Im
1
[ [Imz(R)[
5
4
R
d
1
[Im
1
[. (3.2)
Now assume that V (P + Q) satises PL(R
n
, ) with constants A > 0 and
B
R
d
j
d
1
1
for 2 j n.
Then x R
R
1
and dene
1,R
: C
n
C by
1,R
(z) := z
1
/R
d
1
. Next
note that for each z C
n
with [z z(R)[ t[Imz(R)[ 2tR
d
1
[Im
1
[ its
coordinates z
1
, . . . , z
n
satisfy
z
1
R
d
1
z
1
R
d
1
1,R
(z(R))
+[(R)[ 3t[Im
1
[,
z
j
R
d
j
2t[Im
1
[
R
d
j
d
1
0
, 2 j n.
(3.3)
Note further that
K := w C
n
: [w
1
1
[ 3t[Im
1
[, [w
j
j
[
0
, 2 j n
is compact and hence
max
0km1
sup
wK
[Q
k
(w)[ < .
Therefore we can choose R
1
R
1
so large that
m1
k=0
1
R
mk
Q
k
(w)
/4 for each R R
1
and w K. (3.4)
Next x R R
1
and assume that z C
n
satises the inequalities in (3.3).
Then the d-quasihomogeneity properties of P and Q imply
1
R
m
(P + Q)(z) = P
_
z
1
R
d
1
, . . . ,
z
n
R
d
n
_
+
m1
k=0
1
R
mk
Q
k
_
z
1
R
d
1
, . . . ,
z
n
R
d
n
_
.
By (3.1) and (3.3) this implies
1
R
m
(P + Q)(z)
/4 if
z
1
R
d
1
= .
This shows that
1,R
(
V
z(R)
) C C : [
1
[ = .
Since
1,R
is continuous and satises [
1,R
(z(R))
1
[ = [(R)[ < and
since
V
z(R)
is connected, it follows that
z
1
R
d
1
<
1
4
[Im
1
[ for each z
V
z(R)
.
PERTURBATION OF PARTIAL DIFFERENTIAL OPERATORS 207
Hence we have for each z
V
z(R)
[Imz[ [Imz
1
[ [ImR
d
1
1,R
(z(R))[ [ImR
d
1
1,R
(z(R)) Imz
1
[
3
4
R
d
1
[Im
1
[
1
4
R
d
1
[Im
1
[ =
R
d
1
2
[Im
1
[
2
5
[Imz(R)[.
This shows that our claim holds with M =
5
2
.
Remark. Note that the application of Meise, Taylor, and Vogt [12], Lemma
4.7, stated in Lemma 2.5, requires a good understanding of the given vari-
ety V in order to nd the points z
0
V at which one can use this lemma.
Lemma 3.2 and also Lemma 3.6 below show that there is a systematic way
to nd these points in V (P) if P has a non-trivial d-quasihomogeneous prin-
cipal part with certain other properties. Therefore these lemmas are much
easier to use than Lemma 2.5. We demonstrate this in the following exam-
ples.
Examples 3.3.
(a) Let P C[z
1
, z
2
, z
3
] be dened as
P(z
1
, z
2
, z
3
) := z
2
1
z
3
+ z
1
z
2
2
+ z
2
z
3
.
If V (P) satises PL(R
3
, ) for some weight function then t
1
3
=
O((t)) as t tends to innity. This is an immediate consequence of
Lemma 3.2 and the following facts:
(1) P is (1, 2, 3)-quasihomogeneous of degree 5
(2) (
1
2
(1 + i
3), 1, 1) V (P)
(3) P(, 1, 1) =
2
+ + 1.
(b) Let P C[z
1
, z
2
, z
3
] be dened as
P(z
1
, z
2
, z
3
) := z
2
1
z
3
+ z
1
z
2
2
+ z
2
3
.
If V (P) satises PL(R
3
, ) for some weight function then t
1
2
=
O((t)) as t tends to innity. This follows from Lemma 3.2 and the
following facts:
(1) P is (2, 3, 4)-quasihomogeneous of degree 8
(2) (i, 0, 1) V (P)
(3) P(, 0, 1) =
2
+ 1.
(c) Let P C[z
1
, z
2
, z
3
] be dened as
P(z
1
, z
2
, z
3
) := z
2
1
z
2
z
2
3
.
If V (P) satises PL(R
3
, ) for some weight function then t
1
2
=
O((t)) as t tends to innity. This follows immediately from Lemma
3.2 and the following facts:
(1) P is (1, 2, 2)-quasihomogeneous of degree 4
(2) (i, 1, 1) V (P)
208 R.W. BRAUN, R. MEISE, AND B.A. TAYLOR
(3) P(, 1, 1) = (
2
+ 1).
To indicate that Lemma 3.2 can also be used to disprove conditions of
Phragmen-Lindelof type for homogeneous polynomials which have not been
considered so far, we next recall the condition introduced by Hormander
[3] to characterize the dierential operators P(D) that are surjective on the
space /() of all real-analytic functions on a convex open set R
n
, n 2.
We restrict our attention here to the case = R
n
.
Denition 3.4. Let P
m
C[z
1
, . . . , z
n
] be homogeneous of degree m.
(a) The variety V (P
m
) satises the condition HPL(R
n
) if there exists A
1 such that each u PSH(V ) satisfying () and () also satises (),
where
() u(z) [z[, z V (P
m
),
() u(z) 0, z V (P
m
) R
n
,
() u(z) A[Imz[, z V (P
m
).
(b) The variety V (P
m
) satises HPL(R
n
, loc) at V (P
m
) R
n
if there
exist A 0 and 0 < r
2
< r
1
such that each function u which is
plurisubharmonic on V (P
m
) B
r
1
() and satises () and () also
satises (), where
() 0 u 1 on V (P
m
) B
r
1
(),
() u(z) 0, z V (P
m
) R
n
B
r
1
(),
() u(z) A[Imz[, z V (P
m
) B
r
2
().
Remark. For P C[z
1
, . . . , z
n
] let P
m
denote the principal part of P.
Hormander has shown in [3] that the operator P(D): /(R
n
) /(R
n
) is
surjective if and only if V (P
m
) satises HPL(R
n
). The latter holds if and
only if V (P
m
) satises HPL(R
n
, loc) at each V (P
m
) R
n
, [[ = 1.
Example 3.5. Let P C[z
1
, . . . , z
4
] be dened as
P(z
1
, . . . , z
4
) := z
2
1
z
4
z
2
2
z
3
.
Then V (P) fails PL(R
4
, ) for each weight function and V (P) fails
HPL(R
4
). In particular V (P) fails HPL(R
4
, loc) at some V (P) R
n
,
[[ = 1.
To show this, note rst that P is homogeneous. By Meise, Taylor, and
Vogt [12], Theorem 4.1 and Corollary 2.9, this implies that V (P) satises
PL(R
4
, log) if and only if V (P) satises PL(R
4
, ) for each weight function
. Next note that:
(1) P is (2, 3, 4, 6)-homogeneous of degree 10
(2) (i, 1, 1, 1) V (P)
(3) P(, 1, 1, 1) =
2
+ 1.
Therefore Lemma 3.2 implies that V (P) fails PL(R
4
, t
1/3
). Hence it also
fails PL(R
4
, log). Since P is irreducible and not elliptic, it follows from [12],
Corollary 3.14, that V (P) does not satisfy HPL(R
4
). Since V (P) satises the
PERTURBATION OF PARTIAL DIFFERENTIAL OPERATORS 209
dimension condition, dimV (P) R
n
= n1, Theorem 3.13(4) of [12] shows
that V (P) fails HPL(R
4
, loc) at some V (P) R
n
, [[ = 1. Inspection
of the proof of Lemma 3.2 shows that = lim
R
z(R)/[z(R)[. So in the
present example, = (0, 0, 0, 1).
For our application we also need the following variant of Lemma 3.2,
which for k = 1 is weaker than that lemma.
Lemma 3.6. Let P C[z
1
, . . . , z
n
] be d-quasihomogeneous of degree m and
let Q C[z
1
, . . . , z
n
] be the sum of d-quasihomogeneous polynomials of de-
grees less than m. Assume that for some k, 1 k < n, the following
conditions are fullled:
(1) d
1
= = d
k
< d
j
for j > k,
(2) there exists = (
) C
k
R
nk
satisfying P() = 0 and
,= 0,
(3) if P(z
) = 0 then Imz
,= 0.
If V (P + Q) satises PL(R
n
, ) for some weight function and D =
maxd
j
:
j
,= 0, then satises t
d
1
/D
= O((t)) as t tends to innity.
Proof. From (2) and (3) it follows that the polynomial z
P(z
) is not
constant. Since the hypotheses are invariant under a real linear change of
coordinates in the z
[ so that
:= inf
||=r
[P(
1
+ ,
2
, . . . ,
n
)[ > 0.
For each 1 we get from (3) that P does not vanish on the compact set
L() := x R
n
: [x
j
[ , 1 j k, x
j
=
j
, k + 1 j n.
Hence there exists = () > 0 such that for B
(0) := z C
n
: [z[
we have
() := inf[P(z)[ : z L() + B
()
(0) > 0.
Next note that by hypothesis we have Q =
m1
k=0
Q
k
, where Q
k
is either
zero or d-quasihomogeneous of degree k. Then x R 1 and consider the
polynomial
s() :=
1
R
m
(P + Q)(R
d
1
(
1
+ ), R
d
2
2
, . . . , R
d
n
n
).
Because of our assumptions on d-quasihomogeneity, we have
s() P(
1
+ ,
2
, . . . ,
n
) =
m1
k=0
1
R
mk
Q
k
(
1
+ ,
2
, . . . ,
n
).
210 R.W. BRAUN, R. MEISE, AND B.A. TAYLOR
From this and a standard compactness argument it follows that there exists
R
0
1 such that for each R R
0
,
sup
||=r
[s() P(
1
+ ,
2
, . . . ,
n
)[ /2.
Since P() = 0, our choice of shows that we can apply Rouches theorem
to get the existence of a zero (R) of s satisfying
[(R)[ < r
1
4
[Im
[.
Now assume that V (P + Q) satises PL(R
n
, ) for some weight function
with constants A 1 and B
2
, . . . , R
d
n
n
).
By V
R
we denote the set V (P + Q) B
t|Imz(R)|
(z(R)). We claim that the
following holds:
() There exist R
2
R
0
and > 0 such that for R R
2
[Imz[ [Imz(R)[ for each z V
R
.
To prove () note that the choice of (R) and d
1
= = d
k
imply z(R)
V (P + Q) and [Imz(R)[ = R
d
1
[Im(
1
+ (R),
2
, . . . ,
k
)[. By the estimate
for (R) this shows
3
4
R
d
1
[Im
[ [Imz(R)[
5
4
R
d
1
[Im
[.
For z V
R
this implies
z
1
R
d
1
t[Imz(R)[
R
d
1
+[(R)[ 2t[Im
[,
z
j
R
d
j
2tR
d
1
d
j
[Im
[, 2 j n.
Because of this and (1) we can choose 1 and R
1
R
0
so that
z
j
R
d
j
for 1 j k and
z
j
R
d
j
() for k + 1 j n
whenever R R
1
and z V
R
. Next note that
0 = (P + Q)(z)
= R
m
P
_
z
1
R
d
1
, . . . ,
z
n
R
d
n
_
+ R
m
m1
k=0
1
R
mk
Q
k
_
z
1
R
d
1
, . . . ,
z
n
R
d
n
_
implies
P
_
z
1
R
d
1
, . . . ,
z
n
R
d
n
_
m1
k=0
1
R
mk
Q
k
_
z
1
R
d
1
, . . . ,
z
n
R
d
n
_
C
R
PERTURBATION OF PARTIAL DIFFERENTIAL OPERATORS 211
for some constant C 1 and all z V
R
, R R
1
. Hence we can choose
R
2
R
1
, so that
P
_
z
1
R
d
1
, . . . ,
z
n
R
d
n
_
< () whenever R R
2
, z V
R
.
By the denition of (), this implies
R
d
1
[Imz
[ =
Im
_
z
1
R
d
1
, . . . ,
z
n
R
d
n
_
()
and consequently, for :=
4
5
()
|Im
|
:
[Imz[ [Imz
[ ()R
d
1
=
5
4
R
d
1
[Im
[ [Imz(R)[,
which proves ().
From () and Meise, Taylor, and Vogt [12], Lemma 4.7, it follows that
there exists B > 0 such that
[Imz(R)[ B(z(R)) for R R
2
.
It is again no restriction to assume
n
,= 0 and D = d
n
. From this it
follows as in the proof of Lemma 3.2 that R
d
1
/D
= O((R)) as R tends to
innity.
As an application, we give a short proof of a result of Meise and Taylor
[7], 2.1.
Corollary 3.7. Let P C[z
1
, . . . , z
n
] be of degree m and assume that its
principal part P
m
is real. Let q C[t] have degree k < m and non-real
leading coecient. Set Q(z, t) = P(z) + q(t) and let be a weight function
with (t) = o(t
k/m
). Then V (Q) does not satisfy PL(R
n+1
, ).
Proof. We apply Lemma 3.6 in n + 1 variables with d
1
= = d
n
= k <
m = d
n+1
. Let b C R denote the leading coecient of q. The d-
quasihomogeneous principal part of Q is P
m
(z) + bt
k
. Choose
C
n
with
P
m
(
) = b and set
, C
, R
and
[
||
[ <
.
Then V (P) satises PL(R
n
, ).
The proof of Lemma 4.1 is quite analogous to that of [12], Lemma 5.2.
Therefore, we will only sketch its main steps: Since V (P
m
) has PL(R
n
, log)
by hypothesis, it follows from Meise, Taylor, and Vogt [12], Theorem 3.13,
and [10], Theorem 5.1, that V (P) satises the condition (RPL) of [10], 2.2.
Hence there exists A
0
1 such that for each > 1, there exists B
> 0 such
that each u PSH(V (P)) satisfying
u(z) [z[ + o([z[) and u(z) [Imz[, z V (P) (4.1)
also satises
u(z) A
0
[z[ + B
, z V (P). (4.2)
This a priori estimate and a compactness argument imply that it suces to
prove the desired Phragmen-Lindelof estimate for each V (P
m
)R
n
, [[ =
1, in the intersection of V (P) with some small cone centered around (for
the precise argument we refer to the proof of Meise and Taylor [7], 4.5).
Using appropriate coordinates in such cones, these estimates are derived
from (4.2) similarly as in the proof of [12], Lemma 5.2.
To state our main result, we recall the following denition from Horman-
der [5], 10.4.11.
Denition 4.2. P C[z
1
, . . . , z
n
] is said to be of principal type if its prin-
cipal part P
m
satises
n
j=1
P
m
z
j
(x)
2
,= 0 for each x R
n
0.
Note that by Eulers rule x, grad P
m
(x)) = mP
m
(x), so P is of principal
type if and only if
grad P
m
(x) ,= 0 for each x R
n
0 satisfying P
m
(x) = 0.
Theorem 4.3. Let n 2 and let P
m
C[z
1
, . . . , z
n
] be homogeneous of
degree m 2. Then the following conditions are equivalent:
(1) For each Q C[z
1
, . . . , z
n
] with deg Q < m, the variety V (P
m
+ Q)
satises PL(R
n
, log),
(2) P
m
is of principal type, P
m
is real up to a complex constant, and each
irreducible factor q of P
m
has a real zero ,= 0.
Proof. (1) (2): By hypothesis, V (P
m
) has PL(R
n
, log). Hence it follows
from Meise, Taylor, and Vogt [12], Theorem 3.13, that dimV (q)R
n
= n1
PERTURBATION OF PARTIAL DIFFERENTIAL OPERATORS 213
for each irreducible factor q of P
m
. Thus, the third condition in (2) is
fullled.
To prove that P
m
is of principal type, note rst that by Meise, Taylor, and
Vogt [13], Lemma 2, there exists C 0 so that P
m
R[z
1
, . . . , z
n
].
Hence the second condition of (2) is fullled and it is no restriction to assume
that P
m
has real coecients. To prove that grad P
m
does not vanish on
V (P
m
) (R
n
0) we argue by contradiction and assume that there exists
V (P
m
) (R
n
0) satisfying grad P
m
() = 0. After a real linear change
of variables, we may assume = e
n
= (0, . . . , 0, 1). Then we apply Taylors
formula at to get
P
m
(z
, 1) = P
m
( + (z
, 0)) =
m
k=
q
k
(z
), (4.3)
where q
k
C[z
1
, . . . , z
n1
] is zero or homogeneous of degree k and where
q
,= 0. Then 2 m since P
m
and grad P
m
vanish at . By the
homogeneity of P
m
it follows from 4.3 that for z
n
,= 0 we have
P
m
(z
, z
n
) = z
m
n
P
m
_
z
z
n
, 1
_
= z
m
n
m
k=
q
k
_
z
z
n
_
=
m
k=
z
mk
n
q
k
(z
).
By continuity, this holds also when z
n
= 0. Now let
P(z) := P
m
(z) + iz
m1
n
=
m
k=
z
mk
n
q
k
(z
) + iz
m1
n
and d := (1, . . . , 1, ). Then the monomial z
m1
n
has d-degree (m1)
and the polynomials z
mk
n
q
k
(z
(z
) + iz
m1
n
.
To show that q satises the hypotheses of Lemma 3.6, we note that q
,= 0
implies the existence of
C
n1
satisfying q
) = i. Then := (
, 1)
satises
q() = q
) + i = 0.
Hence the conditions (1) and (2) of Lemma 3.6 are fullled. To show that
also condition 3.6(3) holds, assume that for some z
C
n1
we have
0 = q(z
, 1) = q
(z
) + i.
Since q
/ R
n1
, which proves condition
3.6(3). Hence we can apply Lemma 3.6 to conclude that V (P) does not
satisfy PL(R
n
, log) in contradiction to the hypothesis (1).
(2) (1): Since P
m
is real up to a complex factor, it is no restriction to
assume that P
m
has real coecients. By Meise and Taylor [7], Lemma 4.6,
the hypothesis implies that P
m
is a product of distinct, irreducible factors
214 R.W. BRAUN, R. MEISE, AND B.A. TAYLOR
with real coecients, each of which is of principal type. This implies that
P
m
is locally hyperbolic at every real characteristic in the sense of Denition
6.4 of Hormander [3]. Hence it follows from [3], Theorem 6.5, that V (P
m
)
satises HPL(R
n
). By hypothesis, no irreducible component of V (P
m
) is
elliptic. Hence V (P
m
) satises PL(R
n
, log) by Meise, Taylor, and Vogt [12],
Corollary 3.14. Next x Q C[z
1
, . . . , z
n
] with deg Q < m, Q ,= 0 and
choose C > 0 such that [Q(z)[ C(1+[z[
m1
), z C
n
. By the homogeneity
of P
m
, the function
x
n
j=1
P
m
x
j
(x)
with
[Q(z)[ A
max
=0
[P
()
m
(z)[, z .
Now x z (V (P
m
+ Q) V (P
m
)) and note that by Hormander [4],
Lemma 4.1.1, (which holds also for C
n
) there exists D > 0 such that
dist(, V (P
m
))
=0
P
()
m
()
P
m
()
1
||
D, C
n
V (P
m
).
This and P
m
(z) = Q(z) imply
1
A
max
=0
P
()
m
(z)
Q(z)
= max
=0
P
()
m
(z)
P
m
(z)
max
=0
_
D
dist(z, V (P
m
))
_
||
and hence the existence of E > 0 such that (by continuity)
dist(z, V (P
m
)) E, z V (P
m
+ Q).
From this we get (1) by Lemma 4.1.
Remark 4.4. Note that Theorem 4.3 and its Corollary 4.7 below extend
Corollary 5.8 of Meise, Taylor, and Vogt [12]. Moreover, Theorem 4.3 shows
that the characterizing condition is in fact weaker than the sucient condi-
tion given there, since P
m
can be of principal type, while V (P
m
) has complex
singularities. To see this, consider P
4
(x, y, z) := (x
2
+y
2
z
2
)(x
2
+z
2
y
2
/4)
and note that (i
_
3/5, 2
_
2/5, 1) : C is a singular line for V (P
4
).
Remark 4.5. From Meise and Taylor [7], 4.8 and 3.4, it follows that each
real homogeneous polynomial P
m
of principal type for which each irreducible
PERTURBATION OF PARTIAL DIFFERENTIAL OPERATORS 215
factor has a non-trivial real zero is also stable under certain real perturba-
tions introducing an extra variable. More precisely, the variety
z C
n+1
: P
m
(z
1
, . . . , z
n
) = z
n+1
satises PL(R
n+1
, log), provided that P
m
is of real principal type and has
no elliptic factor.
In the proof of Theorem 4.3 we used complex polynomials to show that
V (P
m
+Q) fails PL(R
n
, log) if V (P
m
) satises PL(R
n
, log) while P
m
is not of
principal type. In some cases, such as P
2
(x, y, z) = x
2
y
2
, this is the only
possible choice (see Meise, Taylor, and Vogt [12], Example 4.9). However, in
other cases, real perturbations can also have the same eect, as the following
example shows.
Example 4.6. Let P(x, y, z):=x
2
z+yz
2
+yz. The principal part P
3
(x, y, z)
= x
2
z+yz
2
= (x
2
+yz)z is hyperbolic with respect to N = (0, 1, 1). Hence
V (P
3
) satises PL(R
3
, log) by Meise, Taylor, and Vogt [9], 3.6, and 4.5 in
connection with [12], 2.12. Obviously, P
3
is not of principal type. By
Example 3.3(a), V (P) does not satisfy PL(R
3
, ) whenever (t) = o(t
1/3
).
Corollary 4.7. Let P
m
C[z
1
, . . . , z
n
] be homogeneous of degree m 2
and of principal type. Then the following conditions are equivalent:
(1) V (P
m
) satises PL(R
n
, log),
(2) P
m
is real up to a complex constant and each irreducible factor q of
P
m
has a real zero ,= 0,
(3) there exist k N, Q C[z
1
, . . . , z
n
] with deg Q < km, and a weight
function so that V (P
k
m
+ Q) satises PL(R
n
, ),
(4) for each k N and each Q C[z
1
, . . . , z
n
] with deg Q = l < km we
have:
(a) if l k(m1), then V (P
k
m
+ Q) satises PL(R
n
, log),
(b) if l > k(m 1), then V (P
k
m
+ Q) satises PL(R
n
, 1 + t
) for =
1 +
l
k
m.
Proof. (1) (2): This follows from Meise, Taylor, and Vogt [13], Lemma
2, and [12], Corollary 3.14.
(2) (3): Since P
m
is of principal type, (2) implies that condition 4.3(2)
is fullled. Hence V (P
m
) satises PL(R
n
, log) by Theorem 4.3. Thus (3)
holds for k = 1 and Q = 0.
(3) (1): If V (P
k
m
+Q) satises PL(R
n
, ) then V (P
k
m
) = V (P
m
) satises
PL(R
n
, log) by Meise, Taylor, and Vogt [12], Theorem 4.1.
(4) (3): This holds trivially.
(2) (4): Since P
m
is of principal type, there exists > 0 such that
[grad P
m
(z)[ > 0 for z C
n
, [z[ = 1 and [Imz[ .
216 R.W. BRAUN, R. MEISE, AND B.A. TAYLOR
Therefore standard arguments using homogeneity and compactness imply
the existence of > 0 such that
max
0<||k
[(P
k
m
)
()
(z)[ [z[
k(m1)
for z C
n
, [Imz[ [z[, [z[ 1. (4.4)
Now x Q C[z
1
, . . . , z
n
] satisfying deg Q k(m 1). Then there exists
D 1 such that
[Q(z)[ D max
||>0
(P
k
m
)
()
(z)
for z C
n
, [z[ 1, [Imz[ [z[. (4.5)
Now x V (P
m
) R
n
, [[ = 1, and let
(, , 1) :=
_
z C
n
:
z
[z[
, [z[ 1
_
.
For z (, , 1) we have [Imz[ [z[ since is real. Now x V (P
k
m
+
Q) (, , 1) satisfying P
m
() ,= 0. Then P
k
m
() = Q() and (4.5) imply
the existence of M 1 such that
1
M
||>0
(P
k
m
)
()
()
Q()
1/||
=
||>0
(P
k
m
)
()
()
P
k
m
()
1/||
.
Since by Hormander [4], Lemma 4.1.1, there exists C 1 such that
dist(, V (P
k
m
))
||>0
(P
k
m
)
()
()
P
k
m
()
1/||
C
we conclude that
dist(, V (P
k
m
)) CM CM log(2 +[[).
Since V (P
m
) R
n
, [[ = 1, was chosen arbitrarily, it follows from
Lemma 4.1 that V (P
k
m
+ Q) satises PL(R
n
, log) in this case.
If k(m 1) < l = deg Q < km then 0 < := 1 +
l
k
m < 1 and (4.4)
implies the existence of > 0 such that
max
0<||k
(P
k
m
)
()
(z)
[z[
||
[z[
k(m1)k
= [z[
l
if [z[ 1 and [Imz[ [z[.
Hence there exists D 1 such that
[Q(z)[ D max
||>0
(P
k
m
)
()
(z)
(1 +[z[
)
||
for z C
n
, [Imz[ [z[, [z[ 1.
From this it follows as above that for each V (P
k
m
) R
n
, [[ = 1, there
exists C
(1 +[[
), (, , 1).
PERTURBATION OF PARTIAL DIFFERENTIAL OPERATORS 217
By Lemma 4.1, this implies that V (P
k
m
+ Q) satises PL(R
n
, ) for (t) =
1 + t
, as asserted.
Theorem 4.3 in connection with Meise, Taylor, and Vogt [8], Theor`eme,
also implies the following result on the existence of fundamental solutions
with large lacunas.
Corollary 4.8. Let P C[z
1
, . . . , z
n
] be of principal type and of degree
m 2. If the principal part P
m
of P is real up to a complex constant and
if each irreducible factor q of P
m
has a real zero ,= 0 then the following
holds:
For each r > 0 there exists R > 0 such that for each R
n
, [[ > R, there
exists a fundamental solution E
(R
n
) of P(D) satisfying Supp E
x R
n
: [x [ r.
References
[1] R.W. Braun, R. Meise and B.A. Taylor, Ultradierentiable functions and Fourier
analysis, Res. Math., 17 (1990), 207-237.
[2] S.G. Gindikin and L.R. Volevich, Mixed Problem for Partial Dierential Equations
with Quasihomogeneous Principal Part, Amer. Math. Soc., 1996 (translation from the
Russian).
[3] L. Hormander, On the existence of real analytic solutions of partial dierential equa-
tions with constant coecients, Invent. Math., 21 (1973), 151-183.
[4] , Linear Partial Dierential Operators, Springer 1969.
[5] , The Analysis of Linear Partial Dierential Operators, Vol. II, Springer 1983.
[6] Y. Laurent, Theorie de la Deuxi`eme Microlocalisation dans le Domaine Complexe,
Birkhauser, 1985.
[7] R. Meise and B.A. Taylor, Phragmen-Lindelof conditions for graph varieties, Res.
Math., to appear.
[8] R. Meise, B.A. Taylor and D. Vogt, Characterisation des operateurs lineaires aux
derivees partielles avec coecients constants sur E(R
n
) admettant un inverse `a droite
qui est lineaire et continu, C. R. Acad. Sci. Paris, 307 (1988), 239-242.
[9] , Characterization of the linear partial operators with constant coecients that
admit a continuous linear right inverse, Ann. Inst. Fourier (Grenoble), 40 (1990),
619-655.
[10] , Extremal plurisubharmonic functions of linear growth on algebraic varieties,
Math. Z., 219 (1995), 515-537.
[11] , Continuous linear right inverses for partial dierential operators on non-
quasianalytic classes and on ultradistributions, Math. Nachr., 180 (1996), 213-242.
[12] , Phragmen-Lindelof principles on algebraic varieties, J. Amer. Math. Soc.,
11 (1998), 1-39.
[13] , Continuous linear right inverses for partial dierential operators of order 2
and fundamental solutions in half spaces, Manuscr. Math., 90 (1996), 449-464.
218 R.W. BRAUN, R. MEISE, AND B.A. TAYLOR
[14] R. Nevanlinna, Eindeutige analytische Funktionen, Springer, 1974.
[15] V.P. Palamodov, A criterion for splitness of dierential complexes with constant co-
ecients, in Geometrical and Algebraical Aspects in Several Complex Variables,
C.A. Berenstein and D.C. Struppa (Eds.), EditEL, (1991), 265-291.
Received June 4, 1998. The third author gratefully acknowledges support of the A. v. Hum-
boldt Stiftung for visits at Mathematisches Institut der Heinrich-Heine-Universitat D us-
seldorf, where this research was carried out.
Mathematisches Institut
Heinrich-Heine-Universit
at
Universit
atsstrae 1
40225 D
usseldorf
Germany
E-mail address: braun@cs.uni-duesseldorf.de
Mathematisches Institut
Heinrich-Heine-Universit
at
Universit
atsstrae 1
40225 D
usseldorf
Germany
E-mail address: meise@cs.uni-duesseldorf.de
Department of Mathematics
University of Michigan
Ann Arbor, MI 48109
E-mail address: taylor@math.lsa.umich.edu
PACIFIC JOURNAL OF MATHEMATICS
Vol. 192, No. 2, 2000
CUBIC MODULAR EQUATIONS AND NEW
RAMANUJAN-TYPE SERIES FOR 1/
Heng Huat Chan and Wen-Chin Liaw
Dedicated to our advisor, Professor Bruce C. Berndt on his 60
th
birthday
In this paper, we derive new Ramanujan-type series for 1/
which belong to Ramanujans theory of elliptic functions to alter-
native base 3 developed recently by B.C. Berndt, S. Bhargava,
and F.G. Garvan.
1. Introduction.
Let (a)
0
= 1 and, for a positive integer m,
(a)
m
:= a(a + 1)(a + 2) (a + m1),
and
2
F
1
(a, b; c; z) :=
m=0
(a)
m
(b)
m
(c)
m
z
m
m!
, |z| < 1.
In his famous paper Modular equations and approximations to [10],
S. Ramanujan oered 17 beautiful series representations for 1/. He then
remarked that two of these series
27
4
=
m=0
(2 + 15m)
(
1
2
)
m
(
1
3
)
m
(
2
3
)
m
(m!)
3
_
2
27
_
m
(1.1)
and
15
3
2
=
m=0
(4 + 33m)
(
1
2
)
m
(
1
3
)
m
(
2
3
)
m
(m!)
3
_
4
125
_
m
(1.2)
belong to the theory of q
2
, where
q
2
= exp
_
3
2
F
1
(
1
3
,
2
3
; 1; 1 k
2
)
2
F
1
(
1
3
,
2
3
; 1; k
2
)
_
.
Ramanujan did not elaborate on his theory of q
2
, neither did he provide
details for his proofs of (1.1) and (1.2).
219
220 HENG HUAT CHAN AND WEN-CHIN LIAW
Ramanujans formulas (1.1) and (1.2) were rst proved by J.M. Borwein
and P.B. Borwein in 1987. Motivated by their study of Ramanujans se-
ries for 1/ associated with the classical theory of elliptic functions, they
established the following result:
Theorem 1.1 ([3, p. 186]). Let
K(x) :=
2
F
1
(
1
3
,
2
3
; 1; x), and
K(x) :=
dK(x)
dx
.
For n Q
+
, dene the cubic singular modulus to be the unique number
n
satisfying
K(1
n
)
K(
n
)
=
n. (1.3)
Set
(n) =
3
3
8
(K(
n
))
2
n
_
3
2
n
(1
n
)
K(
n
)
K(
n
)
n
_
, (1.4)
a
n
:=
8
3
9
_
(n)
n
n
_
, (1.5)
and
b
n
:=
2
3n
3
_
1 H
n
, (1.6)
where
H
n
:= 4
n
(1
n
). (1.7)
Then
1
m=0
(a
n
+ b
n
m)
(
1
2
)
m
(
1
3
)
m
(
2
3
)
m
(m!)
3
H
m
n
. (1.8)
Remark. We state this theorem with a dierent denition of (n) than that
given in [3]. We have avoided using elliptic integrals of the second kind and
Legendres relation.
The Borweins theorem indicates that for each positive rational number
n, we can easily derive a series for 1/ belonging to the theory of q
2
if the
values of
n
and (n) (the rest of the constants can be computed from these)
are known. The computation of these constants for any given n, however,
is far from trivial.
NEW RAMANUJAN-TYPE SERIES FOR 1/ 221
The Borweins method of evaluating
n
involves solving a quartic equa-
tion. More precisely, they show that when n is an odd positive integer,
n
is the smaller of the two real solutions of the equation
(9 8
n
)
3
64
3
n
(1
n
)
=
(4G
24
3n
1)
3
27G
24
3n
, (1.9)
where G
n
is the classical Ramanujan-Weber class invariant dened by
G
n
:= 2
1/4
e
n/24
m=1
(1 + e
n(2m1)
).
Using known values for G
3n
, they derive
n
for n = 3 and 5 from (1.9). For
example, from (see [1, p. 190])
G
12
15
= 8
_
5 + 1
2
_
4
,
they deduce that
5
=
1
2
11
5
50
.
When n is an even positive integer, the corresponding formula between
n
and g
3n
is
(9 8
n
)
3
64
3
n
(1
n
)
=
(4g
24
3n
+ 1)
3
27g
24
3n
, (1.10)
where g
n
is the other Ramanujan-Weber class invariant dened by
g
n
:= 2
1/4
e
n/24
m=1
(1 e
n(2m1)
).
Using (1.10) and known values of g
3n
, they compute
n
for n = 2, 4, and 6.
Together with the values of (n) for n = 2, 3, 4, 5, and 6 [3, p. 190, Problem
20], they obtained ve series for 1/. Ramanujans series (1.1) and (1.2)
then correspond to n = 4 and 5, respectively. At the end of [3, Chapter 5,
Section 5], the Borweins remark that their explanation of Ramanujans series
(1.1) and (1.2) is a bit disappointing as they only have well-concealed
analogues of the original theory for K.
In a recent paper, B. C. Berndt, S. Bhargava, and F. G. Garvan [2]
succeeded in developing Ramanujans corresponding theories mentioned
in [10]. One of these theories is Ramanujans theory of q
2
and its discovery
has motivated us to revisit Ramanujans series (1.1) and (1.2). This theory
is now known as Ramanujans theory of elliptic functions to alternative base
3 or Ramanujans elliptic functions in the theory of signature 3.
In this article, we derive some new formulas from the theory of q
2
m=0
(a + bm)
(
1
2
)
m
(
1
3
)
m
(
2
3
)
m
(m!)
3
_
73 40
3
2
1/3
23
2
(4 + 5
3)
_
3m
,
(1.11)
where
a := 1028358
3 795.
Each term in this series gives approximately 10 decimal places of .
In Section 2, we recall some important results proved in [2] and establish
new formulas satised by (n) which lead to a new formula for a
n
. In
Section 3, we describe our strategy for computing a
n
. In Section 4, we
indicate that if 3n is an Euler convenient number, then
n
, as well as other
related cubic singular moduli, can be computed explicitly via Kroneckers
Limit Formula. These values are used to derive the constants a
n
, b
n
, and
H
n
listed in our nal section.
2. Ramanujans elliptic functions in the theory of signature 3
(Ramanujans theory of q
2
).
Dene
a(q) :=
m,n=
q
m
2
+mn+n
2
and
c(q) :=
m,n=
q
(m+1/3)
2
+(m+1/3)(n+1/3)+(n+1/3)
2
.
Theorem 2.1. If
q = exp
_
3
K(1 )
K()
_
, (2.1)
then
=
c
3
(q)
a
3
(q)
. (2.2)
Theorem 2.2 (Borweins Inversion Formula). We have
a(q) = K
_
c
3
(q)
a
3
(q)
_
= K(), (2.3)
where K() is dened in Theorem 1.1.
NEW RAMANUJAN-TYPE SERIES FOR 1/ 223
Theorem 2.1 and Theorem 2.2 are important results in Ramanujans the-
ory of elliptic functions in the signature 3 which can be found in [2] as
Lemma 2.9 and Lemma 2.6, respectively.
Let be given as in (2.2). Then it is known that (see [2, (4.4)] and [5,
(4.7)])
q
d
dq
= K
2
()(1 ). (2.4)
The modulus is said to have degree n over the modulus when there is a
relation
K(1 )
K()
= n
K(1 )
K()
. (2.5)
Hence, when q satises (2.1),
q
n
= exp
_
3
K(1 )
K()
_
,
and applying (2.4) with q and replaced by q
n
and , respectively, we
deduce that
q
d
dq
= nK
2
()(1 ). (2.6)
Combining (2.6) and (2.4), we arrive at:
Theorem 2.3. If has degree n over , then
m
2
(, )
d
d
= n
(1 )
(1 )
, (2.7)
where
m(, ) :=
K()
K()
. (2.8)
We call the quantity m(, ) the multiplier of degree n in the theory of
signature 3. We are now ready to derive new formulas satised by (n).
Theorem 2.4. Let (r) be dened as in (1.4). Then
_
1
r
_
=
r (r)
r
.
Proof. Set
=
K(1 )
K()
.
Then
d
d
K() +
K() =
K(1 ).
224 HENG HUAT CHAN AND WEN-CHIN LIAW
From (2.1) and (2.4), we deduce that
d
d
=
2
3
K
2
()(1 ).
Hence,
K(1 ) =
3
2
1
K()(1 )
K(). (2.9)
Next, note that from (1.3)
1/r
= 1
r
. (2.10)
Therefore, by (1.4) and (2.9) with =
r,
_
1
r
_
=
3
3
8
K
2
(1
r
)
_
1
r
_
3
r
(1
r
)
K(1
r
)
2K(1
r
)
(1
r
)
_
=
3
3
8
1
K
2
(1
r
)
3
3
4rK
2
(
r
)
+
3
r
(1
r
)
K(
r
)
2
rK(
r
)
+
1
r
r
r
=
r (r)
r
.
:= m(
r
,
n
2
r
) and m
:=
dm
d
(
r
,
n
2
r
).
Then
(n
2
r) = m
2
_
(r)
r
_
3
2
m
1
r
(1
r
) m
n
n
2
r
m
2
__
. (2.11)
Proof. Suppose has degree n over . Then from (2.8), we deduce that
m
dK()
d
+ K()
dm
d
=
dK()
d
. (2.12)
Using (2.7), we may rewrite (2.12) as
n(1 )
K()
dK()
d
=
m
2
(1 )
K()
dK()
d
m(1 )
dm
d
. (2.13)
Next, suppose =
r
. Then =
n
2
r
, and by (1.4), (2.8), and (2.13),
(n
2
r) =
3
3
8K
2
(
n
2
r
)
n
r
_
3
n
2
r
(1
n
2
r
)
2K(
n
2
r
)
K(
n
2
r
)
n
2
r
_
=
3
3m
2
8K
2
(
r
)
r
_
3m
2
r
(1
r
)
2K(
r
)
K(
r
)
3
2
m
r
(1
r
) m
n
n
2
r
_
NEW RAMANUJAN-TYPE SERIES FOR 1/ 225
= m
2
_
(r)
r
_
3
2
m
1
r
(1
r
) m
n
n
2
r
m
2
__
.
_
1
n
_
_
1
n
_
1
n
3
n
(1
n
)
2
n
dm
d
(1
n
,
n
)
n
_
_
= (n) + 2
n
n +
3
n
(1
n
)
2
dm
d
(1
n
,
n
).
Hence, we have:
Theorem 2.6.
(n) =
n
n
+
3
n
(1
n
)
4
dm
d
(1
n
,
n
).
Corollary 2.7. With a
n
and H
n
dened in Theorem 1.1, we have
a
n
=
H
n
2
3
dm
d
(1
n
,
n
).
Theorems 2.4, 2.5, and 2.6 are the respective cubic analogues of [3, (5.1.5),
Theorem 5.2, and (5.2.5)].
3. Computations of a
n
.
It is clear from Corollary 2.7 that in order to compute a
n
it suces to
compute
n
and dm/d, where m is the multiplier of degree n. We will
discuss the computation of the latter in this section. Suppose there is a
relation between and , where has degree n over . Then we can
determine d/d by implicitly dierentiating the relation with respect to
. Substituting d/d into (2.7), we conclude that m can be expressed in
terms of and . This implies that dm/d is a function of and .
A relation between and induced by (2.5) (i.e., when has degree n
over ) is known as a modular equation of degree n in the theory of signature
3. (We sometimes call these cubic modular equations.) Our discussion in
the previous paragraph indicates that our computations of dm/d depend
on the existence of such modular equations.
The rst few modular equations in the theory of signature 3 are given by
Ramanujan in his notebooks. One of these is the following modular equation
of degree 2:
()
1/3
+{(1 )(1 )}
1/3
= 1. (3.1)
Proofs of Ramanujans modular equations in the theory of signature 3 are
now available in [2] and [6].
226 HENG HUAT CHAN AND WEN-CHIN LIAW
Recently, we showed that [6] if p is a prime, then there is a relation
between x := ()
s/6
and y := {(1 )(1 )}
s/6
, when (p + 1)/3 = N/s
and gcd(N, s) = 1. Moreover, we proved that the degree of the polynomial
satised by x and y is N. This proves the existence of modular equations
of prime degrees and we conclude that when m is a multiplier of degree p,
dm
d
= F
p
(, ), (3.2)
where F
p
is a certain function in and . If we know the value of
p
, then
the value of a
p
follows by substituting = 1
p
and =
p
into (3.2) and
simplifying. Our simplication is done with the help of maple v.
When n is not a prime, except for modular equations of degrees 4 and 9, it
is dicult to derive a modular equation of degree n. However, deriving such
a modular equation is unnecessary. We illustrate our point with n = pq. If
has degree q over then from a cubic modular equation of degree q and
(2.7), we can write
m
q
=
K()
K()
= G
q
(, ), (3.3)
where m
q
is the multiplier of degree q and G
q
is a certain function of and
. Similarly, we can deduce that if has degree p over , then from a cubic
modular equation of degree p, we may write
m
p
=
K()
K()
= G
p
(, ), (3.4)
where m
p
is the multiplier of degree p and G
p
is a certain function of and
. It follows that has degree pq over and
m
pq
(, ) =
K()
K()
=
K()
K()
K()
K()
= m
q
(, ) m
p
(, ).
Hence, dierentiating with respect to and substituting =
1/(pq)
, we
have
dm
pq
d
(1
pq
,
pq
) = m
p
(
q/p
,
pq
)
dm
q
d
(1
pq
,
q/p
)
+ m
q
(1
pq
,
q/p
)
d
d
dm
p
d
(
q/p
,
pq
).
This allows us to compute a
pq
provided we know modular equations of de-
grees p and q and the singular moduli
pq
and
q/p
.
When n is a squarefree product of more than 2 primes, say n = p
1
p
2
p
l
,
then the above idea can be extended with the computation of a
n
reduced to
that of nding modular equations of degrees p
1
, p
2
, . . . , p
l1
, and p
l
, and
constants
n/(p
2
i
1
p
2
i
s
)
, where 1 s l 1 and 1 i
j
l.
NEW RAMANUJAN-TYPE SERIES FOR 1/ 227
4. Eulers convenient numbers, Kroneckers Limit Formula,
and cubic singular moduli.
An Euler convenient number is a number c satisfying the following criterion:
Let l > 1 be an odd number relatively prime to c which is properly
represented by x
2
+cy
2
. If the equation l = x
2
+cy
2
has only one
solution with x, y 0, then l is a prime number.
Euler was interested in these numbers because they helped him to generate
large primes. The above criterion, however, is not very useful for nding
these numbers.
Let d be squarefree, K = Q(
d), C
K
denote the class group of K and
C
2
K
be the subgroup of squares in C
K
. A genus group G
K
is dened as the
quotient group C
K
/C
2
K
. Gauss observed that G
K
C
K
if and only if d
is a convenient number. (Some convenient numbers are not squarefree but
Gauss criterion is also true for class groups of orders in K.) Using this new
criterion, Gauss determined 65 Euler convenient numbers [8], [7, p. 60]. We
reproduce here those cs (= 3) which are squarefree and divisible by 3.
h(4c) :=
C
Q(
4c)
_
_
d
1
N(p)
_
, if N(p) d
1
,
_
d
2
N(p)
_
, if N(p) | d
1
,
(4.1)
where N(p) is the norm of the ideal p and
_
_
denotes the Kronecker symbol.
If [a] is an ideal class in C
K
and a =
p
, then we dene
([a]) =
([p])
p
.
Theorem 4.1. Let be a genus character arising from the decomposition
D
K
= d
1,
d
2,
. Let h
i,
be the class number of the eld Q(
_
d
i,
), w
2,
be
the number of roots of unity in Q(
_
d
2,
), and
n=1
(1 e
2inz
), Imz > 0,
and
=
2
1
, Im > 0, where a = [
1
,
2
].
Then
2h
1,
h
2,
/w
2,
[a]C
K
F([a])
([a])
. (4.2)
Theorem 4.1 follows from Kroneckers First Limit Formula [11, p. 72,
Theorem 6]. In [9], K.G. Ramanathan applied Theorem 4.1 to compute
products of the form
t
n
=
1
5
5
_
_
(
1+
n/5
2
)
(
1+
5n
2
)
_
_
6
when 5n is a convenient number. These products are then used to deduce
special values of the Rogers-Ramanujan continued fraction. In the same
article, he dened [9, Eq. (51)]
n
=
1
3
3
_
(
_
n/3)
(
3n)
_
6
, (4.3)
NEW RAMANUJAN-TYPE SERIES FOR 1/ 229
and remarked that
n
can be evaluated when 3n is one of the convenient
numbers listed in Table 1 (15 is missing from his list). Ramanathans result
can be stated as follows:
Theorem 4.2 ([9, Theorem 4]). Let c be a convenient number listed in Ta-
ble 1 and let K = Q(
c/3
=
([t])=1
3e
,
where the exponents are given by
e
=
2wh
1,
h
2,
w
2,
h
,
with h being the class number of K and w the number of roots of unity in
K.
It turns out that Ramanathans
n
is related to
n
, namely [5, (2.7)],
1
n
=
2
n
+ 1. (4.4)
Hence, from Theorem 4.2, (4.1), and (4.4), we can determine
n
explicitly.
Using the same technique as given in the proof of Theorem 4.2, one can
compute
n/(p
2
i
1
p
2
i
s
)
, 1 s l 1 and 1 i
j
l, which will be needed in
the evaluations of a
n
.
We conclude this section with a list of singular moduli which will be
needed in the evaluations of a
n
, b
n
, and H
n
with n = c/3.
n Cubic singular moduli
2
2
=
1
2
2
4
5
5
=
1
2
11
5
50
Table 2. Cubic singular moduli for h(12n) = 2.
230 HENG HUAT CHAN AND WEN-CHIN LIAW
n Cubic singular moduli
7
7
=
1
2
13
7
12
3
10
10
=
1
2
35
2+2
5
108
,
5/2
=
1
2
35
22
5
108
11
11
=
1
2
45
35
44
11
14
14
=
1
2
99
6+2
14
500
,
7/2
=
1
2
99
62
14
500
19
19
=
1
2
301
5113
3
4500
26
26
=
1
2
6930
2+5
26
19652
,
13/2
=
1
2
6930
25
26
19652
31
31
=
1
2
35113
37
93
121500
34
34
=
1
2
17420
17+35
2
143748
,
17/2
=
1
2
17420
1735
2
143748
59
59
=
1
2
6367095
177265
59
169413308
Table 3. Cubic singular moduli for h(12n) = 4.
NEW RAMANUJAN-TYPE SERIES FOR 1/ 231
n Cubic singular moduli
35
35
=
1
2
+
549
50
5 +
63
10
15
83
20
35
243
100
105
7/5
=
1
2
+
549
50
5
63
10
15 +
83
20
35
243
100
105
55
55
=
1
2
+
92995
88434
3
8259
9826
5
13755
19652
11 +
25879
176868
165
11/5
=
1
2
92995
88434
3 +
8259
9826
5
13755
19652
11 +
25879
176868
165
70
70
=
1
2
+
43971
608350
5
10023
121670
14
38621
1095030
21
383383
10950300
30
35/2
=
1
2
43971
608350
5
10023
121670
14 +
38621
1095030
21
383383
10950300
30
7/10
=
1
2
+
43971
608350
5
10023
121670
14 +
38621
1095030
21 +
383383
10950300
30
91
91
=
1
2
53880905
23706108
3 +
10182535
11853054
21
8256325
11853054
39 +
5523815
23706108
273
13/7
=
1
2
+
53880905
23706108
3 +
10182535
11853054
21
8256325
11853054
39
5523815
23706108
273
110
110
=
1
2
1204702947
25128011089
5
7922677455
50256022178
995983605
50256022178
33 +
1027373417
100512044356
110
55/2
=
1
2
+
1204702947
25128011089
5
7922677455
50256022178
995983605
50256022178
33
1027373417
100512044356
110
11/10
=
1
2
+
1204702947
25128011089
5
7922677455
50256022178
6
+
995983605
50256022178
33 +
1027373417
100512044356
110
115
115
=
1
2
132769793
1177056540
3 +
14283759
163480075
5 +
1293747
32696015
23
218554427
5885282700
345
23/5
=
1
2
+
132769793
1177056540
3 +
14283759
163480075
5
1293747
32696015
23
218554427
5885282700
345
119
119
=
1
2
+
50276655
165246443
17
162387225
660985772
51 +
19683565
165246443
119
45457245
660985772
357
17/7
=
1
2
50276655
165246443
17
162387225
660985772
51 +
19683565
165246443
119 +
45457245
660985772
357
154
154
=
1
2
1255233
5404300
2 +
3988079
121596750
21
14120327
668782125
66
254553
14861825
77
77/2
=
1
2
1255233
5404300
2
3988079
121596750
21
14120327
668782125
66 +
254553
14861825
77
11/14
=
1
2
1255233
5404300
2 +
3988079
121596750
21 +
14120327
668782125
66 +
254553
14861825
77
Table 4. Cubic singular moduli for h(12n) = 8.
232 HENG HUAT CHAN AND WEN-CHIN LIAW
n Cubic singular moduli
455
455
=
1
2
52602592750172050462677
248717948742554175611950
5
5668214189343349857381
49743589748510835122390
15
5696597990275946071461
49743589748510835122390
35 +
538462633924678371678
24871794874255417561195
65
682503637304416627557
9948717949702167024478
105 +
25146509927196138320763
497435897485108351223900
195
+
109593923135795012632
4974358974851083512239
455 +
1196360473602901817979
99487179497021670244780
1365
91/5
=
1
2
52602592750172050462677
248717948742554175611950
5 +
5668214189343349857381
49743589748510835122390
15
+
5696597990275946071461
49743589748510835122390
35
538462633924678371678
24871794874255417561195
65
682503637304416627557
9948717949702167024478
105 +
25146509927196138320763
497435897485108351223900
195
+
109593923135795012632
4974358974851083512239
455
1196360473602901817979
99487179497021670244780
1365
65/7
=
1
2
+
52602592750172050462677
248717948742554175611950
5
5668214189343349857381
49743589748510835122390
15
+
5696597990275946071461
49743589748510835122390
35 +
538462633924678371678
24871794874255417561195
65
682503637304416627557
9948717949702167024478
105
25146509927196138320763
497435897485108351223900
195
+
109593923135795012632
4974358974851083512239
455
1196360473602901817979
99487179497021670244780
1365
35/13
=
1
2
+
52602592750172050462677
248717948742554175611950
5 +
5668214189343349857381
49743589748510835122390
15
5696597990275946071461
49743589748510835122390
35
538462633924678371678
24871794874255417561195
65
682503637304416627557
9948717949702167024478
105
25146509927196138320763
497435897485108351223900
195
+
109593923135795012632
4974358974851083512239
455 +
1196360473602901817979
99487179497021670244780
1365
Table 5. Cubic singular moduli for h(12n) = 16.
NEW RAMANUJAN-TYPE SERIES FOR 1/ 233
5. Values of a
n
, b
n
, and H
n
.
The values of H
n
follow immediately from the values of
n
by (1.7). From
(1.6), it appears that we need to denest the expression
1 H
n
in order to
determine b
n
. The next simple lemma shows that this is not necessary.
Lemma 5.1. Let
n
be dened as in (4.3). Then
b
n
=
2
3n
3
2
n
1
2
n
+ 1
.
Proof. From (4.4), we deduce that
1
1
n
=
1
2
n
+ 1. (5.1)
Hence, by (1.7), (4.4), and (5.1), we conclude that
4
H
n
=
2
n
+
1
2
n
+ 2.
Hence,
_
1 H
n
=
1
4
2
n
+
2
n
+ 2
(5.2)
=
n
1
n
n
+
1
n
.
Substituting (5.2) into (1.6) completes our proof of the lemma.
Finally, to compute a
n
, we use the method outlined in Section 3, together
with the singular moduli given in Section 4. Our nal results are shown in
the following tables, grouped once again according to class numbers.
n a
n
b
n
H
n
2
1
3
3
2
3
1
2
5
8
15
3
22
5
3
4
125
Table 6. a
n
, b
n
, and H
n
for h(12n) = 2.
234 HENG HUAT CHAN AND WEN-CHIN LIAW
n a
n
b
n
H
n
7
10
27
+
7
27
7
13
9
7
7
9
17
27
+
13
54
7
10
25
243
15
8
243
6
70
81
15 +
10
81
6
223
1458
35
729
10
11
6
11
13
99
3
45
11
5
33
3
194
1331
+
225
2662
3
14
21
125
7
82
1125
3
198
125
7 +
28
375
3
1819
31250
198
15625
21
19
1654
3375
133
3375
19
5719
1125
13
1125
19
8522
421875
+
3913
843750
19
26
1118
14739
39
3967
44217
3
4620
4913
39 +
130
14739
3
249913
48275138
34650
24137569
13
31
14662
91125
+
7843
91125
31
217
30375
+
35113
30375
31
684197
307546875
+
245791
615093750
31
34
7157
323433
51 +
62896
323433
6
70
107811
51 +
296140
107811
6
3555313
2582935938
304850
1291467969
34
59
342786
717853
593849
6460677
3
6367095
717853
265
2153559
3
1461224894
30403462846931
+
1687280175
60806925693862
3
Table 7. a
n
, b
n
, and H
n
for h(12n) = 4.
NEW RAMANUJAN-TYPE SERIES FOR 1/ 235
n a
n
, b
n
, and H
n
35 a
35
=
558
5
364
15
21 42
7 +
577
9
3
b
35
=
1701
5
+
581
3
3 126
7
366
5
21
H
35
=
1210352
125
279531
50
3 +
91494
25
7 +
264132
125
21
55 a
55
=
1411054
132651
+
14375
4913
15 +
26884
14739
33
194150
132651
55
b
55
=
1423345
44217
+
50435
4913
15 +
27530
4913
33
185990
44217
55
H
55
=
40461639767
651714363
+
2329268305
144825414
15 +
782606510
72412707
33
5473886320
651714363
55
70 a
70
=
57239
1642545
7 +
217912
1642545
10 +
18154
182505
15
5432
60835
42
b
70
=
766766
547515
7 +
540694
547515
10 +
93548
60835
15
29314
60835
42
H
70
=
263701974157
999242250750
3413048639
55513458375
6 +
8992317139
499621125375
70
1429629212
55513458375
105
91 a
91
=
513055226
17779581
+
197125250
17779581
7
140862644
17779581
13 +
52944437
17779581
91
b
91
=
502667165
5926527
+
214664450
5926527
7
142555490
5926527
13 +
53880905
5926527
91
H
91
=
3020198045742832
11707907427243
+
1141527555432550
11707907427243
7
838583339971300
11707907427243
13
+
633909424388075
23415814854486
91
110 a
110
=
51466456301
226152099801
3
1605347400
25128011089
10 +
2302296150
25128011089
55
+
2180745776
75384033267
66
b
110
=
113011075870
75384033267
3 +
21911639310
25128011089
10 +
31690709820
25128011089
55
+
8031352980
25128011089
66
H
110
=
328032510163806603637
1262833882577813931842
34968343286005152660
631416941288906965921
22
26922173637682728405
631416941288906965921
30 +
11479638881035691730
631416941288906965921
165
Table 8. a
n
, b
n
, and H
n
for h(12n) = 8.
236 HENG HUAT CHAN AND WEN-CHIN LIAW
n a
n
, b
n
, and H
n
115 a
115
=
2453452114
882792405
58294124
98088045
15
7317496
32696015
69 +
139937129
882792405
115
b
115
=
5026751821
294264135
39674908
32696015
15
19045012
32696015
69 +
132769793
294264135
115
H
115
=
195193666847694106
144318968578830375
+
5599864542570116
16035440953203375
15
+
2653754247048632
16035440953203375
69
37000600425371947
288637937157660750
115
119 a
119
=
103789302
9720379
547343732
87483411
3 +
43409625
9720379
7
72149336
29161137
21
b
119
=
318200715
9720379
551139820
29161137
3 +
162387225
9720379
7
67035540
9720379
21
H
119
=
49978710596750603
1606258054361897
+
28855221888962700
1606258054361897
37979008521886575
3212516108723794
7 +
10963595445145200
1606258054361897
21
154 a
154
=
965168
4053225
6
3910004
182395125
7
28870936
182395125
22 +
142457
4053225
231
b
154
=
2375828
1351075
6 +
112962616
60798375
7
55833106
60798375
22 +
836822
1351075
231
H
154
=
7319532242037247
27107244286031250
14157410807176
301191603178125
33 +
8770226416943
301191603178125
42
206104571568818
13553622143015625
154
Table 8 (continuation). a
n
, b
n
, and H
n
for h(12n) = 8.
NEW RAMANUJAN-TYPE SERIES FOR 1/ 237
a
455
=
35958686812804845816546
4974358974851083512239
199639241839509967088008
44769230773659751610151
91866311009633295364887
24871794874255417561195
7 +
12114289251501127493868
4974358974851083512239
13
21170489873453104001440
14923076924553250536717
21 +
19045288924435485549578
14923076924553250536717
39
+
3501400086019335742242
4974358974851083512239
91 +
36482832707135043514012
74615384622766252683585
273
b
455
=
108868803097864065436089
4974358974851083512239
199460940107146922990240
14923076924553250536717
326904629053549798169919
24871794874255417561195
7 +
47775254611309163928990
4974358974851083512239
13
9333352321361091775752
4974358974851083512239
21 +
26584123954621081666818
4974358974851083512239
39
+
11336428378686699714762
4974358974851083512239
91 +
35068395166781366975118
24871794874255417561195
273
H
455
=
25593277575291678024530931497850444197001585383
12372123605340761245091680055188536031396560500
726431859849607816583487985232610666030207597
618606180267038062254584002759426801569828025
3839347276534358839899743258373310667699209791
4948849442136304498036672022075414412558624200
7
+
351649374516601338100434337317872458470333402
618606180267038062254584002759426801569828025
13
56196149792473665089405401522944398076147222
123721236053407612450916800551885360313965605
21
+
1032402621896013253168215818718780641551536294
3093030901335190311272920013797134007849140125
39
+
53580377184022523304261118862331927596113329
247442472106815224901833601103770720627931210
91
+
152525837164039389925036504420966961190929163
1237212360534076124509168005518853603139656050
273
Table 9. a
n
, b
n
, and H
n
for h(12n) = 16.
238 HENG HUAT CHAN AND WEN-CHIN LIAW
Concluding remarks.
The common feature of all our series computed here is that they involve
only simple quadratic numbers. The series corresponding to n = 455 gives
us approximately 33 additional digits per term and it is the fastest conver-
gent series belonging to the theory of q
2
known so far. It might also be the
fastest convergent series for 1/ which involves only real quadratic num-
bers. One should compare this with the spectacular series discovered by the
Borweins [4] which gives 25 digits per term using only real quadratics.
The fastest convergent series known so far is that given by the Borweins [4]
which gives roughly 50 additional digits per term.
Acknowledgment. The authors wish to thank W. Galway for helping
us in nding some of the a
n
s. We also thank the referee for comments to
improve the manuscript.
References
[1] B.C. Berndt, Ramanujans Notebooks, Part V, Springer-Verlag, New York, 1998.
[2] B.C. Berndt, S. Bhargava and F.G. Garvan, Ramanujans theories of elliptic functions
to alternative bases, Trans. Amer. Math. Soc., 347 (1995), 4163-4244.
[3] J.M. Borwein and P.B. Borwein, Pi and the AGM, Wiley, New York, 1987.
[4] , Class number three Ramanujan type series for 1/, J. Comput. Appl. Math.,
46 (1993), 281-290.
[5] H.H. Chan, On Ramanujans cubic transformation formula for
2
F
1
(1/3, 2/3; 1; z),
Math. Proc. Cambridge Philos. Soc., 124 (1998), 193-204.
[6] H.H. Chan and W.-C. Liaw, On Russell-type modular equations, Canad. J. Math., to
appear.
[7] D.A. Cox, Primes of the Form x
2
+ ny
2
, Wiley, New York, 1989.
[8] C.F. Gauss, Disquisitiones Arithmeticae, Springer-Verlag, New York, 1986.
[9] K.G. Ramanathan, Some applications of Kroneckers limit formula, J. Indian Math.
Soc., 52 (1987), 71-89.
[10] S. Ramanujan, Modular equations and approximations to , Quart. J. Math. (Oxford),
45 (1914), 350-372.
[11] C.L. Siegel, Advanced Analytic Number Theory, Tata Institute of Fundamental Re-
search, Bombay, 1980.
Received June 16, 1998 and revised July 16, 1998.
National University of Singapore
Kent Ridge, Singapore 119260
Republic of Singapore
E-mail address: chanhh@math.nus.sg
University of Illinois
Urbana, Illinois 61801
E-mail address: liaw@math.uiuc.edu
PACIFIC JOURNAL OF MATHEMATICS
Vol. 192, No. 2, 2000
ON THE STABILITY OF CANONICAL FORMS OF
SINGULAR LINEAR DIFFERENCE SYSTEMS
Guoting Chen and Abdelmajid Fahim
Using the formal reduction by a method of deformation of
orbits under the adjoint representation of GL(n, C), we have
proved the existence and uniqueness (up to equivalence under
GL(n, C)) of a formal canonical form of systems of singular lin-
ear dierence equations. In this paper we study the stability
of the irregular part of the canonical form under perturbation
of the matrix coecients.
1. Introduction and Notations.
The formal reduction of singular linear dierence systems or of dierence
equations is studied in many ways: Formal classication, canonical forms
or formal solutions (see [9], [4], [8], [2], [3]). One of the approaches is the
reduction to canonical forms given in [3] by using the method of Babbitt and
Varadarajan [1] for singular dierential systems. We study in this paper the
stability of the canonical forms of singular linear dierence systems. Similar
results for singular dierential systems can be found in [1] or [7].
We shall use the following notations.
Let K = C((1/x)) be the eld of formal power series with coecients in C.
is the C-automorphism of K dened by (x) = x +1. For q N
, x
1/q
is
a xed root of y
q
= x, O
q
= C[[x
1/q
]], K
q
= C((x
1/q
)) and K =
qN
K
q
is the eld of formal Puisieux power series over C. can be extended to
K by (x
1/q
) = x
1/q
(1 + x
1
)
1/q
. Let a K
q
be nonzero, then it can be
written in the form
a = a(x) = x
k/q
+
j=0
a
j
x
j/q
, a
0
= 0
where k is an integer. We write ord(a) for k/q, (ord(0) = +). For A
gl(n, K
q
), A = 0, we dene
ord(A) = max
_
r
q
| r Z, A x
r/q
gl(n, O
q
)
_
and ord(0) = +.
239
240 GUOTING CHEN AND ABDELMAJID FAHIM
We consider systems of linear dierence equations of the following type
(u) = Au (1)
where A GL(n, K
q
), q N
j=0
A
r+j
x
(r+j)/q
GL(n, K
q
) (2)
where r Z, A
r+j
gl(n, C) and A
r
= 0.
Recall ([3]) that a matrix A or its associate system is said of level 0 if
A = I +
m=q
A
m
x
m/q
;
of level 1 if
A = I +
m=0
A
r+m
x
(r+m)/q
, r N
, 1 r < q, A
r
= 0, (3)
where I denotes the n n identity matrix.
Let T GL(n, K
q
). The change u = Tu transforms the system (1) to
( u) =
A u where
A = T[A]
def
= (T)AT
1
.
We shall say that the matrices A,
A (or the corresponding dierence systems)
are equivalent (under GL(n, K
q
)).
We recall (cf. [3]) the denition of a canonical form for a matrix or its
associate linear dierence system.
Denition 1.1. Let p N
i=1
B
i
x
i
with
r Z,
i
1
p
N,
1
<
2
< <
s
,
B
i
GL(n
(i)
, O
p
), n
(i)
N
,
s
i=1
n
(i)
= n, B
i
=
t
i
=1
(i)
_
B
(i)
+
C
(i)
x
_
where
B
(i)
= I
(i)
+
D
(i)
,1
x
r
(i)
,1
+ +
D
(i)
,j
(i)
x
r
(i)
,j
(i)
and
(i)
,
(i)
=
(i)
for = ,
STABILITY OF CANONICAL FORMS 241
I
(i)
is the n
(i)
n
(i)
identity matrix, n
(i)
,
t
i
=1
n
(i)
= n
(i)
,
r
(i)
,j
1
p
N
, r
(i)
,1
< r
(i)
,2
< < r
(i)
,j
(i)
< 1, D
(i)
,j
gl
_
n
(i)
, C
__
1 j
j
(i)
_
are nonzero diagonal matrices,
C
(i)
gl
_
n
(i)
, C
_
commutes with the D
(i)
,j
for 1 j j
(i)
.
We make the convention that for j
(i)
= 0, B
(i)
= I
(i)
.
We will call
1
x
r/p
s
i=1
t
i
=1
(i)
B
(i)
i
the irregular part of the canonical
form. The aim of this paper is to study the dependency of the irregular part
in the canonical form of a singular linear dierence system on the matrix
coecients A
r+j
.
In [3] we have proved that for any matrix A GL(n, K
q
) there exist some
p qN
and T GL(n, K
p
) such that T[A] GL(n, K
p
) is in a canonical
form and its irregular part is unique up to equivalence in GL(n, C). It is
based on the formal reduction using the method of Babbitt and Varadarajan
[1], i.e., the method of deformation of orbits under the adjoint representation
of GL(n, C) in the nilpotent case of the leading matrix.
Recall that a canonical form for a matrix (or the associate dierence
system) of level 1 is in the form:
I +
D
1
x
r
1
+ +
D
k
x
r
k
+
C
x
where the D
j
(1 j k) are nonzero diagonal matrices, 0 < r
1
< <
r
k
are rational numbers and the matrix C commutes with the matrices
D
j
(1 j k). According to the convention of Denition 1.1, for k = 0 the
canonical form is reduced to I + Cx
1
. The canonical form of level 1 is
similar as in the dierential case (see [1]). But for general dierence systems
the canonical form is more complicated.
We study in this paper the stability of the irregular part of the canonical
form of a matrix or its associate linear dierence system under perturbation
of the matrix coecients. A perturbed system of (1) is
(u) =
_
A+
P
x
(r+N)/q
_
u (4)
with N N
and P gl(n, O
q
), i.e., ord(P) 0.
Note that in [2] the rst author of this paper has studied similar prob-
lems for formal solutions. More precisely it is proved that the irregular part
in a fundamental matrix of formal solutions of dierence systems associ-
ated to a matrix of level 1 (resp. of general systems) depends only on
A
r
, A
r+1
, . . . , A
r+n(qr)1
(resp. A
r
, A
r+1
, . . . , A
r++nq1
) where denotes
the integer such that
q
= ord(det x
r/q
A).
242 GUOTING CHEN AND ABDELMAJID FAHIM
We shall prove, by using the method of [1], similar results on canonical
forms for these two dierence systems. More precisely, we will prove in
Section 3 that for systems of level 1, if N n(q r), the two systems (1)
and (4) have the same irregular part in their canonical forms. This result is
similar to the dierential case [1]. In the general case the situation is more
complicated and is considered in Section 4. Basing on the method of [1] for
dierential systems, we use also frequently the formal reduction procedure
of linear dierence systems presented in [3]. We state some of the results of
[1] and [3] in Section 2 for the use in the sequel.
2. Preliminaries.
We present now some preliminary results which will be used in the next
sections (see also [1] and [3]).
For T GL(n, K
q
) we dene the lag (see also [1]) of T as
q
(T) = min
_
m
q
| m N, A 0(mod x
m/q
) =TAT
1
gl(n, O
q
)
_
.
It is clear that if
q
(T) m/q then
A B(mod x
m
/q
) =T[A] T[B](mod x
(m
m)/q
). (5)
Therefore if one controls the lag of a transformation matrix T, then one
controls the rst terms in the transformed system T[A].
One has immediately the following properties (see also [1], p. 10-11):
(i) If q
q
(T). We will write (T) for
q
(T) in the sequel.
(ii) (T) = 0 for T GL(n, O
q
) Z
q
where Z
q
is the group of elements of
the form x
k/q
1 for k Z.
(iii) (T
1
T
2
) (T
1
) +(T
2
) for T
1
, T
2
GL(n, K
q
).
(iv) (QT
Q) = (T), Q,
Q GL(n, O
q
) Z
q
and T GL(n, K
q
).
(v) If T = x
H
for some semi-simple matrix H in gl(n, C) with eigenvalues
i
1
q
Z (i = 1, . . . , n) then
(T) = max
1i,jn
{|
i
j
|}.
(vi) (T) = (T
1
) for T GL(n, K
q
).
Let O
qN
O
q
. We dene
}
where F may represent K, K
q
, O
q
etc. If H is semi-simple in gl(n, C) with
eigenvalues in Q, it is immediate that
x
H
GL(n, K) tr(H) = 0.
We then have (cf. [1], Proposition 1.2).
STABILITY OF CANONICAL FORMS 243
(vii) Let T =
m
i=1
T
i
where T
i
GL(n
i
, K
q
) and n =
m
i=1
n
i
. Then
T
GL(n, K
q
) and
(T) (T
1
) + +(T
m
).
Let G = gl(n, C). For M G, G
M
and [G, M] denote respectively the
kernel and the image of the adjoint homomorphism ad(M). d(M) is the
dimension of the GL(n, C)-orbit of M with respect to the adjoint represen-
tation of G.
Proposition 2.1 ([6], [1]). Let Y be a nonzero nilpotent in G; then we can
nd H, X sl(n, C) such that H is semi-simple, X is nilpotent and
[H, X] = 2X, [H, Y ] = 2Y, [X, Y ] = H.
(Y, H, X) is called a standard triple.
Proposition 2.2 ([1]). Let Y be a nonzero nilpotent and (Y, H, X) a stan-
dard triple. Let Z G
X
, Z = 0. Suppose that Y + Z is nilpotent. Then
d(Y +Z) > d(Y ).
For a standard triple (Y, H, X), we have G = G
X
[G, Y ]. Moreover there
exists a basis {Z
1
, . . . , Z
} of G
X
such that Z
1
= I, Z
j
sl(n, C) for j 2
(see [1], p. 15) and
[H, Z
j
] =
j
Z
j
,
j
N for 1 j .
In particular
1
= 0. Dene = max
1j
_
j
2
+ 1
_
, then 1 n.
{Z
1
, . . . , Z
, Z
+1
, . . . , Z
n
2} of G
with the following properties:
For all j > , [H, Z
j
] =
j
Z
j
,
j
Z, |
j
| max
1i
i
.
If M gl(n, C) is such that [H, M] = cM for some c Z then
x
H
Mx
H
= x
c
M, for Q.
In particular
x
H
Y x
H
= x
2
Y ; x
H
Z
j
x
H
= x
Z
j
. (6)
One has for Q
(x
H
) || max
1jn
2
{|
j
|} 2( 1)||. (7)
We need also the following lemmas.
Lemma 2.1 ([3]). Let a matrix A GL(n, K
q
) be in one of the following
forms,
(I) I +
j=0
A
r+j
x
(r+j)/q
, 1 r < q.
244 GUOTING CHEN AND ABDELMAJID FAHIM
or
(II) x
r/q
j=0
A
r+j
x
j/q
, r Z;
where A
r+j
G, A
r
= 0. Let L G be a linear subspace such that G =
L + [G, A
r
]. Then there exist sequences (T
j
)
j1
in G, (A
r+j
)
j1
in L,
T =
1
j=
_
I +
T
j
x
j/q
_
= lim
J
1
j=J
_
I +
T
j
x
j/q
_
such that A
r
= A
r
and
T[A] = I +
j=0
A
r+j
x
(r+j)/q
, in the case (I),
or
T[A] = x
r/q
j=0
A
r+j
x
j/q
, in the case (II).
Moreover A
r+j
only depends on A
r
, A
r+1
, . . . , A
r+j
.
Corollary 2.1 ([3]; Splitting lemma). Let notations be as in the above lem-
ma. Let be the set of eigenvalues of A
r
, P
r+j
commutes with P
for j 1;
moreover T[A] =
where
A
= I +
j=0
P
r+j
x
(r+j)/q
in the case (I),
A
= x
r/q
j=0
P
r+j
x
j/q
in the case (II).
Corollary 2.2 ([3]). Let notations be as in the above lemma. Assume that
A
r
is nilpotent and (A
r
, H, X) a standard triple. Let L = G
X
and let m 2
be an integer. There exists
T =
_
I +
T
m1
x
(m1)/q
_
_
I +
T
1
x
1/q
_
GL(n, K
q
)
such that
T[A] = I +
j=0
A
r+j
x
(r+j)/q
, in the case (I)
T[A] = x
r/q
j=0
A
r+j
x
j/q
, in the case (II)
STABILITY OF CANONICAL FORMS 245
with A
r
= A
r
, A
r+j
G
X
for 1 j < m. Furthermore for j N
, A
r+j
only depends on A
r
, . . . , A
r+j
.
Lemma 2.2 ([3]). Let a matrix B GL(n, K
p
) be in the form
B = I +
D
1
x
1/p
+ +
D
p1
x
(p1)/p
+
C
x
+R
B
where the D
j
gl(n, C) are diagonal matrices, C gl(n, C), ord(R
B
) > 1.
Then B is equivalent to a canonical matrix of the form I +
D
1
x
1/p
+ +
D
p1
x
(p1)/p
+
C
x
for some C
gl(n, C).
3. Dierence Systems of Level 1.
We consider at rst, as in [3], dierence systems of level 1, i.e., systems
(1) with matrix A in the special form (3):
A = I +
j=0
A
r+j
x
(r+j)/q
GL(n, K
q
)
with A
r
= 0, 1 r < q.
We will prove, using the method of [1], that the irregular part of a canon-
ical form of dierence systems of level 1 is determined by the matri-
ces A
r
, A
r+1
, . . . , A
r+n(qr)1
. Similar result for formal solutions has been
proved in [2] by a dierent method.
Recall (cf. [3]) that a canonical form for matrices of level 1 is in the
form:
A
cano
= I +
D
1
x
r
1
+ +
D
k
x
r
k
+
C
A
x
GL(n, K
p
) (8)
for some p N
j=0
A
r+j
x
(r+j)/q
GL(n, K
q
) with
A
r
= 0 be of level 1, i.e., 1 r < q. Then we can nd a matrix
U
GL(n, K
p
) for some p qN
j=0
A
s+j
x
(s+j)/p
GL(n, K
p
) where either U[A] is of
level 0 in which case s = p or
A
s
is not nilpotent.
246 GUOTING CHEN AND ABDELMAJID FAHIM
(2) (U) (n 1)(
s
p
r
q
).
Proof. If A
r
is not nilpotent then the proposition is true with s = r, p = q
and U = I, (U) = 0.
If A
r
is nilpotent we prove it by downward induction on d(A), the dimen-
sion of the GL(n, C)-orbit of A
r
.
Let (Y, H, X) be a standard triple with Y = A
r
. We apply at rst the
Corollary 2.2 (for the case (I)) with m = (q r). Recall that
T =
_
I +
T
(qr)1
x
(r+(qr)1)/q
_
_
I +
T
1
x
r/q
_
and
A
= T[A] = I +
A
r
x
r/q
+
A
r+1
x
(r+1)/q
+
where A
r+j
G
X
for 1 j < (q r). Then T
GL(n, K
q
) and (T) = 0
according to the property (ii) of Section 2.
We can write
A
r+j
=
i=1
a
r+j,i
Z
i
, 1 j < (q r),
A
r+j
=
n
2
i=1
a
r+j,i
Z
i
, j (q r).
Dene
E =
_
j
i
2
+ 1
1 j < (q r), 1 i , a
r+j,i
= 0
_
.
Let
=
_
inf E if E =
otherwise.
Dene = min{q r, } and S = x
H/(2q)
. It is clear that > 0, > 0.
According to (7), (S) ( 1)/q (n 1)/q. Since tr(H) = 0,
S
GL(n, K
p
). According to (6) we have,
S[Y ] =
_
1 +x
1
_
H/(2q)
x
/q
Y,
S[Z
j
] =
_
1 +x
1
_
H/(2q)
x
j
/(2q)
Z
j
, 1 j n
2
.
STABILITY OF CANONICAL FORMS 247
Therefore
A
def
= S[A
] =
_
1 +x
1
_
H/(2q)
_
I +x
(r+)/q
_
Y
+
1j<(qr)
1i
a
r+j,i
Z
i
x
1
q
[j(
i
2
+1)]
+
j(qr)
1in
2
a
r+j,i
Z
i
x
1
q
[j(
i
2
+1)]
__
.
If = q r then s = p = q and (S) (n 1)
_
s
p
r
q
_
. A
is of level 0.
If 0 < < q r, write = r
/q
. Then
A
= S[A
] = ST[A] = I +Y
x
r/ q
+ GL(n, O
q
)
where r = 2(q
r +r
), q = 2q
q and Y
= Y +
(j,i)
a
r+j,i
Z
i
with
=
_
(j, i)
1 j < (q r), 1 i , a
r+j,i
= 0, =
j
i
2
+ 1
_
.
Moreover Y
= Y + Z = Y with Z G
X
. We have (S) (n 1)
q
=
(n 1)
_
r
q
r
q
_
. There are two distinct cases.
(a) If Y
is not nilpotent (this case occurs when d(Y ) has the maximal
dimension, i.e., when Y is a principal nilpotent) then take p = q, s = r,
and
(S) (n 1)
_
s
p
r
q
_
.
Hence U = ST
GL(n, K
p
) has the claimed properties.
(b) If Y
GL(n, K
p
) for some p qN
) (n1)
_
s
p
r
q
_
and S
[A
ST
then U
GL(n, K
p
) and
(U) (S
) +(S) (n 1)
_
s
p
r
q
_
+ (n 1)
_
r
q
r
q
_
= (n 1)
_
s
p
r
q
_
.
The next proposition proves that for a system of level 1 one can obtain
the irregular part of a canonical form with a transformation matrix whose
lag is not greater than the number (n 1)
_
1
r
q
_
.
248 GUOTING CHEN AND ABDELMAJID FAHIM
Proposition 3.2. Let A be as in the above proposition. Then we can nd
a matrix U
GL(n, K
p
) for some p qN
, such that
(1) there exists a canonical form
A
cano
= I +
D
1
x
r
1
+ +
D
k
x
r
k
+
C
A
x
GL(n, K
p
)
such that U[A] = A
cano
+R
A
GL(n, K
p
) with ord(R
A
) > 1.
(2) (U) (n 1)(1
r
q
).
Remark. With the convention of Denition 1.1, if k > 0 then D
j
(1 j
k) are nonzero diagonal matrices and for k = 0, A
cano
= I + C
A
x
1
is of
level 0.
Proof. We prove the proposition by induction on n. For n = 1 one can take
U = 1. Suppose n > 1. We assume the assertion in dimension < n.
Assume at rst that A
r
has at least two distinct eigenvalues. By applying
the Corollary 2.1 we obtain a matrix T =
1
j=
(I +T
j
x
j/q
). Take
T
A
=
1
j=n(qr)1
_
I +
T
j
x
j/q
_
and A
= T
A
[A].
Let A
= A
A
(r)
with n
= dim
_
A
(r)
_
then A
. By induction
we can nd matrices U
GL(n
, K
p
) verifying the condition (1) and
(U
) (n
1)
_
1
r
q
_
. We now use the property (vii) (cf. Section 2) to
conclude that if U
, then U
GL(n, K
p
) and
(U
(U
(n
1)
_
1
r
q
_
(n 2)
_
1
r
q
_
and U
[A
[E]) > 1 by
using (5):
ord(U
[E])
r +n(q r)
q
(n 2)(q r)
q
> 1.
Then U = U
T
A
has the claimed properties.
We now consider the case where A
r
has a unique eigenvalue, A
r
= I +Y
where Y is nilpotent. We proceed by induction on the number k = k(A)
in the canonical form. If this number is 0 then by the above proposition
one can nd a matrix T
GL(n, K
p
) for some p qN
qN
.
Then
A
= U
1
[A] = I +A
s
x
s/p
+
and (U
1
) (n 1)
_
s
p
r
q
_
. Either A
s
is not nilpotent which we consider in the
following.
If A
s
has at least two distinct eigenvalues, the earlier result allows us
to nd a matrix U
GL(n, K
p
) for some p p
such that (U
)
(n 1)
_
1
s
p
_
and U
[A
U
1
, then one
has immediately the second assertion:
(U) (n 1)
_
1
s
p
_
+ (n 1)
_
s
p
r
q
_
= (n 1)
_
1
r
q
_
.
If A
s
has a single eigenvalue
= (1 +
x
s/p
)A
where k(A
) < k(A
= I +A
s
x
s
/p
+ (with s
s)
gives a matrix U
) (n 1)
_
1
s
_
(n 1)
_
1
s
p
_
.
As before we take U = U
U
1
and note that (U) (n 1)
_
1
r
q
_
. The
proof is thus complete.
Let A GL(n, K
q
) be a matrix of level 1 as in the above propositions.
We denote by (A, m) the set of matrices B GL(n, K
q
) of the same form
as A with B
r+j
= A
r+j
for all 0 j < m, i.e., B A(mod x
(r+m)/q
).
Corollary 3.1. Let notations be as in the proposition, m = n(q r). If
B (A, m) then
U[B] = I +
D
1
x
r
1
+ +
D
k
x
r
k
+
C
B
x
+R
B
GL(n, K
p
)
where ord(R
B
) > 1. If further B A(mod x
(r+m
)/q
) for some m
> m
then C
B
= C
A
.
250 GUOTING CHEN AND ABDELMAJID FAHIM
Proof. (U) (n 1)
_
1
r
q
_
. From B A(mod x
(r+m)/q
) and (5) we
have
U[A] U[B]
_
mod x
h
r+m
q
(n1)
1
r
q
i_
and
r+m
q
(n 1)
_
1
r
q
_
= 1, proving the rst assertion.
If B A
_
mod x
r+m
q
_
then U[A] U[B]
_
mod x
[1+
m
m
q
]
_
, proving
the second statement.
According to Lemma 2.2, the canonical form of A
cano
+ R
A
is I +
D
1
x
r
1
+
+
D
k
x
r
k
+
C
x
where only the matrix C
is in quasi-canonical form if B
=
1
x
r/p
s
i=1
B
i
x
i
with
B
i
=
t
i
=1
(i)
_
B
(i)
+
C
(i)
x
+R
(i)
_
where ord(R
(i)
) > 1.
Remark. A quasi-canonical form of a matrix of level 1 is simply a
matrix of the form A
cano
+R
A
where A
cano
is a canonical matrix of the form
(8) and ord(R
A
) > 1. It is clear that the matrices B and B
j=0
A
r+j
x
(r+j)/q
GL(n, K
q
) with r Z, A
r
= 0.
Let be the integer such that
q
= ord(det x
r/q
A). Then we can nd a matrix
U
GL(n, K
p
) for some p qN
so that
(1)
A = U[A] =
A
s
x
s/p
+ GL(n, K
p
), where
A
s
is not nilpotent.
(2) (U)
_
1
1
n
_
_
q
p
_
where
p
= ord(det x
s/p
A).
Proof. If A
r
is not nilpotent then s = r, p = q and U = I, (U) = 0.
If A
r
is nilpotent we prove it by induction on d(A
r
), the dimension of
the GL(n, C)-orbit of A
r
. Let Y = A
r
and (Y, H, X) a standard triple. We
apply at rst Corollary 2.2 (for the case II) with m = + 1. We have
T =
1
j=
_
I +
T
j
x
j/q
_
GL(n, K
q
)
and (T) = 0 according to the property (ii) of the Section 2. Let A
= T[A].
Then
A
= x
r/q
_
Y +
A
r+1
x
1/q
+ +
A
r+
x
()/q
+
_
with A
r+j
G
X
for j = 1, . . . , . Furthermore for j N
, A
r+j
only
depends on A
r
, . . . , A
r+j
.
Write
A
r+j
=
i=1
a
r+j,i
Z
i
, 1 j ,
A
r+j
=
n
2
i=1
a
r+j,i
Z
i
, j > .
Dene
E =
_
j
i
2
+ 1
1 j , 1 i , a
r+j,i
= 0
_
.
We claim that E = and inf E since det(
j=0
A
r+j
x
j/q
) = 0. Let
= inf E > 0 and S = x
H/(2q)
. By (7), (S) ( 1)/q. According to
(6),
S[Y ] = (1 +x
1
)
H/(2q)
x
/q
Y,
S[Z
i
] = (1 +x
1
)
H/(2q)
x
i
/(2q)
Z
i
, 1 i n
2
.
252 GUOTING CHEN AND ABDELMAJID FAHIM
Hence
S[A
] = (1 +x
1
)
H/(2q)
x
(r+)/q
_
_
Y +
1j
1i
a
r+j,i
Z
i
x
1
q
h
j
i
2
+1
i
+
j>
1in
2
a
r+j,i
Z
i
x
1
q
h
j
i
2
+1
i
_
_
.
Write =
r
with r
, q
i
2
+ 1
_
> 0. Then S[A
] GL(n, O
2qq
). More precisely,
with r
= 2(q
r +r
), q
= 2q
q,
A
= S[A
] = x
r
/q
_
Y
+O(x
1/q
)
_
where
Y
= Y +
(j,i)
a
r+j,i
Z
i
= Y.
The summation is over the (nonempty) set
=
_
(j, i)
1 j , 1 i , a
r+j,i
= 0, =
j
i
2
+ 1
_
.
Let
= ord(det x
r
/q
GL(n, K
p
) and we have also
(S) ( 1)
q
(n 1)
_
r
r
q
_
=
_
1
1
n
__
_
.
We distinguish two cases.
(a) Y
is not nilpotent (we have this case if d(Y ) is of the maximal di-
mension, i.e., if Y is a principal nilpotent). We take s = r
, p = q
,
U = ST. Then U
GL(n, K
p
).
A = U[A] = A
q
p
_
.
(b) Y
GL(n, K
p
) for some p q
such that (U
1
)
_
1
1
n
_
_
p
_
and
A = U
1
[A
] =
Y x
s/p
+ GL(n, K
p
)
STABILITY OF CANONICAL FORMS 253
with
Y non nilpotent. Let U = U
1
ST. Then U
GL(n, K
p
). U[A]
has the property (1) and (U) (U
1
) +(S)
_
1
1
n
_
_
q
p
_
.
The next proposition shows that one can obtain the irregular part of a
canonical form by a transformation matrix with a lag not exceeding a certain
number that depends only on n, q and .
Proposition 4.2. Let A and be as in the above proposition and m an
integer . We can nd a matrix T
GL(n, K
p
) for some p qN
such
that
(1) T[A] = A
qcano
+R
A
where A
qcano
is a quasi-canonical matrix as in
the Denition 4.1 and ord(R
A
) >
r+m
q
+ 1.
(2) (T) n 1 +
q
.
Proof. If r = 0 one considers x
r/q
A in the place of A. Then we can assume
that r = 0. We prove the theorem by induction on n. It is trivial for n = 1.
Suppose n > 1. We assume the assertion in dimension < n.
Assume that the leading matrix A
0
has at least two distinct eigenval-
ues. Then according to Corollary 2.1, there exists
T =
1
j=
_
I +
T
j
x
j/q
_
GL(n, O
q
) such that
T[A] =
j=0
A
j
x
j/q
where A
0
= A
0
and A
j
commutes with A
0
for all j 1. Take N = max{+
1, m+ +nq} and
T
A
=
1
j=N1
_
I +
T
j
x
j/q
_
GL(n, K
q
).
Then A
= T
A
[A] = A
+R where A
N1
j=0
A
j
x
j/q
and ord(R) N/q.
A
A
(0)
with n
= dim
_
A
(0)
_
then A
. Let
q
= ord(det x
r
/q
A
) 0
with r
/q = ord(A
) 0. We can nd matrices U
GL(n
, K
p
) such that
U
[A
] = A
()
qcano
+ R
with A
()
qcano
in quasi-canonical form of dimension
n
, ord(R
) >
r
+m
q
+ 1
m
q
+ 1 and
(U
) n
1 +
q
.
254 GUOTING CHEN AND ABDELMAJID FAHIM
Since
, n =
, then U
GL(n, K
p
) and
(U
(U
) n 2 +
q
.
And U
[A
[R])
N
q
(n 2)
q
>
m
q
+ 1.
Then T = U
T
A
has the claimed properties.
We now consider the case where A
0
has a unique eigenvalue, A
0
= I +Y
with Y nilpotent. Then either = 0 or not.
Case 1. = 0, then A
0
is nilpotent. According to Proposition 4.1, one can
nd a matrix U
GL(n, K
p
), p qN
with
(U)
_
1
1
n
__
q
p
_
q
p
such that
A = U[A] =
A
s
x
s/ p
+ GL(n, K
p
)
where
A
s
is not nilpotent and
p
= ord(det x
s/ p
A). Two cases may occur:
(a)
A
s
has at least two distinct eigenvalues.
According to the above result one can nd a matrix T
GL(n, K
p
) with
p pN such that (T
) n 1 +
p
and T
[
A] is in the desired form (1).
Let T = T
U then T
GL(n, K
p
) and
(T) (T
) +(U) n 1 +
q
.
(b)
A
s
= wI +Y has only one nonzero eigenvalue w so that = 0 and Y
is nilpotent.
If Y = 0 then
A = x
s/ p
wA
where A
= I +
j=0
A
+j
x
(r
+j)/p
GL(n, O
p
) is a matrix of level 1 with r
2 and p
= 2 p.
If Y = 0 then let (Y, H, X) be a standard triple. Let p
= 2n p and
S = x
H/(2n p)
. Then S
GL(n, K
p
) and (S)
n1
n p
. Write
A =
x
s/ p
(wI +B) with B = Y +
j=1
A
s+j
x
j/ p
.
Write
A
s+j
=
n
2
k=1
a
s+j,k
Z
k
. According to (6), one has
S[wI] = (1 +x
1
)
H/(2n p)
wI, S[Y ] = (1 +x
1
)
H/(2n p)
x
1/(n p)
Y
and
S[Z
k
] = (1 +x
1
)
H/(2n p)
x
k
/(2n p)
Z
k
for 1 k n
2
.
STABILITY OF CANONICAL FORMS 255
Hence
S[B] = (1 +x
1
)
H/(2n p)
x
1/n p
_
_
Y +
j1
n
2
k=1
a
s+j,k
Z
k
x
1
p
h
j
j
2
+1
1
n
i
_
_
.
For all j 1, since
j
2
+1 n, j
_
j
2
+ 1
_
1
n
0. Hence ord(S[B])
1
n p
=
2
p
where
A
= I +
j=0
A
+j
x
(r
+j)/p
GL(n, O
p
) with r
2
is a matrix of level 1.
If r
< p
to obtain an integer
p p
and a matrix U
GL(n, K
p
) such that (U
) (n 1)
_
1
r
_
and
U
[A
] = A
cano
+R
A
= A
qcano
GL(n, K
p
)
where A
cano
is a canonical matrix of the form (8) of level 1 and ord(R
A
) >
1. Let T = U
) +(S) +(U)
(n 1)
_
1
r
+
2
p
_
+
q
n 1 +
q
.
Case 2. If = 0 then = 0 and the treatment is the same as in the case
(b).
Theorem 4.1. Let notations be as above. Take m = and N = 2 + nq.
Let B be a matrix in the same form as A such that B
r+j
= A
r+j
for all
0 j < N. Let T be as in the above proposition. Then T[B] = B
qcano
+R
B
where B
qcano
is a quasi-canonical matrix and ord(R
B
)
r+
q
+1. The two
matrices A and B have the same irregular part in their canonical forms.
Proof. Since
q
=
s
i=1
n
(i)
i
and
i
0, for all i {1, . . . , s}, one has
i
q
, where n
(i)
and
i
are as in A
qcano
(see Denition 4.1 and Denition
1.1).
From (T) n 1 +
q
, B A(mod x
(r+N)/q
) and (5) we obtain
T[A] T[B]
_
mod x
h
r+N
q
(n1)
q
i_
.
Since
r+N
q
(n1)
q
=
r+
q
+1 and T[A] = A
qcano
+R
A
with ord(R
A
) >
r+
q
+1 the rst assertion follows since
i
q
for all 1 i s. The second
one follows from Lemma 2.2.
256 GUOTING CHEN AND ABDELMAJID FAHIM
References
[1] D.G. Babbitt and V.S. Varadarajan, Formal reduction theory of meromorphic dier-
ential equations: A group theoretic view, Pacic J. Math., 109 (1983), 1-80.
[2] G. Chen, Forme normale dArnold et reduction formelle de syst`emes dequations
lineaires aux dierences, Aequationes Math., 54 (1997), 264-288.
[3] G. Chen and A. Fahim, Formal reduction of linear dierence systems, Pacic J. Math.,
182 (1998), 37-54.
[4] A. Duval, Lemme de Hessel et factorisation formelle pour les operateurs aux die-
rences, Funkcial Ekvac., 26 (1983), 349-368.
[5] G. Immink, Asymptotics of analytic dierence equations, Springer L. N. in Math.,
1085 (1984).
[6] B. Kostant, The principal three-dimensional subgroup and the Betti numbers of a
complex simple Lie group, Amer. J. Math., 81 (1959), 973-1032.
[7] D.A. Lutz and R. Schafke, On the identication and stability of formal invariants for
singular dierential equations, Linear Alg. and Appl., 72 (1985), 1-46.
[8] C. Praagman, The formal classication of linear dierence operators, Proc. Kon. Ned.
Ac. Wet. Ser A, 86 (1983), 249-261.
[9] H.L. Turrittin, The formal theory of systems of irregular homogeneous linear dier-
ence and dierential equations, Bol. Soc. Math. Mexicana, 5 (1960), 255-264.
Received June 25, 1998.
URA CNRS 751
Universit
e de Lille I
59655 Villeneuve DAscq
France
E-mail address: Guoting.Chen@univ-lille1.fr
Universit
e de La Rochelle
17042 La Rochelle
France
E-mail address: afahim@cri.univ-lr.fr
PACIFIC JOURNAL OF MATHEMATICS
Vol. 192, No. 2, 2000
WEIGHTED GRAPH LAPLACIANS AND ISOPERIMETRIC
INEQUALITIES
F.R.K. Chung and Kevin Oden
We consider a weighted Cheegers constant for a graph and
we examine the gap between the rst two eigenvalues of Lapla-
cian. We establish several isoperimetric inequalities concern-
ing the unweighted Cheegers constant, weighted Cheegers
constants and eigenvalues for Neumann and Dirichlet condi-
tions.
1. Introduction.
The study of eigenvalue ratios and gaps has a long and prolic history. The
motivation stems not only from their physical relevance but also from the
signicance of their geometric content. The early seminal work of Polya and
S. Szego [29] lay the foundation for the geometric study of eigenvalues. One
of the main techniques involves deriving isoperimetric inequalities, in one
form or another, to associate geometric constraints with analytic invariants
of a given manifold. The isoperimetic methods have been developed by
Cheeger [13], among others, to bound the rst eigenvalue of a compact
manifold by the isoperimetric constant. Further generalizations of Cheegers
constant can be attributed to Yau [32], Croke [20], Brooks [6], and others.
The question of the extent to which the eigenvalues of the Laplace operator
characterize a compact manifold has been investigated by Yau [30], Sunada
[31], Brooks [7, 8, 9], Gordon, Webb and Wolpert [24], just to name a few.
Numerous related results can be found in [3, 5, 11, 14, 22, 23, 27].
The ideas developed in this paper have their roots in results of the contin-
uous setting which have been contributed by numerous people. For example,
the early work of Payne, Polya and Weinberger [28] used geomtric arguments
to develop quite general bounds on eigenvalue gaps. Hile and Protter [25]
and later Ashbaugh and Benguria [1] have obtained sharp upper bounds on
the ratio of the rst two Dirichlet eigenvalues of a compact manifold.
Davies [21] rst transformed the problem to a weighted L
2
space with
weighted operator in the continous setting and he considered eigenvalue gaps
for the weighted cases. In this paper, we introduce the weighted Cheeger
constant of a graph which is a discrete analogue of the results of Cheng and
Oden [15].
257
258 FAN CHUNG AND KEVIN ODEN
For an induced subgraph S of a graph G, the weighted Cheeger con-
stant arises quite naturally by considering a weighted Laplacian (using the
rst Dirichlet eigenfunction u). The following study parallels in many re-
spects the study in [15] for the continuous cases. We establish a weighted
Cheegers inequality for the rst eigenvalue
u
of the weighted Laplacian of
a graph. We derive several inequalities involving the unweighted eigenvalue
and weighted eigenvalues
u
as well as the Dirichlet eigenvalues
D,i
and
Neumann eigenvalues
N,i
. (The detailed denition will be given in Section
2.) For example, we show that the following relation between the weighted
eigenvalue
u
, and the spectral gap of the Dirichlet eigenvalues:
D,2
D,1
1
2
D,1
D,2
h
2
u
. (1)
We prove the following eigenvalue inequality involving the unweighted and
weighted eigenvalues, the Neumann eigenvalues and the Dirichlet eigenval-
ues.
u
D,1
N,1
. (2)
We also prove the following inequality involving the Dirichlet eigenvalues,
the unweighted Cheegers constant h and the weighted constant h
u
.
h 2h
u
+
D,1
+ 2
_
D,1
h
u
. (3)
For a strongly convex subgraph S of an abelian homogeneous graph , we
show that
D,2
D,1
1
8kD
2
where D denotes the diameter of S and k is the degree of (which is regular).
For undened terminology in graph theory and spectral geometry, the reader
is referred to [4, 16] and [12, 30], respectively.
The organizaton of this paper is as follows: In 2 we give basic deni-
tions and describe basic properties for the Laplacian of graphs. In 3 we
dene a weighted graph Laplacian and its associated rst eigenvalue and
the weighted Cheegers constant. In 4 we prove the weighted Cheegers
inequalities. In 5, we establish several isoperimetric inequalities concerning
Neumann and Dirichlet eigenvalues.
2. Preliminaries.
We consider a graph G = (V, E) with vertex set V (G) and edge set E(G).
The value of a function f : V (G) R at a vertex y is denoted by f
y
. For
y V (G), we let d
y
denote the degree of y (which is the number of vertices
adjacent to y). We dene the normalized Laplacian of G to be the following
GRAPH LAPLACIANS AND INEQUALITIES 259
matrix:
L(x, y) =
_
_
1 if x = y, and d
x
,= 0,
d
x
d
y
if x and y are adjacent,
0 otherwise.
The eigenvalues of L are denoted by 0 =
0
1
n1
and when G
is k-regular, it is easy to see that
L = I
1
k
A,
where A is the adjacency matrix of G. It is often convenient to write L as
a product of simpler matrices for a connected graph.
L = T
1/2
LT
1/2
,
where L denotes the combinatorial Laplacian dened as follows:
L(x, y) =
_
_
_
d
x
if x = y,
1 if x and y are adjacent,
0 otherwise,
and T is the diagonal matrix with the (x, x)-th entry having value d
x
.
For a regular graph of degree k, L is just a multiple k of L. For a general
graph, our denition of the normalized Laplacian leads to a clean version
of the Cheeger inequality for graphs [17] (also see (4)), while the Cheeger
inequality using the combinatorial Laplacian involves additional complica-
tions concerning scaling. The advantages of the normalized Laplacian are
perhaps due to the fact that it is consistent with the formulation in spectral
geometry and in stochastic processes. In the rest of the paper, we will call
L the Laplacian, for short.
Associated with L is the positive denite quadratic form Q(f) = f, Lf).
For any real-valued function f we have
Q(g)
g, g)
=
g, T
1/2
LT
1/2
g)
g, g)
=
f, Lf)
T
1/2
f, T
1/2
f)
=
xy
(f
x
f
y
)
2
x
f
2
x
d
x
where f = T
1/2
g, x y denotes x is adjacent to y, and the sum
xy
ranges
over all unordered adjacent pairs x and y.
260 FAN CHUNG AND KEVIN ODEN
Let denote the least nontrivial eigenvalue of L of a graph G. The
eigenvalue is closely related to the isoperimetric invariant, so called the
Cheeger constant, dened as follows:
In a graph G, the volume of a subset X of the vertex set V , denoted by
vol (X) , is dened to be
xX
d
x
.
Denition. The Cheeger constant of a graph G with vertex set V is dened
to be
h = min
XV (G)
e(X, V X)
minvol (X), vol (V X)
where e(X, V X) denotes the number of edges between X and V X.
The Cheeger inequality for a graph G states [17] that
2h
h
2
2
. (4)
We will establish several variations of the Cheeger inequality by consid-
ering eigenvalues of induced subgraphs of a graph.
In a graph G with vertex set V , an induced subgraph on a subset S of
V has vertex set S and edge set consisting of all edges with both endpoints
in S. We denote the induced subgraph determined by S also by S if there
is no confusion. The extension S
N,1
= inf
f0
{x,y}S
(f
x
f
y
)
2
xS
f
2
x
d
x
(5)
where the inmum is taken over all nontrivial functions f : S
S R
satisfying, for each x S,
yS
yx
(f
x
f
y
) = 0. (6)
We remark that (6) is called the Neumann boundary condition for a function
f : V R. It corresponds to the Neumann boundary condition in the
continuous setting. That is,
f
(x) = 0
GRAPH LAPLACIANS AND INEQUALITIES 261
for x S where is the normal direction orthogonal to the tangent hyper-
plane at x.
Denition. The Dirichlet Eigenvalues of S is dened as follows:
D,1
= inf
f|S=0
{x,y}S
(f
x
f
y
)
2
xS
_
f
x
f
x
_
2
d
x
(7)
where f[S = 0 means f(x) = 0 for x S. In general, we dene
D,i
= inf
f|S=0
sup
f
C
i1
{x,y}S
(f
x
f
y
)
2
xS
_
f
x
f
x
_
2
d
x
where C
i
is the subspace spanned by the jth eigenfunctions
j
with eigen-
value
D,j
for j i.
3. Weighted Graph Laplacian.
Let u be the rst eigenfunction for Dirichlet condtions of the induced sub-
graph S achieving
D,1
in (7). Here are some useful facts about u and
D,1
which follows from the denitions (see [16]):
Fact 1: u 0 on S and u = 0 on S.
Fact 2:
D,1
< 1.
Fact 3: For x S, we have
y
{x,y}S
(u
x
u
y
) =
D,1
u
x
d
x
.
Fact 4:
y
{x,y}S
(u
x
u
y
) =
{x,y}S
(u
x
u
y
)
2
.
We will use u to dene a weighted Laplacian L
u
as follows:
L
u
(x, y) =
_
_
u
2
x
if x = y and d
x
,= 0,
u
x
u
y
d
x
d
y
if x and y are adjacent,
0 otherwise,
where d
x
is the degree of x in G.
262 FAN CHUNG AND KEVIN ODEN
The rst eigenvalue
u
of L
u
satises
u
= inf
f0
P
xS
f
x
u
2
x
d
x
=0
{x,y}S
(f
x
f
y
)
2
u
x
u
y
xS
f
2
x
u
2
x
d
x
. (8)
We dene the weighted Cheegers constant h
u
to be
h
u
= min
xX, yS\X
xy
u
x
u
y
xX
u
2
x
d
x
where the minimum ranges over all X S and
xX
u
2
x
d
x
xS\X
u
2
x
d
x
.
4. Weighted Cheegers inequalities.
We will rst give a functional formulation of the weighted Cheegers constant
which will be used later. As in the continuous setting [30], this shows the
connection between the functional properties of a graph and its spectral
properties.
Theorem 1. Suppose u is a nonnegative vector in R
n
(i.e. u
i
0 for all
i) where n = V (S). Then
h
u
= inf
f0
sup
CR
xy
[f
x
f
y
[u
x
u
y
xS
[f
x
C[u
2
x
d
x
.
Proof. We choose C to satisfy:
1) For < 0 we have
f
x
C<
u
2
x
d
x
f
x
C
u
2
x
d
x
.
2) For > 0 we have
f
x
C<
u
2
x
d
x
f
x
C
u
2
x
d
x
.
Let g() =
{x,y}E
f
x
+Cf
y
u
x
u
y
. Then we have
xy
[f
x
f
y
[u
x
u
y
=
_
g()d
GRAPH LAPLACIANS AND INEQUALITIES 263
=
_
0
g()
f
x
C
u
2
x
d
x
f
x
C
u
2
x
d
x
d
+
_
0
g()
f
x
C
u
2
x
d
x
f
x
C
u
2
x
d
x
d
h
u
_
0
f
x
C
u
2
x
d
x
d +h
u
_
0
f
x
C
u
2
x
d
x
d
= h
u
xS
[f
x
C[u
2
x
d
x
.
Conversely, suppose X
0
S is a subset such that
h
u
=
xX
0
, yS\X
0
xy
u
x
u
y
xX
0
u
2
x
d
x
.
Dene f as follows:
f
x
=
_
1 x X
0
1 x S X
0
.
Then
inf
f
sup
CR
xy
[f
x
f
y
[u
x
u
y
xS
[f
x
C[u
2
x
d
x
sup
C
xX
0
,yS\X
0
xy
2u
x
u
y
xX
0
[1 C[u
2
x
d
x
+
xS\X
0
[1 +C[u
2
x
d
x
.
We consider inf
C
_
xX
0
[1 C[u
2
x
+
xS\X
0
[1 +C[ u
2
x
_
, for 1C1.
Dene
f(c) =
_
_
xS\X
0
u
2
x
d
x
xX
0
u
2
x
d
x
_
_
C +
_
_
xS\X
0
u
2
x
d
x
+
xX
0
u
2
x
d
x
_
_
on the interval 1 C 1. Since
xS\X
0
u
2
x
d
x
xX
0
u
2
x
d
x
0, f has
a minimum at C = 1 by elemetary calculation. Therefore,
inf
f
sup
CR
xy
[f
x
f
y
[u
x
u
y
xS
[f
x
C[u
2
x
d
x
xX
0
,yS\X
0
xy
2u
x
u
y
xX
0
2u
2
x
d
x
h
u
264 FAN CHUNG AND KEVIN ODEN
which completes the proof of Theorem 1.
The above theorem leads to several Cheeger-type inequalities concerning
eigenvalue gaps. We will show that the eigenvalue gap
D,2
D,1
is, in
fact, the rst eigenvalue of the weighted Laplacian dened in 3.
Proposition 1.1. Suppose u is the rst Dirichlet eigenfunction of the La-
placian on the induced subgraph S of G. Let
u
be the rst eigenvalue of the
u-weighted Laplacian, L
u
. Then,
u
=
D,2
D,1
.
Proof. For any function f : S S R
+
, by using Fact 3 in Section 3 we
have
D,1
x
f
2
x
u
2
x
d
x
=
x
f
2
x
u
x
yx
(u
x
u
y
)
=
xy
(u
x
u
y
)(f
2
x
u
x
f
2
y
u
y
)
=
xy
_
f
2
x
u
2
x
f
2
x
u
x
u
y
+f
2
y
u
2
y
f
2
y
u
y
u
x
+ 2f
x
f
y
u
x
u
y
2f
x
f
y
u
x
u
y
_
=
xy
(f
x
u
x
f
y
u
y
)
2
_
f
2
y
u
x
u
y
+f
2
x
u
x
u
y
2f
x
f
y
u
x
u
y
_
=
xy
(f
x
u
x
f
y
u
y
)
2
xy
(f
x
f
y
)
2
u
x
u
y
.
Therefore,
u
= inf
f0
P
xS
f
x
u
2
x
=0
xy
(f
x
f
y
)
2
u
x
u
y
x
f
2
x
u
2
x
d
x
= inf
f0
P
xS
f
x
u
2
x
=0
xy
(f
x
u
x
f
y
u
y
)
2
x
f
2
x
u
2
x
d
x
D,1
= inf
g0
P
xS
g
x
u
x
=0
xy
(g
x
g
y
)
2
x
g
2
x
d
x
D,1
=
D,2
D,1
.
xy
(f
x
u
x
f
y
u
y
)
2
x
f
2
x
u
2
x
d
x
= inf
g0
P
xS
g
x
u
x
d
x
=0
xy
(g
x
g
y
)
2
x
g
2
x
d
x
=
D,2
.
Proposition 1.2.
2(1
D,1
)
u
.
Proof. Let f denote the eigenfunction achieving the Dirichlet eigenvalue
D,2
. We consider
2
x
f
2
x
u
x
yx
u
y
xy
2(f
2
x
+f
2
y
)u
x
u
y
xy
(f
x
f
y
)
2
u
x
u
y
.
Using the fact that
yx
u
y
= (1
D,1
)u
x
d
x
we then have
2(1
D,1
)
x
f
2
x
u
2
x
d
x
xy
(f
x
f
y
)
2
u
x
u
y
.
This implies
2(1
D,1
)
xy
(f
x
f
y
)
2
u
x
u
y
x
f
2
x
u
2
x
d
x
=
D,2
.
We will use the above facts to prove several versions of weighted Cheegers
inequalities. The following proof is somewhat similar to the unweighted case
in [15].
Theorem 2.
D,2
D,1
1
2
D,1
D,2
h
2
u
.
266 FAN CHUNG AND KEVIN ODEN
Proof. Let f denote the function achieving
u
in (8). We label the vertices
of G, so that f
i
f
v
i
f
i+1
and let p denote the least integer such that
f
p
0. For each i, 1 i [S[ we consider the cut C
i
= v
j
, v
k
E(S) :
1 j i k n. We dene to be
= min
1i|S|
{j,k}C
i
2u
j
u
k
min
_
_
ji
u
2
j
d
j
,
j>i
u
2
j
d
j
_
_
.
It is clear that h
u
. Without loss of generality, we may assume
jp
u
2
j
d
j
j>p
u
2
j
d
j
.
We dene
V
+
= x : f
x
0
E
+
= x, y : x V
+
or y V
+
.
We dene
g
x
=
_
f
x
if x V
+
0 otherwise.
Since
u
f
x
u
x
d
x
=
yx
(f
x
f
y
)u
x
u
y
we have
u
=
xV
+
f
x
{x,y}E
+
(f
x
f
y
)u
x
u
y
xV
+
f
2
x
u
2
x
d
x
{x,y}E
+
(g
x
g
y
)
2
u
x
u
y
xV
+
f
2
x
u
2
x
d
x
= W.
GRAPH LAPLACIANS AND INEQUALITIES 267
Then we have
W =
_
_
{x,y}E
+
(g
x
g
y
)
2
u
x
u
y
_
_
_
_
{x,y}E
+
(g
x
+g
y
)
2
u
x
u
y
_
_
_
_
xV
+
f
2
x
u
2
x
d
x
_
_
_
_
{x,y}E
+
(g
x
+g
y
)
2
u
x
u
y
_
_
_
_
{x,y}E
+
[g
2
x
g
2
y
[ u
x
u
y
_
_
2
xV
+
f
2
x
u
2
x
d
x
_
_
2
xV
+
f
2
x
u
x
yx
u
y
{x,y}E
+
(g
x
g
y
)
2
u
x
u
y
_
_
.
Since
yx
u
y
= u
x
d
x
D,1
u
x
d
x
we have
W
_
_
{x,y}E
+
[g
2
x
g
2
y
[ u
x
u
y
_
_
2
_
_
xV
+
f
2
x
u
2
x
d
x
_
_
2
_
_
_
_
_
2(1
D,1
)
{x,y}E
+
(g
x
g
y
)
2
u
x
u
y
xV
+
f
2
x
u
2
x
d
x
_
_
_
_
_
_
_
{x,y}E
+
[g
2
x
g
2
y
[ u
x
u
y
_
_
2
_
_
xV
+
f
2
x
u
2
x
d
x
_
_
2
(2 2
D,1
W)
_
_
ip
[f
2
i+1
f
2
i
[
C
i
u
x
u
y
_
_
2
_
_
xV
+
f
2
x
u
2
x
d
x
_
_
2
(2 2
D,1
u
)
268 FAN CHUNG AND KEVIN ODEN
_
_
ip
[f
2
i
f
2
i+1
[
ji
u
2
j
_
_
2
_
_
xV
+
f
2
x
u
2
x
d
x
_
_
2
(2 2
D,1
u
)
2
_
_
xV
+
f
2
x
u
2
x
dx
_
_
2
(2 2
D,1
u
)
_
_
xV
+
f
2
x
u
2
x
d
x
_
_
2
2
2 2
D,1
u
h
2
u
2
D,1
D,2
by using Proposition 1.1.
Here is another analogue of the Cheeger inequality relating the spectral
gaps of Dirichlet eigenvalues to the weighted Cheegers constant. Its proof
follows immediately from Theorem 2 and Fact 2.
Corollary 2.1.
D,2
D,1
h
2
u
2(1
D,1
)
h
2
u
2
.
A theorem of Payne, Polya and Weinberger [28] gives
D,k+1
D,k
4
k
D,i
n k
for Dirichlet eigenvalues of a bounded domain R
n
. It would be of
interest to prove a similar inequality for graphs.
5. Several isoperimetric inequalities.
It was shown in [15] that the continous analogue of h
u
was bounded below by
c h, where c is a constant depending on the dimension of the manifold and its
rolling sphere radius and h is the unweighted Neumann Cheegers constant.
In the discrete setting, similar relationships can be found between the various
unweighted and weighted Cheegers constants as well as the unweighted and
weighted eigenvalues. The following results have their origins in the work of
GRAPH LAPLACIANS AND INEQUALITIES 269
Payne, Pol ya and Weinberger [28] as well as the subsequent developments
by Ashbaugh and Benguria [1], Hile and Protter [25], and Hile and Xu [26].
Theorem 3.
u
D,1
N,1
.
Proof. From the deniton, we have
N,1
= inf
f0
P
xS
f
x
d
x
=0
{x,y}S
(f
x
f
y
)
2
xS
f
2
x
d
x
,
subject to the Neumann boundary condition
y
yx
(f
x
f
y
) = 0 for any x S.
Using the Neumann boundary condition, we have
{x,y}S
(f
x
f
y
)
2
=
xS
f
x
y
yx
(f
x
f
y
).
Let h be the eigenfunction for the weighted Laplcian and set f = h u. Then
we have
N,1
xS
f
x
yx
(f
x
f
y
)
xS
f
2
x
d
x
=
x
h
x
u
x
yx
(h
x
u
x
h
y
u
y
)
x
h
2
x
u
2
x
d
x
=
{x,y}S
(h
x
h
y
)
2
u
x
u
y
xS
yx
h
2
x
u
x
(u
x
u
y
)
x
h
2
x
u
2
x
d
x
=
u
D,1
x
h
2
x
u
2
x
d
x
x
h
2
x
u
2
x
d
x
=
u
D,1
.
D,2
2
D,1
+
N,1
.
In particular the theorem implies that
D,2
D,1
N,1
. One of the au-
thors and S.T. Yau [18] studied
N,1
on subgraphs of homogeneous graphs.
When the induced subgraph S is strongly convex (i.e., all shortest paths in
the host graph joining two vertices in S are contained in S, see [19] for more
details) it was proved in [18] that
N,1
1
8kD
2
,
where k is the degree of the S and D is the diameter. This immediately
gives:
Corollary 3.2. Suppose S is a strongly convex subgraph of an invariant
abelian homogeneous graph with edge generating set K consisting of k
generators. Then
D,2
D,1
1
8kD
2
where D is the diameter of S.
The preceding results can be related to several bounds of
N,1
in terms
of the heat kernel, as examined in [19].
The weighted Cheegers constant incorporates more information about the
graph in its denition. So in principle one would expect estimates involving
the weighted Cheeger to be better than ones involving only the unweighted
Cheegers constants. In this respect, the following isoperimetric inequality
is of interest, and can be contrasted with the upper bounds developed by
Buser [10] in the continuous setting.
Theorem 4. Suppose h is the Cheegers constant of the induced subgraph
S and h
u
the weighted Cheegers constant. Then,
h 2h
u
+
D,1
+ 2
_
D,1
h
u
.
Proof. From Theorem 1, we have
h = inf
f0
sup
C
xy
[f
x
f
y
[
x
[f
x
C[d
x
.
Since V (S) and E(S) are nite sets, there is some X S which achieves h.
Using the rst Dirichlet eigenfunction u, we dene
f
x
C =
_
u
2
x
if x X,
u
2
x
if x S X.
GRAPH LAPLACIANS AND INEQUALITIES 271
where C is as dened in the proof of Theorem 1. Therefore we have
h
xy
xX,yS\X
(u
2
x
+u
2
y
) +
xy
{x,yX} or {x,yS\X}
[u
2
x
u
2
y
[
x
u
2
x
d
x
xy
_
(u
x
u
y
)
2
+ 2u
x
u
y
_
+
xy
{x,yX} or {x,yS\X}
([u
2
x
u
2
y
[ (u
x
u
y
)
2
)
x
u
2
x
d
x
=
D,1
+ 2h
u
+
xy
{x,yX} or {x,yS\X}
_
[u
2
x
u
2
y
[ (u
x
u
y
)
2
_
x
u
2
x
d
x
.
We note that
xy
{x,yX} or {x,yS\X}
_
[u
2
x
u
2
y
[ (u
x
u
y
)
2
_
xy
[u
x
u
y
[ 2 minu
x
, u
y
xy
[u
x
u
y
[
u
x
u
y
2
_
xy
(u
x
u
y
)
2
_
1/2
xy
u
x
u
y
_
1/2
2
_
x
u
2
x
d
x
_
_
D,1
h
u
,
by using the denition of
D,1
and h
u
. Therefore, we have
h 2h
u
+
D,1
+ 2
_
D,1
h
u
as claimed.
References
[1] M. Ashbaugh and R. Benguria, A sharp bound for the ratio of the rst two eigenvalues
of Dirichlet Laplacian and extensions, Annals of Math., 135 (1992), 601-628.
272 FAN CHUNG AND KEVIN ODEN
[2] M.S. Ashbaugh and R.S. Benguira, Isoperimetric inqualities for eigenvalue ratios,
Partical Dierential Equations of Elliptic Type (Cortona, 1992), Sympos. Math.,
XXXV, Cambridge Univ. Press, Cambridge, (1994), 1-36.
[3] P. Berard, Varietes Riemanniennes isospectrales non-isometriques, Seminaire Bour-
baki, 705 (March 1989).
[4] B. Bollobas, Extremal Graph Theory, Academic Press, London, 1978.
[5] H. Brascamp and E. Lieb, On extensions of the Brunn-Minkowski and Prekopa-
Leindler Theorems, including inequalities for log concave functions and with an ap-
plication to diusion equation, Journal of Functional Analysis, 22 (1976), 366-389.
[6] R. Brooks, The Bottom of the Spectrum of a Riemannian Covering, Crelles J., 357
(1985), 101-114.
[7] , Constructing isospectral manifolds, Amer. Math. Month., 95 (1988), 823-839.
[8] R. Brooks, P. Perry and V. Petersen, Finiteness of dieomorphism types of isospectral
manifolds, preprint.
[9] R. Brooks, P. Perry and P. Yang, Isospectral sets of conformally equivalent metrics,
Duke Math J., 58 (1989), 131-150.
[10] P. Buser, A note on the isoperimetric constant, Ann. Sci.
Ecole Norm. Sup., 15(4)
(1982), 213-230.
[11] S.Y.A. Chang and P. Yang, Isospectral conformal metrics on 3-manifolds, J. Amer.
Math. Soc., 3 (1990), 117-145.
[12] I. Chavel, Eigenvalues in Riemannian Geometry, Academic Press, 1984.
[13] J. Cheeger, A lower bound for the smallest eigenvalue of the Laplacian, Problems in
Analysis (R. C. Gunning, ed.), Princeton Univ. Press, (1970), 195-199.
[14] S.Y. Cheng, Eigenvalue comparison theorems and its geometric applications, Math.
Zeit., 143 (1975), 289-297.
[15] S.Y. Cheng and K. Oden, Isoperimetric Inequalities and the Gap Between the First
and Second Eigenvalues of an Euclidean Domain, The Journal of Geometric Analysis,
to appear.
[16] F.R.K. Chung, Spectral Graph Theory, CBMS Lecture Notes, 1997, AMS Publication.
[17] , Laplacians of graphs and Cheeger inequalities, Combinatorics, Paul Erdos is
Eighty, Vol. 2, edited by D. Miklos, V.T. Sos and T. Szonyi, Janos Bolyai Mathemat-
ical Society, Budapest, (1996), 157-172.
[18] F.R.K. Chung and S.-T. Yau, A Harnack inequality for homogeneous graphs and
subgraphs, Communications on Analysis and Geometry, 2 (1994), 628-639.
[19] , An eigenvalue inequality for convex subgraphs, Communications on Analysis
and Geometry, 5 (1998), 575-624.
[20] C.B. Croke, Some isoperimetric inequalities and eigenvalue estimates, Ann. Ec. Norm.
Sup., 13 (1980), 419-435.
[21] E.B. Davies, Heat Kernels and Spectral Theory , Cambridge University Press, 1989.
[22] J.J. Duistermaat and V. Guillemin, The spectrum of positive elliptic operators and
periodic bicharacteristics, Inv. Math., 29 (1975), 39-79.
[23] P.B. Gilkey, Spectral geometry of Riemannian manifolds, Recent Developments in
Geometry (S.Y. Cheng, H. Choi and R. Greene, eds.), Contemp. Math., 101 (1989),
147-153.
GRAPH LAPLACIANS AND INEQUALITIES 273
[24] C. Gordon, D. Webb and S. Wolpert, Isopectral plane domains and surfaces via Rie-
mannian orbifolds, Invent. Math., 110 (1992), 1-22.
[25] G.N. Hile and M.H. Protter, Inequalities for eigenvalues of the Laplacian, Indiana
Univ. Math. J., 29 (1980), 523-538.
[26] G.N. Hile and Z.Y. Xu, Inequalities for sums of reciprocals of eigenvalues, J. Math.
Anal. Appl., 180 (1993), 412-430.
[27] B. Osgood, R. Phillips and P. Sarnak, Compact isospectral sets of surfaces, J. Funct.
Anal., 80 (1988), 212-234.
[28] L.E. Payne, G. Polya and H.F. Weinberger, On the ratio of consecutive eigenvalues,
Journal of Math. and Physics, 35(3) (1956), 289-298.
[29] G. Polya and S. Szego, Isoperimetric inequalities in mathematical physics, Annals of
Math. Studies, 27, Princeton University Press, 1951.
[30] R.M. Schoen and S.-T. Yau, Lectures on Dierential Geometry, 1994, International
Press, Cambridge, Massachusettes.
[31] T. Sunada, Riemannian coverings and isospectral manifolds, Ann. Math., 121 (1985),
169-186.
[32] S.-T. Yau, Isoperimetric constants and the eigenvalue of a compact Riemannian man-
ifold, Ann. Scient. Ecole Norm. Sup., 8 (1975), 487-507.
Received June 12, 1996 and revised April 30, 1998. Research of the rst author was
supported in part by NSF Grant No. DMS 98-01446. Research of the second author was
supported in part by NSF Grant No. DMS 95-04834.
University of California, San Diego
La Jolla, CA 92093-0112
E-mail address: fan@euclid.ucsd.edu
Harvard University
Cambridge, MA 02138
PACIFIC JOURNAL OF MATHEMATICS
Vol. 192, No. 2, 2000
COTILTING MODULES AND BIMODULES
Riccardo Colpi and Kent R. Fuller
Cotilting modules and bimodules over arbitrary associative
rings are studied. On the one hand we nd a connection be-
tween reexive modules with respect to a cotilting (bi)module
U and a notion of U-torsionless linear compactness. On the
other hand we provide concrete examples of cotilting bimod-
ules over linearly compact noetherian serial rings.
Cotilting theory is a generalization of Morita duality in a sense that is
analogous to that in which tilting theory is a generalization of Morita equiv-
alence. Indeed, cotilting modules rst appeared as vector space duals of
tilting modules over nite dimensional algebras (see, e.g., [H, IV, 7.8]), just
as injective cogenerators are such duals of progenerators. Later, R.R. Colby
[Cb1] studied nitely generated cotilting bimodules over noetherian rings,
proving that they induce nitistic generalized Morita dualities, similar to
the nite dimensional algebra case. More recently, in [Cb2] he investigated
a more general class of representable dualities, namely (nonnitistic) gen-
eralized Morita dualities. He proved that the existence of such a duality
implies the existence of a second pair of functors between classes that com-
plement the reexive ones, obtaining a result which is close to a dual form
of the celebrated Tilting Theorem [BrBu], [HaRi].
For arbitrary rings R and S, a Morita duality between left S-modules
and right R-modules is given by the contravariant Hom functors induced by
a so called Morita bimodule
S
W
R
, namely, one such that (i) the classes of
W-reexive modules contain R
R
, W
R
,
S
S and
S
W, and are closed under
submodules, factor modules and extensions; or, equivalently, (ii)
S
W
R
is
balanced, and W
R
and
S
W are injective cogenerators. Colbys generalized
Morita dualities in [Cb2] are those induced by a bimodule
S
U
R
such that a
natural weakening of (i) holds (just closure under factor modules is left out).
Generalizing the notion of injective cogenerator, the authors of [CpDeTo]
and [CpToTr] dened a cotilting module U
R
over a ring R as one such that
Cogen(U
R
) = Ker Ext
1
R
(, U
R
). In [CpDeTo, Proposition 1.7] it is shown
that this notion is dual to that of tilting module by means of the following
Proposition. A module U
R
is a cotilting module if and only if it satises
the conditions
(1) inj dim(U
R
) 1,
275
276 RICCARDO COLPI AND KENT R. FULLER
(2) Ext
1
R
(U
R
, U
R
) = 0 for any cardinal ,
(3) Ker Hom
R
(, U
R
) Ker Ext
1
R
(, U
R
) = 0.
To obtain a homological generalization of (ii), as in [Cp], we say that a
balanced bimodule
S
U
R
in which both U
R
and
S
U are cotilting modules is
a cotilting bimodule.
In this paper we continue the study of cotilting (bi)modules over arbi-
trary rings that was begun in [Cp]. There it was shown that any cotilting
bimodule
S
U
R
induces a pair of dualities between quite large subcategories
of torsion-free and torsion modules in Mod-R and S-Mod, respectively. This
result naturally generalizes Morita dualities to torsion theories, and it is still
dual to the Tilting Theorem.
A third major component of Morita duality theory is B. M ullers theorem
[X, Corollary 4.2] that the reexive modules relative to a Morita bimodule
are precisely the linearly compact modules. In Section 1 we investigate the
related notion of torsionless linear compactness and its connection to the
reexivity of modules. This allows us to nd a bridge between Colbys gen-
eralized Morita duality and cotilting bimodules by showing that a cotilting
bimodule U induces a generalized Morita duality if and only if the classes
of the U-reexive modules coincide with those of the U-torsionless linearly
compact modules. This is accomplished, in part, by answering a question
posed in [Cp].
Perhaps the most accessible collection of examples of tilting modules over
non-artinian rings are those over hereditary noetherian serial rings. They
and their endomorphism rings were classied in [CbFu]. In Section 2 we
show that the Morita dual of a tilting module possesses most of the prop-
erties of a cotilting bimodule. Then in Section 3 we employ these results
and Warelds theorems on noetherian serial rings in [Wa] to show that the
dual of any tilting module over a noetherian serial ring with selfduality is a
cotilting bimodule. Thus we obtain a class of concrete examples of cotilting
bimodules that are not, in general, nitely generated.
We denote by R and S two arbitrary associative rings with unit, and by
Mod-R and S-Mod the category of all unitary right R- and left S-modules,
respectively. All the classes of modules that we introduce are to be con-
sidered as full subcategories of modules closed under isomorphisms. Given
a module U, we denote by add(U) the class of all direct summands of any
nite direct sum of copies of U, and by Cogen(U) the class of all modules
cogenerated by U, that is all the modules M such that there exists an ex-
act sequence 0 M U
= (
S
S)
= U
R
and
2
(
S
S)
= (U
R
)
=
S
S canonically.
(b) Let K M L 0 be an exact sequence in S-Mod, with M
-reexive. Then we have the exact sequence 0 (L) (M) I
0, where I embeds into (K), so that (I) = 0. Therefore we get the
commutative exact diagram
M L 0
2
(M)
2
(L) 0
which shows that
L
is epic.
(c) Clearly
S
L Cogen(
S
U) if and only if
L
is monic. We can conclude
by (b).
(d) By adjunction, we get (
M
)
(M)
= id
(M)
, so that (
M
) is epic.
Therefore, from the exact sequence 0M/ Rej
U
(M)
2
(M)Coker(
M
)
0 we see that (Coker(
M
))
2
(M) = 0.
(e) Again from the identity (
M
)
(M)
= id
(M)
, we see that if
M
is
an isomorphism, then
(M)
is too, and of course M Cogen(U
R
) = Ker .
278 RICCARDO COLPI AND KENT R. FULLER
Conversely, if
(M)
is an isomorphism, then (
M
) must be monic, i.e.,
Coker(
M
) Ker . Moreover Coker(
M
) Ker because of (d). Since
(Ker , Ker ) is a torsion theory, we conclude that Coker(
M
) = 0, i.e.,
M
is epic. Under the further assumption that M Ker , we conclude that
M
is an isomorphism.
(f) Since (M) Cogen(
S
U), (c) applies, giving (M) reexive. We
conclude by (e).
(g) From the exact sequence 0 L M M/L 0 in Mod-R, by
assumption we get the exact sequence 0 (M/L) (M) (L) 0.
We conclude using (e) and (f).
It is well known that linear compactness plays a fundamental role in the
study of duality. Here we introduce a concept of linear compactness with
respect to a torsion theory, drawing inspiration from [GpGaWi, 3]:
Denition 1.2. Let (T , F) be a torsion theory in Mod-R. Then a right
R-module M is called F-linearly compact if M F and for any inverse
system of morphisms {p
: M M
} with M
F and Coker(p
) T , for
all s, it happens that Coker(lim
) T .
If U
R
is a cotilting module, a module M Mod-R is called U-torsionless
linearly compact (U-tl.l.c., for short) if M is Ker -linearly compact.
Note that M Mod-R is linearly compact i M is Mod-R-linearly com-
pact, i.e., it is linearly compact with respect to the trivial torsion the-
ory ({0}, Mod-R). In particular if U
R
is a cotilting module, then the U-
torsionless linear compactness coincides with the usual linear compactness
i U
R
is an injective cogenerator.
Torsionfree linear compactness is inherited by any inverse limit of the
type in Denition 1.2, as the following result due to A. Tonolo shows:
Proposition 1.3. Let (T , F) be a torsion theory in Mod-R, and let M
Mod-R be F-linearly compact. Then for any inverse system {p
: M M
}
with M
F and Coker(p
is F-linearly compact
too.
Proof. First of all, let us note that lim
: lim
} be any inverse
system with M
F and Coker(p
lim
, for all
s, is in T , because it is an extension of a factor of Coker(lim
), which is
in T , by the torsion module Coker(p
(p
lim
= lim
lim
) T .
In [Cp, Proposition 10] it was proved that if
S
U
R
is a cotilting bimodule,
then any U-tl.l.c. module is -reexive; and the question of whether the
COTILTING MODULES AND BIMODULES 279
converse is true was posed. To give a partial answer, we start with a theorem
which generalizes a well known result, substantially due to M uller [Mu] (see
also [X, Theorem 4.1]):
Theorem 1.4. Let U
R
be a cotilting module, and let S = End(U
R
). Then
the following are equivalent for any M Mod-R:
(1) M is U-tl.l.c.
(2) M is -reexive, and for all L
i
(M) we have Coker (i) Ker .
Proof. (1) (2). Let M
R
be U-tl.l.c., let L be a submodule of (M)
and let {L
: L
L
i
(M) the canonical
inclusions, we get lim
= L and lim
= i. Let now p
= (i
)
M
:
M (L
). Then {p
(M) (M)
(M)
2
(L
)
2
(i
)
3
(M)
(
M
)
(M)
where
L
) = (
M
)
2
(i
) Ker .
Thus the hypothesis (1) applies, giving Coker(lim
) Ker . Moreover
() lim
= lim
(i
)
M
= (lim
)
M
= (i)
M
.
First, if we choose L = (M) we clearly get Coker(
M
)
= Coker(lim
)
Ker . On the other hand, since M Ker ,
M
is injective and Coker(
M
)
Ker because of Lemma 1.1(d). Therefore Coker(
M
) = 0, i.e., M is -
reexive.
Finally, in the case L is arbitrary, since
M
is an isomorphism, from ()
we get Coker((i))
= Coker(lim
) Ker .
(2) (1). Let {p
: M M
0
(ex1)
0 I
0
(ex2)
with K
= Ker(p
), I
= Im(p
), C
= Coker(p
) and
= p
.
First, let us prove that all the K
, I
, M
) : (I
0
?
?
y
=
?
?
y
=
?
?
y
0
2
(K
)
2
(M)
2
(
)
2
(I
) Coker(
2
(
)) 0
from which we get (thanks to Lemma 1.1(d)) Coker(
2
(
))
= Coker(
I
)
Ker . Thus Coker(
I
) = 0, i.e., I
2
(I
)
2
(
)
2
(M
) C
0
where C
) (M
)
(p
)
(M)
(ex3)
we get, by hypothesis, that Coker(
2
(p
)) Ker . From
2
(p
) =
2
(
)
2
(
) we see that C
= Coker(
2
(
)
(
)
(I
(
M
=
0
3
(M
)
3
(
)
3
(I
)
which shows that (
M
) is monic. Since (
M
)
(M
)
= id
(M
)
, we
conclude that
(M
)
is an isomorphism, so that M
is -reexive, because
of Lemma 1.1(e).
Finally, from (ex3), we derive the embedding lim
(p
) : lim
(M
)
(M), so that Coker((lim
(p
2
(M)
(lim
(p
))
(lim
(M
))
= lim
2
(M
) Coker((lim
(p
))) 0
=
x
?
?
M
=
x
?
?
lim
M
lim
lim
Coker(lim
) 0
which shows that Coker(lim
= Coker((lim
(p
))) Ker .
The next result points out some good properties of U-tl.l.c. modules.
Corollary 1.5. Let U
R
be a cotilting module.
COTILTING MODULES AND BIMODULES 281
(a) If M M
Ker and T Ker , then M
is U-tl.l.c. too.
(b) If M Mod-R is a factor of any U-tl.l.c. module, then
M
is surjective
and M/ Rej
U
(M) is U-tl.l.c. too.
Proof. (a) is an immediate consequence of Proposition 1.3. In order to prove
(b), let us consider an epimorphism L
M 0, with L U-tl.l.c. From
Theorem 1.4 we get that L is -reexive and, considering the embedding
0(M)
()
(L), also that Coker(
2
()) Ker . On the other hand,
from the commutative exact diagram
L
M 0
2
(L)
2
()
2
(M) Coker(
2
()) 0
we see that Coker(
2
())
= Coker(
M
) Ker , because of Lemma 1.1(d).
Hence Coker(
2
())
= Coker(
M
) = 0, so that
M
is surjective and
M/ Rej
U
(M)
=
2
(M) is U-tl.l.c. because of (a).
Proposition 1.6. Let U
R
be a cotilting module and let S = End(U
R
). Then
U
R
is U-tl.l.c. if and only if (S/I) = 0 for every left ideal I of S.
Proof. The module U
R
is -reexive because of Lemma 1.1(a). There-
fore, by Theorem 1.4, U
R
is U-tl.l.c. if and only if for any exact sequence
of the form 0 I
i
(U
R
)
=
S
S S/I 0 it happens that
Coker((i)) Ker . Finally, from the previous sequence we get the exact
sequence 0 (S/I) (S)
(i)
(I) (S/I) 0, which shows
that Coker((i))
= (S/I).
We switch now to the case of a cotilting bimodule.
Corollary 1.7. Let
S
U
R
be a cotilting bimodule and let
S
S (R
R
, respectiv-
ely) be noetherian. Then U
R
(
S
U, respectively) is U-tl.l.c.
Proof. By assumption, for any left ideal I of S the cyclic module S/I is
nitely presented, and so it belongs to the class C, as proved in [Cp, Propo-
sition 5 d)]. Moreover, from [Cp, Theorem 6 a)], we get (C) Ker , so
that (S/I) = 0. We nish the proof applying Proposition 1.6.
We are now ready to answer the question posed in [Cp, Remark 11].
Theorem 1.8. Let
S
U
R
be a cotilting bimodule. The following conditions
are equivalent for any module M
R
Ker :
(1) M
R
is U-tl.l.c.
(2) M
R
is -reexive and for all
S
L (M) we have ((M)/L) = 0.
282 RICCARDO COLPI AND KENT R. FULLER
(3) Any S-submodule of (M) is -reexive.
Proof. (1) (2). Since (M) = 0, for any embedding i :
S
L (M) we
get Coker((i)) = ((M)/L). Now apply Theorem 1.4.
(2) (3). For any
S
L (M) we get the exact sequence
2
(M)
(L) ((M)/L) 0 where, by assumption, since (1) (2),
2
(M) is
U-tl.l.c. and ((M)/L) Ker . So Corollary 1.5(a) applies, giving (L)
reexive. Since L is clearly in Ker , from Lemma 1.1(e) we obtain that L
is -reexive.
(3) (2). By assumption (M) is -reexive, and so M is -reexive
too, because of Lemma 1.1(e). Next, for any
S
L (M) we get the canoni-
cal exact sequence 0 L (M) (M)/L 0, with both L and (M)
-reexive. Then ((M)/L) = 0 because of [Cp, Lemma 4 d)].
Corollary 1.9. Let
S
U
R
be a cotilting bimodule. The following conditions
are equivalent:
(1) every -reexive right R-module is U-tl.l.c.,
(2) the class of all the -reexive left S-modules is closed under submod-
ules.
Proof. Apply (1) (3) of Theorem 1.8.
We now have the following connection between cotilting bimodules and
those bimodules
S
U
R
that induce Colbys generalized Morita dualities [Cb2]
in the sense that the classes of -reexive modules are closed under exten-
sions and submodules, and contain
S
S and R
R
, respectively.
Corollary 1.10. Let
S
U
R
be a cotilting bimodule. Then
S
U
R
induces a
generalized Morita duality if and only if the class of the -reexive modules
coincides with the class of the U-torsionless linearly compact modules, both
in S-Mod and in Mod-R.
Proof. For any cotilting bimodule
S
U
R
, the regular modules
S
S and R
R
are
-reexive, because of Lemma 1.1(a), and, similarly, any extension of two
-reexive modules is -reexive too, because of [Cp, Proposition 5 a)].
Now apply Corollary 1.9.
2. Morita duals of tilting bimodules.
Originally cotilting bimodules arose as k-duals of tilting bimodules. Namely,
consider two nite dimensional k-algebras R and S, and denote by D() the
vector space k-duality. In this context a cotilting bimodule is just the dual
D(
R
V
S
) of a nite dimensional tilting bimodule
R
V
S
, so cotilting theory for
nite dimensional algebras is just a perfect dual of tilting theory. Moreover,
since D(R
R
) is an injective cogenerator in R-Mod and adjunction induces
a natural isomorphism of left S-modules D(V
S
)
= Hom
R
(
R
V
S
, D(R
R
)), it
COTILTING MODULES AND BIMODULES 283
follows that D(V
S
) is a cotilting left S-module in our sense (see the proof
of 2.4 below). Arguing in the same way for D(
R
V ), we obtain that
S
U
R
=
D(
R
V
S
) is a cotilting bimodule in our sense.
Nevertheless, if we do not restrict our attention to nitely generated mod-
ules, cotilting theory is as far from tilting theory as Morita duality is from
Morita equivalence. Even in this classical case, the theory seems to be quite
hidden: We do not know, for instance, if the equivalent conditions of Corol-
lary 1.9 hold true.
Obviously, a natural way to generalize this construction is to consider
Morita duals of tilting bimodules. In this pursuit we are fortunate that stan-
dard methods yield the following extensions of the adjointness of the functors
Hom
A
(V, ) and V
S
and of the contravariant functors Hom
A
(, W) and
Hom
R
(, W) induced by bimodules
A
V
S
and
A
W
R
(see [AF, 20]):
Lemma 2.1. Let
S
N and M
R
be modules and
A
V
S
and
A
W
R
be bimodules.
(a) If
A
W is injective, then there are natural isomorphisms
Hom
A
(Tor
S
n
(V, N), W)
= Ext
n
S
(N, Hom
A
(V, W))
for n = 1, 2, . . . .
(b) If
A
W and W
R
are both injective, then there are natural isomorphisms
Ext
n
A
(V, Hom
R
(M, W))
= Ext
n
R
(M, Hom
A
(V, W))
for n = 1, 2, . . . .
Proof. (a) This is [CE, page 120, Proposition 5.1].
(b) Being unable to nd a reference for this part, we shall sketch a proof.
Let
P
2
P
1
P
0
A
V 0
be a projective resolution of
A
V , and note that the conditions on W yield
an injective resolution
0 Hom
A
(V, W) Hom
A
(P
0
, W) Hom
A
(P
1
, W)
Hom
A
(P
2
, W)
of Hom
A
(V, W)
R
. Then (see [R, Chapter 7]) one obtains the desired iso-
morphisms from the commutative diagram
Hom
A
(P
0
, Hom
R
(M, W)) Hom
A
(P
1
, Hom
R
(M, W))
Hom
R
(M, Hom
A
(P
0
, W)) Hom
R
(M, Hom
A
(P
1
, W))
M
= Ext
1
A
(M, ) and T
M
= Tor
B
1
(M, ).
Thus by adjointness we have
U
= H
V
W
: Mod-R S-Mod
and
U
=
W
T
V
: S-Mod Mod-R
and by Lemma 2.1
U
= H
V
W
: Mod-R S-Mod
and
U
=
W
T
V
: S-Mod Mod-R.
Also there are natural transformations
: id
Mod-R
U
U
and : id
S-Mod
U
U
and
:
U
U
id
Mod-R
and :
U
U
id
S-Mod
with the s via the usual evaluation maps, and the s derived from the
natural transformations of the Tilting Theorem [CbFu, 1.4] and the
W
s.
Thus we obtain
Duality 2.2. There are dualities
U
: Y
R
S
Y :
U
U
: X
R
S
X :
U
where the Ys and Xs are the full subcategories on whose objects the s and
the s, respectively, are isomorphisms.
Let us denote by
A
C and C
R
the classes of all linearly compact left A-
and right R-modules, respectively. Moreover, (
A
T ,
A
F) denotes the torsion
theory generated by the tilting module
A
V , and (
S
T ,
S
F) the torsion theory
cogenerated by the cotilting module
S
U = Hom
A
(
A
V
S
,
A
W) (see the proof
of 2.4 below).
By assumption, the bimodule
A
W
R
induces a duality of the form
W
: C
R
A
C :
W
COTILTING MODULES AND BIMODULES 285
and the tilting bimodule
A
V
S
induces the two equivalences
H
V
:
A
T
S
F : T
V
and H
V
:
A
F
S
T : T
V
.
Therefore, letting
A
X =
A
C
A
T and
A
Y =
A
C
A
F
we see that
X
R
W
(
A
Y), Y
R
W
(
A
X),
S
X H
V
(
A
Y),
S
Y H
V
(
A
X).
Since
A
V is a tilting module,
A
V and
A
W belong to
A
X. Thus
U
R
=
W
(
A
V ) Y
R
, R
R
=
W
(
A
W) Y
R
,
S
U = H
V
(
A
W)
S
Y,
S
S = H
V
(
A
V )
S
Y,
so, in particular, we have:
2.3. Balance. The bimodule
S
U
R
is faithfully balanced.
Since
A
V is a -module,
S
U = Hom
A
(
A
V
S
,
A
W) and
A
W is an injective
cogenerator, as in [CpToTr, 2.3 3)], we obtain:
2.4. Properties of
S
U.
S
U is a cotilting module.
One would hope that U
R
is one too. Perhaps not in general, but we do
have the following:
2.5. Properties of U
R
.
(a) There is an exact sequence 0 U
R
W
0, where
W
, W
add(W
R
). In particular U
R
is nitely cogenerated and
inj dim(U
R
) 1.
(b) There is an exact sequence 0 U
W
R
0, where U
, U
add(U
R
). In particular Ker
U
Ker
U
= 0.
(c)
W
(
A
X) Ker
U
. In particular Ext
1
R
(M, U) = 0 for all M
R
U
n
R
(n nite).
Proof. (a) Since
A
V is a tilting module, there is an exact sequence of the
form 0 A
A
V 0, with A
, A
add(
A
A). Now apply
W
.
(b) Similarly to the previous case, applying
W
to the exact sequence
0
A
A V
0, where V
, V
add(
A
V ), we obtain the re-
quired exact sequence. Finally, applying Hom
R
(M, ) to that, we see that
Hom
R
(M, U) = 0 = Ext
1
R
(M, U) implies Hom
R
(M, W) = 0, and so M = 0.
(c) For any M
W
(
A
X) we clearly have
W
(M)
A
X
A
T =
Ker Ext
1
A
(V, ). Therefore, we see by Lemma 2.1(b) that Ext
1
R
(M, U)
=
Ext
1
A
(V,
W
(M)) = 0.
From 2.3, 2.4 and 2.5 we immediately have:
Proposition 2.6. The bimodule
S
U
R
is a cotilting bimodule if and only if
Ext
1
R
(U
= X
()
.
Proof. Let Q = R/ Ann
R
(X). Then X
Q
is a faithful indecomposable mod-
ule over the artinian QF-3 ring Q. Thus X is the unique indecomposable
injective projective right Q-module (see [AF, 31 and 32]). But X
is both
injective and, since Q is artinian, projective. Moreover X
Q
is a direct sum
of indecomposable modules, since Q is artinian.
Lemma 3.2. Let R be a prime noetherian serial ring with indecomposable
injective modules E
1
, . . . , E
n
and E
0
as above. Then for any cardinal there
are cardinals , such that E
= E
()
i
E
()
0
.
Proof. Since R is semiperfect and J is nitely generated, we see that
Soc(E
i
) = Ann
E
i
(J) = Ann
E
i
(J)
= Soc(E
i
)
.
But if i = j then Soc(E
i
)e
j
= 0. Thus Soc(E
i
) = S
()
i
. So we see that
E
= E
()
i
E, with Soc(E) = 0. But the only indecomposable injective
with zero socle is E
0
, so E
= E
()
0
.
Proposition 3.3. If U is a nitely cogenerated module over a noetherian
serial ring R such that Ext
1
R
(U, U) = 0, then Ext
1
R
(U
, U) = 0 for any
cardinal .
Proof. Since U is nitely cogenerated, we have
U = E
i
1
E
i
k
X
1
X
l
where E
i
j
= E(S
i
j
), j = 1, . . . , k, and X
i
, i = 1, . . . , l, are uniserial modules
of nite length. Thus by Lemmas 3.1 and 3.2 we have
U
= E
(
1
)
i
1
E
(
k
)
i
k
E
()
0
X
(
1
)
1
X
(
l
)
l
.
Now, since Ext
1
R
(, X
i
) converts direct sums to direct products, we need
only check that Ext
1
R
(E
0
, X
i
) = 0 for all i = 1, . . . , l. To this end, consider
the minimal injective resolution
0 X
i
E
i
E
j
0.
Here we need to show that
Hom
R
(E
0
, E
i
)
Hom
R
(E
0
,)
Hom
R
(E
0
, E
j
) 0
is exact. So let 0 = Hom
R
(E
0
, E
j
) with K = Ker(). Then there is
m N such that E
0
/KJ
m
= E
i
1
. But Ext
1
R
(E
i
1
, X
i
) = 0, being a direct
summand of Ext
1
R
(U, U), so
Hom
R
(E
0
/KJ
m
, E
i
)
Hom
R
(E
0
/KJ
m
,)
Hom
R
(E
0
/KJ
m
, E
j
) 0
288 RICCARDO COLPI AND KENT R. FULLER
is exact. Thus, setting : E
0
E
0
/KJ
m
, we have a commutative diagram
E
0
E
j
E
0
/KJ
m
E
j
E
i
E
j
which shows that Hom
R
(E
0
, )( ) = = = .
Theorem 3.4. Let R be a noetherian serial ring with self-duality induced
by a bimodule
R
W
R
. If
R
V is a tilting module and S = End(
R
V ), then
S
U
R
= Hom
R
(V, W) is a cotilting bimodule.
Proof. According to Proposition 2.6, it only remains to observe that
Ext
1
R
(U
= End(
R/J
kA
k
)
= R/J
k
as rings. Now both {J
k
| k 1} and {Ann
S
(A
k
) | k 1} are down-
ward directed sets of ideals with
k
J
k
= 0 [Wa, Theorem 5.11] and so
k
Ann
S
(A
k
) = 0. Therefore, since
R
R and S
S
are both linearly compact,
we have
R
= lim
R/J
k
= lim
S/ Ann
S
(A
k
)
= S.
= P
T
(
1
)
1
T
(
l
)
l
,
it only remains to show that Ext
1
R
(P
, P) = 0 and Ext
1
R
(P
, T
i
) = 0 for
i = 1, . . . , l.
Let
R
W
R
induce a self-duality and observe that the canonical right R-
isomorphism
R Hom
R
(Hom
R
(R
R
, W), W) Hom
R
(
R
W
R
,
R
W
R
)
is also a left R-map. Now P
,
R
R)
= Ext
1
R
(P
, Hom
R
(
R
W
R
,
R
W))
= Hom
R
(Tor
R
1
(W, P
), W) = 0.
Thus, assuming, as we may, that P is a direct summand of
R
R, we do have
Ext
1
R
(P
, P) = 0.
On the other hand, if T
i
has length m, and A = R/J
m
, then
A
T
i
is
injective [AF, Theorem 32.6] and
R
T
i
= Hom
A
(
A
A
R
,
A
T
i
),
290 RICCARDO COLPI AND KENT R. FULLER
so that
Ext
1
R
(P
,
R
T
i
)
= Ext
1
R
(P
, Hom
A
(
A
A
R
,
A
T
i
))
= Hom
R
(Tor
R
1
(A
R
, P
),
A
T
i
) = 0.
Remark 3.8. (1) Krause and Saorn [KrSa, Proposition 3.8] have recently
shown that if M
R
is a nitely generated module, then every M
is isomorphic
to a direct summand of some M
()
if and only if S = End(M
R
) is left
coherent and right perfect and
S
M is nitely presented. Thus we see that if
R is right artinian and (hence) S is left artinian in a cotilting triple (S, U, R)
in the sense of [Cb1, 2], then Ext
1
R
(U
, U) = 0 = Ext
1
S
(U
, U) for any
cardinal .
(2) Over rings of nite representation type, cotilting triples yield more
examples of cotilting modules. Indeed, in a cotiling triple (S, U, R), if it hap-
pens that R is a ring of nite representation type (so that every R-module
is a direct sum of nitely generated modules), then since U
R
is a nitis-
tic cotilting module [Cb1, Theorem 3.3], we also have Ker Hom
R
(, U
R
)
Ker Ext
1
R
(, U
R
) = 0, so that U
R
is a cotilting module in the present sense.
If in addition S has nite representation type (in particular, if R is heredi-
tary [CbFu, Proposition 2.2]), then
S
U
R
is a cotilting bimodule.
Acknowledgments. This paper was written while R. Colpi was visiting the
University of Iowa in April-May 98, and he wishes to express his gratitude
to this University and especially to Kent Fuller for the great hospitality.
References
[AF] F.D. Anderson and K.R. Fuller, Rings and Categories of Modules (2nd edition),
Springer, New York, 1992.
[BrBu] S. Brenner and M. Butler, Generalizations of the Bernstein-Gelfand-
Ponomarev reection functors, in Proc. ICRA II (Ottawa, 1979), LNM 832,
Springer, Berlin, (1980), 103-169.
[CE] H. Cartan and S. Eilenberg, Homological Algebra, Princeton University Press,
1956.
[Cb1] R.R. Colby, A generalization of Morita duality and the tilting theorem, Comm.
Algebra, 17(7) (1989), 1709-1722.
[Cb2] , A cotilting theorem for rings, in Methods in Module Theory,
M. Dekker, New York, (1993), 33-37.
[CbFu] R.R. Colby and K.R. Fuller, Tilting, cotilting and serially tilted rings, Comm.
Algebra, 18(5) (1990), 1585-1615.
[Cp] R. Colpi, Cotilting bimodules and their dualities, to appear in 1998 Murcie
Euroconference Proceedings, Marcel Dekker.
[CpDeTo] R. Colpi, G. DEste and A. Tonolo, Quasi-tilting modules and counter equiva-
lences, J. Algebra, 191 (1997), 461-494.
COTILTING MODULES AND BIMODULES 291
[CpToTr] R. Colpi, A. Tonolo and J. Trlifaj, Partial cotilting modules and the lattices
induced by them, Comm. Algebra, 25 (1997), 3225-3237.
[DiMl] F. Dischinger and W. M uller, Einreihig zerlegbare Ringe sind selbstdual, Arch.
Math., 43 (1984), 132-136.
[GpGaWi] J.L. Gomez Pardo, P.A. Guil Asensio and R. Wisbauer, Morita dualities in-
duced by the M-dual functors, Comm. Algebra, 22 (1994), 5903-5934.
[H] D. Happel, Triangulated Categories in the Representation Theory of Finite
Dimensional Algebras, Cambridge Univ. Press, Cambridge, 1988.
[HaRi] D. Happel and C.M. Ringel, Tilted algebras, Trans. Amer. Math. Soc., 274
(1982), 399-443.
[KrSa] H. Krause and M. Saorn, On minimal approximations of modules, Preprint,
1998.
[Mu] B.J. M uller, Linear compactness and Morita duality, J. Algebra, 16 (1970),
60-66.
[R] J.J. Rotman, An Introduction to Homological Algebra, Academic Press, New
York, 1979.
[S] B. Stenstrom, Rings of Quotients, Springer-Verlag, Berlin, Heidelberg, New
York, 1975.
[Wa] R.B. Wareld, Serial rings and nitely presented modules, J. Algebra, 37
(1975), 187-222.
[Wb] J. Waschb usch, Self-duality of serial rings, Comm. Algebra, 14 (1986), 581-
589.
[X] W. Xue, Rings with Morita Duality, LNM 11523, Springer-Verlag, Berlin, Hei-
delberg, New York, 1992.
Received June 23, 1998 and revised October 20, 1998. Research partially supported by
grant CNR-GNSAGA.
Universit
`
a di Padova
Via Belzoni 7, 35131 Padova
Italy
E-mail address: colpi@math.unipd.it
University of Iowa
Iowa City, IA 52242
E-mail address: kfuller@math.uiowa.edu
PACIFIC JOURNAL OF MATHEMATICS
Vol. 192, No. 2, 2000
HARDYS UNCERTAINTY PRINCIPLE
ON SEMISIMPLE GROUPS
M. Cowling, A. Sitaram, and M. Sundari
A theorem of Hardy states that, if f is a function on R such
that |f(x)| C e
|x|
2
for all x in R and |
f()| C e
||
2
for
all in R, where > 0, > 0, and > 1/4, then f = 0.
Sitaram and Sundari generalised this theorem to semisimple
groups with one conjugacy class of Cartan subgroups and to
the K-invariant case for general semisimple groups. We ex-
tend the theorem to all semisimple groups.
1. Introduction.
The Uncertainty Principle states, roughly speaking, that a nonzero func-
tion f and its Fourier transform
f cannot both be sharply localised. This
fact may be manifested in dierent ways. The version of this phenomenon
described in the abstract is due to Hardy [3]; we call it Hardys Uncertainty
Principle. Considerable attention has been devoted recently to discovering
new forms of and new contexts for the Uncertainty Principle (see [2] for a
recent comprehensive survey). In particular, Sitaram and Sundari [4] gen-
eralised Hardys Uncertainty Principle to connected semisimple Lie groups
with one conjugacy class of Cartan subgroups and to the K-invariant case for
general connected semisimple Lie groups. We extend the theorem of Sitaram
and Sundari [4], and establish a form of Hardys Uncertainty Principle for
all connected semisimple Lie groups with nite centre.
2. The theorem.
Let G be a connected real semisimple Lie group with nite centre. Let KAN
be an Iwasawa decomposition of G, and let MAN be the associated minimal
parabolic subgroup of G. The Lie algebras of G and A are denoted by g
and a. The Killing form of g induces an inner product on a and hence on
the dual a
of C(M), the
space of continuous complex-valued functions on M. For such a , and in
293
294 M. COWLING, A. SITARAM, AND M. SUNDARI
the complexication a
C
of a
g G.
Note that such functions are determined by their restrictions to K, i.e.,
eectively we are dealing with a subspace of C(K). The representation
0
,
of G is the left-translation representation of G on this space. We dene the
inner product , of and in H
0
,
to be
K
(k) (k) dk;
denotes the associated norm.
Denote by H
,
the completion of H
0
,
with this norm, and by
,
the
extension of
0
,
to H
,
. The space H
,
may be identied with a subspace
of L
2
(K), and H
0
,
with the space of continuous functions in H
,
.
For in
M and in a
, the representation
,
is unitary. This repre-
sentation lifts to a representation of L
1
(G) by integration, as follows. First,
for f in L
1
(G) and and in H
,
, the integral
G
f(g)
,
(g), dg
converges, to B
f
(, ) say. Next, B
f
is a sesquilinear form on H
,
. Thus
there exists a unique bounded operator, denoted
,
(f), such that
,
(f), =
G
f(g)
,
(g), dg , H
,
.
We denote by the operator norm of such operators, relative to the given
norm on H
,
. If a
C
\ a
> 1/4
for all in
M, and that f is a measurable function on G such that
|f(kak
)| C exp(| log a|
2
) k, k
K a A
and
,
(f) C
exp(
||
2
)
M a
.
Then f = 0.
UNCERTAINTY ON SEMISIMPLE GROUPS 295
Proof. Let and be irreducible representations of K, with characters
and
. Dene f
,
by the formula
f
,
(g) = dim dim
(k)
(k
) f(kgk
) dk dk
.
By a straightforward estimate,
|f
,
(kak
)| C (dim dim)
2
exp(| log a|
2
) k, k
K a A.
Further,
,
(f
,
) is the composition P
,
(f)P
, where P
and P
are the
projections of L
2
(K) onto the -isotypic and -isotypic subspaces, so that
,
(f
,
) C
exp(
||
2
)
M a
.
Now the arguments of Section 3 of [4] show that, if
< and
,
(f
,
) C
,,
i Re()
(x) |f(x)| dx
C
,,
exp
||
2
4
M a
C
,
where
i Re()
denotes the usual elementary spherical function, and hence
that
,
(f
,
) = 0
M a
C
.
By Harish-Chandras subquotient theorem (see G. Warner [5, p. 452]), if is
any irreducible unitary representation of G on a Hilbert space H
, then there
exist in
M and in a
C
and closed subspaces S
0
and S
1
of H
,
such that
is Namark equivalent to the quotient representation
,
of
,
on S
1
/S
0
.
This means that there is an intertwining operator A
) and (
,
, S
1
/S
0
). Consequently (f
,
) = 0,
rst on the domain of A
by continuity. In summary,
(f
,
), = 0 , H
,
and therefore, summing over and , we see that
(f), = 0 , H
.
It follows that (f) = 0 for all in
G, the unitary dual of G, whence f = 0
by the Plancherel theorem.
The argument of this paper may also be applied in other contexts. For
instance, we may show the following: if f is a measurable function on G,
rapidly decreasing in the sense that for any B in R
+
there exists A in R
+
such that
|f(kak
K a A,
296 M. COWLING, A. SITARAM, AND M. SUNDARI
and if on each principal series induced from the minimal parabolic subgroup,
the group-theoretic Fourier transform vanishes on a set of positive Plancherel
measure, then f is zero. This is a qualitative uncertainty principle related
to [1].
References
[1] M. Cowling, J.F. Price and A. Sitaram, A qualitative uncertainty principle for semisim-
ple Lie groups, J. Austral. Math. Soc. Ser. A, 45 (1988), 127-132.
[2] G.B. Folland and A. Sitaram, The uncertainty principle: a mathematical survey, J.
Fourier Anal. Appl., 3 (1997), 207-238.
[3] G.H. Hardy, A theorem concerning Fourier transforms, J. London Math. Soc., 8 (1933),
227-231.
[4] A. Sitaram and M. Sundari, An analogue of Hardys theorem for very rapidly decreasing
functions on semi-simple Lie groups, Pacic J. Math., 177 (1997), 187-200.
[5] G. Warner, Harmonic Analysis on Semi-Simple Lie Groups I, Grundlehren der math.
Wissenschaft, Bd 188, Springer-Verlag, Berlin, Heidelberg, New York, 1972.
Received June 26, 1998. The rst and third authors were supported by the Australian
Research Council.
University of New South Wales
Sydney NSW 2052
Australia
E-mail address: m.cowling@unsw.edu.au
Indian Statistical Institute
Bangalore - 560 059
India
E-mail address: sitaram@isibang.ernet.in
University of New South Wales
Sydney NSW 2052
Australia
P.O. Box No. 5978
Jeddah 21432
Kingdom of Saudi Arabia
E-mail address: madhava@memrbksa.com
PACIFIC JOURNAL OF MATHEMATICS
Vol. 192, No. 2, 2000
RESTRICTIONS OF
m
(q)-MODULES TO ALTERNATING
GROUPS
William J. Husen
We consider the restriction of an irreducible F
m
(q)-mo-
dule M to a subgroup H where F
(H)
= A
n
and where F
is algebraically closed with (char(F), q) = 1. Given certain
restrictions on the highest weight of M, we show that if m >
n
6
, then M
H
is reducible.
1. Introduction.
In the study of the maximal subgroups of classical groups, the following
question arises: Given an absolutely irreducible module M for K and a
subgroup H, when does M
H
remain absolutely irreducible? In this article
K
=
m
(q) is the commutator subgroup of an m-dimensional orthogonal
group over F
q
, and F
(H)
= A
n
is the alternating group of degree n. We
treat the case that the eld of denition of M has characteristic dividing q.
Let F be an algebraically closed eld containing F
q
, the eld with q
elements, such that char(F) > 3. Then K < K where K
=
m
(F) and we
may assume that M is a FK-module. By [6, Theorem 43], every absolutely
irreducible FK-module is the restriction of an irreducible FK-module of
the same weight. So we may assume that M = M() is an irreducible FK-
module with highest weight . Let = m/2| be the Lie rank of K and
let
i
be the fundamental dominant weights of K. The labeling of these
weights corresponds to the labeling of the Dynkin diagrams for K as given
in [3].
Hypothesis 1.1. Assume the following are true:
(1) If m is even, then =
_
2
i=1
a
i
i
_
+a
1
(
1
+
); a
i
Z, a
i
0.
(2) If m is odd, then =
_
1
i=1
a
i
i
_
+ 2a
; a
i
Z, a
i
0.
(3) If
i
=
1
j=i
a
j
, m even or if
i
=
j=i
a
j
, m odd then
(a)
1
< p = char(F
q
);
(b) 1 <
i
= k < .
297
298 WILLIAM J. HUSEN
Conditions (1) and (2) imply that M is not a faithful module for any
proper covering group of K. We now state our main result:
Theorem 1.2. Assume that H, K and M = M() are as above with n, m
10 and (q, 6) = 1. Suppose further that satises Hypothesis 1.1. If m > n
6
,
then M
H
is reducible.
Our strategy is to produce a small subspace in M with a large stabilizer
in H and then, using Frobenius reciprocity, produce an upper bound for
dim(M). We produce a lower bound for dim(M) as an FK-module using
the length of the Weyl group orbit of a subdominant weight in M. The
result then follows by comparing these two bounds.
2. A construction of W().
In this section we construct the Weyl module W() of K with highest weight
. Then M is a homomorphic image of this module. Our construction
proceeds by rst constructing the Weyl module W() for a complex Lie
group G of the same type and rank as K, then we use Kostants Z-form
to produce W(). For notational convenience we assume that
i
are the
fundamental dominant weights for G as well as for K, and accordingly,
assume that is a dominant weight of G.
Let V be a complex, m-dimensional vector space possessing a non-degene-
rate orthogonal form f ( , ) and let B be a basis for V so that
B =
_
e
i
, f
i
[ 1 i if m is even
e
i
, f
i
, x [ 1 i if m is odd
with f (e
i
, e
j
) = f (f
i
, f
j
) = f (x, e
i
) = f (x, f
i
) = 0, f (e
i
, f
j
) =
i,j
and
f (x, x) = 2. We then dene G = (V ) and let T be the maximal torus
of G with respect to B. Set V
e
= e
i
[ 1 i ) and V
f
= f
i
[ 1 i ).
Suppose that satises hypothesis 1.1 and d = maxi [
i
,= 0 so that
= (
1
, . . . ,
d
) is a proper partition of k. Let T be the tableau of shape
with entries t
i,j
= j +
s<i
s
. Dene the following subgroups of the
symmetric group o
k
:
= o
k
[ (t
i,j
) lies in the same row as t
i,j
for all i, j
(
= o
k
[ (t
i,j
) lies in the same column as t
i,j
for all i, j
and elements of Co
k
:
r
and c
sgn()
RESTRICTIONS OF
m
(q)-MODULES 299
Dene
i,j
: V
k
V
(k2)
by
i,j
(v
l
1
v
l
k
) = f(v
l
i
, v
l
j
)(v
l
1
v
l
i
v
l
j
v
l
k
) for 1 i < j k and set
/ =
i,j
ker(
i,j
).
o
k
acts on V
k
by place permutation, specically:
(v
i
1
v
i
k
) = v
i
1
(1)
v
i
1
(k)
.
This action commutes with the diagonal action of G on V
k
.
Given v V
k
, we dene one additional element r
v
in
. Dene r
v
i
s
i
Notice that r
(v) = [
v
[r
v
(v).
By [2, Theorem 19.22], W() = c
_
V
k
_
/ is the Weyl module for
G with highest weight . Since V is a complex vector space, c
_
V
k
_
=
c
r
v
(v) [ v V
k
).
Dene V
Z
= Z[B] and let V = V
Z
Z
F. Then f ( , ) = f ( , ) 1
F
is a non-degenerate orthogonal form on V . Without loss of generality, we
may assume that K = (V ). Moreover if e
i
= e
i
1
F
, f
i
= f
i
1
F
and
x = x 1
F
, then
B =
_
e
i
, f
i
[ 1 i if m is even
e
i
, f
i
, x [ 1 i if m is odd
is a standard basis for V with respect to f ( , ). We identify r
and c
with
the elements r
1
F
and c
1
F
of Fo
k
.
Suppose that L End(V ) is the adjoint module for G so that L is a
complex Lie algebra of type D
or B
. Let = r
1
, . . . , r
be the set
of simple roots corresponding to the torus T and let be the root system
generated by . Set
0
= r
1
, . . . , r
1
and let
0
be the subset
generated by
0
. Using the setup of [1, 11.2],
r
, h
r
i
[ r , 1 i
is a Chevalley basis for L and
r
, h
r
i
[ r
0
, 1 i 1 is a
Chevalley basis for L
0
L where L
0
is a Lie algebra of type A
1
. Let
G
0
< N
G
(V
e
V
f
) such that G
0
= SL
Z
F = W() where W() is
the Weyl module for K with highest weight . By the previous remarks,
|
Z
(L
0
) |
Z
(L), which implies that |
Z
(L
0
)v
Z
F W().
Dene v
i
=
i
j=1
e
i
and v
d
i=1
v
i
.
300 WILLIAM J. HUSEN
Lemma 2.1. We have:
(1) c
(v
(v
) = c
_
V
k
e
_
Z[e
1
, . . . , e
]
k
.
Proof. First note that
v
so that r
v
(v
) = v
and that c
(v
) ,= 0.
This implies that c
(v
) c
_
V
k
_
. It is clear that c
(v
) / so we
have c
(v
, t
1
1
, . . . , t
1
)
or t = diag(t
1
, . . . , t
, t
1
1
, . . . , t
1
, t
(tc
(v
)) = c
_
d
i=1
t
i
i
v
i
_
=
_
d
i=1
t
i
i
_
c
(v
).
From the denition of it follows that c
(v
_
V
k
e
_
is the Weyl module for G
0
corresponding to the partition of k via the Schur functor. The argument
above restricted to t T G
0
shows that c
(v
_
V
k
e
_
. In particular |(L
0
)c
(v
) = c
_
V
k
e
_
. Using the proof of
[4, Theorem 8.3.1], we have
|
Z
(L
0
)c
(v
) = c
_
V
k
e
_
Z[e
1
, . . . , e
]
k
which completes our proof.
Lemma 2.2. Suppose v = v
i
1
v
i
k
where v
i
is a collection of mu-
tually orthogonal, linearly independent singular vectors. Then:
(1) If sgn(
c
)
c
r
(v) ,= v for all
c
,= 1 (
,
r
, then c
r
v
(v) ,= 0;
(2) c
r
v
(v) W().
Proof. Since v is a summand of c
r
v
r
v
(v)
have the form sgn(
c
)
c
r
(v), part (1) must hold. There is g K such that
g(v
i
j
) =
i
j
e
i
j
such that
i
j
,= 0 for all 1 i k. If w = e
i
1
e
i
k
,
then r
v
= r
w
. As
c
r
w
(w) c
_
V
k
e
_
Z[e
1
, . . . , e
]
k
,
Lemma 2.1 implies that c
r
w
(w) |
Z
(L)v. Writing w =
i
1
e
i
1
i
k
e
i
k
,
we then have
c
r
w
(w) |
Z
(L)v
Z
F = W().
Finally, as W() is a FK-module, g
1
c
r
w
(w) = c
r
v
(v) W().
Though W() is a irreducible module for G, W() may not be an ir-
reducible module for K; however, it does possess a unique maximal sub-
module by [6, Lemma 80] which we denote by Rad(W()). Moreover,
M
= W()/Rad(W()).
RESTRICTIONS OF
m
(q)-MODULES 301
We now discuss the orthogonal forms on V
k
and W(). Suppose v, w
V
k
where v = v
1
v
k
and w = w
1
w
k
. We dene f
k
( , ) by
f
k
(v, w) =
k
i=1
f (v
i
, w
i
).
f
k
( , ) is a non-degenerate, G-invariant orthogonal form on V
k
. This form
is also invariant under the action of o
k
. Note that
f
k
[c
(v), c
(w)] =
sgn()f
k
[(v), c
(w)]
=
sgn()f
k
[v,
1
c
(w)]
=
f
k
[v, c
(w)]
= [(
[f
k
[v, c
(w)].
We dene f
k
( , ) on c
_
V
k
_
by
f
k
[c
(v), c
(w)] = f
k
[v, c
(w)].
By a similar argument as above, we see that f
k
[v, c
(w)] = f
k
[w, c
(v)], so
this form is symmetric. Since f
k
( , ) is bilinear and G-invariant, f
k
( , ) is
also bilinear and G-invariant. Therefore f
k
( , ) is a G-invariant orthogonal
form on W() c
_
V
k
_
. As before, f
k
( , ) = f
k
( , ) 1
F
is a K-
invariant orthogonal form on V
k
and f
k
( , ) = f
k
( , ) 1
F
is a K-invariant
orthogonal form on W(). This form is possibly degenerate. We denote the
radical of this form as W()
.
Proof. Dene v
i
=
i
j=1
f
i
and v
d
i=1
v
i
. Noting that r
v
= 1,
c
(v
(v
(v
)] = c
(v
, W()/W()
must
possess a submodule isomorphic to M. Since M
= W()/Rad(W()) and
Rad(W()) does not possess a constituent which is isomorphic to M, we
must have Rad(W()) = W()
(v) ,= 0, c
(w) ,= 0;
(2) c
(v), c
(w) W();
(3) f
k
[c
(v), c
(w)] ,= 0.
Proof. Parts (1) and (2) follow from Lemma 2.2 since r
v
= r
w
= r
and the
v
i
are distinct, similarly for w
i
. If
1
,
2
o
k
, then
f
k
[
1
(v),
2
(w)] =
k
i=1
f [v
1
1
(i)
, w
1
2
(i)
] =
_
1 if
1
=
2
0 otherwise.
Recall that
= 1. Then we have
f
k
[c
(v), c
(w)] = f
k
[r
(v), c
(w)]
=
f
k
[(v), c
(w)]
=
f
k
[(v), (w)]
= [
[.
Part (3) then follows as [
[ =
d
i=1
i
! and
i
< char(F
q
) for all i.
Lemma 2.5. M possesses a vector of weight
k
.
Proof. Let e
i
, f
i
[ 1 i k be a subset of our standard basis B for V . By
part (2) of Lemma 2.4, c
(e
1
e
k
) W(). An argument similar to
that used in Lemma 2.1 shows that c
(e
1
e
k
) is a vector of weight
k
. Hence
k
is a subdominant weight of . Condition (3) of Hypothesis 1.1
insures that is p-restricted. Therefore using the results of [5], M possesses
a vector of weight
k
.
3. Elementary abelian 3-subgroup E
k
.
Assume that k n/3 2 and recall that F
(H) so that E
k
is an elementary abelian 3-group of rank k. Then
N
k
= N
H
0
(E
k
)
= S
3
/ S
k
S
n3k2
C
k
= C
H
0
(E
k
)
= E
k
S
n3k2
RESTRICTIONS OF
m
(q)-MODULES 303
and let H
k
< C
k
so that H
k
= S
n3k2
. Note that C
N
k
(H
k
)
= S
3
/ S
k
and
this subgroup controls fusion in E
k
. Let ,= 1 E
k
and assume that is the
product of k
1
disjoint 3-cycles. Then C
N
k
()
= Z
3
/S
k
1
S
3
/S
kk
1
S
n3k2
which implies [
N
k
[ = 2
k
1
_
k
k
1
_
.
Let E
k
= Hom(E
k
, F
). The group N
k
acts on this group by
g
:
(g
1
g) for g N
k
, E
k
. We abuse notation slightly and dene
1
by
1
: (
1
) for all E
k
. Recall that In
N
k
() = g N
k
[
g
=
is the inertia group of in N
k
and note that H
k
In
N
k
().
If E
k
is non-trivial, then the previous remarks concerning the action
of N
k
on E
k
imply that [N
k
: In
N
k
()] = 2
k
1
_
k
k
1
_
for some k
1
, 1 k
1
k
and that
1
N
k
. Since
_
k
k
1
_
k unless k = k
1
, in which case 2
k
1
2k,
we have [N
k
: In
N
k
()] 2k.
4. Decomposition of V
E
k
and C
k
-invariant subspace of W().
We continue to assume that k n/32 and we now consider the restriction
of V to E
k
. Since char(F) ,= 3, we have V
E
k
=
E
k
V
where V
is
the homogeneous component of . Let v
1
V
1
and v
2
V
2
. Then
(gv
1
, gv
2
) =
1
(g)
2
(g)(v
1
, v
2
) for all g E
k
. If
1
1
,=
2
then (v
1
, v
2
) = 0
which implies V
1
V
2
when
1
1
,=
2
. Since V is non-degenerate,
dim(V
1
) = dim(V ) dim(V
1
) and it follows that V
1 must be
non-degenerate and therefore possesses a hyperbolic basis.
Pick ,= 1 so that V
,= 0. Since gV
= V
g for g N
k
, we may consider
V
to be an FIn
N
k
()-module. Let E
k1
be a maximal subgroup of E
k
which
does not contain . Dene O
+
= E
k1
N
k
and O
=
1
E
k1
N
k
so that O
+
O
=
N
k
and [O
+
[ = [O
[ k. Moreover
i
O
+
if
and only if
1
i
O
. We assume that O
+
=
i
and that O
=
1
i
.
Then
_
i
O
+
V
i
_
1
i
O
1
i
_
is an FN
k
-submodule of V
N
k
. If
N
k
then, as C
N
k
(H
k
) also controls fusion in E
k
, there is a g C
N
k
(H
k
)
such that gV
= V
. In particular V
= V
as FH
k
-modules. Dene
D = dim(V
) so that D = dim(V
i
) for all i.
Given the above decomposition, we form the following:
V
+
=
k
i=1
V
i
and V
=
k
i=1
V
1
i
.
Recall that D = dim(V
i
) and assume that v
i,j
, w
i,j
[ 1 j
i
D is a
hyperbolic basis for V
i
V
1
i
. Dene v
j
1
,... ,j
k
=
k
i=1
v
i,j
i
and w
j
1
,... ,j
k
=
k
i=1
w
i,j
i
. Then v
j
1
,... ,j
k
, w
j
1
,... ,j
k
[ 1 j
i
D forms a hyperbolic basis
for V
+
V
. If o
k
, then (v
j
1
,... ,j
k
) = v
j
1
,... ,j
k
if and only if = 1
304 WILLIAM J. HUSEN
since the V
i
are distinct. Moreover, r
v
j
1
,... ,j
k
= r
for all v
j
1
,... ,j
k
V
+
.
Similarly for w
j
1
,... ,j
k
V
.
By parts (1) and (2) of Lemma 2.4, and as V
_
V
_
W(). By part (3) of Lemma 2.4, f
k
[c
(v
j
1
,... ,j
k
),
c
(w
j
1
,... ,j
k
)] ,= 0. Whenever (j
1
, . . . , j
k
) ,= (j
1
, . . . , j
k
), we have that
f
k
[c
(v
j
1
,... ,j
k
), c
(w
j
1
,... ,j
k
)] = 0. Therefore c
(v
j
1
,... ,j
k
),
c
(w
j
1
,... ,j
k
) [ 1 j
i
D is a hyperbolic basis for
c
_
V
+
_
_
V
_
.
Lemma 4.1. We have:
(1) V
= c
(V
) as FC
k
-modules;
(2) If k is even, then C
k
stabilizes a 1-dimensional subspace of M;
(3) If k is odd, then C
k
stabilizes a D-dimensional subspace of M.
Proof. Given the hyperbolic basis v
j
1
,... ,j
k
, w
j
1
,... ,j
k
[ 1 j
i
D for
V
+
V
(v
j
1
,... ,j
k
) is a C
k
-invariant
bijection. Therefore V
+
= c
(V
+
) as FC
k
-modules. The case for V
fol-
lows by a similar argument, proving part (1). Suppose that k is even and re-
call that V
= V
j
and V
1
i
= V
1
j
as FH
k
-modules. As H
k
= S
n3k2
and all irreducible FS
n2k2
are self-dual, H
k
stabilizes a 1-dimensional
subspace of V
i
V
j
. It follows by induction that H
k
stabilizes a 1-
dimensional subspace of V
+
. If k is odd, then the same argument leads to a
D-dimensional subspace being stabilized by H
k
. As E
k
acts as scalars on V
,
these spaces are, in fact, stabilized by C
k
. Using part (1), C
k
stabilizes a sub-
space W
0
of one of these dimensions in W(). Since c
_
V
+
_
c
_
V
_
possesses a hyperbolic basis, W
0
W()
= 0. If we let
M
0
=
_
W
0
+W()
_
/W()
= M, hence (2) and
(3).
5. Proof of Theorem 1.2.
We are now in a position to prove Theorem 1.2:
Since M possesses a vector v
k
of weight
k
by Lemma 2.5, we can produce
a lower bound for dim(M) as follows: Let Weyl(K) be the Weyl group of K
and recall that is the Lie rank of K. We compute C
Weyl(K)
(
k
) using [3,
13.1], and compute [
Weyl(K)
k
[, whence
dim(M) [
Weyl(K)
k
[ = 2
k
_
k
_
. (1)
RESTRICTIONS OF
m
(q)-MODULES 305
Case 1. First suppose that k n/3 1. We assume that dim(V ) 2n
4
,
so n
4
. Since dim(M)
_
[H[
k
_
n!.
Trivially, 2
n
4
/2
>
k
1
_
<
_
k
2
_
if k
1
< k
2
< /2,
_
k
_
will be minimal when k = n/3 1 and
= n
4
. Note also that
_
k
_
=
k
i=1
(i+1)
(ki+1)
(k+1)
k
k
k
. We have:
2
n/31
_
n
4
n/3 1
_
<
n!,
2
n/31
(n
4
n/3 + 2)
n/31
(n/3)
n/31
< (n
1/2
)
n1
,
2
n/31
(n
3
1)
n/31
< n
(n1)/2
,
n
n3
< n
(n1)/2
,
n 3 < (n 1)/2,
n < 5.
This contradicts our assumption that n 10, so that dim(V ) 2n
4
or
k < n/3 1.
Case 2. We assume that k < n/3 1 and that k is odd. Lemma 4.1
and Frobenius reciprocity imply dim(M) D[H : C
k
]. Since D
2k
and
[H : C
k
] =
n!
2(3
k
)(n3k2)!
, we have dim(M)
2k
n!
3
k
(n3k2)!
. Combining
with (1) we get:
2
k
_
k
_
2k
n!
2(3
k
)(n 3k 2)!
,
2
k
_
1
k 1
_
<
n
3k+2
3
k1
,
2
k
( k + 1)
k1
(k 1)
k1
<
n
3(k1)
n
5
3
k1
,
2
k
k 1
<
n
3
3
n
5/(k1)
.
Observing that (k 1)n
5/(k1)
< n
3
when k 3 and n 10, we have
2 <
n
6
+ 2n
3
< n
6
.
Case 3. Finally we assume that k < n/3 1 and that k is even. Again
Lemma 4.1 and Frobenius reciprocity imply that dim(M) [H : C
k
]
n!
2(3
k
)(n3k2)!
. Combining with (1) we get:
2
k
_
k
_
n!
3
k
(n 3k 2)!
,
306 WILLIAM J. HUSEN
2
k
( k + 1)
k
k
k
<
n
3k+2
3
k
=
n
3k
3
k
n
2
,
2
k
k
<
n
3
3
n
2/k
,
2 <
n
5
+ 3n
9
.
In all cases, 2 < n
6
, which implies that dim(V ) n
6
. This completes
the proof of Theorem 1.2.
Acknowledgements. The author would like to thank Prof. Kay Magaard
and Prof. Robert L. Griess for their help during the preparation of this arti-
cle, and the referee for suggested improvements, particularly for Lemma 2.5.
References
[1] R.W. Carter, Simple groups of Lie type, Wiley-Interscience, 1989.
[2] W. Fulton and J. Harris, Representation theory, Springer-Verlag, 1991.
[3] J.E. Humphreys, Introduction to Lie algebras and representation theory, Springer-
Verlag, 1972.
[4] G.D. James and A. Kerber, The Representation Theory of the Symmetric Groups,
Encyclopedia of Math. and its Appl., Vol. 16, Addison-Wesley, 1981.
[5] A. Premet, Weights of innitesimally irreducible representations of Chevalley groups
over a eld of prime characteristic, Math. USSR Sbornik, 61 (1988), 167-183 (English
translation).
[6] R. Steinberg, Lectures on Chevalley Groups, Yale University Mathematics Depart-
ment, 1968.
Received June 16, 1998 and revised November 11, 1998. This research was done at Wayne
State University, Detroit MI, 48202.
Ohio State University
Columbus, OH 43210
E-mail address: husen@math.ohio-state.edu
PACIFIC JOURNAL OF MATHEMATICS
Vol. 192, No. 2, 2000
HOW LARGE ARE THE SPECTRAL GAPS?
Alex Iosevich and Steen Pedersen
Let D be a bounded domain in R
n
whose boundary has a
Minkowski dimension < n. Suppose that E
={e
2ix
}
,
an innite discrete subset of R
n
, is a frame of exponentials
for L
2
(D), with frame constants A, B, A B. Then if
R C
_
B|D|
A|D|
_
1
n
,
where C depends only on the ambient dimension n and |D|
= {e
2ix
}, , a discrete
innite subset of R
n
, with frame constants A, B, A B, in the sense that
() A||f||
2
L
2
(D)
f()|
2
B||f||
2
L
2
(D)
,
where f : D C, and
f denotes the standard Fourier transform. In this
paper we will work with frames rather than exponential basis because L
2
of
every bounded domain has frames, whereas exponential basis are generally
hard to come by. (See [Fug].) The following quantities were introduced by
Beurling. See [Br].
(1) D
+
R
= max
xR
n
#{ Q
R
(x)},
where Q
R
(x) is a cube of sidelength 2R centered at x, and let
(2) D
R
= min
xR
n
#{ Q
R
(x)}.
It follows from results proved by Landau ([Lan], see also [GR]) that if D
is a bounded domain then
(3) limsup
R
D
R
(2R)
n
|D|.
If the set E
R
and D
+
R
. These results show that,
307
308 ALEX IOSEVICH AND STEEN PEDERSEN
asymptotically, a suciently large cube centered at any point contains the
number of elements of at least equal to its volume multiplied by the
Lebesgue measure of D. In this paper we will show that if the Minkowski
dimension, , of the boundary D is smaller than the ambient dimension n,
then there exists
(4) R = C
_
B|D|
A|D|
_ 1
n
,
where C only depends on n and |D|
= lim
0
n
|{x : d(x, D) < }|
denotes the -dimensional upper Minkowski content of D, such that a cube
of sidelength 2R centered at any point contains at least one element of .
Note that if D is, say, piecewise smooth, then = n 1 and R = C
B|D|
A|D|
.
A note on notation. The letter C below shall denote a generic constant
which may change from line to line. We shall give more precise information
about the constants when appropriate.
Our main result is the following.
Theorem 1. Let D denote a domain in R
n
with nite non-zero Lebesgue
measure whose boundary D has Minkowski dimension < n in the sense
that
(5) |{x R
n
: d(x, D) < }| C
n
.
Then there exists C depending only on n, such that if
(6) R C
_
B|D|
A|D|
_ 1
n
,
then
(7) Q
R
() =
for every R
n
, and any set such that E
n
j=1
a
j
= 1. We can take =
n
j=1
1
a
j
Z. It is not hard to see that the largest
cube that does not intersect has sidelength 2R =
1
a
n
. The measure of D
HOW LARGE ARE THE SPECTRAL GAPS? 309
is 2
n
j=1
1
a
j
. It follows that
(8)
1
4n
R
|D|
1
4
,
so R grows linearly with |D|.
Example 3. We now spice up the above example to illustrate the fractal
phenomenon. Let D be a domain constructed by taking a square [0, 1]
2
and replacing the upper and lower segments by identical fractal curves of
Minkowski dimension 1 < < 2. It is not hard to see that may be taken
to be Z
2
. (See [Fug].) We now blow up the domain by the factor of t > 1
(i.e., we apply the matrix tI, where I is the identity matrix). Let tD denote
the image of D under that mapping. The set must now be taken to be
(
1
t
Z)
2
, which tells us that R in Theorem 1 should be
1
t
. On the other
hand, |tD|
, and |tD| = t
2
, so Theorem 1 gives us R
_
t
t
2
_
1
2
=
1
t
.
The following example shows that if the Lebesgue measure |D| = 0 the
conclusion of Theorem 1 no longer holds.
Example 4. Let D [0, 1] denote the Cantor type set consisting of num-
bers that do not have 1 or 3 in their base 4 expansion. Let m denote the
unique probability measure supported on D (see [Fal]) given by the equation
(9)
_
f(t)dm(t) =
1
2
_
f
_
t
4
_
dm(t) +
1
2
_
f
_
t + 2
4
_
dm(t).
One can check that
(10) m(t) = e
i
2
3
t
j=0
cos
_
t
2 4
n
_
.
If is the set of non-negative integers whose base 4 expansion does not
contain 2 or 3, then E
is an orthonormal basis of L
2
(m). (See [JP].)
In particular this shows that the conclusion of Theorem 1 fails miserably
in this case.
Example 5. In this example we shall see that there exist families of do-
mains with piecewise smooth boundaries such that the volume of each do-
main is 1, the length of the boundary tends to innity, but R, in the sense
of Theorem 1, may always be taken to be
1
2
+, for any > 0.
Let D
k
denote the unit square in R
2
where the upper and lower edges are
replaced by a sawtooth function with k teeth where the height of each tooth
is
1
2
. The length |D
k
| goes to innity as k . The set for each D
k
is Z
2
, so R, in the sense of Theorem 1, may always be taken to be
1
2
+ ,
for any > 0. This says that the inequality (6) does not sharply describe
the behavior of R in this case. However, the proof of Theorem 1 (see the
discussion at the end of the proof of Theorem 1 below) shows that in some
310 ALEX IOSEVICH AND STEEN PEDERSEN
cases R may be taken to be C
diameter (D)
|D|
, where C depends only on n. We
shall see that the example given in this paragraph falls into that category.
In all the previous examples we used frames which were actually orthogo-
nal bases. However, this phenomenon persists in the cases when orthogonal
exponential basis do not exist and we have to make do with frames.
Example 6. Let B
r
denote the disc of radius r in R
2
centered at the origin.
It was shown in [JP2] that =
1
2r
Z
2
is frame for L
2
(B
r
) with constants
A = B = 4r
2
. Note that we do not have orthogonal basis becuase, in
particular, that would imply that A = B = |B
r
| = r
2
. It is well known
that B
r
does not have orthogonal basis of exponentials. See [Fug].
It is clear that R, in the sense of Theorem 1 must be taken to be greater
than
1
4r
, which is exactly what Theorem 1 predicts.
The key estimate (see Lemma 9 below) involved in the proof of Theorem
1 is
(11)
/ Q
R
()
|
D
( )|
2
C
|D|
R
n
,
for any R
n
, where C depends only on the dimension and on the frame
constant B.
This estimate is similar to the estimate that comes up in the theory of
irregularities of distributions, (see [Mgr, p. 110]), namely that for any
domain S whose boundary is a piecewise C
1
curve C
(12)
_
|t|R
|
S
(t)|
2
dt
|C|
2
2
R
.
In fact, our proof of the estimate (11) given in Lemma 9 below uses an
idea from the proof of the estimate (12) given by Brandolini, Colzani, and
Travaglini in [BCT].
The proof of Theorem 1 is based on the following sequence of lemmae.
Lemma 7. For any f L
2
(D) dene
(13) F
D
f() =
_
D
e
2ix
f(x)dx,
and let
f denote the standard Fourier transform
(14)
f() =
_
R
n
e
2ix
f(x)dx.
Let t
h
f(x) = f(x + h), and let
D
denote the characteristic function of
D. Then
F
D
t
h
D
() = e
2ih
DD+h
(), (15)
F
D
t
h
D
() =
DD+h
(), (16)
HOW LARGE ARE THE SPECTRAL GAPS? 311
and
F
D
D
() =
D
(). (17)
The proof is straightforward.
Lemma 8. Let D be as above. Then
_
D
|
D
(x +h)
D
(x h)|
2
dx C|h|
n
, (18)
and
_
D
|
D
(x)
D
(x h)|
2
dx C|h|
n
, (19)
with C C
|D|
, where C
depends only on n.
Remark. We note again that even though the estimate C C
|D|
is
best possible over all hs, for special choices of h, the estimate is frequently
much better. (See Example 5 above.)
To prove (19) note that the left hand side equals |{D(D+h)} {(D+
h) D}| |{x R
n
: d(x, D) < h}| C|D|
|h|
n
. The proof of (18) is
similar.
The key lemma is the following. (See [BCT] for a similar argument.)
Lemma 9. Let D be as above and let be such that E
is a frame of L
2
(D)
with frame constants A and B, A B. Then
(20)
{Q
2
k+1
Q
2
k
}
|
D
()|
2
CB2
k(n)
,
where Q
R
= Q
R
(0, . . . , 0), and C as in Lemma 8.
To prove Lemma 9 chose N boxes A
j
k
and N vectors h
j
such that 2
k
|h
j
| 2
k+1
,
N
j=1
A
j
k
= Q
2
k+1 Q
2
k, and
(21)
e
2ih
j
1
1, A
j
k
.
Clearly this can be done in any dimension n, for a suciently large N =
N(n).
Now, by triangle inequality
_
_
_
A
j
k
|
D
()|
2
_
_
_
1
2
(22)
_
_
_
A
j
k
|
DD+h
j
()|
2
_
_
_
1
2
+
_
_
_
A
j
k
|
D
()
DD+h
j
()|
2
_
_
_
1
2
312 ALEX IOSEVICH AND STEEN PEDERSEN
= I +II.
By Lemma 7, the frame property, and Lemma 8 we get
II
2
|
D
()
DD+h
j
()|
2
(23)
=
|F
D
D
() F
D
t
h
j
D
()|
2
B
_
D
|
D
(x)
D
(x h
j
)|
2
dx
CB|h
j
|
n
CB2
k(n)
.
On the other hand, by (21), Lemma 7, the frame property, and Lemma 8
we get
I
2
A
j
k
|
DD+h
j
()|
2
|e
2ih
1|
2
(24)
|
DD+h
j
()|
2
|e
2ih
j
1|
2
=
|F
D
t
h
j
D
() F
D
t
h
j
D
()|
2
B
_
D
|
D
(x +h
j
)
D
(x h
j
)|
2
dx
CB|h
j
|
n
CB2
k(n)
.
Proof of Theorem 1.
Since E
is a frame for L
2
(D) if and only if E
|F
D
D
()|
2
=
|
D
()|
2
(25)
=
Q
R
|
D
()|
2
+
{/ Q
R
}
|
D
()|
2
.
Using Lemma 9 we see that if R = 2
k
0
,
(26)
{/ Q
R
}
|
D
()|
2
=
k=k
0
Q
2
k+1
Q
2
k
|
D
()|
2
CB2
k
0
(n)
=
BC
R
n
.
HOW LARGE ARE THE SPECTRAL GAPS? 313
So by (25) and (26)
(27)
Q
R
|
D
()|
2
A|D|
BC
R
n
which proves that if R >
_
BC
A|D|
_ 1
n
, then
(28) Q
R
= .
Moreover, the above proof shows that C C
|D|
where C
depends
only on n.
Remark. In the proof above the key estimate is |{D D h}|
C|h|
n
|D|
function germs of R
n
to
R around the origin with maximal ideal m(n). The powers of m(n) will be
denoted by m(n)
k
and m(n)
k=1
m(n)
k
. For I = (i
1
, . . . , i
n
) a multi-
index of natural numbers and x = (x
1
, . . . , x
n
) we shall write x
I
= x
i
1
1
. . . x
i
n
n
and [I[ = i
1
+ + i
n
, also for a germ f,
|I|
f
x
I
=
|I|
f
x
i
1
1
x
i
n
n
.
For S a subset of R
n
, 0 S, cl(S) and int (S) will denote the closure
and the interior of S respectively and G
S
will be the group of germs of
dieomophisms at 0, such that (x) = x x S. Also d(x, S) will denote
the usual distance from the point x to the subset S.
Finally if f is a germ, f/x
i
will be the partial derivatives of f and
_
f
x
i
_
will be the ideal of c (n) generated by them. Also for a germ f,
315
316 LE
( m l).
We shall denote /(J) the minimum l satisfying this equality, therefore
/(J) /((J)).
Since m(n)
k
ker, then (J m(n)
k
) = (J) (m(n)
k
) = (J) M
k
.
If we apply the epimorphism to the equality (), we get M
m
(J) =
M
ml
(M
l
(J)), m l. Therefore /((J)) /(J) and hence /(J) =
/((J)).
In case I is a radical ideal of c (n), we get in fact I m(n)
m
=
m(n)
mk
(I m(n)
k
) m k.
FINITE RELATIVE DETERMINATION 317
Theorem 3. Consider I a nitely generated ideal of c (n) . Then for any
k < , I m(n)
k
is also nitely generated.
Proof. Consider g
1
, . . . , g
s
generators of I and let f =
s
i=1
h
i
g
i
. Then we
have
f =
s
i=1
h
(k)
i
g
i
+
s
i=1
h
[k]
i
g
i
,
where h
(k)
i
is the (k 1) jet of h
i
and h
[k]
i
= h
i
h
(k)
i
.
Hence as vector spaces I = V + m(n)
k
I, where V is the vector space
generated by
_
x
I
g
i
_
with [I[ k 1. Therefore I m(n)
k
= V m(n)
k
+
m(n)
k
I. It is clear that a basis of the subspace V m(n)
k
of V and the
generators of m(n)
k
I generate I m(n)
k
as an ideal of c (n).
Theorem 4. Suppose I is a radical ideal of c (n), I m(n)
k
a nitely gen-
erated ideal and I m(n)
k
Im(n)
_
f
x
i
_
with k /(I) . Then f is k-
determined relative to G
S
, where S = z(I).
Theorem 5. Let f be a k-determined germ relative to G
S
, S = z(I) and I
a radical nitely generated ideal. Then I m(n)
k+1
I
_
f
x
i
_
+m(n)
k+2
I.
Joining Theorems 4 and 5 we get for I a nitely generated ideal, the
following:
Theorem 6. Let f be a germ, I a nitely generated radical ideal, S = z(I),
and k /(I). Then f is nitely determined relative to G
S
if and only
if there exists a number l greater or equal than k such that m(n)
l
I I
_
f
x
i
_
.
The proofs of the above theorems can be found in [K], Theorems 11 and
15.
We can change Theorem 4 in the following way.
Theorem 7. Let I be a radical ideal, k = /(I) and suppose that I
m(n)
k
is nitely generated. Let l be a natural number such that m(n)
l
I
m(n)I
_
f
x
i
_
. Then f is (k + l 1) determined relative to G
S
, where
S = z(I).
Proof. Let g be a germ with g f on S and j
k+l1
g(0) = j
k+l1
f(0).
If we dene the trivial homotopy F(x, t) = (1 t) f(x) + tg(x) we get
F
t
= g f m(n)
k+l
I and
f
x
i
=
F
x
i
+ t
_
f
x
i
g
x
i
_
.
318 LE
f
x
i
) implies
f is l determined on the right ([M]).
(b) If I = x
1
, . . . , x
s
) then k = 1 and we get that m(n)
l
I m(n)I
f
x
i
)
implies f is l determined relative to G
S
, S = 0 R
ns
([P-L]).
Corollary 8. Let f be a germ, I a radical ideal, k = / (I) and I m(n)
k
be nitely generated. Suppose that m(n)
l
m(n)
f
x
i
). Then f is (k+l 1)-
determined relative to G
S
. Hence nite determination on the right implies
nite relative determination.
Proof. Since m(n)
l
m(n)
f
x
i
) then m(n)
l
I m(n)I
f
x
i
). We now use
Theorem 7.
We are now interested in determining for which ideals I we have the
converse of Corollary 8. For this purpose we need the following:
Theorem 9. Let A be a commutative ring, I, J, K ideals of A with I =
g
1
, . . . , g
k
). Suppose that ag
i
= 0 for all i and a J
k
+ K implies a = 0.
Then if IJ IK hence J
k
K.
Proof. Let m
1
, . . . , m
k
be arbitrary elements of J, then g
i
m
i
=
k
j=i
g
j
d
ij
i with 1 i k (d
ij
K). In matricial notation we can write
() C
_
_
_
g
1
.
.
.
g
k
_
_
_
=
_
_
_
0
.
.
.
0
_
_
_
where C = (
ij
m
i
d
ij
).
If we multiply () by the adjoint of C we get (det C)g
i
= 0 i, but
det C = m
1
m
k
+ b with b K. Hence by hypothesis det C = 0 and
then m
1
m
k
K, therefore J
k
K.
FINITE RELATIVE DETERMINATION 319
Corollary 10. Let A = c (n), J = m(n)
l
, (or J = m(n)
), K =
f
x
i
)
and I ideal with
I
= g
1
, . . . , g
k
) . Suppose that hg
i
= 0 for all i and
h m(n)
lk
+
_
f
x
i
_
(or h m(n)
+
f
x
i
)) implies h 0. Then if
I
m(n)
l
I
_
f
x
i
_
(or
I
m(n)
I
f
x
i
)) hence m(n)
lk
f
x
i
) (or m(n)
f
x
i
))
and f is (lk + 1)-determined on the right ( - determined on the right).
This result motivates us to nd examples where I is a nitely generated
radical ideal satisfying the hypothesis of the above corollary.
Example 11. Let I be a radical ideal generated by a non trivial analytic
germ g. If hg = 0 then h 0 and we will have nite relative determination
implies determination on the right (hg 0 = h
1
(0) g
1
(0) = R
n
but
g
1
(0) is a closed set with empty interior, therefore h
1
(0) = R
n
).
Example 12. Consider in c (3) the ideal I generated by x
1
x
2
, x
1
x
3
, x
2
x
3
.
It is easy to see that I is radical and / (I) = 2. Moreover if we denote
P
1
= x
1
x
2
, P
2
= x
1
x
3
and P
3
= x
2
x
3
,we get for i ,= j, i ,= k, j ,= k that
the closure of z(P
i
) z (P
j
) z(P
k
) is a plane and does not contain z(I),
which is the union of the three axes, hence it does not satisfy the hypothesis
of Theorem 20 [K], but the conclusion is still true. We give a proof since it
is important for the converse of Corollary 8.
Proposition 13. With the above notation, if f is m-determined relative to
G
S
, where S = z(I) are the coordinate axes, then f is (2m2)-determined
on the right.
Proof. By Theorem 15 ([K]) we know that m(3)
m+1
I I
f
x
i
) which
in this case is equivalent to Im(3)
m1
I
f
x
i
). We shall show that
m(3)
2m1
f
x
i
)m(3)
2
and hence f is (2m 2)-determined on the right.
Any mixed monomial of m(3)
2m1
has a factor of I times a monomial of
degree 2m 3, hence for m 2 it is contained in the Jacobian ideal. We
now give the proof for x
2m1
1
, the other two are similar,
() x
m1
1
(x
1
x
2
) = x
1
x
2
3
j=1
f
x
j
h
1j
+ x
1
x
3
3
j=1
f
x
j
h
2j
+ x
2
x
3
3
j=1
f
x
j
h
3j
.
If we denote = x
m1
1
3
i=1
f
x
j
h
1j
we get that the zeroes of contain
x
3
= 0 and the zeroes of x
1
contain x
3
= 0x
1
= 0, hence x
1
I
=
I. From () and the denition of , x
m
1
= x
1
3
j=1
f
x
j
h
1j
+ x
1
, therefore
x
2m1
1
m(3)
2
f
x
i
) and f is (2m2)-determined.
320 LE
r
i=1
I
i
rad
r
i=1
I
i
r
i=1
rad I
i
=
r
i=1
I
i
.
Therefore the equality
r
i=1
I
i
= rad
r
i=1
I
i
.
Lemmas 15 and 16 generate a special collection of algebraic sets. They
are called bouquets of subspaces.
Example 17. Consider I c (3) the ideal generated by x and yz, hence
z(I) is the union of the yaxis and zaxis, they are not in general position
(in R
3
). By Lemma 16, I is clearly a radical ideal since I = I
1
I
2
where
I
1
= x, y) and I
2
= x, z) .
Denition 18. Let I be a nitely generated ideal of c (n), I = g
1
, . . . , g
k
) .
We say that I is integral if S = z(I) is nowhere dense.
We now arrive at the main theorem of this section.
Theorem 19. Let I be a nitely generated ideal of c (n) which is radical.
Then if f is nitely determined relative to G
S
, S = z(I), hence f is nitely
determined on the right.
Proof. Suppose I = g
1
, . . . , g
k
) and that hg
i
0 i. Therefore z(h)
z(g
i
) = R
n
i and hence z(h) z(I) = R
n
. Since I is an integral ideal,
see [R], z(h) = R
n
and hence h 0. On the other side there exists a
natural number p such that m(n)
p
I
f
x
i
)I. By Corollary 10, we get
m(n)
pk
f
x
i
) and therefore f is (pk + 1)-determined on the right.
FINITE RELATIVE DETERMINATION 321
Corollary 20. Consider I
1
, . . . , I
r
ideals each of them generated by lin-
early independent linear polynomials and S the union of their common ze-
roes (bouquet of subspaces). Then a germ f is nitely determined on the
right if and only if f is nitely determined relative to G
S
.
We nish this section with an observation about homogeneous polynomi-
als.
Proposition 21. Let h
1
, . . . , h
k
be homogeneous polynomials of degree
s
1
, . . . , s
k
respectively and let s be the maximum of these degrees. Hence
if I is the ideal generated by h
1
, . . . , h
k
we get /(I) s.
Proof. We have to show that (I m(n)
s
)m(n)
r
= I m(n)
s+r
r 0. Let
f I m(n)
s+r
, hence we have the following equalities.
f = h
1
g
1
+ . . . + h
k
g
k
()
0 = j
s+r1
f(0) = h
1
j
r+s1s
1
g
1
(0) + + h
k
j
r+s1s
k
g
k
(0). ()
Sustracting () from () we get f = h
1
g
1
+ + h
k
g
k
, where
g
i
m(n)
r+ss
i
.
Hence each
g
i
is a sum of elements of the form
h
j
i
h
j
i
, with
h
j
i
m(n)
r
and
h
j
i
is a homogeneous monomial of degree s s
i
.
Therefore f is a sum of elements of the form (h
i
h
j
i
)
h
j
i
,with (h
i
h
j
i
)
I m(n)
s
, so f (I m(n)
s
)m(n)
r
. We have shown that I m(n)
s+r
(I m(n)
s
) m(n)
r
r 0. The other inclusion is obvious.
2. Innite determinacy on germs of closed subsets of R
n
.
In this setction we will assume that S is a germ of a closed subset of R
n
such that the origin is an accumulation point of S.
Denition 22. Let S R
n
be a germ of a closed set such that
0
is an
accumulation point of S. We say that a germ f in c (n) is Sinnitely
determined if given a germ g such that j
g(x) = j
f(x) x S there
exists a germ of a dieomorphism such that g = f .
We denote by c (S, n) the ideal of c (n) consisting of the germs f such
that j
g(x) = 0
x ,= 0, x S and therefore c (S, n) c (S, n) m(n)
g(x) = j
f(x) x S. We de-
ne the homotopy F(x, t) = tg(x) + (1 t) f(x) . Consider the follow-
ing c(n + 1) modules N = c(n + 1)
f
x
i
) and K = c(n + 1)
F
x
i
). If
h N, we have h(x, t) =
n
i=1
f
x
i
(x) h
i
(x, t) =
n
i=1
F
x
i
(x, t) h
i
(x, t)+
t
n
i=1
(fg)
x
i
(x) h
i
(x, t). Since
(fg)
x
i
(x) c(S, n)
f
x
i
)c(S, n), we get
N K+c(S, n)N, and by Nakayamas lemma, N K and hence c(S R,
n + 1)
f
x
i
) c(n + 1) c(S R, n + 1)
F
x
i
)c(n + 1).
Since g f c(S R, n + 1)
f
x
i
)c(n + 1), hence
F
t
c(S R, n + 1)
F
x
i
)c(n + 1). We now proceed in the usual way.
Proposition 25. If f is a germ, nitely (innitely) determined on the right,
it is S-innitesimally stable and therefore S-innitely determined.
Proof. By our hypothesis we have
m(n)
_
f
x
i
_
and c(S, n) c(S, n)m(n)
c(S, n)
_
f
x
i
_
.
Example 26.
(a) Let f be a germ and k a natural number, denote by I
k
the ideal
generated by f
k
. Suppose c(S, n) I
k
, f
k
_
f
x
i
_
and j
f(x) ,=
0 x T, where the closure of T is S. If h c(S, n), h = f
k
g where
g c(S, n). Therefore c(S, n) c(S, n)
f
k
_
c(S, n)
_
f
x
i
_
, and
thus f is S-innitely determined.
(b) In particular let S = (x
1
, . . . , x
n
)[x
1
0. Then for the germs
f
1
(x
1
, . . . , x
n
) =
n
i=1
x
2
i
, and f
2
(x
1
, . . . , x
n
) = x
1
, we get j
f
1
(x) ,=
0 and j
f
2
(x) ,= 0 x int S. Since S = cl(int S) and c(S, n)
f
k
i
_
for i = 1, 2 (Proposition 5.4 of Chapter V, [T]), and clearly f
k
i
_
f
i
x
j
_
for i = 1, 2, we get that f
k
1
, f
k
2
are S-innitely determined. ([P-L]).
Denition 27.
(a) Let S be a closed subset of R
n
(containing the origin) and f a germ
with f
_
0
_
= 0. We say that f satises a Lojasiewicz inequality for S
if for any K, a germ of a compact set with
0
K, there exist constants
c > 0 and 0 such that [f(x)[ cd(x, S)
for all x K.
FINITE RELATIVE DETERMINATION 323
(b) Let I be a nitely generated ideal of c(n) and S the germ of its common
zeroes. We say that I is a Lojasiewicz ideal if there exists f in I
satisfying a Lojasiewicz inequality for S.
(c) Let f m(n) and S a closed subset of R
n
, we say that f satises
a Jacobi-Lojasiewicz condition for S if [f[ satises a Lojasiewicz
inequality for S.
Remark 4. If f
1
, . . . , f
s
is a set of generators of a Lojasiewicz ideal I,
then
s
i=1
f
2
i
,
s
i=1
[f
i
[ and max
_
f
2
1
, . . . , f
2
s
_
also satisfy a Lojasiewicz
inequality for S.
Denition 28. Let (b
i
) be a sequence of positive real numbers converging
to zero. We say that a sequence of real numbers (a
i
) is at along (b
i
) if
given r > 0 there exists a natural number N = N(r) such that [a
i
[ b
r
i
for i N. Sequences of vectors, matrices, jets are at along a sequence
(b
i
) if each entry is at along (b
i
). A sequence is at along a sequence (x
i
)
of nonzero vectors in R
n
converging to the vector 0 if it is at along the
sequence ([x
i
[). In the case of jets, we ask for a uniform N = N(r) for
all entries. Here we are identifying
(xx
0
)
!
with (a
).
Remark 5. We can change r > 0 for r = n, n a natural number since for
n > r, we get b
n
i
b
r
i
(0 b
i
1).
We state an interesting equivalence.
Lemma 29. A germ g does not satisfy a Lojasiewicz inequality for a closed
subset S if and only if there exists a sequence of vectors x
i
R
n
S con-
verging to the vector
0
such that (g(x
i
)) is at along (d(x
i,
S)).
Remark 6. For a germ g not identically zero we can choose g(x
i
) ,= 0 i.
Denition 30. Let S be a closed subset of R
n
. Then M(S, n) is the set of
maps : R
n
S R such that if K is a germ of a compact set and I is a
multi-index of natural numbers, there exist constants c > 0 and > 0 such
that
|I|
x
I
(x)
cd(x, S)
for all x K S.
We state the following proposition (Chapter IV, Proposition 4.2 of [T]).
Proposition 31. Let M(S, n) and f c(S, n). Then we can extend
f in a unique way to a germ in c(S, n), denoted also by f.
Theorem 32. Let f be a germ, S a germ of a closed subset of R
n
such that
0
is an accumulation point of S. Suppose that f satises a Jacobi-Lojasiewicz
condition for S. Then f is S-innitesimally stable and therefore S-innitely
determined.
Proof. Consider g = [f[
2
, we shall show that c(S, n)
_
f
x
i
_
c(S, n).
Let K be a germ of a compact subset and g
1
be a representative of g; for
324 LE
|I|
(
1
g
1
)
x
I
C
I
|g
1
(x)|
(|I|+1)
x K. Since g
1
satises a Lojasiewicz inequality for S, there exist c > 0
and 0 such that [g
1
(x)[ cd(x, S)
x KS and therefore
|I|
(
1
g
1
)
x
I
C
I
c
|I|+1
d(x,S)
(|I|+1)
x K S, hence
1
g
1
M(S, n). Now for h c(S, n) and
x / S we have h(x) =
h(x)
g
1
(x)
g
1
(x), extend
h(x)
g
1
(x)
to a germ H in c(S, n) and
h = Hg
1
in c(S, n)
_
f
x
i
_
. Therefore we get c(S, n) c(S, n)
_
f
x
i
_
and f
is S-innitesimally stable.
Lemma 33 ([W, Lemma 3.3]). Suppose there exist a sequence (w
i
) in
J
k
(n, 1), k , a sequence (x
i
) in R
n
_
0
_
converging to the origin
and a germ f such that q
i
= w
i
j
k
f(x
i
) is at along (x
i
). Then there
exists a germ g such that j
k
g(x
i
j
) = w
i
j
holds for (x
i
j
) subsequence of (x
i
),
and j
g(0) = j
f(0).
Lemma 34. Suppose there exist a sequence (w
i
) in J
k
(n, 1), k , a
sequence (x
i
) in R
n
S converging to zero and a germ f c(n) such that
(q
i
) = (w
i
j
k
f(x
i
)) is at along (d(x
i
, S)), where S is a closed subset
of R
n
(0 S). Then there exists a germ g c(n), such that j
g(x) =
j
f(x) x S and j
k
g(x
i
) = w
i
holds for a subsequence of (x
i
).
Proof. If k is nite, then we transform q
i
into an -jet in such a way that
all the terms of order greater than k of q
i
are zero. Thus we will assume
k = .
We dene Q, a Taylor eld, by q
i
at x
i
and by the zero series on S.
We want to show that Q is a C
|L|m|I|
Q
I+L
(y)
(xy)
L
L!
.
If x, y S then the proof is obvious. In the case x, y x
i
_
0
_
we
proceed as in the proof of Lemma 3.3 of [W]. If x, y = x
j
, s, s S, we
use the atness of (q
i
) along (d(x
i
, S)) to obtain for each natural number
l another N(l) such that [(R
m
s
)
I
(x
j
)[ = [q
I
j
[ d(x
j
, S)
l
d(x
j
, s)
l
and
[(R
m
x
j
)
I
(s)[
|L|m|I|
q
(I+L)
j
(sx
j
)
L
L!
Cd(x
j
, s)
l
for j N(l), where
C is a positive real number depending only on m and I. Let l = m + 1.
Hence, using Whitney Extension Theorem (Theorem 3.1 of Chapter IV,
[T]), there exists a smooth germ q such that j
q(x) = 0 x S and
j
q(x
i
) = q
i
. If g = f + q, we see that g has the desired properties.
Theorem 35. Let f be a germ, S a closed subset of R
n
and
0
an accumu-
lation point of S. Hence if f is S-innitely determined, then f satises a
Jacobi-Lojasiewicz condition for S.
FINITE RELATIVE DETERMINATION 325
Proof. We shall prove the theorem by contradiction. Then there is a se-
quence (x
j
) in R
n
S converging to the origin such that ([f(x
j
)[) is at
along (d(x
j
, S)). Choose (y
j
) a sequence of regular values of f converging
to zero and such that (f(x
j
) y
j
) is at along (d(x
j
, S)). It clearly follows
that (y
j
, 0) (f(x
j
), f(x
j
)) is at along (d(x
j
, S)).
If we denote q
j
= (y
j
, 0) (f(x
j
), f(x
j
)) and setting k = 1 in the
previous lemma, there exists a germ g such that j
1
g(x
j
) = (y
j
,
0
) and gf
c(S, n). Now since f is S-innitely determined, f and g must have the same
critical and regular values, which is not the case, since the points y
j
are
regular values for f but critical values for g.
As a consequence of Theorems 24, 32 and 35 we get the main theorem of
part II:
Theorem 36. Let f c(n). The concepts of S-innitesimally stability, S-
innite determinacy and the Jacobi-Lojasiewicz condition at S are equivalent
for the germ f and S a germ of a closed subset of R
n
with
0
an accumulation
point of S.
3. A special case.
Denition 37. Let S be a germ of a closed subset of R
n
such that
0
g(x) =
j
and
|I|
f
x
I
m(S) for all multi-index I. We also get in this case the equality m(S) =
m(n)
m(S).
326 LE
_
f
x
i
_
for some k , we
get that m(S)
_
f
x
i
_
m(S). We now use Theorem 44.
Denition 46. Let P be a polynomial in variables x
1
, . . . , x
n
. We say that
P is quasihomogeneous of degree l and weights k
1
, . . . , k
n
if
P(t
k
1
x
1
, . . . , t
k
n
x
n
) = t
l
P(x
1
, . . . , x
n
).
For P quasihomogeneous we get
P
x
j
(t
k
1
x
1
, . . . , t
k
n
x
n
) =
t
lkj P
x
j
(x
1
, . . . , x
n
).
Also if we write P =
a
I
x
I
, for a quasihomogeneous polynomial we
obtain for any multi-index I = (i
1
, . . . , i
n
), i
1
k
1
+ . . . + i
n
k
n
= l (a
I
,= 0).
Theorem 47. Let P(x) be a quasihomogeneous polynomial and S a closed
subset of R
n
containing the origin and such that S = cl(int S). Suppose that
m(S) P) and that z(P) int S = . Then P is S-innitesimally stable.
In the case S = x[P(x) 0 is a good semialgebraic set, we can skip the
equality z(P) int S = .
Proof. By hypothesis we get m(S) P) and P
_
P
x
i
_
, this together with
z(P) int S = give the result using Example 26. For the second part it is
obvious that z(P) int S = since S is a good semialgebraic set.
As in the previous section, we get the following:
Theorem 48. Let f c(n), S be a closed subset of R
n
such that the origin
is an accumulation point of S and S = cl(int S). Then the concepts for f of
S-innitesimally stability, S-stability and the Jacobi-Lojasiewicz condition
for S are equivalent.
Proof. Our Theorem 44 shows that S-innitesimally stability implies S-
stability. Now as in Theorem 32, we show that the Jacobi-Lojasiewicz con-
dition at S implies S-innitesimally stability. Since Lemma 34 is true for
any closed subset of R
n
, the proof of Theorem 35 will be true in the case
S = cl(int S), and hence S-stability implies the Jacobi-Lojasiewicz condition
of f for S.
References
[K] L. Kushner, Finite determination on algebraic sets, Trans. of the Amer. Math.
Soc., 331(2) (1992), 553-561.
328 LE
exico
E-mail address: kushner@servidor.unam.mx
Universidade Federal de S
ao Carlos
13560-905 Sao Carlos, S.P.
Brasil
E-mail address: bterra@power.ufscar.br
PACIFIC JOURNAL OF MATHEMATICS
Vol. 192, No. 2, 2000
BRAIDED-LIE BIALGEBRAS
Shahn Majid
We introduce braided Lie bialgebras as the innitesimal
version of braided groups. They are Lie algebras and Lie
coalgebras with the coboundary of the Lie cobracket an in-
nitesimal braiding. We provide theorems of transmutation,
Lie biproduct, bosonisation and double-bosonisation relating
braided Lie bialgebras to usual Lie bialgebras. Among the
results, the kernel of any split projection of Lie bialgebras is
a braided-Lie bialgebra. The Kirillov-Kostant Lie cobracket
provides a natural braided-Lie bialgebra on any complex sim-
ple Lie algebra, as the transmutation of the Drinfeld-Sklyanin
Lie cobracket. Other nontrivial braided-Lie bialgebras are as-
sociated to the inductive construction of simple Lie bialgebras
along the C and exceptional series.
1. Introduction.
Braided geometry has been developed in recent years as a natural gener-
alisation of super-geometry with the role of Z/2Z grading played by braid
statistics. It is also the kind of noncommutative geometry appropriate to
quantum group symmetry because the modules over a strict quantum group
(a quasitriangular Hopf algebra [3]) form a braided category, hence any ob-
ject covariant under the quantum group is naturally braided. In particular,
one has braided groups [6] as generalisations of super-groups or super-Hopf
algebras. The famous quantum-braided plane with relations yx = qxy is
a braided group with additive coproduct [7]. We refer to [8], [10] for in-
troductions to the 50-60 papers in which the theory of braided groups is
developed.
In a dierent direction, Drinfeld [2] has introduced Lie bialgebras as an
innitesimalisation of the theory quantum groups. This concept has led (on
exponentiation) to an extensive theory of Poisson-Lie groups, as well as to a
Yang-Baxter theoretic approach to classical results of Lie theory, such as a
new proof of the Iwasawa decomposition and the structure of Bruhat cells;
see for example [9], [5]. For an introduction to quantum groups and Lie
bialgebras, see [10].
We now combine these ideas for the rst time by introducing the inn-
itesimal theory of braided groups. All computations and results will be in
329
330 SHAHN MAJID
the setting of Lie algebras, although motivated from the theory of braided
groups. In fact, there are several dierent concepts of precisely what one
may mean by the innitesimal theory of braided groups. Firstly, one may
keep the braided category in which one works xed and look at algebras
which depart innitesimally from being commutative. In the category of
vector spaces this leads to Drinfelds notion of Poisson-Lie group. Then one
can consider the coalgebra also in an innitesimal form, which leads in the
category of vector spaces to Drinfelds notion of Lie bialgebra. In the case
of a braided category one already has the notion of braided-Lie algebra [11]
and, adding to this, one could similarly consider a Lie bialgebra in a braided
category. By contrast, we now go further and let the braiding also depart
innitesimally from the usual vector space transposition. In principle, the
degree of braiding is independent of the degree of algebra commutativity
or coalgebra cocommutativity. Thus one could have innitesimally braided
algebras, coalgebras and Hopf algebras as well. However, the case which ap-
pears to be of most interest, on which we concentrate, is the case in which all
three aspects are made innitesimal simultaneously, which we call a braided-
Lie bialgebra. The formal denition appears in Section 2. It consists of a
Lie algebra b equipped with further structure.
In Section 3 we provide the Lie version of the basic theorems from the
theory of braided groups. These basic theorems connect braided groups and
quantum groups by transmutation [12], [6] and bosonisation [13], [14] pro-
cedures, thereby establishing (for example) the existence of braided groups
associated to all simple Lie algebras. The theorems in Section 3 likewise
connect braided-Lie bialgebras with quasitriangular Lie bialgebras and es-
tablish the existence of the former. The Lie versions of biproducts [20] and
of the more recent double-bosonisation theorem [15] are covered as well. For
example, the Lie version of the theory of biproducts states that the kernel
of any split Lie bialgebra projection g f is a braided-Lie bialgebra b, and
g = b> f.
In Section 4 we study some concrete examples of braided-Lie bialgebras,
including ones not obtained by transmutation. The simplest are ones with
zero braided-Lie cobracket as the innitesimal versions of the q-ane plane
braided groups in [7]. As an application of braided-Lie bialgebras, their
bosonisations provide maximal parabolic or inhomogeneous Lie bialgebras.
Meanwhile, double-bosonisation allows the formulation in a basis-free way
of the notion of adjoining a node to a Dynkin diagram. For every simple Lie
bialgebra g and braided-Lie bialgebra b in its category of modules we obtain
a new simple Lie bialgebra b> g <b
op
as its double-bosonisation. This pro-
vides the inductive construction of all complex simple Lie algebras, complete
with their Drinfeld-Sklyanin quasitriangular Lie bialgebra structure (which
is built up inductively at the same time). Some concrete examples are given
in detail.
BRAIDED-LIE BIALGEBRAS 331
These results have been briey announced in [16, Sec. 3], of which the
present paper is the extended text. We work over a general ground eld k
of characteristic not 2.
Acknowledgements. The results were obtained during a visit in June 1996
to the Mathematics Dept., Basel, Suisse. I thank the chairman for access to
the facilities.
2. Braided-Lie bialgebras.
We will be concerned throughout with the Lie version of braided categories
obtained as module categories over quantum groups. In principle one could
also formulate an abstract notion of innitesimal braiding as a Lie version
of a general braided category, but since no examples other than the ones
related to quantum groups are known we limit ourselves essentially to this
concrete setting. Some slight extensions (such as to Lie crossed modules)
will be considered as well, later on.
As a Lie version of a strict quantum group we use Drinfelds notion of a
quasitriangular Lie bialgebra [2], [3]. We recall that a Lie bialgebra is a Lie
algebra g equipped with linear map : g g g forming a Lie coalgebra
(in the nite dimensional case this is equivalent to a Lie bracket on g
) and
being a 1-cocycle with values in g g as a g-module by the natural extension
of ad. It is quasitriangular if there exists r g g obeying dr = in the
Lie algebra complex, obeying the Classical Yang-Baxter Equation (CYBE)
[r
(1)
, r
(1)
] r
(2)
r
(2)
+ r
(1)
[r
(2)
, r
(1)
] r
(2)
+ r
(1)
r
(1)
[r
(2)
, r
(2)
] = 0
(1)
and having ad-invariant symmetric part 2r
+
= r + (r), where is trans-
position. We use the conventions and notation similar to [10, Ch. 8], with
r = r
(1)
r
(2)
denoting an element of g g (summation understood) and r
(2)
to denote
the output in g g for g (summation understood). A quasitriangular
Lie bialgebra is called factorisable if 2r
+
is surjective when viewed as a map
g
g.
In view of the discussion above, we are interested in Lie-algebraic objects
living in the category
g
/ of modules over a quasitriangular Lie bialgebra
g. If V is a g-module, we dene its innitesimal braiding to be the operator
: V V V V, (v w) = 2r
+
(v w wv)
where denotes the left action of g.
Lemma 2.1. Let b
g
/ be a g-covariant Lie algebra. Then the associated
: b b b b is a 2-cocycle Z
2
ad
(b, b b).
332 SHAHN MAJID
Proof. The proof that d = 0 is a straightforward computation in Lie alge-
bra cohomology. We use covariance of b in the form: [x, y] = [x, y] +
[x, y] for all g. Then,
(d)(x, y, z)
= ([x, y], z) + ([x, z], y) ([y, z], x) + ad
x
(y, z)
ad
y
(x, z) + ad
z
(x, y)
= 2r
+
([x, y] z + z [x, y]) + [x, 2r
+
(1)
y] r
+
(2)
z
+ 2r
+
(1)
y [x, r
+
(2)
z]
[x, 2r
+
(1)
z] r
+
(2)
y 2r
+
(1)
z [x, r
+
(2)
y] + cyclic
= [2r
+
(1)
x, y] r
+
(2)
z + 2r
+
(1)
y [x, r
+
(2)
z] [x, 2r
+
(1)
z] r
+
(2)
y
+ 2r
+
(1)
z [r
+
(2)
x, y] + cyclic = 0
on using the cyclic invariance in x, y, z and antisymmetry of the Lie bracket.
Note that this works for any element 2r
+
g g in the denition of .
Denition 2.2. A braided-Lie bialgebra b
g
/is a g-covariant Lie algebra
and g-covariant Lie coalgebra with cobracket : b b b obeying x, y b,
([x, y]) = ad
x
y ad
y
x (xy); = 2r
+
(id ),
i.e., obeys the coJacobi identity and d = .
The denition is motivated from that of a braided group, where the co-
product fails to be multiplicative up to a braiding [6]. The results in the
next section serve to justify it further.
3. Lie versions of braided group theorems.
The existence of nontrivial quasitriangular Lie bialgebra structures is known
[3] for all simple g at least over C. Our rst theorem ensures likewise the
existence of braided-Lie bialgebras.
Theorem 3.1. Let i : g f be a map of Lie bialgebras, with g quasitrian-
gular. There is a braided-Lie bialgebra b(g, f), the transmutation of f, living
in
g
/. It has the Lie algebra of f and for all x f, g,
x = x + r
(1)
xi(r
(2)
) i(r
(2)
) r
(1)
x, x = [i(), x].
Proof. We rst verify that as stated is indeed a g-module map. Thus
(x)
= [i(), x] + r
(1)
xi(r
(2)
) i(r
(2)
) r
(1)
x
= x [x, i(
(1)
)] i(
(2)
) i(
(1)
) [x, i(
(2)
)]
+ r
(1)
xi(r
(2)
) i(r
(2)
) r
(1)
x
= x [x, [i(), i(r
(1)
)]] i(r
(2)
) [x, i(r
(1)
)] [i(), i(r
(2)
)]
BRAIDED-LIE BIALGEBRAS 333
[i(), i(r
(1)
)] [x, i(r
(2)
)] i(r
(1)
) [x, [i(), i(r
(2)
)]]
+ r
(1)
xi(r
(2)
) i(r
(2)
) r
(1)
x
= x + [, r
(1)
]xi(r
(2)
) [x, i(r
(1)
)] [i(), i(r
(2)
)]
[i(), i(r
(1)
)] [x, i(r
(2)
)] + i(r
(1)
) [, r
(2)
]x
+ r
(1)
xi(r
(2)
) i(r
(2)
) r
(1)
x
= x + r
(1)
x[i(), i(r
(2)
)] + [i(), i(r
(1)
)] r
(2)
x
+ r
(1)
xi(r
(2)
) i(r
(2)
) r
(1)
x
= x
where we used the denitions of and and the fact that g is quasitriangular,
so that = [, r
(1)
] r
(2)
+ r
(1)
[, r
(2)
].
Antisymmetry of the output of is clear. Next we verify the coJacobi
identity,
(id )x
+ cyclic = (id )x + r
(1)
xi(r
(2)
) i(r
(2)
) (r
(1)
x) + cyclic
= x
(1)
r
(1)
x
(2)
i(r
(2)
) x
(1)
i(r
(2)
) r
(1)
x
(2)
+ r
(1)
xi(r
(2)
)
+ r
(1)
xi([r
(1)
, r
(2)
]) i(r
(2)
) r
(1)
xi(r
(2)
) i([r
(1)
, r
(2)
])
i(r
(2)
) r
(1)
x
(1)
x
(2)
i(r
(2)
) x
(1)
r
(1)
x
(2)
i(r
(2)
) r
(1)
r
(1)
xi(r
(2)
) i(r
(2)
) r
(1)
xi([r
(1)
, r
(2)
])
+ i(r
(2)
) i([r
(1)
, r
(2)
]) r
(1)
x + i(r
(2)
) i(r
(2)
) r
(1)
r
(1)
x + cyclic
using the denition of and the previous covariance result. Several of the
resulting terms cancel immediately. Using the quasitriangular form of on
r
(2)
and the further freedom to cyclically rotate all tensor products so that
x appears in the rst factor, our expression becomes
= r
(1)
xi([r
(2)
, r
(1)
]) i(r
(2)
) + r
(1)
xi(r
(1)
) i([r
(2)
, r
(2)
])
+ r
(1)
xi([r
(1)
, r
(2)
]) i(r
(2)
) r
(1)
xi(r
(2)
) i([r
(1)
, r
(2)
])
+ [r
(1)
, r
(1)
]xi(r
(2)
) i(r
(2)
)
+ r
(1)
xi(r
(2)
) i([r
(1)
, r
(2)
]) r
(1)
xi([r
(1)
, r
(2)
]) i(r
(2)
) + cyclic
= (( )xi i)
_
[r
(1)
, r
(1)
] r
(2)
r
(2)
+ r
(1)
[r
(2)
, r
(1)
] r
(2)
+ r
(1)
r
(1)
[r
(2)
, r
(2)
]
_
+ cyclic
= 0
by the CYBE (1).
Finally, we prove that d = . Thus,
([x, y]) = [x, y] + r
(1)
[x, y] i(r
(2)
) i(r
(2)
) r
(1)
[x, y]
334 SHAHN MAJID
= ad
x
y ad
y
x + [r
(1)
x, y] i(r
(2)
) + [x, r
(1)
y] i(r
(2)
)
i(r
(2)
) [r
(1)
x, y] i(r
(2)
) [x, r
(1)
y]
= ad
x
y ad
y
x r
(1)
y [x, i(r
(2)
)] + r
(1)
x[y, i(r
(2)
)]
+ [x, i(r
(2)
)] r
(1)
y [y, i(r
(2)
)] r
(1)
x
= ad
x
y ad
y
x 2r
+
(xy y x)
as required. We used the denitions of and .
Corollary 3.2. Every quasitriangular Lie bialgebra g has a braided version
g
g
/ by ad, the same Lie bracket, and
x = 2r
+
(1)
[x, r
+
(2)
]. (2)
Proof. We take the identity map i = id : g g and g = b(g, g). Its braided-
Lie cobracket from Theorem 3.1 is x = [x, r
(1)
] r
(2)
+ r
(1)
[x, r
(2)
] +
r
(1)
xr
(2)
r
(2)
r
(1)
x using the quasitriangular form of .
The corollary ensures the existence of non-trivial braided-Lie bialgebras
since nontrivial quasitriangular Lie bialgebras are certainly known.
Example 3.3. Let g be a nite-dimensional factorisable Lie bialgebra.
Then in Corollary 3.2 is equivalent under the isomorphism 2r
+
: g
=g
to the Kirillov-Kostant Lie cobracket on g
acquires a
natural Poisson bracket structure. Considering g as a subset of the functions
on g
.
Let K() = 2r
+
(1)
, r
+
(2)
) denote the isomorphism K : g
=g resulting
from our factorisability assumption. Then
, (K
1
K
1
)K())
= , K
1
(2r
+
(1)
)), K
1
([K(), r
+
(2)
]))
= K
1
(), 2r
+
(1)
)[, K()], K
1
(r
+
(2)
))
= [, K()], K
1
()) = K(), K
1
([, ])) = , [, ]).
We used symmetry and ad-invariance of K as an element of g g, with its
corresponding property , K
1
([, ])) = [, ], K
1
()) , , g, for
the map K : g
g.
BRAIDED-LIE BIALGEBRAS 335
Next, we give g
(1)
(2)
(dual to the Lie algebra of g) a Lie bracket and g-module structure
[, ] =
(1)
2r
+
(,
(2)
), =
(1)
(2)
, ) (3)
for all g, , g
and with 2r
+
viewed as a map g
k. Then
g
. Its Lie
cobracket is = the dual of the Lie bracket of g (since this is the same as
that of g), and its Lie bracket is dual to the Lie cobracket of g in Corollary 3.2
since
, [, ]) = ,
(1)
)2r
+
(),
(2)
) = [, 2r
+
()], )
= 2r
+
(2)
[, r
+
(1)
], ) = , )
for all g and , g
g
/. It is built on the vector space
b
with action ()(x) = (x) and Lie (co)bracket structure maps given
by dualisation.
Proof. The Jacobi and coJacobi (and antisymmetry) axioms are clear by
dualisation, as is the specied left action on b
. The proof is identical to the proof that the dual of a usual Lie bialgebra
is a Lie bialgebra (see [10] for details). Hence d = for b
by dualisation
of this relation for b.
336 SHAHN MAJID
Note also that if b
g
/ is any braided-Lie bialgebra then so is b
op/cop
with opposite bracket and cobracket, in the same category. This is because
the covariance conditions on the Lie bracket and cobracket are each linear
in those structures and hence valid even with the additional minus signs
in either case. Meanwhile, d is linear in and linear in the Lie bracket,
hence invariant when both are changed by a minus sign. The innitesimal
braiding does not involve either the bracket or cobracket and is invariant.
Applying this observation to b
=b
by
x x, so can work entirely with b
x = x [, r
(2)
] r
(1)
x r
(2)
r
(1)
x + r
(1)
xr
(2)
+ r
(1)
x[, r
(2)
]
ad
x
=
(1)
x
(2)
(1)
(2)
x
= [r
(1)
, ]xr
(2)
+ r
(1)
x[r
(2)
, ] + [r
(1)
, ] r
(2)
x
+ r
(1)
[r
(2)
, ]x.
We used the denitions and, in the last line, the form of as a quasitrian-
gular Lie bialgebra. Adding these expressions, we obtain
([, x]) ad
x + ad
x
= x x + [2r
+
(2)
, ] r
+
(1)
x + 2r
+
(2)
[r
+
(1)
, ]x = 0
BRAIDED-LIE BIALGEBRAS 337
by covariance of and ad-invariance of 2r
+
. The remaining case is
([x, y]) = ([x, y]) + r
(2)
r
(1)
[x, y] r
(1)
[x, y] r
(2)
= (ad
x
y + r
(2)
ad
x
(r
(1)
y) ad
x
(r
(1)
y) r
(2)
(xy))
(xy)
= (ad
x
y ad
x
(r
(2)
) r
(1)
y + r
(1)
y ad
x
(r
(2)
)
(xy)) (xy)
= ad
x
y ad
y
x
on writing ad
x
(r
(2)
) = r
(2)
x and comparing with the denition of . We
used the braided-Lie bialgebra property of .
The construction in the bosonisation theorem can also be viewed as a
special case of a more general construction for Lie bialgebras which are
semidirect sums as Lie algebras and Lie coalgebras by a simultaneous Lie
action and Lie coaction. We call such Lie algebras bisum Lie algebras. They
are the analogue of biproduct Hopf algebras in [20]. In the general case one
only needs covariance under a Lie bialgebra, not necessarily quasitriangular.
However, any Lie bialgebra has a Drinfeld double [2] which is quasitriangu-
lar. In order to explain these topics we need quite a bit more formalism.
Firstly, if f is any Lie coalgebra, we have a notion of left Lie coaction on a
vector space V . This is a map : V f V such that [10],
( id) = ((id ) id) (id ) . (5)
The category of left Lie comodules is denoted
g
/and is monoidal in the ob-
vious derivation-like way. Morphisms are dened as linear maps intertwining
the Lie coactions, again in the obvious way.
Lemma 3.6. Let f be a Lie bialgebra. There is a monoidal category of
Lie crossed modules
f
f
/ having as objects vector spaces V which are are
simultaneously f-modules : f V V and f-comodules : V f V
obeying f, v V ,
(v) = ([, ] id + id )(v) + ()v.
It can be identied when f is nite-dimensional with the category
D(f)
/
where D(f) is the Drinfeld double [2]. Writing (v) = v
(1)
v
(2)
, the corre-
sponding innitesimal braiding on any object V
f
f
/ is
(v w) = w
(1)
v w
(2)
v
(1)
wv
(2)
w
(2)
w
(1)
v + v
(2)
v
(1)
w.
Proof. Morphisms in
f
f
/ are maps intertwining both the Lie action and the
Lie coaction. We start with f nite-dimensional and use Drinfelds formulae
for D(f) in the conventions in [10], where it contains the Lie algebras f and
f
op
with the cross relations [, ] =
(1)
(2)
, )+
(1)
,
(2)
) for all f and
338 SHAHN MAJID
f
(2)
, ) +
(1)
v,
(2)
). Next we view the right action of f
a
f
a
e
a
+e
a
f
a
where e
a
is a basis of f and
f
a
is a dual basis. Hence the innitesimal braiding in
D(f)
/ is
(v w) = 2r
+
(v w wv)
= f
a
, v
(1)
)v
(2)
e
a
w + e
a
v f
a
, w
(1)
)w
(2)
(vw)
when the left action of f
op
is of the form given by a left coaction of f.
This gives as stated. Note that the use of a coaction to reformulate an
action of the dual in the innite dimensional case is a completely routine
procedure in Hopf algebra theory; we have given the details here since the
Lie version is less standard; the category of Lie crossed modules
f
f
/ should
not be viewed as anything other than a version of the ideas behind Drinfelds
double construction.
The resulting map is well-dened even in the innite dimensional case;
we call it the innitesimal braiding of the category
f
f
/ of crossed f-modules
and dene a braided-Lie bialgebra in
f
f
/ with respect to this.
Theorem 3.7. Let f be a Lie bialgebra and let b
f
f
/ be a braided-Lie
bialgebra. The bisum Lie bialgebra b> f has semidirect Lie bracket/cobracket
and projects onto f. Conversely, any Lie bialgebra g with a split Lie bialgebra
projection g
i
f is of this form, with b = ker and braided-Lie bialgebra
structure given by b g as a Lie algebra and
= ad
i()
, = ( id) , = (id i )
2
.
Proof. In the forward direction, since b is covariant under an action of f we
can make, as usual, a semidirect sum b>f. The bracket on general elements
of the direct sum vector space is [x , y ] = ([x, y] +y x) [, ]
as usual. On the other hand, since the Lie coalgebra of b is covariant under
a Lie coaction of f, one may make a semidirect Lie coalgebra b>f with [10]
(x ) = + x + (id ) (x) (6)
BRAIDED-LIE BIALGEBRAS 339
where is the Lie cobracket of b. The required covariance of the Lie coal-
gebra under the coaction here is
(id ) = (id (id )) ( id) (7)
and ensures that on b>f obeys the coJacobi identity.
The further covariance assumptions on b are that its Lie bracket is co-
variant under the Lie coaction and its Lie cobracket is covariant under the
Lie action. These assumptions are all needed to show that the semidirect
Lie bracket and Lie cobracket form a Lie bialgebra b> f. The case ([, ])
is clear since f is a Lie subalgebra and Lie subcoalgebra. The mixed case is
([, x]) = (x) = (x) + (x)
(1)
(x)
(2)
(x)
(2)
(x)
(1)
= x + ad
((id ) (x)) +
(1)
(2)
x
(2)
x
(1)
= ad
x ad
x
(x) = x =
ad
x
() when viewed in the Lie algebra b>f. The remaining case is
([x, y])
= ([x, y]) + (id )([x, y])
= ad
x
y + y
(2)
y
(1)
x y
(1)
xy
(2)
+ x
(1)
[x
(2)
, y]
[x
(2)
, y] x
(1)
(xy)
= ad
x
y y
(1)
xy
(2)
+ y
(1)
[x, y
(2)
] [x, y
(2)
] y
(1)
+ y
(2)
y
(1)
x (xy)
= ad
x
y + [x, y
(1)
] y
(2)
+ y
(1)
[x, y
(2)
] [x, y
(2)
] y
(1)
y
(2)
[x, y
(1)
] (xy)
= ad
x
y ad
y
x
where we used ([x, y]) = y
(1)
[x, y
(2)
] x
(1)
[y, x
(2)
] (covariance of the Lie
bracket of b under the Lie coaction) and the assumption that is a braided-
Lie bialgebra in
f
f
/ with innitesimal braiding from Lemma 3.6. We then
used the crossed module compatibility condition also from Lemma 3.6 and
x = ad
x
() to recognise the required result. It is easy to see that the
projection b> f f dened by setting elements of b to zero is a Lie bialgebra
map covering the inclusion f b> f.
In the converse direction, we assume a split projection, i.e. a surjection
: g f between Lie bialgebras covering an inclusion i : f g of Lie
bialgebras (so that i = id). We dene b = ker . Since this is a Lie
ideal, it both forms a sub-Lie algebra of g and is covariant under the action
of f given by pull-back along i of ad. Moreover, g coacts on itself by its Lie
cobracket (the adjoint coaction of any Lie bialgebra on itself) and hence
340 SHAHN MAJID
push-out along is an f-coaction = ( id) , which restricts to b since
(id )(x) = ( )x = (x) = 0 for x ker . This Lie action and Lie
coaction t together to form a Lie crossed module,
(x) = ( id) ([i(), x]) = ( id)(ad
i()
x ad
x
i())
= ([i(), x
(1)
]) x
(2)
+ (x
(1)
) [i(), x
(2)
] ([x, i()
(1)
]) i()
(2)
(i()
(1)
) [x, i()
(2)
]
= [x, (x
(1)
)] x
(2)
+ (x
(1)
) x
(2)
+
(1)
(2)
x
= ([, ] id + id )(x) + ()x
as required. We used that i is a Lie coalgebra map and a Lie algebra map,
along with x ker to kill the term with ([x, i()
(1)
]).
Finally, we give b a Lie cobracket as stated. Writing p = id i , we
have
(id ) x
= p(x
(1)
) p(p(x
(2)
)
(1)
) p(p(x
(2)
)
(2)
)
= (p p p)(id )x p(x
(1)
) p(i (x
(2)
)
(1)
) p(i (x
(2)
)
(2)
)
= (p p p)(id )x
since i is a Lie coalgebra map and p i = 0. Hence obeys the
coJacobi identity since does. Moreover, for all x, y ker ,
([x, y]) = (id i ) (id i )[x, y]
= [x, y
(1)
] y
(2)
+ y
(1)
[x, y
(2)
] [x, y
(1)
] i (y
(2)
)
i (y
(1)
) [x, y
(2)
] (xy)
since i ([x, y
(2)
]) = 0 etc., as i is a Lie algebra map. Also, from
Lemma 3.6 and the form of and antisymmetry of we have
(xy) = [i (y
(1)
), x] y
(2)
+ y
(1)
[i (y
(2)
), x] (xy).
Then,
ad
x
y ad
y
x
= [x, (id i )(y
(1)
)] (id i )(y
(2)
)
+ (id i )(y
(1)
) [x, (id i )(y
(2)
)] (xy)
= [x, y
(1)
] y
(2)
+ y
(1)
[x, y
(2)
] [x, i (y
(1)
)] y
(2)
[x, y
(1)
] i (y
(2)
)
i (y
(1)
) [x, y
(2)
] y
(1)
[x, i (y
(2)
)] (xy)
= (xy) + ([x, y])
as required. The additional terms i (y
(1)
) [x, i (y
(2)
)] etc. vanish as
i is a Lie coalgebra map and x, y ker . Hence b = ker becomes
a braided-Lie bialgebra in
f
f
/. One may then verify that the bisum Lie
BRAIDED-LIE BIALGEBRAS 341
bialgebra b> f coincides with g viewed as a direct sum bf of vector spaces
according to the projection i .
This is the Lie analogue of the braided groups interpretation [14] of Rad-
fords theorem [20]. It tells us that braided-Lie bialgebras are rather com-
mon as they arise whenever we have a projection of ordinary Lie bialgebras.
Finally, we provide the Lie analogue of the functor [17] which connects
biproducts and bosonisation.
Lemma 3.8. Let g be a quasitriangular Lie bialgebra. There is a monoidal
functor
g
/
g
g
/ respecting the innitesimal braidings. It sends an action
to a pair (, ) where = r
21
, the induced Lie coaction. The bosonisation
of b
g
/in Theorem 3.5 can thereby be viewed as an example of a biproduct
in Theorem 3.7.
Proof. We rst verify that (v) = r
(2)
r
(1)
v denes a Lie coaction for any
g-module V v. This follows immediately from the identity (id )r =
[r
(1)
, r
(1)
] r
(2)
r
(2)
holding for any quasitriangular Lie bialgebra (follow-
ing from the CYBE and = dr). Thus,
(id )(v) = r
(2)
r
(2)
r
(1)
r
(1)
v r
(2)
r
(2)
r
(1)
r
(1)
v
= ((id ) id) (id ) (v)
as required. This ts together with the given action to form a Lie crossed
module as
(v) = r
(2)
r
(1)
v = r
(2)
[r
(1)
, ]v + r
(2)
r
(1)
v
= ()v + [, r
(2)
] r
(1)
v + (id )(v)
as required, using the quasitriangular form of . More trivially, a morphism
: V W in
g
/ is automatically an intertwiner of the induced coactions
(since r
(2)
r
(1)
(v) = r
(2)
(r
(1)
v)) and hence a morphism in
g
g
/. It is
also clear that the functor respects tensor products. In this way,
g
/ is a
full monoidal subcategory.
Finally, we check that the innitesimal braidings coincide. Computing
from Lemma 3.6 in the image of the functor, we have
(v w) = r
(2)
v r
(1)
w + r
(1)
v r
(2)
w (vw)
= 2r
+
(v w wv)
as required. From the form of the Lie cobracket in the bosonisation con-
struction, it is clear that it can be viewed as a semidirect Lie coalgebra by
the induced action, i.e., it can be viewed as a nontrivial construction for
examples of bisum Lie algebras.
There is a dual theory of dual quasitriangular (or coquasitriangular) Lie
bialgebras [10] where the Lie bracket has a special form
[, ] =
(1)
r(
(2)
, ) +
(1)
r(,
(2)
), , g, (8)
342 SHAHN MAJID
dened by a dual quasitriangular structure r : g g k. This is required
to obey the CYBE in a dual form
r(,
(1)
)r(
(2)
, ) + r(
(1)
, )r(
(2)
, ) + r(,
(1)
)r(,
(2)
) = 0, , , g
(9)
and 2r
+
is required to be invariant under the adjoint Lie coaction (= , the
Lie cobracket) according to r
+
(,
(1)
)
(2)
+ r
+
(
(1)
, )
(2)
= 0. All of the
above theory goes through in this form. Thus,
g
/ has, by denition, an
innitesimal braiding dened by
(v w) = r(v
(1)
, w
(1)
)(v
(2)
w
(2)
w
(2)
v
(2)
) (10)
with respect to which we dene a braided-Lie bialgebra in
g
/. The Lie
comodule transmutation theory associates to a map f g of Lie bialgebras
with g dual quasitriangular, a braided-Lie bialgebra b(f, g)
g
/.
For example, the Lie comodule version of Corollary 3.2 is g
g
/ with
the same Lie cobracket as g, the adjoint coaction and
[, ] =
(1)
2r
+
(,
(2)
) , g. (11)
A concrete example is provided by g
coincides
with (g)
> g is isomorphic
as a Lie bialgebra to the Drinfeld double D(g).
Proof. The required isomorphism : D(g) g
> g is ()=r
(2)
, r
(1)
)
and () = for g and g
(2)
, ) +
(1)
,
(2)
) + r
(2)
, [, r
(1)
])
BRAIDED-LIE BIALGEBRAS 343
= (
(1)
(2)
, ) +
(1)
,
(2)
)) = ([, ])
where [ , ]
bos
is the Lie bracket of g
(2)
, ) for g
bos
()
=
bos
r
(2)
, r
(1)
)
= + r
(2)
r
(1)
r
(1)
r
(2)
r
(2)
r
(2)
, [r
(1)
, r
(1)
])
= + r
(2)
(1)
(2)
, r
(1)
)
(1)
(2)
, r
(1)
) r
(2)
r
(2)
r
(2)
, [r
(1)
, r
(1)
])
= (
(1)
r
(2)
(1)
, r
(1)
)) (
(2)
r
(2)
(2)
, r
(1)
)) = ( )
using the Lie cobracket
bos
on g
(2)
, x) + 2r
+
(1)
, r
+
(2)
x)
x = x + r
(2)
r
(1)
x r
(1)
xr
(2)
,
= + r
(2)
r
(1)
r
(1)
r
(2)
x b, g and c. Here x = x
(1)
x
(2)
.
Proof. Here b, g clearly form the bosonisation Lie bialgebra b> g from The-
orem 3.5. In the same way, we recognise g <c
op
as the bosonisation of c
op
as a braided-Lie bialgebra in the category of g-modules with opposite in-
nitesimal braiding (see the remark below Lemma 3.4). Since these are
already known to form Lie bialgebras, the coJacobi identity for the double-
bosonisation holds, as well as the 1-cocycle axiom for all cases except ([x, ])
mixing b, c. We outline the proof of this remaining case. From the denition
of b> g <c
op
, we have
([x, ])
= (x
(1)
, x
(2)
) +
(1)
(2)
, x) + 2r
+
(1)
, r
+
(2)
x))
= x
(1)(1)
x
(1)(2)
, x
(2)
) + r
(2)
r
(1)
x
(1)
, x
(2)
) r
(1)
x
(1)
r
(2)
, x
(2)
)
+
(1)(1)
(1)(2)
(2)
, x) + r
(2)
(1)
r
(1)
(2)
, x) r
(1)
r
(2)
(1)
(2)
, x)
+ 2r
+
(1)
, r
+
(2)
x)
ad
x
= [x,
(1)
]
(2)
+
(1)
[x,
(2)
] + [x, r
(2)
] r
(1)
BRAIDED-LIE BIALGEBRAS 345
+ r
(2)
[x, r
(1)
] [x, r
(1)
] r
(2)
r
(1)
[x, r
(2)
]
= x
(1)
(1)
, x
(2)
)
(2)
+
(1)(1)
(1)(2)
, x)
(2)
+ 2r
+
(1)
(1)
, r
+
(2)
x)
(2)
+
(1)
x
(1)
(2)
, x
(2)
) +
(1)
(2)(1)
(2)(2)
, x)
+
(1)
2r
+
(1)
(2)
, r
+
(2)
x) r
(2)
r
(1)
x + r
(1)
xr
(2)
+ [x, r
(2)
] r
(1)
r
(1)
[x, r
(2)
].
In a similar way, one has
ad
x
= x
(1)(1)
, x
(1)(2)
) x
(2)
+
(1)
(2)
, x
(1)
) x
(2)
+ 2r
+
(1)
, r
+
(2)
x
(1)
) x
(2)
+ x
(1)
x
(2)(1)
, x
(2)(2)
) + x
(1)
(2)
, x
(2)
)
(1)
+ x
(1)
2r
+
(1)
, r
+
(2)
x
(2)
)
+ r
(2)
r
(1)
x r
(1)
xr
(2)
+ r
(2)
[r
(1)
x, ] [r
(1)
x, ] r
(2)
.
Adding the latter two expressions and comparing with ([x, ]) we see that
the terms of the form r
(2)
r
(1)
x etc. immediately cancel, the terms of
the form
(1)
(2)
, x
(2)
) x
(2)
etc. (involving Lie cobrackets of both x and
) cancel by antisymmetry of the Lie cobrackets, and the terms of the form
x
(1)(1)
, x
(1)(2)
) x
(2)
etc. (involving iterated Lie cobrackets of either x or )
cancel using antisymmetry of the Lie cobrackets and the coJacobi identity
(id ) + cyclic = 0 for b and c. Hence the 1-cocycle identity for this case
reduces to the more manageable
r
(2)
r
(1)
x
(1)
, x
(2)
) + r
(2)
(1)
r
(1)
(2)
, x)
+ [2r
+
(1)
, r
(1)
] r
(2)
, r
+
(2)
) ip
= 2r
+
(1)
(1)
, r
+
(2)
x)
(2)
+ 2r
+
(1)
, r
+
(2)
x
(1)
) x
(2)
+ [x, r
(2)
] r
(1)
[r
(1)
x, ] r
(2)
ip
where -ip means to subtract all the same expressions with the opposite
tensor product. We used antisymmetry of the Lie cobrackets and the qua-
sitriangular of g for r
+
(1)
. One then has to put in the stated denitions of
the Lie brackets [x, r
(2)
] and [r
(1)
x, ] and use g-covariance of the pairing,
and of the braided-Lie brackets and cobrackets to obtain equality.
Note that by comparing the Lie bosonisation formulae with the braided
group case, we can read o the Lie double-bosonisation formulae from the
braided group case given in the required left-module form in the appendix
of [18]. The only subtlety is that in the Lie case we can eliminate the
categorical pairing ev (corresponding to the categorical dual b
in the nite-
dimensional case): c, b are categorically paired by ev : c b k i , ) =
ev is a (g-equivariant) ordinary duality pairing. Then one obtains the [x, ]
346 SHAHN MAJID
relations as stated. Finally, in [15] it is explicitly shown that the double-
bosonisation is built on the tensor product vector space. The analogous
arguments now prove that the Lie double bosonisation is built on the direct
sum vector space.
Proposition 3.11. Let b
g
/ be a nite-dimensional braided-Lie bialge-
bra with dual b
a
f
a
e
a
,
where e
a
is a basis of b and f
a
is a dual basis, and r is the quasitrian-
gular structure of g. If g is factorisable then so is the double-bosonisation.
Proof. We show rst that the Lie cobracket of the double-bosonisation has
the form = dr
new
. With summation over a understood, we have
[, r
(1)
new
] r
(2)
new
+ r
(1)
new
[, r
(2)
new
]
= [, r
(1)
] r
(2)
+ r
(1)
[, r
(2)
] [, f
a
]
b
e
a
+ f
a
[, e
a
]
= [, f
a
]
b
e
a
r
(1)
r
(2)
r
(1)
r
(2)
f
a
e
a(1)
, e
a(2)
) f
a
(1)
(2)
, e
a
) f
a
2r
+
(1)
, r
+
(2)
e
a
)
= r
(1)
r
(2)
r
(1)
r
(2)
+ 2r
+
(2)
r
+
(1)
=
as required. Here [f
a
, ]
b
e
a
= f
a
e
a(1)
, e
a(2)
) since both evaluate
against x b to x
(1)
, x
(2)
). The sux b
to
(1)
(2)
, x). Since the Lie cobracket of the double-bosonisation is antisym-
metric, we conclude also that 2r
+new
is ad-invariant.
Finally, we verify the CYBE for r
new
. Actually, once = dr
new
has been
established, the CYBE is equivalent to
( id)r
new
= r
(1)
new
r
(1)
new
[r
(2)
new
, r
(2)
new
]
(see [10]). Note that
f
a
e
a
= f
a
f
b
[e
a
, e
b
]
BRAIDED-LIE BIALGEBRAS 347
(sum over a, b) since evaluation against x, y b gives [x, y] in both cases.
Then
( id)r
new
= ( id)r + f
a
e
a
= r
(1)
r
(1)
[r
(2)
, r
(2)
] + f
a
f
b
[e
a
, e
b
]
+ r
(2)
f
a
r
(1)
e
a
r
(1)
r
(2)
f
a
e
a
= r
(1)
r
(1)
[r
(2)
, r
(2)
] + f
a
f
b
[e
a
, e
b
]
f
a
r
(1)
[r
(2)
, e
a
] + r
(1)
f
a
[r
(2)
, e
a
]
as required. We used g-covariance of the pairing, so that f
a
e
a
=
f
a
e
a
= f
a
[, e
a
] for all g.
If g is factorisable then 2r
+new
as a map (b> g <b
op
)
b> g <b
op
has g in its image, by restricting to g. It has b in its image by restricting
to b, and b
. So the double-bosonisation
is again factorisable. Explicitly, if we denote by K the bilinear form on g
corresponding to the inverse of 2r
+
as a map, we have
K
new
(x , y ) = , x) + K(, ) +, y).
. Consider I b g C b
an ideal of the
double-bosonisation. Let I
b
, I
b
, I
g
, I
C
be the components of I in the direct
sum. By the relation [, x] = x, I
b
b is a subrepresentation under g.
Since b is irreducible, I
b
is either zero or b. Similarly for I
b
. Likewise I
g
is
zero or g as g is simple. Finally, I
C
is zero or C by linearity. We therefore
have 16 possibilities to consider for whether C, g, b, b
, x] = y, [X
, y] = 0
[H, x] = x, [H, y] = y, [, H] = 0, [, X
] = 0,
[, x] = x, [, y] = y
where x, y are a basis of C
2
and H, X
=
1
2
X
H, x =
1
2
x h
350 SHAHN MAJID
where h =
1
2
H
3
2
and = (id ) .
Proof. Note that we work over C, but there is are natural real forms justify-
ing the notation. Here
2
b is the 1-dimensional (i.e., spin 0) representation
of su
2
. The standard quasitriangular structure of su
2
is
r =
1
4
HH + X
+
X
.
Then c = r
+
(1)
r
+
(2)
is twice the quadratic Casimir in its usual normalisation.
Hence its value in the (2j +1) dimensional (i.e., spin j) irreducible represen-
tation is j(j+1). In the present case, we have = 0.(0+1)2.
1
2
(
1
2
+1) =
3
2
in Proposition 4.1. We therefore make the central extension to g and apply
Theorem 3.5. The Lie algebra of the bosonisation is given by the action of g.
Its explicit form in the representation (X
+
) =
_
0 1
0 0
_
, (X
) =
_
0 0
1 0
_
and (H) =
_
1 0
0 1
_
is X
+
x = 0, X
+
y = x, X
x = y, X
y = 0,
Hx = x and Hy = y, giving the Lie bracket stated. The Lie cobracket
is x = 0+ r
(2)
r
(1)
x =
1
4
HHx+
3
4
x =
1
2
xh as stated. We identify
X
= X
1
, H = H
1
as a sub-Lie algebra of su
3
and x = X
2
, h = H
2
as the
remaining Chevalley generators of its standard maximal parabolic. Finally,
let b
, ] = 0, [X
, ] =
[H, ] = , [H, ] = , [, ] = , [, ] = .
Among the further b, b
k
ijk
x
k
, [, x
j
] = x
j
BRAIDED-LIE BIALGEBRAS 351
where i, j, k = 1, 2, 3 and is the totally antisymmetric tensor with
123
= 1.
Here e
i
are the vector basis of so
3
. The Lie cobracket is
e
1
= e
1
e
3
, e
2
= e
2
e
3
, e
3
= 0, = 0,
x
1
= (e
1
+ e
2
) x
3
+ x
2
e
3
+ x
1
x
2
= x
3
(e
1
e
2
) + e
3
x
1
+ x
2
,
x
3
= (e
1
e
2
) x
2
+ x
1
(e
1
+ e
2
) + x
3
.
Proof. Here
2
b is also the 3-dimensional (i.e., spin 1) representation. Hence,
from the rst part of the proof of Proposition 4.1, we have = 1.(1 + 1)
2.1.(1 +1) = 2. The Lie algebra so
3
in the vector basis is [e
1
, e
2
] = e
3
and
cyclic rotations of this, and the Drinfeld-Sklyanin quasitriangular structure
in this basis is [10, Ex. 8.1.13]
r =
i
e
i
e
i
+ (e
1
e
2
e
2
e
1
).
We add to give the quasitriangular structure r. The action on C
3
with basis x
i
is [e
1
, x
2
] = x
3
and cyclic rotations of this. This immedi-
ately provides the Lie algebra of the bosonisation. The Lie cobracket from
Theorem 3.5 is
x
i
= e
2
[e
1
, x
i
] e
1
[e
2
, x
i
] +
j,k
e
j
ijk
x
k
+ x
i
with computes as stated.
This example is manifestly the Lie algebra of motions plus dilation of R
3
,
as a sub-Lie algebra of the conformal Lie algebra so(1, 4), equipped now
with a Lie bialgebra structure. At the level of complex Lie algebra, it is
the maximal parabolic of so
5
. The generator is called the dilaton in the
corresponding quantum groups literature. We likewise obtain natural maxi-
mal parabolics for the whole ABD series by bosonisation of the fundamental
representation b.
On the other hand, these steps for other Lie algebras can involve less
trivial braided-Lie bialgebras b (with non-zero bracket and cobracket). The
general case is as follows. We consider simple Lie algebras g associated
to root systems in the usual conventions. Positive roots are denoted ,
with length d
and Cartan
generators H
i
corresponding to the simple roots
i
. We dene d
i
n
i
d
i
H
i
if =
i
n
i
i
. We take the Drinfeld-Sklyanin quasitriangular
structure in its general form
r =
+
1
2
ij
A
ij
H
i
H
j
, (12)
352 SHAHN MAJID
where A
ij
= d
i
(a
1
)
ij
. Here a is the Cartan matrix. The corresponding Lie
cobracket is X
i
=
d
i
2
X
i
H
i
and H
i
= 0 on the generators.
Proposition 4.5. Let i
0
be a choice of simple root such that its deletion
again generates the root system of a simple Lie algebra, g
0
. Let b
g
be the standard (negative) Borel and let f b
f, (H
i
) = H
i
, (X
) =
_
0 if contains
i
0
X
else
is a split Lie bialgebra projection. Then b = ker is the Lie ideal generated
by X
i
0
in b
=
d
2
X
+=
c
,
X
where the sum is over positive root , adding up to and the c are
constants. The proof is by induction (being careful about signs). From the
Lie bialgebra cocycle axiom and the induction hypothesis,
([X
i
, X
])
=
d
2
[X
i
, X
] H
+
d
2
X
[X
i
, H
] +
+=
c
,
[X
i
, X
] X
+=
c
,
X
[X
i
, X
]
d
i
2
[X
, X
i
] H
i
d
i
2
X
i
[X
, H
i
]
=
d
+
i
2
[X
i
, X
] H
+
i
+ (d
i
H
i
)X
i
X
+=
c
,
[X
i
, X
] X
+=
c
,
X
[X
i
, X
]
if +
i
is a positive root. We used the identities [d
, X
i
] = (d
i
H
i
)X
i
and [d
i
H
i
, X
] = (d
i
H
i
)X
,
so this becomes a sub-Lie bialgebra of g (this is well-known). Moreover, if
does not involve
i
0
then neither can positive , such that + = .
Hence f is a Lie sub-bialgebra of b
.
BRAIDED-LIE BIALGEBRAS 353
Finally, is clearly a Lie algebra map by considering the cases separately.
For elements of f f we know that [ , ] = [ , ] = [( ), ( )] since f is
closed, while if involves
i
0
then so does + and ([X
, X
]) = 0 =
[(X
), (X
where contains
i
0
.
The braided-Lie cobracket of b from Theorem 3.7 is
X
c
,
X
,
the part of the Lie cobracket X
) =
2
H
c
,
X
) = 0. Then b = n
) = (, )X
,
hX
= (h), (X
) =
d
2
H
,
BRAIDED-LIE BIALGEBRAS 355
for all h t. The coaction here is induced from the action as in Lemma 3.8,
where t is a quasitriangular Lie algebra with zero bracket, zero cobracket
and r =
1
2
ij
A
ij
H
i
H
j
. In this way we can also view n
t
/ and
g = n
> t <n
+
via Theorem 3.10, where we identify (n
)
op
= n
+
via the
Killing form.
5. Concluding remarks.
We have given here the basic theory of braided-Lie algebras, obtained by
innitesimalising the existing theory of braided groups. We also outlined in
Section 4 its application to the inductive construction of simple Lie algebras
with their standard quasitriangular structures. Further variations of these
constructions are certainly possible, and by making them one should be able
to also obtain the other strictly quasitriangular Lie bialgebras structures
in the Belavin-Drinfeld classication [1]. For example, there is a twisting
theory of quantum groups [4] and braided groups [19]. An innitesimal
version of the latter would allow one to introduce additional twists at each
stage of the inductive construction of the simple Lie algebra.
Also, although we have (following common practice) named our Lie alge-
bras by their natural real forms, our Lie algebras in Section 4 were complex
ones. There is a theory of -braided groups (real forms of braided groups)
as well as their corresponding bosonisations and double-bosonisations [19],
[18]. The innitesimal version of these should yield, for example, so(1, 4)
as a real form arising from the double-bosonisation of the 3-dimensional
braided-Lie bialgebra in Example 4.4. The construction of natural compact
real forms and the classication of real forms would be a further goal. These
are some directions for further work.
Finally, just as Lie bialgebras extend to Poisson-Lie groups, so braided-Lie
bialgebra structures typically extend to the associated Lie group B of b, at
least locally. First, one needs to exponentiate Z
2
ad
(b, b b) to a group
cocycle Z
2
Ad
(B, b b). Since d = , we should likewise exponentiate
to the group as a map D : B b b with coboundary , and dene
from this a braided-Poisson bracket. The latter will not, however, respect
the group product in the usual way but rather up to a braiding obtained
from . Details of these braided-Poisson-Lie groups and the example of the
Kirillov-Kostant braided-Poisson bracket from Example 3.3 extended to the
group manifold (e.g., to SU
2
) will be developed elsewhere.
References
[1] A.A. Belavin and V.G. Drinfeld, On the solutions of the classical Yang-Baxter equa-
tions for simple Lie algebras, Func. Anal. Appl., 16 (1982), 1-29.
356 SHAHN MAJID
[2] V.G. Drinfeld, Hamiltonian structures on Lie groups, Lie bialgebras and the geometric
meaning of the classical Yang-Baxter equations, Sov. Math. Dokl., 27 (1983), 68.
[3] , Quantum groups, in Proceedings of the ICM, A. Gleason, editor, AMS,
Rhode Island, (1987), 798-820.
[4] , Quasi-Hopf algebras, Leningrad Math. J., 1 (1990), 1419-1457.
[5] J-H. Lu and A. Weinstein, Poisson Lie groups, dressing transformations and Bruhat
decompositions, J. Di. Geom., 31 (1990), 501-526.
[6] S. Majid, Braided groups, J. Pure and Applied Algebra, 86 (1993), 187-221.
[7] , Braided momentum in the q-Poincare group, J. Math. Phys., 34 (1993),
2045-2058.
[8] , Algebras and Hopf algebras in braided categories, Lec. Notes in Pure and
Appl. Math., 158 (1994), 55-105, Marcel Dekker.
[9] , Matched pairs of Lie groups associated to solutions of the Yang-Baxter equa-
tions, Pac. J. Math., 141 (1990), 311-332.
[10] , Foundations of Quantum Group Theory, Cambridge Univeristy Press, 1995.
[11] , Quantum and braided Lie algebras, J. Geom. Phys., 13 (1994), 307-356.
[12] , Transmutation theory and rank for quantum braided groups, Math. Proc.
Camb. Phil. Soc., 113 (1993), 45-70.
[13] , Cross products by braided groups and bosonization, J. Algebra, 163 (1994),
165-190.
[14] , Braided matrix structure of the Sklyanin algebra and of the quantum Lorentz
group, Commun. Math. Phys., 156 (1993), 607-638.
[15] , Double bosonisation and the construction of U
q
(g), Math. Proc. Camb. Phil.
Soc., 125 (1999), 151-192.
[16] , Braided geometry and the inductive construction of Lie algebras and quantum
groups, in Deformation Theory and Symplectic Geometry (S. Gutt et al., eds),
Kluwer, (1997), 339-344.
[17] , Doubles of quasitriangular Hopf algebras, Commun. Algebra, 19(11) (1991),
3061-3073.
[18] , Braided geometry of the conformal algebra, J. Math. Phys., 37 (1996), 6495-
6509.
[19] , Quasi- structure on q-Poincare algebras, J. Geom. Phys., 22 (1997), 14-58.
[20] D. Radford, The structure of Hopf algebras with a projection, J. Algebra, 92 (1985),
322-347.
Received May 26, 1998. Royal Society University Research Fellow and Fellow of Pembroke
College, Cambridge. Most of the paper was written when I was on Leave 1995-1996 at
the Dept. of Mathematics, Harvard University, Cambridge, MA 02138.
University of Cambridge
Cambridge CB3 9EW
United Kingdom
E-mail address: majid@damtp.cam.ac.uk
PACIFIC JOURNAL OF MATHEMATICS
Vol. 192, No. 2, 2000
REDUCIBLE DEHN SURGERY AND ANNULAR DEHN
SURGERY
Ruifeng Qiu
Let M be a compact, orientable, irreducible, -irreducible,
anannular 3-manifold with one component T of M a torus.
Suppose that r
1
and r
2
are two slopes on T. In this paper, we
shall show that if M(r
1
) is reducible while M(r
2
) contains an
essential annulus, then (r
1
, r
2
) 2.
1. Introduction.
Let M be a compact, orientable, irreducible, -irreducible, anannular 3-
manifold with one component T of M a torus. A slope r on T is a T-
isotopy class of essential, unoriented, simple closed curves on T, and the
distance between two slopes r
1
and r
2
, denoted by (r
1
, r
2
), is the minimal
geometric intersection number among all the curves representing the slopes.
For a slope r on T, we denote by M(r) the surgered manifold obtained by
attaching a solid torus J to M along T so that r bounds a disk in J. Now
consider two distinct slopes r
1
, r
2
on T. There are many results showing
how constraints on the topology of M(r
1
) and M(r
2
) put constraints on
(r
1
, r
2
). For example, C. Gordon and J. Luecke [5] have shown that if
M(r
1
) and M(r
2
) are reducible, then (r
1
, r
2
) 1. C. Gordon [3] has
shown that if M contains no essential torus, and M(r
i
) contains an essential
torus, i = 1, 2, then (r
1
, r
2
) 8. Y-Q Wu [8] has shown that if M(r
1
) and
M(r
2
) are -reducible, then (r
1
, r
2
) 1. In this paper, we shall estimate
(r
1
, r
2
) when M(r
1
) is reducible, and M(r
2
) contains an essential annulus.
The main result is the following theorem:
Theorem 1. Let M be a compact, orientable, irreducible, -irreducible,
anannular 3-manifold with one component T of M a torus. If r
1
and r
2
are two slopes on T, such that M(r
1
) is reducible while M(r
2
) contains an
essential annulus, then (r
1
, r
2
) 2.
An example has been given by Hayashi and Motegi [6] showing that the
bound 2 in Theorem 1 is the best possible in general.
Another proof of Theorem 1 has been obtained independently by Y-Q
Wu.
357
358 RUIFENG QIU
2. Scharlemann cycle and parallel edges.
In what follows, we shall assume that M(r
1
) is reducible, and M(r
2
) contains
an essential annulus. We may assume further that M(r
2
) is irreducible, -
irreducible (see [5], [7]).
Let V
i
be the solid torus attached to M in forming M(r
i
), i = 1, 2.
Consider the family of essential 2-spheres in M(r
1
) which intersect V
1
in a
family of meridional discs, and let S M(r
1
) be such a 2-sphere chosen
so that S V
1
has the minimal number, say n
1
, of components. Similarly,
let A M(r
2
) be an essential annulus which intersects V
2
in a collection
of meridian discs, the number of which, say n
2
, is minimal among all such
annuli. By assumptions, n
1
> 2 and n
2
> 0.
Now suppose that F
1
= M S and F
2
= M A. Then F
1
is an es-
sential planar surface in M with boundary slope r
1
while F
2
is an essential
punctured annulus in M with boundary slope r
2
. We may assume that the
number of components of F
1
F
2
is minimal subject to these conditions.
Then no circle component of F
1
F
2
bounds a disk in F
1
or F
2
, and no arc
component of F
1
F
2
is boundary parallel in F
1
or F
2
. Each component of
F
i
lying in T is called an inner component of F
i
, i = 1, 2.
Let
1
(
2
) be the graph in S(A) obtained by taking the arc components
of F
1
F
2
as edges and taking the inner components of F
1
( F
2
) as fat
vertices.
We shall use the indices and to denote 1 or 2, with the convention
that, when they are used together,
,
1, 2
.
Number the inner components of F
,
1
F
, . . . ,
n
, so that they
appear consecutively on T. By construction, each inner component
i
F
of
F
of F
in exactly (r
1
, r
2
) points.
The ends of the edges in
1, 2, . . . , n
i
F
, such that
x e
i
F
k
F
, . . . , 1, . . . , n
(repeated (r
1
, r
2
) times).
Now x an orientation on F
of F
are said to be
parallel if they, when given the induced orientation by F
, are homologous on
T; otherwise they are antiparallel. Two vertices of
if and only if it
connects antiparallel vertices of
.
Two edges of
, bound a disk in F
. A cycle in
is a subgraph of
homeomorphic
to a circle. The length of a cycle is the number of edges contained in it. A
REDUCIBLE DEHN SURGERY AND ANNULAR DEHN SURGERY 359
cycle in
=
e
1
, e
in
in
such that e
i
and e
i
are adjacent parallel edges
in
, i = 1, 2.
Let x be a vertex of
. An edge of
the subgraph of
consisting of
all the vertices of
. A
disk face D of
x
is called an x-face.
Lemma 2.1. Let W be a compact, irreducible, -irreducible 3-manifold,
and let A
0
be a non-separating annulus properly embedded in W. Then
A
0
is essential in W.
Proof. Let D be a compressing disk of A
0
. If one component of A
0
is essen-
tial on W, then W is -reducible, a contradiction. If the two components
of A
0
are trivial on W, then W contains a non-separating 2-sphere. Thus
W is reducible, a contradiction.
Now let D be a -compressing disk of A
0
, such that D = a b, where
a is an arc on A
0
, and b is an arc on W. If the two components of A
0
bounds an annulus A
1
on W, and b A
1
, then A
0
A
1
is non-separating
in W, and the surface obtained by doing a 2-surgery on A
0
A
1
along D is
a non-separating 2-sphere in W. Thus W is reducible, a contradiction. If
not, then the band connected sum of the two components of A
0
along b
on W, say C, bounds a disk in W, and C is essential on W. Thus W is
-reducible, a contradiction.
Lemma 2.2. If
is separating.
Proof. By Lemma 2.1 of [5], F
1
is separating when
2
contains a Scharle-
mann cycle.
Now let be a Scharlemann cycle of
1
with label pair
1, 2
, D be the
disk face bounded by in
1
, and let A
1
be the annulus bounded by
1
F
2
and
2
F
2
on T, such that the interior of A
1
is disjoint from A. Let D
i
be
the disks in A bounded by
i
F
2
, and let T
= (AD
1
D
2
) A
1
. Then T
along D, then A
is an annulus in M(r
2
), such that |A
V
2
| < n
2
. If
F
2
is non-separating, then A
is
essential, contradicting the minimality of n
2
.
Lemma 2.3. Let be a Scharlemann cycle of
1
, then the edges in can
not lie in a disk of A.
Proof. Suppose, otherwise, that the edges in lie in a disk of A. Then
M(r
2
) contains a lens space as a factor (by the proof of Lemma 2.8 of [1]).
360 RUIFENG QIU
Since M(r
2
) = , M(r
2
) is reducible, contradicting our assumptions on
M(r
2
).
Proposition 2.4.
2
, with label pairs
x, y
and
x
, y
, y
. By Lemma 2.2, n
2
is even.
Now consider the edges of
1
and
2
as they lie in
2
, joining the vertices
x, y and x
, y
be an annulus containing x
, y
is contained in intA. If
x, y
, y
= , then we
may assume E E
= .
Now let D be the face of
1
bounded by
1
, let H be the 3-cell in V
2
between the consecutive meridional disks of V
2
corresponding to x and y,
and let N be a regular neighborhood of E H D in M(r
2
). Then the
frontier of N is an annulus A
and
1
A
be the face of
1
bounded by
2
, let H
be the 3-cell in V
2
between
the consecutive meridional disks of V
2
corresponding to x
, y
, and let N
be a regular neighborhood of E
is an
annulus A
and
2
A
and D
0
[1, 1]
along
2
A [1, 1], say M
1
, is a punctured lens space whose fundamental
group has order the length of
2
, where D
0
is a disk. We may assume that
N N
= (moving N slightly o A if
x, y
, y
= ). We claim
that A
is essential in M(r
2
).
Suppose, otherwise, that A
is incompressible in M(r
2
). Now let D
1
be a -compressing
disk of A
, such that D
1
= a b, a M(r
2
), and b A
.
Case 1. a
1
A[1, 1].
REDUCIBLE DEHN SURGERY AND ANNULAR DEHN SURGERY 361
Now either M(r
2
) is reducible, or the union of N and D
0
[1, 1] along
1
A[1, 1] is a 3-cell, a contradiction.
Case 2. a M(r
2
)
1
A(1, 1).
If M
1
A(1, 1) is not an annulus, then either M(r
2
) is reducible, or
M(r
2
) is -reducible, a contradiction. If M
1
A(1, 1) is an annulus,
then M(r
2
) is a torus, and the union of M(r
2
) intN and D
0
[1, 1]
along M
1
A (1, 1) is a 3-cell, but it contains M
1
as a factor, a
contradiction.
By construction, |A
V
2
| = |AV
2
| 2, contradicting the minimality of
n
2
.
Lemma 2.5. (1)
2
contains no extended S-cycle.
(2)
2
contains at most n
1
/2+1 mutually parallel edges connecting parallel
vertices.
(3)
2
contains at most n
1
1 mutually parallel edges.
(4) If
1
contains a great cycle, then
1
contains a Scharlemann cycle.
(5) If n
3, and
2
, then the edges of
1
are disjoint from the edges of
2
.
Proof. (1) is Lemma 2.3 of [9]. (2) is Lemma 2.4 of [9]. (3) is Lemma
2.6 of [1]. See also [4, Proposition 1.3]. (4) is Lemma 2.6.2 of [2]. (5)
Suppose, otherwise, that one edge of
1
is contained in
2
. Then n
= 2, a
contradiction.
Lemma 2.6. Let y be a vertex of
.
(1) If
contains
a Scharlemann cycle.
(2) If F
is separating, and
contains a
Scharlemann cycle in f.
Proof. (1) Suppose that
contains a Scharle-
mann cycle. (2) is Lemma 2.2 of [5].
3. Reduced graph.
Let G be a graph in a surface S. The reduced graph of G is the graph
obtained from G by amalgamating each complete set of mutually parallel
edges of G to a single edge.
Lemma 3.1. One of
1
and
2
satises
(). Each vertex is incident to an edge connecting it to an antiparallel
vertex.
This follows immediately from the proof of [9, Lemma 2.6].
362 RUIFENG QIU
Let G
be the subgraph of
and the
edges connecting parallel vertices. We rst suppose that
2
has property
(). A component F
of G
2
is called an extremal component if there is a disc
D in
A such that DG
2
= F
, where
A is the 2-sphere obtained by capping
o the two components of A with disks. In this case F = D
2
is a
graph in D. If e is an edge in
2
connecting a vertex of F
to an antiparallel
vertex, then e D is an edge of F connecting that vertex to D. Such an
edge is called a boundary edge of F. Property () means that each vertex
of F belongs to a boundary edge.
Lemma 3.2. Let be a graph in a disk with no 1-sided disk face or two
sided disk face, such that every vertex of belongs to a boundary edge, then
either contains only one vertex, or there are at least two vertices of valency
at most 3, each of which belongs to a single boundary edge.
This follows immediately from the proof of [2, Lemma 2.6.5].
Lemma 3.3. If
2
has property (), then there exists at least one vertex of
2
, such that among the families of ends around it, there are at most two
families which are ends of edges connecting it to parallel vertices. Further-
more, if there are two such families, they are successive.
Proof. Since G
2
contains at least two extremal components, there is an ex-
tremal component F
of G
2
, such that the correspond disc D of F
in
A
contains at most one component of A, say
1
A, and the remaining com-
ponent of A is disjoint from D. Thus F contains at most one 1-sided disk
face. Furthermore, if F contains a 1-sided disk face f
1
, then
1
A intf
1
.
Let
F be the reduced graph of F in D. Let S be the graph obtained from
contains only
one vertex v, then v has valency 3 in
F
0
, and it belongs to a boundary edge.
If F
contains two vertices, then the one which does not belong to the edge
bounding f
1
, has valency at most 3 in
F
0
. Assume now that F
contains at
least three vertices.
REDUCIBLE DEHN SURGERY AND ANNULAR DEHN SURGERY 363
1
A
v
l l
.
........................
.......................
.......................
.......................
.......................
......................
.
......................
.......................
.......................
.......................
.......................
........................ . ........................
.......................
.......................
.......................
.......................
......................
.
......................
.......................
.......................
.......................
.......................
........................
.
. . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . .
.
. . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . .
.
. . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . .
.
.................
.................
.................
.................
.
................
...............
................
.................
.
.................
.................
.................
.................
.
................
...............
................
.................
.
. . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . .
.
. . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . .
.
. . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . .
.
. . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . .
Figure 1.
.
...................................................................
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
.
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
.
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
v
1
v
2
l
.
........................
.......................
.......................
.......................
.......................
......................
.
......................
.......................
.......................
.......................
.......................
........................ . ........................
.......................
.......................
.......................
.......................
......................
.
......................
.......................
.......................
.......................
.......................
........................
.
. . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . .
.
. . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . .
.
. . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . .
Figure 2.
Case 1. S contains no 2-sided disk face.
By Lemma 3.2, S contains at least two vertices of valency at most 3, say
v
1
and v
2
, each of which belongs to a single boundary edge.
Suppose that
1
A is contained in a fat edge l of S (as in Fig. 1). If
v
i
/
, v
, i =1, or 2, then v
i
has valency at most 3 in
F
0
. If
v
1
, v
2
, v
.
...................................................................
.
...................................................................
.
...................................................................
.
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
.
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
v
1
v
2
v
l
1
l
l
2
D
.
........................
.......................
.......................
.......................
.......................
......................
.
......................
.......................
.......................
.......................
.......................
........................ . ........................
.......................
.......................
.......................
.......................
......................
.
......................
.......................
.......................
.......................
.......................
........................
.
. . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . .
.
. . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . .
.
. . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . .
Figure 3.
.
...................................................................
.
...................................................................
.
...................................................................
.
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
.
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
v
1
v
2
v
l
1
l
l
2
D
.
........................
.......................
.......................
.......................
.......................
......................
.
......................
.......................
.......................
.......................
.......................
........................ . ........................
.......................
.......................
.......................
.......................
......................
.
......................
.......................
.......................
.......................
.......................
........................
.
. . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . .
.
. . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . .
.
. . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . .
Figure 4.
(2) the edges incident to one of v
1
and v
2
are as in one of Figures 3-4.
Now l
1
v
1
l v
2
l
2
separates D into two discs D
and D
. Since
1
A
is contained in l, the graph S D
contains at
least two vertices of valency at most 3, each of which belongs to a single
boundary edge, and the one which is not equal to v, has valency at most 3
in
F
0
.
Now we suppose that
1
A is not contained in a fat edge. Since S contains
no 2-sided disk face, the one of v
1
and v
2
which does not belong to the edge
bounding f
1
, has valency at most 3 in
F
0
.
Case 2. S contains a 2-sided disk face f
2
.
Now
1
A intf
2
. That means that
1
A is contained in a fat edge of
S.
By Lemma 3.2,
S contains at least two vertices of valency at most 3, each
REDUCIBLE DEHN SURGERY AND ANNULAR DEHN SURGERY 365
of which belongs to a single edge. Using
S to take place of S in Case 1, we
can proof that
F
0
contains a vertex of valency at most 3, which belongs to
a boundary edge.
Proposition 3.4. If
2
has property (), then
2
contains at least one ver-
tex, such that around it, all the endpoints of edges connecting it to parallel
vertices are successive, and there are at most n
1
+ 2 of them.
Proof. This follows immediately from Lemma 2.5(2) and Lemma 3.3.
Proposition 3.5. If
2
does not have property (), then
1
contains at
least one vertex, such that around it, all the endpoints of edges connecting
to parallel vertices are successive, and there are at most n
2
1 of them.
Proof. By Lemma 3.1,
1
has property (). Now let F
be an extremal
component of G
1
, and let D be the corresponding disc. In this case F =
D
1
is a graph in D. By Lemma 3.2,
F contains at least one vertex, say x,
of valency at most 3, which belongs to a single boundary edge. That implies
that in
1
, there are at most two families of parallel edges connecting x to
parallel vertices, and if there are two such families, then they are successive.
Since
2
does not have property (),
2
contains one vertex, such that each
of the edges incident to it connects it to a parallel vertex. That implies that
all edges in
1
with this vertex as a label connect nonparallel vertices, hence
the above two families of parallel edges contains at most n
2
1 edges.
4. The proof of Theorem 1.
Let G be a graph on a surface S. In this section, we shall denote by V the
number of vertices of G, E the number of edges of G and F the number of
disk faces of G. By the Euler characteristic formula, V E +F (S).
Proposition 4.1. If n
2
= 1, then (r
1
, r
2
) 1.
Proof. Suppose, otherwise, that (r
1
, r
2
) 2. Since n
2
= 1,
2
contains
only one vertex, say x. It is easy to see that x has valency 2 in
2
. Hence
2
contains n
1
mutually parallel edges, contradicting Lemma 2.5(3).
Proposition 4.2. If n
2
= 2, then (r
1
, r
2
) 2.
Proof. Suppose, otherwise, that (r
1
, r
2
) 3. Since n
2
= 2, each vertex of
2
has valency 4 as in Fig. 5 (otherwise
2
contains n
1
mutually parallel
edges). By Lemma 2.5(2), l
1
and l
2
contains at most n
1
+2 edges. If l
1
and
l
2
occupy at most n
1
+ 1 edges, then one of l
3
and l
4
occupies at least n
1
mutually parallel edges, contradicting Lemma 2.5(3).
366 RUIFENG QIU
.
............................
............................
............................
...........................
...........................
...........................
.
...........................
...........................
...........................
............................
............................
............................ . ............................
............................
............................
...........................
...........................
...........................
.
...........................
...........................
...........................
............................
............................
............................
.
. . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . .
.
. . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . .
.
. . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . .
.
.......................
.......................
.......................
.......................
.
......................
......................
.......................
........................
.
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
.
. . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . .
.
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . .
.
. . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . .
.
. . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
.
................
................
................
................
.
...............
...............
................
................
.
................
................
................
................
.
...............
...............
................
................
.
. . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . .
.
. . . . . . . . . . . . . . .
. . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . .
.
. . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . .
.
. . . . . . . . . . . . . . .
. . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . .
.
...............
...............
...............
.
...............
...............
............... . ...............
...............
...............
.
...............
...............
...............
.
............
............
............
.
...........
............
............
.
. . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
.
. . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . . . . . .
l
4
l
3
l
2
l
1
2
F
1
F
Figure 5.
Now we suppose that that l
1
and l
2
occupy n
1
+2 edges. Then
2
contains
an S-cycle. By Lemma 2.2, F
1
is separating, and n
1
4. By Lemma 2.5(3),
each of l
3
and l
4
occupies n
1
1 edges.
Case 1.
1
F
2
and
2
F
2
are parallel.
Let x be a vertex of
1
. By the parity rule,
x
2
contains at least six (3n
2
)
edges. Since
2
has four edges,
x
2
contains a 2-sided disk face. By Lemma
2.5(1) and Lemma 2.5(3),
2
contains an S-cycle, one label of which is x,
for any given vertex x in
1
. Since n
1
3,
2
contains two Scharlemann
cycles with distinct label pairs, contradicting Proposition 2.4.
Case 2.
1
F
2
and
2
F
2
are antiparallel.
Let be the subgraph of
1
consisting of all the vertices of
1
and the
edges in l
3
. Since F
1
is separating, is not connected. By the Euler char-
acteristic formula, contains a disk face. By the proof of Proposition 1.3 of
[4], M contains an essential annulus, a contradiction.
In the following arguments, we shall assume that n
3.
Proposition 4.3. If
2
has property (), then (r
1
, r
2
) 2.
Proof. Suppose, otherwise, that (r
1
, r
2
) 3. By Proposition 3.4, there
exists a vertex of
2
, say y, such that
y
1
contains at least 2n
1
+ l edges,
where l 2. By the Euler characteristic formula,
y
1
contains at least
n
1
+l +2 n
1
disk faces. Since there are n
1
adjacent edges at y connecting
it to antiparallel vertices, there is a great y-cycle in
1
. By Lemma 2.5(4),
1
contains a Scharlemann cycle. By Lemma 2.2, F
2
is separating. By Lemma
2.6(2) and Proposition 2.4,
1
contains at least n
1
Scharlemann cycles with
the same label pair, say
1, 2
. Then m n
1
. By Lemma 2.5(5),
1
1
contains at least 2m edges. By the Euler characteristic formula,
1
1
contains
REDUCIBLE DEHN SURGERY AND ANNULAR DEHN SURGERY 367
at least 2mn
1
+ 2 m+ 2 disk faces. By Lemma 2.6(2),
1
contains at
least m + 2 Scharlemann cycles. Thus
1
contains two Scharlemann cycles
with distinct label pairs, contradicting Proposition 2.4.
Proposition 4.4. If
2
does not have property (), then (r
1
, r
2
) 2.
Proof. Suppose, otherwise, that (r
1
, r
2
) 3. By Proposition 3.5, there
exists a vertex of
1
, say x, such that
x
2
contains 2n
2
+l edges, where l 1.
By the Euler characteristic formula,
x
2
contains at least n
2
+ l disk faces.
By the parity rule and Proposition 3.5, there are n
2
l vertices of
2
, each
of which is incident to an edge connecting it to an antiparallel vertex. That
means that there are at least n
2
l edges of
x
2
, each of which is on the
boundary of only one disk face of
x
2
. We claim that
x
2
contains either a
2-sided disk face, or a 3-sided disk face.
If
x
2
contains no 2-sided disk face or 3-sided disk face. Then 4F
2(2n
2
+l) (n
2
l). Thus F 3/4(n
2
+l) < n
2
+l, a contradiction.
Now by Lemma 2.6(1),
2
contains a Scharlemann cycle. By Lemma
2.2, F
1
is separating. By Lemma 2.6(2) and Proposition 2.4,
2
contains at
least n
2
+ l Scharlemann cycles with the same label pair, say
1, 2
. Now
suppose that
2
contains m Scharlemann cycles with label pair
1, 2
. Then
m n
2
+l, where l 1. By Lemma 2.5(5),
1
2
contains at least 2m edges.
By the Euler characteristic formula,
1
2
contains at least m + l disk faces.
By Lemma 2.6(2),
2
contains at least m + l Scharlemann cycles. Thus
2
contains two Scharlemann cycles with distinct label pairs, contradicting
Proposition 2.4.
Theorem 1 follows immediately from Propositions 4.1-4.4.
I am grateful to the referee for his suggestions.
References
[1] S. Boyer and X. Zhang, Reducing Dehn lling and toroidal Dehn lling, Topology and
Its Appl., 68 (1996), 285-303.
[2] M. Culler, C. Gordon, J. Luecke and P. Shalen, Dehn surgery on knots, Ann. of Math.,
125 (1987), 237-300.
[3] C. Gordon, Boundary slopes of punctured tori in 3-manifolds, preprint.
[4] C. Gordon and R. Litherland, Incompressible planar surfaces in 3-manifolds, Topology
and Its Appl., 18 (1984), 121-144.
[5] C. Gordon and J. Luecke, Reducible manifolds and Dehn surgery, Topology, 35 (1996),
385-403.
[6] C. Hayashi and K. Motegi, Dehn surgery on knots in solid tori creating essential annuli,
Trans. AMS, to appear.
[7] M. Scharlemann, Producing reducible 3-manifolds by surgery on a knot, Topology, 29
(1990), 481-500.
368 RUIFENG QIU
[8] Y-Q Wu, Incompressibility of surfaces in surgered 3-manifold, Topology, 31 (1992),
271-279.
[9] Y-Q Wu, The reducibility of surgered 3-manifolds, Topology and its Appl., 43 (1992),
213-218.
Received December 8, 1996 and revised September 10, 1998. This research was supported
in part by National Natural Science Foundation of China.
Department of Mathematics
Jilin University
Changchun 130023
China
E-mail address: qrf@mail.jlu.edu.cn
PACIFIC JOURNAL OF MATHEMATICS
Vol. 192, No. 2, 2000
EXTENDING MAPS OF A CANTOR SET PRODUCT
WITH AN ARC TO NEAR HOMEOMORPHISMS OF THE
2-DISK
Michael D. Sanford and Russell B. Walker
We prove that a positive entropy map of the product of
a Cantor Set and an arc (which covers a homeomorphism)
cannot be embedded into a near homeomorphism of the 2-
disk. Thus a theorem of M. Brown cannot be used to embed
the induced shift map on the corresponding inverse limit space
into a 2-disk homeomorphism.
1. Introduction.
In 1990, M. Barge and J. Martin [BM90] proved that the shift map on the
inverse limit space ([0, 1], f), for any map f : [0, 1] [0, 1], can be realized as
a global attractor in the plane. In 1960, M. Brown [Bro60] proved that the
inverse limit space of any near homeomorphism (Denition 1.2) of a compact
metric space is homeomorphic to the original space. M. Barge and J. Martin
prove that, for all such f, there exists an embedding h : [0, 1] D
2
such that
hf h
1
can be extended to a near homeomorphism of the 2-disk, D
2
. They
then use M. Browns theorem to extend the induced shift homeomorphism
on h([0, 1]) to a homeomorphism of D
2
. With care in the construction of
the near homeomorphism of D
2
, the inverse limit space (h([0, 1]), hf h
1
)
becomes a global attractor.
The main goal of this paper is to show that analogous techniques for maps,
F : C[0, 1] C[0, 1], where C is a Cantor set, F(x, y) = (F
1
(x), F
2
(x, y))
is a surjective map with positive topological entropy (Denition 1.4), and
F
1
is a homeomorphism, do not work; no near homeomorphic extension of
h F h
1
to D
2
exists for any embedding h : C [0, 1] D
2
(Theorem
3.1). In our terminology, such F cannot be embedded into any 2-disk
homeomorphism (Denition 1.1). In the proof of Theorem 3.1 one rst
assumes that h is a tame embedding (Denition 3.1). But in recent work,
R. Walker proves that all embeddings of C [0, 1] into D
2
are tame [Wal].
Our study of maps of C[0, 1] and their embeddings has links to a central
problem in the dynamical systems of positive entropy homeomorphisms of
compact surfaces.
Does there exist a C
1
positive entropy 2-disk dieomorphism without shifts?
369
370 M. SANFORD AND R. WALKER
In 1980, A. Katok [Kat80] proved that all C
1+
, > 0, positive entropy
dieomorphisms of a compact surface, have transverse homoclinic points. So
some power of such a dieomorphism restricts to a shift map of nite type.
The next year M. Rees announced a minimal positive entropy homeomor-
phism of the 2-torus [Ree81]. So her homeomorphism has no periodic orbits
thus no shifts. Though not in print, it appears that techniques M. Rees used
can be adapted to build a positive entropy 2-disk homeomorphism which has
a xed point and no other periodic orbits. The C
1
case remains open. In
1993, M. Barge and R. Walker built a chainable continuum which is the
inverse limit space of a map of a Cantor comb [BW93]. The map restricted
to each tooth was a tent map over an adding machine base map. The
induced shift homeomorphism has positive entropy but all periodic orbits
are period a power of 2. Thus no shifts are present. All chainable continua
can be embedded into the 2-disk [Bin62]. Although their Cantor comb map
can be embedded into a near homeomorphism of the 3-ball, it cannot be em-
bedded into a near homeomorphism of the 2-disk. (To prove this M. Barge
and R. Walker rely on properties of the adding machine base map.) So their
induced shift homeomorphism cannot be used to build a new Rees-type 2-
disk homeomorphism. By our Theorem 3.1, a much larger class of maps (all
positive entropy maps of C [0, 1] which cover any homeomorphism) has
the same drawback.
In Section 2 we show that if F : C [0, 1] C [0, 1] is a surjective
map such that F(x, y) = (F
1
(x), F
2
(x, y)), F
1
is a homeomorphism and
F
2
(x
0
, ) : [0, 1] [0, 1] is nonmonotone (Denition 1.3) for some x
0
, then
there exists no embedding of F into a near homeomorphism (Denition of
the 2-disk). We will show this by assuming such a near homeomorphism
does exist and then obtaining a contradiction using a result of S. Schwartz
[Sch92] (Theorem 1.1) concerning nonmonotone maps.
Unless otherwise specied X, and Y are compact metric spaces. And
1
and
2
on X Y are the rst and second coordinate projection maps.
Denition 1.1. A map f : X X can be embedded into the map F :
Y Y if there exists a topological embedding h : X Y such that
F|
h(X)
= h f h
1
.
Denition 1.2. A map f : X Y is called a near homeomorphism pro-
vided there exists a sequence {f
k
: X Y }
k=1
of homeomorphisms which
uniformly converge to f.
Denition 1.3. A map f : X Y is monotone provided f
1
(V ) is con-
nected, whenever V Y is connected.
Theorem 1.1 (S. Schwarts [Sch92]). Suppose that X is a locally connected
compact metric space. If f : X X is a near homeomorphism then f is
monotone.
EXTENDING MAPS OF A CANTOR SET 371
As mentioned, in Section 3 we show that if F : C [0, 1] C [0, 1]
is a surjective map with positive topological entropy (Denition 1.4), which
is embedded in the 2-disk, then F cannot be extended to a near homeo-
morphism of the disk. The proof uses theorems of R. Bowen (Theorem 1.2)
[Bow71] and M. Barge (Theorem 1.3) [Bar87].
Denition 1.4 (Topological Entropy). Suppose that F : X Y X Y
is a surjective map and has the form F(x, y) = (F
1
(x), F
2
(x, y)). Fix x
0
and
let > 0. A set E Y is (n, )separated by F|
1
1
(x
0
)
if for all y
0
, y
1
E,
y
0
= y
1
, d(
2
F
k
(x
0
, y
0
),
2
F
k
(x
0
, y
1
)) > for some k [0, n), where d is the
Y metric. Since Y is compact and n < , card E < . Let the maximum
number of (n, )-separated orbits for each be
s(n, ) = max
_
card E
E Y such that
E is (n, ) separated by F|
1
1
(x
0
)
_
.
Now, let the growth rate of s(n, ) (or -topological entropy) be
h
top
_
F|
1
1
(x
0
)
,
_
= limsup
n
log s(n, )
n
.
Lastly we let 0 and dene topological entropy for F|
1
1
(x
0
)
.
h
top
_
F|
1
1
(x
0
)
_
= lim
0
h
top
_
F|
1
1
(x
0
)
,
_
.
The topological entropy h
top
(F
1
) of the homeomorphism F
1
is dened
similarly (see [Bow71]).
Theorem 1.2 (R. Bowen [Bow71]). If F : X Y X Y has the form
F(x, y) = (F
1
(x), F
2
(x, y)) then
h
top
(F) h
top
(F
1
) + sup
xX
_
h
top
_
F|
1
1
(x)
__
.
If h
top
(F
1
) = 0 then h
top
(F) = sup
xX
_
h
top
_
F|
1
1
(x)
__
.
Theorem 1.3 (M. Barge [Bar87]). If F : X [0, 1] X [0, 1] has the
form F(x, y) = (F
1
(x), F
2
(x, y)), F
2
(x, ) : [0, 1] [0, 1] is monotone for
each x and h
top
(F
1
) = 0, then h
top
(F) = 0.
2. Nonmonotone Maps of the Cantor Set Cross the Interval.
2.1. Preliminaries. Let C [0, 1] be a Cantor set. Let C [0, 1] and
{} [0, 1] R
2
for C. The goal of this section is to prove Theorem
2.1 to follow. But rst some preliminaries.
2.1.0.1. Assume F : C [0, 1] C [0, 1] is a surjective map that has the
form
F(, y) = (F
1
(), F
2
(, y))
372 M. SANFORD AND R. WALKER
where F
1
: C C is a homeomorphism. Furthermore, for a given
0
C,
F
2
(
0
, y) = t(y) where t : [0, 1] [0, 1] is a continuous nonmonotone map
(see Figure 1 for an example). Let
0
= F
1
(
0
).
1
0
1
m M
b
a
s
1
s
2
c
Figure 1. Example of a nonmonotone map.
It will be needed later, that because t is nonmonotone we can nd a point
that has at least two points in the the pre-image that can be seperated by
disjoint epsilon balls. We introduce this idea at this point so that we can
use the notation developed here throughout.
2.1.0.2. Since t is nonmonotone and continuous, there is an a (0, 1) such
that t
1
(a) is closed and not connected. Thus, there is an interval (m, M)
[0, 1] \ t
1
(a) such that a = t(m) = t(M), and t([m, M]) = [a, b] (or [b, a])
for some b = a. Without loss of generality we will assume that a < b. Let
t
1
(b). By the intermediate value theorem, t([m, M]) = [t(m), t()].
Now choose c =
1
2
(a + b). Since t is continuous there are s
1
(m, ) and
s
2
(, M) such that c = t(s
1
) = t(s
2
) (see Figure 1).
2.1.0.3. By the continuity of F, for any > 0 there is a
1
=
1
() > 0
such that F (x, y) B
(
0
, t(y)) when d(
0
, x) <
1
and y [0, 1]. Suppose
K
1
= K
1
() N is such that
1
K
1
<
1
.
Let D = {(x, y) R
2
|x
2
+ y
2
4}. Now let h
0
: C [0, 1] D
be an arbitrary topological embedding. Then there is a homeomorphism
h
1
: D D such that (h
1
h
0
)(
0
, y) = (
0
, y) and (h
1
h
0
)(
0
, y) = (
0
, y)
for all y [0, 1]. So h
1
straightens out h
0
(
0
[0, 1]) and h
0
(
0
[0, 1])
in a strong sense. Notice that C [0, 1] D.
2.1.0.4. By the uniform continuity of h
1
h
0
, for all > 0 there is a
2
=
2
() > 0 such that for all y [0, 1], h
1
h
0
(x, y) B
(
0
, y) and h
1
h
0
(x
, y) B
(
0
, y), for all (x, y) B
2
()
(
0
, y) and (x
, y) B
2
()
(
0
, y).
Let K
2
= K
2
() N be such that
1
K
2
<
2
.
EXTENDING MAPS OF A CANTOR SET 373
2.1.0.5. With a, b dened as in [2.1.0.2], let
d = min{a, 1 b, |a b|}. For
0 <
0
<
d
100
choose 0 <
0
min
_
1
(
0
),
2
(
0
),
Mm
100
_
. So in particular
[2.1.0.3] and [2.1.0.4] are satised. Note that t([m, M]) [
0
, 1
0
]. Let
K
0
max{K
1
(
0
), K
2
(
0
)} be such that
1
K
0
<
0
. Since C is perfect, there
is a sequence {
k
} C such that
k
0
as k , and
k
[0, 1]
N
0
(
0
[0, 1]), for all k > K
0
. Let
k
= F
1
(
k
). (Note that N
(S) is
a neighborhood of S.) It follows that
k
0
as k and
k
[0, 1] N
0
(
0
[0, 1]) for all k > K
0
. For a possibly larger K
0
, also called
K
0
, and o
k
B
0
(
0
, c), k > K
0
, there exist q
1
(k) and q
2
(k) such that
{q
1
(k), q
2
(k)} F
1
(o
k
), q
1
(k) B
0
(
0
, s
1
) and q
2
(k) B
0
(
0
, s
2
).
We now state our rst theorem.
2.1.1. Nonmonotone Nonextension Theorem.
Theorem 2.1. Let F : C[0, 1] C[0, 1] be a map of the form F(, y) =
(F
1
(), F
2
(, y)) where F
1
: C C is a homeomorphism. Furthermore,
assume F
2
(
0
, ) : [0, 1] [0, 1] is surjective but not monotone for some
0
.
Then there exists no extension of h
0
F h
1
0
to a near homeomorphism of
the disk D, for any topological embedding h
0
: C [0, 1] D.
Proof. Assume h, K
0
,
0
,
0
, {
k
}, {
k
}, q
1
(k), and q
2
(k) are dened as in
[2.1.0.1-5]. Suppose that H
0
: D D is a near homeomorphism such
that H
0
|
h
0
(C[0,1])
= h
0
F h
1
0
. And let H
1
: D D be given by
H
1
= h
1
H
0
h
1
1
. Thus H
1
is also a near homeomorphism. So the
diagram in Figure 2 commutes.
h
0
h
0
h
1
h
1
F
H
0
H
1
C [0, 1] C [0, 1]
D D
D D
? ?
-
-
? ?
-
Figure 2. Commuting Diagram.
2.1.1.1. Let () = h
1
h
0
( [0, 1]) for all C. By [2.1.0.3] h
1
h
0
is a
homeomorphism and if ({} [0, 1])
() = . Let
(k) =
374 M. SANFORD AND R. WALKER
(
k
)
and
(k) = (
k
)
. Because h
1
h
0
(
k
, 0) B
0
(
0
, 0),
h
1
h
0
(
k
, 1) B
0
(
0
, 1), and (
k
) is connected, then
0
(
0
, ).
(
k
)
(
0
)
(k)
B
0
(
0
, 0)
B
0
(
0
, 1)
(
k
) (
0
)
(k)
B
0
(
0
, 0)
B
0
(
0
, 1)
Figure 3. Intersection of (
k
) with
.
Lemma 2.1 follows from the continuity of h
1
, h
0
, and
1
.
Lemma 2.1. Choose p
k
k
,
1
2
_
=
0
for suciently large k. So either
card
_
k
1
_
h
1
h
0
_
k
,
1
2
__
>
0
_
=
or
card
_
k
1
_
h
1
h
0
_
k
,
1
2
__
<
0
_
= .
2.1.1.2. So without loss of generality we may assume there exist distinct
{k
n
}
n=1
such that k
n
as n , and
1
_
h
1
h
0
_
k
n
,
1
2
__
>
0
.
2.1.1.3. For the sake of simplicity we denote
k
n
by
n
, (
k
n
) by (
n
),
(
k
n
) by (
n
) and
k
n
by
n
.
EXTENDING MAPS OF A CANTOR SET 375
Lemma 2.2. Let N
0
be such that k
n
K
0
for all n N
0
. Then
(
n
)
__
x,
1
2
_
x <
0
_
= .
Proof. Fix n N
0
and assume there exists
p
1
(
n
)
__
x,
1
2
_
x <
0
_
,
and let p
2
=
_
h
1
h
0
_
k
n
,
1
2
__
. By [2.1.1.2]
1
(p
2
) > 0. Let A be the arc in
(
n
) with end points p
1
and p
2
. By [2.1.1.1], p
1
, p
2
B
0
_
0
,
1
2
_
. So by
[2.1.0.5],
d((h
1
h
0
)
1
(p
1
), (h
1
h
0
)
1
(p
2
)) <
0
.
Since (
n
)
(
0
) = , then using a Jordan Curve argument, it follows
A
{(
0
, y)|y > 1 or y < 0} = .
Let p
3
A
2
(h
1
h
0
)
1
(p
1
) <
2
(h
1
h
0
)
1
(p
3
) <
2
(h
1
h
0
)
1
(p
2
)
or
2
(h
1
h
0
)
1
(p
2
) <
2
(h
1
h
0
)
1
(p
3
) <
2
(h
1
h
0
)
1
(p
1
).
In either case d((h
1
h
0
)
1
(p
1
), (h
1
h
0
)
1
(p
3
)) <
0
. And so d(p
1
, p
3
) <
0
which is a contradiction.
2.1.1.4. Assume n N
0
. Let g
n
: [0, 1] (
n
) be the parameterization
of (
n
) dened by g
n
() = h
1
h
0
(
n
, ). Using , m from [2.1.0.2]
(
n
)
= and (
n
)
m
= (see Figure 4) so by Lemma [2.2]
and the connectivity of (
n
) there is a largest , call it
n
, such that
g
n
(
n
)
m
. Let a
n
= g
n
(
n
) (see Figure 4). Similarly there is a smallest
, call it
+
n
, such that g
n
(
+
n
)
m
. Let b
n
= g
n
(
+
n
).
2.1.1.5. If necessary, renumber the k
n
s so that if k
n
< k
n+1
then
1
(a
n
) >
1
(a
n+1
). It follows by an argument similar to that of Lemma [2.2] that
1
(b
n
) >
1
(b
n+1
). (Because h
1
h
0
may have scrambled the C[0, 1] order
in the rst coordinate, it may be necessary to relabel the k
n
s so that (
k
n
)
to be between (
k
n1
) and (
k
n+1
).)
Considering [2.1.1.5] and [2.1.1.2] and to simplify the notation assume
card
_
k|
1
(h
1
h
0
)
_
k
,
1
2
_
>
0
_
=
and
1
(a
k
) >
1
(a
k+1
) for all k.
376 M. SANFORD AND R. WALKER
b
n
= g
n
(
+
n
)
a
n
= g
n
(
n
)
(
n
)
(
0
)
Figure 4. First and Last Intersections.
Using [2.1.1.4], for k N
0
dene the four curves I (k, m), I (k, ), J
k1
,
and J
k+1
in the following manner (see Figure 5). Let I (k, m) be the line
segment in
m
between a
k+1
and a
k1
and I (k, ) be the line segment in
between b
k+1
and b
k1
. Let
J
k1
=
_
g
k1
()
k1
+
k1
_
, and
J
k+1
=
_
g
k+1
()
k+1
+
k+1
_
.
b
k+1
b
k1
I (k, )
a
k+1 a
k1
I (k, m)
J
k+1
J
k1
Figure 5. The Boundary of R
k
.
Lemma 2.3. I (k, )
J
k1
I (k, m)
J
k+1
is a simple closed curve.
Proof. Since (
k1
)
(
k+1
) = we have J
k1
J
k+1
= . By [2.1.1.1]
I (k, m)
I (k, m) and a
k+1
= J
k+1
I (k, m)
EXTENDING MAPS OF A CANTOR SET 377
and
b
k1
= J
k1
I (k, ) and b
k+1
= J
k+1
I (k, ) .
And so the lemma follows.
Let R
k
be the closed and bounded set with boundary
I (k, m)
_
J
k1
_
I (k, )
_
J
k+1
(see Figure 5). Recall from [2.1.0.2] that s
1
[m, ] and from [2.1.1.1] that
s
1
(k) = (
k
)
s
1
(see Figure 3).
Lemma 2.4. R
k
s
1
(k) = .
Proof. Let
k
be the arc
_
g
k
()|0
+
k
_
. Let S
k
= R
k
1
2
[s
1
, ]
(see Figure 6). So S
k
I (k, ) and by [2.1.1.5] b
k
I (k, ). But b
k
is
not an endpoint of I (k, ) because the endpoints of I (k, ) are b
k1
and
b
k+1
. And so there is an > 0 such that if p B
(b
k
) and
2
(p) < then
p int S
k
. Now, if q
k
\ {b
k
} then
2
(q) <
1
2
. And since
k
connects
h
1
h
0
(
k
, 0) to b
k
, we have that (
k
(b
k
)) \{b
k
} = . Thus there exists
p
0
k
(b
k
)
intS
k
. Let A
k
k
be the arc with endpoints p
0
and
h
1
h
0
(
k
, 0) (see Figure 6).
s
1
A
k
R
k
s
1
(k)
S
k
B
(b
k
)
m
(
0
)
p
0
Figure 6. The Arc A
k
.
Because p
0
intS
k
and h
1
h
0
(
k
, 0) S
k
then A
k
S
k
= . Since
A
k
(
k1
) = , A
k
(
k+1
) = , A
k
I (n, ) = and
s
1
R
k
S
k
,
we have that A
k
s
1
R
k
= , or R
k
s
1
(k) = .
2.1.1.6. Note that since
s
1
(k)
R
k
= then
s
1
(k) intR
k
.
Lemma 2.5.
_
(
l
)
H
1
1
(h
1
h
0
(
k
, y))
= for k = l.
378 M. SANFORD AND R. WALKER
Proof. Suppose that (
l
)
H
1
1
(h
1
h
0
(
k
, y)) for k = l. Then
H
1
() = h
1
h
0
(
k
, y) But H
1
(
l
) = (
l
). Thus H
1
() (
l
) So
h
1
h
0
(
k
, y) (
l
) Or (
k
, y) C
l
. Which contradicts, [2.1.1.1] since
k = l.
Proof of Theorem 2.1 continued. By Lemma [2.4] there exists p
1
(k)
R
k
s
1
(k) for all k N
0
. By [2.1.0.5] (h
1
h
0
)
1
(p
1
(k)) B
0
(
0
, s
1
).
Using [2.1.0.5], let o
k
= F (h
1
h
0
)
1
(p
1
(k)). So there exists {q
1
(k),
q
2
(k)} F
1
(o
k
) such that q
1
(k) B
0
(
0
, s
1
) and q
2
(k) B
0
(
0
, s
2
).
Choose q
1
(k) so that p
1
(k) = h
1
h
0
(q
1
(k)). And let p
2
(k) = h
1
h
0
(q
2
(k))
and r
k
= h
1
h
0
(o
k
) (see Figure 7). Because H
1
h
1
h
0
= h
1
h
0
F
then {p
1
(k), p
2
(k)} H
1
1
(r
k
). By the size of
0
chosen in [2.1.0.5], p
2
(k)
B
0
(
0
, s
2
) R
k
.
Recall that H
0
and H
1
are near homeomorphisms. Near homeomorphisms
are monotone on locally connected compact metric spaces ([Sch92]). Thus
pre-images of connected sets under H
1
are connected. So H
1
1
(r
k
) is a
connected set which contains p
2
(k) R
k
and by [2.1.1.6] p
1
(k) intR
k
.
Then H
1
1
(r
k
)
R
k
= . By Lemma [2.5] either H
1
1
(r
k
)
I (k, ) =
or H
1
1
(r
k
)
H
1
1
(r
k
j
) or
k
j
I (k, m)
H
1
1
(r
k
j
) for all j (see
Figure 7).
(
0
)
(
k
)
p
1
(k)
p
2
(k)
k
H
1
1
(r
k
)
Figure 7. Subsequence and Pre-image.
Now by Lemma [2.1] either
k
j
h
1
h
0
(
0
, ) or
k
j
h
1
h
0
(
0
, m)
as j . Since H
1
is continuous for all j, either
H
1
k
j
H
1
h
1
h
0
(
0
, ) or H
1
k
j
H
1
h
1
h
0
(
0
, m) .
Because H
1
h
1
h
0
= h
1
h
0
F, then either
r
k
j
h
1
h
0
(
0
, t()) or r
k
j
h
1
h
0
(
0
, t(m)) .
EXTENDING MAPS OF A CANTOR SET 379
Since h
1
h
0
is a homeomorphism either
o
k
j
(
0
, b) or o
k
j
(
0
, a)
which is a contradiction since {o
k
j
} B
0
(
0
, c).
3. Positive Entropy Maps of C [0, 1].
3.1. Introduction. Let C R be a Cantor set. In this chapter we use the
results of Chapter 2 to prove the following:
Theorem 3.1. Let F : C [0, 1] C [0, 1] be a surjective map such
that F(a, y) = (F
1
(a), F
2
(a, y)), where F
1
: C C is a homeomorphism. If
h
top
(F) > 0 then there exists no topological embedding h
0
: C[0, 1] D
R
2
such that h
0
F h
1
0
extends to a near homeomorphism of the disk D.
Recall that
1
: C [0, 1] C is the projection map onto the rst
coordinate. By work of R. Bowen [Bow71] we know that h
top
(F)
h
top
(F
1
) + sup
aC
_
h
top
_
F|
1
1
(a)
__
. It has been shown by M. Barge and
R. Walker [BW93] that any near homeomorphism that extends h
0
F
h
1
0
to the disk must preserve a certain local order on the set of bers
{h
0
(a [0, 1]) |a C}. But we will show that if h
top
(F
1
) > 0 no such local
order is preserved. So in fact h
top
(F
1
) = 0. Using [Bow71] and a result
of M. Barge [Bar87], if h
top
(F) > 0 then for some a
0
C, F
2
(a
0
, ) is a
nonmonotone map. Thus by Theorem 2.1, h
0
F h
1
0
cannot be extended
to a near homeomorphism of the disk.
3.2. Proof of Theorem 3.1.
Denition 3.1 (Tame Embedding). h
0
: C [0, 1] D R is a tame
embedding provided there is a homeomorphism h
1
: D D such that for all
a C, h
1
h
0
({a} [0, 1]) = ({a
}. If h
0
is a tame
embedding, using a theorem of E. Moise [Moi77], we may further require
that h
1
has the property: h
1
h
0
({a}, i) = ({a
for some
a
and
+
are arcs in the plane
with the properties:
has endpoints h
1
h
0
(a, 0) and h
1
h
0
(b, 0), and
is otherwise disjoint
from (a)
(b);
+
has endpoints h
1
h
0
(a, 1) and h
1
h
0
(b, 1) and
+
is
otherwise disjoint from (a)
(b); and
_
+
_
_
[0, 2]
_
1
2
__
= .
Such arcs
and
+
will be called admissible arcs joining (a) and (b).
Denition 3.3. Given a, b C, a = b, then (a) <
bw
(b) if there are
admissible arcs joining (a) and (b), as above, and the orientation
(b)
+
(a) is positive (counterclockwise) on the simple closed curve
(b)
(b)
(a)
1
((b)) = T(b)
1
((a)) = T(a)
y =
1
2
1
,
+
2
, and
2
such that (z)
+
1
(x)
1
and (y)
+
2
(z)
2
have positive orientation.
Sublemma 3.1. For > 0 there is a
1
> 0 such that if
(z)
1
((x)) =
then |x z| < .
Proof. By the continuity of (h
1
h
0
)
1
, if > 0 there is a
1
> 0 such that
if d(p, q) <
1
where p, q then d((h
1
h
0
)
1
(p), (h
1
h
0
)
1
(q)) < . So
if (z)
1
((x)) = there is p (x), q (z) such that d(p, q) <
1
.
Thus |xz| = |
1
((h
1
h
0
)
1
(p))
1
((h
1
h
0
)
1
(q))| d((h
1
h
0
)
1
(p), (h
1
h
0
)
1
(q)) < .
Choose
1
> 0 smaller so that if
(z)
1
((x)) = and
(z)
1
((y)) =
then |x z| < and |y z| < .
By the continuity of (h
1
h
0
) there is > 0 such that if |x y| < then
(x) N
1
((y)) and (y) N
1
((x)).
Suppose that (z)
1
((x))
1
((y)) = . So either
1
(z) <
1
(x) or
1
(y) <
1
(z). Thus either (z)
+
1
(x)
1
has
negative orientation or (z)
+
2
(y)
2
has positive orientation
which contradicts x <
C
z <
C
y.
382 M. SANFORD AND R. WALKER
Thus (z)
1
((x)) = and (z)
1
((y)) = . So by the choice
of then |x z| < and |y z| < as desired.
Lemma 3.2. Let f : (C, <
C
) (C, <
C
) be a local order preserving home-
omorphism. Then there is a > 0 such that if |x y| < then f([x, y]) =
[f(x), f(y)].
Proof. By Denition 3.6 there is an > 0 such that for any x, y C if
|x y| < , and x <
C
y then f(x) <
C
f(y). By Lemma 3.1 there is a > 0
such that if x <
C
z <
C
y and |x y| < then |x z| < and |y z| < .
Thus f(x) <
C
f(z) and f(z) <
C
f(y).
The proof of the following lemma was suggested by M. Barge.
Lemma 3.3. Let f : (C, <) (C, <) be a local order preserving homeo-
morphism. Then h
top
(f) = 0.
Proof. Recall that S C is an (n, )-spanning set, for f if for all x C
there is a y S such that |f
k
(x) f
k
(y)| < for all k = 0, 1, 2, . . . n 1.
Then (h
top
)
(f) = limsup
n
log card S(n, )
n
, and h
top
(f) = lim
0
(h
top
)
(f).
Choose as in Lemma 3.2 and suppose that S C is an (n, )-spanning
set where 0 < ( from the lemma). Let X be a nite set of C that is
-dense, let N = card X. Before proceeding with the proof of Lemma 3.3
we prove the following sublemma.
Sublemma 3.2. S
_
f
n
(X) is an (n + 1, )-spanning set.
Proof. Let x C. Suppose that y S is such that |f
k
(x) f
k
(y)| < for
k = 0, 1, 2, . . . n 1. There is a z X such that either z [f
n
(x), f
n
(y)]
or z [f
n
(y), f
n
(x)], and such that |f
n
(x) z| < . Then we have that
f
n
(z) S
f
n
(X) and z satises |f
k
(x) f
k
(z)| < for k = 0, 1, 2, . . . n
as desired.
Continuing with proof of Lemma 3.3, it follows from Sublemma 3.2 that
there exists a constant K > 0 such that for all n, card S(n.) K + nN.
Thus,
h
top
(f) = lim
0
(h
top
)
(f)
= lim
0
limsup
n
log card S(n, )
n
= lim
0
limsup
n
log(K + nN)
n
= 0.
a
linking h
1
h
0
({a
0
} {0}) to h
1
h
0
({a} {0}) and an
admissible arc,
+
a
linking h
1
h
0
({a
0
} {1}) to h
1
h
0
({a} {1}). Now
H is monotone on the simple closed curve = (a
0
)
(a)
+
a
.
Thus H can be approximated by a homeomorphism H
: D D such that
H
(a
0
) = H((a
0
)), H
(a) = H((a)), H
a
= H(
a
), and H
+
a
=
H(
+
a
). So the orientation of H() is identical to the orientation of H
().
For a suciently close to a
0
H
(or (H
)
2
) preserves <
bw
between (a
0
) and
(a) [BW93]. Thus H (or (H)
2
) does so as well.
Proof of Theorem 3.1 continued. We now complete the proof of Theorem 3.1.
First suppose that F
1
and F
2
1
do not locally preserve <
C
. Then by Denition
3.5 H and H
2
cannot locally preserve <
bw
on the bers {(a)| a C},
contradicting Lemma 3.4.
Next suppose F
1
locally order preserves <
C
. Then by Lemma 3.3 we have
that h
top
(F
1
) = 0. And if F
2
1
locally preserves <
C
, then h
top
_
F
2
1
_
= 0, thus
h
top
(F
1
)=0. So by [Bow71] h
top
(F) = h
top
(F
1
)+sup
aC
_
h
top
_
F|
1
1
(a)
__
=
sup
aC
_
h
top
_
F|
1
1
(a)
__
. But if h
top
(F) > 0 there is an a
0
C such that
h
top
_
F|
1
1
(a
0
)
_
> 0. Thus by Theorem 1.3 ([Bar87]) F
2
|
a
0
[0,1]
is not
monotone. So by Theorem 2.1 no such near homeomorphism extension H
of h
1
h
0
F (h
1
h
0
)
1
exists.
References
[Bar87] M. Barge, The topological entropy of homeomorphisms of Knaster continua,
Houston Journal of Mathematics, 13(4) (1987), 465-485.
[BM90] M. Barge and J. Martin, The construction of global attractors, Proc. Am. Math.
Soc., 110 (1990), 523-525.
[BW93] M. Barge and R. Walker, Nonwandering structures at the period-doubling in di-
mensions 2 and 3, Trans. Am. Math. Soc., 337(1) (1993), 259-278.
[Bin54] R.H. Bing, Locally tame sets are tame, Annals of Mathematics, 59 (1954), 145-
158.
[Bin62] , Embedding circle like continua in the plane, Canadian J. Math., 14
(1962), 113-128.
[Bow71] R. Bowen, Entropy for group endomorphisms and homogeneous spaces, Trans.
Am. Math. Soc., 153 (1971), 401-408.
[Bro60] M. Brown, Some applications of an approximation theorem for inverse limits,
Proc. Am. Math. Soc., 11 (1960), 478-483.
384 M. SANFORD AND R. WALKER
[Kat80] A. Katok, Lyaponov exponents, entropy, and periodic orbits for dieomorphisms,
Publications Mathematique, 51 (1980), 137-173.
[Moi77] E. Moise, Geometric Topology in Dimension 2 and 3, Springer-Verlag, New York,
NY, 1977.
[Ree81] M. Rees, A minimal positive entropy homeomorphism of the 2-torus, J. London
Math. Soc., 23(2) (1981), 537-550.
[Rus73] T.B. Rushing, Topological Embeddings, Academic Press, 1973.
[Sch92] S. Schwartz, Some Planar Embeddings of Chainable Continua can be Expressed
as Inverse Limit Spaces, Ph.D. thesis, Montana State Univ., 1992.
[Wal] R. Walker, Taming the Cantor Fence, Preprint.
Received July 18, 1996 and revised December 16, 1996. This research was supported in
part by NSF-OSR grant #9350546.
262 South Main St.
Lodi, NJ 07644
E-mail address: sanfordm@inet.felician.edu
Montana State University
Bozeman, MT 59715
E-mail address: walker@math.montana.edu
PACIFIC JOURNAL OF MATHEMATICS
Vol. 192, No. 2, 2000
K-TYPES OF SU(1, n) REPRESENTATIONS AND
RESTRICTION OF COHOMOLOGY
Mark R. Sepanski
This paper shows that the highest weights of the K-types
of any irreducible admissible representation of SU(1, n) are
determined by certain restriction maps from u to u k coho-
mology. In particular, the image of these maps determines a
set of points in a Cartan subalgebra. It is proved that the
highest weights of the K-types are given by intersecting a
translate of the root lattice with the closed convex hull of the
points determined by the restriction maps.
1. Introduction.
A basic idea often employed in the study of representations of real reductive
Lie groups is the notion of a K-type. In particular, if G is a real reductive
Lie group, K its maximally compact subgroup, and X an admissible repre-
sentation of G, then the representations of K appearing in X are called the
K-types of X. The point is that compact groups are well understood and
provide a powerful tool in the analysis of noncompact groups. The classical
application of these ideas is Bargmanns description in [1] of the represen-
tations of SL(2, R) based on the study on its SO(2)-types (see also [4], [2],
[3], and especially [6] for an extensive list).
In many cases formulas for the K-types are known. For instance Blat-
tners formula ([5]) provides a wonderfully explicit description of the K-types
of the discrete series (see [8] for a generalization). Unfortunately, even in
these cases the formulas are combinatorially complex and it is often hard to
determine whether a particular representation is a K-type.
A dierent approach, suggested by D. Vogan, is followed in [13]. There the
object of study is the closed convex hull of the set of highest weights of the K-
types. In the case of nite dimensional representations when G is SU(1, n)
or SO(1, n), an algebraic method is developed for nding the edges of this
closed convex hull. The point is that knowledge of just the edges is enough
to reconstruct the whole closed convex hull. This already provides fairly
sharp control of which representations can be K-types. Moreover, it can be
seen in [13] that intersecting a translate of the root lattice with the closed
convex hull recovers all the K-types.
385
386 MARK R. SEPANSKI
Let X be an irreducible admissible representation of G, write g for its
complexied Lie algebra, and choose a -stable parabolic subalgebra q =
l + u of g. The main algebraic tool used to construct the edges in [13] is a
restriction map on cohomology
: H
b
(u, X) H
b
(u k, X).
In a sense that is made precise, it is shown there that the image of (as
q varies) determines all the edges (not lying in a Weyl chamber wall of K)
and therefore determines all the K-types.
This paper generalizes [13]. The main result is Theorem 9. It says that
the edges of the closed convex hull of the set of K-types of any irreducible
admissible representation of G = SU(1, n) are completely determined by the
image of (in fact, only two parabolics are needed). Corollary 3 completes
the circle by showing that all K-types may be recovered from this closed
convex hull by intersecting it with a translate of the root lattice of K. All
notation necessary to understand the precise result is contained in Section
6.
The layout of the paper is as follows: Section 2 sets up the notation,
Section 3 lists the K-types of the induced representations of G, Section 4
lists the innitesimal characters and a reducibility criterion, Section 5 gives
the K-types of all irreducible representations of G, and Section 6 constructs
the K-types in terms of the image of .
2. Notation.
Let G = SU(1, n), n > 1, and write K
= U(n) for its maximally compact
subgroup embedded into G as
K =
__
x
X
_
| x U(1), X U(n), xdet(X) = 1
_
.
Let g
0
= su(1, n) be the Lie algebra of G and write g for its complexication.
This convention will be followed throughout the paper. For example, k
0
is
the Lie algebra of K, isomorphic to u(n), and k is its complexication. Also
write for the standard Cartan involution and g = k+p for the corresponding
Cartan decomposition.
Let T be the Cartan subgroup of K (and G) consisting of all diagonal
matrices in G. If x = (x
1
, . . . x
m
) R
m
, dene its trace by tr(x) = x
1
+
. . . x
m
. With this notation and the identication of it
0
with it
0
via the
standard dot product, the set of analytically integral weights on t is
T =
_
(
0
,
1
, . . .
n
)
_
1
n + 1
Z
_
n+1
| tr() = 0,
j
Z for 0 i < j n
_
.
K-TYPES OF SU(1, n) 387
Viewing T as a Cartan subgroup of K, we say
T is positive and write
T
+
if
1
2
. . .
n
. By taking highest weights,
T
+
parameterizes
the irreducible representations of K. We also write W
K
for the Weyl group
of K and W
G
for the Weyl group of G with respect to it
0
. W
K
acts on it
0
as the set of all permutations of the last n coordinates and W
G
acts on it
0
as the set of all permutations.
Let A be the subalgebra of G dened by exp(a
0
) where a
0
p is the
subalgebra given by
a
0
=
_
_
a
_
_
_
0
.
.
.
0
_
_
_
| R
_
_
.
By conjugation, we may pull back the standard trace form on the diagonal
matrices to a so that a
1
a
2
= 2
1
2
. We use this to identify a and a
. We
further identify C with a by mapping z C to a
z
a. By the identication
of a and a
, z acts on a by z a
= 2z.
Let = (g, a) be the restricted root system so = {
1
2
, 1} with mul-
tiplicities 2(n 1) and 1, respectively. Set
+
= {
1
2
, 1} and let P be the
corresponding parabolic subgroup with P = MAN its Langlands decompo-
sition. In particular,
M =
_
_
_
_
_
x
X
x
_
_
| x U(1), X U(n 1), x
2
det(X) = 1
_
_
_
and is a double cover of U(n 1).
Let S be the Cartan subgroup of M consisting of all diagonal matrices in
M and write H = SA as a Cartan subgroup of G. The set of analytically
integral weights on S is
S =
_
(x
0
, x
1
, . . . x
n
) R
n+1
| x
0
= x
n
, tr(x) = 0, x
0
x
1
1
2
Z,
x
i
x
j
Z for 1 i < j n 1
_
=
_
(x
0
, x
1
, . . . x
n
) | x
1
1
n + 1
Z, x
i
x
j
Z for 1 i < j n 1,
x
0
= x
n
=
1
2
n1
j=1
x
j
_
.
We say x
S is positive and write x
S
+
if x
1
x
2
. . . x
n1
. By taking
highest weights,
S
+
parameterizes the irreducible representations of M.
388 MARK R. SEPANSKI
3. K-types of induced modules.
For
T, let V
K
|
M
to signify restriction to M. Since the branching law for
restriction from U(n) to U(n1) is well known ([14]), it is easy to determine
how restriction works from K to M:
Theorem 1. Let = (
0
,
1
, . . .
n
)
T
+
. Then
V
K
|
M
=
V
M
x
where
= {(x
0
, x
1
, . . . x
n
) | x
0
= x
n
, tr(x) = 0, x
i
0
Z for 1 i n 1,
1
x
1
2
x
2
. . .
n1
x
n1
n
}.
By Frobenius reciprocity, we then have
Corollary 1. Let x = (x
0
, x
1
, . . . x
n
)
S
+
. Then
Ind
K
M
(V
M
x
) =
x
V
K
,
where
x
= {(
0
,
1
, . . .
n
) | tr() = 0,
i
x
1
Z for 0 i n,
1
x
1
2
x
2
. . .
n1
x
n1
n
}.
For Langlands parameters x
S and a
). Since I(x, )|
K
= Ind
K
M
(V
M
x
),
Corollary 1 describes the K-types of I(x, ). If Re() > 0, write J(x, ) for
the unique irreducible Langlands quotient of I(x, ).
4. Character Equalities.
For x
S
+
and C, write for the innitesimal character x +
M
+ of
I(x, ). After conjugating h to t, we may take t as
=
_
x
0
+, x
1
+
n
2
1, x
2
+
n
2
2, . . . x
n1
n
2
+ 1, x
n
_
. (4.1)
We say is nonsingular if no two coordinates are the same.
Using the action of the Weyl group, it is straightforward to write down
all induced modules with the same innitesimal character. The following
notation simplies the results.
K-TYPES OF SU(1, n) 389
Denition 1. Let s
1
: R
n+1
C R
n+1
by s
1
(x, v) = x where
x
i
=
_
_
_
x
0
+
n
2
if i = 0
x
i
if 1 i n 1
x
n
+
n
2
if i = n.
For each a, b Z with 0 a < b n, dene s
2,a,b
: R
n+1
R
n+1
C by
s
2,a,b
( x) = (x
) where
x
i
=
_
_
1
2
( x
a
+ x
b
a +n b) if i = 0, n
x
i1
+ 1 if 1 i a
x
i
if a + 1 i b 1
x
i+1
1 if b i n 1,
=
1
2
( x
a
x
b
a +b)
and dene s
a,b
: R
n+1
C R
n+1
C by the composition s
2,a,b
s
1
. It is easy
to verify that the inverse, s
1
a,b
, is determined by dening s
1
2,a,b
(y
, w
) = y
where
y
i
=
_
_
y
i+1
1 if 0 i a 1
y
0
+w
n
2
+a if i = a
y
i
if a + 1 i b 1
y
n
w
+
n
2
+b n if i = b
y
i1
+ 1 if b + 1 i n
and dening s
1
1
( y) = (y, w) where
w =
1
2
( y
0
y
n
+n),
y
i
=
_
_
_
y
0
w +
n
2
if i = 0
y
i
if 1 i n 1
y
n
+w
n
2
if i = n.
Denition 2. For x
S
+
and C, dene = (x, ) to be
=
_
x
0
+, x
1
+
n
2
1, x
2
+
n
2
2, . . . x
n1
n
2
+ 1, x
n
_
.
Observe that
1
>
2
> . . .
n1
.
If R
0
, dene c, d Z, 0 c < d n, so that
c
>
0
c+1
and
d1
n
>
d
. Dene
( x, ) = s
1
c,d
(x, ).
Observe that if
= ( x, ), then
1
>
2
> . . .
n1
n
.
390 MARK R. SEPANSKI
We say that C is non-negative if either Re(
S
+
and ,
C both non-negative. If
0
1
/ Z,
then I(x, ) and I(x
and =
. If
0
1
Z, then I(x, ) and I(x
) have the
same innitesimal character if and only if (x
) = s
a,b
( x, ) for some a, b
with 0 a < b n.
Using Theorem 2, the Subrepresentation theorem, and the Langlands
classication, it is easy to identify almost all the induced modules I(x, )
that are irreducible. The calculations later in this paper or a few R-group
calculations suce to clear up the remaining ambiguities. But since this is
known (Kraljevic, [11], Proposition 1, 3 from [12], Theorems 7.5 and 8.7)
and not so important for the purpose of this paper, we simply state the
result.
Theorem 3. For x
S
+
and C, let be the character (x, ). Then
I(x, ) is reducible if and only if
0
1
Z and either
0
c+1
= 0 or
d1
= 0.
Note that reducibility always implies
1
2
Z.
5. K-types of Langlands quotients.
In this section we record the K-types of each irreducible representation of
G. The Langlands classication says that every irreducible representation
is a discrete series representation, limit of discrete series representation, an
irreducible tempered representation of the form I(x, i) with x
S
+
and
R
0
, or one of the J(x, ) with x
S
+
and C with Re() > 0.
Hence Corollary 1 and Theorem 3 yield the K-types of most irreducible
representations. The only ones yet to be determined are the discrete series,
limit of discrete series, and the irreducible quotients J(x, ) in the cases
where I(x, ) is reducible.
We begin by studying the reducible I(x, ) with nonsingular character.
They may be parameterized as follows. Fix with
0
>
1
> . . .
n
and
i
j
Z, 0 i < j n. Choose the unique x
S
+
and
1
2
Z
>0
so
that = (x, ). Write
I
a,b
() = I(s
a,b
(x, ))
and
J
a,b
() = J(s
a,b
(x, ))
K-TYPES OF SU(1, n) 391
for a, b Z with 0 a < b n. In particular, I
0,n
() = I(x, ). By Section
4, the set of all I
a,b
() encompass the set of all reducible principal series
representations (Re() > 0) with nonsingular character.
The set of discrete series representations of G may be parameterized as
follows. Let
a
W
G
, 0 a n, be dened by
a
(t
0
, t
1
, . . . t
n
) = (t
a
, t
0
, t
1
. . . t
a1
, t
a+1
, . . . t
n
).
For each from the previous paragraph, write
J
a,a
()
for the discrete series representation with innitesimal character
a
associ-
ated to the G chamber determined by
a
. The set of all J
a,a
() encompass
all discrete series representations.. Dene
a
to be the highest weight of its
lowest K-type. It is easy to verify that
(
a
)
i
=
_
_
_
x
a
+n 2a if i = 0
x
i1
+ 1 if 1 i a
x
i
1 if a + 1 i n
(5.1)
and that the K-types of J
a,a
() are contained in the cone
a
+C
a
where
C
a
= {(t
0
, t
1
, . . . t
n
) | tr(t) = 0, t
1
, . . . t
a
R
0
, t
a+1
, . . . t
n
R
0
}. (5.2)
Using Corollary 1, it easy to explicitly write the K-types for each I
a,b
()
and to check the following.
Corollary 2. Fix with
0
>
1
> . . .
n
and
i
j
Z, 0 i < j n.
Let a, b Z with 0 a < b n. The K-types of I
a,b
() occur with
multiplicity one. I
a,b
() and I
a
,b
() have K-types in common if and only
if (a
, b
) {(a +
1
, b +
2
) |
1
,
2
{0, 1}, 0 a
< b
n}.
In the case n = 2, using only the Langlands classication, the information
about K-types already determined, and a basic embedding result on discrete
series ([9]), it is possible to give the semisimplication of each I
a,b
() and to
deduce the K-types of each J
a,b
(). However, things become too complicated
for this line of reasoning to be sucient for larger n. Thus we use the
following well known description of the composition series of I
a,b
() (see
[11] Proposition 3, 7, [12] Theorem 7.5, [15]).
Theorem 4. Fix with
0
>
1
> . . .
n
and
i
j
Z, 0 i < j n.
Let a, b Z with 0 a < b n. The socle ltration of I
a,b
() is
J
a,b
()
J
a,b1
() J
a+1,b
()
J
a+1,b1
()
where the bottom row does not occur if a + 1 = b.
392 MARK R. SEPANSKI
Combining this theorem with our knowledge of the K-types allows us to
prove:
Lemma 1. Fix with
0
>
1
> . . .
n
and
i
j
Z, 0 i < j n.
Let 0 a b n. Then J
a,b
() and J
a
,b
() have a K-type in common if
and only if a = a
and b = b
.
Proof. Corollary 1 and Equations 5.1 to 5.2 imply that if either a and a
or
b and b
,b
into some I
a
,b
and I
a
,b
where either a
and a
or b
and
b
dier by more than one so that Corollary 1 again says that they have no
K-types in common.
This allows us to determine the K-types for each J
a,b
() (which includes
the discrete series).
Theorem 5. Fix with
0
>
1
> . . .
n
and
i
j
Z, 0 i < j n.
Let a, b Z with 0 a b n. The K-types of J
a,b
() appear with
multiplicity one. Choose the unique x
S
+
and
1
2
Z
>0
such that =
(x, ). The highest weights of the K-types of J
a,b
() are
{(
0
,
1
, . . .
n
) | tr() = 0,
i
x
1
Z for 0 i n,
1
x
0
+ 1 . . .
a
x
a1
+ 1 > x
a
a+1
x
a+1
. . .
b
x
b
> x
b+1
1
b+1
x
b+2
1 . . .
n1
x
n
1
n
}.
This notation includes the natural collapsing of certain . For instance, if
a = b the above inequalities reduce to
1
x
0
+ 1 . . .
a
x
a1
+ 1 > x
a
x
a
> x
a+1
1
a+1
x
a+2
1 . . .
n1
x
n
1 .
Proof. This follows using Lemma 1, Theorem 4, and Corollary 1. For in-
stance, the K-types of J
a,b
for 0 < a b < n are the K-types that occur in
both I
a1,b
and I
a,b+1
. The other cases are handled similarly.
We turn our attention to the reducible I(x, ) with singular character.
They may be parameterized as follows. Fix with
0
>
1
> . . .
c
=
c+1
> . . .
n
, 0 c n 1, and
i
j
Z, 0 i < j n. Choose the
unique x R
n+1
and
1
2
Z
>0
so that = (x, ). Write
I
a,c+1
() = I(s
a,c+1
(x, ))
J
a,c+1
() = J(s
a,c+1
(x, ))
for each 0 a < c and
I
+
c,b
() = I(s
c,b
(x, ))
J
+
c,b
() = J(s
c,b
(x, ))
K-TYPES OF SU(1, n) 393
for each c + 1 < b n.
The set of discrete series representations of G may be parameterized as
follows. Continue with the same from the previous paragraph and recall
the elements
a
W
G
from the discussion of the discrete series. Write
J
c,c+1
()
for the limit of discrete series representation with innitesimal character
corresponding to the chamber determined by
c
. It is immediate that
Equation 5.1 (with a = c) gives the lowest K-type and that Equation 5.2
describes a cone containing all of its K-types. Similarly write
J
+
c,c+1
()
for the limit of discrete series representation with innitesimal character
corresponding to the chamber determined by
c+1
. It is immediate that
Equations 5.1 and 5.2 (with a = c + 1) describe its K-types. The set of all
J
c,c+1
() encompass all the limits of discrete series. Moreover, it is possible
to check that I(s
c,c+1
(x, )) (notice s
c,c+1
() = 0) splits as a direct sum of
J
c,c+1
()
J
+
c,c+1
().
The composition series of I(x, ) is also well know in the singular setting
(see [11] Proposition 3, 7, [12] Theorem 7.5) and closely parallels Theorem
4.
Theorem 6. Fix with
0
>
1
> . . .
c
=
c+1
> . . .
n
, 0 c n 1,
and
i
j
Z, 0 i < j n. For 0 a < c, the socle ltration of
I
a,c+1
() is
J
a,c+1
()
a+1,c+1
().
For c + 1 < b n, the socle ltration of I
+
c,b
() is
J
+
c,b
()
J
+
c,b1
().
As in Theorem 5, Theorem 6 allows us to immediately determine the K-
types for each J
a,c+1
() and J
+
c,b
() (which includes the limits of discrete
series).
Theorem 7. Fix with
0
>
1
> . . .
c
=
c+1
> . . .
n
, 0 c n 1,
and
i
j
Z, 0 i < j n. Choose the unique x R
n+1
and
1
2
Z
>0
394 MARK R. SEPANSKI
so that = (x, ). For 0 a c, the K-types of J
a,c+1
() are
{(
0
,
1
, . . .
n
) | tr() = 0,
i
x
1
Z for 0 i n,
1
x
0
+ 1 . . .
a
x
a1
+ 1 > x
a
a+1
x
a+1
. . .
c
x
c
c+1
x
c+2
1 . . .
n1
x
n
1
n
}.
For c + 1 b n, the K-types of J
+
c,b
() are
{(
0
,
1
, . . .
n
) | tr() = 0,
i
x
1
Z for 0 i n,
1
x
0
+ 1 . . .
c
x
c1
+ 1
c+1
x
c+1
. . .
b
x
b
> x
b+1
1
b+1
x
b+2
1 . . .
n1
x
n
1
n
}.
Corollary 1 and Theorems 5 and 7 give the K-types for all irreducible
representations of G.
6. Restriction of Cohomology.
We begin this section by recalling some notation from [13].
Denition 3. Fix a (g, K) module X and a -stable parabolic subalgebra
q = l +u of g where l is the Levi component and u is the nilradical of q. Let
be the map on cohomology
: H
b
(u, X) H
b
(u k, X)
induced by restricting Hom(
_
b
u, X) Hom(
_
b
u k, X).
Write
+
(k, t) for the positive roots of k corresponding to the choice of
T
+
and write
K
for the half sum these roots.
Denition 4. Fix a (g, K) module X and a -stable parabolic subalgebra
q = l + u of g. Let be the highest weight of an L K representation
appearing in H
b
(uk, X) and choose w W
K
so that w( +
K
) is positive.
Dene
K
= w( +
K
)
K
.
By Kostants Borel-Weil theorem ([10]), V
K
K
appears in X|
K
. We say that
K
is the associated K-type to the L K-type .
Denition 5. Fix a (g, K) module X. Let C be the closed convex hull in it
0
of the set of highest weights of the K-types appearing in X. Given
T
+
a
K-type of X, we say that lies on the geometric edge of the set of K-types
of X if it lies on the boundary of C as a subset of it
0
.
It is hoped that the associated K-types to the L K-types appearing
in the image of describe the K-types lying on geometrical edges (as long
the edge is not completely contained in a Weyl chamber wall of K). Thus
it is hoped that knowledge of the image of completely determines C and
therefore goes a long way towards describing all the K-types.
K-TYPES OF SU(1, n) 395
Denition 6. Fix a (g, K) module, X. If
T
+
, the multiplicity of in
X, m(), is the multiplicity of V
K
in X|
K
. Extend this denition as follows.
For
T with +
K
singular, dene m
e
() = 0. For
T with +
K
nonsingular, there exists a unique w W
K
so that w( +
K
)
K
T
+
.
Dene
m
e
() = (1)
l(w)
m(w( +
K
)
K
)
where l(w) is the length of w in W
K
.
If b is any Lie algebra and b
) for the
set of roots of b with respect to b
.
Denition 7. Fix a (g, K) module X and a -stable parabolic subalgebra
q = l + u of g. Choose any w W
K
so that
w(u k, t)
+
(k, t)
and write
m = min{dim(u p), l(w) + 1}.
Given a K-type
T of X, we say that lies on an algebraic q-edge of the
set of K-types if
m
e
( + 2
A
) = 0
for every nonempty collection A consisting of elements in (up, t) of order
at most m (here 2
A
is the sum of the roots in A).
Theorem 8. Fix a (g, K) module X and a -stable parabolic subalgebra
q = l +u of g. If
T is a K-type of X lying on an algebraic q-edge, then
there is another -stable parabolic and an L K-type in the image of
whose associated K-type is .
Proof. By Theorem 3.4 in [13] (using the notation in Denition 7 above),
: H
l(w)
(wu, X) H
l(w)
(wu k, X)
is surjective on the LK-types = w(+
K
)
K
. Choosing w of minimal
length, we may assume that appears in H
l(w)
(wu k, X) (by Lemma 2.2
in [13]) and therefore that appears in the image of . But since
K
= ,
we are done.
We now apply Theorem 8 to the irreducible representations of G to show
that the geometric edge can be constructed from the image of .
Theorem 9. Let X be an irreducible representation of G = SU(1, n). Then
any K-type of X lying on a geometric edge is the associated K-type to an
L K-type lying in the image of : H
(u, X) H
(u k, X) for some
-stable parabolic subalgebra q = l + u of g.
396 MARK R. SEPANSKI
Proof. The idea of the proof is to use the explicit description of K-types
(Corollary 1 and Theorems 5, 7, and 3) and to show that every K-type
of X lying on a geometric edge lies on an algebraic q-edge for some -
stable parabolic subalgebra q = l + u of g. Theorem 8 then nishes the
proof. Since this is merely a matter of coming up with q and checking the
appropriate denitions, we only give the details in the case of X = J
a,b
()
with
0
>
1
> . . .
n
,
i
j
Z for 0 i < j n, and a, b Z
with 0 a b n. The argument in the other cases is identical. Let
T
+
. Then is a K-type of J
a,b
() if and only if it satises the integrality
condition in Theorem 5 and
i
i
+
i
, 1 i n, where
i
(possibly
equal to ) are identied explicitly in Theorem 5. It is easy to verify that
+
i1
i
. Thus lies on a geometric edge if and only if either
i
=
i
or
i
=
+
i
, for some i, 1 i n. Write E
i
, respectively, for the set of
K-types lying on a geometric edge satisfying either
i
=
i
, respectively.
Let
i
= (0, . . . , 0, 1, 0, . . . 0) with i zeros before the one and let
i,j
=
i
j
be the usual root vector in (g, t). For each edge dene q
i
= l
i
+u
i
to be
the -stable parabolic subalgebra of g generated by
i
. In particular,
(q
, t) = {
r,s
|
i
r,s
0 where 0 r, s n, r = s}
and (u
p, t) = {
0,i
} so that dim(u
is supposed to
describe the direction of E
; i.e., if ,
should
point towards the outside of the K-types; i.e., if E
W
K
be dened by the cyclic permutations w
+
= (1, 2, . . . i)
and w
) = n i. These
elements are chosen so that w
(u
k, t)
+
(k, t). In particular,
+
q
+
is generated by
1
and
is generated by
n
. Let be a K-type in
E
satisfying
+
k1
>
k
, 1 k n, satisfying
k
k
+
k
and either
i
=
i
, and
y = ( r
0,i
) with r > 0. We show there is always some k, 1 k n,
so that y
k
fails to lie between
k
. View as a permutation and consider
the case where (i) = i rst. In this case y
i
=
i
r and therefore fails
to lie between
i
and
+
i
. On the other hand, say (i) = j = i. Then
K-TYPES OF SU(1, n) 397
y
j
=
i
and therefore lies between
i
. However, this makes it impossible
for y
j
to lie between
j
since
i
and
j
are disjoint intervals. This nishes
the proof.
Corollary 3. Let X be any irreducible representation of SU(1, n). Let q
i
=
l
i
+u
i
, i = 1, n, be the two maximal proper -stable parabolic subalgebras of g
generated by
1
and
n
, respectively. Let E be the set of associated K-types
to the L K-types in the images of
: H
(u
i
, X) H
(u
i
k, X).
Then the closed convex hull of the set of highest weights of the K-types of
X is equal to the closed convex hull of E. Moreover, t
is a K-type of
X if and only if lies in the closed convex hull of E and diers from some
element of E by an element of the root lattice of g.
Proof. This follows immediately from the Theorem 9 (noting that all the
parabolics used in the proof were K conjugate to either q
1
or q
n
) and the
explicit description of K-types.
While this is a strong statement, the generalization to other groups cannot
always be as nice. In particular, gaps may appear in the set of K-types.
But in any case, it is still conjectured that the image of is enough to
describe the closed convex hull of the set of K-types.
References
[1] V. Bargmann, Irreducible unitary representations of the Lorentz group, Ann. of Math.,
48 (1947), 568-640.
[2] R. Brylinski and B. Kostant, Minimal representations of E
6
, E
7
, and E
8
and the
generalized Capelli identity, Proc. Nat. Acad. Sci. USA, 91(7) (1994), 2469-2472.
[3] , Minimal representations, geometric quantization, and unitarity, Proc. Nat.
Acad. Sci. USA, 91(13) (1994), 6026-6029.
[4] J. Dixmier, Representations integrables du groupe de De Sitter, Bull. Soc. Math.
France, 89 (1961), 9-41.
[5] H. Hecht and W. Schmid, A proof of Blattners conjecture, Invent. Math., 31 (1975),
129-154.
[6] R. Howe and E. Tan, Homogeneous functions on the light cones: The innitesimal
structure of some degenerate principal series representations, Bull. Amer. Math. Soc.
(N. S.), 28(1) (1993), 1-74.
[7] A. Knapp, Representation Theory of Semisimple Groups: An Overview Based on
Examples, Princeton University Press, Princeton, 1986.
[8] A. Knapp and D. Vogan, Cohomological induction and unitary representations,
Princeton University Press, Princeton, 1995.
[9] A. Knapp and N. Wallach, Szego kernels associated with discrete series, Invent. Math.,
34 (1976), 163-200.
398 MARK R. SEPANSKI
[10] B. Kostant, Lie algebra cohomology and the generalized Borel-Weil theorem, Ann. of
Math., 74(2) (1961), 329-387.
[11] H. Kraljevic, On representations of the group of SU(n, 1), Trans. Amer. Math. Soc.,
221(2) (1976), 433-448.
[12] , On representations of the universal covering group of SU(n, 1), Glas. Mat.
Ser. III, 8 (1973), 23-72.
[13] M. Sepanski, Boundaries of K-types, restriction of cohomology, and the multiplicity
free case, Compositio Math., 106 (1997), 31-41.
[14] D.
Zelebenko, Compact Lie Groups and Their Representations, Amer. Math. Soc.,
Providence, 1973.
[15] R. Zierau, A construction of harmonic forms on U(p +1, q)/U(p, q) U(1), Pacif. J.
Math., 139(2) (1989), 377-399.
Received June 4, 1998 and revised August 19, 1998. The author was partially supported
by the National Science Foundation under Grant DMS 9623280.
Baylor University
Waco, TX 76798-7328
E-mail address: Mark Sepanski@Baylor.edu
PACIFIC JOURNAL OF MATHEMATICS
Vol. 192, No. 2, 2000
VERONESE SUBRINGS AND TIGHT CLOSURE
Anurag K. Singh
We determine when an N-graded ring has Veronese subrings
which are F-rational or F-regular. The results obtained here
give a better understanding of these properties, and include
various techniques of constructing F-rational rings which are
not F-regular.
1. Introduction.
Throughout this paper, all rings are commutative, Noetherian, and have an
identity element. By a graded ring, we mean a ring R =
n0
R
n
, which is
nitely generated over a eld R
0
= K.
The theory of tight closure was developed by Melvin Hochster and Craig
Huneke in [HH1], and has yielded many elegant and powerful results in
commutative algebra and related elds. The theory draws attention to rings
which have the property that all their ideals are tightly closed, called weakly
F-regular rings, and rings with the weaker property that parameter ideals
are tightly closed, called F-rational rings. The term F-regular is reserved
for rings with the property that all their localizations are weakly F-regular.
(The recent work of Lyubeznik and Smith shows that for graded rings the
properties of weak F-regularity and F-regularity are equivalent, see [LS].)
These properties turn out to be of signicant importance, for instance the
Hochster-Roberts theorem of invariant theory that direct summands of poly-
nomial rings are Cohen-Macaulay ([HR1]), can actually be proved for the
much larger class of F-regular rings.
While the property of F-rationality provides an algebraic analogue of the
notion of rational singularities, F-regularity, in general, is not so well under-
stood geometrically. One approach is to study the variety X = Proj R for a
graded F-regular ring R. The Veronese subrings of R are also homogeneous
coordinate rings for X, and so it is interesting to determine when graded
rings have F-rational or F-regular Veronese subrings. The question regarding
F-rational Veronese subrings is easily answered: Let (R, m, K) be a Cohen-
Macaulay graded domain of dimension d, with an isolated singularity at m.
We show that there exists a positive integer n such that the Veronese sub-
ring R
(n)
is F-rational if and only if [H
d
m
(R)]
0
= 0. With regard to F-regular
Veronese subrings, we show that if R is a normal ring generated by degree
one elements over a eld, then either R is F-regular, or else no Veronese
399
400 ANURAG K. SINGH
subring of R is F-regular. This leads us to the question: If (R, m, K) is a
normal graded domain, generated by degree one elements, with an isolated
singularity at m, then under what conditions is R F-regular? It is easily
seen that F-regularity forces the a-invariant, a(R), to be negative. For rings
of dimension two (although not in higher dimensions) this is also a sucient
condition for F-regularity. We construct rings R of dimension d 3 with
a(R) = 2 d which are not F-regular, while if a(R) < 2 d, Smith has
pointed out that Proj R is a variety of minimal degree, and R is indeed F-
regular. We also construct a rich family of F-rational rings of characteristic
zero, with isolated singularities, which have no F-regular Veronese subrings.
We would like to point out that although tight closure is primarily a char-
acteristic p notion, it has strong connections with the study of singularities
of algebraic varieties over elds of characteristic zero. Specically, let R be
a ring essentially of nite type over a eld of characteristic zero. Then R has
rational singularities if and only if it is of F-rational type, see [Ha, Sm3].
In the Q-Gorenstein case, we have some even more remarkable connections:
F-regular type is equivalent to having log-terminal singularities and F-pure
type implies (and is conjectured to be equivalent to) log-canonical singular-
ities, see [Sm5, Wa4].
2. Preliminaries.
Let R be a Noetherian ring of characteristic p > 0. We shall always use the
letter e to denote a variable nonnegative integer, and q to denote the e th
power of p, i.e., q = p
e
. We shall denote by F, the Frobenius endomorphism
of R, and by F
e
, its e th iteration, i.e., F
e
(r) = r
q
. For an ideal I =
(x
1
, . . . , x
n
) R, we let I
[q]
= (x
q
1
, . . . , x
q
n
). Note that F
e
(I)R = I
[q]
, where
q = p
e
, as always. Let S denote the ring R viewed as an R-algebra via F
e
.
Then S
R
is a covariant functor from R-modules to S-modules, and so is
a covariant functor from R-modules to R-modules! If we consider a map of
free modules R
n
R
m
given by the matrix (r
ij
), applying F
e
we get a map
R
n
R
m
given by the matrix (r
q
ij
). For an R-module M, note that the
R-module structure on F
e
(M) is r
(rm) = r
rm, and r
rm = r
r
q
m.
For R-modules N M, we use N
[q]
M
to denote Im(F
e
(N) F
e
(M)).
For a reduced ring R of characteristic p > 0, R
1/q
shall denote the ring
obtained by adjoining all q th roots of elements of R. The ring R is said to
be F-nite if R
1/p
is module-nite over R. Note that a nitely generated
algebra R over a eld K is F-nite if and only if K
1/p
is a nite eld extension
of K.
We shall denote by R
M
, the tight closure
of N in M, if there exists c R
such that cu
q
N
[q]
M
for all q = p
e
0.
It is worth recording this when M = R, and N = I is an ideal of R. An
element x of R is said to be in I
if there exists c R
such that cx
q
I
[q]
for all q = p
e
0. If I
. Furthermore, I
is always contained
in the integral closure of I, and is frequently much smaller. A weakly F-
regular ring is F-rational as well as F-pure. We next record some useful
results.
Theorem 2.2.
(1) Regular rings are F-regular. A ring which is a direct summand of an
F-regular ring is itself F-regular.
(2) An F-rational ring R is normal. If, in addition, R is the homomorphic
image of a Cohen-Macaulay ring, then it is Cohen-Macaulay.
(3) An F-rational Gorenstein ring is F-regular.
(4) Let (R, m) be a reduced excellent local ring of dimension d and char-
acteristic p > 0. If c R
H
d
m
(R)
= 0.
(5) Let R be a graded ring. Then R is weakly F-regular if and only if it is
F-regular.
Proof. For assertions (1)-(3), see [HH2, Theorem 4.2]. Part (4) is a result
of Velez, [Ve], and (5) is [LS, Corollary 4.4].
Remark 2.3. The equivalence of weak F-regularity and F-regularity, in
general, is a formidable open question. However in the light of Theorem
2.2 (5) above, we frequently have no reason to distinguish between these
notions.
By a graded ring (R, m, K), we shall always mean a ring R =
n0
R
n
nitely generated over a eld R
0
= K. We shall denote by m = R
+
, the
homogeneous maximal ideal of R. The punctured spectrum of R refers to the
set Spec R{m}. By a system of parameters for R, we shall mean a sequence
of homogeneous elements of R whose images form a system of parameters for
R
m
. In specic examples involving homomorphic images of polynomial rings,
402 ANURAG K. SINGH
lower case letters shall denote the images of the corresponding variables, the
variables being denoted by upper case letters.
For conventions regarding graded modules and homomorphisms, we follow
[GW]. For a graded R-module M, we shall denote by [M]
i
, the i-th graded
piece of M.
Denition 2.4. Let R =
i0
R
i
be a graded ring, and n be a positive
integer. We shall denote by R
(n)
, the Veronese subring of R spanned by all
elements of R which have degree a multiple of n, i.e., R
(n)
=
i0
R
in
.
Note that the ring R
(n)
is a direct summand of R as an R
(n)
-module
and that R is integral over R
(n)
. Hence whenever R is Cohen-Macaulay or
normal, so is R
(n)
. We record the following result, see [EGA, Lemme 2.1.6]
or [Mu, page 282] for a proof.
Lemma 2.5. Let R be a graded ring. Then there exists a positive integer n
such that the Veronese subring R
(n)
is generated over K by forms of equal
degree.
Recall that the highest local cohomology module H
d
m
(R) of R, where
dimR = d, may be identied with lim
R/(x
t
1
, . . . , x
t
d
) where x
1
, . . . , x
d
is a
system of parameters for R and the maps are induced by multiplication by
x
1
x
d
. If R is Cohen-Macaulay, these maps are injective. The R-module
H
d
m
(R) carries a natural graded structure, namely deg[r + (x
t
1
, . . . , x
t
d
)] =
deg r t
d
i=1
x
i
, where r and x
i
are homogeneous elements of R.
Denition 2.6. In the above setting, Goto and Watanabe dene the a-
invariant of R as the highest integer a(R) = a such that [H
d
m
(R)]
a
is
nonzero.
When R is a ring of characteristic p, the Frobenius homomorphism of R
gives a natural Frobenius action on H
d
m
(R) where
F : [r + (x
t
1
, . . . , x
t
d
)] [r
p
+ (x
pt
1
, . . . , x
pt
d
)], see [FW] or [Sm2].
For a graded R-module M, dene M
(n)
=
iZ
[M]
in
. With this notation,
it follows from [GW, Theorem 3.1.1] that
H
d
m
R
(n)
(R
(n)
)
= (H
d
m
(R))
(n)
.
The following theorem, [HH3, Theorem 7.12], indicates the importance
of the a-invariant in the study of graded F-rational rings.
Theorem 2.7. A graded Cohen-Macaulay normal ring R over a eld of
prime characteristic p is F-rational if and only if a(R) < 0 and the ideal
generated by some homogeneous system of parameters for R is Frobenius
closed.
VERONESE SUBRINGS AND TIGHT CLOSURE 403
3. F-rationality of Veronese subrings.
The following proposition, well-known to the experts, addresses the existence
of F-rational Veronese subrings.
Proposition 3.1. Let R be a graded Cohen-Macaulay domain of dimension
d, which is locally F-rational on the punctured spectrum Spec Rm. (This is
satised, in particular, if R has an isolated singularity.) Then [H
d
m
(R)]
0
= 0
if and only if the Veronese subring R
(n)
is F-rational for all integers n 0.
In particular if a(R) < 0, then R
(n)
is F-rational for all integers n 0.
Proof. Note that we have [H
d
m
(R)]
0
0
H
d
m
(R)
, since for z [H
d
m
(R)]
0
we
get cz
q
= 0 for all q = p
e
, when c m is of a suciently large degree.
Consequently if R
(n)
is F-rational for some n, we must have a(R
(n)
) < 0,
but then [H
d
m
(R)]
0
= 0.
For the converse rst note that since R is F-rational on the punctured
spectrum, Theorem 2.2 (4) says that 0
H
d
m
(R)
must be killed by a power of
the maximal ideal m, and so is of nite length. As [H
d
m
(R)]
0
= 0, for large
positive integers n we see that H
d
m
(R
(n)
)
= (H
d
m
(R))
(n)
contains no nonzero
element of 0
H
d
m
(R)
where m
H
d
m
(R
(n)
)
then u 0
H
d
m
(R)
H
d
m
(R
(n)
) and so u = 0. Hence R
(n)
is
F-rational for n 0.
Example 3.2. Let R = K[X, Y, Z]/(X
2
+ Y
3
+ Z
5
) where K is a eld of
prime characteristic p. We make this a graded ring by setting the weights
of x, y and z to be 15, 10 and 6 respectively. We determine the positive
integers n for which the Veronese subring R
(n)
is F-rational. This shall, of
course, depend on the characteristic p of R.
First note that a(R) = 1 with this grading. If p 7, it is easy to
verify that the ring R is F-regular. Consequently every Veronese subring
of R, being a direct summand of R, is also F-regular. For p = 2, 3 or
5, x
p
(y
p
, z
p
), and so R is not F-rational. It is also easily checked that
the action of the Frobenius on H
2
m
(R) is injective in degree 2 with the
one exception of p = 2 where elements in degree 7 are mapped to zero
under the action of the Frobenius, specically F(xy
1
z
2
) = 0 in H
2
m
(R).
Recall that H
2
m
R
(n)
(R
(n)
) is generated by elements of H
2
m
(R) whose degree
is a multiple of n. Consequently for n 2 the action of the Frobenius on
H
2
m
R
(n)
(R
(n)
) is injective, with the one exception. Using the arguments in
the proof of the above proposition, we see that R
(n)
is F-rational for all
n 2, excluding the case when p = 2 and n = 7.
404 ANURAG K. SINGH
4. Rational coecient Weil divisors.
We review some notation and results from [De], [Wa1] and [Wa3], as well
as make a few observations which we shall nd useful later in our study.
Denition 4.1. By a rational coecient Weil divisor (or a Q-divisor) on a
normal projective variety X, we mean a Q-linear combination of codimension
one irreducible subvarieties of X. For D =
n
i
V
i
with n
i
Q, we set
[D] =
[n
i
]V
i
, where [n] denotes the greatest integer less than or equal to
n, and dene O
X
(D) = O
X
([D]).
Let D =
(p
i
/q
i
)V
i
where the integers p
i
and q
i
are relatively prime and
q
i
> 0. We dene D
=
((q
i
1)/q
i
)V
i
to be the fractional part of D.
Note that with this denition of D
] for any
integer n.
Given an ample Q-divisor D (i.e., such that ND is an ample Cartier
divisor for some N N), we construct the generalized section ring:
R = R(X, D) =
n0
H
0
(X, O
X
(nD))T
n
K(X)[T].
With this notation, Demazures result ([De, 3.5]) is:
Theorem 4.2. Let R =
n0
R
n
be a graded normal ring. Then there exists
an ample Q-divisor D on X = Proj R such that
R =
n0
H
0
(X, O
X
(nD))T
n
K(X)[T],
where T is a homogeneous element of degree one in the quotient eld of R.
Example 4.3. Take the Q-divisor
D = (1/2)V (S) + (1/3)V (T) + (1/5)V (S +T)
on P
1
= Proj K[S, T] where V (S), e.g., denotes the irreducible subvariety
dened by the vanishing of S. Fix T as the degree one element. Then
R =
n0
H
0
(P
1
, O
P
1(nD))T
n
= K[X, Y, Z]/(X
2
+Y
3
+Z
5
), where
X = (S
8
T
10
)/(S +T)
3
, Y = (S
5
T
7
)/(S +T)
2
, and Z = (S
3
T
4
)/(S +T).
Remark 4.4. Let R = R(X, D) be as above. Then the Veronese subring
R
(n)
is given by R
(n)
= R(X, nD). For a rational function f K(X) we
have an isomorphism R(X, D)
= R(X, div(f) + D). If R is generated over
K by its elements of degree one, we have R = R(X, [D]). Note that [D] is a
Weil divisor, i.e., has integer coecients.
5. Results in dimension two.
In the following theorem, we summarize some familiar results about graded
rings of dimension two.
VERONESE SUBRINGS AND TIGHT CLOSURE 405
Theorem 5.1. Let R be a graded normal ring of dimension two, which is
generated by degree one elements over an algebraically closed eld. Then the
following statements are equivalent:
(1) R is isomorphic to a Veronese subring of a polynomial ring in two
variables.
(2) R is F-regular.
(3) R is F-rational.
(4) R has a negative a-invariant.
Proof. The implications (1) (2) (3) (4) follow easily. For (4) (1)
note that X = Proj R is a nonsingular projective curve. Since [H
2
m
(R)]
0
= 0,
we have H
1
(X, O
X
) = 0 and so X is of genus zero, i.e., P
1
. Consequently
R
= R(P
1
, D) where D is a Weil divisor on P
1
. Hence D is linearly equivalent
to O(m) for some m N and R
= R(P
1
, O(m))
= (K[X
0
, X
1
])
(m)
.
As an easy consequence of the above, we have:
Theorem 5.2. Let R be a graded domain of dimension two, with an isolated
singularity, which is nitely generated over an algebraically closed eld. If
a(R) < 0, there exists a positive integer n such that R
(n)
is isomorphic to a
Veronese subring of a polynomial ring in two variables over K. In particular,
some Veronese subring of R is F-regular.
Proof. Note that R is excellent and so R
) is primary to the
maximal ideal of R
, by which R
i
= R
i
for all i 0. We may therefore
choose a positive integer k such that R
(k)
is normal, and then choose an
appropriate multiple n of k, by Lemma 2.5, such that R
(n)
is generated
by elements of equal degree. We are now in a position to apply the above
theorem to conclude that R
(n)
is isomorphic to a Veronese subring of a
polynomial ring in two variables.
Example 5.3. Let S = K[X, Y, Z]/(X
3
Y Z(Y +Z)) where K is a eld
of characteristic p 1 (mod 3) and consider the subring
R = K[X, Y
3
, Y
2
Z, Y Z
2
, Z
3
]/(X
3
Y Z(Y +Z)).
It is proved in [HH3] that R is F-rational but not F-regular, see also [Wa3].
Since R
(3)
is generated by elements of equal degree, it must be isomorphic
to a Veronese subring of a polynomial ring by Theorem 5.1. Indeed,
R
(3)
= K[Y
3
, Y
2
Z, Y Z
2
, Z
3
].
Example 5.4. Let R = K[t, t
4
x, t
4
x
1
, t
4
(x + 1)
1
] where K is a eld of
prime characteristic p. This is one of the examples in [Wa2] of rings which
406 ANURAG K. SINGH
are F-rational but not F-pure; for a dierent proof see [HH3]. By mapping
a polynomial ring onto it, we may write R as
R = K[T, U, V, W]/(T
8
UV, T
4
(V W) V W, U(V W) T
4
W).
This is graded by setting the weights of t, u, v and w to be 1, 4, 4 and 4
respectively. Note that
R
(4)
= K[S, U, V, W]/(S
2
UV, S(V W) V W, U(V W) SW)
where we relabel T
4
as S. Then R
(4)
is generated by elements of equal degree,
and is isomorphic to K[X
3
, X
2
Y, XY
2
, Y
3
] by setting S = XY (X Y ),
U = XY
2
, V = X(X Y )
2
, and W = Y (X Y )
2
.
By Theorem 5.2 we know that a graded normal ring R of dimension two
over an algebraically closed eld has a Veronese subring R
(n)
which is F-
regular. We next show that if R is a hypersurface, there exists n such that
R
(n)
is actually an F-regular hypersurface.
Theorem 5.5. Let R be a graded normal hypersurface of dimension two
with a(R) < 0. Then there exists a positive integer n such that the Veronese
subring R
(n)
is an F-regular hypersurface.
Proof. Let R = K[X, Y, Z]/(f) where x, y and z have weights m, n and
r respectively. We may assume without any loss of generality that m, n
and r have no common factor. If d = gcd(m, n), then by our assumption d
and r are relatively prime. Therefore f must be a polynomial in x, y and
z
d
. Consequently R
(n)
is again a hypersurface, and satises all the initial
hypotheses, and so we may assume that R satises the extra hypothesis that
m, n and r are pairwise relatively prime. Assume further that m n r.
We consider the two cases: a) n = 1 and r = 1, and b) m > n > r. Note
that it suces to show that R is F-rational, since it is indeed a hypersurface.
We rst eliminate the case (#) when f is of the form XH(Y, Z)+G(Y, Z).
We may take a system of parameters of R of the form x, t where t is the
image in R of a polynomial T K[X, Y, Z] involving only Y and Z. If R is
not F-rational, then since a(R) < 0, (x, t) cannot be F-pure. Hence for some
q = p
e
, we have s
q
(x
q
, t
q
) while s / (x, t). Again, we may assume that s
is the image in R of a polynomial S K[X, Y, Z] involving only Y and Z.
This means that in K[X, Y, Z], we have S
q
(X
q
, T
q
, XH + G) but then
S
q
(T
q
, G
q
) and so S (T, G) in K[X, Y, Z], giving us the contradiction
s (x, t).
a) We have a(R) = deg f (m+n+r) < 0, and so deg f < m+2 since
n = r = 1. This forces f to be of the form (#).
b) Since a(R) = deg f (m+n+r) < 0, we have deg f < m+n+r < 3m.
Hence up to a scalar multiple, f is of the form XH(Y, Z) + G(Y, Z) or
X
2
+G(Y, Z). Note that the rst case has already been handled.
VERONESE SUBRINGS AND TIGHT CLOSURE 407
Now suppose f = X
2
+ G(Y, Z). Then deg f = 2m < m + n + r and so
3 < m < n + r, consequently G cannot involve a term of the form Y
2
Z
l
where l 2. If G has a term Y
k
, then 2m = kn and so n = 1 or 2. Since
n > r, we can only have n = 2 and r = 1, but this too is impossible. Hence
f can only be of the form f = X
2
+ aZ
k
+ bY Z
l
+ cY
2
Z where a, b and c
are scalars. R is normal, and so c must be non-zero since l 2 and k 2.
It follows that 2m = 2n + r. If a is non-zero, 2m = rk and since r is even,
we can only have r = 2. But then m = n + 1, and so r divides either m
or n, a contradiction. Hence a = 0, and so f = X
2
+ bY Z
l
+ cY
2
Z. If b
were non-zero, then we would have n +rl = 2n +r, i.e., n = r(l 1), which
forces r = 1. However we know r to be even, and so b = 0. We are left with
f = X
2
+cY
2
Z but this is ruled out since R is normal.
6. F-regular Veronese subrings.
We begin by recalling a theorem of Watanabe, [Wa3, Theorem 3.4]:
Theorem 6.1. Let D
1
and D
2
be ample Q-divisors on a normal projec-
tive variety X. If the fractional parts D
1
and D
2
are equal, then the ring
R(X, D
1
) is F-regular (F-pure) if and only if the ring R(X, D
2
) is F-regular
(F-pure).
A complete proof of the theorem, as stated above, relies on the char-
acterization of strong F-regularity in terms of the tight closure of the zero
submodule of the injective hull of the residue eld, [Sm1, Proposition 7.1.2],
as well as the results of [LS].
Corollary 6.2. Let R be a graded normal ring which is generated by degree
one elements over a eld. Then either R is F-regular (F-pure), or else no
Veronese subring of R is F-regular (F-pure).
Proof. Since R is generated by its elements of degree one, we have R =
R(X, D), where D is a Weil divisor, i.e., has D
= 0. Also, (nD)
= 0 where
n is any positive integer. By the above Theorem, R = R(X, D) is F-regular
(F-pure) if and only if R
(n)
= R(X, nD) is F-regular (F-pure).
As an application of this result, we now construct a family of rings with
negative a-invariants, which have no F-pure Veronese subrings. This shows
that a result corresponding to Theorem 5.2 is no longer true in higher di-
mensions.
Example 6.3. Let R = K[X
0
, . . . , X
d
]/(X
3
0
+ +X
3
d
) with d 3, where
K is a eld of characteristic 2. It is readily seen that x
2
0
(x
1
, . . . , x
d
)
,
since x
4
0
(x
1
, . . . , x
d
)
[2]
. Hence R is not F-pure, and since it is generated
by elements of degree one, Corollary 6.2 shows that R has no F-regular or
F-pure Veronese subrings. Note that a(R) = 2 d < 0.
408 ANURAG K. SINGH
We can also see that R
(n)
is not F-pure (for any n > 0) by showing that
the element x
d
0
(x
1
x
d
)
n1
is in the Frobenius closure of the ideal
x
d2
0
x
n
1
x
n1
2
x
n1
d1
, x
d2
0
x
n
2
x
n1
3
x
n1
d
, . . . , x
d2
0
x
n
d
x
n1
1
x
n1
d2
,
although not in the ideal itself.
For all n 2, the ring R
(n)
is an example of a graded ring generated by
degree one elements (with an isolated singularity and a negative a-invariant)
which is F-rational but not F-pure.
Remark 6.4. The examples above are not completely satisfactory as they
are not valid in the characteristic zero setting: in fact, for d 3, the ring
R = Q[X
0
, . . . , X
d
]/(X
3
0
+ +X
3
d
) is of F-regular type. Characteristic zero
examples turn out to be much more subtle, and we construct these in the
next section.
We again return to the ring R = K[X, Y, Z]/(X
2
+ Y
3
+ Z
5
), and this
time determine its F-regular and F-pure Veronese subrings.
Example 6.5. Let R = K[X, Y, Z]/(X
2
+ Y
3
+ Z
5
) where K is a eld of
prime characteristic p, and the grading is as before. For p 7 the ring R is
F-regular, and therefore so is any Veronese subring R
(n)
. We now determine
when R
(n)
is F-regular assuming p is either 2, 3 or 5.
Note that the Veronese subrings R
(2)
, R
(3)
and R
(5)
are in fact polynomial
rings. Therefore when n is divisible by one of 2, 3 or 5, R
(n)
is a direct
summand of a polynomial ring, and so is F-regular. We show that these are
the only instances when R
(n)
is F-regular, or even F-pure.
Recall from Example 4.3 that R = R(X, D) where X = Proj K[S, T] and
D = (1/2)V (S)+(1/3)V (T)+(1/5)V (S+T). If n is relatively prime to 30,
the Q-divisor nD has the same fractional part as D, and so R
(n)
= R(X, nD)
is not F-pure or F-regular by Theorem 6.1.
We can also construct explicit instances of Frobenius closure to illustrate
why R
(n)
is not F-pure when n is relatively prime to 30. Since n is relatively
prime to the weight of y, the ring R
(n)
has a unique monomial of the form
xy
l
with 0 < l < n. Similarly there is a unique integer m with 0 m < n
such that y
l+1
z
m
R
(n)
, and a unique integer r with 0 < r < n such that
x
r
z R
(n)
. We claim that
x
r+1
y
rl+l
z
d
(x
r
y
rl+l+1
z
m
, x
r
z
d+1
)
F
, and
x
r+1
y
rl+l
z
d
/ (x
r
y
rl+l+1
z
m
, x
r
z
d+1
).
The second statement is true in R and so also in R
(n)
, while the rst assertion
follows from
(x
r+1
y
rl+l
z
d
)
p
((x
r
y
rl+l+1
z
m
)
p
, (x
r
z
d+1
)
p
) for p = 2, 3 or 5.
VERONESE SUBRINGS AND TIGHT CLOSURE 409
Example 6.6. We saw that the F-purity and F-regularity of a ring R =
R(X, D) depend only on the fractional part D
A
K. Note that the
bers of the homomorphism A R
A
over maximal ideals of A are nitely
generated algebras over elds of prime characteristic.
410 ANURAG K. SINGH
Denition 7.1. Let R be a ring nitely generated over a eld of charac-
teristic zero. Then R is said to be of F-regular type if there exists a nitely
generated Z-algebra A K and a nitely generated A-algebra R
A
such
that R
= R
A
A
K, and for all maximal ideals in a Zariski dense subset
of Spec A, the ber rings R
A
A
A/ are F-regular.
Similarly, R is said to be of F-pure type if for all maximal ideals in a
Zariski dense subset of Spec A, the ber rings R
A
A
A/ are F-pure.
Remark 7.2. Some authors use the term F-pure type (F-regular type) to
mean that R
A
A
A/ is F-pure (F-regular) for all maximal ideals in a
Zariski dense open subset of Spec A.
All our positive results towards the existence of F-rational and F-regular
Veronese subrings in prime characteristic do have corresponding statements
in the characteristic zero situation. However we have so far not exhibited
a normal Cohen-Macaulay ring, generated by degree one elements over a
eld of characteristic zero, which has an isolated singularity and a negative
a-invariant but is not of F-regular type. N. Hara has pointed out to us a
geometric argument for the existence of such rings using a blow-up of P
2
at
nine points. In this section, we construct a large family of explicit examples
of such rings of dimension d 3.
Example 7.3. Take two relatively prime homogeneous polynomials F and
G of degree d in the ring Z[X
1
, . . . , X
k
], where k 3, such that G is monic in
X
k
and the monomial X
d
k
does not occur in F. Using F and G, construct the
hypersurface S = Q[S, T, X
1
, . . . , X
k
]/(SF TG) and let R be the subring
of S generated by the elements sx
1
, . . . , sx
k
, tx
1
, . . . , tx
k
.
For suitably general choices of the polynomials F and G of degree d = k
the ring R has only isolated singularities, and we show that it is Cohen-
Macaulay with a(R) = 1, and is not of F-regular type. For an explicit
example, take k = 3, F = X
1
X
2
X
3
and G = X
3
1
+X
3
2
+X
3
3
.
We shall prove that R is Cohen-Macaulay whenever d k. We rst
show that the Hilbert polynomial multiplicity of R is d(k 1) +1, and then
construct a system of parameters such that the ring obtained by killing this
system of parameters has length d(k 1) + 1.
We construct a basis for the vector space generated by the monomials
of degree n 0, s
i
t
ni
x
j
1
1
x
j
2
2
x
j
k
k
, where the j
r
are nonnegative integers
which add up to n. The relations permit us to express tx
d
k
in terms of other
monomials. Let [u
1
, . . . , u
m
]
i
denote the set S of monomials of degree i in
u
1
, . . . , u
m
, and for two such sets, let S T denote the product of all possible
pairs from S and T . In this notation, for n 0, the following monomials
VERONESE SUBRINGS AND TIGHT CLOSURE 411
constitute a basis for R
n
:
[s, t]
n
[x
1
, . . . , x
k1
]
n
,
[s, t]
n
[x
1
, . . . , x
k1
]
n1
[x
k
],
. . .
[s, t]
n
[x
1
, . . . , x
k1
]
nd+1
[x
k
]
d1
,
[s]
n
[x
1
, . . . , x
k
]
nd
[x
k
]
d
.
Consequently for large n the vector space dimension of R
n
is
(n + 1)
n +k 2
k 2
+ +
n d + 1 +k 2
k 2
n d +k 1
k 1
.
As a polynomial in n, the leading term of this expression is
n
n
k2
(k 2)!
+ +
n
k2
(k 2)!
+
n
k1
(k 1)!
=
n
k1
(d(k 1) + 1)
(k 1)!
,
and so the Hilbert polynomial multiplicity of R is d(k 1) + 1.
The sequence of elements sx
1
, sx
2
tx
1
, sx
3
tx
2
, . . . , sx
k
tx
k1
is a system of parameters for R. Since we have already veried that the
Hilbert polynomial multiplicity of R is d(k 1) + 1, to prove that R is
Cohen-Macaulay when d k, it suces to show that the length of the ring
T obtained by killing this system of parameters is at most d(k 1) + 1.
Relabel the generators of T as a
2
= sx
2
, a
3
= sx
3
, . . . , a
k
= sx
k
, a
k+1
=
tx
k
. Note that the relations amongst the a
i
include the size two minors of
the matrix
0 a
2
. . . a
k1
a
k
a
2
a
3
. . . a
k
a
k+1
.
Consequently a generating set for [T]
<d
is given by
deg 0 : 1,
deg 1 : a
2
, a
3
, . . . , a
k+1
,
deg 2 : a
2
a
k+1
, a
3
a
k+1
, . . . , a
2
k+1
,
deg 3 : a
2
a
2
k+1
, a
3
a
2
k+1
, . . . , a
3
k+1
,
. . .
deg d 1 : a
2
a
d2
k+1
, a
3
a
d2
k+1
, . . . , a
d1
k+1
.
In degree d the ring T has d additional independent relations coming from
the equations s
i
t
di
f s
i1
t
di+1
g, for 1 i d. Consequently we need
k d generators for the degree d piece of T, and one can check that there
are no nonzero elements in degree d + 1. Hence the length of T is bounded
by d(k 1) + 1, and this completes the proof that R is Cohen-Macaulay.
It only remains to show that R is not of F-regular type when k d.
Consider the ber A of the map Z R
Z
over an arbitrary closed point
412 ANURAG K. SINGH
pZ. Then A is a nitely generated algebra over the nite eld Z/pZ, and it
suces to show that A is not F-regular. Take the ideal
I = (sx
1
, sx
2
, . . . , sx
k1
, tx
1
, tx
2
, . . . , tx
k1
)A.
It is easily veried that (tx
k
)
d1
/ I, and we show that (tx
k
)
d1
I
.
To see (tx
k
)
d1
I
r=1
j
r
= q(d 1).
If
q
/ I
[q]
, then j
r
< q for 1 r k 1. However on summing these
inequalities we get q(d 1) < q(k 1), a contradiction.
Remark 7.4. Consider the polynomial ring K[X
1
, . . . , X
k
] where k 3.
It is worth noting that the ring R, as above, is isomorphic to a subring of
K[X
1
, . . . , X
k
],
R = K[X
1
F, X
2
F, . . . , X
k
F, X
1
G, X
2
G, . . . , X
k
G].
We can show that R is Cohen-Macaulay precisely when the degree d of F
and G is less than or equal to k. It would certainly be interesting to explore
generalizations of this construction.
Acknowledgments. The author wishes to thank Melvin Hochster and
Karen Smith for many valuable discussions.
References
[De] M. Demazure, Anneaux gradues normaux, in: Le D ung Trang (ed.) Introduction
a la theorie des singularities II, Hermann, Paris, (1988), 35-68.
[EH] D. Eisenbud and J. Harris, On varieties of minimal degree (A centennial account),
Proc. Sympos. Pure Math., 46 (1987), 3-13.
[FW] R. Fedder and K.-i. Watanabe, A characterization of F-regularity in terms of F-
purity, in: Commutative Algebra, Proc. Microprogram, June 15-July 12, 1987;
Math. Sci. Res. Inst. Publ., 15, Springer-Verlag, New York, Berlin, Heidelberg,
London, Paris, Tokyo, (1989), 227-245.
[GW] S. Goto and K.-i. Watanabe, On graded rings, I, J. Math. Soc. Japan, 30 (1978),
179-213.
[EGA] A. Grothendieck and J. Dieudonne,
Elements de geometrie algebrique; II
Etude
globale elementaire de quelques classes de morphismes, Inst. Hautes
Etudes Sci.
Publ. Math., 8 (1961), 5-222.
[Ha] N. Hara, A characterisation of rational singularities in terms of injectivity of Frobe-
nius maps, Amer. J. Math., 120 (1998), 981-996.
VERONESE SUBRINGS AND TIGHT CLOSURE 413
[HH1] M. Hochster and C. Huneke, Tight closure, invariant theory, and the Briancon-
Skoda theorem, J. Amer. Math. Soc., 3 (1990), 31-116.
[HH2] , F-regularity, test elements, and smooth base change, Trans. Amer. Math.
Soc., 346 (1994), 1-62.
[HH3] , Tight closure of parameter ideals and splitting in module-nite extensions,
J. Algebraic Geom., 3 (1994), 599-670.
[HH4] , Tight closure in equal characteristic zero, in preparation.
[HR1] M. Hochster and J. Roberts, Rings of invariants of reductive groups acting on
regular rings are Cohen-Macaulay, Adv. Math., 13 (1974), 115-175.
[HR2] , The purity of the Frobenius and local cohomology, Adv. Math., 21 (1976),
117-172.
[LS] G. Lyubeznik and K.E. Smith, Strong and weak F-regularity are equivalent for
graded rings, preprint.
[Mu] D. Mumford, The red book of varieties and schemes, Lecture notes in Math., 1358,
Springer-Verlag, Berlin, New York, 1988.
[Sm1] K.E. Smith, Tight closure of parameter ideals and F-rationality, Thesis, Univ. of
Michigan, 1993.
[Sm2] , Tight closure of parameter ideals, Invent. Math., 115 (1994), 41-60.
[Sm3] , F-rational rings have rational singularities, Amer. J. Math., 119 (1997),
159-180.
[Sm4] , Tight closure in graded rings, J. Math. Kyoto Univ., 37 (1997), 35-53.
[Sm5] , Vanishing theorems, singularities, and eective bounds in algebraic geom-
etry via prime characteristic local algebra, in: Algebraic geometry Santa Cruz,
1995, J. Kollar, R. Lazarsfeld and David R. Morrison (eds.) Proc. Sympos. Pure
Math., 62 (1997), 289-325.
[Ve] J.D. Velez, Openness of the F-rational locus and smooth base change, J. Algebra,
172 (1995), 425-453.
[Wa1] K.-i. Watanabe, Some remarks concerning Demazures construction of normal
graded rings, Nagoya Math. J., 83 (1981), 203-211.
[Wa2] , Study of F-purity in dimension two, in: Algebraic Geometry and Commu-
tative Algebra in honor of Masayoshi Nagata, Vol. II, Kinokuniya, Tokyo, (1988),
791-800.
[Wa3] , F-regular and F-pure normal grade rings, J. Pure Appl. Algebra, 71
(1991), 341-350.
[Wa4] , F-purity and F-regularity vs. log-canonical singularities, preprint.
Received June 23, 1998 and revised October 23, 1998.
University of Utah
Salt Lake City, UT 84112-0090
E-mail address: singh@math.utah.edu
PACIFIC JOURNAL OF MATHEMATICS
Vol. 192, No. 2, 2000
RATIONAL CURVES ON A COMPLETE INTERSECTION
CALABIYAU VARIETY IN P
3
P
3
Dag Einar Sommervoll
We study rational curves on the Tian-Yau complete inter-
section CalabiYau threefold (CICY) in P
3
P
3
. Existence
of positive dimensional families of nonsingular rational curves
is proved for every degree 4. The number of nonsingu-
lar rational curves of degree 1, 2, 3 on a general TianYau
CICY is nite and enumerated. The number of curves of
these degrees are also enumerated for the special TianYau
CICY. There are two 1-dimensional families of singular ratio-
nal curves of degree 3 on a general TianYau CICY, making
this degree a turning point between nite and innite num-
ber of curves. We also introduce a notion of equivalence of a
family of rational curves, and determine the equivalences of
the two 1-dimensional families on the TianYau CICY. The
equivalences equal the predicted numbers of curves obtained
by a power series expansion of the solution of a Picard-Fuchs
equation that arises in superconformal eld theory.
1. Introduction and basic denitions.
In the 1980s the physicists started considering supersymmetric theories for
a 10-dimensional universe. In these theories a CalabiYau threefold is at-
tached to every point of the Minkowski time-space. Moreover, certain invari-
ants of this CalabiYau threefold are linked to observables in our universe.
For example, the number of generations of elementary particles is 3 (elec-
tron, muon, tauon) in our universe. The superstring theory yields that the
absolute value of the Euler number of the manifold must be twice the number
of generations. So physicists were hoping to nd relatively easy examples of
manifolds with Euler number 6. The rst example was found by G. Tian
and S.-T. Yau ([23], [24]). Their starting point was the following complete
intersection CalabiYau threefold in P
3
P
3
:
X = Z
x
3
i
,
x
i
y
i
,
y
3
i
.
We shall call this the special TianYau CICY. This variety has Euler number
18. Furthermore, this variety allows a free action by a group G of order 3.
Hence, the quotient variety X/G is CY with Euler number 6.
415
416 DAG EINAR SOMMERVOLL
More generally, a TianYau CICY is dened by:
X = Z(f
1
, f
2
, g) P
3
1
P
3
2
where f
1
, f
2
, g are polynomials of bidegrees (3, 0), (0, 3) and (1, 1) respec-
tively, and such that X is nonsingular. By a general TianYau variety we
shall mean a generic choice of the polynomials f
1
, f
2
, g. We introduce the
following notation: F
1
(resp. F
2
) is the cubic surface in P
3
1
(resp. P
3
2
) dened
by the polynomial of bidegree (3, 0) (resp. (0, 3)). The variety G is dened
by the polynomial of bidegree (1, 1). In other words:
X = F
1
F
2
G.
All deformations of a general TianYau variety as an abstract variety, are
realisable as polynomial deformations of the dening equations. The family
of TianYau varieties in P
3
1
P
3
2
is complete in this sense.
After the example of G. Tian and S.-T. Yau was found, an intensive
search for more examples of complete intersections in multiprojective space
was undertaken ([2], [3], [8], [9]). It is proven that no complete intersection
CalabiYau threefold X in multiprojective space would have |(X)| = 6.
As in the example given by G. Tian and S.-T. Yau one may look for some
group acting freely on the variety. Such groups are of course hard to nd
and in many cases it is quite easy to prove that such groups cannot exist. In
fact, starting with the list of all the CICY threefold types (approximately
10,000) in multiprojective space, the nal result was that there were at most
3 types (including the TianYau CICY type) ([3]) that possibly could have
a free action by a group acting of the desired cardinality. Moreover, to the
present day a such group has only been found in the TianYau case.
The quintic in P
4
is the most studied CalabiYau threefold. Clemens
conjectured that there are only nitely many smooth rational curves on a
general quintic threefold for every degree. This has been proven for degrees
less than 10 ([13], [12]). The numbers have been computed for degrees less
than or equal to 4 using algebro-geometric techniques ([13], [6], [15]). Using
conformal eld theory, one is able to predict the number of rational curves
of every degree. More precisely, the predicted numbers appear in a power
series expansion of the solution of a Picard-Fuchs equation. In the case of
the quintic in P
4
, the power series looks like:
F(q) = 5 +
d
n
d
d
3
q
d
1 q
d
,
where n
d
is the (conjectured) number of rational curves of degree d. This
expansion rst appeared in ([4]), and started o a new branch of mathemat-
ics trying to understand the mathematical implications of mirror symmetry.
Recently progress has been made in achiving this goal ([7], [16]). For a
further discussion and a more complete reference list, see ([17]).
CURVES ON A CALABI-YAU VARIETY 417
In general it is hard to determine when the number of curves of a given de-
gree is nite. We address this question in case of a general TianYau CICY.
Every rational curve on a TianYau variety has a bidegree, and a degree
via the Segre embedding in P
15
. Furthermore the Hilbert scheme Hilb
dm+1
X
has a natural partition in open-closed disjoint subschemes Hilb
(i,j)m+1
X
with
i +j = d.
Our main result is the following:
Theorem 1.1. Let X be a general TianYau CICY. Let m > 3 be an in-
teger, and i {0, 1, 2, 3}. Then there exist positive dimensional families of
nonsingular rational curves of bidegree (m, mi).
There exist also positive dimensional families of nonsingular rational
curves of bidegrees (2, 2), (3, 3), (3, 2).
This has the following corollary:
Corollary 1.2. There exist positive dimensional families of nonsingular ra-
tional curves on a general TianYau CICY for every degree n, n 4.
However, this abundance of curves for innitely many bidegrees, does not
extend to all bidegrees, on the contrary, there are innitely many bidegrees
that do not allow any rational curves at all:
Theorem 1.3. There are no curves of bidegree (m, 1) or (m, 0) on a general
TianYau CICY for m 4.
The number rational curves on a general TianYau CICY is nite only for
degrees 1 and 2. An explicit enumeration of rational curves of the various
bidegrees shows that these numbers are in agreement with the numbers
worked out by S. Hosono, A. Klemm, S. Theisen, S.-T. Yau ([11]) and by
V.V. Batyrev and D. van Straten ([1]) using the Picard-Fuchs equation.
The number of nonsingular rational curves of degree 3 is nite, but there
are also two 1-dimensional families of singular rational curves on a general
TianYau CICY. We give an algebro-geometric denition of the equivalence
of a 1-dimensional family of rational curves, and apply this denition to the
two 1-dimensional families of degree 3 curves on a general TianYau CICY.
The author would like to thank Ragni Piene for numerous discussions and
encouragements. Moreover, the author would like to thank Sheldon Katz
and Duco van Straten for sharing their insights concerning equivalences
of families of rational curves. A previous version of this work was part
of the authors doctoral dissertation ([22]), written with support from the
Norwegian Research Council.
2. Preliminaries.
In this section we study the geometry and the rational curves of the variety
G = Z(
ij
x
i
y
j
). We start this section with two lemmas.
418 DAG EINAR SOMMERVOLL
Lemma 2.1. Set G = Z(
ij
x
i
y
j
) P
3
1
P
3
2
and let L be a line in P
3
2
(resp. P
3
1
). Then there exists a unique maximal linear space V (L) in P
3
1
(resp. P
3
2
) such that V (L) L (resp. L V (L)) is contained in G.
Proof. We prove the assertion in the case where the line L is in P
3
2
. Assume
L is dened by Z(y
2
, y
3
) P
3
2
. Set
G = G|
P
3
1
L
, then
G is dened in P
3
1
L
by the following equation:
i0
x
i
y
0
+
i1
x
i
y
1
= 0.
Obviously
V (L) = Z
i0
x
i
,
i1
x
i
ij
x
i
y
j
). The matrix [
ij
] is invertible if and
only if dimV (L) = 1 for all L P
3
i
, i = 1, 2.
Proof. Assume that [
ij
] is invertible. Introduce the notation xAy
t
=
ij
x
i
y
j
, where x = (x
0
, . . . , x
3
), y = (y
0
, . . . , y
3
), and
A =
00
. . .
03
.
.
.
.
.
.
30
. . .
33
.
We have to prove that for every line L in P
3
1
, V (L) is of minimal dimension.
Consider rst the special case where L = Z(y
2
, y
3
) P
3
2
. This gives
V (L) = Z
i1
x
i
,
i0
x
i
L P
3
1
P
3
2
.
Assume that V (L) is not of minimal dimension, i.e., dimZ(
i1
x
i
,
i0
x
i
)
2. This implies that
i1
x
i
=
i0
x
i
, giving
i1
=
i0
. In other
words, the rst two columns are proportional, which contradicts that A is
of maximal rank.
The nal step is reducing the general situation to the special case con-
sidered above. This is done in the following way: Choose any line L in
P
3
2
. It is possible to change the coordinates on the second factor, such
that L is equal to Z(y
2
, y
3
). Call this coordinate change matrix P (i.e.,
CURVES ON A CALABI-YAU VARIETY 419
(y
1
, . . . , y
3
)
t
= P (y
1
, . . . , y
3
)
t
). We make the following change of coordi-
nates on the rst factor:
x
t
= (A
1
)
t
P
t
A
t
x
t
.
This gives G = Z(
ij
x
i
y
j
) with respect to the new coordinates, since
ij
x
i
y
j
= xAy
t
= (x
APA
1
)A(P
1
y
t
) = x
Ay
t
=
ij
x
i
y
j
.
The result now follows from the special case considered above.
We also observe that dimV (L) = 1 for all L, implies that all columns
must be linearly independent (by considering the lines Z(y
2
, y
3
), Z(y
0
, y
3
),
and Z(y
0
, y
2
)). This gives the implication the other way.
When G is as in the previous proposition, we have a map:
l : Grass (1, P
3
) Grass (1, P
3
)
dened by sending L to V (L). This map is obviously bijective, since l(l(L))=
L by denition. In this case, we shall write l(L) for V (L) to signify that it
is a line. Moreover, note that then the dening equation can be brought to
diagonal form
3
i=0
x
i
y
i
by a suitable change of coordinates on P
3
1
and P
3
2
.
Proposition 2.4. Let G = Z(
ij
x
i
y
j
), where the matrix [
ij
] is invert-
ible, and set
G = G|
P
3
L
. Then
G is isomorphic to the blow-up of P
3
1
in
l(L).
Proof. We can, without loss of generality, assume that
G is dened by
Z(x
1
y
2
x
2
y
1
) P
3
(x
0
, . . . , x
3
) P
1
(y
1
, y
2
)
(by change of coordinates). Then l(L) is dened by x
1
= x
2
= 0. It is
enough to check the statement locally, take for instance x
0
= 1. Then we
have
Z(x
1
y
2
x
2
y
1
) A
3
P
1
.
This is in fact the blow-up of A
3
with center Z(x
1
, x
2
) ([10], II.7.12.1, p.
163).
Corollary 2.5. Let G = Z(
ij
x
i
y
j
), where the matrix [
ij
] is invertible,
and let
G = G|
HL
, where H is a hyperplane and L is a line. Let :
G
P
3
1
denote the blow-up of P
3
1
in l(L). Then
G is isomorphic to
1
(H). If
l(L) H, then
G is isomorphic to H blown up in the point H l(L).
We end this section by a proposition that we will use extensively in the
following sections. For its formulation we need a denition.
Denition 2.6. A rational curve in P
3
1
P
3
2
is of type ( m, n) if the image
of the rst (resp. second) projection is of degree m (resp. n).
420 DAG EINAR SOMMERVOLL
Proposition 2.7. Let L be a line in P
3
2
, and let C
1
be a rational curve
of degree m in P
3
1
. Furthermore, let G = Z(
ij
x
i
y
j
), where the matrix
[
ij
] is invertible, and denote G|
P
3
1
L
by
G. Let C be the unique irreducible
component of D = C
1
L
G such that
1
(C) = C
1
, where
1
is the
projection map on the rst factor. Let i = lg(C
1
l(L)). Then C is a
rational curve of bidegree (m, mi) and of type ( m,
ij
x
i
y
j
), where the matrix [
ij
] is in-
vertible. Then there are no nonsingular rational curves of bidegree (m, 0),
m 3, on G.
Proof. Assume for contradiction that C is a nonsingular rational curve of
bidegree (m, 0), m 3 on G, i.e., C = C
1
{p}, where C
1
is a nonsingular
rational curve in P
3
1
and p is a point in P
3
2
. Fix a line L in P
3
2
passing through
p. By the proof Proposition 2.7 l(L) has to be an m-secant to the curve C
1
.
This is impossible since a nonsingular rational curve of degree m has at most
an (m1) secant for m 3.
3. The number of rational curves of degree less than 4 on a
general TianYau CICY.
In this section we compute the number of nonsingular rational curves on a
general TianYau CICY for degrees less than 4. In the end of this section
we describe two 1-dimensional families of singular rational curves of degree
3.
Proposition 3.1. The numbers N
i,j
of nonsingular rational curves of bide-
gree (i, j) on a general TianYau variety are nite for i + j 3 and are
given by:
N
0,1
= N
1,0
= 81
N
0,2
= N
2,0
= 81
N
1,1
= 729
N
1,2
= N
2,1
= 2187
N
3,0
= N
0,3
= 0.
CURVES ON A CALABI-YAU VARIETY 421
Proof. All curves of bidegree (1, 0) have to be of the form L {b} where L
is a line on F
1
and b is a point on F
2
. The condition that this (1, 0) also
should be contained in G, gives three possible values for the point b given a
line L on F
1
. Since F
1
has 27 lines, the number of (1, 0) curves is 3 27 = 81.
A (2, 0) curve is of the form C
1
{b} H {b}, where C is a conic in a
hyperplane H in P
3
1
. It follows that HF
1
= C
1
L, since F
1
is of degree 3.
Hence, the number of (2, 0) curves has to be equal to the number of (1, 0)
curves, which is 81. The intersection L
L G is obviously an irreducible
rational curve of bidegree (1, 1), and every rational curve of bidegree (1, 1)
has to arise in this way. Hence, the number of bidegree (1, 1) curves is equal
to the number of pairs (L
, L) where L
F
1
and L F
2
. The number of
such pairs is 27 27 = 729.
We now want to nd the number of rational curves of bidegree (2, 1).
Denote the lines on F
i
by L
k
i
where k {1, . . . , 27}. In the generic situation
l(L
k
2
) L
k
1
= for all k, k
i
x
i
y
i
). The curve C is of the form C
1
{p}, where C
1
is on F
1
and
p = (p
0
, p
1
, p
2
, p
3
) is a point on F
2
. Furthermore, C
1
H
p
= Z(
i
p
i
x
i
).
By Bezouts theorem, F
1
H
p
is of degree 3, so C
1
= F
1
H
p
. Finally, that
422 DAG EINAR SOMMERVOLL
F
1
H
p
is a singular cubic implies that H
p
is a tangent hyperplane, i.e.,
H
p
F
P
3
1
1
F
2
} P
3
1
P
3
2
= P
3
2
P
3
2
hence with F
1
F
2
, where denotes the diagonal. This set is isomorphic
to F
1
F
2
P
3
2
.
Hence, we can represent the complete family of bidegree (3, 0) rational
curves as a curve in P
3
. We denote this curve by .
Since deg F
1
= 12, deg = 3 12 = 36. Note that is a (local) complete
intersection, so the dualising sheaf is given by:
=
2
N
/P
3
P
3|
.
Using N
|
P
3
= O
(d
1
) O
(d
2
), where d
1
= 12 and d
2
= 3 are the degrees
of F
1
and F
2
respectively, we get
deg
= 2p
a
2 = (d
1
+d
2
4)d
1
d
2
= 396
so that the arithmetic genus, p
a
, of is 199.
Finally, we want to determine the singularities of the curve . The dual
surface F
1
has a double curve of degree
1
2
d(d1)(d2)(d
3
d
2
d12) = 27
(see e.g. [18]), where d = deg F
1
= 3. The nodes on are precisely the
intersection points between F
2
and the double curve on F
1
. Let denote
the number of nodes. Then
= 3 27 = 81.
The surface F
1
also has a cuspidal edge of degree 4d(d1)(d2) = 24 ([18]).
The cusps on are the intersection points between F
2
and this curve. Let
denote the number of cusps, then
= 3 24 = 72.
Hence the geometric genus, p
g
, of is given by p
g
= p
a
= 46.
In the end of this section we will give a denition of the equivalence, that
apply to our two 1-dimensional families. We start out by reviewing the
denition given by S. Katz of the equivalence of a family of rational curves
([14]). Let X be a CalabiYau threefold and let H be a k-dimensional
nonsingular family of nonsingular rational curves on X. Consider
D H X
H
where D is the total space of the family and is the projection on the
rst factor. Let N
D/HX
denote the normal bundle of D in H X. The
CURVES ON A CALABI-YAU VARIETY 423
equivalence, e(H), of the family H is then dened to be:
e(H) = deg c
k
(R
1
N
D/HX
).
We are interested in determining the equivalence of a family of mildly sin-
gular curves, say local complete intersection curves with at most node and
cusp singularities. Furthermore, we shall allow the base space of the family
to have the same kind of mild singularities. Because of the singularities, we
can not apply Katz notion of equivalence directly. It is easy to see that
R
1
N
D/HX
is not a vector bundle in general.
Let C be a local complete intersection curve in X. Let I be its dening
ideal. The sheaf I/I
2
is locally free, hence so is the normal sheaf N
C/X
=
(I/I
2
)
2
N
C/X
X
|
C
=
C
where
C
is the dualising sheaf. Since X is CalabiYau, this gives:
2
N
C/X
=
C
.
Furthermore, since N
C/X
is a rank 2 bundle, we have the following perfect
pairing N
C/X
= N
C/X
2
N
C/X
. Hence
N
C/X
= N
C/X
C
.
By Serre Duality we get:
(5) H
1
(N
C/X
)
= H
11
(N
C/X
C
)
= H
0
(N
C/X
)
.
Consider a family : D H of local complete intersection curves with at
most cusps and nodes on X. The relative version of the isomorphism (5) is
(6) R
1
N
D/HX
= Hom(
N
D/HX
, O
H
).
Assuming that (6) holds and that the Kodaira-Spencer map
1
H
N
D/HX
is an isomorphism (e.g., if H is a component of the Hilbert scheme), we
obtain Kodaira-Spencer
R
1
N
D/HX
= Hom(
N
D/HX
, O
H
)
= Hom(
1
H
, O
H
) =
1
H
.
In the cases we are interested in, R
1
N
D/HX
is not necessarily isomorphic
to
1
H
, and
1
H
is not isomorphic to
1
.
We have the following exact sequence of sheaves on
:
(7)
1
e
1
e
/
0.
Let denote the image of
in
1
e
, i.e.,
(8) 0
1
e
1
e
/
0.
The sheaf can be considered as a modication of
1
in the following
way. Consider the following commutive diagram:
1
e
1
e
.
We dene the equivalence e() of as the (degree of the) rst Chern class
of the image of
in
1
e
. Note that
is surjective and
1
e
=
1
e
) deg
1
e
/
(9)
= 2p
g
(
) 2 #cusps.
Since the geometric genus is 46 and the number of cusps is 72, we get,
e() = 2 46 2 72 = 18.
Remark 3.4. This number is equal to the predicted number of curves cal-
culated by S. Hosono, A. Klemm, S. Theisen and S.-T. Yau ([11], p. 521)
and by D. van Straten and V. V. Batyrev ([1]). A general hypersurface of
bidegree (3, 3) in P
2
P
2
also has a 1-dimensional family of singular rational
curves of bidegree (3, 0). Applying the above denition of equivalence to
this family gives the number 162. This agrees with the conjectured number
in [11].
CURVES ON A CALABI-YAU VARIETY 425
4. Rational Curves of higher degree on a general TianYau
CICY.
In this section we study rational curves of degree higher than 3 on a general
TianYau CICY. We consider certain linear systems on P
2
, and use them to
prove the existence of positive dimensional families of curves of every degree
greater than 3 on a general TianYau CICY. In the rst part of the section
we rene this study, and give results concerning existence of rational curves
of various bidegrees.
We want to give a constructive proof of Theorem 1.1, and we start out
by two preliminary lemmas.
Lemma 4.1. Fix a point p in P
2
and let d 3. The linear system of curves
of degree d, with a point of order (d 1) at p, is of dimension 2d, and a
generic member is an irreducible rational curve.
Proof. The dimension of the linear system of curves of degree d is
d+2
2
1.
The condition that a curve has a given point p as a multiple point of order
(d 1), is equivalent to the vanishing of the (d 1) rst partial derivatives
at p. This gives 1 + +(d 1) conditions on the coecients, and the rst
statement follows. To prove prove the second statement it suces to show
that there exists an irreducible rational curve in the linear system. One can
construct one in the following way: Let
f : P
1
P
d
,
f(u, v) = (u
d
, u
d1
v, . . . , v
d
).
Let C = f(P
1
) P
d
. Choose d 1 points on C. These = points span a
linear subspace L of dimension (d 2). Let L
L be a linear subspace
with the following properties: dim L
= d 3 and L
= C = , and let
: P
d
P
2
be the projection from the linear subspace L
. Then
C = (C)
is a curve with the desired properties.
Lemma 4.2. Let F be a nonsingular cubic surface in P
3
. For every natural
number m 3, there exists a 2-dimensional family of nonsingular rational
curves of degree m on F.
Proof. The hypersurface F is isomorphic to P
2
blown up in six points p
0
, . . . ,
p
5
. Consider the linear system
0
of curves of degree d 3, and with a
multiple point of order (d 1) at p
0
in P
2
. We denote a generic curve
of
0
by C
0
. The strict transform of C
0
is a rational curve C
1
of degree
2d + 1. Since the dimensions of the linear systems considered down on
P
2
is at least 6 by the preceding lemma, the statement is proved for odd
degrees 7, 9, 11, . . . . For even degrees we take a sublinear system
1
of
0
,
by demanding the curve to pass through p
1
once. The strict transform of
a generic curve is a rational curve of degree 2d. The dimensions of these
426 DAG EINAR SOMMERVOLL
families of curves are at least 5. In the same manner we can take curves that
in addition to the requirements above also pass though p
2
and so on. In each
case the dimension drops by no more than one. Hence, we have inclusions
0
1
. . .
t
. . .
5
. (The linear system
t
constist of curves
of degree d passing through the points p
1
, . . . , p
t
.) This gives the desired
results for the remaining degrees 3, 4, 5. In the case m = 3 (corresponding
to d = 3 and t = 4) the dimension is at least equal to 2.
Now we give a constructive proof of Theorem 1.1.
Proof of the theorem. Let X = F
1
F
2
G be a general TianYau CICY, and
let L be one of the 27 lines on F
2
. Let q
1
, q
2
, q
3
be the points of intersection
of l(L) and F
1
. Furthermore, x a blowing down of the exceptional divisors
: F
1
P
2
, and let q
i
= (q
i
) for i = 1, 2, 3. We shall use the linear
systems of curves in P
2
considered in Lemma 4.1 and in Lemma 4.2.
Consider rst m 3 and i = 0. By Lemma 4.2 we have linear systems
t
with t base points p
1
, . . . , p
t
. A general member of this linear system
does not pass through any of the q
i
, i.e., it gives rise to a rational curve
of bidegree (m, m) on X by Proposition 2.7. Since these linear systems
are all positive dimensional, we get positive dimensional families of bidegree
(m, m), for m > 2, on X.
In order to prove the statement in the case m 3 and i = 1, we take
sublinear systems
t
1
of the
t
considered above, by assigning the basepoint
q
1
. The dimension of
t
1
is dim
t
1. Lemma 4.2 then gives dim
t
1
1,
and the result follows.
For i = 2 we take sublinear systems of
t
1
, by assigning q
2
as an additional
base point. By the same reasoning as above this gives positive dimensional
families, using Prop. 2.7, Lemma 4.1 and Lemma 4.2 for m > 3.
Finally, the case i = 3 is treated analogously by considering sublinear sys-
tems of
t
by assigning q
1
, q
2
, q
3
as base points. Using Prop. 2.7, Lemma 4.1
and Lemma 4.2 we obtain positive dimensional families of bidegree (m, m3)
curves for m > 3.
In the proof of Proposition 3.1 we gave all rational curves of bidegree
(2, 1). They were realised as degenerations of a 1-dimensional family of
bidegree (2, 2) rational curves of type (
2,
2,
1).)
Theorem 4.3. A general TianYau CICY contains no nonsingular rational
curves of bidegree (m, mi) and of type ( m,
1), for m i 4.
Proof. An i-secant of a curve when i 4, has to be contained in F
1
, by
Bezouts theorem. In other words, it has to be one of the 27 lines, but for
a general TianYau CICY, none of the 27 l(L)s are among the 27 lines on
F
1
.
CURVES ON A CALABI-YAU VARIETY 427
Remark 4.4. Note that Theorem 1.3 follows directly from the proof of
Theorem 4.3.
The last theorem states the non-existence of rational curves of certain
bidegrees on a general TianYau CICY. However, there may exist nongeneral
TianYau CICYs with rational curves of these bidegrees.
Proposition 4.5. Let d 3 and let t {0, 1, 2, 3, 4, 5}. There exist vari-
eties of TianYau CICY type with positive dimensional families of nonsin-
gular rational curves of bidegree (2d + 1 t, d + 2 t).
Proof. In Lemma 4.1 and Lemma 4.2 we constructed the linear systems of
curves
t
. The strict transforms of these curves have a (d 1)-secant, E
0
,
the exceptional divisor corresponding to p
0
. Furthermore, the degree of the
strict transform of a general member of
t
is 2d+1t. If F
i
is a nonsingular
cubic surface in P
3
i
, denote by L
k
i
, k {1, . . . , 27}, the 27 lines on F
i
. Now,
choose a pair of cubic surfaces F
1
, F
2
and a G = Z(
ij
x
i
y
j
), such that
the matrix [
ij
] is invertible, and with the property that there exists a pair
of lines L
j
1
and L
n
2
such that l(L
n
2
) = L
j
1
. Applying Proposition 2.7 gives the
desired result.
Remark 4.6. All of these curves are of type ( m,
LG, where L
F
1
and
L F
2
, by Proposition 2.7. We have 729 pairs of lines to consider. Com-
putation gives 567 irreducible intersections, hence 567 (1,1) curves. The
number of rational curves of bidegree (2, 1) is a little more complicated to
obtain. First, notice that there is no rational curves of bidegree (2, 1) and
of type (
1,
, where L
is a line in F
1
. Conversely, every line L
= Z(x
0
+ x
3
= x
1
+ x
3
). The pencil is
then given by: H
a
= Z(ax
0
+
2
x
1
+ x
2
+ aX
3
) (where we allow a = ).
Let C
a
be dened by: H
a
F
1
= C
a
L.
Demanding a point in l(L) F
1
to be in H
a
, gives the following condition
on a: a(1
j+l
) =
l
i2
. There are 27 cases to consider: i, j, l may take
values in {0, 1, 2}. A case by case study gives that only 12 of these give a
rational curve of bidegree (2, 1), i.e., where l(L) intersects an irreducible C
a
once. This accounts for 9 lines on F
1
(9 dierent pairs (i, j)). A similar study
of the remaining 18 lines, gives 24 rational curves of the desired bidegree
(out of 54 candidates). Hence the number of rational curves of bidegree
(2, 1) is 36. Note that all these curves are mapped to L F
2
by the second
projection. By symmetry the total number of rational curves of bidegree
(2, 1) is 27 36 = 972.
Corollary 5.2. Let N
d
denote the number of nonsingular rational curves
of degree d on a general TianYau variety. Then
N
1
= 162, N
2
= 729, N
3
= 1944.
Remark 5.3. The numbers of curves of bidegree (0, 1), (1, 0) and (1, 1) have
been calculated previously, using similar techniques ([5], [19]).
CURVES ON A CALABI-YAU VARIETY 429
Corollary 5.4. There exist positive dimensional families of rational curves
on the special TianYau CICY for every degree n, n 4.
Proof. This is a corollary of the proof of Corollary 1.2. The construction
of curves relied on the use of Proposition 2.7, i.e. on the fact that G =
Z(
ij
ij
x
i
y
j
), where the matrix [
ij
] is invertible, which is clearly satised
in this case. The last necessary ingredient in the proof is that none of the
l(L), where L is one of the 27 lines of F
2
, are tangent to the surface F
2
.
This is easily checked for the special TianYau CICY. The rest of the proof
is identical to the proof of Theorem 1.1.
Remark 5.5. By comparing Corollary 5.2 with Corollary 3.2, we see that
the number of curves of degree 2 and 3 on the special TianYau variety are
dierent from the corresponding numbers for the general TianYau variety.
For example for degree 2, the dierence stems from the number of (1, 1)
curves. The dierence between the numbers of (1, 1) curves on the general
and the special is 729 567 = 162. This dierence is explained by the
fact that we have 162 reducible intersections of type L
1
L
2
G (where
L
i
F
i
) on the special TianYau variety, and none on the general Tian
Yau variety. A reducible intersection L
1
L
2
G gives one (1, 0) curve and
one (0, 1) curve. It is easy to see that every rational curve of bidegree (1, 0)
is contained in precisely two distinct intersections of the type L
1
L
2
G.
Hence, we get 81 (1, 0) curves. By Proposition 5.1 we know that these are
in fact the only ones.
References
[1] V.V. Batyrev, D. van Straten, private communication.
[2] P. Candelas, A.M. Dale, C.A. L utken and R. Schimmrigk, Complete intersection
CalabiYau manifolds, Nucl. Phys., B298 (1988), 493-525.
[3] P. Candelas, C.A. L utken and R. Schimmrigk, Complete intersection CalabiYau
manifolds II: Three generation manifolds, Nucl. Phys., B306 (1987), 113-136.
[4] P. Candelas, X. C. de la Ossa, P. S. Green, L. Parkes, A pair of CalabiYau manifolds
as an exactly soluble superconformal theory, Nucl. Phys., B359 (1991), 21-74.
[5] J. Distler, B. Greene, K. Kirklin and P. Miron, Evaluation of 27
3
Yukawa couplings
in a three generation superstring model, Phys. Lett., 195B (1987), 41-57.
[6] G. Ellingsrud and S.A. Strmme, The number of twisted cubic curves on the general
quintic threefold, Math. Scand., 76 (1995), 5-34.
[7] A. Givental, Equivariant Gromov-Witten invariants, Int. Math. Res. Notices, 13
(1996), 613-663.
[8] P.S. Green and T. H ubsch, CalabiYau manifolds as complete intersections in prod-
ucts of complex projective spaces, Comm. Math. Phys., 109 (1987), 99-108.
[9] P.S. Green, T. H ubsch and C.A. L utken, All the Hogde numbers of all CalabiYau
complete intersections, Class. Quantum Gravity, 6 (1989), 105-124.
430 DAG EINAR SOMMERVOLL
[10] R. Hartshorne, Algebraic Geometry, GTM, 52, Springer-Verlag, 1977.
[11] S. Hosono, A. Klemm, S. Theisen and S.-T. Yau, Mirror symmetry, mirror map
and applications to CalabiYau complete intersections, Nucl. Phys., B 433 (1995),
501-552.
[12] T. Johnsen and S.L. Kleiman, Rational curves of degree at most 9 on a general quintic
threefold, Comm. Algebra, 24(8) (1996), 2721-2753.
[13] S. Katz, On niteness of rational curves on quintic threefolds, Compositio Math., 60
(1986), 151-162.
[14] , Enumerative geometry of curves on CalabiYau threefolds, ITEG-Workshop
on Enumerative Geometry, Dyrkolbotn, Norway, August 1993.
[15] M. Kontsevich, Enumeration of rational curves via torus actions, Progr. in Math.,
129, (1995), 335-368. Birkhauser, 1995.
[16] B.H. Lian, K. Liu and S.-T. Yau, Mirror Principle I, Asian J. of Math, 1(4) (1997),
729-763.
[17] R. Pandharipande, Rational curves on hypersurfaces (after A. Givental), Seminaire
Bourbaki, Exposes, (1997/1998), 835-849.
[18] R. Piene, Some formulas for a surface in P
3
, Algebraic Geometry, Troms, 1977 (ed.
L. Olson), LNM 687, Springer Verlag, (1978), 196-235.
[19] D.E. Sommervoll, Tredimensjonale CalabiYau mangfoldigheter, Hovedfagsoppgave,
University of Oslo, 1989.
[20] , Rational curves of low degree on a complete intersection CalabiYau threefold
in P
3
1
P
3
2
, Preprint Series, Univ. of Oslo, 11 (1993).
[21] , Rational curves of high degree on a complete intersection CalabiYau three-
fold in P
3
1
P
3
2
, Preprint Series, Univ. of Oslo, 2 (1994).
[22] , Rational curves on complete intersection CalabiYau threefolds, Ph. D. thesis,
University of Oslo, 1996.
[23] G. Tian and S.-T. Yau, Three-dimensional algebraic manifolds with c
1
= 0 and
E
= 6, Mathematical Aspects of String Theory (S.-T. Yau ed.), World Scientic,
Singapore, 1987, 543-559.
[24] S.-T. Yau, Compact three dimensional Kahler manifolds with zero Ricci curvature,
Symp. on Anomalies, Geometry, Topology, (W.A. Bardeen et al eds.), World Scien-
tic, Singapore, 1985, 395-406.
Received December 2, 1997.
Forskningsavdelingen
SSB P.b. 8131 Dep N-0033 Oslo
Norway
E-mail address: dag@ssb.no
PACIFIC JOURNAL OF MATHEMATICS
Vol. 192, No. 2, 2000
GENERALIZED CARTAN TYPE S LIE ALGEBRAS IN
CHARACTERISTIC 0 (II)
Kaiming Zhao
In this paper, we introduce a class of Lie algebras which
are subalgebras of generalized Cartan type S Lie algebras of
characteristic 0. We determine the necessary and sucient
conditions for such Lie algebras to be simple. And we give all
derivations of such simple Lie algebras.
1. Introduction.
This paper is a sequel to the paper [7] in which generalized Cartan type S
Lie algebras t
z
S(A, T, ) over a eld F of characteristic 0 were studied. We
have tried to make this paper independent of other papers. So in Section 2,
we give a description of relevant Lie algebras and some basic facts which will
be used in this paper. In Section 3 we introduce a class of Lie algebras which
are subalgebras of generalized Cartan type S Lie algebras, and determine
the necessary and sucient conditions for such Lie algebras to be simple.
We give all derivations of such simple Lie algebras in Section 4.
This research was carried out during the authors visit to University of
Wisconsin-Madison. He wishes to thank Professor J.M. Osborn for his hos-
pitality and helpful discussions.
2. Notations and related Lie algebras.
In this section, for the convenience of the reader, we recall the relevant Lie
algebra denitions and some basic facts which will be used later in this
paper. Throughout this paper we assume that F is a eld of characteristic
0, and that A is a nonzero abelian group written additively.
2.1 Generalized Witt algebras.
Let n be a positive integer, and t
1
, . . . , t
n
independent and commuting
indeterminates over F. Denote by P
n
and Q
n
the polynomial algebra
F[t
1
, . . . , t
n
], and the Laurent polynomial algebra F[t
1
1
, . . . , t
1
n
] respec-
tively. By W
n
= W
n
(F) we denote the Witt algebra, i.e., the Lie algebra of
431
432 KAIMING ZHAO
all formal vector elds
(2.1)
n
i=1
f
i
t
i
with coecients f
i
Q
n
. The bracket in W
n
is
_
f
t
i
, g
t
j
_
= f
(g)
t
i
t
j
g
(f)
t
j
t
i
,
where f, g Q
n
, and i, j 1, 2, . . . , n. The subalgebra W
+
n
= W
+
n
(F)
of W
n
consisting of all vector elds (2.1) with polynomial coecients, i.e.,
f
i
P
n
, is known as the general Lie algebra, or the Lie algebra of Cartan
type W. For more details, please refer to [13]. It is well known that W
n
and
W
+
n
are simple Lie algebras.
Let T be a vector space over F. We denote by FA the group algebra of
A over F. The elements t
x
, x A, form a basis of this algebra, and the
multiplication is dened by t
x
t
y
= t
x+y
. We shall write 1 instead of t
0
.
The tensor product W = FA
F
T is a free left FA-module in the natural
way. We denote an arbitrary element of T by (to remind us of dierential
operators). For the sake of simplicity, we shall write t
x
instead of t
x
.
We now choose a pairing : T A F which is F-linear in the rst
variable and additive in the second one. For convenience we shall also use
the following notations:
(, x) = , x) = (x)
for arbitrary T and x A. W becomes a Lie algebra under the following
bracket:
(2.2) [t
x
1
, t
y
2
] := t
x+y
(
1
(y)
2
2
(x)
1
),
for arbitrary x, y A and
1
,
2
T. We refer to this algebra W =
W(A, T, ) as a generalized Witt algebra.
The subspaces W
x
= t
x
T , x A, dene an A-gradation of W, i.e., W is
the direct sum of the W
x
s, and [W
x
, W
y
] W
x+y
for all x, y A.
It follows from (2.2) that ad() acts on W
x
as a scalar (x). Hence
each T is ad-semisimple, and T is a torus (i.e., an abelian subalgebra
consisting of ad-semisimple elements). In fact T is the only maximal torus
of W (see [3, Lemma 4.1]). Kawamoto proved in [11] that the Lie algebra
W = W(A, T, ) is simple if and only if A ,= 0 and is nondegenerate in
the sense that the conditions
(2.3) , x) = 0, T x = 0
and
(2.4) , x) = 0, x A = 0
hold.
GENERALIZED CARTAN LIE ALGEBRAS 433
Note that (2.3) implies that A is torsion free. This implies that FA is an
integral domain and it implies that the invertible elements of FA have the
form at
x
, where a F
, x A.
There is a natural structure of a left W-module on FA, namely the struc-
ture is such that
(2.5) t
x
t
y
= (y)t
x+y
for x, y A and T. Also we have the natural left FA-module structure
on W. These two module structures are related by the identity
(2.6) [fu, gv] = f(u g)v g(v f)u +fg[u, v]
where f, g FA and u, v W are arbitrary. The W-module structure on
FA gives rise to a homomorphism
(2.7) W Der (FA)
because each w W acts on FA as a derivation. Clearly (2.7) is also a
homomorphism of FA-modules. For more details about W(A, T, ), please
refer to [3].
2.2 Generalized Cartan type W Lie algebras.
Suppose that W = W(A, T, ) denotes a simple generalized Witt algebra.
Let I be an index set, d : I T an injective map, and write d
i
= d(i) for
i I. We say that d is admissible if the following two conditions hold:
(Ind) d
i
, i I, are linearly independent;
(Int) d
i
(A) = Z for all i I.
We assume throughout that an admissible d has been xed. We set
A
+
d
= x A : d
i
(x) 0, i I,
A
0
d
= x A : d
i
(x) = 0, i I,
A
d,i
= x A : d
i
(x) = 1; d
j
(x) 0, j Ii,
A
#
d,i
= x A : d
i
(x) = 1; d
j
(x) = 0, j Ii,
A
d
= A
+
d
(
iI
A
d,i
) .
We now introduce some subalgebras of W:
W
+
d
=
xA
+
d
W
x
;
W
d,i
=
_
_
xA
d,i
Ft
x
_
_
d
i
, i I;
434 KAIMING ZHAO
and
W
d
= W
d
(A, T, ) = W
+
d
+
iI
W
d,i
.
We also introduce the subalgebra FA
+
d
of FA, which is the span of all
elements t
x
with x A
+
d
. Since W is a left FA-module, we can view W also
as a left FA
+
d
-module. Then it is easy to see that the subspaces W
+
d
and
W
d
are FA
+
d
-submodules of W.
By restricting the action of W on FA, we can view FA as a left W
d
-
module, and then FA
+
d
is a W
d
-submodule of FA. When d is xed, and
there is no danger of confusion, we shall write
A
+
, A
i
, A
#
i
, W
+
, W
i
, FA
+
instead of
A
+
d
, A
d,i
, A
#
d,i
, W
+
d
, W
d,i
, FA
+
d
,
respectively. The following Theorem is proved in [5].
Theorem 2.1. The Lie algebra W
d
is simple if and only if the following
conditions hold:
(i) if T and (x) = 0 for all x A
d
, then = 0;
(ii) if x A
d
, then d
i
(x) = 0 for almost all i I;
(iii) A
#
i
,= for all i I.
The simple Lie algebra W
d
is called an algebra of generalized Cartan type
W. For more details on the Lie algebra W
d
, please refer to the papers [5]
and [12].
2.3 Generalized Cartan type S Lie algebras.
It is well known that the classical divergence Div:W
n
F[t
1
1
, . . . , t
1
n
]
maps
n
i=1
f
i
t
i
to
n
i=1
(f
i
)
t
i
. The kernel
S
n
of Div is a subalgebra of
W
n
. The algebra S
n
= (
S
n
)
and S
+
n
= S
n
W
+
n
are called Lie algebras of
Cartan type S.
Suppose that W = W(A, T, ) is a simple generalized Witt algebra. The
divergence div: W FA is the F-linear map such that
(2.8) div (t
x
) = (x)t
x
holds for all x A and T. It has the following two properties:
(2.9) div (fw) = fdiv (w) +w f
and
(2.10) div [u, v] = u div (v) v div (u)
GENERALIZED CARTAN LIE ALGEBRAS 435
where u, v, w W and f FA are arbitrary. The latter property shows
that div is a derivation of W with values in the W-module FA. Since
div : W FA is a derivation of degree 0, its kernel
S := ker (div) is a
homogeneous subalgebra of W:
(2.11)
S =
xA
S
x
,
S
x
:=
S W
x
.
For x A dene the F-linear function x : T F by x() = (x). The
condition (2.3) shows that, if T
sending x x is injective. If T
x
:= ker( x), then we have
S
x
= t
x
T
x
.
Hence
S
0
= W
0
= T and, for x ,= 0,
S
x
is a hyperplane of W
x
. In particular,
if dimT = 1, then
S = T. To avoid trivialities, we shall assume always that
dimT > 1.
Let
S := (
S)
S =
x=0
S
x
.
More generally, the subspaces t
z
S, z A, are subalgebras of W and their
derived algebras are given by
(2.12) (t
z
S)
= t
z
S =
x=z
t
x
T
xz
.
If dimT 3, all the subalgebras t
z
S are simple. If dimT = 2, then
S
itself is simple while the shifted algebras t
z
S , z ,= 0, are not. Their derived
algebras
(t
z
S)
x=z,2z
t
x
T
xz
, z ,= 0,
are simple.
We shall refer to the subalgebras S(A, T, , z) := t
z
S if dimT 3, and
S(A, T, , z) := (t
z
S)
xA\{z}
t
x
T
xz
, if dimT > 2,
xA\{z,2z}
t
x
T
xz
, if dimT = 2.
These algebras were studied in papers: [6] when dimT = 2 and z = 0, [4]
when dimT = 2 and z ,= 0, and [7] when dimT 3.
2.4 Generalized Block algebras.
We shall denote by Hom (A, F) the F-vector space of all additive (i.e., Z-
linear) maps A F. We now x an additive map : A F and a
skew-symmetric bi-additive map : AA F.
Let L = L(A, , ) be the vector space over F having a basis consisting
of all symbols e
x
, x A. We make L into a (non-associative) algebra over
436 KAIMING ZHAO
F by dening F-bilinear multiplication L L L by
(2.13) [e
x
, e
y
] = f(x, y)e
x+y
, x, y A,
where
(2.14) f(x, y) = (x, y) +(x y).
If = 0, then L is a Lie algebra. These Lie algebras were studied in [8]. It
was shown by Albert and Frank [1] that, under the assumption that ,= 0,
L is a Lie algebra if and only if there exists another additive map : A F
such that = , i.e.,
(2.15) (x, y) = (x)(y) (x)(y).
We shall assume throughout that such a exists, i.e. that L is a Lie algebra.
We also assume that
(2.16) K
= 0,
where K
A by
(2.18) A
= x A : (x) 1 ,
and denote by L
. It follows that L
is a subalgebra of L.
We shall denote by L
or L
/Z. The
Lie algebra L
(A, , ),
please refer to [4] and [8].
3. Lie algebra S
d
(A, T, , z).
Now we are ready to introduce our main object, the Lie algebra S
d
(A, T, , z).
We assume that a generalized Witt algebra W = W(A, T, ), a generalized
Cartan type W Lie algebra W
d
= W
d
(A, T, ) W, a generalized Car-
tan type S Lie algebra S(A, T, , z) W are given, and we also assume
throughout the paper that all of these algebras are simple. Our hypotheses
here imply that dimT 2. We dene
S
d
(A, T, , z) := W
d
S(A, T, , z).
GENERALIZED CARTAN LIE ALGEBRAS 437
Then S
d
= S
d
(A, T, , z) is a subalgebra of W. We shall also call the algebra
S
d
a generalized Cartan type S Lie algebra. It is clear that S
d
has an A-
gradation with the following components of degree x A:
(S
d
)
x
=
_
0, for x = z
(t
x
T
xz
) W
d
, for x A z
if dimT > 2,
(S
d
)
x
=
_
0, for x = z or x = 2z
(t
x
T
xz
) W
d
, for x A z, 2z
if dimT = 2. It follows that (S
d
)
x
= 0 for all x / A
d
.
For convenience, by U
1
) we denote the subspace of the vector space V
generated by U
1
V .
Theorem 3.1. Suppose that the Lie algebras W = W(A, T, ), W
d
=
W
d
(A, T, ) W, and S = S(A, T, , z) W are given, and that all of
them are simple. Then S
d
(A, T, , z) is simple if and only if the following
conditions hold:
(a) d
i
(z) = 1 for all i I;
(b) I is nite.
Proof. For simplicity we write L = S
d
(A, T, , z), so L
x
= (S
d
)
x
for x A.
() Suppose that L is simple. We shall rst show that (a) is true. For
contradiction we assume that there exists an element in I, say 1, such that
d
1
(z) ,= 1. If t
x
d
1
W
1
L (for denition of W
1
see Section 2.2), we know
that d
1
(x) = 1 and d
1
(x z) = 0, so 1 = d
1
(x) = d
1
(z) ,= 1. It is a
contradiction. Then W
1
L = 0, i.e., L
x
= 0 for all x A
1
. Theorem 2.1(iii)
assures that we can choose y A with y A
#
1
. So y A
+
d
. If z ,= y, then
L
y
= t
y
T
yz
,= 0. Let J =
x: d
1
(x)>1
L
x
. If z = y, we see that L
3y
,= 0,
let J =
x: d
1
(x)>3
L
x
. It is easy to verify that J is a nonzero proper ideal
of L in both cases, which contradicts the simplicity of L. Consequently (a)
holds.
On the other side we have z A
+
d
. From Theorem 2.1(ii) it follows that
I is nite. So (b) is true.
() Suppose (a) and (b) hold. We write I = 1, 2, , n. We know
that L
x
,= 0 if and only if x A
d
z. Fix u
1
, u
2
, , u
n
A such
that d
i
(u
j
) =
i,j
. Let A(d) = Zu
1
Zu
2
Zu
n
, and A
(d) = x
A[d
i
(x) = 0 i I. Then it follows from (b) that A = A(d) A
(d).
Case 1. Suppose that dimT = 2 and [I[ = 1, say I = 1. Choose d
2
T
such that T = Fd
1
Fd
2
. Denote e
x
= t
x
(d
1
(x z)d
2
d
2
(x z)d
1
) for
438 KAIMING ZHAO
x A
d
. It follows that e
x
[x A
d
z is a basis of L = S
d
(A, T, , z).
For any x, y A
d
we have
[e
x
, e
y
] = [t
x
(d
1
(x z)d
2
d
2
(x z)d
1
), t
y
(d
1
(y z)d
2
d
2
(y z)d
1
)]
= t
x+y
_
(d
1
(x z)d
2
(y) d
2
(x z)d
1
(y))
t
x
(d
1
(x z)d
2
d
2
(x z)d
1
)
(d
1
(y z)d
2
(x) d
2
(y z)d
1
(x))
t
y
(d
1
(y z)d
2
d
2
(y z)d
1
)
_
= t
x+y
_
d
1
(x z)d
2
(y z) d
2
(x z)d
1
(y z)
_
(d
1
(x +y z)d
2
d
2
(x +y z)d
1
)
=
d
1
(x z) d
1
(y z)
d
2
(x z) d
2
(y z)
e
x+y
.
Denote (x) = d
2
(x) d
2
(z)d
1
(x), (x) = d
1
(x) for x, y A. It is clear
that (A) = Z. By Section 2.4 we know that L L
(A, , ). Consequently
L is simple.
Case 2. Suppose that dimT = 2 and [I[ = 2, say I = 1, 2. Since
is nondegenerate, we get that A = A(d) = d
1
(A) d
2
(A) Z Z.
We may assume that d
i
(x) = x
i
for x = (x
1
, x
2
) Z Z. Thus A
d
=
_
(Z
+
1) (Z
+
1)
_
(1, 1). Same as Case 1, we dene e
x
, x A
d
.
Then we also have
[e
x
, e
y
] =
d
1
(x z) d
1
(y z)
d
2
(x z) d
2
(y z)
e
x+y
,
i.e.,
[e
x
, e
y
] =
x
1
+ 1 y
1
+ 1
x
2
+ 1 y
2
+ 1
e
x+y
, x, y A
d
.
It is well known that the Lie algebra S
+
2
W
+
2
has basis t
x
1
1
t
x
2
2
(t
2
t
1
t
1
t
2
)[(x
1
, x
2
) A
d
. It is easy to verify that the following linear map is an
isomorphism of Lie algebras:
S
d
S
+
2
, e
x
t
x
1
1
t
x
2
2
_
t
2
t
1
t
1
t
2
_
.
Thus S
d
is also simple in this case.
Case 3. Suppose that dimT > 2.
If [I[ = dimT = n, as we did in Case 2 we can deduce that L S
+
n
, the
special algebra of rank n. Thus in this case S
d
is simple. Next we assume
that dimT > [I[ = n > 0.
GENERALIZED CARTAN LIE ALGEBRAS 439
Let J be a nonzero ideal of L. It is suces to show that J = L. Choose
a nonzero element u J, say
(3.1) u =
m
i=1
t
x
i
i
, x
i
A
d
z,
i
T
x
i
z
with m is minimal. Then x
1
, , x
m
are distinct and
i
,= 0.
Claim 1. We have that m = 1.
Otherwise we suppose m > 1. Since T
z
L, from the minimality of m it
follows that
( x
1
)[
T
z
= ( x
2
)[
T
z
= = ( x
m
)[
T
z
.
Thus
(3.2) x
i
x
j
F z, i, j 1, 2, , m.
Subclaim. We can choose such an element u in (3.1) such that d
1
(x
1
) =
1.
Suppose d
1
(x
1
) 0. If
1
Fd
1
, let y
1
A
#
1
, then
u
= [t
y
1
d
1
, u] =
m
i=1
t
y
1
+x
i
(d
1
(x
i
)
i
i
(y
1
)d
1
) J 0
and d
1
(y
1
+ x
1
) < d
1
(x
1
). If
1
/ Fd
1
, then A
d,1
, ker(
1
), Otherwise we
can show that
1
(A
d
) = 0, it is impossible. We choose y
1
A
d,1
ker(
1
).
We deduce that
u
= [t
y
1
d
1
, u] =
m
i=1
t
y
1
+x
i
(d
1
(x
i
)
i
i
(y
1
)d
1
) J 0,
and also d
1
(y
1
+x
1
) < d
1
(x
1
). After nitely many steps of this kind process,
we get a nonzero element u in (3.1) such that d
1
(x
1
) = 1. Our subclaim
follows.
Without loss of generality we may assume that
1
= d
1
. From (3.2) it
follows that there exists
i
F
such that
x
i
=
i
z + x
1
, i 2, , m.
Since x
i
A
d
, we have d
1
(x
i
) 1, then
i
N, the set of natural
numbers. Thus d
1
(x
2
) 0. If
2
Fd
1
, then
[t
x
1
d
1
, t
x
2
2
] = t
x
1
+x
2
(d
1
(x
i
)
i
i
(y
1
)d
1
) ,= 0.
It follows that
u
= [t
x
1
d
1
, u] =
m
i=2
t
x
1
+x
i
(d
1
(x
i
)
i
i
(y
1
)d
1
) J 0.
440 KAIMING ZHAO
This contradicts the minimality of m. Consequently
2
/ Fd
1
. Choose z
1
A
d,1
ker(
2
), then [t
x
1
d
1
, t
z
1
d
1
] = 0 and [t
z
1
d
1
, t
x
2
2
] = t
z
1
+x
2
(d
1
(x
2
)
2
2
(z
1
)d
1
) ,= 0. Thus
u
= [t
z
1
d
1
, u] =
m
i=2
t
z
1
+x
i
(d
1
(x
i
)
i
i
(z
1
)d
1
) J 0.
It again contradicts the minimality of m. Therefore m = 1. Claim 1 follows.
Claim 2. We have T
z
J.
From Claim 1 we know that there exists a nonzero element t
x
0
J,
where x A
d
,
0
T
xz
.
Subcase 1. Suppose x A
#
d,i
for some i I, say x A
#
d,1
. Then
0
Fd
1
.
Thus t
x
d
1
J. For any T
xz
, from
[t
x
d
1
, t
x
] = (x)d
1
J,
and d
1
/ T
xz
, we know that d
1
, (x)d
1
[ T
xz
) = T. Hence
(x)d
1
[ T
xz
) = T
z
= T
z
. therefore T
z
J.
Subcase 2. Suppose x A
+
d
. Dene d(x) :=
n
i=1
d
i
(x). If d
1
(x) > 0,
choose y
1
A
#
d,1
. Since d
1
(x z) ,= 0 and
0
(x z) = 0, we know that d
1
and
0
are linearly independent. From the computation
[t
y
1
d
1
, t
x
0
] = t
x+y
1
(d
1
(x)
0
0
(y
1
)d
1
) ,= 0,
we get a nonzero element t
x+y
1
(d
1
(x)
0
0
(y
1
)d
1
) J with d(x + y
1
) =
d(x) 1. By repeatedly using this method, after nitely many steps we
deduce that there exists a nonzero element t
y
J with y A
0
d
. If A
#
d,i
ker() for all i I, we infer that (A
d
) = 0. It contradicts Theorem 2.1(i).
Thus there exists an i I such that A
#
d,i
, ker(), say A
#
d,1
, ker().
Choose z
1
A
#
d,1
ker(), then
[t
z
1
d
1
, t
y
] = t
z
1
+y
(z
1
)d
1
J 0.
By Subcase 1 we obtain again that T
z
J. Similarly T
z
J for x A
d,i
.
Thus Claim 2 is proved.
If x A
d
and x / F z, then T
z
, ker x. Choose T
z
ker x. From
[, t
x
] = (x)t
x
J we know that t
x
T
xz
J.
If y A
d
0 and y F z, it follows that y = kz for some positive
integer k. Since x[x A
#
d,1
, F z, we choose x
1
A
#
d,1
with x
1
/ F z.
Since y ,= z, then d
i
(y) > 0 for all i I. Note that t
yx
1
T
yx
1
z
J. Then,
for t
x
1
d
1
, t
yx
1
1
L, we have
[t
yx
1
1
, t
x
1
d
1
] = t
y
(d
1
(y x
1
)
1
1
(x)d
1
) J.
GENERALIZED CARTAN LIE ALGEBRAS 441
Because d
1
, d
1
(yx
1
)
1
1
(x)d
1
[
1
T
yx
1
z
) = T. Then d
1
(yx
1
)
1
1
(x)d
1
[
1
T
yx
1
z
) = T
yz
= T
z
, hence L
y
J. Therefore J = L. This
completes the proof of this theorem.
Note that z / A
d
if [I[ > 1, and z A
d
if [I[ = 1. The following corollary
follows directly from the above theorem.
Corollary 3.2. Suppose that S
d
= S
d
(A, T, , z) is simple. Then (S
d
)
x
,= 0
if and only if x A
d
z.
4. Derivations of S
d
(A, T, , z).
In this section we assume that the Lie algebra S
d
(A, T, , z) is simple, and
we shall determine all the derivations of S
d
(A, T, , z). From the proof of
Theorem 3.1 we know that:
(a) If I = , S
d
(A, T, , z) = S(A, T, , z), which was thoroughly studied
in [7];
(b) If [I[ = dimT = n, S
d
(A, T, , z) S
+
n
, which was studied in many
references, for example [12];
(c) If dimT = 2 and [I[ = 1, S
d
(A, T, , z) L
yA
D
y
,
where D
y
is a derivation of L of degree y, such that for each u L there
are only nitely many y A with D
y
(u) ,= 0.
First we construct some derivations of S
d
. For any additive function
: A F the linear map
D
(X) = (x)X, X (S
d
)
x
is a derivation of S
d
of degree 0.
The following Lemmas 4.2-4.5 will be useful in the sequel.
442 KAIMING ZHAO
Lemma 4.2. Let x, y A 0. Then T
x
= T
y
if and only if x F y.
This lemma is obvious.
Lemma 4.3. Let x
1
, x
2
A
d
z. If one of the following conditions holds:
(a) x
2
A
+
d
and x
1
A
d,i
for certain i I,
(b) x
1
, x
2
A
+
d
and T
x
1
z
,= T
x
2
z
,
then
(4.2) [L
x
1
, L
x
2
] = L
x
1
+x
2
.
Proof. Suppose (a) is satised. If d
i
(x
2
) > 0, then d
i
T
x
1
z
(T
x
2
T
x
2
z
),
hence Fd
i
+d
i
(x
2
) (x
1
)d
i
[ T
x
2
z
) = T. We deduce that d
i
(x
2
)
(x
1
)d
i
[ T
x
2
z
) = T
x
1
+x
2
z
. From
[t
x
1
d
i
, t
x
2
] = t
x
1
+x
2
(d
i
(x
2
) (x
1
)d
i
), T
x
2
z
,
we see that (4.2) follows.
If d
i
(x
2
) = 0, then x
1
+x
2
A
d,i
. If x
1
+x
2
= z further, it follows from
(S
d
)
z
= 0 that (4.2) follows. Suppose x
1
+ x
2
,= z, by Lemma 4.2 then
T
x
2
z
,= T
x
1
. Choose T
x
2
z
T
x
1
, so [t
x
1
d
i
, t
x
2
] = (x
1
)t
x
1
+x
2
d
i
.
Since (S
d
)
x
1
+x
2
= Ft
x
1
+x
2
d
i
, hence (4.2) follows again.
Suppose (b) is satised. We have
(4.3) [t
x
1
1
, t
x
2
2
] = t
x
1
+x
2
(
1
(x
2
)
2
2
(x
1
)
1
)
for all
1
T
x
1
z
,
2
T
x
2
z
.
If x
1
= 0, it follows from T
x
1
z
,= T
x
2
z
that x
2
/ F z. Then T
x
2
,= T
z
.
Choose
1
T
x
2
T
z
. Then (4.2) follows from (4.3). Now we assume that
x
1
, x
2
A
+
d
0.
We claim that T
x
1
z
,= T
x
2
or T
x
1
,= T
x
2
z
. Otherwise from T
x
1
z
= T
x
2
and T
x
1
= T
x
2
z
, by Lemma 4.2 we obtain
d
1
(x
2
)(x
1
z) = (d
1
(x
1
) + 1)x
2
, d
1
(x
1
)(x
2
z) = (d
1
(x
2
) + 1)x
1
.
Then x
1
= d
1
(x
1
)z and x
2
= d
1
(x
2
)z, it contradicts T
x
1
z
,= T
x
2
z
.
Hence our claim holds.
We assume that T
x
1
z
,= T
x
2
. If T
x
1
z
T
x
2
T
x
2
z
, then we deduce
that T
x
1
z
= T
x
2
, it is impossible. So T
x
1
z
(T
x
2
T
x
2
z
) ,= . Choose
1
T
x
1
z
(T
x
2
T
x
2
z
), then F
1
+
1
(x
2
)
2
2
(x
1
)
1
[
2
T
x
2
z
) = T.
Hence
1
(x
2
)
2
2
(x
1
)
1
[
2
T
x
2
z
) = T
x
1
+x
2
z
. From (4.3) it follows
that (4.2) holds.
Lemma 4.4. For a xed i I and a xed x
0
A
d
z with d
i
(x
0
) = 1,
the subspace
(S
d
)
x
0
+
xA
d
, d
i
(x)0
(S
d
)
x
generates S
d
as a Lie algebra.
GENERALIZED CARTAN LIE ALGEBRAS 443
Proof. Denote by M the subalgebra generated by the above subspace. We
shall show that (S
d
)
x
M for x A
d
by induction on k = d
i
(x). By
denition of M this is true for x A
d
with d
i
(x) < 1.
Claim 1. We can assume that x
0
A
0
d
.
If d
j
(x
0
) = 1 for one j I i, choose u
j
A
#
d,j
, by Lemma 4.3 we
deduce that [(S
d
)
x
0
, (S
d
)
u
j
] = (S
d
)
x
0
u
j
M. Note that d
j
(x
0
u
j
) = 0.
Then we may assume that x
0
A
+
d
.
If d
j
(x
0
) > 0 for some j I i, also using u
j
A
#
d,j
, by Lemma 4.3
we obtain that [(S
d
)
x
0
, (S
d
)
u
j
] = (S
d
)
x
0
+u
j
M. Note that d
j
(x
0
+ u
j
) =
d
j
(x
0
)1. Thus after nitely many such steps we may assume that x
0
A
0
d
.
This is our Claim 1.
Claim 2. There exists y
0
A
0
d
0 such that (S
d
)
x
0
, (S
d
)
x
0
+y
0
M.
Since 1 < [I[ < dimT we know that A
0
d
,= 0. Choose y
A
0
d
0. If
T
x
0
z
= T
y
z
, then x
0
z = 2y
2z, i.e., x
0
2y
+z = 0, we set y
0
= 2y
.
If T
x
0
z
,= T
y
z
we set y
0
= y
. Thus y
0
A
0
d
0 and T
x
0
z
,= T
y
z
.
Since (S
d
)
x
0
, (S
d
)
y
0
M and [(S
d
)
x
0
, (S
d
)
y
0
] = (S
d
)
x
0
+y
0
(Lemma 4.3), then
(S
d
)
x
0
, (S
d
)
x
0
+y
0
M. Claim 2 follows.
Suppose that x A
d
with d
i
(x) = 1. Note that T
x
0
z
= T
xx
0
z
implies
xx
0
z = 2(x
0
z), i.e., x3x
0
z = 0, and that T
x
0
+y
0
z
= T
xx
0
y
0
z
implies x 3x
0
3y
0
+ z = 0. Then T
x
0
z
,= T
xx
0
z
or T
x
0
+y
0
z
,=
T
xx
0
y
0
z
. Without loss of generality we assume that T
x
0
z
,= T
xx
0
z
.
By Lemma 4.3 we know that [(S
d
)
xx
0
, (S
d
)
x
0
] = (S
d
)
x
. By noting that
(S
d
)
xx
0
, (S
d
)
x
0
M, we get that (S
d
)
x
M. So (S
d
)
x
M for x A
d
with d
i
(x) 1.
Suppose that (S
d
)
x
0
M for all x
0
A
d
with d
i
(x
0
) = k for a xed k 1.
Consider x A
d
with d
i
(x) = k+1. Similar to the above argument we know
that T
x
0
z
,= T
xx
0
z
or T
x
0
+y
0
z
,= T
xx
0
y
0
z
, say T
x
0
z
,= T
xx
0
z
,
By Lemma 4.3 we obtain that [(S
d
)
xx
0
, (S
d
)
x
0
] = (S
d
)
x
. By noting that
(S
d
)
xx
0
, (S
d
)
x
0
M, we get that (S
d
)
x
M.
By induction we obtain that (S
d
)
x
M for all x A
d
. This completes
the proof of Lemma 4.4.
Lemma 4.5. (a) Suppose that D
1
, D
2
are derivations of a Lie algebra g,
and that M g generates g. If D
1
[
M
= D
2
[
M
, then D
1
= D
2
.
(b) Suppose that D
1
, D
2
Der(S
d
) are homogeneous of degree y A
d
. If
D
1
(u) = D
2
(u) for all u (S
d
)
x
with x A
+
d
, i.e., D
1
[
S
+
d
= D
2
[
S
+
d
,
then D
1
= D
2
.
Proof. (a) is obvious.
444 KAIMING ZHAO
(b) It suces to show that D
1
(t
x
d
1
) = D
2
(t
x
d
1
) for x A
d,1
. If x +y /
A
d
z, we see that D
1
(t
x
d
1
) = D
2
(t
x
d
1
) = 0. Suppose that x+y A
d
z,
and that D
1
(t
x
d
1
) = t
x+y
,= 0 and D
2
(t
x
d
1
) = t
x+y
. If ,=
we choose
x
A
+
d
with x
+x A
+
d
such that (x
(x
) ,= 0. For any
1
T
x
z
,
we have
[t
x
1
, t
x
d
1
] = t
x
+x
(
1
(x)d
1
d
1
(x
)
1
).
Then [t
x
1
, t
x+y
] = [t
x
1
, t
x+y
], i.e.,
1
(x +y) (x
)
1
=
1
(x +y)
(x
)
1
,
so, we deduce that
1
(x +y)(
) = ((x
(x
))
1
,
1
T
x
z
.
This is impossible since dimT
x
z
> 1. Thus we get a contradiction. Con-
sequently =
, i.e., D
1
(t
x
d
1
) = D
2
(t
x
d
1
).
Now we are ready to describe the homogeneous derivations D
y
in (4.1).
Proposition 4.6. If y / A
d
, then every homogeneous derivation D of S
d
of degree y is 0.
Proof. We shall divide the proof into three cases.
Case 1. Suppose d
i
(y) 3 for some i I.
From Corollary 3.2 we see that D((S
d
)
x
) = 0 for all x A
d
with d
i
(x) 1.
By Lemmas 4.4 and 4.5 it follow that D = 0.
Case 2. Suppose d
1
(y) = 2 and d
i
(y) 2 for all i I.
Then D((S
d
)
x
) = 0 for x A
d
with d
1
(x) 0. If [I[ = 1, since (S
d
)
z
= 0
then D((S
d
)
zy
) (S
d
)
z
= 0 and d
1
(z y) = 1. By Lemma 4.4 it follows
that D = 0.
Suppose [I[ > 1. Assume that u
1
, u
2
, , u
n
A such that d
i
(u
j
) =
i,j
. If d
i
(y) 0 for some i I 1, it follows from d
1
(u
i
u
1
+ y) =
1, d
i
(u
i
u
1
+ y) < 0 that D((S
d
)
u
i
u
1
) (S
d
)
u
i
u
1
+y
= 0. By Lemmas
4.4 and 4.5, we also have D = 0.
Suppose [I[ > 1 and d
i
(y) > 0 for all i I 1. Choose v
1
A
#
1
, v
2
A
#
2
. Let D(t
v
2
v
1
d
2
) = t
v
2
v
1
+y
d
1
for some F. From [t
v
2
v
1
d
2
, d
1
d
2
] = 2t
v
2
v
1
d
2
and D(d
1
d
2
) = 0 we get that [t
v
2
v
1
+y
d
1
, d
1
d
2
] =
2t
v
2
v
1
+y
d
1
. Then (d
1
(y) d
2
(y)) = 0. Since d
1
(y) d
2
(y) < 3 we
obtain that = 0, i.e., D((S
d
)
v
i
v
1
) = 0. By Lemmas 4.4 and 4.5, it follows
that D = 0.
Case 3. Suppose that [I[ 2, that d
i
(y) 1 for all i I and that
d
1
(y) = d
2
(y) = 1. We rst show that D((S
d
)
x
) = 0 for all x A
d
with
d
1
(x) 0. If x + y / A
d
then D((S
d
)
x
) (S
d
)
x+y
= 0. In particular,
D(T
z
) = 0. Suppose x A
d
with d
1
(x) = 0 and x+y A
d
. then x+y A
1
and d
2
(x) 1, d
i
(x) 0, i > 1. If y ,= z then y / F z. Choose
1
T
z
T
y
.
GENERALIZED CARTAN LIE ALGEBRAS 445
Let D(t
x
) = t
x+y
d
1
where F. Applying D to [
1
, t
x
] =
1
(x)t
x
,
we obtain that
1
(y) = 0. thus = 0. Consequently D((S
d
)
x
) = 0 in this
case.
Suppose that y = z. Recall that x, x + y(= x + z) A
d
, d
1
(x) = 0 and
d
i
(x) > 0 for all i I 1. Choose u
j
A
#
j
, j I. For any x
0
A
0
d
0
we have
(4.4) [t
x
0
, t
u
1
1
] = t
x
0
u
1
((u
1
)
1
+
1
(x
0
)),
where T
x
0
z
,
1
T
u
1
+z
. Let D(t
u
1
1
) =
1
t
zu
1
d
2
, where
1
F.
Since x
0
, z, u
1
are linearly independent, we can choose T
x
0
z
T
zu
1
T
z
.
Then (u
1
)
1
+
1
(x
0
)[
1
T
u
1
+z
) = T
x
0
u
1
z
since / T
u
1
+z
and
, (u
1
)
1
+
1
(x
0
)[
1
T
u
1
+z
) = T. Applying D to (4.4) we obtain that
D(t
x
0
u
1
((u
1
)
1
+
1
(x
0
))) = [t
x
0
,
1
t
zu
1
d
2
]
=
1
(z u
1
)t
zu
1
d
2
= 0,
for all
1
T
u
1
+z
. Then D((S
d
)
x
0
u
1
) = 0. Similarly we can get
D((S
d
)
x
0
u
2
) = 0. By [(S
d
)
x
0
, (S
d
)
x
0
u
2
] = (S
d
)
u
2
we have D((S
d
)
u
2
) =
0. By induction on k = d
2
(x) 2, and using [(S
d
)
u
2
, (S
d
)
x+u
2
] = (S
d
)
x
,
we can obtain that D((S
d
)
x
) = 0 for x A
d
with d
1
(x) = 0.
Next we claim that D((S
d
)
u
1
) = 0 for u
1
A.
If y u
1
/ A
d
we have D((S
d
)
u
1
) (S
d
)
yu
1
= 0.
Suppose that y u
1
A
d
. Then d
i
(y) 0 for all i 3. If y = z,
from the above argument we know that D((S
d
)
u
1
) = 0. Suppose also
that y ,= z. Choose
1
T
z
T
y
. Let D(t
u
1
) =
t
yu
1
d
2
for
T
z+u
1
, where
F. Then by [
1
, t
u
1
] =
1
(u
1
)t
u
1
we obtain that
[
1
,
t
yu
1
d
2
] =
1
(u
1
)t
u
1
d
2
. Thus
1
(y) = 0. Since
1
(y) ,= 0 we
infer that
= 0. Consequently D((S
d
)
u
1
) = 0 also. Therefore our claim
is true. By Lemmas 4.4 and 4.5 we conclude that D = 0 in this case. Hence
we have proved that D = 0 when y / A
d
.
Proposition 4.7. Suppose that y A
d
0, and that D Der(S
d
) is
homogeneous of degree y.
(a) If y ,= z, there exists t
y
0
(S
d
)
y
such that D = ad (t
y
0
).
(b) If y = z, we have [I[ = 1 and D F ad (t
y
d
1
).
Proof. For any x A
d
z, we dene the linear map D
x
: T
xz
T
x+yz
(or D
x
: Fd
i
T
x+yz
if x A
i
) by D(t
x
) = t
x+y
(D
x
). By applying D
to
[t
x
1
1
, t
x
2
2
] = t
x
1
+x
2
(
1
(x
2
)
2
2
(x
1
)
1
)
where x
1
, x
2
A
d
z and
1
T
x
1
z
,
2
T
x
2
z
, we obtain that
D
x
1
1
, x
2
)
2
D
x
2
2
, x
1
)
1
+
1
(x
2
+y)D
x
2
2
2
(x
1
+y)D
x
1
1
(4.5)
= D
x
1
+x
2
(
1
(x
2
)
2
2
(x
1
)
1
)
446 KAIMING ZHAO
holds for
1
T
x
1
z
,
2
T
x
2
z
, and x
1
, x
2
A
d
z.
Case 1. Suppose that y and z are linearly independent. By setting x
2
= 0
in (4.5) we obtain
(4.6)
1
(y)D
0
(
2
) =
2
(y)D
x
1
(
1
) +D
0
(
2
), x
1
)
1
.
By setting here x
2
= 0 in (4.6) we obtain that
1
(y)D
0
(
2
) =
2
(y)D
0
(
1
).
Choose
2
T
z
T
y
, and denote
0
=
2
(y)
1
D
0
(
2
). Then we have
0
T
yz
and
(4.7) D
0
(
1
) =
1
(y)
0
,
1
T
z
.
Hence we can rewrite (4.6) as
2
(y)(D
x
1
(
1
)
1
(y)
0
+
0
(x
1
)
1
) = 0.
Thus we deduce that
D
x
1
(
1
) =
1
(y)
0
0
(x
1
)
1
.
It follows that D = ad (t
y
0
). Note that by now we have not known that
t
y
0
S
d
yet. If
0
= 0 or y A
+
d
, from
0
T
yz
then t
y
0
S
d
. If
0
,= 0
and y A
i
for some i I, since A
i
, ker(
0
), choose x
0
A
i
ker(
0
).
Since D((S
d
)
x
0
) = 0 and D = ad (t
y
0
), we deduce that
[t
y
0
, t
x
0
d
i
] = t
x
0
+y
(
0
(x
0
)d
i
+
0
) = 0.
Thus
0
=
0
(x
0
)d
i
. Hence t
y
0
S
d
.
Case 2. Suppose that y = z. Then [I[ = 1 and z A
#
1
. Since D
0
((S
d
)
0
)
(S
d
)
z
= 0, we know that D
0
= 0.
Claim 1. For x
1
A
d
z with x
1
, F z, there exists a constant a
x
1
F
such that
(4.8) D
x
1
= a
x
1
, T
x
1
T
z
.
If x
1
A
d
A
+
d
, clearly (4.8) is true. Next we suppose that x
1
A
+
d
. By
setting
1
=
2
= T
x
1
T
z
and x
2
= z in (4.5), we obtain that
D
x
1
, z) D
z
, x
1
) = 0,
and so
(4.9) D
z
, x
1
) = D
x
1
, z)
holds. On the other hand, for x
2
= z,
2
= T
x
1
T
z
, and arbitrary
1
T
x
1
z
, (note that we allow x
1
A
1
here), (4.5) gives that
(4.10) D
x
1
1
, z) +D
z
, x
1
)
1
=
1
(z)D
x
1
z
.
GENERALIZED CARTAN LIE ALGEBRAS 447
By evaluating both sides at x
1
and using
1
(x
1
) =
1
(z), we obtain that
1
(z)[D
x
1
z
, x
1
) D
z
, x
1
)] = 0.
As x
1
, F z, we can choose
1
T
x
1
z
T
z
, and so
(4.11) D
x
1
z
, x
1
) = D
z
, x
1
), T
x
1
T
z
.
By substituting x
1
+z for x
1
in (4.9), and using D
z
, z) = 0, we infer that
D
x
1
z
, x
1
) = D
z
, x
1
)
holds for T
x
1
T
z
. By comparing this equation with (4.11), we conclude
that
(4.12) D
z
, x
1
) = 0, x
1
A
d
, T
x
1
T
z
.
Now (4.10) gives that
1
(z)D
x
1
z
= D
x
1
1
, z)
for
1
T
x
1
z
, T
x
1
T
z
, x
1
A
d
, with x
1
/ F z. By choosing
1
T
x
1
z
T
z
and setting a
x
1
z
=
D
x
1
1
,z
1
(z)
, then we have D
x
1
z
= a
x
1
z
,
thus
D
x
1
= a
x
1
, x
1
A
d
, T
x
1
T
z
.
Hence our rst claim is proved.
Claim 2. If x
1
, x
2
A
+
d
with x
1
, x
2
, x
1
+ x
2
/ F z, then
(4.13) a
x
1
+x
2
= a
x
1
+a
x
2
.
In order to prove this claim we shall consider rst the case where x
1
, x
2
,
and z are linearly independent. Then we can choose
1
(T
x
1
T
z
)T
x
2
and
2
(T
x
2
T
z
)T
x
1
. It follows that
1
(x
2
)
2
2
(x
1
)
1
is a nonzero
vector in T
x
1
+x
2
T
z
. By using the rst claim, (4.5) gives that
(4.13
) (a
x
1
+x
2
a
x
1
a
x
2
)[
1
(x
2
)
2
2
(x
1
)
1
] = 0,
and so (4.13) holds in this case.
Now assume that x
1
, x
2
and z are linearly dependent. Since dimT 3
we can choose w A such that x
1
, w and z are linearly independent. By
using the case already established, we have
a
x
1
= a
x
1
+w
+a
w
= a
x
1
+a
w
+a
w
,
and
a
x
1
+x
2
= a
x
1
+w
+a
x
2
w
= a
x
1
+a
w
+a
x
2
+a
w
,
and conclude again that (4.13) holds. Hence our second claim is proved.
Now let x A
+
d
with x F z and set a
x
= a
x+v
a
v
where v A
+
d
with
v , F z. By our second claim, a
x+v
a
v
is independent of the choice of v.
448 KAIMING ZHAO
With a
x
now dened for all x A
+
d
, it is easy to see that (4.13) is valid for
all x
1
, x
2
A
+
d
.
We shall now remove the restriction x
1
, F z in (4.8). Thus assume
that x
1
F z and x
1
A
+
d
. We choose x
2
A
+
d
so that x
2
, F z, and
let
1
T
z
and
2
T
x
2
T
z
. By using the rst claim and
2
(x
1
) =
1
(y) = 0, the equation (4.5) gives D
x
1
1
, x
2
)
2
= a
x
1
1
(x
2
)
2
. Hence
(D
x
1
a
x
1
)
1
, x
2
) = 0 for all x
2
A
+
d
with x
2
, F z, and so (4.8) holds
also for x
1
F z.
For x A
+
d
, we dene linear maps D
x
: T
xz
T by D
x
= D
x
a
x
.
By Claim 1, T
x
T
z
is contained in the kernel of D
x
. In particular, if x F z
and x ,= z, then D
x
= 0. If x , F z, then T
x
T
z
is a hyperplane in T
xz
,
and so the vector
(4.14)
x
:=
D
(y)
is independent of the choice of T
xz
T
z
(note that y = z). Thus we
have
(4.15) D
x
= (y)
x
, x A
+
d
, T
xz
.
If x F z, then
x
is not dened but (4.15) is also valid because D
x
= 0 and
(y) = 0 for T
xz
= T
z
.
By substituting D
x
1
+ a
x
1
for D
x
1
and making similar substitutions for
D
x
2
and D
x
1
+x
2
in (4.5), we obtain that
D
x
1
1
, x
2
)
2
D
x
2
2
, x
1
)
1
+
1
(x
2
+y)D
x
2
2
(x
1
+y)D
x
1
1
+a
x
2
1
(x)
2
a
x
1
2
(x)
1
= D
x
1
+x
2
(
1
(x
2
)
2
2
(x
1
)
1
)
holds for x
1
, x
2
A
+
d
,
1
T
x
1
z
and
2
T
x
2
z
.
By using (4.15) and similar expressions for D
x
2
and D
x
1
+x
2
, the last
equation can be rewritten as follows
1
(y)[a
x
2
+
x
1
(x
2
)]
2
2
(y)[a
x
1
+
x
2
(x
1
)]
1
(4.16)
= [
1
(x
2
)
2
(y)
2
(x
1
)
1
(y)]
x
1
+x
2
+
1
(y)
2
(x
1
+y)
x
1
2
(y)
1
(x
2
+y)
x
2
.
Claim 3. The vector
x
for x A
+
d
with x / F z are independent of x.
Suppose x
1
, x
2
A
+
d
with x
1
, x
2
, x
1
+ x
2
/ F z. For
1
T
x
1
T
z
0
and
2
T
x
2
z
T
z
, (4.16) gives
(4.17)
1
(x
2
)(
x
2
x
1
+x
2
) = [a
x
1
+
x
2
(x
1
)]
1
.
GENERALIZED CARTAN LIE ALGEBRAS 449
If x
1
, x
2
, z are linearly dependent, we see that
1
(x
2
) = 0. Thus we obtain
from (4.17) that a
x
1
=
x
2
(x
1
).
Assume that x
1
, x
2
, z are linearly independent. Then
1
T
x
1
T
z
can
be chosen so that
1
(x
2
) ,= 0. By evaluating both sides of (4.17) at x
1
, we
obtain that
1
(x
2
)(
x
2
(x
1
)
x
1
+x
2
(x
1
)) = 0. Since
1
(x
2
) ,= 0 we deduce
that
(4.18)
x
2
(x
1
) =
x
1
+x
2
(x
1
), x
1
, x
2
A
+
d
, with x
1
, x
2
, x
1
+ x
2
/ F z.
Symmetrically we have
x
1
(x
2
) =
x
1
+x
2
(x
2
), x
1
, x
2
A
+
d
, with x
1
, x
2
, x
1
+ x
2
/ F z.
By evaluating (4.17) at x
2
we get
a
x
1
=
x
2
(x
2
x
1
)
x
1
+x
2
(x
2
),
i.e.,
a
x
1
=
x
2
(x
2
x
1
)
x
1
(x
2
). (4.19)
By evaluating (4.17) at z, we nd
x
1
+x
2
(z) =
x
2
(z) =
x
1
(z).
By evaluating (4.16) at z, we nd that
1
(z)
2
(z)[
x
1
(x
2
)
x
2
(x
1
) +a
x
2
a
x
1
] = 0.
Since we can choose
1
T
x
1
z
and
2
T
x
2
z
such that
1
(z)
2
(z) ,= 0,
we infer that
a
x
1
a
x
2
=
x
1
(x
2
)
x
2
(x
1
).
By replacing x
1
with x
1
+x
2
and using Claim 2, we obtain the equation
a
x
1
=
x
1
+x
2
(x
2
)
x
2
(x
1
+x
2
) =
x
1
(x
2
)
x
2
(x
1
+x
2
).
Combining this with (4.19), we deduce that
x
1
(x
2
) =
x
2
(x
2
). Consider-
ing (4.19) also we have a
x
1
=
x
2
(x
1
). So
x
1
(x
2
) is independent of x
1
,
consequently
x
1
is independent of x
1
. Then Claim 3 is proved.
Denote
x
(x A
+
d
with x / F z) by
0
. So we have
a
x
=
0
(x), x A
+
d
, with x / F z.
By the denition of a
x
1
for x A
+
d
with x / F z, we also have
(4.20) a
x
=
0
(x), x A
+
d
.
Combining this with (4.15), we deduce that
(4.21) D
x
= (y)
0
+
0
(x), x A
+
d
, T
xz
.
So D[
S
+
d
= ad (t
y
0
)[
S
+
d
. By Lemma 4.5 we see that D = ad (t
y
0
).
450 KAIMING ZHAO
Choose x A
0
d
0, and T
xz
T
z
. We see that D((S
d
)
x
) (S
d
)
x+z
=
Ft
x+z
d
1
, i.e., D
x
=
d
1
for some
d
1
= (y)
0
+
0
(x).
Since d
1
/ T
xz
and dimT
xz
2 we conclude from the above equation that
0
(x) = 0. Furthermore we deduce that
0
=
(y)
1
d
1
, i.e.,
0
Fd
1
.
Case 3. Suppose that y = z, where N = 1, 2, 3, . . . .
By setting x
2
= 0 in (4.5) we obtain
1
(y)D
0
(
2
) = D
0
(
2
), x
1
)
1
,
for
1
T
x
1
z
,
2
T
z
. Since dimT
x
1
z
2 we deduce that D
0
(
2
), x
1
) =
0 for all x A
d
. It follows that D
0
(
2
) = 0 for
2
T
z
, i.e., D
0
= 0.
Now we show that Claim 1 is also true in this case.
First suppose that x
1
A
+
d
with x
1
A
d
and x / F z. Since x / F z
and x
1
z A
+
d
, we can choose
2
T
x
1
z
T
z
. By setting x
2
= x
1
and
1
= T
x
1
T
z
in (4.5) and using D
0
= 0 we obtain that
2
(x
1
+y)D
x
1
() +D
x
1
(), x
1
)
2
+D
x
1
(
2
), x
1
) = 0.
By evaluating the above equation at x
1
+y z and by using D
x
1
(), x
1
+
y z) = 0, (x
1
) = (y) = (z) = 0, we conclude that D
x
1
(), x
1
) = 0,
and consequently
2
(x
1
+y)D
x
1
() = D
x
1
(
2
), x
1
),
holds for all T
x
1
T
z
and
2
T
x
1
z
. Since
2
(x
1
+y) = (+1)
2
(z) ,=
0, then (4.8) holds for x
1
A
+
d
with x
1
A
d
and x / F z.
We shall show (4.8) for x
1
A
+
d
by induction on d(x
1
) :=
iI
d
i
(x
1
).
This has been proved for all x
1
with d(x
1
) 1. Now suppose (4.8)
holds for all x
1
with d(x
1
) k ( 1). Consider x
2
A
+
d
with d(x
2
) =
k + 1. Choose x
0
A
+
d
with d(x
0
) = 1 and with x
0
, x
2
, z being linearly
independent. By inductive hypothesis we have
D
x
0
= a
x
0
T
x
0
T
z
,
D
x
2
x
0
= a
x
2
x
0
, T
x
2
x
0
T
z
.
By replacing x
1
with x
0
, x
2
with x
2
x
0
,
1
with
0
,
2
with respectively
in (4.5), we obtain that
(4.22) D
x
2
(
0
(x
2
) (x
0
)
0
) = (a
x
0
+a
x
2
x
0
)(
0
(x
2
) (x
0
)
0
)
for all
0
T
x
0
T
z
, T
x
2
x
0
T
z
. It suces to show that
0
(x
2
) (x
0
)
0
[
0
T
x
0
T
z
, T
x
2
x
0
T
z
) = T
x
2
T
z
.
Choose
0
(T
x
0
T
z
)T
x
2
, then
F
0
+
0
(x
2
) (x
0
)
0
[ T
x
2
x
0
T
z
) = F
0
+T
x
2
x
0
T
z
GENERALIZED CARTAN LIE ALGEBRAS 451
is codimension 1 in T. Hence the subspace
0
(x
2
) (x
0
)
0
[ T
x
2
x
0
T
z
)T
x
2
T
z
is codimension 2 in T. Therefore (4.22) holds. Thus there exists a
x
2
F
such that D
x
2
= a
x
2
for all T
x
2
T
z
. Consequently Claim 1 is true.
Exactly the same as that in Case 2, Claim 2 is true in this case also.
Now same as in Case 2, we dene a
x
for x A
+
d
with x F z and we can
remove the restriction x / F z in (4.8), then we can also dene the linear
map D
x
, the vector
x
in (4.14) for x A
+
d
with x / F z, and same as before
we also get equations (4.15), (4.16).
Now we claim that Claim 3 is true in this case also. The proof is exactly
the same as it was in Case 2.
We also denote
x
(x A
+
d
with x / F z) by
0
. So we have
a
x
=
0
(x), x A
+
d
, with x / F z.
The same as in Case 2, Equations (4.20) and (4.21) are true. So D[
S
+
d
=
ad (t
y
0
)[
S
+
d
. By Lemma 4.5 we see that D = ad (t
y
0
). By denition of
0
we know that t
y
0
(S
d
)
y
.
By now we have completed the proof of Proposition 4.7.
Proposition 4.8. Suppose that D Der(S
d
) is homogeneous of degree 0.
Then there exists a hom(A, F) such that D = D
.
Proof. For any x A
+
d
, we dene the linear map D
x
: T
xz
T
xz
(or
D
x
: Fd
i
Fd
i
if x A
i
) by D(t
x
) = t
x+y
(D
x
). If T
x
1
z
, then
(x
1
) = (z) and D
x
1
, x
1
) = D
x
1
, z). As x = 0, the equation (4.5)
takes the form
D
x
1
1
, x
2
)
2
D
x
2
2
, x
1
)
1
+
1
(x
2
)D
x
2
2
2
(x
1
)D
x
1
1
(4.23)
= D
x
1
+x
2
(
1
(x
2
)
2
2
(x
1
)
1
)
where
1
T
x
1
z
,
2
T
x
2
z
, and x
1
, x
2
A
+
d
. By setting v = 0 in (4.23),
we obtain that D
0
2
, x
1
)
1
= 0. Hence D
0
= 0.
We claim that (4.8) holds for x
1
A
d
z and T
x
1
T
z
.
If x
1
A
d
A
+
d
, (4.8) holds clearly. Next suppose that x A
+
d
.
First suppose that x
1
A
+
d
with x
1
A
d
and x / F z. Since x / F z
and x
1
z A
+
d
, we can choose
2
T
x
1
z
T
z
. By setting x
2
= x
1
and
1
= T
x
1
T
z
in (4.23) and using D
0
= 0, we obtain that
2
(x
1
)D
x
1
() +D
x
1
(), x
1
)
2
+D
x
1
(
2
), x
1
) = 0.
452 KAIMING ZHAO
By evaluating the above equation at x
1
z and by using D
x
1
(), x
1
z) = 0,
(x
1
) = (z) = 0, we conclude that D
x
1
(), x
1
) = 0, and consequently
2
(x
1
)D
x
1
() = D
x
1
(
2
), x
1
),
holds for all T
x
1
T
z
and
2
T
x
1
z
. Since
2
(x
1
) =
2
(z) ,= 0, then
(4.8) holds for x
1
A
+
d
with x
1
A
d
and x / F z.
We shall show (4.8) for x
1
A
+
d
by induction on d(x
1
) :=
iI
d
i
(x
1
).
This has been proved for all x
1
with d(x
1
) 1. Now suppose (4.8)
holds for all x
1
with d(x
1
) k ( 1). Consider x
2
A
+
d
with d(x
2
) =
k + 1. Choose x
0
A
+
d
with d(x
0
) = 1 and with x
0
, x
2
, z being linearly
independent. By inductive hypothesis we have
D
x
0
= a
x
0
T
x
0
T
z
,
D
x
2
x
0
= a
x
2
x
0
, T
x
2
x
0
T
z
.
By replacing x
1
with x
0
, x
2
with x
2
x
0
,
1
with
0
,
2
with respectively
in (4.23), we obtain that
D
x
2
(
0
(x
2
) (x
0
)
0
) = (a
x
0
+a
x
2
x
0
)(
0
(x
2
) (x
0
)
0
)
for all
0
T
x
0
T
z
, T
x
2
x
0
T
z
. It suces to show that
0
(x
2
) (x
0
)
0
[
0
T
x
0
T
z
, T
x
2
x
0
T
z
) = T
x
2
T
z
.
Choose
0
(T
x
0
T
z
)T
x
2
, then
F
0
+
0
(x
2
) (x
0
)
0
[ T
x
2
x
0
T
z
) = F
0
+T
x
2
T
z
is codimension 1 in T. Hence the subspace
0
(x
2
) (x
0
)
0
[ T
x
2
x
0
T
z
)T
x
2
T
z
is codimension 2 in T. Therefore (4.23) holds. Thus there exists a
x
2
F
such that D
x
2
= a
x
2
for all T
x
2
T
z
. Consequently our claim about
(4.8) is true.
Exactly the same as that in Case 2 in the proof of Proposition 4.7, Claim
2 in the proof of Proposition 4.7 is true in this case also.
Now as in Case 2 of the proof of Proposition 4.7, we dene a
x
for x
A
d
z with x F z and we can remove the restriction x / F z in (4.8).
Then we have obtained that:
(a) For any x A
d
z, there exists a constant a
x
F such that
(4.8
) D
x
= a
x
, T
x
T
z
.
(b) For all x
1
, x
2
A
+
d
,
a
x
1
+x
2
= a
x
1
+a
x
2
.
Now we claim that, for any x A
d
z,
(4.24) D
x
= a
x
, T
xz
.
GENERALIZED CARTAN LIE ALGEBRAS 453
If x F z, this follows from (4.8
) since T
x
T
z
= T
x
T
z
= T
z
. If
x A
d
(A
+
d
z), (4.24) is clear. Next we suppose x A
+
d
with x / F z.
We shall show this by induction on d(x) :=
iI
d
i
(x).
If d(x) = 0, by replacing D with D + D
. For any
i I, u
i
A
i
, appling D to [t
x
, t
u
i
d
i
] = t
x+u
i
(u
i
)d
i
, we obtain that
[a
x
t
x
, t
u
i
d
i
] + [t
x
, a
u
i
t
u
i
d
i
] = (a
x
+a
u
i
)t
x+u
i
(u
i
)d
i
.
We deduce that a
x
(
)(u
i
)d
i
= 0. Thus (u
i
) =
(u
i
) for all u
i
A
i
and any i I. So we obtain that =
.
Suppose (4.24) holds for any x A
d
z with d(x) k where k 0.
Consider x
1
A
+
d
(Zz) with d(x
1
) = k + 1. Suppose t
x
1
(S
d
)
x
1
with
T
xz
T
z
. For any i I and any u
i
A
i
, we know that D[
(S
d
)
x
1
+u
i
=
a
x
1
+u
i
[
(S
d
)
x
1
+u
i
. By replacing D with D+D
. Appling D to
[t
x
1
, t
u
i
d
i
] = t
x
1
+u
i
((u
i
)d
i
d
i
(x
1
)d
i
), we obtain that
a
x
1
[t
x
1
, t
u
i
d
i
] + [t
x
1
, a
u
i
t
u
i
d
i
] = (a
x
1
+a
u
i
)t
x
1
+u
i
((u
i
)d
i
d
i
(x
1
)d
i
).
Then we infer that d
i
(x
1
)(
) = (
)(u
i
)d
i
. If (
)(u
i
) ,= 0, we
deduce that d
i
T
x
1
z
. This contradicts the fact that x
1
A
+
d
. So we
deduce that (u
i
) =
(u
i
) for all u
i
A
i
, any i I. Thus =
.
By induction we see that (4.24) is true. Dene hom(A, F) so that
(x) = a
x
for allx A
+
d
. Then we see that D[
S
+
d
= ad (t
y
0
)[
S
+
d
. By Lemma
4.5 we conclude that D = D
.
We now summarize the results on derivations of S
d
(A, T, , z) obtained
in this section.
Theorem 4.9. Every D Der(S
d
(A, T, , z)) has the form D =
yA
D
y
for degree y derivations D
y
, such that for each u S
d
(A, T, , z) there only
nitely many y A with D
y
(u) ,= 0, where
(a) D
y
= ad (t
y
0
) for some t
y
0
(S
d
)
y
if y A
d
0, z;
(b) D
y
= aad (t
y
d
1
) for some a F if y = z A
d
;
(c) D
y
= D
3
415
Christina Sormani: Harmonic functions on manifolds with nonnegative Ricci
curvature and linear volume growth 183
Katherine F. Stevenson with Amlcar Pacheco 143
Edgar Lee Stout with Josip Globevnik 65
M. Sundari with M. Cowling and A. Sitaram 293
B.A. Taylor with R udiger W. Braun and Reinhold Meise 201
M.K. Vamanamurthy with G.D. Anderson, S.-L. Qiu and M. Vuorinen 1
M. Vuorinen with G.D. Anderson, S.-L. Qiu and M.K. Vamanamurthy 1
Russell B. Walker with Michael D. Sanford 369
Allen Weitsman: On univalent harmonic mappings and minimal surfaces 191
Zhao Xuan with Duan Haibao 93
Kaiming Zhao: Generalized Cartan type S Lie algebras in characteristic 0 (II) 431
Guidelines for Authors
Authors may submit manuscripts at pjm.math.berkeley.edu/about/journal/submissions.html
and choose an editor at that time. Exceptionally, a paper may be submitted in hard copy to
one of the editors; authors should keep a copy.
By submitting a manuscript you assert that it is original and is not under consideration
for publication elsewhere. Instructions on manuscript preparation are provided below. For
further information, visit the web address above or write to pacic@math.berkeley.edu or
to Pacic Journal of Mathematics, University of California, Los Angeles, CA 900951555.
Correspondence by email is requested for convenience and speed.
Manuscripts must be in English, French or German. A brief abstract of about 150 words or
less in English must be included. The abstract should be self-contained and not make any
reference to the bibliography. Also required are keywords and subject classication for the
article, and, for each author, postal address, afliation (if appropriate) and email address if
available. A home-page URL is optional.
Authors are encouraged to use L
A
T
E
X, but papers in other varieties of T
E
X, and exceptionally
in other formats, are acceptable. At submission time only a PDF le is required; follow
the instructions at the web address above. Carefully preserve all relevant les, such as
L
A
T
E
X sources and individual les for each gure; you will be asked to submit them upon
acceptance of the paper.
Bibliographical references should be listed alphabetically at the end of the paper. All
references in the bibliography should be cited in the text. Use of BibT
E
X is preferred but
not required. Any bibliographical citation style may be used but tags will be converted to
the house format (see a current issue for examples).
Figures, whether prepared electronically or hand-drawn, must be of publication quality.
Figures prepared electronically should be submitted in Encapsulated PostScript (EPS) or
in a form that can be converted to EPS, such as GnuPlot, Maple or Mathematica. Many
drawing tools such as Adobe Illustrator and Aldus FreeHand can produce EPS output.
Figures containing bitmaps should be generated at the highest possible resolution. If there
is doubt whether a particular gure is in an acceptable format, the authors should check
with production by sending an email to pacic@math.berkeley.edu.
Each gure should be captioned and numbered, so that it can oat. Small gures occupying
no more than three lines of vertical space can be kept in the text (the curve looks like
this:). It is acceptable to submit a manuscript will all gures at the end, if their placement
is specied in the text by means of comments such as Place Figure 1 here. The same
considerations apply to tables, which should be used sparingly.
Forced line breaks or page breaks should not be inserted in the document. There is no point
in your trying to optimize line and page breaks in the original manuscript. The manuscript
will be reformatted to use the journals preferred fonts and layout.
Page proofs will be made available to authors (or to the designated corresponding author)
at a Web site in PDF format. Failure to acknowledge the receipt of proofs or to return
corrections within the requested deadline may cause publication to be postponed.
PACIFIC JOURNAL OF MATHEMATICS
Volume 192 No. 2 February 2000
Characterization of the homogeneous polynomials P for which (P + Q)(D)
admits a continuous linear right inverse for all lower order perturbations Q 201
RDIGER W. BRAUN, REINHOLD MEISE AND B.A. TAYLOR
Cubic modular equations and new Ramanujan-type series for 1/ 219
HENG HUAT CHAN AND WEN-CHIN LIAW
On the stability of canonical forms of singular linear difference systems 239
GUOTING CHEN AND ABDELMAJID FAHIM
Weighted graph Laplacians and isoperimetric inequalities 257
F.R.K. CHUNG AND KEVIN ODEN
Cotilting modules and bimodules 275
RICCARDO COLPI AND KENT R. FULLER
Hardys Uncertainty Principle on semisimple groups 293
M. COWLING, A. SITARAM AND M. SUNDARI
Restrictions of
m
(q)-modules to alternating groups 297
WILLIAM J. HUSEN
How large are the spectral gaps? 307
ALEX IOSEVICH AND STEEN PEDERSEN
Finite relative determination and relative stability 315
LEN KUSHNER AND BRASIL TERRA LEME
Braided-Lie bialgebras 329
SHAHN MAJID
Reducible Dehn surgery and annular Dehn surgery 357
RUIFENG QIU
Extending maps of a Cantor set product with an arc to near
homeomorphisms of the 2-disk 369
MICHAEL D. SANFORD AND RUSSELL B. WALKER
K-types of SU(1, n) representations and restriction of cohomology 385
MARK R. SEPANSKI
Veronese subrings and tight closure 399
ANURAG K. SINGH
Rational curves on a complete intersection CalabiYau variety in
3
3
415
DAG EINAR SOMMERVOLL
Generalized Cartan type S Lie algebras in characteristic 0 (II) 431
KAIMING ZHAO
P
a
c
i
c
J
o
u
r
n
a
l
o
f
M
a
t
h
e
m
a
t
i
c
s
2
0
0
0
V
o
l
.
1
9
2
,
N
o
.
2
Pacic
Journal of
Mathematics
Volume 192 No. 2 February 2000