Professional Documents
Culture Documents
org presents
PMATH 453/753
Functional Analysis
Dr. Nicolaas Spronk Fall 2012 (1129) University of Waterloo
www.math.uwaterloo.ca/nspronk
Disclaimer: These notes are provided as-is, and may be incomplete or contain errors.
Contents
1 Normed spaces and Banach spaces
3 `p -spaces
5 Separation by hyperplanes
12
15
20
22
25
10 Nets
27
31
34
13 Spectral theory
40
14 Adjoint operators
44
15 Compact operators
46
48
50
Let F be R or C. Define
(
|x| =
absolute value of x if x R
modulus of x
if x C.
n
X
!1/2
|xi |
i=1
where the integral is a Riemann integral. (C[0, 1], kkp ) is a normed space. It is not complete. Indeed, consider
the sequence of functions
if 0 t 21
0
fn (t) = some affine bit if 12 < t < 12 + n1
1
if 12 + n1 t 1.
This sequence is k kp -Cauchy but converges to no g C[0, 1], i.e. for no such g do we have limn kfn gkp = 0.
1 M147
2 PM351.
3 PM450/354.
(vi) Define
` = {x = (x1 , x2 , . . .) FN : xi = 0 for all but finitely many indices i}.
This is an F-vector space under pointwise operations
(x1 , x2 , . . .) + (y1 , y2 , . . .) = (x1 + y1 , x2 + y2 , . . .),
v
u
uX
kxk2 = t
|xi |2 ,
iN
kxk1 =
i=1
|xi |.
i=1
Note that all sums are finite by definition of `. Convince yourself that none of (`, k k ), (`, k k2 ), (`, k k1 )
are complete.
2.1 Definition. Given a non-empty set X, define its power set, P(X) = {A : A X}. A topology on X is a
subfamily P(X) such that
(i) , X .
(ii) {Ui }iI implies that
iI
n
\
i=1
We call the elements of ( -)open sets. The pair (X, ) is called a topological space.
2.2 Example. We have the following examples:
(i) triv = {, X} is called the trivial topology.
(ii) Let (X, ) be a metric space, i.e. : X X R0 satisfies
(i) (x, y) = 0 if and only if x = y (non-degeneracy).
(ii) (x, y) = (y, x) (symmetry).
(iii) (x, z) (x, y) + (y, z) (triangle inequality).
For > 0, we define the open ball of radius centered at x by B (x) = {x0 X : (x, x0 ) < }. We let
= {U X : for each x U , there is x > 0 such that Bx (x) U }.
(iii) d = P(X), called the discrete topology. Notice that d = , where is the discrete metric, i.e.
(
1 if x 6= y
(x, y) =
0 if x = y.
(iv) The Sorgenfrey line: Take X = R. Let
= {U R : for each x U , there is x > 0 such that [x, x + x ) U }.
Show there is no metric such that = .
(v) Recall, if 1 , 2 are metrics on X, we say 1 and 2 are equivalent (denoted 1 2 ) if there exist c, C > 0
such that
c1 2 C1 .
If 1 2 , then we have 1 = 2 . However, we may have 1 = 2 without 1 2 . As an example, consider
the metrics on R given by
|s t|
.
1 (s, t) = |s t|,
2 (s, t) =
1 + |s t|
The latter is bounded, whereas the former is unbounded, so there is no way these could be equivalent. However,
they induce the same topology on R.
Assignment #1 is posted; google Nico Spronk, hit PMATH 753 tab under Teaching Fall 2012. I would
advise trying most questions after Fridays lecture.
2.3 Definition. Let (X, ), (Y, ) be topological spaces. A function f : X Y is ( --)continuous at x0 X if
for each neighbourhood4 f (x0 ) V , there is a neighbourhood x0 U such that f (U ) V , i.e. f (x) V
for each x U .
2.4 Remark. Note that in metric topology, V plays the role of and U plays the role of .
2.5 Definition. Let F = R or C. Let
F
Cb (X, ) = Cb (X, ) = f : X F | f continuous, and bounded: kf k = sup |f (x)| < .
xX
(f )(x) = f (x)
x X
Let > 0. Choose N N such that if n, m N then kfn fm k < . Observe that for x X, we have
|fn (x) f (x)| = lim |fn (x) fm (x)| .
m
It follows that fn f uniformly on X. Also, let us note that f is bounded: we can pick = 1 and find some N so
that kfN f k < . However, this means that for any x X we have
|f (x)| |f (x) fN (x)| + |fN (x)| 1 + kfN (x)k .
Now we show f is continuous. Fix an arbitrary x0 X, > 0. Let n0 be such that for n, m n0 we have
kfn fm k < /3. Notice that
kfn f k = lim kfn fm k
m
,
3
for n n0 .
(we can do this by definition of continuity for fn0 ). Then for x U we have
+ + =
3 3 3
and hence f (U ) B (f (x0 )). Hence f is continuous at x0 . Thus f is continuous at each point in X.
|f (x) f (x0 )| |f (x) fn0 (x)| + |fn0 (x) fn0 (x0 )| + |fn0 (x0 ) f (x0 )| <
x = (x1 , x2 , . . .).
The map Cb (N, d ) ` , given by f 7 (f (n))nN is easily seen to be a surjective linear isometry. Hence
these spaces are isometrically isomorphic, hence (` , k k ) is complete. Verify the details.
(ii) Let
c = {x ` : lim xn exists}.
n
`p -spaces
3.1 Lemma. If I R is an open interval and : I R satisfies 00 (t) > 0 for all t I, then for a < c in I and
0 < s < 1 we have
((1 s)a + sc) < (1 s)(a) + s(c)
(*)
Any function satisfying (*) is called strictly convex.
Proof. By MVT, 0 is strictly increasing. Let b = (1 s)a + sc so that
s=
ba
,
ca
1s=
cb
.
ca
Another application of MVT allows us to find , such that a < < b < < c and
(b) (a)
(c) (b)
= 0 () < 0 () =
.
ba
cb
Thus,
((b) (a)) (c b) < ((c) (b)) (b a) = (b)(c a) < (c)(b a) + (a)(c b)
which reads ((1 s)a + sc) < s(c) + (1 s)(a).
5 PM351.
1
p
1
q
ab
with equality if and only if ap = bq .
Proof. Let (t) = log t for t (0, ), so 00 (t) = t12 > 0. Then
p
a
bq
1
1
log
+
log(ap ) log(bq ) = log a log b = log(ab).
p
q
p
q
We observe that < holds if ap 6= bq , whereas = holds if ap = bq . Now, apply the strictly decreasing function
x 7 ex to gain the desired result6 .
3.3 Theorem (Hlders Inequality). If a1 , . . . , an , b1 , . . . , bn C then
n
!1/p n
!1/q
n
X
X
X
p
q
ai bi
|ai |
|bi |
i=1
i=1
i=1
where p and q are conjugate indices. Moreover, equality holds if and only if either ai = 0 for all i or there is t 0
such that t|ai |p = |bi |q for all i, and |z| = 1 such that ai bi = z|ai bi | for all i.
P
P
1/p
1/q
Proof. Let A = ( |ai |p ) , B = ( |bi |q ) . Suppose that AB 6= 0. From the corollary above, we have
n
n
n
(1) 1 X
(2) X
1 X
|ai |p
|bi |q
1
1 1
1 p
a
b
|a
||b
|
+
A +
B q = + = 1.
=
i i
i i
p
q
p
q
AB i=1
AB i=1
pA
qB
pA
qB
p
q
i=1
Note that equality can hold at (1) if and only if |z| = 1 exists as claimed. Also, equality holds at (2) if
p
q
|ai |
|bi |
=
A
B
for all i (or at least one of A, B = 0), hence t = B q /Ap suffices.
As a corollary, we have the following generalisation of the triangle inequality.
3.4 Theorem (Minkowskis Inequality). If p > 1 and a1 , . . . , an , b1 , . . . , bn C then
!1/p
!1/p
!1/p
n
n
n
X
X
X
|ai + bi |p
|ai |p
+
|bi |p
i=1
i=1
i=1
with equality if and only if either ai = 0 for all i, or there is t 0 such that tbi = ai for all i.
P
Proof. Again, suppose
|ai + bi | =
6 0. Then by the triangle inequality and Hlders inequality,
!1/p
!1/p " n
#1/q
n
n
n
n
X
X
X
X
X
H.I.
p
p1
p
p
(p1)q
|ai + bi |
(|ai | + |bi |) |ai + bi |
|ai |
+
|bi |
|ai + bi |
.
i=1
Note that
(1)
(2)
i=1
i=1
i=1
i=1
1
1
p1
1 1
+ = 1 =
=1 =
= (p 1)q = p.
p q
q
p
p
P
1/q
Divide by ( |ai + bi |p ) , and note that 1
sgn ai = sgn bi , where
1
q
sgn z =
z
|z|
if z 6= 0
if z = 0
for some t1 , t2 0.
resultPis also a special case of a weighted AM-GM inequality (obtainable from Jensens inequality):
pi > 0 where
pi = 1. See Bollobs Linear Analysis, Chapter 1, Theorem 3.
ai i
ai pi for ai 0,
!1/p
|xi |p
i=1
<
3.6 Proposition. For 1 < p < , `p is a vector space, under pointwise operations, and k kp is a norm on `p .
Proof. If x = (x1 , x2 , . . .) let x(N ) = (x1 , . . . , xN , 0, 0, . . .). Then if x `p ,
!1/p
X
N
(N )
p
kx x kp =
|xi |
0
i=N +1
by the definition of convergence, and kxkp = limN kx(N ) kp . Hence, by Minkowskis Inequality, if x, y `p , then we
have
M.I.
kx + ykp = lim kx(N ) + y (N ) kp lim kx(N ) kp + ky (N ) kp = kxkp + kykp
N
and hence x + y `p with kx + ykp kxkp + kykp . Moreover, it is obvious that if F, and x `p , then
kxkp = ||kxkp < , so x `p .
3.7 Proposition. (`1 , k k1 ) is a normed space.
Proof. Easy exercise.
x X .
1
D(Y)
4.6 Theorem. If Y is a Banach space, then (CbY (X, ), k k ) is also a Banach space.
Proof. Trivial modifications of F-valued case.
4.7 Theorem. If X , Y are normed spaces, then (B(X , Y), k k) is a normed space. If Y is a Banach space, then
(B(X , Y), k k) is a Banach space.
Proof. First, if S, T B(X , Y) then
kS + T k = sup{k(S + T )xk : x D(X )} sup{kSxk : x D(X )} + sup{kT x0 k : x0 D(X )} = kSk + kT k.
|
{z
}
kSx+T xk
kSxk+kT xk
It is easy to show kSk = ||kSk for F. Now, suppose that Y is a Banach space. Let B(X ) have the usual
norm topology from X . Define
: B(X , Y) CbY (B(X )),
T 7 T |B(X )
that is, restricts T to the unit ball. Then is an isometry, i.e. for all T we have
k(T )k = sup{kT xk : x B(X )} = kT k.
F = lim (Tn )
n
4.8 Definition. For a vector space X , we define the algebraic dual space of X by X 0 = L(X , F); it consists of
the linear functionals on X . If X is a normed space7 , then X = B(X , F) denotes the continuous dual space, i.e.
the subspace of continuous linear functionals on X .
4.9 Proposition. If x `1 , then fx : c 0 F given by
fx (y) =
xi yi
i=1
is a bounded linear functional. Moreover, all bounded linear functionals arise in this way, with kfx k = kxk1 .
4.10 Remark. This identification x 7 fx is a linear isometric isomorphism c 0
= `1 .
Proof. Let x `1 , and y c 0 . We establish that
xi yi
i=1
i=m
i=m
X
X
X
x
y
|x
||y
|
i=1
i=1
This shows fx is well-defined. It is easy to check that it is linear. Observe that kfx k kxk1 is (essentially) already
shown. Let
y (N ) = (sgn x1 , . . . , sgn xN , 0, 0, . . .) c 0 ,
and ky (N ) k 1. Then
kfx k sup |fx (y (N ) )| = sup
N
X
|xi | =
N N n=1
|xi | = kxk1 .
n=1
Now suppose that f c 0 . Let en be the sequence with 0 everywhere except for a 1 in the nth position. Define x
by putting xi = f (ei ). We observe, by linearity, that
n
X
i=1
Thus,
|xi | = sup
i=1
n
X
|xi | < = x `1 .
nN i=1
Using the continuity of f and the fact that f = fx on ` (a dense subset), we have
f (y) = lim f ((y1 , . . . , yn , 0, 0, . . .)) = lim fx ((y1 , . . . , yn , 0, 0, . . .)) = fx (y)
n
so that f = fx .
7 More
A1, Due Monday. Office hours today 11-noon, Friday 2-3:30 pm, or by appointment. nspronk@uwaterloo.ca.
Grader needed for AM/PM 331 (Real Analysis), about 3 hours per week, $600 per term. Talk to me by 3pm
today.
4.11 Definition. A Hamel basis of an F-vector space X is any family B X such that
1. B is linearly independent i.e. for any finite distinct e1 , . . . , en B and 1 , . . . , n F, the equation
n
X
i ei = 0
i=1
n
X
i ei .
i=1
and F (x) p(x) for x X . We will refer to F as a Hahn-Banach extension of f (relative to p).
Proof. Step 1 : Suppose there is x X \ Y such that X = spanR {x, Y}. We observe for y+ , y Y, we have
f (y+ ) + f (y ) = f (y+ + y ) p(y+ + x + y x) p(y+ + x) + p(y x)
which implies that
f (y ) p(y x) p(y+ + x) f (y+ ).
Hence there is c R so that
sup{f (y ) p(y x) : y Y} c inf{p(y+ + x) f (y+ ) : y+ Y}.
0
whenever x M, for (, M) C.
f where (, M), (,
f C, with M M,
f say, then (x)
Then is well-defined: if x M and y M,
e M)
e
= (x).
Also, is clearly linear. Note that (, U) is an upper bound for C.
By Zorns Lemma, there exists a maximal element (F, M) for E. If it were the case that M ( X , then there would
0
be x X \ M. Then, by Step 1, we would find spanR {x, M} such that
|M = f , and p on spanR {x, M}.
Then (, spanR {x, M}) E and (F, M) < (, spanR {x, M}), which would violate the fact that (F, M) is maximal.
Thus M = X .
4.15 Lemma. Let X be a C-vector space, hence a R-vector space.
(i) If f XR 0 (i.e. a R-valued linear functional), then
fC (x) = f (x) if (ix)
is C-linear, i.e. fC XC 0 .
(ii) Conversely if g XC 0 and f = Re g, then f XR 0 and moreover fC = g.
(iii) If X is a normed C-space, hence a normed R-space, then f XR if and only if fC XC and kf k = kfC k.
Proof. (i) and (ii) are simple exercises.
(iii): For x X , let z(x) = sgn fC (x), so fC (x) = z(x)|fC (x)|. Then we assume f XR :
|fC (x)| = z(x)fC (x) = fC (z(x)x) = |f (z(x)x)| kf kkz(x)xk = kf kkxk
| {z }
R0
Proof. That x
(X ) is clear, and it is clear that (x 7 x
) L(X , (X ) ). Now we have for x X , f X
|
x(f )| = |f (x)| kf kkxk
so that k
xk kxk. However, by the last corollary (really, the Hahn-Banach theorem), given we have that there is
f X , kf k 1, such that
|
x(f )| = |f (x)| = kxk,
it follows that k
xk kxk. Thus k
xk = kxk.
4.19 Remark. We have:
(i) If Y is a linear subspace of a normed space X , then its closure Y X is also a linear subspace. Indeed,
observe that x 7 x and x 7 x + y are uniformly continuous on X , hence on Y.
(ii) If X is a normed space, but not complete, then we may define its completion as
X := Xb X .
Since X is a dual space, it is complete, and any closed subset of a complete space is itself complete. This
is, up to isometric isomorphism, the unique Banach space containing a dense copy of X .
Separation by hyperplanes
Indeed, if there were x D(X ) such that |f (x)| > 1 f (x0 ), then
f (x0 )
f (x) < 1
hence
f (x ) 1
0
x = f (x0 )
f (x)
| {z }
kk<1
5.3 Remark. If X is an infinite-dimensional normed space, then in fact X 0 \ X 6= (assuming the Axiom of
Choice). To see this, note that there exists a Hamel basis {e } for X , and we may assume ke k = 1. Find an
unbounded subset { } F. Let f X 0 be given by
X
X
f
e =
since A is convex, and hence sA + tA (s + t)A. On the other hand we always have
(s + t)A = {(s + t)a : a A} {sa + tb : a, b A} = sA + tA
so (s + t)A = sA + tA. Now for x, y X we have
p(x) + p(y) = inf{s 0 : x sA} + inf{t 0 : y tA} = inf{s + t : s, t 0, x sA, y tA}
inf{s + t : s, t 0, x + y sA + tA = (s + t)A}
= inf{r 0 : x + y rA} = p(x + y).
Let us prove the remaining claims:
(i) If p(x) < 1, then because A is absorbing, there is 0 < t < 1 so x tA, i.e. 1t x A, but then by convexity of
A, we have x = (1 t)0 + t 1t x A. Also, if x A, then x 1A so p(x) 1.
(ii) Let > 0 be such that D(X ) A. Then for 0 6= x X , > 0,
x (kxk + )D(X ) =
Hence p(x)
kxk+
kxk +
kxk +
D(X )
A.
1 kxk as 0. Let N = 1 .
8 If A X is not convex, the proof above still shows, at least, that the Minkowski functional of A is finite-valued, non-negative and
positive-homogeneous, with A {x X : p(x) 1}. Furthermore, if A is both convex and balanced, then p is a seminorm on X .
Converses of this result are also true; see Megginsons An Introduction to Banach Space Theory, Chapter 1, Section 9.
13
5.7 Theorem (Separation Theorem/Geometric Form of the Hahn-Banach Theorem). Suppose X is an F-vector space and A, B X are non-empty convex sets with A B = and A is absorbing about a
point a0 . Then there are f X 0 and R such that
Re f (a) Re f (b),
a A, b B.
a A, b B.
so |f (x)| N kxk so kf k N , i.e. f is continuous. Moreover, if A is open, we cannot have that f (a) = for any
a A, where is as above. Indeed there is > 0 so that a x0 A, which means that
f (a x0 ) = f (a) p(x0 )
but p(x0 ) 1 so we certainly cannot have f (a) = .
5.8 Definition. In an F-vector space, a (R)-half-space is any set of the form
for some 0 6= f X 0 , R.
H = {x X : Re f (x) },
14
We observe that if X is normed, H is closed if and only if f X 0 is continuous (if is easy because continuous
maps pull back closed sets to closed sets why is only if true?).
5.9 Definition. In an F-vector space, given a non-empty S X , we let its convex hull be given by
X
co S = {1 x1 + . . . + n xn : i [0, 1],
i = 1, xi S, n N}.
We note that
co S =
{C : C S and C is convex},
i.e. co S is the smallest convex set containing S. If X is normed then we let co S = co S, denote the closed convex
hull. We remark that if C X is convex, then its closure C is convex.
5.10 Theorem (Closed Convex Hull Theorem). If X is a normed space and S X is nonempty, then
\
co S = {H : H is a closed half-space containing S}.
Proof. The collection of all closed half-spaces containing S is a collection of closed convex sets, which proves the
direction (arbitrary intersections of closed/convex sets remain closed/convex). It remains to prove . It will
/ H.
suffice to see that for any x0
/ co S that there is a closed half-space H such that co S H and x0
If x0
/ co S, then there is > 0 such that
(x0 + D(X )) co S = ,
but x0 + D(X ) is open and convex, while co S is convex so by the Separation Theorem, there is f X and R
such that
Re f (c) > Re f (x),
c co S, x x0 + D(X ).
In particular, the closed half-space
H = {x X : Re f (x)}
contains co S, but misses x0 .
We now discuss the Banach-Steinhaus Theorem, the Open Mapping Theorem, and the Closed Graph Theorem.
These are traditionally proved as consequences of the Baire Category Theorem9 .
6.1 Theorem (Baire Category Theorem I). If (X, ) is a complete metric space and {Un }
n=1 is a collection
of dense open sets, then
\
Un is dense in X.
n=1
Proof. PM351.
6.2 Definition. Let (X, ) be a metric space.
A set F X is called nowhere dense if its closure F contains a neighbourhood of none of its points, i.e. if
x0 F , there is no > 0 such that
B (x0 ) = {x X : (x, x0 ) < } F .
In other words, F has empty interior. This is equivalent to saying that X \ F is dense in X.
A set M X is called meager (or 1st category) if
M=
Fn
n=1
where each Fn is nowhere dense, and non-meager (or 2nd category) otherwise.
9 In some cases, one can find non-Baire proofs, e.g. of Banach-Steinhaus via gliding hump. By the way, the Open Mapping Theorem
may be proved via Zabrekos Lemma: every countably subadditive seminorm on a Banach space is continuous (which follows from the
fact that every closed, convex, absorbing-about-0 subset in a Banach space contains a nbhd of 0, which is a special incarnation of Baire).
15
6.3 Theorem (Baire Category Theorem II). If (X, ) is a complete metric space, and 6= U X is
open, then U is non-meager.
Proof. If U was meager, i.e. we could write
U=
Fn
n=1
where each Fn is nowhere dense, then each Un = X \ F n would be open and dense, and hence
Un
n=1
x U,
T 1 (nB(Y))
T F
where T
Fn = U =
(Fn U ).
n=1
n=1
Since U is non-meager, Baire Category says that at least one Fn U has non-empty interior, so there is x0 X ,
> 0 so that
x0 + D(X ) Fn U Fn .
Now, for x D(X ), we can write x =
1
2
Tx =
1
[T (x0 + x) T (x0 x)],
2 | {z } | {z }
kkn
so kT xk
n
,
so kT k
n
,
T F
kkn
is meager in X .
6.6 Theorem (Open Mapping Theorem/Banach-Schauder Theorem). Let X , Y be Banach spaces
and T B(X , Y). Then if T is surjective, i.e. T (X ) = Y, then T is open, i.e. if U X is open, then T (U ) Y is
open.
6.7 Remark. We shall frequently use the following fact: if 6= A X, x X , 0 6= F, then x + A = x + A.
Indeed, ak a in X if and only if x + ak x + a in X .
6.8 Lemma (Main Lemma10 ). Suppose there is r > 0 that
T (D(X )) rD(Y)
then T (D(X )) rD(Y).
10 See
16
kzk
1
Proof of lemma. Let z rD(Y), and let > 0 be so that kzk < r(1 ) < r. Set y = 1
z, so kyk = 1
< r. We
1
1
will show that y 1 T (D(X )) i.e. there is x 1 D(X ) so T x = y. Then T ((1 )x) = z and so we will have
shown T (D(X )) rD(Y).
Put A = T (D(X )) rD(Y); it follows from our assumptions that A T (D(X )) rD(Y). We proceed inductively:
y rD(Y) A = y1 A (y + rD(Y))
y y1 + rD(Y) y1 + A = y2 (y1 + A) (y + 2 rD(Y))
y yn + n rD(Y) yn + n A = yn+1 (yn + n A) (y + n+1 rD(Y))
Thus, we found a sequence (yn )
n=1 Y. We note
yn+1 yn + n A
kyn+1 yk < n+1 r, kyn yk < n r = kyn+1 yn k < n+1 r + n r
yn+1 yn + n A = yn+1 yn n A
and A = T (D(X )) rD(Y) which yields
yn+1 yn n rD(Y), i.e. kyn+1 yn k < n r
yn+1 yn n T (D(X )) = T ( n D(X ))
i.e. x X , kxk < n s.t. yn+1 yn = T xn . Moreover, we similarly find x0 D(X ) such that y1 = T x0 . We also
note that yn+1 y + n+1 rD(Y) hence kyn+1 yk < n+1 r implies
lim yn = y.
Now
x=
xn ,
n=0
note that
kxk
kxn k <
n =
n=0
n=0
1
.
1
T xn = y1 +
n=0
(yn+1 yn ) = . . . = yN +
|{z}
n=1
by
kT x yk kyN yk +
X
n=N
(yn+1 yn ) = y
{z
}
|
kk< n r
n r 0
n=N
hence kT x yk = 0, as claimed.
Proof of Open Mapping Theorem. It suffices to show that T (D(X )) rD(Y) for some r > 0. Indeed, supposing
this, note that if U X is open and x U then x + D(X ) U for some > 0. Thus, U x D(X ). Hence
T (U x) T (D(X )) rD(Y),
so that
T x + rD(Y) T x + T (U x) = T (U ).
Thus, T (U ) is indeed open. Now, since T (X ) = Y we see that
Y=
nT (D(X ))
n=1
S
S
because nT (D(X )) = T ( nD(X )). Since Y is a Banach space, the Baire Category Theorem tells us that some
nT (D(X )) is not nowhere dense, hence there is y0 Y and > 0 such that
y0 + D(Y) nT (D(X )) = nT (D(X )).
17
1
[(y0 + y) (y0 y) ] nT (D(X ))
2 | {z } | {z }
y0 +D(Y)
so that
ny
n D(Y)
y0 +D(Y)
n D(Y)
T (D(X )).
6.9 Theorem (Inverse Mapping Theorem). If X , Y are Banach spaces and T B(X , Y) is bijective, then
T 1 B(Y, X ).
Proof. Since T L(X , Y) and bijective, we clearly have that T 1 L(Y, X ). Since T B(X , Y) and surjective,
the Open Mapping Theorem says there is r > 0 such that T (D(X )) rD(Y). Applying T 1 , we see that D(X )
rT 1 (D(Y)) so 1r D(X ) T 1 (D(Y)), i.e. kT 1 k 1r < , so that T 1 B(Y, X ).
6.10 Definition. Let X , Y be normed spaces and X Y denote their direct sum, i.e.
X Y = {(x, y) : x X , y Y}.
If 1 p < , let
k(x, y)kp = (kxkp + kykp )1/p ,
6.15 Remark (obvious projection). In general, if X Y is a subspace, let B be a basis for X . Then
there is B 0 Y so B B 0 is a basis for Y. Hence each y Y admits scalars {ye }eBB 0 such that ye 6= 0 for only
finitely many elements e, and
X
ye e.
y=
eBB 0
Define P : Y X by
Py =
ye e.
eB
Z, check
P2 = i P
| J{z }i P J = i P J = P
identity on X
6.17 Remark (Murray, 1940s). If 1 p (p 6= 2), then `p admits subspaces which are not boundedly
complemented11 .
6.18 Theorem. c 0 is not boundedly complemented in ` .
Proof. We suppose, for sake of contradiction, that P B(` ) exists with Im P = c 0 , P 2 = P . Let F P(N) be
an uncountable collection of infinite sets, such that for distinct sets F1 , F2 F, |F1 F2 | < . For each F F,
let yF = (I P )F . Note that c 0 = ker(I P ), and clearly F
/ c 0 , so yF 6= 0 for any F . Observe for distinct
F1 , . . . , Fm F and 1 , . . . , m F, we have
m
X
i=1
i Fi =
m
X
i Fi \Gi +
kk =
{z
max
i=1,...,m
where Gi =
i=1
m
X
|i |
{z
cc0
Fj . Hence
m
m
m
X
X
X
i yFi
=
(I P )
i Fi
kI P k
i Fi \Gi
= kI P k max |i |.
i=1,...,m
j6=i
i=1
i=1
i=1
1
n}
11 In fact, it was proved by J. Lindenstrauss and L. Tzafriri in 1971 (On the complemented subspaces problem, Israel J. Math. 9 (1971),
263-269; MR 43 #2474) that a Banach space is isomorphic to a Hilbert space if all of its closed subspaces are boundedly complemented!
19
where k ((xi )
i=1 ) = xk , so k ` with kk k = 1. Now, if F1 , . . . , Fm are distinct elements of Fn,k , then with
i = sgn k (yFi ), we have
m
m
m
X
X
X
m
i k (yFi ) =
|k (yFi )|
i yFi
kI P k
| {z }
n
i=1
i=1
i=1
1/n
Fn,k
n=1 k=1
7.1 Lemma. Given a finite-dimensional R-vector space, let {e1 , . . . , ed } be a basis, and let kxk1 =
Pd
x = i=1 xi ei . If B = Bkk1 (X ), then B is compact.
Pd
i=1
|xi | where
Proof. Let us accept the Bolzano-Weierstrass Theorem, any sequence in [1, 1] has a convergent subsequence. Let
(x(n) )
n=1 B. We will show that this has a convergent subsequence.
(nk(1) )
)k(1)=1
(n)
(x1 )
n=1 [1, 1] has a convergent subsequence (x1
(nk(1) )
)k(1)=1
(x2
(nk(2) )
)k(2)=1
.
..
(nk(d1) )
)k(d1)=1
(xd
(nk(d) )
)k(d)=1
kxk
d
X
kxi ei k =
i=1
d
X
i=1
d
X
fi
j ej = i
j=1
(1 , . . . , n F). Since X is finite-dimensional, with the norm inherited from Y, X 0 = X . We find Hahn-Banach
extensions F1 , . . . , Fd for f1 , . . . , fd respectively so each Fi Y . Then define P : Y Y by
Py =
d
X
Fi (y)ei .
i=1
21
d
X
i=1
We note that X , with norm from Y, is complete, hence closed. Alternatively, X = ker(I P ), which is closed.
7.5 Theorem. Let X be a normed space. Then B(X ) is compact if and only if X is finite-dimensional.
Proof. () Heine-Borel, above.
7.6 Lemma (Rieszs Lemma). If X is a normed space and Y is a proper closed subspace, then given > 0,
there is x0 B(X ) such that d(x0 , Y) 1 .
Proof. Given x X \ Y, we define f : Y + Fx F by f (y + x) = for y Y, F. We note that ker f = Y
which is closed so f is continuous. (Propn before Separation Theorem). Thus by Hahn-Banach Theorem, there is
an extension F X of f . Find x0 B(X ) such that |F (x0 )| (1 )kF k. We observe for any y in Y
1
|F (x0 )|
|F (x0 y)| =
1 .
kF k
kF k
kx0 yk
Proof of () of theorem. We show that if dim X 6< , then B(X ) is not compact. We perform an induction; given
0 < < 1, find
x1 S(X ), i.e. kx1 k = 1.
x2 B(X ) such that d(x2 , Fx1 ) 1 .
x3 B(X ) such that d(x3 , span{x1 , x2 }) 1 .
.
..
xn B(X ) such that d(xn , span{x1 , . . . , xn1 }) 1 .
Note, at each stage, span{x1 , . . . , xn } is finite-dimensional, and hence closed. Moreover, dim X 6< , so these
finite-dimensional subspaces are proper. We note if n > m then
kxn xm k d(xn , span{x1 , . . . , xn1 }) 1
and we see that (xn )
n=1 B(X ) admits no Cauchy subsequence.
8.1 Definition. Let X be a non-empty set, {(X , )}A be a collection of topological spaces, and {f : X
X }A be a family of functions. We define the initial topology = (X, {f }A ) as follows:
U for each x U , there exist U1 1 , . . . , Un n (n N) such that x
n
\
f1
(Ui ) U.
i
i=1
n
\
)
f1
(Ui )
i
: n N, each i A, each Ui i
i=1
form a base for , i.e. any U in is the union of such sets. We might call the sets {f1 (U ) : A, U } a
subbase, i.e. finite intersections of such sets form a base.
8.2 Remark. Each f : (X, ) (X , ) is - -continuous (A1Q1). In fact, is the coarsest topology on
X which allows each f to be continuous, i.e. if P(X) is any topology for which each f : X X is
- -continuous, then . We remark that is trivially a topology: it is closed under finite intersections, and
arbitrary unions.
8.3 Example (Metric Topology). Let : X X [0, ) be a metric on X. For each x X, let
x (y) = (x, y), so {x : X R}xX is a family of functions on X (we equip R with its usual topology, of course).
Its not difficult to show that
= (X, {x }xX ).
22
8.4 Example (Product Topology). Let {(X , )}A be a collection of topological spaces. Let
Y
X=
X = {x = (x )A : x X for all A}
A
denote the Cartesian product. For each , denote by : X X the canonical projections, i.e. x = x .
Then we define the product topology on X by
= (X, { }A ).
8.5 Remark. Note the following:
(i) If A = N, then basic open neighbourhoods of any point in X =
n
\
nN
Xn , are given by
p1
i (Ui ) = U1 U2 . . . Un Xn+1 Xn+2 . . .
i=1
(ii) Consider the set RR (functions from R to R) with the product topology . Were effectively selecting finitely
many points t1 , . . . , tn . [DIAGRAM]
8.6 Example (Linear Topologies). Let X be a normed space and Z X . We may write Z = {f : X
F}f Z . The initial topology (X , Z) is called the linear topology on X from Z. Note that (X , Z) kk . In
particular:
The weak topology is defined as w = (X , X ) P(X ).
The weak* topology is defined as w = (X , X ) P(X ).
8.7 Remark. Recall that X X = {
x : x X }, where x
(f ) = f (x). What do subbasic w-open sets of X look
like? Fix f X \ {0}, x0 X . Then
f 1 (f (x0 ) + D) = {x X : f (x) f (x0 ) + D} = {x X : f (x) f (x0 ) D} = {x X : |f (x) f (x0 )| < }
or alternatively, f 1 (f (x0 ) + D) = {x X : |g(x) g(x0 )| < 1} where g = 1 f . Here,
(
(1, 1)
if F = R
D = D(F) =
{z : |z| 1} if F = C.
8.8 Example (Relative Topology). Let (Y, ) a topological space. For X Y , let : X , Y denote the
canonical injection/embedding map. The relative (or relativized) topology on X is denoted
|X = (X, {}).
Note U |X if and only if there exists V such that U = X V .
8.9 Definition. If (X, ) is a topological space, a set K X is -compact if given any open cover of K, i.e. any
family O such that
[
K
U
U O
n
[
Ui .
i=1
S = S = {F X : F S and F is -closed}.
Since unions of open sets are open, intersections of closed sets are closed. Thus S is the smallest -closed set
containing S.
8.11 Proposition. S = {x X : for every U with x U , we have U S 6= }.
23
n
[
(X \ Fi ) = X \
i=1
n
\
Fi =
i=1
n
\
Fi = =
i=1
n
\
i=1
Fi
n
\
Fi =
i=1
(X \ Ui ) 6= .
i=1
(X \ U ) 6= ,
U O
but then
i=1
| {z }
6=
24
1
p
(V
)
U.
Hence
x
U
for
each
U
U
and
thus
x
i
i=1 i
U U U
F F F so
F F F 6= . Hence
(X, ) is compact.
Let F0 = {N \ E : E P(N) is finite}. Observe that F0 certainly has the Finite Intersection Property. Let U be
any ultrafilter containing F0 . Define U : P(N) {0, 1} R by
(
1 if A U
U (A) =
0 if A
/ U.
We note that U () = 0. Also, if A, B P(N) and A B = then at most one of A or B is in U, and hence
U (A) + U (B) = U (A B) (using the property that for E P(N), either E U or N \ E U). If we have a
partition
n
G
N=
Ei ,
i=1
then exactly one Ei U and the others are not. It follows that the variation V (U ) = 1. Hence U F A(N); see
A1Q4. By A1Q4, there is LU ` such that LU (S ) = U (S). We note the following facts (the proofs are similar
to A2Q4):
(i) LU (1) = 1, where 1 denotes the all-ones sequence, and kLU k = V (U ) = 1.
(ii) LU |c 0 = 0, LU (x) 0 if xn 0 for all n, and
lim inf xn LU (x) lim sup xn
n
x
i (Ui ) =
(
x1
(U
))
=
x1
(Ui ) (X )
i
i
i
i=1
i=1
{z
i=1
basic set in w
We prove, now, that (B(X )) is -closed in FX . Suppose g = (g(x))xX (B(X )) . Now for x, y in X , F
and for each n in N we have
1
6= (B(X )) x1 (g(x) + n1 D) y1 (g(y) + n1 D) x+y
(g(x + y) + n1 D)
26
1
1
||
2 + ||
+ +
=
n n
n
n
(0 = fn (x + y) (fn (x) + fn (y))) which holds for all n, so g, i.e. x 7 g(x), is linear. Likewise, we see that for
each x in X we find fn (B(X ))
|g(x) fn (x)| <
1
1
1
= |g(x)| |fn (x)| + kxk +
n
n
n
10
Nets
These are like sequences in some ways, but unlike sequences in other ways.
10.1 Definition. We make the following definitions.
A directed set is a pair (N, ) where is
a preorder on N (reflexive and transitive, but not necessarily antisymmetric: we allow 0 , 0
without = 0 ).
a cofinal ordering, i.e. 1 , 2 N , there exists 3 such that 1 3 , 2 3 .
Let X 6= . A net is a map ( 7 x ) : N X where (N, ) is a directed set.
If (M, ) is another directed set, a map : M N is directed if
0 in M implies () (0 ) in N .
is cofinal, i.e. for any in N , then there is M such that ().
If (x )N X is a net, and : M N is a directed map, then (x() )M is called a subnet of (x )N .
We often write = () and hence (x )M .
If (x )N X is a net and A X, we say that (x )N is
eventually in A if there is A N such that x A whenever A .
frequently in A if for any in N , there is 0 such that x 0 A.
10.2 Example. We have:
(i) Sequences are nets.
(ii) R, [0, ), [a, b] are directed sets via usual . So any map [a, b] X is a net.
(iii) (Riemann sums) Fix a compact interval [a, b] R. Let
N = {(s0 , s1 , . . . , sn ; t1 , . . . , tn ) : a = s0 < s1 < . . . < sn = b, ti [si1 , si ], i = 1, . . . , n, n N}.
27
We say that (s0 , . . . , sn ; t1 , . . . , tn ) (s00 , . . . , s0m ; t01 , . . . , t0m ) iff {s0 , . . . , sn } {s00 , . . . , s0m }. Inspect that this
is a cofinal preordering. If f : [a, b] F is a function, a Riemann sum is given for = (s0 , . . . , sn ; t1 , . . . , tn )
by
n
X
S (f ) =
f (ti )(si si1 ).
i=1
(iv) Let X 6= . A family of sets F P(X) is filtering if for any F1 , F2 F, there is 6= F3 F such that
F1 F2 F3 . Now let N = {(x, F ) : x F F }. We let (x, F ) (x0 , F 0 ) iff F F 0 . This is a cofinal
preordering. We let (x)(x,F )N X and this is a net.
10.3 Definition. Let X be a non-empty set, (x )N a net on X. We say that (x )N is an ultranet if for any
A X, either (x )N is eventually in A, or is eventually in X \ A.
10.4 Remark. In general, if (x )N is a net in X and A X, then either (x )N is frequently in A or is
eventually in X \ A.
10.5 Proposition. If (x )N is a net in X and A X is such that (x )N is frequently in A then there is a
subnet (x )M which is eventually in A. Moreover, this subnet can be arranged to be an ultranet.
Proof. Let = {F P(X) : A F, F is filtering, and for all F in F, (x )N is frequently in F }. Recall: filtering, F1 , F2 F, then there is 6= F3 such that F3 F1 F2 . Observe that {A} , if F = A {x 0 }N, 0
then {F }N , i.e. if 1 , 2 N , 3 1 , 2 and F3 = F1 F2 and since (x )N is frequently in A, there is
3 such that x A, so F3 6= .
Given F we let
M = {(, F ) : x F F}
0
We preorder M , (, F ) ( , F ) iff 0 in N , F F 0 . [We let ,F = . ()] We remark that this preordering is cofinal: if (1 , F1 ), (2 , F2 ) M , find 6= F3 F1 F2 and 3 1 , 2 such that x3 F3 , so
(3 , F3 ) (1 , F1 ), (2 , F2 ). [] It is obvious that (, F ) 7 is a directed map. Thus (x )(,F )M is a subnet of
(x )N . Moreover, (x )(,F )M is eventually in A. Indeed if (0 , F0 ) M such that F0 A (and F0 F), which
exists by assumption on , then for (, F ) (0 , F0 ) we have x0 F F0 A, so x , (, F ) (0 , F0 ) in M is in
A.
Now, if F , the Ultrafilter Lemma provides an ultrafilter U F. We observe for U U, that U F 6= for each
F in F, hence U (check!). Then (x )(,U )M (where M is as above, U replacing F) is an ultranet (check!).
10.6 Definition. Suppose (X, ) is a topological space, and (x )N is a net in X. If x0 X, we say that
x0 is a limit point for (x )N if each neighbourhood x0 U has that (x )N is eventually in U .
x0 is a cluster point for (x )N if each neighbourhood x0 U has that (x )N is frequently in U .
In the first case we write x0 = -lim x . We may abuse notation and write -lim rather than -lim if is a metric.
N
10.7 Proposition. If (x )N is a net in a topological space (X, ) and x0 in X is a -cluster point, then there is
a subnet (x )M such that x0 = -limM x .
Proof. We let Ox0 = {U : x0 U }. Then Ox0 is a filtering family of sets, and for U in Ox0 , (x )N is
frequently in U . Consider the subnet (x )M of the prior proposition. Check that for U in Ox0 , (x )M is
eventually in U .
10.8 Remark. By A3Q5, f : (X, ) (Y, ) is continuous if and only if it preserves net convergence.
(i) If , P(X) are topologies, then iff id : (X, ) (X, ) is continuous, iff -limN x = x0 whenever
-limN x = x0 .
(ii) In FX , the product topology is the topology of pointwise convergence. If (f (x))xX FX , (f )N FX is
a net, then f0 = -limN f iff for each x in X, f0 (x) = limN f (x) in F (F has metric topology). Check!
Q
(iii) X = A X , each (X , ) a topological space, then a net (x )N converges to x(0) in (product topology)
()
(0)
if and only if -limN x = x .
10.9 Theorem (Metrisation Theorem). Let X be a separable normed space. Then (B(X ), w ) is metrisable.
Proof. Since X is separable, B(X ) contains a dense countable set {xn }
n=1 (from this, we can extract a linearly
0
independent, countable set {x0n }
such
that
span{x
}
is
dense
in
X
. We can use x0n s in place of elements xn
n n=1
n=1
28
X
|f (xn ) g(xn )|
2n
n=1
X
k=n0 +1
2
< ,
k
2
2
f A, g B.
f A, g B.
We know B is w -open, i.e. B (X , X ). So for any f0 B, then there are x1 , . . . , xn X such that
f0 U =
n
\
i=1
Tn
Let Y = i=1 ker xbi , then xbi (Y + f0 ) = xbi (f0 ) = f0 (xi ) f0 (xi ) + D. So Y + f0 U B. For any f Y ,
Re F (f + f0 ) > since f + f0 B. Hence Re F (f ) > Re F (f0 ), which means F |Y = 0 (why?), so Y ker F .
Together with the next lemma will prove the result; the lemma will conclude that F span{c
x1 , . . . , x
cn }.
Tn
0
10.12 Lemma. In an F -vector space X, if f0 , . . . , fn X such that ker f0 i=1 ker fi , then f0 span{f1 , . . . , fn }.
Proof. Let T : X Fn be given by x 7 (f1 (x), . . . , fn (x)). Then
ker T =
n
\
ker fi .
i=1
Hence,
f0 =
n
X
i=1
i fi
dense) in B(X ).
w
Proof. Let A = B(X ) . Since B(X ) is w -closed (by Alaoglu) A B(X ). Let F0 X \ A, noting that
X \ A is w -open. Then there is w -open basic neighbourhood U of F0 , U X \ A but this means U A = .
Such U can be chosen to be convex i.e.
n
\
1
U=
fbi (F0 (fi ) + D)
i=1
for some f1 , . . . , fn X . Hence, by the w -Separation Theorem, there are f X and R such that
Re F (f ) > Re G(f )
F U, G A.
1
f.
Then
F U, G A.
(1)
for x B(X )
(2)
Hence
Re F0 (f 0 ) > 1 Re x
(f 0 )
now (2) implies k Re f 0 k 1, so kf 0 k 1. Hence (1) implies |F0 (f 0 )| > 1, thus kF0 k > 1. Thus F0
/ B(X ).
Therefore B(X ) = A.
10.15 Definition. A normed space X is called reflexive if X = X .
Since dual spaces are always complete, a reflexive normed space is Banach.
10.16 Theorem. Given a Banach space X , the following are equivalent:
1. X is reflexive.
2. B(X ) is weakly compact.
3. (X , X ) = (X , X ) i.e. weak and weak* topologies concide on X .
4. X is reflexive.
X normed is reflexive if X = X . We saw that any reflexive X is Banach.
Proof. Consider the map f : (B(X ), (X , X )) (B(X ), (X , X )) given by x 7 x
. This is a homeomorphism.
X (X ) = X
w-closed
B(X )
| {z }
= B(Y ) is w-compact
w-compact
Therefore, Y is reflexive.
11
11.1 Definition. Let X be a normed space and C X be a convex set. A face of C is a non-empty convex subset
F C, such that if x, y C, t [0, 1] and (1 t)x + ty F then both x, y F . Furthermore, a face E of C is
called an extreme point if E is a singleton. For example [DIAGRAM].
11.2 Theorem (Krein-Milman Theorem). Let X be a normed space and C X is w -compact and
Proof of Krein-Milman Theorem. C w -closed, convex implies cow ext(C) C. First we will prove that ext(C) 6=
. We take
F = {F C : F is a w -closed face of C}.
C F. Then (F, ) is a partially ordered set. Let C be a chain in F. Define
\
FC =
F
F C
and noting each F is w -closed we have that the intersection above is too. Furthermore, FC is a face: suppose
x, y C, t [0, 1] such that
(1 t)x + ty FC = (1 t)x + ty F for any F C
so indeed x, y F for any F C. So x, y FC . Hence FC F, so it is an upper bound with respect to , for C.
By Zorns lemma, there is a maximal element of F which we call E. E is w -compact, so for x X
Re x
(E) R
| {z }
compact
Hence Re x
(E) has a minimum value, say mx . Let Ex = {f E : Re f (x) = Re x
(f ) = mx } 6= , w -closed. And
Ex is a face (exercise). Hence Ex E but E was minimal so Ex = E. Now if f, g E, then for any x X
Re f (x) = mx = Re g(x) = f = g
Thus E = {f } is a singleton and f ext(C). Finally, we will prove that C cow ext(C). Let f0 X \cow ext(C).
We want to show f0
/ C.
C cow ext C. We proved that ext C 6= . We need to prove C cow ext C. Let f0 X \ cow ext C. Note
that cow ext C is convex, and X \ cow ext C is w -open. By the w -separation theorem, there is x X, R
such that Re f0 (x) < Re f (x) for any f ext C. C is w -compact, so mx = min Re x
(C) exists. Let
Cx = {f C : Re f (x) = mx }
noting that this is w -closed. Also, Cx is a face. By the first part of this proof applied to Cx , we conclude Cx admits
an extreme point, say f . Then f is also an extreme point of C. So Re f (x) = mx . Hence Re f0 (x) < mx thus
f0
/ C. Thus, C = cow ext C.
31
11.3 Corollary. If X is a normed space, and C X is a w-compact convex set, then C = cow ext C.
(X,
X
c )) (X , (X , X
c )) is a homeomorphism onto its range.
Proof. The embedding (X, (X, X )) (X,
w
C = cow ext C = co\
ext C.
1
f = f = f
/ ext B(X).
kf kp
| {z }
D(X)
/
Thus ext B(X) S(X). Conversely, f S(X) i.e. kf kp = 1, then {f } is a face: let f = (1 t)x + ty for some
x, y B(X) and t [0, 1]. Note
1 = kf kp = k(1 t)x + tykp (1 t) kxkp +t kykp = kxkp = kykp = 1.
| {z } |{z}
1
So, kf kp = (1 t)kxkp + tkykp . By the equality case of Minkowski we know that there is 0 with (1 t)x = ty.
Taking norms, (1 t) kxkp = t kykp . Hence (1 t) = t, so that x = y. Thus f = (1 t)x + tx = x = y. So
|{z}
| {z }
1
c0 ) and 21 (y + z) = x.
Then y = (yn )
n=1 , z = (zn )n=1 B(c
Can c 0 be a dual space? That is, is there a normed space X with X = c 0 ? Assume yes. So X = c 0 . Consider
We now prove that the extreme points of the set P (X) of probability measures on X is just the set of Dirac measures
x ; integration of a function f against such a x merely results in evaluation at x, that is, x (f ) = f (x). It is just
as nontrivial if we choose (X, ) to just be [0, 1] with the usual topology.
11.9 Theorem. For (X, ) a compact Hausdorff space, define
P (X) = { C R (X, ) : kk 1, (1) = 1}.
Then ext P (X) = {x : x X}, where for each x we define x (f ) = f (x).
32
Proof. Since we use symbols f, g for elements of C = C R (X, ) we shall use , , for elements of C . Observe that
1 ({1}) is w -compact (B(C ) is w -compact by Banach-Alaoglu, and 1
1 ({1}) is w -closed).
P (X) = B(C ) 1
Also, P (X) is evidently convex, so it admits extreme points by the Krein-Milman Theorem.
If P (X), we note for 0 f 1 in C, then 0 1 f 1 and hence |(f )| 1 and |1 (f )| = |(1 f )| 1
since kf k , k1 f k 1. Thus 0 (f ) 1. From this it follows that if g 0, then (g) 0.
Now, if g C, let g + = max{g, 0}, g = max{g, 0} so g = g + g and |g| = g + + g . If we fix 0 f 1 in C
and in P (X) we let (g) = (f g), then
0
z }| { z }| {
|(g)| = |(g + g )| = | (f g + ) (f g ) | (f g + ) + (f g ) = (f (g + + g ))
|{z}
|{z}
0
(*)
noting that f |g| f kgk . Now, fix ext P (X). Find 0 f 1 in C such that 0 < (f ) < 1 (since kk = 1, this
1
is always possible, exercise). Let (g) = (f
) (f g) for g in C. Then using (*),
|(g)| =
1
1
|(f g)|
(f )kgk = kgk
(f )
(f )
1
1
so kk 1. Also (1) = (f
) (f 1) = 1. Hence P (X). Similarly, if we put (g) = 1(f ) ((1 f )g), then
noting that 1 (f ) = (1 f ), we see that defines an element of P (X). We observe (f ) + (1 (f )) = .
1
In particular = , i.e for g in C, (g) = (f
) (f g) = (g) so (f g) = (f )(g). Since span{f : 0 f 1} = C, it
follows that (f g) = (f )(g) for f, g C. Now, suppose for each x in X Sthere is fx C such that fx (x) 6= 0 but
x Ux , hence X = xX Ux . Since X is compact, there are
fx ker . Let Ux = fx1 (R \ {0})
Sn so Ux is open andP
n
x1 , . . . , xn in X such that X = i=1 Uxi . Thus f := i=1 fx2i > 0 on X so 1/f C. Then
1 = (1) =
( f1 f )
( f1 )(f )
( f1 )
n
X
(fxi )2 = 0.
i=1
This is absurd. Hence there is some x in X such that x (f ) = f (x) = 0 whenever (f ) = 0, ker x ker . However,
by the lemma following w -Separation Theorem, this implies x = c for some c R. However 1 = x (1) = c(1) =
c, so x = . Thus ext P (X) {x }xX .
It remains to show that each x is an extreme point of P (X). If x = (1 t) + t for 0 < t < 1 and , P (X),
then computations similar to but much simpler than (*) show that for f in C,
t|(f )| t(|f |) tx (|f |) = t|f (x)|.
Hence if f ker x , then f ker so ker x ker and, as above, x = . Similarly x = , and x ext P (X).
11.10 Remark. w -closed convex hull of extreme points of a w -closed convex set in the dual space is that set
itself (this is the K-M theorem). Its very tricky to exhibit an example where it is not, a fortiori, the norm-closure.
11.11 Exercise. Prove (only the second equality needs proof) that
(
)
X
X
co ext P (X) = co{x }xX =
i xi : xi X, i 0 for all i and
i = 1 .
i=1
i=1
11.12 Exercise. Let (X, ) = ([0, 1], usual top.). Let m P ([0, 1]) be given by the Riemann integral (i.e. m is
integration against the usual Lebesgue measure, which reduces to Riemann integration because all f are continuous):
Z 1
m(f ) =
f.
0
Then m
/ co{t }t[0,1] (otherwise, we would have covered this in MATH 147).
11.13 Remark. Observe, however, that m cow {t }t[0,1] . We use Riemann sums; let
N = {(s0 , . . . , sn , t1 , . . . , tn ) : 0 = s0 < s1 < . . . < sn = 1, and ti [si1 , si ] for i = 1, . . . , n, and n N}.
Pre-order N by declaring that
= (s0 , . . . , sn , t1 , . . . , tn ) (s00 , . . . , s0n0 , t01 , . . . , t0n0 ) = 0 {s0 , . . . , sn } {s00 , . . . , s0n0 }.
33
If is as above,
S (f ) =
n
n
X
X
(si si1 )f (ti ) =
(si si1 )ti (f ) co{t }t[0,1] .
i=1
i=1
i.e.
m = w* -lim
N
n
X
(si si1 )ti .
i=1
12
z }| {
0 p(x y) = [x y, x y] = [x, x] [y, x] [x, y] + ||[y, y] = p(x)2 2 Re [x, y] + p(y)2
| {z }
2
|[x,y]|
34
p(x)2 +p(y)2
.
2
(iii) Note
p(x + y)2 = p(x)2 + 2 Re[x, y] + p(y)2
2
(as above)
(**)
P3
k=0
ik = 0,
P3
k=0
i2k = 0.
12.6 Definition. Suppose (, ) is an inner product with associated norm k k. We call a set S X orthogonal if
x 6= y S we have (x, y) = 0; write x y (read as is perpendicular to).
n
n
X
2 X
xi
=
kxi k2 .
12.7 Proposition (Pythagoras Law). If {x1 , . . . , xn } X is orthogonal, then
i=1
i=1
F F
35
F F
a < for a = (a )
Fn = { : a 6= 0}.
n=1
If
x y =
x y
x y
[
(x(n) )
x(n)
n=1 `2
n=1
and the countable union of countable sets is countable, from which it follows that `2 () is complete.
12.11 Proposition. Let (X , (, )) be an Euclidean space. For each x in X , the functional fx : X F given by
fx (y) = (y, x) is in X and kfx k = kxk.
Proof. fx X 0 by the properties of (, ). Also kfx k kxk is a consequence of the Cauchy-Schwarz inequality.
1
1
1
x) = ( kxk
x, x) = kxk
(x, x) = kxk, so kfx k kxk.
Observe if x = 0 then fx = 0; if x 6= 0, then fx ( kxk
12.12 Remark (Notation). Let X be an Euclidean space and 6= S X . We define
S = {y X : (y, x) = 0 for all x in S},
T
pronounced S-perp. Observe that S = xS ker fx , and each of these kernels is a closed subspace, so this is
itself a closed subspace of X .
12.13 Theorem (Complementation Theorem). Let X be an Euclidean space and Y X be a complete
subspace (e.g. X is Hilbert, Y closed; or in general Y is finite dimensional). Then for x in X then there is a unique
decomposition x = xY + xY such that xY Y, xY Y . Moreover, the map P : X X given by P x = xY is
linear, and Im P = Y, P 2 = P and if Y 6= {0}, kP k = 1.
Proof. First, let us find xY . Let d = dist(x, Y) = inf{y Y : kxyk}. Let for each n, yn Y be so kxyn k < d+ n1 .
The parallelogram law for n, m N gives
k(x yn ) (x ym )k2 + k(x yn ) + (x ym )k2 = 2kx yn k2 + 2kx ym k2
so
kym yn k2 = 2kx yn k2 + 2kx ym k2 k2x (yn + ym )k2
= 2kx yn k2 + 2kx ym k2 4kx 21 (yn + ym ) k2
|
{z
}
Y
{z
}
|
< 2(d +
1 2
n)
+ 2(d +
1 2
m)
4d2
2 n,m
+
4d 0 .
So (yn )
n=1 is Cauchy in Y. By completeness, xY = limn yn . Now, let xY = x xY . Observe that kxY k = d.
Suppose there were y in Y such that (xY , y) 6= 0. We let z = (xY , y)y, so z Y, (xY , z) = |(xY , y)|2 > 0.
Then for any > 0 we have
kx (xY + z )k2 = kxY zk2 = (xY z, xY z) = kxY k2 2 Re(xY , z) + 2 kzk2
| {z }
Y
2(xY , z)
kzk2
We first note, by Pythagoras law, kxk2 = kxyk2 + kxY k2 so kxY k kxk. If x, x0 are in Y and F, then
(x + x0 )Y + (x + x0 )Y = xY + xY + (x0Y + x0Y )
= xY + x0Y Y + xY + x0Y
| {z }
|
{z
}
Y
{x}
|{z}
span S {x} .
Now for any closed, hence complete, subspace Y we have Y 2 Y = H = (Y ) 2 Y . We have that Y (Y ) ,
and hence Y = (Y ) . We apply this to Y = span S.
12.16 Theorem (Riesz Representation Theorem). Let H be a Hilbert space, and f H . Then there
is a unique vector x0 H such that f = fx0 , i.e. f (x) = (x, x0 ) for x H, kfx0 k = kx0 k.
Proof. Say f 6= 0. We first note that (ker f ) is one-dimensional. Indeed, if x1 , x2 (ker f ) \ {0} then f (x1 ) 6=
0 6= f (x2 ), and
f (x2 )x1 f (x1 )x2 ker f (ker f ) = {0},
so {x1 , x2 } cannot be linearly independent. Hence (ker f ) = span{x1 }, kx1 k = 1. Let x0 = f (x1 )x1 . Let P = Pker f
be the orthogonal projection onto ker f . Now, if x H,
x = P x + x1 for some F.
|{z}
(IP )x
Thus
(x, x0 ) = (
Px
|{z}
ker f ={x1
+x1 , f (x1 )x1 ) = (x1 , f (x1 )x1 ) = f (x1 ) (x1 , x1 ) = f (x1 ) = f ( |{z}
P x +x1 ) = f (x).
| {z }
1
ker f
So we see indeed that f = fx0 . If (x, x0 ) = (x, y0 ) for all x in H, then with x = x0 y0 , we see (x0 y0 , x0 y0 ) = 0
so x0 = y0 .
Final exam date is Dec. 12, 12:30-3pm (see website). Possible talk topics posted soon.
If F = C, x 7 fx is conjugate linear, i.e. fx+x0 = fx + fx0 (passing scalars at cost of conjugation). Generally, we
where H
denotes the conjugate space. We have H = H
as sets, but scalar multiplication in H
is
write H
=H
governed by the rule x = x where the right-hand side is scalar multiplication in H, and (x, y)H = (y, x)H .
12.17 Remark (on the completion of an Euclidean space). We have:
kk
let (xn )
n , G = limn yn . We observe for n, m in N
n=1 , (yn )n=1 X be so F = limn x
|(xn , yn ) (xm , ym )| |(xn , yn ) (xn , ym )| + |(xn , ym ) (xm , ym )| kxn kkyn ym k + kxn xm kkym k
just using Cauchy-Schwarz and biadditivity of the form giving the inner product. So it is easy to see that
0
((xn , yn ))
n=1 F is Cauchy, and hence converges to something, call it (F, G). We observe if (xn )n=1 ,
0
0
0
(yn )n=1 X , F = limn x
n , G = limn yn , then the same technique as above shows that limn (xn , yn ) =
limn (x0n , yn0 ). Hence (F, G) is well-defined and may be shown to define an inner product on H for which
kF k = (F, F )1/2 .
(ii) Now, if f X , consider f X , so f|H H , so f|H = fG0 . Hence for x in X , f (x) = (
x, G0 ). It follows
H
=
H.
that H = X , i.e. X
H
so
X
H
=
=
=
37
12.18 Theorem (Gram-Schmidt Orthogonalisation). Let {x1 , x2 , x3 , . . .} be a linearly independent sequence in an Euclidean space X . Then there exists an orthogonal sequence {e1 , e2 , . . .} such that span{e1 , . . . , en } =
span{x1 , . . . , xn }.
Proof. Let En = span{x1 , . . . , xn } and Pn denote the orthogonal projection from X to En (complementation theorem). We define
e1 = x1
e 2 = x 2 P1 x 2
e3 = x3 P2 x3
..
.
en = xn Pn1 xn .
We observe, inductively, then en En . Hence span{x1 , . . . , xn } = span{e1 , . . . , en } holds. Moreover, each en
En1 , and it follows that {e1 , . . . , en } is orthogonal, hence {e1 , e2 , . . .} is orthogonal.
12.19 Theorem. We have:
(i) If X is a separable Euclidean space, then X contains an orthogonal sequence, E = {e1 , e2 , . . .} such that
span E = X .
(ii) If H is any Hilbert space, there is an orthogonal set E H such that span E = H.
Proof. We have:
(i) If {xn }
n=1 is a dense subset of X , then span{xn }n=1 = X . Now let
n1 = min{k N : xk 6= 0}
n2 = min{k N : xk
/ span{xn1 }}
..
.
nm = min{k N : xk
/ span{xn1 , . . . , xnm1 }}
sequence (em )m=1 . Observing that span{xn1 , . . . , xnm } = span{e1 , . . . , em }, we see that span{em }
m=1 =
span{xnm }
.
m=1
(ii) Let O = {S SH : O
/ S, S is orthogonal}. Partially order O by inclusion. If C O is a chain, it is easy
to verify that SC S is an orthgonal set, not including O. Hence by Zorns Lemma, there is a maximal
orthogonal set E. If we had span E ( H, we let PE denote the orthogonal projection onto span E. Then if
x H \ span E, we would see that 0 6= (I PE )x E, contradicting maximality.
12.20 Definition. A set E in an Euclidean space is orthonormal if for e, e0 in E,
(
1 if e = e0
0
(e, e ) =
0 if e 6= e0 .
12.21 Remark. If {e1 , . . . , en } is an orthonormal set, let E = span{e1 , . . . , en }. If x PX , we have orthogonal
n
decomposition x = PE x + (I PE )x, where (I PE )x E. Moreover PE x E, so PE x = i=1 i ei . Then for each
i,
n
X
(x, ei ) =
i ei + (I PE x), ei
| {z }
i=1
to E
Pn
Proof. We have:
(i) If F E is finite then
X
eF
(ii) () We let xn =
Pn
i=1
eF
i=m+1
i=m+1
So (xn )
n=1 X is Cauchy, hence converges.
(n)
() Let (x(n) )
}n=1 is separable. Hence, by the
n=1 be a Cauchy sequence in X . Then E = span{x
last theorem, there is an orthogonal (hence we can make it orthonormal) sequence {en }
n=1 such that E =
P
(n)
(n)
2
(n) 2
(n)
span{en }
.
From
(i)
above,
we
have
that
|(x
,
e
)|
kx
k
so
if
=
(x
, ei ), then (n) =
i
n=1
i
i=1
(n)
(i )
i=1 `2 . We observe for n, m in N that
i=1
(n)
so (P
)n=1 `2 is Cauchy, and hence converges to some in `2 , as `2
= `2 and is complete. Thus
(n)
x = i=1 i ei X , by assumption. Given > 0, find n0 such that k k2 < 2 for n n0 . Then, for
P
P
(n)
fixed n n0 , find N such that i=N +1 |i |2 < (/3)2 , i=N +1 |i |2 < (/3)2 . Then
N
X
X
X
(n)
(n)
kx x(n) k
(i i )ei
+
i ei
+
i ei
i=1
i=N +1
i=N +1
m
m
X
X
(n)
(n)
i ei
+ + = .
k k2 + lim
i ei
+ lim
m
m
3 3 3
i=N +1
i=N +1
{z
}
|
P
(
m
i=N +1
|i |2 )1/2
12.23 Theorem (Orthonormal Basis Theorem). Let H be a Hilbert space and E H be an orthonormal
set, with E = span E and PE = PE be the orthogonal projection on E. Then for x, y in H we have:
X
X
(i)
(x, e)e = lim
(x, e)e = PE x, where F = {F E : F finite} directed by F F 0 iff F F 0 .
F F
eE
(ii)
eF
eE
eE
(ii)
eE
(iii) (x, y) =
eE
Then span E = H.
39
Proof. We have:
P
(i) It follows from Bessels inequality (from the theorem above) that PeE |(x, e)|2 Pkxk2 , so Ex = {e E :
(x, e) 6= 0} is countable, hence a sequence, so by Riesz-Fischer x0 = eE (x, e)e = eEx (x, e)e converges in
H. Also, x0 E. Now if e0 E, then
!
X
X
0
0
0
(x, e)(e, e0 )
(x x , e) = (x, e )
(x, e)e, e = (x, e0 )
eEx
eEx
by the continuity of y 7 (y, e0 ), but this is just equal to (x, e0 ) (x, e0 ) = 0. It follows that x x0 E, hence
x x0 E. Also
(x x0 , x0 ) = (x, x0 ) (x0 , x0 )
!
X
= x,
(x, e)e
!
X
eE
(x, e)e,
eE
(x, e )e
e0 E
!
=
(x, e)(x, e)
eE
(x, e) e,
eE
X X
eE
(x, e )e
()
e0 E
eE
|(x, e)|2
e0 E
eE
(x,e)
F F
Now, if (i) holds, then H span E H. If (ii) holds then x = PE x + (I PE )x is an orthogonal decomposition
by which kxk2 = kPE xk2 so k(I PE )xk2 = 0, thus PE = I which gives (i). If (iii) holds, let x = y and obtain
(ii).
12.24 Corollary. If H is a Hilbert space and E H is an orthonormal set for which span E = H, then the map
U : H `2 (E) given by
U x = ((x, e))eE
defines a surjective linear isometry such that (U x, U y) = (x, y).
Proof. That (U x, U y)`2 = (x, y)H is Parsevals identity and hence kU xk2 = (U x, U x)`2 = (x, x)H = kxk2 , so U is
an isometry. Surjectivity of U is a consequence of the Riesz-Fischer Theorem.
Talk topics have been posted.
13
Spectral theory
k=0
T k = (I T )1 .
k=0
1
kS 1 k ,
k=m+1
So, (Sn )
n=1 is Cauchy, hence convergent. Now,
(I T )Sn =
n
X
(T k T k+1 ) = I T n+1 I.
k=0
Similarly, Sn (I T ) I. So
k=0
T k = (I T )1 .
(ii) Observe kS 1
T )k kS 1 kkS T k < 1 so T = (S (S PT )) = S(I S 1 (S T )) G(X ) from (i),
P(S
1
with T = k=0 (S 1 (S T ))k S 1 . Therefore, T 1 S 1 = k=1 (S 1 (S T ))k S 1 , so
kT 1 S 1 k
X
k=1
kS 1 kkS T k
k
(kS 1 kkS T k) kS 1 k =
kS 1 k,
{z
}
|
1 kS 1 kkS T k
<1
1
kS 1 k D(B(X ))
13.4 Definition. If X is a complex Banach space and T B(X ), the resolvent set of T is given by
(T ) = { C : (I T )1 B(X )} = C \ (T ).
If B is a complex Banach space and U C is open we call F : U B holomorphic if for each z0 in U ,
F 0 (z0 ) = lim
zz0
F (z) F (z0 )
z z0
exists.
41
G(X )
1
for |z| > kT k.
|z| kT k
Proof. We have:
(i) Let F : C B(X ), F (z) = zI T . Then F is continuous, and (T ) = F 1 (G(X )) is open.
(ii) If z, z0 (T ),
R(z) R(z0 ) = (zI T )1 (z0 I T )1 = (zI T )1 [(z0 I T ) (zI T )](z0 I T )1
= (z0 z)(zI T )1 (z0 I T )1
so that
R(z) R(z0 )
zz0
= (zI T )1 (z0 I T )1
((z0 I T )1 )2 .
z z0
(iii) If |z| > kT k, then k z1 T k < 1 so zI T = z(I z1 T ) is invertible. So (T ) {z C : |z| > kT k}. Also,
1
R(z) = (zI T )
so that
1
=
z
1
1X 1 k
1
=
T
I T
z
z
zk
k=0
1 X 1
1
kR(z)k
kT kk =
k
|z|
|z|
|z|
k=0
1
1
1
|z| kT k
1
.
|z| kT k
F (z) F (z0 )
lim
zz0
z z0
zC
Applying Liouvilles Theorem to f F we see for z, z 0 C that f (F (z)) = f (F (z 0 )). Now, applying this, for any
f X and appealing to the Hahn-Banach Theorem (that theres a sufficient quantity of linear functionals to
separate all points), then we see that F (z) = F (z 0 ) for all z, z 0 C.
13.10 Theorem. Let X be a C-Banach space and T B(X ). Then (T ) 6= .
42
Proof. Let R(z) = (zI T )1 denote the resolvent function, for z (T ). Suppose that (T ) = C. We observe
that
1
lim kR(z)k lim
= 0,
|z|
|z| |z| kT k
and that R, being holomorphic, is continuous, i.e. bounded on 2kT kB C. [diagram]. Hence R is bounded on all
of C, and is thus constant. Due to the limit above, this means that R(z) = 0 for all z. This contradicts that each
R(z) is invertible.
13.11 Theorem (Spectral Mapping Theorem). Let X be a C-Banach space. If T B(X ), and p(t) C[t]
(polynomials with complex coefficients) then
(p(T )) = p((T )) = {p() : (T )}.
Proof. We suppose p(t) 6= 0. If z0 C, write
n
Y
p(t) z0 =
(t zk )
k=1
where z1 , . . . , zn is the family of zeros of p(t) z0 (counting multiplicity) [Fundamental Theorem of Algebra]. Then
n
Y
p(T ) z0 I =
(T zk I).
k=1
Thus p(T ) z0 I is non-invertible, if and only if at least one T zk I is non-invertible (i.e. zk (T ) for some k).
Thus z0 (p(T )) if and only if p() z0 = 0 for some (T ).
13.12 Theorem (Spectral Radius Formula). Let X be a C-Banach space. Let T B(X ) and define the
spectral radius of T by r(T ) = max{|| : (T )}. Then
r(T ) = lim kT n k1/n .
n
Proof. Let us first see that limn kT n k1/n exists. Let (n) = log kT n k for n N. Observe that
kT n+m k kT n kkT m k = (n + m) (n) + (m).
Fix, for the moment, m and let n > m, so n = qn m + rn where qn = 0, 1, 2, . . ., rn = 0, 1, . . . , m 1 and then
(n)
(qn m + rn )
(qn m) + (rn )
qn (m) + (rn )
(m) (rn )
=
+
n
n
n
n
m
n
so lim sup
n
(m)
(n)
, hence
n
m
lim sup
n
so
limn (n)
n
(n)
(m)
(n)
inf
lim inf
n
mN m
n
n
Now, by the spectral mapping theorem, we have that r(T n ) = r(T )n . Also, we saw earlier that r(T n ) kT n k.
Thus
r(T ) kT n k1/n = r(T ) lim kT n k1/n .
n
We let f B(X ) and consider f R : (T ) C. Then f R has a Laurent series for all |z| > r(T ). We write
f R(z) =
X
fk
k=0
2k
(note we have omitted the polynomial part but we will prove we didnt need that anyway). We have for |z| > kT k,
R(z) =
X
k=1
43
1
T k,
2k+1
X
k=0
1
Tk
z k+1
converges for all |z| > r(T ). Now, if |z| < limn kT n k1/n , we can find > 0 such that
(1 + )|z| < lim kT n k1/n
n
and there is n0 such that (1 + )|z| < kT n k1/n for n n0 . But then (1 + )n <
1
n
|z|n kT k
X
1 k
T
zk
k=0
cannot converge, since the nth grows without bound. Thus we have that |z| > r(T ) implies |z| > limn kT n k1/n .
Hence r(T ) limn kT n k1/n .
13.13 Example. Suppose dim X < , where X is a C-vector space. Then L(X ) = B(X )
= Mdim X (C) for any
norm on X . Here, if T B(X ), (T ) = p (T ) = { C : det(I T ) = 0}. Observe r(T ) = max{|| : (T )}
depends only on algebraic structure of T , whereas kT k depends on our choice of norm on X . But, by SRF,
r(T ) = limn kT n k1/n depends only on algebraic structure.
14
Adjoint operators
14.1 Definition. Let X , Y be vector spaces and T L(X , Y). Define T L(Y 0 , X 0 ) by (T (f ))(x) = f (T x), so
T f = f T = f T . Linearity is clear. This is the adjoint of T .
14.2 Proposition (on the adjoint). Let X , Y, Z be normed spaces, and T, S B(X , Y), R B(Y, Z).
(i) T B(Y , X ), kT k = kT k.
(ii) If F, (S + T ) = S + T .
(iii) If T = (T ) B(X , Y ), then T x
= Tcx for all x X .
(iv) RS B(X , Z) implies (RS) = S R B(Z , X ).
Proof. We have:
(i) and (iii) If f Y ,
kT f k = sup{|T f (x)| : x B(X )} = sup{|f (T x)| : x B(X )} kf kkT k
so kT k kT k. If x X , f Y ,
T x
(f ) = x
(T f ) = T f (x) = f (T x) = Tcx(f ).
So kT k = kT |X k kT k kT k kT k. So kT k = kT k.
(ii) Obvious.
(iv) If f Z , (RS) f = f R S = S (f R) = S R f .
14.3 Definition. Let X be a normed space, Y X a subspace, and Z X a subspace.
\
The annihilator of Y in X is Y a = {f X : f|Y = 0} =
ker y (w -closed subspace of X ).
| {z }
yY w -closed
The pre-annihilator of Z on X is Za = {x X : x
|Z = 0} =
f Z
)a and (Za )a = Z
xi yi
i=1
= X .
1
k(T )1 k kxk
(iii) (i): If T is bounded below, then ker T = {0}. We also have that if T is bounded below, then Im T is closed.
Let (yn )
n=1 Im T with y = limn yn in Y. Write yn = T xn for some xn in X , and we observe
kxm xk k
1
kT (xm xk )k = kym yk k
m
1
n,
ap (T ) = { C : there is (xn )
0}.
n=1 S(X ) s.t. (I T )xn
{z
}
|
n
0
T xn xn
14.13 Example. Fix 1 p < , S B(`p ), S(x1 , x2 , . . .) = (0, x1 , x2 , . . .). We
showed earlier that p (S) = .
P
Lets compute S B(`q ), p1 + 1q = 1. Write for x `p , y `q , hx, yi = fy (x) = i=1 xi yi . We have
hx, S yi = fy (Sx) = hSx, yi =
xi1 yi =
i=2
xi yi+1
i=1
1
1
,
M
k(Tn )1 k
1
n
(T n I)
xn , so kyn k = 1. Compute
15
1
n
, so n .
n
1
n
xn + (n )yn 0.
n
Compact operators
15.1 Definition. Let X , Y be Banach spaces. An operator K L(X , Y) is compact if K(B(X )) Y is compact.
In particular, any compact operator is a fortiori bounded.
15.2 Remark. We observe for K B(X , Y) that the following are equivalent:
46
1. K is compact.
3. K(B(X )) is totally bounded, i.e. given > 0, there are x1 , . . . , xn B(X ) such that
K(B(X ))
n
[
(Kxi + D(Y)) .
i=1
15.3 Proposition. Let X , Y, Z be Banach spaces and K(X , Y) = {K B(X , Y) : K is compact}. Then
(i) K(X , Y) is a closed subspace of B(X , Y).
(ii) If S B(Y, Z) or T B(Z, X ) then for K K(X , Y) then SK, KT are both compact.
Proof. We have:
sequence in a compact space, admits a converging subsequence ((Kxnk , Lxnk ))k=1 and hence ((K+L)xnk )
k=1
converges in K(B(X )) Y.
+ + = ,
3 3 3
z
}|
{
SK(B(X )) = S(K(B(X)) S(K(B(X )))
| {z }
compact
kk
15.5 Remark. Often, F(X , Y) = K(X , Y) if X , Y have the approximation property [Grothendieck]. It
turns out the approximation property characterizes this equality. You can devise separable Banach spaces for
which this equality fails to be true.
(iii) Let I = [0, 1]. Let k C(I 2 ) (the kernel). Define K : C(I) C(I) by
Z 1
Kf (x) =
k(x, y)f (y) dy.
0
Weve claimed that C(I) is the codomain, which isnt completely obvious, so lets check that. Observe
Z 1
|Kf (x) Kf (x0 )|
|k(x, y) k(x0 , y)| |f (y)| dy sup |k(x, y) k(x0 , y)| kf k
yI
and k, being continuous on a compact space I 2 , is uniformly continuous, so Kf C(I). Its obvious that
R1
K is linear, and for all x I, |Kf (x)| 0 |k(x, y)| |f (y)| dy kkk kf k , so kKf k kkk kf k , so
kKk kkk .
47
These are the so-called integral operators. We will approximate K by finite-rank operators, and hence
show that K is compact. Let A = span{ : C(I)}, (x, y) = (x)(y). We observe that A is an
algebra of functions, which is point separating and conjugate closed (if F = C). Hence by Stone-Weierstrass
kk
Theorem, A
= C(I 2 ). Hence if k C(I 2 ) then there is (kn )
n=1 A such that limn kk kn k = 0.
R1
n
Hence if Kn f = 0 kn (, y)f (y) dy, then kK Kn k kk kn k 0. If
m(n)
kn =
n,i n,i ,
i=1
R1
0
R1
n,i ()n,i (y)f (y) dy = [ 0 n,i f ] n,i .
15.6 Remark. If k, K are as above, then by Hlders inequality for 1 < p < shows that kKf k
kkk kf kp , so K K(Lp (I), C(I)). The injection map J : C(I) Lp (I) is linear and contractive (kJk 1).
Hence JK K(Lp (I)).
15.7 Theorem. If X , Y are Banach spaces and K B(X , Y) then K K(X , Y) if and only if K K(Y , X ).
Proof. We will observe that K is w -norm continuous on bounded sets. Indeed, suppose (f )N M B(Y )
(M > 0) with f0 = limN f . Then f0 M B(Y ) (Alaoglu). Given > 0, let {Kx1 , . . . , Kxn } be an (/2M )-net
and for each N ,
of K(B(X )). Then for x B(X ), there is xj so kKx Kxj k < 2M
|f (Kx) f0 (Kx)| |f (Kx) f (Kxj )| + |f (Kxj ) f0 (Kxj )| + |f0 (Kxj ) f0 (Kx)|
kf kkKx Kxj k + |f (Kxj ) f0 (Kxj )| + kf0 kkKxj Kxk
< + |f (Kxj ) f0 (Kxj )|.
Hence
kK f K f0 k = sup |K f (x) K f0 (x)| = sup |f (Kx) f0 (Kx)|
xB(X )
xB(X )
N
K (B(X )); the first equality here is from the first proposition on adjoint operators. Also, the last space is compact.
\ )) is compact (
Hence K(B(X ))
= K(B(X
= denotes isometry).
15.8 Corollary. If X , Y are reflexive spaces, then for K B(X , Y) we have K K(X , Y) if and only if K is
w-norm continuous on bounded sets.
Proof. On reflexive spaces, w = w coincide. Also, K = (K ) . Appeal to the proof above.
16
16.1 Lemma (Key Lemma). Let X be a Banach space, and K K(X ). Suppose that there exist:
a sequence Y0 ( Y1 ( Y2 ( . . ., each Yn a closed subspace, and
scalars (n )
n=1 such that (n I K)Yn Yn1 .
Then limn n = 0.
Proof (notes 16.1). Suppose not. Then by dropping to a subsequence if necessary, we may suppose |n | > 0 for
each n, for some > 0. By the Riesz Lemma (used to prove B(X ) is not compact if dim X 6< ) there exists for each
48
i some xi Yi such that dist(xi , Yi1 ) > 21 , and kxi k 1. Let yi = (K i I)xi Yi1 , so Kxi = i xi + yi Yi .
If j < i, we have
kKxi Kxj k = ki xi + yi Kxj k = |i | kxi +
1
(yi Kxj )k |i | dist(xi , Yi1 ) .
i | {z }
2
Yi1
Hence (Kxi )
i=1 has no converging subsequence, which means that K is not compact.
16.2 Remark. We have:
(i) Let C be a compact set. If the interior 6= then || = c. Indeed, let z0 . For each z S
(i.e. |z| = 1) let rz = max{r > 0 : z0 + rz } > 0. Then {z0 + rz z} , so |S| ||. By the
Cantor-Bernstein-Schroeder Theorem, we have that || = c. Thus, if || 0 then = and hence
= is countable.
(ii) If T L(X ) (X a C-vector space) and {1 , . . . , n } is a collection of distinct eigenvalues of T and xi
ker(i I T ) \ {0} then {x1 , . . . , xn } is linearly independent.
Proof by induction. Let n = 2. If x1 = 2 x2 then 1 2 x2 = 1 x1 = T x1 = T (2 x2 ) = 2 2 x2 , so 2 (1
2 )x2 = 0 which contradicts that x2 6= 0. Hence having proved this for n 1, suppose x1 = 2 x2 + . . . + n xn .
Then 1 (2 x2 + . . . + n xn ) = 1 x1 = T x1 = T (2 x2 + . . . + n xn ) = 2 2 x2 + . . . + n n xn , so 2 (1
2 )x2 + . . . + n (1 n ) = 0, which contradicts the linear independence.
16.3 Theorem (Structure Theorem for Compact Operators). Let X be a C-Banach space, K
K(X ).
(i) If C\{0}, each ker[(I K)n ] is finite-dimensional and there is n0 so that ker[(I K)n ] = ker[(I K)n0 ]
for n n0 .
(ii) If dim X 6< , (K) = p (K) {0} and p (K) = {1 , 2 , . . .} where limn n = 0 if p (K) is infinite.
Proof. We have:
(i) Consider first the sequence ker(I K) ker(I K)2 . . .. If n0 , as suggested, does not exist, then let
n1 = 1 and nk+1 = min{n N : ker(I K)nk ( ker(I K)n }. Let Yk = ker(I K)nk , and Y0 = {0}, so
Y0 ( Y1 ( Y2 ( . . .. We have that (I K)Yj Yj1 , so by the Key Lemma we would obtain limn = 0,
but is constant which is absurd. Hence n0 exists, as claimed.
Now suppose dim ker(I K)n 6< for some n. Pick the nk , above, for which dim Yk1 < and dim Yk 6< .
Let {xi }
i=1 Yk \ Yk1 be linearly independent. Let V0 = Yk1 and Vn = span{V0 , x1 , . . . , xn }. We observe
that (I K)Vn V0 Vn1 . As well, V0 ( V1 ( V2 ( . . ., and hence by the Key Lemma we have limn = 0,
which is still absurd.
(ii) We first prove that ap (K) p (K) {0}. If ap (K) \ {0}, then there is (xn )
n=1 S(X ) such that
n
k(I K)xn k 0. Since K is compact, (Kxn )n=1 K(B(X )) admits a subsequence (Kxnj )
j=1 which
j
[
1
(K)
(K) \ D {0}
{z n }
n=1 |
finite
so that (K) is countable. Hence we have that (K) itself is countable (remark, last class). We thus have
(K) = (K) = ap (K) p (K) {0}. Thus, either |p (K)| < , in which case 0 p (K) since each
non-zero eigenvalue has finite-dimensional eigenspace, that is, dim ker[(I K)n ] < , by (i) above. And, if
|p (K)| <
6 , then 0 is the only cluster point of p (K). Because (K) is compact, 0 (K).
49
16.4 Aside. |p (K)| < . p (K) = {1 , . . . , n }. Let Vk = ker[(k I K)nk ] where nk is the maximal exponent
as in (i). dim Vk < . Then dim span{v1 , . . . , vn } < .
16.5 Example. It is possible for 0 6= K K(X ) with empty point spectrum.
Let K B(`p ), 1 p < , be given by K(x1 , x2 , . . .) = (0, x1 , 21 x2 , 13 x3 , . . .) (weighted shift).
K(x1 , x2 , . . .) = (0, x1 , 12 x2 , 31 x3 , . . .)
1
1
K 2 (x1 , x2 , . . .) = (0, 0, 12 x1 , 32
x2 , 43
x3 , . . .)
..
.
1
2
32
k!
K n (x1 , x2 , . . .) = (0, 0, . . . , 0, n!
x1 , (n+1)!
x2 , (n+2)!
x3 , . . . , (n+k)!
xk , . . .).
Note
1
n!
k!
(n+k)!
1
1
kxkp = kK n k
n!
n!
X
1
p
kKxkp =
|xn |
np
n=1
and necessarily at least one |xn |p is not zero, so this norm is not zero, which tells us that ker K = {0}. This gives
us a little bit of a warning: even as pretty as this theorem, the first part of it might essentially have no content;
there might be no eigenvalues to deal with. Because we were only able to get finite dimensionality of eigenspaces
with non-zero eigenvalues, we see that the zero part of the spectrum can contain all the craziness of the operator.
16.6 Remark. There are quite a few ways to manufacture operators that contain only 0 in the spectrum. Let
I = [0, 1], let K be a continuous function on K C(I 2 ) such that k(x, y) = 0 when x y. For example, let
(
0
if x y
k(x, y) =
x y if x > y
R1
R1
If we let K be the integral operator coming from this kernel, Kf (x) = 0 k(x, y)f (y) dy = x k(x, y)f (y) dy. Now
we can write down a formula for what happens when we iterate this operator. We can check
kK n f kp
kkkn
kf kp
n!
and hence r(K) = {0}. If k is continuous partially differentiable in the 1st coordinate, then one can show that
p (K) = .
16.7 Remark. If Kn (x1 , x2 , . . .) = (0, x, 21 x2 , . . . , n1 xn , 0, 0, . . .). Kn F(`p ) K(`p ). Check
k(K Kn )xkp
17
1
kxkp = lim kKn Kk = 0.
n
n+1
17.1 Definition. Let H, L be C-Hilbert space. Given T B(H, L). We define B(L, H) by (x, T y) := (T x, y) for
all x H, y L. Indeed, for fixed y L, x 7 (T x, y) is a bounded linear functional so by Riesz Representation
Theorem there exists T y H such that
(T x, y) = (x, T y).
Check that y 7 T y is linear.
50
Cost: (T ) = T for C.
Advantage: If L = H we can now compose T S in H.
Office hours this week. Thursday 2:30-4. Friday 2-3:30. Typo 5(b) p (Mf ) 6= .
Alternate exam time: if you wish to take advantage, send an email including all other exam times.
Recall: for H a Hilbert space, T B(H), we define T B(H) by
(T x, y) = (x, T y).
von Neumann preferred Hilbert spaces to model quantum mechanics. It plays host to non-commutative geometry;
I am involved with non-commutative harmonic analysis (a more general kind of harmonic analysis). The notion of
the adjoint is just mildly different than it is in the Banach space setting. If we have an operator on a Banach space,
then the adjoint will be defined its dual space. Hilbert spaces are essentially self-dual (we say essentially, because
the small cost of conjugation, in the complex realm).
17.2 Proposition. If H, K, L are Hilbert spaces and S, T B(H, K) and R B(K, L) then
(i) T B(K, H) and kT k = kT k.
(ii) (T + S) = T + S (meaningful only if F = C, but generally for spectral theory this is the domain in
which we wish to stay).
(iii) (RT ) = T R .
(iv) T = T (in Banach spaces, we couldnt quite say anything this nice; compare to the statement T x
= Tcx).
(v) If K = H, then kT T k = kT k2 (C -identity; makes a profound theory no study of Banach algebras is
complete without a substudy of C -algebras).
This is not very hard to prove. We will give very small hints at the details of C -algebra theory.
Proof. (i), (iii) and (iv) are pretty much just like the Banach space case. Also, (ii) is very straightforward (also
similar to the Banach space case).
(v): First, we have
kT T k kT kkT k = kT k2 .
Hence, it is really required to show kT T k kT k2 . We have
kT k2 = sup{kT xk2 : x B(H)} = sup{(T x, T x) : x B(H)} = sup{(T T x, x) : x B(H)}.
By Cauchy-Schwarz,
sup{kT T xkkxk : x B(H)} = kT T k.
This will be the driving force for the theorem on the spectral theorem for compact Hermitian operators on Hilbert
space.
We mention two theorems whose proofs are just like the Banach space case.
17.3 Theorem (Kernel-Annihilator Theorem). If H, K are Hilbert spaces, T B(H, K), then
ker T Im(T ).
17.4 Remark. In Hilbert spaces, if Y is a subspace of H, then Y = Y a = Ya (due to reflexivity).
17.5 Theorem (Schauder). If H, L are Hilbert spaces, and K B(H, L) then the following are equivalent:
(i) K K(H, L) K K(L, H).
(ii) K is w -norm continuous on bounded sets.
There is only a modest difference, because K is only slightly differently described as K from the Banach space
setting.
Lets talk about some particular classes of operators on a Hilbert space.
17.6 Definition. Let H be a Hilbert space, and T B(H). We say that
51
(i) T is positive if (T x, x) 0.
(ii) T is Hermitian (or self-adjoint) if T = T .
(iii) T is normal if T T = T T .
17.7 Proposition. If H is a Hilbert space, T B(H), then
(i) T is Hermitian if and only if [x, y] = (T x, y) is a Hermitian form.
(ii) T is positive implies T is Hermitian.
(iii) T T is positive.
(iv) If T is Hermitian, then T 2 is positive.
Proof. (i) ()
[y, x] = (T y, x) = (y, T x) = (y, T x) = (T x, y) = [x, y].
() First, observe that if [x, y] = (T x, y) defines a Hermitian form, then [x, x] = [x, x] and hence [x, x] R. Thus
we have the polarisation identity
(T x, y) = [x, y] =
k=0
k=0
1X k
1X k
1X k
i [x + ik y, x + ik y] =
i (T (x + ik y), x + ik y) =
i (x + ik y, T (x + ik y)) = (x, T y).
{z
} 4
|
4
4
R
k=0
Remark: In finite dimensions, represent T with respect to an orthonormal basis {e1 , . . . , ed } = , and we have
[T ] = [tij ] and [T ] = [T ] = [tji ] and tij = (T ej , ei ).
(ii): If T is positive, then [x, y] = (T x, y) is a positive form, i.e. [x, x] 0 for x H. Hence this form is Hermitian,
by polarisation identity.
(iii): (T T x, x) = (T x, T x) = kT xk2 0.
(iv): T = T implies that T T = T 2 .
17.8 Proposition. If T B(H) with H a Hilbert space, then there exist unique Hermitian operators Re T, Im T
such that T = Re T + i Im T .
1
Proof. Let Re T = 12 (T + T ) and Im T = 2i
(T T ). Check that (Re T ) = Re T and (Im T ) = Im T (recall
0 = T T = (T1 Re T ) + i(T2 Im T )
where T1 Re T , T2 Im T are each Hermitian. Thus, adding
0 = 0 + 0 = 2(T1 Re T ) = T1 = Re T,
and subtracting,
0 = 0 0 = 2i(T2 Im T ) = T2 = Im T.
Now we talk about why Hermitian and normal operators are absolutely as nice as an operator can be.
17.9 Corollary. If T B(H), T is normal if and only if Re T and Im T commute.
Proof. () Obvious.
() If Re T Im T = Im T Re T , then T = Re T + i Im T and Re T i Im T necessarily commute too.
17.10 Proposition. If H is a Hilbert space and T B(H), then T is normal if and only if kT xk = kT xk for all x
in H.
Proof. () kT xk2 = (T x, T x) = (T T x, x) = (T T x, x) = (T x, T x) = kT xk2 .
() 0 = kT xk2 kT xk2 = (T T x, x)(T T x, x) = ([T T T T ]x, x). Since T T T T is Hermitian, polarisation
identity shows that ([T T T T ]x, y) = 0 for all x, y H. It follows that T T T T = 0, by the Hahn-Banach
Theorem and Riesz Representation Theorem.
If we take a normal operator, its spectral radius is as large as can be: its always the norm of the operator. This is a
very magical property, and its exactly the sort of engine thats going to drive the spectral theory for both Hermitian
and normal operators. Well see this over the next two lectures (proof is just a nice mixture of the Spectral Radius
Formula, and the C -identity).
52
Office hours Th 2:30-4, F 2:00-3:30. A5 due Monday. Please make copies for study purposes. Final exam
questions posted soon.
17.11 Proposition. If H is a Hilbert space and T B(H) is normal (i.e. T T = T T ) then kT n k = kT kn .
Proof. Observe that kT k2 = kT T k by the C*-identity. Let H = T T so H = H and hence kHk2 = kH Hk =
kH 2 k. Thus for k in N:
k
k1
k1
k
kHk2 = kHk22
= kH 2 k2
= . . . = kH 2 k
Thus if n N, 2k n,
k
kHkn kHk2
= kHk2 = kH 2 k = kH n H 2
k kH n kkH 2
k kH n kkHk2
If H 6= 0, i.e. kHk =
6 0, we see that kHkn kH n k hence kH n k = kHkn . Thus
kT kn kT kn = (kT k2 )n = kT T kn = kHkn = kH n k = k(T T )n k = k(T )n T n k
where normality of T is used in the last equality.
k(T T )n k kT kn kT n k = kT kn kT n k.
Hence as above kT kn kT n k, so kT kn = kT n k.
17.12 Corollary. If T B(H) is normal, then r(T ) = kT k.
Proof. Recall Spectral Radius Formula: r(T ) = limn kT n k1/n .
This is a special property. For exame K K(`2 ) given by
Kx = (0, x1 , 21 x2 , 13 x3 , . . .)
has that kKk = 1 while r(K) = 0, i.e. (K) = {0}. Recall p (K) = .
17.13 Proposition. Let H be a Hilbert space and U B(H). Then the following are equivalent:
(i) (U x, U y) = (x, y) for x, y in H.
(ii) kU xk = kxk for x in H (i.e. U is an isometry).
(iii) U U = I.
Furthermore, if any of (i) (iii) holds, then the following are equivalent (we are not suggesting the following are
equivalent to those above):
(iv) U is surjective.
(v) U U = I (i.e. U is normal).
Recall if U U = I = U U , then U is a unitary.
Proof. (i) (ii) and (iii) (i): easy.
(ii) (iii): If kU xk2 = kxk2 , then
((U U I)x, x) = (U x, U x) (x, x) = 0
and by applying polarisation identity (U U I is Hermitian), we see that U U I = 0.
(iv) (v) (with (iii) holding).
(iii) U is injective. If U is also surjective and bounded below: kU xk kxk, by (ii), means that U 1 B(H)
exists. U 1 = IU 1 = U U U 1 = U (by (iv)).
(v) (iv): U U = I = U U (i.e. by (iii)) then U = U 1 so U is bijective.
In finite dimensions, U U = I implies U is invertible and hence U = U 1 . In infinite dimensions, this may not be
true:
U B(`2 ),
U (x1 , x2 , . . .) = (0, x1 , x2 , x3 , . . .)
53
Calculate:
U (x1 , x2 , . . .) = (x2 , x3 , . . .)
U U (x1 , x2 , . . .) = (0, x2 , x3 , . . .)
so kU U k = 1, (U U ) = U U = U U , (U U )2 = U |U{z
U} U = U U . One can calculate, moreover, that U U = I
I
T P = P T P,
ker(IH)
P
i.e. if x H then x has a unique decomposition x = n=1,2,... xn + x0 where xn ker(n I H), x0 ker H, with
P
xn xm for n 6= m, i.e. kxk2 = n=1,2,... kxn k2 + kx0 k2 . Thus, letting Pn = Pker(n IH) (orthogonal projection),
we have
X
H=
n Pn
n=1,2,...
54
Proof. Let E = span{ker(n I H) : n = 1, 2, . . .}. Then part (ii) of the above lemma tells us that
M
E = `2
ker(n I H)
n=1,2,...
We note that E contains all eigenspaces corresponding to nonzero eigenvalues of H. Now, let H0 = E and P0 = PH0
(orthogonal projection).
We observe, first, that HH0 H0 . Indeed ,if x H0 , and y E then (Hx, y) = (x, Hy) = 0 since HE E
so Hy E and x E by assumption. That is Hx H0 whenever x H0 . Thus H = HP0 = P0 HP0 (by
proof of part (iii) of lemma above) is a hermitian operator: H0 = (P0 HP0 ) = P0 HP0 = P0 HP0 . Since H0 is
compact and since r(H0 ) = kH0 k, there exists (H0 ) such that || = kH0 k. Further, if H0 6= 0, i.e. kH0 k =
6 0,
then p (H0 ). If 6= 0 and x ker(I H0 ) \ {0} then x E because x ker(0I H) = ker H and
E ker H0 = ker P0 HP0 ker P0 = E. Thus for x as above, P0 x = x. Thus Hx = HP0 x = H0 x = x. But this
contradicts the fact that E contains all eigenspaces
of H corresponding to nonzero eigenvalues. Thus H0 = 0 and
L
H0 = ker H. Hence H = E 2 E = `2 n=1,2,... ker(n I H) 2 ker H.
To see the operator form of this, let
H1 =
n Pn .
We observe that if {1 , 2 , . . .} is infinite then limn n = 0 by the structure theorem for general compact
operators and
2
2
!
2
N
N
X
X
X
n Pn x
n Pn x
= sup
n Pn
= sup
H1
H1
xB(H)
xB(H)
n=1
n=1
n=N +1
|n |2 kPn xk2
xB(H) n=N +1
sup |n |2 0.
CHECK nN +1
So
H1 = lim
N
X
n Pn
n=1
where P1 , P2 , . . . are from above. The sense of convergence above is pointwise (strong operator topology):
N
X
T x = lim
Pn T Pn x
n=1
(if {1 , 2 , . . .} is infinite).
Proof. Part (iii) of the Lemma shows that T Pn = Pn T Pn for n = 0, 1, 2, . . . Moreover
M
H = `2
Im Pn 2 Im(P0 )
n=1,2,...
Pn + P0
n=1,2,...
n=1,2,...
55
17.19 Corollary (Simultaneous Diagonalisation Theorem). If H1 , H2 K(H) are both Hermitian and
H2 H1 = H1 H2 then there are finite rank orthogonal projections {Q1 , Q2 , . . .} K(H) such that Im Qn Im Qm
for n 6= m and for any scalars 1 , 2 C we can write
X
1 H1 + 2 H2 =
n (1 , 2 )Qn
n=1,2,...
where each n (1 , 2 ) is a C-valued bilinear form and limn n (1 , 2 ) = 0, so the series converges in norm.
P
Proof. First, write H1 = n=1,2,... n Pn as in the spectral theorem. We have, by the Corollary above, that
H2 =
Pn H2 Pn + P0 H2 P0
n=1,2,...
where P0 = Pker H1 . We have that each Pn H2 Pn is compact (in fact finite rank if n 0). So in fact each Pn H2 Pn
can be written
X
Pn H 2 Pn =
i,n Qi,n
i=1,2,...
where
X
Qi,n = Pn
i=1,2,...
(in the pointwise sense if infinite i.e. n = 0) by the structure of Pn H2 Pn . We let {Q1 , Q2 , . . .} be an enumeration
of {Qi,n : i = 1, 2, . . . ; n = 1, 2, . . .} and let n (1 , 2 ) = 1 m + 2 i,m where Qn = Qi,m .
17.20 Corollary (Spectral Theorem for Compact Normal Operators). If N K(H) is normal,
then there is a sequence of finite rank projections {Q1 , Q2 , . . .} such that
X
N=
n Qn
n=1,2,...
1
2i (N
17.21 Theorem (Spectral Theorem for Compact Normal Operators). If N K(H) is normal,
then there exists a sequence (Qn )n=1,2,... of finite rank orthogonal projections with Im Qn Im Qm for m 6= n, and
a sequence (n )n=1,2,... C \ {0}, and limn n = 0 if infinite, such that
X
N=
n Qn
n=1,2,...
ker(
Pn I N ). Let (ei )i=1 be an enumeration of {enj : n = 0, 1, 2, . . . ; j = 1, . . . , m(n); j= 1, 2, . . . if n = 0}. Let
U i=1 xi ei = (xi )i=1 `2 . Check that U is unitary. Check that U N U = Ma , a = (ai )i=1 , ai = nj according to
ei = enj .
17.24 Remark. As usual, let C((N )) denote the continuous C-valued functions on (N ). Define N : C((N ))
B(H) to be the unique continuous linear operator such that
N (p) = p(N, N )
if p(z, z) is a polynomial in z, z. Check that kN (p)k = supz(N ) |p(z, z)|. By Stone-Weierstrass, such polynomials
are dense in C((N )). The operator N is called a functional calculus. Often we write f (N ) = N (f ). Note that
for f, g C((N )), f g(N ) = f (N )g(N ), and f (N ) = f (N ) .
56
17.25 Proposition. If T K(H) is positive, then there is a unique positive operator S K(H) such that S 2 = T .
We write S = T 1/2 .
Proof. Recall, T positive means that (T x, x) 0 for all x H. Hence (T ) [0, ). Indeed, if (T ) \ {0},
then p (T ) and we have, for x ker(I T ) \ {0} that
(x, x) = (x, x) = (T x, x) 0
so > 0. By the Spectral Theorem (Hermitian),
X
T =
n Pn .
n=1,2,...
P
Let S = n=1,2,... n Pn . Suppose S1 K(H) and S1 0, S12 = T . Then S1 S12 = S12 S1 , so S1 T = T S1 . Hence by
an earlier corollary,
X
S1 =
Pn S1 Pn + P0 S1 P0
n=1,2,...
where
spani=1,2,... Im(Qn,i ) Im Pn
i=1,2,...
Then
X
n=1,2,...
n Pn = T = S12 =
(Pn S1 Pn )2 =
n=1,2,...
n=1,2,... m=1,2,...
2n,i Qn,i
| {z }
Q2n,i =Qn,i
where (*) holds since Im Pn Im Pm for n 6= m. This forces 2n,1 = n , which means S1 = S, above.
17.26 Theorem (Polar decomposition). Let K K(H). Then let |K| = (K K)1/2 . There is a unique
operator U B(H) such that
U |K| = K,
ker U = ker K
and kU xk = kxk whenever x (ker K) . Here U is called a partial isometry.
Proof. First, for x H
k|K|xk2 = (|K|x, |K|x) = (|K|2 x, x) = (K Kx, x) = (Kx, Kx) = kKxk2 .
Hence we can define V0 : Im |K| H by V0 |K|x = Kx, so kV0 yk = kyk, i.e. V0 is an isometry. Let V : Im |K| H
be the unique continuous extension of V0 . As usual, V is an isometry and is linear. By the Kernel-Annihilator
Theorem, we know that Im |K| = (ker |K|) and we note from above that ker K = ker |K|. Let U : H H be
given by V P where P = P(ker K) . We observe that if y Im |K| = (ker K) then kU yk = kV P yk = kV yk since
P y = y, hence kU yk = kyk. Also, ker U ker P = ker K so ker U = ker K.
57