Professional Documents
Culture Documents
If you have printed out the page with the measuring tapes on them from the
CD, cut them out with the edges of the paper aligned with the measuring edges
of the printed tapes. Tape measures A and B should be taped together at a right
angle to one another with the measuring edges facing one another. We will call
this taped-together object the Side Measurer. The Side Measurer will be used to
measure the lengths of the two sides of a right triangle, while tape measure C,
which we will call the Hypotenuse Measurer, will be used to measure the length
of the hypotenuse, in the common set of units inscribed on the three measures.
1 Each chapter in Part I of the book starts with a “hands-on exercise.” These are intended to help the reader
develop spatial awareness. They are not supposed to be scientific experiments, rather they are exercises to
limber up the mind. The reader is free to skip them, of course.
Downloaded from https://www.cambridge.org/core. Columbia University Libraries, on 21 Aug 2018 at 12:58:29, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/CBO9780511755811.002
4 From Pythagoras to spacetime geometry
Go to your desk or table and tape the corner of the Side Measurer onto some
convenient location on its surface. Now use the Hypotenuse Measurer to measure
the distances between the locations on the Side Measurer marked by the numbers
1, 2, 3, 4, 5, 6. In other words, measure the distances from 1 to 1, 2 to 2 and so on.
Make a table on your graph paper to record your measurements. Plot the results on
the graph paper with the side lengths on the x axis and the hypotenuse lengths on
the y axis.
Next untape the Side Measurer from the desk or table. Grab your large spherical
object (henceforth referred to as the LSO) and tape the corner of the Side Measurer
onto some convenient location on its surface, taking care to preserve the right
angle where tape measures A and B are taped together. Now use the Hypotenuse
Measurer to measure the same set of distances that you measured previously when
the Side Measurer was taped to the table or desk. Write them down in a table as
you did above, and then plot the data on the plot√ you made above.
Now on the same plot, draw the line y = 2 x. Write down any impressions
you have or conclusions you arrive at by looking at these data, and save them for
later.
Given a right triangle, the sum of the squares bounding the two legs of the triangle is equal
to the square bounding the hypotenuse of the triangle.
In Pythagoras’ time, there was only geometry – algebra was still 1300 years
in the future. Pythagoras wasn’t talking about the squares of the lengths of the
sides as numbers. He proved his theorem by cutting up the squares on the legs and
showing that the pieces could be reassembled into the square on the hypotenuse,
so that the two squares truly were equal. But now that we have algebra, we can say
that if the lengths of the two sides of a right triangle are denoted by A and B, then
the length C of the hypotenuse of the right triangle is given by solving the equation
A2 + B 2 = C 2 (1.1)
Downloaded from https://www.cambridge.org/core. Columbia University Libraries, on 21 Aug 2018 at 12:58:29, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/CBO9780511755811.002
Pythagoras and the measurement of space 5
6
Hypotenuse
2 LSO
Desk
1 2 3 4 5 6
Legs
Fig. 1.1. Data from hands-on exercise.
As you should be able to see in the hands-on exercise, this formula works quite
reliably when we’re measuring right triangles on a desk or table but begins to fail
when we measure right triangles on the LSO. One set of data from this exercise is
plotted in Figure 1.1.
Now let’s use math to explore this issue further. Consider a right isosceles tri-
angle on a two-dimensional sphere of radius R with azimuthal angle θ and polar
angle φ. A triangle in flat space is determined by three straight lines. The closest
analog to a straight line on a sphere is a great circle. Let’s make the legs of our right
triangle extend from the north pole of the sphere along the great circles determined
by φ = 0 and φ = π/2, beginning at θ = 0 and terminating at θ = θ0 . The arc of
a great circle of radius R subtending an angle θ0 has arc length Rθ0 , so we can say
that A = B = Rθ0 . The arc length of the great circle serving as the hypotenuse is
given by
Downloaded from https://www.cambridge.org/core. Columbia University Libraries, on 21 Aug 2018 at 12:58:29, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/CBO9780511755811.002
6 From Pythagoras to spacetime geometry
Desk
Sphere
Hypotenuse 3
For small values of θ, where C/R 1, the Pythagorean rule works fairly well,
although not exactly, on the sphere, but eventually the formula fails. We can see
how badly it fails in Figure 1.2. This is because the sphere is curved, and the
Pythagorean formula only works on flat surfaces. The formula works approxi-
mately on the sphere when the distance being measured is small compared to
the radius of curvature of the sphere. The mathematical way of saying this is that
the sphere is locally flat.
But what does this have to do with Einstein’s Special Theory of Relativity? In
this book we’re going to develop the concept of spacetime, but we’re only going to
study flat spacetime, because that’s what special relativity is all about. Everything
you will learn in this book will apply to flat spacetime in the same way that the
Pythagorean formula applies to flat space. In the real world we experience the force
Downloaded from https://www.cambridge.org/core. Columbia University Libraries, on 21 Aug 2018 at 12:58:29, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/CBO9780511755811.002
Pythagoras and the measurement of space 7
The world we know seems to have three space dimensions, but this is no problem
because a rectangular coordinate system can be defined just as easily in any number
of dimensions. In D space dimensions we can describe each point P at which
an object could be located or an event could take place by a position vector r
representing a collection of D coordinates (x 1 , x 2 , . . . , x D ) in a D-dimensional
rectangular grid. The distance r12 between two points r1 and r2 is given by the
Pythagorean formula generalized to D dimensions
D
|r12 |2 = (x1i − x2i )2 . (1.5)
i =1
Any position vector r in this D-dimensional space can be written in terms of the
D coordinate components x i in a basis of D orthonormal vectors êi as
D
r = x i êi . (1.6)
i =1
Downloaded from https://www.cambridge.org/core. Columbia University Libraries, on 21 Aug 2018 at 12:58:29, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/CBO9780511755811.002
8 From Pythagoras to spacetime geometry
4
P1
2
L12
∆y
0
P2
∆x
-2
-4
-4 -2 0 2 4
Fig. 1.3. Space as a flat rectangular coordinate grid.
êi · ê j = δi j , (1.7)
Any other vector V in this space can then be represented in this orthonormal
basis as
D
V = V i êi , V i = V · êi , (1.9)
i =1
where the set of D numbers V i are said to be the components of the vector V in
this specific basis. Note that the same vector can be represented in more than one
basis. This is an extremely important thing to remember, and we will return to it
again and again in this book, in greater and greater detail, because this is one of
the basic mathematical ideas behind the principle of relativity, both special and
general.
Downloaded from https://www.cambridge.org/core. Columbia University Libraries, on 21 Aug 2018 at 12:58:29, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/CBO9780511755811.002
The differential version, in D dimensions 9
If we use the Kronecker delta function δi j as defined in Eq. (1.8), and adopt the
convention of summing over repeated indices, this expression can be rewritten as
dl 2 = δi j d x i d x j . (1.12)
In differential geometry this object is called the Euclidean metric in rectangular
coordinates. Euclidean space is another name for flat space. A metric is another
name for an infinitesimal line element. Using the methods of differential geom-
etry, the curvature of a given space can be calculated from the first and second
derivatives of metric functions gi j (x), which replace the constant δi j if the space
has nonzero curvature. If we were to calculate the curvature of the D-dimensional
space whose metric is Eq. (1.12), we would find that it is exactly zero, because all
of the derivatives of the components of δi j are zero. But that’s a subject for another
book.
Now that we have an infinitesimal line element, we can integrate it to find the
lengths of lines that are not straight but curved, using the differential version of
the Pythagorean formula, also known as the Euclidean metric in rectangular coor-
dinates. If we have some curve C between points P1 and P2 then the length L of
the curve is given by
P2
L = dl. (1.13)
P1
Downloaded from https://www.cambridge.org/core. Columbia University Libraries, on 21 Aug 2018 at 12:58:29, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/CBO9780511755811.002
10 From Pythagoras to spacetime geometry
and
2π 2π
L = δi j ẋ i ẋ j dt = R dt = 2π R. (1.16)
0 0
Since it’s guaranteed that δi j ẋ i ẋ j ≥ 0, formula (1.14) for the length of a curve is
always positive as long as the curve itself is well-behaved. This won’t continue to
be the case when we graduate from space to spacetime.
The constants b1 and b2 represent a shift in the origin of the coordinate system.
Taking the differential of this expression automatically gets rid of b1 and b2 , and
this reflects the freedom with which we can set the origin of the coordinate system
anywhere in the space without changing the metric. This freedom is called trans-
lation invariance, and ultimately, as we shall show in a later chapter, it leads to
conservation of momentum for objects moving in this space.
Downloaded from https://www.cambridge.org/core. Columbia University Libraries, on 21 Aug 2018 at 12:58:29, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/CBO9780511755811.002
Rotations preserve the Euclidean metric 11
If we require that the metric remain unchanged under the rest of the transforma-
tion, so that
d x̃ 2 + d ỹ 2 = d x 2 + dy 2 , (1.18)
then it must be true that
2
a11 + a12
2
= a21
2
+ a22
2
= 1, a11 a12 = −a21 a22 . (1.19)
We have three equations for four variables, so instead of a unique solution, we get
a continuous one-parameter family of solutions that can be written in terms of an
angular parameter θ as
a11 = a22 = cos θ, a12 = −a21 = sin θ. (1.20)
This describes a rotation by an angle θ. We also get a second family of solutions
a11 = −a22 = cos θ, a12 = a21 = sin θ, (1.21)
which represents a reflection about an axis characterized by θ . Transformations
like this will be discussed in more detail in Chapter 8.
Expanding out the matrix multiplication for the solution in (1.20), our coordi-
nate transformation becomes
x̃ = x cos θ + y sin θ
ỹ = −x sin θ + y cos θ. (1.22)
This transformation is a rotation of the rectangular coordinate system by an angle
θ about the origin at x = y = 0. To see this, look at the x̃ and ỹ axes, represented
by the line ỹ = 0 and x̃ = 0, respectively. In the (x, y) coordinate system, they
become
ỹ = 0 → y = x tan θ
x̃ = 0 → x = −y tan θ, (1.23)
and if we plot this we can see that the x̃ and ỹ axes are both rotated counterclock-
wise by an angle θ compared to the x and y axes.
As the angle from which we view an object changes, we don’t expect the object
itself to change. Suppose we have a position vector r pointing to some object in
two-dimensional Euclidean space. In the (x, y) coordinate system, r can be written
r = x êx + y ê y . (1.24)
If we require that the position vector itself remain unchanged as we rotate the
coordinate system in which the vector is described, so that
r = x êx + y ê y = x̃ êx̃ + ỹ ê ỹ , (1.25)
Downloaded from https://www.cambridge.org/core. Columbia University Libraries, on 21 Aug 2018 at 12:58:29, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/CBO9780511755811.002
12 From Pythagoras to spacetime geometry
then the transformation of the coordinate components of the vector must cancel
the transformation of the unit basis vectors. Therefore the transformation rule for
the orthonormal basis vectors must be
êx̃ = êx cos θ + ê y sin θ
ê ỹ = −êx sin θ + ê y cos θ. (1.26)
The rotation transformation can be written in matrix form as
cos θ sin θ
R(θ) = . (1.27)
− sin θ cos θ
The inverse transformation is a rotation in the opposite direction
−1 cos θ − sin θ
R(θ) = R(−θ ) = . (1.28)
sin θ cos θ
The matrix R(θ) satisfies the conditions det R = 1 and R T I R = I , where I is the
identity matrix
1 0
I = . (1.29)
0 1
The first condition classifies R as a special, as opposed to a general, linear transfor-
mation. The second condition classifies R as an orthogonal matrix. The full name
of the group of linear transformations represented by R(θ) is the special orthogo-
nal group in two space dimensions, or S O(2)2 for short. Transformations by R(θ)
take place around a circle, which can be thought of as a one-dimensional sphere,
known as S 1 for short.
A transformation matrix can have the properties of being special and orthogonal
in any number of dimensions, so rotational invariance is easily generalized to D
space dimensions, although the matrices get more complicated as D increases. As
one would expect, since we have S O(2) for D = 2, we have S O(D) for arbitrary
D. A rotation in D space dimensions takes place on a (D − 1)-dimensional sphere,
called S D−1 for short. We will examine rotational invariance in D space dimen-
sions in greater detail in a later chapter. For now, everything we want to accomplish
in this chapter can be achieved using the simplest case of D = 2.
Downloaded from https://www.cambridge.org/core. Columbia University Libraries, on 21 Aug 2018 at 12:58:29, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/CBO9780511755811.002
Infinitesimal rotations 13
x̃ x + θ y + O(θ 2 )
ỹ y − θ x + O(θ 2 ) (1.30)
In this infinitesimal limit, the rotation matrix R(θ) can be written in terms of a
matrix r
0 1
R(θ) I + θ r + · · · , r= , (1.31)
−1 0
which we will call the generator of the rotation transformation. Notice that r is an
antisymmetric matrix, that is rT = −r, where the matrix transpose is defined by
(M T )i j = M ji .
Written in this manner, an infinitesimal rotation looks like the first two terms in
the expansion of an exponential
eαx ∼ 1 + αx + · · · (1.32)
Downloaded from https://www.cambridge.org/core. Columbia University Libraries, on 21 Aug 2018 at 12:58:29, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/CBO9780511755811.002
14 From Pythagoras to spacetime geometry
Downloaded from https://www.cambridge.org/core. Columbia University Libraries, on 21 Aug 2018 at 12:58:29, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/CBO9780511755811.002
Could a line element include time? 15
the concept of absolute time did not present a challenge to devout Christians such
as Isaac Newton, who held it to be an obvious and unquestionable truth.
In this old picture of space and time, space and time are inherently separate, and
both absolute. An object at a location marked in absolute space by the position
vector r moves along a path in absolute space that can be written as a function of
time such that
r = r(t) = x i (t)êi , (1.38)
where the implied sum over the repeated index i runs over all of the dimensions
in the space. For now, let’s restrict space to one dimension, so we’re only dealing
with one function of time x(t).
In Newtonian physics in one space dimension, in the absence of any forces, the
motion of an object is determined by the solution to the differential equation
d 2 x(t)
= 0. (1.39)
dt 2
This equation is invariant under the transformation
t˜ = t
x̃ = x − vt, (1.40)
where v is the velocity of an observer in the x̃ coordinate system relative to an
observer in the x coordinate system.
This invariance principle was first proposed by Galileo based on general phys-
ical and philosophical arguments, long before Newton’s equation existed. Galileo
argued that when comparing a moving ship with dry land, the natural laws govern-
ing the motion of an object on the ship should not depend on the motion of the ship
relative to the dry land, as long as that motion was smooth motion at a constant
velocity.
This sounds suspiciously like what we learned about Euclidean space and
rotations – an object should not change as we rotate the coordinate system used to
describe it. But now we’re not talking about rotating space into space, we’re talk-
ing about space and time, and instead of a rotation, which involves a dimensionless
angle, we have motion, which involves the dimensionful quantity of velocity.
Let’s consider the possibility that Galilean invariance could be an infinitesimal
version of some full spacetime invariance principle, a version valid only for veloc-
ities close to zero. Suppose we use as our candidate spacetime transformation the
S O(1, 1) transformation developed in the previous section.
At first it would appear that Galilean invariance is not consistent with a small
rotation by L(ξ ). However, the units by which we measure space and time are not
the same. Time is measured in units of time such as seconds or years, and space
Downloaded from https://www.cambridge.org/core. Columbia University Libraries, on 21 Aug 2018 at 12:58:29, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/CBO9780511755811.002
16 From Pythagoras to spacetime geometry
τ̃ = τ cosh ξ − x sinh ξ
x̃ = −τ sinh ξ + x cosh ξ. (1.42)
τ̃ τ − βx
x̃ x − β τ. (1.43)
So far, this looks nothing like a Galilean transformation. However, maybe one
of the terms is smaller than the others and can be neglected. If we rewrite the above
equations back in terms of t and c then we get:
vx
t˜ t − t, c→∞ (1.44)
c2
x̃ x − vt. (1.45)
ds 2 = −dτ 2 + d x 2 (1.46)
is invariant under the coordinate transformation given by L(ξ ) in (1.42) such that
−dτ 2 + d x 2 = −d τ̃ 2 + d x̃ 2 . (1.47)
Downloaded from https://www.cambridge.org/core. Columbia University Libraries, on 21 Aug 2018 at 12:58:29, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/CBO9780511755811.002
The Lorentz transformation 17
The metric (1.46) gives us at last the analog of the Pythagorean rule for space-
time. If curvature comes from derivatives of the metric, then this strange met-
ric with a minus sign must be flat. This metric for flat spacetime is called the
Minkowski metric, and flat spacetime is also known as Minkowski spacetime. Note
that unlike the Euclidean metric, the Minkowski metric is not positive-definite.
This is extremely important and we will explore the implications of this fact in
greater detail in later chapters.
What happens for higher dimensions? A flat spacetime with D space dimensions
and one time dimension has a line element
ds 2 = −dτ 2 + dl 2 , dl 2 = δi j d x i d x j . (1.48)
The Latin indices (i, j) refer to directions in space, and by convention take the
values (1, 2, . . . , D). When dealing with spacetime, it has become the convention
to use a set of Greek indices (µ, ν) and appoint the 0th direction as being the time
direction so that d x 0 = dτ , in which case the Minkowski metric can be written
ds 2 = ηµν d x µ d x ν , (1.49)
where
η00 = −1,
η0i = ηi0 = 0,
ηi j = δi j . (1.50)
Downloaded from https://www.cambridge.org/core. Columbia University Libraries, on 21 Aug 2018 at 12:58:29, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/CBO9780511755811.002
18 From Pythagoras to spacetime geometry
proportional to the velocity, for the sake of brevity we shall refer to it as the ve-
locity.) Therefore an observer measuring space and time in the (τ̃ , ỹ) coordinate
system sees the observer in the (τ, x) frame not as being at rest, but moving with
velocity −β so that
d x̃
= −β. (1.51)
d τ̃
Since the observer in question is at rest in the (τ, x) coordinate frame, for
her/him d x = 0 and therefore
ds 2 = −dτ 2 . (1.52)
ds 2 = −d τ̃ 2 + d x̃ 2 = −d τ̃ 2 + β 2 d τ̃ 2 . (1.53)
Because the two coordinate systems (τ, x) and (τ̃ , ỹ) differ only by a Lorentz
transformation, and because we’re in flat spacetime, where the metric is Lorentz-
invariant, it must be true that
dτ
d τ̃ = . (1.54)
1 − β2
Taking the differential of the Lorentz transformation relating the two frames in
Eq. (1.42) gives
d τ̃ = dτ cosh ξ − d x sinh ξ
d x̃ = d x cosh ξ − dτ sinh ξ, (1.55)
Downloaded from https://www.cambridge.org/core. Columbia University Libraries, on 21 Aug 2018 at 12:58:29, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/CBO9780511755811.002
Exercises 19
Exercises
1.1 Verify that the rotation R(θ) leaves the Euclidean line element dl 2
invariant.
1.2 Let’s look at a two-sphere of radius R whose center is at the origin of
three-dimensional flat Euclidean space, with coordinates related by
(x, y, z) = (R sin θ cos φ, R sin θ sin φ, R cos θ). (E1.1)
Consider the three great circles passing through the north pole x0 =
(0, 0, R) and the points x1 = (R sin θ0 , 0, R cos θ0 ) and x2 = (0, R sin θ0 ,
R cos θ0 ) on the sphere. These three circles define the right triangle on the
sphere described at the beginning of the chapter. Recall that the Euclidean
dot product of two vectors xi · x j = |xi ||x j | cos θi j , where θi j is the angle
between the two vectors in the two-dimensional plane they determine.
(a) Use this result to verify that the arc length of the hypotenuse of this right
triangle is given by Eq. (1.2).
(b) Expand Eq. (1.2) for small θ0 to check whether the Pythagorean rule is obeyed
in that limit.
1.3 Using the transformation rule for basis vectors, compute the components
of the vector V in the (x̃, ỹ) coordinate system of Eq. (1.17).
1.4 On a sheet of graph paper, represent the (x, y) coordinate system as a
two-dimensional rectangular grid, with y on the vertical axis and x on the
horizontal axis. Using the transformation R(θ) to relate the two coordinate
systems (x̃, ỹ) and (x, y), plot the two lines x̃ = 0 and ỹ = 0 in the (x, y)
coordinate system for the values θ = π/4, π/2, 3π/4 and π . Then draw
another coordinate grid with (x̃, ỹ) on the axes, and plot the two lines
x = 0 and y = 0 in the (x̃, ỹ) coordinate system for the same values of θ.
1.5 On a sheet of graph paper, represent the (τ, x) coordinate system as a two-
dimensional rectangular grid, as you did above for the (x, y) coordinate
Downloaded from https://www.cambridge.org/core. Columbia University Libraries, on 21 Aug 2018 at 12:58:29, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/CBO9780511755811.002
20 From Pythagoras to spacetime geometry
system, but with τ replacing y on the vertical axis, and x on the horizontal
axis. Using the transformation L(ξ ) to relate the two coordinate systems
(τ, x) and (τ̃ , x̃), plot the two lines τ̃ = 0 and x̃ = 0 in the (τ, x) coor-
dinate system for ξ = 1/3, 1/2, 1 and 2. Then draw another coordinate
grid with (τ̃ , x̃) on the axes, and plot the two lines τ = 0 and x = 0 in the
(τ̃ , x̃) coordinate system for the same values of ξ . Using (1.56), calculate
β and γ for ξ = 1/3, 1/2, 1 and 2.
1.6 Given the 2 × 2 antisymmetric matrix
0 1
A= , (E1.2)
−1 0
compute the first four terms in the Taylor expansion of the exponential eθ A
around θ = 0 and derive a general formula for the elements of eθ A as an
infinite sum of powers of θ.
1.7 Given the 2 × 2 symmetric matrix
0 −1
S= , (E1.3)
−1 0
compute the first four terms in the Taylor expansion of the exponential eξ S
around ξ = 0 and derive a general formula for the elements of eξ S as an
infinite sum of powers of ξ .
1.8 Multiply two rotation matrices R(θ1 ) and R(θ2 ). Is the result a third rota-
tion matrix R(θ3 )? If so, what is the angle θ3 of the resulting transforma-
tion in terms of θ1 and θ2 ?
1.9 Multiply two Lorentz transformation matrices L(ξ1 ) and L(ξ2 ). Is the
result another Lorentz transformation matrix L(ξ3 )? If so, what is the
transformation parameter ξ3 of the resulting matrix in terms of ξ1 and ξ2 ?
Downloaded from https://www.cambridge.org/core. Columbia University Libraries, on 21 Aug 2018 at 12:58:29, subject to the Cambridge Core
terms of use, available at https://www.cambridge.org/core/terms. https://doi.org/10.1017/CBO9780511755811.002