Physics Module On Electrostatics and Magnetostatics

Physics module on Electrostatics and
Magnetostatics
ICT programme
S. Anantha Ramakrishna,
Department of Physics, Indian Institute of Technology Kanpur
Contents
1 Introduction to electromagnetism page 1
1.1 Some numbers associated with electromagnetism 2
1.1.1 Electromagnetism in natural phenomena 3
1.1.2 Electromagnetic devices 4
1.2 The electric charge and the Coulomb law 5
1.3 The superposition principle 7
2 Mathematical essentials of Vectors and Vector Calculus 8
2.1 Scalars and Vectors 8
2.1.1 Multiplication with vectors 9
2.1.2 Representation of vectors in terms of specied vectors 10
2.1.3 Symmetries of vectors under transformations 11
2.1.4 Visualizing scalar functions of multiple variables 12
2.1.5 Visualizing vector functions 12
2.2 A brief review of Vector Calculus 13
2.2.1 Fundamental theorem of Gradients 14
2.2.2 Gausss Divergence theorem 15
2.2.3 Stokes Theorem 15
2.2.4 Helmholtz theorem 16
2.3 Curvilinear geometries and coordinates 16
2.3.1 The Cartesian coordinate system 16
2.3.2 Cylindrical coordinate system 17
2.3.3 Spherical coordinates 18
2.3.4 An orthogonal curvilinear coordinate system 19
2.4 The Dirac function 20
3 Static charges, electric eld and electric potential 24
3.1 Concept of the electric eld 24
3.1.1 Superposition of electric elds and Charge distributions 25
3.2 Gausss law of Electrostatics 27
3.3 The curl of the electric eld 27
3.3.1 The electrostatic Potential 27
iv Contents
3.4 Energy associated with charge distributions 27
3.4.1 Problems regarding energy of a point charge 27
3.5 Conductors and capacitance 27
3.5.1 A cavity within a conductor 27
3.5.2 Surface charge on a conductor 27
3.5.3 Surface force on a conductor 27
3.5.4 Energy of a capacitor 27
4 Calculating the Electric eld and potential 28
4.1 Poisson and Laplace equations: Boundary Value problems 28
4.2 Boundary conditions on the electrostatic eld across charged
surfaces 28
4.3 Uniqueness theorems 28
4.4 The method of images 28
4.4.1 A conducting innite plane 28
4.4.2 A conducting sphere 28
4.5 Sorting out some important problems 28
4.5.1 A conducting sphere placed in a homogeneous electric
eld 28
4.5.2 Screening: elds within a atom 28
4.5.3 Why is a colloid suspension stable ? 28
4.5.4 Oscillations of a plasma 28
5 Approximate description of the electric eld at far-o points 29
5.1 Multipole expansions 29
5.2 Fields and potential of dipoles 29
5.3 Force, torque and energy associated with a dipole in electro-
static elds 29
6 Electrostatics of material media 30
6.1 Ideas of homogenization of the electric eld in a material
medium 30
6.2 Bound charges, polarizability and macroscopic polarization 30
6.3 Electric eld inside a material medium and The displacement
eld 30
6.4 Dielectric susceptibility and dielectric permittivity 30
6.5 The electrostatic energy inside a dielectric medium 30
6.6 Sorting out some important problems 30
6.6.1 The electric elds of a uniformly polarized dielectric
sphere 30
6.6.2 A dielectric sphere placed in a homogeneous electric
eld 30
6.6.3 Making a composite material with high dielectric
permittivity 30
Contents v
6.6.4 Fields of a point charge placed near a at, large
dielectric medium 30
6.6.5 Placing a dielectric inside a capacitor 30
7 Magnetic elds 31
7.1 BiotSavart law for the magnetic eld 31
7.1.1 Experimental verication of BiotSavart law 31
7.2 Calculating elds for simple current congurations 31
7.3 Current distributions and magnetic elds for distributed
currents 31
7.4 Forces between current carrying conductors 31
7.5 Divergence and curl of the magnetic eld and Amperes Law 31
7.6 The magnetic vector potential 31
7.7 Boundary conditions on magnetic elds across current sheets 31
8 Forces on a charge in electric and magnetic elds: The Lorentz
force 32
8.1 The Cyclotron frequency 32
8.2 Motion in parallel electric and magnetic elds 32
8.3 Motion in crossed electric and magnetic elds 32
8.3.1 Conning and focussing charges 32
9 Magnetostatics of material media 33
9.1 Multipole expansions for the magnetic eld 33
9.2 Magnetic eld and magnetic vector potential of a magnetic
dipole 33
9.3 Magnetic elds and the

H eld in a material medium 33
9.4 Basic ideas of para-, dia-, ferro- magnetic media 33
9.5 Magnetic elds of magnetized objects 33
Notes 35
Author index 37
Subject index 38
1
Introduction to electromagnetism
Electromagnetic forces all prevalent in our daily lives and at least 70% of our sen-
sory perceptions arise optically, i.e., come from electromagnetic waves. Thus, it
would not be an exaggeration to say that electromagnetic phenomena dominate
our lives, particularly have even fashioned our lifestyles in modern times where
electronic gadgetry is so ubiquitous. Thus, understanding electromagnetism is one
of the rst tasks for a Scientist and engineer. This module on electrostatics and
magnetostatics, is the rst part of a course on Electromagnetism with the second
part being devoted to time-varying electromagnetic elds, optics and an elemen-
tary introduction to quantum mechanics mechanics. This module will lay down the
fundamental principles and also the mathematical background that is essential to
understand the second part of the course. We will use SI units throughout for all
quantities.
In nature, there are four fundamental forces:
1. gravitational forces
2. electromagnetic forces
3. Weak nuclear forces
4. Strong nuclear forces
The student must have already encountered the gravitational force that occurs
between masses in a previous course. The electromagnetic forces occur due to
static and moving electric charges and are responsible for holding together atoms,
molecules, crystals and much of the structure that we see and interact with on a
daily basis. The strong and weak nuclear forces are responsible for the stability of
nuclei that comprise most of matter. Figure ?? tries to capture relationship between
these forces, the historical origin of the studies of electromagnetism, its applications
and the possible future of our knowledge of the universe.
All of electromagnetism is governed by four equations that involve the sources of
the electromagnetic elds the charges and the currents. We will put them down
2 Introduction to electromagnetism
just to have our perspectives in sight:

E =

0
, (1.1)

B = 0, (1.2)

E =

B
t
, (1.3)

E =
0

J +
0
E
t
. (1.4)
Here

E is the electric eld,

B is the magnetic eld, is the charge density in space,
J is the current density in space and the above equations describe the variation of
the electric and magnetic elds in space and time (t) for given sources ( and

J).
The operator is described in Chapter ??,
0
and
0
are some constants that will
be described later. All of electrostatic and magnetostatic phenomena in this module
can be described by simply setting to zero the time derivatives in these equations as
there is no time dependence. These equations were originally written down by J.C
Maxwell, and hence they are called Maxwells equations or the Maxwell equations.
It is interesting to note that electromagnetism is the rst fundamental force that
mankind discovered properly - the four Maxwells equations were automatically
Lorentz invariant and relativistically correct.
1.1 Some numbers associated with electromagnetism
In this section, I will present some numbers for electromagnetic quantities that are
associated with various pheomena of electromagnetism and electromagnetic devices.
Electromagnetic quantities that one is concerned with are typically the electric and
magnetic elds, chrages and currents that drive these elds, the frequencies at which
the electromagnetic elds vary at, energies associated with various eects and so
on. This discussion some particular examples will hopefully raise the curiosity to
understand the phenomena better and might enable the student to appreciate the
magnitudes of the numbers that (s)he might calculate in this course.
There is another picture of electromagnetism that is placed in Figure ??. This
picture shows a frequency scale for the electromagnetic spectrum indicating where
various electromagnetic phenomena happen. On the left, we have time-independent
or static phenomena that correspond to zero frequency. All of the discussion in this
module will concern that. Higher up we have radio-frequency waves that are so es-
sential to communications from about hundreds of kilohertz to tens of megahertz.
(hertz is the unit of frequency (Hz being the symbol). Higher frequencies than these
are the microwaves, from hundreds of MHz to hundreds of gigahertz (GHz) which
have refashioned the way we live, from mobile phones, microwave ovens to radars
for airplanes and ships. At even higher frequencies, we have the terahertz (THz)
waves that form a part of the electromagnetic spectrum that have not been utilized
1.1 Some numbers associated with electromagnetism 3
by mankind very well so far, but has future promise for many applications in the
future using manmade materials (metamaterials) for purposes of detection, sens-
ing and security scanning applications. Above the Terahertz, we have the infra-red
part of the spectrum that is so essential for spectroscopy of molecules by which
we have learnt so much about molecular structure. The near-infra-red part of the
spectrum (200 THz or 1500 nm wavelengths) have been very fruitfully utilized for
long distance communications through bre optics. The uhman eye can sense only
a very small part of the spectrum called visible frequencies extending from 400 nm
to about 700 nm wavelengths (700 - 400 THz). The blackbody radiation of the sun
peaks in the yellow-green part of the spectrum and nature has beautifully utilized
it with the human eye being most sensitive at about 555nm wavelength. At higher
frequencies, we have the ultra-violet spectrum ( 10
15
Hz), which most of the atomic
transitions emit, and X-rays at even higher frequencies ( THz) Both Ultra-violet
and X-rays have ionizing properties and exposure to them beyond certain safe limits
can result in cancers in our bodies. However, X-rays have proved essential for med-
ical imaging of our bones and interior organs as well as for actually understanding
(imaging in a sense) the crystal structures of most of solid matter. Higher beyond
these frequencies, we have the Gamma () rays that are emitting by nuclear reac-
tions and the cosmic rays whose origins are not completely understood even today.
It must be emphasized that most of our knowledge about matter and this universe
has arisen from the detection of electromagnetic waves at various frequencies across
this spectrum.
1.1.1 Electromagnetism in natural phenomena
Let us consider some natural phenomena involving electromagnetism and talk about
the magnitudes of various electromagnetic quantities that are involved.
Earths atmosphere
It might surprise the student to to know that there is an almost uniform vertical
electric eld at the earths surface of about 100 V/m. While the presence of buildings
etc. that are somewhat conducting will deform this electric eld, this eld is readily
detectible in open areas or over the surface of a large water body such as a lake or
the sea. This electric eld extends to several kilometres although its magnitude does
reduce with increasing altitude because the higher regions of the atmosphere are
more conducting and tenuous. The total potential dierence from the earths surface
to the top of the atmosphere is about 400,000 Volts, the earth being negatively
charged with respect to the atmosphere! Due to ionization caused by solar radiation
in the upper atmospheric stretches, currents ow in the atmosphere driven by this
potential dierence. This integrated current over the earths surface is about 1800
amperes implying that the net energy associated with this process is about 700
megawatts, which is the ouput of a large power plant.
So let us ask, what is the reason for this electric eld, for the earths negative
charge? The reason it turns out is that the lightning strikes during a thunder-
storm are mostly negatively charged, thereby inducing a net negative charge on
the earths surface. The reason why lightning is, most of the times, negative is due
to the peculiar charge distribution of charges on a cloud that is mostly negative
in the lower stretches while being positive in the upper region. The real reason
for this peculiar charge distribution is however not known yet. The student is re-
ferred to Chapter -6 of Ref. ? for a more detailed description of atmospheric eects.
Lightning
Lightning is easily one of the most impressive natural phenomena. A large amount
of charge built up on clouds discharges to the earth in one sudden burst. The time
of the lightning ash is about 0.2 seconds in which a typical total charge of about
2.5 coulomb (it can range from 1 to 6 C) is discharged over a distance of about 2
km. This is a good place to point out how large the unit of charge is in SI units.
The peak currents are about 20 kA with an energy dissipation of about 100 kJ /m
along the path. An important parameter here is the voltage at which air breaks
down and becomes eectively a conductor. Initially a few ions present in the air
accelerate down the voltage column, and cause more ionization in the air due to
collisions with other atoms (avalanche process).This large density of ions eectively
creates a conducting column for lightning. Air at room temperature and pressure
breaks down when an electric eld of about 30kV / cm is applied across the air col-
umn. This breakdown at a specic voltage has been utilized in making fast switches
(spark gaps) in automobile technology for a long time.
Earths magnetic elds
Neurons and electricity
Inside our bodies an enormous amount of information and signals are transmitted
via specialized cells called neurons. The signals are transmitted in between neurons
across spaces called neuronal synapses. The neuron is held at a resting potential
of about -70mV which can be switched o suddenly thereby transmitting a signal.
This rest potential is accomplished by segregating ions such as Na
+
, K
+
and Ca
2+
from negative chloride ions by semi-permeable membranes.
1.1.2 Electromagnetic devices
Now we will proceed to some modern devices that crucially depend on electromag-
netism and the typical values of electromagnetic quantities that arise in them. One
should remember that devices and mechanisms being utilized for devices change
very quickly and the numbers can be quite dierent in the matter of few years.
Hence it would be important to keep in mind the year 2010 in which the present
1.2 The electric charge and the Coulomb law 5
section is being written.
Magnetic resonance imaging
Mobile Phones
X-ray machines
DC and induction motors
Magnetic memories
Semiconductor chips and computers
1.2 The electric charge and the Coulomb law
From various experiments, (that historically involve rubbing various insulators such
as glass rods and cats fur, or frictional electricity), it has been deduced that there
are two kinds of charges positive and negative. The result of such experiments
can be summed up as that like charges repel each other while unlike charges attract
each other.
Usually it is thought that a neutral object is composed of equal amounts of posi-
tive and negative charges. At a more fundamental level, we know that fundamental
particles either have a positive (examples are the proton and the positron) or a
negative charge (examples are the electron, the antiproton, the muon etc.) or are
neutral (an example is the neutron). Modern Physics tells us that for every ele-
mentary particle there is an anti-particle that will have an equal mass but opposite
charge. For example, we have the positron that is the antiparticle for the electron,
the anti-proton for the proton and so on. We have even managed to make anti-
hydrogen in 2003 by getting a positron into a bound state with an anti-proton! A
particle and its anti particle can annihilate and their energy (including rest mass) is
converted into a gamma ray with the appropriate energy. For example, an electron
annihilates with a positron:
e
+ e
+
= (1.5)
where the gamma ray has a minimum energy of 1.02 MeV that corresponds to
2m
e
c
2
the rest mass of the electron and the positron. Note that the above equation
should conserve energy and momentum too.
In addition, in any such reaction the net charge should be conserved too. We
have experimentally found that the algebraic sum of the charges in an isolated
system is always constant. A stronger statement would be that the total charge is
conserved in any inertial system. Note that while notions such as length and time
intervals change under a Lorentz transformation, the total charge is conserved. A
much better justication of this claim is needed and will be done when we discuss
transformation of electromagnetic elds later.
Another important aspect about the charge is that it is quantized. It always
occurs in any experiment in multiples of the electronic charge. The electronic charge
in SI units is e = 1.60910
19
coulomb. Millikan established with his famous oil-
drop experiment that even in macroscopic situations, the charge is always a multiple
of e. The charge of a proton and an electron are identical in magnitude upto 1 part
in 10
20
(atoms are neutral). The electric force is such a strong force that any larger
discrepancy would be very easily discernible at terrestrial scales. It is now known
that particles (hadrons) such as the proton and the neutron have internal structure
and are made of particles called quarks. These quarks have fractional charge: the
up-quark has charge 2e/3 while the down-quark has e/3. But quarks are never
observed as free particles. They are always present in bound states represented
other particles: for example, the proton is a bound state of two up-quarks and one
down quark resulting in a net charge of +e, while the neutron is a bound state of
one up-quark and two down-quarks resulting in a net zero charge for the neutron.
In electrostatics, the fundamental question is, given a set of charges located at
certain positions, what would be the force on a given charge at a given position?
At the heart to the solution of this question is an empirical law that governs the
forces (of repulsion or attraction) between two charges the Coulombs law. This
law states that the force exerted by one point charge (of zero spatial extant) of
magnitude q
1
on another charge of magnitude q
2
is given by
F = k
q
1
q
2
r
2
21
r
21
(1.6)
where r
21
is the distance from the charge q
1
to the charge q
2
. The force is directed
along the vector r
21
from charge -1 to charge -2 and is inversely proportional to the
square of the distance between them. The force is attractive or repulsive depending
on the relative signs of the two charges.
In SI Units, the numerical value of the proportionality constant k depends on the
units used. In the SI Units,
k =
1
4
0
= 8.9 10
9
N m
2
/C
2
. (1.7)
The Coulomb that is the unit for the charge in SI units is a very large unit. However,
it is very convenient to dene the units of current as ampere = coulomb per second,
which seems to be a very convenient measure for the current in most everyday
situations and is hence retained in the SI system of units.
The validity of Coulombs law has been subjected to intense scrutiny The inverse
square behaviour with the charge separation distances appears almost exact. One
1.3 The superposition principle 7
may write
F r
(2+)
where is the deviation from the inverse square behaviour. Experimentally, one
may x limits on the maximum magnitude of , depending on the sensitivity and
accuracy of the experiment. This has been a cause for concern since the times of
Cavendish who found that < 10
2
and Maxwell ( < 10
6
) down to modern times
where modern experiments based on variants of the original Cavendish experiment
have obtained < 10
14
and from geomagnetic measurements ( < 10
16
). To-
day, we believe that the Coulomb force obeys an inverse square behaviour exactly.
Any deviation from an exact inverse square behaviour would have serious repur-
cussions: for example, the photon would then have a non-zero photon rest mass.
Another amazing aspect of the Coulomb law is the range of lengthscales where it
has been tested and found valid. We have conrmed the Coulomb force law down
to lengthscales of 10
15
m (at lower lengthscales we have electro-weak corrections)
while measurements on the magnetic eld of jupiter have conrmed this law to the
large lengthscales of 10
8
m. Thus, it can be said that we have enormous condence
in this law.
1.3 The superposition principle
The coulombs law of electrostatics describes the force exerted by one charge upon
another. However, what would be the force on a given charge at a given location
due to several charges ?
It has been found that the electrostatic force exerted on one charge by another
is independent of the presence of a third charge. Also, it has been found that the
electrostatic forces on any one charge due to several others add up vectorially. Thus,
the net force on charge -1 due to other charges (2, 3, ) is given by
F
1
=

F
12
+

F
13
+

F
14
+ (1.8)
The superposition principle is a profound principle of Physics without which it
might have been almost impossible to calculate anything in electromagnetism. Imag-
ine the complicated situation that would result if the presence of a third charge
changed the interaction of two charges. such situations can arise with some non-
linear systems. But linear superposition works at a fundamental level with electric
and magnetic elds.
2
Mathematical essentials of Vectors and Vector
Calculus
Here, we will rst introduce the reader to the basic Mathematical knowledge and
obtain some essential Mathematical skills necessary to calculate and deal with Elec-
tric and magnetic elds of more complicated charge and current distributions. Since
these elds are essentially are vectorial in nature and the equations that we will nd
to describe them involve derivatives of these quantities, we will quickly go through
the fundamental theories and results of vectors and vector Calculus. The student
is advised to work through this chapter in order to rigorously follow the rest of the
material.
2.1 Scalars and Vectors
Most students are aware that there are quantities such as the mass of an object,
the distance covered by a car in a year (look at the odometer), temperature etc.
that can be associated with a number and there are physical quantities such as the
displacement of an object, or the velocity of a moving car that not only have a
number , but also a sense of direction associated with them. Further, they do not
add up like ordinary numbers. For example, if you went south for 4 kms and east
for 3 kms, the net displacement from the initial would be 5 km. Further, the nal
location is not uniquely specied unless the direction from the initial point is also
stated. However, we would need to make more rigorous these intuitive ideas.
Mathematically, a vector (denoted by an overhead arrow

A) is an abstract object
that belongs to a set. The essential conditions are that
1. You can associate by a given rule (add) any two of these vectors to produce
another vector that belongs to the same set:

C =

A+

B, where a,

B,

C belong
to the set.
2. Further, this association (addition) does not depend on the order that you con-
sider them (addition is commutative):

A +

B =

B +

A.
3. There exists an identity (zero) vector (
0 in the set which when associated with

any vector in the set produces the same given vector:

A +
0 =

A.
4. For every vector in the set, there exists another vector in the set, such that they
add up to the zero vector:

A + (

A) =
0.
An example of the above mathematical objects are the real vectors in three
dimensional space that can be represented by arrows, with the length of the arrows
denoting the magnitude of the vector and the orientation denoting the direction.
The Zero vector has zero magnitude or length. When we add two such vectors,
we imply that the tail of one vector must be placed at the head of the other (as
shown in Fig. ??) with the resulting vector being represented by the arrow that
joins the tail of the latter to the head of the former vector. Familiar examples of
such quantities are the displacement, velocity, forces etc.
2.1.1 Multiplication with vectors
We can envision the following forms of multiplication involving vectors:
1. Multiplication of a vector by a scalar.
Multiplication of a vector by a scalar gives a vector in the same direction and
a dierent magnitude. For real vectors in the three dimensional space, we will
constrain the scalar to be real numbers. Then the resulting vector is thought
to remain along the same direction as the original vector, but with a dierent
length.
a . v = av,
where a is a scalar and v and av are vectors. For real vectors, the resulting
vectors length is equal to the length of the original vector multiplied by the
modulus of the scalar number.
2. Multiplication of two vectors to yield a scalar.
We can associate a unique scalar number by a given rule with every two pair of
vectors. For real vectors, we can use the rule
a
b = ab cos
where a, b are the lengths of the two vectors a and

b respectively and is the
angle between the vectors. a
b is called the scalar product. The dot product is

commutative, and distributive, i.e.,
a
b =
b aa (
b +c) = a
b +a c
3. Multiplication of two vectors to yield a vector This is dened for two real vectors
as
a
b = ab sin . (2.1)
This is called a cross product and is denoted by the symbol . The resultant
is a vector. The magnitude of the cross product corresponds to the area of
10 Mathematical essentials of Vectors and Vector Calculus
parallelogram whose sides are dened by the two vectors and the direction is
dened to be perpendicular to the parallelogram. It is obvious that the cross
product is zero if either of the two vectors are zero or if they are parallel. It is
common to utilize the cross product to dene areas using the above idea.
The cross product is anti-commutative,
a
b =
b a,
and distributive
a (
b +c) = a
b +a c.
Note that the associative property does not hold true. i.e.,
a (
b c = (a
b)c
4. Scalar triple product.
Another frequently appearing quantity is the triple product dened by
[a,
b, c] = a (
b c). (2.2)
The value of this quantity corresponds to the volume bounded by the parallel-
ogram whose sides are dened by the three vectors. It is easy to show that the
following triple products are equal
[a,
b, c] = [c, a,
b] = [
b, c, a],
i.e., any cyclic permutation of the order of the vectors yields the same triple
product.
5. Vector triple product.
It can be shown that
a (
b c =
b(a c) c(a
b)
Use the cartesian representation of vectors to prove this (see below). This very
useful identity can easily be remembered by noting it as the BAC-CAB rule.
2.1.2 Representation of vectors in terms of specied vectors
A vector can be represented in terms of certain specied vectors. For example, we
can prefer to think of a given displacement, for example, from Kanpur to Jhansi
as x kilometres west and y kilometres south. Then we can write the displacement
vector as v = x x + y y, where the unit vectors x and y represent the directions
along east and the north. The choice of the unit vectors is usually along orthogonal
directions for the sake of convenience, but it is not necessary that the unit vectors
be orthogonal. In three dimensional space, we will denote the unit vectors in the
three orthogonal directions as x, y and z. Any given vector can be resolved into
components along these unit vectors
A = A
x
x + A
y
y + A
z
z. (2.3)
The components of a given vector are obtained by the scalar products with the
respective unit vectors:
A
x
= x

A, A
y
= y

A, A
z
= z

A. (2.4)
2.1.3 Symmetries of vectors under transformations
We start by asking how do vectors transform under various transformations? What
are the physical properties that essentially distinguish between vectors and scalars
? Scalars have the same value regardless of how we view them thus they are
invariant under physical transformations. However consider, for example, if any
arbitrary ordered triplet such as (n, m, l) where n represents the number of apples
in a barrel, m the number of oranges and l the number of bananas, be considered as
a vector? Clearly such a quantity remains a barrel with the same number of apples,
oranges and bananas. Let us understand how real vectors transform in contrast
under the following physical transformations:
1. Shift of origin: Clearly a vector is unaected in direction and magnitude by a
shift of the origin. Hence the transformed vector v
= v. The vector can be

shifted around while keeping the direction and magnitude unaected.
2. A rotation of the coordinate system: Let us consider vectors in two dimensions
for clarity and a rotation of the axes (X Y axes about the Z axis) by an
angle as shown in Fig. ??. Clearly, the Cartesian components in the new set
of axes are related to the old components as
x
= xcos + y sin (2.5)

y
= xsin + y cos (2.6)

Thus, writing the vectors as column matrices,
v
=
_
x
_
and v =
_
x
y
_
, the transformed vector is given by
v
= Rv
where
R =
_
cos sin
sin cos
_
(2.7)
is the rotation matrix. The vecv can be obtained from v
via the inverse matrix

R
1
. Clearly the manner in which the components of a vector combine to produce
the components in the rotated coordinate system is very dierent from the way
apples and oranges behave in our above example.
3. Inversion of the coordinates Imagine if we reected each point by a mirror placed
on the principal planes (X Y plane, Y Z plane and the Z X plane)
containing the origin and assigned to each point in the new coordinate system,
the coordinates of the reected point:
x x, y y, z z.
It is obvious that each vector in the transformed system is the negative of the
original vector v
= v.
Note that objects such as the cross product a =
bc remain invariant under an

inversion hence they are called pseudo-vectors. Similarly, scalar triple product
changes sign under an inversion hence they are called pseudo-scalars.
4. Time reversal: t t.
This is another symmetry operation that is important in Physics, particularly
when analyzing the dynamical nature of systems. Objects such as the velocity
(the time derivative of the displacement) pickup a negative sign under the time
reversal transformation.
2.1.4 Visualizing scalar functions of multiple variables
It is important to be able to visualize functions of multiple variable that we will con-
tinuously encounter in our study, for example, the electric potential that depends
on the three Cartesian coordinates of the point. In many case, just by being able to
visualize the function we may be able to obtain an deep insight into the behaviour
of the physical system. If we have a function of two other variables, we can project
the function along one axis and visualize the behaviour of the function as shown
in Fig ??. An alternative mode if the number of independent variables were three
would be give a shading or color coding as shown in Fig ??. The student is encour-
aged to picture these functions and to learn the use of computer softwares such as
MATLAB or MATHEMATICA that would enable him/her with the capability for
visualization of quite complicated functions.
2.1.5 Visualizing vector functions
It is clear that for representing vector functions, one would need to represent the
direction as well as the magnitude at every point. One typically uses arrows at
representative points to do this: the length or thickness of the arrows at each point
could serve to indicate the magnitude with the arrow directed along the vector at the
given point (an example being shown in Fig. ??). Another possible representation
of this could be to draw streamlines with the tangent to the streamline being along
the vector eld at each point. The density of the streamlines would have to be
proportional to the magnitude of the eld (an example is shown in Fig ??) with
colour being used as another possible label for the magnitude of the eld.
2.2 A brief review of Vector Calculus
In this section, we will briey describe some essential theorems of vector calcu-
lus that we will use in our description of electrodynamics. The student is urged
to develop an acquaintance with these results as we will repeatedly require them
throughout our discussions. The proofs of the theorems will, however, not be pre-
sented here, and the student is referred to Ref. ?? for the proofs.
Consider a continuous function f(x, y, z). The innitesimal change in the function
due a a innitesimal change in the position: from (x, y, z) to (x+dx, y +dy, z +dz)
is given by
df =
f
x
dx +
f
y
dy +
f
z
dz (2.8)
where (f)/(x) etc. are the partial derivatives of the function, i.e., describe the
derivative of the function due to a change of that variable only while keeping other
variables xed. The above relation is the fundamental relation of dierential cal-
culus. Noting that the innitesimal shift in the position can be described by the
innitesimal vector d, we write
df =
_
f
x
x +
f
y
y +
f
z
z
_
( xdx + ydy + zdz)
= f d
(2.9)
where the vector operator called the gradient and is dened to operate on a scalar
function such that
f =
f
x
x +
f
y
y +
f
z
z. (2.10)
Since the change of the function would be maximum when the displacement is
parallel to the gradient, we realise that the vector f points along the direction of
greatest increase in f and |f| gives the magnitude of the slope in this direction.
Hence the name gradient for this object.
We can dene a vector operator
= x

x
+ y

y
+ z

z
, (2.11)
with the understanding that to be meaningful, the operator has to operate on a
function. Once this vector operator has been dened, we can dene other operations
with it, such as operation on a vector eld through a scalar product or through the
vector product. Operation through the scalar product yields the divergence:

A(x, y, z)
=
_
x

x
+ y
f
y
+ z

z
_
(A
x
x + A
y
y + A
z
z),
=
A
x
x
+
A
y
y
+
A
z
z
. (2.12)
In a similar manner, operation of the gradient operator on a vector eld

A through
a vector product yields

A =
x y z
/x /y /z
A
x
A
y
A
z
(2.13)
= x
_
A
z
y

A
y
z
_
+ x
_
A
x
z

A
z
x
_
+ x
_
A
y
x

A
x
y
_
.
The divergence of a vector eld is related to the sourrces or sinks of a vector eld
while the curl is related to the rotational aspects of the vector eld. These aspects
will become clear below when we discuss the Divergence and Stokes theorems.
EXAMPLE:
1. Calculate (
1
|r
|
) where

r
is a xed vector
(
1
|r
|
) =
_
x

x
+ y

y
+ z

z
__
1
[(x x
)
2
+ (y y
)
2
+ (z z
)
2
)]
1/2
_
=
(x x
) x
[(x x
)
2
+ (y y
)
2
+ (z z
)
2
)]
3/2
+ (y term) + (z term)
=
(r
)
|r
|
3
(2.14)
2. Obtain the divergence
3. Obtain the curl
2.2.1 Fundamental theorem of Gradients
The Fundamental theorem of Gradients states that the line integral of the gradient
of a scalar function f(r depends only on the values of the end points ( r
1
and r
2
):
_
r
2
r
1
f(r) dr = f( r
2
) f( r
1
). (2.15)
This is consistent with the interpretation of the gradient that the innitesimal
change in the function df = f dr. This also implies that the line integral over a
closed loop C is
_
C
f(r) dr = 0.
2.2.2 Gausss Divergence theorem
This important result concerns the divergence of a vector eld

A(r) and states that
net ux of a vector eld out of a closed surface S enclosing a volume V (See Fig. ??)
is related to the volume integral of the divergence of the eld:
_
V
(

A)d
3
r =
_
S
A ds. (2.16)
Note that

A is expected to satisfy certain conditions of continuity that are usually
resolved by most elds that describe physical quantities. If one thinks of the vector
eld as describing the velocity ow of an incompressible uid, then the divergence
theorem essentially states that the net eux of the uid through a closed surface
must come through sources of production of the uid in the volume within the
surface. In fact, the very idea of the divergence as the source for the vector eld
becomes obvious by taking the limit of an innitesimal volume:

A = lim
V 0
_

A ds
V
. (2.17)
There is an interesting aspect to the Divergence theorem. The value of the volume
integral of

A that should depend on the behaviour of the vector eld in the
interior of the volume turns out to be entirely determined by only the values of
the eld on the closed surface bounding the volume. This surprising fact arises
primarily due to the continuity of the eld within the given volume.
2.2.3 Stokes Theorem
Another important and often used result is the Stokes theorem that concerns the
curl of a vector eld. It relates the surface integral of the curl over an open surface
(S) to the line integral of the eld over the closed loop, C, bounding the open
surface (See Fig. ??):
_
S

A(r) ds =
_
C
A dr. (2.18)
Here, once again, the vector eld is assumed to satisfy the minimal amounts of
continuity that are usually satised by elds representing most physical quantities.
The Stokes theorem also interestingly relates the value of a surface integral over
the curl of the vector eld to the values of the vector eld on the closed curve
enclosing the surface thus, it is merely the values of the function on the loop C
that determine the integral. There is an important fact that should be pointed out:
the loop C contains an innite number of possible surfaces. Think about a balloon
the circular curve describing the mouth of the ballon can be taken to be C. Now a
stretched sheet across this mouth is a possible surface. Now if this balloon is blown
out to dierent shapes and sizes, each one of the surfaces is a possible surface that
is bound by the closed curve C. The really amazing thing about the Stokes theorem
is that it holds true for each and everyone of them the ux of the curl through
any of them for a common bounding curve is identical.
2.2.4 Helmholtz theorem
Consider a vector eld

A(r) in a given region of space V . The Helmholtz theorem
states that if we can uniquely specify the divergence and curl of the vector eld
everywhere within V , and additionally also specify the normal component of A on
the bounding surface S of the volume V , then the eld

A is uniquely specied.
Following this theorem, we can separate the given vector eld into two parts:
A(r) =

A
D
+

A
R
(2.19)
where

A
D
is an irrotational eld with zero curl and non-zero divergence only, and
A
R
is a divergenceless eld with a non-zero curl only such that

A =

A
D
and

A =

A
R
. The Helmholtz theorem is of a great help in those situations
where we know the divergence and the curl of a vector eld. Then we can be sure
that there is a unique vector eld that has the divergence and the curl, subject to
the specication of the boundary terms.
2.3 Curvilinear geometries and coordinates
In our discussions of electromagnetism in this course, very often we will deal with
geometries containing cylinders or spheres. The Cartesian coordinate geometry is
not the most well suited system to handle spherical and cylindrical geometries. Par-
ticularly, if there are symmetries associated with the problem such as an invariance
with angle or distance from a given point, considerable simplications can occur
in the calculations if other coordinate systems are used. Usually it is simpler to
consider coordinate systems with orthogonal axes. Here we will formally introduce
and detail the three orthogonal coordinate systems that we will frequently use.
2.3.1 The Cartesian coordinate system
This is the familiar coordinate system to the student. Consider space in three
dimensions: let us choose one point and call it the Origin. Now choose three mutually
perpendicular axes in three dimensions that we will call as the X, Y, and the Z axes
that intersect at the Origin (see Fig. ??. We label every point in space by three
numbers, (x, y, z), that correspond to the distances from the origin that one would
have to travel parallel to the three axes. Three unit vectors ( x, y, z) are dened
in the directions along the three principal axes. Note that these unit vectors are
constant vectors and are the same when transposed to any given point this follows
from the property that the principal surfaces are planes in this coordinate system
(see Fig. ??. By denition, we have x y = y z = z x = 0.
We list the following quantities for the sake of completeness and comparison with
other coordinate systems:
1. The innitesimal line element : dr = xdx + ydy + zdz.
2. The innitesimal volume element: d
3
r = dx dy dz.
3. The innitesimal surface elements: ds
x
= dy dz x, ds
y
= dz dx y, ds
z
= dx dy z.
2.3.2 Cylindrical coordinate system
This becomes useful when the problem at hand has a preferred axis and when the
elds primarily depend only on the absolute distance of the point from the the
preferred axis. In this system, we label each point in space again by three numbers:
but only two of them correspond to distances while the third corresponds to an
angle. First we take the preferred axis (direction) and call it the Z axis. Choose
the origin on this axis, and arbitrarily choose another direction in the plane (
that would be the X-Y plane) perpendicular to the Z axis. Now any point in three
dimensional space can be labelled by the radial distance,r, from the Z axis, the
angle, , the radius makes with the X-axis in the X-Y plane and the height, Z,
along the Z axis. This is depicted in Fig. ??. The values of these numbers are
conned to the ranges 0 r , 0 < 2 and < Z < so that each
point has a unique triplet that labels it. The relation to the Cartesian coordinates
is obtained as
x = r cos , y = r sin , z = Z, (2.20)
relations that can be easily inverted.
The unit vectors corresponding to each of these numbers point along the direction
of increasing coordinate at each point as shown in Fig. ??. These are easily related
to the Cartesian unit vectors as
r = cos x + sin y, (2.21)
= sin x + cos y, (2.22)
Z = z. (2.23)
It can be easily veried that the unit vectors are mutually perpendicular r

=

Z =

Z r = 0 . It is clear from the above that the unit vectors change from
point to point, in this case they depend on the location through the angle . This
is unlike the unit vectors in the Cartesian system. Thus, we cannot thoughtlessly
move the unit vectors in or out across derivatives and integrals.
Another crucial dierence comes from the consideration of the innitesimal dis-
placements along the three directions. Along the radial and axial directions, the
innitesimal displacements corresponds to the change in the coordinates (dr and
dZ), which dimensions of length. Along the

direction, however, an innitesimal
change in the coordinate (d) is an angle and translates to a length as rd (see
Fig. ??). Thus, there is a scale factor of r that depends on the given point in space.
Thus, the innitesimal quantities in this coordinate system are:
1. The innitesimal line element : dr = rdr +

rd +

ZdZ.
3
r = dr rd dZ.
r
= rd dZ r, ds
= dZ dr
, ds
Z
=
dr rd
Z.
2.3.3 Spherical coordinates
When a given problem has complete angular symmetry, i.e., when no direction is
preferrable over any other, the spherical coordinate system is very useful. Typically,
all properties of the system depend only on the absolute distance from a specic
point in space, which we will choose to be the origin. Now we will arbitrary choose
an axis, the z axis and a x axis on the plane perpendicular to the z axis and
containing the origin. Now any point in (three dimensional) space can be labelled
uniquely by a triplet of numbers: one representing the absolute radial distance (r)
to the origin, an angle indicating the angle between the radial line joining the
origin to the given point and the chosen z axis, and another angle that is the angle
between the projection of the radial line to the point on the X-Y plane and the
chosen x axis (see Fig. ??). The values of these numbers are conned to the ranges
0 r , 0 theta , and 0 < 2 so that each point corresponds to a
unique triplet that labels it. The relation to the Cartesian coordinates is obtained
as
x = r sin cos , y = r sin sin , z = r cos , (2.24)
relations that can be easily inverted as
r = (x
2
+y
2
+z
2
)
1/2
, = cos
1
_
z
_
x
2
+ y
2
+ z
2
_
, = tan
1
_
y
x
_
. (2.25)
The unit vectors corresponding to each of these numbers point along the direction
of increasing coordinate at each point as shown in Fig. ??. These are easily related
to the Cartesian unit vectors as
r = sin cos x + sin sin y + cos z, (2.26)
= cos cos x + cos sin y sin z, (2.27)
= sin x + cos y. (2.28)

As in the case of the cylindrical system, these unit vectors point in dierent di-
rections at dierent points. This is a general property of all curvilinear coordinate
systems which we will briey discuss later. Hence one has to be careful while dif-
ferentiating or integrating expressions containing these unit vectors.
The innitesimal quantities in this system are:
1. The innitesimal line element : dr = rdr +

rd +

r sin d.
3
r = dr rd r sin d.
r
= rd r sin d r, ds
= dr r sin d

,
ds
= dr rd
.
It can be seen that the scale factor r multiplies the innitesimal change d to give
rise to an innitesimal length rd along

. Similarly, a scale factor r sin (projected
length of the radial vector on the X-Y plane) accompanies the innitesimal quantity
d to give an innitesimal length r sin d along

.
2.3.4 An orthogonal curvilinear coordinate system
We will not discuss, in detail, a curvilinear coordinate system, but will only list some
results that can be written down for orthogonal coordinate systems. For details, we
refer the reader to ?. Consider an invertible mapping to the coordinate system
(u
1
, u
2
, u
3
) from the Cartesian coordinates by the functions:
u
1
= u
1
(x, y, z), u
2
= u
2
(x, y, z), u
3
= u
3
(x, y, z). (2.29)
It can be shown that the unit vectors are given by
u
i
=
u
i
|u
i
|
. (2.30)
The innitesimal displacement vector can be written as
dr = h
1
du
1
u
1
+ h
2
du
2
u
2
+ h
3
du
3
u
3
, (2.31)
where the scale factors h
i
are given by
h
2
i
= |u
i
|
2
. (2.32)
Now the innitesimal volume is written as
d
3
r = h
1
h
2
h
3
du
1
du
2
du
3
. (2.33)
In general, we can also write down expressions for the gradient, divergence and
curl in the generalized coordinates using the scale factors
f = u
1
1
h
1
f
u
1
+ u
2
1
h
2
f
u
2
+ u
3
1
h
3
f
u
3
, (2.34)

A =
1
h
1
h
2
h
3
_
(A
1
h
2
h
3
)
u
1
+
(A
2
h
3
h
1
)
u
2
+
(A
3
h
1
h
2
)
u
3
_
(2.35)

A =
1
h
1
h
2
h
3
h
1
u
1
h
2
u
2
h
3
u
3
u
1
u
2
u
3
h
1
A
1
h
2
A
2
h
3
A
3
(2.36)
It is easily seen that the scale factors for the cylindrical coordinates are given by
h
r
= 1, h
= r, h
Z
= 1, (2.37)
and for the spherical coordinates they are given by
h
r
= 1, h
= r, h
= r sin . (2.38)
Knowledge of the scale factors enables us to carry out all the calculations on the
vectors elds in any desired coordinate system.
2.4 The Dirac function
Consider the function
f(x) =
_
1
2w
|x| < w
0 |x| > w.
(2.39)
It is evident that the integral
_
L
L
f(x)dx = 1 if L > w. Now examine this function
in the limit w 0. It is clear that the function is zero everywhere except the single
point x = 0 where it diverges, and yet the integral is exactly unity. This is contrary
to our usual understanding of (Riemann) integrals where the value of the integral
is zero unless the integration range is nite. In other words, a single point usually
has zero measure. Yet this mathematical object that results from a well dened
function in the limit w 0 has a non-zero integral.
We will often face such mathematical objects in our study of electromagnetism.
Consider the following denition
(x x
0
) =
_
0 x = x
0
,
x = x
0
,
(2.40)
such that the integral
_
b
a
(x x
0
)dx = 1, (2.41)
if the interval [a, b] includes the singular point x
0
and is zero otherwise. It is simple
to construct that (x a) = (a x). Note that the principal properties of this
object derive from the integral. The above mathematical construct was rst formally
discussed in connection with quantum mechanics by a scientist called Dirac, and
it is are called the Dirac function. Although we call this object a function, it is
not a function in the conventional sense and belongs to a generalized class called
distributions by mathematicians.
The Dirac function can work as a sieve to pick out values of functions at specic
points. It is easily seen that
_
b
a
(x x
0
)f(x)dx = lim
0
_
x
0
+
x
0
(x x
0
)f(x)dx = f(x
0
), (2.42)
where f(x) is a usual continuous function. Sometimes it is convenient to work with
certain functions that become a function in limiting cases. The rectangular func-
tion presented above is one example. Other possible examples include a Gaussian
function
(x x
0
) = lim
0
1
2
exp
_
(x x
0
)
2
2
2
_
, (2.43)
and a Lorentzian function
(x x
0
) = lim
0
1
(x x
0
)
2
+
2
. (2.44)
In both of the cases above, is linearly proportional to the full width of the functions
where the value of the singly peaked functions falls to half their peak value. As this
width falls to zero in the limit, the peak value rises keeping the value of the integral
constant (unity). The function can also be interpreted as the derivative of a step-
function at the point of discontinuity.
Consider the Heaviside step function
(x x
0
) =
_
1 x > x
0
,
0 x < x
0
,
(2.45)
its derivative can be shown to be a function. We can do this by showing that it
has the property of the function. The derivative is clearly zero everywhere except
at x
0
, and for an arbitrary function that is continuous at x
0
, the integral
_
b
a
f(x)
d
dx
(x x
0
)dx = [f(x)(x x
0
)]
b
a
_
b
a
(x x
0
)
df
dx
dx
= f(b)
_
b
x
0
df
dx
dx
= f(b) [f(b) f(x
0
)] = f(x
0
) (2.46)
where the interval [a, b] is assumed to contain the point x
0
, and we have integrated
by parts. Clearly the derivative of the step-function has all the essential properties
of the function.
The idea of the function as a point of singularity but with a nite integral
is easily extended to higher dimensions. In three dimensional space, we have the
integral
_
V
(r r
0
)d
3
r = 1 (2.47)
if the integration volume V contains the point r
0
and is zero otherwise. In Cartesian
coordinates, it is straightforward to represent the function as a product of one-
dimensional functions,
(r r
0
) = (x x
0
) (y y
0
) (z z
0
). (2.48)
Representation of higher dimensional functions in other co-ordinate systems will
be discussed in the next section. Note that the one dimensional function has
dimensions of inverse length, Hence the three-dimensional function has dimen-
sions of inverse volume. This gives rise to the interpretation that the function is
eectively a density.
In general curvilinear coordinate systems, the innitesimal volumes depend on
the point and it becomes important to normalize the function to account for this
change in the density. For a function located on the point (u
1
, u
2
, u
3
), we write
(r
) =
1
h
1
h
2
h
3
(u
1
u
1
)(u
2
u
2
)(u
3
u
3
). (2.49)
Unless properly normalized, the function would begin to have dierent weights
depending on where it is placed. Overall the function should be dened such that
the integral over a volume containing the point where the singularity is located
should yield unity. Thus, in spherical coordinates the function would be written
as
(r
) =
1
r
2
sin
(r r
)(
)(
). (2.50)
Special mention must be made of points of singularity such as the origin or points
on the z axis where the spherical coordinates and may become ill-dened, i.e.,
the point is multiply described by the curvilinear coordinates. In such cases, if the
coordinate u
3
multiply describes the point where the function is located, there
will be no such factor such as (u
3
u
3
) in the representation for the function,
since the value of u
3
would be non-unique and ill-dened. Hence the representation
the curvilinear coordinate system would only appear as
(r
) =
1
h
1
h
2
_
b
a
h
3
du
3
(u
1
u
1
)(u
2
u
2
) (2.51)
EXAMPLES:
1. Consider a point charge q located at the origin. In cylindrical coordinates, the
corresponding charge density would be described as
(r) = q
1
2r
(r)(z).
In spherical coordinates, the representation would be
(r) = q
1
4r
2
(r).
2. Consider a charged thin disk of radius R on the carrying a charge per unit area
of lying on the X Y plane. The volume charge density can be represented
in cylindrical coordinates as
(r) = (z)(R r),
while the representation in spherical coordinates is
(r) =
1
r
( /2)(R r).
Note that the Heaviside step function has been used to conne the charge toa
radius smaller than R.
3. Consider a line charge with linear charge density per unit length, located along
the Z axis. In Cartesian coordinates, this is is easily represented as
(r) = (x)(y),
while in the cylindrical coordinates, we can write
(r) =
1
2r
(r),
and in the spherical coordinates, the representation would be
(r) =
1
2r
2
sin
[() + ( )].
3
Static charges, electric eld and electric potential
3.1 Concept of the electric eld
The Coulombs law gives the force exerted by one charge (q
1
) on another charge
(q
2
) as
F =
1
4
0
q
1
q
2
(r
2
r
1
)
|r
2
r
1
|
3
(3.1)
where r
1
is the position vector of the location of the charge q
1
and r
2
is the position
vector location of the charge q
2
. This is essentially action at a distance, whereby the
force due to the rst charge is instantaneously felt by the second charge. However,
we know from the theory of special relativity, that if one of the charges is moved
suddenly, the information of the changed location and hence the changed force
cannot be felt immediately. This information can only be known after a minimum
time of r
21
/c which is the time that it takes light to travel from one point to another.
Of course, we could argue that when we are discussing electrostatics, one cannot
talk of time dependent dynamic phenomena. However, this point makes us realize
that if we took the view that the forces between the charges are instantaneous, then
this would need to be changed once we began to talk to time-dependent phenomena.
There is an alternative and more fruitful viewpoint to take in this respect. We
can think of a charge aecting the space around itself such that any other charge
feels a force when placed in the space inuenced by it. This property of the space
around the point charge is called the Electric eld. It is dened as the force felt by
a unit charge. Hence, from Coulombs law, we can write that the electric eld at a
point r due to a point charge located at

r
is
E(r) =
1
4
0
q
1
(r
)
|r
|
3
. (3.2)
This is called the Coulomb eld of a point charge. Once the electric eld

E(r) in a
given region is known, the force felt by a (test) charge in the given region is given
by
F = q
test
E(r). (3.3)
3.1 Concept of the electric eld 25
It is assumed that the placement of the test charge in that region does not aect
the charge conguration that gives rise to the electric eld. For example, it should
not cause the rst charge to move - we have assumed that the rst charge is xed
at the location

r
in the above equations. If the test charge is small enough, then

such assumptions are usually valid. .
3.1.1 Superposition of electric elds and Charge distributions
Since the Coulomb forces due to various charges superpose (this is an experimental
fact), it is straightforwardly seen that one has a linear superposition of the electric
elds too. If

E
1
(r) is the electric eld at the location r due to a point charge q
1
,
E
2
(r) is the electric eld due to a point charge q
2
, and so on, then the net electric
eld at r is given by
E(r) =

E
1
(r) +

E
2
(r) +

E
3
(r) + , (3.4)
i.e. it is the vectorial sum of the electric elds produced by the individual charges.
Taking the location of the point charge q
i
to be r
i
, we can use the Coulomb law
in the superposition above and write for the net electric eld from a set of n point
charges as
E(r) =
n
i=1
1
4
0
q
i
(r r
i
)
|r r
i
|
3
. (3.5)
Next, we ask whether a point charge is actually found in nature or is it a mere
idealization like a point mass? We believe that electrons (as well as positrons and
other leptons) are truly point charges this is because we have never found any
internal structure to the electron in any of our experiments
1
. However, in most of
our everyday experiments and devices, many thousands, if not millions and billions
or more, electrons are involved. Further, we never try to measure the given charge
at a given point. Usually the question we ask is, how much charge is present within
a given volume? The volume depends on the resolution of the measurement we
perform the although we can improve the spatial resolution by better measurement
methods and techniques, the measurement volume is never really zero and is in
fact large compared to the atomic volumes in most cases. Thus, it makes better
sense to talk of average charge densities at a slightly more coarse-grained level,
where the graininess of the electron and the electronic charge is not really visible.
For example, if we consider the volume of a typical capacitor, it is several cubic
millimetres and the charge density in the volume concerned can be considered to
almost vary continuously. This is much like a uid where the graininess of the
1
These experiments involve colliding electrons against electrons at very high energies and observing
the energies of the scattered electrons and other particles that are produced in the collision. The
internal structure, if any, can be deduced from the variation of these energies and particles with
direction.
26 Static charges, electric eld and electric potential
atomic structure making up the uid is averaged out due to the presence of a very
large number of atoms involved.
In the above spirit, we will now consider continuous distributions of charges where
the amount of charge in a small (innitesimal) volume around a given point (given
by the position vector r) is dened as
dq = (r)d
r
(3.6)
where (r) is a smooth continuous function describing the charge density in space.
It is apparent from our above denition that the charge density of a point charge
would be singular. From our discussion of the Dirac functions in in Chapter 2, it
is clear that the charge distribution of a point charge is a function at the given
location (
) weighted by the magnitude of the charge: q(r

r
. In most other
physical macroscopic charge distributions, we have a continuous charge density that
avoids all such problems.
Now we can generalize our expression for the electric eld for a set of discrete
charges to a continuous (but nite) charge distribution as an integral over the charge
density
E(r) =
1
4
0
_
(r
)(r r
)
|r r
|
3
d
3
r
. (3.7)
Note that the integral has to be carried out over the volume where the charges
are present. This integral can be carried out for a few situations analytically. But
in most cases, the integrals become extremely cumbersome to evaluate and would
mostly have to be carried out by numerical or alternative methods that we will
learn a bit later. Further, note that in many experiments, specifying the charge
density is quite dicult as we will learn in due process.
EXAMPLES Consider a uniformly charged thin disk of radius a and total charge
q. We will nd the electric eld at points on the axis of the disk. We will take
the z axis to lie along the axis of the disk that is taken to be on the X-Y plane.
The charge density of the disk can be written as (r) = q/(a
2
)(z)(a r) in
cylindrical coordinates. The electric eld at points on the Z axis then comes out
to be (see Figure (??),
R(z) =
1
4
0
_ _ _
q/(a
2
)(z)(a r)(z z r
r)
(r
2
+ z
2
)
3/2
r
dzdr
d,
=
q
4
2
a
2
0
2 zz
_
a
0
r
dr
(r
2
+ z
2
)
3/2
=
q
2a
2
0
z
_
1
z
(z
2
+ a
2
)
1/2
_
(3.8)
where we have written out the integral in cylindrical coordinates and used that
_
2
0
rd = 0.
3.2 Gausss law of Electrostatics 27
3.2 Gausss law of Electrostatics
3.3 The curl of the electric eld
3.3.1 The electrostatic Potential
3.4 Energy associated with charge distributions
3.4.1 Problems regarding energy of a point charge
3.5 Conductors and capacitance
3.5.1 A cavity within a conductor
3.5.2 Surface charge on a conductor
3.5.3 Surface force on a conductor
3.5.4 Energy of a capacitor
4
Calculating the Electric eld and potential
4.1 Poisson and Laplace equations: Boundary Value problems
4.2 Boundary conditions on the electrostatic eld across charged
surfaces
4.3 Uniqueness theorems
4.4 The method of images
4.4.1 A conducting innite plane
4.4.2 A conducting sphere
4.5 Sorting out some important problems
4.5.1 A conducting sphere placed in a homogeneous electric eld
4.5.2 Screening: elds within a atom
4.5.3 Why is a colloid suspension stable ?
4.5.4 Oscillations of a plasma
5
Approximate description of the electric eld at
far-o points
5.1 Multipole expansions
5.2 Fields and potential of dipoles
5.3 Force, torque and energy associated with a dipole in electrostatic
elds
6
Electrostatics of material media
6.1 Ideas of homogenization of the electric eld in a material medium
6.2 Bound charges, polarizability and macroscopic polarization
6.3 Electric eld inside a material medium and The displacement eld
6.4 Dielectric susceptibility and dielectric permittivity
6.5 The electrostatic energy inside a dielectric medium
6.6 Sorting out some important problems
6.6.1 The electric elds of a uniformly polarized dielectric sphere
6.6.2 A dielectric sphere placed in a homogeneous electric eld
6.6.3 Making a composite material with high dielectric permittivity
6.6.4 Fields of a point charge placed near a at, large dielectric
medium
6.6.5 Placing a dielectric inside a capacitor
7
Magnetic elds
7.1 BiotSavart law for the magnetic eld
7.1.1 Experimental verication of BiotSavart law
7.2 Calculating elds for simple current congurations
7.3 Current distributions and magnetic elds for distributed currents
7.4 Forces between current carrying conductors
7.5 Divergence and curl of the magnetic eld and Amperes Law
7.6 The magnetic vector potential
7.7 Boundary conditions on magnetic elds across current sheets
8
Forces on a charge in electric and magnetic elds:
The Lorentz force
8.1 The Cyclotron frequency
8.2 Motion in parallel electric and magnetic elds
8.3 Motion in crossed electric and magnetic elds
8.3.1 Conning and focussing charges
9
Magnetostatics of material media
9.1 Multipole expansions for the magnetic eld
9.2 Magnetic eld and magnetic vector potential of a magnetic dipole
9.3 Magnetic elds and the

H eld in a material medium
9.4 Basic ideas of para-, dia-, ferro- magnetic media
9.5 Magnetic elds of magnetized objects
Notes
Author index
Subject index

Physics Module On Electrostatics and Magnetostatics

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Physics Module On Electrostatics and Magnetostatics

Uploaded by

Copyright:

Available Formats

Physics module on Electrostatics and

0 in the set which when associated with

b is called the scalar product. The dot product is

= v. The vector can be

= xcos + y sin (2.5)

= xsin + y cos (2.6)

via the inverse matrix

bc remain invariant under an

= sin x + cos y, (2.22)

= cos cos x + cos sin y sin z, (2.27)

= sin x + cos y. (2.28)

in the above equations. If the test charge is small enough, then

) weighted by the magnitude of the charge: q(r

You might also like