We will begin today with the problem of charged
particle in an electromagnetic field. But before I do that, I would like to make sure
that we have clear, our ideas about basic electricity and magnetism. So I am going to
start with a bit of lengthy digression fifteen minutes to start with on basic electricity
and magnetism. It might go into fifty minutes, but it depends on what you already know and
what you do not. So let us start with the basic equations of
electromagnetism. And here is E and M in a nutshell. Let us do that first; E and M after
a lot of experimentation, it turns out that all of electricity and magnetism at the classical
level can actually be compressed into four equations, the four Maxwell equations. So
let us write them down and then let us see what we can do from them. So I start of by
saying the first equation is, the divergence of the electric field. This is the charge
density. I will give standard international units so it is rho over epsilon naught. The
second equation is the divergence of B that is equal to zero. The curl of E, this is equal
to minus delta B over delta t; and the curl of B, which is mu naught j plus mu naught
epsilon naught delta E over delta t. These are the four Maxwell equations in free space.
I am going to only deal with the free space case. As you can see they are two equations
for the divergence of the electric and magnetic fields and two equations for curl of these
two electric fields. Now the first question, I am going to do this
in a sarcastic method. So speak up, give loudly your answers, loud and clear. Is this equation
valid, even if this field is time dependent?. Yes indeed, it is. So these are actually field's
space and time. So this equation is valid for arbitrary electric fields and similarly
this too. How about this equation?. Is it true only for steady currents or is it true
only for steady magnetic fields?. Is it clear from the right hand side that, it should be
time dependent. Can the field change from point to point, is this true in general? Yes
indeed, it is so. These are field equations. Therefore, this is a function of r and t and
this is a function of r and t as well. And similarly here too completely arbitrary space
time dependent quantities here. And these are the four Maxwell's equations. I am going
to ask them series of questions about these equations. The first of them is, how many
unknowns are there? What are the known and what are the unknowns? rho and j are the sources
of the electromagnetic field. They are supposed to be specified. And you are supposed to calculate
E and B. How many unknowns are there in E and B? Six unknowns. And how many equations
are there? Well, if you have six equations and four unknowns. You are in deep trouble.
We do not know anything yet. This is all we know. These are the equations and this is
crystallized wisdom starting from Coulomb's law in a generalized form leads to this equation.
The fact that, there are no magnetic monopoles leads to this equation. This is Faradays law
of electromagnetic induction and this is the Ampere Maxwell's theorem.
In the absence of electric fields, we change with time. And for steady currents, this was
the original Ampere theorem and now it is Ampere Maxwell's theorem with this displacement
current. That is it. These are the equations that I have given to you and how many of them
are there, how many equations are there? There are eight equations, because these two are
vector equations. These are scalar equations. So if you go by components there are eight
equations. But then that is an embarrassment of which is you have eight equations and only
six unknowns. In general, you do not have a solution then,
too many equations, too few unknowns. No reason why there should be any solution at all. And
yet we know there are electric and magnetic fields revolve around us. And these equations
were so, what would you say is happening. There are some redundant equations or at least
there are connections between these equations, which are not apparently. So they really are
six pieces of information for six unknowns, but they are jumbled up and they are mixed
up in a crazy fashion. And we are going to unravel that, going to
see, in what fashion they are really mixed up. So there are really only six independent
equations. And the reason is there is a connection between the j and the rho. The source terms
here. They are not independent of each other. So, as soon as, you say charges cannot be
created or destroyed, there is a connection between j and rho. If you do not specify them independently,
what is the connection between them? The equation of continuity; this is certainly true, in
the absence of sources and sinks del dot j equal to 0.
There is such a connection. There is a similar connection between the magnetic charge density
which happens to be zero; and the magnetic current density which should be sitting here
under zero. So you really have another constraint equation which is equal in saying zero plus
zero is equal to zero and that is a constraint too, because it is not apparently, so you
really have only six pieces of information. We will see this in another way very shortly.
What is the difference, you can see immediately between these two equations on the one hand
and these two on the other. Well, given these equations there is no magnetic
charge and so on. Given these equations, there is a basic intrinsic difference between the
first and the fourth equations on the one hand and the second and third on the other.
There are no sources. These two equations involve the sources and these equations do
not involve sources at all. Therefore these equations are valid independent,
of the sources. All electric and magnetic fields must satisfy these equations. So let
us make that clear by writing this. Bringing this to the left hand side and writing this
plus delta B over delta t equal to 0; and this of course continuous to be zero.
So these two equations slightly different from the other two equations and let us bring
this E to this side. And let me write this E minus mu naught epsilon naught delta E over
delta t. Now there are no sources in these two equations. So there is another way of
saying that, these equations are different. What is that? The right hand side of these
equations are zero. Not so for, other two equations, so what do you call these equations,
in which Michener's equations. These are homogeneous and these are n homogeneous.
Now that we have homogenous equations let us exhaust the contents of these equations
first. Put in all the information that you can get from it and then substitute in the
first and fourth equations. Now what is the first conclusion, when you draw, when you
hear that a vector field B is identically divergence less. What conclusion do you draw
from it? We know that the divergence of a curl must
be zero identity. So you have this vector identity, which says that divergence of the
curl of any vector field, let me call it, V is equal to zero. This is identically zero.
Therefore this equation implies immediately, that B can always be expressed as a curl of
another vector field A. You exchanged your ignorance of B for your
ignorance of A, but at least you exhausted the contents of this equation. Because you
know, that the divergence of a curl is identically zero. And since the divergence of B is identically
zero, for all magnetic fields. B must be the curl of some other vector field always, with
no exception. We do not know the other vector field. But
it is always. So you will see the advantage of introducing this A. A has a name. It is
called the vector potential. Let me not call it, the magnetic vector potential. Because
you will see that A will play a role in E as well. It is just a vector potential. Why
do I call it a potential. It is not potential energy. It is a historical hang up. The fact
is that, you know, that in the old days, if you had a scalar potential V as a function
of possession and you took it is gradient. You got a force and this very often in mechanical
instances, had the implication of being the potential energy. And you differentiate it in order to find
the force the vector field. So that name got carried over and since you take this vector
field A and you differentiate it. After all the curl operation is some kind of derivative
and you produce another vector field B. You call it the vector potential, vector, because
A is a vector. That is the only reason for you.
So anything you differentiate to produce another field, you call it the potential, does not
have the connotation of physical potential or anything like that. It is not a potential
energy, but it is just a auxiliary field which you introduce, whose curl is going to give
you B. Now I put into you, that once you have written B in that form, A of course is also
r and t dependent. You have exhausted the contents of this equation; that is it. There
is no more information that this equation affords. But you put that in here. And you
promptly discover that, this implies that the curl of E plus delta A over delta t is
identically zero once again. Now notice that I have taken the curl operation
del cross and delta over delta t. And I have commuted the two. And the reason is, these
are partial derivatives. Therefore you can differentiate it in either order does not
matter. So curl of delta B over delta t is the same as delta over delta t of curl E.
So I mix the two up, then I get this. I interchange then, I get this. But this again affords us
some simplification. It says this vector field E plus delta A over delta t has no curl. It
is identically curl free or irrotational and what quantity has the gradient of a scalar
field is identically has a zero curl identically. So of the gradient of any scalar quantity
is identically zero. Therefore the contents of this equation are completely taken care
of, because once you write E plus delta A over delta t, as the gradient of some scalar
field. That is it. This equation is taken care of completely. So this would imply that
E plus delta A over delta t equal to the gradient of a scalar field. But let me put a minus
sign here, purely as a matter of convention. The reason is, in electro statics you know
that E can be written as minus the gradient field phi scalar field and this phi has the
connotation of being the potential energy, of a set of charges. So we keep that minus
sign for that reason. No other reason and write E plus delta A over delta t in this
form. What would I call phi? I call it, the scalar potential. Not the electro static potential.
This has nothing to do with electro statics. It is always true. So this becomes. Therefore
let us write our basic equations. One of them is, B equal to the curl of A and
E is equal to minus delta A over delta t minus the gradient of phi. That is it. The fact
that I can write these two equations, that these two fields in this form derives directly
from the two homogenous Maxwell's equations and that is it, their contents are completely
taken care of. So erase these two equations and I am left with those two. What have I
done so far? Well, I got rid of B and E in terms of auxiliary
quantities A and phi. And the idea is, if you solve for A and phi, you can solve for
E and B, by differentiation by these rules. But I have gained an advantage. The fact is,
these are two vectors. So that, really six numbers sitting here but A and phi between
them are count for only four of them. So it means that, these homogenous Maxwell's equations
have ensured that all E and all B can be found from four points, not six. That is enough
to find all of A and B. Incidentally these identities are completely trivial to prove
either component wise or in index notation. Because in index notation, the divergence
of the curl of V, the curl of a vector field, it is i th component would be epsilon i j
k del j V k where this is delta over delta x comma x sub j, the x component and then
the divergence of that would be epsilon i j k del i del j V k with the contraction over
i and j. Because the i th component is like del dot this. That is del i and then the i
th component of this curl. This is it. But this is a symmetric in i and j. This is anti
symmetric in i and j. And you have to sum over all possible i j k, in the contraction
of the symmetric with the anti symmetric identity is zero. So it is a trivial statement that
the divergence of the curl of the vector will be zero.
Likewise the curl of the gradient of u stands for the i th component of this curl, for an
instance would be epsilon i j k del j that takes care of this and then del k on u. That
is the k th component of the gradient of u. Once again this is anti symmetric in i j and
j and k. And you got this is symmetric gradient and therefore it is zero. This also tells
you that, there is no other possible vector identity. This is it. These are the only two
identities that you have to really remember. And we put that in and this is what we get.
Now let us go back a little bit and ask these four Maxwell's equations. They are first
order partial differential equations. They are first order for a very deep reason.
I will get back to it later. But they involve the divergence in the curl and I want to explain
to you, why they involve only the divergence and the curl. So please remember that, we
also had this equation divergence B is equal to zero and curl E plus delta B by delta t
is equal to zero. My question is why do we specify what is so
special about specifying that the divergence and the curl of a vector field?. Why not anything
else?. Because after all, if I have a vector field V which has got components V one V two
V three, then you can differentiate each component with respect to each of the coordinates and
really you have nine derivatives possible. So you will have del i V j or, if I write
this out you have delta V one over delta X one delta V one over delta X two delta V one
over delta X three. X one X two X three stand for X Y Z and V one V two V three stand for
the X Y Z components then you have delta V two over delta X one delta V two over delta
X two delta V two over delta X three. So you have these nine components, you could actually
put in a little matrix of this kind. The nine possible partial derivatives and
what does the divergence correspond to?. The sum of these diagonal elements, what does
the curl correspond to?. It is a vector. Well, it is, this minus that and that gives you
the three component of the curl. This minus that gives you the minus two components of
the curl and this minus that gives you the one component of the curl. Now why is that
out of the set of nine quantities. You choose the diagonal free. You call it something.
Call it the divergence. And it is a scalar and why do you choose the anti symmetric differences
of the out diagonal elements. And you call that a curl, one of the vector field. And
the answer is that, it turns out that, when you take the tensile of rank two of this kind.
The sum of the diagonal elements as you can see is going to be invariant under similarity
transformations. It is not going to change. We know that the trace does not change. And
this means under rotations this quantity would not change. That is the reason. The divergence
is a scalar and it turns out that the differences between the off diagonal elements, is a vector
transforms like a vector. So very specific transformation quantity law
is implied, we would like to have all physical quantities expressed in terms of objects,
whose transformation properties are known to you. That is why objects can become physical
laws, can become form invariant, the same for all observers. And this is the reason
why you choose these two particular combinations, but a deeper question still occurs. That is
how do we know we do not need any other combinations, any other derivatives. Why are these alone
enough, and the answer is buried in the theorem due to Helmoltz, which says, if you give me
in rough form it says in a region of space. If you give me the divergence and the curl
of a vector field, and you specify the normal component of the curl on the surface of this
volume element. Of this volume then the field is uniquely
determined. So we take request to this theorem. In this vector analysis which says, essentially
that specifying the divergence and curl is in a sense enough. It is sufficient to determine
the vector field completely but having said, that you still have to understand physically,
what does it mean to specify the divergence and curl. And here let me go back a little
bit and give an example where this comes from. What is the reason for it? So that is important
to understand. Because every equation in mathematical physics, that you see would always involve
vector fields with divergence and curl. And the reason is buried in this theorem. And
you would also like to understand why this is sufficient. So let us understand why it is sufficient
to specify the divergence and curl let us go back and ask what is the solution to an
equation of the form, d over d X f of X equal to lambda f of X. This is a eigen value equation
in which d over d X is the operator differential operator, f of X is the eigen function, lambda
is the eigen value and f of X is again the eigen function. What is the solution to this
equation. The solution is f of X is apart from a constant. It is e to the lambda X.
So you agree that e to the power lambda X is a eigen state or eigen function of the
derivative of X. Now let us generalize to three dimensions
and instead of lambda, I would like to take something like i k. So I ask if I took e to
the power i k dot r, but k is an arbitrary constant vector and r stands for the three
position components X Y Z. This is a scalar function. I put an i , because I am thinking
in terms of waves, in terms of oscillatory functions cos and sine functions and so on,
which i can combine into e to the i k dot r. This stands for e to the i k one X plus
k two Y plus k three Z and I ask what happens, if I took the gradient of this. That is a
scalar function. I am asking what happens if we took to the
gradient of this scalar function. What is the answer, i k times r. So you could now
say that e to the power i k dot r is an eigen function of the del operator with eigen value
i k, that is a vector operator. So it is eigen values also vector. That is a very simple
result. You can see what is the physical meaning of this result.
Well, if you put e to the i k dot r equal to constant, the surfaces you get correspond
to k dot r equal to constant that says k one X plus k two Y plus k three Z is equal to
constant. What sort of surface is that? It is a plane in space and the gradient is normal
to this plane, because the gradient of the scalar field is always normal to its level
surfaces. So it gives you the direction normal to this plane. That is what this is, but what
is this, del dot some constant vector. Let me call it, a e to the i dot r. What is
this equal to, a is a constant vector. k is another constant vector, the r dependence
the coordinate depends in e i k dot r and I do del dot this. What happens? So you must
take the X component of this guy which is, a one e to the i k one X plus k two Y differentiated
with respect to X. And Similarly for Y and Z and add up these three. This is a simple
matter to verify. It is a two line thing to verify that, you are going to get i k dot
a e to i k dot r. k is a vector and a is a vector. So you find the dot product of k dot
a and it is multiplied by the scalar function e to the i k dot. So in a sense it is turning
out that the eigen function of del dot is a e to the i k, for arbitrary a and arbitrary
k. And likewise it is not hard to see that del
cross a e to the i k dot r written out to be i k cross a e to the i k dot r. So even
that cross has exactly the same kind of eigen vectors. Now that is marvelous. Because it
is really telling you that if you give me a vector a e to the i k dot r, that is a vector
field, where a is any constant vector. Then this vector field is acting as an eigen function
for del dot and del cross, the divergence and the curl. So what it is doing is, to take
the divergence and curl operators and converting them to numbers. It is converting into algebraic
quantities. Differential equations are getting converted to algebraic equations. And what
is that trick called? When you convert to differential equations from algebraic equations,
it is called Fourier analysis. This is exactly what you do to convert the derivative operator
to multiplication by k, by the Fourier variable. And this is what has happened.
So now we know that, if you give me a arbitrary function of r, I can expand it in a Fourier
basis, if it is a respectable function. And since these are linear equations, I can superpose
the equations for each component and reconstruct the field. And for each component those things
are valid. So what is this telling you? It says that the divergence of this vector field
is, i k dot this guy. So it says the divergence, tells you the component of this vector field
along the direction of k. And this is the curl of this vector field, this i k cross
and therefore this curl operator, this guy here is telling you this is normal to the
direction a. So it says in sum that the divergence equation is going to give you along any arbitrary
direction you choose. The divergence equation is going to tell you
how much of that vector field points along this arbitrary direction and the curl equation
is going to tell you, how much is in the normal direction, in the transverse direction and
together they determine the entire field. So this is the reason why all these equations
involve divergences and curls, independent of coordinates, independent of observers,
independent of, what your coordinate system is.
So it is carrying a dictionary which is independent of the observer. No matter what k direction
you choose. The divergence will give you the longitudinal component and curl will give
you the transverse component. And that is true for every k and you superpose all possible
k's, you get your fields. So this is the reason why the Maxwell equations specify.
This is the reason for the theorem, the Helmoltz theorem which says, if you tell me the divergence
and the curl of a vector field diagram and suitable boundary conditions, I tell you the
field uniquely. The reason is the divergence equation gives
you the longitudinal components and the curl equations give the transverse components,
independent of the direction of flow for every k. So is it clear? That is the real reason
why you have divergences and curls everywhere in Physics. Now, we know that let us put these things
back and ask what do these equations tell us. I am going to pluck this equation back
in here and what does it give you. It says del dot E so it is minus delta over delta
t del dot A. That is the first part minus del dot del phi, but that is equal to minus
del square phi. Let us call the Laplacian of this phi and
it got its own interesting set of properties. We will come to that later and that is equal
to rho over epsilon naught. So let us put the minus sign here and put epsilon naught
here plus this, plus and that is the first equation. We took care of these two equations.
We need to take care of this equation here and it says the curl of E is of course minus
del. The curl of B; so it says del cross del cross A minus mu naught epsilon naught.
By the way let us anticipate things a little bit mu naught epsilon naught is the dimensions
of one over the square of the velocity. And conventionally it is called C square, speed
of light in vacuum. So we can just put that in. So this is equal to one over C square
time derivative. We have to differentiate the second time that becomes a plus one by
C square delta E over delta t gradient of delta phi over delta t equal to mu naught
j. Now we have this famous vector identity for
the curl of the curl of the vector field and what is that identity gradient of the divergence
of A minus del square A. So curl A del cross del cross A equal to the gradient of the divergence
of A minus del square A. Del square is a scalar operator it acts on each components of A,
so produces a vector field, which in Cartesian coordinates has components del square A one
del square A two and del square A three. That is what the del square stands for. In
case you have forgotten this identity it is not hard to derive, because we want the curl
here. So put an epsilon i j k and then del j of del cross A, the k component that is
equal to epsilon k l m del l A m. And then you move this thing back here and write this
as epsilon i j k epsilon l m k del j del l A m and that by the identity is delta i l
delta j m minus delta i m delta j l del j del l A m. Now you use the chronicle delta to simplify
this matter and then you get this. So it is very simple thing you do not have to remember
anything. You just have to remember the fact, that when you contract this epsilon symbol
once, the answer is in terms of delta. So let us put that back in here and you have
got del of del dot A. There is another del which is sitting here. So let us call it one
over C square delta phi by delta t, if I took care of this term. And the first term here
plus one over C square d two A d two minus del square A equal to mu naught j. These are
the two equations. I erase this and I have to deal with these two equations. Now that
exhausts all four Maxwell's equations. The price you paid is that, the equations order
is increased to second order differential equations.
The Laplacian of a scalar function simply says we have got phi. It is d two phi over
d X two plus d two phi over d Y two plus d two phi over d Z two. This is what del square
on a scalar function is, but then del square of A, a vector. A would be A one unit vector
in the X direction plus A two unit vector in the Y direction plus A three unit vector
in the Z direction and these are constant vectors in Euclidian space. So they do not
get differentiated and therefore it stands for a vector with components e X times del
square A one then e Y times del square A two times and so on.
So it is just a short hand notation for three del square acting on the A one A two A three
and it is a vector created out of these three. You do not have to do that. Divergence is
really not defined as del dot A, the divergence of A for a vector field A. The divergence
is defined as a flux per unit volume at each point in a limiting case and then algorithm
for calculating that quantity turns out to be del dot A.
Similarly the curl is not defined in terms of this ridiculous determinants or anything
like that it is not del cross A. But rather the circulation of this vector field over
small line each point divided by the area. So it is the circulation per unit area in
the limit, in which the area go to zero and algorithm for calculating it is del cross
A. So this question is, is there a similar thing for del square? What is the significance
of del square and so on. I would not answer that question directly.
It is a scalar operator del dot l del, but I would like to mention though, that the reason
the del square operator appears is very profound and very deep. It is really buried in differential
geometry. The del square operator or rather the generalization of del square operator
complicated spaces is a very basic problem. It probes very basic property, is of the space
itself. So I know it is an unsatisfactory answer at
the moment. But let me give you a physical picture of why the del square operator has
the kind of role that it plays. Let us go back to electro statics, what it does. What
the del square does a good digression, good to understand that. Incidentally, if you did
not have time dependent fields, but you had only electrostatics. Then of course you know
this goes away and you have got del square phi is minus rho over epsilon naught.
Imagine for a minute, rho is time independent j is time independent, what you call that
equation, Harsens equation. That is the fundamental equation of electrostatics and what is the
primary property of charge free region, where del square phi is zero. There is no absolute
maxima or minima. This is the primary property and you have the mean value theorem. And it
says the value of the field at any point, where del square phi is zero. The value of
this phi at any point is the arithmetic mean of all points, values of all points symmetrically
situated above it. Whenever you have Laplace equation; that is
true that is the statement that, I am making. Of course, you put a source so along true.
What I am trying to bring out here is, the significance of the Laplacian operator. When
you say del square phi is equal to zero. It says phi is a function and eigen function
of the del square operator with zero eigen value. It is called a harmonic function and
the question is what is so harmonic about it. And the answer I am giving is that, this
function is such that every solution of the Laplace equation is such that, the value of
the function at any point, is the arithmetic average of the values at point symmetrically
situated about it, whatever be the number of dimensions. Let me go one step further back. Let us look
at Laplaces equation in one dimension. That is my Laplace equation in one dimension, just
the second derivative is zero. Now what is the solution to this equation, a linear equation,
linear function. So it tells you phi is equal to a x plus b. So let us plot this phi as
a function of x. This is what it is. What is so special about a straight line. I want
the value at every point. The slope is constant what does it lack. Well the first derivative
is non zero. It has no maxima or minima. That is very true, but even functions which are
not straight lines will have known, what does it lack.
It is first derivative is non zero in general. But the second derivative is identically zero.
So what does the second derivative measure curvature. There is no curvature. This function
has no curvature. So the consequence of that, if you took the value at any point here, it
is guaranteed to be the arithmetic mean of values symmetrically situated about it, of
this value and this value. So this is x and this is x minus epsilon x
plus epsilon. You are absolutely guaranteed that f of x equal to f of x plus epsilon plus
f of x minus epsilon divided by two. You guaranteed that. On the other hand, if the function at
curvature, then the curvature like this, the value here is smaller. The value at this point
is larger than the arithmetic mean and it would curve the other way, the value is smaller
than the value than the arithmetic mean, only if the curvature is identically zero. You
get this mean value. And of course it does not take long to realize
that I could write this as, f of x plus epsilon minus twice f of x which, I will write f of
x minus f of x minus f of x minus epsilon and that is equal to zero. I divide this by
epsilon. And this is, of course the derivative from the right. And this is the derivative
from the left. And you divide one more by epsilon and take the limit. And what derivative
do I get, the second derivative. And that is exactly this. So you see a harmonic function.
This is a harmonic function in one dimension, only a straight line. But guaranteed that
value at any point, is the arithmetic average of points either side. Now two or three dimensions in two dimensions,
for example the same property would translate to something much more interesting. Because
if you have d two phi over d x two plus d two phi over d y two equal to zero. Then of
course, if you put phi equal to a x plus b plus c y plus d. This is true. That is a trivial
case. But you have more interesting possibilities. It is possible that, the x derivative d two
phi over d x two is some point t and the y derivative is minus that. So the functions
do not have to be linear. But they could still satisfy this point. And it is not very hard
to see, if I did the same trick as, what I did there, that the value at any point is
the arithmetic mean of the value here. Take these four points x minus epsilon x plus
epsilon y minus epsilon y plus epsilon divide by four after summing them. And I would end
up to that equation to the limit. But then the del square operator is spherically invariant.
It is a scalar operator. So, I could in fact have taken four points this, this, this and
this. I still have this. You take all these eight points and do one
eight. And I still have it. And in fact, I could do the whole lot and take all the points
on the circle and divide by the length of the circle. And I would have the value; that
is the harmonic problem. And in three dimension, it is a sphere. As you know, this is the property
of harmonic functions, the mean value problem any number of dimensions, so that is what
is so basic about the del square operator. It is eigen functions, give you the generalization
of a function without curvature in one dimension. It satisfies this mean value theorem. So that
is why, it is so special. There is other reasons, why this is very particular. In fact if you
know the spectrum of the Laplacian operator in a domain actually known everything about
the domain subsets. So that is the reason, it is a very basic
probe. By the way in electrostatics, this has the consequence, that a function which
satisfies this cannot have a maximum here, because it cannot be larger than these values.
And still be it is average, your average. It cannot have an absolute minimum here and
the consequence of that is that in a charge free region. You cannot put a charge held
together by electrostatic forces alone and expected to be in stable equilibrium. Could
be equilibrium but unstable. Some of these terms will be positive, some would be negative.
If it is a absolute minimum, all of them would have to be positive and they cannot be add
up to zero. That is an absolute maximum. All of them will be negative. We cannot add up
to zero. So it always have saddle point some positive, some negative and you have unstable
equilibrium. What is that theorem called Poisson's theorem. Just say that, del square function
satisfies the Laplace equation, cannot have absolute maximum or minimum. So let us come
back. Now we have to deal with these two equations. And the bad news is that this equation which
looks almost like Poisson's equation has this A sitting.
Now what does this equation look like, if you did not have this term, it looks like
the wave equation. It looks exactly like the wave equation. So there is a one over C square.
It is second derivative with respect to time minus the del square the three dimensional
wave equation. The source term on the right hand side and there are mathematical methods
for solving the wave equation, just like there are for solving of Laplacian equation or the
Poisson's equation. Unfortunately you cannot use them directly.
Because the A equation involves phi and the phi equation involves A. It could be glorious,
if you could put this equal to zero or you can put this equal to zero. And now I put
it to you that you can always put it equal to zero A is arbitrary. Well remember the
physical fields are E and B. They are my physical observables. They carry energy momentum and
so on. I can measure them. So these quantities are specified by the sources
and the suitable boundary conditions. But these quantities are auxiliary points. And
the question is, are they uniquely specified or not? It is quite clear, if I add a constant
vector to A, the fields are not going to change, because the derivative is going to act on
constants. Similarly, if I add a constant scalar to phi,
nothing is going to change. The gradient is not going to change. But there is more to
it, than that, you can add a lot more to it, than that. So let us look at that. This is
going to be exploited and I am going to finally tied down to non uniqueness of the Lagrangians. So let us look this carefully this is called
Gauge Invariance. Now the question we are going to ask is, I say B is the curl of A.
B is a physical field probably unique in every given situation. Is A unique or not? This
is the question you have to ask. Is this unique?. For a given B, is A unique or not?
This is the question apart from the constant I can always add trivial constants. But is
it unique. I can actually the fact is the curl of any gradient is identically zero.
Therefore to A, I can add the gradient of any arbitrary scalar field and B would not
change. So it is quite clear that A as well as A prime, which is A plus the gradient of
any scalar field of space as well as time does not matter, would lead to the same B.
Because the curl of this gradient could be identically zero. So A is not unique.
In classical physics it is A is not correctly physically measurable. It is not unique at
all. B very much is, but then, we have to be careful if I change A to A prime, I may
also change E unless, I compensate for it with phi. So what I should say is that, simultaneously
we are changing A to A prime, by adding the gradient of the scalar. I should also change
phi to phi prime which is phi minus delta x over delta t minus, because there is a minus
sign. This compensate and it is now trivial to check
that B and E would be the same, if you had the scalar and vector potential specified
by pair phi and A or the pair phi prime and A prime make note of this. This is called
Gauge Invariance. The word is historical. It was pointed out in the early part of the
twentieth century. It comes from something called high invariance which from a while
introduced got translated into English as gauge like the meter gauge or the broad gauge
and so on. That is a work on measure of some kind. Do not worry about the historical antecedents
of this word. But it is stuck. It is called Gauge Invariance.
At this level, it looks like a trivial mathematical ploy. But this principle of Gauge Invariance,
when you translate it to quantum mechanics and then quantum field theory turns out to
be perhaps, the most basic dynamical principle of all. Because all the fundamental forces
of nature are taken to be mediated by quantum fields, which are called Gauge fields.
And it arises from a generalization of this idea of electromagnetism. So it is not a trivial
mathematical artifact. It is very much rooted in reality. We will see even at this level
what it implies. So, I put it to you once again, that A phi and, well let me write it
the other way. Because there is a reason for it phi prime A it implies same E and B. Yes
it does, because you can see E is going to change. If I change A to A prime, I have added
an extra term which is minus delta over delta t gradient of phi. But B must be unique and
the only way, I can get rid of it is, to simultaneously change phi. Then you would have discovered
that, d is equal to minus delta A by delta t minus grad phi plus delta phi over delta
t. Quite right. Could used A prime there. You would have added this delta phi. It would
have turned out definitely, which two equations. These two we have not even come to it yet.
We have not even come to that yet. These equations would of course change. But these are not
the equations for E and B. The point is, should these equations also remain unchanged and
my answer is, no they would not change. They would not remain unchanged, they will very
much change and that is the whole point I want them to change, because they are equations
of A and phi. But the physical fields are E and B.
I simply have to make sure that, I no matter what pair A and phi, I choose subject to this
freedom. The same E and B are produced at the end. No need at all. No need why A prime
and phi should satisfy it. So A and phi satisfy this equation, I do not know what A prime
and phi prime satisfied at the moment. And I do not care. That is the whole point, I
am going to ensure that, I choose my ki, in such a way that A prime and phi prime satisfy
a simpler set of equations. But since I am guaranteed that, A prime and phi prime do
the same job as A and phi as far as E and B is concerned. I would like to solve an easier
problem. It is a little bit, just to give you a analogy. It is a little bit like, saying
I give you a set of vector equation. I could solve this in Cartesian coordinates
or spherical polar coordinates. There may be some symmetry in the problem which causes
the problem to be simpler and spherical polar coordinates, I use that and after I do that,
I translate the solution back to Cartesian coordinates. I do not loose anyone. So this
is exactly what is going to be done. I am going to exploit this freedom. Rho and j are
physical quantities absolutely A and phi are auxiliary quantities.
No, let us please understand these two are specified. You are not solving these equations,
in order to find rho and j. You are solving these equations in order to find A and phi
and therefore E and B. So rho and j are given to you. They are not changed. They are gauge
invariant. No question. E and B are gauge invariant, but A and phi are not. They are
intermediate quantities. They are auxiliary quantities and my claim now is that, this
freedom is what I am going to exploit in order to simplify that set of equations. So now
here is what I do, you have given me this freedom and I come back to this equation.
And I am going to do exactly what she suggested. Put this in there and see what happens.
Now I put it to you that, I can choose my chi in such a way. So let me not clutter up
the notation. But I will say it in words, it is enough to understand, I will choose
my ki, in such a way that, this equation becomes del square phi prime equal to minus rho over
epsilon naught. Without this term, how do I do that? Well, I first pretend that, I have
been stupid enough to choose some A and phi and that is my equations to start. And let
us suppose that del dot A is not zero,I have chosen an A and I calculate del dot A. And
I find it. It is not equal to zero. What can it be. Well it is equal to this del dot A,
is a scalar quantity. So it is equal to some f of r and E then,
what is del dot A prime. It is equal to del dot A which is f of r and t plus del dot del
ki, which is equal to del square ki. And I set this equal to zero. I set it equal to
zero. How would I do it, by choosing ki, to be a solution of del square chi equal to minus
f. So no matter, what A you give me. I find out what it is, divergence is and I find it
some function f of r and t. Then I immediately switch to A prime, which is A plus the gradient
of ki, where I choose ki. And that is up to me to choose this ki, such that it solves
the equation del square chi equal to minus f. That is Poisson's equation and I know
how to solve it. So choose ki. chi is called the gauge function, such that it is a solution
of del square chi is equal to minus f. Very good question, can that equation be solved.
Now I have to put conditions under which Poisson's equation can be solved. This is a well posed
problem in the area of partial differential equations. If you specify appropriate boundary
conditions, Poisson's equation can be solved. It is guaranteed that the solution exists.
I would not go to the technicalities of when it exists and what conditions and so on in
all the normal electromagnetic problems this would be solved. In fact we are going to write
the solution now, which second term this I have not done anything. I have not gone here
at all. I am just saying you, give me a del dot A. You give me an A. I calculate del dot
A. I discover it some function, not zero. I promptly choose an A prime related to A
through this, such that del dot A prime is identically zero. That means you give me f
and I tell you what I is. Not necessarily. Yes, in a closed analytical form.
Let me give you an instance. If this you have to specify things, is it differentiable, is
this f very bad. I have to after all integrate this equation. So is that f integrable? So,
I have to answer technical mathematical questions of this kind. But we are going to write physical
arguments to write down a solution to Poisson's equation. I will do that from coulomb's
law, which you already know. So we write down a free solution to the Poisson's equation.
But take it from me. At the moment, as an assertion, that you can solve Poisson's
equation. There is a table mathematical algorithm to solve Poisson's equation. Therefore in
principle, no matter what A you give me. I can rewrite this equation in terms of A
prime and phi prime, where del dot A prime is zero. Yes, I can make del dot A equal to
some constant. Yes instead of zero and then this is again zero here. And therefore, this
is certainly true. Yes I can do this. But if I can make it a constant, why do I want
this constant. Why do not I make it zero?. I can do just as easily. You are right. There
is nothing unique about the choice in this case. Yes. We are trying to find E and B.
We have written these equations, in terms of A and phi but A and phi are not unique,
because it is some derivatives. Of them, which gives you E and B and you can
put various arbitrary gradients and so on added to A, such that A and B do not change,
but the auxiliary equation for A and phi. I would like to use this freedom to write
in, as simpler form as possible in particular. It will be very nice, if I could decouple
the phi equation from A equation. And that is, what is achieved. If I put this equal
to zero and there is no A dependence. Here it is only phi. I should call it phi prime.
So I am going to take this, as I have given. And I can always choose my gauge such that
del dot A is zero, in which case the first equation becomes Poisson's equation. Again
for phi or phi prime, but I will drop the prime. I will assume that, you have chosen
del dot A is equal to zero. This is called the Coulomb gauge. Yes. No let me put that
in. So, I choose this. I choose this A prime and instead of A, I now write A prime minus
grad phi grad chi and instead of phi, I write phi prime plus delta chi over delta t. I put
that in here and then the statement is del dot A prime is zero.
So, where does it leave you? It leaves you with Poisson's equation for the scalar quantity.
So the sum of this entire thing is, you can use gauge invariance. The freedom in the choice
of the vector potential, to ensure that the first of these equations; this equation deduces
to Poisson's equation for the scalar quantities. Because the statement is, you can always choose
your gates such that del dot A is 0. If you want to call it A prime, that is ok. Good
point, once you simplify the first equation, you may be in deep trouble, in the second
equation. But let me say what happens. That is a good
point. You have to worry about that. But then, what I am going to say is, permit me to get
rid of this. And this is called the Coulomb gauge. So if I work in the Coulomb gauge,
then the first equation becomes Poisson's equation and I claim. I solve Poisson's
equation. I am going to show you how to write down a solution, then phi becomes known to
you. This is known to you and then you put it in here. This is known to you and this
is zero. So move this known to the right hand side
and then it says, A satisfies a wave equation with a known right hand side. And if I can
solve the wave equation, then the problem is done in principle at least in principle
and after that, if you find A and phi, then you differentiate it. So all of the electromagnetism
is solved in the Coulomb, but the important point is, I want to assert this again and
again, that you can always work in the coulomb gauge.
This is always possible, whether it is advantage to do or not, is a different question, whether
you want to do it or not is a question. But it decouples the two equations the A and phi
dependences and it reduces the problem, solving the Poisson equation for phi and the wave
equation for A. And these are well studied mathematically,
that solves all of Maxwell's equations. But you can take another tact you can say
I do not like, that can I not use the gauge freedom, to put this equal to zero. If I did
that, I would just have the wave equation for A. Can I always do that. The question
is can I always do this? And the answer is, what is that you have here. You have got del
dot, if I put A prime here and I got a del square chi minus one over C square d two chi
over d two equal to what ever this quantity is. So I give the same argument and I say.
Suppose you find that del dot A plus one over C square delta phi over delta t equal to some
g of r and t. suppose it is equal to some g not zero. But some zero, then I go to phi
prime and A prime and I choose my chi, such that one over C square d two chi over d t
two minus del square chi equals minus g. Just as I chose my chi, my chi, here to be
a solution of Poisson's equation, I choose my ki, there of the wave equation and the
assertion is, I can solve the wave equation subjected to conditions and so on and once
I do that I am guaranteed that del dot A prime plus one over C square delta phi prime over
delta t is zero. So in that gauge this quantity can be set equal to zero or rather with A
prime phi prime. My assertion is you can always set this to zero always. Can you do both.
Can you set this to zero and that to zero. No, the reason is gauge freedom gives you
the freedom to handle to choose only one scalar quantity chi. So if you choose that chi, to
be a solution to the wave equation, there is no reason why it should also satisfy the
Poisson equation and vice versa. So you have only one piece of freedom, can
I choose therefore a gauge in which A is always zero, the vector A. No, physically you know,
there is a magnetic field and it need not be zero in general. So you cannot do that.
Because A is a vector and chi is a scalar function. Incidentally gauge in which this is zero is
called the Lawrence gauge. There are cases, where this is more advantageous than this
gauge. Can you tell me why you would think that setting this to zero is better than that
to setting to zero?. You can always do, that in any given problem. You can do either this
or that. Why would one be preferable to another. Well ok. Again the same question.
Suppose, I solve the equation in the Lawrence gauge, then I find A; I put that in here.
So this becomes a known. I take it to that side and solve Poisson's equation. But why
is it conceivable that the Lawrence gauge is any better than
the coulomb gauge, then I have no problem.
I will just decide phi equal to zero and I am done.
If I have electrostatics, can I do this. I do not have a magnetic field. I have a constant
uniform electric field in the x direction independent of space and time everywhere in
space, I have constantly. Can I choose phi equal to zero. You really think you have to
have faith in you, because I just put into you, that there is one piece of information
ki. That is the scalar function, you can choose that in many ways and the claim is, yes you
can always make phi. You can always choose a gauge such that phi
is identically zero. May not be very good thing to do. But let us look at this example.
We talked about, I have E of r and t to be equal to E naught e x. That is it, where e
naught is a constant. No space dependence. No time dependence. And B equal to zero and
my question is, can we choose a gate in which phi is zero. Yes, I can choose phi of r t
to be identically zero and I choose A of r and t is equal to minus E naught e x.
So I have a vector potential, no scalar potential,
no magnetic field and of course del cross A is zero, because this is a constant vector.
This involves only time. So there is no magnetic field.
But then minus delta A over delta t that gives you this field. So, you can do electrostatics
using only a vector potential and no scalar potential, even in the simplest of the electrostatic
problems. You can use a scalar potential to be identically zero. It would, of course be
very stupid to do this and the reason it would be stupid is, because the whole purpose of
the game, is to find the fields. And now I am giving you a vector potential which constructed
to give you this field. So I have to tell you the field before I tell you, what the
field is. Then I go through the potential. This is not much sensible. But the point is
in principle. This is ok. So please remember that, all electric fields are not minus the
gradient of the scalar field, because these are not all irrotational. The moment you have
the magnetic field which is changing with time, you immediately will have a electric
field, whose curl is not identically zero. Therefore which cannot be written as the gradient
of the scalar and even in cases, where you have electrostatic fields of this kind. You
could work completely with a vector potential and no scalar potential at all.
Similarly you can choose a gauge, in which delta phi over delta t is zero. You can choose
a gauge where phi is zero. You can choose a gauge where del dot A is zero. You cannot
choose gauge in A is zero, because that is a vector. You cannot choose a gauge, in which
it is not common. You cannot choose a gauge in which the magnitude of A is the given constant,
because this would involve technicalities, because the magnitude involves square root
of A x square plus A y square plus A z square and once you bring in square roots and singularities,
problem becomes different. We have been talking about non singular gauges. There are singular
gauges. But the simplest possibilities are, as I said most standard possibilities, either
the Coulomb gauge and this is zero or the Lawrence gauge and now what I am going to
do, is to write down next time, the solution to Poisson's equation.
We will work backwards from Coulomb's law which we already know plus the superposition
principle and then I show you how to write equations. So this wave equation, so once
we do this, then we have to address the real problem and that problem is what happens,
if you have a charged particle moving in a electric gauge. What is the equation of motion?.
And our target is, to find the Lawrence force equation. This is the equation of motion and
the question is, how are you going to derive; so physical arguments to write down the Lagrangian
and then use that in Euler Lagrange equations, you derive the Lawrence force. And finally,
I will come into why the particular charge in Largrangian. It is again buried in relative,
all of electromagnetism is buried in that. And finally the last point I want to make
is, that these are first order differential equations, unlike Newton's equations of
motion. The original Maxwell equation is, the first
order differential equations and the structure of this equation is, such that there is Lawrence
invariance built in same for all, that requires this set of equations, we have at this moment.
You can see this in another way, in a very heuristic and crude way. Once you have relativity
spatial and temporal coordinates, get mixed up. So if you have first order derivatives
in space, you cannot have second order derivatives in time. So that is the reason why time equations
are also first order partial derivatives. This is fine as long as the problem is mathematically
well posed. So I hope, I have given you a sort of in a nutshell summary of, what basic
problem is in classical electromagnetism. Now we are going to couple it to the magnetic
fields to the charged particles. We will do that next class.