Quantum math: transformations

Pre-script (dated 26 June 2020): Our ideas have evolved into a full-blown realistic (or classical) interpretation of all things quantum-mechanical. In addition, I note the dark force has amused himself by removing some material. So no use to read this. Read my recent papers instead. šŸ™‚

Original post:

We’ve come aĀ veryĀ long way. Now we’re ready for the Big Stuff. We’ll look at the rules forĀ transformingĀ amplitudes from one ‘base’ to ‘another’. [In quantum mechanics, however, we’ll talk about a ‘representation’, rather than a ‘base’, as we’ll reserve the latter term for a ‘base’Ā state.] In addition, we’ll look at how physicists model how amplitudes evolve over time using the so-calledĀ HamiltonianĀ matrix. So let’s go for it.

Transformations:Ā how should we think about them?

In my previous post, I presented the following hypothetical set-up: we have an S-filter and a T-filter in series, but the T-filter at the angle α with respect to the first. In case you forgot: these ‘filters’ are modified Stern-Gerlach apparatuses, designed to split a particle beam according to the angular momentum in the direction of the gradient of the magnetic field, in which we may place masks to filter out one or more states.

tilted

The idea is illustrated in the hypothetical exampleĀ below. The unpolarized beam goes through S, but we have masks blocking all particles with zero or negative spin in the z-direction, i.e. with respect to S. Hence, all particles entering the T-filter are in the +S state. Now, we assume the set-up of the T-filter is such that it filters out all particles with positive or negative spin. Hence, only particles with zero spin goĀ through. So we’ve got something like this:

3

However, we need to be careful as what we are saying here. The T-apparatus is tilted, so the gradient of the magnetic field is different. To be precise, it’s got the same tilt as the T-filter itself (α). Hence, it will be filtering out all particles with positive or negative spinĀ with respect to T.Ā So, unlike what you might think at first, some fraction of the particles in the +S state will get through the T-filter, and come out in the 0T state. In fact, we know how many, because we have formulas for situations like this. To be precise, in this case, we should apply the following formula:

〈 0T | +S 〉 = Ā āˆ’(1/√2)Ā·sinα

This is a real-valued amplitude. As usual, we get the following probability by taking the absolute square, so P = |āˆ’(1/√2)Ā·sinα|2Ā =Ā (1/2)Ā·sin2α, which gives us the following graph of P:

graph probability

The probability varies between 0 Ā (for α = 0 or Ļ€) and 1/2 = 0.5 (for α = Ļ€/2 or 3Ļ€/2). Now, this graph may or may not make sense to you, so you should think about it. You’ll admit it makes sense to find P = 0 for α = 0, but what about the non-zero values?

Think about what this would mean in classical terms: we’ve got a beam of particles whose angular momentum is ‘up’ in the z-direction. To be precise, this means that JzĀ =Ā +ħ. [Angular momentum and the quantum of action have the same dimension: the jouleĀ·second.] So that’s the maximumĀ value out of the three permitted values, which are +ħ, 0 and –ħ. Note that the particles here must beĀ bosons. So you may think we’re talking photons, in practice but… Well… No. As I’ll explain in a later post, the photon is a spin-one particle but it’s quite particular, because it has noĀ ā€˜zero spin’-state. Don’t worry about it here – but it’s really quite remarkable. So, instead of thinking of a photon, you should think ofĀ some composite matter particle obeying Bose-Einstein statistics. These are not so rare as you may think: all matter-particles that contain an even number of fermions –Ā like elementary particles – haveĀ integer spin – but… Well… Their spin number is usually zero – not one. So… Well… Feynman’s particle here is somewhat theoretical – but it doesn’t matter. Let’s move on.Ā šŸ™‚

Let’s look at another transformation formula. More in particular, let’s look at the formula we (should)Ā get for 〈 0T | āˆ’S 〉 as a function of α. So we change theĀ set-up of the S-filter to ensureĀ all particles entering T have negative spin. The formula is:

〈 0T | āˆ’S 〉 = Ā +(1/√2)Ā·sinα

That gives the same probabilities: |+(1/√2)Ā·sinα|2Ā =Ā (1/2)Ā·sin2α. Adding |〈 0T | +S 〉|2Ā and |〈 0T | āˆ’S 〉|2Ā gives us aĀ totalĀ probability equal toĀ sin2α, which is equal to 1 if α = Ļ€/2 or 3Ļ€/2. We may be tempted toĀ interpret this as follows: if a particle is in the +S or āˆ’S state before entering the T-apparatus, and the T-apparatus is tilted at an angle α = Ļ€/2 or 3Ļ€/2 with respect to the S-apparatus, then this particle will come out of the T-apparatus in the 0T-state. No ambiguity here: P = 1.

Is this strange? Well… Let’s think about what it means to tilt the T-apparatus. You’ll have to admit that, if the apparatus is tilted at the angle Ļ€/2 or 3Ļ€/2, it’s going to measure the angular momentum in the x-direction. [The y-direction is the common axis of both apparatuses here.] So… Well… It’s pretty plausible, isn’t it? If all of the angular momentum is in the positive or negative z-direction, then it’s not going toĀ have any angular momentum in the x-direction, right? AndĀ not having any angular momentum in the x-direction effectively corresponds to being in the 0T-state, right?

Oh ! Is itĀ thatĀ easy?

Well… No! Not at all!Ā TheĀ reasoningĀ above shows how easy it is to be led astray.Ā We forgot to normalize. Remember, if we integrate the probability density function over its domain, i.e. α ∈ [0, 2Ļ€], then we have toĀ get one, as all probabilities have to add up to one. The definite integral ofĀ (1/2)Ā·sin2α over [0, 2Ļ€] is equal to Ļ€/2 (the definite integral of the sine or cosine squared over a full cycle is equal to π), so we need to multiply this function by 2/Ļ€ to get the actual probability density function, i.e. (1/Ļ€)Ā·sin2α. It’s got the same shape, obviously, but it gives us maximum probabilities equal toĀ 1/Ļ€ ā‰ˆ 0.32 for α = Ļ€/2 or 3Ļ€/2, instead of 1/2 = 0.5.

Likewise, theĀ sin2α function we got when adding |〈 0T | +S 〉|2Ā and |〈 0T | āˆ’S 〉|2Ā should also beĀ normalized. One really needs to keep one’s wits about oneself here. What we’re saying here is that we have a particle that is eitherĀ in the +S or theĀ āˆ’S state, so let’s say that the chance is 50/50 to be inĀ eitherĀ of the two states. We then have these probabilities |〈 0T | +S 〉|2Ā andĀ |〈 0T | āˆ’S 〉|2, which we calculated as (1/Ļ€)Ā·sin2α. So the totalĀ combinedĀ probability is equal toĀ 0.5Ā·(1/Ļ€)Ā·sin2α +Ā 0.5Ā·(1/Ļ€)Ā·sin2α =Ā (1/Ļ€)Ā·sin2α.Ā So we’re now weighing the two (1/Ļ€)Ā·sin2α functions – and it doesn’t matter ifĀ the weights are 50/50 or 75/25 or whatever, as long asĀ the two weights add up to one. The bottom line is:Ā we get the same (1/Ļ€)Ā·sin2α function for P, and the same maximum probability 1/Ļ€ ā‰ˆ 0.32 for α = Ļ€/2 or 3Ļ€/2.

So we don’t get unity: P ≠ 1 for α = Ļ€/2 or 3Ļ€/2. Why not? Think about it. The classical analysis made sense, didn’t it? If the angular momentum is all inĀ the z-direction (or in one of the two z-directions, I should say), then we cannotĀ have any of it in the x-direction, can it? Well… The surprising answer is: yes, we can.Ā TheĀ remarkable thingĀ is that, in quantum physics, we actually never have allĀ of the angular momentum in one direction. As I explained in my post on spin and angular momentum, theĀ classical concepts of angular momentum, and the related magnetic moment, have their limits in quantum mechanics. In quantum physics, we find that the magnitudeĀ of a vector quantity, like angular momentum, or the related magnetic moment, is generallyĀ not equal toĀ the maximum value of the component of that quantity in any direction. The general rule is that theĀ maximum value of any component of JĀ in whatever direction – i.e. +ħ in the example we’re discussing here – is smaller than theĀ magnitudeĀ of J – which I calculated in the mentioned post as |J| = J = +√2·ħ ā‰ˆ 1.414·ħ, so that’s almost 1.5 times ħ! So it’s quite a bit smaller!Ā The upshot is that we cannot associate any preciseĀ and unambiguousĀ direction with quantities like the angular momentumĀ JĀ or the magnetic moment μ. So the answer is:Ā the angular momentum canĀ neverĀ beĀ all in the z-direction, so we can alwaysĀ haveĀ some of itĀ in theĀ x-direction, and so that explains the amplitudes and probabilities we’re having here.

Huh?

Yep. I know. We never seem to get out of this ‘weirdness’, but then that’s how quantum physics is like. Feynman warned us upfront:

“Because atomic behavior is so unlike ordinary experience, it is very difficult to get used to, and it appears peculiar and mysterious to everyone—both to the novice and to the experienced physicist. Even the experts do not understand it the way they would like to, and it is perfectly reasonable that they should not, because all of direct, human experience and of human intuition applies to large objects. We know how large objects will act, but things on a small scale just do not act that way. So we have to learn about them in a sort of abstract or imaginative fashion and not by connection with our direct experience.”

As I see it, quantum physics is about explaining all sorts of weird stuff, like electron interference and tunneling and what have you, so it shouldn’t surprise us that the framework is as weird as the stuff it’s trying to explain. šŸ™‚ So… Well… All we can do is to try to go along with it, isn’t it? And so that’s what we’ll do here. šŸ™‚

Transformations: the formulas

We need to distinguish various cases here. The first case is the case explained above: the T-apparatus shares the same y-axis – along which the particles move – but it’s tilted. To be precise, we should say that it’s rotatedĀ about the common y-axis by the angle α. That implies we can relate the x’, y’, z’ coordinate system of T to the x, y, z coordinate system of S through the following equations:Ā z′ =Ā zĀ·cosα +Ā xĀ·sinα, x′ =Ā xĀ·cosĪ±Ā āˆ’Ā zĀ·sinα, andĀ y′ =Ā y. Then the transformation amplitudes are:

transformation 1

We used the formula for 〈 0T | +S 〉 and 〈 0T | āˆ’S 〉 above, and you can play with the formulas above by imagining the related set-up of the S and T filters, such as the one below:

1

If you do your homework (just check what formula and what set-upĀ this corresponds to), you should find the following graph for the amplitude and the probability as a function of α: the graph is zero for α = Ļ€, but is non-zero everywhere else. As with the other example, you should think about this. It makes sense—sort of, that is. šŸ™‚

probability 2

OK. Next case. Now we’re going to rotate the T-apparatus around the z-axis by some angle β. To illustrate what we’re doing here, we need to take a ‘top view’ of our apparatus, as shown below, which shows a rotation over 90°. More in general, for any angle β, the coordinate transformation is given byĀ z′ =Ā z, x′ =Ā xĀ·cosβ +Ā yĀ·sinβ, y′ =Ā yĀ·cosĪ²Ā āˆ’Ā xĀ·sinβ. [So it’s quite similar to case 1: we’re only rotating the thing in a different plane.]

rotation z

The transformation amplitudes are now given by:

transformation 2

As you can see, we getĀ complex-valuedĀ transformation amplitudes, unlike our first case, which yieldedĀ real-valuedĀ transformation amplitudes. That’s just the way it is. Nobody says transformation amplitudes have to be real-valued. On the contrary, one wouldĀ expectĀ them to be complex numbers. šŸ™‚ Having said that, the combined set of transformation formulas is, obviously, rather remarkable. The amplitude to go from theĀ +S stateĀ to, say, theĀ 0T state is zero. Also, when our particleĀ has zero spin when coming out of S, it will always haveĀ zero spinĀ when and if it goes through T. In fact, the absolute value of those e±iβ functions is also equal to one, so theyĀ are also associated with probabilities that are equal to one: |e±iβ|2 = 12 = Ā 1. So… Well… Those formulas are simple and weird at the same time, aren’t they? They sure give usĀ plenty of stuff to think about, I’d say.

So what’s next? Well… Not all that much. We’re sort of done, really. Indeed,Ā it’s just a property of space that we can getĀ anyĀ rotation of T by combining the two rotations above. As I only want to introduce the basic concepts here, I’ll refer you to Feynman for the details ofĀ howĀ exactly that’s being done. [He illustrates it for spin-1/2 particles in particular.] I’ll just wrap up here by generalizing our results fromĀ baseĀ states toĀ anyĀ state.

Transformations: generalization

We mentioned a couple of times already that the base states are like a particular coordinate system: we will usually describe a state in terms ofĀ baseĀ states indeed. More in particular, choosing S as our representation,Ā we’ll say:

The state φ is defined by the three numbers:

C+Ā = 〈 +S | φ 〉,

C0Ā = 〈 0S | φ 〉,

Cāˆ’Ā = 〈 āˆ’S | φ 〉.

Now, the veryĀ sameĀ state can, of course, also be described in the ‘T system’, so then our numbers – i.e. the ‘components’ of φ – would be equal to:

C’+Ā = 〈 +T | φ 〉, C’0Ā = 〈 0T | φ 〉, and C’āˆ’Ā = 〈 āˆ’T | φ 〉.

So how can we go from the unprimed ‘coordinates’ to the primed ones? The trick is to use the second of the three quantum math ‘Laws’ which I introduced in my previous post:

Capture

Just replace χ in [II] by +T, 0T and/or –T. More in general, if we denoteĀ +T, 0T or –T byĀ jT, we can re-write this ‘Law’ as:

transformation 3

So the 〈 jT | iS 〉 amplitudes are those nine transformation amplitudes. Now, we can represent those nine amplitudes in a nice three-by-three matrix and, yes, we’ll call that matrix the transformation matrix. So now you know what that is.

To conclude, I should note that it’s only because we’re talking spin-one particles here that we have three base states here and, hence, three ‘components’, which we denoted byĀ C+, Cāˆ’Ā and C0, which transform the way they do when going from one representation to another, and so that is very much like what vectors do when we move to a different coordinate system, which is why spin-one particles are often referred to as ‘vector particles. [I am just mentioning this in case you’d come across the term and wonder why they’re being called that way. Now you know.] In fact, if we have three base states, in respect to whatever representation, and we define some state φ in terms of them, then we can always re-define that state in terms of the following ‘special’ set of components:

special set

The set is ‘special’ because one can show (you can do that yourself that by using those transformation laws) that these components transformĀ exactlyĀ the way asĀ x, y, z transform to x′, y′, z′. But so I’ll leave at this.

[…]

Oh… What about the Hamiltonian? Well… I’ll save that for my next posts, as my posts have become longer and longer, and so it’s probably a good idea to separate them out. šŸ™‚

Post scriptum: transformations for spin-1/2 particles

You should actually really check out that chapter of Feynman. The transformation matrices for spin-1/2 particles look different because… Well… Because there’s only two base states for spin-1/2 particles. It’s a pretty technical chapter, but then spin-1/2 particles are the ones that make up the world. šŸ™‚

Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/

Quantum math: the rules – all of them! :-)

Pre-script (dated 26 June 2020): Our ideas have evolved into a full-blown realistic (or classical) interpretation of all things quantum-mechanical. In addition, I note the dark force has amused himself by removing some material. So no use to read this. Read my recent papers instead. šŸ™‚

Original post:

In my previous post, I made no compromise, and used allĀ of the rules one needs to calculate quantum-mechanical stuff:

Capture

However, I didn’t explain them. These rules look simple enough, but let’s analyze them now. They’re simple and not at the same time, indeed.

[I] The first equation uses the KroneckerĀ delta, which sounds fancy but it’s just a simple shorthand:Ā Ī“ijĀ =Ā Ī“jiĀ is equal to 1 if i = j, and zero if i ≠ j, withĀ iĀ andĀ jĀ representingĀ baseĀ states.Ā Equation (I) basically says that base states are all different.Ā For example, the angular momentum in the x-direction of a spin-1/2 particle – think of an electron or a proton – isĀ eitherĀ +ħ/2 orĀ āˆ’Ä§/2, not something in-between, or someĀ mixture. So 〈 +xĀ |Ā +x 〉 = 〈 āˆ’xĀ |Ā āˆ’x 〉 = 1 and 〈 +xĀ |Ā āˆ’x 〉 = 〈 āˆ’xĀ | +x 〉 = 0.

We’re talking base states here, of course. Base states are like a coordinate system: we settle on an x-, y- and z-axis, and a unit, and any point is defined in terms of an x-, y– and z-number. It’s the same here, except we’re talking ‘points’ in four-dimensional spacetime. To be precise, we’re talking constructs evolving in spacetime. To be even more precise, we’re talking amplitudesĀ with a temporal as well as a spatial frequency, which we’ll often represent as:

aĀ·eāˆ’iĀ·ĪøĀ =Ā eāˆ’iĀ·(ω·t āˆ’ kĀ āˆ™x)Ā =Ā aĀ·eāˆ’(i/ħ)Ā·(EĀ·t āˆ’ pāˆ™x)

The coefficient in front (a) is just a normalization constant, ensuring all probabilities add up to one. It may not be a constant, actually: perhaps it just ensure our amplitude stays within some kind of envelope, as illustrated below.

Photon wave

As for the ω = E/ħ and k =Ā p/ħ identities, these are the de Broglie equations for aĀ matter-wave, which the young ComteĀ jotted down as part of his 1924 PhD thesis. He was inspired by the fact that theĀ EĀ·t āˆ’ pāˆ™xĀ factor is an invariantĀ four-vector productĀ (EĀ·t āˆ’ pāˆ™x =Ā pμxμ) in relativity theory, and noted the striking similarity with the argument of any wave function in space and time (ω·t āˆ’ kĀ āˆ™x) and, hence, couldn’t resist equating both. Louis de Broglie was inspired, of course, by the solution to the blackbody radiation problem, which Max Planck and Einstein had convincingly solved by accepting that the ω = E/ħ equation holds for photons. As he wrote it:

“When I conceived the first basic ideas of wave mechanics in 1923–24, I was guided by the aim to perform a real physical synthesis, valid for all particles, of the coexistence of the wave and of the corpuscular aspects that Einstein had introduced for photons in his theory of light quanta in 1905.” (Louis de Broglie, quoted in Wikipedia)

Looking back, you’d of course want the phase of a wavefunction to be some invariant quantity, and the examples we gave our previous post illustrate how one would expect energy and momentum to impact its temporal and spatial frequency. But I am digressing. Let’s look at the second equation. However, before we move on, note thatĀ minusĀ sign in the exponent of our wavefunction:Ā aĀ·eāˆ’iĀ·Īø. The phase turns counter-clockwise. That’s just the way it is. I’ll come back to this.

[II] The φ and χ symbols do notĀ necessarily represent base states. In fact, Feynman illustrates this law using a variety of examples including both polarized as well asĀ unpolarizedĀ beams, or ‘filtered’ as well as ‘unfiltered’ states, as he calls it in the context of the Stern-Gerlach apparatuses he uses to explain what’s going on. Let me summarize his argument here.

I discussed the Stern-Gerlach experiment in myĀ post on spin and angular momentum, but the Wikipedia article on it is very good too. The principle is illustrated below: a inhomogeneous magnetic field – note the direction of the gradient āˆ‡B = (āˆ‚B/āˆ‚x, āˆ‚B/āˆ‚y, āˆ‚B/āˆ‚z) – will split a beam of spin-one particles into three beams. [Matter-particles with spin one are rather rare (Lithium-6 is an example),Ā but three states (rather than two only, as we’d have when analyzing spin-1/2 particles, such as electrons or protons) allow for more playĀ in the analysis. šŸ™‚ In any case, the analysis is easily generalized.]

stern-gerlach simple

The splitting of the beam is based, of course, on the quantized angular momentum in the z-directionĀ (i.e. the direction of the gradient): its value is either ħ, 0, or āˆ’Ä§. We’ll denote theseĀ baseĀ states as +, 0 orĀ āˆ’, and we should note they are defined in regard to an apparatus with a specific orientation. If we call this apparatus S, then we can denote theseĀ base states as +S, 0S and āˆ’S respectively.

The interesting thing in Feynman’s analysis is the imagined modifiedĀ Stern-Gerlach apparatus, which – I am using Feynman‘s words here šŸ™‚Ā ā€“ Ā “puts Humpty Dumpty back together.” It looks a bit monstruous, but it’s easy enough to understand. Quoting Feynman once more: “It consists of a sequence of three high-gradient magnets. The first one (on the left) is just the usual Stern-Gerlach magnet and splits the incoming beam of spin-one particles into three separate beams. The second magnet has the same cross section as the first, but is twice as long and the polarity of its magnetic field is opposite the field in magnetĀ 1. The second magnet pushes in the opposite direction on the atomic magnets and bends their paths back toward the axis, as shown in the trajectories drawn in the lower part of the figure. The third magnet is just like the first, and brings the three beams back together again, so that leaves the exit hole along the axis.”

stern-gerlach modified

Now, we can use this apparatus as aĀ filterĀ by inserting blocking masks, as illustrated below.

filter

But let’s get back to the lesson. What about the second ‘Law’ of quantum math? Well… You need to be able to imagine all kinds of situations now. The rather simple set-up below is one of them: we’ve got two of these apparatuses in series now, S and T, with T tilted at the angle α with respect to the first.

tilted

I know: you’re getting impatient. What about it? Well… We’re finally ready now. Let’s suppose we’ve got three apparatuses in series, with the first and the last one having the very same orientation, and the one in the middle being tilted. We’ll denote them by S, T and S’ respectively. We’ll also use masks: we’ll block the 0 andĀ āˆ’ state in the S-filter, like in that illustration above. In addition, we’ll block the + andĀ āˆ’ state in the T apparatus and, finally, the 0 andĀ āˆ’ state in the S’ apparatus. Now try to imagine what happens: how many particles will get through?

[…]

Just try to think about it.Ā Make some drawing or something. Please!Ā Ā 

[…]

OK… The answer is shown below. Despite the filtering in S, the +S particles that come out do have an amplitude to go through the 0T-filter, and so the number of atoms that come out will be some fraction (α) of the number of atoms (N) that came out of the +S-filter. Likewise, some other fraction (β) will make it through the +S’-filter, so we end up with βαN particles.

ratio 2

Now, I am sure that, if you’d tried to guess the answer yourself, you’d have saidĀ zeroĀ rather than βαN but, thinking about it, it makes sense: it’s not because we’ve got some angular momentum in one direction that we have none in the other. When everything is said and done, we’re talking componentsĀ of theĀ total angular momentum here, don’t we? Well… Yes and no. Let’s remove the masks from T. What do we get?

[…]

Come on: what’s your guess? N?

[…] You’re right. It’s N. Perfect. It’s what’s shown below.

ratio 3

Now,Ā thatĀ should boost your confidence. Let’s try the next scenario. We block the 0 andĀ āˆ’ state in the S-filter once again, and theĀ + andĀ āˆ’ state in the T apparatus, so the first two apparatuses are the same as in our first example. But let’s change the S’ apparatus: let’s close the + andĀ āˆ’ state there now. Now try to imagine what happens: how many particles will get through?

[…]

Come on! You think it’s a trap, isn’t it? It’s not. It’s perfectly similar: we’ve got some other fraction here, which we’ll write as γαN, as shown below.

ratio 1Next scenario: S has the 0 andĀ āˆ’ gate closed once more, and T is fully open, so it has no masks. But, this time, we set S’ so it filters the 0-state with respect to it. What do we get? Come on!Ā Think! Please!

[…]

The answer is zero, as shown below.

ratio 4

Does that make sense to you? Yes? Great! Because many think it’s weird: they think the T apparatus must ‘re-orient’ the angular momentum of the particles. It doesn’t: if the filter is wide open, then “no information is lost”, as Feynman puts it. Still… Have a look at it. It looks like we’re opening ‘more channels’ in the last example: the S and S’ filter are the same, indeed, and T is fully open, while it selected for 0-state particles before. ButĀ noĀ particles come through now, while with the 0-channel, we had γαN.

Hmm… It actuallyĀ isĀ kinda weird, won’t you agree?Ā Sorry I had to talk about this, but it will make you appreciate that secondĀ ‘Law’ now: we can always insert a ‘wide-open’ filter and, hence, split the beams into a completeĀ set of base statesĀ āˆ’ with respect to the filter, that isĀ āˆ’Ā and bring them back together provided our filter does not produce any unequal disturbances on the three beams. In short, the passage through the wide-open filter shouldĀ notĀ result in a change of the amplitudes. Again, as Feynman puts it: the wide-open filter should really put Humpty-Dumpty back together again. If it does, we can effectively apply our ‘Law’:

second law

For an example, I’ll refer you to my previous post. This brings me to the third and final ‘Law’.

[III]Ā The amplitude to go from state φ to state χ is the complex conjugateĀ of the amplitude to to go from state χ to state φ:

〈 χ |Ā Ļ†Ā āŒŖ = 〈 φ | Ļ‡Ā āŒŖ*

This is probably the weirdest ‘Law’ of all, even if I should say, straight from the start, we can actually derive it from the second ‘Law’, and the fact that all probabilities have to add up to one. Indeed, a probability is theĀ absoluteĀ square of an amplitude and, as we know, theĀ absoluteĀ square of a complex number is also equal to the product of itself and its complex conjugate:

|z|2Ā =Ā |z|Ā·|z| = zĀ·z*

[You should go through the trouble of reviewing the difference between the square and the absoluteĀ square of a complex number. Just write z as a + ib and calculate (a + ib)2Ā = a2Ā + 2abiĀ + b2 , as opposed to |z|2Ā = a2Ā + b2. Also check what it means when writing z as rĀ·eiĪøĀ = rĀ·(cosĪø + iĀ·sinĪø).]

Let’s applying the probability rule to a two-filter set-up, i.e. the situation with the S and the tilted T filter which we described above, and let’s assume we’ve got a pure beam of +S particles entering the wide-open T filter, so our particles can come out in either of the three base states with respect to T. We can then write:

〈 +TĀ |Ā +S 〉2Ā + 〈 0T |Ā +S 〉2Ā + 〈 āˆ’T | +S 〉2Ā = 1

⇔ 〈 +TĀ |Ā +S 〉〈 +TĀ |Ā +S 〉* + 〈 0T |Ā +S 〉〈 0T |Ā +S 〉* + 〈 āˆ’T | +S 〉〈 āˆ’T | +S 〉* = 1

Of course, we’ve got two other such equations if we start with a 0S or aĀ āˆ’S state. Now, we take the 〈 χ |Ā Ļ†Ā āŒŖ =Ā āˆ‘ 〈 χ | i 〉〈 i |Ā Ļ†Ā āŒŖ ‘Law’, and substitute χ and φ for +S, andĀ allĀ iĀ states for the base states with regard to T. We get:

〈 +S | +S 〉 = 1 = 〈 +S |Ā +T 〉〈 +TĀ |Ā +S 〉 + 〈 +S | 0T 〉〈 0T |Ā +S 〉 + 〈 +S | –T 〉〈 āˆ’T | +S 〉

These equations are consistent only if:

〈 +S | +T 〉 = 〈 +T | +S 〉*,

〈 +S | 0T 〉 = 〈 0T | +S 〉*,

〈 +S |Ā āˆ’T 〉 = 〈 āˆ’T |Ā +S 〉*,

which is what we wanted to prove. One can then generalize to any state φ and χ. However, provingĀ the result is one thing.Ā Understanding it is something else. One can write down a number of strange consequences, which all point to Feynman‘s rather enigmatic comment on this ‘Law’:Ā “If this Law were not true, probability would not be ‘conserved’, and particles would get ‘lost’.” So what doesĀ thatĀ mean?Ā Well… You may want to think about the following, perhaps. It’s obvious that we can write:

|〈 φ | Ļ‡Ā āŒŖ|2Ā = 〈 φ | Ļ‡Ā āŒŖāŒ© φ | Ļ‡Ā āŒŖ* = 〈 χ | φ 〉*〈 χ | φ 〉 =Ā |〈 χ | φ 〉|2

This says that the probability to go from the φ-state to the χ-state Ā isĀ the sameĀ as the probability to go from the χ-state to the φ-state.

Now, when we’re talking base states, that’s rather obvious, because the probabilities involved are either 0 or 1. However, if we substitute for +S and āˆ’T, or some more complicated states, then it’s a different thing. My guts instinct tells me this third ‘Law’ – which, as mentioned, can be derivedĀ from the other ‘Laws’ – reflectsĀ the principle ofĀ reversibilityĀ in spacetime, which you may also interpret as aĀ causalityĀ principle, in the sense that, in theory at least (i.e. not thinking about entropy and/or statistical mechanics), we can reverse what’s happening: we can go back in spacetime.

In this regard, we should alsoĀ remember that the complex conjugate of a complex number in polar form, i.e. a complex number written asĀ rĀ·eiĪø, is equal toĀ rĀ·eāˆ’iĪø, so the argument in the exponent gets aĀ minusĀ sign. Think about what this means for ourĀ aĀ·eāˆ’iĀ·ĪøĀ =Ā eāˆ’iĀ·(ω·t āˆ’ kĀ āˆ™x)Ā =Ā aĀ·eāˆ’(i/ħ)Ā·(EĀ·t āˆ’ pāˆ™x)Ā function. Taking the complex conjugate of this function amounts toĀ reversing the direction of t and x which, once again, evokes that idea of going back in spacetime.

I feel there’s some more fundamental principle here at work, on which I’ll try to reflect a bit more. Perhaps we can also do something with that relationship between the multiplicative inverse of a complex number and its complex conjugate, i.e.Ā zāˆ’1Ā = z*/|z|2. I’ll check it out. As for now, however, I’ll leave you to do that, and please let me know if you’ve got any inspirational ideas on this. šŸ™‚

So… Well… Goodbye as for now. I’ll probably talk about theĀ HamiltonianĀ in my next post. I think we really did a good job in laying the groundwork for the really hardcore stuff, so let’s go for that now. šŸ™‚

Post Scriptum: On the Uncertainty Principle and other rules

After writing all of the above, I realized I should add some remarks to make this post somewhat more readable. First thing: not all of the rules are there—obviously! Most notably, I didn’t say anything about the rules for adding or multiplying amplitudes, but that’s because I wrote extensively about that already, and so I assume you’re familiar with that. [If not, see my page on the essentials.]

Second, I didn’t talk about the Uncertainty Principle. That’s because I didn’t have to. In fact, we don’t need it here. In general, all popular accounts of quantum mechanics have an excessive focus on the position and momentum of a particle, while the approach in this and my previous post is quite different. Of course, it’s Feynman’s approach to QM really. Not ‘mine’. šŸ™‚ All of the examples and all of the theory he presents in his introductory chapters in the Third Volume of Lectures, i.e. the volume on QM, are related to things like:

  • What is the amplitude for a particle to go from spin state +S to spin state āˆ’T?
  • What is the amplitude for a particle to be scattered, by a crystal, or from some collision with another particle, in the Īø direction?
  • What is the amplitude for two identical particles to be scattered in the same direction?
  • What is the amplitude for an atom to absorb or emit a photon? [See, for example, Feynman’s approach to the blackbody radiation problem.]
  • What is the amplitude to go from one place to another?

In short, you read Feynman, and it’s only at theĀ veryĀ end of hisĀ exposĆ©, that he starts talking about the things popular books start with, such as the amplitude of a particle to be at point (x, t) in spacetime, or the Schrƶdinger equation, which describes the orbital of an electron in an atom. That’s where the Uncertainty Principle comes in and, hence, one can really avoid it for quite a while. In fact, oneĀ shouldĀ avoid it for quite a while, because it’s now become clear to me that simply presenting the Uncertainty Principle doesn’t help all that much to trulyĀ understandĀ quantum mechanics.

Truly understanding quantum mechanics involves understanding all of these weird rules above. To some extent, that involves dissociatingĀ the idea of the wavefunction with our conventional ideas of time and position. From the questions above, it should be obvious that ‘the’ wavefunction does actuallyĀ notĀ exist: we’ve got a wavefunction for anything we can and possibly want to measure. That brings us to the question of the base states: what areĀ they?

Feynman addresses this question in a rather verbose section of his Lectures titled:Ā What are the base states of the world?Ā I won’t copy it here, but I strongly recommend you have a look at it. šŸ™‚

I’ll end here with a final equation that we’ll need frequently: the amplitude for a particle to go from one place (r1) to another (r2). It’s referred to as aĀ propagatorĀ function, for obvious reasons—one of them being that physicists like fancy terminology!—and it looks like this:

propagator

The shape of theĀ e(i/ħ)Ā·(pāˆ™r12)Ā function is now familiar to you. Note theĀ r12Ā in the argument, i.e. the vectorĀ pointing fromĀ r1Ā to r2. The pāˆ™r12Ā dot product equals |p|āˆ™|r12|Ā·cosĪø = pāˆ™r12Ā·cosĪø, with Īø the angle between p and r12. If the angle is the same, then cosĪø is equal to 1. If the angle is Ļ€/2, then it’s 0, and the function reduces to 1/r12. So the angle Īø, through the cosĪø factor, sort of scalesĀ the spatial frequency. Let me try to give you some idea of how this looks like by assuming the angle between p and r12Ā is the same, so we’re looking at the space in the direction of the momentum only andĀ |p|āˆ™|r12|Ā·cosĪø = pāˆ™r12. Now, we can look at the p/ħ factor as a scalingĀ factor, and measure theĀ distance xĀ in units defined by that scale, so we write: x = pāˆ™r12/ħ.Ā The function then reduces to (ħ/p)Ā·eiāˆ™x/xĀ =Ā (ħ/p)Ā·cos(x)/xĀ + iĀ·(ħ/p)Ā·sin(x)/x, and we just need to square this to get the probability. All of the graphs are drawn hereunder: I’ll let you analyze them. [Note that the graphs do not include the ħ/p factor, which you may look at as yet another scaling factor.] You’ll see – I hope! – that it all makes perfect sense: the probability quickly drops off with distance, both in the positive as well as in the negative x-direction, while it’s going to infinity when veryĀ near.Ā [Note that the absolute square, usingĀ cos(x)/xĀ andĀ sin(x)/xĀ yields the same graph as squaring 1/x—obviously!]

graph

Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/

Working with amplitudes

Note (22 May 2020): I am in the process of reviewing many of my posts—including the one below—as a result of my realist interpretation of quantum mechanics, which is based on the ring current model of an electron. It is a delicate exercise because I would like to keep track of the old posts. Perhaps I should copy all of them to some new site. Any case, the point is: I would significantly rephrase some of the below because I now believe spin is very real in quantum mechanics as well.

Original post:

Don’t worry: I am not going to introduce the Hamiltonian matrix—notĀ yet, that is. But this post is going a step further than my previous ones, in the sense that it will be more abstract. At the same time, I do want to stick toĀ realĀ physical examples so as to illustrate what we’re doing when working with those amplitudes. The exampleĀ that I am going to use involvesĀ spin. So let’s talk about that first.

Spin, angular momentum and the magnetic moment

You know spin: it allows experienced pool players to do the most amazing tricks with billiard balls, making a joke of what a so-called elastic collision is actually supposed to look like. So it should not come as a surprise that spin complicates the analysis in quantum mechanics too. We dedicated several posts to thatĀ (see, for example, my post on spin and angular momentum in quantum physics) and I won’t repeat these here. Let me just repeat the basics:

1. Classical and quantum-mechanical spin do share similarities: the basicĀ ideaĀ driving the quantum-mechanical spin model is that of aĀ electricĀ charge – positive or negative – spinning about its own axis (this is often referred to as intrinsicĀ spin)Ā as well as having some orbital motion (presumably around some other charge, like an electron in orbit with a nucleus at the center). This intrinsic spin, and the orbital motion, give our charge someĀ angular momentumĀ (J) and, because it’s an electric charge in motion, there is a magnetic moment (μ). To put things simply: the classical and quantum-mechanical view of things converge in their analysis of atoms or elementary particles as tiny little magnets. Hence, when placed in anĀ externalĀ magnetic field, there is some interaction – a force – and their potential and/orĀ kineticĀ energy changes. The whole system, in fact, acquires extra energyĀ when placed in an external magnetic field.

Note: The formula for that magnetic energy is quite straightforward, both in classical as well as in quantum physics, so I’ll quickly jot it down: U =Ā āˆ’Ī¼ā€¢BĀ = āˆ’|μ|Ā·|B|Ā·cosĪø = āˆ’Ī¼Ā·BĀ·cosĪø.Ā So it’s just theĀ scalarĀ product of the magnetic moment and the magnetic field vector, with a minus sign in front so as to get the direction right. [Īø is the angle between the μ andĀ B vectorsĀ and determines whether U as a whole is positive or negative.

2. The classical and quantum-mechanical view also diverge, however. They diverge, first, because of the quantumĀ nature of spin in quantum mechanics. Indeed, while the angular momentum can take on any value in classical mechanics, that’s not the case in quantum mechanics: in whatever direction we measure, we get a discrete set of values only. For example, the angular momentum of a proton or an electron is either āˆ’Ä§/2 orĀ +ħ/2, in whatever direction we measure it. Therefore, they are referred to as spin-1/2 particles. All elementaryĀ fermions, i.e. the particles that constituteĀ matterĀ (as opposed to force-carrying particles, like photons), have spin 1/2.

Note:Ā Spin-1/2 particles include, remarkably enough, neutrons too, which has the same kind of magnetic moment that a rotating negative charge would have. The neutron, in other words, is not exactly ‘neutral’ in the magnetic sense. One can explain this by noting that a neutron is not ‘elementary’, really: it consists of three quarks, just like a proton, and, therefore, it may help you to imagine that the electric charges inside are, somehow, distributed unevenly—although physicists hate such simplifications. I am noting this becauseĀ the famous Stern-Gerlach experiment, which established the quantum nature of particle spin, used silver atoms, rather than protons or electrons. More in general, we’ll tend to forget about the electric charge of the particles we’re describing, assuming, most of the time, or tacitly, that they’re neutral—which helps us to sort of forget about classical theory when doing quantum-mechanical calculations!

3. The quantum nature of spin is related to another crucial difference between the classical and quantum-mechanical view of the angular momentum and the magnetic moment of a particle. Classically, the angular momentum and the magnetic moment can have any direction.

Note: I should probably briefly remind you that J is a so-called axial vector, i.e. a vector product (as opposed to a scalarĀ product)Ā of the radius vector r and the (linear) momentum vector p = mĀ·v, with v the velocity vector, which points in the direction of motion. So we write: J = rƗp = rƗmĀ·vĀ = |r|Ā·|p|Ā·sinĪøĀ·n. The n vector is the unit vector perpendicularĀ to the plane containing r and pĀ (and, hence,Ā v, of course) given by the right-hand rule. I am saying this to remind you that the direction of the magnetic moment and the direction of motion are notĀ the same: the simple illustration below may help to see what I am talking about.]

atomic magnet

Back to quantum mechanics: the image above doesn’t work in quantum mechanics. We doĀ notĀ have an unambiguous direction of the angular momentum and, hence, of the magnetic moment. That’s where all of the weirdness of the quantum-mechanical concept of spin comes out, really. I’ll talk about that when discussing Feynman’s ‘filters’ – which I’ll do in a moment – but here I just want to remind you of the mathematical argument that I presented in the above-mentioned post. Just like in classical mechanics, we’ll have a maximum (and, hence, also a minimum) value for J, like +ħ, 0 and +ħ for a Lithium-6 nucleus. [I am just giving this rather special example of a spin-1 article so you’re reminded we can have particles with an integer spin number too!] So, when we measure its angular momentum in any direction really, it will take on one of these three values: +ħ, 0 or +ħ. So it’s either/or—nothing in-between.Ā Now that leads to a funny mathematical situation: one would usually equate theĀ maximumĀ value of a quantity like this to the magnitudeĀ of the vector, which is equal to the (positive) square root of J2Ā = J•JĀ = Jx2Ā + Jy2Ā +Ā Jz2, with Jx, JyĀ and JzĀ the componentsĀ of J in the x-, y- and z-direction respectively. But we don’t have continuity in quantum mechanics, and so the concept of aĀ componentĀ of a vector needs to be carefully interpreted. There’s nothing definite there,Ā like in classical mechanics: all we have is amplitudes, and all we can do is calculateĀ probabilities, orĀ expected valuesĀ based on those amplitudes.

Huh? Yes. In fact, the concept of theĀ magnitude of a vector itself becomes rather fuzzy: all we can do really is calculate its expected value.Ā Think of it: in the classical world, we have a J2Ā =Ā J•JĀ product that’s independent of the direction of J. For example, if J is all in the x-direction, then JyĀ and JzĀ will be zero, and J2Ā = Jx2. If it’s all in the y-direction, then JxĀ and JzĀ will be zero and all of the magnitudeĀ of J will be in the y-direction only, so we write:Ā J2Ā = Jy2. Likewise, if J does not have any z-component, then our J•J product will only include the x- and y-components: J•J =Ā Jx2Ā + Jy2. You get the idea: theĀ J2Ā =Ā J•JĀ product is independent of the direction of JĀ exactly because, in classical mechanics, J actually has a precise and unambiguous magnitude and directionĀ and, therefore, actually hasĀ a precise and unambiguous component in each direction. So we’d measure Jx, Jy, andĀ JzĀ and, regardless of theĀ actualĀ direction of J, we’d find its magnitude |J| = J = +√J2Ā =Ā +(Jx2Ā + Jy2Ā +Ā Jz2)1/2.

In quantum mechanics, we just don’t have quantities like that. We say that Jx, JyĀ andĀ JzĀ have an amplitudeĀ to take on a value that’s equal to +ħ, 0 or +ħ (or whatever other value is allowed by the spin number of the system). Now that we’re talking spin numbers, please note that this characteristic number is usually denoted by j, which is a bit confusing, but so be it. SoĀ jĀ can be 0, 1/2, 1, 3/2, etcetera, and the number of ‘permitted values’ is 2j + 1 values, with each value being separated by an amount equal to ħ. So we have 1, 2, 3, 4, 5 etcetera possible values for Jx, JyĀ andĀ JzĀ respectively. But let me get back to the lesson. We just can’t do the same thing in quantum mechanics. For starters, we can’t measure Jx, Jy, andĀ JzĀ simultaneously: our Stern-Gerlach apparatus has a certain orientationĀ and, hence, measures one componentĀ of J only. So what can we do?

Frankly, we can only do some math here. The wave-mechanical approach does allow to think of the expectedĀ value ofĀ J2Ā =Ā J•JĀ = Jx2Ā + Jy2Ā +Ā Jz2Ā value, so we write:

E[J2] = E[J•J] = E[Jx2Ā + Jy2Ā +Ā Jz2] = ?

[Feynman’s use of the 〈 and 〉 brackets to denote an expected value isĀ hugelyĀ confusing, because these brackets are also used to denote an amplitude. So I’d rather use the more commonly used E[X] notation.]Ā Now, it is a rather remarkable property, but the expected value of the sum of two or more random variables is equal to the sum of the expected values of the variables, evenĀ if those variables may not be independent. So we can confidently use the linearity property of the expected value operator and write:

E[Jx2Ā + Jy2Ā +Ā Jz2] = E[Jx2] +Ā E[Jx2] +Ā E[Jx2]

Now we need something else. It’s also just part of the quantum-mechanical approach to things and so you’ll just have to accept it. It sounds rather obvious but it’s actually quite deep: if we measure the x-, y-Ā orĀ z-component of the angular momentumĀ of a random particle, then each of the possible values is equally likelyĀ to occur. So that means, in our case, that theĀ +ħ, 0 or +ħ values are equally likely, so their likelihood is one into three, i.e. 1/3. Again, that sounds obvious but it’s not. Indeed, please note, once again, thatĀ we can’t measure Jx, Jy, andĀ JzĀ simultaneously, so the ‘or’ in x-, y-Ā orĀ z-component is an exclusive ‘or’.Ā Of course, I must add this equipartitionĀ of likelihoods is validĀ only because we do notĀ have a preferred direction for J: the particles in our beam have random ‘orientations’. Let me give you the lingoĀ for this: we’re looking at an unpolarizedĀ beam. You’ll say: so what? Well… Again, think about what we’re doing here: we may of may not assume that theĀ Jx, Jy, andĀ JzĀ variables are related. In fact, in classical mechanics, they surely are: they’re determined by the magnitude and direction of J. Hence, they are notĀ random at all !Ā But let me continue, so you see what comes out.

Because theĀ +ħ, 0 and +ħ values are equally, we can write: E[Jx2] = ħ2/3 + 0/3 + (āˆ’Ä§)2/3 = [ħ2Ā + 0 + (āˆ’Ä§)2]/3 = 2ħ2/3. In case you wonder, that’s just the definitionĀ of the expected value operator: E[X] = p1x1Ā + p2x2Ā + … = āˆ‘pixi, with piĀ the likelihoodĀ of the possibleĀ valueĀ xiĀ . So we take a weighted average with the respective probabilities as the weights. However, in this case, with an unpolarizedĀ beam, the weighted average becomes aĀ simpleĀ average.

Now, E[Jy2] and E[Jz2] are – rather unsurprisingly – also equal to 2ħ2/3, so we find thatĀ E[J2] = E[Jx2] +Ā E[Jx2] +Ā E[Jx2] = 3Ā·(2ħ2/3) = 2ħ2Ā and, therefore, we’d say that the magnitudeĀ of the angular momentum is equal to |J| = J = +√2·ħ ā‰ˆ 1.414·ħ. Now thatĀ value isĀ notĀ equal to the maximum value of our x-, y-, z-component of J, or the component of J in whatever direction we’d want to measure it. That maximumĀ value is ħ, without the √2 factor, so that’s some 40% lessĀ than the magnitude we’ve just calculated!

Now, you’ve probably fallen asleep by now but, what this actually says, is that the angular momentum, in quantum mechanics, is never completely in anyĀ direction. We can state this in another way: it implies that, in quantum mechanics, there’s no such thing really as a ‘definite’ direction of the angular momentum.

[…]

OK. Enough on this. Let’s move on to a more ‘real’ example. Before I continue though, let me generalize the results above:

[I]Ā A particle, or a system, will have a characteristicĀ spinĀ number:Ā j. That number is always an integer or a half-integer, and it determines a discrete set of possible values for theĀ componentĀ of the angular momentum J in any direction.

[II]Ā The number of values is equal to 2j + 1, and these values are separated by ħ, which is why they are usually measured in units of ħ, i.e. Planck’s reducedĀ constant: Ä§Ā ā‰ˆĀ 1Ɨ10āˆ’34Ā JĀ·s, so that’s tiny but real. šŸ™‚ [It’s always good to remind oneself that we’re actually trying to describe reality.] For example, the permitted values for a spin-3/2 particle are +3ħ/2, +ħ/2, āˆ’Ä§/2 and āˆ’3ħ/2 or, measured in units of ħ, +3/2, +1/2, āˆ’1/2 and āˆ’3/2. When discussing spin-1/2 particles, we’ll often refer to the two possible statesĀ as the ‘up’ and the ‘down’ state respectively. For example, we may write the amplitudeĀ for an electron or a proton to have a angular momentum in the x-direction equal to +ħ/2 or āˆ’Ä§/2 as 〈+x〉 and āŒ©āˆ’x〉 respectively. [Don’t worry too much about it right now: you’ll get used to the notation quickly.]

[III]Ā The classical concepts of angular momentum, and the related magnetic moment, have their limits in quantum mechanics. The magnitudeĀ of a vector quantity like angular momentum is generallyĀ not equal toĀ the maximum value of the component of that quantity in any direction. The general rule is:

 J2 = j·(j+1)ħ2 > j2·ħ2

So the maximum value of any component of J in whatever direction (i.e. j·ħ) is smaller than the magnitude of J (i.e. √[ j·(j+1)]·ħ). This implies we cannot associate any precise and unambiguous direction with quantities like the angular momentum J or the magnetic moment μ. As Feynman puts it:

“That the energy of an atom [or a particle] in a magnetic field can have only certain discrete energies is really not more surprising than the fact that atoms in general have only certain discrete energy levels—something we mentioned often in VolumeĀ I. Why should the same thing not hold for atoms in a magnetic field? It does. But it is the attempt to correlate this with the idea of an oriented magnetic moment that brings out some of the strange implications of quantum mechanics.”

A real example: the disintegration of a muon in a magnetic field

I talked about muon integration before, when writing a much more philosophical piece on symmetriesĀ in NatureĀ and time reversal in particular. I used the illustration below. We’ve got an incoming muon that’s being brought to rest in a block of material, and then, as muons do, it disintegrates, emitting an electron and two neutrinos. As you can see, theĀ decay direction is (mostly) in the direction of theĀ axial vectorĀ that’s associated with the spin direction, i.e. the direction of the grey dashed line. However, there’s some angular distribution of the decay direction, as illustrated by the blue arrows, that are supposed to visualize the decay products, i.e. the electron and the neutrinos.

Muon decay

This disintegration process is very interesting from a more philosophical side. The axial vector isn’t ‘real’: it’s a mathematical concept—aĀ pseudovector. A pseudo- or axial vector is theĀ product of two so-calledĀ trueĀ vectors, aka as polarĀ vectors. Just look back at what I wrote about the angular momentum: the JĀ in the JĀ =Ā rƗp = rƗmĀ·vĀ formula is such vector, and its direction depends on the spin direction, which isĀ clockwise or counter-clockwise, depending from what side you’re looking at it. Having said that, who’s to judge if the product of two ‘true’ vectors is any less ‘true’ than the vectors themselves? šŸ™‚

The point is: the disintegration process does not respect what is referred to as P-symmetry. That’s because our mathematical conventions (like all of these right-hand rules that we’ve introduced) are unambiguous, and they tell us that the pseudovector in theĀ mirrorĀ image of what’s going on, has the opposite direction. It hasĀ to, as per our definition of a vector product. Hence, our fictitious muon in the mirror should send its decay products in the opposite direction too! So… Well… The mirrorĀ image of our muon decay process is actually something that’s not going to happen: it’sĀ physicallyĀ impossible. SoĀ we’ve got a process in Nature here that doesn’t respect ‘mirror’ symmetry. Physicists prefer to call it ‘P-symmetry’, forĀ parityĀ symmetry, because it involves a flip of sign of all space coordinates, so there’s aĀ parityĀ inversion indeed. So there’s processes in Nature that don’t respect it but, while that’s all very interesting, it’s not what I want to write about. [Just check that post of mine if you’d want to read more.] Let me, therefore, use another illustration—one that’s more to the point in terms of what weĀ doĀ want to talk about here:

muon decay Feynman

So we’ve got the same muon here – well… A different one, of course! šŸ™‚Ā ā€“ entering that blockĀ (A) and coming to a grinding halt somewhere in the middle, and then it disintegratesĀ in a few micro-seconds, which isĀ an eternity at the atomic or sub-atomic scale. It disintegrates into an electron and two neutrinos, as mentioned above, with some spread in the decay direction. [In case you wonder where we can find muons… Well… I’ll let you look it up yourself.] So we have:

Part-muonDeqn

Now it turns out that the presence of a magnetic field (represented by the B arrows in the illustration above) can drastically changeĀ the angular distribution of decay directions. That shouldn’t surprise us, of course, but how does it work,Ā exactly? Well… To simplify the analysis, we’ve got a polarized beam here: the spin direction of all muons before they enter the block and/or the magnetic field, i.e. at time t = 0, is in the +x-direction. So we filtered them just, before they entered the block. [I will come back to this ‘filtering’ process.] Now, if the muon’s spin would stay that way, then the decay products – and the electron in particular – would just go straight, because all of the angular momentum is in that direction. However, we’re in the quantum-mechanical world here, and so things don’t stay the same. In fact, as we explained, there’s no such things as a definite angular momentum: there’s just an amplitudeĀ to be in the +x state, and that amplitude changes in time and in space.

How exactly? Well… We don’t know, but we can apply some clever tricks here. The first thing to note is that our magnetic field will add to the energy of our muon. So, as I explained in my previous post, the magnetic field adds to the E in the exponent of our complex-valued wavefunctionĀ aĀ·eāˆ’(i/ħ)(EĀ·t āˆ’ pāˆ™x). In our example, we’ve got a magnetic field in the z-direction only, so thatĀ U =Ā āˆ’Ī¼ā€¢BĀ reduces to U =Ā āˆ’Ī¼zĀ·B, and we can re-write our wavefunction as:

aĀ·eāˆ’(i/ħ)[(E+U)Ā·t āˆ’ pāˆ™x]Ā =Ā aĀ·eāˆ’(i/ħ)(EĀ·t āˆ’ pāˆ™x)Ā·e(i/ħ)(μzĀ·BĀ·t)

Of course, the magnetic field only acts from t = 0 to when the muon disintegrates, which we’ll denote by the point t =Ā Ļ„. So what we get is that the probability amplitude of a particle that’s been in a uniform magnetic field changes by a factorĀ e(i/ħ)(μzĀ·BĀ·Ļ„).Ā Note that it’s a factor indeed: we use it to multiply. You should also note that this is a complex exponential, so it’sĀ a periodic function, with its real and imaginary part oscillating between zero and one. Finally, we know that μzĀ can take on only certain values: for a spin-1/2 particle, they are plusĀ orĀ minus some number, which we’ll simply denote as μ, so that’s without the subscript, so our factor becomes:

e(i/ħ)(±μ·B·t)

[The plusĀ orĀ minus sign needs to be explained here, so let’s do that quickly: we haveĀ twoĀ possible states for a spin-1/2 particle, one ‘up’, and the other ‘down’. But then we also know that the phase of our complex-valued wave function turns clockwise, which is why we have a minus sign in the exponent of our eāˆ’iĪøĀ expression. In short, for the ‘up’ state, we should take the positiveĀ value, i.e. +μ, but the minus sign in the exponent of our eāˆ’iĪøĀ function makes it negative again, so our factor isĀ eāˆ’(i/ħ)(μ·BĀ·t)Ā for the ‘up’ state, and e+(i/ħ)(μ·BĀ·t)Ā for the ‘down’ state.]

OK. We get that, but that doesn’t get us anywhere—yet. We need another trick first. One of the most fundamental rules in quantum-mechanics is that we can always calculate the amplitude to go from one state, say φ (read: ‘phi’), to another, say χ (read: ‘khi’), if we have a complete set of so-calledĀ base states, which we’ll denote by the index i or jĀ (which you shouldn’t confuse with the imaginary unit, of course), using the following formula:

〈 χ |Ā Ļ†Ā āŒŖ =Ā āˆ‘ 〈 χ | i 〉〈 i |Ā Ļ†Ā āŒŖ

I know this is a lot to swallow, so let me start with the notation. You should read 〈 χ |Ā Ļ†Ā āŒŖ from right to left: it’s the amplitude to go from state φ to state χ. This notation is referred to as the bra-ket notation, or the DiracĀ notation. [Dirac notation sounds more scientific, doesn’t it?] The right part, i.e. |Ā Ļ†Ā āŒŖ, is theĀ bra, and the left part, i.e. 〈 χ |Ā is the ket. In our example, we wonder what the amplitude is for our muon staying in the +x state. Because that amplitude is time-dependent, we can write it as A+(Ļ„) = 〈 +xĀ at time t = τ |Ā +xĀ at time t = 0 〉 = 〈 +xĀ at t = τ |Ā +xĀ at t = 0 〉or, using a veryĀ misleading shorthand, 〈 +xĀ |Ā +x 〉. [The shorthand is misleading because the +xĀ in the ket obviously means something else than theĀ +xĀ in the bra.]

But let’s apply the rule. We’ve got two states with respect to each coordinate axis only here. For example, in respect to the z-axis, the spin values are +z and āˆ’z respectively. [As mentioned above, we actually mean that the angular momentum in this direction is either +ħ/2 orĀ āˆ’Ä§/2, aka as ‘up’ or ‘down’ respectively, but then quantum theorists seem to like all kinds of symbols better, so we’ll use the +z and āˆ’zĀ notations for these twoĀ baseĀ states here. So now we can use our rule and write:

A+(t) = 〈 +xĀ |Ā +x 〉 = 〈 +xĀ |Ā +z 〉〈 +zĀ |Ā +x 〉 + 〈 +xĀ |Ā āˆ’z 〉〈 āˆ’zĀ |Ā +x 〉

You’ll say this doesn’t help us any further, but it does, because there is another set of rules, which are referred to asĀ transformationĀ rules, which gives us those 〈 +zĀ |Ā +x 〉 and 〈 āˆ’zĀ |Ā +x 〉 amplitudes. They’reĀ realĀ numbers, and it’s the same number for both amplitudes.

〈 +zĀ |Ā +x 〉 = 〈 āˆ’zĀ |Ā +x 〉 = 1/√2

This shouldn’t surprise you too much: the square root disappears when squaring, so we get two equalĀ probabilities – 1/2, to be precise – that add up to one which – you guess it – they have to add up to because of the normalization rule: the sum of all probabilities has toĀ add up to one, always. [I can feel your impatience, but just hang in here for a while, as I guide you through what is likely to be your very first quantum-mechanical calculation.]Ā Now, the 〈 +zĀ |Ā +x 〉 = 〈 āˆ’zĀ |Ā +x 〉 = 1/√2 amplitudes are the amplitudes at time t = 0, so let’s be somewhat less sloppy with our notation and write 〈 +zĀ |Ā +x 〉 as C+(0) and 〈 āˆ’zĀ |Ā +x 〉 asĀ Cāˆ’(0), so we write:

〈 +z | +x 〉 = C+(0) = 1/√2

〈 āˆ’zĀ |Ā +x 〉 = Cāˆ’(0) = 1/√2

Now we know what happens with those amplitudes over time: that e(i/ħ)(±μ·B·t) factor kicks in, and so we have:

C+(t) =Ā C+(0)Ā·eāˆ’(i/ħ)(μ·BĀ·t)Ā = eāˆ’(i/ħ)(μ·BĀ·t)/√2

Cāˆ’(t) = Cāˆ’(0)Ā·e+(i/ħ)(μ·BĀ·t)Ā = e+(i/ħ)(μ·BĀ·t)/√2

As for the plus and minus signs, see my remark on the tricky ± business in regard to μ. To make a long story somewhat shorter :-), our expression for A+(t) = 〈 +x at t | +x 〉 now becomes:

A+(t) = 〈 +xĀ |Ā +z 〉·C+(t)Ā + 〈 +xĀ |Ā āˆ’z 〉·Cāˆ’(t)

Now, you wouldn’t be too surprised if I’d just tell you that the 〈 +xĀ |Ā +z 〉 and 〈 +xĀ | āˆ’z 〉 amplitudes are also real-valued and equal to 1/√2, but you can actually use yet another rule we’ll generalize shortly: the amplitude to go from state φ to state χ is the complex conjugateĀ of the amplitude to to go from state χ to state φ, so we write 〈 χ |Ā Ļ†Ā āŒŖ = 〈 φ | Ļ‡Ā āŒŖ*,Ā and therefore:

〈 +x | +z 〉 = 〈 +z | +x 〉* = (1/√2)* = (1/√2)

〈 +xĀ |Ā āˆ’z 〉 = 〈 āˆ’zĀ |Ā +x 〉* =Ā (1/√2)* =Ā (1/√2)

So our expression for A+(t) = 〈 +x at t | +x 〉 now becomes:

A+(t) = eāˆ’(i/ħ)(μ·BĀ·t)/2 + e(i/ħ)(μ·BĀ·t)/2

That’s the sum of a complex-valued function and its complex conjugate, and we’ve shown more than once (see my page on the essentials, for example) that such sum reduces to the sum of the realĀ parts of the complex exponentials. [You should notĀ expect any explanation of Euler’s eiĪøĀ = cosĪø + iĀ·sinĪø rule at this level of understanding.] In short, we get the following grand result:

muon decay result

The big question, of course: what does this actuallyĀ mean? šŸ™‚ Well… Just square this thing and you get the probabilities shown below. [Note that the period of aĀ squaredĀ cosine function is π,Ā instead ofĀ 2Ļ€, which you can easily verify using an online graphing tool.]

graph

Because you’re tired of this business, you probably don’t realize what we’ve just done. It’s spectacular and mundane at the same time. Let me quote Feynman to summarize the results:

“We find that the chance of catching the decay electron in the electron counter varies periodically with the length of time the muon has been sitting in the magnetic field. The frequency depends on the magnetic moment μ. The magnetic moment of the muon has, in fact, been measured in just this way.”

As far as I am concerned, the key result is that we’ve learned how to work with those mysterious amplitudes, and the wavefunction, in aĀ practicalĀ way, thereby using all of the theoretical rules of the quantum-mechanical approach to real-life physical situations. I think that’s a great leap forward, and we’ll re-visit those rules in a more theoretical and philosophical dĆ©marche in the next post.Ā As for the example itself, Feynman takes it much further, but I’ll just copy the Grand Master here:

Feynman

Huh?Ā Well… I am afraid I have to leave it at this, as I discussed the precession of ‘atomic’ magnets elsewhereĀ (see my post on precession and diamagnetism), which gives you the same formula: ωpĀ = μ·B/JĀ (just substitute J for ±ħ/2). However, the derivation above approaches it from an entirely different angle, which is interesting. Of course, all fits. šŸ™‚ However, I’ll let you do your own homework now. I hope to see you tomorrow for the mentioned theoretical discussion. Have a nice evening, or weekend – or whatever ! šŸ™‚

Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/

Quantum mechanics and forces: the classical limit

Pre-script (dated 26 June 2020): Our ideas have evolved into a full-blown realistic (or classical) interpretation of all things quantum-mechanical. In addition, I note the dark force has amused himself by removing some material. So no use to read this. Read my recent papers instead. šŸ™‚

Original post:

This post continues along the same line as the two previous ones: the objective is to get a better ‘feel’ for those weird amplitudes by showing how they are being affected by a potential āˆ’ such as an electric potential, or a gravitational field, or a magnetic fieldĀ āˆ’ and how they relate to a classical analysis of a situation. It’s what Feynman tries to do too.Ā The best example of such ‘familiarization exercises’, by far, is the comparison between a classical analysis of a force field, and the quantum-mechanical one, so let’s quickly go through that.

The classical situation is depicted below. We’ve got a particle moving in the x-direction and entering a region where the potentialĀ āˆ’ again, think of an electrostatic field, for example, as it probably inspires the VĀ (forĀ voltage, I assume) – varies in the y-direction. As you know, the potentialĀ (energy) is measured by doing work against the force, and the work done by the force is the W = ∫F•ds integral along the path. Conversely, one can differentiate both sides of thatĀ W = ∫F•dsĀ equation to get the force, so we can write: F = dW/ds. That explains the F = āˆ’āˆ‚V/āˆ‚y expression in the illustration below. Again, think of a positively charged particle that’s being attracted from a high or positive voltage (high V) to a lower or negative voltage.

classical

As a result of the force, the particle is being acceleratedĀ transversely, i.e. in the y-direction, and so it adds some momentum in the y-direction (py) to the initial momentum p, which is all in the x-direction (p = px). Of course, the force only acts as the particle is passing through the area, so that’s during a time interval that’s equal to w/v, with w being the ‘width’ of the area. Now, the vertical momentum builds from 0 to py, and we can calculate pyĀ using Newton’s Law: F = mĀ·a = mĀ·(dv/dt) = d(mĀ·v)/dt = dp/dt. It goes as follows:

formula

We can then use a simple trigonometric formula to find the angle of deflection Γθ:

formula 2

Done! Now we need to show we get the same result with our quantum-mechanical approach or, at the very least, show how our quantum-mechanical approach actually works.

Of course, the assumption is that everything is on a very large scale as compared with the wavelength of our probability amplitudes. We then know that the presence of a potential V will just add to the energy E that we are to use in our wavefunction. Hence, in any smallĀ region, the amplitude will vary as:

aĀ·eāˆ’(i/ħ)[(EintĀ + p2/(2m) + V)Ā·t āˆ’ pāˆ™x]

Ā Now, in the previous post, we explained that the energy conservation principle does not have any impact on the temporal variation of the wavefunction. Hence,Ā EintĀ + p2/(2m) + V effectively remains constant, and the effect of a changing V is on p, and works out through theĀ āˆ’pāˆ™xĀ term in the exponent of our complex-valued wave function. To be specific, in a region where V is larger, p will be smaller and, therefore, we’ll have a larger wavelength Ī» = h/p.Ā [Sorry if you can’t follow here: please do check out what I wrote in my two previous posts. It’s not that difficult.]

The illustration below shows how it works. The wavelength is the distance between successive wave nodes, which you should think of as surfaces where the phase of the amplitude is zero, or as wavefronts. So we’ve got a change in the angle of the wave nodes as well. How does that come about? Look at the two paths a and b: there’s a difference in potential between them, which we denote byĀ Ī”V, and which can be approximated asĀ Ī”V = (āˆ‚V/āˆ‚y)Ā·D, with D the separation between the two paths.

wave mechanics

Now, if EintĀ + p2/(2m) + V is a constant, thenĀ Ī”V must be equal to āˆ’Ī”[p2/(2m)]. Now what can we do with that? We’re talking a differentialĀ really, so we should apply the Ī”f(x) = [df(x)/dx]Ā·Ī”x formula, which gives us:

Ī”[p2/(2m)] = (p/m)Ā·Ī”p = āˆ’Ī”V ⇔ Ī”p =Ā āˆ’(m/p)Ā·Ī”V

[Note that the d[f(x)] = [df(x)/dx]Ā·dx formula is an interesting one: you’ll surely have come across something likeĀ Ī”[1/Ī»], and wondered what you could do with that. Now you know: Ī”[1/Ī»] = Δλ/Ī»2.]

Now, we’re interested in Ī”x, so how can we get some equation for that? Here we need to use the wavenumber k, and the derivation is not so easy—unfortunately! TheĀ (p/m)Ā·Ī”p = āˆ’Ī”V equation tells us that the wave number will be different along the two paths, and we can write the differenceĀ asĀ Ī”k =Ā Ī”(p/ħ) =Ā Ī”p/ħ. Now, we also now that the wavenumber is expressed inĀ radians per unit distance. In addition, we also know that the wavenumber is the derivative of the phase with respect to distance, so we can approximately calculate the amount by which the phase on path b is ‘ahead’ of the phase on path a as the wave leaves the area, as:

Δ(phase) = Δk·w

Now, Ī”k =Ā Ī”p/ħ, so Ī”(phase) = Ī”pĀ·w/ħ = āˆ’[m/(p·ħ)]Ā·Ī”VĀ·w. Now, along path b, we will have some wavenumber k which, as per the definition of k, is equal to the derivative of the phase with respect to distance, so the ratio of theĀ change in phase along path and the incremental distance Ī”x should equal the wave number k along path b, so we can write:

Ā Ī”(phase)/Ī”x = k ⇔ Ī”x = kĀ·Ī”(phase) = (Ī»/2Ļ€)Ā·Ī”(phase) = (ħ/p)Ā·Ī”(phase)

Combining that withĀ Ī”(phase) = āˆ’[m/(p·ħ)]Ā·Ī”VĀ·w identity, we get:

Ī”x =Ā āˆ’(ħ/p)Ā·[m/(p·ħ)]Ā·Ī”VĀ·w =Ā āˆ’[m/p2]Ā·Ī”VĀ·w

Phew!Ā Just hang in there. We’re not done yet. I need to refer to the illustration now to show that Ī”x can also be approximated by D·Γθ, so we write: Ī”x =Ā D·Γθ.Ā [Again: it’s just plain triangle geometry, and the small angle approximation.] Combining both formulas for Ī”x yields:

 Γθ =Ā āˆ’[m/p2]Ā·Ī”VĀ·w/D

Now we can replace p/m by v and use our Ī”V = (āˆ‚V/āˆ‚y)Ā·D approximation to replaceĀ Ī”V/D by (āˆ‚V/āˆ‚y), so we get:

 Γθ =Ā āˆ’[1/(pĀ·v)]Ā·(āˆ‚V/āˆ‚y)Ā·w =Ā āˆ’[w/(pĀ·v)]Ā·(āˆ‚V/āˆ‚y)

So we get the veryĀ same equation for Γθ as the one we got when applying classical mechanics. That’s a great result—also I have to admit that the differential analysis here wasĀ veryĀ painful and, hence, difficult to reproduce. In any case, what you should be interested in, is the result: as Feynman summarizes it,

We get the same particle motions we get from Newton’sĀ F= mĀ·a law, provided we assume that a potential contributes a phase to the probability amplitude equal toĀ VĀ·t/ħ. Hence, in the classical limit, quantum mechanics agrees with Newtonian mechanics.”

However, he also cautions that “the result we have just got is correct only if the potential variations are slow and smooth”, which actually defines what is referred to as the “classical limit”.

Let me, to conclude this post, present another example out of Feynman’s ‘illustrations’ of what a field or a potential does with amplitudes. Trust me: we’ll need it later. As Feynman is really succinct here, I will just copy it. The quote describes what happens in aĀ magneticĀ field, as opposed to the electrostatic field which I mentioned above. Just read it: it’s easy. [If it’s too small, just click on it, and the text size will be OK.] However, while it’s easy to understand, the consequences, on which I’ll write a separate post, are quite deep.

text

So… Well… That’s really it for today. šŸ™‚

Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/

Potential energy and amplitudes: energy conservation and tunneling effects

Pre-script (dated 26 June 2020): Our ideas have evolved into a full-blown realistic (or classical) interpretation of all things quantum-mechanical. In addition, I note the dark force has amused himself by removing some material. So no use to read this. Read my recent papers instead. šŸ™‚

Original post:

This post is intended to help you think about, and work with, those mysterious amplitudes. More in particular, I’ll explore how potentialĀ differences change amplitudes. But let’s first recapitulate the basics.

In my previous post, I explained why the young FrenchĀ ComteĀ Louis de Broglie, when writing his PhD thesis back in 1924, i.e. before Schrƶdinger, Born, Heisenberg and others had published their work, boldly proposed to the ω·t āˆ’Ā kĀ·xĀ argument in the wavefunction of a particle with the relativistic invariant product of the momentum and position four-vectorsĀ pμ = (E, p) =Ā (E, px, py,Ā pz,) and xμ = (t, x) =Ā (t, x, y, z), provided the energy and momentum are re-scaled in terms of ħ. Hence, he wrote:

Īø = ω·t āˆ’Ā kĀ·x =Ā (pμxμ)/ħ = (Eāˆ™t āˆ’ pāˆ™x)/ħ = (E/ħ)āˆ™t āˆ’Ā (p/ħ)āˆ™x

As it’s usually instructive to do a quick dimensional analysis, let’s do one here too. Energy is expressed in joule, and dividing it by the quantum of action, which is expressed in jouleĀ·secondsĀ (JĀ·s) gives us the dimension of an (angular) frequency indeed, which, in turns, yields a pure number. Likewise, linear momentum can be expressed in newtonĀ·seconds which, when divided byĀ jouleĀ·secondsĀ (JĀ·s), yields a quantity expressed per meter.Ā Hence, the dimension of p/ħ is m–1, which again yields a pure number when multiplied with the dimension of the coordinates x, y or z.

In the mentioned post, I also gave an unambiguous answer to the question as to what energy concept should be used in the equation: it is theĀ totalĀ energy of the particle we are trying to describe, so that includes its kinetic energy, its rest mass energy and, finally, its potential energy in whatever force field it may find itself, such as a gravitational and/or electromagnetic force field. Now, while we know that, when talking potential energy, we have some liberty in choosing the zero point of our energy scale, this issue is easily overcome by noting that we are always talking about the amplitude to go from one state to another, or to go from one point in spacetime to another. Hence, what matters is the potential difference, really.

Feynman, in his description of the conservation of energy in a quantum-mechanical context, distinguishes:

  1. The rest energyĀ m0āˆ™c2, which he describes as the rest energy ‘of the parts of the particle’. [One should remember he wrote thisĀ beforeĀ the existence of quarks and the associated theory of matter was confirmed.]
  2. The energy ‘over and above’ the rest energy, which includes both the kinetic energy, i.e.Ā māˆ™v2/2 = p2/(2m), as well as the ‘binding and/or excitation energy’, which he refers to as ‘internal energy’.
  3. Finally, there is the potential energy, which we’ll denote by U.

In my previous post, I also gave you the relativistically correct formula for the energy of a particle with momentum p:

Capture

However, we will follow Feynman in his description, who uses the non-relativistic formula Ep = Eint + p2/(2m) + U. This is quite OK if we assume that the classical velocity of our particle does not approach the speed of light, so that covers a rather large number of real-life situations. Also, to make the example more real, we will assume the potential energy is electrostatic, and given by the formula U = q·Φ, with Φ the electrostatic potential (so just think of a number expressed in volt). Of course, q·Φ will be negative if the signs of q (i.e. the electric charge of our particle) and Φ are opposite, and positive if both have the same sign, as opposites attract and like repel when it comes to electric charge.

The illustration below visualizes the situation for Φ2Ā < Φ1. For example, we may assume Φ1Ā is zero, that Φ2Ā is negative, and that our particle is positively charged, so U2Ā = qΦ2Ā < 0. So it’s all rather simple really: we have two areas with a potential equal to U1Ā = qΦ1Ā and U2Ā = qΦ2Ā < 0 respectively. Hence, we need to use E1Ā = EintĀ + p12/(2m) + U1Ā to substitute ω1Ā forĀ E1/ħ in the first area, and then E2Ā = EintĀ + p22/(2m) + U2Ā to substitute ω2Ā forĀ E2/ħ in the second area, which U2 – U1Ā < 0.

potential

The corresponding amplitudes, or wavefunctions, are:

  1. ĪØ1(Īø1) = ĪØ1(x, t) = aĀ·eāˆ’iĪø1Ā = aĀ·eāˆ’i[(EintĀ + p12/(2m) + U1)Ā·t āˆ’ p1āˆ™x]/ħ 
  2. ĪØ2(Īø2) = ĪØ2(x, t) = aĀ·eāˆ’iĪø2Ā = aĀ·eāˆ’i[(EintĀ + p22/(2m)Ā + U2)Ā·t āˆ’ p2āˆ™x]/ħ 

Now how should weĀ thinkĀ about these two equations? We are definitely talkingĀ differentĀ wavefunctions. However, having said that, there is no reason to assume the different potentials would have an impact on the temporalĀ frequency. Therefore, we can boldly equate ω1Ā and ω2Ā and, therefore, write that:

EintĀ + p12/(2m) + U1Ā = Ā EintĀ + p22/(2m) + U2 ⇔ p12/(2m)Ā āˆ’Ā p22/(2m) = U2 – U1Ā < 0

⇒ p12Ā āˆ’Ā p22Ā < 0 ⇔ p2Ā >Ā p1

What this says is that the kinetic energy, and/or the momentum, of our particle is greater in the second area, which is what we would classically expect, as a positive charged particle will pick up speed – and, therefore, momentum and kinetic energy – as it moves from an area with zero potential to an area with negative potential. However, the λ = h/p relation then implies that Ī»2Ā = h/p2Ā is smallerĀ thanĀ Ī»1Ā = h/p2, which is what is illustrated by the dashed lines in the illustration above – which represent surfaces of equal phase, or wavefronts – and also by the second diagram in the illustration, which shows the real part of the complex-valued amplitude and compares the wavelengths Ī»1Ā andĀ Ī»2. [As you know, the imaginary part is just like the real part but with a phase shift equal to Ļ€/2. Ideally, we should show both, but you get the idea.]

To sum it all up, the classical statement energy conservation principle is equivalent to the quantum-mechanical statement that the temporal frequency f or ω, i.e. theĀ time-rate of changeĀ of the phase of the wavefunction, does notĀ change – as long as the conditions do not change with time, of course – but that theĀ spatialĀ frequency, i.e. the wave number k or the wavelength Ī» – changes as the potential energy and/or kinetic energy change.

Tunneling

TheĀ p12/(2m)Ā āˆ’Ā p22/(2m) = U2 – U1Ā equation may be re-written to illustrate the quantum-mechanical effect of tunneling, i.e. theĀ penetration of a potential barrier. Indeed, we can re-writeĀ p12/(2m)Ā āˆ’Ā p22/(2m) = U2 – U1Ā as

p22Ā = 2mĀ·[p12/(2m) āˆ’Ā (U2 – U1)]

and, importantly, try to analyze what happens if U2 – U1Ā is largerĀ thanĀ p12/(2m), so we get a negativeĀ value forĀ p22. Just imagine that Φ1Ā is zero again, and that our particle is positively charged, but that Φ2Ā is also positive (instead of negative, as in the example above), so our particle is being repelled. In practical terms, it means that our particle just doesn’t have enough energy to “climb the potential hill”. Quantum-mechanically, however, the amplitude is still given by that equation above, and we have a purely imaginary number for p2, as the square root of a negativeĀ number is a purely imaginary number, justĀ like āˆšāˆ’4 = 2i. So let’s denote p2Ā asĀ iĀ·p’ and let’s analyze what happens by breaking ourĀ aĀ·eiĪø2Ā function up in two separate parts by writing: aĀ·eāˆ’iĪø2Ā = aĀ·eāˆ’i[(E2/ħ)āˆ™t āˆ’Ā (iĀ·p’/ħ)x]Ā =Ā aĀ·eāˆ’i(E2/ħ)āˆ™tĀ·ei2Ā·p’Ā·x/ħ =Ā aĀ·eāˆ’i(E2/ħ)āˆ™tĀ·eāˆ’p’Ā·x/ħ.

Now, theĀ eāˆ’p’Ā·x/ħ factor in our formula for aĀ·eāˆ’iĪø2Ā is a real-valuedĀ exponential function, and it’s a decreasing function, with the same shape as the general eāˆ’xĀ function, which I depict below.

graph

This eāˆ’p’Ā·x/ħ basically ‘kills’ our wavefunction as we move in the positive x-direction, past the potential barrier, which is what is illustrated below.

potential barrier

However, the story doesn’t finish here. We may imagine that the region with the prohibitive potential is rather small—like a few wavelengths only—and that, past that region, we’ve got another region where p22Ā = 2mĀ·[p12/(2m) āˆ’Ā (U2 – U1)] is notĀ negative. That’s the situation that’s depicted below, which also shows whatĀ mightĀ happen: the amplitude decays exponentially, but doesĀ notĀ reach zero and, hence, there is a possibility that a particle might make it through the barrier, and that it will be found on the other side, with a real-valued and positive momentum and, hence, with a regular wavefunction.

potential barrier 2

Feynman gives a very interesting example of this: alpha-decay.Ā Alpha decayĀ is a type of radioactive decay in which an atomic nucleus emits an α-particle (so that’s aĀ helium nucleus, really), thereby transforming or ‘decaying’ into an atom with a reduced mass and atomic number. The Wikipedia article on it hais not bad, but Feynman’s explanation is more to the point, especially when you’ve understood all of the above. The graph below illustrates the basic idea as it shows the potential energy U of an α-particle as a function of the distance from the center. As Feynman puts it: “If one tried to shoot an α-particle with the energyĀ E into the nucleus, it would feel an electrostatic repulsion from the nucleusĀ and would, classically, get no closer than the distance r1,Ā where its total energy is equal to U. Closer in, however, the potential energy is much lower because of the strong attraction of the short-range nuclear forces.Ā How is it then that in radioactive decay we find α-particles which started out inside the nucleus coming out with the energyĀ E? Because they start out with the energyĀ E inside the nucleus and ā€œleakā€ through the potential barrier.”

potential energy

As for the numbers involved, theĀ mean life of an α-particle in the uranium nucleus is as long asĀ 4.5Ā billion years, according to Feynman, whereas the oscillations inside the nucleus are in the range of 1022Ā cycles per second! So how can one get a number like 109Ā years fromĀ 10āˆ’22Ā seconds? The answer, as Feynman notes, is that that exponential gives a factor of about eāˆ’45. So that gives the very small but definite probability of leakage. Once the α-particle is in the nucleus, there is almost no amplitude at all for finding it outside. However, if you take many nuclei and wait long enough, you’ll find one. šŸ™‚

Now, that should be it for today, but let meĀ end this post with something I should have told you a while ago, but then I didn’t, because I thought it would distract you from the essentials. If you’ve read my previous post carefully, you’ll note that I wrote the wavefunction as ĪØ(Īø) =Ā aĀ·eiĪø, rather asĀ aĀ·eāˆ’iĪø, with the minus sign in front of the complex exponent. So why is that?

There is a long and a short answer to that. I’ll give the short answer. You’ll remember that the phase of our wavefunction is like the hand of a stopwatch. Now we could imagine a stopwatch going counter-clockwise, and we could actually make one. Now, there is no arbitrariness here: it’s one way or the other, depending on our other conventions, and the phase of our complex-valued wavefunction does actually turn clockwiseĀ if we write things the way we’re writing them, rather than anti-clockwise. That’s a direction that’s actually not as per the usual mathematical convention: an angle in the unit circle is usually measured counter-clockwise. If you’d want it that way,Ā we can fix easily by reversing the signs inside of the bracket, so we could write Īø =Ā kĀ·xĀ āˆ’Ā Ļ‰Ā·t, which is actually what you’ll often see. But so there’s only way to get it right: there’s a direction to it, and if we use theĀ Īø = ω·t āˆ’Ā kĀ·x, then we need the minus sign in the ĪØ(Īø) =Ā aĀ·eāˆ’iĪøĀ equation.

It’s just one of those things that is easy to state, but actually gives us a lot of food for thought. Hence, I’ll probably come back to this one day. As for now, however, I think you’ve had enough. Or I’ve had enough, at least. šŸ™‚ I hope this was not too difficult, and that you enjoyed it.

Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/

Amplitudes, wavefunctions and relativity – or the de Broglie equation re-visited

Pre-script (dated 26 June 2020): Our ideas have evolved into a full-blown realistic (or classical) interpretation of all things quantum-mechanical. In addition, I note the dark force has amused himself by removing some material. So no use to read this. Read my recent papers instead. šŸ™‚

Original post:

My previous posts were rather technical and, hence, I thought I’d re-visit a topic on which I’ve written before – but represent it from another angle: the de Broglie equation. You know it by heart: itĀ associates a wavelength (Ī») with the momentum (p) of a particle: λ = h/p. It’s a simple relationship: the wavelength and the momentum are inversely proportional, and the constant of proportionality is Planck’s constant.Ā It’sĀ an equation you’ll find in all of theĀ popular accounts of quantum mechanics. However, I am of the opinion that the equation may actually notĀ help novices to understand what quantum mechanics is all about—at least not in an initial approach.

One barrier to a proper understanding is that the de Broglie relation is always being presented as the twin of the Planck-Einstein relation for photons, which relates the energy (E) of a photon to its frequency (n): E = hāˆ™Ī½ = Ä§āˆ™Ļ‰Ā [i]. It’s only natural, then, to try to relate the two equations, as momentum and energy are obviously related one to anotyher. But how exactly? What energy concept should we use? Potential energy? Kinetic energy? Should we include the equivalent energy of the rest mass?

One quickly gets into trouble here. For example, one can try the kinetic energy, K.E. = māˆ™v2/2 and use the definition of momentum (p = māˆ™v) to write E = p2/(2m), and then relate the frequency ν to the wavelength λ using the general rule that the traveling speed of a wave is equal to the product of its wavelength and its frequency (v = Ī»āˆ™Ī½). But if E = p2/(2m) and ν = v/Ī», we get:

p2/(2m) = hāˆ™v/Ī» ⇔ Ī» = 2āˆ™h/p

So that is almost right, but not quite: that factor 2 should not be there. In fact, it’s easy to see that we’d get de Broglie’s equation if we’d use E = māˆ™v2 rather than E = māˆ™v2/2. But E = māˆ™v2? How could we possibly justify the use of that formula? There’s something weird here—something deep, but I will probably die before I figure out exactly what. šŸ™‚

Note: I should make a reference to the argument of the wavefunction here: EĀ·t āˆ’pĀ·x. [The argument of the wavefunction has a 1/ħ factor in front, but we assume we measure both E as well as p in units of ħ here.] Ā So that’s an invariant quantity, i.e. it doesn’t change under a relativistic transformation of the reference frame. Now, if we measure time and distance in equivalent units, so cĀ = 1, then we can show that E/p = 1/v. [Remember: if c would not be one, we’d write: E·β = pĀ·c, with β = v/c, i.e. the relativeĀ velocity of our particle, as measured as aĀ ratioĀ of the speed of light.] As EĀ·t āˆ’ pĀ·x, is an invariant quantity, it’s some constantĀ that’s characteristic of the particle. But x and t change as the clock is tick. Well… Yes. But if we believe the particle is somehow real, and its velocity is v, then the ratio of theĀ realĀ positionĀ x and theĀ time t should be equal toĀ vĀ = x/t. Hence, for these very special positionsĀ x, i.e. theĀ realĀ position of the particle, we can equateĀ EĀ·t āˆ’pĀ·x to EĀ·t āˆ’pĀ·vĀ·t = EĀ·t āˆ’mĀ·vĀ·vĀ·t = (E āˆ’ māˆ™v2)Ā·t. So there we have the māˆ™v2Ā factor. ThereĀ Ā  must be something very deep about it, but, as mentioned above, I will probably die before I figure out exactly what. šŸ™‚

The second problem is the interpretation of Ī». Of course, Ī» is just a length in space, which we can relate to the spatial frequency or wavenumber k = 2Ļ€/λ [ii]. And, of course, the frequency and the wavelength are, once again, related through the traveling speed of the wave: v = Ī»āˆ™Ī½. But then, when you think about it, it’s actuallyĀ not that simple: the wavefunction of a particle is much more complicated. For starters, you should think of it as a wave packet, or wavetrain, i.e. a composite wave: a sum of a potentially infinite number of elementary waves. So we do not have a simple periodic phenomenon here: we need to distinguish the so-called group velocity from the phase velocity, and we’re also talking a complex-valued wavefunction, so it’s all quite different from what we’re used to.

But back to the energy concept. We have Einstein’s E = māˆ™c2 = m0āˆ™Ī³āˆ™c2 equation, of course, from which the relativistically correct momentum-energy relationship can be derivedĀ [iii]:

Capture

Ep is the energy of a particle with momentum p, and the relationship establishes a one-to-one relationship between the energy and the momentum of a particleĀ [iv], with the rest mass (or rest energy) E02 = m0āˆ™c2 appearing as a constant (the rest mass does not depend on the reference frame – per definition). However, you can try this formula too, but it will not give you the de Broglie relation. In short, it doesn’t help us in terms of understanding what the de Broglie relation is all about.

So how did this young nobleman, back in 1924, as he was writing his PhD thesis, get this Ī» = h/p or – using the wavenumber – the k = p/ħ equation?

Well… The relativity theory had been around for quite a while and, amongst other things (including the momentum-energy relationship above), it had also established the invariance of the four-vector product pμxμ = Eāˆ™t – pāˆ™x = pμ‘xμ‘ = E’āˆ™t’ – p’āˆ™x’.

Now, any regular sinusoidal wave is associated with a phase Īø = ωt – kāˆ™x, and then quantum theory had associated a complex-valued wavefunction ĪØ(Īø) = ĪØ(ωt – kāˆ™x) with a particle, and so the young count, Louis de Broglie, saw the mathematical similarity between the Eāˆ™t – pāˆ™x and ωt – kāˆ™x expressions, and then just took the bold step of substituting ω and k for E/ħ and p/ħ respectively in the ĪØ(ωt – kāˆ™x) function.Ā That’s it really: as the laws of physics should look the same, regardless of our frame of reference, he realized the argument of the wavefunction needed to be some invariant quantity, and so that’s what he gets through this substitution.

Of course, the substitution makes sense: it has to. To show why, let’s consider the limiting situation of a particle with zero momentum – so it’s at rest, really – but assuming that the probability of finding it at some point in space, at some point of time is equally distributed over space and time. So we have the rather nonsensical but oft-used wavefunction:

ĪØ(Īø) = ĪØ(x, t) = aĀ·eiĪøĀ = aĀ·ei(ωt – kāˆ™x)

Taking the absolute square of this yields a constant probability equal to a2Ā indeed. The equation is non-sensical or – to put it more politely – a limiting caseĀ only because it assumes perfect knowledge about the particle’s momentum p (we said it was zero, exactly) and, hence, about its energy. So we don’t need to worry about any Ī” here: Ī”p = Ī”E = 0 and Ep = E0. As mentioned,Ā we know that doesn’t make much sense, and that a particle in free space will actually be represented by a wave train with a group velocity v (i.e. the classical speed of the particle) and a phase velocity, and so that’s where the modeling of uncertainty comes in, but let’s just go along with the example now. [Note that, while p = 0, that does notĀ imply that E0Ā is equal to zero. Indeed, there’s energy in the rest mass and possibly potential energy too!]

Now, if we substitute ω and k for E/ħ and p/ħ respectively – note that we did away with the bold-face k and x, so we’ve reduced the analysis to one dimension (x) only – we get:

ĪØ(Īø) = ĪØ(x, t) = aĀ·eiĪøĀ = aĀ·ei(E0t – pāˆ™x)/ħ =Ā aĀ·ei(E0/ħ)t

What this means is that the phase Īø = (E0/ħ)Ā·t doesĀ notĀ depend on x: it only varies in time. Hence, the diagram below – don’t look at the x’ and t’ right now: that’s another reference frame that we’ll introduce in a moment – shows equal-phase lines parallel to the x-axis and, because of the Īø = (E0/ħ)Ā·t equation, they’re equally spaced in time, i.e. in the t-coordinate.

phase

Of course, a particle at rest in one reference frame – let’s say S – will appear to be moving in another – which we’ll denote as S’, so we have the primedĀ coordinates x’ and t’, which are related to the x and t by theĀ Lorentz transformationĀ rules. It’s easy to see that the points of equal phase have aĀ different spacing along the t’-axis, so the frequency in time must be different. Indeed, we’ll write that frequency as ω’ = Ep‘/ħ in the S’ reference frame.

Likewise, we see that the phase now does vary in space, so the probability amplitude does vary in space now, as the particle’s momentum in the primed reference frame is no longer zero: if we write it as p’, then weĀ can write that Īø = (E0/ħ)Ā·t = (Ep‘/ħ)Ā·t āˆ’ (p/ħ)āˆ™x = (Ep‘/ħ)Ā·t āˆ’ (p/ħ)āˆ™x. Therefore, our wavefunction becomes:

ĪØ(Īø) = ĪØ(x, t) = aĀ·eiĪøĀ = aĀ·ei(E0/ħ)tĀ =Ā ĪØ(x’, t’) =Ā aĀ·ei(Ep‘Ā·t’ āˆ’ p’āˆ™x’)/ħ 

We could introduce yet another reference frame, and we’d get similar results. The point is: the k in our ĪØ(Īø) = ĪØ(x, t) = aĀ·eiĪøĀ = aĀ·ei(ωt – kāˆ™x)Ā equation is, effectively, equal to k = p/ħ, and that identity holdsĀ in any reference frame.

Now, none of what I wrote above actuallyĀ provesĀ theĀ de BroglieĀ relation: it merelyĀ explainsĀ it. When everything is said and done, theĀ de BroglieĀ relation is aĀ hypothesis, but it is an important one—and it does fit into the overall quantum-mechanical or wave-mechanical approach, that physicists take for granted nowadays.

So that’s it, really. I have nothing more to write but I should, perhaps, just remind you about what I said about a ‘particle wave’: we should look at it as some composite wave. Indeed, there will be uncertainty, and the uncertainty in E implies a frequency range Δω = Ī”(E/ħ) = Ī”E/ħ. Likewise, the momentum will be unknown, and so we’ll have a spread in the wavenumber k as well. We write: Ī”k = Ī”(p/ħ) = Ī”p/ħ. We can try to reduce this uncertainty, but the Uncertainty Principle gives us the limits: theĀ Ī”pĀ·Ī”x and/or the Ī”EĀ·Ī”t products cannot be smaller than ħ/2.

So we’ll have a potentially infinite number of waves with slightly different values for ω and k, whose sum may be visualized as a complex-valued traveling wavetrain, like the lump below.

Photon wave

All of the component waves necessarily need to travel at the same speed, and this speed, which is the ratio of ω and k, will be equal to the so-called phase velocityĀ of the wave (vp), which we can calculate as vpĀ = ω/k = (E/ħ)/(p/ħ) = E/p = (mĀ·c2)/(mĀ·v) = c2/v. This speed is superluminal (c2/vĀ = c/β, with β = v/c < 1), but that isĀ not in contradiction with special relativity because the phase velocity carries no ā€˜signal’ or ā€˜information’. As for the group velocity (vg), we can effectively see that this is equal to the classical velocity of our ‘particle’ by noting that:

vgĀ = dω/dk = d(E/ħ)/d(p/ħ) = dE/dp = d[(p2/(2m)]/dp = p/m = v.

You may think there’s some cheating here, as we equate E with the kinetic energy only. You’re right: the total energy should also include potential energy and rest energy, so E is a sum, but then rest energy and potential energy are treated as constants and, hence, it’s only the kinetic energy that matters when taking the derivative with respect to p, and so that’s why get the result we get, which makes perfect sense.

I wanted this to be a very short post, and so I will effectively end it here. I hope you enjoyed it. It actually sets the stage for a more interesting discussion, and that’s a discussion on how aĀ changeĀ in potential energy effectivelyĀ changesĀ the phase and, hence, the amplitude. But so that’s for next time. I also need to devote a separate post on a discussion of the wave-mechanical framework in general, with a particular focus on the math behind. So… Well… Yes, the next posts are likely to be somewhat more technical again. :-/

[i] I should make a note on notations here, and also insert some definitions. I will denote a frequency by ν (nu), rather than by f, so as to not cause confusion with any function f. A frequency is expressed in cycles per second, while the angular frequency ω is expressed in radians per second. One cycle covers 2Ļ€ radians and, therefore, we can write: ν = ω/2Ļ€. Hence, hāˆ™Ī½ = hāˆ™Ļ‰/2Ļ€ = Ä§āˆ™Ļ‰. Both ν as well as ω measure the time-rate of change of the phase, as opposed to k, i.e. the spatial frequency of the wave, which depends on the speed of wave. I will also use the symbol v for the speed of a wave, although that is hugely confusing, because I will also use it to denote the classical velocity of the particle. However, I find the conventional use of the symbol of c even more confusing, because this symbol is also used for the speed of light, and the speed of a wave is not necessarily equal to the speed of light. In fact, both the group as well as the phase velocity of a particle wave are very different from the speed of light. The speed of a wave and the speed of light only coincide for electromagnetic waves and, even then, it should be noted that photons also have amplitudes to travel faster or slower than the speed of light.

[ii] Note that the de Broglie relation can be re-written as k = p/ħ, with ħ = h/2Ļ€. Indeed, λ = 2Ļ€/k and, hence, we get: λ = h/p ⇔ 2Ļ€/k = h/p ⇔ p = Ä§āˆ™k ⇔ k = p/ħ.

[iii] One gets the equation by equating m to m = m0γ (γ is the Lorentz factor) in the E2 = (mc2)2 equation, and re-arranging.

[iv] Note that this energy formula does not include any potential energy: it is (equivalent) rest mass plus kinetic energy only.

Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/

Atomic magnets: precession and diamagnetism

This and the next posts will further build on the concepts introduced in my previous post on particle spin. This post in particular will focus on some of the math we’ll need to understand what quantum mechanics is all about. The first topic is about the quantum-mechanical equivalent of the phenomenon of precession. The other topics are… Well… You’ll see… šŸ™‚

The Larmor frequency

The motion of a spinning object in a force field is quite complicated. In our post on gyroscopes, we introduced the concepts of precession and nutation. The concept of precession is illustrated below for the Earth as well as for a spinning top. In both cases, the external force is just gravity.

precession_earth

Nutation is an additional movement: on top of the precessional movement, a spinning object may wobble, as illustrated below.

17_Precession and Nutation

There seems to be no analog for nutation in quantum mechanics. In fact, the terms nutation and precession seem to be used interchangeably in quantum physics, although they are very different in classical physics. But let’s not complicate things and, hence, talk about the phenomenon of precession only.

We will not re-explain the phenomenon of precession here but just remind you that the phenomenon can be described in terms of (a) the angle between the symmetry axis and the momentum vector, which we’ll denote by Īø, and (b) the angular velocity of the precession, which we’ll denote by ω= dφ/dt, as shown below. The J in the illustration below is the angular momentum of the object. Hence, if we’d imagine it to be an electron, then J would be the spin angular momentum only, not its orbital angular momentum—although the analysis would obviously be valid for the orbital and/or total angular momentum as well.

precession

OK. Let’s look at what’s going on. The angular displacement – which is also, rather confusingly, referred to as the angle of precession ā€“ in the time interval Ī”t is, obviously, equal to Δφ = ωpĀ·Ī”t. Now, looking at the geometry of the situation, and using the small-angle approximation for the sine, one can also see that Ī”J ā‰ˆ (JĀ·sinĪø)Ā·(ωpĀ·Ī”t). In fact, going to the limit (i.e. for infinitesimally small Ī”φ and Ī”J), we can write:

dJ/dt = ωpĀ·JĀ·sinĪø

But the angular momentum cannot change if there’s no torque. In fact, the time rate of change of the angular momentum is equal to the torque. [You should look this up but, if you don’t want to do that, note that this is just the equivalent, for rotational motion, of the F = dp/dt law for linear motion.] Now, in my post on magnetic dipoles, I showed that the torque Ļ„ on a loop of current with magnetic moment Ī¼ in an external magnetic field B  is equal to Ļ„ = μ×B. So the magnitude of the torque is equal to |Ļ„| = |μ|Ā·|B|Ā·sinĪø = μ·BĀ·sinĪø. Therefore, Ļ‰pĀ·JĀ·sinĪø = μ·BĀ·sinĪø and, hence,

ω= μ·B/J

However, from the general Ī¼/J = –gĀ·(qe/2m) equation we derived in our previous post, we know that μ/J ā€“ for an atomic magnet, that is – must be equal to μ/J = gĀ·qe/2m. So we get the formula we wanted to get here:

ω= gĀ·(qe/2m)Ā·B

This equation says that the angular velocity of the precession is proportional to the magnitude of the external magnetic field, and that the constant of proportionality is equal to gĀ·(qe/2m). It’s good to do the math and actually calculate the precession frequency fp = ωp/2Ļ€. It’s easy. We had calculated qe/2m already: it was equal to 1.6Ɨ10āˆ’19 C divided by 2Ā·9.1Ɨ10āˆ’31 kg, so that’s 0.0879Ɨ1012  C/kg or 0.0879Ɨ1012 (CĀ·m)/(NĀ·s2), more or less. šŸ™‚ Now, g is dimensionless, and B is expressed in tesla: 1 T = (NĀ·s)/(CĀ·m), so we get the sāˆ’1 dimension we want for a frequency. For g = 2 (so we look at the spin of the electron itself only), we get:

fp = Ļ‰p/2Ļ€ = 2Ā·0.0879Ɨ1012/2Ļ€ ā‰ˆ 28Ɨ109 = 28 gigacycles per tesla = 28 GHz/T

This is a number expressed per unit of the magnetic field strength B. Note that you’ll often see this number expressed as 1.4 megacycles per gauss, using the older gauss unit for magnetic field strength: 1 tesla = 10,000 gauss. For a nucleus, we get a somewhat less impressive number because the proton (or neutron) mass is so much bigger: it’s a number expressed in megacycles per tesla, indeed, and for a proton (i.e. a hydrogen nucleus), it’s about 42.58 MHz/T.

Now, you may wonder about the numbers here. Are they astronomical? Maybe. Maybe not. It’s probably good to note that the strength of the magnetic field in medical MRI systems (magnetic resonance imaging systems) is only 1.5 to 3 tesla, so it’s a rather large unit. You should also note that the clock speed of the CPU in your laptop – so that’s the speed at which it executes instructions – is measured in GHz too, so perhaps it’s not so astronomic. I’ll let you judge. šŸ™‚

So… Well… That’s all nice. The key question, of course, is whether or not this classical view of the electron spinning around a proton is accurate, quantum-mechanically, that is. I’ll let Feynman answer that question provisionally:

“According to the classical theory, then, the electron orbits—and spins—in an atom should precess in a magnetic field. Is it also true quantum-mechanically? It is essentially true, but the meaning of the ā€œprecessionā€ is different. In quantum mechanics one cannot talk about the direction of the angular momentum in the same sense as one does classically; nevertheless, there is a very close analogy—so close that we continue to call it precession.”

To distinguish classical and quantum-mechanical precession, quantum-mechanical precession is usually referred to as Larmor precession, and the frequencies above are often referred to as Larmor frequencies. However, I should note that, technically speaking, the term Larmor frequency is actually reserved for the frequency I’ll describe in the next section. I should also note that the Ļ‰= gĀ·(qe/2m)Ā·B is usually written, quite simply, as Ļ‰= γ·B. Of course, the gamma is not the Lorentz factor here, but the so-called gyromagnetic ratio (aka as the magnetogyric ratio): Ī³ = gĀ·(qe/2m). Oh—just so you know: Sir Joseph Larmor was a British physicists and, yes, he developed all of the stuff we’re talking about here. šŸ™‚

At this point, you may wonder if and why all of the above is relevant. Well… There’s more than one answer to this question, but I’d recommend you start with reading the Wikipedia article on NMR spectroscopy. šŸ™‚ And then you should also read Feynman’s exposĆ© on the Rabi atomic or molecular beam method for determining the precession frequency. It’s really fascinating stuff, but you are sufficiently armed now to read those things for yourself, and so I’ll just move on. Indeed, there’s something else I need to talk about here, and that’s Larmor’s Theorem.

Larmor’s Theorem

We’ve been talking single electrons only so far. Now, you may fear that things become quite complicated when many electrons are involved and… Well… That’s true, of course. And then you may also think that things become even more complicated when external fields are involved, like that external magnetic field we introduced above, and that led our electrons to precess at extraordinary frequencies. Well… That’s not true. Here we get some help: Larmor proved a theorem that basically says that, if we can work out the motions of the electrons without the external field, the solution for the motions with the external field is the no-field solution with an added rotation about the axis of the field. More specifically, for an external magnetic field, the added rotation will have an angular frequency equal to:

ω= (qe/2m)Ā·B

So that’s the same formula as we found for the angular velocity of the precession if g = 1, so that’s very easy to remember. The Ļ‰L  frequency, which is the precession frequency for g = 1, is referred to as the Larmor frequency. The proof of the above is remarkably easy, but… Well… I don’t want to copy Feynman here, so I’ll just refer you to the relevant Lecture on it. šŸ™‚

Diamagnetism

I guess it’s about time we relate all of what we learned so far to properties of matter we can relate to, and so that’s what I’ll do here. We’re not going to talk about ferromagnetism here, i.e. the mechanism through which iron, nickel and cobalt and most of their alloys become permanent magnets. That’s quite peculiar and so we will not discuss it here. Here we’ll talk about the very weak quantum-mechanical magnetic effect ā€“ a thousand to a million times less than the effects in ferromagnetic materials ā€“ that occurs in all materials when placed in an external magnetic field.

While the effect is there in all materials, it’s stronger for some than for others. In fact, it’s usually so weak it is hard to detect, and so it’s usually demonstrated using elements for which the diamagnetic effect is somewhat stronger, like bismuth or antimony. The effect is demonstrated by suspending a piece of material in a non-uniform field, as illustrated below. The diamagnetic effect will cause a small displacement of the material, away from the high-field region, i.e. away from the pointed pole.

diamagnetism

I should immediately add that some materials, like aluminium, will actually be attracted to the pointed pole, but that’s because of yet another effect that not all materials share: paramagnetism. I’ll talk about that in another post, together with ferromagnetism. So… Diamagnetism: what is it?

The illustration below shows our spinning electron (q) once again. It also shows a magnetic field B but, unlike our analysis above, or the analysis in our previous post, we assume the external magnetic field is not just there. We assume it changes, because it’s been turned on or off—hopefully slowly: if not, we’d have eddy-current forces causing potentially strong impulses.

diagmagnetism 2But so we’ve got some change in the magnetic flux , and so we know, because of Faraday or Maxwell ā€“ you choose šŸ™‚ ā€“ that we’ll have some circulation of E, i.e. the electric field. The magnetic flux is B times the surface area, and the circulation is the average tangential component E times the length of the path. Because our model of the orbiting electron is so nice and symmetric, we can write Faraday’s Law here as:

EĀ·2π·r = āˆ’d(B·π·r2)/dt ⇔ E = āˆ’(r/2)Ā·dB/dt

A field implies a force and, therefore, a torque on the electron. The torque is equal to the force times the lever arm, so it’s equal to (āˆ’qeĀ·E)Ā·r = āˆ’qeĀ·EĀ·r. Of course, the torque is also equal to the rate of the change of the angular momentum, so dJ/dt must equal:

dJ/dt = āˆ’qeĀ·EĀ·r =  qeĀ·(r/2)Ā·(dB/dt)Ā·r = (qeĀ·r2/2)Ā·(dB/dt)

Now, the assumption is that the field goes from zero to B, so ΔB = B. Therefore, ΔJ must be equal to:

Ī”J = (qeĀ·r2/2)Ā·B

You should, in fact, derive this more formally, by integrating—but let’s keep things as simple as we can. šŸ™‚ What does this formula say, really? It’s the extra angular momentum from the ‘twist’ that’s given to the electrons as the field is turned on. Now, this added angular momentum makes an extra magnetic moment which, because it is an orbital motion, is just qe/2m times the angular momentum that’s already there. But more angular momentum means the magnetic moment has changed, according to the Ī¼ = (qe/2m)Ā·J formula we derived in our previous post, so we have:

Δμ = –(qe/2m)Ā·Ī”J

The minus sign is there because of Lenz’ law: the added momentum is opposite to the magnetic field—and, yes, I know: it’s hard to keep track of all of the conventions involved here. :-/ In any case, we get the following grand equation:

lens

So we found that the induced magnetic moment is directly proportional to the magnetic field B, and opposing it. Now that is what explains why our piece of bismuth does what it does in that non-uniform magnetic field. Of course, you’ll say: why is stronger for bismuth than for other materials? And what about aluminium, or paramagnetism in general? Well… Good questions, but we’ll tackle them in the next posts. šŸ™‚

Let me conclude this post by copying Feynman’s little exposĆ© on why the phenomenon of diamagnetism is so particular. In fact, he notes that, because we’re talking a piece of material here that can’t spin – so it’s held in place, so to say – we should have “no magnetic effects whatsoever”. The reasoning is as follows:

Capture

This is very interesting indeed. This classical theorem basically says that the energy of a system should not be affected by the presence of a magnetic field. However, we know magnetic effects, such as the diamagnetic effect, are there, so these effects are referred to as ‘quantum-mechanical’ effects indeed: they cannot be explained using classical theory only, even if all of what we wrote above used classical theory only.

I should also note another point: why do we need a non-homogeneous field? Well… The situation is comparable to what we wrote on the Stern-Gerlach experiment. If we would have a homogeneous magnetic field, then we would only have a torque on all of the atomic magnets, but no net force in one or the other direction. There’s something else here too: you may think that the forces pointing towards and away from the pointed tip should cancel each other out, so there should actually be no net movement of the material at all! Feynman’s analysis works for one atom, indeed, but does it still make sense if we look at the whole piece of material? It does, because we’re talking an induced magnetic moment that’s opposing the field, regardless of the orientation of the magnetic moment of the individual atoms in the piece of material. So, even if the individual atoms have opposite momenta, the extra induced magnetic moment will point in the same direction for all. So that solves that issue. However, it does not address Feynman’s own critical remark in regard to the supposed ‘impossibility’ of diamagnetism in classical mechanics.

But I’ll let you think about this, and sign off for today. šŸ™‚ I hope you enjoyed this post.

Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/

Spin and angular momentum in quantum mechanics

Note: A few years after writing the post below, I published a paper on the anomalous magnetic moment which makes (some of) what is written below irrelevant. It gives a clean classical explanation for it. Have a look by clicking on the link here !

Original blog post:

Feynman starts his Volume of LecturesĀ on quantum mechanics (so that’s Volume III of the whole series) with the rules we already know, so that’s the ‘special’ math involving probabilityĀ amplitudes, rather than probabilities. However, these introductory chapters assume theoretical zero-spinĀ particles, which means they don’t have any angularĀ momentum. While that makesĀ itĀ much easierĀ to understand the basics of quantum math, realĀ elementary particles do have angular momentum, which makes the analysis much more complicated. Therefore, Feynman makes it veryĀ clear, after his introductory chapters, that he expects all prospective readers of his third volume to first work their way through chapter 34 and 35 of the second volume, which discusses the angular momentum of elementary particles from both a classical as well as a quantum-mechanical perspective. So that’s what we will do here.Ā I have to warn you, though: while the mentioned two chapters are more generous with text than other textbooks on quantum mechanicsĀ I’ve looked at, the matter is still quite hard to digest. By way of introduction, Feynman writes the following:

“The behavior of matter on a small scale—as we have remarked many times—is different from anything that you are used to and is very strange indeed. Understanding of these matters comes very slowly, if at all. One never gets a comfortable feeling that these quantum-mechanical rules are ‘natural’. Of course they are, but they are not natural to our own experience at an ordinary level. The attitude that we are going to take with regard to this rule about angular momentum is quite different from many of the other things we have talked about. We are not going to try to ‘explain’ it but tell you what happens.”

I personally feel it’s not all as mysterious as Feynman claims it to be, but I’ll let you judge for yourself. So let’s just go for it and see what comes out. šŸ™‚

Atomic magnets and the g-factor

When discussing electromagnetic radiation, we introduced the concept of atomic oscillators. It was a very useful model to help us understand what’s supposed to be going on. Now we’re going to introduce atomic magnets. It is based on the classical idea of an electron orbiting around a proton. Of course, weĀ knowĀ this classical idea is wrong: we don’t have nice circular electron orbitals, and our discussion on the radius of an the electron in our previous post makes it clear that the idea of the electron itself is rather fuzzy.Ā Nevertheless, the classical concepts used to analyze rotation are also used, mutatis mutandis,Ā in quantum mechanics. Mutatis mutandis means: with necessary alterations. So…Ā Well… Let’s go for it. šŸ™‚Ā The basic idea is the following: an electron in a circular orbit is a circular current and, hence, it causes a magnetic field, i.e. a magnetic flux through the area of the loop—as illustrated below.

magneticdipole2

As such, we’ll have a magnetic (dipole) moment, and you may want to review my post(s) on that topicĀ so as to ensure you understand what follows. The magnetic moment (μ)Ā is the product of the current (I) and the area of the loop (π·r2), and its conventional direction is given by the μ vectorĀ in the illustration below, which also shows the other relevant scalar and/or vector quantities, such as the velocity v and the orbital angular momentum J. The orbital angular momentum is to be distinguished from theĀ spinĀ angular momentum, which results from the spin around its own axis. So the spin angular momentum – which is often referred to as theĀ spinĀ tout court – isĀ notĀ depicted below, and will only be discussed in a few minutes.

atomic magnet

Let me interject something on notation here. Feynman’s always uses J, for whatever momentum. That’s not so usual. Indeed, if you’d google a bit, you’ll see the convention is to useĀ SĀ andĀ LĀ respectively to distinguish spin and orbital angular momentum respectively. If we’d use S and L, we can write theĀ totalĀ angular momentum as J = S + L, and the illustration below shows how the S and L vectors are to be added. It looks a bit complicated, so you can skip this for now and come back to it later. But just try to visualize things:

  1. The L vector is moving around, so that assumes the orbital plane is moving around too. That happens when we’d put our atomic system in a magnetic field. We’ll come back to that. In what follows, we’ll assume the orbital plane isĀ notĀ moving.
  2. The S vector here is also moving, which also assumes the axis of rotation is notĀ steady. What’s going on here is referred to asĀ precession, and we discussed it when presenting the math one needs to understand gyroscopes.
  3. Adding S and L yields J, the totalĀ angular momentum. Unsurprisingly, this vector wiggles around too. Don’t worry about the magnitudes of the vectors here. Also, in case you’d wonder why the axis of symmetry for the movement of theĀ J vector happens to be theĀ JzĀ axis, the answer is simple: we choseĀ the coordinate system so as to ensure that was the case.

250px-LS

But I am digressing. I just inserted the illustration above to give you an inkling of where we’re going with this. Indeed, what’s shown above will make it easier for you to see how we can generalize the analysis that we’ll do now, which is an analysis of the orbital angular momentum and the related magnetic moment only. Let me copy the illustration we started with once more, so you don’t have to scroll up to see what we’re talking about.

atomic magnet

So we have a charge orbiting around some center. It’s a classical analysis, and so it’s really like a planet around the Sun, except that we should remember that likes repel, and opposites attract, so we’ve got a minus sign in the force law here.

Let’s go through the math. TheĀ magnetic moment is the current times the area of the loop. As the velocity is constant, the current is just the charge qĀ times the frequency of rotation. The frequency of rotation is, of course, the velocity (i.e. the distance traveled per second) divided by the circumference of the orbit (i.e. 2π·r). Hence, we write: I = (qeĀ·v)/(2π·r) and, therefore: μ =Ā (qe/Ā·v)·π·r2)/(2π·r) = qeĀ·vĀ·r/2. Note that, as per the convention, current is defined as a flow ofĀ positiveĀ charges, so the illustration above actually assumes we’re talking some protonĀ in orbit, so q = qeĀ would be the elementary charge +1. If we’d be talking an electron, then its charge is to be denoted as –qeĀ (minusĀ qe, i.e.Ā āˆ’1), and we’d need to reverse the direction of μ, which we’ll do in a moment. However, to simplify the discussion, you should just think of some positiveĀ charge orbiting the way it does in the illustration above.

OK. That’s all there’s to say about the magnetic moment—for the time being, that is. Let’s think about the angular momentum now. It’sĀ orbital angular momentum here, and so that’s the type of angular momentum we discussed in our post on gyroscopes. We denoted it as LĀ indeed – i.e. not asĀ J, but that’s just a matter of conventions – and we noted thatĀ L could be calculated as the vector cross product of the position vectorĀ r and the momentum vector p, as shown in the animation below, which also shows the torqueĀ vectorĀ Ļ„.

Torque_animation (1)

The angular momentum LĀ changesĀ inĀ the animation above. InĀ our J case above, it doesn’t. Also, unlike what’s happening with the angular momentum of that swinging ball above, the magnitudeĀ of our J doesn’t change. It remains constant, and it’s equal to |J| = J =Ā |rƗp| =Ā |r|Ā·|p|Ā·sinĪø = rĀ·p = rĀ·mĀ·v. One should note this is a non-relativistic formula, but as the relative velocity of an electron v/cĀ is equal to the fine-structure constant, so that’s Ī±Ā ā‰ˆ 0.0073 (see my post on the fine-structure constantĀ if you wonder where this formula comes from), it’s OK to notĀ include the Lorentz factor in our formulas as for now.

Now, as I mentioned already, the illustration we’re using to explain μ andĀ JĀ is somewhat unreal because it assumes a positive charge q, and so μ and JĀ point in the same direction in this case, which is notĀ the case if we’d be talking an actual atomic system with an electron orbiting around a proton. But let’s go along with it as for now and so we’ll put the required minusĀ sign in later. We can combine the J =Ā rĀ·mĀ·v and μ =Ā qĀ·vĀ·r/2 formulas to write:

μ = (q/2m)·J or μ/J = (q/2m) (electron orbit)

In other words, theĀ ratioĀ of the magnetic moment and the angular moment depends on (1) the charge (q) and (2)Ā the mass of the charge, and on those two variablesĀ only. So the ratio does not dependĀ on the velocity v nor on the radius r. It can be noted that the q/2m factor is often referred to as the gyromagnetic factor (not to be confused with the g-factor, which we’ll introduce shortly). It’s good to do a quick dimensional check of this relation: the magnetic moment is expressed in ampĆØreĀ perĀ secondĀ times the loop area, so that’s (C/s)Ā·m2. On the right-hand side, we have the dimension of the gyromagnetic factor, which is C/kg, times the dimension of the angular momentum, which is mĀ·kgĀ·m/s, so we have the same units on both sides: CĀ·m2/s, Ā which is often written as jouleĀ perĀ teslaĀ (J/T): the jouleĀ is the energy unit (1 J = 1 NĀ·m), and the tesla measures the strength of the magnetic field (1 T = 1 (NĀ·s)/(CĀ·m). OK. So that works out.

So far, so good. TheĀ story is a little bit different for the spinĀ angular momentum and theĀ spin magnetic moment.Ā The formula is the following:

μ = (q/m)·J (electron spin)

This formula says that the μ/J ratio is twiceĀ what it is for the orbital motion of the electron. Why is that? Feynman says “the reasons are pure quantum-mechanical—there is no classical explanation.” So I’d suggest we just leave that question open for the moment and see if we’re any wiser once we’ve worked ourselves through all of his LecturesĀ on quantum physics. šŸ™‚ Let’s just go along with it as for now.

Now,Ā we can write both formulas – i.e. the formula for the spin and the orbital angular momentum – in a more general way using the format below:

μ = –gĀ·(qe/2me)Ā·J

Why the minus sign? Well… I wanted to get the sign right this time. Our model assumed someĀ positiveĀ charge in orbit, but so we want a formula for a atomic system, and so our circling charge should be anĀ electron. So the formula above is the formula for a electron, and the direction of the magnetic moment and of the angular motion will be oppositeĀ for electrons: it just replaces q by –qe. The format above also applies to anyĀ atomic system: as Feynman writes, “For an isolated atom, the direction of the magnetic moment will always be exactly opposite to the direction of the angular momentum.”Ā So the g-factor will beĀ characteristic of the state of the atom. It will beĀ 1Ā for a pure orbital moment,Ā 2 for a pure spin moment, or some other number in-between for a complicated system like an atom, indeed.

You may have one last question: whyĀ qe/2m instead of qe/m in the middle? Well… If we’d take qe/m, then g would be 1/2 for the orbitalĀ angular momentum, and the initial idea with g was that it would be some integer (we’ll quickly see that’s an idea only). So… Well… It’s just one more convention. Of course, conventions are not always respected so sometimes you’ll see the expression above written without the minus sign, so you may see it as μ = gĀ·(qe/2me)Ā·J. In that case, the g-factor for our example involving the spinĀ angular momentum and the spinĀ magnetic moment, will obviously have to be written asĀ minusĀ 2.

Of course, it’s easy to see that the formula for the spin of aĀ protonĀ will look the same, except that we should take the mass of the proton in the formula, so that’sĀ mpĀ instead of me. Having said that, the elementary charge remains what it is, but so we write it without the minus sign here. To make a long story short, the formula for the proton is:Ā 

μ = g·(qe/2mp)·J

OK. That’s clear enough. For electrons, the g-factor is referred to as the LandĆ© g-factor, while the g-factor for protons or, more generally, for any spinningĀ nucleus, is referred to as theĀ nuclearĀ g-factor. Now, you may or may not be surprised, but there’sĀ a g-factorĀ for neutrons too, despite the fact that they do not carry a netĀ charge: the explanation for it must have something to do with the quarks that make up the neutron but that’s a complicated matter which we will notĀ get into here. Finally, there is a g-factor for a whole atom, or a whole atomic system, and that’s referred to as… Well… Just the g-factor. šŸ™‚ It’s, obviously, a number that’s characteristic of the state of the atom.

So… This was a big step forward. We’ve got all of the basics on that ‘magical’ spin number here, and so I hope it’s somewhat less ‘magical’ now. šŸ™‚ Let me just copy the values of the g-factor for some elementary particles. It also shows how hard physicists have been trying to narrow down the uncertainty in the measurement. Quite impressive! The table comes from the Wikipedia article on it. I hope the explanations above will now enable you to read and understand that. šŸ™‚

g-factor

Let’s now move on to the next topic.

Spin numbers and states

Of course, we’re talking quantum mechanics and, therefore, J can only take on a finite number of values. While that’s weird – as weird as other quantum-mechanical things, such as the boson-fermion math, for example – it should not surprise us. As we will see in a moment, the values of J will determine the magneticĀ energy our system will acquire when we put in some magnetic field and, as Feynman writes: “That the energy of an atom in the magnetic field can have only certain discrete energies is really not more surprising than the fact that atoms in general have only certain discrete energy levels. Why should the same thing not hold for atoms in a magnetic field? It does. It is just correlation of this with theĀ idea of an oriented magnetic moment that brings out some of the strange implications of quantum mechanics.” Yep. That’s true. We’ll talk about that later.

Of course, you’ll probably want some ‘easier’ explanation. I am afraid I can’t give you that. All I can say is that, perhaps, you should think of ourĀ discussion on the fine-structure constant, which made it clear that the various radii of the electron, its velocity and its mass and/or energy are all related one to another and, hence, that they can only take on certain values. Indeed, of all the relations we discussed, there’s two you should alwaysĀ remember. The first relationship is theĀ UĀ = (e2/r) = α/r. So that links the energy (which we can express in equivalent mass units), the electron charge and its radius. The second thing you should remember is that the Bohr radius and the classical electron radius are also related through α: α  Ā re/r = α2. So you may want to think of theĀ different values for J as being associated with different ‘orbitals’, so to speak. But that’s a very crude way of thinking about it, so I’d say: just accept the fact and see where it leads us. You’ll see, in a few moments from now, that theĀ whole thing is not unlike the quantum-mechanical explanation of the blackbody radiation problem, which assumes that the permitted energy levels (or states) are equally spaced and hĀ·f apart, with fĀ the frequency of the light that’s being absorbed and/or emitted. So the atom takes up energies onlyĀ hĀ·f at a time. Here we’ve got something similar: the energy levels that we’ll associate with the discrete values of J – or J‘sĀ componentsĀ , I should say – will also be equally spaced. Let me show you how it works, as that will make things somewhat more clear.

If we have an object with a given total angular momentum J in classical mechanics, then any of its components x, y or z, could take on any value from +J toĀ āˆ’J. That’s not the case here. The rule is that the ‘system’ – the atom, the nucleus, or anything really – will have a characteristic number, which is referred to as the ‘spin’ of the system and, somewhat confusingly, it’s denoted by jĀ (as you can,Ā it’s extremely important, indeed, to distinguish capital letters (like J) from small letters (like j)Ā if you want to keep track of what we’re explaining here). Now, if we have that characteristic spin number j, then any component of JĀ (think of theĀ z-direction, for example) can take on only (one of) the following values:

permitted values

Note that we will always have 2j + 1 values. For example, if j = 3/2, we’ll have 2Ā·(3/2) + 1 = 4 permitted values, and in the extreme case where j is zero, we’ll still have 2Ā·0 + 1 = 1 permitted value: zero itself. So that basically says we have no angular momentum. […] OK. That should be clear enough, but let’s pause here for a moment and analyze this—just to make sure we ‘get’ this indeed. What’s being written here? WhatĀ areĀ those numbers? Let’s do a quick dimensional analysis first. BecauseĀ j, jĀ āˆ’ 1, jĀ āˆ’ 2, etcetera are pure numbers, it’s only the dimension of ħ that we need to look at. We know ħ: it’s the Planck constant h, which is expressed inĀ jouleĀ·second, i.e. JĀ·s =Ā NĀ·mĀ·s, divided by 2Ļ€.Ā Ā That makes sense, because we get the same dimension for the angular momentum. Indeed, the L or J =Ā rĀ·mĀ·v formula also gives us the dimension of physical action, i.e. NĀ·mĀ·s. Just check it: [r]Ā·[m]Ā·[v] = mĀ·kgĀ·m/s = mĀ·(NĀ·s2/m)Ā·m/sĀ = NĀ·mĀ·s. Done!

So we’ve got some kind ofĀ unit of actionĀ once more here, even if it’s not h but ħ = h/2Ļ€. That makes it a quantum of action expressed for a radian, so that’s a unit of length, rather than for a full cycle.Ā Just so you know, ħ = h/2Ļ€ is 1Ɨ10āˆ’34Ā JĀ·sĀ ā‰ˆĀ 6.6Ɨ10āˆ’16Ā eVĀ·s, and we could chose to express the components of J in terms of h by multiplying the whole thing with 2Ļ€. That would boil down to saying that our unit length is not unity but the unit circle, which is 2Ļ€ times unity. Huh?Ā Just think about it: h is a fundamental unit linked to one full cycle of something, so it all makes sense. Before we move on, you may want to compare the value of h or ħ with the energy of a photon, which is 1.6 to 3.2 eV in the visible light spectrum, but you shouldĀ note that energy does not have the time dimension, and a second is an eternity in quantum physics, so the comparison is a bit tricky. So… […] Well… Let’s just move on.Ā What about those coefficients? What constraints are there?

Well… The constraint is that the difference between +j and āˆ’j must be some integer, so +jāˆ’(āˆ’j) = 2jĀ must be an integer. That implies that the spin number j is always an integer or a half-integer, depending on whether j is even or odd. Let’s do a few examples:

  1. A lithium (Li-7) nucleus has spin j = 3/2 and, therefore, the permitted values for the angular momentum around any axis (the z-axis, for example) are: 3/2, 3/2āˆ’1=1/2, 3/2āˆ’2=āˆ’1/2, andĀ āˆ’3/2—all times ħ of course! Note that the difference between +j and –j is 3, and each ‘step’ between those two levels is ħ, as we’d like it to be.
  2. The nucleus of the much rarer Lithium-6 isotope is one of the few stable nuclei that has spin j = 1, so the permitted values are 1, 0 and āˆ’1. Again, all needs to be multiplied with ħ to get the actual value for the J-component that we’re looking at. So each step is ‘one’ again, and the total difference (betweenĀ +j and –j) is 2.]
  3. An electron is a spin-1/2 particle, and so there are only two permitted values: +ħ/2 and āˆ’Ä§/2. So there is just one ‘step’ and it’s equal to the whole difference betweenĀ +j and –j. In fact, this is the most common situation, because we’ll be talking elementary fermions most of the time.
  4. Photons are an example of spin-1 ‘particles’, and ‘particles’ with integer spin are referred to as bosons. In this regard, you may heard of superfluid Helium-4, which is caused by Bose-EinsteinĀ condensationĀ near the zero temperature point, and demonstrates the integerĀ spin number ofĀ Helium-4, so it resembles Lithium-6 in this regard.

The four ‘typical’ examples makes it clear that the actual situations that we’ll be analyzing will usually be quite simple: we’ll only have 2, 3 or 4 permitted values only. As mentioned, there is this fundamental dichotomy betweenĀ fermionsĀ andĀ bosons. Fermions have half-integer spin, and all elementaryĀ fermions, such as protons, neutrons, electrons, neutrinos and quarks are spin-1/2 particles. [Note that a proton and a neutron are, strictly speaking, notĀ elementary, as their constituent parts are quarks.] Bosons have integer spin, and the bosons we know of are spin-one particles, (except for the newly discovered Higgs boson, which is an actual spin-zero particle). The photon is an example, but the helium nucleus (He-4) also has spin one, which – as mentioned above – gives rise to superfluidity when its cooled near the absolute zero point.

In any case, to make a long story short, in practice, we’ll be dealing almost exclusively with spin-1, spin-1/2 particles and, occasionally, with spin-3/2 particles. In addition, to analyze simple stuff, we’ll often pretend particles doĀ notĀ have any spin, so our ‘theoretical’ particles will often be spin zero. That’s just to simplify stuff.

We now need to learn how to do a bit of math with all of this. Before we do so, let me make some additional remarks on these permitted values. Regardless of whether or not J is ‘wobbling’ or movingĀ or not – let me be clear: J is not movingĀ in the analysis above, but we’ll discuss the phenomenon of precessionĀ in the next post, and that willĀ involve a J like that J circling around theĀ JzĀ axis, so I am just preparing the terrain here – J‘s magnitude will always be some constant, which we denoted by |J| = J.

Now there’s something really interesting here, which again distinguishes classical mechanics from quantum mechanics. As mentioned, in classical mechanics, any of J‘s components Jx,Ā JyĀ or Jz, could take on any value from +J toĀ āˆ’J and, therefore, the maximum value of any component of J – say Jz – would be equal to J. To be precise, J would be the value of the component of JĀ in the direction of J itself. So, in classical mechanics, we’d write: |J| = +√(JĀ·J) = +√J2Ā =Ā J, and it would be the maximum value of any component of J. But so we said that, if the spin number of J is j, then the maximum value of any component of J was equal to j·ħ.Ā So, naturally, one would think that J = |J| = +√(JĀ·J) = +√J2Ā = j·ħ.

However, that’s notĀ the case in quantum mechanics: the maximum value of any component of J is notĀ J = j·ħ but the square root of jĀ·(j+1)·ħ.

Huh?Ā Yes. Let me spell it out:Ā |J| = +√(JĀ·J) = +√J2 ≠ jħ. Indeed, quantum math has many particularities, and this is one of them. The magnitude of JĀ isĀ not equal to the largest possible value of any component of J:

J‘s magnitude is not jħ but √(j(j+1)ħ).

As for the proof of this, let me simplify my life and just copy Feynman here:

proof

The formula can be easily generalized for j ≠ 3/2. Also note that we used a fact that we didn’t mention as yet: all possible values of the z-component (or of whatever component) of J are equally likely.

Now, the result is fascinating, but the implications are even better. Let me paraphrase Feynman as he relates them:

  1. From what we have so far, we can get another interesting and somewhat surprising conclusion. In certain classical calculations the quantity that appears in the final result is the square of the magnitude of the angular momentumĀ J—in other words, Jā‹…JĀ = J2. It turns out that it is often possible to guess at the correct quantum-mechanical formula by using the classical calculation and the following simple rule: Replace J2Ā = Jā‹…JĀ byĀ j(j+1)ħ. This rule is commonly used, and usually gives the correct result.
  2. The second implication is the one we announced already: although we would think classically that the largest possible value of the anyĀ component ofĀ JĀ is just the magnitude ofĀ J, quantum-mechanically the maximum of any component of J is always less than that, because jħ is always less than √(j(j+1)ħ). For example, for j = 3/2 = 1.5, we haveĀ j(j+1) = (3/2)Ā·(5/2) = 15/4 = 3.75. Now, the square root of this value is √3.75 ā‰ˆ 1.9365, so the magnitude of J is about 30% largerĀ than the maximum value of any of J‘s components. That’s a pretty spectacular difference, obviously!Ā  Ā 

The second point is quite deep: it implies that the angular momentum is ‘never completely along any direction’. Why? Well… Think of it: “any of J‘s components” also includes the component in the direction of J itself! But if the maximum value of that component is 30% less than the magnitude of J, what does that mean really? All we can say is that it implies that the concept of the direction of the magnitude itself is actually quite fuzzy in quantum mechanics! Of course, that’s got to do with the Uncertainty Principle, and so we’ll come back to this later.

In fact, ifĀ you look at the math, you may think: what’s that business with those average or expected values? A magnitude is a magnitude, isn’t it? It’s supposed to be calculated from theĀ actualĀ values of Jx, JyĀ andĀ Jz, not from some average that’s based on the (equal) likelihoods of the permitted values. You’re right. Feynman’s derivation here is quantum-mechanical from the start and, therefore, we get a quantum-mechanical result indeed: the magnitude of JĀ is calculated as the magnitude of a quantum-mechanical variable in the derivation above, not as the magnitude of a classical variable.

[…] OK. On to the next.

The magnetic energy of atoms

Before we start talking about this topic, we should, perhaps, relate the angular momentum to the magnetic moment once again. We can do that using the μ = (q/2m)Ā·J and/or μ = (q/m)Ā·JĀ formula (so that’s the simple formulas for the orbital and spin angular momentum respectively) or, else, by using the more general μ = – gĀ·(q/2m)Ā·J formula.

Let’s use the simpler μ = (qe/2m)Ā·JĀ formula, which is the one for the orbitalĀ angular momentum.Ā What’s qe/2m? It should be equal to 1.6Ɨ10āˆ’19Ā C divided by 2Ā·9.1Ɨ10āˆ’31Ā kg, so that’s about 0.0879Ɨ1012Ā  C/kg, or 0.0879Ɨ1012Ā (CĀ·m)/(NĀ·s2). Now we multiply by ħ/2 ā‰ˆ 0.527Ɨ10āˆ’34Ā JĀ·s. We get something like 0.0463Ɨ10āˆ’22Ā m2Ā·C/s or J/T. These numbers are ridiculously small, so they’re usually measured in terms of a so-called naturalĀ unit: the Bohr magneton, which I’ll explain in a moment but so here we’re interested in its value only, which is μBĀ =Ā 9.274Ɨ10āˆ’24Ā J/T. Hence, μ/μBĀ = 0.5 = 1/2. What a nice number!

Hmm… This cannot be a coincidence… […] You’re right. It isn’t. To get the full picture, we need to include the spinĀ angular momentum, so we also need to see what the μ = (q/m)Ā·JĀ will yield. That’s easy, of course, as it’s twice the value of (q/2m)Ā·J, so μ/μBĀ = 1, and so the totalĀ is equal to 3/2. So the magnetic moment of an electron has the same value (when expressed in terms of the Bohr magneton) as the spinĀ (when expressed in terms of ħ). Now that’s justĀ sweet!

Yes, it is. All our definitions and formulas were formulated so as to make it sweet. Having said that,Ā we do have a tiny little problem. If we use the general μ = āˆ’gĀ·(q/2m)Ā·J to write the result we found for the spinĀ of the electron only (so we’re notĀ looking at the orbital momentum here), then we’d write: μ = 2Ā·(q/2m)Ā·JĀ = (q/m)Ā·JĀ and, hence, the g-factor here isĀ āˆ’2. Yes. We know that. You told me so already. What’s the issue? Well… The problem is: experiments reveal the actual value of g isĀ not exactly āˆ’2: it’s āˆ’2.00231930436182(52) instead, with the last two digits (in brackets) the uncertainty in the current measurements. Just check it for yourself on the NIST website. šŸ™‚ [Please do check it: it brings some realness to this discussion.]

Hmm…. The accuracy of the measurement suggests we should take it seriously, even if we’re talking a difference of 0.1% only. We should. It can be explained, of course: it’s something quantum-mechanical. However, we’ll talk about this later. As for now, just try to understand the basics here. It’s complicated enough already, and so we’ll stay away from the nitty-gritty as long as we can.

Let’s now get back to the magneticĀ energyĀ of our atoms.Ā From our discussion on theĀ torqueĀ on a magnetic dipole in an external magnetic field, we know that our magnetic atoms will have someĀ extraĀ magnetic energy when placed in an external field. So now we have an externalĀ magnetic field B, and we derived the formula for the energy is

UmagĀ =Ā āˆ’Ī¼Ā·BĀ·cosĪø = āˆ’Ī¼Ā·B

I won’t explain the whole thing once again, but it might help to visualize the situation, which we do below. The loop here is not circular but square, and it’s a current-carrying wire instead of an electron in orbit, but I hope you get the point.

Geometry 2

We need to chose some coordinate system to calculate stuff and so we’ll just choose our z-axis along the direction of the external magnetic field B so as to simplify those calculations. If we do that, we can just take the z-component of μ and then combine the interim result with our general μ = – gĀ·(q/2m)Ā·J formula, so we write:

UmagĀ =Ā āˆ’Ī¼zĀ·B =Ā gĀ·(q/2m)Ā·JzĀ·B

Now, we know that theĀ maximumĀ value of JzĀ is equal to j·ħ, and so the maximum value of UmagĀ will be equalĀ g(q/2m)jħB. Let’s now simplify this expression by choosing some natural unit, and that’s the unit we introduced already above: the Bohr magneton. It’s equal to (qeħ)/(2me) and its value is μBĀ ā‰ˆĀ 9.274Ɨ10āˆ’24Ā J/T. So we get the result we wanted, and that is:

formula

Let me make a few remarks here. First on that magneton: you should note there’s also something which is known as theĀ nuclearĀ magneton which, you guessed it, is calculated using theĀ proton charge and the proton mass: μNĀ = (qpħ)/(2mp)Ā ā‰ˆ 5.05Ɨ10āˆ’27Ā J/T. My second remark is a question: what does that formulaĀ mean, really? Well… Let me quote Feynman on that. The formula basically says the following:

“The energy of an atomic system is changed when it is put in a magnetic field by an amount that is proportional to the field, and proportional to Jz. We say that the energy of an atomic system is ‘split’ into 2jĀ + 1Ā ‘levels’ by a magnetic field. For instance, an atom whose energy is U0Ā outside a magnetic field and whose jĀ isĀ 3/2, will have four possible energies when placed in a field. We can show these energies by an energy-level diagram like that drawn below. Any particular atom can have only one of the four possible energies in any given field B. That is what quantum mechanics says about the behavior of an atomic system in a magnetic field.”

diagram 1

Of course, the simplest ‘atomic’ system is a single electron, which has spin 1/2 only (like most fermions really: the example in the diagram above, with spin 3/2, would be that Li-7 system or something similar). If the spin is 1/2, then there are only two energy levels, with JzĀ = ±ħ/2 and, as we mentioned already, the g-factor for an electron is āˆ’2 (again, the use of minus signs (or not) is quite confusing: I am sorry for that),Ā and soĀ our formula above becomes very simple:

Umag = ± μB·B

The graph above becomes the graph below, and we can now speak more loosely and say that the electron either has its spin ‘up’ (so that’s along the field), or ‘down’ (so that’s opposite the field).

diagram 2

By now, you’re probably tired of the math and you’ll wonder: how can weĀ proveĀ all of this permitted value business? Well… That question leads me to the last topic of my post: the Stern-Gerlach experiment.

The Stern-Gerlach experimentĀ 

Here again, I can just copy straight of out of Feynman, and so I hope you’ll forgive me if I just do that, as I don’t think there’s any advantage to me trying to summarize what he writes on it:

“The fact that the angular momentum is quantized is such a surprising thing that we will talk a little bit about it historically. It was a shock from the moment it was discovered (although it was expected theoretically). It was first observed in an experiment done in 1922 by Stern and Gerlach. If you wish, you can consider the experiment of Stern-Gerlach as a direct justification for a belief in the quantization of angular momentum. Stern and Gerlach devised an experiment for measuring the magnetic moment of individual silver atoms. They produced a beam of silver atoms by evaporating silver in a hot oven and letting some of them come out through a series of small holes. This beam was directed between the pole tips of a special magnet, as shown in the illustration below. Their idea was the following. If the silver atom has a magnetic moment μ, then in a magnetic fieldĀ BĀ it has an energyĀ āˆ’Ī¼zB, where zĀ is the direction of the magnetic field. In the classical theory, μzĀ would be equal to the magnetic moment times the cosine of the angle between the moment and the magnetic field, so the extra energy in the field would be

Ī”U =Ā āˆ’Ī¼Ā·BĀ·cosĪø

Of course, as the atoms come out of the oven, their magnetic moments would point in every possible direction, so there would be all values ofĀ Īø. Now if the magnetic field varies very rapidly withĀ z—if there is a strong field gradient—then the magnetic energy will also vary with position, and there will be a force on the magnetic moments whose direction will depend on whether cosineĀ ĪøĀ is positive or negative. The atoms will be pulled up or down by a force proportional to the derivative of the magnetic energy; from the principle of virtual work,

Capture

Stern and Gerlach made their magnet with a very sharp edge on one of the pole tips in order to produce a very rapid variation of the magnetic field. The beam of silver atoms was directed right along this sharp edge, so that the atoms would feel a vertical force in the inhomogeneous field. A silver atom with its magnetic moment directed horizontally would have no force on it and would go straight past the magnet. An atom whose magnetic moment was exactly vertical would have a force pulling it up toward the sharp edge of the magnet. An atom whose magnetic moment was pointed downward would feel a downward push. Thus, as they left the magnet, the atoms would be spread out according to their vertical components of magnetic moment. In the classical theory all angles are possible, so that when the silver atoms are collected by deposition on a glass plate, one should expect a smear of silver along a vertical line. The height of the line would be proportional to the magnitude of the magnetic moment. The abject failure of classical ideas was completely revealed when Stern and Gerlach saw what actually happened. They found on the glass plate two distinct spots. The silver atoms had formed two beams.

That a beam of atoms whose spins would apparently be randomly oriented gets split up into two separate beams is most miraculous. How does the magnetic moment know that it is only allowed to take on certain components in the direction of the magnetic field? Well, that was really the beginning of the discovery of the quantization of angular momentum, and instead of trying to give you a theoretical explanation, we will just say that you are stuck with the result of this experiment just as the physicists of that day had to accept the result when the experiment was done. It is an experimental fact that the energy of an atom in a magnetic field takes on a series of individual values. For each of these values the energy is proportional to the field strength. So in a region where the field varies, the principle of virtual work tells us that the possible magnetic force on the atoms will have a set of separate values; the force is different for each state, so the beam of atoms is split into a small number of separate beams. From a measurement of the deflection of the beams, one can find the strength of the magnetic moment.”

I should note one point which Feynman hardly addresses in the analysis above: why do we need a non-homogeneous field? Well… Think of it. The individual silver atoms are not like electrons in some electric field. They are tiny little magnets, and magnets do not behave like electrons. Remember we said there’s no such thing as a magnetic charge? So that applies here. If the silver atoms are tiny magnets, with a magnetic dipole moment, then the only thing they will do is turn, so as to minimize their energy U = āˆ’Ī¼BcosĪø.

That energy is minimized when μ and B are at right angles of each other, so as to make the cosĪø factor zero, which happens when Īø = Ļ€/2. Hence, in a homogeneous magnetic field, we will have a torque on the loop of current – think of our electron(s) in orbit here – but no net force pulling it in this or that direction as a whole. So the atoms would just rotate but not move in our classical analysis here.

To make the atoms themselves move towards or away one of the poles (with or without a sharp tip), the magnetic field must be non-homogeneous, so as to ensure that the force that’s pulling on one side of the loop of current is slightly different from the force that’s pulling (in the opposite direction) on the other side of the loop of current. So that’s why the field has to be non-homogeneous (or inhomogeneous as Feynman calls it), and so that’s why one pole needs to have a sharply pointed tip.

As for the force formula, it’s crucial to remember that energy (or work) is force times distance. To be precise, it’s the ∫Fāˆ™ds integral. This integral will have a minus sign in front when we’re doing work against the force, so that’s when we’re increasing the potential energy of an object. Conversely, we’ll just take the positive value when we’re converting potential energy into kinetic energy. So that explains the F = āˆ’āˆ‚U/āˆ‚z formula above. In fact, in the analysis above, Feynman assumes the magnetic moment doesn’t turn at all. That’s pretty obvious from the Fz = āˆ’āˆ‚U/āˆ‚z = āˆ’Ī¼āˆ™cosĪøāˆ™āˆ‚B/āˆ‚z formula, in which μ is clearly being treated as a constant. So the Fz in this formula is a net force in the z-direction, and it’s crucially dependent on the variation of the magnetic field in the z-direction. If the field would not be varying, āˆ‚B/āˆ‚z would be zero and, therefore, we would not have any net force in the z-direction. As mentioned above, we would only have a torque.

Well… This sort of covers all of what we wanted to cover today. šŸ™‚ I hope you enjoyed it.

Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/

Taking the magic out of God’s number: some additional reflections

Note: I have published a paper that is very coherent and fully explains this so-called God-given number. There is nothing magical about it. It is just a scaling constant. Check it out: The Meaning of the Fine-Structure Constant. No ambiguity. No hocus-pocus.

Jean Louis Van Belle, 23 December 2018

Original post:

In my previous post, I explained why the fine-structure constant α isĀ notĀ a ‘magical’ number, even ifĀ it relates all fundamental properties of the electron: its mass, its energy, its charge, its radius, its photonĀ scattering cross-sectionĀ (i.e. the Bohr radius, or the size of the atom really) and, finally, the couplingĀ constant for photon-electron interactions.Ā The key to such understanding of α was the model of an electron as a tiny ball of charge. As such, we have two energy formulas for it. One is the energy that’s needed to assemble the charge from infinitely dispersed infinitesimal charges, which we denoted as Uelec. The other formula is the energy of the fieldĀ of the tiny ball of charge, which we denoted as Eelec.

The formula for EelecĀ is calculated using the formula for the field momentum of a moving charge and, using the m = E/c2Ā mas-energy equivalence relationship, is equivalentĀ to the electromagnetic mass. We went through the derivation in our previous post, so let me just jot down the result:

emm - 2

The second formula depends on what ball of charge we’re thinking of, because the formulas for a charged sphere and aĀ sphericalĀ shellĀ of charge are different: both have the same structureĀ as the relationship above (so the energy is also proportional to the square of the electron charge andĀ inverselyĀ proportional to the radius a), but the constant of proportionality is different. For a sphere of charge, we write:

Ā f sphre

For a spherical shellĀ of charge we write:

shell

To compare the formulas, you need to note that the square of the electron chargeĀ eĀ in the formula for the field energy isĀ equal to e2Ā = qe2/4πε0Ā = keĀ·qe2. So we multiplyĀ the square of the actualĀ electron charge by the Coulomb constant keĀ = 1/4πε0. As you can see, the three formulas have exactlyĀ the same form then. It’s just the proportionality constant that’s different: it’s 2/3, 3/5 and 1/2 respectively.Ā It’s interesting to quickly reflect on the dimensions here: [ke] ā‰ˆ 9Ɨ109 NĀ·m2/C2, so e2Ā is expressed in NĀ·m2. That makes the units come out alright, as we divide by a (so that’s in meter) and so we get the energy inĀ jouleĀ (which is newtonĀ·meter). In fact, now that we’re here, let’s quickly calculate the value of e2: it’s that keĀ·qe2Ā product, so it’s equal to 2.3Ɨ10āˆ’28Ā NĀ·m2. We can quickly check this value because we know that the classical electron radius is equal to:

classical electron radius

So we divide 2.3Ɨ10āˆ’28Ā NĀ·m2Ā by mec2Ā ā‰ˆ 8.2Ɨ10āˆ’14 J, so we get r0Ā ā‰ˆ 2.82Ɨ10āˆ’15Ā m. So we’re spot on! Why did I do this check? Not really to check what I wrote. It’s more to show what’s going on.Ā We’ve got yet another formula relating the energy and the radius of an electron here, so now we have three. In fact we have more because the formula for UelecĀ depends on the finer details of our model for the electron (sphere versus shell, uniform versus non-uniform distribution):

  1. EelecĀ = (2/3)Ā·(e2/a): This is the formula for the energy of the field, so we may all it isĀ externalĀ energy.
  2. UelecĀ = (3/5)Ā·(e2/a), or UelecĀ = (1/2)Ā·(e2/a): This is the energy needed to assembleĀ our electron, so we might, perhaps, call it itsĀ internalĀ energy. The first formula assumes our electron is a uniformly chargedĀ sphere. The second assumes all charges sit on the surface of the sphere. If we drop the assumption of the charge having to be uniformly distributed, we’ll find yet another formula.
  3. mec2Ā =Ā e2/r0: This is the energy associated with the so-called classical electron radiusĀ (r0) and the electron’s rest mass (me).

In our previous posts, we assumed the last equation was the right one. Why? Because it’s the one that’s been verified experimentally. The discrepancies between the various proportionality coefficients – i.e. the difference between 2/3 and 1, basically – are to be explained because of theĀ binding forces within the electron, without which the electron would just ‘explode’, as the French physicist and polymath Henri PoincarĆ© famously put it.Ā Indeed, if the electron is a little ball of negative charge, the repulsive forces between its parts should rip it apart. So we will not say anything more about this. You can have fun yourself byĀ googlingĀ all the various theories that try to model these binding forces. [I may do the same some day, but now I’ve got other priorities: I want to move to Feynman’s third volume ofĀ Lectures, which is devoted to quantum physicsĀ only, so I look very much forward to that.]

In this post, I just wanted to reflect once more on what constants are reallyĀ fundamentalĀ and what constants are somewhat less fundamental. From all what I wrote in my previous post, I said there were three:

  1. The fine-structure constant α, which is a dimensionless number.
  2. Planck’s constant h, whose dimension is jouleĀ·second, so that’s the dimension ofĀ action.
  3. The speed of light c, whose dimension is that of a velocity.

The three are related through the following expression:

alpha re-expressed

This is an interesting expression. Let’s first check its dimension. We already explained thatĀ e2Ā is expressed inĀ NĀ·m2. That’s rather strange, because it means the dimension of e itself is N1/2Ā·m: what’s the square root of a force of oneĀ newton? In fact, to interpret the formula above, it’s probably better to re-writeĀ e2Ā asĀ e2Ā = qe2/4πε0Ā = keĀ·qe2. That shows you how the electron charge and Coulomb’s constant are related. Of course, they are part and parcel of one and the same force law:Ā Coulomb’s law. We don’t need anything else, except for relativity theory, because we need to explain theĀ magnetic forceĀ as well—andĀ thatĀ we can do because magnetism is just a relativistic effect. Think of the field momentum indeed: the magnetic field comes into play only when we start to move our electron. The relativity effect is captured by cĀ  in that formula for α above. As for ħ, ħ = h/2Ļ€ comes with the E = hĀ·fĀ equation, which links us to the electron’s Compton wavelengthĀ Ī» through theĀ de BroglieĀ relationĀ Ī» = h/p.

The point is: we should probably not look at α as a ‘fundamental physical constant’. It’s e2Ā that’s the third fundamental constant, besides h and c. Indeed, it’s fromĀ e2Ā that all the rest follows: the electron’s internal energy, its external energy, and its radius, and then all the rest by combining stuff with other stuff.

Now, we took the magic out of α by doing what we did in the previous posts, and that’s to combine stuff with other stuff, and so now you may think I am putting the magic back in with that formula for α, which seems to define α in terms of the three mentioned ‘fundamental’ constants. That’s not the case: this relation comes out of all of the other relationships we found, and so it’s nothing new really. It’s actually notĀ a definition of α: it just does what it does, and that’s toĀ relate α to the ‘fundamental’ physical constants behind.

So… No new magic. In fact, I want to close this post by taking away even more of the magic. If you read my previous post, I said that α was ‘God’s cut-off factor’ šŸ™‚ ensuring our energy functions do not blow up, but I also said it was impossible to sayĀ whyĀ he choseĀ 0.00729735256Ā as the cut-off factor. The question is actually easily answered by thinking about thoseĀ twoĀ formulas we had for the internal and external energy respectively. Let’s re-write them in natural units and, temporarily, two different subscripts for α, so we write:

  1. Eelec = αe/r0: This is the formula for the energy of the field.
  2. Uelec = αu/r0: This is the energy needed to assemble our electron.

Both energies are determined by the above-mentioned laws, i.e. Coulomb’s Law and the theory of relativity, so α has got nothing to do what that. However, both energies have to be the same, and so αeĀ has to be equal to αu. In that sense, α is, quite simply, aĀ proportionality constantĀ that achieves that equality. NowĀ thatĀ explains why we canĀ derive α from the three other constants which, as mentioned above, are probablyĀ moreĀ fundamental. In fact, we’ve got only three degrees of freedom here, so if we chose c,Ā h andĀ eĀ as ‘fundamental’, then α isn’t any more.

The underlying deep question behind it all isĀ whyĀ those two energies should be equal.Ā Why would our electron have some internal energy if it’s elementary? The answer to that question is: because it has some non-zero radius, and it has some non-zero radius because we don’t want our formula for the field energy (or the field momentum) to blow up. Now, if it has some radius, then itĀ hasĀ to have some internal energy.

You’ll say: that makes sense, but it doesn’t answer the question. Why would it haveĀ internal energy, with or without a zero radius? If an electron is an elementaryĀ particle, then it’s reallyĀ elementary, isn’t? And so then we shouldn’t try to ‘assemble’ it from an infinite number of infinitesimally small charges. You’re right, and here we can also note that the fact that the electron doesn’t blow up is firm evidence it’s veryĀ elementary indeed.

I should also note that Feynman actually doesn’t talk about the energy that’s needed to assemble a charge: he gets his UelecĀ = (1/2)Ā·(e2/a) by calculating the externalĀ field energy for a spherical shell of charge, and he sticks to it—presumably because it’s the same field for a uniform or non-uniformĀ sphere of charge. He only notes there has to be some radius because, if not, the formula he uses blows up, indeed. So – who knows? – perhaps he doesn’t quite believe that formula for theĀ internalĀ energy is relevant either.

So perhaps there is no internal energy indeed. Perhaps there’s just the energy of the field. So… Well… I can’t say much about this… Except… Well…Ā Perhaps just one more thing. Let me note somethingĀ that, I hope, you noticed as well: theĀ keĀ·qe2 is the numerator in Coulomb’s Law itself. You also know that energy equals force times distance. So if we divide both sides by r0, we get Coulomb’s Law itself FelecĀ = keĀ·qe2/r02. The only thing is: what’s the distance? It’s one charge only, and there is no distance between one charge, is there? Well… Yes and no. I have been thinking that the requirement of the internal and external energies being equal resembles the statement that the forces between two charges are equal and opposite. That ties in with the idea of the internal energy itself: remember we were basically talking forces between infinitesimally small elements of charge within the electron itself? So r0Ā is, perhaps, some averageĀ distance or so. There must beĀ someĀ way of thinking of it like that. But… Well… Which oneĀ exactly?

This kind of reflection may not make sense. Who knows? I obviously need to think all of this through and so this post is, indeed, just a bunch of reflections for which I will have more time later—hopefully. šŸ™‚ Perhaps we’re all just pushing the matter too far. Perhaps we should just accept that the external energy has that 2/3 factor but that the actual energy of the electron should also include the equivalent energy of some binding force that holds the electron together. Well… In any case. That’s all I am going to do on this extremely complicated matter. It’s time to move indeed! So the point to take home here is probably just this:

  1. When calculating the radius of an electron using classical theory, we get in trouble: not only do we find different radii, but the radii that we find do not respect the E =Ā mec2Ā law. It’s only theĀ mec2Ā =Ā e2/r0Ā that’s relativistically correct.
  2. That suggests the electron also has some non-electric mass, which are referred to as ‘binding forces’ or ‘PoincarĆ© stresses’, but which remain to be explained convincingly.
  3. All of this shouldn’t surprise us: for all we know, the electron is something fuzzy. šŸ™‚

So my next posts will focus on the ‘essentials’ preparing for Feynman’s Volume on quantum mechanics. Those ‘essentials’ will still involve some classical stuff but, as you will see, even more contradictions, that – hopefully! – will then be solved in the quantum-mechanicalĀ picture of itĀ all. šŸ™‚

Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 20, 2020 as a result of a DMCA takedown notice from Michael A. Gottlieb, Rudolf Pfeiffer, and The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 20, 2020 as a result of a DMCA takedown notice from Michael A. Gottlieb, Rudolf Pfeiffer, and The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/

Taking the magic out of God’s number

Note: I have published a paper that is very coherent and fully explains this so-called God-given number. There is nothing magical about it. It is just a scaling constant. Check it out: The Meaning of the Fine-Structure Constant. No ambiguity. No hocus-pocus.

Jean Louis Van Belle, 23 December 2018

Original post:

I think the post scriptum to my previous post is interesting enough to separate it out as a piece of its own, so let me do that here. You’ll remember that we were trying to find some kind of a model for the electron, picturing it like a tiny little ball of charge, and then we just applied the classical energy formulas to it to see what comes out of it. TheĀ key formula is the integral that gives us the energy that goes into assembling a charge. It was the following thing:

U 4

This is a double integral which we simplified in two stages, so we’re looking at an integral within an integral really, butĀ we can substitute the integral over the ρ(2)Ā·dV2Ā product by the formula we got for the potential, so we write that as Φ(1), and so the integral above becomes:

U 5Now, thisĀ integral integrates the ρ(1)·Φ(1)Ā·dV1Ā product over all of space, so that’s over all points in space, and so we just dropped the index and wrote the whole thing as the integral of ρ·Φ·dVĀ over all of space:

U 6

We thenĀ established that this integral was mathematically equivalent to the following equation:

U 7

So this integral is actually quite simple: it just integrates E•E = E2Ā over all of space. The illustration below shows E as a function of the distanceĀ rĀ for a sphere of radius R filledĀ uniformlyĀ with charge.

uniform density

So the field (E) goes asĀ rĀ for r ≤ R and as 1/r2Ā for r ≄ R. So, for r ≄ R, the integral will have (1/r2)2Ā = 1/r4Ā in it. Now, you know that the integral of some function is the surface under the graph of that function. Look at the 1/r4 function below: it blows up between 1 and 0. That’s where the problem is: there needs to be some kind of cut-off, because that integral will effectively blow up when the radius of our little sphere of charge gets ‘too small’.Ā So that makes it clear why it doesn’t make sense to use this formula to try to calculate the energy of a point charge. It just doesn’t make sense to do that.

graph

In fact, the need for a ‘cut-off factor’ so as to ensure our energy function doesn’t ‘blow up’ is not because of the exponent in theĀ 1/r4 expression: the need is also there for any 1/r relation, as illustrated below. AllĀ 1/rnĀ function have the sameĀ pivotĀ point, as you can see from the simple illustration below. So, yes, we cannot go all the way to zero from there when integrating: we have to stopĀ somewhere.

graph 2So what’s the ‘cut-off point’? What’s ‘too small’ a radius? Let’s look at the formula we got for our electron as a shellĀ of charge (so the assumption here is that the charge isĀ uniformly distributed on the surfaceĀ of a sphere with radius a):

energy electron

So we’ve got an even simpler formula here: it’s just a 1/rĀ relation (a is rĀ in this formula), not 1/r4. Why is that? Well… It’s just the way the math turns out: we’re integrating over volumes and so that involves an r3Ā factor and so it all simplifies to 1/r, and so that gives us this simple inverselyĀ proportional relationship between U and r, i.eĀ a, in this case. šŸ™‚Ā I copied the detail of Feynman’s calculation in my previous post, so you can double-check it. It’s quite wonderful, really. Look at it again: we have a very simpleĀ inverselyĀ proportional relationship between theĀ radiusĀ of our electron andĀ itsĀ energy as a sphere of charge. We could write it as:

Uelect  = α/a, with α = e2/2

Still… We need the cut-off point’. Also note that, as I pointed out, we don’t necessarily need to assume that the charge in our little ball of charge (i.e. our electron) sits on the surface only: if we’d assume it’s a uniformly charged sphere of charge, we’d just get another constant of proportionality: our 1/2 factor would become a 3/5 factor, so we’d write: UelectĀ Ā = (3/5)Ā·e2/a. But we’re not interested in finding the rightĀ model here. We know theĀ UelectĀ Ā = (3/5)Ā·e2/aĀ gives us a value forĀ aĀ that differs with a 2/5 factor as the classical electron radius. That’s not so bad and so let’s go along with it. šŸ™‚

We’re going to look at the simpleĀ structureĀ of this relation, and all of its implications. The simple equation above says that the energy of our electron is (a) proportional to the square of its charge and (b) inversely proportional to its radius. Now,Ā thatĀ is a very remarkable result. In fact,Ā we’ve seen something like this before, and we were astonished.Ā We saw it when we were discussing the wonderful properties of that magical number, theĀ fine-structure constant, which we also denoted by α. However, because we used α already, I’ll denote the fine-structure constant as αe here, so you don’t get confused. You’ll remember thatĀ the fine-structure constant is a God-like number indeed: it links allĀ of the fundamental properties of the electron, i.e. its charge, its radius, its distance to the nucleus (i.e. the Bohr radius), its velocity, its mass (and, hence, its energy), its de BroglieĀ wavelength. Whatever: all these physical constants are all related through the fine-structure constant.Ā 

In my various posts on this topic, I’ve repeatedly said that, but I never showed why it’s true, and so it was a very magical number indeed. I am going to take some of the magic out now. Not too much but… Well… You can judge for yourself how much of the magic remains after I am done here. šŸ™‚

So, at this stage of the argument, α can be anything, and αeĀ cannot, of course. It’s just that magical number out there, which relates everything to everything: it’s the God-given number we don’t understand,Ā or didn’t understand, I should say. Past tense. Indeed, we’re going to get some understanding here because we know that one of theĀ many expressions involving αe was the following one:

me = αe/re

This says that theĀ massĀ of the electron is equal to the ratio of the fine-structure constant and the electron radius. [Note that we express everything in natural units here, so that’sĀ Planck units. For the detail of the conversion, please see the relevant section on that in my one of my posts on this and other stuff.] In fact, the U =Ā (3/5)Ā·e2/aĀ andĀ meĀ = αe/reĀ relations looks exactlyĀ the same, because one of the other equations involving the fine-structure constant was: αeĀ =Ā eP2. So we’ve got the square of the charge here as well! Indeed, as I’ll explain in a moment, the difference between the two formulas is just a matter of units.

Now, mass is equivalent to energy, of course: it’s just a matter of units, so we can equate meĀ with EeĀ (this amounts to expressing the energy of the electron in aĀ kg unit—bit weird, but OK) and so we get:

Ee = αe/re

So there we have: the fine-structure constant αeĀ is Nature’s ‘cut-off’ factor, so to speak. Why? Only God knows. šŸ™‚ But it’s now (fairly) easy to see why all the relations involving αeĀ are what they are. As I mentioned already, we also know that αeĀ is the square of the electron charge expressed in Planck units, so we have:

 αe = eP2 and, therefore, Ee = eP2/re

Now, you can check for yourself: it’s just a matter of re-expressing everything in standard SI units, and relating eP2Ā to e2, and it should all work: you should get the EelectĀ Ā = (2/3)Ā·e2/aĀ expression. So… Well… At least this takes some of the magic out the fine-structure constant. It’s still a wonderful thing, but so you see that the fundamental relationship between (a) the energy (and, hence, the mass), (b) the radius and (c) the charge of an electron isĀ notĀ something God-given. What’s God-given are Maxwell’s equations, and so theĀ EeĀ = αe/reĀ = eP2/reĀ is just one of the many wonderful things that you can get out of Ā them.

So we found God’s ‘cut-off factor’ šŸ™‚ It’s equal to αeĀ ā‰ˆ 0.0073 = 7.3Ɨ10āˆ’3. So 7.3 thousands of… What? Well… Nothing. It’s just a pure ratio between the energy and the radius of an electron (if both are expressed in Planck units, of course). And so it determines the electron charge (again, expressed in Planck units). Indeed, we write:

eP = √αe

Really? Yes. JustĀ doĀ all these formulas:

ePĀ = √αeĀ ā‰ˆĀ āˆš0.0073Ā·1.9Ɨ10āˆ’18Ā coulombĀ ā‰ˆ 1.6Ā Ć—10āˆ’19 C

Just re-check itĀ withĀ all the known decimals: you’ll see it’s bang on. Let’s look at the EeĀ = meĀ = αe/reĀ ratio once again. What’s theĀ meaningĀ of it? Let’s first calculate the value of reĀ and me,Ā i.e. the electron radius and electron mass expressed in Planck units. It’s equal to the classical electron radius divided by the Planck length, and then the same for the mass, so we get the following thing:

reĀ ā‰ˆ (2.81794Ɨ10āˆ’15Ā m)/(1.6162Ɨ10āˆ’35Ā m) = 1.7435Ɨ1020Ā 

meĀ ā‰ˆ (9.1Ɨ10āˆ’31Ā kg)/(2.17651Ɨ10āˆ’8 kg) = 4.18Ɨ10āˆ’23

αeĀ = (4.18Ɨ10āˆ’23)Ā·(1.7435Ɨ1020) ā‰ˆ 0.0073

It works like a charm, but what does it mean? Well… It’s just a ratio between two physical quantities, and the scaleĀ you use to measure those quantities matters very much. We’ve explained that the Planck mass is a rather large unit at the atomic scale and, therefore, it’s perhaps not quite appropriate to use it here. In fact, out of the many interesting expressions for αe, I should highlight the following one:

αeĀ = e2/(ħ·c) ā‰ˆ (1.60217662Ɨ10āˆ’19 C)2/(4πε0Ā·[(1.054572Ɨ10āˆ’34 NĀ·mĀ·s)Ā·(2.998Ɨ108 m/s)])Ā ā‰ˆ 0.0073 once more šŸ™‚

Note that the elementary charge e is actually equal to qe/4πε0, which is what I am using in the formula. I know that’s confusing, but it what it is. As for the units, it’s a bit tedious to write it all out, but you’ll get there. Note that ε0Ā ā‰ˆ 8.8542Ɨ10āˆ’12Ā C2/(NĀ·m2) so… Well… All the units do cancel out, and we get a dimensionless number indeed, which is what αeĀ is.

The point is: this expression links αeĀ to the the de BroglieĀ relation (p =Ā h/Ī»), with Ī» the wavelengthĀ that’s associated with the electron. Of course, because of the Uncertainty Principle, we know we’re talking some wavelengthĀ rangeĀ really, so we should write the de Broglie relation asĀ Ī”p = hĀ·Ī”(1/Ī»). Now, that, in turn, allows us to try to work out the Bohr radius, which is the other ‘dimension’ we associate with an electron. Of course, now you’ll say: why would you do that. Why would you bring in the de BroglieĀ relation here?

Well… We’re talking energy, and so we have the Planck-EinsteinĀ relation first: the energy of some particle can always be written as the product of hĀ and some frequencyĀ f: E = hĀ·f. The only thing thatĀ de Broglie relation adds is the Uncertainty Principle indeed: the frequencyĀ fĀ will be some frequency range, associated with someĀ momentumĀ range, and so that’s what the Uncertainty Principle really says. I can’t dwell too much on that here, because otherwise this post would become a book. šŸ™‚ For more detail, you can check out one of my many posts on the Uncertainty Principle. In fact, the one I am referring to here has Feynman’s calculation of the Bohr radius, so I warmly recommend you check it out. The thrust of the argument is as follows:

  1. If we assume that (a) an electron takes some space – which I’ll denote by rĀ šŸ™‚ – and (b) that it has some momentum p because of its mass m and its velocity v, then the Ī”xĪ”p = ħ relation (i.e. the Uncertainty Principle in its roughest form) suggests that the order of magnitude of r and p should be related in the very same way. Hence, let’s just boldly write r ā‰ˆ ħ/p and see what we can do with that.
  2. We know that the kinetic energy of our electron equals mv2/2, which we can write as p2/2m so we get rid of the velocity factor.Well… Substituting our p ā‰ˆ ħ/r conjecture, we get K.E. = ħ2/2mr2. So that’s a formula for the kinetic energy. Next is potential.
  3. TheĀ formula for the potential energy is U = q1q2/4πε0r12. Now, we’re actually talking about the size of an atomĀ here, so one charge is the proton (+e) and the other is the electron (–e), so the potential energy is U = P.E. = –e2/4πε0r, with rĀ the ā€˜distance’ between the proton and the electron—so that’s the Bohr radius we’re looking for!
  4. We can now write the total energy (which I’ll denote by E, but don’t confuse it with the electric field vector!) asĀ E = K.E. + P.E. =  ħ2/2mr2 – e2/4πε0r. Now,Ā the electron (whatever it is) is, obviously, in some kind of equilibrium state. Why is that obvious? Well… Otherwise our hydrogen atom wouldn’t or couldn’t exist. šŸ™‚ Hence, it’s in some kind of energy ā€˜well’ indeed, at the bottom. Such equilibrium point ā€˜at the bottom’ is characterized by its derivative (in respect to whatever variable) being equal to zero. Now, the only ā€˜variable’ here isĀ rĀ (all the other symbols are physical constants), so we have to solve for dE/dr = 0. Writing it all out yields:Ā dE/dr = –ħ2/mr3Ā + e2/4πε0r2Ā = 0 ⇔ r =Ā 4πε0ħ2/me2
  5. We can now put the values in:Ā r =Ā 4πε0h2/me2Ā = [(1/(9Ɨ109) C2/NĀ·m2)Ā·(1.055Ɨ10–34Ā JĀ·s)2]/[(9.1Ɨ10–31Ā kg)Ā·(1.6Ɨ10–19Ā C)2]Ā = 53Ɨ10–12Ā m = 53 pico-meter (pm)

Done. We’re right on the spot.Ā The Bohr radius is, effectively, about 53 trillionthsĀ of a meter indeed!

Phew!

Yes… I know… Relax. We’re almost done. You should now be able to figure out why the classical electron radius and the Bohr radius can also be related to each other through the fine-structure constant. We write:

me = α/re = α/α2r = 1/αr

So we get that α/reĀ =Ā 1/αr and, therefore, we get re/r = α2, which explains why α is also equal to the so-called junction number, or the coupling constant, for an electron-photon coupling (see my post on the quantum-mechanical aspects of the photon-electron interaction). It gives a physical meaning to the probability (which, as you know, is the absolute square of the probability amplitude) in terms of the chance of a photon actually ‘hitting’ the electron as it goes through the atom. Indeed, the ratio of the Thomson scattering cross-section and the Bohr size of the atom should be of the same order asĀ re/r, and so that’s α2.

[Note: To be fully correct and complete, I should add that the coupling constant itself is not α2Ā but √α = eP. Why do we have this square root? You’re right: the fact that the probability is the absolute squareĀ of the amplitude explains one square root (√α2Ā = α), but not two. The thing is: the photon-electron interaction consists of twoĀ things. First, the electron sort of ‘absorbs’ the photon, and then it emits another one, that has the same or a different frequency depending on whether or not the ‘collision’ was elastic or not. So if we denote the coupling constant as j, then the whole interaction will have a probability amplitude equal to j2. In fact, theĀ value which Feynman uses in his wonderful popular presentation of quantum mechanics (The Strange Theory of Light and Matter), is āˆ’Ī± ā‰ˆ āˆ’0.0073. I am not quite sure why the minus sign is there. It must be something with the angles involved (the emitted photon will not be following the trajectory of the incoming photon) or, else, with the special arithmetic involved in boson-fermion interactions (we add amplitudes when bosons are involved, butĀ subtractĀ amplitudes when it’s fermions interacting.Ā I’ll probably find out once I am true through Feynman’s third volume ofĀ Lectures, which focus on quantum mechanics only.]

Finally, the last bit of unexplained ‘magic’ in the fine-structure constant is that the fine-structure constant (which I’ve started to write as α again, instead of αe) also gives us the (classical) relative speed of an electron, so that’s its speed as it orbits around the nucleus (according to the classical theory, that is), so we write

α = v/c = β

I should go through the motions here – I’ll probably do so in the coming days – but you can see we must be able to get it out somehow from all what we wrote above. See how powerfulĀ ourĀ Uelect  ∼ e2/a relation really is? It links the electron, charge, its radius and its energy, and it’s all we need to all the rest out of it: its mass, its momentum, its speed and – through the Uncertainty Principle – the Bohr radius, which is the size of the atom.

We’ve come a long way. This is truly a milestone. We’ve taken the magic out of God’s number—to some extent at least. šŸ™‚

You’ll have one last question, of course: if proportionality constants are all about theĀ scaleĀ in which weĀ measureĀ the physical quantities on either side of an equation, is there some way the fine-structure constant would come out differently? That’s the same as asking: what if we’d measure energy in units that are equivalent to the energy of an electron, and the radius of our electron just as… Well… What if we’d equate our unit of distance with the radius of the electron, so we’d write re = 1? What would happen to α? Well…Ā I’ll let you figure that one out yourself. I am tired and so I should go to bed now. šŸ™‚

[…] OK. OK. Let me tell you. It’s not that simple here. All those relationships involving α, in one form or the other, are very deep. They relate a lot of stuff to a lot of stuff, and we can appreciate that only when doing a dimensional analysis. A dimensional analysis of the EeĀ = αe/reĀ = eP2/rĀ yields [eP2/r] = C2/m on the right-hand side and [Ee] = J = NĀ·mĀ on the left-hand side. How can we reconcile both? The coulomb is an SI base unitĀ , so we can’t ‘translate’ it into something with N and m. [To be fully correct, for some reason, theĀ ampĆØreĀ (i.e. coulombĀ per second) was chosen as an SI base unit, but they’re interchangeable in regard to their place in the international system of units: they can’t be reduced.] So we’ve got a problem. Yes. That’s where we sort of ‘smuggled’ the 4πε0 factor in when doing our calculations above. That ε0 constant is, obviously, not ‘as fundamental’ as cĀ or α (just think of the cāˆ’2Ā = ε0μ0 relationship to understand what I mean here) but, still, it was necessary to make the dimensions come out alright: we need the reciprocalĀ dimensionĀ of ε0, i.e. (NĀ·m2)/C2, to make the dimensional analysis work.Ā We get: (C2/m)Ā·(NĀ·m2)/C2Ā = NĀ·m = J, i.e.Ā joule, so that’s the unit in which we measure energy or – using the E = mc2Ā equivalence – mass, which is the aspect of energy emphasizing itsĀ inertia.

So the answer is: no. Changing units won’t change alpha. So all that’s left is to play with it now. Let’s try to do that. Let me first plot that EeĀ = meĀ = αe/re = 0.00729735256/re:

graph 3Unsurprisingly, we find theĀ pivotĀ point of this curve is at the intersection of the diagonal and the curve itself, so that’s at the (0.00729735256,Ā 0.00729735256) point, where slopes are ± 1, i.e. plus or minus unity.Ā What does this show? Nothing much. What?Ā I can hear you: I should be excited because… Well… Yes! Think of it. If youĀ would have to chose a cut-off point, you’d chose this one, wouldn’t you? šŸ™‚ Sure, you’re right. How exciting! Let me show you. Look at it! It proves that God thinks in terms of logarithms. He has chosen α such that ln(E) = ln(α/r) = lnα – lnrĀ = lnα – lnrĀ = 0, so ln α = lnr and, therefore, α = r. šŸ™‚

Huh? Excuse me?

I am sorry. […] Well… I am not, of course… šŸ™‚ I just wanted to illustrate the kind of exercise some people are tempted to do. It’s no use. The fine-structure constant is what it is: it sort ofĀ summarizesĀ an awful lot of formulas. It basically shows what Maxwell’s equation imply in terms of theĀ structureĀ of an atom defined as a negativeĀ charge orbiting around some positive charge. It shows we can get calculate everything as a function of something else, and that’s what the fine-structure constant tells us: it relates everything to everything. However, when everything is said and done, the fine-structure constant shows us two things:

  1. Maxwell’s equations are complete: we canĀ construct a complete model of the electron and the atom, which includes: the electron’s energy and mass, its velocity, its own radius, and the radius of the atom. [I might have forgotten one of the dimensions here, but you’ll add it. :-)]
  2. God doesn’t want our equations to blow up. Our equations are all correct but, in reality, there’s a cut-off factor that ensures we don’t go to the limit with them.

So the fine-structure constant anchors our world, so to speak. In other words: of all the worlds that are possible, we live in this one.

[…] It’s pretty good as far as I am concerned. Isn’t it amazing that our mind is able to justĀ graspĀ things like that? I know my approach here is pretty intuitive, and with ‘intuitive’, I mean ‘not scientific’ here. šŸ™‚ Frankly, I don’t like the talk about physicists “looking into God’s mind.” I don’t think that’s what they’re trying to do. I think they’re just trying to understand the fundamentalĀ unityĀ behind it all. And that’s religion enough for me. šŸ™‚

So… What’s the conclusion? Nothing much. We’ve sort of concluded our description of the classical world… Well… Of its ‘electromagnetic sector’ at least. šŸ™‚ That sector can be summarized in Maxwell’s equations, which describe an infinite world of possible worlds. However, God fixed three constants:Ā h,Ā cĀ and α. So we live in a world that’s defined by this Trinity of fundamental physical constants. Why is it not two, or four?

My guts instinct tells me it’s because we live in three dimensions, and so there’s three degrees of freedom really. But what about time? Time is the fourth dimension, isn’t it? Yes. But time is symmetric in the ‘electromagnetic’ sector: we can reverse the arrow of time in our equations and everything still works. TheĀ arrow of timeĀ involves other theories: statistics (physicists refer to it as ‘statistical mechanics‘) and the ‘weak force’ sector, which I discussed when talking about symmetries in physics. So… Well… We’re not done. God gave us plenty of other stuff to try to understand. šŸ™‚

The classical explanation for the electron’s mass and radius

Feynman’s 28thĀ LectureĀ in his series on electromagnetism is one of the more interesting but, at the same time, it’s one of the fewĀ LecturesĀ that is clearly (out)dated. In essence, it talks about the difficulties involved in applying Maxwell’s equations to theĀ elementary chargesĀ themselves, i.e. the electron and the proton.Ā We already signaled some of these problems in previous posts. For example, in our post on the energy in electrostatic fields, we showed how our formulas for the field energy and/or the potential of a charge blow up when we use it to calculate the energy we’d need toĀ assembleĀ a point charge. What comes out is infinity: āˆž. So our formulas tell us we’d need an infinite amount of energy to assemble a point charge.

Well… That’s no surprise, is it? The idea itself is impossible: how can one have a finite amount of charge in something that’s infinitely small? Something that has no size whatsoever? It’s pretty obvious we get some division by zero there. šŸ™‚ The mathematicalĀ approach is often inconsistent. Indeed, a lot of blah-blahĀ in physics is obviously just about applying formulas to situations that are clearly notĀ within the relevant area of application of the formula.Ā So that’s why I went through the trouble (in my previous post, that is) of explaining you how we getĀ these energy and potential formulas, and that’s by bringing chargesĀ (note the plural) together. Now, we may assume these charges are point charges, but that assumption is not so essential. What I tried to say when being so explicitĀ was the following: yes, aĀ charge causes a field, but theĀ ideaĀ of a potential makes sense only when we’re thinking of placing someĀ other charge in that field. So point charges with ‘infinite energy’ should not be a problem. Feynman admits as much when he writes:

“If the energy can’t get out, but must stay there forever, is there any real difficulty with an infinite energy? Of course, a quantity that comes out infinite may be annoying, but what really matters is only whether there are any observable physical effects.”

So… Well… Let’s see. There’s another, more interesting, way to look at an electron: let’s have a look at the field it creates. A electron – stationary or moving – will create a field in Maxwell’s world, which we know inside out now. So let’s just calculate it. In fact, Feynman calculates it for the unit charge (+1), so that’s a positron. It eases the analysis because we don’t have to drag any minus sign along. So how does it work? Well…

We’ll have anĀ energy flux density vector – i.e. the Poynting vector S – as well as a momentum densityĀ vector g all over space. Both are related through the g = S/c2Ā equation which, as I explained in my previous post, is probably best written as cg = S/c, because we’ve got units then, on both sides, that we can readily understand, like N/m2 (so that’s force per unit area) or J/m3Ā (so that’s energy per unit volume). On the other hand, we’ll need something that’s written as a function of the velocity of our positron, so that’s v, and so it’s probably best to just calculate g, the momentum,Ā which is measured in NĀ·s or kgĀ·(m/s2)Ā·s (both are equivalent units for theĀ momentum p = mv, indeed) per unit volumeĀ (so we need to add a 1/Ā m3Ā to the unit).Ā So we’ll have some integral all over space, but I won’t bother you with it. Why not? Well… Feynman uses a rather particular volume element to solve the integral, and so I want you to focus on the solution. The geometry of the situation, and the solution for g, i.e. the momentum of the field per unit volume,Ā is what matters here.

So let’s look at that geometry. It’s depicted below. We’ve got a radial electric field—a Coulomb field really, because our charge is moving at a non-relativistic speed, so v << c and we can approximate with a Coulomb field indeed. Maxwell’s equations imply that B = vƗE/c2, so g = ε0EƗBĀ is what it is in the illustration below. Note that we’d have to reverse the direction of both E and B for an electron (because it’s negative), but g would be the same. It is directed obliquely toward the line of motion and its magnitude is g = (ε0v/c2)Ā·E2Ā·sinĪø. Don’t worry about it: Feynman integrates this thing for you. šŸ™‚ It’s notĀ thatĀ difficult, but still… To solve it, he uses the fact that the fields are symmetric about the line of motion, which is indicated by the littleĀ arrow around the v-axis, with the Φ symbol next to it (it symbolizes the potential). [The ‘rather particular volume element’ is a ring around the v-axis, and it’s because of this symmetry that Feynman picks the ring. Feynman’s Lectures are not onlyĀ great to learn physics: they’re a treasure drove of mathematical tricks too. :-)]

momentum field

As said, I don’t want to bother you with the technicalities of the integral here. This is the result:

Ā Ā emm

What does this say? It says that the momentum of the field – i.e. the electromagnetic momentum, integrated over all of space – is proportional to the velocity v of our charge. That makes sense: when v = 0, we’ll have an electrostatic field all over space and, hence, some inertia, but it’s only when we try toĀ moveĀ our charge that Newton’s Law comes into play: then we’ll needĀ someĀ forceĀ to overcome that inertia. It all works through the Poynting formula: S = EƗB/μ0. If nothing’s moving, then B = 0, and so we’ll have some EĀ and, therefore, we’ll have field energy alright, but the energyĀ flowĀ will be zero. But when we move the charge, we’re moving the field, and so then B ≠ 0 and so it’s through B that the E in our S equation start kicking in. Does that make sense? Think about it: it’s good to try to visualize things in your mind. šŸ™‚

The constants in the proportionality constant (2e2)/(3ac2) of our p ∼ v formula above are:

  • e2Ā =Ā qe2/(4πε0), withĀ qeĀ the electron charge (without the minus sign) and ε0Ā our ubiquitous electric constant. [Note that, unlike Feynman, I prefer toĀ notĀ write e in italics, so as toĀ notĀ confuse it with Euler’s numberĀ eĀ ā‰ˆĀ 2.71828 etc. However, I know I am not always consistent in my notation. :-/ We don’t need Euler’s number in this post, so e or eĀ is always an expression for the electron charge,Ā notĀ Euler’s number. Stupid remark, perhaps, but I don’t want you to be confused.]
  • a is the radius of our charge—see we got away from the idea of a point charge? šŸ™‚
  • c2Ā is just c2, i.e. our weird constant (the square of the speed of light) which seems to connect everything to everything. Indeed, think about stuff like this: S/g = c2Ā = 1/(ε0μ0).

Now, p = mv, so that formula for p basically says that our elementary charge (as mentioned, g is the same for a positron or an electron: E and BĀ will be reversed, but g is not) has anĀ electromagnetic mass melecĀ equal to:

emm - 2

That’s an amazing result. We don’t need to give our electron anyĀ rest mass: just its charge and its movement will do!Ā Super!Ā So we don’t need any Higgs fields here! šŸ™‚ The electromagnetic field will do!

Well… Maybe. Let’s explore what we’ve got here.

First, let’s compare that radius a in our formula to what’s found in experiments.Ā Huh? Did someone ever try to measure the electron radius? Of course. There are all theseĀ scattering experiments in which electrons get fired at atoms. They can fly through or, else, hit something. Therefore, one can some statistical analysis and determine what is referred to as aĀ cross-section. A cross-section is denoted by the same symbol as the standard deviation: σ (sigma). In any case… So there’s something that’s referred to as the classical electron radius, and it’s equal to the so-calledĀ Thomsom scattering length.Ā Thomson scattering, as opposed to ComptonĀ scattering, is elastic scattering, so it preserves kinetic energy (unlike Compton scattering, where energy gets absorbed and changes frequencies). So… Well… I won’t go into too much detail but, yes, this is theĀ electronĀ radius we need. [I am saying this rather explicitly because there are two other numbers around: the so-called Bohr radiusĀ and, as you might imagine, the ComptonĀ scattering cross-section.]

The Thomson scattering length isĀ 2.82 femtometer (so that’sĀ 2.82Ɨ10āˆ’15Ā m), more or less that is :-), and it’s usually related to theĀ observedĀ electron massĀ meĀ through the fine-structure constant α. In fact, using Planck units, we can write:Ā  reĀ·me= α, which is an amazing formula but, unfortunately, I can’t dwell on it here. Using ordinary m, s, C and what have you units, we can write reĀ as:

classical electron radius

That’s good, because if we equateĀ meĀ and melecĀ and switch melecĀ and a in our formula for melec, we get:

a

So, frankly, we’reĀ spot on!Ā Well… Almost. The two numbers differ by 1/3. But who cares about a 1/3 factor indeed? We’re talking rather fuzzy stuff here – scattering cross-sections and standard deviations and all that – so… Yes. Well done! Our theory works!

Well… Maybe. Physicists don’t think so. They think the 1/3 factor isĀ an issue. It’sĀ sad because it really makes a lot of sense. In fact, the Dutch physicist Hendrik Lorentz – whom we know so well by now šŸ™‚Ā ā€“ had also worked out that, because of the length contraction effect, our spherical charge would contract into an ellipsoid and… Well… He worked it all out, and it was not a problem: he found that the momentum was altered by the factorĀ (1āˆ’v2/c2)āˆ’1/2, so that’s the ubiquitous Lorentz factor γ! He got this formula in the 1890s already, so that’sĀ long before the theory of relativity had been developed. So, many years before Planck and Einstein would come up with theirĀ stuff, Hendrik AntoonĀ Lorentz had the correct formulas already: the mass, or everything really, all should vary with that γ-factor. šŸ™‚

Why bother about the 1/3 factor? [I should note it’s actually referred to as the 4/3 problem in physics.] Well… The critics do have a point: if we assume that (a) an electron isĀ notĀ a point charge – so if we allow it to have some radius a – and (b) that Maxwell’s Laws apply, then we should go all the way. TheĀ energyĀ that’s needed toĀ assembleĀ an electron should then, effectively, be the same as the value we’d get out of those field energy formulas. So what doĀ we get when we apply those formulas? Well… Let me quickly copy Feynman as he does the calculation for an electron, notĀ looking at it as a point particle, but as a tinyĀ shellĀ of charge, i.e. a sphere with all charge sitting on the surface:

Feynman energy

Ā Let me enlarge the formula:

energy electron

Now, if we combine that with our formula forĀ melecĀ above, then we get:

4-3 problem

So that formula does notĀ respect Einstein’s universal mass-energy equivalence formula E = mc2. Now, youĀ willĀ agree that we really want Einstein’s mass-energy equivalence relation to be respected by all, so our electron should respect it too. šŸ™‚ So, yes, we’ve got a problem here, and it’s referred to as theĀ 4/3 problem (yes, the ratio got turned around).

Now, you may think it got solved in the meanwhile. Well… No. It’s still a bit of a puzzle today, and the current-day explanation is not really different from what the French scientist Henri Poincaré proposed as a ‘solution’ to the problem back in the 1890s. He basically told Lorentz the following: “If the electron is someĀ littleĀ ball of charge, then it should explode because of the repulsive forces inside. So there should be some binding forces there, and so that energy explains the ‘missing mass’ of the electron.” So these forces are effectively being referred to as PoincarĆ© stresses, and the non-electromagnetic energy that’s associated with them – which, of course,Ā has to be equal to 1/3 of the electromagnetic energy (I am sure you see why) šŸ™‚Ā ā€“ adds to theĀ totalĀ energy and all is alright now. We get:

U = mc2 = (melec + mPoincaré)c2

So… Yes… Pretty ad hoc.Ā Worse, according to the Wikipedia article on electromagnetic mass, that’s still where we are. And, no, don’t read Feynman’s overview of all of the theories that were around then (so that’s in the 1960s, or earlier). As I said, it’s theĀ oneĀ LectureĀ you don’t want to waste time on. So I won’t do that either.

In fact, let me try to do something else here, and that’s to de-construct the whole argument really. šŸ™‚Ā Before I do so, let me highlight theĀ essenceĀ of what was written above. It’s quite amazing really.Ā Think of it: we say that the mass of an electron – i.e.Ā its inertia, or the proportionality factor in Newton’sĀ F = mĀ·aĀ law of motion – is the energy in the electric and magnetic field it causes. So the electron itself is just aĀ hook for the force law, so to say. There’s nothing there, except for the charge causing the field. But so its mass is everywhere and, hence, nowhere really. Well… I should correct that: the field strength falls of as 1/r2Ā and, hence, the energy flow and momentum density that’s associated with it, falls of as 1/r4, so it falls of veryĀ rapidly and so the bulk of the energy is pretty near the charge. šŸ™‚

[Note: You’ll remember that the field that’s associated with electromagneticĀ radiationĀ falls of as 1/r, not as 1/r2, which is why there is an energy flux there which is never lost, which can travel independently through space. It’s not the same here, so don’t get confused.]

So that’s something to note: theĀ melecĀ = (2cāˆ’2/3)Ā·(e2/a) has the radiusĀ aĀ in it, but that radius is only theĀ hook, so to say. That’s fine, because it is not inconsistent with theĀ ideaĀ of the Thomson scattering cross-section, which is the area that oneĀ canĀ hit. Now, you’ll wonder how one can hit an electron: you can readily imagine an electron beam aimed at nuclei, but how would one hit electrons? Well… You can shoot photons at them, and see if they bounce back elastically or non-elastically. The cross-section area that bounces them off elastically must be pretty ‘hard’, and the cross-section that deflects them non-elastically somewhat less so. šŸ™‚

OK… But… Yes?Ā Hey! How did we get that electron radius in that formula?Ā 

Good question! Brilliant, in fact! You’re right: it’s here that the whole argument falls apart really. We did a substitution. That radius a is the radius of a sphericalĀ shell of chargeĀ with an energy that’s equal to UelecĀ = (1/2)Ā·(e2/a), so there’s another way of stating the inconsistency: the equivalent energy of melecĀ = (2cāˆ’2/3)Ā·(e2)/a) is equal to E =Ā melecĀ·c2Ā = (2/3)Ā·(e2/a) and that’s notĀ the same as UelecĀ = (1/2)Ā·(e2/a). If we take the ratio of UelecĀ and melecĀ·c2Ā =, we get the same factor: (1/2)/(2/3) = 3/4. But… Your question is superb! Look at it: putting it the way we put it reveals the inconsistency in the whole argument. We’re mixing two things here:

  1. We first calculate the momentum density, and the momentum, that’s causedĀ by the unit charge,Ā so we get some energy which I’ll denote as EelecĀ =Ā melecĀ·c2
  2. Now, we then assume this energy must beĀ equal to the energy that’s needed to assembleĀ the unit charge from an infinite number of infinitesimally small charges, thereby also assumingĀ the unit charge is a uniformly charged sphere of charge with radius a.
  3. We then use this radius a to simplify our formula for EelecĀ =Ā melecĀ·c2

Now thatĀ is not kosher, really! First, it’s (a) a lot of assumptions, both implicit as well as explicit, and then (b) it’s, quite simply, not a legit mathematical procedure: calculating the energy in the field, or calculating the energy we need to assemble a uniformly charged sphere of radius a are twoĀ veryĀ different things.

Well… Let me put it differently. We’re using the same laws – it’s all Maxwell’s equations, really – but we should be clear about what we’re doing with them, and those two things are veryĀ different.Ā The legitimate conclusion must be that our a is wrong. In other words, we should not assume that our electron is spherical shell of charge. So then what? Well… We could easily imagine something else, like a uniform or even a non-uniformly charged sphere. Indeed, if we’re just filling empty space with infinitesimally small charge ‘elements’, then we may want to think the density at the ‘center’ will be much higher, like what’s going on when planets form: the density of the inner core of our own planet Earth is more than four times the density of its surface material. [OK. Perhaps not very relevant here, but you get the idea.] Or, conversely, taking into account PoincarĆ©’s objection, we may want to think all of the charge will be on the surface, just like on a perfect conductor, where all charge is surface charge!

Note that the field outside of a uniformly charged sphere and the field of a sphericalĀ shellĀ of charge is exactly the same, so we wouldĀ notĀ find a different number forĀ EelecĀ =Ā melecĀ·c2, but we surely would find a different number for Uelec. You may want to look up some formulas here: you’ll find that the energy of a uniformly distributed sphereĀ of charge (so we do notĀ assume that all of the charge sits on the surface here)Ā is equal to (3/5)Ā·(e2/a). So we’d already have much less of a problem, because the 3/4 factor in the UelecĀ = (3/4)Ā·melecĀ·c2Ā becomes a (5/3)Ā·(2/3) = 10/9 factor. So now we have a discrepancy of some 10% only. šŸ™‚

You’ll say: 10% is 10%. It’s huge in physics, as it’s supposed to be anĀ exact science.Ā Well… It is and it isn’t. Do you realize we haven’t even started to talk about stuff likeĀ spin? Indeed, in modern physics, we think ofĀ electrons as something that also spinsĀ around one or the other axis, so there’s energy there too, and we didn’t include that in our analysis.

In short, Feynman’s approach here is disappointing. NaiveĀ even, but then… Well… Who knows? Perhaps he didn’t do thisĀ LectureĀ himself. Perhaps it’s just an assistant or so.Ā In fact, I should wonder why there’s still physicists wasting time on this!Ā I should also note that naively comparing that a radius with the classical electron radius also makes little or no sense. Unlike what you’d expect, the classical electron radius reĀ and the Thomson scattering cross-section σeĀ are notĀ related like you might think they are, i.e. like σeĀ = π·re2Ā or σeĀ = π·(re/2)2Ā or σeĀ =Ā re2Ā or σeĀ = π·(2Ā·re)2Ā or whatever circular surface calculation rule that might make sense here.Ā No. The Thomson scattering cross-section is equal to:

σeĀ =Ā (8Ļ€/3)Ā·re2Ā = (2Ļ€/3)Ā·(2Ā·re)2Ā =Ā (2/3)·π·(2Ā·re)2Ā ā‰ˆ 66.5Ɨ10āˆ’30Ā m2Ā = 66.5 (fm)2

Why? I am not sure. I must assume it’s got to do with the standard deviation and all that. The point is, we’ve got a 2/3 factor here too, so do we have a problem really? I mean… The a we got was equal to a = (2/3)Ā·re, wasn’t it? It was. But, unfortunately, it doesn’t mean anything. It’s just a coincidence. In fact, looking at the Thomson scattering cross-section, instead of the Thomson scattering radius, makesĀ the ‘problem’ a little bit worse. Indeed, applying the π·r2Ā rule for a circular surface, we get that the radiusĀ would be equal to (8/3)1/2Ā·reĀ ā‰ˆ 1.633Ā·re, so we get something that’s muchĀ largerĀ rather than something that’s smaller here.

In any case, it doesn’t matter. The point is: this kind of comparisons should not be taken too seriously. Indeed, when everything is said and done, we’re comparing three very different things here:

  1. The radius that’s associated with the energy that’s needed toĀ assembleĀ our electron from infinitesimally small charges, and so that’s based onĀ Coulomb’sĀ law and the model we use for our electron: is it a shell or a sphere of charge? If it’s a sphere, do we want to think of it as something that’s of uniform of non-uniform density.
  2. The second radius is associated with the field of an electron, which we calculate using Poynting’s formula for the energy flow and/or the momentum density. So that’s notĀ about the internal structure of the electron but, of course, it would be nice if we could find some model of an electron that matchesĀ thisĀ radius.
  3. Finally, there’s the radius that’s associated with elasticĀ scattering, which is also referred to asĀ hardĀ scattering because it’s like the collision of two hard spheres indeed. But so that’s some value that has to be established experimentallyĀ and so it involves judicious choices because there’s probabilities and standard deviations involved.

So should we worry about the gaps between these three different concepts? In my humble opinion: no. Why? Because they’re all damn close and so we’re actually talking about the same thing. I mean: isn’t terrific that we’ve got a model that brings the first and the second radius together with a difference of 10% only? As far as I am concerned, that shows the theory works. So what Feynman’s doing in that (in)famous chapter is some kind of ‘dimensional analysis’ which confirms rather than invalidatesĀ classical electromagnetic theory. So it shows classical theory’s strength, rather than its weakness. It actually shows our formulaĀ doĀ work where we wouldn’t expect them to work. šŸ™‚

The thing is: when looking at the behavior of electrons themselves, we’ll need a different conceptual framework altogether. I am talking quantum mechanics here. Indeed, we’ll encounter other anomalies than the ones we presented above. There’s the issue of the anomalous magnetic moment of electrons, for example. Indeed, as I mentioned above, we’ll also want to think as electrons as spinning around their own axis, and so that implies some circulationĀ of EĀ that will generate a permanent magnetic dipole moment… […]Ā OK, just think of some magnetic field if you don’t have a clue what I am saying here (but then you should check out my post on it). […] The point is: here too, the so-called ‘classical result’, so that’s its theoreticalĀ value, will differ from the experimentally measured value. Now, the difference here will be 0.0011614, so that’s about 0.1%, i.e. 100 timesĀ smallerĀ than my 10%. šŸ™‚

Personally, I think that’s not so bad. šŸ™‚ But then physicists need to stay in business, of course. So, yes, itĀ isĀ a problem. šŸ™‚

Post scriptum on the math versus the physics

The key to the calculation of the energy that goes into assembling a charge was the following integral:

U 4

This is a double integral which we simplified in two stages, so we’re looking at an integral within an integral really, butĀ we can substitute the integral over the ρ(2)Ā·dV2Ā product by the formula we got for the potential, so we write that as Φ(1), and so the integral above becomes:

U 5Now, thisĀ integral integrates the ρ(1)·Φ(1)Ā·dV1Ā product over all of space, so that’s over all points in space, and so we just dropped the index and wrote the whole thing as the integral of ρ·Φ·dVĀ over all of space:

U 6

We thenĀ established that this integral was mathematically equivalent to the following equation:

U 7

So this integral is actually quite simple: it just integrates E•E = E2Ā over all of space. The illustration below shows E as a function of the distanceĀ rĀ for a sphere of radius R filledĀ uniformlyĀ with charge.

uniform density

So the field (E) goes asĀ rĀ for r ≤ R and as 1/r2Ā for r ≄ R. So, for r ≄ R, the integral will have (1/r2)2Ā = 1/r4Ā in it. Now, you know that the integral of some function is the surface under the graph of that function. Look at the 1/r4 function below: it blows up between 1 and 0. That’s where the problem is: there needs to be some kind of cut-off, because that integral will effectively blow up when the radius of our little sphere of charge gets ‘too small’.Ā So that makes it clear why it doesn’t make sense to use this formula to try to calculate the energy of a point charge. It just doesn’t make sense to do that.

graph

What’s ‘too small’? Let’s look at the formula we got for our electron as a spherical shellĀ of charge:

energy electron

So we’ve got an even simpler formula here: it’s just a 1/rĀ relation. Why is that? Well… It’s just the way the math turns it out. I copied the detail of Feynman’s calculation above, so you can double-check it. It’s quite wonderful, really. We have a very simpleĀ inverselyĀ proportional relationship between theĀ radiusĀ of our electron andĀ itsĀ energy as a sphere of charge. We could write it as:

Uelect  = α/a , with α = e2/2

But – Hey!Ā Wait a minute! We’ve seen something like this before, haven’t we?Ā We did. We did when we were discussing the wonderful properties of that magical number, theĀ fine-structure constant, which we also denoted by α. šŸ™‚ However, because we used α already, I’ll denote the fine-structure constant as αe here, so you don’t get confused. As you can see, the fine-structure constant links allĀ of the fundamental properties of the electron: its charge, its radius, its distance to the nucleus (i.e. the Bohr radius), its velocity, and its mass (and, hence, its energy). So, at this stage of the argument, α can be anything, and αeĀ cannot, of course. It’s just that magical number out there, which relates everything to everything: it’s the God-given number we don’t understand. šŸ™‚ Having said that, it seems like we’re going to get some understanding here because we know that, one theĀ many expressions involving αe was the following one:

me = αe/re

This says that theĀ massĀ of the electron is equal to the ratio of the fine-structure constant and the electron radius. [Note that we express everything in natural units here, so that’sĀ Planck units. For the detail of the conversion, please see the relevant section on that in my one of my posts on this and other stuff.] Now, mass is equivalent to energy, of course: it’s just a matter of units, so we can equate meĀ with EeĀ (this amounts to expressing the energy of the electron in aĀ kg unit—bit weird, but OK) and so we get:

Ee = αe/re

So there we have: the fine-structure constant αeĀ is Nature’s ‘cut-off’ factor, so to speak. Why? Only God knows. šŸ™‚ But it’s now (fairly) easy to see why all the relations involving αeĀ are what they are. For example, we also know that αeĀ is the square of the electron charge expressed in Planck units, so we have:

 α = eP2 and, therefore, Ee = eP2/re

Now, you can check for yourself: it’s just a matter of re-expressing everything in standard SI units, and relating eP2Ā to e2, and it should all work: you should get theĀ UelectĀ Ā = (1/2)Ā·e2/aĀ expression. So… Well… At least this takes some of the magic out the fine-structure constant. It’s still a wonderful thing, but so you see that the fundamental relationship between (a) the energy (and, hence, the mass), (b) the radius and (c) the charge of an electron isĀ notĀ something God-given. What’s God-given are Maxwell’s equations, and so theĀ EeĀ = αe/reĀ = eP2/reĀ is just one of the many wonderful things that you can get out of Ā them.Ā šŸ™‚

Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 17, 2020 as a result of a DMCA takedown notice from Michael A. Gottlieb, Rudolf Pfeiffer, and The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/

Field energy and field momentum

This post goes to the heart of the E = mc2, equation. It’sĀ kindaĀ funny, because Feynman just compresses all of it in a sub-section of hisĀ Lectures. However, as far as I am concerned, I feel it’s a very crucial section. Pivotal, I’d say, which would fit with its place in all of the 115 LecturesĀ that make up the three volumes, which is sort of mid-way, which is where we are here.Ā So let’s get go for it. šŸ™‚

Let’s first recall what we wrote about the Poynting vector S, which we calculate from the magnetic and electric field vectors E and B by taking their cross-product:

S formula

This vector represents the energy flow, per unit area and per unit time, in electrodynamical situations. If E and/or BĀ are zero (which is the case in electrostatics, for example, because we don’t have magnetic fields in electrostatics), then S is zero too, so there is no energy flow then. That makes sense, because we have no moving charges, so where would the energy go to?

I also made it clear we should think of S as something physical, by comparing it to the heat flow vectorĀ h, which we presented when discussing vector analysis and vector operators. The heat flow out of a surface element daĀ is the area times the component ofĀ hĀ perpendicular to da, so that’s (h•n)Ā·da = hnĀ·da. Likewise, we can writeĀ (S•n)Ā·da = SnĀ·da. The units of S and h are also the same:Ā joule per second and per square meterĀ or, using the definition of theĀ wattĀ (1 W = 1 J/s), in watt per square meter.Ā In fact, if you google a bit, you’ll find that both hĀ and S are referred to as aĀ flux density:

  1. The heat flow vector hĀ is the heat flux densityĀ vector, from which we get the heat flux through an area through the (h•n)Ā·da = hnĀ·daĀ product.
  2. The energy flow SĀ is the energy flux density vector, from which we get the energy flux through the (S•n)Ā·da = SnĀ·daĀ product.

So that should be enough as an introduction to what I want to talk about here. Let’s first look at the energy conservation principle once again.

Local energy conservation

In a way, you can look atĀ my previous postĀ as being all about the equation below, which we referred to as the ‘local’ energy conservation law:

energy flux

Of course, it is notĀ theĀ completeĀ energy conservation law. The local energy is not only in the field. We’ve got matter as well, and so that’s what I want to discuss here: we want to look at the energy in the field as well as the energy that’s in the matter. Indeed, field energy is conserved, and then it isn’t: if the field is doing work on matter, or matter is doing work on the field, then… Well… Energy goes from one to the other, i.e. from the field to the matter or from the matter to the field. So we need to include matter in our analysis, which we didn’t do in our last post. Feynman gives the following simple example: we’re in a dark room, and suddenly someone turns on the light switch. So now the room is full of field energy—and, yes, I just mean it’s not dark anymore. :-). So that means someĀ matter out thereĀ must have radiated its energy out and, in the process, it must have lost the equivalent mass of that energy. So, yes, we had matter losing energy and, hence, losing mass.

Now, we know that energy and momentum are related. Respecting and incorporating relativity theory, we’ve got two equivalentĀ formulas for it:

  1. E2Ā āˆ’ p2c2Ā = m02c4
  2. pcĀ = EĀ·(v/c) ⇔ p = vĀ·E/c2Ā = mĀ·v

The EĀ = mc2Ā and m = Ā·m0Ā·(1āˆ’v2/c2)āˆ’1/2Ā formulas connect both expressions. So we canĀ look at it in either of two ways. We couldĀ useĀ the energy conservation law, but Feynman prefers the conservation of momentum approach, so let’s see where he takes us. If the field has some energyĀ (and, hence, some equivalent mass) per unit volume, and if there’s someĀ flow, so if there’s someĀ velocityĀ (which there is: that’s what our previous post was all about), then it will have a certain momentumĀ per unit volume. [Remember: momentum isĀ massĀ timesĀ velocity.] That momentum will have aĀ direction, so it’s a vector, just like p = mv. We’ll write it as g, so we define g as:

g is the momentum of the field per unit volume.

What units would we express it in? We’ve got a bit of choice here. For example, because we’re relating everything to energy here, we may want to convert our kilogram into eV/c2Ā or J/c2Ā units, using the mass-energy equivalence relation EĀ = mc2. Hmm… Let’s first keep the kg as a measure of inertia though. So we write: [g] = [m]Ā·[v]/m3Ā = (kgĀ·m/s)/m3. Hmm… That doesn’t show it’s energy, so let’s replace the kg with a unit that’s got newton and meter in it, cf. the F = ma law. So we write:Ā [g] = (kgĀ·m/s)/m3Ā =Ā (kg/s)/m2Ā = [(NĀ·s2/m)/s]/m2Ā =Ā NĀ·s/m3. Well… OK. The newtonĀ·second is the unit of momentum indeed, and we can re-write it including the joule (1 J = 1 NĀ·m), so then we get [g] = (JĀ·s/m4), so what’s that? Well… Nothing much. However, I do note it happens to be the dimension of S/c2, so that’s [S/c2] = [J/(sĀ·m2)]Ā·(s2/m2) =Ā (JĀ·s/m4). šŸ™‚ Let’s continue the discussion.

Now, momentum is conserved, and eachĀ componentĀ of it is conserved. So let’s look at the x-direction. We should have something like:

formula

If you look at this carefully, you’ll probably say: “OK. I understood the thing with the dark room and light switch. Mass got converted into field energy, but what’s that second term of the left?”

Good. Smart. RightĀ remark. Perfect. […] Let me try to answer the question. While all of the quantities above are expressedĀ per unit volume, we’re actually looking at the sameĀ infinitesimal volume element here, so the example of the light switch is actually an example of a ‘momentum outflow’, so it’s actually an example of that second term of the left-hand side of the equation above kicking in! šŸ™‚

Indeed, the first term just sort of reiterates the mass-energy equivalence: the energy that’s in the matter can become field energy, so to speak, in our infinitesimal volume element itself, and vice versa. But if it doesn’t, then it should get out and, hence, become ‘momentum outflow’. Does that make sense? No?

Hmm… What to say? You’ll need to look at that equation a couple of times more, I guess. :-/ But I need to move on, unfortunately. [Don’t get put off when I say things like this: I am basically talking to myself, so it means I’ll need to re-visit this myself. :-/]

Let’s look at all of the three terms:

  1. The left-hand side (i.e. the time rate-of-change of the momentum of matter) isĀ easy. It’s just the force on it, which we know is equal to F =Ā q(E+vƗB). Do we know that? OK… I’ll admit it.Ā Sometimes it’s easy to forget where we are in an analysis like this, but so we’re looking at the electromagnetic force here. šŸ™‚ As we’re talking infinitesimals here and, therefore, chargeĀ densityĀ rather than discreteĀ charges, we should re-write this as the force per unit volume which is ρE+jƗB. [This is an interesting formula which I didn’t use before, so you should double-check it. :-)]
  2. The first term on the right-hand side should be equally obvious, or… Well… Perhaps somewhat less so. But with all my rambling on the Uncertainty PrincipleĀ and/or the wave-particle duality, it should make sense. If we scrap the second term on the right-hand side, we basically have an equation that is equivalent to the E = mc2Ā equation. No? Sorry. Just look at it, again and again. You’ll end up understanding it. šŸ™‚
  3. So it’s that second term on the right-hand side. What the hell does thatĀ say? Well… I could say: it’s the local energy or momentum conservation law. If the energy or momentum doesn’t stay in, it has to go out. šŸ™‚ But that’s not very satisfactory as an answer, of course. However, please just go along with thisĀ ‘temporary’ answer for a while.

So what isĀ that second term on the right-hand side? As we wrote it, it’s an x-component – or, let’s put it differently, it is or was part of the x-component of the momentum density – but, frankly, we should probably allow it to go out in any direction really, as the only constraint on the left-hand side is a per secondĀ rate of change of something. Hence, Feynman suggest to equate it to something like this:

general

What a, b and c? The components of some vector? Not sure. We’re stuck. This pieceĀ reallyĀ requiresĀ veryĀ advanced math. In fact, as far as I know, this is the onlyĀ time where Feynman says: “Sorry. This is too advanced. I’ll just give you the equation. Sorry.” So that’s what he does. He explains the philosophyĀ of the argument, which is the following:

  1. On the left-hand side, we’ve got the time rate-of-change of momentum, so that obeys the F = dp/dt = d(mv)/dt law, with the force F,Ā per unit volume, being equal to F(unit volume) = ρE+jƗB.
  2. On the right-hand side, we’ve got something that can be written as:

general 2

So we’d need to find a way to ρE+jƗBĀ in terms of EĀ andĀ B only – eliminating ρ andĀ jĀ by using Maxwell’s equations or whatever other trick  – and then juggle terms and make substitutions to get it into a form that looks like the formula above, i.e. the right-hand side of that equation. But so Feynman doesn’t show us how it’s being done. He just mentions some theorem in physics, which says that the energy that’s flowing through a unit area per unit time divided byĀ c2 – so that’s E/c2Ā per unit area and per unit time – must be equal to the momentum per unit volume in the space, so we write:

g = S/c2

HeĀ illustratesĀ the general theorem that’s used to get the equation above by giving two examples:

example theorem

OK. Two good examples. However, it’s still frustrating toĀ notĀ see how we get the g = S/c2Ā in the specificĀ context of the electromagnetic force, so let’s do a dimensional analysis at least. In my previous post, I showed thatĀ the dimension ofĀ S mustĀ be J/(m2Ā·s), so [S/c2] = [J/(m2Ā·s)]/(m2/s2) = [NĀ·m/(m2Ā·s)]Ā·(s2/m2) = [NĀ·s/m3]. Now, we know that the unit of mass is 1 kg = N/(m/s2). That’s just the force law: a force of 1 newton will give a mass of 1 kg an acceleration of 1 m/s per second, so 1 N = 1 kgĀ·(m/s2). So theĀ [NĀ·s/m3] dimension is equal to [kgĀ·(m/s2)Ā·s/m3] = [(kgĀ·(m/s)/m3] =Ā [(kgĀ·(m/s)]/m3, which is the dimension of momentum (p = mv) per unit volume, indeed.Ā So, yes, the dimensional analysis works out, and it’s also in line with theĀ p = vĀ·E/c2Ā = mĀ·v equation, but… Oh… We did a dimensional analysis already, where we also showed that [g]Ā = [S/c2] = (JĀ·s/m4). Well… In any case… It’s a bit frustrating to notĀ see the detail here, but let us note the the Grand ResultĀ once again:

The Poynting vector S gives us the energy flow as well as the momentum density gĀ = S/c2.

But what does it all mean, really? Let’s go through Einstein’s illustration of the principle. That will help us a lot. Before we do, however, I’d like to note something. I’ve always wondered a bit about that dichotomy between energy and momentum. Energy is force times distance: 1Ā jouleĀ is 1Ā newtonĀ Ć— 1Ā meterĀ indeed (1 J = 1 NĀ·m). Momentum is force times time, as we can express it in NĀ·s. Planck’s constant hĀ combines all three in the dimension ofĀ action, which isĀ forceĀ timesĀ distanceĀ timesĀ time: hĀ ā‰ˆĀ 6.6Ɨ10āˆ’34Ā NĀ·mĀ·s, indeed. I like that unity. In this regard, you should, perhaps, quickly review that post in which I explain that hĀ isĀ the energy per cycle, i.e. per wavelength or per period, of a photon, regardless of its wavelength. So it’s really something very fundamental.

We’ve got something similar here: energy and momentum coming together, and being shown as one aspect of the same thing: some oscillation. Indeed, just see what happens with the dimensions when we ‘distribute’ the 1/c2Ā factor on the right-hand side over the two sides, so we write:Ā cĀ·gĀ = S/c and work out the dimensions:

  1. [cĀ·g ]Ā = (m/s)Ā·(NĀ·s)/m3Ā = N/m2Ā = J/m3.
  2. [S/c] =Ā (s/m)Ā·(NĀ·m)/(sĀ·m2) = N/m2Ā = J/m3.

Isn’t that nice? Both sides of the equation now have a dimension like ‘the force per unit area’, or ‘the energy per unit volume’. To get that, we just re-scaled g and S, by c and 1/c respectively. As far as I am concerned, this shows an underlying unity we probably tend to mask with our ‘related but different’ energy and momentum concepts. It’s like E and B: I just love it we can write them together in our Poynting formula SĀ = ε0c2EƗB. In fact, let me show something else here, which you should think about. You know that c2Ā = 1/(ε0μ0), so we can write SĀ also as S =Ā EƗB/μ0. That’s nice, but what’s nice too is the following:

  1. S/c = cĀ·gĀ = ε0cEƗBĀ = EƗB/μ0c
  2. S/g = c2 = 1/(ε0μ0)

So, once again, Feynman may feel the Poynting vector is sort of counter-intuitive when analyzing specific situations but, as far as I am concerned, I feel the Poyning vector makes things actuallyĀ easierĀ to understand. Instead of two E and B vectors, and two concepts to deal with ‘energy’ (i.e. energy and momentum), we’re sort of unifying things here. In that regard – i.e in regard of feeling we’re talking the same thing really – I’d really highlight the S/g = c2 = 1/(ε0μ0) equation. Indeed, the universal constantĀ cĀ acts just like the fine-structure constant here: it links everything to everything. šŸ™‚

And, yes, it’s also about time we introduce the so-called principle of least action to explain things, becauseĀ action, as a concept, combines force, distance and time indeed, so it’s a bit more promising than just energy, of just momentum. Having said that, you’ll see in the next section that it’s sometimes quite useful to have the choice between one formula or the other. But… Well…Ā Enough talk. Let’s look at Einstein’s car.

Einstein’s car

Einstein’s car is a wonderful device: it rolls without any friction and it moves with a little flashlight. That’s all it needs. It’s pictured below. šŸ™‚ So the situation is the following: the flashlight shoots some light out from one side, which is then stopped at the opposite end of the car. When the light is emitted, there must be some recoil. In fact, we know it’s going to be equal to 1/c times the energy because all we need to do is apply theĀ pcĀ = EĀ·(v/c) formula forĀ v = c, so we know that p = E/c. Of course, this momentum now needs to move Einstein’s car. It’s frictionless, so it should work, but still… The car has some mass M, and so that will determine its recoil velocity: v = p/M. We just apply the general p = mv formula here, and v is notĀ equal toĀ c here, of course! Of course, then the light hits the opposite end of the car and delivers the same momentum, so that stops the car again. However, itĀ didĀ move over some distance x = vt. So we could flash our light again and get to wherever we want to get. [Never mind the infinite accelerations involved!] So… Well… Great! Yes, but Einstein didn’t like this car when he first saw it. In fact, he still doesn’t like it, because he knows it won’t take you very far. šŸ™‚

Einsteins' car

The problem is that we seem to be moving the center of gravity of this car by fooling around on the inside only. Einstein doesn’t like that. He thinks it’s impossible. And he’s right of course. The thing is: the center of gravity didĀ notĀ change. What happened here is that we’ve got someĀ blob of energy, and so that blob has some equivalent mass (which we’ll denote by U/c2), and so that equivalent mass moved all the way from one side to the other, i.e. over the length of the car, which we denote by L. In fact, it’s stuff like this that inspired the whole theory of the field energy and field momentum, and how it interacts withĀ matter.

What happens here is like switching the light on in the dark room: we’ve got matter doing work on the field, and so matter loses mass, and the field gains it, through its momentum and/or energy. To calculate how much, we could integrate S/cĀ or cĀ·gĀ over the volume of our blob, and we’d get something in jouleĀ indeed, but there’s a simpler way here. The momentum conservation says that the momentum of our car and the momentum of our blob must be equal, so if T is the time that was needed for our blob to go to the other side – and so that’s, of course, also the time during which our car was rolling – then MĀ·v = MĀ·x/T must be equal to (U/c2)Ā·cĀ =Ā (U/c2)Ā·L/T. The 1/T factor on both sides cancel, so we write:Ā MĀ·x = (U/c2)Ā·L.Ā Now, what is x? Yes. In case you were wondering, that’s what we’re looking for here. šŸ™‚ Here it is:

x = vT = vL/c = (p/M)Ā·(L/c) = [U/c)/M]Ā·(L/c) = (U/c2)Ā·(L/M)

So what’s next? Well… Now we need to show that the center-of-mass actually didĀ notĀ move with this ‘transfer’ of the blob. I’ll leave the math to you here: it should all work out. And you can also think through the obvious questions:

  1. Where is the energy and, hence, the mass of our blob after it stops the car? Hint: think about excited atoms and imagine they might radiate some light back. šŸ™‚
  2. As the car did move a little bit, we should be able to move it further and further away from its center of gravity, until the center of gravity is no longer in the car. Hint: think about batteries and energy levels going down while shooting light out. It just won’t happen. šŸ™‚

Now, what about a blob of light going from the top to the bottom of the car? Well… That involves the conservation ofĀ angularĀ momentum: we’ll have more mass on the bottom, but on a shorter lever-arm, soĀ angularĀ momentum is being conserved. It’s aĀ veryĀ good question though, and it led Einstein to combine the center-of-gravity theorem with the angular momentum conservation theorem to explain stuff like this.

It’s all fascinating, and one can think of a great many paradoxes that, at first, seem to contradict the Grand Principles we used here, which means that they would contradict all that we have learned so far. However, a careful analysis of those paradox reveals that they are paradoxes indeed:Ā propositions which soundĀ true but are, in the end,Ā self-contradictory. In fact, when explaining electromagnetism over his variousĀ Lectures, Feynman tasks his readers with a rather formidable paradox when discussing the laws of induction, he solves it here, ten chapters later, after describing what we described above. You can busy yourself with it but… Well… I guess you’ve got something better to do. If so, just take away the key lesson: there’s momentum in the field, and it’s also possible to build upĀ angularĀ momentum in a magneticĀ field and, if you switch it off, the angular momentum will be given back, somehow, as it’s stored energy.

That’s also why the seemingly irrelevant circulation of SĀ we discussed in my previous post, where we had a charge next to an ordinary magnet, and where we found that there was energy circulating around, is not so queer. The energy is there, in the circulating field, and it’sĀ real. As real as can be. šŸ™‚

crazy

Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/

The energy of fields and the Poynting vector

For some reason, I always thought that Poynting was a Russian physicist, like Minkowski. He wasn’t. I just looked it up. Poynting was anĀ Englishman, born near Manchester, and he teached in Birmingham. I should have known. Poynting is a very English name, isn’t it? My confusion probably stems from the fact that it was someĀ Russian physicist,Ā Nikolay Umov, who first proposed the basic concepts we are going to discuss here, i.e. the speed and direction of energy itself, or its movement. And as I am double-checking, I just learned that Hermann Minkowski is generally considered to be German-Jewish, not Russian. Makes sense. With Einstein and all that. His personal life story is actually quite interesting. You should check it out. šŸ™‚

Let’s go for it. We’ve done a few posts on the energy in the fields already, but all in the contexts of electrostatics. Let me first walk you through the ideas we presented there.

The basic concepts: force, work, energy and potential

1.Ā A charge q causes an electric field E, and E‘sĀ magnitude E is a simple function of the charge (q) and its distance (r) from the point that we’re looking at, which we usually write as P = (x, y, z). Of course, the origin of our reference frame here is q. The formula is the simple inverse-square law that you (should) know: E ∼ q/r2, and the proportionality constant is just Coulomb’s constant, which I think you wrote as keĀ in your high-school days and which, as you know, is there so as to make sure the units come out alright. So we could just writeĀ E = keĀ·q/r2. However, just to makeĀ sure it doesĀ notĀ look like a piece of cake šŸ™‚ physicists writeĀ the proportionality constant as 1/4πε0, so we get:

E 3

Now, the field is the force on any unit charge (+1) we’d bring to P. This led us to think of energy,Ā potentialĀ energy, because… Well… You know: energy is measured by work, so that’s some force acting over some distance. The potential energy of a charge increases if we move it againstĀ the field, so we wrote:

formula 1

Well… We actually gave the formula below in that post, so that’s the work done per unit charge.Ā To interpret it, you just need to remember that F = qE, which is equivalent to saying that E is the force per unit charge.

unit chage

As for the F•ds or E•ds product in the integrals, that’s a vector dot product, which we need because it’s only the tangential component of the force that’s doing work, as evidenced by the formula F•ds =Ā |F|Ā·|ds|Ā·cosĪø = FtĀ·ds, and as depicted below.

illustration 1

Now, thisĀ allowed us to describe the field in terms of the (electric)Ā potential Φ and the potential differencesĀ between two points, like the points a and b in the integral above. We have to chose some reference point, of course, someĀ P0Ā defining zero potential, which is usually infinitely far away. So we wrote our formula for the work that’s being done on a unit charge, i.e. W(unit) as:

potential

2.Ā The world is full of charges, of course, and so we need to add all of their fields. But so now you need a bit of imagination. Let’s reconstruct the world by moving all charges out, and then we bring them back one by one. So we take q1Ā now, and we bring it back into the now-empty world. NowĀ that does notĀ require any energy, because there’s no field to start with. However, when we take our second chargeĀ q2, we will be doing work as we move it against the field or, if it’s an opposite charge, we’ll be taking energy out of the field.Ā Huh?Ā Yes. Think about it. All is symmetric.Ā Just to make sure you’re comfortable with every step we take, let me jot down the formula for the force that’s involved. It’s just theĀ CoulombĀ force of course:

Coulomb's law

F1Ā is the force on charge q1, andĀ F2Ā is the force on charge q2. Now, q1Ā and q2. may attract or repel each other but the forces will alwaysĀ be equal and opposite. The e12Ā vector makes sure the directions and signs come out alright, as it’s the unit vector from q2Ā to q1Ā (not from q1Ā to q2, as you might expect when looking at the order of the indices). So we would need to integrate this forĀ rĀ going from infinity to… Well… The distance betweenĀ q1Ā and q2 – wherever they end up as we put them back into the world – so that’s what’s denoted by r12. Now I hate integrals too, but this is an easy one. Just note that ∫ rāˆ’2dr = 1/r and you’ll be able to figure out that what I’ll write now makes sense (if not, I’ll do a similar integral in a moment): the work done in bringing two charges together from a large distance (infinity) is equal to:

U 1So now we should bring in q3Ā and then q4, of course. That’s easy enough.Ā Bringing the first two charges into that world we had emptied took a lot of time, but now we can automate processes. Trust me: we’ll be done in no time. šŸ™‚ We just need to sum over all of theĀ pairsĀ of chargesĀ qiĀ andĀ qj. So we writeĀ the total electrostatic energy UĀ as the sum of the energies of all possible pairs of charges:

U 3

Huh?Ā Can we do that?Ā I mean… Every new charge that we’re bringing in here changes the field, doesn’t it? It does. But it’s the magic of the superposition principle at work here. Our third charge q3Ā is associated with two pairs in this formula. Think of it: we’ve got theĀ q1q3Ā and theĀ q2q3Ā combination, indeed. Likewise, our fourh chargeĀ q4Ā is to be paired up withĀ three charges now: q1,Ā q1Ā andĀ q3. This formula takes care of it, and theĀ ‘all pairs’ mentionĀ under the summationĀ sign (Ī£) reminds us we should watch we don’t double-count pairs:Ā theĀ q1q3Ā and q3q1Ā combination, for example, count for one pair only, obviously. So, yes, we write ‘all pairs’ instead of the usual i, j subscripts. But then, yes, this formula takes care of it. We’re done!

Well… Not really, of course. We’ve still got some way to go before I can introduce the Poynting vector. šŸ™‚ However, to make sure you ‘get’ the energy formula above, let me insert an extremely simple diagram so you’ve got a bit of a visual of what we’re talking about.

U system

3.Ā Now, let’s take a step back. We just calculated the (potential)Ā energy of the world (U), which is great. But perhaps we should also be interested in the world’sĀ potential Φ, rather than its potential energy U. Why? Well, we’ll want to know what happens when we bring yet another charge in—from outer space or so. šŸ™‚ And so then it’s easier to know the world’s potential, rather than its energy, because we can calculate the field from it using the E =Ā āˆ’āˆ‡Ī¦ formula.Ā So let’s de- and re-construct the world once again šŸ™‚ but now we’ll look at what happens with the field and the potential.

We know our first charge created a field with a field strength we calculated as:

E 3

So, when bringing in our second charge, we can use our Φ(P) integral to calculate the potential:

potential

[Let me make a note here, just for the record. You probablyĀ think I am being pretty childish when talking about my re-construction of the world in terms of bringing all charges out and then back in again but, believe me, there will be a lot of confusion when we’ll start talking about the energy of oneĀ charge, and that confusion can be avoided, to a large extent, when you realize that the idea (I mean the concept itself, really—not its formula)Ā of a potential involves two charges really. Just remember: it’s theĀ firstĀ charge that causes the field (and, of course, anyĀ charge causes a field),Ā but calculating a potential only makes sense when we’re talking someĀ other charge.Ā Just make a mental note of it. You’ll be grateful to me later.]

Let’s nowĀ combine the integral and the formula for E above. Because you hate integrals as much as I do, I’ll spell it out: the antiderivative of the Φ(P) integral is ∫ q/(4πε0r2)Ā·dr. Now, let’s bring q/4πε0 out for a while so we can focus on solving ∫(1/r2)dr. Now, ∫(1/r2)drĀ is equal to –1/r + k, and so the whole antiderivative is –q/4πε0rĀ + k. Now, we integrate from r = āˆž to r, and so the definite integral is [–q/(4πε0)]Ā·[1/āˆž āˆ’ 1/r] = [–q/(4πε0)]Ā·[0 āˆ’ 1/r]Ā = q/(4πε0r). Let me present this somewhat nicer:

E 4

You’ll say: so what? Well… We’re done! The only thing we need to do now is add up the potentials of all of the charges in the world. So the formula for the potential Φ at a point which we’ll simply refer to as point 1, is:

P 1

Note that our index jĀ starts at 2, otherwise it doesn’t make sense: we’d have a division by zero for the q1/r11Ā term. Again, it’s an obvious remark, butĀ notĀ thinking about it can cause a lot of confusion down the line.

4.Ā Now, I am veryĀ sorry but I have to inform you that we’ll be talking charge densitiesĀ and all that shortly, rather than discreteĀ charges, so I have to give you the continuumĀ version of this formula, i.e. the formula we’ll use when we’ve got charge densities rather than individual charges. That sum above then becomes an infinite sum (i.e. an integral), and qjĀ becomes a variable which we write as ρ(2). [That’s totally in line with our index jĀ starts at 2, rather than from 1.] We get:

U 2

Just look at this integral, and try to understand it: we’re integrating over all of space – so we’re integrating the whole world, really šŸ™‚Ā ā€“ and the ρ(2)Ā·dV2Ā product in the integral is just the charge of an infinitesimally small volumeĀ of our world. So the whole integral is just the (infinite) sum of the contributions to the potential (at point 1) of all (infinitesimally small) charges that are around indeed. Now, there’s something funny here. It’s just a mathematical thing: we don’t need to worry about double-counting here. Why? We’re not having products of volumes here. Just make a mental note of it because it will be different in a moment.

Now we’re going to look at the continuum version for our energy formula indeed. Which energy formula? That electrostatic energy formula, which said thatĀ the total electrostatic energy UĀ as the sum of the energies of all possible pairs of charges:

U 3

Its continuum version is the following monster:

U 4

Hmm… What kind of integral is that? We’ve gotĀ two variables here: dV2Ā and dV1. Yes. And we’ve also got a 1/2 factor now, because we do notĀ want to double-count and, unfortunately, there is no convenient way of writing an integral like this that keeps track of the pairs. It’s a so-called double integral, but I’ll let you look up the math yourself. In any case, we can simplify this integral so you don’t need to worry about it too much. How do we simplify it? Well… Just look at that integral we got for Φ(1): we calculated the potential at point 1 by integratingĀ the ρ(2)Ā·dV2Ā product over all of space, so the integral above can be written as:

U 5But so thisĀ integral integrates the ρ(1)·Φ(1)Ā·dV1Ā product over all of space, so that’s over all points in space. So we can just drop the index and write the whole thing as the integral of ρ·Φ·dVĀ over all of space:

U 6

5. It’s time for the hat-trick now. The equation above is mathematically equivalent to the following equation:

U 7

Huh?Ā Yes. Let me make two remarks here. First on the math, the E =Ā āˆ’āˆ‡Ī¦ formula allows you to the integrand of the integral above asĀ E•EĀ =Ā (āˆ’āˆ‡Ī¦)•(āˆ’āˆ‡Ī¦) = (āˆ‡Ī¦)•(āˆ‡Ī¦). And then you may or may not remember that, when substitutingĀ E =Ā āˆ’āˆ‡Ī¦ in Maxwell’s first equation (āˆ‡ā€¢E = ρ/ε0), we got the following equality: ρ = ε0Ā·āˆ‡ā€¢(āˆ‡Ī¦) = ε0Ā·āˆ‡2Φ, so we can write ρΦ as ε0Ā·Ī¦Ā·āˆ‡2Φ. However, that still doesn’t show the two integrals are the same thing. The proof is actually rather involved, and so I’ll refer to that post I referred to, so you can check the proof there.

The second remark is much more fundamental. The two integrals are mathematically equivalent, but are they also physically? What do I mean with that? Well… Look at it. The second integral implies that we can look at (ε0/2)Ā·E•E = ε0E2/2 as an energy density, which we’ll denote by u, so we write:

D 6

Just to make sure you ā€˜get’ what we’re talking about here: u is the energy density in the little cube dV in the rather simplistic (and, therefore, extremely useful) illustration below (which, just like most of what I write above, I got from Feynman).

Capture

Now the question: what is theĀ realityĀ of that formula? Indeed, what we did when calculating U amounted to calculating theĀ Universe with some numberĀ U – and that’s kinda nice, of course! – but then what? Is u = ε0E2/2 anything real? Well… That’s what this post is about. So we’re finished with the introduction now. šŸ™‚

Energy density and energy flow in electrodynamics

Before giving you any more formulas, let me answer the question: there is no doubt, in the classical theory of electromagnetism at least, that the energy density u is somethingĀ very real. ItĀ hasĀ to be because of the charge conservation law. Charges cannot just disappear in space, to then re-appear somewhere else. The charge conservation law is written asĀ āˆ‡ā€¢j =Ā āˆ’āˆ‚Ļ/āˆ‚t, and that makes it clear it’s aĀ localĀ conservation law. Therefore, charges can only disappear and re-appear through some current. We write dQ1/dt = ∫ (j•n)Ā·da = āˆ’dQ2/dt, and here’s the simple illustration that comes with it:

charge flow

So we doĀ notĀ allow for any ‘non-local’ interactions here! Therefore, we say that, if energy goes away from a region, it’s because itĀ flowsĀ away through the boundaries of that region.Ā So that’s what the Poynting formulas are all about, and so I want to be clear on that from the outset.

Now, to get going with the discussion, I need to give you the formula for the energy density inĀ electrodynamics. Its shape won’t surprise you:

energy density

However, it’s just like the electrostatic formula: it takes quite a bit of juggling to get this from our electrodynamic equations, so, if you want to see how it’s done, I’ll refer you to Feynman. Indeed, I feel the derivation doesn’t matter all that much, because the formula itself is veryĀ intuitive: it’s really the thing everyone knows about a wave, electromagnetic or not: the energy in it is proportional to the square of its amplitude, and so that’s E•EĀ =Ā E2Ā and B•BĀ = B2. Now, you also know that the magnitudeĀ of B is 1/c of that of E, so cB = E, and so that explains the extra c2Ā factor in the second term.

The second formula is also very intuitive. Let me write it down:

energy flux

Just look at it: uĀ is the energy density, so that’s the amount of energy per unit volumeĀ at a given point, and so whatever flows out of that point must represent itsĀ time rate of change. As for the ā€“āˆ‡ā€¢S expression… Well… Sorry, I can’t keep re-explaining things: theĀ āˆ‡ā€¢ operator is the divergence, and so it give us the magnitude of a (vector) field’s source or sinkĀ at a given point. āˆ‡ā€¢CĀ is a scalar, and if it’sĀ positive in a region, then that region is a source. Conversely, if it’s negative, then it’s a sink. To be precise, the divergence represents the volume density of the outward fluxĀ of a vector field from an infinitesimal volume around a given point. So, in this case, it gives us the volume density of the flux of S. As you can see, the formula has exactly the same shape asĀ āˆ‡ā€¢j =Ā āˆ’āˆ‚Ļ/āˆ‚t.

So what is S? Well… Think about the more general formula for the flux out of some closed surface, which we get from integrating over the volume enclosed. It’s just Gauss’ Theorem:

Gauss Theorem

Just replace C by E, and think about what it meant: the flux of E was the field strength multiplied by the surface area, so it was the totalĀ flow of E. Likewise, SĀ represents the flow of (field) energy. Let me repeat this, because it’s an important result:

SĀ represents the flow of field energy.

Huh?Ā What flow? Per unit area? Per second? How do you define such ‘flow’? Good question. Let’s do a dimensional analysis:

  1. E is measured inĀ newton per coulomb, so [E•E] = [E2] = N2/C2.
  2. B is measured in (N/C)/(m/s). [Huh?Ā Well… Yes. I explained that a couple of times already. Just check it in my introduction to electric circuits.] So we get [B•B] = [B2] = (N2/C2)Ā·(s2/m2) but the dimension of ourĀ c2Ā factor is (m2/s2) so we’re left with N2/C2. That’s nice, because we need to add in the same units.
  3. Now we need to look at ε0. That constant usually ‘fixes’ our units, but can we trust it to do the same now? Let’s see… One of the many ways in which we can express its dimension is [ε0] =Ā C2/(NĀ·m2), so if we multiply that with N2/C2, we find that u is expressed inĀ N/m2.Ā Wow!Ā That’sĀ kindaĀ neat. Why? Well… Just multiply with m/m and its dimension becomes NĀ·m/m3Ā = J/m3, so that’s Ā joule per cubic meter, so… Yes: uĀ has got the right unit for something that’s supposed to measure energy density!
  4. OK. Now, we take the time rate of change of u, and so both the right and left of our āˆ‚u/āˆ‚t = āˆ’āˆ‡ā€¢SĀ formula are expressed in (J/m3)/s, which means that the dimension ofĀ S itself mustĀ be J/(m2Ā·s). Just check it by writing it all out: āˆ‡ā€¢SĀ = āˆ‚Sx/āˆ‚x + āˆ‚Sy/āˆ‚x + āˆ‚Sz/āˆ‚z, and so that’s something per meterĀ so, to get the dimension of S itself, we need to go from cubic meter to square meter. Done! Let me highlight the grand result:

S is the energy flow per unit area and per second.

Now we’ve got its magnitude and its dimension, but what is its direction? Indeed, we’ve been writing S as a vector, but… Well… What’s its direction indeed?

Well… Hmm… I referred you to FeynmanĀ for that derivation of that u = ε0E2/2 + ε0c2B2/2 formula energy forĀ u, and so the direction of S – I should actually say, its complete definition – comes out of that derivation as well. So… Well… I think you should just believe what I’ll be writing here for S:

S formula

So it’s the vector cross product of E and B with ε0c2Ā thrown in. It’s a simple formula really, and because I didn’t drag you through the whole argument, you should just quickly do a dimensional analysis again—just to make sure I am not talking too much nonsense. šŸ™‚ So what’s the direction? Well… You just need to apply the usual right-hand rule:

right hand rule

OK. We’re done! This S vector, which – let me repeat it – represents the energy flow per unit area and per second, is what is referred to as Poynting’s vector, and it’s a most remarkable thing, as I’ll show now. Let’s think about theĀ implicationsĀ of this thing.

Poynting’s vector in electrodynamics

The S vector is actually quite similar to the heat flow vectorĀ h, which we presented when discussing vector analysis and vector operators. The heat flow out of a surface element daĀ is the area times the component ofĀ hĀ perpendicular to da, so that’s (h•n)Ā·da = hnĀ·da. Likewise, we can writeĀ (S•n)Ā·da = SnĀ·da. The units of S and h are also the same:Ā joule per second and per square meterĀ or, using the definition of theĀ wattĀ (1 W = 1 J/s), in watt per square meter.Ā In fact, if you google a bit, you’ll find that both hĀ and S are referred to as aĀ flux density:

  1. The heat flow vector hĀ is the heat flux densityĀ vector, from which we get the heat flux through an area through the (h•n)Ā·da = hnĀ·daĀ product.
  2. The energy flow SĀ is the energy flux density vector, from which we get the energy flux through the (S•n)Ā·da = SnĀ·daĀ product.

The big difference, of course, is that we getĀ hĀ from a simpler vector equation:

h =Ā āˆ’Īŗāˆ‡T ⇔ (hx,Ā hy,Ā hz) =Ā āˆ’Īŗ(āˆ‚Tx/āˆ‚x,Ā āˆ‚Ty/āˆ‚y,āˆ‚Tz/āˆ‚x)

The vector equation for SĀ is more complicated:

S formula

So it’s a vector product. Note that S will be zero if E = 0 and/or if B = 0. So S = 0 in electrostatics, i.e. when there are no moving charges and only steady currents. Let’s examine Feynman’s examples.

The illustration below shows the geometry of theĀ E, B and SĀ vectors for a light wave. It’s neat, and totally in line with what we wrote on theĀ radiation pressure, or the momentum of light. So I’ll refer you to that post for an explanation, and to Feynman himself, of course.

light wave

OK. The situation here is rather simple. Feynman gives a few others examples that are notĀ so simple, like that ofĀ a charging capacitor, which is depicted below.

capacitor

The Poynting vector points inwards here, toward the axis. What does it mean?Ā It means the energy isn’t actually coming down the wires, but from the space surrounding the capacitor.Ā 

What?Ā I know. It’s completely counter-intuitive, at first that is. You’d think it’s the charges. But it actually makes sense. The illustration below shows how we should think of it. The charges outside of the capacitor are associated with a weak, enormously spread-out field that surrounds the capacitor. So if we bring them to the capacitor, that field gets weaker, and the field between the plates gets stronger. So the field energy which is way out moves into the space between the capacitor plates indeed, and so that’s what Poynting’s vector tells us here.

capacitor 2

Hmm… Yes. You can be skeptic. You should be. But that’s how it works. The next illustration looks at a current-carrying wire itself. Let’s first look at the B and E vectors. You’re familiar with the magnetic field around a wire, so the B vector makes sense, but what about the electric field? Aren’t wires supposed to be electrically neutral? It’s a tricky question, and we handled it in our post on the relativity of fields. The positive and negative charges in a wire should cancel out, indeed, but then it’s the negative charges that move and, because of their movement, we have the relativistic effect of length contraction, so the volumes are different, and theĀ positive andĀ negative charge density do notĀ cancel out: the wire appears to be charged, so we do have a mix of E and B! Let me quickly give you the formula: E = (2πε0)Ā·(Ī»/r), withĀ Ī» the (apparent) charge per unit length, so it’s the same formula as for a long line of charge, or for a long uniformly charged cylinder.

So we have a non-zeroĀ E and B and, hence, a non-zero Poynting vector S, whose direction isĀ radially inward, so there is a flow of energy into the wire, all around. What the hell?Ā Where does it go? Well… There’s a few possibilities here: the charges need kineticĀ energy to move, or as they increaseĀ theirĀ potentialĀ energy when moving towards the terminals of our capacitor to increase the charge on the plates or, much more mundane, the energy may be radiated out again in the form of heat. It looks crazy, but that’s how it is really. In fact, the more you think about, the more logical it all starts to sound. EnergyĀ mustĀ be conserved locally, and so it’s just field energy going in and re-appearing in some other form. So itĀ doesĀ make sense. But, yes, it’s weird, because no one bothered to teach us thisĀ in school. šŸ™‚

wire

The ‘craziest’ example is the one below: we’ve got a charge and a magnet here. All is at rest. Nothing is moving… Well… I’ll correct that in a moment. šŸ™‚ The charge (q) causes a (static) Coulomb field, while our magnet produces the usual magnetic field, whose shape we (should) recognize: it’s the usual dipole field. So E and B are not changing. But so when we calculate our Poynting vector, we see there is a circulation of S. TheĀ EƗBĀ product is notĀ zero.Ā So what’s going on here?

crazy

Well… There is no netĀ change in energy with time: the energy just circulates around and around. Everything which flows into one volume flows out again. As Feynman puts it: “It is like incompressible water flowing around.” What’s the explanation? Well… Let me copy Feynman’s explanation of this ‘craziness’:

“Perhaps it isn’t so terribly puzzling, though, when you remember that what we called a ā€œstaticā€ magnet is really a circulating permanent current. In a permanent magnet the electrons are spinning permanently inside. So maybe a circulation of the energy outside isn’t so queer after all.”

So… Well… It looks like we do need to revise some of our ‘intuitions’ here. I’ll conclude this post by quoting Feynman on it once more:

“You no doubt get the impression that the Poynting theory at least partially violates your intuition as to where energy is located in an electromagnetic field. You might believe that you must revamp all your intuitions, and, therefore have a lot of things to study here. But it seems really not necessary. You don’t need to feel that you will be in great trouble if you forget once in a while that the energy in a wire is flowing into the wire from the outside, rather than along the wire. It seems to be only rarely of value, when using the idea of energy conservation, to notice in detail what path the energy is taking. The circulation of energy around a magnet and a charge seems, in most circumstances, to be quite unimportant. It is not a vital detail, but it is clear that our ordinary intuitions are quite wrong.”

Well… That says it all, I guess. As far as I am concerned, I feel the Poyning vector makes things actuallyĀ easierĀ to understand. Indeed, the E and B vectors were quite confusing, because we had twoĀ of them, and the magnetic field is, frankly, a weirdĀ thing. Just think about the units in which we’re measuring B: (N/C)/(m/s). IĀ can’t imagine what a unit like that could possible represent, so I must assume you can’t either.Ā But so now we’ve got this Poynting vector that combines both E and B, and which represents the flow of the field energy. Frankly, I think that makes a lot of sense, and it’s surely much easier to visualize than EĀ and/or B. [Having said that, of course, you should note that E and B do have their value, obviously, if only because they represent theĀ lines of force, and so that’s something very physical too, of course.Ā I guess it’s a matter of taste, to some extent, but so I’d tend to soften Feynman’s comments on the supposed ‘craziness’ of S.

In any case… The next thing I should discuss isĀ field momentum. Indeed, if we’ve got flow, we’ve got momentum. But I’ll leave that for my next post. This topic can’t be exhausted in one post only, indeed. šŸ™‚ So let me conclude this post. I’ll do with a very nice illustration I got from the Wikipedia article on the Poynting vector. It shows the Poynting vector around a voltage source and a resistor, as well as what’s going on in-between. [Note that the magnetic field is given by the field vector H, which is related to B as follows: B = μ0(H + M), with M the magnetization of the medium. B and H are obviously just proportional in empty space, with μ0Ā as the proportionality constant.]

Poynting_vectors_of_DC_circuit

Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 17, 2020 as a result of a DMCA takedown notice from Michael A. Gottlieb, Rudolf Pfeiffer, and The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 20, 2020 as a result of a DMCA takedown notice from Michael A. Gottlieb, Rudolf Pfeiffer, and The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/

Re-visiting relativity and four-vectors: the proper time, the tensor and the four-force

Pre-script (dated 26 June 2020): Our ideas have evolved into a full-blown realistic (or classical) interpretation of all things quantum-mechanical. In addition, I note the dark force has amused himself by removing some material. So no use to read this. Read my recent papers instead. šŸ™‚

Original post:

My previous post explained how four-vectors transformĀ from one reference frame to the other. Indeed, a four-vector isĀ notĀ just some one-dimensional array of four numbers: it represent something—a physical vector that… Well… Transforms like a vector. šŸ™‚ So whatĀ vectorsĀ are we talking about? Let’s see what we have:

  1. We knew the position four-vector already, which we’ll write as xμ = (ct, x, y, z) = (ct, x).
  2. We also proved that Aμ = (Φ,Ā Ax, Ay, Az) = (Φ,Ā A)Ā is a four-vector: it’s referred to as the four-potential.
  3. We also know the momentumĀ four-vector from theĀ LecturesĀ on special relativity. We write it as pμ = (E, px, py, pz) = (E, p), with E = γm0, p = γm0v, and γ = (1āˆ’v2/c2)āˆ’1/2Ā or, forĀ cĀ = 1, γ =Ā (1āˆ’v2)āˆ’1/2

To show that it’s notĀ just a matter of adding some fourth t-component to aĀ three-vector, Feynman gives the example of the four-velocity vector. We have vxĀ = dx/dt,Ā vyĀ = dy/dt and vzĀ = dz/dt, but a vμ = (d(ct)/dt, dx/dt, dy/dt, dz/dt) = (c, dx/dt, dy/dt, dz/dt) ‘vector’ is, obviously, not a four-vector. [Why obviously? The inner productĀ vμvμ Ā is not invariant.] In fact,Ā Feynman ‘fixes’ the problem by noting that ct, x, y and z have the ‘right behavior’, but the d/dt operator doesn’t. The d/dt operator isĀ not an invariant operator. So how does he fix it then? He tries the (1āˆ’v2/c2)āˆ’1/2Ā·d/dt operator and, yes, it turns out we do get a four-vector then. In fact, we get that four-velocity vector uμ that we were looking for:four-velocity vector[Note we assume we’re using equivalentĀ time and distance units now, soĀ cĀ = 1 andĀ v/c reduces to a new variable v.]

Now how do we know this is four-vector? How can we prove this one? It’s simple. We can get it from ourĀ pμ = (E, p) by dividing it by m0, which is an invariantĀ scalar in four dimensions too. Now, it is easy to see that a division by an invariantĀ scalar doesĀ notĀ change the transformation properties. So just write it all out, and you’ll see that pμ/m0Ā = uμ and, hence, that uμ is a four-vector too. šŸ™‚

We’ve got an interesting thing here actually: division by an invariant scalar, or applying that (1āˆ’v2/c2)āˆ’1/2Ā·d/dt operator, which is referred to as an invariant operator, on a four-vector will give us another four-vector. Why is that? Let’s switch to compatible time and distance units soĀ c = 1 so to simplify the analysis that follows.

The invariant (1āˆ’v2)āˆ’1/2Ā·d/dt operatorĀ and the proper time s

Why is theĀ (1āˆ’v2)āˆ’1/2Ā·d/dt operator invariant? Why does it ‘fix’ things? Well… Think about the invariant spacetime interval (Ī”s)2Ā = Ī”t2Ā āˆ’ Ī”x2Ā āˆ’ Ī”y2Ā āˆ’Ā Ī”z2Ā going to the limit (ds)2Ā = dt2Ā āˆ’ dx2Ā āˆ’ dy2Ā āˆ’ dz2Ā . Of course, we can and should relate this to an invariant quantity s = ∫ ds. Just like Ī”s, this quantity also ‘mixes’ time and distance. Now, we could try to associate some derivative d/ds with it because, as Feynman puts it, “it should be a nice four-dimensional operation because it is invariant with respect to a Lorentz transformation.” Yes. It should be. So let’s relate ds to dt and see what we get. That’s easy enough: dx = vxĀ·dt,Ā dy = vyĀ·dt, dz = vzĀ·dt, so we write:

(ds)2Ā = dt2Ā āˆ’ vx2Ā·dt2Ā āˆ’ vy2Ā·dt2Ā āˆ’Ā vz2Ā·dt2 ⇔ (ds)2Ā = dt2Ā·(1 āˆ’ vx2Ā āˆ’ vy2Ā āˆ’Ā vz2) = dt2Ā·(1 āˆ’ v2)

and, therefore, ds = dtĀ·(1āˆ’v2)1/2. So our operator d/ds is equal to (1āˆ’v2)āˆ’1/2Ā·d/dt, and we can apply it toĀ anyĀ four-vector, as we are sure that, as an invariant operator, it’s going to give us another four-vector. I’ll highlight the result, because it’s important:

The d/ds = (1āˆ’v2)āˆ’1/2Ā·d/dt operator is an invariant operator for four-vectors.

For example, if we apply it to xμ = (t, x, y, z), we get the very same four-velocity vector μμ:

dxμ/ds = uμ = pμ/m0

Now, if you’re somewhat awake, you should ask yourself: what is this s, really,Ā and what is this operator all about? Our new function s = ∫ ds is notĀ the distance function, as it’s got both time and distance in it. Likewise, the invariant operatorĀ d/ds = (1āˆ’v2)āˆ’1/2Ā·d/dt has both time and distance in it (the distance is implicit in the v2Ā factor). Still, it is referred to as the proper timeĀ along the path of a particle. Now why is that? If it’s got distance andĀ time in it, why don’t we call it the ‘proper distance-time’ or something?

Well… The invariant quantity s actually is the time that would be measured by a clock that’s moving along, in spacetime, with the particle. Just think of it: in the reference frame of the moving particle itself, Ī”x, Ī”yĀ and Ī”zĀ must be zero, because it’s not moving in its own reference frame. So theĀ (Ī”s)2Ā = Ī”t2Ā āˆ’ Ī”x2Ā āˆ’ Ī”y2Ā āˆ’Ā Ī”z2Ā reduces to (Ī”s)2Ā = Ī”t2, and so we’re only adding time to s. Of course, this view of things implies that theĀ proper time itself is fixed only up to some arbitrary additive constant, namely the setting of the clock at some event along the ‘world line’ of our particle, which is its path in four-dimensional spacetime. But… Well… In a way, s is the ‘genuine’ or ‘proper’ time coming with the particle’s reference frame, and so that’s why Einstein called it like that. You’ll see (later) that it plays a very important role in general relativity theory (which is a topic we haven’t discussed yet: we’ve only touched special relativity, so no gravity effects).

OK. I know this is simple and complicated at the same time: the math is (fairly) easy but, yes, it may be difficult to ‘understand’ this in some kind of intuitiveĀ way. But let’s move on.

The four-force vector fμ

We know the relativistically correct equation for the motionĀ of some charge q. It’s just Newton’s Law F = dp/dt = d(mv)/dt. The only difference is that we areĀ not assuming that m is some constant. Instead, we use the pĀ = γm0vĀ formula to get:

motion

How can we get a four-vector for the force? It turns out that we get it when applying our new invariant operator to the momentum four-vector pμ = (E, p), so we write: fμ = dpμ/ds. But pμ = m0uμ = m0dxμ/ds, so we can re-write this as fμ = d(m0·dxμ/ds)/ds, which gives us a formula which is reminiscent of the Newtonian F = ma equation:

force formula

WhatĀ isĀ this thing? Well… It’s not so difficult to verify that the x, y and z-components are just our old-fashioned Fx,Ā FyĀ andĀ Fz, so these are the components of F. The t-component is (1āˆ’v2)āˆ’1/2Ā·dE/dt. Now, dE/dt is the time rate of change of energy and, hence, it’s equal to the rate of doing work on our charge, which is equal to F•v. So we can write fμ as:

froce

The force and the tensor

We will now derive that formula which we ended the previous postĀ with. We start with calculating the spacelike components of fμ from the Lorentz formula F = q(E + vƗB). [The terminology is nice, isn’t it? The spacelike components of the four-force vector!Ā Now thatĀ sounds impressive, doesn’t it? But so… Well… It’s really just the old stuff we know already.]Ā So we start with fxĀ =Ā Fx, and write it all out:

fx

What a monster! But,Ā hey! We can ‘simplify’ this by substituting stuff by (1) the t-, x-, y- and z-components of the four-velocity vector uμ and (2) the components of our tensor Fμν = [Fij] = [āˆ‡iAjĀ āˆ’Ā āˆ‡jAi] with i, j = t, x, y, z. We’ll also pop in the diagonal FxxĀ = 0 element, just to make sure it’s all there. We get:

fx 2

Looks better, doesn’t it? šŸ™‚ Of course, it’s just the same, really. This is just an exercise in symbolism. Let me insert the electromagnetic tensor we defined in our previous post, just as a reminder of what that Fμν matrix actually is:

electromagnetic tensor final

If you read my previous post, this matrix – or the concept of a tensor – has no secrets for you. Let me briefly summarize it, because it’s an important result as well. The tensor is (a generalization of) the cross-product inĀ four-dimensional space. We take two vectors:Ā aμ =Ā (at, ax, ay, az) and bμ =Ā (bt, bx, by, bz) and then we takeĀ cross-products of their components just like we did in three-dimensional space, so we write TijĀ = aibjĀ āˆ’Ā ajbi. Now, it’s easy to see that this combination implies that TijĀ = āˆ’ TjiĀ and that TiiĀ = 0, which is why we only have sixĀ independent numbers out of the 16 possible combinations, and which is why we’ll get a so-called anti-symmetric matrix when we organize them in a matrix. In three dimensions, the very same definition of the cross-product TijĀ gives us 9 combinations, and only 3 independent numbers, which is why we represented our ‘tensor’ as a vector too! In four-dimensional space we can’t do that: six things cannot be represented by a four-vector, so weĀ need to use thisĀ matrix, which is referred to as a tensor of the second rank in four dimensions. [When you start using words like that, you’ve come a long way, really. :-)]

[…]Ā OK. Back to our four-force. It’s easy to get a similar one-liner forĀ fyĀ and fzĀ too, of course, as well as for ft. But… Yes, ft… Is it the same thing really? Let me quickly copy Feynman’s calculation for ft:

ft

ItĀ does: remember that vƗB and v are orthogonal, and so their dot product is zero indeed. So, to make a long story short, the four equations – one for each component of the four-force vector fμ – can be summarized in the following elegant equation:

motion equation

Writing this all requires a few conventions, however. For example,Ā Fμν is a 4Ɨ4 matrix and so uν has to be written as a 1Ɨ4 vector. And the formula for theĀ fxĀ and ftĀ component also make it clear that we also want to use the +āˆ’āˆ’āˆ’ signature here, so the convention for the signs in the uνFμν product is the same as that for the scalar product aμbμ. So, in short, you really need to interpret what’s being written here.

A more important question, perhaps, is: what can we do with it? Well… Feynman’s evaluation of the usefulness of this formula is rather succinct: “Although it is nice to see that the equations can be written that way, this form is not particularly useful. It’s usually more convenient to solve for particle motions by using the F = q(E + vƗB) = (1āˆ’v2)āˆ’1/2Ā·d(m0v)/dtĀ equations, and that’s what we will usually do.”

Having said that, this formula really makes good on the promise I started my previous post with: we wanted a formula, someĀ mathematical construct, that effectively presents the electromagnetic force as oneĀ force, as one physical reality. So… Well… Here it is! šŸ™‚

Well… That’s it for today. Tomorrow we’ll talk about energy and about a veryĀ mysterious concept—the electromagnetic mass. That should be fun! So I’llĀ c u tomorrow! šŸ™‚

Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 20, 2020 as a result of a DMCA takedown notice from Michael A. Gottlieb, Rudolf Pfeiffer, and The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/

Relativistic transformations of fields and the electromagnetic tensor

Pre-script (dated 26 June 2020): Our ideas have evolved into a full-blown realistic (or classical) interpretation of all things quantum-mechanical. In addition, I note the dark force has amused himself by removing some material. So no use to read this. Read my recent papers instead. šŸ™‚

Original post:

We’re going to do a very interesting piece of math here. It’s going to bring a lot of things together. The key idea is to present aĀ mathematical construct that effectively presents the electromagnetic force as oneĀ force, as one physical reality. Indeed, we’ve been saying repeatedly that electromagnetism is oneĀ phenomenon only but we’ve been writingĀ it always as something involvingĀ twoĀ vectors: he electric field vector E and the magnetic field vector B. Of course, Lorentz’ force law F = q(E + vƗB) makes it clear we’re talking one force only but… Well… There is a way of writing it all up that is much more elegant.

I have to warn you though: this post doesn’t add anything to theĀ physicsĀ we’ve seen so far: it’s all math, really and, to a large extent, math only. So if you read this blog because you’re interested in the physics only, then you may just as well skip this post. Having said that, theĀ mathematical concept we’re going to present is that of the tensorĀ and… Well… You’ll have to get to know that animal sooner or later anyway, so you may just as well give it a try right now, and see whatever you can get out of this post.

The concept of a tensor further builds on the concept of the vector, which we liked so much because it allows us to write the laws of physics as vector equations, which do notĀ change whenĀ going from one reference frame to another. In fact, we’ll see that a tensor can be described as a ‘special’ vectorĀ cross productĀ (to be precise, we’ll show that a tensor is a ‘more general’ cross product, really). So the tensor and vector concepts areĀ veryĀ closely related, but then… Well… If you think about it, the concept of a vector and the concept of a scalar are closely related, too! So we’re just moving up the value chain, so to speak: from scalar fields to vector fields to… Well… Tensor fields! And in quantum mechanics, we’ll introduce spinors, and so we also have spinor fields!Ā Having said that, don’t worry about tensor fields. Let’s first try to understand tensorsĀ tout court.Ā šŸ™‚

So… Well… Here we go.Ā Let me start with it all by reminding you of the concept of a vector, and why we like to use vectors and vector equations.

The invariance of physics and the use of vector equations

What’s a vector? You may think, naively, that any one-dimensional array of numbers is a vector. But… Well… No! In math, we may, effectively, refer to anyĀ one-dimensionalĀ array of numbersĀ as a ‘vector’, perhaps, but in physics, a vector does represent something real, something physical, and so a vector is only a vector if it transforms like a vectorĀ underĀ theĀ transformationĀ rules that apply when going from one another frame of reference, i.e. one coordinate system, to another. Examples of vectors in three dimensions are: the velocity vector v, or the momentum vector p = mĀ·v, or the position vector r.

Needless to say, the same can be said of scalars: mathematicians may define a scalar as just any real number, but it’sĀ notĀ in physics. A scalar in physics refers to something real, i.e.Ā a scalar field, like the temperature (T) inside of a block of material. In fact, think about your first vector equation: it may have been the one determining the heat flow (h), i.e. h =Ā āˆ’ĪŗĀ·āˆ‡T = (āˆ’ĪŗĀ·āˆ‚T/āˆ‚x, āˆ’ĪŗĀ·āˆ‚T/āˆ‚y, āˆ’ĪŗĀ·āˆ‚T/āˆ‚z). It immediately shows how scalar and vector fields are intimately related.

Now, when discussing the relativistic framework of physics, we introduced vectors inĀ fourĀ dimensions, i.e.Ā four-vectors.Ā The most basic four-vector is the spacetime four-vector R = (ct, x, y, z), which is often referred to as an event, but it’s just aĀ point in spacetime, really. So it’s a ‘point’ with a time as well as a spatial dimension, so it also has t in it, besides x, y and z. It is also known as the position four-vectorĀ but, again, you should think of a ‘position’ that includes time! Of course, we can re-write R as R = (ct, r), with r = (x, y, z), so here we sort of ‘break up’ the four-vector in a scalar and a three-dimensional vector, which is something we’ll do from time to time, indeed. šŸ™‚

We also have aĀ displacement four-vector, which we can write asĀ Ī”R = (cĀ·Ī”t, Ī”r). There are other four-vectors as well, including theĀ four-velocity, theĀ four-momentum and theĀ four-forceĀ four-vectors, which we’ll discuss later (in the last section of this post).

So it’s just like using three-dimensional vectors in three-dimensional physics, or ‘Newtonian’ physics, I should say: the use of four-vectors is going toĀ allow us to write the laws of physics usingĀ vector equations, butĀ in four dimensions, rather than three, so we get the ‘Einsteinian’ physics, the realĀ physics, so to speak—or the relativistically correct physics, I should say. And so these four-dimensional vector equations will also notĀ change whenĀ going from one reference frame to another, and so our four-vector will be vectors indeed, i.e. they willĀ transformĀ like a vectorĀ underĀ theĀ transformationĀ rules that apply when going from one another frame of reference, i.e. one coordinate system, to another.

What transformation? Well… In Newtonian or Galilean physics, we had translations and rotations and what have you, but what weĀ are interested in right now areĀ ‘Einsteinian’ transformations of coordinate systems, so these have to ensure that allĀ of the laws of physics that we know of, including the principle of relativity,Ā still look the same.Ā You’ve seen these transformation rules. We don’t call them the ‘Einsteinian’ transformation rules, but the LorentzĀ transformation rules, because it was a Dutch physicist (Hendrik Lorentz) who first wrote them down. So these rules are veryĀ differentĀ from the Newtonian or Galilean transformation rules which everyone assumed to be valid until the Michelson-Morley experiment unequivocally established that the speed of lightĀ did notĀ respect the Galilean transformation rules. VeryĀ different? Well… Yes. In their mathematical structure, that is. Of course, when velocities are low, i.e.Ā non-relativistic, then they yield the same result,Ā approximately, that is. However,Ā I explained that in my post on special relativity, and so I won’t dwell on that here.

Let me just jot down both sets of rules assumingĀ that the two reference frames move with respect to each other along the x- axis only, so the y- and z-component of u is zero.

Capture

The Galilean or Newtonian rules are the simple rules on the right. Going from one reference frame to another (let’s call them S and S’ respectively) is just a matter of adding or subtracting speeds: if my car goes 100 km/h, and yours goes 120 km/h, then youĀ will see my car falling behind at a speed of (minus) 20 km/h. That’s it. We could alsoĀ rotateĀ our reference frame, and our NewtonianĀ vector equationsĀ would still look the same. As Feynman notes, smilingly, it’s what a lot of armchair philosophers think relativity theory is all about, but so it’s got nothing to do with it. It’s plain wrong!

In any case, back to vectors and transformations.Ā The key to the so-calledĀ invarianceĀ of the laws of physics is the use of vectors and vector operators that transform like vectors.Ā For example, if we defined A and B as (Ax, Ay, Az) and (Bx, By, Bz), then we knew that the so-called inner productĀ A•BĀ would look the same in all rotated coordinate systems, so we can write: A•B =Ā A’•B’. So we know that if we have a product like that on both sides of an equation, we’re fine: the equation will have the same formĀ in all rotated coordinate systems. Also, the gradient, i.e. ourĀ vector operatorĀ āˆ‡Ā = (āˆ‚/āˆ‚x, āˆ‚/āˆ‚y, āˆ‚/āˆ‚z), when applied to a scalar function, gave three quantities that also transform like a vector under rotation. We also defined a vectorĀ crossĀ product, which yielded aĀ vector (as opposed to the inner product, i.e. the vectorĀ dotĀ product, which yields a scalar):

cross product

So how does this thing behave under a Galilean transformation? Well… You may or may not remember that we used this cross-product to define theĀ angular momentum L, which was a cross product of the radius vector r and the momentum vector p = mv, as illustrated below. The animation also gives the torque Ļ„, which is, loosely speaking, a measure of the turning force: it’s the cross product of r and F, i.e. the force on the lever-arm.

Torque_animation

The components of L are:

momentum angular

Now, we find that these three numbers, orĀ objectsĀ if you want, transform inĀ exactly the same wayĀ as the components of a vector.Ā However, as Feynman points out, that’s a matter of ‘luck’ really. It’s something ‘special’. Indeed,Ā you may or may not remember that we distinguished axialĀ vectors fromĀ polar vectors. L is an axial vector, while r and p are polar vectors, and so we find that, in three dimensions, the cross product of two polar vectors will always yields an axial vector. Axial vectors are sometimes referred to asĀ pseudovectors, which suggests that they are ‘not so real’ as… Well… Polar vectors, which are sometimes referred to as ‘true’ vectors. However, it doesn’t matter when doing these Newtonian or Galilean transformations: pseudo or true, both vectors transform like vectors. šŸ™‚

But so… Well… We’re actually getting a bit of a heads-up here: if we’d be mixing (or ‘crossing’) polar and axial vectors, or mixing axial vectors only, so if we’d define something involvingĀ LĀ andĀ pĀ (rather than r and p), or something involvingĀ LĀ andĀ Ļ„, then we may notĀ be so lucky, and then we’d have to carefully examine our cross-product, or whatever other product we’d want to define, because its components mayĀ notĀ behave like a vector.

Huh? Whatever other product we’d want to define? Why are you saying that?Ā Well…Ā We actuallyĀ can think of other products. For example, if we haveĀ two vectors a = (ax, ay,Ā az) and b = (bx, by, bz), then we’ll haveĀ nine possible combinations of their components, which we can write as TijĀ = aibj. So that’s like Lxy, LyzĀ and LzxĀ really. Now, you’ll say: “No. It isn’t. We don’t have nine combinations here. Just three numbers.” Well… Think about it: we actually doĀ haveĀ nineĀ LijĀ combinations too here, as we can write: LijĀ = riĀ·pj – rjĀ·pi. It just happens that, with this definition, only threeĀ of these combinations LijĀ are independent. That’s because the other six numbers are either zero or the opposite. Indeed, it’s easy to verify thatĀ LijĀ = –LjiĀ , and LiiĀ  = 0. So… Well… It turns out that the three components of our LĀ = rƗp ‘vector’ are actually a subset of a set of nineĀ LijĀ numbers.Ā So… Well… Think about it. We cannot just do whatever we want with our ‘vectors’. We need to watch out.

In fact, I do not want to get too much ahead of myself, but I can already tell you that the matrix with these nineĀ TijĀ = aibjĀ combinations is what is referred to as the tensor. To be precise, it’s referred to as a tensor of the second rank in three dimensions. The ‘second rank’, aka as ‘degree’ or ‘order’ refers to the fact that we’ve got two indices, and the ‘three dimensions’ is because we’re using three-dimensional vectors. We’ll soon see that the electromagnetic tensor is also of the second rank, but it’s a tensor in four dimensions. In any case, I should not get ahead of myself. Just note what I am saying here: the tensor is like a ‘new’ product of two vectors, a new type of ‘cross’ product really (because we’re mixing the components, so to say), but it doesn’t yield a vector: it yields aĀ matrix. For three-dimensional vectors, we get a 3Ɨ3 matrix. For four-vectors, we’ll get a 4Ɨ4 matrix. And so the full truth about our angular momentum vector L, is the following:

  1. There is a thing which we call the angular momentum tensor. It’s a 3Ɨ3 matrix, so it has nine elements which are defined as: LijĀ = riĀ·pj – rjĀ·pi. BecauseĀ of this definition, it’s an antisymmetric tensor of the second order in three dimensions, so it’s got only three independentĀ components.
  2. The three independent elements are the components of our ‘vector’ L, and picking them out and calling these three components a ‘vector’ is actually a ‘trick’ that only works in three dimensions. They really just happen toĀ transform like a vector under rotation or under whatever Galilean transformation! [By the way, do you know understand why I was saying that we can look at a tensor as a ā€˜more general’ cross product?]
  3. In fact, in four dimensions, we’ll use a similar definition and define 16 elementsĀ FijĀ as FijĀ = āˆ‡iAjĀ āˆ’Ā āˆ‡jAi, using the two four-vectors āˆ‡Ī¼Ā and Aμ (so we have 4Ɨ4 = 16 combinations indeed), out of whichĀ only sixĀ will be independent for the very same reason: we have an antisymmetric vector combination here, FijĀ = āˆ’FjiĀ and FiiĀ = 0. šŸ™‚ However, because we cannotĀ represent six independent things by four things, we doĀ notĀ get some other four-vector, and so that’s why we cannot apply the same ‘trick’ in four dimensions.

However, here I amĀ getting way ahead of myself and so… Well… Yes. Back to the main story line. šŸ™‚ So let’s try to move to the next level of understanding, which is… Well…

Because of guys like Maxwell and Einstein, we now knowĀ that rotations are part of the Newtonian world, in which time and space are neatly separated, and that things are notĀ so simple in Einstein’s world, which is the real world, as far as we know, at least! Under a Lorentz transformation, the new ā€˜primed’ space and time coordinates are a mixture of the ā€˜unprimed’ ones. Indeed, the new x’Ā is a mixture of x and t, and the new t’Ā is a mixture of x and t as well. [Yes, please scroll all the way up and have a look at the transformation on the left-hand side!]

So youĀ don’t have that under a Galilean transformation: in the Newtonian world, space and time are neatly separated, and time is absolute, i.e. it is the same regardless of the reference frame. In Einstein’s world – our world – that’s not the case: time is relative, orĀ localĀ as Hendrik Lorentz termed it quite appropriately,Ā and so it’s space-time – i.e. ā€˜some kind of union of space and time’ as Minkowski termed it – that transforms.

So that’s why physicists useĀ four-vectorsĀ toĀ keep track of things. These four-vectors always have three space-like components, but they also include one so-calledĀ time-like component.Ā It’s the only way to ensure thatĀ the laws of physics are unchanged when moving with uniform velocity.Ā Indeed, any true law of physics we write down must be arranged so that the invariance of physics (as a “fact of Nature”, as Feynman puts it) is built in, and so that’s why we use Lorentz transformations and four-vectors.

In the mentioned post, I gave a few examples illustrating how the Lorentz rules work. Suppose we’re looking at some spaceship that is moving at half the speed of light (i.e. 0.5c) and that, inside the spaceship, some object is also moving at half the speed of light, as measured in the reference frame of the spaceship, then we get the rather remarkable result that, from ourĀ point of view (i.e. ourĀ reference frame as observer on the ground), that object is notĀ going as fast as light, as Newton or Galileo – and most present-day armchair philosophers šŸ™‚ – would predict (0.5cĀ +Ā 0.5cĀ = c). We’d see it move at a speed equal to vĀ =Ā 0.8c. Huh?Ā How do we know that? Well… We can derive a velocity formula from the Lorentz rules:

Capture

So now you can just put in the numbers now:Ā vxĀ = (0.5c + 0.5c)/(1 + 0.5Ā·0.5) = 0.8c. See?

Let’s do another example. Suppose we’re looking at a light beam inside the spaceship, so something that’s traveling at speed c itself in the spaceship. How does that look to us? The Galilean transformation rules say its speed should be 1.5c, but that can’t be true of course, and the Lorentz rules save us once more: vxĀ = (0.5cĀ + c)/(1 + 0.5Ā·1) = c, so it turns out that the speed of light doesĀ notĀ depend on the reference frame: it looks the same – both to the man in the ship as well as to the man on the ground. As Feynman puts it: “This is good, for it is, in fact, what the Einstein theory of relativity was designed to do in the first place—so it hadĀ betterĀ work!” šŸ™‚

So let’s now apply relativity to electromagnetism. Indeed, that’s what this post is all about! However, before I do so, let me re-write the Lorentz transformation rules forĀ cĀ = 1. We can equate the speed of light to one, indeed, when measure time and distance in equivalent units. It’s just a matter of ditching our seconds for meters (so our time unit becomes the time that light needs to travel a distance of one meter), or ditching our meters for seconds (so our distance unit becomes the distance that light travels in one second). You should be familiar with this procedure. If not, well… Check out my posts on relativity. So here’s the same set of rules for cĀ = 1:

Lorentz rules

They’re much easier to remember and work with, and so that’s good, because now we need toĀ look at how these rules work with four-vectors and the various operations and operators we’ll be defining on them. Let’s look at that step by step.

Electrodynamics in relativistic notation

Let me copy the UniversalĀ Set of Equations and Their Solution once more:

frame

The solution for Maxwell’s equations is given in terms of the (electric) potential Φ and the (magnetic) vectorĀ potential A. I explained that in my post on this, so I won’t repeat myself too much here either. The only point you should note is that this solution is the result of a special choice of Φ andĀ A, which we referred to as the Lorentz gauge.Ā We’ll touch upon this condition once more, so just make a mental note of it.

Now, E and B do not correspond to four-vectors: theyĀ dependĀ on x, y, z and t, but they haveĀ threeĀ components only:Ā Ex, Ey,Ā Ez, and Bx, By, and BzĀ respectively. So we have sixĀ independent terms here, rather thanĀ four things that, somehow, we could combine into some four-vector. [Does this ring a bell? It should. :-)] Having said that, it turns out that weĀ canĀ combine Φ andĀ AĀ into a four-vector, which we’ll refer to as the four-potentialĀ and which we’llĀ will write as:

Aμ = (Φ, A) = (Φ, Ax, Ay, Az) = (At, Ax, Ay, Az) with At = Φ.

So that’s a four-vector just likeĀ R = (ct, x, y, z).

How do we know that Aμ is a four-vector? Well… Here I need to say a few things about those Lorentz transformation rules and, more importantly, about the required condition ofĀ invarianceĀ under a Lorentz transformation. So, yes, here we need to dive into the math.

Four-vectors and invariance under Lorentz transformations

When you were in high-school, you learned how toĀ rotateĀ your coordinate frame. You also learned that the distance of a point from the origin does not change under a rotation, so you’d writeĀ r’2Ā = x’2Ā + y’2Ā + z’2Ā =Ā r2Ā =Ā x2Ā + y2Ā + z2, and you’d say that r2Ā is an invariant quantity under a rotation.Ā Indeed, transformations leave certain things unchanged. From the Lorentz transformation rules itself, it is easy to see that

cĀ·t’2 – x’2 – y’2 –z ‘2 = cĀ·t2 –x2 – y2Ā  – z2, or,

if cĀ = 1, thatĀ t’2 – x’2 – y’2 – z’2 = t2 – x2 – y2Ā  – z2,

is anĀ invariantĀ under a Lorentz transformation. We found the same for the so-calledĀ spacetimeĀ interval Ī”s2Ā  =Ā Ī”r2 – cĪ”t2, which we write as Ī”s2Ā  =Ā Ī”r2 – Ī”t2Ā as we chose our time or distance units such that cĀ = 1. [Note that, from now on, we’ll assume that’s the case, so cĀ = 1 everywhere. We can always change back to our old units when we’re done with the analysis.] Indeed, such invariance allowed us to define spacelike,Ā timelikeĀ and lightlikeĀ intervals using the so-called light cone emanating from a single event and traveling in all directions.

You should note that, for four-vectors, we do not have a simple sum of three terms. Indeed, we don’t write x2Ā + y2Ā + z2Ā but t2 – x2 – y2Ā  – z2. So we’ve got a +āˆ’āˆ’āˆ’ thing here or, it’s just another convention, we could also work with a āˆ’+++ sum of terms. The convention is referred to as the signature, and we will use the so-calledĀ metric signature here, which is +āˆ’āˆ’āˆ’. Let’s continue the story.Ā Now, all four-vectors aμ = (at, ax, ay, az) have this property that:

at2 – ax2 – ay2 – az2 = at2 – ax2 – ay2Ā  – az2.

[The primed quantities are, obviously, the quantities as measured in the other reference frame.]Ā So. Well… Yes. šŸ™‚ But… Well… Hmm… We can sayĀ that our four-potential vector is a four-vector, butĀ so we still have toĀ proveĀ that. So we need to prove that Φ’2 – Ax2 – Ay2 – Az2 = Φ2 – Ax2 – Ay2Ā  – Az2Ā for ourĀ four-potential vectorĀ Aμ = (Φ, A). So… Yes… How can we do that? The proof isĀ notĀ so easy, but you need to go through it as it will introduce some more concepts and ideas you need to understand.

In my post on the Lorentz gauge, I mentioned that Maxwell’s equations can be re-written in terms of Φ andĀ A, rather than in terms of E and B. The equations are:

Equations 2

The expression look rather formidable, but don’t panic: just look at it. Of course, you need to be familiar with the operators that are being used here, so that’s the Laplacian āˆ‡2Ā and the divergence operator āˆ‡ā€¢ that’s being applied to the scalar Φ and the vector A. I can’t re-explain this. I am sorry. Just check my posts on vector analysis.Ā You should also look at the third equation: that’s just the Lorentz gauge condition, which we introduced when derivingĀ these equations from Maxwell’s equations. Having said that, it’s the first and second equation which describe Φ and A as a function of the charges and currents in space, and so that’s what matters here. So let’s unfold the first equation. It says the following:

potential formula

In fact, if we’d be talkingĀ freeĀ or empty space, i.e. regions where there are no charges and currents, then the right-hand side would be zero and this equation would then represent a wave equation, so some potential Φ that is changing in time and moving out at the speed c. Here again, I am sorry I can’t write about this here: you’ll need to check one of my posts on wave equations. If you don’t want to do that, you should believe me when I say that, if you see an equation like this:

f8then the functionĀ ĪØ(x, t) must be some function

solution

Now, that’s a function representing a wave traveling at speed c, i.e. the phase velocity. Always? Yes.Ā Always! It’s got to do with the x āˆ’ ct and/or x +Ā ctĀ  argument in the function. But, sorry, I need to move on here.

The unfolding of the equation with Φ makes it clear that we have four equations really. Indeed, the second equation is three equations: one for Ax, one forĀ Ay, and one forĀ AzĀ respectively. The four quantities on the right-hand side of these equations are ρ, jx, jyĀ and jzĀ respectively, divided by ε0, which is a universal constant which does notĀ change when going from one coordinate system to another.Ā Now, the quantities ρ, jx, jyĀ and jzĀ transform like a four-vector. How do we know that? It’s just the charge conservation law. We used it when solving the problem of the fields around a moving wire, when we demonstrated theĀ relativityĀ of the electric and magnetic field. Indeed, the relevant equations were:

Lorentz j and rho

You can check that against the Lorentz transformation rules forĀ cĀ = 1. They’re exactly the same, but so we chose t = 0, so the rules are even simpler. Hence, the (ρ,Ā jx, jy, jz) vector is, effectively, aĀ four-vector, and we’ll denote it by jμ = (ρ, j).Ā I nowĀ need to explain something else. [And, yes,Ā I know this is becoming a veryĀ long story but… Well… That’s how it is.]

It’s about our operatorsĀ āˆ‡, āˆ‡ā€¢, āˆ‡Ć— and āˆ‡2Ā , so that’s the gradient, theĀ divergence, curlĀ and LaplacianĀ operator respectively: they all have a four-dimensional equivalent. Of course, that won’t surprise you. 😦 Let me just jot all of them down, so we’re done with that, and then I’ll focus on the four-dimensional equivalent of the LaplacianĀ Ā āˆ‡ā€¢āˆ‡ =Ā āˆ‡2Ā , which is referred to as theĀ D’Alembertian, and which is denoted byĀ ā–”2, because that’s the one we need to prove that our four-potential vector is a realĀ four-vector. [I know: ā–”2Ā is a tiny symbol for a pretty monstrous thing, but I can’t help it: my editor tool is pretty limited.]

Four-vectors

Now, we’re almost there. Just hang in for a little longer. It should be obvious that we can re-write those two equations with Φ, A, ρ and j, as:

Formula d'alembertian 2

Just to make sure, let me remind you that Aμ = (Φ, A) and thatĀ jμ = (ρ, j). Now, our new D’Alembertian operator is just an operator—a prettyĀ formidableĀ operator but, still, it’s an operator, and so itĀ doesn’t change when the coordinate system changes, so the conclusion is that,Ā IFĀ jμ = (ρ, j) is a four-vector – which it is – and, therefore, transforms like a four-vector,Ā THENĀ the quantities Φ, Ax, Ay, and AzĀ must also transformĀ like a four-vector, which means they areĀ (the components of) a four-vector.

So… Well… Think about it, but not too long, because it’s just an intermediate result we had to prove. So that’s done. But we’re not done here. It’s just the beginning, actually. :-/ Let me repeat our intermediate result:

Aμ = (Φ, A) is a four-vector. We call it the four-potential vector.

OK. Let’s continue. Let me firstĀ draw your attention to that expression with the D’Alembertian above. Which expression? This one:

Formula d'alembertian 2

What about it? Well… You should note thatĀ the physics of that equation is just the same as Maxwell’s equations. So it’s one equation only, but it’s got it all.

It’s quite a pleasure to re-write it in such elegant form. Why? Think about it: it’s a four-vector equation: we’ve got a four-vector on the left-hand side, and a four-vector on the right-hand side. Therefore, this equation is invariant under a transformation. So, therefore,Ā it directly shows the invariance of electrodynamics under the Lorentz transformation.

Huh? Yes. You may think about this a little longer. šŸ™‚

To wrap this up, I should also note that we can also express the gauge condition using our new four-vector notation. Indeed, we can write it as:

Lorentz condition

It’s referred to as the Lorentz condition and it is, effectively, a condition for invariance, i.e. it ensures that the four-vector equation above does stay in the form it is in for all reference frames. Note that we’re re-writing it using the four-dimensional equivalent of the divergence operatorĀ āˆ‡ā€¢, but so we don’t have a dot between āˆ‡Ī¼Ā and Aμ. In fact, the notation is pretty confusing, and it’s easy to think we’re talking some gradient, rather than the divergence. So let me therefore highlight the meaning of both once again. It looks the same, but it’s two veryĀ different things: the gradient operates on a scalar, while the divergence operates on a (four-)vector. Also note the +āˆ’āˆ’āˆ’ signature is only there for theĀ gradient, not for the divergence!

example

You’ll wonder why they didn’t use some • orĀ āˆ— symbol, and the answer: I don’t know. I know it’s hard to keep inventing symbols for all these different ‘products’ – the āŠ— symbol, for example, is reserved forĀ tensorĀ products, which we won’t get into – but… Well… I think they could have done something here. 😦

In any case… Let’s move on. Before we do, please note that we can also re-write our conservation law for electric charge using our new four-vector notation. Indeed, you’ll remember that we wrote that conservation law as:

conservation law

Using our new four-vector operator āˆ‡Ī¼, we can re-write that as āˆ‡Ī¼jμ = 0. So all of electrodynamics can be summarized in the two equations only—Maxwell’s law and the charge conservation law:

all

OK. We’re now ready to discuss the electromagnetic tensor. [I know… This is becoming an incredibly long and incredibly complicated piece but, ifĀ you get through it, you’ll admitĀ it’s really worth it.]

The electromagnetic tensor

The whole analysis above was done in terms of the Φ and A potentials. It’s time to get back to our field vectorsĀ E and B. We know we can easily get them from Φ and A, using the rules we mentioned as solutions:

E and B solutions

These two equations shouldĀ notĀ look as yet another formula. They are essential, and you should be able to jot them down anytime anywhere. They should be on your kitchen door, in your toilet and above your bed. šŸ™‚Ā For example, the second equation gives us theĀ components of the magnetic field vector B:

B field components

Now, look at these equations. The x-component is equal to a couple of terms that involve only y– and z-components. The y-component is equal to something involving only xĀ and z.Ā Finally, the z-component only involves x and y. Interesting. Let’s define a ‘thing’ we’ll denote by FzyĀ and define as:

F definition

So now we can write: BxĀ = Fzy,Ā ByĀ = Fxz, andĀ BzĀ = Fxy. Now look at our equation for E. It turns out the components of E are equal to things likeĀ Fxt, FytĀ and Fzt! Indeed,Ā FxtĀ =Ā āˆ‚Ax/āˆ‚tĀ āˆ’Ā āˆ‚At/āˆ‚x = Ex!

But… Well… No. 😦 The sign is wrong!Ā ExĀ = āˆ’āˆ‚Ax/āˆ‚tāˆ’āˆ‚At/āˆ‚x, so we need to modify our definition of Fxt. When the t-component is involved, we’ll define our ‘F-things’ as:

time f

So we’ve got a plus instead of a minus. It looks quite arbitrary but, frankly, you’ll have to admit it’s sort of consistent with our +āˆ’āˆ’āˆ’ signatureĀ for our four-vectors and, in just a minute, you’ll see it’s fully consistent with our definition of the four-dimensional vector operatorĀ āˆ‡Ī¼Ā = (āˆ‚/āˆ‚t, āˆ’āˆ‚/āˆ‚x,Ā āˆ’āˆ‚/āˆ‚y, āˆ’āˆ‚/āˆ‚z). So… Well… Let’s go along with it.

What about the Fxx, Fyy, FzzĀ and FttĀ terms? Well…Ā FxxĀ =Ā āˆ‚Ax/āˆ‚x āˆ’Ā āˆ‚Ax/āˆ‚x = 0, and it’s easy to see that FyyĀ and FzzĀ are zero too. ButĀ Ftt? Well… It’s a bit tricky but, applying our definitions carefully, we see that FttĀ must be zero too. In any case, theĀ FttĀ = 0 will become obvious as we will be arranging these ‘F-things’ in a matrix, which is what we’ll do now.Ā [Again: does this ring a bell? If not, it should. :-)]

Indeed, we’ve got sixteen possible combinations here, which Feynman denotes as Fμν, which is somewhat confusing, because Fμν usually denotes theĀ 4Ɨ4 matrixĀ representing all of these combinations. So let me use the subscripts i and j instead, and define FijĀ as:

FijĀ =Ā āˆ‡iAjĀ āˆ’Ā āˆ‡jAi

with āˆ‡iĀ being the t-, x-, y- orĀ z-component of āˆ‡Ī¼Ā =Ā (āˆ‚/āˆ‚t, āˆ’āˆ‚/āˆ‚x,Ā āˆ’āˆ‚/āˆ‚y,Ā āˆ’āˆ‚/āˆ‚z) and, likewise, AiĀ being the t-, x-, y- orĀ z-component of Aμ =Ā (Φ, Ax,Ā Ay,Ā Az). Just check it: FzyĀ = āˆ’āˆ‚Ay/āˆ‚z +Ā āˆ‚Az/āˆ‚y = āˆ‚Az/āˆ‚y āˆ’ āˆ‚Ay/āˆ‚z = Bx, for example, andĀ FxtĀ =Ā āˆ’āˆ‚Ī¦/āˆ‚x āˆ’Ā āˆ‚Ax/āˆ‚tĀ = Ex. So theĀ +āˆ’āˆ’āˆ’ conventionĀ works. [Also note that it’s easier now to see that FttĀ = āˆ‚Ī¦/āˆ‚t āˆ’ āˆ‚Ī¦/āˆ‚t = 0.]

We can now arrange the FijĀ in a matrix. This matrix is antisymmetric, because FijĀ = – Fji, and its diagonal elements are zero. [For those of you who love math: note that the diagonal elements of an antisymmetric matrix are always zero because of the FijĀ = – FjiĀ constraint: just use k = i = j in the constraint.]

Now that matrix is referred to as the electromagnetic tensor and it’s depicted below (we pluggedĀ cĀ back in, remember thatĀ B’s magnitude is 1/c times E’s magnitude).

electromagnetic tensor final

So… Well… Great ! We’re done! Well… Not quite. šŸ™‚

We can get this matrix in a number of ways. The least complicated way is, of course, just to calculate all FijĀ components and them put them in a [Fij] matrix using theĀ iĀ as the row number and theĀ jĀ as the column number. You need to watch out with the conventions though, and soĀ i and j startĀ onĀ t and end on z. šŸ™‚

The other way to do it is to write theĀ āˆ‡Ī¼Ā = (āˆ‚/āˆ‚t, āˆ’āˆ‚/āˆ‚x,Ā āˆ’āˆ‚/āˆ‚y,Ā āˆ’āˆ‚/āˆ‚z) operator as a 4Ɨ1 column vector, which you then multiply with the four-vector Aμ written as a 4Ɨ1 row vector. So āˆ‡Ī¼Aμ is then aĀ 4Ɨ4 matrix, which we combine with its transpose, i.e.Ā (āˆ‡Ī¼Aμ)T, as shown below. So what’s written below is (āˆ‡Ī¼Aμ) āˆ’ (āˆ‡Ī¼Aμ)T.

matrix

If you google, you’ll see there’s more than one way to go about it, so I’d recommend you just go through the motions and double-check the whole thing yourself—and please do let me know if you find any mistake! In fact, the Wikipedia article on the electromagnetic tensor denotes the matrix above asĀ Fμν, rather than asĀ Fμν, which is the sameĀ tensorĀ but in its so-calledĀ covariantĀ form, but so I’ll refer you to that article as I don’t want to make things even more complicated here! As said, there’s differentĀ conventionsĀ around here, and so you need to double-check what is what really. šŸ™‚

Where are we heading with all of this? The next thing is to look at theĀ LorentzĀ transformation of theseĀ FijĀ =Ā āˆ‡iAjĀ āˆ’Ā āˆ‡jAiĀ components, because then we know how our E and B fields transform. Before we do so, however, we should note the more general results and definitions which we obtained here:

1. The Fμν matrix (a matrix is just a multi-dimensional array, of course) is a so-called tensor. It’s a tensor of the second rank, because it has two indices in it. We think of it as a very special ‘product’ of two vectors, not unlike the vector cross product aĀ Ć— b, whose components were also defined by a similar combination of the components of a and b. Indeed, we wrote:

cross product

So one should think of a tensor as “another kind of cross product” or, preferably, and as Feynman puts it, as a “generalization of the cross product”.

2.Ā In this case, the four-vectors are āˆ‡Ī¼Ā =Ā (āˆ‚/āˆ‚t, āˆ’āˆ‚/āˆ‚x,Ā āˆ’āˆ‚/āˆ‚y,Ā āˆ’āˆ‚/āˆ‚z) and Aμ =Ā (Φ, Ax,Ā Ay,Ā Az). Now, you will probably say that āˆ‡Ī¼Ā is an operator, not a vector, and you are right. However, we know that āˆ‡Ī¼Ā behaves like a vector, and so this is just a special case. The point is: because the tensor is based on four-vectors, the Fμν tensor is referred to as a tensor of the second rank in four dimensions. In addition, because of the FijĀ = – FjiĀ result, Fμν is an asymmetric tensor of the second rank in four dimensions.

3.Ā Now, the whole point is to examine how tensors transform. We know that the vector dot product, aka the inner product, remains invariantĀ under a Lorentz transformation, both in three as well as in four dimensions, but what about the vector cross product, and what about the tensor? That’s what we’ll be looking at now.

The Lorentz transformation of the electric and magnetic fields

Cross products are complicated, and tensors will be complicated too. Let’s recall our example in three dimensions, i.e. the angular momentum vectorĀ L, which was a cross product of the radius vector r and the momentum vector p = mv, as illustrated below (the animation also gives the torque Ļ„, which is, loosely speaking, a measure of the turning force).

Torque_animation

The components of L are:

momentum angular

Now, this particular definition ensures that LijĀ turns out to be an antisymmetric object:

three-vector

So it’s a similar situation here. We have nineĀ possible combinations, but only threeĀ independent numbers. So it’s a bit like our tensor in four dimensions: 16 combinations, but only 6 independent numbers.

Now, it so happens that that these three numbers, orĀ objectsĀ if you want, transform inĀ exactly the same wayĀ as the components of a vector.Ā However, as Feynman points out, that’s a matter of ‘luck’ really. In fact, Feynman points out that, when we have two vectors a = (ax, ay,Ā az) and b = (bx, by, bz), we’ll haveĀ nineĀ products TijĀ = aibjĀ which will also form aĀ tensorĀ of the second rank (cf. the two indices) but which, in general, will not obey the transformation rules we got for theĀ angular momentumĀ tensor, whichĀ happenedĀ to be an antisymmetric tensor of the second rank in three dimensions.

To make a long story short, it’s not simple in general, and surely not here: with E and B, we’ve gotĀ sixĀ independent terms, and so we cannotĀ represent six things by four things, so the transformation rules for E and B will differ from those for a four-vector. So what areĀ they then?

Well… Feynman first works out the rules for the general antisymmetric vector combination GijĀ = aibjĀ āˆ’ ajbi, withĀ aiĀ and bjĀ the t-, x-, y- or z-component of the four-vectors aμ = (at, ax, ay, az) and bμ = (bt, bx, by, bz) respectively. The idea is to first get some general rules, and then replace GijĀ = aibjĀ āˆ’ ajbiĀ by FijĀ =Ā āˆ‡iAjĀ āˆ’Ā āˆ‡jAi, of course! So let’s apply the Lorentz rules, which – let me remind you – are the following ones:

Lorentz rules

So we get:

set 1

The rest is all very tedious: you just need to plug these things into the variousĀ GijĀ = aibjĀ āˆ’ ajbiĀ formulas. For example, for G’tx, we get:

G1

Hey! That’s justĀ G’tx, so we find that G’txĀ =Ā Gtx! What about the rest? Well… ThatĀ yields something different. Let me shorten the story by simply copying Feynman here:

resulsts

So… Done!

So what?

Well… Now we just substitute. In fact, thereĀ areĀ two alternative formulations of the Lorentz transformations of E and B. They are given below (note the units are such thatĀ c = 1):

result 1 result 2

In addition, there is a third equivalent formulation which is more practical, and also simpler, even if it puts theĀ c‘s back in. It re-defines the field components, distinguishing only two:

  1. The ‘parallel’ components E||Ā and B||Ā along theĀ x-direction ( because they are parallel to the relative velocity of the S and S’ reference frames), and
  2. The ‘perpendicular’ or ‘total transverse’ componentsĀ E⊄ and B⊄, which are the vector sums of the y- and z-components.

So that gives us four equations only:

result 3

And, yes, we areĀ done now. This is the Lorentz transformation of the fields. I am sure it has left you totally exhausted. Well… If not… […] It sure left me totally exhausted. šŸ™‚

To lighten things up, let me insert an image of how the transformed field E actually looks like. The first image is the reference frame of a charge itself: we have a simple Coulomb field. The second image shows the charge flying by. Its electric field is ‘squashed up’. To be precise, it’s just like the scale ofĀ xĀ is squashed up by a factorĀ ((1āˆ’v2/c2)1/2. Let me refer you to Feynman for the detail of the calculations here.

field

OK. So that’s it. You may wonder: what about that promise I made? Indeed, when I started this post, I said I’dĀ present aĀ mathematical construct that presents the electromagnetic force as oneĀ force only, as one physical reality, but so we’re back writing all of it in terms of twoĀ vectors—the electric field vector E and the magnetic field vector B. Well… What can I say? IĀ didĀ present the mathematical construct: it’s the electromagnetic tensor. So it’s that antisymmetric matrix really, which one can combine with aĀ transformation matrixĀ embodying the Lorentz transformation rules. So, IĀ didĀ what I promised to do. But you’re right: IĀ amĀ re-presenting stuff in the old style once again.

The second objection that you may have—in fact, that youĀ shouldĀ have, is that all of this has been rather tedious. And you’re right.Ā The whole thing just re-emphasizes the value of using the four-potential vector. It’s obviouslyĀ muchĀ easier to takeĀ thatĀ vector from one reference frame to another – so we just apply the Lorentz transformation rules to Aμ = (Φ, A) and get Aμ‘ = (Φ’, A’) from it – and then calculate E’ and B’ from it, rather than trying to remember those equations above.Ā However, that’s not the point, or…

Well… It is and it isn’t. We wanted to get away from thoseĀ twoĀ vectors E and B, and show thatĀ electromagnetism is reallyĀ oneĀ phenomenon only, and so that’s where the concept of the electromagnetic tensor came in. There were two objectives here: the first objective was to introduce you to the concept of tensors, which we’ll need in the future. The second objective was to show you that, while Lorentz’ force law – F = q(E + vƗB) makes it clear we’re talking one force only, there is a way of writing it all up that is much more elegant.

I’ve introduced the concept of tensors here, so the first objective should have been achieved. As for the second objective, I’ll discuss that in my next post, in which I’ll introduce the four-velocity vector μμ as well as the four-force vectorĀ fμ. It will explain the following beautiful equation of motion:

motion equation

NowĀ thatĀ looks very elegant and unified, doesn’t it? šŸ™‚

[…] Hmm… No reaction. I know… You’re tired now, and you’re thinking: yet another way of representing the same thing? Well… Yes! So…

OK… Enough for today. Let’s follow up tomorrow.

Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 20, 2020 as a result of a DMCA takedown notice from Michael A. Gottlieb, Rudolf Pfeiffer, and The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 20, 2020 as a result of a DMCA takedown notice from Michael A. Gottlieb, Rudolf Pfeiffer, and The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 20, 2020 as a result of a DMCA takedown notice from Michael A. Gottlieb, Rudolf Pfeiffer, and The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/

Electric circuits (2): Kirchoff’s rules, and the energy in a circuit

My previous post was long and tedious, and all it did was presenting the three (passive) circuit elements as well as the concept of impedance. It show the inner workings of these little devices are actually quite complicated.Ā Fortunately, the conclusions were very neat and short: for all circuit elements, we have a very simple relationship between (a) the voltage acrossĀ the terminals of the element (V) and (b) the current that’s goingĀ throughĀ the circuit element (I). We found they are always in some ratio, which is referred to as the impedance, which we denoted by Z:

Z =Ā V/I ⇔ V = Iāˆ—Z

So it’s a ‘simple’ ratio, indeed. But… Well… Simple and not simple. It’s a ratio of two complex numbers and, therefore, it’s aĀ complexĀ number itself. That’s why I use the āˆ— symbol when re-writing the Z =Ā V/IĀ formula as V = Iāˆ—Z, so it’s clear we’re talking a product of two complex numbers). This ‘complexity’ is best understood by thinking of the voltage and the current asĀ phase vectorsĀ (orĀ phasorsĀ as engineers call them). Indeed, instead of using the sinusoidal functions we are used to, so that’s

  • V = V0Ā·cos(ωt + ĪøV),
  • I = I0Ā·cos(ωt + ĪøI), and
  • Z = Z0Ā·cos(ωt + Īø) = (V0/I0)Ā·cos(ωt + ĪøV āˆ’Ā ĪøI),

we preferred theĀ complex orĀ vectorĀ notation, writing:

  • VĀ =Ā |V|ei(ωt +Ā ĪøV) = V0Ā·ei(ωt +Ā ĪøV)
  • IĀ =Ā |I|ei(ωt +Ā ĪøI)Ā = I0Ā·ei(ωt +Ā ĪøI)
  • Z = |ZIei(ωt +Ā Īø)Ā = Z0Ā·ei(ωt +Ā Īø)Ā = (V0/I0)Ā·ei(ωt +Ā ĪøV āˆ’Ā ĪøI)

For the three circuit elements, we found the followingĀ solutionĀ for ZĀ in terms of the previously definedĀ propertiesĀ of the respective circuit elements, i.e. their resistanceĀ (R),Ā capacitanceĀ (C), and inductanceĀ (L) respectively:

  1. For a resistor, we have Z(resistor) = ZRĀ = R
  2. For an capacitor, we have Z(capacitor) = ZCĀ = 1/iωC = –i/(ωC)
  3. For an inductor, we haveĀ Z(inductance) = ZL= iωL

We also explained what these formulas meant, using graphs like the ones below:

  1. The graph on the left-hand side gives you the ratio of theĀ peakĀ voltage andĀ peakĀ current for the three devices as a function of C, L, R and ω respectively.
  2. The graph on the right-hand side shows you the relationship between theĀ phaseĀ of the voltage and the current for a capacitor and for an inductor. [For a resistor, the phases are the same, so no need for a graph. Also note that the lag of theĀ phaseĀ of the current vis-Ć”-vis the voltage phase is 90 degrees for an inductor, while it’s 270 degrees for a capacitorĀ (which amounts to the current leading the voltage with a 90° phase difference).]

Capture800px-VI_phase

The inner workings of our circuit elements are all wonderful and mysterious, and so we spent a lot of time writing about them. That’s finished now. The summary about describes all of them in very simple terms, relating the voltage and currentĀ phasorĀ through the concept of impedance, which is just a ratio—albeit aĀ complexĀ ratio.

As the graphs above suggest, we can build all kinds of crazy circuits now, and the idea of resonanceĀ as we’ve learned it when studying the behavior of waves will be particularly relevant when discussing circuits that are designed to filter certain frequenciesĀ or, the opposite, to amplifyĀ some. We won’t go that far in this post, however, as I just want to explain the basic rules one needs to know when looking at a circuit, i.e.Ā Kirchoff’s circuit laws.Ā There are two of them:

1. Kirchoff’s Voltage Law (KCL): The sum of the voltage drops around any closed path is zero.

The principle is illustrated below. It doesn’t matter whether or not we have other circuits feeding into this one: Kirchoff’s Voltage LawĀ (KCL) remains valid.

KVR

We can write this law using the concept of circulation once again or, what you’ll probably like more, just using plain summation:Ā integral

KVL

2. Kirchoff’s Current Law (KCL): The sum of the currents into any node is zero.

This law is written and illustrated as follows:

KCL

KCL illus

KCR

This Law requires some definition of aĀ node, of course. Feynman defines a node as any set of terminals such as a, b, c, d in the illustration above which are connected. So it’s a set of connected terminals.

Now, I’ll refer you Feynman for some practicalĀ examples. The circuit below is one of them. It looks complicated but it all boils down to solving a set of linear equations. So… Well… That’s it, really. We’re done! We should do the exercises, of course, but then we’re too lazy for that, I guess. šŸ™‚ So we’re done!

example

Well… Almost. I also need to mention how one can reduceĀ complicated circuits by combining parallel impedances, using the following formula:

parallel Z

And then another powerful idea is the idea ofĀ equivalentĀ circuits. The rules for this are as follows:

  1. Any two-terminal network of passive elements is equivalent to (and, hence, can be replaced by) anĀ effectiveĀ impedance (Zeff).
  2. Any two-terminal network of passive elements is equivalent to (and, hence, can be replaced by) aĀ generator in series with an impedance.

These two principles are illustrated below: (a) is equivalent to (b) in each diagram.

P2P1

The related formulas are:

  1. I = Ɛ/Zeff
  2. VnĀ = ƐeffĀ āˆ’ Ināˆ—Zeff

Last but not least, I need to say something about the energyĀ in circuits. As we noted in our previous post, the impedance will consist of a real and an imaginary part. We write:

Z = R + iĀ·X

This gives rise to the following powerful equivalence:Ā any impedance is equivalent to a series combination of a pure resistance and a pure reactance, as illustrated below (the ≔ sign stands for equivalence):

energy

Of course, because this post risks becoming too short šŸ™‚ I need to insert some more formulas now. IfĀ Z = R + iĀ·X is the impedance of the whole circuit, then the whole circuit can be summarized in the following equation:

Ɛ = Iāˆ—ZĀ = Iāˆ—(R + iĀ·X)

Now, if we bring the analysis back to the realĀ parts of this equation, then we may write our current as I = I0Ā·cos(ωt). This implies we chose a t = 0 point soĀ ĪøIĀ = 0. [Note that this is somewhat different than what we usually do: we usually chose our t = 0 point such thatĀ ĪøVĀ = 0, but it doesn’t matter.] The real emf is then going to be the real part of Ɛ = Iāˆ—ZĀ = Iāˆ—(R + iĀ·X), so we’ll write it as ʐ (no bold-face), and it’s going to be the real part of that expression above, which we can also write as:

Ɛ = Iāˆ—ZĀ = I0Ā·ei(ωt)Ā āˆ—(R + iĀ·X)

So ʐ is the real part of this Ɛ and, you should check, it’s going to be equal to:

ʐ =Ā I0Ā·RĀ·cos(ωt) āˆ’ I0Ā·XĀ·sin(ωt)

The two terms in this equation represent the voltage drops across the resistance R and the reactance X in that illustration above. […] Now that I think of it, in line with the -or and -ance convention for circuit elements and their properties, should we, perhaps, sayĀ resistorĀ andĀ reactor in this case? šŸ™‚ […] OK. That’s a bad joke. [I don’t seem to have good ones, isn’t it?] šŸ™‚

Jokes aside, we see thatĀ the voltage drop across the resistance is in phase with the current (because it’s a simple cosine function of ωt as well), while the voltage drop across the purely reactive part is out of phase with the current (as you know, the sine and cosine are the same function, but with a phase difference of π/2 indeed).

You’ll wonder where are we going with this, so let me wrap it all up. You know the power is the emf times the current, and so let’s integrate this thing over one cycle to get the average rate (and then I mean aĀ time rate of change) of the energy that getsĀ lostĀ in the circuit. So we need to solve the following integral:

integral 2

This may look like a monster, but if you look back at your notes from your math classes, you should be able to figure it out:

  1. The first integral is (1/2)I02Ā·R..
  2. The second integral is zero.

So what? Well… Look at it! It means that the (average) energy loss in a circuit with impedance Z = R + iĀ·X onlyĀ depends on the real part of Z, which is equal to I02Ā·R/2. That’s, of course, how we want it to be: ideal inductances and capacitorsĀ storeĀ energy when being powered, and give whatever they stored when ‘charging’ back to the circuit when the current reverses direction.

So it’s a nice result, because it’s consistent with everything. Hmm… Let’s double-check though… Is it also consistent with the power equation for a resistor which, remember, is written as: P = VĀ·I = IĀ·RĀ·I = I2Ā·R. […] Well… What about the 1/2 factor?

Well… Think about it. I isĀ a sine or a cosine here, and so we want the average value of its square, so that’s 〈cos2(ωt)〉 = 1/2.

Done!Ā šŸ™‚

Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 17, 2020 as a result of a DMCA takedown notice from Michael A. Gottlieb, Rudolf Pfeiffer, and The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 17, 2020 as a result of a DMCA takedown notice from Michael A. Gottlieb, Rudolf Pfeiffer, and The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 20, 2020 as a result of a DMCA takedown notice from Michael A. Gottlieb, Rudolf Pfeiffer, and The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/

Electric circuits (1): the circuit elements

OK. No escape. It’s part of physics. I am not going to go into the nitty-gritty of it all (because this is a blog about physics, not about engineering)Ā but it’s good to review the basics, which are, essentially, Kirchoff’s rules. Just for the record, GustavĀ Kirchhoff was a German genius who formulated these circuit laws while he was still a student, when he was like 20 years old or so. He did it as aĀ seminar exerciseĀ 170 years ago, and then turned it into doctoral dissertation. Makes me think of that Dire Straits song—That’s the way you do it—Them guys ain’t dumb. šŸ™‚

So this post is, in essence, just an ‘explanation’ of Feynman’s presentation of Kirchoff’s rules, so I am writing this post basically for myself, so as to ensure I am not missing anything. To be frank, Feynman’s use of notation when working with complex numbers is confusing at times and so, yes, I’ll do some ‘re-writing’ here.Ā The nice thing about Feynman’s presentation of electrical circuits is that he sticks to Maxwell’s Laws when describing all ideal circuit elements, so he keeps using line integrals of the electric field EĀ around closed paths (that’s what a circuit is, indeed) to describe the so-calledĀ passive circuit elements, and he also recapitulates the idea of theĀ electromotive forceĀ when discussing the so-calledĀ activeĀ circuit element, so that’s the generator. That’s nice, because it links it all with what we’ve learned so far, i.e. theĀ fundamentalsĀ as expressed in Maxwell’s set of equations. Having said that, I won’t make that link here in this post, because I feel it makes the whole approach rather heavy.

OK. Let’s go for it. Let’s first recall the concept ofĀ impedance.

The impedance concept

There are three idealĀ (passive) circuit elements: the resistor, the capacitor and the inductor. RealĀ circuit elements usually combine characteristics of all of them, even if they are designed to work like ideal circuit elements. Collectively, these ideal (passive) circuit elements are referred to as impedances, because… Well… Because they have someĀ impedance. In fact, you should note that, if we reserve the terms ending with -ance for theĀ propertyĀ of the circuit elements, and those ending on -or for the objects themselves, then we should call them impedors. However, that term does not seem to have caught on.

You already know what impedance is. I explained it before, notably in my post on the intricacies related to self- and mutual inductance. Impedance basically extends the concept ofĀ resistance, as we know it fromĀ direct current (DC)Ā circuits, toĀ alternating currentĀ (AC) circuits. To put it simply, when AC currents are involved – so when the flow of charge periodically changes reverses direction – then it’s likely that, because of the properties of the circuit, the current signal will lag the voltage signal, and so we’ll have some phase differenceĀ telling us by how much. So, resistance is just a simple realĀ number R – it’sĀ the ratioĀ between (1) the voltage that is being applied across the resistor and (2) the current through it, so we write R = V/I – and it’s got aĀ magnitudeĀ only, butĀ impedance isĀ a ‘number’ that has both a magnitude as well as phase, so it’s aĀ complexĀ number, or a vector.

In engineering, such ‘numbers’ with a magnitude as well as a phase are referred to as phasors. A phasor represents voltages, currents and impedances as a phase vector (note the boldĀ italics: they explain how we got the pha-sorĀ term). It’s just a rotatingĀ vector really.Ā So a phasor has a varying magnitude (A) and phase (φ) , which is determined by (1) some maximum magnitude A0, (2) some angular frequency ω and (3) some initial phaseĀ (Īø). So we can write the amplitudeĀ A as:

A = A(φ) = A0Ā·cos(φ) =Ā A0Ā·cos(ωt + Īø)

As usual, Wikipedia has a nice animation for it:

Unfasor

In case you wonder why I am using aĀ cosineĀ rather than aĀ sineĀ function, the answer is that it doesn’t matter: the sine and the cosine are the same function except for a Ļ€/2Ā phase difference: just rotate the animation above by 90 degrees, or think about the formula: sinφ = cos(Ļ†āˆ’Ļ€/2). šŸ™‚

So A = A0Ā·cos(ωt + Īø) is the amplitude. It could be the voltage, or the current, or whatever realĀ variable.Ā The phase vectorĀ itself is represented by a complex number, i.e. a two-dimensional number, so to speak, which we can write as all of the following:

A = A0Ā·eiφ = A0Ā·cosφ + iĀ·A0Ā·sinφ = A0Ā·cos(ωt+Īø) + iĀ·A0Ā·sin(ωt+Īø)

=Ā A0Ā·ei(ωt+Īø)Ā = A0Ā·eiĪøĀ·eiωtĀ = A0Ā·eiωtĀ with A0Ā = A0Ā·eiĪø

That’s just Euler’s formula, and I am afraid I have to refer you to my page on theĀ essentialsĀ if you don’t get this. I know what you are thinking: why do we need the vector notation? Why can’t we just be happy with the A = A0Ā·cos(ωt+Īø) formula? The truthful answer is: it’s just to simplify calculations: it’s easier to work with exponentials than with cosines or sines. For example, writingĀ ei(ωt + Īø)Ā = eiĪøĀ·eiωtĀ is easier than writing cos(ωt + Īø) = … […] Well? […]Ā Hmm… šŸ™‚

See!Ā You’re stuck already. You’d have to use the cos(α+β) = cosα·cosĪ²Ā āˆ’ sinα·sinβ formula: you’d get the same results (just do it for the simple calculation of theĀ impedanceĀ below) but it takes a lot more time, and it’s easier to make mistake. Having said why complex number notation is great, I also need to warn you. There are a few things you have to watch out for. One of these things is notation. The other is the kind of mathematical operations we can do: it’s usually alright but we need to watch out with the i2Ā = –1 thing when multiplyingĀ complex numbers. However, I won’t talk about that here because it would only confuse you even more. šŸ™‚

Just for the notation, let me note that Feynman would write A0Ā as A0Ā with the little hatĀ orĀ caretĀ symbol (∧) on top of it, so as to indicate the complexĀ coefficient isĀ notĀ a variable. So he writes A0Ā asĀ Ć‚0Ā =Ā A0Ā·eiĪø.Ā However, I find that confusing and, hence, I prefer using bold-type for any complex number, variable or not. The disadvantage is that we need to remember that the coefficient in front of the exponential is not a variable: it’s a complex number alright, but not a variable. Indeed, do look at thatĀ A0Ā = A0Ā·eiĪøĀ equality carefully: A0Ā is a specificĀ complex number that captures theĀ initial phase Īø. So it’s not the magnitude of the phasor itself, i.e. |A| =Ā A0. In fact, magnitude, amplitude, phase… We’re using a lot confusing terminology here, and so that’s why you need to ‘get’ the math.

The impedance is not a variable either. It’s some constant. Having said that, this constant will depend on theĀ angular frequency ω. So… Well… Just think about this as you continue to read. šŸ™‚ So the impedance is someĀ number, just like resistance, but it’s a complexĀ number. We’llĀ denote it by ZĀ and, using Euler’s formula once again, we’ll write it as:

Z =Ā |Z|eiĪøĀ = V/I = |V|ei(ωt +Ā ĪøV)/|I|ei(ωt +Ā ĪøI)Ā = [|V|/|I|]Ā·ei(ĪøVĀ āˆ’Ā ĪøI)

So, as you can see, it is,Ā literally, some complexĀ ratio, just like R = V/I was some realĀ ratio: it is a complex ratio because itĀ has a magnitude and a direction, obviously. Also please do note that, as I mentioned already, the impedance is, in general, some function of the frequency ω, as evidenced by the ωt term in the exponential, but so we’re not looking at ω as a variable: V and I are variables and, as such, they depend on ω, but so you should look at ω as some parameter. I know I should, perhaps, notĀ be so explicit on what’s going on, but I want to make sure you understand.

So what’s going on? The illustration below (credit goes to Wikipedia, once again) explains. It’s a pretty generic view of a very simple AC circuit. So we don’t care what the impedance is: it might be an inductor or a capacitor, or a combination of both, but we don’t care: we just call it an impedance, or an impedorĀ if you want. šŸ™‚ The point is: if we apply an alternating current, then the current and the voltage will both go up and down, but the current signal will lag the voltage signal, and some phase factor Īø tells us by how much, so Īø will be the phase difference.

General_AC_circuit

Now, we’re dividing one complex number by another in that Z = V/I formula above, and dividing one complex number by another is not allĀ thatĀ straightforward, so let me re-write that formula for Z above as:

V = Iāˆ—ZĀ = Iāˆ—|Z|eiĪø

Now, while that V = Iāˆ—Z formulaĀ resembles the V = IĀ·R formula, you should note the bold-face type for V and I, and theĀ āˆ— symbol I am using here for multiplication. TheĀ bold-face for V and I implies they’re vectors, or complex numbers. As for theĀ āˆ— symbol, that’s to make it clear we’re not talking a vector cross product AƗB here, but a product of two complex numbers. [It’s obviously notĀ a vector dot product either, because a vector dot product yields a real number, not some other vector.]

Now we write V and IĀ as you’d expect us to write them:

  • VĀ =Ā |V|ei(ωt +Ā ĪøV) = V0Ā·ei(ωt +Ā ĪøV)
  • IĀ =Ā |I|ei(ωt +Ā ĪøI)Ā = I0Ā·ei(ωt +Ā ĪøI)

ĪøVĀ andĀ ĪøIĀ are, obviously, the so-called initialĀ phase of the voltage and the current respectively. These ‘initial’ phases are notĀ independent: we’re talking a phaseĀ differenceĀ really, between the voltage and the current signal, and it’s determined by the properties of the circuit. In fact, that’s the whole point here: theĀ impedanceĀ is a property of the circuit and determines how theĀ current signal varies as a function of the voltage signal. In fact, we’ll often choose the t = 0 point such that ĪøVĀ and so then we need to find ĪøI. […]Ā OK. Let’s get on with it. Writing out all of the factors in the V = Iāˆ—ZĀ = Iāˆ—|Z|eiĪøĀ equation yields:

VĀ =Ā |V|ei(ωt +Ā ĪøV)Ā =Ā Iāˆ—ZĀ = |I|ei(ωt +Ā ĪøI)āˆ—|Z|eiĪøĀ = |I||Z|ei(ωt +Ā ĪøIĀ +Ā Īø)Ā 

Now, this equation must hold for all values of t, so we can equate the magnitudes and phases and, hence, the following equalities must hold:

  1. |V| =Ā |I||Z| ⇔ |Z| =Ā |V|/|I|
  2. ωt + ĪøVĀ =  ωt + ĪøIĀ + θ ⇔ θ = ĪøVĀ āˆ’ ĪøI

Done!Ā 

Of course, you’ll complain once again about those complex numbers: voltage and current are something real, isn’t it? And so what isĀ really aboutĀ this complex numbers? Well… I can just say what I said already. You’re right. I’ve used the complex notation only to simplify the calculus, so it’s only theĀ real partĀ of those complex-valued functions that counts.

OK. We’re done with impedance. We can now discuss the impedors, including resistors (for which we won’t haveĀ such lag or phase difference, but the concept of impedance applies nevertheless).

Before I start, however, you should think about what I’ve done above: IĀ explainedĀ the concept of impedance, but I didn’t do much with it. The real-life problem will usually be that you get the voltage as a function of time, and then you’ll have to calculate the impedance of a circuit and, then, the current as a function of time. So I just showed the fundamental relations but, in real life, you won’t know what Īø andĀ ĪøIĀ could possibly be. Well… Let me correct that statement: we’ll give you formulas forĀ Īø as we discuss the various circuit elements and their impedance below, and so then you can use these formulas to calculateĀ ĪøI. šŸ™‚

Resistors

Let’s start with what seems to be the easiest thing: a resistor. A real resistor is actually not easy to understand, because it requires us to understand the properties ofĀ realĀ materials. Indeed, it may or may not surprise you, but the linear relation between the voltage and the current for realĀ materials is only approximate.Ā Also, the way resistors dissipateĀ energy is not easy to understand. Indeed, unlike inductors and capacitors, i.e. the other twoĀ passiveĀ components of an electrical circuit, a resistor does not storeĀ butĀ dissipatesĀ energy, as shown below.

Electric_load_animation_2

It’s a nice animation (credit for it has to go to Wikipedia once more), as it shows how energy is being used in an electric circuit. Note that the little moving plusesĀ are in line with the conventionĀ that a current is defined as the movement of positive charges, so we write I = dQ/dt instead ofĀ I =Ā āˆ’dQ/dt. That also explains the direction of the field line E, which has been added to show that the charges move with the field that is being generatedĀ by theĀ power source (which is not shown here). So, what we have here is that, on one side of the circuit, someĀ generator or voltage sourceĀ will create anĀ emfĀ pushing the charges,Ā and so the animation shows how someĀ load – i.e. the resistor in this case – willĀ consumeĀ their energy, so they lose their push (as shown by the change in color from yellow to black). SoĀ power, i.e.energy per unit time, is supplied, and is then consumed.

To increase the current in the circuit above, you need to increase the voltage, but increasing both amounts to increasing the powerĀ that’s being consumed in the circuit. Electric power is voltage times current, so P = VĀ·I (or vĀ·i, if I use the small letters that are used in the two animations below). Now, Ohm’s LawĀ (I = V/R)Ā says that, if we’d want to double the current, we’d need to double the voltage, and so we’re quadrupling the power then: P2Ā = V2Ā·I2Ā = (2Ā·V1)Ā·(2Ā·I1) = 4Ā·V1Ā·I1Ā = 22Ā·P1. So we have a square-cube law for the power, which we get by substituting V for RĀ·I or by substituting I for V/R, so we can write the power P as P = V2/R = I2Ā·R. This square-cube law says exactly the same: if you want to doubleĀ the voltageĀ orĀ the current, you’ll actually have to double both and, hence, you’ll quadruple the power.

But back to the impedance: Ohm’s Law is theĀ Z = V/I law for resistors, but we can simplify it because we know the voltage acrossĀ the resistorĀ and the current that’s going throughĀ areĀ in phase. Hence, ĪøVĀ andĀ ĪøIĀ are identical and, therefore, theĀ Īø = ĪøVĀ āˆ’ ĪøIĀ in Z =Ā |Z|eiĪøĀ is equal to zero and, hence, Z =Ā |Z|. Now,Ā |Z| =Ā |V|/|I| =Ā V0/I0. So the impedanceĀ ZĀ is just some real number R = V0/I0, which we can also write as:

R = V0/I0Ā = (V0Ā·ei(ωt + α))/(I0Ā·ei(ωt + α)) = V(t)/I(t), with α = ĪøVĀ = ĪøI

The equation above goes from R = V0/I0Ā to R = V(t)/I(t) = V/I. It’s note the same thing: the second equation says that,Ā at any point in time, the voltage and the current will be proportional to each other, with R or its reciprocal as the proportionality constant. In any case,Ā we have our formula for ZĀ here:

Z = R = V/I =Ā V0/I0

So that’s simple. Before we move to the next, let me note that the resistance of aĀ realĀ resistor may depend on itsĀ temperature, so in real-life applications one will want to keep its temperature as stable as possible. That’s why real-life resistors have power ratings and recommended operatingĀ temperatures. The image below illustrates how so-called heat-sink resistors can be mounted on a heat sink with a simple spring clip so as to ensureĀ the dissipated heat is transported away. These heat-sink resistors are rather small (10 by 15 mm only) but are rated for 35Ā watt – so that’s quite a lot for such small thing – ifĀ correctly mounted.

spring-clips-mounting-technigques

As mentioned, the linear relation between the voltage and the current is onlyĀ approximate, and the observed relation is also there only for frequencies that are not ‘too high’ because, if the frequency becomes very high, the free electrons will start radiating energy away, as they produce electromagnetic radiation. So one always needs to look at the tolerances of real-lifeĀ resistors, which may be ± 5%, ± 10%, or whatever.Ā In any case… On to the next.

Capacitors (condensers)

We talked at length about capacitors (aka condensers) in our post explaining capacitanceĀ or, the more widely used term, capacity: theĀ capacity of a capacitor is the observedĀ proportionality between (1) the voltage (V) across and (2) the charge (Q) on the capacitor, so we wrote it as:

C = Q/V

Now, it’s easy to confuse the C here with the C for coulomb, which I’ll also use in a moment, and so… Well… Just don’t! šŸ™‚ The meaning of the symbol is usually obvious from the context.

As for the explanation of this relation, it’s quite simple: a capacitor consists of two separateĀ conductors in space, with positive charge on one, and an equal and opposite (i.e. negative) charge on the other. Now, the logic of the superposition of fields implies that, if we double the charges, we will also double the fields, and so the work one needs to do to carry a unit charge from one conductor to the other is also doubled! So that’s why the potential difference between the conductors is proportional to the charge.

The C = Q/VĀ formula actually measures theĀ ability of the capacitor to store electric charge and, therefore, to store energy, so that’s why the term capacityĀ is really quite appropriate.Ā I’ll let youĀ googleĀ a few illustrations like the one below, that shows how a capacitor is actually being charged in a circuit. Usually, some resistance will be there in the circuit, so as to limit the current when it’s connected to the voltage source and, therefore, as you can see, the R times C factor (RĀ·C) determines how fast or how slow the capacitor charges and/or discharges. Also note that the current is equal to the time rate of change of the charge: I = dQ/dt.

images

In the above-mentioned post, we also give a few formulas for the capacity of specificĀ typesĀ of condensers. For example, for aĀ parallel-plate condenser, the formula was CĀ =  ε0A/d. We also mentioned its unit, which is isĀ coulomb/volt, obviously, but – in honor of Michael Faraday, who gave us Faraday’s Law, and many other interesting formulas – it’s referred to as theĀ farad: 1 F = 1 C/V. The C here isĀ coulomb, of course. Sorry we have to use C to denote two different things but, as I mentioned, the meaning of the symbol is usually clear from the context.

We also talked about how dielectrics actually work in that post, but we did notĀ talk about the impedance of a capacitor, so let’s do that now. The calculation is pretty straightforward. Its interpretation somewhat less so. But… Well… Let’s go for it.

It’s the current that’s charging the condenser (sorry I keep using both terms interchangeably), and we know that the current is the time rate of change of the charge (I = dQ/dt).Ā Now, you’ll remember that, in general, we’d write a phasor A as A = A0Ā·eiωtĀ with A0Ā = A0Ā·eiĪø, soĀ A0Ā is a complex coefficient incorporating the initial phase, which we wrote as ĪøVĀ andĀ ĪøIĀ for the voltage and for the current respectively. So we’ll represent the voltage and the current now using that notation, so we write: V = V0Ā·eiωtĀ and I = I0Ā·eiωt. So let’s now use that C = Q/V by re-writing it as Q = CĀ·V and, because C is some constant, we can write:

I = dQ/dt = d(CĀ·V)/dt = CĀ·dV/dt

Now, what’s dV/dt? Oh… You’ll say: V is the magnitude of V, so it’s equal toĀ |V| = |V0Ā·eiωt| = |V0|Ā·|eiωt| = |V0| = |V0Ā·eiĪø| =Ā |V0|Ā·|eiĪø| =Ā |V0| = V0. So… Well…Ā What?Ā V0Ā is some constant here! It’s theĀ maximumĀ amplitude of V, so… Well… It’s time derivative is zero: dV0/dt = 0.

Yes. Indeed. We did something very wrong here!Ā You really need to watch out with this complex-number notation, and you need to think about what you’re doing. V is not the magnitude of V but its (varying) amplitude. So it’s theĀ realĀ voltage V that varies with time: it’s equal to V0Ā·cos(ωt + ĪøV), which is the realĀ part of our phasorĀ V. Huh?Ā Yes.Ā Just hang in for a while. I know it’s difficult and, frankly, Feynman doesn’t help us very much here.Ā Let’s take one step back and so – you will see why I am doing this in a moment – let’s calculate the time derivative of our phasor V, instead of the time derivative of our realĀ voltageĀ V. So we calculate dV/dt, which is equal to:

dV/dtd(V0Ā·eiωt)/dt =Ā V0Ā·d(eiωt)/dt = V0Ā·(iω)Ā·eiωtĀ = iω·V0Ā·eiωtĀ = iω·V

Remarkable result, isn’t it? We take the time derivative of our phasor, and the result is the phasor itself multiplied with iω. Well… Yes. It’s a general property of exponentials, but still… Remarkable indeed! We’d get the same with I, but we don’t need that for the moment. What we doĀ need to do is go from ourĀ I =Ā CĀ·dV/dtĀ relation, which connects theĀ realĀ parts of I and V one to another, to theĀ I =Ā CĀ·dV/dtĀ relation, which relates theĀ (complex) phasors. So we write:

Ā I =Ā CĀ·dV/dt ⇔ I =Ā CĀ·dV/dt

Can we do that? Just like that? We just replace I and V by I and V? Yes, we can.Ā Why? Well… We know that I is the real part ofĀ I and so we can write I = Re(I)+ Im(I)Ā·i = I + Im(I)Ā·i, and then we can write the right-hand side of the equation as CĀ·dV/dt = Re(CĀ·dV/dt)+ Im(CĀ·dV/dt)Ā·i. Now, two complex numbers are equal if, and only if, their real and imaginary parts are the same, so… Well… Write it all out, if you want, using Euler’s formula, and you’ll see it all makes sense indeed.

So what do we get? TheĀ I =Ā CĀ·dV/dt gives us:

I =Ā CĀ·dV/dt = CĀ·(iω)Ā·V

That implies that I/V = CĀ·(iω) and, hence, we get – finally! – what we need to get:

Z = V/I = 1/(iωC)

This is a grand result and, while I am sorry I made you suffer for it, I think it did a good job here because, if you’d check Feynman on it, you’ll see he – or, more probably, his assistants, – just skate over this without bothering too much about mathematical rigor. OK.Ā All that’s left now is to interpret this ‘number’ Z = 1/(iωC). It is a purelyĀ imaginary number, and it’s a constant indeed, albeit a complex constant. ItĀ can be re-written as:

Z = 1/(iωC) = i-1/(ωC) = –i/(ωC) = (1/ωC)Ā·eāˆ’iĀ·Ļ€/2

[Sorry. I can’t be more explicit here. It’s just of the wonders of complex numbers: i-1 = –i. Just check one my posts on complex numbers for more detail.] Now, a –iĀ factor corresponds to a rotation ofĀ minusĀ 90 degrees, and so that gives you the true meaning of what’s usually said about a circuit with a capacitor:Ā the voltage across the capacitor willĀ lagĀ the current with a phase differenceĀ equal to π/2, as shown below. Of course, as it’s the voltage driving the current, we should say it’s the current that is lagging with a phase difference of 3Ļ€/2, rather than stating it the other way around! Indeed, i-1 = –i = –1Ā·i =Ā i2Ā·i = i3, so that amounts to three ‘turns’ of the phase in the counter-clockwise direction, which is the direction in which our ωt angle is ‘turning’.

800px-VI_phase

It is a remarkable result, though. The illustration above assumes the maximum amplitude of the voltage and the current are the same, soĀ |Z|Ā =Ā |V|/|I| = 1, but what if they are notĀ the same? What are the real bits then? I can hear you, indeed: “To hell with the bold-face letters: what’s V and I? What’s the realĀ thing?”

Well… V and I are the real bits ofĀ VĀ =Ā |V|ei(ωt+ĪøV) = V0Ā·ei(ωt+ĪøV)Ā and of IĀ =Ā |I|ei(ωt+ĪøI)Ā = I0Ā·ei(ωt+ĪøVāˆ’Īø)Ā =Ā I0Ā·ei(ωtāˆ’Īø)Ā = I0Ā·ei(ωt+Ļ€/2)Ā respectively so, assuming ĪøVĀ = 0 (as mentioned above, that’s just a matter of choosing a convenient t = 0 point), we get:

  • V = V0Ā·cos(ωt)
  • I = I0Ā·cos(ωt + Ļ€/2)

So the Ļ€/2 phase difference is there (you need to watch out with the signs, of course: Īø = āˆ’Ļ€/2, but so it’s the current that seems to leadĀ here)Ā but the V0/I0Ā ratio doesn’t have to be one, so the realĀ voltage and current could look like something below, where the maximumĀ amplitude of the current is only half of the maximumĀ amplitude of the voltage.

Capture

So let’s analyze this quickly: the V0/I0Ā ratio is equal to |Z| =Ā |V|/|I| = V0/I0Ā = 1/ωC = (1/ω)(1/C) (note that it’sĀ notĀ equal to V/I = V(t)/I(t), which is a ratio that doesn’t make sense because I(t) goes through zero as the current switches direction). So what? Well… It means the ratio is inversely proportional to both the frequency ω as well as the capacity C, as shown below. Think about this: if ω goes to zero, V0/I0Ā goes to āˆž, which means that, for a given voltage, the current must go to zero. That makes sense, because we’re talking DC current when ω → 0, and the capacitor charges itself and then that’s it: no more currents. Now, if C goes to zero, so we’re talking capacitors with hardly any capacity, we’ll also get tiny currents. Conversely, for large C, we’ll get huge currents, as the capacitor can take pretty much any charge you throw at it, so that makes for smallĀ V0/I0Ā ratios. The most interesting thing to consider is ω going to infinity, as the V0/I0Ā ratio is also quite small then. What happens? The capacitor doesn’t get the time to charge, and so it’s always in this state where it has large currents flowing in and out of it, as it can’t build the voltageĀ that would counter the electromotive force that’s being supplied by the voltage source.

graph 6OK. That’s it. Le’s discuss the last (passive) element.

Inductors

We’ve spoiled the party a bit with that illustration above, as it gives the phase difference for an inductor already:

Z = iωL = ωLĀ·eiĀ·Ļ€/2, with L the inductance of the coil

So, againĀ assuming thatĀ ĪøVĀ = 0, we can calculate I as:

IĀ =Ā |I|ei(ωt+ĪøI)Ā = I0Ā·ei(ωt+ĪøVāˆ’Īø)Ā =Ā I0Ā·ei(ωtāˆ’Īø)Ā = I0Ā·ei(ωtāˆ’Ļ€/2)Ā 

Of course, you’ll want to relate this, once again, to theĀ realĀ voltage and theĀ realĀ current, so let’s write the real parts of ourĀ phasors:

  • V = V0Ā·cos(ωt)
  • I = I0Ā·cos(ωt āˆ’ Ļ€/2)

Just to make sure you’re not falling asleep as you’re reading, I’ve made another graph of how things could look like. So now’s it’s the current signal that’sĀ lagging the voltage signalĀ with a phase differenceĀ equal toĀ Īø = Ļ€/2.

Capture

Also, to be fully complete, I should show you how the V0/I0Ā ratio now varies with L and ω. Indeed, here also we can write that |Z| =Ā |V|/|I| = V0/I0, but so here we find that V0/I0Ā =  ωL, so we have a simple linear proportionality here! For example, for a given voltage V0, we’ll have smallerĀ currents as ω increases, so that’s the opposite of what happens with our ideal capacitors. I’ll let you think about that… šŸ™‚

Capture

Now how do we get thatĀ Z = iωL formula? In my post on inductance, I explained what an inductor is: a coil of wire, basically. Its defining characteristic is thatĀ a changing current will cause a changingĀ magnetic field in it and, hence, some change in the flux of the magnetic field. Now, Faraday’s Law tells us that that will cause some circulation of the electric field in the coil, which amounts to an inducedĀ potential difference which is referred to as theĀ electromotive forceĀ (emf). Now, it turns out that the inducedĀ emfĀ is proportional to the change in current. So we’ve got another constant of proportionality here, so it’s like how we defined resistance, orĀ capacitance. So, in many ways, theĀ inductanceĀ is just another proportionality coefficient. If we denote it by L – the symbol is said to honor the Russian phyicistĀ Heinrich Lenz, whom you know from Lenz’ Law – then we define it as:

L = āˆ’Ę/(dI/dt)

The dI/dt factor is, obviously, the time rate of change of the current, and the negative sign indicates that the emfĀ opposesĀ the change in current, so it will tend to cause anĀ opposingĀ current. However, theĀ powerĀ of our voltage source will ensure the current does effectively change, so it will counter the ‘back emf’ that’s being generated by the inductor. To be precise, the voltage across the terminals of our inductor, which we denote by V, will be equal and opposite to ʐ, so we write:

V = āˆ’Ę = LĀ·(dI/dt)

Now, this very much resembles the I =Ā CĀ·dV/dt relation we had for capacitors, and it’s completely analogous indeed: we just need to switch the I and V, and C and L symbols. So we write:

Ā V = LĀ·dI/dt⇔ V = LĀ·dI/dt

Now, dI/dt is a similar time derivative as dV/dt. We calculate it as:

dI/dtd(I0Ā·eiωt)/dt =Ā I0Ā·d(eiωt)/dt = I0Ā·(iω)Ā·eiωtĀ = iω·I0Ā·eiωtĀ = iω·I

So we get what we want and have to get:

V = LĀ·dI/dt = iωLĀ·I

Now, Z = V/I, so Z =Ā iωL indeed!

Summary of conclusions

Let’s summarize what we found:

  1. For a resistor, we have Z(resistor) = ZRĀ = R = V/I =Ā V0/I0
  2. For an capacitor, we have Z(capacitor) = ZCĀ = 1/(iωC) = –i/(ωC)
  3. For an inductor, we haveĀ Z(inductance) = ZL= iωL

Note that the impedance of capacitors decreases as frequency increases, while for inductors, it’s the other way around. We explained that by making you think of the currents: for a given voltage, we’ll have large currents for high frequencies, and, hence, a smallĀ V0/I0Ā ratio. Can you think of what happens with an inductor? It’sĀ notĀ so easy, so I’ll refer you to the addendum below for some more explanation.

Let me also note that, as you can see, the impedance of (ideal) inductors and capacitors is a pure imaginary number, so that’s a complex number which has no real part. In engineering, the imaginary part of the impedance is referred to as theĀ reactance, so engineers will say that ideal capacitors and inductors have aĀ purely imaginary reactive impedance.Ā 

However, in real life, the impedance will usually have both a real as well as an imaginary part, so it will be some kind of mix, so to speak. The real part is referred to as the ‘resistance’ R, and the ‘imaginary’ part is referred to as the ‘reactance’ X. The formula for both is given below:

formula resistance and reactance

But here I have to end my post on circuit elements. It’s become quite long, so I’ll discuss Kirchoff’s rulesĀ in my next post.

Addendum: Why is V = āˆ’ ʐ?

Inductors are not easy to understand—intuitively, that is. That’s why I spent so much time writing on them in my other post on them, to which I should be referring you here. But let me recapitulate the key points. The key idea is that we’re pumping energy into an inductor when applying a current and, as you know, the time rate of change is power: P = dW/dt, so we’re talking power here too, which is voltageĀ timesĀ current: P = dW/dt = VĀ·I.Ā The illustration below shows what happens when anĀ alternating currentĀ is applied to the circuit with the inductor. So the assumption is that the current goes in one and then in the other direction, so I > 0, and then I < 0, etcetera. We’re also assuming some nice sinusoidal curve for the current here (i.e. the blue curve), and so we get what we get for U (i.e. the red curve), which is the energy that’s stored in the inductor really, as it tries to resist the changing current: the energy goes up and down between zero and some maximum amplitude that’s determined by the maximum current.

power 2

So, yes, building up current requires energy from some external source, which is used to overcome the ‘back emf’ in the inductor, and that energy is stored in the inductor itself. [If you still wonder why it’s stored in the inductor, think about the other question: where else would it be stored?]Ā HowĀ is stored? Look at the graph and think: it’s stored as kinetic energy of the charges, obviously. That explains why the energy is zero when the current is zero, and why the energy maxes out when the current maxes out. So, yes, it all makes sense! šŸ™‚

Let me give another example.Ā The graph below assumes the current builds up to some maximum. As it reaches its maximum, theĀ storedĀ energy will also max out. This example assumesĀ direct current, so it’s a DC circuit: the current builds up, but then stabilizes at some maximum that we can find by applying Ohm’s Law to the resistanceĀ of the circuit: I = V/R. Resistance? But we were talking an ideal inductor? We are. If there’s no other resistance in the circuit, we’ll have a short-circuit, so the assumption is that we doĀ have some resistance in the circuit and, therefore, we should also think of some energyĀ lossĀ to heat from the current in the resistance. If not, well… Your power source will obviously soon reach its limits. šŸ™‚

power

So what’s going on then?Ā We have someĀ changingĀ current in the coil but, obviously, some kind of inertia also: the coil itself opposes the change in current through the ‘back emf’. Now, it requires energy, or power, to overcome the inertia, so that’s the power that comes from our voltage source: it will offset the ‘back emf’, so we may effectively think of a little circuit with an inductorĀ and a voltage source, as shown below.

circuit with coil

But why do we write V = āˆ’Ā Ę? Our voltage source can have any voltage, can’t it? Yes. Sure. But so the coil will always provide anĀ emf that’s exactly theĀ oppositeĀ of this voltage. Think of it: we have someĀ voltage that’s being applied across the terminals of the inductor, and so we’ll have some current. A current that’s changing. And it’s that current will generate an emf that’s equal to Ɛ = –LĀ·(dI/dt). So don’t think of ʐ as some constant: it’s the self-inductance coefficient L that’s constant, but I (and, hence, dI/dt) and V are variable.

The point is: we cannot have any potential difference in a perfect conductor, which is what the terminals are: any potential difference, i.e. any electric field really, would cause huge currents. In other words, the voltage V and the emf Ɛ have to cancel each other out,Ā all of the time. If not, we’d have huge currents in the wires re-establishing the V = āˆ’Ę equality.

Let me use Feynman’s argument here. Perhaps that will work better. šŸ™‚ Our ideal inductor is shown below: it’s shielded by some metal box so as to ensure it does not interact with the rest of the circuit. So we have some current I, which we assume to be an AC current, and we know someĀ voltageĀ is needed toĀ causeĀ that current, so that’s the potential difference V between the terminals.

inductor

The total circulation of E – around the whole circuit – can be written as the sum of two parts:

Formula circulaton

Now, we know circulation of E can only be caused by someĀ changingĀ magnetic field, which is what’s going on in the inductor:

emf

So this change in the magnetic flux is what it causing the ‘back emf’, and so the integral on the left is, effectively, equal to ʐ, not minus Ɛ but +ʐ.Ā Now, the second integral is equal to V, because that’s the voltage V between the two terminals a and b. So the whole integral is equal toĀ 0 = ʐ +Ā V and, therefore, we have that:

VĀ = āˆ’Ā Ę = LĀ·dI/dt

Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/

Re-visiting the speed of light, Planck’s constant, and the fine-structure constant

Note: I have published a paper that is very coherent and fully explains what the fine-structure constant actually is. There is nothing magical about it. It’s not some God-given number. It’s a scaling constant – and then some more. But not God-given. Check it out: The Meaning of the Fine-Structure Constant. No ambiguity. No hocus-pocus.

Jean Louis Van Belle, 23 December 2018

Original post:

A brother of mine sent me a link to an article he liked. Now, because we share some interest in physics and math and other stuff, I looked at it and…

Well… I was disappointed. Despite the impressive credentials of its author – a retired physics professor – it was veryĀ poorly written. It made me realize how much badly written stuff is around, and I am glad I am no longer wasting my time on it. However, I do owe my brother some explanation of (a) why I think it was bad, and of (b) what, in my humble opinion, he should be wasting his time on. šŸ™‚ So what it is all about?

The article talks about physicists deriving the speed of light from “theĀ electromagnetic properties of the quantum vacuum.” Now, it’s theĀ term ‘quantum‘, in ‘quantum vacuum’, that made me read the article.

Indeed, derivingĀ the theoretical speed of light in empty space from the properties of theĀ classicalĀ vacuum – aka empty space – is a piece of cake: it was done by Maxwell himself as he was figuring out his equations back in the 1850s (see my post on Maxwell’s equations and the speed of light). And then he compared it to the measuredĀ value, and he saw it was right on the mark. Therefore, saying that the speed of light is a property of the vacuum, or of empty space, is like a tautology: we may just as well put it the other way around, and say that it’s the speed of light thatĀ definesĀ the (properties of the) vacuum!

Indeed, as I’ll explain in a moment: the speed of lightĀ determinesĀ both the electric as well as the magnetic constants μ0Ā and ε0, which are the (magnetic) permeability and the (electric) permittivity of the vacuum respectively. Both constants depend on the units we are working with (i.e. the units for electric charge, for distance, for time and for force – or for inertia, if you want, because force is defined in terms of overcoming inertia), but so they are just proportionality coefficients in Maxwell’s equations. So once we decide what units to use in Maxwell’s equations, then μ0Ā and ε0Ā areĀ just proportionality coefficientsĀ which we get from c. So they areĀ notĀ separate constants really – I mean, they are not separate from c – and all of the ‘properties’ of the vacuum, including these constants, are in Maxwell’s equations.

In fact, when Maxwell compared the theoreticalĀ value ofĀ cĀ with its presumed actual value, he didn’t compare c‘s theoretical valueĀ with the speed of light as measured by astronomers (like that 17th century Ole Roemer, to which our professor refers: he had a first go at it by suggesting some specific value for it based on his observations of the timing of the eclipses of one of Jupiter’s moons), but with c‘s value as calculated from the experimentalĀ values of μ0Ā and ε0!Ā So he knew veryĀ well what he was looking at. In fact, to drive home the point, it may also be useful to note that the Michelson-Morley experiment – which accuratelyĀ measured the speed of light – was done some thirty years later. So Maxwell had already left this world by then—very much in peace, because he had solved the mystery all 19th century physicists wanted to solve through his greatĀ unification: his set of equations covers it all, indeed: electricity, magnetism, light, and even relativity!

I think the article my brother likedĀ so much does a very lousy job in pointing all of that out, but that’s not why I wouldn’tĀ recommend it.Ā It got my attention because I wondered why one would try to derive the speed of light from the properties of the quantum vacuum. In fact, to be precise, I hoped the article would tell me what the quantumĀ vacuum actually is. Indeed, as far as I know, there’s only one vacuum—one ’empty space’: empty is empty, isn’t it? šŸ™‚ So I wondered: do we have a ‘quantum’ vacuum? And, if so, what is it, really?

Now, thatĀ is where the article is reallyĀ disappointing, I think. The professor dropsĀ a few names (like the Max Planck Institute, the University of Paris-Sud, etcetera), and then, promisingly, mentions ‘fleeting excitations of the quantum vacuum’ and ‘virtual pairs of particles’, but then he basically stops talking about quantum physics. Instead, he wanders off to share some philosophical thoughts on the fundamental physical constants. What makes it all worse is that even thoseĀ thoughts on the ‘essential’ constants are quite off the mark.

So… This post is just a ‘quick and dirty’ thing for my brother which, I hope, will be somewhat more thought-provoking than that article. More importantly, I hope that my thoughts will encourage him to try to grind through better stuff.

On Maxwell’s equations and the properties of empty space

Let me first say something about the speed of light indeed. Maxwell’s four equations mayĀ look fairlyĀ simple, but that’s only until one starts unpacking all those differential vector equations, and it’s only when going through all of their consequences that one starts appreciating their deep mathematical structure. Let me quickly copy how another bloggerĀ jotted them down: šŸ™‚

god-said-maxwell-equation

As I showed in my above-mentioned post, the speed of lightĀ (i.e. the speed with which an electromagnetic pulse or wave travels through space)Ā is just one of the many consequences of the mathematical structure of Maxwell’s set of equations. As such, the speed of light is a direct consequence of the ‘condition’, or the properties, of the vacuum indeed, as Maxwell suggested when he wrote thatĀ ā€œwe can scarcely avoid the inference that light consists in the transverse undulations of the same medium which is the cause of electric and magnetic phenomenaā€.

Of course, whileĀ Maxwell still suggests light needs some ‘medium’ here – so that’s a reference to the infamous aetherĀ theory – we now know that’s because he was a 19th century scientist, and so we’ve done away with the aether concept (because it’s a redundant hypothesis), and so now we also know there’s absolutely no reason whatsoever to try to “avoid the inference.” šŸ™‚ It’s all OK, indeed: light is some kind of “transverse undulation” of… Well… Of what?

We analyze light as traveling fields, represented by two vectors, E and B, whose direction and magnitude varies both in space as well as in time.Ā E and BĀ are field vectors, and represent the electric and magnetic field respectively. An equivalent formulation – more or less, that is (see my post on theĀ LiĆ©nard-Wiechert potentials) – for Maxwell’s equations when only one (moving) charge is involved is:

E

B

This re-formulation, which is Feynman’s preferred formula for electromagnetic radiation, is interesting in a number of ways. It clearly shows that, while we analyze the electric and magnetic field as separateĀ mathematicalĀ entities, they’re one and the same phenomenon really, as evidenced by the B = –erā€˜Ć—E/cĀ equation, which tells us the magnetic field from a single moving charge is always normal (i.e. perpendicular) to the electric field vector, and also that B‘s magnitude is 1/cĀ times the magnitude of E, soĀ |B| = B =Ā |E|/c = E/c. In short, B is fully determined by E, or vice versa: if we have one of the two fields, we have the other, so they’re ‘one and the same thing’ really—not in a mathematical sense, but in a real sense.

Also note that E and B‘s magnitude is just the same if we’re using natural units, so if we equateĀ cĀ with 1. Finally, as I pointed out in my post on the relativity of electromagnetic fields, if we would switch from one reference frame to another, we’ll have a different mix of E and B, but that different mix obviously describes the sameĀ physicalĀ reality. More in particular, if we’d be movingĀ with the charges, the magnetic field sort of disappears to re-appear as an electric field. So the Lorentz forceĀ F = FelectricĀ +Ā FmagneticĀ = qE + qvƗBĀ is one force really, and its ‘electric’ and ‘magnetic’ component appear the way they appear in our reference frame only. In some other reference frame, we’d have the same force, but its components would look different, even if they, obviously, would and should add up to the same.Ā [Well… Yes and no… You know there’s relativistic corrections to be made to the forcesĀ to, but that’s a minor point, really. The force surely doesn’t disappear!]

All of this reinforces what you know already: electricity and magnetism are part and parcel of one and the same phenomenon,Ā the electromagnetic force field, and Maxwell’s equations are the mostĀ elegant way of ‘cutting it up’. Why elegant? Well… Click the Occam tab. šŸ™‚

Now, after having praised Maxwell once more, I must say that Feynman’s equations above have another advantage. In Maxwell’s equations, we see two constants, the electric and magnetic constant (denoted by μ0Ā and ε0Ā respectively), andĀ Maxwell’s equations imply that the productĀ of theĀ electric and magnetic constant is the reciprocal of c2: μ0·ε0Ā = 1/c2. So here we see ε0Ā and cĀ only, so no μ0, so that makes it even more obvious that the magnetic and electric constant are related one to another through c.

[…] Let me digress briefly: why do we have c2Ā in μ0·ε0Ā = 1/c2, instead of justĀ c? That’s related to the relativistic nature of the magnetic force: think about that B = E/c relation. Or, better still, think about the Lorentz equation F = FelectricĀ +Ā FmagneticĀ = qE + qvƗB = q[EĀ + (v/c)Ɨ(EƗerā€˜)]: the 1/c factor is there because the magnetic force involves some velocity, and any velocity is always relative—and here I don’t mean relative to the frame of reference but relative to the (absolute) speed of light! Indeed, it’s the v/c ratio (usually denoted by β = v/c) that enters all relativistic formulas. So the left-hand side of the μ0·ε0Ā = 1/c2Ā equation is best written as (1/c)Ā·(1/c), with one of the two 1/c factors accounting for the fact that the ‘magnetic’ force is a relativistic effect of the ‘electric’ force, really, and the other 1/cĀ factor giving us the proper relationship between the magnetic and the electric constant.Ā To drive home the point, I invite you to think about the following:

  • μ0Ā is expressed in (VĀ·s)/(AĀ·m), while ε0Ā is expressed in (AĀ·s)/(VĀ·m), so the dimension in which the μ0·ε0Ā product is expressed is [(VĀ·s)/(AĀ·m)]Ā·[(AĀ·s)/(VĀ·m)] = s2/m2, so that’s the dimension of 1/c2.
  • Now, this dimensional analysis makes it clear that we can sort of distributeĀ 1/c2Ā over the two constants. All it takes is re-defining the fundamental unitsĀ we use to calculate stuff, i.e. the units for electric charge, for distance, for time and for force – or for inertia, as explained above. But so we could, if we wanted, equate both μ0Ā as well as ε0Ā with 1/c.
  • Now, if we would then equate c with 1, we’d have μ0Ā = ε0Ā = cĀ = 1. We’d have to define our units for electric charge, for distance, for time and for force accordingly, but it could be done, and then we could re-write Maxwell’s set of equations using these ‘natural’ units.

In any case, the nitty-gritty here is less important: the point is that μ0Ā and ε0Ā are also related through the speed of light and, hence, they are ‘properties’ of the vacuum as well. [I may add that this is quite obvious if you look at their definition, but we’re approaching the matter from another angle here.]

In any case, we’re done with this. On to the next!

On quantum oscillations, Planck’s constant, and Planck unitsĀ 

The second thought I want to develop is about the mentioned quantum oscillation. What is it? Or what could it be? An electromagnetic wave is caused by a moving electric charge. What kind of movement? Whatever: the charge could move up or down, or it could just spin around some axis—whatever, really. For example, if it spins around some axis, it will have a magnetic moment and, hence, the field is essentially magnetic, but then, again,Ā E and B are related and so it doesn’t really matter if the first cause is magnetic or electric: that’s just our way of looking at the world: in another reference frame, one that’s moving with the charges,Ā the field would essential be electric.Ā So the motion can be anything: linear, rotational, or non-linear in some irregular way. It doesn’t matter: any motion canĀ always be analyzed as the sum of a number of ‘ideal’ motions.Ā So let’s assume we have some elementaryĀ charge in space, and it moves and so it emits some electromagnetic radiation.

So nowĀ we need to think about that oscillation. The key question is: how small can it be? Indeed, in one of my previous posts, I tried to explain some of the thinking behind the idea of the ‘Great Desert’, as physicists call it. The whole idea is based on our thinking about the limit: what is the smallest wavelength that still makes sense? So let’s pick up that conversation once again.

The Great Desert lies between theĀ 1032Ā and 1043 Hz scale. 1032Ā Hz corresponds to aĀ photonĀ energy of Eγ = hĀ·f = (4Ɨ10āˆ’15Ā eVĀ·s)Ā·(1032Ā Hz) = 4Ɨ1017Ā eV = 400,000 tera-electronvolt (1 TeV = 1012Ā eV). I use the γ (gamma) subscript in my Eγ symbol for two reasons: (1) to make it clear that I am not talking the electric field E here but energy, and (2) to make it clear we are talking ultra-high-energy gamma-rays here.

In fact, γ-rays of this frequency and energy are theoretical only. Ultra-high-energy gamma-rays are defined as rays with photon energies higher than 100 TeV, which is the upper limit forĀ very-high-energy gamma-rays, which have been observed as part of the radiation emitted by so-called gamma-ray burstsĀ (GRBs):Ā flashes associated with extremely energetic explosions in distant galaxies. Wikipedia refers to them as the ‘brightest’ electromagnetic events know to occur in the Universe. These rays are not to be confused withĀ cosmicĀ rays, which consist of high-energy protonsĀ andĀ atomic nuclei stripped of their electron shells. Cosmic rays aren’t rays really and, because they consist of particles with a considerable rest mass, their energy is even higher. The so-called Oh-My-GodĀ particle, for example, which is the most energetic particle ever detected, had an energy ofĀ 3Ɨ1020Ā eV, i.e.Ā 300 million TeV. But it’s not a photon: its energy is largely kinetic energy, with the rest mass m0Ā counting for a lot in the mĀ in the E =Ā mĀ·c2Ā formula. To be precise: the mentioned particle was thought to be anĀ ironĀ nucleus, and it packed the equivalent energy of a baseball traveling at 100 km/h!Ā 

But let me refer you to another source for a good discussion on these high-energy particles, so I can get get back to the energy of electromagnetic radiation. When I talked about the Great Desert in that post, I did so using the Planck-Einstein relation (E = hĀ·f), which embodies the idea of the photon being valid always and everywhere and, importantly,Ā at every scale. I also discussed the Great Desert using real-life light being emitted by real-life atomic oscillators. Hence, I may have given the (wrong) impression that the idea of a photon as a ā€˜wave train’ is inextricably linked with these real-life atomic oscillators, i.e. to electrons going from one energy level to the next in some atom. Let’s explore these assumptions somewhat more.

Let’s start with the second point. Electromagnetic radiation is emitted by any accelerating electric charge, so the atomic oscillator model is an assumption that shouldĀ notĀ be essential. And it isn’t. For example, whatever is left of the nucleus after alpha or beta decay (i.e. a nuclear decayĀ process resulting in the emission of an α- or β-particle) it likely to be in an excited state, and likely to emit a gamma-ray for about 10āˆ’12 seconds, so that’s a burst that’s about 10,000 times shorterĀ than the 10–8Ā seconds it takes for the energy of a radiating atom to die out. [As for the calculation of that 10–8Ā sec decay time – so that’s like 10 nanoseconds – I’ve talked about this before but it’s probably better to refer you to the source, i.e. one of Feynman’s Lectures.]

However, what we’re interested in is not the energy of theĀ photon, but the energy of one cycle. In other words, we’re not thinking of the photon as some wave train here, but what we’re thinking about is the energy that’s packed into a space corresponding toĀ one wavelength. What can we say about that?

As you know, that energy will depend both on the amplitudeĀ of the electromagnetic wave as well as its frequency. To be precise, the energy is (1) proportional to theĀ squareĀ of the amplitude, and (2) proportional to the frequency. Let’s look at the first proportionality relation. It can be written in a number of ways, but one way of doing it is stating the following: if we know the electric field, then the amount of energy that passes per square meter per second through a surface that is normal to the direction in which the radiation is going (which we’ll denote by S – the s from surface – in the formula below), must be proportional to the average of the square of the field. So we write SĀ āˆĀ āŒ©E2〉, and so we should think about theĀ constant of proportionalityĀ now. Now, let’s not get into the nitty-gritty, and so I’ll just referĀ to Feynman for the derivation of the formula below:

S = ε0c·〈E2〉

So the constant of proportionality is ε0c. [Note that, in light of what we wrote above, we can also write this asĀ S = (1/μ0Ā·c)·〈(cĀ·B)2〉 = (c/μ0)·〈B2〉, so that underlines once again that we’re talking one electromagneticĀ phenomenon only really.] So that’sĀ a nice and rather intuitive result in light of all of the other formulas we’ve been jotting down. However, it isĀ a ‘wave’ perspective. The ‘photon’ perspective assumes that, somehow, the amplitude is given and, therefore, the Planck-Einstein relation only captures the frequency variable:Ā Eγ = hĀ·f.

Indeed, ‘more energy’ in the ‘wave’ perspective basically means ‘more photons’, but photons are photons: they have a definiteĀ frequencyĀ and a definiteĀ energy, and both are given by that Planck-Einstein relation.Ā So let’s look at that relation by doing a bit of dimensionalĀ analysis:

  • Energy is measured in electronvolt or, using SI units, joule: 1 eV ā‰ˆĀ 1.6Ɨ10āˆ’19Ā J. Energy is force times distance: 1 joule = 1 newtonĀ·meter, which means that a larger force over a shorter distance yields the same energy as a smaller force over a longer distance. The oscillations we’re talking about here involve very tinyĀ distances obviously. But the principle is the same: we’re talking some moving chargeĀ q, and theĀ power – which is the time rate of change of the energy – that goes in or out at any point of time is equal to dW/dt =Ā FĀ·v, with W the work that’s being done by the charge as it emits radiation.
  • I would also like to add that, as you know, forces are related to theĀ inertiaĀ of things. Newton’s Law basically defines a force as that what causes a mass to accelerate: F = mĀ·a =Ā mĀ·(dv/dt) = d(mĀ·v)/dt = dp/dt, with p theĀ momentumĀ of the object that’s involved. When charges are involved, we’ve got the same thing: aĀ potential differenceĀ will cause some currentĀ to change, and one of the equivalents of Newton’s LawĀ F = mĀ·a = mĀ·(dv/dt) in electromagnetism is V = LĀ·(dI/dt). [I am just saying this so you get a better ‘feel’ for what’s going on.]
  • Planck’s constant is measured inĀ electronvoltĀ·secondsĀ (eVĀ·s) or in, using SI units, inĀ jouleĀ·secondsĀ (JĀ·s), so its dimension is that of (physical) action, which is energy times time: [energy]Ā·[time]. Again, a lot of energy during a short time yields the same energy as less energy over a longer time. [Again, I am just saying this so you get a better ‘feel’ for these dimensions.]
  • The frequency f is the number of cycles per time unit, so that’s expressedĀ per second, i.e. inĀ herzĀ (Hz) = 1/second = sāˆ’1.

So… Well… It all makes sense: [xĀ joule]Ā = [6.626Ɨ10āˆ’34Ā joule]Ā·[1 second]Ɨ[fĀ cycles]/[1 second].Ā But let’s try to deepen our understanding even more: what’s the Planck-Einstein relationĀ reallyĀ about?

To answer that question, let’s think some more about the wave function. As you know, it’s customary to express the frequency as an angular frequency ω, as used in the wave function A(x, t) = A0Ā·sin(kx āˆ’ ωt).Ā The angular frequency is the frequency expressed inĀ radiansĀ per second. That’s because we need an angleĀ in our wave function, and so we need to relate x and t to some angle.Ā The way to think about this is as follows: one cycle takes a time T (i.e. the periodĀ of the wave)Ā which is equal to T = 1/f. Yes: one second divided by the number of cycles per second gives you the time that’s needed for one cycle. One cycle is also equivalent to our argument ωt going around the full circle (i.e. 2Ļ€), so we write:Ā  ω·T = 2Ļ€ and, therefore:

ω = 2Ļ€/T = 2π·f

Now we’re ready to play with the Planck-Einstein relation. WeĀ knowĀ it gives us the energy ofĀ oneĀ photon really, but what ifĀ we re-write our equation Eγ = hĀ·fĀ as Eγ/fĀ = h? The dimensions in this equation are:

[xĀ joule]Ā·[1 second]/[fĀ cyles] = [6.626Ɨ10āˆ’34Ā joule]Ā·[1 second]

⇔ xĀ =Ā 6.626Ɨ10āˆ’34Ā jouleĀ per cycle

So that means that the energy per cycle is equal toĀ 6.626Ɨ10āˆ’34Ā joule,Ā i.e. the value of Planck’s constant.

Let me rephrase trulyĀ amazing result, so you appreciate it—perhaps: regardless of the frequency of the light (or our electromagnetic wave, in general) involved, the energy per cycle, i.e. per wavelength or per period, is alwaysĀ equal to 6.626Ɨ10āˆ’34Ā joule or, using the electronvolt as the unit, 4.135667662Ɨ10āˆ’15Ā eV. So, in case you wondered,Ā thatĀ is the trueĀ meaning of Planck’s constant!

Now, if we have the frequency f, we also have the wavelength Ī», because the velocity of the wave is the frequency times the wavelength:Ā cĀ = λ·f and, therefore, Ī» = c/f.Ā So if we increase the frequency, the wavelength becomes smaller and smaller, and so we’re packing the same amount of energy – admittedly, 4.135667662Ɨ10āˆ’15Ā eV is a very tiny amount of energy – into a space that becomes smaller and smaller. Well… What’s tiny, and what’s small? All is relative, of course. šŸ™‚ So that’s where the Planck scale comes in. IfĀ we pack that amount of energy into some tiny little space of the Planck dimension, i.e. a ā€˜length’ of 1.6162Ɨ10āˆ’35Ā m, then it becomes a tiny black hole, and it’s hard to think about how that would work.

[…] Let me make a small digression here. I said it’s hard to think about black holes but, of course, it’s not because it’s ‘hard’ that we shouldn’t try it. So let me just mention a few basic facts. For starters, black holes do emit radiation! So they swallow stuff, but they also spit stuff out. More in particular, there is the so-called Hawking radiation,Ā as Roger Penrose and Stephen Hawking discovered.

Let me quickly make a few remarks on that: Hawking radiation is basically a form of blackbody radiation, so all frequencies are there, as shown below: theĀ distributionĀ of the various frequencies depends on the temperature of the black body, i.e. the black hole in this case. [The black curve is the curve that Lord Rayleigh and Sir James Jeans derived in the late 19th century, usingĀ classicalĀ theory only, so that’s the one that does notĀ correspond to experimental fact, and which led Max Planck to become the ‘reluctant’ father of quantum mechanics. In any case, that’s history and so I shouldn’t dwell on this.]

600px-Black_body

The interesting thing about blackbody radiation, includingĀ Hawking radiation, is that it reduces energy and, hence, the equivalent mass of our blackbody. SoĀ Hawking radiation reduces the mass and energy of black holes and is therefore also known as black hole evaporation. So black holes that lose more mass than they gain through other means are expected to shrink and ultimately vanish. Therefore, there’s all kind of theories that say why micro black holes, like that Planck scale black hole we’re thinking of right now, should be muchĀ larger net emitters of radiation than large black holes and, hence, whey they should shrink and dissipate faster.

Hmm… Interesting… What do we do with all of this information? Well… Let’s think about it as we continue our trek on this long journey to reality over the next year or, more probably, years (plural). šŸ™‚

The key lesson here is that space and time are intimately related because of the idea of movement, i.e. the idea of something having some velocity, and that it’s not so easy to separate the dimensions of time and distance in any hard and fast way. As energy scales become larger and, therefore, our natural time and distance units become smaller and smaller, it’s the energy concept that comes to the fore.Ā It sort of ‘swallows’ all other dimensions, and it does lead to limiting situations which are hard to imagine. Of course, that just underscores the underlying unity of Nature, and the mysteries involved.

So… To relate all of this back to the story that our professor is trying to tell, it’s a simple story really. He’s talking aboutĀ two fundamental constants basically, c and h, pointing out thatĀ cĀ is a property of empty space, andĀ h is related to something doing something. Well… OK. That’s really nothing new, and surely notĀ ground-breaking research. šŸ™‚

Now, let me finish my thoughts on all of the above by making one more remark. If you’ve read a thing or two about this – which you surely have – you’ll probably say: this is not how people usually explain it. That’s true, they don’t. Anything I’ve seen about this just associates theĀ 1043 Hz scale with the 1028Ā eV energy scale, using the same Planck-Einstein relation. For example, the Wikipedia article on micro black holes writes that “the minimum energy of a microscopic black hole is 1019Ā GeV [i.e. 1028Ā eV], which would have to be condensed into a region on the order of the Planck length.” So that’s wrong. I want to emphasize this point because I’ve been led astray by it for years.Ā It’s not the totalĀ photon energy, butĀ the energy per cycle that counts. Having said that, it is correct, however, and easy to verify, that theĀ 1043 Hz scale corresponds to a wavelength of the Planck scale: Ī» =Ā c/fĀ = (3Ɨ108Ā m/s)/(1043Ā sāˆ’1) = 3Ɨ10āˆ’35Ā m. The confusion between the photon energy and the energy per wavelengthĀ arises because of the idea of a photon: it travels at the speed of light and, hence, because of theĀ relativistic length contraction effect, it is said to be point-like, to have no dimension whatsoever. So that’s why we think of packing all of its energy in some infinitesimally small place. But you shouldn’t think like that. The photon is dimensionless in ourĀ reference frame: in its own ‘world’, it is spread out, so itĀ isĀ a wave train. And it’s in its ‘own world’ that the contradictions start… šŸ™‚

OK. Done!

My third and final point is about what our professor writes on the fundamental physical constants, and more in particular on what he writes on the fine-structure constant. In fact, I could just refer you toĀ my own post on it, but that’s probably a bit too easy for me and a bit difficult for you šŸ™‚ so let me summarize that post and tell you what you need to know about it.

The fine-structure constant

The fine-structure constant α is a dimensionlessĀ constant whichĀ also illustrates the underlying unity of Nature, but in a way that’sĀ much more fascinatingĀ than the two or three things the professor mentions. Indeed, it’s quite incredible how this number (α =Ā 0.00729735…, but you’ll usually see it written as its reciprocal, which is a number that’s close to 137.036…) links charge with the relative speeds, radii, and the mass of fundamental particles and, therefore, how this number also these concepts with each other. And, yes, the fact that it is, effectively, dimensionless, unlike h or c, makes it even more special.Ā Let me quickly sum up what the very same number α all stands for:

(1) α is the square of the electron charge expressed in Planck units: α = eP2.

(2) α is the square root of the ratio of (a) the classical electron radius and (b) the Bohr radius: α = √(reĀ /r). You’ll see this more often written asĀ reĀ = α2r. Also note that this is an equation that doesĀ notĀ depend on the units, in contrast to equation 1 (above), and 4 and 5 (below), which require you to switch to Planck units. It’s the square of a ratio and, hence, the units don’t matter. They fall away.

(3) α is the (relative) speed of an electron: α = v/c. [The relative speed is the speed as measured against the speed of light. Note that the ā€˜natural’ unit of speed in the Planck system of units is equal to c. Indeed, if you divide one Planck length by one Planck time unit, you get (1.616Ɨ10āˆ’35Ā m)/(5.391Ɨ10āˆ’44Ā s) =Ā cĀ m/s. However, this is another equation, just like (2), that does notĀ depend on the units: we can express v and c in whatever unit we want, as long we’re consistent and express both in the same units.]

(4) α is also equal to the product of (a) the electron mass (which I’ll simply write as meĀ here) and (b) the classical electron radius reĀ (if both are expressed in Planck units): α =Ā meĀ·re. Now IĀ thinkĀ that’s, perhaps, theĀ mostĀ amazing of all of the expressions for α. [If you don’t think that’s amazing, I’d really suggest you stop trying to study physics. :-)]

Also note that, from (2) and (4), we find that:

(5) The electron mass (in Planck units) is equal me = α/re = α/α2r = 1/αr. So that gives us an expression, using α once again, for the electron mass as a function of the Bohr radius r expressed in Planck units.

Finally, we can also substitute (1) in (5) to get:

(6) The electron mass (in Planck units) is equal to me = α/re  = eP2/re. Using the Bohr radius, we get me = 1/αr = 1/eP2r.

So… As you can see, this fine-structure constant really links allĀ of the fundamental properties of the electron: its charge, its radius, its distance to the nucleus (i.e. the Bohr radius), its velocity, its mass (and, hence, its energy),…

So… Why is what it is?

Well… We all marvel at this, but what can we say about it, really? I struggle how to interpret this, just as much – or probably much more šŸ™‚ – as the professor who wrote the article I don’t like (because it’s so imprecise, and that’s what made me write all what I am writing here).

Having said that, it’s obvious that it points toĀ a unity beyond these numbers and constants that I am only beginningĀ to appreciate for what it is: deep, mysterious, and very beautiful.Ā But so I don’t think that professor does a good job at showing how deep, mysterious and beautiful it all is. But then that’s up to you, my brother and you, my imaginary reader, to judge, of course. šŸ™‚

[…] I forgot to mention what I mean with ‘Planck units’. Well… Once again, I should refer you to one of my other posts. But, yes, that’s too easy for me and a bit difficult for you. šŸ™‚ So let me just note we get those Planck units by equating not less thanĀ five fundamental physical constants to 1, notably (1) the speed of light, (2) Planck’s (reduced) constant, (3) Boltzmann’s constant, (4) Coulomb’s constant and (5) Newton’s constant (i.e. the gravitational constant). Hence, we have a set of five equations here (cĀ = ħ = kBĀ = keĀ = G = 1), and so we can solve that to get the five Planck units, i.e. the Planck length unit, the Planck time unit, the Planck mass unit, the Planck energy unit, the Planck charge unit and, finally (oft forgotten), the Planck temperature unit. Of course, you should note that all mass and energy units areĀ directlyĀ related because of the mass-energy equivalence relationĀ E = mc2, which simplifies to E = m if cĀ is equated to 1. [I could also say something about the relation between temperature and (kinetic) energy, but I won’t, as it would only further confuse you.]

OK. Done! šŸ™‚

Addendum:Ā How to think about space and time?

If you read the argument on the Planck scale and constant carefully, then you’ll note that it does notĀ depend on the idea of an indivisible photon. However, it does depend on that Planck-Einstein relation being valid always and everywhere. Now, the Planck-Einstein relation is, in its essence, a fairly basic result from classical electromagnetic theory: itĀ incorporatesĀ quantum theory – remember: it’s the equation that allowed Planck to solve the black-body radiation problem, and so it’s why they call Planck the (reluctant) ‘Father of Quantum Theory’ – but it’sĀ notĀ quantum theory.

So the obvious question is: can we make this reflection somewhat more general, so we can think of theĀ electromagneticĀ force as an example only. In other words:Ā can we apply the thoughts above to any force and any movement really?

The truth is: I haven’t advanced enough in my little study to give the equations for the other forces. Of course, we could think of gravity, and I developed some thoughtsĀ on how gravity waves might look like, but nothing specific really. And then we have the shorter-range nuclear forces, of course: the strong force, and the weak force. The laws involved areĀ very different. The strong force involves colorĀ charges, and the way distances work is entirely different. So it would surely be some differentĀ analysis. However, the results should be the same. Let me offer some thoughts though:

  • We know that the relative strength of the nuclear force is much larger, because it pulls like charges (protons) together, despite the strong electromagnetic force that wants to push them apart! So the mentioned problem of trying to ‘pack’ some oscillation in some tiny little space should be worse with the strong force. And the strong force is there, obviously, at tiny little distances!
  • Even gravity should become important, because if we’ve got a lot of energy packed into some tiny space, its equivalent mass will ensure the gravitational forces also become important. In fact, that’s what the whole argument was all about!
  • There’s also all this talk about the fundamental forces becoming one at the Planck scale. I must, again, admit my knowledge is not advanced enough to explain how that would be possible, but I must assume that, if physicists are making such statements, the argument must be fairly robust.

So…Ā Whatever charge or whatever force we are talking about, we’ll be thinking of waves or oscillations—or simply movement, but it’s always a movement in a force field, and so there’s power and energy involved (energy is force times distance, and power is the time rate of change of energy). So, yes, we should expect the same issues in regard to scale. And so that’s what’s captured byĀ h.

As we’re talking the smallest things possible, I should also mention that there are also other inconsistencies in the electromagnetic theory, which should (also) have their parallel for other forces. For example, the idea of aĀ point chargeĀ is mathematically inconsistent, as I show in my post on fields and charges. Charge, any charge really, must occupyĀ someĀ space. It cannotĀ all be squeezed into one dimensionless point. So the reasoningĀ behind the Planck time and distance scale is surely valid.

In short, the whole argument about the Planck scale and those limits is veryĀ valid. However, does it imply our thinking about the Planck scale is actually relevant? I mean: it’s not because we canĀ imagineĀ how things might look like  – they may lookĀ like those tiny little black holes, for example – that these things actuallyĀ exist.Ā GUT or string theorists obviously think they are thinking about somethingĀ real. But, frankly, Feynman had a point when he said what he said about string theory, shortly before his untimely death in 1988: “I don’t like that they’re not calculating anything. I don’t like that they don’t check their ideas. I don’t like that for anything that disagrees with an experiment, they cook up an explanation—a fix-up to say, ‘Well, it still might be true.'”

It’s true that the so-called Standard Model doesĀ notĀ look very nice. It’s not like Maxwell’s equations. It’s complicated. It’s got various ‘sectors’: the electroweak sector, the QCD sector, the Higgs sector,… So ‘it looks like it’s got too much going on’, as a friend of mine said when he looked at a new design for mountainbike suspension. šŸ™‚ But, unlike mountainbike designs, there’s no real alternative for the Standard Model. So perhaps we should just accept it is what it is and, hence, in a way, accept Nature as we canĀ see it. So perhaps we should just continue to focus on what’s here, before we reach the Great Desert, rather than wasting time on trying to figure out how things might look like on the other side, especially because we’ll never be able to test our theories about ‘the other side.’

On the other hand, we can see where the Great Desert sort of startsĀ (somewhere near the 1032Ā Hz scale), and so it’s only natural to think it should alsoĀ stop somewhere. In fact, we knowĀ where it stops: it stops at the 1043Ā Hz scale, because everything beyond that doesn’t make sense. The question is: is there actually there? Like fundamental strings or whatever you want to call it. Perhaps we should just stop where the Great Desert begins. And what’s the Great Desert anyway? Perhaps it’s a desert indeed, and so then there is absolutelyĀ nothingĀ there. šŸ™‚

Hmm… There’s not all that much one can say about it. However, when looking at the history of physics, there’s one thing that’s really striking. Most of what physicists can think of,Ā in the sense that it made physical sense, turned out toĀ exist. Think of anti-matter, for instance. Paul Dirac thought it might exist, that it made sense to exist, and so everyone started looking for it, and Carl Anderson found in a few years later (in 1932). In fact, it had been observed before, but people just didn’t pay attention, so they didn’t want to see it, in a way. […] OK. I am exaggerating a bit, but you know what I mean. The 1930s are full of examples like that. There was a burst of scientific creativity, as the formalism of quantum physics was being developed, and the experimental confirmations of the theory just followed suit.

In the field of astronomy, or astrophysics I should say, it was the same with black holes.Ā No one could really imagine the existence of black holes until the 1960s or so: they were thought of a mathematical curiosity only, a logical possibility. However, the circumstantial evidence now is quite large and so… Well… It seems a lot of what we can think ofĀ actually has some existence somewhere. šŸ™‚

So… Who knows?Ā […] I surely don’t. And so IĀ need to get back toĀ the grind and work my way through the rest of Feynman’s LecturesĀ and the related math. However, this was a nice digression, and so I am grateful to my brother he initiated it. šŸ™‚

Some content on this page was disabled on June 17, 2020 as a result of a DMCA takedown notice from Michael A. Gottlieb, Rudolf Pfeiffer, and The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/

Self-inductance, mutual inductance, and the power and energy of inductors

As Feynman puts it, studying physics is not always about the ‘the great and esoteric heights’. In fact, you usuallyĀ have to come down from then fairly quickly – studying physics is similar to mountain climbing in that regard šŸ™‚ – and study ‘relatively low-level subjects’, such as electrical circuits, which is what we’ll do in this and the next post.

As I’ve introduced some key concepts in a previous post already, let me recapitulate the basics, which include the concept of theĀ electromotive force, which is basically theĀ voltage, i.e. the potential difference, that’s produced in a loop or coil of wire as the magnetic flux changes. I also talked about theĀ impedanceĀ in a AC circuit. Finally, we also discussed the powerĀ andĀ energiesĀ involved. Important results from this previous discussion include (but are not limited to):

  1. A constant speed AC generator will create an alternating current with the emf, i.e. the voltage, varying as V0Ā·sin(ωt).
  2. If we only have resistors as circuit elements, and the resistance in the circuit adds up to R, then the electric current in the circuit will be equal to I = ʐ/R = V/R = (V0/R)Ā·sin(ωt). So that’s Ohm’s Law, basically.
  3. The power that’s produced and consumed in an AC circuit is the product of the voltage and the current, so P = Ɛ·I = VĀ·I. We also showed this electrical power is equal to theĀ mechanicalĀ power dW/dt that makes the generator run.
  4. Finally, we explained the concept of impedance (denoted by Z) using Euler’s formula:Ā Z =Ā |Z|eiĪø, mentioning that, if other circuit elements than resistors are involved, such as inductors,Ā then it’s quite likely that the current signal will lag the voltage signal, with the phase factor Īø telling us by how much.

It’s now time to introduce those ‘other’ circuit elements. So let’s start with inductors here, and the concept ofĀ inductanceĀ itself. There’s a lot of stuff to them, and so let’s go over it point by point.

The concept of self-inductance

In its simplest form, an inductor is just a coil, but they come in all sizes and shapes. If you want to see how they might look like, justĀ googleĀ some images of micro-electronic inductors, and then, just to see the range of applications, some images of inductors used for large-scale industrial applications. If you do so, you’re likely to see images ofĀ transformersĀ too, because transformers work on the principle ofĀ mutualĀ inductance, and so they involveĀ twoĀ coils, i.e. two inductors.

Contrary to what you might expect, the concept of self-inductance (or inductance tout court) is quite simply: a changing current will cause a changingĀ magnetic field and, hence, some emf. Now, it turns out that the inducedĀ emfĀ is proportional to the change in current. So we’ve got another constant of proportionality here, so it’s like how we defined resistance, orĀ capacitance. So, in many ways, theĀ inductanceĀ is just another proportionality coefficient. If we denote it by L – the symbol is said to honor the Russian phyicistĀ Heinrich Lenz, whom you know from Lenz’ Law – then we define it as:

L = āˆ’Ę/(dI/dt)

The dI/dt factor is, obviously, the time rate of change of the current, and the negative sign indicates that the emfĀ opposesĀ the change in current, so it will tend to cause anĀ opposingĀ current. That’s why the emf involved is often referred to as a ‘back emf’. So that’sĀ Lenz’ LawĀ basically. As you might expect, the physicists came up with yet another derived unit, theĀ Henry, to honor yet another physicist, Joseph Henry, an American scientist who was a contemporary of Michael Faraday and independently discovered pretty much the same as Faraday: oneĀ henryĀ (H) equals one voltĀ·second per ampere: 1 H = 1 VĀ·s/A.

The concept of mutual inductance

FeynmanĀ introduces the topic of inductance with a two-coil set-up, as shown below, noting that a current in coil 1 will induce some emf in coil 2, which he denotes byĀ M12. Conversely, a current in coil 2 will induce some emf in coil 1, which he denoted by M21. M12Ā and M21Ā are also constants: they depend on the geometry of the coils, including the length of the solenoid (l), its surface area (S) and the number of loop turns of the coils (N1Ā and N2).

mutual inductance

The next step in the analysis is then to acknowledge that each coil should also produce a ‘back emf’ in itself, which we can denote by M11Ā and M22Ā respectively, but then these constants are, of course, equal to the self-inductanceĀ of the coils so, taking into account the convention in regard to the sign of the self-inductance, we write:

L1Ā = āˆ’ M11Ā and L1Ā =Ā āˆ’ M22

You will now wonder: what’s theĀ totalĀ emf in each coil, taking into account that we do not only have mutual inductance but also self-inductance? Frankly, when I was a kid, and my father tried to tell me one or two things about this, it confused me very much. I could not imagine what happened in one coil, let alone in twoĀ coils. I had this vision of a current producing some ‘back-current’, and then the ‘back-current’ producing ‘back-current’ again, and so I could not imagine how one could solve this problem. So the image in my head was very much like that baking powder box which Feynman evokes when talking about the method of imagesĀ to find the electric fields in situations with an easy geometry, so that’s the picture of a baking powder box which has on its label a picture of a baking powder box which has… Well… Etcetera. Unfortunately, my father didn’t push us to study math and, therefore, I knewĀ that one could solve such problems mathematically – we’re talking a converging series here – but I did notĀ know how, and that’s why I found it all veryĀ confusing.

Now I understand there’s one current only, and one potential difference only, and that the formulas doĀ notĀ involve some infinite series of terms. But… Well… I am not ashamed to say these problems are still testing the (limited) agility of my mind. The first thing to ‘get’ is that we’re talking a backĀ emf, and so that’s not a current but a potential difference. In fact, as I explained in my post on the electromotive force, the term ‘force’ in emf is actually misleading, and may lead to that same erroneous vision that I had as a kid: forces generating counter-forces, that generate counter-forces, that generate counter-forces, etcetera. It’s not like that: we have some current – oneĀ current – in a coil and we’ll have someĀ voltage – oneĀ voltage – across the coil. If the coil would be a resistorĀ instead of a coil, we’d find that the ratio of this voltage and the current would be some constant R = V/I. Now here we’re talking a coil indeed, so that’s aĀ differentĀ circuit element, and we find some other ratio, L =Ā āˆ’V/(dI/dt) = āˆ’Ę/(dI/dt). Why the minus sign? Well… As said, the inducedĀ emf will be such that it will tend to counter the current, and current flows fromĀ positiveĀ toĀ negativeĀ as per our convention.

But…Ā Yes?Ā So how does it work when we put this coil in some circuit, and how does the resistance of the inductor come into play? Relax. We’ve just been talking ideal circuit elementsĀ so far, and we’ve discussed only two: the resistor and the inductor. We’ll talk about voltage sources (or generators) and capacitors too, and then we’ll link all of these ideal circuit elements. In short, we’ll analyze some real-life electricalĀ circuitĀ soon, but first you need to understand the basics. Let me just note that an ideal inductor appears a zero-resistance conductor in a direct currentĀ (DC) circuit, so it’s a short-circuit really! Please try to mentallyĀ separate out those ‘ideal’ circuit components. Otherwise you’ll never be able to make sense of it all!

In fact, there’s a good reason why Feynman starts with explainingĀ mutual inductance before discussing a little circuit like the one below, which has an inductorĀ andĀ a voltage source. The two-coil situation above is effectively easier to understand, although it may not look like that at first. So let’s analyze that two-coil situation in more detail first. In other words, let me try to understand the situation that I didn’t understand as a kid. šŸ™‚

circuit with coil

Because of the law of superposition, we should add fluxes and changes in fluxes and, hence, we should also add the electromotive forces, i.e. the induced voltages. So, what we have here is that the total emf in coil 2 should be written as:

ʐ2Ā = M21Ā·(dI1/dt) +Ā M22Ā·(dI2/dt) =Ā M21Ā·(dI1/dt) – L2Ā·(dI2/dt)

What we’re saying here is that the emf, i.e. the voltage across the coil, will indeed depend on the change in current in the other coil, but also on the change in current of the coil itself. Likewise, the total emf in coil 1 should be written as:

ʐ1Ā = M12Ā·(dI2/dt) +Ā M11Ā·(dI1/dt) =Ā M12Ā·(dI2/dt) – L1Ā·(dI1/dt)

Of course, this does reduce to the simple L = āˆ’Ę/(dI/dt)Ā if there’s one coil only. But so you see where it comes from and, while we doĀ notĀ have some infinite series šŸ™‚ we do have a system of two equations here, and so let me say one or two things about it.

The first thing to note is that it is not so difficult to show thatĀ M21Ā is equal toĀ M12, so we can simplify and write that M21Ā = M12Ā = M. Now, while I said ‘not so difficult’, I didn’t mean it’s easy and, because I don’t want this post to become too long, I’ll refer you to Feynman for the proof of this M21Ā = M12Ā = M equation. It’s a general proof for any two coils or ‘circuits’ of arbitrary shape and it’s really worth the read. However, I have to move on.

The second thing to note is that this coefficientĀ M, which is referred to as theĀ mutual inductance now (so singular instead of plural) depends on the ‘circuit geometry’ indeed. For a simple solenoid, Feynman calculates it as

M = āˆ’(1/ε0c2)Ā·(N1Ā·N2)Ā·S/l,

withĀ lĀ the length of the solenoid, SĀ its surface area (S), and N1Ā and N2Ā the respective number of loop turns of the two coils. So, yes, only ‘geometry’ comes into play. [Note that’s quite obvious from the formula because a switch of the subscriptsĀ ofĀ N1Ā and N2Ā makes no difference, of course!]. Now, it’s interesting to note that M is the same for, let’s say, N1Ā = 100 and N2Ā = 10 and for N1Ā = 20 and N2Ā = 50. In fact, because you’re familiar with what transformers do, i.e. transforming voltages, you may think that’s counter-intuitive. It’s not. The thing with the number of coils does notĀ imply that ʐ1Ā and Ɛ2Ā remain the same. Our set of equations is Ɛ1Ā = MĀ·(dI2/dt) – L1Ā·(dI1/dt) and Ɛ2Ā = MĀ·(dI1/dt) – L2Ā·(dI2/dt), and so L1Ā and L2Ā clearly do vary asĀ N1Ā and N2Ā vary! So… Well… Yes. We’ve got a set of two equations with two independent variables (I1Ā and I2) and two dependent variables (ʐ1Ā and ʐ2). Of course, we could also phrase the problem the other way around: given two voltages, what are the currents? šŸ™‚

Of course, that makes us think of the power that goes in and out of a transformer. Indeed, you’ll remember that power is voltage times current. So what’s going on here in regard to that?

Well… There’s a thing with transformers, or with two-coil systems like this in general, that is referred to as coupling. The geometry of the situation will determineĀ how much flux from one coil is linked with the flux of the other coil. If most, or all of it, is linked, we say the two coils are ‘tightly coupled’ or, in the limit, that they are fully coupled. There’s a measure for that, and it’s called theĀ coefficient of coupling. Let’s first explore that concept of power once more.

Inductance, energy and electric power

It’s easy to see that we needĀ electric powerĀ to get some current going. Now, as we pointed out in our previous post, the power is equal to the voltage times the current. It’s also equal, of course, to the amount of work done per second, i.e. the time rate of change of the energy W, so we write:

dW/dt = Ɛ·I

Now, we defined the self-inductance as L = āˆ’Ę/(dI/dt) and, therefore, we know that Ɛ = āˆ’LĀ·(dI/dt), so we have:

dW/dt = āˆ’LĀ·IĀ·(dI/dt)

What is this? A differential equation? Yes and no. We’ve got not one but two functions of time here, W and I, and, while their derivatives with respect to time do appear in the equation, what we need to do is just integrate the two sides over time. We get: W =Ā āˆ’(1/2)Ā·LĀ·I2. Just check it by taking the time derivative of both sides. Of course, we can add any constant, to both sides in fact, but that’s just a matter of choosing some reference point. We’ll chose our constant to be zero, and also think about the energy that’sĀ storedĀ in the coil, i.e. U, which we define as:

U =Ā āˆ’W =Ā āˆ’(1/2)Ā·LĀ·I2Ā 

Huh?Ā What’s going on here? Well… It’s not an easy discussion, but let’s try to make sense of it. We have someĀ changingĀ current in the coil here but, obviously, some kind of inertia also: the coil itself opposes the change in current through the ‘back emf’. It requires energy, or power, to overcome the inertia. We may think of applying some voltage to offset the ‘back emf’, so we may effectively think of that little circuit with an inductorĀ and a voltage source. The voltage V we’d need to apply to offset the inertia would, obviously, be equal to the ‘back emf’, but with its sign reversed, so we have:

V = āˆ’Ā ĘĀ = LĀ·(dI/dt)

Now, it helps to think of what a current really is: it’s about electric charges that are moving at some velocity v because some force is being applied to them. As in any system, the power that’s being delivered is the dot product of the force and the velocity vectors (that ensures we only take the tangential component of the force into account), so if we have nĀ moving charges, the power that is being delivered to the circuit isĀ (FĀ·v)Ā·n. What is F? It’s obviously, qE, as the electric field is the force per unit charge, so E = F/q. But so we’re talking some circuitĀ here and we need to think ofĀ the power being delivered to some infinitesimal element ds in the coil, and so that’s (FĀ·v)Ā·nĀ·ds, which can be written as:Ā (FĀ·ds)Ā·nĀ·v. And then we integrate over the whole coil to find:power

Now, you may or may not remember that the emf (Ɛ) is actually defined as the line integral ∫ E·ds line, taken around the entire coil and, hence, noting that E = F/q, and that the current I is equal to I = q·n·v, we got our power equation. Indeed, the integrand or kernel of our integral becomes F·n·v·ds = q·E·n·v·ds = I·E·ds. Hence, we get our power formula indeed: P = V·I, with V the potential difference, i.e. the voltage across the coil.

I am getting too much into the weeds here. The point is: we’ve got a full and complete analog to the concept of inertia in mechanics here: instead of some forceĀ F causing someĀ massĀ m to change its velocity according to Newton’s Law, i.e. F = mĀ·a = mĀ·(dv/dt), we here have a potential difference V causing some current I to change according to the V = LĀ·(dI/dt) law.

This is very confusing but, remember, the same equations must have the same solutions! So, in an electric circuit, the inductance is really like what the mass is in a mechanics. Now, in mechanics, we’ll say that our mass has some momentum p = mĀ·v, and we’ll also say that its kinetic energy is equal to (1/2)mĀ·v2. We can do the same for our circuit: potential energy is continuously being converted into kinetic energy which, for our inductor, we write as U = (1/2)Ā·LĀ·I2.

Just think about by playing with one of the many online graphing tools. The graph below, for example, assumes the current builds up to some maximum. As it reaches its maximum, theĀ storedĀ energy will also max out. Now, you should not worry about the units here, or the scaleĀ of the graphs. The assumption is that I builds up from 0 to 1, and that L = 1, so that makes U what it is. Using a different constant for L, and/or different units for I, will change the scale of U too, but not its general shape, and that shape gives you the general idea.

powerThe example above obviously assumes some direct current, so it’s a DC circuit: the current builds up, but then stabilizes at some maximum that we can find by applying Ohm’s Law to the resistanceĀ of the circuit: I = V/R. Resistance? But we were talking an ideal inductor? We are. If there’s no other resistance in the circuit, we’ll have a short-circuit, so the assumption is that we doĀ have some resistance in the circuit and, therefore, we should also think of some energyĀ lossĀ to heat from the current in the resistance, but that’s not our worry here.

The illustration below is, perhaps, more interesting. Here we are, obviously, applying anĀ alternating current, and so the current goes in one and then in the other direction, so I > 0, and then I < 0, etcetera. We’re assuming some nice sinusoidal curve for the current here (i.e. the blue curve), and so we get what we get for U (i.e. the red curve): the energy goes up and down between zero and some maximum amplitude that’s determined by the maximum current.

power 2

So, yes, it is, after all, quite intuitive: building up a current does require energy from some external source, which is used to overcome the ‘back emf’ in the inductor, and that energy is stored in the inductor itself. [If you still wonder why it’s stored in the inductor, think about the other question: where else would it be stored?]Ā HowĀ is stored? Look at the graph and think: it’s stored as kinetic energy of the charges, obviously. That explains why the energy is zero when the current is zero, and why the energy maxes out when the current maxes out. So, yes, it all makes sense! šŸ™‚

Let’s now get back to that coupling constant.

The coupling constant

We can apply our reasoning to two coils. Indeed, we know that Ɛ1Ā = MĀ·(dI2/dt) – L1Ā·(dI1/dt) and Ɛ2Ā = MĀ·(dI1/dt) – L2Ā·(dI2/dt). So the power in the two-coils system is dW/dt = Ɛ1Ā·I1Ā + ʐ2Ā·I2, so we have:

dW/dt = MĀ·I1(dI2/dt) – L1Ā·I1Ā·(dI1/dt) + MĀ·I2Ā·(dI1/dt) – L2Ā·I2Ā·(dI2/dt)

= – L1Ā·I1Ā·(dI1/dt) – L2Ā·I2Ā·(dI2/dt) + MĀ·I1(dI2/dt)Ā·I2Ā·(dI1/dt)

Integrating both sides, and equating U with āˆ’W once more, yields:

U = (1/2)Ā·L1Ā·I12Ā + (1/2)Ā·L2Ā·I22Ā + MĀ·I1Ā·I2

[Again, you should just take the time derivative to verify this. If you don’t forget to apply the product rule for the MĀ·I1Ā·I2Ā term, you’ll see I am not writing too much nonsense here.] Now, there’s an interestingĀ algebraic transformation of this expression, and an equally interesting explanationĀ whyĀ we’d re-write the expression as we do. Let me copy it fromĀ Feynman so I’ll be using his fancier L and M symbols now. šŸ™‚

explanation coupling constant

So what? Well… Taking into account that inequality above, we can write the relation between M and the self-inductancesĀ L1Ā andĀ L2Ā using some constant k, which varies between 0 and 1 and which we’ll refer to as theĀ coupling constant:

formula coupling constant 2

We refer to k as the coupling constant, for rather obvious reasons: if it’s near zero, the mutual inductance will be very small, and if it’s near one, then the coils are said to be ‘tightly coupled’, and the ‘mutual flux linkage’ is then maximized. As you can imagine, there’s a whole body of literature out there relating this coupling constant to the behavior of transformers or other circuits where mutual inductance plays a role.

The formula for self-inductance

We gave the formula for theĀ mutualĀ inductance of two coils that are arranged as one solenoid on top of the other (cf. the illustration I started with):

M = āˆ’(1/ε0c2)Ā·(N1Ā·N2)Ā·S/l

It’s a very easy calculation, so let me quickly copy it from Feynman:

calculation solenoid

You’ll say: where is the M here? This is a formula for the emf! It is, but M is the constant of proportionality in front, remember? So there you go. šŸ™‚

Now, you would think that getting a formula for the self-inductance L of some solenoid would be equally straightforward. It turns out that that isĀ notĀ the case. Feynman needs two full pages and… Well… By now, you should now how ‘dense’ his writing really is: if it weren’t so dense, you’d be reading Feynman yourself, rather than my ‘explanations’ of him. šŸ™‚ So… Well.. If you want to see how it works, just click on the link hereĀ and scroll down to the last two pages of hisĀ exposé on self-inductance. I’ll limit myself to just jotting down the formula he does obtain when he’s through the whole argument:

solenoid formula

See why he uses a fancier L than ‘my’ L? ‘His’ L is the length of the solenoid. šŸ™‚ And, yes, r is the radius of the coil and n the number of turnsĀ per unit lengthĀ in the winding. Also note this formula is valid only if L >> R, so the effects at theĀ endĀ of the solenoid can be neglected. OK. Done. šŸ™‚

Well… That’s it for today! I am sorry to say but the next post promises to be as boring as this one because… Well… It’s on electric circuits again. 😦

Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 20, 2020 as a result of a DMCA takedown notice from Michael A. Gottlieb, Rudolf Pfeiffer, and The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/

Reconciling the wave-particle duality in electromagnetism

As I talked about Feynman’s equation for electromagnetic radiation in my previous post, I thought I should add a few remarks on wave-particle duality, but then I didn’t do it there, because my post would have become way too long. So let me add those remarks here. In fact, I’ve written about this before, and so I’ll just mention the basic ideas without going too much in detail.Ā Let me first jot down the formula once again, as well as illustrate the geometry of the situation:

formual

geometry

The gist of the matter is that light, in classical theory, is a traveling electromagnetic field caused by an accelerating electric charge and that, because light travels at speed c, it’s the acceleration at the retarded time t – r/c, i.e. a‘ = a(t – r/c), that enters the formula. You’ve also seen the diagrams that accompany this formula:

EM 1 EM 2

The two diagrams above show that the curve of the electric fieldĀ in spaceĀ is a ā€œreversedā€ plot of the acceleration as a function of time. As I mentioned before, that’s quite obvious from the mathematical behavior of a function with argument like the argument above, i.e. a function F(t – r/c). When we write t – r/c, we basically measureĀ distance units in seconds, instead of in meter. So we basically useĀ cĀ as theĀ scale for both time as well as distance. I explained that in a previous post, so please have a look there if you’d want so see how that works.

So it’s pretty straightforward, really. However, having said that, when I see a diagram like the one above, so all of these diagrams plotting anĀ E or BĀ waveĀ in space, I can’t help thinking it’s somewhat misleading: after all, we’re talking something traveling at the speed of light here and, therefore, its length – inĀ ourĀ frame of reference – should be zero. And it is, obviously. Electromagnetic radiation comes packed in point-like, dimensionless photons: the length of something that travels at the speed of lightĀ mustĀ be zero.

Now, I don’t claim to know what’s going onĀ exactly, but my thinking on it may not be far off the mark. We know that light is emitted and absorbed by atoms, as electrons go from one energy level to another, and the energy of theĀ photonsĀ of lightĀ corresponds to the differenceĀ between those energy levels (i.e. a few electron-volt only, typically: it’s given by the E = h·ν relation). Therefore, we can look at a photon as aĀ transient electromagnetic wave. It’s a very short pulse: the decay time for one such pulse ofĀ sodium light, i.e. one photon of sodium light,Ā isĀ 3.2Ɨ10–8Ā seconds. However, taking into account the frequency of sodium light (500 THz), that still makes for some 16 million oscillations, and a wave-train with a length of almost 10 meter. [Yes. Quite incredible, isn’t it?]Ā SoĀ the photon could look like the transient wave I depicted below, except… Well… This wavetrain is traveling at the speed of light and, hence, we will not see it as a ten-meter long wave-train. Why not? Well…Ā Because of the relativistic length contraction, it will effectively appear as a point-like particle to us.

Photon wave

So relativistic length contraction is whyĀ the wave and particle duality can be easily reconciled in electromagnetism: we can think of light as an irregular beam of point-like photons indeed, as one atomic oscillator after the other releases a photon, in no particularly organized way. So we can think of photons asĀ transient wave-trains, but we should remind ourselves that they are traveling at the speed of light, so they’ll look point-like to us.

Is such view consistent with theĀ results of the famous – of should I say infamous? – double-slit experiment. Well… Maybe. As I mentioned in one of my posts, it is rather remarkable that is actually hard to find actual double-slitĀ experiments that use actualĀ detectors near the slits, and even harder to find such experiments involving photons! Indeed, experiments involving detectors near the slits are usually experiments with ‘real’ particles, such as electrons, for example. Now, a lot of advances have been made in the set-up of these experiments over the past five years, and one of these experiments isĀ a 2010 experiment of an Italian teamĀ which suggests that it’s the interaction between the detector and the electron waveĀ that may cause the interference pattern to disappear. Now thatĀ throws some doubts on the traditional explanation of the results of the double-slit experiment.

The idea is shown below. The electron is depicted as an incoming plane wave which effectively breaks up as it goes through the slits. The slit on the left has no ā€˜filter’ (which you may think of as a detector) and, hence, the plane wave goes through as a cylindrical wave. The slit on the right-hand side isĀ covered by a ā€˜filter’ made of several layers of ā€˜low atomic number material’, so the electron goes through but, at the same time, the barrier creates a spherical wave as it goes through. The researchers note that ā€œthe spherical and cylindrical wave do not have any phase correlation, and so even if an electron passed through both slits, the two different waves that come out cannot create an interference pattern on the wall behind them.ā€ [I hope I don’t have to remind you that, while being represented as ā€˜real’ waves here, the ā€˜waves’ are, obviously, complex-valued psi functions.]

double-slit experiment

In fact, to be precise, the experimenters note that there still was an interference effect if the filter was thin enough. Let me quote the reason for that: ā€œThe thicker the filter, the greater the probability for inelastic scattering. When the electron suffers inelastic scattering, it is localized. This means that its wavefunction collapses and, after the measurement act, it propagates roughly as a spherical wave from the region of interaction, with no phase relation at all with other elastically or inelastically scattered electrons. If the filter is made thick enough, the interference effects cancels out almost completely.ā€

This does notĀ solve the ‘mystery’ of the double-slit experiment, but it throws doubt on how it’s usually being explained. The mystery in such experiments is that, when we put detectors, it isĀ eitherĀ theĀ detector at AĀ or the detector at B that goes off. They should never go off togetherā€”ā€at half strength, perhapsā€, as Feynman puts it. But so there areĀ doubts here now. Perhaps the electron doesĀ go through both slits at the same time! And soĀ that’s why I used italics when writing ā€œeven if an electron passed through both slitsā€: the electron, or the photon in a similar set-up, is not supposed to do that according to the traditional explanation of the results of the double-slit experiment! It’s one or the other, and the wavefunction collapsesĀ or reduces as it goes through.Ā 

However,Ā that’s where these so-called ā€˜weak measurement’ experiments now come in, like this 2010 experiment: it does not prove but indicates that interaction does not have to be that way. They strongly suggest that it is notĀ all or nothing, that our observations should not necessarily destroy the wavefunction. So, who knows, perhaps we will be able, one day, to show that the wavefunction does go through both slits, as it should (otherwise the interference pattern cannot be explained), and then we will have resolved the paradox.

I am pretty sure that, when that’s done, physicists will also be able to relate the image of a photon as a transient electromagnetic wave (cf. the diagram above), being emitted by an atomic oscillator for a few nanoseconds only (we gave the example for sodium light, for which the decay time was 3.2Ɨ10–8Ā seconds) with the image of a photon as a particle that can be represented by a complex-valued probability amplitude functionĀ (cf. the diagram below). I look forward to that day. I think it will come soon.

Photon wave

Here I should add two remarks. First, a lot hasĀ been said about the so-called indivisibility of a photon, but inelastic scattering implies that photons are notĀ monolithic: the photon loses energy to the electron and, hence, its wavelength changes. Now, you’ll say: the scattered photon is not the same photon as the incident photon, and you’re right. But… Well. Think about it. It does say something about the presumed oneness of a photon.

I15-72-Compton1

The other remark is on the mathematics of interference. Photons are bosons and, therefore, we have to add their amplitudes to get the interference effect. So you may try to think of an amplitude function, like ĪØ = (1/√2Ļ€)Ā·eiĪøĀ or whatever, and think it’s just a matter of ‘splitting’ this function before it enters the two slits and then ‘putting it back together’, so to say, after our photon has gone through the slits. [For the detailed math of interference in quantum mechanics, see my page on essentials.] Ā Well… No. It’s not that simple. The illustration with that plane wave entering the slits, and the cylindrical and/or spherical wave coming out, makes it obvious that something happens to our wave as it goes through the slit. As I said a couple of times already, the two-slit experiment is interesting, but the interference phenomenon – or diffraction as it’s called – involving one slit only is at least as interesting. So… Well… The analysis isĀ notĀ that simple. Not at all, really. šŸ™‚

Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 16, 2020 as a result of a DMCA takedown notice from The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/