The photon wavefunction

Post scriptum note added on 11 July 2016: This is one of the more speculative posts which led to my e-publication analyzing the wavefunction as an energy propagation. With the benefit of hindsight, I would recommend you to immediately the more recent exposé on the matter that is being presented here, which you can find by clicking on the provided link.

Original post:

In my previous posts, I juxtaposed the following images:

Both are the same, and then they’re not. The illustration on the left-hand side shows how the electric field vector (E) of an electromagnetic wave travels through space, but it does not show the accompanying magnetic field vector (B), which is as essential in the electromagnetic propagation mechanism according to Maxwell’s equations:

∂B/∂t = –∇×E
∂E/∂t = c²∇×B = ∇×B for c = 1

The second illustration shows a wavefunction eⁱ^{(kx − ωt)}= cos(kx − ωt) + i∙sin(kx − ωt). Its propagation mechanism—if we can call it like that—is Schrödinger’s equation:

∂ψ/∂t = i·(ħ/2m)·∇²ψ

We already drew attention to the fact that an equation like this models some flow. To be precise, the Laplacian on the right-hand side is the second derivative with respect to x here, and, therefore, expresses a flux density: a flow per unit surface area, i.e. per square meter. To be precise: the Laplacian represents the flux density of the gradient flow of ψ.

On the left-hand side of Schrödinger’s equation, we have a time derivative, so that’s a flow per second. The ħ/2m factor is like a diffusion constant. In fact, strictly speaking, that ħ/2m factor is a diffusion constant, because it does exactly the same thing as the diffusion constant D in the diffusion equation ∂φ/∂t = D·∇²φ, i.e:

As a constant of proportionality, it quantifies the relationship between both derivatives.
As a physical constant, it ensures the dimensions on both sides of the equation are compatible.

So our diffusion constant here is ħ/2m. Because of the Uncertainty Principle, m is always going to be some integer multiple of ħ/2, so ħ/2m = 1, 1/2, 1/3, 1/4 etcetera. In other words, the ħ/2m term is the inverse of the mass measured in units of ħ/2. We get the terms of the harmonic series here. How convenient! 🙂

In our previous posts, we studied the wavefunction for a zero-mass particle. Such particle has zero rest mass but – because of its movement – does have some energy, and, therefore, some mass and momentum. In fact, measuring time and distance in equivalent units (so c = 1), we found that E = m = p = ħ/2 for the zero-mass particle. It had to be. If not, our equations gave us nonsense. So Schrödinger’s equation was reduced to:

∂ψ/∂t = i·∇²ψ

How elegant! We only need to explain that imaginary unit (i) in the equation. It does a lot of things. First, it gives us two equations for the price of one—thereby providing a propagation mechanism indeed. It’s just like the E and B vectors. Indeed, we can write that ∂ψ/∂t = i·∇²ψ equation as:

Re(∂ψ/∂t) = −Im(∇²ψ)
Im(∂ψ/∂t) = Re(∇²ψ)

You should be able to show that the two equations above are effectively equivalent to Schrödinger’s equation. If not… Well… Then you should not be reading this stuff.] The two equations above show that the real part of the wavefunction feeds into its imaginary part, and vice versa. Both are as essential. Let me say this one more time: the so-called real and imaginary part of a wavefunction are equally real—or essential, I should say!

Second, i gives us the circle. Huh? Yes. Writing the wavefunction as ψ = a + i·b is not just like writing a vector in terms of its Cartesian coordinates, even if it looks very much that way. Why not? Well… Never forget: i²= −1, and so—let me use mathematical lingo here—the introduction of i makes our metric space complete. To put it simply: we can now compute everything. In short, the introduction of the imaginary unit gives us that wonderful mathematical construct, eⁱ^{(kx − ωt)}, which allows us to model everything. In case you wonder, I mean: everything! Literally. 🙂

However, we’re not going to impose any pre-conditions here, and so we’re not going to make that E = m = p = ħ/2 assumption now. We’ll just re-write Schrödinger’s equation as we did last time—so we’re going to keep our ‘diffusion constant’ ħ/2m as for now:

Re(∂ψ/∂t) = −(ħ/2m)·Im(∇²ψ)
Im(∂ψ/∂t) = (ħ/2m)·Re(∇²ψ)

So we have two pairs of equations now. Can they be related? Well… They look the same, so they had better be related! 🙂 Let’s explore it. First note that, if we’d equate the direction of propagation with the x-axis, we can write the E vector as the sum of two y- and z-components: E = (E_y, E_z). Using complex number notation, we can write E as:

E = (E_y, E_z) = E_y + i·E_z

In case you’d doubt, just think of this simple drawing:

The next step is to imagine—funny word when talking complex numbers—that E_y and E_zare the real and imaginary part of some wavefunction, which we’ll denote as ψ_E = eⁱ^{(kx − ωt)}. So now we can write:

E = (E_y, E_z) = E_y + i·E_z= cos(kx − ωt) + i∙sin(kx − ωt) = Re(ψ_E) + i·Im(ψ_E)

What’s k and ω? Don’t worry about it—for the moment, that is. We’ve done nothing special here. In fact, we’re used to representing waves as some sine or cosine function, so that’s what we are doing here. Nothing more. Nothing less. We just need two sinusoids because of the circular polarization of our electromagnetic wave.

What’s next? Well… If ψ_E is a regular wavefunction, then we should be able to check if it’s a solution to Schrödinger’s equation. So we should be able to write:

Re(∂ψ_E/∂t) = −(ħ/2m)·Im(∇²ψ_E)
Im(∂ψ_E/∂t) = (ħ/2m)·Re(∇²ψ_E)

Are we? How does that work? The time derivative on the left-hand side is equal to:

∂ψ_E/∂t = −iω·eⁱ^{(kx − ωt)}= −iω·[cos(kx − ωt) + i·sin(kx − ωt)] = ω·sin(kx − ωt) − iω·cos(kx − ωt)

The second-order derivative on the right-hand side is equal to:

∇²ψ_E= ∂²ψ_E/∂x²= −k²·eⁱ^{(kx − ωt)}= −k²·cos(kx − ωt) − ik²·sin(kx − ωt)

So the two equations above are equivalent to writing:

Re(∂ψ_E/∂t) = −(ħ/2m)·Im(∇²ψ_E) ⇔ ω·sin(kx − ωt) = k²·(ħ/2m)·sin(kx − ωt)
Im(∂ψ_E/∂t) = (ħ/2m)·Re(∇²ψ_E) ⇔ −ω·cos(kx − ωt) = −k²·(ħ/2m)·cos(kx − ωt)

Both conditions are fulfilled if, and only if, ω = k²·(ħ/2m). Now, assuming we measure time and distance in equivalent units (c = 1), we can calculate the phase velocity of the electromagnetic wave as being equal to c = ω/k = 1. We also have the de Broglie equation for the matter-wave, even if we’re not quite sure whether or not we should apply that to an electromagnetic wave. In any case, the de Broglie equation tells us that k = p/ħ. So we can re-write this condition as:

ω/k = 1 = k·(ħ/2m) = (p/ħ)·(ħ/2m) = p/2m ⇔ p = 2m ⇔ m = p/2

So that’s different from the E = m = p equality we imposed when discussing the wavefunction of the zero-mass particle: we’ve got that 1/2 factor which bothered us so much once again! And it’s causing us the same trouble: how do we interpret that m = p/2 equation? It leads to nonsense once more! E = m·c²= m, but E is also supposed to be equal to p·c = p. Here, however, we find that E = p/2! We also get strange results when calculating the group and phase velocity. So… Well… What’s going on here?

I am not quite sure. It’s that damn 1/2 factor. Perhaps it’s got something to do with our definition of mass. The m in the Schrödinger equation was referred to as the effective or reduced mass of the electron wavefunction that it was supposed to model. Now that concept is something funny: it sure allows for some gymnastics, as you’ll see when going through the Wikipedia article on it! I promise I’ll dig into it—but not now and here, as I’ve got no time for that. 😦

However, the good news is that we also get a magnetic field vector with an electromagnetic wave: B. We know B is always orthogonal to E, and in the direction that’s given by the right-hand rule for the vector cross-product. Indeed, we can write B as B = e_x×E/c, with e_x the unit vector pointing in the x-direction (i.e. the direction of propagation), as shown below.

E and b

So we can do the same analysis: we just substitute E for B everywhere, and we’ll find the same condition: m = p/2. To distinguish the two wavefunctions, we used the E and B subscripts for our wavefunctions, so we wrote ψ_Eand ψ_B. We can do the same for that m = p/2 condition:

m_E= p_E/2
m_B= p_B/2

Should we just add m_Eand m_E to get a total momentum and, hence, a total energy, that’s equal to E = m = p for the whole wave? I believe we should, but I haven’t quite figured out how we should interpret that summation!

So… Well… Sorry to disappoint you. I haven’t got the answer here. But I do believe my instinct tells me the truth: the wavefunction for an electromagnetic wave—so that’s the wavefunction for a photon, basically—is essentially the same as our wavefunction for a zero-mass particle. It’s just that we get two wavefunctions for the price of one. That’s what distinguishes bosons from fermions! And so I need to figure out how they differ exactly! And… Well… Yes. That might take me a while!

In the meanwhile, we should play some more with those E and B vectors, as that’s going to help us to solve the riddle—no doubt!

Fiddling with E and B

The B = e_x×E/c equation is equivalent to saying that we’ll get B when rotating E by 90 degrees which, in turn, is equivalent to multiplication by the imaginary unit i. Huh? Yes. Sorry. Just google the meaning of the vector cross product and multiplication by i. So we can write B = i·E, which amounts to writing:

B = i·E = eⁱ^(π/2)·eⁱ^{(kx − ωt)} = eⁱ^{(kx − ωt + π/2)} = cos(kx − ωt + π/2) + i·sin(kx − ωt + π/2)

So we can now associate a wavefunction ψ_B with the field magnetic field vector B, which is the same wavefunction as ψ_E except for a phase shift equal to π/2. You’ll say: so what? Well… Nothing much. I guess this observation just concludes this long digression on the wavefunction of a photon: it’s the same wavefunction as that of a zero-mass particle—except that we get two for the price of one!

It’s an interesting way of looking at things. Let’s look at the equations we started this post with, i.e. Maxwell’s equations in free space—i.e. no stationary charges, and no currents (i.e. moving charges) either! So we’re talking those ∂B/∂t = –∇×E and ∂E/∂t = ∇×B equations now.

Note that they actually give you four equations, because they’re vector equations:

∂B/∂t = –∇×E ⇔ ∂B_y/∂t = –(∇×E)_y and ∂B_z/∂t = –(∇×E)_z
∂E/∂t = ∇×B ⇔ ∂E_y/∂t = (∇×B)_y and ∂E_z/∂t = (∇×B)_z

To figure out what that means, we need to remind ourselves of the definition of the curl operator, i.e. the ∇× operator. For E, the components of ∇×E are the following:

(∇×E)_z = ∇_xE_y– ∇_yE_x= ∂E_y/∂x – ∂E_x/∂y
(∇×E)_x = ∇_yE_z– ∇_zE_y= ∂E_z/∂y – ∂E_y/∂z
(∇×E)_y = ∇_zE_x– ∇_xE_z= ∂E_x/∂z – ∂E_z/∂x

So the four equations above can now be written as:

∂B_y/∂t = –(∇×E)_y = –∂E_x/∂z + ∂E_z/∂x
∂B_z/∂t = –(∇×E)_z = –∂E_y/∂x + ∂E_x/∂y
∂E_y/∂t = (∇×B)_y = ∂B_x/∂z – ∂B_z/∂x
∂E_z/∂t = (∇×B)_z= ∂B_y/∂x – ∂B_x/∂y

What can we do with this? Well… The x-component of E and B is zero, so one of the two terms in the equations simply disappears. We get:

∂B_y/∂t = –(∇×E)_y = ∂E_z/∂x
∂B_z/∂t = –(∇×E)_z = – ∂E_y/∂x
∂E_y/∂t = (∇×B)_y = – ∂B_z/∂x
∂E_z/∂t = (∇×B)_z= ∂B_y/∂x

Interesting: only the derivatives with respect to x remain! Let’s calculate them:

∂B_y/∂t = –(∇×E)_y = ∂E_z/∂x = ∂[sin(kx − ωt)]/∂x = k·cos(kx − ωt) = k·E_y
∂B_z/∂t = –(∇×E)_z = – ∂E_y/∂x = – ∂[cos(kx − ωt)]/∂x = k·sin(kx − ωt) = k·E_z
∂E_y/∂t = (∇×B)_y = – ∂B_z/∂x = – ∂[sin(kx − ωt + π/2)]/∂x = – k·cos(kx − ωt + π/2) = – k·B_y
∂E_z/∂t = (∇×B)_z= ∂B_y/∂x = ∂[cos(kx − ωt + π/2)]/∂x = − k·sin(kx − ωt + π/2) = – k·B_z

What wonderful results! The time derivatives of the components of B and E are equal to ±k times the components of E and B respectively! So everything is related to everything, indeed! 🙂

Let’s play some more. Using the cos(θ + π/2) = −sin(θ) and sin(θ + π/2) = cos(θ) identities, we know that B_y and B_z= sin(kx − ωt + π/2) are equal to:

B_y= cos(kx − ωt + π/2) = −sin(kx − ωt) = −E_z
B_z= sin(kx − ωt + π/2) = cos(kx − ωt) = E_y

Let’s calculate those derivatives once more now:

∂B_y/∂t = −∂E_z/∂t = −∂sin(kx − ωt)/∂t = ω·cos(kx − ωt) = ω·E_y
∂B_z/∂t = ∂E_y/∂t = ∂cos(kx − ωt)/∂t = −ω·sin(kx − ωt) = −ω·E_z

This result can, obviously, be true only if ω = k, which we assume to be the case, as we’re measuring time and distance in equivalent units, so the phase velocity is c = 1 = ω/k.

Hmm… I am sure it won’t be long before I’ll be able to prove what I want to prove. I just need to figure out the math. It’s pretty obvious now that the wavefunction—any wavefunction, really—models the flow of energy. I just need to show how it works for the zero-mass particle—and then I mean: how it works exactly. We must be able to apply the concept of the Poynting vector to wavefunctions. We must be. I’ll find how. One day. 🙂

As for now, however, I feel we’ve played enough with those wavefunctions now. It’s time to do what we promised to do a long time ago, and that is to use Schrödinger’s equation to calculate electron orbitals—and other stuff, of course! Like… Well… We hardly ever talked about spin, did we? That comes with huge complexities. But we’ll get through it. Trust me. 🙂

The photon wavefunction

Fiddling with E and B

Published by Jean Louis Van Belle

4 thoughts on “The photon wavefunction”

Leave a comment Cancel reply

Fiddling with E and B

Share this:

Related

Published by Jean Louis Van Belle

4 thoughts on “The photon wavefunction”

Leave a comment Cancel reply