The Hamiltonian of matter in a field

In this and the next post, I want to present some essential discussions in Feynman’s 10th, 11th and 12th Lectures on Quantum Mechanics. This post in particular will actually present the Hamiltonian for the spin state of an electron, but the discussion is much more general than that: it’s a model for any spin-1/2 particle, i.e. for all elementary fermions—so that’s the ‘matter-particles’ which you know: electrons, protons and neutrons. Or, taking into account protons and neutrons consists of quarks, we should say quarks, which also have spin 1/2. So let’s go for it. Let me first, by way of introduction, remind you of a few things.

What is it that we are trying to do?

That’s always a good question to start with. 🙂 Just for fun, and as we’ll be talking a lot about symmetries and directions in space, I’ve inserted an animation below of a four-dimensional object, as its author calls it. This ‘object’ returns to its original configuration after a rotation of 720 degrees only (after 360 degrees, the spiral flips between clockwise and counterclockwise orientations, so it’s not the same). For some rather obscure reason 🙂 he refers to it as a spin-1/2 particle, or a spinor.

Spin_One-Half_(Slow)

Are spin one-half particles, like an electron or a proton, really four-dimensional? Well… I guess so. All depends, of course, on your definition or concept of a dimension. 🙂 Indeed, the term is as well – I should say, as badly, really – defined as the ubiquitous term ‘vector’ and so… Well… Let me say that spinors are usually defined in four-dimensional vector spaces, indeed. […] So is this what it’s all about, and should we talk about spinors?

Not really. Feynman doesn’t push the math that far, so I won’t do that either. 🙂 In fact, I am not sure why he’s holding back here: spinors are just mathematical objects, like vectors or tensors, which we introduced in one of our posts on electromagnetism, so why not have a go at it? You’ll remember that our electromagnetic tensor was like a special vector cross-product which, using the four-potential vector Aμ and the ∇μ = (∂/∂t, −∂/∂x, −∂/∂y, −∂/∂z) operator, we could write as (∇μAμ) − (∇μAμ)T.

Huh? Hey! Relax! It’s a matrix equation. It looks like this:

matrix

In fact, I left out above, and so we should plug it in, remembering that B’s magnitude is 1/c times E’s magnitude. So the electromagnetic tensor – in one of its many forms at least – is the following matrix:

electromagnetic tensor final

Why do we need a beast like this? Well… Have a look at the mentioned post or, better, one of the subsequent posts: we used it in very powerful equations (read: very concise equations, because that’s what mathematicans, and physicists, like) describing the dynamics of a system. So we have something similar here: what we’re trying to describe the dynamics of a quantum-mechanical system in terms of the evolution of its state, which we express as a linear combination of ‘pure’ base states, which we wrote as:

|ψ〉 = |1〉C|2〉C= |1〉〈1|ψ〉 + |2 〉〈2|ψ〉

C1 and C2 are complex-valued wavefunctions, or amplitudes as we call them, and the dynamics of the system are captured in a set of differential equations, which we wrote as:

System

The trick was to know or guess our Hamiltonian, i.e. we had to know or, more likely, guess those Hij coefficients (and then find experiments to confirm our guesses). Once we got those, it was a piece of cake. We’d solve for C1 and C2, and then take their absolute square so as to get probability functions. like the ones we found for our ammonia (NH3) molecule: P1(t) = |C1(t)|2 = cos2[(A/ħ)·t] and P2(t) = |C2(t)|= sin2[(A/ħ)·t]. They say that, if we would take a measurement, then the probability of finding the molecule in the ‘up’ or ‘down’ state (i.e. state 1 versus state 2) varies as shown:

graph

So here we are going to generalize the analysis: rather than guessing, or assuming we know them (from experiment, for example, or because someone else told us so), we’re going to calculate what those Hamiltonian coefficients are in general.

Now, returning to those spinors, it’s rather daunting to think that such a simple thing as being in the ‘up’ or ‘down’ condition has to be represented by some mathematical object that’s at least as complicated as these tensors. But… Well… I am afraid that’s the way it is. Having said that, Feynman himself seems to consider that’s math for graduate students in physics, rather than the undergraduate public for which he wrote the course. Hence, while he presented all of the math in the Lecture Volume on electromagnetism, he keeps things as simple as possible in the Volume on quantum mechanics. So… No. We will not be talking about spinors here.

The only reason why I started out with that wonderful animation is to remind you of the weirdness of quantum mechanics as evidenced by, for example, the fact I almost immediately got into trouble when trying to associate base states with two-dimensional geometric vectors when writing my post on the hydrogen molecule, or when thinking about the magnitude of the quantum-mechanical equivalent of the angular momentum of a particle (see my post on spin and angular momentum).

Thinking of that, it’s probably good to remind ourselves of the latter discussion. If we denote the angular momentum as J, then we know that, in classical mechanics, any of J‘s components Jx, Jy or Jz, could take on any value from +J to −J and, therefore, the maximum value of any component of J – say Jz – would be equal to J. To be precise, J would be the value of the component of J in the direction of J itself. So, in classical mechanics, we’d write: |J| = +√(J·J) = +√JJ, and it would be the maximum value of any component of J.

However, in quantum mechanics, that’s not the case. If the spin number of J is j, then the maximum value of any component of J is equal to j·ħ. In this case, the spin number will be either +1/2 or −1/2. So, naturally, one would think that J, i.e. the magnitude of J, would be equal to J = |J| = +√(J·J) = +√J= j·ħ = ħ/2. But that’s not the case: J = |J| ≠ j·ħ = ħ/2. To calculate the magnitude, we need to calculate J= Jx+ Jy+ Jz2. So the idea is to measure these repeatedly and use the expected value for Jx2, Jy2 and Jz2 in the formula. Now, that’s pretty simple: we know that Jx, Jy or Jz are equal to either +ħ/2 or −ħ/2, and, in the absence of a field (i.e. in free space), there’s no preference, so both values are equally likely. To make a long story short, the expected value of Jx2, Jy2 and Jz2 is equal to (1/2)·(ħ/2)+ (1/2)·(−ħ/2)= ħ2/4, and J= 3·ħ2/4 = j(j+1)ħ, with j = 1/2. So J = |J| = +√J= √(3·ħ2/4) = √3·(ħ/2) ≈ 0.866·ħ. Now that’s a huge difference as compared to ħ/2 = ħ/2.

What we’re saying here is that the magnitude of the angular momentum is √3 ≈ 1.7 times the maximum value of the angular momentum in any direction. How is that possible? Thinking classically, this is nonsensical. However, we need to stop thinking classically here: it means that, when we’re atomic or sub-atomic particles, their angular momentum is never completely in one direction. This implies we need to revise our classical idea of an oriented (electric or magnetic) moment: to put it simply, we find it’s never in one direction only! Alternatively, we might want to re-visit our concept of direction itself, but then we do not want to go there: we continue to say we’re measuring this or that quantity in this or that direction. Of course we do! What’s the alternative? There’s none. You may think we didn’t use the proper definition of the magnitude of a quantity when calculating J as √3·(ħ/2), but… Well… You’ll find yourself alone with that opinion. 🙂

This weird thing really comes with the experimental fact that, if you measure the angular momentum, along any axis, you’ll find it is always an integer or half-integer times ħ. Always! So it comes with the experimental fact that energy levels are discrete: they’re separated by the quantum of energy, which is ħ, and which explains why we have the 1/ħ factor in all coefficients in the coefficient matrix for our set of differential equations. The Hamiltonian coefficients represent energies indeed, and so we’ll want to measure them in units of ħ.

Of course, now you’ll wonder: why the −i? I wish I could you a simple answer here, like: “The −factor corresponds to a rotation by −π/2, and that’s the angle we use to go from our ‘up’ and ‘down’ base states to the ‘Uno‘ and ‘Duo‘ (I and II) base states.” 🙂 Unfortunately, this easy answer isn’t the answer. :-/ I need to refer you to my post on the Hamiltonian: the true answer is that it’s got to do with the in the e(i/ħ)·(E·t − pxfunction: the E, i.e. the energy, is real – most of the time, at least 🙂 – but the wavefunction is what it is: a complex exponential. So… Well…

Frankly, that’s more than enough as an introduction. You may want to think about the imaginary momentum of virtual particles here – i.e. ‘particles’ that are being exchanged as part of a ‘state switch’ –  but then we’d be babbling for hours! So let’s just do what we wanted to do here, and that is to find the Hamiltonian for a spin one-half particle in general, so that’s usually in some field, rather than in free space. 🙂

So here we go. Finally! 🙂

The Hamiltonian of a spin one-half particle in a magnetic field

We’ve actually done some really advanced stuff already. For example, when discussing the ammonia maser, we agreed on the following Hamiltonian in order to make sense of what happens inside of the maser’s resonant cavity:

states

State 1 was the state with the ‘upper’ energy E0 + με, as the energy that’s associated with the electric dipole moment of the ammonia molecule was added to the (average) energy of the system (i.e. E0). State 2 was the state with the ‘lower’ energy level E0 − με, implying the electric dipole moment is opposite to that of state 1. The field could be dynamic or static, i.e. varying in time, or not, but it was the same Hamiltonian. Of course, solving the differential equations with non-constant Hamiltonian coefficients was much more difficult, but we did it.

We also have a “flip-flop amplitude” – I am using Feynman’s term for it 🙂 – in that Hamiltonian above. So that’s an amplitude for the system to go from one state to another in the absence of an electric field. For our ammonia molecule, and our hydrogen molecule too, it was associated with the energy that’s needed to tunnel through a potential barrier and, as we explained in our post on virtual particles, that’s usually associated with a negative value for the energy or, what amounts to the same, with a purely imaginary momentum, so that’s why we write minus A in the matrix. However, don’t rack your brain over this as it is a bit of convention, really: putting +A would just result in a phase difference for the amplitudes, but it would give us the same probabilities. If it helps you, you may also like to think of our nitrogen atom (or our electron when we were talking the hydrogen system) as borrowing some energy from the system so as to be able to tunnel through and, hence, temporarily reducing the energy of the system by an amount that’s equal to A. In any case… We need to move on.

As for these probabilities, we could see – after solving the whole thing, of course (and that was very complicated, indeed) – that they’re going up and down just like in that graph above. The only difference was that we were talking induced transitions here, and so the frequency of the transitions depended on με0, i.e. on the strength of the field, and the magnitude of the dipole moment itself of course, rather than on A. In fact, to be precise, we found that the ratio between the average periods was equal to:

Tinduced/Tspontaneous = [(π·ħ)/(2με0)]/[(π·ħ)/(2A)] = A/με0

But… Well… I need to move on. I just wanted to present the general philosophy behind these things. For a simple electron which, as you know, is either in a ‘up’ or a ‘down’ state – vis-á-vis a certain direction, of course – the Hamiltonian will be very simple. As usual, we’ll assume the direction is that z-direction. Of course, this ‘z-direction” is just a short-hand for our reference frame: we decide to measure something in this or that direction, and we call that direction the z-direction.

Fine. Next. As our z-direction is currently our reference direction, we assume it’s the direction of some magnetic field, which wel’ll write as B. So the components of B in the x– and y-direction are zero: all of the field is in the z-direction, so B = Bz. [Note that the magnetic field is not some quantum-mechanical quantity, and so we can have all of the magnitude in one direction. It’s just a classical thing.]

Fine. Next. The spin or the angular momentum of our electron is, of course, associated with some magnetic dipole moment, which we’ll write as μ. [And, yes, sometimes we use this symbol for an electric dipole moment and, at other times, for a magnetic dipole moment, like here. I can’t help that. You don’t want a zillion different symbols anyway.] Hence, just like we had two energy levels E0 ± με, we’ll now have two energy levels E0 ± μBz. We’ll just shift the energy scale so E0 = 0, so that’s as per our convention. [Feynman glosses over it, but this is a bit of a tricky point, really. Usually, one includes the rest mass, or rest energy, in the E in the argument of the wavefunction, but so here we’re equating m0 c2 with zero. Tough! However, you can think of this re-definition of the zero energy points as a phase shift in all wavefunctions, so it shouldn’t matter when taking the absolute square or looking at interference. Still… Think about it.]

Fine. Next. Well… We’ve got two energy levels, +μBz and +μBz, but no A to put in our Hamiltonian, so the following Hamiltonian may or may not make sense:

electron

Hmm… Why is there no flip-flop amplitude? Well… You tell me. Why would we have one? It’s not like the ammonia or hydrogen molecule here, so… Well… Where’s the potential barrier? Of course, you’ll now say that we can imagine it takes some energy to change the spin of an electron, like we were doing with those induced transitions. But… Yes and no. We’ve been selecting particles using our Stern-Gerlach apparatus, or that state selector for our maser, but were we actually flip-flopping things? The changing electric field in our resonant cavity is changes the transition frequency but, when everything is said and done, the transition itself has to do with that A. You’ll object again: a pure stationary state? So the electron is either ‘up’ or ‘down’, and it stays like that foreverReally?

Well… I am afraid I have to cut you off, because otherwise we’ll never get to the end. Stop being so critical. 🙂 Well… No. You should be critical. However, you’re right in saying that, when everything is said and done, these are all hypotheses that may or may not make sense. However, Feynman is also right when he says that, ultimately, the proof of the pudding is in the eating: at the end of this long, winding story, we’ll get some solutions that can be tested in experiment: they should give predictions, or probabilities rather, that agree with experiment. As Feynman writes: “[The objective is to find] “equations of motion for the spin states” of an electron in a magnetic field. We guess at them by making some physical argument, but the real test of any Hamiltonian is that it should give predictions in agreement with experiment. According to any tests that have been made, these equations are right. In fact, although we made our arguments only for constant fields, the Hamiltonian we have written is also right for magnetic fields which vary with time.”

So let’s get on with it: let’s assume the Hamiltonian above is the one we should use for a magnetic field in the z-direction, and that we have those pure stationary states with the energies they have, i.e. −μBz and +μBz. One minor technical point, perhaps: you may wonder why we write what we write and do not switch −μBz and +μBz in the Hamiltonian—so as to reflect these ‘upper’ and ‘lower’ energies in those other Hamiltonians. The answer is: it’s just convention. We choose state 1 to be the ‘up’ state, so its spin is ‘up’, but the magnetic moment is opposite to the spin, so the ‘up’ state has the minus sign. Full stop. Onwards!

We’re now going to assume our B field is not in the z-direction. Hence, its Bx and By components are not zero. What we want to see now is how the Hamiltonian looks like. [Yes. Sorry for regularly reminding you of what it is that we are trying to do.] Here you need to be creative. Whatever the direction of the field, we need to be consistent. If that Hamiltonian makes sense, i.e. if we’d have two pure stationary states with the energies they have, if the field is in the z-direction, then it’s rather obvious that, if the field is in some other direction, we should still be able to find two stationary states with exactly the same energy levels. As Feynman puts it: “We could have chosen our z-axis in its direction, and we would have found two stationary states with the energies ±μBz. Just choosing our axes in a different direction doesn’t change the physics. Our description of the stationary states will be different, but their energies will still be ±μBz.” Right. And because the magnetic field is a classical quantity, the relevant magnitude is just the square root of the squares of its components, so we write:

formula 1So we have the energies now, but we want the Hamiltonian coefficients. Here we need to work backwards. The general solution for any system with constant Hamiltonian coefficients always involves two stationary states with energy levels which we denoted as Eand EII, indeed. Let me remind you of the formula for them:

energies

[If you want to double-check and see how we get those, it’s probably best to check it in the original text, i.e. Feynman’s Lecture on the Ammonia Maser, Section 2.]

So how do we connect the two sets of equations? How do we get the Hij coefficients out of these square roots and all of that? [Again. I am just reminding you of what it is that we are trying to do.] We’ve got two equations and four coefficients, so… Well… There’s some rules we can apply. For example, we know that any Hij coefficient must equal Hji*, i.e. complex conjugate of Hji. [However, I should add that’s true only if i ≠ j.] But… Hey! We can already see that H11 must be equal to minus H22. Just compare the two sets. That comes out as a condition, clearly. Now that simplifies our square roots above significantly. Also noting that the absolute square of a complex number is equal to the product of the number with its complex conjugate, the two equations above imply the following:

formula 2

Let’s see what this means if we’d apply this to our ‘special’ direction once more, so let’s assume the field is in the z-direction once again. Perhaps we can some more ‘conditions’ out of that. If the field is in the z-direction itself, the equation above reduces to:

formula 3

That makes it rather obvious that, in this special case, at least, |H12|2 = 0. You’ll say: that’s nothing new, because we had those zeroes in that Hamiltonian already. Well… Yes and no! Here we need to introduce another constraint. I’ll let Feynman explain it: “We are going to make an assumption that there is a kind of superposition principle for the terms of the Hamiltonian. More specifically, we want to assume that if two magnetic fields are superposed, the terms in the Hamiltonian simply add—if we know the Hij for a pure Band we know the Hij for a pure Bx, then the Hij for a both Band Btogether is simply the sum. This is certainly true if we consider only fields in the z-direction—if we double Bz, then all the Hij are doubled. So let’s assume that H is linear in the field B.”

Now, the assumption that H12 must be some linear combination of Bx, Band Bz, combined with the |H12|2 = 0 condition when all of the magnitude of the field is in the z-direction, tells us that H12 has no term in Bz. It may have – in fact, it probably should have – terms in Bx and By, but not in Bz. That does take us a step further.

Next assumption. The next assumption is that, regardless of the direction of the field, H11 and H22 don’t change: they remain what they are, so we write: H11 = −μBz and H22 = +μBz. Now, you may think that’s no big deal, because we defined the 1 and 2 states in terms of our z-direction, but… Well… We did so assuming all of the magnitude was in the z-direction.

You’ll say: so what? Now we’ve got some field in the x– and y-directions, so that shouldn’t impact the amplitude to be in a state that’s associated with the z-direction. Well… I should say two things here. First, we’re not talking about the amplitude to be in state 1 or state 2. These amplitudes are those C1 and Cfunctions that we can find once we’ve got those Hamiltonian coefficients. Second, you’d surely expect that some field in the x– and y-directions should have some impact on those C1 and Cfunctions. Of course!

In any case, I’ll let you do some more thinking about this assumption. Again, we need to move on, so let’s just go along with it. At this point, Feynman‘s had enough of the assumptions, and so he boldly proposes a solution, which incorporates that the H11 = −μBz and H22 = +μBz assumption. Let me quote him:

Formula 4

Of course, this leaves us gasping for breath. A simple guess? One can plug it in, of course, and see it makes sense—rather quickly, really. But… Nothing linear is going to come out of that expression for |H12|2, right? We’ll have to take a square root to find that H12 = ±μ·(Bx+ By2)1/2. Well… No. We’re working in the complex space here, remember? So we can use complex solutions. Feynman notes the same and immediately proposes the right solution:

final 1

To make a long story, we get what we wanted, i.e. those “equations of motion for the spin states” of an electron in a magnetic field. I’ll let Feynman summarize the results:

Final 3

It’s truly a Great Result, especially because, as Feynman notes, (almost) any problem about two-state systems can be solved by making a mathematical analog to the system of the spinning electron. We’ll illustrate that as we move ahead. For now, however, I think we’ve had enough, isn’t it? 🙂

We’ve made a big leap here, and perhaps we should re-visit some of the assumptions and conventions—later, that is. As for now, let’s try to work with it. As mentioned above, Feynman shied away from the grand mathematical approach to it. Indeed, the whole argument might have been somewhat fuzzy, but at least we got a good feel for the solution. In my next post, I’ll abstract away from it, as Feynman does in his next Lecture, where he introduces the so-called Pauli spin matrices, which are like Lego building blocks for all of the matrix algebra which – I must assume you sort of sense that’s coming, no? 🙂 – we’ll need to master so as to understand what’s going on.

So… That’s it for today. I hope you understood “what it is that we’re trying to do”, and that you’ll have some fun working on it on your own now. 🙂

One thought on “The Hamiltonian of matter in a field

  1. Pingback: Pauli’s spin matrices | Reading Feynman

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s