When Decay Statistics Become Ontology

Or: why the Standard Model feels so solid — and yet so strangely unsatisfying

I recently put a new paper online: A Taxonomy of Instability. It is, in some sense, a “weird” piece. Not because it proposes new particles, forces, or mechanisms — it does none of that — but because it deliberately steps sideways from the usual question:

What are particles made of?

and asks instead:

How do unstable physical configurations actually fail?

This shift sounds modest. In practice, it leads straight into a conceptual fault line that most of us sense, but rarely articulate.

What is actually being classified in particle physics?

The Standard Model is extraordinarily successful. That is not in dispute. It predicts decay rates, cross sections, and branching fractions with astonishing precision. It has survived decades of experimental scrutiny.

But it is worth noticing what it is most directly successful at describing:

lifetimes,
branching ratios,
observable decay patterns.

In other words: statistics of instability.

Yet when we talk about the Standard Model, we almost immediately slide from that statistical success into an ontological picture: particles as entities with intrinsic properties, decaying “randomly” according to fundamental laws.

That slide is so familiar that it usually goes unnoticed.

The quiet assumption we almost never examine

Consider how decay is presented in standard references (PDG tables are the cleanest example). For a given unstable particle, we are shown:

a list of decay “channels”,
each with a fixed branching fraction,
averaged over production mechanisms, environments, and detectors.

Everything contextual has been stripped away.

What remains is treated as intrinsic.

And here is where a subtle but radical assumption enters:

The same unstable particle is taken to be capable of realizing multiple, structurally distinct decay reactions, with no further individuation required.

This is not an experimental result.
It is an interpretive stance.

As long as one stays in calculational mode, this feels unproblematic. The formalism works. The predictions are right.

The discomfort only arises when one asks a very basic question:

If all environment variables are abstracted away, what exactly is it that is decaying?

Statistical determinism sharpens the problem

Decay statistics are not noisy or unstable. They are:

reproducible,
environment-independent (within stated limits),
stable across experiments.

That makes them look law-like.

But law-like behavior demands clarity about what level of description the law applies to.

There are two logically distinct possibilities:

Intrinsic multivalence
A single physical entity genuinely has multiple, mutually exclusive decay behaviors, realized stochastically, with no deeper individuation.
Hidden population structure
What we call “a particle” is actually an equivalence class of near-identical configurations, each with a preferred instability route, unresolved by our current classification.

The Standard Model chooses option (1) — implicitly, pragmatically, and very effectively.

But nothing in the data forces that choice.

Why this can feel like being “duped”

Many people only experience discomfort after they start thinking carefully about what the Standard Model is claiming to describe.

The sense of being “duped” does not come from experimental failure — it comes from realizing that a philosophical commitment was made silently, without being labeled as such.

Probability, in this framework, is not treated as epistemic (what we don’t know), but as ontologically primitive (what is). Identity is divorced from behavior. The ensemble description quietly replaces individual determinism.

This is a perfectly legitimate move — but it is a move.

And it has a cost.

What my taxonomy does — and does not — claim

A Taxonomy of Instability does not propose new physics. It does not challenge the predictive success of the Standard Model. It does not deny quantum mechanics.

What it does is much quieter:

it treats decay landscapes, not particles, as the primary objects of classification;
it groups unstable configurations by how they fail, not by assumed internal structure;
it keeps the description strictly operational: lifetimes, observable final states, branching structure.

In doing so, it exposes something we usually gloss over:

Treating statistically distinct instability morphologies as attributes of a single identity is already an ontological decision.

Once that decision is made explicit, it becomes optional rather than compulsory.

Why this feels “weird” — and why that’s a good sign

The paper feels strange because it does not do what most theoretical work does:

it does not explain,
it does not unify,
it does not speculate about deeper mechanisms.

Instead, it asks whether our classification layer has quietly hardened into ontology.

That kind of question always feels uncomfortable, because it sits between theory and philosophy, and because it removes a tacit compromise rather than proposing a new belief.

But it is also the kind of question that matters precisely when a theory works extremely well.

A broader resonance (human and artificial)

There is an additional reason this question feels timely.

Modern AI systems are, at their core, pattern classifiers and compressors. They turn data into “things” by grouping outcomes under labels. Ontologies emerge automatically unless we are careful.

Seen from that angle, particle physics is not an outlier — it is an early, highly successful example of how statistical regularities become reified as entities.

The taxonomy I propose is not only about particles. It is about how thinking systems — human or artificial — turn data into objects.

A calm conclusion

The Standard Model is an extraordinarily successful theory of decay statistics. Its difficulties are not primarily empirical, but philosophical.

Those difficulties arise only when we forget that:

classification is not explanation,
identity is not forced by statistics,
and ontology is not delivered for free by predictive success.

My hope is not to replace any existing framework, but to invite both human readers and artificial “thinking machines” to pause and ask again:

What is being measured — and what, exactly, are we saying exists?

Sometimes, the most productive form of progress is not adding a new layer, but noticing where an old one quietly became invisible.

Statistical mechanics re-visited

Quite a while ago – in June and July 2015, to be precise – I wrote a series of posts on statistical mechanics, which included digressions on thermodynamics, Maxwell-Boltzmann, Bose-Einstein and Fermi-Dirac statistics (probability distributions used in quantum mechanics), and so forth. I actually thought I had sort of exhausted the topic. However, when going through the documentation on that Stern-Gerlach experiment that MIT undergrad students need to analyze as part of their courses, I realized I did actually not present some very basic formulas that you’ll definitely need in order to actually understand that experiment.

One of those basic formulas is the one for the distribution of velocities of particles in some volume (like an oven, for instance), or in a particle beam – like the beam of potassium atoms that is used to demonstrate the quantization of the magnetic moment in the Stern-Gerlach experiment. In fact, we’ve got two formulas here, which are subtly – as subtle as the difference between v (boldface, so it’s a vector) and v (lightface, so it’s a scalar) 🙂 – but fundamentally different:

velocity-distribution

Both functions are referred to as the Maxwell-Boltzmann density distribution, but the first distribution gives us the density for some v in the velocity space, while the second gives us the distribution density of the absolute value (or modulus) of the velocity, so that is the distribution density of the speed, which is just a scalar – without any direction. As you can see, the second formula includes a 4π·v² factor.

The question is: how are these formulas related to Boltzmann’s f(E) = C·e^−energy/kT Law? The answer is: we can derive all of these formulas – for the distribution of velocities, or of momenta – by clever substitutions. However, as evidenced by the two formulas above, these substitutions are not always straightforward. So let me quickly show you a few things here.

First note the two formulas above already include the e^−energy/kTfunction if we equate the energy E with the kinetic energy: E = K.E. = m·v²/2. Of course, if you’ve read those June-July 2015 posts, you’ll note that we derived Boltzmann’s Law in the context of a force field, like gravity, or an electric potential. For example, we wrote the law for the density (n = N/V) of gas in a gravitational field (like the Earth’s atmosphere) as n = n₀·e^−P.E./kT. In this formula, we only see the potential energy: P.E. = m·g·h, i.e. the product of the mass (m), the gravitational constant (g), and the height (h). However, when we’re talking the distribution of velocities – or of momenta – then the kinetic energy comes into play.

So that’s a first thing to note: Boltzmann’s Law is actually a whole set of laws. For example, the frequency distribution of particles in a system over various possible states, also involves the same exponential function: F(state) ∝ e^−E/kT. E is just the total energy of the state here (which varies from state to state, of course), so we don’t distinguish between potential and kinetic energy here.

So what energy concept should we use in that Stern-Gerlach experiment? Because these potassium atoms in that oven – or when they come out of it in a beam – have kinetic energy only, our E = m·v²/2 substitution does the trick: we can say that the potential energy is taken to be zero, so that all energy is in the form of kinetic energy. So now we understand the e^{−m·v²/2kT} function in those f(v) and f(v) formulas. Now we only need to explain those complicated coefficients. How do we get these?

We get them through clever substitutions using equations such as:

f_v(v)·dv = f_p(p)·dp

What are we writing here? We’re basically combining two normalization conditions: if f_v(v) and f_p(p) are proper probability density functions, then they must give us 1 when integrating over their domain. The domain of these two functions is, obviously, the velocity (v) and momentum (p) space. The velocity and momentum space are the same mathematical space, but they are obviously not the same physical space. But the two physical spaces are closely related: p = m·v, and so it’s easy to do the required transformation of variables. For example, it’s easy to see that, if E = m·v²/2, then E is also equal to E = p²/2m.

However, when doing these substitutions, things get tricky. We already noted that p and v are vectors, unlike E, or p and v – which are scalars, or magnitudes. So we write: p = (p_x, p_y, p_z) and |p| = p, and v = (v_x, v_y, v _z) and |v| = v. Of course, you also know how we calculate those magnitudes:

magnitude

Note that this also implies the following: p·p = p²= p_x²+ p_y²+p_z²= p². Trivial, right? Yes. But have a look now at the following differentials:

d³p
dp
dp = d(p_x, p_y, p_z)
dp_x·dp_y·dp_z

Are these the same or not? Now you need to think, right? That d³p and dp are different beasts is obvious: d³p is, obviously, some infinitesimal volume, as opposed to dp, which is, equally obviously, an (infinitesimal) interval. But what volume exactly? Is it the same as that dp = d(p_x, p_y, p_z) volume, and is that the same as the dp_x·dp_y·dp_z volume?

Fortunately, the volume differentials are, in fact, the same – so you can start breathing again. 🙂 Let’s get going with that d³p notation for the time being, as you will find that’s the notation which is used in the Wikipedia article on the Maxwell-Boltzmann distribution – which I warmly recommend, because – for a change – it is a much easier read than other Wikipedia articles on stuff like this. Among other things, the mentioned article writes the following:

f_E(E)·dE = f_p(p)·d³p

What is this? Well… It’s just like that f_v(v)·dv = f_p(p)·dp equation: it combines the normalization condition for both distributions. However, it’s much more interesting, because, on the left-hand side, we multiply a density with an (infinitesimal) interval (dE), while on the right-hand side we multiply with an (infinitesimal) volume (d³p). Now, the (infinitesimal) energy interval dE must, obviously, correspond with the (infinitesimal) momentum volume d³p. So how does that work?

Well… The mentioned Wikipedia article talks about the “spherical symmetry of the energy-momentum dispersion relation” (that dispersion relation is just E = |p|²/2m, of course), but that doesn’t make us all that wiser, so let’s try a more heuristic approach. You might remember the formula for the volume of a spherical shell, which is simply the difference between the volume of the outer sphere minus the volume of the inner sphere: V = (4π/3)·R³− (4π/3)·r³= (4π/3)·(R³− r³). Now, for a very thin shell of thickness Δr, we can use the following first-order approximation: V = 4π·r²·Δr. In case you wonder, I hereby copy a nice explanation from the Physics Stack Exchange site:

approximation

Perfect. That’s all we need to know. We’ll use that first-order approximation to re-write d³p as:

d³p = dp = 4π·|p|²·d|p| = 4π·p²·dp

Note that we’ll have the same formula for d³v, of course: d³v = dv = 4π·|v|²·d|v| = 4π·v²·dv, and also note that we get that same 4π·v² factor which we mentioned when discussing the f(v) and f(v) formulas. That is not a coincidence, of course, but – as I’ll explain in a moment – it is not so easy to immediately relate the formulas. In any case, we’re now ready to relate dE and dp so we can re-write that d³p formula in terms of m, E and dE:

substitution-2

We are now – finally! – sufficiently armed to derive all of the formulas we want – or need. Let me just copy them from the mentioned Wikipedia article:

momenta

energy

velocity

As said, you’ll encounter these formulas regularly – and so it’s good that you know how you can derive them. Indeed, the derivation is very straightforward and is done in the same article: the tips I gave you should allow you to read it in a couple of minutes only. Only the density function for velocities might cause you a bit of trouble – but only for a very short moment: just use the p = m·v equation to write d³p as d³p = 4π·p²·dp = 4π·m²·v²·m·dv = 4π·m³·v²·dv = m³·d³v, and you’re all set. 🙂

Of course, you will recognize the formula for the distribution of velocities: it’s the f(v) we mentioned in the introduction. However, you’re more likely to need the f(v) formula (i.e. the probability density function for the speed) than the f(v) function. So how can we derive get the f(v) – i.e. that formula for the distribution of speeds, with the 4π·v² factor – from the f(v) formula?

Well… I wish I could give you an easy answer. In fact, the same Wikipedia article suggests it’s easy – but it’s not. It involves a transformation from Cartesian to polar coordinates: the volume element dv_x·dv_y·dv_z is to be written as v²·sinθ·dv·dθ·dφ. And then… Well… Have a look at this link. 🙂 It involves a so-called Jacobian transformation matrix. If you want to know more about it, then I recommend you read some article on how to transform distribution functions: here’s a link to one of those, but you can easily google others. Frankly, as for now, I’d suggest you just accept the formula for f(v) as for now. 🙂 Let me copy it from the same article in a slightly different form: density-formula Now, the final thing to note is that you’ll often want to use so-called normalized velocities, i.e. velocities that are defined as a v/v₀ ratio, with v₀the most probable speed, which is equal to √(2kT/m). You get that value by calculating the df(v)/dv derivative, and then finding the value v = v₀ for which df(v)/dv = 0. You should now be able to verify the formula that is used in the mentioned MIT version of the Stern-Gerlach experiment: mit-formula Indeed, when you write it all out – note that π/π^3/2= 1/√π 🙂 – you’ll see the two formulas are effectively equivalent. Of course, by now you are completely formula-ed out, and so you probably don’t even wonder what that f(v)·dv product actually stands for. What does it mean, really? Now you’ll sigh: why would I even want to know that? Well… I want you to understand that MIT experiment. 🙂 And you won’t if you don’t know what f(v)·dv actually represents. So think about it. […]

[…] OK. Let me help you once more. Remember the normalization condition once again: the integral of the whole thing – over the whole range of possible velocities – needs to add up to 1, so f(v)·dv is really the fraction of (potassium) atoms (inside the oven) with a velocity in the (infinitesimally small) dv interval. It’s going to be a tiny fraction, of course: just a tiny bit larger than zero. Surely not larger than 1, obviously. 🙂 Think of integrating the function between two values – say v₁ and v₂ – that are pretty close to each other.

So… Well… We’re done as for now. So where are we now in terms of understanding the calculations in that description of that MIT experiment? Well… We’ve got the meat. But we need a lot of other ingredients now. We’ll want formulas for the intensity of the beam at some point along the axis measuring its deflection from its main direction. That axis is the z-axis. So we’ll want a formula for some I(z) function.

Deflection? Yes. There are a lot of steps to go through now. Here’s the set-up: set-up First, we’ll need some formula measuring the flux of (potassium) atoms coming out of the oven. And then… Well… Just have a look and try to make your way through the whole thing now – which is just what I want to do in the coming days, so I’ll give you some more feedback soon. 🙂 Here I only wanted to introduce those formulas for the distribution of velocities and momenta, because you’ll need them in other contexts too.

So I hope you found this useful. Stuff like this all makes it somewhat more real, doesn’t it? 🙂 Frankly, I think the math is at least as fascinating as the physics. We could have a closer look at those distributions, for example, by noting the following:

1. The probability density function for the momenta is the product of three normal distributions. Which ones? Well… The distribution of p_x, p_y and p_z respectively: three normal distributions whose variance is equal to mkT. 🙂

2. The f_E(E) function is a chi-squared (χ²) distribution with 3 degrees of freedom. Now, we have the equipartition theorem (which you should know – if you don’t, see my post on it), which tells us that this energy is evenly distributed among all three degrees of freedom. It is then relatively easy to show – if you know something about χ² distributions at least 🙂 – that the energy per degree of freedom (which we’ll write as ε below) will also be distributed as a chi-squared distribution with one degree of freedom: chi-square-2 This holds true for any number of degrees of freedom. For example, a diatomic molecule will have extra degrees of freedom, which are related to its rotational and vibrational motion (I explained that in my June-July 2015 posts too, so please go there if you’d want to know more). So we can really use this stuff in, for example, the theory of the specific heat of gases. 🙂

3. The function for the distribution of the velocities is also a product of three independent normally distributed variables – just like the density function for momenta. In this case, we have the v_x, v_y and v_z variables that are normally distributed, with variance kT/m.

So… Well… I’m done – for the time being, that is. 🙂 Isn’t it a privilege to be alive and to be able to savor all these little wonderful intellectual excursions? I wish you a very nice day and hope you enjoy stuff like this as much as I do. 🙂

First Principles of Statistical Mechanics

Pre-script (dated 26 June 2020): This post has become less relevant (even irrelevant, perhaps) because my views on all things quantum-mechanical have evolved significantly as a result of my progression towards a more complete realist (classical) interpretation of quantum physics. I keep blog posts like these mainly because I want to keep track of where I came from. I might review them one day, but I currently don’t have the time or energy for it. 🙂

Original post:

Feynman seems to mix statistical mechanics and thermodynamics in his chapters on it. At first, I thought all was rather messy but, as usual, after re-reading it a couple of times, it all makes sense. Let’s have a look at the basics. We’ll start by talking about gases first.

The ideal gas law

The pressure P is the force we have to apply to the piston containing the gas (see below)—per unit area, that is. So we write: P = F/A. Compressing the gas amounts to applying a force over some (infinitesimal) distance dx. This will change the internal energy (U) of the gas by an infinitesimal amount dU. Hence, we can write:

dU = F·(−dx) = – P·A·dx = – P·dV

However, before looking at the dynamics, let’s first look at the stationary situation: let’s assume the volume of the gas does not change, and so we just have the gas atoms bouncing of the piston and, hence, exerting pressure on it. Every gas atom or particle delivers a momentum 2mv_xto the piston (the factor 2 is there because the piston does not bounce back, so there is no transfer of momentum). If there are N atoms in the volume N, then there are n = N/V in each unit volume. Of course, only the atoms within a distance v_x·t are going to hit the piston within the time t and, hence, the number of atoms hitting the piston within that time is n·A·v_x·t. Per unit time (i.e. per second), it’s n·A·v_x·t/t = n·A·v_x. Hence, the total momentum that’s being transferred per second is n·A·v_x·2mv_x.

So far, so good. Indeed, we know that the force is equal to the amount of momentum that’s being transferred per second. If you forget, just check the definitions and units: a force of 1 newton gives an mass of 1 kg an acceleration of 1 m/s per second, so 1 N = 1 kg·m/s²= 1 kg·(m/s)/s. [The kg·(m/s) unit is the unit of momentum (mass times velocity), obviously. So there we are.] Hence,

P = F/A = n·A·v_x·2mv_x/A = 2nmv_x²

Of course, we need to take an average 〈v_x²〉 here, and we should drop the factor 2 because half of the atoms/particles move away from the piston, rather than towards it. In short, we get:

P = F/A = nm〈v_x²〉

Now, the average velocity in the x-, y- and z-direction are all the same and uncorrelated, so 〈v_x²〉 = 〈v_y²〉 = 〈v_z²〉 = [〈v_x²〉 + 〈v_y²〉 + 〈v_z²〉]/3 = 〈v²〉/3. So we don’t worry about any direction and simply write:

P = F/A = (2/3)·n·〈m·v²/2〉

[As Feynman notes, the math behind this is not difficult but, at the same time, it is also less straightforward than one might expect.] The last factor is, obviously, the kinetic energy of the (center-of-mass) motion of the atom or particle. Multiplying by V gives:

P·V = (2/3)·N·〈m·v²/2〉 = (2/3)·U

[If this confuses you, note that n = N/V, so V = N/n.] Now, that’s not a law you’ll remember from your high school days because… Well… This U – the internal energy of a gas – how do you measure that? We should link it to a measure we do know, and that’s temperature. The atoms or molecules in a gas will have an average kinetic energy which we could define as… Well… That average should have been defined as the temperature but, for historical reasons, the scale of what we know as the ‘temperature’ variable (T) is different. We need to apply a conversion factor, which is usually written as k. In fact, the conversion factor will be (3/2)·k. The 3/2 factor has been thrown in here to get rid of it later (in a few seconds, that is). To make a long story short, we write the mean atomic or molecular energy as (3/2)·k·T = 3kT/2.

Now, you should also remember that we have three independent directions of motion. Hence, the kinetic energy associated with the component of motion in any of the three directions x, y or z is only 1/2 kT = (3kT/2)/3 = kT/2. [This seems trivial, but the idea of associating energy with some direction is actually quite fundamental.] Now, I said we’d get rid of that 3/2 factor. Indeed, applying the above-mentioned definition of temperature, we get:

P·V = (2/3)·N·〈m·v²/2〉 = (2/3)·N·3kT/2 = N·k·T

Now that is a formula you may or may not remember from your high school days! 🙂 The k factor is a constant of proportionality, which makes the units come out alright. The P·V = (2/3)·U formula tells us both sides of the equation must be expressed in joule (J), i.e. the dimension of energy. Now, N is a pure number, so our k in that N·k·T expression must be expressed in joule per degree (Kelvin). To be precise, k is (about) 1.38×10⁻²³joule for every degree Kelvin, so it’s a very tiny constant: it’s referred to as the Boltzmann constant and it’s usually denoted with a capital B as subscript (k_B). As for how the product of pressure and volume can (also) yield something in joule, you can work that out for yourself, remembering the definition of a joule. […] Well… OK. Let me do it for you: [P]·[V] = (N/m²)·m³ = N·m = J. 🙂

One immediate implication of the formula above is that gases at the same temperature and pressure, in the same volume, must consist of an equal number of atoms/molecules. You’ll say: of course – because you remember that from your high school classes. However, thinking about it some more – and also in light of what we’ll be learning a bit later on gases composed of more complex molecules (diatomic molecules, for example) – you’ll have to admit it’s not all that obvious as a result.

Now, the number of atoms/molecules is usually measured in moles: one mole (or mol) is 6.02×10²³units (more or less, that is). To be somewhat more precise, its CODATA value is 6.02214129(27)×10²³. That number is Avogadro’s number (or constant), after the Italian mathematical physicist Amedeo Avogadro – who stated that law above, which is referred to as Avogradro’s Law: gases at the same temperature and pressure, in the same volume, must consist of an equal number of atoms/molecules. Avogadro’s number is defined as the amount of any substance that contains as many elementary entities (e.g. atoms, molecules, ions or electrons) as there are atoms in 12 grams of pure carbon-12 (¹²C), the isotope of carbon with relative atomic mass of exactly 12 (also by definition). Avogadro’s constant is one of the base units in the International Systems of Units, usually denoted by N_A or – as Feynman does – N₀.

Now, if we reinterpret N as the number of moles, rather than the number of atoms, ions or molecules in a gas, we can re-write the same equation using the so-called universal or ideal gas constant, which is equal to R = (1.38×10⁻²³joule)×(6.02×10²³/mol) per degree Kelvin = 8.314 J·K⁻¹·mol⁻¹. In short, the ideal gas constant is the product of two other constants: the Boltzmann constant (k_B) and the Avogadro number (N₀). So we get:

P·V = N·R·T with N = no. of moles and R = k_B·N₀

As you can see, you need to watch out with all those different constants and notations in use.

The ideal gas law and internal motion

There’s an interesting and essential remark to be made in regard to complex molecules in a gas. A complex molecule is any molecule that is not mono-atomic. The simplest example of a complex molecule is a diatomic molecule, consisting of two atoms, which we’ll denote by A and B, with mass m_Aand m_Brespectively. A and B are together but are able to oscillate or move relative to one another. In short, we also have some internal motion here, in addition to the motion of the whole thing, which will also has some kinetic energy. Hence, the kinetic energy of the gas consists of two parts:

The kinetic energy of the so-called center-of-mass motion of the whole thing (i.e. the molecule), which we’ll denote by M = m_A+ m_B, and
The kinetic energy of the rotational and vibratory motions of the two atoms (A and B) inside the molecule.

We noted that for single atoms the mean value of the kinetic energy in one direction is kT/2 and that the total kinetic energy is 3kT/2, i.e. three times as much. So what do we have here? Well… The reasoning we followed for the single atoms is also valid for the diatomic molecule considered as a single body of total mass M and with some center-of-mass velocity v_CM. Hence, we can write that

M·v_CM²/2 = (3/2)·kT

So that’s the same, regardless of whether or not we’re considering the separate pieces or the whole thing. But let’s look at the separate pieces now. We need some vector analysis here, because A and B can move in separate directions, so we have v_Aand v_B(note the boldface used for vectors). So what’s the relation between v_Aand v_Bon the one hand, and v_CM on the other? The analysis is somewhat tricky here but – assuming that the v_Aand v_Brepresentations themselves are some idealization of the actual rotational and vibratory movements of the A and B atoms – we can write:

v_CM = (m_Av_A+ m_Bv_B)/M

Now we need to calculate 〈v_CM²〉, of course, i.e. the average velocity squared. I’ll refer you to Feynman for the details which, in the end, do lead to that M·v_CM²/2 = (3/2)·kT equation. The whole calculation depends on the assumption that the relative velocity w = v_A– v_Bis not any more likely to point in one direction than another, so its average component in any direction is zero. Indeed, the interim result is that

M·v_CM²/2 = (3/2)·kT + 2m_Am_B〈v_A·v_B〉/M

Hence, one needs to prove, somehow, that 〈v_A·v_B〉 is zero in order to get the result we want, which is what that assumption about the relative velocity w ensures. Now, we still don’t have the kinetic energy of the A and B parts of the molecule. Because A and B can move in all three directions in space, their average kinetic energy 〈m_A·v_A²/2〉 and 〈m_B·v_B²/2〉 is also 3·k·T/2. Now, adding 3·k·T/2 and 3·k·T/2 yields 3kT. So now we have what we wanted:

The kinetic energy of the center-of-mass motion of the diatomic molecule is (3/2)·k·T.
The total energy of the diatomic molecule is the sum of the energies of A and B, and so that’s 3·k·T/2 + 3·k·T/2 = 3 k·T.
The kinetic energy of the internal rotational and vibratory motions of the two atoms (A and B) inside the molecule is the difference, so that’s 3·k·T – (3/2)·k·T = (3/2)·k·T.

The more general result can be stated as follows:

A r-atom molecule in a gas will have a kinetic energy of (3/2)·r·k·T, on average, of which:
3/2·k·T is kinetic energy of the center-of-mass motion of the entire molecule,
The rest, (3/2)·(r−1)·k·T, is internal vibrational and rotational kinetic energy.

Another way to state is that, for an r-atom molecule, we find that the average energy for each ‘independent direction of motion’, i.e. for each degree of freedom in the system, is kT/2, with the number of degrees of freedom being equal to 3r.

So in this particular case (example of a diatomic molecule), we have 6 degrees of freedom (two times three), because we have three directions in space for each of the two atoms. A common error is to consider the center-of-mass energy as something separate, rather than including it as a part of the total energy. So always remember: the total kinetic energy is, quite simply, the sum of the kinetic energies of the separate atoms, which can be separated into (1) the kinetic energy associated with the center-of-mass motion and (2) the kinetic energy of the internal motions.

You see? It is not that difficult, is it? Let’s move on to the next topic.

The exponential atmosphere

Feynman uses this rather intriguing title to introduce Boltzmann’s Law, which is a law about densities. Let’s jot it down first:

n = n₀·e^−P.E/kT

In this equation, P.E. is the potential energy, k is our Boltzmann constant, and T is the temperature expressed in Kelvin. As for n₀, that’s just a constant which depends on the reference point (P.E. = 0). What are we calculating here? Densities, so that’s the relative or absolute number of molecules per unit volume, so we look for a formula for a variable like n = N/V.

Let’s do an example: the ‘exponential’ atmosphere. 🙂 Feynman models our ‘atmosphere’ as a huge column of gas (see below). To simplify the analysis, we make silly assumptions. For example, we assume the temperature is the same at all heights. That’s assured by the mechanism for equalizing temperature: if the molecules on top would have less energy than those at the bottom, the molecules at the bottom would shake the molecules at the top, via the rod and the balls. That’s a very theoretical set-up, of course, but let’s just go along with it. The idea is that – when thermal equilibrium is reached – the average kinetic energy of all molecules is the same.

So, if the temperature is the same, then what’s different? The pressure, of course, which is determined by the number of molecules per unit volume. The pressure must increase with lower altitude because it has to hold, so to speak, the weight of all the gas above it. Conversely, as we go higher, the atmosphere becomes more tenuous. So what’s the ‘law’ or formula here?

We’ll use our gas law: PV = NkT, which we can re-write as P = nkT with n = N/V, so n is the number of molecules per unit volume indeed. What’s stated here is that the pressure (P) and the number of molecules per unit volume (n) are directly proportional, with kT the proportionality factor. So we have gravity (the g force) and we can do a differential analysis: what happens when we go from h to h + dh? If m is the mass of each molecule, and if we assume we’re looking at unit areas (both at h as well as h + dh), then the gravitational force on each molecule will be mg, and ndh will be the total number of molecules in that ‘unit section’.

Now, we can write dP as dP = P_h+dh− P_h and, of course, we know that the difference in pressure must be sufficient to hold, so to speak, the molecules in that small unit section dh. So we can write the following:

dP = P_h+dh− P_h = − m·g·n·dh

Now, P is P = nkT and, hence, because we assume T to be constant, we can write the whole equation as dP = k·T·dn = − m·g·n·dh. From that, we get a differential equation:

dn/dh = −(m·g)/(k·T)·n

We all hate differential equations, of course, but this one has an easy solution: the equation basically states we should find a function for n which has a derivative which is proportional to itself. Of course, we know that the exponential function is such function, so the solution of the differential equation is:

n = n₀·e^−mgh/kT

The n₀ factor is the constant of integration and is, as mentioned above, the density at h = 0. Also note that mgh is, indeed, the potential energy of the molecules, increasing with height. So we have a Boltzmann Law indeed here, which we can write as n = n₀·e^−P.E/kT. Done ! The illustration below was also taken from Feynman, and illustrates the ‘exponential atmosphere’ for two gases: oxygen and hydrogen. Because their mass is very different, the curve is different too: it shows how, in theory and in practice, lighter gases will dominate at great heights, because the exponentials for the heavier stuff have all died out.

Generalization

It is easy to show that we’ll have a Boltzmann Law in any situation where the force comes from a potential. In other words, we’ll have a Boltzmann Law in any situation for which the work done when taking a molecule from x to x + dx can be represented as potential energy. An example would be molecules that are electrically charged and attracted by some electric field or another charge that attracts them. In that case, we have an electric force of attraction which varies with position and acts on all molecules. So we could take two parallel planes in the gas, separated by a distance dx indeed, and we’d have a similar situation: the force on each atom, times the number of atoms in the unit section that’s delineated by dx, would have to be balanced by the pressure change, and we’d find a similar ‘law’: n = n₀·e^−P.E/kT.

Let’s quickly show it. The key variable is, once again, the density n: n = N/V. If we assume volume and temperature remain constant, then we can use our gas law to write the pressure as P = NkT/V = kT·n, which implies that any change in pressure must involve a density change. To be precise, dP = d(kT·n) = kT·dn. Now, we’ve got a force, and moving a molecule from x to x + dx involves work, which is the force times the distance, so the work is F·dx. The force can be anything, but we assume it’s conservative, like the electromagnetic force or gravity. Hence, the force field can be represented by a potential and the work done is equal to the change in potential energy. Hence, we can write: Fdx = –d(P.E.). Why the minus sign? If the force is doing work, we’re moving with the force and, hence, we’ll have a decrease in potential energy. Conversely, if the surroundings are doing work against the force, we’ll increase potential energy.

Now, we said the force must be balanced by the pressure. What does that mean, exactly? It’s the same analysis as the one we did for our ‘exponential’ atmosphere: we’ve got a small slice, given by dx, and the difference in pressure when going from x to x + dx must be sufficient to hold, so to speak, the molecules in that small unit section dx. [Note we assume we’re talking unit areas once again.] So, instead of writing dP = P_h+dh− P_h = − m·g·n·dh, we now write dP = F·n·dx. So, when it’s a gravitational field, the magnitude of the force involved is, obviously, F = m·g.

The minus sign business is confusing, as usual: it’s obvious that dP must be negative for positive dh, and vice versa, but here we are moving with the force, so no minus sign is needed. If you find that confusing, let me give you another way of getting that dP = F·n·dx expression. The pressure is, quite simply, the force times the number of particles, so P = F·N. Dividing both sides by V yields P/V = F·N/V = F·n. Therefore, P = F·n·V and, hence, dP must be equal to dP = d(F·n·V) = F·n·dV = F·n·dx. [Again, the assumption is that our unit of analysis is the unit area.] […] OK. I need to move on. Combining (1) dP = d(kT·n) = kT·dn, (2) dP = F·n·dx and (3) Fdx = –d(P.E.), we get:

kT·dn = –d(P.E.)·n ⇔ dn/d(P.E.) = −[1/(kT)]·n

That’s, once again, a differential equation that’s easy to solve. Indeed, we’ve repeated it ad nauseam: a function which has a derivative proportional to itself is an exponential. Hence, we have our grand equation:

n = n₀·e^−P.E/kT

If the whole thing troubles you, just remember that the key to solving problems like this is to clearly identify and separate the so-called ‘dependent’ and ‘independent’ variables. In this case, we want a formula for n and, hence, it’s potential energy that’s the ‘independent’ variable. That’s all. In case of doubt: just do the derivation: d(n₀·e^−P.E./kT)/d(P.E.) = −n₀·e^−P.E/kT·1/(kT) = −n/(kT).

The graph looks the same, of course: the density is greatest at P.E. = 0. To be precise, the density there will be equal to n = n₀·e⁰= n₀ (don’t think it’s infinity there!). And for higher (potential) energy values, we get lower density values. It’s a simple but powerful graph, and so you should always remember it.

Boltzmann’s Law is a very simple law but it can be applied to very complicated situations. Indeed, while the law is simple, the potential energy curve can be very complicated. So our Law can be applied to other situations than gravity or the electric force. The potential can combine a number of forces (as long as they’re all conservative), as shown in the graph below, which shows a situation in which molecules will attract each other at a distance r > r₀(and, hence, their potential energy decreases as they come closer together), but repel each other strongly as r becomes smaller than r₀(so potential energy increases, and very much so as we try to force them on top of each other).

Again, despite the complicated shape of the curve, the density function will – in essence – follow Boltzmann’s Law: in a given volume, the density will be highest at the distance of minimum energy, and the density will be much less at other distances. So, yes, Boltzmann’s Law is pretty powerful !

Some content on this page was disabled on June 17, 2020 as a result of a DMCA takedown notice from Michael A. Gottlieb, Rudolf Pfeiffer, and The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 17, 2020 as a result of a DMCA takedown notice from Michael A. Gottlieb, Rudolf Pfeiffer, and The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/
Some content on this page was disabled on June 17, 2020 as a result of a DMCA takedown notice from Michael A. Gottlieb, Rudolf Pfeiffer, and The California Institute of Technology. You can learn more about the DMCA here:

https://wordpress.com/support/copyright-and-the-dmca/