Why does F = ma?
By Michael Nielsen, November
23, 2019
Note: Rough working notes, me thinking out loud. Thoughtful,
well-informed further ideas and corrections welcome.
The laws of physics don’t need justification, they just are. In that
sense, “why does F = ma?” is a ridiculous
question. Certainly, it can’t be proved, it’s not a mathematical
theorem, or in any sense inevitable. Indeed, it’s easy to imagine
universes in which F = ma is not true: we live in such a universe,
since F = ma only arises as an approximation to a deeper quantum
mechanical reality.
Nonetheless, there’s a sense in which “why does F = ma?”
is a stimulating question. It’s a challenge to deepen one’s
understanding of Newton’s second law of motion, and to understand how
the universe would be different if we replaced the second law by
something else.
As a student, Newton’s second law bugged me. Physicists often seemed
to use it almost tautologically, as a definition of what a force was.
I wondered if there was any non-tautological, non-trivial physical
content? Or was it really just a definition? Of course, it turns out
there is deep physical content, but it’s left implicit in many
discussions of Newton’s second law. Let’s make it more explicit.
(Incidentally, the equation in Newton’s second law isn’t F = ma, but
rather the more subtle statement that force is equal to the rate of
change of momentum of a body. When the mass isn’t changing, that
reduces to F = ma, and that’s the form I’ll discuss here.)
One way to get insight into the second law is to consider
variations. How would the world be different if instead of F = ma,
we instead had F = mv, that is, force is equal to mass times
velocity? Or perhaps F = mj, where j is the “jerk”,
the third derivative of position. Is there some reason Newton’s law
involves the second time derivative of position, rather than the first
or the third (or some more exotic) derivative?
There’s a conventional answer to this question. The key to this answer
is that if we take some fixed configuration of (say) gravitating
bodies, and then consider a test particle, its subsequent motion: (a)
is completely determined by its initial position and velocity; but (b)
the initial position and velocity are free variables which can be
changed relatively easily.
So, for instance, you can’t have F = mv, because that would mean the
initial velocity would be entirely determined by the configuration of
surrounding matter. It would actually be impossible(!) for us to set
the velocity of (for instance) a projectile. But in practice we find
that initial velocities are things which we have a lot of freedom to
adjust. So F = mv is ruled out.
In more mathematical terms: suppose we believe the motion of a test
particle is completely determined by its initial position and
velocity, but also that those quantities are free variables which we
can choose. If we know just a little about differential equations this
suggests some kind of second-order differential equation must be
controlling the behavior of the particle. In particular, the
acceleration of the test particle should somehow be a function of the
other configuration of matter. F = ma is very nearly the simplest
equation we can imagine of this form; the mass is the only slightly
unexpected feature in the equation.
This is a pretty conventional story. It’s one I remember reading in
textbooks as a student. It has some insight worth remembering, but
it’s wrong in important ways. For one thing, test particles don’t
all behave in the same way. Two test particles with the same initial
position and velocity, but different electric charges, can behave
quite differently in the same electric field.
One possible response is to say “oh, maybe our notion of force
should really be something like F = mj, where j is the jerk, i.e.,
the third derivative of position”.
I’ve never worked it out in detail, but wouldn’t be surprised if such
an approach can be made to work. Essentially, it’d make acceleration
into a free (possibly constrained) parameter of the particle, rather
than something completely determined by the distribution of matter and
fields. That free parameter would implicitly contain what (in the
conventional approach) we think of as the charge information. Indeed,
the new equations of motion would have a conserved quantity,
corresponding to the charge. But the resulting force laws would be
quite a bit uglier.
(Actually, if we ever saw a situation in nature where charges seemed
to change over time, this jerk-based approach might be worth
exploring!)
So what then really is the content of Newton’s second law?
The right-hand side of F = ma is at least moderately clear, though
it bears more examination.
But the left-hand side, the very notion of a force, is subtle
indeed. There’s an underlying implicit set of assertions: matter
produces forces on test particles; those forces control the behavior
of the test particles; those forces can be computed as a reasonably
simple universal function of the configuration of matter and fields,
notably of positions, velocities, and charges.
(Just to make the last assertion more concrete: Newton’s law of
gravitation, for instance, asserts that you can compute the force on a
test particle as the integral over mass density throughout the
universe, in accord with the inverse square law. And, of course, other
people have figured out other ways of computing force as a function of
the distribution of matter and fields.)
None of these implicit assertions has anything a priori to do with
ma. Rather, they’re a remarkable set of assertions about how we
should describe nature. And they’re all implicitly part of the content
of the second law, though often not so explicitly stated. If these
things weren’t true, the second law wouldn’t be a useful statement;
indeed, no-one would ever have heard of it.
Putting it in somewhat fuzzier terms, and at the risk of repeating
myself: F = ma derives its power from the (implicit) assertion that
there is a simple universal force law that lets us figure out F for a
particular configuration of matter. And so the configuration of matter
completely determines the acceleration of a test particle. There is no
a priori reason this ought to be true. It’s an absolutely incredible
fact of nature.
Let’s condense our observations into a single paragraph: a reasonable
answer to “why does F = ma?” is: the behavior of test
particles is somehow determined by a quantity which we’ll call a
“force”. This force is a simple function of the
configuration of matter and fields, notably of the positions,
velocities and charges of all particles. In practice, we find it’s
possible to change the initial position and velocity of test
particles, without changing the rest of the matter configuration. But
it doesn’t seem so easy to change the initial acceleration, without
changing the rest of the matter configuration. That suggests the force
should somehow determine the acceleration. At this point, F = ma
seems a good candidate law of motion.
Personally, I find this all a very helpful line of thought. Of course,
there’s still much that’s mysterious. For instance, I haven’t said
anything about why m appears in the second law, or even where the
notion of mass comes from. Of course, mass is very familiar to us from
childhood, and so seems innocuous, but it’s an incredibly deep and
subtle idea. What’s it doing in the second law? If F is a universal
function, then m is almost like a resistance, something that makes a
test particle respond less to the applied force. It’s remarkable this
is a fixed constant for particles in nature.
A fun question: how does the universe change if the mass isn’t a
scalar, but rather is a matrix, and so a = m-1F is the
acceleration? What would this world look like? Is it plausible?
Another fun question: how does the universe change if F = mw, where
w is a fractional time derivative of position? Say, for instance,
the 1.5th time derivative of position. Is there any sensible
formulation of (classical) physics where this kind of thing can be
used as a law of motion?
And one more fun question: is there any connection to evolutionary
psychology? Human beings can see (and manipulate) the position and
velocity of everyday objects quite well; much, much better than they
can see the acceleration. People routinely get the sign(!) wrong
when estimating acceleration; it’s hard to imagine that happening for
velocity, outside of rather contrived circumstances. Is there some
evolutionary reason for this, connecting Newton’s law to facts about
our nervous system?
Of course, it’s possible to deepen our thinking much further. We can
start to think about F = ma as a consequence of the Euler-Lagrange
or Hamilton’s equations; or as a consequence of the Schroedinger
equation, or of Feynman’s sum-over histories approach to
physics. Indeed, I suspect it is possible to in some sense deduce the
second law of motion from thermodynamics. (Cf the work by Ted Jacobson
on the Einstein field equations as equations of state, and more
recent followups). And we can think much more deeply about notions
like “test particles”, or what reference frame to measure
acceleration in. And so on – a panoply of great questions!
Newton’s laws are incredibly deep.
An interesting feature of the discussion above is that it’s written
for people already familiar with Newton’s laws. It takes as given a
lot of pre-existing intellectual structure. I assume you have a basic
comfort with differential equations, with test particles, with
gravitating bodies, with acceleration, and so on. That’s a huge amount
of background. And then we leave most of it fixed, and poke hard in a
few places, seeing what happens when you change those things around,
but leave most of the intellectual edifice unchanged.
This is a good strategy for building insight, if you’re already
knowledgeable about a theory. But it’s likely not so helpful for
newcomers. Perhaps this is why these questions weren’t discussed in
my introductory physics classes! It’d be fun to find an approach that
also works for newcomers. The notion of universal force laws is one of
the most beautiful and audacious ideas humans have ever developed.
Perhaps one approach could be to write a piece of discovery fiction
explaining how the second law could have come to be discovered. I find
it truly remarkable just how much Newton and his contemporaries
needed to get right. There’s so many different, subtle ideas; each
needs to sit in the right relationship to the others. It’s remarkable
they were able to bootstrap them all collectively into a useful form.
Finally, let me emphasize that, considered as a theory of physics,
Newton’s laws are wrong. At best they’re an approximation to certain
parts of a theory we hope may be correct, quantum mechanics. I find it
astounding that a theory like quantum mechanics can have inside it
another theory, an approximation, also extremely beautiful
mathematically, but radically different. It’s like taking Bach, adding
some noise, and getting the best of the Beatles out. I wish I
understood better why this can happen.
Acknowledgments: Many thanks to David Chapman and Andy Matuschak for
the conversation which instigated this essay.
Please help support my work on Patreon, and
please follow me on Twitter.