This argument has been useful to me in the past, as a way of constructing the nonstandard real numbers (real numbers with infinitesimals). Of course, infinitesimals are useful for cheesing epsilon-delta arguments where the explicit values of the constants that appear aren’t important, but one often feels “guilty” about using them because they come off as “less rigorous.” This is probably because the proof that nonstandard reals exist is usually thought of as “difficult to explain” to a non-logician, but the proof that I will give is meant to be “measure-theoretic” in nature, so much friendlier to someone with a background in analysis.
First, an incomprehensible definition. We will define the nonstandard reals to be any ultrapower , where
is the set of natural numbers and
is a free ultrafilter on the powerset
of
.
An ultrafilter is nothing more than a finitely additive measure valued in {0, 1}, which is said to be free if it is non-atomic. (Throughout this post, we will take measures to be finitely additive, rather than countably additive.) If a set X is sent to 1, we write (so we identify the measure with the set of all sets which it sends to 1). Of course atomic ultrafilters on
exist (just take any Dirac mass.) But as we will see, atomic ultrafilters are not very interesting for our purposes here.
Let be an infinite set, k an ordered field, and
an ultrafilter on
. (This can be defined in much greater generality than ordered fields, but that’s all we’ll need for our purposes.) As usual
is the set of all maps
(so if
, it is the sequence space of k). Now
determines an equivalence relation on
, by declaring that two maps x and y are equivalent if they are equal almost everywhere. The resulting quotient is known as the ultrapower
of k.
We give the structure of an ordered field, as follows. First, 0 and 1 are nothing more than the equivalence classes of 0 and 1. We now observe that + and
lift to pointwise operations on
, and then descend to operations on
, just as we can define operations on the space
of Lebesgue-almost everywhere equivalence classes p-integrable functions by taking quotients of operations on the space
of p-integrable functions. Finally, we declare
if
for almost every
.
We have to use some logical machinery somewhere in this proof, and so now we will. Specifically, we need Łoś’s theorem (also known as the fundamental theorem of ultrapowers, or the transfer principle.) To do this, we need the notion of a first-order formula in the language of ordered fields (or I will say “formula” for short). This is any mathematical statement about a given ordered field k which can be expressed using only finitely many copies of the symbols ,
,
,
(i.e. “not”), +,
,
, =, 0, 1, and parentheses and variables, where
means that there exists an element
of k (and not a subset of k, or a set of subsets of k, or so on) and
is negation. For example, the statement that
exists is a formula, namely
. A more sophisticated example of a formula is the quadratic formula, as would be a nice exercise to verify. We need infinitely many formulae to express that k is characteristic 0, namely
for every length of 1’s. And we cannot express that k is archimedean even with infinitely many formulae.
Łoś’s theorem says that if k is an ordered field and is any ultrapower of k, then for every formula
that is true in k,
is true in F as well (and if this conclusion holds, then we say that k and F are elementarily equivalent). Note carefully that elementary equivalence is weaker than isomorphism (indeed, by Łoś’s theorem, $\mathbb R$ is elementarily equivalent to the nonstandard reals, which are nonarchimedean).
To prove Łoś’s theorem, we use induction on the complexity of . For so-called “atomic formulae”, those that can be expressed without use of
,
, or
, this is an easy “almost everywhere” argument. For negation, notice that if
, then
is true almost everywhere iff
is false almost everywhere by the inductive hypothesis. Similarly for implication. For quantification, assume
. If
is true in k, then there is an
be such that a satisfies
. So the map which is constantly a satisfies
by the inductive hypothesis. On the other hand, if
is true in
, then find a map
satisfying
. Then there is a
in the image of
which satisfies
, since
satisfies
almost everywhere. This proves Łoś’s theorem.
We are almost done. The problem is that if we choose any ultrapower of , we have not accomplished anything. In fact if we take
, then the map
is an isomorphism
, so we have not shown the existence of infinitesimals. This is why we assumed that the ultrafilter
was free.
To prove that a free ultrafilter exists, we define a filter to be a measure on a subalgebra
of
valued in {0, 1}, and declare that a filter is free if it is not atomic. Clearly the filter which sends finite sets to 0 and cofinite sets to 1 is non-atomic, so one exists. We order filters by saying that
if there is an inclusion of algebras
, and
agrees with
if they agree whenever both are defined, just like in the proof of the Hanh-Banach theorem. By taking limits we see that the supremum of filters is a filter. Now we apply Zorn’s lemma to the space of free filters, which is easily seen to be an ultrafilter.
(Unimportant aside: The existence of a -additive free ultrafilter on
is a very special property of
. It is not true for
or most other infinite sets. In fact, if
is uncountable and has a
-additive free ultrafilter on
, we say that
is a measurable cardinal. The existence of a measurable cardinal cannot be proven from the axioms of set theory, and in fact is much stronger than the consistency of said axioms.)
Now if is a free ultrafilter, then the ultrapower
is by definition the ordered field of nonstandard real numbers. By Łoś’s theorem, any statement which does not require quantifying over infinitely many reals or over sets of reals and is true in
is still true in
. Moreover, the reals embed in
by the map
. On the other hand, the sequence
, descends to an positive element
of
which is smaller than any standard real number, since it is smaller than any standard real number almost everywhere. So we do have infinitesimals, and the inclusion
is not an isomorphism, as desired.