One of the clean cardinal-arithmetic facts about infinite sets is that an infinite set has the same cardinality as its Cartesian square. In symbols, if is infinite, then
Equivalently, there is a bijection
For , this is familiar. One can enumerate the lattice
by diagonals, or use a coding such as
with the usual harmless adjustment depending on whether one starts at
or
.
But for an arbitrary infinite set , there need not be a visible grid. There may be no natural ordering, no preferred coordinates, no obvious diagonal traversal. The proof I want to describe instead uses a maximality argument. Its basic idea is this:
Find the largest possible subset which already knows how to enumerate its own square.
That is, we look for subsets equipped with a bijection
Then we make such a pair maximal by Zorn’s lemma. If the leftover set is small, then
absorbs it and hence
. If the leftover set is large, then there is enough fresh material outside
to enlarge
, contradicting maximality.
The proof has a slightly strange texture because it does not directly build a global bijection . Instead, it builds a maximal self-similar island inside
, and then proves that maximality forces the island to be all of
up to bijection.
The strange texture is exactly the point.
A small absorption lemma
I will use the following notation. Write to mean that there is an injection from
into
, and write
to mean that there is a bijection between
and
.
We first record a useful lemma.
Lemma. Suppose is infinite and
Then, for every positive finite integer ,
In particular,
Proof. Since is infinite, choose distinct elements
Define an injection
by sending
Since , this gives an injection
On the other hand, there is an obvious injection
for example
By Cantor-Bernstein,
This proves the lemma.
The lemma is the small but important compression move. Once a set can absorb its own square, it can also absorb finitely many copies of itself.
The theorem
Theorem. Assuming Zorn’s lemma, if is infinite, then
Proof. Since we are using Zorn’s lemma, we are working in the usual choice context. Thus every infinite set contains a countably infinite subset. Choose a countably infinite subset
Since , we know
Fix one bijection
Now consider the collection of pairs
such that
and
is a bijection extending . That is,
We partially order by extension. Thus
means that
and
The collection is nonempty because it contains
Now let be a chain in
. Define
and define
Because is a chain and the maps are ordered by extension, these maps agree on overlaps. Therefore
is a well-defined function.
We claim that
is a bijection.
First, it is injective. Suppose
Then belongs to some member
of the chain, and
belongs to some member
of the chain. Since the chain is linearly ordered, one of these two sets contains the other. Hence
and
both lie in a common member of the chain, where the corresponding map is injective. Therefore
Now we check surjectivity. Take any pair
Then belongs to some
in the chain, and
belongs to some
in the chain. Again, since
is a chain, one of
contains the other. Thus both
and
lie in a common member
of the chain. Since the corresponding map
is surjective, there is some such that
Therefore
So is surjective.
Thus every chain has an upper bound in . By Zorn’s lemma,
has a maximal element. Fix one and call it
So
and
is a bijection, maximal with respect to extension.
Since and
is countably infinite, the set
is infinite.
Let
We now analyze the complement .
First case: the complement is small
Suppose
So there is an injection
We claim that in this case
Choose two distinct elements
Define an injection
as follows.
For , set
For , set
This is injective: the second coordinate separates the two pieces and
, and
is injective on
.
Since
we obtain an injection
The inclusion
is also an injection. By Cantor-Bernstein,
Now conjugate the bijection along a bijection
. Define
by
Since each piece of this composition is a bijection, is a bijection. Hence
So the theorem is proved in this case.
Second case: the complement is large
Now suppose instead that
Since we are in the choice context, cardinalities are comparable. Therefore
Thus we may choose a subset
such that
Let
Since , this is a disjoint union.
Now decompose the square into four blocks:
The original map
already accounts for the old block .
So, to extend from
to
, it remains only to find a bijection from the new elements
onto the three new blocks
But . Therefore
and
Since , each of these three blocks has cardinality
. Hence their disjoint union has cardinality
By the absorption lemma,
And since , we conclude that
Choose a bijection
Now define
as follows.
For , set
For , set
The range of is exactly
, and the range of
is exactly
These two ranges are disjoint, and together they cover all of . Therefore
is a bijection.
Moreover, extends
, and
strictly contains
, because
is nonempty. Thus
is a strictly larger element of than
This contradicts the maximality of .
Therefore the second case is impossible.
Conclusion
The large-complement case cannot occur. Therefore the complement must satisfy
As shown in the first case, this implies
But by construction,
Conjugating this bijection along gives
Hence every infinite set has the same cardinality as its Cartesian square.
What the proof is really doing
The proof is best understood through the block decomposition
The old set already knows how to enumerate
. That is the role of the bijection
If we try to enlarge by adding a fresh copy
of
, then the newly added elements
do not need to enumerate all of
. They only need to enumerate the new part of the square, namely
And if , then this new part has the size of three copies of
. But
, and three copies of
still have size
. Thus the fresh copy
is exactly large enough to pay for all the new ordered pairs created by adding
.
That is the heart of the proof.
The maximality argument says: keep enlarging the self-similar island as long as possible. Once you cannot enlarge it anymore, the outside cannot contain another full copy of
. If it did, that copy could be used to absorb all the new square-blocks and produce a larger self-similar island. Therefore the outside is no larger than
, and so the whole set
is absorbed by
.
In compressed cardinal notation, the extension step is the calculation
But the proof above spells out why this calculation is legitimate without assuming in advance the theorem we are trying to prove.
This is the attractive feature of the argument: instead of proving by global cardinal arithmetic, it proves that any maximal partial instance of the identity must already be global up to bijection. The proof is not a diagonal enumeration. It is a self-similarity trap.
Leave a comment