I have at least four planned posts left in the series on symmetric functions, but unfortunately I’ll be very busy for the next two weeks. In the meantime, here are some thoughts on the primes.
Euclid’s proof of the infinitude of the primes is often held up as a shining example of mathematical proof. (Whether this reputation is deserved is a matter of opinion.) Euler’s proof via the zeta function is also classic. An even showier proof by Furstenburg is phrased in the language of topology. Today I’d like to share my personal favorite proof and discuss one of its possible consequences.
Special cases of Dirichlet’s theorem
Euclid’s proof generalizes to certain special cases of Dirichlet’s theorem. The general recipe is as follows. First, one finds an integer polynomial with the property that its integer values, with finitely many exceptions, are only divisible by primes in some congruence class. For example, the polynomial
is only divisible by primes congruent to
. Then one uses the following result.
Theorem: The set of primes dividing the integer values of an integer polynomial is infinite.
The standard proof of this result is a generalization of Euclid’s proof: if (which we can assume WLOG), we suppose that there are finitely many such primes
and considers
. After dividing out by
, we obtain a number coprime to
; contradiction. (Edit, 2/21/22: As Alexander Stasinski says in the comments, we need to do a little more work here to make sure that
is not equal to
or
. Every way I can think of to do this is slightly inelegant but we can, for example, instead consider
, which can only be equal to
or
for finitely many values of
, and pick
not equal to any such value.)
One obtains Euclid’s result with and Dirichlet’s theorem for primes congruent to
using the cyclotomic polynomials
; I’ve written about this proof in an old post. A paper of Conrad discusses the generalization of this method to primes congruent to
where
; it is known that one can do no better by this method.
Polynomials are not important
The limitation of this method is that is required to be a polynomial, and the reason I no longer like the above proof is that the highly “algebraic” method does not suggest the following generalization at all.
Theorem: Let be a strictly increasing sequence of positive integers which grows slower than
for every
(in particular, any sequence of polynomial growth rate). Then the set of primes dividing some
is infinite.
Proof. We prove the following more precise converse: any strictly increasing sequence of positive integers divisible by at most distinct primes
grows at least as fast as
.
But this is almost obvious. Let denote the number of numbers of the form
less than or equal to
. Clearly any such number must satisfy
and it follows that . In particular,
for every
. (This is not the strongest bound, but it’s enough.) It follows that
.
This is equivalent to the claim we wanted to prove. (Note that if is required to be strictly increasing we get an honest inequality
; otherwise, we can still say
.)
I find this proof the best motivated out of any of the proofs I know, especially because ultimately it reduces to simple counting. It is also constructive in the sense that a failure to satisfy the above inequality for some (for a strictly increasing sequence) implies that you have already hit at least
prime factors.
Motivated by the above result, a positive solution to the following problem would imply a proof of Dirichlet’s theorem that totally avoids the machinery of zeta functions.
Problem: Explicitly describe a sequence growing slower than
for every
that is divisible, with finitely many exceptions, only by primes in a specified arithmetic progression.
Of course we already know by the strong form of Dirichlet’s theorem that this is true of the sequence consisting precisely of the numbers divisible by primes in a specified arithmetic progression. But this is a “high-complexity” sequence whereas, say, the integer values of a cyclotomic polynomial
are “low-complexity.” In the spirit of Polymath4 what would be nice is a sequence describable in polynomial time, but as long as the proof that both of the above conditions are met is elementary it really doesn’t matter. Ideally
would be some perturbation of a polynomial, so the first condition would be trivial to verify. I have no idea how one would go about constructing such a sequence.