Having some more free time than usual, I figured I would read a recent paper that looked interesting. However, I’m something of a noob at math, so I figure it’s worth it to take it slowly and painstakingly think through the details. This will be a sort of stream-of-conciousness post where I do just that.
The paper I’ll be reading will be Terry Tao’s “Sendov’s conjecture for sufficiently high degree polynomials.” This paper looks interesting to me because it applies “cheap nonstandard” methods to prove a result in complex analysis. In addition it uses probability-theoretic methods, which I’m learning a lot of right now. Sendov’s conjecture is the following:
Sendov’s conjecture Let be a polynomial of degree that has all zeroes . If is a zero then has a zero in .
Without loss of generality, we may assume that is monic and that . Indeed, if is a polynomial in the variable we divide by the argument of and then rescale by the top coefficient.
Tao notes that by Tarski’s theorem, in principle it suffices to get an upper bound on the degree of a counterexample and then use a computer-assisted proof to complete the argument. I think that for every you’d get a formula in the theory of real closed fields that could be decided in time, where is an absolute constant (which, unfortunately, is exponential in the number of variables of the formula, and so is probably quite large). Worse, Tao is going to use a compactness argument and so is going to get an astronomical bound . Still, something to keep in mind — computer-assisted proofs seem like the future in analysis.
More precisely, Tao proves the following:
Proposition 1 For every in a monotone sequence in , let be a monic polynomial of degree with all zeroes in and let satisfy . If for every , has no zeroes in , then .
It’s now pretty natural to see how “cheap nonstandard” methods apply. One can pass to a subsequence countably many times and still preserve the hypotheses of the proposition, so by diagonalization and compactness, we may assume good (but ineffective) convergence properties. For example, we can assume that , where does not depend on and is with respect to .
Using overspill, one can view the proposition model-theoretically: it says that if is a nonstandard natural number, a monic polynomial of degree with a zero , and there are no zeroes of in , then . Tao never fully takes this POV, but frequently appeals to results like the following:
Proposition 2 Let be a first-order predicate. Then:
- (Overspill) If for every sufficiently small standard , , then there is an infinitesimal such that .
- (Underspill) If for every infinitesimal , , then there are arbitrarily small such that .
- (-saturation) If is where is compact, then the implied constant in the statement of is independent of .
Henceforth we will use asymptotic notation in the nonstandard sense; for example, a quantity is if it is infinitesimal. This is equivalent to the cheap nonstandard perspective where a quantity is iff it is with respect to , where is ranging over some monotone sequence of naturals. I think the model-theoretic perspective is helpful here because we are going to pass to subsequences a lot, and at least in the presence of the boolean prime ideal theorem, letting be a fixed nonstandard natural number is equivalent to choosing a nonprincipal ultrafilter on that picks out the subsequences we are going to pass to, in the perspective where ranges over a monotone sequence of standard naturals.
This follows because the Stone-Cech compactification is exactly the space of ultrafilters on . Indeed, if ranges over a monotone sequence of standard naturals, then in , converges to an element of , which then is a nonprincipal ultrafilter. If denotes the ultrapower of with respect to , then I think the equivalence class of the sequence in is exactly the limit of . Conversely, once a nonprincipal ultrafilter has been fixed, we have a canonical way to pass to subsequences: only pass to a subsequence which converges to . This is possible since is compact.
I think it will be at times convenient to go back to the “monotone sequence of standard naturals” perspective, especially when we’re doing computations, so I reserve the right to go between the two. We’ll call the monotone sequence perspective the “cheap perspective” and the model-theoretic perspective the “expensive perspective”.
I’m not familiar with the literature on Sendov’s conjecture, so I’m going to blackbox the reduction that Tao carries out. The reduction says that, due to the Gauss-Lucas theorem and previously existing partial results on Sendov’s conjecture, to prove Proposition 1, it suffices to show:
Proposition 3 Let be a nonstandard natural, let be a monic polynomial of degree with all zeroes in and let satisfy . Suppose that has no zeroes in and
- (Theorem 3.1 in Tao) either , or
- (Theorem 5.1 in Tao) there is a standard such that
In the former case we have and in the latter we have . We’ll call the former “case zero” and the latter “case one.”
Tao gives a probabilistic proof of Proposition 3, and that’s going to be the bulk of this post and its sequels. Let be a random zero of , drawn uniformly from the finite set of zeroes. Le denote a random zero of chosen independently of .
In the cheap perspective, depends on , we are going to study properties of the convergence of as , by using our chosen ultrafilter to repeatedly pass to subsequences to make converge in some suitable topology. The probability spaces that lives in depend on , but as long as we are interested in a deterministic limit that does not depend on , this is no problem. Indeed will converge to uniformly (resp. in probability) provided that for every , almost surely (resp. ), and this makes sense even though the probability space we are studying depends on . The usual definition of convergence in distribution still makes sense even for random variables converges to a random variable that deos not depend on provided that their distributions converge vaguely to .
Okay, it’s pretty obvious what being infinitesimally close to a deterministic standard real is in the uniform or probabilistic sense. Expanding out the definition of the vague topology of measures, a nonstandard measure on a locally compact Hausdorff space is infinitesimally close to a standard measure provided that for every continuous function with compact support,
This induces a definition of being infinitesimally close in distribution.
Okay, no more model-theoretic games, it’s time to start the actual proof.
Definition 4 Let be a bounded complex random variable. The logarithmic potential of is
Here denotes expected value. Tao claims but does not prove that this definition makes sense for almost every . To check this, let be a compact set equipped with Lebesgue measure and let be the distribution of . Then
and the integrand is singular along the set , which has real codimension in . The double integral of a logarithm makes sense almost surely provided that the logarithm blows up with real codimension (to see this, check the double integral of log on ) so this looks good.
Definition 5 Let be a bounded complex random variable. The Stieltjes transform of is
Then is “less singular” than , so this definition is inoffensive almost everywhere.
Henceforth Tao lets denote the distribution of , where is any bounded complex random variable. Then and where are the complex derivative and Cauchy-Riemann operators respectively. Since we have
This just follows straight from the definitions. Of course might not be an absolutely continuous measure, so this only makes sense if we use the calculus of distributions.
Does this make sense if is deterministic, say almost surely? In that case is a Dirac measure at and . Everything looks good, since .
For the next claims I need the Gauss-Lucas theorem:
Theorem 6 (Gauss-Lucas) If is a polynomial on , then all zeroes of belong to the convex hull of the variety .
Lemma 7 (Lemma 1.6i in Tao) surely lies in and surely lies in .
Proof: The first claim is just a tautology. For the second, by assumption on all zeroes of lie in the convex set , so so does their convex hull. In particular lies in almost surely by the Gauss-Lucas theorem. Our contradiction hypothesis says that .
Lemma 8 (Lemma 1.6ii-iv in Tao) One has . For almost every ,
Proof: We pass to the cheap perspective, so is a large standard natural. Since is large, in particular , if is the coefficient of in then the roots of sum to . The roots of sum to by calculus. So .
Taking of both sides and then dividing by we immediately get and . Then we take the complex derivative of both sides of the and formulae to get the formulae for and .
Now the formula for follows by subtracting the above formulae, as does the formula for .
Since the distributions of and have bounded support (it’s contained in ) by Prokhorov’s theorem we can find standard random variables and such that is infinitesimal in distribution and similarly for . The point is that and give, up to an infinitesimal error, information about the behavior of , , and by the above lemma and the following proposition.
Proposition 9 (Theorem 1.10 in Tao) One has:
- In case zero, and are identically distributed and almost surely lie in , so is infinitesimal in probability. Moreover, for every compact set ,
- In case one, is uniformly distributed on and is almost surely zero. Moreover,
The proposition gives quantitative bounds that force the zeroes to all be in certain locations. Looking ahead in the paper, it follows that:
- In case zero, the Stieltjes transform of is infinitesimally close to , so by a stability-of-zeroes argument to show that has no zeroes near the origin, even though .
- In case one, if is the standard deviation of , then we have control on the zeroes of up to an error of size , which we can then use to deduce a contradiction.
Next time I’ll start the proof of this proposition. Its proof apparently follows from the theory of Newtonian potentials, which is not too surprising since if is the Laplacian of . It needs the following lemma:
Lemma 10 (Lemma 1.6vi in Tao) If is a curve in that misses the zeroes of and then
Proof: One has
by the previous lemma. Breaking up into finitely many parts one can assume that is a contractible curve in a simply connected set, in which case we have a branch of the logarithm along . Now apply the fundamental theorem. The case for is the same.