7 A proof of Marton’s conjecture in
We shall prove the following theorem.
Theorem 7.1 (Green, Manners, Tao, Gowers).
There is a polynomial
with the following
property:
If and
is such that
, then there is a
subspace of size at
most such that
is contained in the union
of at most translates
of . (Equivalently,
there exists ,
such
that ).
This is known as “Polynomial Freiman–Ruzsa”.
In fact, we shall prove the following statement.
Theorem 7.2 (Entropic Polynomial Freiman–Ruzsa).
There exists an absolute constant
satisfying the
following: Let
and let be
-valued random variables.
Then there exists a subsgroup
of such
that
where is the uniform
distribution on .
Lemma 7.3.
Assuming that:
Then there exists
such that
.
Proof.
If not, then
|
contradiction. □
Proof.
Let ,
. Let
and
be independent
copies of . Then by
Theorem 7.2, there exists
(a subgroup) such that
so
But
So .
Therefore
Therefore, by Lemma 7.3, there exists
such that
|
But
|
(using characteristic 2). So there exists
such that
|
Let . By the Ruzsa covering
lemma, we can cover
by at most
translates of .
But so
, so
can be covered
by at most
translates of .
But using ,
So
|
Since is
contained in ,
so , so
If then we are done.
Otherwise, since ,
so .
Pick a subgroup
of of size
between
and . Then
is a union
of at most
translates of ,
so is a union
of at most
translates of .
□
Now we reduce further. We shall prove the following statement:
Theorem 7.5 (EPFR).
There is a constant
such that if and
are any two
-valued random
variables with , then
there exists -valued
random variables
and
such that
|
Proof.
By compactness we can find ,
such
that
|
is minimised. If
then by EPFR()
there exist ,
such that .
But then
Contradiction.
It follows that .
So there exists
such that and
are uniform
on cosets of ,
so
|
which gives us EPFR().
□
Definition.
Write
for
|
Definition.
Write
for
|
Remark.
If we can prove EPFR
for conditional random variables, then by averaging we get it for some pair of random variables (e.g. of the
form
and ).
Lemma 7.7 (Fibring lemma).
Assuming that:
Then
|
Proof.
But the last line of this expression equals
|
We shall be interested in the following special case.
Corollary 7.8.
Assuming that:
Then
Proof.
Apply Lemma 7.7 with ,
and .
□
We shall now set .
Recall that Lemma 6.11 says
|
Equivalently,
|
Applying this to the information term (),
we get that it is at least
which simplifies to
So Corollary 7.8 now gives us:
Now apply this to ,
and
and
add.
We look first at the entropy terms. We get
where we made heavy use of the observation that if
are some
permutation of ,
then
This also allowed use e.g. to replace
|
by
|
Therefore, we get the following inequality:
Lemma 7.9.
Now let
be copies of
and copies of
and apply
Lemma 7.9 to
(all independent), to get this.
Lemma 7.10.
Assuming that:
-
satisfy:
and
are copies of ,
and
are copies of ,
and all of them are independent
Then
OR? TODO: figure out which is correct
Recall that we want
such that
Lemma 7.10 gives us a collection of distances (some conditioned), at least one of which is at most
. So it
will be enough to show that for all of them we get
for some absolute constant .
Then we can take .
Definition (-relevant).
Say that is
-relevant
to if
Proof.
.
□
Lemma 7.12.
Assuming that:
Then
|
Proof.
Corollary 7.13.
Assuming that:
-
-
are independent copies of
Proof.
□
Corollary 7.14.
is -relevant
to .
Corollary.
Assuming that:
Proof.
By Lemma 7.12,
Corollary 7.15.
Assuming that:
Proof.
Similarly for .
□
Lemma 7.16.
Assuming that:
Then
|
Proof.
But , so
it’s also
|
Averaging the two inequalities gives the result (as earlier). □
Corollary 7.17.
Assuming that:
Then
Proof.
Use Lemma 7.16. Then as soon as it is used, we are in exactly the situation we were in when
bounding the relevance of
and .
□
It remains to tackle the last two terms in Lemma 7.10. For the fifth term we need to bound
|
But first term of this is at most (by Lemma 7.12)
|
By The entropic Ruzsa triangle inequality and independence, this is at most
Now we can use Lemma 7.16, and similarly for the other terms.
In this way, we get that the fifth and sixth terms have relevances bounded above by
for an absolute
constant .