Introduction to
Additive Combinatorics
Daniel Naylor
Contents
1 Combinatorial methods
Definition 1.1 (Sumset).
Let
be an abelian group. Given ,
define the sumset
to be
|
and the difference set
to be
|
If and
are
finite, then certainly
|
Example 1.2.
Let .
Then
|
Lemma 1.3.
Assuming that:
Then , with equality
if and only if
is an arithmetic progression.
Proof.
Let
with .
Then
|
so . But
we could also have written
|
When ,
these two orderings must be the same. So
for all .
□
Exercise: If , then
with equality
if and only if
and are
arithmetic progressions with the same common difference.
Example 1.4.
Let
with prime.
Then .
Indeed,
(note that
means ).
But ,
|
Theorem 1.5 (Cauchy-Davenport).
Assuming that:
-
is a prime
-
nonempty
Then
|
Proof.
Assume .
Without loss of generality assume that
and that .
Apply induction on .
The case
is trivial. Suppose ,
and let .
Since
and ,
there must exist
such that
but .
Let ,
so ,
,
.
But , so the inductive
hypothesis applies to
and .
Since
|
we have
|
This fails for general abelian groups (or even general cyclic groups).
Example 1.6.
Let be
(fixed, small) prime, and let
be a subspace. Then ,
so . In fact,
if is such
that ,
then
must be a coset of a subspace.
Example 1.7.
Let
be such that .
Then there exists
a subspace such that
and is contained
in a coset of .
See Example Sheet 1.
Definition 1.8 (Ruzsa distance).
Given finite sets
, we define the
Ruzsa distance
between
and by
|
Note that this is symmetric, but is not necessarily non-negative, so we cannot prove that it is a metric. It
does, however, satisfy triangle inequality:
Lemma 1.9 (Ruzsa’s triangle inequality).
Assuming that:
Then
|
Proof.
Observe that
Indeed, writing each
as with
,
, the
map
is injective. The triangle inequality now follows from the definition. □
Definition 1.10 (Doubling / difference constant).
Given a finite
, we
write
for the doubling constant of
and
for the difference constant of .
Then Lemma 1.9 shows, for example, that
|
So , or
.
Notation.
Given
and , we
write
|
Theorem 1.11 (Plunnecke’s Inequality).
Assuming that:
Proof.
Choose a non-empty subset
such that the ratio
is minimised, and call this ratio .
Then ,
,
and ,
.
Claim: For every finite ,
.
Let’s complete the proof of the theorem assuming the claim. We first show that
,
. Indeed, the case
is trivial, and
is true by assumption.
Suppose and the
inequality holds for .
By the claim with ,
we get
|
But as in the proof of Ruzsa’s triangle inequality,
, we can
show
|
Hence ,
which completes the proof (assuming the claim).
We now prove the claim by induction on .
When
the statement follows from the assumptions. Suppose the claim is true for
, and
consider for
some .
Observe that
|
with .
By definition of ,
,
so
We apply this argument a second time, writing
|
where .
We conclude that
|
so
|
proving the claim. □
We are now in a position to generalise Example 1.7.
Theorem 1.12 (Freiman-Ruzsa).
Assuming that:
Then is contained
in a subspace
of size .
Proof.
Choose maximal
such that the translates
with are disjoint.
Such a set cannot
be too large: ,
, so by Plunnecke’s
Inequality, since ,
|
So . We
next show
Indeed, if
and ,
then by maximality of ,
for some
(and if ,
then clearly ).
It follows from ()
by induction that ,
|
since
|
Now let be the
subgroup generated by ,
which we can write as
|
where
is the subgroup generated by .
But every element of can
be written as a sum of
elements of with
coefficients amongst ,
hence .
To conclude, note that
|
where we use Plunnecke’s Inequality or even Ruzsa’s triangle inequality. □
Example 1.13.
Let
where
is a subspace of dimension
and
consists of
linearly independent vectors not in .
Then
|
and
|
But any subspace
containing must have
size at least , so the
exponential dependence on
is necessary.
Theorem 1.14 (Polynomial Freiman-Ruzsa, due to Gowers–Green–Manners–Tao 2024).
Assuming
that:
Then there exists a subspace
of size at most
such
that for some
,
|
where
and are
polynomial in .
Proof.
Omitted, because the techniques are not relevant to other parts of the course. See Entropy
Methods in Combinatorics next term. □
Definition 1.15.
Given we
define the additive energy between
and to
be
|
We refer to the quadruples
such that
as additive quadruples.
Example 1.16.
Let
be a subspace. Then .
On the other hand, if is chosen at
random from (each element chosen
independently with probability ),
then with high probability
|
Lemma 1.17.
Assuming that:
Then
|
Proof.
Define (and notice
that this is the same as ).
Observe that
but
(As usual,
here means the indicator function). □
In particular, if ,
then
|
The converse is not true.
Example 1.18.
Let
be your favourite (class of) abelian group(s). Then there exist constants
such that for all
sufficiently large ,
there exists ,
with
satisfying
and .
Theorem 1.19 (Balog–Szemeredi–Gowers, Schoen).
Assuming that:
Then there exists
of size at least
such that ,
where
and are
polynomial in .
Idea: Find
such that such
that has many
representations as
with .
We first prove a technical lemma, using a technique called “dependent random choice”.
Definition 1.20 (gamma-popular differences).
Given
and
, let
|
be the set of -popular
differences of .
Lemma 1.21.
Assuming that:
-
is finite
-
-
Then there is a subset
of size such that for
all but a -proportion
of pairs ,
.
Proof.
Let .
Then
For , let
|
and set .
Then
Let .
Then
Hence there exists
such that
|
Let ,
,
. So
|
Find such
that is
large.
Given ,
let .
Then
|
Let .
Then
Therefore,
So there exists
such that ,
in which case we have
and .
□
Proof of Theorem 1.19.
Given
with , apply
Lemma 1.21 with
to otain of
size such that
for all but
of pairs ,
. In
particular, the bipartite graph
|
has at least
edges. Let .
Clearly, . For
any , there
are at least
elements
such that
().
Thus
has at least
|
representations of the form
with .
It follows that
Thus .
□
2 Fourier-analytic techniques
In this chapter we will assume that
is finite abelian.
comes equipped with
a group of characters,
i.e. homomorphisms .
In fact, is
isomorphic to .
See Representation Theory notes for more information about characters and proofs of this as well as some of
the facts below.
Example 2.1.
-
(i)
If ,
then for any ,
we have an associated character ,
where .
-
(ii)
If ,
then any
can be associated to a character .
Notation.
Given
nonempty, and any function ,
let
|
Lemma 2.2.
Assuming that:
Then
|
and for all ,
|
Proof.
The first equality in eqch case is trivial. Suppose
. Then there
exists
with .
Then
So .
For the second part, note that given ,
there must by such
that , for otherwise
would act trivially on
, hence would also be
the dual group for ,
a contradiction. □
Definition 2.3 (Fourier transform).
Given ,
define its Fourier transform
by
It is easy to verify the inversion formula: for all ,
Indeed,
Given , the indicator or
characteristic function of ,
is
defined as usual.
Note that
|
The density of
in (often
denoted by ).
Definition (Characteristic measure).
Given non-empty ,
the characteristic measure
is defined by .
Note that .
Definition (Balanced function).
The balanced function
is given by
. Note
that .
Example 2.4.
Let
be a subspace. Then for ,
we have
where is the
annihilator of .
In other words, .
Example 2.5.
Let
be such that each
lies in independently
with probability .
Then with high probability
|
This follows from Chernoff’s inequality: Given
-valued independent
random variables
with mean ,
then for all ,
we have
|
Example 2.6.
Let
with .
Then
and .
Given ,
we write
|
Consequently,
|
Lemma 2.7.
Assuming that:
Then
-
(i)
(Parseval’s identity)
-
(ii)
(Plancherel’s identity)
Proof.
Exercise (hopefully easy). □
Definition 2.8 (Spectrum).
Let
and . Define
the -large
spectrum of
to be
|
Example 2.9.
By Example 2.4, if
with ,
then ,
|
Lemma 2.10.
Assuming that:
Then
|
Proof.
By Parseval’s identity,
In particular, if
for ,
then
|
so .
Definition 2.11 (Convolution).
Given ,
we define their convolution
by
|
Example 2.12.
Given ,
|
In particular, .
Lemma 2.13.
Assuming that:
Then
|
Proof.
Example 2.14.
|
In particular,
for any .
Theorem 2.15 (Bogolyubov’s lemma).
Assuming that:
Then there exists
of codimension
such that .
Proof.
Observe
|
so wish to find
such that
for all . Let
with
and let
. By
Lemma 2.10, .
Fix .
Note
since
for all
and
hence (in
fact, )
for all
and .
□
Example 2.16.
The set
(where counts the
number of 1s in ) has
density , but there is no
coset of any subspace
of codimension
such that .
Lemma 2.17.
Assuming that:
-
of density
-
-
Then there exists
of codimension
and
such that
|
Proof.
Let be
such that ,
and let .
Write for
for the
distinct
cosets
of .
Then
By triangle inequality, .
But note that
so , hence
there exists
such that .
Then .
□
Notation.
Given ,
write
|
Notation.
Given ,
write
to be distinguished from .
Lemma 2.18.
Assuming that:
-
prime
-
of density
-
Then the number of 3-term arithmetic progressions in
differs from
by at
most .
Proof.
The number of 3-term arithmetic progressions in
is
times
By Plancherel’s identity and Lemma 2.13, we have
but
by Parseval’s identity. □
Theorem 2.19 (Meshulam’s Theorem).
Assuming that:
Then .
Proof.
By assumption,
|
But as in (the proof of) Lemma 2.18,
|
so provided ,
i.e.
we have .
So by Lemma 2.17 with ,
there exists
of codimension 1 and
such that .
We iterate this observation: let ,
,
. At the
-th step, we are
given a set
of density
with no non-trivial 3 term arithmetic progressions. Provided that
, there
exists of
codimension ,
such
that
|
Set ,
has density ,
and is free of non-trivial 3 term arithmetic progressions.
Through this iteration, the density increases from
to
in at most
steps.
to
in at most
steps and so on.
So reaches
in at most
|
steps. The argument must end with ,
at which point we must have had ,
or else we could have continued.
But we may assume that
(or )
whence ,
or .
□
At the time of writing, the largest known subset of
containing no non-trivial 3 term arithmetic progressions has size
.
We will prove an upper bound of the form .
Theorem 2.20 (Roth’s theorem).
Assuming that:
Then .
Example 2.21 (Behrend’s example).
There exists
of size
at least
containing no non-trivial 3 term arithmetic progressions.
Lemma 2.22.
Assuming that:
Then one of the following holds:
-
(i)
(where the Fourier coefficient is computed in )
-
(ii)
There exists an interval
of length
such that
Proof.
We may assume that
since otherwise
so we would be in Case (ii) with .
Let . Note that all 3 term arithmetic
progressions of the form are in
fact arithmetic progressions in .
If or
were at least
, we would again be in case
(ii). So we may assume that .
Now as in Lemma 2.18 and Theorem 2.19,
where
and . So
as before,
|
provided that ,
i.e. .
(Check this is satisfied).
Hence
|
Lemma 2.23.
Assuming that:
Then there exists a partition of
into progressions
of length
such that
|
for all .
Proof.
Let
and consider .
By Pigeonhole, there exists such
that .
Set ,
so .
Divide
into residue classes modulo ,
each of which has size at least .
But each residue class can be divided into arithmetic progressions of the form
with .
The diameter of the image of each progression under
is .
□
Lemma 2.24.
Assuming that:
Then there exists a progression
of length at least
such that .
Proof.
Let , and use
Lemma 2.23 to partition
into progressions
of length
|
and . Fix
one from
each of the .
Then
So
|
Since
has mean zero,
|
hence there exists
such that
|
and so
|
Definition 2.25 (Bohr set).
Let
and . By the
Bohr set we
mean the set
|
We call
the rank of ,
and
its width or radius.
Example 2.26.
When ,
then for all
sufficiently small .
Lemma 2.27.
Assuming that:
Proposition 2.28 (Bogolyubov in a general finite abelian group).
Assuming that:
Then there exists
of size at most
such that .
Proof.
Recall .
Let , and note
that, for
and ,
. Hence,
for ,
|
and
|
3 Probabilistic Tools
All probability spaces in this course will be finite.
Theorem 3.1 (Khintchine’s inequality).
Assuming that:
-
-
independent random variables
-
Then
|
Proof.
By nesting of norms, it suffices to prove the case
for some .
Write ,
and assume .
Note that in fact ,
hence .
By Chernoff’s inequality (Example 2.5), for all
we have
|
and so using the fact that
we have
We shall show by induction on
that . Indeed,
when ,
|
For ,
integrate by parts to find that
Corollary 3.2 (Rudin’s Inequality).
Let
be a linearly independent set and let .
Then for any ,
|
Corollary 3.3.
Let be a
linearly independent set and let .
Then for all ,
|
Proof.
Let
and write .
Then
which is bounded above by
where ,
using Hölder’s inequality.
By Rudin’s Inequality,
|
Recall that given
of density ,
we had .
This is best possible as the example of a subspace shows. However, in this case the large spectrum is highly
structured.
Theorem 3.4 (Special case of Chang’s Theorem).
Assuming that:
-
of density
-
Then there exists
of dimension
such that .
Proof.
Let be a maximal
linearly independent set. Let .
Clearly . By
Corollary 3.3, for all ,
|
so
|
Set
to get .
□
Definition 3.5 (Dissociated).
Let
be a finite abelian group. We say
is dissociated if
for ,
then .
Clearly, if ,
then is
dissociated if and only if it is linearly independent.
Theorem 3.6 (Chang’s Theorem).
Assuming that:
Then .
We may bootstrap Khintchine’s inequality to obtain the following:
Theorem 3.7 (Marcinkiewicz-Zygmund).
Assuming that:
Then
|
Proof.
First assume the distribution of the ’s
is symmetric, i.e. for
all . Partition the
probability space
into sets , write
for the induced
measure on
such that all ’s
are symmetric and take at most 2 values. By Khintchine’s inequality, for each
,
so summing over all and taking
-th roots gives the symmetric
case. Now suppose the ’s
are arbitrary, and let
be such that and
are all independent. Applying
the symmetric case to ,
But then
concluding the proof. □
Theorem 3.8 (Croot-Sisask almost periodicity).
Assuming that:
Then there exists
and a set
such that
and
|
where for all
, and as a reminder,
is the characteristic
measure of .
Proof.
The main idea is to approximate
|
by ,
where
are sampled independently and uniformly from ,
and
is to be chosen later.
For each , define
. For each
, these are independent
random variables with mean ,
so by Marcinkiewicz-Zygmund,
By Hölder with ,
we get
so
|
Summing over all ,
we have
|
with
by Young / Hölder (
where ).
So we have
|
Choose so that the
RHS is at most .
whence
|
Write
|
By Markov inequality, since
|
we have
|
so . Let
|
Now ,
whence
|
By Lemma 1.17,
|
so there are at least
pairs such that
. In particular,
there exists
and of size
such that for
all , there
exists such
that for all ,
. But then
for each , by
the triangle inequality,
by definion of .
□
Theorem 3.9 (Bogolyubov again, after Sanders).
Assuming that:
Then there exists a subspace
of codimension
such tht .
Almost periodicity is also a key ingredient in recent work of Kelley and Meka, showing that any
containing no non-trivial 3 term arithmetic progressions has size
.
4 Further Topics
In , we
can do much better.
Theorem 4.1 (Ellenberg-Gijswijt, following Croot-Lev-Pach).
Assuming that:
Then .
Notation.
Let be the set of
monomials in whose degree
in each variable is at most .
Let be the vector
space over
whose basis is .
For any , write
for the set of
monomials in of (total)
degree at most , and
for the corresponding
vector space. Set .
Lemma 4.2.
Assuming that:
Then
|
Proof.
Every can be written as a
linear combination of monomials in ,
so
|
for some coefficients .
Clearly at least one of
must have degree ,
whence
|
for some families of polynomials ,
.
Viewing
as a -matrix
,
we see that
can be written as the sum of at most
matrices, each of which has rank .
Thus .
But by assumption,
is a diagonal matrix whose rank equals .
□
Proposition 4.3.
Assuming that:
Then .
Proof.
Let be an integer to
be determined later. Let be
the space of polynomials in
that vanish on .
We have
|
We claim that there exists
such that . Indeed, pick
with maximal support.
If , then there would be
a non-zero polynomial
vanishing on , in which
case , contradicting
the choice of .
Now by assumption,
|
So any polynomial that vanishes on
vanishes on .
By Lemma 4.2 we now have that,
Hence . But the
monomials in are in
bijection with the ones in
via , whence
. Thus
setting , we
have .
□
You will prove Theorem 4.1 on Example Sheet 3.
We do not have at present a comparable bound for 4 term arithmetic progressions. Fourier techniques also
fail.
Example 4.4.
Recall from Lemma 2.18 that given
,
|
But it is impossible to bound
|
by . Indeed,
consider .
By Problem 11(ii) on Sheet 1,
and
|
But given a 3 term arithmetic progression ,
by the identity
|
automatically
lies in ,
so
|
which is not close to .
Definition 4.5 (-norm).
Given , define
its -norm by
the formula
|
Problem 1(i) on Sheet 2 showed that ,
so this is indeed a norm.
Problem 1(ii) asserted the following:
Lemma 4.6.
Assuming that:
Then
|
Note that
|
and thus by Parseval’s identity,
|
Hence
|
Moreover, if ,
then
|
We may therefore reformulate the first step in the proof of Meshulam’s Theorem as follows: if
, then
by Lemma 4.6,
|
It remains to show that if is non-trivial,
then there exists a subspace of
bounded codimension on which
has increased density.
Theorem 4.7 ( Inverse Theorem).
Assuming that:
-
-
-
-
Then there exists
such that
|
In other words,
for and
we say “
correlates with a linear phase function”.
Proof.
We have seen that
|
so
|
Definition 4.8 ( norm).
Given ,
define its
norm by
where and
denotes the
number of ones in .
It is easy to verify that
where .
Definition 4.9 ( inner product).
Given functions
for , define
their
inner product by
|
Observe that .
Lemma 4.10 (Gowers–Cauchy–Schwarz Inequality).
Assuming that:
Then
|
Setting
for and
otherwise, it
follows that
hence .
Proposition 4.11.
Assuming that:
Then
|
Proof.
We additionally assume
to make the proof easier to follow, but the same ideas are used for the general case. We additionally
assume ,
by rescaling, since the inequality is homogeneous.
Reparametrising, we have
Theorem 4.12 (Szemeredi’s Theorem for 4-APs).
Assuming that:
Then .
Idea: By Proposition 4.11 with ,
|
where
consists of
other terms in which between one and three of the inputs are equal to
.
These are controlled by
whence
|
So if contains no non-trivial 4
term arithmetic progressions and ,
then .
What can we say about functions with large
norm?
Example 4.13.
Let
be an symmetric
matrix with entries in .
Then
satisfies .
Theorem 4.14 ( inverse theorem).
Assuming that:
Then there exists a symmetric
matrix
with
entries in
and
such that
|
where is a
polynomial in .
In other words,
for and
we say “
correlates with a quadratic phase function”.
Proof (sketch).
Let
denote .
.
-
STEP 1:
Weak linearity. See reference.
-
STEP 2:
Strong linearity. We will spend the rest of the lecture discussing this in detail.
-
STEP 3:
Symmetry argument. Problem 8 on Sheet 3.
-
STEP 4:
Integration step. Problem 9 on Sheet 3.
STEP 1: If , then for
at least a -proportion
of ,
. So for each
such , there
exists such
that .
Proposition 4.15.
Assuming that:
-
-
-
-
Then there exists
with
and a
function
such that
-
(i)
;
-
(ii)
There are at least
quadruples
such that
and .
STEP 2: If and
are as above, then there
is a linear function
which coincides with
for many elements of .
Proposition 4.16.
Assuming that:
-
and
given as in Proposition
4.15
Then there exists
matrix with
entries in
and such
that
()
satisfies
for
elements .
Proof.
Consider the graph of ,
.
By Proposition 4.15,
has
additive quadruples.
By Balog–Szemeredi–Gowers, Schoen, there exists
with
and .
udefine
by
and note .
By Freiman-Ruzsa applied to ,
there exists a subspace
with
such that .
Denote by the projection
onto the first coordinates.
By construction, .
Moreover, since ,
|
We may thus partition
into cosets of some
subspace such that
is injective on each coset. By
averaging, there exists a coset
such that
|
Set ,
and define
accordingly.
Now
is injective and surjective onto .
This means there is an affine linear map
such that
for all .
□
Then do steps 3 and 4. □
What to do if you have lots of additive quadruples?
Balog-Szemeredi-Gowers
What to do if you have small doubling constant?
Freiman-Ruzsa (or Polynomial-Freiman-Ruzsa)
˙
Index
additive energy
additive quadruple
Bohr set
characteristic measure
Chernoff’s inequality
difference constant
dissociated
doubling constant
Parseval’s identity
Plancherel’s identity
Rusza distance
-large
spectrum of
sumset