1 Ramsey’s Theorem

Notation. ={1,2,}, [n]={1,2,,n} for a set X, r1, X(r)={AX,|A|=r}.

Given a 2-colouring of (2), are we guaranteed to have an infinite monochromatic set (i.e. M, M infinite such that the colouring is constant on M(2))?

Example.

  • (1) {i,j} red if i+j even, odd otherwise. Then M={n:n even} works.
  • (2) {i,j} red if max{n:2n|i+j} is even, blue otherwise. Then M={40,41,} works.
  • (3) {i,j} red if i+j has an even number of distinct prime divisors, and blue otherwise. No explicit M is known!

Theorem 1.1 (Ramsey’s Theorem for pairs). Assuming that:

  • (2) are 2-coloured (i.e. c:(2){1,2}).

Then there exists M infinite monochromatic.

Proof. Pick a1. Then there exists an infinite set A1 such that c(a1i)=c1 for all iA1. Pick a2A1 and find A2 (infinite) such that c(a2i)=c2 for all iA2. Keep on doing this. We end up with a1<a2<a3<<ak< and A1A2 such that c(aij)=ci for all jAi.

One colour appears infinitely many times ci1=ci2==cik==e. Now note M={ai1,ai2,ai3,} is a monochromatic set.

PIC

Remark.

  • (1) The same proof works for k colours. This is referred to as a “2-pass” proof. Alternatively: if we have colours 1,2,,k, then we can consider 1 to be red, and everything else to be blue. Then using the above result and induction, we get an alternative way to prove the theorem for greater than 2 colours.
  • (2) Infinite monochromatic is very different than arbitrarily large monochromatic.

    For example: suppose we write A1={1,2}, A2={3,4,5}, A3={6,7,8,9} and so on. Say {i,j} is red if there exists k such that i,jAk, and blue otherwise. Then there exist arbitrarily large monochromatic red sets, but no infinite monochromatic red set.

What about (r) with r=3?

Example. r=3, {i,j,k}, i<j<k red if and only if i|j+k. Then M={20,21,22,} is monochromatic.

Theorem 1.2 (Ramsey’s Theorem for r-sets). Assuming that:

  • (r) is finitely coloured.

Then there exists a monochromatic infinite set.

Proof. r=1 pigeonhole, r=2 is Theorem 1.1. Prove this by induction.

Assume it is true for r1. Given c:(r)[k], we must find M (infinite and monochromatic). Pick a1. Look at the r1 sets of {a1}. Define c:({a1})(r1)[k] via c(F)=c(F{a1}).

By induction there exists A1{a1} such that c is constant on it, say constantly equal to c1.

Now pick a2A1 and induce c:(A1{a2})(r1)[k] defined by c(F)=c(F{a2}). By induction there exists A2A1{a2} such that c is constant on it, say equal to c2.

Continuing this, we end up with a1,a2, and sets A1,A2, such that Ai+1Ai{ai+1} with c(F{ai})=ci for all FAi+1, |F|=r1.

Some colour must appear infinitely many times: say ci1=ci2=ci3==c. Check: M={ai1,ai2,} is monochromatic.

Example. Applications:

  • (1) In a totally ordered set, any sequence has a monotone subsequence.

    Proof. Let the sequence be x1,x2,. Say {i<j} is red if xixj, and blue otherwise. By Theorem 1.1, we may find M={i1<i2<} monochromatic. If M is red, then the sequence xi1,xi2,xi3, is increasing, and if M is blue then the sequence is strictly decreasing.

    PIC

  • (2) Using a slightly adjusted argument, we can insist that the function given by (ij,xij) is either concave or convex. We do this by: for a triple (ij1,ij2,ijk) we colour it convex or concave. Then apply Theorem 1.2.

From Theorem 1.2 we can deduce:

Theorem 1.3 (Finite Ramsey). Assuming that:

  • r1, k1, m1

Then there exists n such that whenever [n](r) is k-coloured, we can find a monochromatic set of size m.

Proof. Suppose not. Then for each n we can find cn:[n](r)[k] with no monochromatic m-sets. Note that there are only finitely many ways to k-colour [r](r). So infinitely many cn will agree on [r](r). Pick A1 such that for all nA1,

cn|[r](r)=dr:[r](r)[k].

We can do the same on [r+1](r) and produce some A2A1 such that cn|[r+1](r) is constant on A2.

Continuing this, we get AnAn1A1. They satisfy:

  • (1) There is no monochromatic m-set for any dn:[n](r)[k] (because dn=ci|[n](r)).
  • (2) These dn’s are nested: dj|[i](r)=di for j>i.

Finally: colour (r) via c(F)=dn(F), where n is any integer nmaxF. One can see that this is well defined, and gives a contradiction to Theorem 1.2.

Remark.

  • (1) This proof gives no bound on this n(k,m). There are other proofs that give some bounds.
  • (2) This is a “proof by compactness”: what we (essentially) showed is that {0,1} with the product topology is (sequentially) compact. If you prefer, the product topology can be thought of as the topology derived from the metric
    d(f,g)={0f=g1min{n:f(n)g(n)}otherwise.

What happens if we have c:(2)X with X being potentially infinite?

Theorem 1.4 (Canonical Ramsey Theorem). Assuming that:

  • (2) is coloured (possibly with an infinite number of colours)

Then there exists an infinite set M such that one of the following holds:
  • (i) c is constant on M.
  • (ii) c is injective on M.
  • (iii) c({i,j})=c({k,l}) if and only if i=k for i<j, k<l in M.
  • (iv) c({i,j})=c({k,l}) if and only if j=l, for all i<j, k<l.

Proof. We colour an element {i<j<k<l} of (4) as follows: We say that it is red if c(ij)=c(kl), and blue otherwise.

By Ramsey’s Theorem for r-sets, there exists an infinite set M1 that is monochromatic under this colouring.

  • (1) Suppose M1 is red. Then c is constant on M1.

    Let i<j, k<l. Pick m<n (in M1) bigger than all 4. Then i<j<m<n hence c(ij)=c(mn). Also, k<l<m<n, so c(kl)=c(mn). So c is constant on M1.

  • (2) Now let’s assume M1 is blue. So for i<j<k<l we have c(ij)c(kl). Next: colour M1(4) as follows: we will say that {i,j,k,l} is green if c(il)=c(jk), and purple otherwise. By Ramsey’s Theorem for r-sets we can pick infinite M2M1 monochromatic.

    We claim that M2 cannot be green. This is because if M2 is green, let i<j<k<l<m<n in M2. Then:

    • {i<j<k<n} gives us that c(in)=c(jk)

    • {i<l<m<n} gives us that c(in)=c(lm)

    But using these we get c(jk)=c(lm), which contradicts the fact that {i<j<k<l} is blue.

    Therefore M2 is purple: for i<j<k<l we have c(il)c(jk).

    Next we colour M2(4) as follows: {i<j<k<l} is orange if c(ik)=c(jl), and white otherwise. Again, by Ramsey’s Theorem for r-sets we can pick an infinite M3M2 such that it is monochromatic with respect to this colouring.

    We claim that M3 cannot be orange. If it is, then we again consider i<j<k<l<m<n:

    • {j<l<m<n} gives that c(jm)=c(ln)

    • {i<j<k<m} gives that c(jm)=c(ik)

    Hence c(ik)=c(ln), which contradicts the fact that {i<j<l<n} is blue.

    Therefore M3 is white. This finally tells us (using earlier working) that given any pair disjoint edges, the colours must be different.

    Now, we colour M3(3) via: {i<j<k} yellow if c(ik)=c(jk), and pink otherwise. By Ramsey’s Theorem for r-sets, there is an infinite M4M3 that is monochromatic.

    We claim that M4 is not yellow. If it is, then given i<j<k<l, we have c(ij)=c(jk)=c(kl), which contradicts blueness.

    Thus for any i<j<k in M4, we have c(ij)c(jk).

    Finally: we colour M4(3) with 4 colours, with {i<j<k} coloured according to:

    • turquoise if c(ij)=c(ik) and c(ik)=c(jk)

    • magenta if c(ij)=c(ik) and c(ik)c(jk)

    • cyan if c(ij)c(ik) and c(ik)=c(jk)

    • maroon if c(ij)c(ik) and c(ik)c(jk)

    By Ramsey’s Theorem for r-sets, there exists a monochromatic set M5M4. It cannot be turquoise because c(ij)=c(jk) contradicts M4. Then:

    • magenta implies case (iii)

    • cyan implies case (iv)

    • maroon implies (ii), i.e. injective.

Theorem 1.5. Assuming that:

  • (r) abitrarily coloured

Then we can find an infinite set M and I[r] such that for any x1<x2<<xr in M, and y1<y2<<yr in M we have c(x1x2xr)=c(y1y2yr) if and only if xi=yi for all iI.

Example. In the previous theorem:

  • (i) I=
  • (ii) I={1,2}
  • (iii) I={1}
  • (iv) I={2}

These 2r colourings are call the “canonical colourings” of (r).

Proof. Exercise. Note that this proof is examinable (because the ideas are exactly the same as those in the previous theorem).

1.1 Van der Waerden’s Theorem

We will colour .

Aim 1: Whenever we 2-colour , we find a monochromatic arithmetic progression of length m for any m.

The abbreviation A.P. can be used to mean “arithmetic progression”, i.e. a sequence of the form {a,a+d,,a+(m1)d}.

Aim 2: For any m, there exists n such that whenever [n] are 2-coloured, there exists a monochromatic arithmetic progression of length m.

This is equivalent to Aim 1, by using a proof by compactness argument like before:

If Aim 2 is not true, then we can find cn:[n]{1,2} such that infinitely many agree on {1}. Of those infinitely many agree on {2}, etc. Keep going (as before), and then get a colouring of without a monochromatic arithmetic progression of length m.

The other direction is easier.

We will show something a bit stronger (because it turns out to be easier): we will prove Aim 2 but with k colours.

This is in contrast with the earlier theorems, where the proofs were slightly easier to think about with just 2 colours.

Definition 1.6 (Focused). Let A1,A2,,Ak be arithmetic progressions of length m. Say

Ai={ai,ai+di,,ai+(m1)di}.

We say that these arithmetic progressions are focused if ai+mdi=f for all i[k].

If we have a colouring of and A1,,Ak are focused arithmetic progressions if each Ai is monochromatic but they all have different colours, then we call them colour-focused.

Example. {4,8} and {10,11} are focused at 12.

PIC

Theorem 1.7. Assuming that:

  • is k-coloured

Then we can find a monochromatic arithmetic progression of length 3 (equivalently, for any k we find an n that works).

Proof. Claim: For any rk there exists an n such that if [n] is k-coloured then either:

The proof is by induction on r.

Base case r=1 we can take n=k+1, since 2 numbers will have the same colour.

Suppose the result is true for r1 and let n be an n that satisfies the property in the claim.

We will show that N=(k2n+1)2n works for r. Let c:[(k2n+1)2n][k] be a colouring. We will split the ground set into k2n+1 blocks of length 2n. Call the blocks Bi.

If there exists a monochromatic arithmetic progression of length 3 in this colouring, then we are done. So assume not.

By the induction hypothesis, the first half of each Bi has r1 colour-focused arithmetic progressions of length 2. Because |Bi|=2n, each block also contains their focus.

For a set |M|=2n, there are exactly k2n ways to k-colour it. So there exists two blocks Bp and Bp+t that are identically coloured.

PIC

Let {ai,ai+di} be the r1 colour-focused arithmetic progressions in Bp. Then {ai+2nt,ai+di+2nt} are the corresponding ones in Bp+t. Let f be the focus in Bp, so therefore f+nt is the focus in Bp+t.

Now take {ai,ai+di+2nt} for i[r1], and {f,f+2nt}. Since ai+2di=f, we have ai+2di+4nt=f+4nt. So all r of these sequences are focused at f+4nt.

We know that {ai,ai+di+nt} and {f,f+2nt} are monochromatic by the choice of Bp, Bp+t. Why colour focused? {ai,ai+2nt} have different colours by induction hypothesis. Also, because c was assumed to have no monochromatic arithmetic progression of length 3, the colours of {f,f+2nt} must be different to all the colours of the above r1 arithmetic progressions of length 2. Thus we have r colour-focused arithmetic progressions of length 2 in [(k2n+1)2n].

Remark. The idea of looking at all the possible colouring of a set is referred to as the “product argument”.

The Van der Waerden number W(k,m) is the smallest n such that whenever [n] is k-coloured, there exists a monochromatic arithmetic progression of length m.

Proof above claims that W(k,3)kk...msupk4k for a tower of size k1. “tower-type bound”.

Theorem 1.8 (Van der Waerden). Assuming that:

  • k,m

Then there exists an n such that whenever we k-colour [n] we can find a monochromatic arithmetic progression of length m.

Recall that we defined W(k,m) to be the smallest n (if it exists) such that whenever [n] is k-coloured, there exists a monochromatic arithmetic progression of length m.

Proof. This will be by induction on m. For any k:

m=1 is trivial.

m=2 is pigeonhole.

m=3 is Theorem 1.7.

Assume that this is true for some m1 fixed, but for any k. In other words, W(k,m1) exists for all k1.

Claim: For every rk there exists n such that we always have one of the following:

When r=k we are done by looking at the focus. Now we prove the claim. We will prove it by induction on r.

For r=1 we can take n=W(k,m1).

Now assume that the result is true for r1 and that there does not exist a monochromatic arithmetic progression of length m. We will show that n works for r1, then W(k2n,m1)2n will work for r.

Aim: whenever we k-colour [W(k2n,m1)2n] we can find r colour-focused arithmetic progressions of length m1. Let B1={1,2,,2n}, B2={2n+1,,4n} etc, i.e. Bi=[2n(i1)+1,2ni] for i{1,,W(k2n,m1)}.

Let us look at the indices of these blocks. I colour i with k2n colours like so:

c(i)=(c(2n(i1)+1),c(2n(i1)+2),,c(2ni)).

We therefore colour [W(k2n,m1)] with k2n colours. By the definition of W, there exists a monochromatic arithmetic progression of length m1 (with respect to c). Say α,α+t,,α+(m2)t.

So the respective blocks Bα,Bα+2t,,Bα+(m2)t are identically coloured. Look at Bα. It has length 2n, so by induction Bα contains r1 colour-focused arithmetic progressions of length m1 together with their focus.

Let Ai={ai,ai+di,,ai+(m2)di} for i{1,,r1}. Let f be their focus. Look at Bi={ai,ai+di+2nt,ai+2di+4nt,,ai+(m2)di+2(m2)t}, i=1,,r1.

They are monochromatic because the blocks are identically coloured and the Bis are monochromatic. Since the colour of Ai is the colour of Bi and the Ais are colour-focused, we must have that the Bis have pairwise distinct colours.

Remember that the Ais are are focused at fi and the colour of fi is different than the colour of all the Ais. Note fi=ai+(m1)di.

Look at A={f,f+2nt,2+4nt,,f+2n(m2)t}. This is an arithmetic progression of length m1 and monochromatic and of a different colour from all of the Ais.

Enough to show ai+(m1)(di+2nt)=f+2n(m1)t for all i, which is equivalent to ai+(m1)di=f, which is true as all the Bis are focused at f.

Non-examinable: what about bounds?

We define the Ackermann hierarchy to be the seqeunce of functions

f1,f2,,fn:

by

f1(x)=2xfn+1(x)=fn(x)(1)=fn(fn(fn(1)))x times

Observe:

f2(x)=xf3(x)=222...msup2x timesf4(1)=2f4(2)=22=4f4(3)=2222=65536f4(4)=22...msup265536 times

These functions grow very fast.

We say that a function f: is of type n if there exists c,d such that

f(cx)fn(x)f(dx).

Our bound on W(k,3) was of type 3.

If you check our proof carefully, then W(k,m) (as a function of k) is bounded by a “type m” bound.

Define: W(m)=W(2,m). Then our proof gives a bound that grows faster than any fn.

Remark. This is often a feature of a double induction proof.

It was believed that W(m) does indeed grow this fast.

Shelah (1987) found a proof by just induction on m, and showed that W(k,m)f4(m+k).

A prize of $1000 was placed by Graham to show that W(m)f3(m).

Gowers (1998) showed that W(m)22222m+9, which is “almost type 2”.

The best lower bound is W(m)2m8m.

Corollary. Whenever is finitely coloured, there exists a colour class that contains arbitrarily long arithmetic progressions.

What about infinite monochromatic arithmetic progressions

No, for example:

Theorem 1.9 (Strengthened Van Waerden). Assuming that:

  • m,k

Then there exists n such that whenever [n] is k-coloured, there exists a monochromatic arithmetic progression of length m together with their common differences, i.e. the set {d,a,a+d,,a+(m1)d} is monochromatic.

Proof. By induction on the number of colours. k=1 is trivial.

Assume that the case for k1 colours is true. So there exists n that works for m and k1.

We will show that W(k,n(m1)+1) works for m and k. If [W(k,n(m1)+1) are k-coloured, then there exists a monochromatic arithmetic progression (say red) of length n(m1)+1, say a,a+d,,a+dn(m1).

If any of d,2d,,nd is red, then we are done: e.g. a,a+rd,,a+r(m1)d for some r[n].

If not, the set {d,2d,,nd} is k1-coloured.. This involves a (k1) colouring on [n], therefore there exist b,b+r,,b+(m1)r and r the same colour. This translates to db,db+r,,db+d(m1)r,dr monochromatic.

Remark. m=2 is known as Schur’s Theorem: we can always find a,a+d,d monochromatic (for finite colouring of ). In other words, there exists a monochromatic solution to x+y=z.

Can deduce Schur from Ramsey for pairs: c:[k] then we induce c:(2)[k] as follows: c(ij)=c(ji). By Ramsey’s Theorem for r-sets, there exists i<j<k such that c(ij)=c(ik)=c(jk). Then

c(jix)=c(kiz)=c(kjy).

Get x+y=z and x,y,z monochromatic.

1.2 The Hales-Jewett Theorem

Let X be a finite set and Xn is words of length n on the alphabet X.

Definition 1.10 (Combinatorial line). A combinatorial line in Xn is a set of the following form:

{(x1,,xn)Xn:I[n],aiX,I,xi=ai iI,xi=xj i,jI}.

Example. X=[3] and we want combinatorial lines in [3]2. If I={1}, we get:

L1={(1,1),(2,1),(3,1)}L2={(1,2),(2,2),(3,2)}L2={(1,3),(2,3),(3,3)}

If I={2} we get:

L4={(1,1),(1,2),(1,3)}L5={(2,1),(2,2),(2,3)}L6={(3,1),(3,2),(3,3)}

I={1,2}, then

L7={(1,1),(2,2),(3,3)}.

PIC

Example. [3]3, I={1} (1,2,3),(2,2,3),(3,2,3), or if I={1,3} (1,3,1),(2,3,2),(3,3,3).

Note. The definition of a combinatorial line is invariant under permutations of X.

Theorem 1.11 (The Hales-Jewett Theorem). Assuming that:

  • m,k

Then there exists n such that whenever we k-colour [m]n there exists a monochromatic combinatorial line.

Remark.

Exercise: Suppose you play Naughts and Crosses with m in a line and you play it in high enough dimensions. Show it cannot be a draw (assuming optimal play). Moreover, it is a first player win. Hint: Strategy stealing.

Definition 1.12. If I have a combinatorial line L in [m]n, then I can order xy if and only if xiyi for all i. Let L denote the first point in this ordering, and let L+ denote the last point in this ordering.

Definition 1.13. Let L1,,Lk be combinatorial lines. We call them focussed if Li+=f for all i[k]. For a fixed colouring, they are colour focused if they are focused and Li{Li+} monochromatic for each i, and they have different colours.

Example. [4]2

PIC

are 3 colour-focused combinatorial lines [4]2.

Proof of The Hales-Jewett Theorem. The proof is by induction on the size of the alphabet, i.e. m.

m=1 is trivially true.

Assume m1 and assume HJ(m1,k) exists for all k.

Claim: for every 1rk there exists nsuch that in [m]n

Then we are done by r=k and looking at the focus.

Now we prove the claim:

r=1: look in [m1]n[m]n We can take HJ(m1,k).

Now assume r>1 and that n is suitable for r1. We will show that n+HJ(m1,kmn) is suitable for r. Let N=HJ(m1,kmn) for convenience.

Need: given c a k-colouring of [m]n+N with no monochromatic combinatorial lines, we can find r colour-focused combinatorial lines. Look at [m]n+N as [m]n×[m]N, with [m]n={a1,,amn}.

Let us colour [m]N as follows:

c:[m]Nkmnc(b)=(c(a1,b),c(a2,b),,c(amn,b)).

By The Hales-Jewett Theorem there exists a combinatorial lines L with active coordinates I such that

c(a,b)=c(a,b)a[m]n,b,bL{L+}.

But now this induces c:[m]nk where c(a)=c(a,b) for any bL{L+}. By the definition of n, there exist r1 colour-focused combinatorial lines (for c) L1,L2,,Lr1, focused at f, and with active coordinates I1,I2,,Ir1.

Finally: look at the combinatorial lines that start at Li×L and active coordinates IiI. These give r1 combinatorial lines, and the combinatorial lines that starts at f×L with active coordinates I. All focused at f×L+. Then done.

Definition 1.14 (d-dimensional space). A d-dimensional space SXn or a d-point parameter set is a set such that there exists I1,I2,,Id[n] disjoint and aiX i[n](I1Id), and xS if and only if:

  • xi=ai for all i[n](I1Id)

  • xi=xi if i,jId for some l[d]

Theorem 1.15 (The Extended Hales-Jewett Theorem). Assuming that:

  • m,k,d positive integers

Then there exists n such that whenever [m]n is k-coloured, there exists a d-point parameter set monochromatic.

Proof. In Xnd=(Xd)n=Yn what does a combinatorial line in Yn look like?

PIC

So a monochromatic line in Yn is a monochromatic d-point parameter set in Xnd.

Letting n=dHJ(md,d) works.

Definition 1.16 (Homothetic copy). Let S be a finite set of points in Xn. A homothetic copy of S is a set of the form x+λS.

Theorem 1.17 (Gallai’s Theorem). Assuming that:

  • finite Sd

  • k-colouring of d

Then there exists a monochromatic homothetic copy of S.

Proof. S={S1,,Sm}. Let c:dk be a colouring.

We colour [m]n (for n large enough) as follows:

c((x1,,xn))=c(Sx1+Sx2++Sxm).

By The Hales-Jewett Theorem, there exists a monochromatic combinatorial line in [m]n with active coordinates I. Then

c(inactiveSi+|I|Sj)

has the same colour for all j[m].

Done as this is a copy of S translate by inactiveSi, and dilation factor |I|.

Remark.