1 The regularity lemma and applications

Theorem 1.1 (Szemerédi). For all 𝜀>0, there exists k=k(𝜀) such that the vertex set of any sufficiently large graph G=(V,E) can be partitioned into V1,,Vs, sk such that for all but an 𝜀-proportion of pairs (Vi,Vj), G(Vi,Vj) is 𝜀-regular.

PIC

Remark.

  • 𝜀-regular means: looks like a “random” graph. We will define it more thoroughly at the end of the lecture.

  • In Theorem 1.1, it is also possible to ensure that all the Vi are almost the same size.

  • k is known to have a bad dependence on 𝜀: we have

    222...msup2,

    where the tower of exponentials is of size 𝜀C.

Theorem 1.2 (Removal). For all δ>0, there exists η>η(δ) such that the following holds. If G is a graph on n vertices with at most ηn3 triangles, then it is possible to remove δn2 edges to make it triangle-free.

Sketch proof. Apply Theorem 1.1 with 𝜀=δ4 to obtain V1,,Vs, sk=k(δ) of almost equal size.

PIC

Remove all edges between Vi and Vj where G(Vi,Vj) fails to be 𝜀-regular, or where the density of G(Vi,Vj)2𝜀. The number of edges removed in this way is 𝜀n2+2𝜀n2<δn2.

Claim: The “reduced” graph is triangle-free.

Indeed, if x,y,z form a triangle in G, then (x,y,z) must lie in Vi×Vj×Vk with G(Vi,Vj), G(Vi,Vk), G(Vj,Vk) all regular and dense (density 2𝜀). Hence we in fact have 𝜀3(nk)3=δ326n3k(δ) triangles in G. This is a contradiction if η<δ326k(δ).

Theorem 1.3 (Corners). For all α>0, there exists N0=N0(α) such that for all NN0, the following holds. Let A[N]2 of density α (|A|N2=α). Then A contains a triple of the form (x,y), (x+d,y), (x,y+d) with d>0.

PIC

Remark. The theorem as stated can be fairly easily deduced from a version where we only ask that d0. To do this: note that if A is symmetric, then existence of a triangle with d<0 implies existence of one with d>0. Then one can “make A symmetric” (in exchange for a loss in density) by intersecting A with a reflection of A through a suitably chosen point.

Proof. Let

X={vu={(x,y):x=u}:u[N]}Y={ht={(x,y):y=t}:t[N]}Z={ds={(x,y):x+y=s}}

We define a tripartite graph on parts X,Y,Z, where vertices are joined by an edge if and only if the intersection of the corresponding lines lies in A. A triangle in this graph corresponds to three points

(u,t),(u,su),(st,t)A.

Setting d=sut, these points are (u,t), (u,t+d), (u+d,t).

PIC

If A contains no corner with d0, then the only triangles in this graph are the degenerate ones (d=0). There are αN2=|A| many of these, and they are edge disjoint. Pick δ=α2, then by triangle-removal, there exists η=η(α) such that we can destroy ηN3 triangles by removing at most δN2 edges.

If αN2<ηN3, then should be able to remove all triangles. But this is a contradiction since all the triangles are edge disjoint.

Theorem 1.4 (Roth). For all α>0, there exists N0=N0(α) such that for all NN0, every A[N] of density α (=|A|N) contains a non-trivial 3-AP (a triple x,x+d,x+2d).

Sketch proof. Let B={(x,y)[N]2:xyA}. By Theorem 1.3, B contains (x,y),(x,y+d),(x+d,y) with d0. Then xy,x(y+d)=xy+d,x+dy=xy+dA.

Proposition 1.5. Let X,Y be sets, and f:X×Y[1,1]. Then the following are equivalent:

  • (i) For any pair of sets XX, YY,
    𝔼xX,yYf(x,y)c1|X||X||Y||Y|,

    where 𝔼xX=1|X|xX.

  • (ii) For any two functions u:X[1,1], v:Y[1,1],
    |𝔼xX,yYf(x,y)u(x)v(y)|c2.
  • (iii) 𝔼x,xX𝔼y,yYf(x,y)f(x,y)f(x,y)f(x,y)c3.

Proof.

  • (ii) (i) Obvious (let u=𝟙X, v=𝟙Y). c1=c2.
  • (i) (ii) Example sheet. c2=c112.
  • (ii) (iii) Suppose (iii) is false, i.e.
    𝔼x,xX𝔼y,yYf(x,y)f(x,y)f(x,y)f(x,y)>c2,

    with c3=c2. We can rewrite as:

    𝔼xX,yY𝔼xX,yYf(x,y)f(x,y)v(y)f(x,y)f(x,y)u(x)>c2.

    So there exist xX, yY such that

    𝔼xX,yY𝔼xX,yYf(x,y)v(y)u(x)>c2,

    contradiction. c3=c2

  • (iii) (ii) |𝔼xX𝔼yYf(x,y)u(x)v(y)|2=|𝔼yYv(y)𝔼xXf(x,y)u(y)|2𝔼yY|𝔼xXf(x,y)u(x)|2=𝔼xXu(x)u(x)𝔼yYf(x,y)f(x,y)

    So

    |𝔼xX𝔼yYf(x,y)u(x)v(y)|4𝔼x,xX|𝔼yYf(x,y)f(x,y)|2c3

    c2=c314

Definition 1.6. Say f:X×Y[1,1] is 𝜀-quasirandom if Proposition 1.5(iii) holds with c3=𝜀4. Say a graph G on X×Y of density δ>0 is 𝜀-quasirandom if its balanced function f(x,y)=𝟙G(x,y)δ is 𝜀-quasirandom (note that 𝔼f(x,y)=0).

Proposition 1.7 (Counting Lemma). Let G be a tripartite graph on vertex sets X,Y,Z. Suppose G(X,Y),G(X,Z),G(Y,Z) are 𝜀-quasirandom, with densities δXY,δXZ,δYZ respectively. Then

|𝔼xX𝔼yY𝔼zZ𝟙G(x,y)𝟙G(x,z)𝟙G(y,z)δXYδXZδYZ()|4𝜀.

Proof.

()=𝔼xX𝔼yY𝔼zZ(fXY(x,y)+δXY)(fXZ(x,z)+δXZ)(fYZ(y,z)+δYZ)δXYδXZδYZ=seven terms

For example,

𝔼xX𝔼yY𝔼zZfXY(x,y)fXZ(x,z)fYZ(y,z)=𝔼xZ𝔼xX,yYfXY(x,y)uz(x)vz(y)𝜀,

by (iii) (ii). Similarly for the other terms, and hence we get ()7𝜀.

We can improve it to ()4𝜀 by noticing that 3 of the terms are zero (any of the terms of the form fδδ).

Definition 1.8. Suppose X=X1Xn and Y=Y1Ym. Then the mean square density (msd) of a bipartite graph G on X×Y relative to this partition is

i=1nj=1m|Xi||Yj||X||Y|d(G(Xi,Yj))2,

where d(G(Xi,Yj)) is the density of G(Xi,Yj).

Example. With respect to the trivial partition, msd(G)=d(G(X,Y))2.

Lemma 1.9. Let G be a bipartite graph on X×Y of density δ>0, and suppose it fails to be 𝜀-quasirandom. Then there are partitions X=X1X2, Y=Y1Y2 such that the msd of G relative to the new partition is δ2+(𝜀2)8.

Proof. Proposition 1.5(i) (iii) provides us with X1X, Y1Y such that

𝔼xX1,yY1f(x,y)>𝜀412|X||X1||Y||Y1|.

Let X2=XX1, Y2=YY1.

The msd relative to the new partition is

i=12j=12|Xi||Yi||X||Y|d(G(Xi,Yj))2,

where

d(G(Xi,Yj))=𝔼xXi,yYj𝟙G(x,y)=𝔼xXi,yYj(f(x,y)+δ)=δ+φ(Xi,Yj).

Know: φ(X1,Y1)>𝜀412|X||X1||Y||Y1|. So

i=12j=12|Xi||Yj||X||Y|(δ2+2δφ(Xi,Yj)+φ(Xi,Yj)2)δ2+0+i=12j=12|Xi||Yj||X||Y|φ(Xi,Yj)2δ2+(𝜀2)8

Note. “Hypergraph” in these lectures will usually mean 3-uniform hypergraph.