4 Shearer’s lemma and applications

Notation. Given a random variable X=(X1,,Xn) and A={a1,,ak}[n] with a1<a2<<ak, write XA for the random variable (Xa1,Xa2,,Xak).

Lemma 4.1 (Shearer). Assuming that:

  • X=(X1,,Xn) a random variable

  • A a family of subsets of [n] such that every i[n] belongs to at least r of the sets AA

Then
H[X1,,Xn]1rAAH[XA].

Proof. For each a[n], write X<a for (X1,,Xa1).

For each AA, A={a1,,ak} with a1<<ak, we have

H[XA]=H[Xa1]+H[Xa2|Xa1]++H[Xak|Xa1,,Xak]H[Xa1|X<a1]+H[Xa2|X<a2]++H[Xak|X<ak](Lemma 1.15)=aAH[Xa|X<a]

Therefore,

AAH[XA]AAaAH[Xa|X<a]ra=1nH[Xa|X<a]=rH[X]

Alternative version:

Lemma 4.2 (Shearer, expectation version). Assuming that:

  • X=(X1,,Xn) a random variable

  • A[n] a randomly chosen subset of [n], according to some probability distribution (don’t need any independence conditions!)

  • for each i[n], [iA]μ

Then
H[X]μ1𝔼AH[XA].

Proof. As before,

H[XA]aAH[Xa|X<a].

So

𝔼AH[XA]𝔼AaAH[Xa|X<a]μa=1nH[Xa|X<a]=μH[X]

Definition (PA). Let En and let A[n]. Then we write PAE for the set of all uA such that there exists v[n]A such that [u,v]E, where [u,v] is u suitably intertwined with v (i.e. uv as functions).

Corollary 4.3. Assuming that:

  • En

  • A a family of subsets of [n] such that every i[n] is contained at least r sets AA

Then
|E|AA|PAE|1r.

Proof. Let X be a uniform random element of E. Then by Shearer,

H[X]1rAAH[XA].

But XA tkaes values in PAE, so

H[XA]log|PAX,

so

log|E|1rA log |PAE|.

If A={[n]{i}:i=1,,n} we get

|E|i=1n|P[n]{i}E|1n1.

This case is the discrete Loomis-Whitney theorem.

Theorem 4.4. Assuming that:

  • G a graph with m edges

Then G has at most (2m)326 triangles.

Is this bound natural? Yes: if m=n2, and we consider a complete graph on n vertices, then we get approximately (2m)236 triangles.

Proof. Let (X1,X2,X3) be a random ordered triangle (without loss of generality G has a triangle so that this is possible).

Let t be the number of triangles in G. By Shearer,

log(6t)=H[X1,X2,X3]12(H[X1,X2]+H[X1,X3]+H[X2,X3]).

Each edge H[Xi,Xj] is supported in the set of edges G, given a direction, i.e.

12(H[X1,X2]+H[X1,X3]+H[X2,X3])32log(2m).

Definition. Let X be a set of size n and let G be a set of graphs with vertex set X. Then G is Δ-intersecting (read as “triangle-intersecting”) if for all G1,G2G, G1G2 contains a triangle.

Theorem 4.5. Assuming that:

Then G has size at most 2n22.

Proof. Let X be chosen uniformly at random from G. We write V(2) for the set of (unordered) pairs of elements of V. Think of any GG as a function from V(2) to {0,1}. So X=(Xe:eV(2)).

For each RV, let GR be the graph KRKVR

PIC

For each R, we shall look at the projection XGR, which we can think of as taking values in the set {GGR:GG}=:GR.

Note that if G1,G2G, R[n], then G1G2GR, since G1G2 contains a triangle, which must intersect GR by Pigeonhole Principle.

Thus, GR is an intersecting family, so it has size at most 2|E(GR)|1. By Shearer, expectation version,

H[X]2𝔼RH[XGR](since each e belongs to GR with probability 12)2𝔼R(|E(GR)|1)=2(12m21)=n22

Definition (Edge-boundary). Let G be a graph and let AV(G). The edge-boundary A of A is the set of edges xy such that yA.

If G=n or {0,1}n and i[n], then the i-th boundary iA is the set of edges xyA such that xy=±ei, i.e. iA consists of deges pointing in direction i.

Theorem 4.6 (Edge-isoperimetric inequality in Zn). Assuming that:

  • An a finite set

Then |A|2n|A|n1n.

Proof. By the discrete Loomis-Whitney inequality,

|A|i=1n|P[n]{i}A|1n1=(i=1n|P[n]{i}A|1n)nn1(1ni=1n|P[n]{i}A|)nn1

But |iA|2|P[n]{i}A| since each fibre contributes at least 2.

So

|A|(12ni=1n|iA|)nn1=(12n|A|)nn1

Theorem 4.7 (Edge-isoperimetric inequality in the cube). Assuming that:

  • A{0,1}n (where we take the usual graph)

Then |A||A|(nlog|A|).

Proof. Let X be a uniform random element of A and write X=(X1,,Xn). Write Xi for (X1,,Xi1,Xi+1,,Xn). By Shearer,

H[X]1n1i=1rH[Xi]=1n1i=1nH[X]H[Xi|Xi]

Hence

i=1nH[Xi|Xi]H[X].

Note

H[Xi|Xi=u]={1|P[n]{i}1(u)|=20|P[n]{i}1(u)|=1

The number of points of the second kind is |iA|, so H[Xi|Xi]=1|iA||A|. So

H[X]i=1n(1|iA||A|)=n|A||A|

Also, H[X]=log|A|. So we are done.

Definition (Lower shadow). Let A be a family of sets of size d. The lower shadow A is {B:|B|=d1,AA,BA}.

Notation. Let h(x)=xlog1x+(1x) log 11x (for x[0,1]).

Theorem 4.8 (Kruskal-Katona). Assuming that:

  • |A|=td=t(t1)(td1)d! for some real number t

Then |A|td1.

Proof. Let X=(X1,,Xd) be a random ordering of the elements of a uniformly random AA. Then

H[X]=log(d!td).

Note that (X1,,Xd1) is an ordering of the elements of some BA, so

H[X1,,Xd1]log((d1)!|A|).

So it’s enough to show

H[X1,,Xd1]log((d1)!td1).

Also,

H[X]=H[X1,,Xd1]+H[Xd|X1,,Xd1]

and

H[X]=H[X1]+H[X2|X1]++H[Xd|X1,,Xd1].

We would like an upper bound for H[Xd]X<d. Our strategy will be to obtain a lower bound for H[Xk|X<k] in terms of H[Xk+1|X<k+1]. We shall prove that

2H[Xk|X<k]2H[Xk+1|X<k+1]+1k.

Let T be chosen independently of X1,,Xk1 with

T={0probability p1probability 1p

(p will be chosen and optimised later).

Given X1,,Xk1, let

X={Xk+1T=0XkT=1

Note that Xk and Xk+1 have the same distribution (given X1,,Xk1), so X does as well. Then

H[Xk|X<k]=H[X|X<k]H[X|Xk](Submodularity)=H[X,T|Xk](Xk and X determine T)=H[T|Xk]+H[X|T,Xk](additivity)=H[T]+pH[Xk+1|X1,,Xk]   +(1p)H[Xk|X1,,Xk]=h(p)+ps

where h(p)=plog1p+(1p) log 11p and s=H[Xk+1|X1,,Xk].

It turns out that this is maximised when p=2s2s+1. Then we get

2s2s+1(log(2s+1) log 2s)+ log (2s+1)2s+1+s2s2s+1= log (2s+1).

This proves the claim.

Let r=2H[Xd|X1,,Xd1]. Then

H[X]=H[X1]++H[Xd|X1,,Xd1]logr+ log (r+1)++ log (r+d1)=log((r+d1)!(r1)!)=log(d!r+d1d)

Since H[X]=log(d!td), it follows that

r+d1t,rt+1d.

It follows that

H[X1,,Xd1]=log(d!td) log rlog(d!t!d!(td)!(t+1d))=log((d1)!td1)