5 The KKL theorem and Friedgut’s junta theorem

A dictator was a single variable that dictates the function. A junta is a small set of variables that dictate the function.

We view {1,1}n as a graph, where a pair of vertices is connected by an edge if they differ in a single coordinate.

The edge boundary of a subset A of a graph is the set of edges with one vertex in A and one in Ac. We denote the edge boundary by A.

Theorem 5.1 (The edge-isoperimetric inequality in the cube – approximate version). Let A{0,1}n. Then

|A||A|(nlog2|A|).

Remark. Sharp for subcubes of dimension k.

Proof. Let 𝜃A be the set of internal edges of A – i.e. edges ab with a,bA. Then n|A|=2|𝜃A|+|A|, so the theorem is equivalent to showing that

|𝜃A|12|A|log2|A|

for every A.

Induction on n. If n=1 the result is true (3 cases to check).

Let A0={xA:xn=0}, A1={xA:xn=1}. Then

|𝜃A||𝜃A0|+|𝜃A1|+min{|A0|,|A1|}12|A0| log 2|A0|+12|A1| log 2|A1|+min{|A0|,|A1|}

by induction hypothesis. Want 12(|A0|+|A1|)log(|A0|+|A1|). So it’s enough to prove

12xlog2x+12y log 2y+x12(x+y) log 2(x+y)

whenever 0xy, or equivalently

xlogx+y log y+2x log 2(x+y) log (x+y).

If x=y, we need

2xlogx+2x log 22x log (2x),

which is indeed true.

Now differentiate with respect to y. Left hand side becomes logy+1, and right hand side becomes log(x+y)+1. Since x0, the left hand derivative is to the right hand derivative.

Remark. If |A|=2n1, this tells us that the edge-boundary is minimised by a half space. Let f:{1,1}n{1,1} be a function with 𝔼f=0, and therefore Varf=1. If I(f)=1, then A|A|f^(A)2=1, but f^()=0 and also Af^(A)2=1 by Parseval, so equality holds only if f is linear, so f is a dictator. Similarly, if equality almost holds, then by FKN f is almost a dictator.

The above remark says that to minimise I(f), the best thing is to have a dictator: just one variable contributing to I(f). What if we forbid this, for example by asking that each variable has the same influence?

Might guess that taking majority vote is best for this, but as mentioned before this has I(f)n. It turns out the following is much better:

The “tribes” function of Ben–Or and Linial: Let k,m. Let n=km, write [n]=A0Ai, |Ai|=k. Define f(x)=1 if and only if there exists i such that xj=1 for every jAi.

This achieves I(f)=clognn (once we optimise k and m).

Theorem 5.2 (The Kahn–Kalai–Linial edge-isoperimetric inequality). Let f:{1,1}n{1,1} be a Boolean function. Then there exists i such that Infif9Ĩ(f)29Ĩ(f)29Ĩ(f), where Ĩ(f)=I(f)Varf.

Proof. We shall obtain upper and lower bounds or i=1nStab13(Dif) (magical idea).

Upper bound: iStab13(Dif)iDif432. But Dif4343=Infif, so

iDif432=i(Infif)32(maxiInfif)12I(f).

Lower bound: iStab13(Dif)=iA(13)|A|Dif^(A)2. But

Dif^(A)={f^(A{i})iA0iA

(using the formula Dif^=Bif^(B)xB{i}). So

iA(13)|A|Dif^(A)=iAi(13)|A|f^(A{i})2=B(13)|B|1|B|f^(B)23VarfB(13)|B|f^(B)2Varf

But

Bf^(B)2=f22f^(0)2=𝔼f2(𝔼f)2=Varf,

so Bf^(B)2Varf=1.

The function x(13)x is convex, so by Jensen,

B(13)|B|f^(B)2Varf(13)B|B|f^(B)2Varf=(13)Ĩ(f).

Therefore,

(maxiInfif)12I(f)3Varf(13)Ĩ(f),

which rearranges to the result.

Corollary 5.3 (The KKL theorem). Let f:{1,1}n{1,1}. Then there exists i such that InfifclognnVarf (for some absolute constant c>0).

Proof. If Ĩ(f)clogn, then the result follows trivially by averaging.

Otherwise, by ?? 35, there exists i such that Infi(f)9(clogn)29clogn. For small enough c, that’s much bigger than clognn.

This shows that the “tribes” example from last lecture is the best possible.

Motivation for why Stab is natural to define in order to tackle the The KKL theorem: We can assume λI(f)cλlogn. Then eλI(f)ncλ. LHS is eλA|A|f^(A)2, and by Jensen we get Af^(A)2eλ|A|=Stabeλf. So Stab comes up somewhat naturally.

Definition 5.4 (Junta). Let f:{1,1}n, and let J[n]. Then f is a J-junta if f(x) depends only on x|J. We also say that J is a junta. f is a k-junta if f is a J-junta for some J of size k.

Friedgut’s junta theorem states that a Boolean function with small total influence can be approximated by a m-junta for some small m.

Theorem 5.5 (Friedgut’s junta inequality). Let f:{1,1}n{1,1}, let 𝜀>0, and let k. Suppose that f(k)221𝜀. Then there exists an m-junta g:{1,1}n such that fg22𝜀 with m32kI(f)3𝜀2.

The proof has some similarities with ?? 35, so we include less detail in this proof.

Proof. Let τ>0 a constant to be chosen later and let

J={i[n]:Infifτ}.

This time we estimate iJStab13(Dif).

iJStab13(Dif)iJ(Infif)32τ12I(f).

(The first inequality is proved using the same technique as in ?? 35.)

In the other direction,

iJStab13(Dif)=iJ(13)|A|f^(A{i})2=3B|BJ|(13)|B|f^(B)23BJ(13)|B|f^(B)2

But

3BJ|B|k(13)|B|f^(B)23BJ|B|k(13)kf^(B)2=(13)k1BJ|B|kf^(B)2.

Let g=BJ|B|kf^(A)xA. Then

Then

fg22AJ|A|kf^(A)2+|A|>kf^(A)2.

By hypothesis,

|A|>kf^(A)2=f(>k)22c.

But

3BJ|BJ|(13)|B|f^(B)23BJ|B|k(13)kf^(B)2=3(k1)AJ|A|kf^(A)2.

So

AJ|A|kf^(A)23k1τ12I(f).

If we choose τ=𝜀232kI(f), then this is at most 𝜀, so fg222𝜀. But |J|I(f)τ=32kI(f)3𝜀2.

Corollary 5.6 (Friedgut’s junta theorem). Let f:{1,1}n{1,1} and let 𝜀>0. Then there exists an m-junta g:{1,1}n with fg222𝜀 and

m=exp(O(I(f)𝜀)).

Proof.

I(f)=A|A|f^(A)2|A|>k|A|f^(A)2kf(>k)22

So if kI(f)𝜀, then f(k)221𝜀. By Theorem 5.5 we get a junta with m32I(f)𝜀I(f)3𝜀2=exp(O(I(f)𝜀)).

Remark. Let f:{1,1}n{1,1}, g:{1,1}n and suppose that fg22𝜀. Let

h(x)={1g(x)01g(x)<0

Then if h(x)f(x), then g(x) has a different sign from f(x), so |g(x)f(x)|21. Since 𝔼|f(x)g(x)|2𝜀, we have [f(x)h(x)]𝜀.

In other words, we can find a Boolean function that approximates f, and if g is a J-junta, then so is h.