Intersecting families - Analysis of Boolean Functions

7 Intersecting families

A family $A$ of subsets of $[n]$ is intersecting if $A \cap B \neq \emptyset$ for every $A, B \in A$ . Since we can’t include both $A$ and $A^{c}$ , $| A | \leq 2^{n - 1}$ for any intersecting family $A$ . Equality holds if $A = {A \subset [n] : i \in A}$ for some $i$ . If $n$ is odd, another example is ${A \subset [n] : | A | > n ∕ 2}$ . If $n$ is even, we can take ${A \subset [n] : | A | > n ∕ 2}$ and add in exactly one of $A$ and $A^{c}$ for each $A \in {[n]}^{(n ∕ 2)}$ .

There exist plenty of other extremal examples (e.g. $A$ is extremal for $m$ , then $B = {B \subset [n] : | B \cap [m] | \in A}$ is extremal for $n$ ).

Notation. $[n] = {1, \dots, n}$ , $S^{(r)} = {A \subset S : | A | = r}$ .

What if we look at families of sets of size $r$ for some given $r$ ?

If $r > \frac{n}{2}$ we can take all of ${[n]}^{(r)}$ , so we get $(\binom{n}{r})$ . If $r = \frac{n}{2}$ , we can take one set from each ${A, A^{c}}$ to get $\frac{1}{2} (\binom{n}{r})$ . When $r < \frac{n}{2}$ , it gets more interesting.

Theorem 7.1 (Erdős–Ko–Rado Theorem). Let $r, n \in ℕ$ with $r < \frac{n}{2}$ . Let $A \subset {[n]}^{(r)}$ be an intersecting family. Then $| A | \leq (\binom{n - 1}{r - 1})$ , with equality if and only if $A$ is of the form ${A \in {[n]}^{(r)} : i \in A}$ .

Proof (Katona). Let $A \subset {[n]}^{(r)}$ be an intersecting family. We need to prove that if $A \subset {[n]}^{(r)}$ is chosen uniformly, then

ℙ [A \in A] \leq \frac{r}{n} (= \frac{(\binom{n - 1}{r - 1})}{(\binom{n}{r})}) .

Let $x_{1}, x_{2}, \dots, x_{n}$ be a random cyclic ordering of ${1, 2, \dots, n}$ . An interval in this cyclic ordering is a set of the form ${x_{i}, x_{i + 1}, \dots, x_{i + r - 1}}$ for some $i$ (addition modulo $n$ ).

There are $n$ intervals, so we will be done if we can prove that at most $r$ of them belong to $A$ . (That is because we can choose a random element of ${[n]}^{(r)}$ by first choosing a random cyclic ordering and then choosing a random interval in that ordering.)

For each $i$ , let $I_{x_{i}}^{+}$ be the interval ${x_{i + 1}, x_{i + 2}, \dots, x_{i + r}}$ and let $I_{x_{i}}^{-}$ be the interval ${x_{i - r + 1}, \dots, x_{i}}$ .

Without loss of generality ${x_{1}, \dots, x_{r}} \in A$ . Then any other interval in $A$ must be $I_{x_{i}}^{+}$ or $I_{x_{i}}^{-}$ for some $i \in {1, \dots, r - 1}$ . We can’t have both. So at most $r - 1$ more intervals.

In the equality case, we must have equality for every cyclic ordering. In the above argument, we cannot have $i$ with $I_{x_{i}}^{-} \in A$ and $I_{x_{i + 1}}^{+} \in A$ . Therefore, if we have $r$ sets, then there must exist $t$ such that the intervals are $I_{x_{1}}^{+}, \dots, I_{x_{t - 1}}^{+}, I_{x_{t}}^{-}, \dots, I_{x_{r - 1}}^{-}$ , together with ${x_{1}, \dots, x_{r}}$ . So there exists $t$ such that they all contain $x_{t}$ .

Now let us show that $A = {A \in {[n]}^{(r)} : x_{t} \in A}$ . Without loss of generality $t = r$ so our intervals are ${x_{i}, \dots, x_{i + r - 1}}$ for $i = 1, 2, \dots r$ . Let $u = x_{n}$ . Define a cyclic ordering that goes like this:

\begin{array}{l} p_{1} : & u, then \\ p_{2} : & elements of {x_{1}, \dots, x_{r - 1}} ∖ A, then \\ p_{3} : & elements of {x_{1}, \dots, x_{r - 1}} \cap A, then \\ p_{4} : & x_{r}, then \\ p_{5} : & elements of A ∖ {x_{1}, \dots, x_{r}}, then \\ p_{6} : & the rest \end{array}

In this cyclic ordering, ${u} \cup I_{u}^{+} ∖ {x_{r}} = (p_{1}, p_{2}, p_{3}) = {x_{n}, x_{1}, \dots, x_{r - 1}} \notin A$ (sinc it is an interval in the other cyclic ordering). Next, note $I_{u}^{+} = (p_{2}, p_{3}, p_{4}) = {x_{1}, \dots, x_{r}} \in A$ . So by the property we have proved (that in any cyclic ordering we have $r$ consecutive intervals), $A = (p_{3}, p_{4}, p_{5}) \in A$ since $A$ is also an interval.

The proof is complete. □

Our aim now is to prove a theorem of Friedgut and Dinur that shows that if $0 < p < \frac{1}{2}$ and $n$ is sufficiently large, then for every intersecting family $A$ of sets of size $p n$ , there exists a subset $J \subset [n]$ and an intersecting family $B \subset P (J)$ such that very few sets in $A$ do not contain a set in $B$ . “Very few” here means relative to $(\binom{n}{p n})$ (rather than being relative to $| A |$ ).

Note. In this section, Boolean functions are from ${0, 1}^{n}$ to ${0, 1}$ , and the $μ_{p}$ -biased measure gives probability $p$ to $x_{i} = 1$ and $q$ to $x_{i} = 0$ .

Definition 7.2. A Boolean function $f : {0, 1}^{n} \to {0, 1}$ is $(𝜀, p, r)$ -quasirandom if for every $J \subset [n]$ of size $r$ , and every $u \in {0, 1}^{J}$ ,

| 𝔼 [f^{(p)} | x |_{J} = u] - 𝔼 f^{(p)} (x) | \leq 𝜀 .

Notation. Let $f : {0, 1}^{n} \to ℝ$ . Let $J \subset [n]$ and let $n \in {0, 1}^{J}$ .

Define $f_{u} : {0, 1}^{[n] ∖ J} \to ℝ$ by $f_{u} (y) = f (x)$ , where $x |_{J} = u$ , $x |_{J^{c}} = y$ .

Define the averaging projection $E_{J}$ by $E_{J} f^{(p)} (x) = 𝔼 f_{u}^{(p)}$ , where $u = x |_{J}$ . Note that $E_{J} f$ depends only on the coordinates in $J$ .

Lemma 7.3. Each $E_{J}$ is an orthogonal projection, and if $J \subset K$ , then $E_{J} E_{K} = E_{J}$ .

Proof. Since ${(f_{u})}_{u} = f_{u}$ , we have $E_{J}^{2} = E_{J}$ , so $E_{J}$ is a projection. Now we would like to show that

⟨ f - E_{J} f, E_{J} f ⟩ = 0

for every $f$ , or equivalently that $⟨ f, E_{J} f ⟩ = ⟨ E_{J} f, E_{J} f ⟩$ . Since $E_{J} f = E_{J}^{2} f$ , it is enough to prove that $E_{J}$ is self-adjoint.

But

\begin{array}{l} ⟨ E_{J} f, g ⟩ & = 𝔼_{x} E_{J} f (x) g (x) \\ = 𝔼_{u \in {0, 1}^{J}} 𝔼_{y \in {0, 1}^{J^{c}}} (𝔼 f_{u}) g_{u} (y) \\ = 𝔼_{u} (𝔼 f_{u}) 𝔼_{y} g_{u} (y) \\ = 𝔼_{u} (𝔼 f_{u}) (𝔼 g_{u}) \\ = 𝔼_{x} 𝔼_{J} f (x) 𝔼_{J} g (x) \\ = ⟨ E_{J} f, E_{J} g ⟩ \\ = ⟨ f, E_{J} g ⟩ & (the last step by symmetry) \end{array}

This proves that $E_{J}$ is self-adjoint, which is interesting property, but we in fact we didn’t need to write down the last line in the calculation above if all we want to deduce is that $⟨ f, E_{J} f ⟩ = ⟨ E_{J} f, E_{J} f ⟩$ .

Now let $J \subset K$ and let $L = K ∖ J$ . Write $x = (u, v, z)$ where $u \in {0, 1}^{J}$ , $v \in {0, 1}^{L}$ , $z \in {0, 1}^{K^{c}}$ . Then

\begin{array}{l} E_{J} E_{K} f (x) & = E_{J} (𝔼_{z \in K^{c}} f (u, v, z)) \\ = 𝔼_{\begin{array}{c} v \in L \\ z \in K^{c} \end{array}} f (u, v, z) \\ = E_{J} f (x) □ \end{array}

Lemma 7.4 (Regularity lemma for Boolean functions). For every $(𝜀, ρ, r, δ)$ , there exists some $T$ such that for every Boolean function $f^{(p)} : {0, 1}^{n} \to {0, 1}$ , there exists $J \subset [n]$ with $| J | \leq T$ such that if $u$ is chosen randomly ( $μ_{p}$ randomly) from ${0, 1}^{J}$ , then

ℙ [f_{u} is (𝜀, ρ, r) -quasirandom] \geq 1 - δ .

Example. Define $f : {0, 1}^{n} \to {0, 1}$ randomly with

ℙ (f (x_{1}, \dots, x_{n - 1}, 0) = 1) = \frac{1}{3}

and

ℙ (f (x_{1}, \dots, x_{n - 1}, 1) = 1) = \frac{2}{3} .

Then $f$ is not quasirandom, but the conclusion of the above lemma holds if we take $J = {n}$ .

Proof. For any $J \subset [n]$ , define the mean-square density of $J$ to be $∥ E_{J} f ∥_{2}^{2}$ .. Note that if $J \subset K$ , then

∥ E_{J} f ∥_{2}^{2} = ∥ E_{J} E_{K} f ∥_{2}^{2} \leq ∥ E_{K} f ∥_{2}^{2} .

Suppose now that $J$ does not satisfy the conclusion of the lemma. Let $u \in {0, 1}^{J}$ . Then for any $K \subset [n] ∖ J$ , $∥ E_{J} f_{u} ∥_{2}^{2} \geq {(𝔼 f_{u})}^{2}$ . If $f_{u}$ is not $(𝜀, ρ r)$ -quasirandom, then there exists $K_{u} \subset [n] ∖ J$ and some $v \in {0, 1}^{K_{u}}$ such that

| 𝔼 f_{u, v}^{(p)} - 𝔼 f_{u}^{(p)} | \geq 𝜀 .

Let $ζ = \min {p, 1 - p}$ . Also, $| K_{u} | \leq r$ . If we choose a random element of ${0, 1}^{K_{u}}$ , then it equals $v$ with probability $\geq ζ^{r}$ . Therefore,

𝔼_{w} | 𝔼 f_{u, w}^{(p)} - 𝔼 f_{u}^{(p)} |^{2} \geq 𝜀^{2} ζ^{r} .

But

𝔼_{w} 𝔼 f_{u, w} = 𝔼 f_{u}

so the LHS is $Var (𝔼 f_{u, w} - 𝔼 f_{u})$ so it equals

𝔼_{w} {(𝔼 f_{u, w})}^{2} - {(𝔼_{w} 𝔼 f_{u, w})}^{2} = ∥ E_{K_{u}} f_{u} ∥_{2}^{2} - {(𝔼 f_{u})}^{2} .

So $∥ 𝔼_{K_{u}} f_{u} ∥_{2}^{2} \geq {(𝔼 f_{u})}^{2} + ζ^{r} 𝜀^{2}$ in this case.

Let $K = ⋃_{u} K_{u}$ . Then $p n o r m ∥ E_{K} f_{u} ∥_{2}^{2} \geq {(𝔼 f_{u})}^{2} + ζ^{r} 𝜀^{2}$ in this case.

Averaging over $u$ , we deduce that

𝔼_{u} ∥ E_{K} f_{u} ∥_{2}^{2} \geq 𝔼_{u} {(𝔼 f_{u})}^{2} + δ ζ^{r} 𝜀^{2},

i.e. $∥ E_{J \cup K} f ∥_{2}^{2} \geq ∥ E_{J} f ∥_{2}^{2} + δ ζ^{r} 𝜀^{2}$ .

We can now do an iteration. Start with $J_{0} = \emptyset$ . At $i$ -th stage, if $J_{i}$ doesn’t work, then replace it by $J_{i + 1} = J_{i} \cup K_{i}$ , using the argument just given, with $| K_{i} | \leq r \cdot 2^{| J_{i} |}$ . At each stage, mean square density goes up by at least $δ ζ^{r} 𝜀^{2}$ , so the process must terminate in a bounded number of steps. □

Lemma 7.5. For every $ζ, α > 0$ and $p \in [ζ, \frac{1}{2} - ζ]$ , there exists $𝜀 > 0$ and $r$ such that if $f$ is an $(𝜀, ρ, r)$ -quasirandom monotone Boolean function from ${0, 1}^{n}$ to ${0, 1}$ with $𝔼 f^{(p)} = α$ , then $𝔼 f^{(1 ∕ 2)} > \frac{1}{2}$ .

Proof. Suppose that $𝔼 f^{(1 ∕ 2)} \leq \frac{1}{2}$ . By the Mean Value Theorem, there exists $s \in (p, \frac{1}{2})$ such that

\frac{d}{d s} 𝔼 f^{(s)} \leq \frac{\frac{1}{2} - α}{\frac{1}{2} - p} \leq \frac{1}{ζ} .

By the main corollary to the The Margulis–Russo formula (Corollary 6.10), it follows that $I (f^{(s)}) \leq \frac{1}{ζ}$ . By the $p$ -biased Friedgut junta theorem, we can find a Boolean $J$ -junta $h$ such that $ℙ [f^{(s)} \neq h^{(s)}] \leq 𝜀$ and $| J | \leq r (ζ, 𝜀)$ .

But $𝔼 f^{(s)} \leq 𝔼 f^{(1 ∕ 2)}$ by monotonicity, so $ℙ [f^{(s)} = 1] \leq \frac{1}{2}$ $⟹$ $ℙ [h^{(s)} = 1] \leq \frac{1}{2} + 𝜀 \leq \frac{3}{4}$ (as long as $𝜀 \leq \frac{1}{4}$ ) $⟹$ $ℙ [h^{(s)} = 0] \geq \frac{1}{4}$ . Therefore $ℙ [f^{(s)} = 1 | h^{(s)} = 0] \leq 4 𝜀$ . Therefore, there exists $u$ such that $h_{u} \equiv 0$ and $ℙ [f_{u}^{(s)} = 1] \leq 4 𝜀$ . By monotonicity, $ℙ [f_{u}^{(p)} = 1] = 𝔼 f_{u}^{(p)} \leq 4 𝜀$ , so choosing $𝜀 < \frac{α}{5}$ , we have that

| 𝔼 f_{u}^{(p)} - 𝔼 f^{(p)} | > 𝜀,

contradicting $(𝜀, ρ, r)$ -quasirandom, where $r = r (ζ, 𝜀)$ . □

Notation. Let $B$ be a family of subsets of $[n]$ . Write $\bar{B} = {A \subset [n] : \exists B \in B, B \subset A}$ , the upward closure of $B$ .

Theorem 7.6 (Dinur, Friedgut). For every $p \in (0, \frac{1}{2})$ and $𝜀 > 0$ , there exists $T$ such that for every intersecting family $A$ of subsets of $[n]$ there exists $J \subset [n]$ of size at most $T$ and an intersecting family $B$ of subsets of $J$ such that

μ_{p} (A ∖ \bar{B}) \leq 𝜀 .

Proof. It is convenient to reformulate the statement in terms of Boolean functions. So call $f : {0, 1}^{n} \to {0, 1}$ intersecting if for all $x, y$ , $f (x) = f (y) = 1 ⟹ \exists i, x_{i} = y_{i} = 1$ .

Let $f : {0, 1}^{n} \to {0, 1}$ be a Boolean function and apply the regularity lemma with parameters $(\frac{𝜀}{10}, p, r, \frac{𝜀}{2})$ , where $r$ is to be chosen. That gives us $J$ of size at most $T (𝜀, p, r)$ such that if $u$ is chosen $μ_{p}$ -randomly from ${0, 1}^{J}$ , then

ℙ [f_{u}^{(p)} is not (\frac{𝜀}{10}, p, r) -quasirandom] \leq \frac{𝜀}{2} .

Define $g : {0, 1}^{J} \to {0, 1}$ by setting $g (u) = 1$ if $f_{u}^{(p)}$ is $(\frac{𝜀}{10}, p, r)$ -quasirandom, and $𝔼 f_{u}^{(p)} \geq \frac{𝜀}{2}$ and $0$ otherwise. Then

ℙ [f^{(p)} (x) = 1 and g (x |_{J}) = 0] \leq \frac{𝜀}{2} + \frac{𝜀}{2} .

(This corresponds to the statement that $| A ∖ \bar{B} | \leq 𝜀$ . $A = {A : f (𝟙_{A}) = 1}$ , $B = {B : g (𝟙_{B}) = 1}$ .)

Note that $\bar{A}$ is an intersecting family and $| \bar{A} ∖ \bar{B} | \geq | A ∖ B |$ , so we may assume that $f$ is monotone, and hence that each $f_{u}$ is monotone. It remains to prove that $g$ is intersecting. Let $u, v \in {0, 1}^{J}$ such that $g (u) = g (v) = 1$ . Then $f_{u}$ and $f_{v}$ are $(\frac{𝜀}{10}, p, r)$ -quasirandom and $𝔼 f_{u}^{(p)}, 𝔼 f_{v}^{(p)} \geq \frac{𝜀}{2}$ . By Lemma 7.5 for appropriate $r$ , it follows that $𝔼 f_{u}^{(1 ∕ 2)}, 𝔼 f_{v}^{(1 ∕ 2)} > \frac{1}{2}$ .

By averaging, we can find $y, z \in {0, 1}^{[n] ∖ J}$ such that $y_{i} = 1 ⟺ z_{i} = 0$ and $f_{u} (y) = f_{v} (z)) = 1$ . Since $f$ is intersecting, there must exist $j$ such that $u_{i} = v_{j} = 1$ , so $g$ is intersecting. □

Lemma 7.7 (The LYM inequality). Let $1 \leq r < s \leq n$ and let $A \subset {[n]}^{(r)}$ . Write $\partial_{s} A$ for ${B \in {[n]}^{(s)} : \exists A \in A, A \subset B}$ . Then $\frac{| \partial_{s} A |}{(\binom{n}{s})} \geq \frac{| A |}{(\binom{n}{r})}$ .

Proof. Let $α = \frac{| A |}{(\binom{n}{r})}$ and $β = \frac{| \partial_{s} A |}{(\binom{n}{s})}$ . Define a bipartite graph by joining $A \in {[n]}^{(r)}$ to $B \in {[n]}^{(s)}$ if and only if $A \subset B$ . Now pick a random edge. The probability that it joins $A$ to $\partial_{s} A$ is exactly $α$ . It is also at most $β$ . So $α = ℙ [joins A to \partial_{s} A] \leq β$ . □

Definition 7.8 (Modified upper shadow). Let $n \in ℕ$ , let $J \subset [n]$ , let $r < s$ and let $A \subset {[n]}^{(r)}$ . Define the modified upper shadow $\partial_{J}^{s} A$ to be ${B \in {[n]}^{(s)} : A \subset B, (B ∖ A) \cap J = \emptyset}$ .

Lemma 7.9. Let $n, r, s, J, A$ be as above. Let $α = \frac{| A |}{(\binom{n}{r})}$ and $β = \frac{| \partial_{J}^{s} A |}{(\binom{n}{s})}$ . Assume that $r \leq \frac{n}{2}$ . Then

β \geq α {(1 - \frac{2 | J |}{n})}^{s - r} .

Proof. Pick a random pair $(A, B)$ with $A \in {[n]}^{(r)}$ , $B \in {[n]}^{(s)}$ , $A \subset B$ . Then

ℙ [(A, B) \in A \times \partial_{J}^{s} A] \geq α {(1 - \frac{2 | J |}{n})}^{s - r} .

Also, the probability is at most $β$ . The result follows. □

Corollary 7.10. For every $p \in (0, \frac{1}{2})$ , $𝜀 > 0$ , there exists $m$ such that for every $n \in ℕ$ and every intersecting family $A \subset {[n]}^{(r)}$ where $r = p n$ , there exists $J \subset [n]$ , $| J | \leq m$ , and an intersecting family $B$ of subsets of $J$ such that $| A ∖ \bar{B} | \leq 𝜀 (\binom{n}{r})$ .

Proof. Suppose not. Let $C = h a c a l ∖ \bar{B}$ . Then $C$ has density $\geq 𝜀$ . Apply the Dinur, Friedgut Theorem to $A$ to obtain $J$ and an intersecting family $B$ of subsets of $J$ with $μ_{p} (\bar{A} ∖ \bar{B}) \leq \frac{𝜀}{4}$ .

Note that since $C \cap \bar{B} = \emptyset$ , $\partial_{J}^{s} C \cap \bar{B} = \emptyset$ for every $s > r$ . Also, if $s \leq r + n^{2 ∕ 3}$ , then

\frac{| \partial_{J}^{s} C |}{(\binom{n}{s})} \geq 𝜀 {(1 - \frac{2 | J |}{n})}^{n^{2 ∕ 3}} \geq 𝜀 (1 - \frac{2 | J |}{n^{1 ∕ 3}}) \geq \frac{3 𝜀}{4},

for $n$ sufficiently large.

But (by law of total probability),

μ_{p} (\bar{C}) = \sum_{s \geq r} μ_{p} ({[n]}^{(s)}) \cdot [\frac{| \partial_{J}^{s} C |}{(\binom{n}{s})}] \geq \frac{3 𝜀}{4} \sum_{r \leq s \leq r + n^{2 ∕ 3}} μ_{p} ({[n]}^{(s)}) \geq \frac{3 𝜀}{4} (1 - o (1)) \geq \frac{5 𝜀}{16} .

But $\bar{C} \subset \bar{A} ∖ \bar{B}$ , so this is a contradiction. □