1 Set Systems

Definition 1 (Set system). Let X be a set. A set system on X (or a family of subsets of X) is a family AP(X).

Notation. We will use the notation

X(r)={AX:|A|=r}.

We call an element of X(r) an r-set. We will usually be using X=[n]={1,,n}, so |X(r)|=nr.

Example.

[4](2)={12,13,14,23,24,34}.

Definition 2 (Discrete cube). Make P(X) into a graph by joining A and B if |AΔB|=1, i.e. if A=B{i} for some i, or vice versa. We call this ths discrete cube Qn (if X=[n]).

Example. Q3:

PIC

In general:

PIC

Alternatively, can view Qn as an n-dimensional unit cube {0,1}n, by identifying e.g. {1,3} with 10100000 (i.e. identify A with 𝟙A, the characteristic function of A).

Example.

PIC

Definition 3 (Chain). Say AP(X) is a chain if A,BA, either AB or BA.

PIC

Example. For example,

A={23,12357,123567}

is a chain.

Definition 4 (Antichain). Say A is an antichain if A,BA, AB, we have AB.

PIC

How large can a chain be? Can achieve |A|=n+1, for example using

A={,1,12,123,,[n]}.

Cannot beat this: for each 0rn, A contains 1 r-set.

How large can an antichain be? Can achieve |A|=n, for example A={1,2,,n}. More generally, can take A=X(r), for any r – best out of these is X(n2).

Can we beat this?

Theorem 5 (Sperner’s Lemma). Assuming that:

Then |A|nn2.

Idea: Motivated by “a chain meets each layer in 1 point, because a layer is an antichain”, we will try to decompose the cube into chains.

PIC

Proof. We’ll decompose P(X) into n12n chains – then done. To achieve this, it is sufficient to find:

  • (i) For each r<n2, a matching from X(r) to X(r+1) (recall that a matching here means a set of disjoint edges, one for each point in X(r)).
  • (ii) For each rn2, a matching from X(r) to (r1).

We then put these together to form our chains, each passing through X(n2).

PIC

By taking complements, it is enough to prove (i).

Let G be the (bipartite) subgraph of Qn spanned by X(r)X(r+1): we seek a matching from X(r) to X(r+1). For any SX(r), the number of SΓ(S) edges in G is |S|(nr) (counting from below) and |Γ(S)|(r+1) (counting from above).

PIC

Hence, as r<n2,

|Γ(S)||S|(nr)r+1|S|.

Thus by Hall’s Marriage theorem, there exists a matching.

Equality in Sperner’s Lemma? Proof above tells us nothing.

Aim: If A is an antichain then

r=0n|AX(r)|nr1.

PIC

“The percentages of each layer occupied add up to 1.”

Trivially implies Sperner’s Lemma (think about it).

Definition 6 (Shadow). For AX(r) (1rn), the shadow of A is A=AX(r1) defined by, A={BX(r1):AA,BA}.

Example. If A={123,124,134,137}X(3), then A={12,13,23,14,24,34,17,37}X(2).

Proposition 7 (Local LYM). Assuming that:

  • AX(r)

  • 1rn

Then
|A|nr1|A|nr.

“The fraction of the level occupied by A is the fraction for A”.

Remark. LYM = Lubell, Meshalkin, Yamamoto.

Proof. The number of AA edges in Qn is |A|r (counting from above) and is |A|(nr+1) (counting from above). So

|A||A|rnr+1.

But nr1nr=rnr+1, so done.

Equality in Local LYM? Must have that AA, iA, jA have A{i}{j}A. So A= or X(r).

Theorem 8 (LYM Inequality). Assuming that:

Then
r=0n|AX(r)|nr1.

Notation. We will now start writing Ar for AX(r).

Proof 1. “Bubble down with Local LYM”.

Have |An|nn1. Now, An and An1 disjoint (as A is an antichain), so

|An|nn1+|An1|nn1=|AnAn1nn11,

whence

|An|nn+|An1|nn11

by Local LYM.

Now, note (AnAn1) is disjoint from An2 (since A is an antichain), so

|(AnAn1)|nn2+|An2|nn21,

whence

|AnAn1|nn1+|An2|nn2.

(Local LYM) so

|An|nn+|An1|nn1+|An2|nn21.

Continue inductively.

Equality in LYM Inequality? Must have had equality in each use of Local LYM. Hence equality in LYM Inequality needs: max r with Ar has Ar=X(r).

So: equality in Local LYM A=X(r) for some r.

Hence: equality in Sperner’s Lemma if and only if A=X(r)n2 (if n even), and A=X(n2) or A=X(n2).

Proof 2. Choose, uniformly at random, a maximal chain C (i.e. C0C1C2Cn, with |Cr|=r for all r).

PIC

For any r-set A, (AC)=1nr (all r-sets are equally likely). So (C meets Ar)=|Ar|nr (as events are disjoint) and hence

1(C meets A)=r=0n|Ar|nr.

Equivalently: (if you want to lose the intuition about how this works) then: #maximal chains=n!, and #through any fixed r-set=r!(nr)!, hence

r|Ar|r!(nr)!n!.

1.1 Shadows

For AX(r), know |A|rnr+1. Equality is rare – only for A= or X(r). What happens in between?

PIC

In other words, given |A|, how should we choose AX(r) to minimise |A|?

Believable that if |A|=kr then we sholud take A=[k](r).

What if kr<|A|<k+1r?

Believable that should take [k](r) plus some r-sets in [k+1](r). For example, for AX(r) with |A|=83+42, take A=[8](3){9B:B[4](2)}.

1.2 Two total orders on X(r)

Let A and B be distinct r-sets: say A=a1,,ar, B=b1,,br where a1<<ar and b1<<br.

Say that A<B in the lexicographic (or lex) ordering if for some j we have ai=bi for i<j and aj<bj.

Slogan: “Use small elements” (“dictionary order”).

Example. lexicographic on [4](2): 12,13,14,23,24,34.

lexicographic on [6](3): 123,124,125,126,134,135,136,145,146,156,234,235,236,245,256,345, 346,356,456.

Say that A<B in the colexicographic (or colex) ordering if for some j we have ai=bi for all i>j and aj<bj.

Slogan: “Avoid large elements” (note that this is not quite the same as “use small elements”, which is what we had before).

Example. colexicographic on [4](2): 12,13,23,14,24,34.

colexicographic on [6](3): 123,124,134,234,125,135,235,145,245,345,126,136,236,146,246,346, 156,256,356,456.

Note that, in colexicographic, [n1](r) is an initial segment (first t elements, for some t) of [n](r).

This is false for lex.

So we could view colexicographic as an enumeration of (r).

Remark. A<B in colexicographic if and only if Ac<Bc in “lexicographic with ground set order reversed”.

Aim: colexicographic initial segments are best for , i.e. if AX(r) and CX(r) is the initial segment of colexicographic with |C|=|A|, then |C||A|.

In particular, |A|=kr|A|kr1.

1.3 Compressions

Idea: try to transform AX(r) into some AX(r) such that:

Ideally, we’d like a family of such ‘compressions’: AAAAB such that either B=C or B is so similar to C that we can directly check that |B||C|.

colexicographic prefers 1 to 2” inspires:

Definition 9 (ij-compression). Fix 1i<jn. The ij-compression Cij is defined as follows:

For AX(r), set

Cij(A)={Aijif jAiAAotherwise,

and for AX(r), set

Cij(A)={Cij(A):AA}{AA:Cij(A)A}.

Note that the second part of the union in Cij(A) is because we need to make sure that we “replace j by i where possible”.

PIC

Example. If A={123,134,234,235,146,567} then C12(A)={123,134,234,135,146,567}.

So Cij(A)X(r), and |Cij(A)|=|A|.

Say A is ij-compressed if Cij(A)=A.

Lemma 10. Assuming that:

  • AX(r)

  • 1i<jn

Then |Cij(A)||A|.

Proof. Write A for Cij(A). Let BAA. We’ll show that iB, jB and BjiAA. [Then done].

PIC

Have BxA for some x, with BxA (as BA). So iBx, jBx, and (Bx)jiA.

Cannot have x=i, else (Bx)ji=Bj, giving BA, contradiction.

Hence we have iB, jB.

Also, BjiA, since (Bx)jiA.

Suppose BjiA: so (Bji)yA for some y. Cannot have y=i, else BjA – so BjA (as jBj), contradicting BA. Hence j(Bji)y and i(Bji)y.

Whence both (Bji)y and By belong to A (by definition of A), contradicting BA.

Remark. Actually showed that Cij(A)CijA.

Definition 11 (Left-compressed). Say AX(r) is left-compressed if Cij(A)=A for all i<j.

Corollary 12. Let AX(r). Then there exists a left-compressed BX(r) with |B|=|A| and |B||A|.

Proof. Define a sequence A0,A1, as follows. Set A0=A. Having defined A0,,Ak, if Ak left-compressed then stop the sequence with Ak.

If not, choose i<j such that Ak is not ij-compression, and set Ak+1=Cij(Ak).

This must terminate, because for example AAkiA is strictly decreasing in k.

Final term B=Ak satisfies |B|=|A|, and |B||A| (by Lemma 10)

Remark.

  • (1) Or: among all BX(r) with |B|=|A| and |B||A|, choose one with minimal ABiAi.
  • (2) Can choose order of the Cij so that no Cij applied twice.
  • (3) Any initial segment of colexicographic is left-compressed. Converse false, for example {123,124,125,126} (initial segment of lexicographic).

These compressions only encode the idea “colexicographic prefers i to j (i<j)”, but this is also true for lexicographic.

So we try to come up with more compressions that encode more of what colexicographic likes.

colexicographic prefers 23 to 14” inspires:

Definition 13 (UV-compression). Let U,VX with |U|=|V|, UV= and maxV>maxU. We define the UV-compression as follows: for AX,

CUV(A)={AUVif VAUA=Aotherwise,

and for AX(r), set

CUV(A)={CUV(A):AA}{AA:CUVA}.

Example. If

A={123,124,147,237,238,149},

then

C23,14(A)={123,124,147,237,238,239}.

So CUV(A)X(r), and |CUV(A)|=|A|.

Say A is UV-compressed if CUV(A)=A.

Sadly, we can have |CUV(A)|>|A|:

Example. A={147,157} has |A|=5, but C23,14(A)={237,147} has |C23,14(A)|=6.

Despite this, we at least we do have the following:

Lemma 14. Assuming that:

  • AX(r) is UV-compression for all U,V with |U|=|V|, UV=, maxV>maxU

Then A is an initial segment of colexicographic.

Proof. Suppose not. So there exists A,BX(r) with B<A, in colexicographic but AA, BA.

PIC

Put V=AB, U=BA.

Then |V|=|U|, and U,V disjoint, and maxV>maxU (since max(AΔB)A, by definition of colexicographic).

So CUV(A)=B, contradicting A is UV-compression.

Lemma 15. Assuming that:

  • U,VX

  • |U|=|V|

  • UV=

  • maxU<maxV

  • AX(r)

  • uU vV such that A is (Uu,Vv)-compressed(∗)

Then |CUV(A)||A|.

Proof. Let A=CUV(A). For BAA, we’ll show UB, VB=, and BVUAA. (Then done).

PIC

Have BxA for some x, and BxA, so UBx, V(Bx)=, and (Bx)VUA (by definition of CUV).

If xU: there exists yU such that A is (Ux,Vy)-compressed, so from (Bx)VUA we have ByA – contradicting BA, contradiction.

Thus xU, and so UB, VB=. Certainly BVUA (because (Bx)VUA), so just need to show that BVUA.

Suppose BVUA: so (BVU)wA, for some w. Also have (BVU)wA (for example, as V contained in it).

If wU: know A is (Uw,Vz)-compressed for some zV, so BzA – contradicting B.

If wU: have V(BVU)w, U((BVU)w)=, so by definition of CUV we must have that both (BVU)w and BwA – contradicting BA, a contradiction.

Theorem 16 (Kruskal-Katona). Assuming that:

  • AX(r), 1rn

  • C is the initial segment of colexicographic on X(r) with |C|=|A|

Then |C||A|. In particular: if |A|=kr, then |A|kr1.

Proof. Let Γ={(U,V):|U|=|V|>0,UV=,maxU<maxV}{(,)}. Define a sequence A0,A1, of set systems in X(r) as follows:

  • Set A0=A.

  • Having chosen A0,,Ak, if Ak is UV-compressed for all (U,V)Γ then stop. Otherwise, choose U,VΓ with |U|=|V|>0 minimal such that Ak is not UV-compressed.

    Note that uU vV such that (Uu,Vv)Γ (namely, use v=minV). So () is satisfied.

    So Lemma 15 tells us that |CUV(Ak)||Ak|.

    Set Ak+1=CUV(Ak), and continue.

Must terminate, as AAkiA2i is strictly decreasing. The final term B=Ak satisfies |B|=|A|, |B||A| and is UV-compressed for all (U,V)Γ.

So B=C by Lemma 14.

Remark.

  • (1) Equivalently: if
    |A|=krr+kr1r1++kss,

    where kr>kr1>>ks, and s1, then

    |A|krr1+kr1r2++kss1.
  • (2) Equality in Kruskal-Katona? Can check that if |A|=kr and ||=kr1 (i.e. equality in a step of the proof of Kruskal-Katona), then A=Y(r), for some YX with |Y|=k.
  • (3) However, not true in general that |A|=|C| implies that A is isomorphic to C. (uset systems A,B are isomorphic if there exists permutation of the ground set X sending A to B).

For AX(r), 0rn, the upper shadow of A is

+A={Ax:AA,xA}X(r+1).

Corollary 17. Let AX(r), where 0rn, and let C be the initial segment of lexicographic on X(r) with |C|=|A|. Then |+A||+C|.

Proof. From Kruskal-Katona, since A<B in colexicographic if and only if Ac<Bc in lexicographic with ground-set order reversed.

Note that the shadow of an initial segment of colexicographic on X(r) is an initial segment of colexicographic on X(r1) – as if C={AX(r):Aa1ar in colexicographic} then C={BX(r1):Ba2ar in colexicographic}.

PIC

This fact gives:

Corollary 18. Let AX(r), and C is the initial segment of colexicographic on X(r) with |C|=|A|. Then |tC||tA| for all 1tr.

Proof. If |tC|tA|, then |t+1C||t+1A|, because tC is an initial segment of colexicographic. Done by induction.

Note. If |A|=kr, then |tA|krt.

Remark. Proof of Kruskal-Katona used Lemma 14 and Lemma 15, but not Lemma 10 or Corollary 12.

1.4 Intersecting Families

Say AP(X) intersecting if AB for all A,BA.

How large can an intersecting family be? Can have |A|=2k1, by taking A={A:1A}.

Proposition 19. Assuming that:

Then |A|2k1.

Proof. For any AX, at most one of A,Ac can belong to A.

Note. Many other extremal examples. For example, for n odd take {A:|A|>k2}.

What if AX(r)?

If r>n2, take A=X(r).

If r=n2: just choose one of A,Ac for all AX(r): gives |A|=12nr.

So interesting case is r<n2.

Could try A={AX(r):1A}. Has size n1r1=rnnr (while this identity can be verified by writing out factorials, a more useful way of observing it is by noting that (random r-set contains 1)=rn).

Could also try B={AX(r):|A{1,2,3}|2}.

Example. n=8, r=3. Then |A|=72=21 and

|B|=1|B[3]|=3+3251|B[3]|=2=16<21.

Theorem 20 (Erdos-Ko-Rado Theorem). Assuming that:

Then |A|n1r1.

Proof 1 (“Bubble down with Kruskal-Katona”). Note that ABABc.

PIC

Let A¯={Ac:AA}X(nr). Have n2rA¯ and A are disjoint families of r-sets.

Suppose |A|>n1r1. Then |A¯|=|A|>n1r1=n1nr. Whence by Kruskal-Katona we have |n2rA¯|n1r.

So |A|+|n2rA¯|>n1r1+n1r=nr, a contradiction.

Remark. Calculation at the end had to give the right answer, as the calculations would all be exact if A={AX(r):1A}.

Proof 2. Pick a cyclic ordering of [n] i.e. a bijection c:[n]n.

PIC

How many sets in A are intervals (r consecutive elements) in this ordering?

Answer: r. Because say C1,,CrA. Then for each 2i1, at most one of the two intervals CiCi+1Ci+r1 and CirCir+1Ci1 can belong to A (subscrpits are modulo n).

For each r-set A, in how many of the n! cyclic orderings is it an interval?

Answer: nr!(nr)! (n= where, r!= order inside A, (nr)!= order outside A).

Hence  A|nr!(nr)!n!r, i.e. |A|n!rnr!(nr)!=n1r1.

Remark.

  • (1) Numbers had to work out, given that we get equality A={AX(r):1A}.
  • (2) Equivalently, we are double-counting the edges in the bipartite graph, with vertex classes A and all cycling orderings, with A joined to c if A is an interval in c.
  • (3) This method is caled averaging or Katona’s method.
  • (4) Equality in Erdos-Ko-Rado Theorem? Our example is actually unique – if AX(r) is intersecting and |A|=n1r1, then A={AX(r):iA} for some 1in.

    Can get this from Proof 1 (and equality in Kruskal-Katona) or from Proof 2 (with a bit of care).