Forcing and the Continuum Hypothesis
Daniel Naylor

Contents

1The Continuum Hypothesis
2Transitive Models
2.1Absoluteness for transitive models
2.2Non-absoluteness
2.3The constructible hierarchy
3Forcing
3.1Cohen Forcing
Index

1 The Continuum Hypothesis

1398: Gödel showed Con(ZFC)Con(ZFC+CH).

1962: Cohen showed Con(ZFC)Con(ZFC+¬CH).

Theorem (Hartogs’s Theorem). For every set X, there is a (least) cardinal α such that there is no injection from α to X.

We denote this by Hartogs’s aleph of X, (X).

Theorem. For every set X, there is no injection from P(X) into X. We denote this by 2|X|.

Using Axiom of Choice, well-order P(X) and get ordinal 2|X|.

We have

(X)2|X|.

Notation. Define:

0:=α+1:=(α)λ:=α<λα0=α+1=2αλ=α<λα

Clearly, αα.

Continuum Hypothesis (CH): 1=1.

Generalised Continuum Hypothesis (GCH): αα=α.

Why “continuum”?

Lemma. CH if and only if X, X is uncountable X ( means “there is a bijection”).

Proof.

Reminder:

Gödel showed Con(ZFC)Con(ZFC+CH).

Cohen showed Con(ZFC)Con(ZFC+¬CH).

Relative consistency proofs.

By Completeness Theorem, this means:

If there is MZFC, then I can construct NZFC+(¬)CH.

Analogy from algebra:

Lfields={0,1,+,,,1}.

Axioms of fields: Fields.

Let φ2:=x(xx=1+1). Note ¬φ2.

Idea: Start with and extend to get FFields+φ2.

Construct by algebraic closure (not in the usual sense – here we just mean adding in 2 and then adding everything else that this requires us to add).

Obtain (2)Fields+φ2.

This is easy because everything that matters (Fields and φ2) is determined by equations; all formulas we need to check are atomic.

Definition 1.1 (absolute). If MN and M,N are L-structures and φ an L-formula, then we say φ is absolute between M and N if for all x1,,xmM,

Mφ(x1,,xn)Nφ(x1,,xn).

If the direction holds, then we say “upwards absolute”, and if the direction holds, then we say “downwards absolute”.

Theorem (Substructure Lemma). All atomic formulas are absolute between substructures.

WHat if we have models of ZFC? Have

L={}.

No function symbols nor constant symbols. So: almost nothing is atomic.

MN if and only if M is an L-substructure of N.

And: the formulas that we care about are definitely not atomic, but instead very complex.

Try to imagine a proof of:

If MZFC then there is NM such that NZFC+CH.

Without loss of generality M¬CH (else we are done). For simplicity, let’s consider the case 1=2.

PIC

What can we do to “get rid of 1”?

Maybe a surjection f:1. Maybe we can form M[f]M to get a smallest model M[f]ZFC.

Clearly, in M[f], 1M is not a cardinal anymore.

Does that show CH?

All sorts of things can happen

Assuming it is actually possible to form this smallest model M[f], there are lots of ways that this might not end up being enough to deduce CH. For example:

A fundamental problem of non-absoluteness

Consider φ(x):=z(zx), which means “x is empty”.

Consider MZFC. Therefore there are e,e such that Mφ(e) and Mz(zez=e).

Consider N:=M{e}.

N is an L-substructure of M. But Nφ(e), even though M¬φ(e).

So φ is not absolute between substructures.

Instead of substructures, we will restrict out attention to transitive substructures: in particular, to M transitive (x,xMxM or equivalently xMyxyM) such that MZFC.

Next time: theorems about absoluteness for transitive substructures.

Definition (Absolute formula). We say φ is absolute for M if for all x1,,xnM, we have

Mφ(x1,,xn)φ(x1,,xn) is true.

Clearly, if φ is absolute for M and absolute for N, then it’s absolute between M and N.

Cohen’s proof becomes:

If M is a countable transitive set such that MZFC, then there is a a countable transitive set NM such that NZFC+¬CH.

2 Transitive Models

Observation: If M is transitive and Me is empty”, then e=. This is because if we, then we and eM gives us that wM by transitivity, so Mwe, so Me is not empty”.

Lemma. Assuming that:

  • M is transitive

Then
MExtensionality+Foundation.

Proof. Extensionality: xy(w(wxwy)x=y). Take xy, x,yM. Without loss of generality take zxy. Then zx and xM so zM. So Mzxzy. So M¬w(wxwy).

Foundation: x(xm(mxw¬(wmwx))). Take xM. Mx so x is not empty. So find mx which is -minimal. Then since xM as well, we have mM. Therefore x has an -minimal element in M.

2.1 Absoluteness for transitive models

Definition (Bounded quantifier). We call a quantifier of the form xy,φ or xy,φ a bounded quantifier.

(Defined by xy,φ:=x(xyφ) and xyφ:=x(xyφ)).

Definition (Closed under bounded quantification). A class of formulas Γ is closed under bounded quantification if whenever φ is in Γ, then so are xy,φ and xy,φ.

Definition (Delta0). Δ0 is the smallest class of formulas containing the atomic formulas that is closed under propositional connectives and bounded quantifiers.

Let T be any theory. Then Δ0T is the class of formulas equivalent to a Δ0 formula in T.

Theorem. Δ0 formulas are absolute for transitive models.

Proof. By induction:

Corollary. Assuming that:

  • T is any theory

  • MT is transitive

Then Δ0T-formulas are absolute for M.

Definition (Sigma1, Pi1). A formula is called Σ1 if it is of the form x1,xn,φ where φ is Δ0.

It is called Π1 if it is of the form x1,,xn,φ where φ is Δ0.

(same for Σ1T, Π1T).

Proof. Just definition of the semantics of ,.

Example. What is Δ0?

Definition (Absolute function). Let M a transitive set, and F:MnM (note that this means that M is closed under F). We say F is absolute for M if there is a formula Φ which is absolute for M such that

F(x1,,xn)=zΦ(x1,,xn,z).

Observation: So, if M is closed under pairing (x,yM,{x,y}M), then the pairing operation x,y{x,y} is absolute, and therefore MPairing.

Similarly for union.

Lemma. Assuming that:

Then ψ(x1,,xm):=φ(G1(x1,,xm),,Gn(x1,,xm))H(x1,,xm):=F(G1(x1,,xm),,Gn(x1,,xm))

are absolute for M.

Proof. Check the definitions!

Example. More examples:

Ordinals

x is an ordinal” means x is transitive and (x,) is a well-order.

We know: being well-founded is not expressible in first-order logic (see Example Sheet 1).

Because all transitive models satisfy Foundation, we have that if M is transitive, then

Mx is transitive(x,) is linearly ordered.

characterises ordinals. But this is clearly in Δ0.

So: being an ordinal is absolute for transitive models.

Thus MOrd={xM:Mx is an ordinal}. This is transitive, thus there is αOrd such that α=MOrd.

Also absolute:

Cardinals

x is a cardinal” if and only if

x is an ordinalf,yx,f:yxf is not a surjection

Note that f is not bounded (while yx is bounded).

Observe: this is Π1 and therefore downwards absolute.

Remark.

2.2 Non-absoluteness

Assume that MZFC is transitive and countable. Then

MOrd=α<ω1.

However, MZFC implies Mthere are uncountable cardinals.

Let β<α be such that Mβ is the least uncountable cardinal.

But β is a countable ordinal, so not a cardinal.

Consequence: All cardinals in M except 0 are going to be fake.

So “x is a cardinal” can’t be absolute.

Note. This also shows that “x=P(y)” cannot be absolute:

Take y such that My=P(ω). Then yP(ω), but is countable since yM.

Thus yP(ω). Therefore “x=P(y)” is not absolute.

Recall the general proof strategy mentioned before:

If M is a countable transitive set such that MZFC, then there is a a countable transitive set NM such that NZFC+¬CH.

Question: Is this really solving the original problem? i.e. Con(ZFC)Con(ZFC+¬CH).

It’s not obvious that Con(ZFC) implies that there is a countable transitive model (ctm) of ZFC.

Answer: That’s not only not obvious, but fake.

Let’s prove that Con(ZFC) there is a countable transitive model of ZFC.

Why? Note that Con(ZFC), or Con(T) for any T is Δ0. So, it’s absolute for transitive models.

So if M is a countable transitive model of ZFC, then Con(ZFC) is true, so by absoluteness, MCon(ZFC). So MZFC+Con(ZFC). This contradicts Gödel’s Incompleteness Theorem.

We can get a proof of

Con(ZFC)Con(ZFC+¬CH)

via a trick ( Example Sheet 1).

Lemma (Cohen Lemma). Assuming that:

  • TZFC

Then there is finite TZFC such that if M is a countable transitive model of T, then there is NM such that N is a countable transitive model of T+¬CH.

This reduces the problem to:

Find countable transitive models of T for sufficiently large finite TZFC.

Definition (Hierarchy). We call an assignment αZα a hierarchy if

  • (i) Zα is a transitive set
  • (ii) OrdZα=α
  • (iii) α<βZαZβ
  • (iv) λ limitZλ=α<λZα

If {Zα:αOrd} is a hierarchy, we can define Z:=αOrdZα. This is a proper class as OrdZ. We also define ρZ(x):=min{α:xZα}, a notion of Z-rank.

Paradigmatic example: von Neumann hierarchy Vα, and V is the entire universe.

Theorem 2.1 (Levy Reflection Theorem). Assuming that:

  • Z is a hierarchy

  • φ is a formula

Then there are unboundedly many 𝜃 such that φ is absolute between Z𝜃 and Z.

Proposition 2.2 (Tarski-Vaught Test). Assuming that:

  • M is a substructure of N

Then M is an elementary substructure if and only if for any formula ϕ(v,w¯) and a¯M, if there is bN such that Nϕ(b,a¯), then there is cM such that Nϕ(c,a¯).

TVTΦ:

Let MN and Φ be a collection of formulas closed under subformulas. Then the following are equivalent:

Warm-up: let (M,)ZFC. Find countable NM such that (N,)(M,).

Suppose p¯=(p0,,pn)M and My,ψ(y,p¯). Let w(ψ,p¯) be a witness for this:

Mψ(w(ψ,p¯),p¯)

(if necessary, use Axiom of Choice).

(if M¬y,ψ(y,p¯), then let w(ψ,p¯)=).

Set:

N0:=Ni+1:={w(ψ,p¯):ψ formula and p¯N1<ω}N:=iωNi

Note:

Remark. In general, even if M is transitive, N is not.

For example, if ω1M, then

x,(x is the least countable ordinal)

is true in M.

w(ψ,)=ω1.

So ω1N. But ω1N, since N is countable.

Relevant later!

Also see Example Sheet 1.

Proof of Levy Reflection Theorem. Fix φ and let Φ be its collection of subformulas. This is a finite set!

Need to show: α,𝜃>α such that Z𝜃φZφ.

For each ψΦ and p¯=(p0,,pn), write

o(ψ,p¯):={least α such that yZα with Zψ(y,p¯)if it exists0otherwiseo(p¯):=maxψΦo(ψ,p¯)𝜃0:=α+1𝜃i+1:=sup{o(p¯):p¯Z𝜃i<ω}𝜃:=supiω𝜃i

Then Tarski-Vaught Test implies that Z𝜃 and Z agree on φ.

Corollary. If TZFC is finite, then there is M transitive such that MT.

Proof. Let φ:=ψTψ. Since ZFCφ, we have that φ is true. By Levy Reflection Theorem, we can find 𝜃 such that V𝜃φ. Note V𝜃 is transitive.

Remark about the proof of Levy Reflection Theorem:

Can you do the same if Φ is infinite?

Of course not: otherwise we colud prove that there exists 𝜃 such that V𝜃ZFC, and hence get Con(ZFC).

The problem is the case distinction in the definition of o(ψ,p¯): it requires to check whether y,ψ is true.

Next goal: Obtain some MV𝜃 countable such that Mφ and M is transitive.

TODO

Theorem (Mostowski’s Collapsing Theorem). Let r be a relation on a set a that is well-founded and extensional. Then there exists a transitive set b, adn a bijection f:ab such that (x,ya)(x r yf(x)f(y)). Moreover, b and f are unique.

Proof. See Logic and Set Theory.

Corollary. For every TZFC finite, there is a countable transitive model of T.

Proof. Without loss of generality that T contains the axiom of extensionality. Form MT transitive by Levy Reflection Theorem.

Use warm-up to obtain NM countable. This is extensional and well-founded, so by Mostowski find W transitive such that

(W,)(N,).

Then WT adn |W|=||, so W is countable.

The next few lectures will be spent proving Con(ZFC+CH) using Gödel’s constructible universe.

Absoluteness is preserved under transfinite recursion.

Let F,G,H be three operations.

R(0,x¯):=F(x¯)R(α+1,x¯):=G(α,R(α,x¯),x¯)R(λ,x¯):=H(λ,{R(α,x¯):α<λ},x¯)()

Proof. Attempts: set functions satisfying the ().

R(α,x¯):=y if and only if there exists attempt r with (α,x¯)dom(r) and r(α,x¯)=y.

Note that for F,G,H fixed, there is a finite fragment TF,G,HZFC that proves the recursion theorem instance for F,G,H.

Theorem. If TTF,G,H and F,G,H are absolute for transitive models of T, then so is R defined by ().

Want TF,G,HL1(F,G,H), TF,G,HL2(F,G,H) and TF,G,H proves existence of R.

Convention: We say “T is sufficiently strong” if TZFC is finite and T proves hte existence of all relevant operations such that they are absolute for transitive models of T.

Proof. Observe that by assumption, being an “attempt” is absolute for transitive models of T.

Let MT be transitive.

Note. This uses the fact that “Δ1” concepts are absolute.

Definition (Delta1T property). A property is called Δ1T if it’s both Σ1T and Π1T.

Observe: Δ0T concepts are absolute (upwards from Σ1 and downwards from Π1).

Typical Applications

Bounding a quantifier by operation.

Let F be an operation and T strong enough to prove F is an operation and absolute.

Tx,z,F(x)=zTx,z,z,F(x)=zF(x)=zz=z

Then the quantifiers yF(x) and yF(x) preserve absoluteness.

yF(x)ψz(z=F(x)yzψ)absoluteupwards absolutez(z=F(x)yzψ)absolutedownwards absolute
Applications

2.3 The constructible hierarchy

Fix a set X. Define for each φFml and each pX<ω (parameter)

D(φ,p,X):={wX:Xφ(p,w)}

the subset of X defined by φ with parameter p.

For a sufficiently strong TZFC finite, we have that T proves that D is an absolute operation (see Example Sheet 1).

D(X):={D(φ,p,X):φFml,pX<ω}.

This is absolute for a sufficiently strong theory (use Replacement to get D(X)).

This D(X) is sometimes (misleadingly) called the “definable power set of X” (it is misleading because it is more like a “definable (by X) power set of X”).

(αD(X)φFml,pX<ω,a=D(φ,p,X))

Obvious: D(X)P(X). Also: If X is transitive, then so is D(X).

L0:=Lα+1:=D(Lα)Lλ=α<λLα The constructible hierarchy.

We usually write L:=αOrdLα.

Claim: L is a hierarchy (in the sense of Lecture). See Example Sheet 1.

By closure of absoluteness under transfinite recursion, the L-hierarchy is absolute for transitive models of TZFC where T is strong enough to prove that it exists.

i.e. if MT transitive and αOrdM and MX=Lα, then X=Lα. So

αOrdMLαM.

The main theorem of next lecture will be:

If LZF and MZF transitive, then

αOrdMLαZF.

(Minimal ZF-model).

Some first idea of what the L-hierarchy is like

Clearly, by induction, LαVα, and clearly for nω, Ln=Vn. So Lω=Vω.

Lα+1:=φFmlpLα<ω{D(φ,p,Lα)}.

If αω, then

|Lα+1|0|Lα<ω|=0|Lα|.

Thus |Lα|=|Lα+1|.

Therefore α<ω1, |Lα|=0 and |Lω1|=1.

This means: Vω+1Lω+1 (since the first has size 20, while the second has size 0).

Note: This does not mean VL. (V=L means x,α,xLα).

V=L is called the “axiom of constructibility”.

There is a finite fragment T𝕃 of ZF[!] that proves that all of the operations occuring in the definition of 𝕃, i.e. Fml,X<ω,,D,D are well-defined and absolute.

Thus, if M is a transitive model of T𝕃, then

αOrdM,𝕃αM

and thus 𝕃αM.

So αOrdM𝕃αM.

Axioms of ZF

Structural axioms:

Functional axioms:

Now we check that these hold in 𝕃.

In Lecture 2, we proved Extensionality and Foundation in all transitive structures, so also in 𝕃.

Note that ω satisfies the condition of the axiom of infinity, so any M transitive with ωM will satisfy the axiom of infinity. TODO

Now do pairing and union.

Since the definitions of pairs and unions

z={x,y}z=x

are absolute for transitive models, it’s enough to show that

x,y𝕃,{x,y}𝕃x𝕃x𝕃

If x,y𝕃α, φ(w,x,y):=w=xw=y,

D(φ,(x,y),Lα)={wLα:Lαφ(w,x,y)}={wLα:Lαw=xw}TODO

Powerset axiom.

x,p,w,(wpwx)

The problem here is that is not obviously absolute. In particular, z=P(x) is not absolute.

Consider 𝕃ω+1: we have ω𝕃ω+1 and P𝕃ω+1 is countable.

In 𝕃ω+2, we find

{a𝕃ω+1:aω}

which is the best possible answer to the question “what is the power set of ω?” that 𝕃ω+1 can give, but unlikely to be the correct answer.

Consider instead P(ω)𝕃=:P and define

Ω:={ρ𝕃(a):aP}.

(reminder: ρ𝕃(a) is the least α such that a𝕃α+1)

By Replacement, Ω is a set of ordinals, so find α>Ω. Then P𝕃α.

Therefore P={a𝕃α:aω}𝕃α+1, so P𝕃.

Separation:

p¯,x,s,w,(wswxφ(w,p¯)φ(w,x,p¯))

If x𝕃α, then

D(φ,x,Lα):={wLα:Lαφ(w,x,p¯)}={wLα:Lαwxφ(w,p¯)}=?{w𝕃not a problem:𝕃this is a problemwxφ(w,p¯)}

If φ is not absolute between Lα and L, this won’t work.

Levy Reflection Theorem to the rescue: φ,α,𝜃>α such that φ is absolute between L𝜃 and L.

Thus: form

D(φ,x,L𝜃)={wL𝜃:L𝜃wxφ(w,p¯)}=absolute{wL𝜃:Lwxφ(w,p¯)}

Replacement:

This will be on Example Sheet 2. The proof is a combination of the ideas from power set and separation.

Corollary 2.3 (Minimality). Assuming that:

  • T is a transitive model of ZF

Then for all αTOrd, 𝕃αT. Axiom of TODO.

Remark. Remark on the Axiom of Choice.

Gödel (1938): Con(ZF)Con(ZFC).

Note first that everything we did so far only needed ZF in the universe. We will sketch that 𝕃AC. In fact a strong version of AC known as GLOBAL CHOICE: there is an absolutely definable bijective operation between 𝕃 and Ord.

Sketch: Recursive construction of bijections πα:𝕃αηα for some ordinal ηα, and such that for β<α we have πα|𝕃β=πβ.

If λ is a limit and πα is defined for α<λ, then let

πλ(x):=πα(x)

if x𝕃α.

Suppose α=β+1 and πβ is given by πβ:𝕃βηβ.

Consider Fml×Lβ<ω. Well-order it in order type ηβ via the induced πβ well-order. Then if xLα, say

πα(x):={πβ(x)if xLβηβξif ξ is the ordinal corresponding to the least (φ,p¯) such that x=D(φ,p¯,TODO)

TODO

Cohen:

TZFC finite, TZFC finite such that if M is a countable transitive model of T, then there is NM countable transitive model of T+¬CH. ()

We have seen ( Example Sheet 1) that () implies Con(ZFC)Con(ZFC+¬CH).

Simplified: If M is a countable transitive model of ZFC, then there is NM countable transitive model of ZFCh=¬CH.

Idea: If M is a countable transitive model of ZFC: α:=ω1M; β:=ω2M; f:βP(ω) injection. Force N such that fN and MN.

Observe that there is a countable transitive model N such that fN and MN. Mtcl({f}) is transitive and countable. Thus LSM gives N transitive countable with Mtcl({f})N. Know Ng:βP(ω) is an injection, but no clue what 1 and 2 in N are.

How do we control what we add?

3 Forcing

Definition (Forcing). (,,𝟙) is called a forcing poset or forcing if it is a partial order and 𝟙 and 𝟙 is the largest element. Elements of are called condition.

pq is interpreted as

p is stronger than q

q is weaker than p

Note. Unconventional.

Alternative: “Jerusalem convention”. Interpret pq as “q is stronger than p”.

We are not following the Jerusalem convention.

Definition (Compatible). p and q are compatible if there is rp,q. Otherwise, we say they are incompatible (which we write as pq).

Definition (Antichain). AP is an antichain if any two distinct elements of A are incompatible.

Definition (Dense). D is dense if pTODO.

Definition (Filter). F is called a filter if

  • (a) p,qF,rF,rp,q
  • (b) pF,q,qpqF

If F only has property (a), we call it a filter base, and then

{p:qF,qp}

is the filter generated from F.

Note. Filters cannot contain incompatible elements.

Definition (D-generic). If D is a family of dense sets, then F is called D-generic if F is a filter and DD,DF.

Theorem 3.1. Assuming that:

  • D is countable

Then there is a D-generic filter.

Proof. Let D={Dn:n}. Pick p0D0 arbitrarily and define by recursion pi+1 by picking some qpi with qDi+1.

Then {pi:i} is a filter base, so let F be the filter generated by it. Then it is D-generic by construction.

Main example

Fix any sets X and Y

Fn(X,Y):={p:p is a finite fucntion with dom(p)X and range(p)Y}.

Define pqpq and 𝟙:=.

What does pq mean?

pqxX,xdom(p)dom(q)p(x)q(x).

Lemma 3.2. Assuming that:

Then F is a function.

Lemma 3.3. Dx:={p:xdom(p)} is a dense set. If D:={Dx:xX}, and F is D-generic, then dom(F)=X.

Proof. If xX, find pFDx, then xdom(p), TODO

3.1 Cohen Forcing

Example 1

:=Fn(ω,2).

If F is D-generic, then F:ω2 (by above).

Lemma 3.4. Assuming that:

  • Fix f:ω2 and define

    Nf:={p:u,p(u)f(u)}.

Then Ff.

Corollary 3.5. There is no F that is D{Nf|f:ω2}-generic.

However: if M is a countable transitive model and you consider

Nf:={Nf:fM},

then by Theorem 3.1, there is a DNM-generic. And for this F, TODO

Example 2

Fn(X,Y)=: as before.

Ry:={p:yrange(p)}R:={Ry:yY}

Lemma 3.6. Assuming that:

Then F:XY is a surjection.

Corollary 3.7. Assuming that:

  • |Y|>|X|

Then there is no DR-generic.

However: if M is a countable transitive model and, e.g. TODO

Example 3

Fn(X×Y,2). Assume X is infinite. Consider

Ey,y:={p:xX,p(x,y)p(x,y)}.

This is dense for yy.

E:={Ey,y:yyY}.

Lemma 3.8. Assuming that:

Then there is an injection from Y into P(X).

Proof. Fix y and define

Ay:={xX:(F)(x,y)=1}.

Ey,y guarantees that yAy is an injection.

Corollary 3.9. Assuming that:

  • |Y|>|P(ω)|

Then there is no DE-generic.

However: if M is a countable transitive model of ZFC+CH, α is countable and so a DE-generic exists.

Recap: M a countable transitive model of ZFC (or a sufficiently large finite fragment).

M: dense / filter / D-generic.

Example. Fn(ω,2) produces a new function f:ω2. Cohen forcing.

Example. Fn(X,Y) produces a surjection f:XY. Fn(ω,Y) COLAPSE of Y.

Example. Fn(X×Y,2) produces an injection f:YP(X).

If M is a countable transitive model of ZFC, then DM:={DP dense:DM} is countable, so by Theorem, we have a DM-generic.

Definition 3.10 (P-generic over M). We say F is -generic over M if it is DM-generic. These always exist if M is a countable transitive model.

GOAL: Build an extension M[G] such that MM[G], M[G] a countable transitive model of ZFC, GM[G] and M[G] is minimal.

Names

Idea: Think of elements of as “truth values” for the von Neumann construction.

Name0:=Nameα+1:={τ:τNameα×}Nameλ:=α<λNameα Then
Name:=αOrdNameα

is the proper class of all names.

Consider: ={0,1}. Then NameV.

Since this is a recursive definition using absolute concepts, being a name is absolute for transitive models:

{τ:Mτ is a -name}=NameM.

TODO

Examples

Name1,

τp:={(,p)}Name2

“The name for a set that contains with value p.”

τpq:={(τp,q)}.

“The name for whatever τp describes with value q”.

Interpretation

If F, we interpret a -name as follows:

val(τ,F):={val(σ,F):pF,(σ,p)τ}.

Important: This is a recursive definition.

Thus: the valuation is absolute for transitive models containing τ and F.

Example.

Definition 3.11 (Generic extension). The (generic) extension for any countable transitive model M and any F where M is

M[F]:={val(τ,F):τNameM}.

Obviously, M[F] is a countable set with M[F] (Example (1)).

Also, by definition, M[F] is transitive.

Note:

M[F]Extensionality+Foundation.
Need to show

Canonical Names

Definition 3.12 (Canonical name). Let xM. Define by recursion the canonical name for x by

wˇ:={(yˇ,𝟙):yx}.

Lemma 3.13. val(xˇ,F)=x if 𝟙F.

Proof. Induction.

Corollary 3.14. MM[F] if 𝟙F.

Alternative construction of canonical names without 𝟙 is on Example Sheet 2.

Γ:={(pˇ,p):p}.

Lemma 3.15. val(Γ,F)=F.

Proof. Calculate:

val(Γ,F)={val(pˇ,F):pF}={p:pF}(by previous lemma)=F

Corollary 3.16. FM[F].

Remark. If N is a countable transitive model with MN and FN, then M[F]N. (by absoluteness of val(τ,F)).

Warm-up: Suppose σ,τName. Define

up(σ,τ):={(σ,𝟙),(τ,𝟙)}.

Pairing: Then

val(up(σ,τ),F))={val(σ,F),val(τ,F)}.

(“up” stands for unordered pair).

Corollary 3.17. M[F]Pairing (if 𝟙F).

Union: If τ is a name, define

uτ:={(σ,r):σ,p,q,s.t.(σ,p)τ,(σ,q)σ,rp,q}.

Claim: val(uτ,F)=val(τ,F) if F is a filter.

Proof. Suppose zval(uτ,F).

So z=val(σ,F) for some (σ,r)uτ with rF.

So σ,p,q with (σ,p)τ, (σ,q)σ, rp,q.

So p,qF. Hence val(σ,F)val(τ,F), z=val(σ,F)val(σ,F).

So zval(τ,F).

Conversely, suppose zval(τ,F). Then y,zyval(τ,F) (z(σ,q)σ with qF, y(σ,ρ)τ with pF).

Hence since F a filter, find rp,q with rF.

Then (σ,r)uτ, so zval(uτ,F).

TODO

Further Recap

We proved

M[G]Extensionality + Foundation + Pairing + Union.

Homework was: Think about why power set is not easy.

“Union” proof was:

collect all natural candidates of names for elements and assign the natural values.

Problem: If you try to do this for power set, neither the “natural candidates for names” nor the “natural values” are obvious. It’ll turn out that they are obvious in the end, but that requires some assistance.

Note: Separation and Replacement are even worse.

One remaining easy axiom: AC. By well-ordering theorem, AC holds if and only if x,α,i,i:xα injection. If xM[F], then there is σName such that x=val(σ,F).

Notation. I write dom(σ):={τ:p,(τ,p)σ}.

Consider dom(σ). In M, I have i:dom(σ)α for some ordinal α. In M[F], define

ymin{i(τ):val(τ,F)=y,τdom(σ)}.

Call this i. Then i:xα is an injection. So, AC holds in M[F].

Forcing

Definition (Forcing language). Fix M a countable transitive model of ZFC, M a forcing poset.

We call the language

Lforcing:=L{τ:-names}

the forcing language (over M).

Definition (Interpretation of forcing language). If G is a -generic over M and φ is in the forcing language, we say

M[G]φ

if and only if

M[G],vφ

where v(τ):=val(τ,G).

Definition (Semantic forcing predicate). pM,φ: Forall G -generic over M with pG, M[G]φ.

p forces φ

We often omit M,.

Two theorems at the heart of forcing:

We are going to do the following:

Lectures 11 and 12.

Observations (under the assumption of the Forcing Theorem):

Proof of Separation in M[G]. Let φ(x,x1,,xn) be an L-formula (not in the forcing language) and xM[G]. Need a name for

A:={zx:M[G]φ(z,p¯)}.

[For readability, we now drop parameters p¯].

Fix σ such that val(σ,G)=x. Define

ϱ:={(π,p):πdom(σ)pπσφ(π)}.

By the Definability Theorem, ϱ is a name in M.

Claim: val(ϱ,G)=A.

Proof of power set. Example Sheet 3.

Proof of Replacement.

Since we already have Separation, it’s enough to show the following:

if φ is a functional formula and xM[G], then there is RM[G] such that

M[G]yx,zR,φ(y,z,p¯)(∗)

(as before, we suppress parameters for notational ease).

We work in M and identify a name ρ for R. Fix σ such that x=val(σ,G). Find α large enough such that dom(σ)Vα. Consider ψ(p,π):=μ,pφ(π,μ) (p, π a name). By Definability Theorem, this is an L formula.

By Levy Reflection Theorem, find 𝜃>α such that ψ is absolute between V𝜃 and V=M. Define

ρ:={(μ,1):μV𝜃}

and R:=val(ρ,G).

Now we verify () holds: Fix yx, y=val(π,G). Since φ is functional, we know that φ(π,μ) holds in M[G] for some μ. By Forcing Theorem, there is pG such that pφ(π,μ). So Mψ(p,π).

By absoluteness, V𝜃ψ(p,π). This means μV𝜃 such that pφ(π,μ).

Thus: val(μ,G)val(ρ,G)=R.

Definition (Dense below p). D is called dense below p if qp,rq,rD.

Lemma 3.18. Assuming that:

Then

Proof. Example Sheet 3

Definition of the syntactic forcing relation

pφ(τ¯).

Two recursions:

First “=” by recursion on Nameα.

Then “” without recursion.

Then the rest by recursion on formula complexity.

pτ0=τ1: if and only if

(π0,s0)τ0{qp:qs0(π1,s1)τ1,(qs1qπ0=π1)}is dense below p

and

(π1,s1)τ1{qp:qs1(π0,s0)τ0,(qs0qπ0=π1)}is dense below p

Remark. This is a recursion on Nameα.

pτ0τ1: if and only if

{qp:(π,s)τ1,(qsqπ=τ0)}

is dense below p.

Remark. No recursion involved, just “=”.

Recursion on complexity of formulas:

Remark. These definitions remind us of Kripke semantics for intuitionistic logic.

Lemma 3.19. The following are equivalent:

  • (i) pφ.
  • (ii) rp,rφ.
  • (iii) {r:rφ} is dense below ρ.

Proof. If this is true for φ of the form τ0=τ1 and of the form τ0τ1, then it’s true for all formulas.

For φ atomic, we get that (ii) (i) and (ii) (iii) are trivial.

(i) (ii) follows from Lemma 3.18(i).

(iii) (i) follows from Lemma 3.18(ii).

Strategy:

Theorem 3.20 (Syntactic Forcing Theorem). M[G]φ if and only if pG,Mrφ.

Corollary 3.21 (Definability Theorem). pφ if and only if pφ.

(pMφMpφ)

Corollary 3.22 (Forcing Theorem). M[G]φ if and only if pG,pφ.

This corollary is immediate from combining the two previously stated results.

Proof of Definability Theorem from Syntactic Forcing Theorem.

Proof of the Syntactic Forcing Theorem. The Theorem for = can be proved by induction on Nameα.

The Theorem for can be proved once the Theorem has been established for =.

Rest is induction on formula complexity.

Let’s do ¬ and leave the rest to Example Sheet 3. Assume we have proved it for φ. We will prove it for ¬φ.

Note. The Forcing Theorem is very useful! The inner details of the proof are not very important though. We won’t really discuss these details once we have proved the theorem.

Continuing the proof of Syntactic Forcing Theorem. Assume Syntactic Forcing Theorem for “=” and prove if for “”.

Want to show:

pτ0τ1 if and only if

{q:(π,s)τ1,(qsqπ=τ0)}

is dense below p.

Proof.

Now we prove it for “=”.

We prove this by induction on name rank, so assume that Syntactic Forcing Theorem for “=” is true for names π0,π1 with smaller name rank.

Reminder: we need to show M[G]τ0=τ1 if and only if pG, pτ0=τ1.

TODO: double check the below

Proof.

Summary: If G is Fn(ω×2M,2)-generic, then M[G]ZFC+there is an injection from 2M into P(ω).

This implies M[G]ZFC+202 if we have 1M=M[G], 2M=2M[G]. This is the goal for today’s lecture.

Note that our proof above is not good enough for the statement () from Lecture 8:

For all TZFC finite, there exists TZFC finite such that if M is a countable transitive model of T, then there is NM countable transitive model of T+¬CH.

For this, we need to look more carefully at the proof of the GMT:

MZFCM[G]ZFC.

The proof proceeds AXIOM BY AXIOM and thus for each φZFC, we find finite Sφ such that MSφ implies M[G]φ.

Let SZFC finite such that S proves that all relevant notions (name, value, …) are well-defined and absolute. Then for TZFC finite, define

T:=SφTSφ.

Then, the proof shows ().

Let’s prove ¬CH in M[G].

Definition (Preserves cardinals). We say preserves cardinals if G -generic over M, “κ is a cardinal” is absolute between M and M[G].

Definition 3.23 (Countable chain condition). We say has the countable chain condition (c.c.c.) if every antichain (note the anti!) in is countable.

Theorem 3.24. Assuming that:

Theorem 3.25. Fn(ω×2M,2) has the countable chain condition.

Corollary 3.26. Assuming: - G is Fn(ω×2M,2)-generic over M, then

M[G]ZFC+202.

Lemma 3.27. Assuming that:

Then there is FM such that xX,F(x)Y, xX,f(x)F(x) and MxX,F(x) is countable.

Proof of Theorem 3.24. Suppose Mκis a cardinal, M[G]κ is not a cardinal, so there is λ<κ and fM[G], f:λκ, f is a surjection. Apply the lemma to get F.

Define R:=α<λF(α).

Since x,f(x)F(x), R=κ.

But M|R|=0λ=λ<κ. But then M thinks that κ is not a cardinal, contradiction.

Now we prove the lemma:

Proof. Let F(x):={yY:p,pτ(xˇ)=yˇ}.

Fix τ a name for f.

By Definability Theorem , FM.

Then xX,F(x)Y follows from definition.

xX,f(x)F(x) follows from Forcing Theorem.

Let’s look at MF(x) is countable: If yF(x), let py be such that pyτ(xˇ)=yˇ.

If yy, then pypy. Thus

{py:yF(x)}

is an antichain.

By countable chain condition, it’s countalbe. So F(x) was countable.

TODO

Proof of Theorem 3.25. Actually, we prove Fn(X,Y) has countable chain condition whenever Y is countable.

Δ-systems form Example Sheet 3 Q33 are also called quasi-disjoint families.

(A family of finite sets D is called a Δ-system if there is a finite set R (called the root of the Δ-system) such that for all D,DD, if DD then DD=R).

Δ-system lemma: Any countable family of finite sets contains an uncountable Δ-system.

Take any A uncountable and prove that it’s not an antichain.

If pA, then dom(p)X finite. Consider

S:={dom(p):pA}.

That’s an uncountable family of finite sets, so by Δ-system lemma, find Δ-system DS uncountable.

Let rX finite be the root of D.

Since Y is countable, there are only countably many functions q:rY. Since D is uncountable, by pigeonhole principle, there are p,q such that dom(p),dom(q)D and p|r=q|r. But since dom(p)dom(q)=r, p and q are compatible.

So A is not an antichain.

We got M[G]202.

Next time: What is the size of 20 in M[G]?

Remember: LST Example Sheet 4: ZFC20ω.

What if 2 was one of the forbidden values?

Remark. Obtaining M[G]20=2 cannot be quite as general as this proof: if M20>2, then this will remain true in M[G].

Remark. If G is Fn(ω×αM,2)-generic over M, then

M[G]20α.

Previous lecture: Suppose M a countable transitive model of ZFC.

TODO

Question: () What are the possible values for 20?

Mentioned last lecture: not all values are possible. In particular, 20ω.

Definition (Cofinal). Cκ is cofinal (= unbounded) if λ<κ,γC,γλ. We can then define:

cfκ:={|C|:C is cofinal}.

Example 3.28. cf1=1, cfω=0.

Lemma 3.29 (Kőnig’s Lemma). κcfκ>κ.

Then LST Example Sheet 4 Q10 is the special case κ=ω, cfκ=0 of Kőnig’s Lemma.

Consequence for 2cfκ: (2cfκ)cfκ=2cfκ, thus 2cfκκ.

Preview of answer to (): Every value not prohibited by Kőnig’s Lemma is possible.

Now: 2!

Definition. If is any forcing, let

OL:={An:nω}

be any ω-sequence of antichains in .

Let

τOL:={(ň,p):pAn}.

We call these nice names.

If ||=κ and has countable chain condition, then there are at most κ0 many antichains and thus at most (κ0)0=κ00=κ0 many ω-sequences of antichains and thus nice names.

Theorem 3.30. Assuming that:

  • M[G]xω

Then there is a nice name τ such that val(τ,G)=x.

Note. The Theorem does not need any assumptions about .

Proof. Start with M such that val(μ,G)=x (possibly not nice). Fix nω. Fix either a well-ordering of (possibly using AC to get one) and build a maximal antichain An such that pAn, pňμ.

TODO.

Claim: val(μ,G)=val(τOL,G).

: If nval(τOL,G), then there is pG such that (ň,p)τOL, and then pAn, so pňμ. So nval(μ,G).

: If nval(μ,G), then by Forcing Theorem, get qG such that qňμ.

Subclaim: AnG. Indeed, by our lemma on compatibility ( Example Sheet 3), get qG such that qp for all pAn. Find rq,q. Then rňμ. But that is in contradiction to An being maximal.

So find pGAn. By definition, (ň,p)τOL and pG. So nval(τOL,G).

Corollary 3.31. Assuming that:

Then M[G]20λ.

Proof. Follows directly from:

Main Application

If =Fn(ω×2M,2), then ||=2M.

Calculate in M, 20.

Hausdorff’s Formula:

α+1β=α+1αβ.

So in particular:

10=100=2020=210=22

So 20=max(2,20).

By this calculation, if M202, then M[G]202.

Corollary 3.32. If MCH, then M[G]20=2.

Remark. This proof also shows that if

M20n

and G is -generic over M where =Fn(ω×2M,2), then

M[G]20n.

Corollary: If MCH, then M[G]20=n.

Remark. What happens at ω?

:=Fn(ω×ωM,2).

By general theory, M[G]20ω, but Kőnig’s Lemma gives 20ω+1.

What about the lower bound?

Our theorem and counting of nice names yields

M[G]20ω0>ω.

If MGCH, then

ω0ωω=ω+1.

So by Kőnig’s Lemma, ω0=ω+1.

Therefore, if MGCH, then M[G]20=ω+1.

TODO

First limit cardinal that is a possible value of 20 is ω1.

Clearly, =Fn(ω×ω1M,2) adds injection from ω1M into P(ω). So M[G]20ω1.

Count nice names: ||0=ω10.

If for all α<ω1, α0ω1 (), then ω10=ω1.

ω10=ω1={f|f:ωω1}=:X=α<ω1{f|f:ωα}

(since ω1 has no cofinal ω-sequence).

Thus |X|1ω1=ω1 (by assumption ()).

Thus M[G]20=ω1.

TODO

What about 21?

More on nice names:

Definition (lambda-nice names). Generalise “nice names” to λ-nice names:

Let

OL={Aα:α<λ}

be a family of λ many maximal -antichains:

τOL:={(αˇ,p):pAα}

for αλ.

These are names for subsets of λ.

Observe that our theorem “every Aλ in M[G] has a λ-nice name” still goes through.

If κ is an M-cardinal such that every antichain of has size κ. [On Example Sheet 3, this is called the κ+-chain condition.]

Let μ:=|| (in M). Then

(μκ)λ=μκλ

is an upper bound on the number of λ-nice names.

Thus

M[G]2λ(μκλ)M.

Question: Forcing with :=Fn(ω×2,2) and calculate 21. Assume MGCH. Then:

μ=||=2λ=1λ=0

(since has countable chain condition). So

M[G]2λ210=(21)M.

Calculate (21)M:

21=Hausdorff’s formula221=GCH22=2.

Together:

M[G]21=2=20.

Question: Is it possible to get PSA, i.e.

κ,λ,  κ<λ2κ<2λ

without CH.

In particular, can we get

20=221=3

First idea: Force with Fn(1×3,2).

Unfortunately,

M[G]20=3.

Interpret the generic object G as fα12 for α<3.

Define gα:=fα|ω.

Claim: For αα, gαgα. This is since

Dα,α:={p:uω,gα(u)gα(u)}

is still dense.

So, forcing with Fn(1×3,2) gives the same situation as forcing with Fn(ω×3,2) for 20, 21.

Idea:

Fn(X,Y,κ):={p:dom(p)X,range(p)Y,|p|κ}.

Thus Fn(X,Y)=Fn(X,Y,0).

Consider

:=Fn(1×3,2,1).

Properties:

First goal: What about preserving cardinals?

Clearly, Fn(1×3,2,1) does not have the countable chain condition anymore.

With example (38) (on Example Sheet 3), we need to figure out the chain condition of Fn(1×3,2,1).

We need a Δ-system lemma for this: If λ is regular (cfλ=λ) and OL is a family of sets of size <λ of size λ+. Then there is a Δ-system DOL of size λ+.

This Δ-system lemma gives with the same proof as before:

Fn(1×3,2,1) has the 2M-chain condition.

So: Fn(1×3,2,1) preserves cardinals 2M.

Closure

Definition (lambda-closed). A forcing is called λ-closed if any family {pα:α<γ} for γ<λ that is a descending chain:

α<βpβ<pα

there is q such that qpα for all α<γ.

Example. Fn(1×3,2,1) is 1-cloesd.

[If {pα} is a descending chain, then α<γpα is a condition in Fn(1×3,2,1).]

Theorem 3.33. Assuming that:

Then
P(κ)M=P(κ)M[G].

Corollary 3.34. Forcing with Fn(1×3,2,1)

  • (a)
    Does not change P(ω).
  • (b)
    Therefore preserves 1 (see Example Sheet 1 and the relation between codes for countable well-orders and preserving 1).

Summary: Forcing with Fn(1×3,2,1) over a model of GCH gives M[G] with:

Hence 21=3.





Forcing

Property

Preservation

Arithmetic





Fn(ω×2,2)

countable chain condition

all cardinals

20=2, 21=2





Fn(ω×3,2)

countable chain condition

all cardinals

20=3, 21=3, 22=3





Fn(1×3,2,1)

1-closed.

If MCH, then 2-chain condition

1 preserved. (Closure lemma [not yet proved])

κ2 preserved

If MCH, then 20=1, 21=3.





Note GCHPSA, but our model of ¬CH fails PSA.

Question: Can we have 1<20<21?

Closure lemma:

Theorem. If is λ-closed and κ<λ, then P(κ)M=P(κ)M[G].

λ-closed: every descending sequence of length <λ has a lower bound.

Proof. Let fM[G], f:κ2 and assume towards contradiction that fM. fB.

B:={fM|f:M2}.

Let τ be a name for f.

By Forcing Theorem, there is pG such that pτ:κˇ2ˇτBˇ.

Construct a κ-sequence of conditions pα TODO

TODO

The sequence {pα:ακ} is defined in M (by Definability Theorem), so we can define

g(α)=1:pα+1τ(αˇ)=1ˇ.

Then gM.

But now pκτ(αˇ)=1ˇ or pκτ(αˇ)=0ˇ for all α.

uso pkτ=ǧ. Hence pκτBˇ.

But pκpτBˇ. Contradiction.

Note that while Fn(X,Y,1) is always 1-closed, the chain condition depended on the value of 10.

The partial order Fn(1×3,2,1) has in general the (20)+-chain condition.

CH implies (20)+=2, so all cardinals are preserved.

However, if 20>1, then there is a gap and we do not know whether TODO.

If M20=2, does

Fn(1M×3M,2,1M)

preserve 2M?

Answer: Fn(λ+×κ,2,λ+) always adds a surjection from λ+ to 2λM. ()

Application: If λ=2 and M=20=2, then Fn(1×κ,2,1M adds a surjection from 1M onto P(0)M, i.e. 2M. So |2M|=1M[G].

[Proof of (). ] A generic for “is” a map f:λ+×κ2.

Define h:λ+2λ by

h(α)(β)=1:f(α,β)=1.

Claim: h is a surjection onto 2λM.

If gM, g:λ2, consider

Dg:={p|α<λ+,β<λ,p(α,β)=1g(β)=1}.

This is dense, and thus grange(h).

Back to our question: Can we get 1<20<21?

Start with MGCH. Consider

:=Fn(1×3,2,1)M

and let G be -generic over M. Consider

:=Fn(ω×2,2)M[G],

and let H be -generic over M[G].

Claim: M[G][H]1<20<21.

(M[G][H] is a forcing iteration).

This proves the claim.

Important: The order of forcings matters!

Suppose MGCH.

:=Fn(ω×2,2)M.

H -generic over M.

:=Fn(1×3,2,1)M[H].

G -generic over M[H].

Consider M[H][G]. Then

M[H]20=2=21.

But that means that forcing with will collapse 2M=2M[H].

Since is 1-closed,

P(ω)M[H]=P(ω)M[H][G].

In particular, M[H][G]CH. So, this order does not achieve what we want.

Final Remark on Forcing CH

Assume M20=2.

Question: Can you obtain M[G]CH?

The natural forcing would be

:=Fn(ω,1M).

This collapses 1M; it does not have the countable chain condition, but since it has size 1M, it has the 2M-chain condition, so all cardinals 2M are preserved.

Clearly therefore:

M[G]|P(ω)M|=2M=1M[G].

But: is |P(ω)M|=|P(ω)M[G]|?

Nice names: gives upper bound of

(1M)1M0=(21M)M.

That’s not surprising, since any A1 in M becomes a new subset of ω in M[G] via the new bijection between ω and 1M.

˙

Index

D-generic

absolute

absolute

absolute

antichain

countable chain condition

cofinal

compatible

countable transitive model

downwards absolute

dense below

dense

filter

filter base

forcing language

forcing

hierarchy

incompatible

λ-closed

λ-nice name

nice name

preserves cardinals

upwards absolute