8.12. Separation TheoremsΒΆ

Separation theorems provide us ways to find a separating hyperplane between two disjoint convex sets. Hyperplanes require the notion of an inner product.

Throughout this section, we assume that V is a finite dimensional real vector space endowed with norm β€–β‹…β€–:Vβ†’R and a real inner product βŸ¨β‹…,β‹…βŸ©:VΓ—Vβ†’R. The norm in general is not necessarily induced by the inner product. The Euclidean norm is denoted by β€–β‹…β€–2.

8.12.1. Types of Separating HyperplanesΒΆ

Definition 8.59 (Separating hyperplane)

Let V be a real n-dimensional vector space. For any two subsets C and D of V, a hyperplane H is said to separate C and D if C is contained in one of the closed halfspaces corresponding to H and D is contained in the other closed halfspace.

Let H be given by

H={x∈V|⟨x,a⟩=b}

where a∈Vβˆ— and b∈R.

If C and D are separated by H, then either

⟨x1,aβŸ©β‰€bβ‰€βŸ¨x2,aβŸ©βˆ€x1∈C,x2∈D

or

⟨x2,aβŸ©β‰€bβ‰€βŸ¨x1,aβŸ©βˆ€x1∈C,x2∈D

holds true.

We can also say that the two sets C and D can be separated by a hyperplane or there exists a hyperplane separating C and D if there exists a nonzero vector a∈Vβˆ— such that

supx∈C⟨x,aβŸ©β‰€infx∈D⟨x,a⟩.

Any choice of b∈[l,u] where l=supx∈C⟨x,a⟩ and u=infx∈D⟨x,a⟩ describes a separating hyperplane.

This definition allows for some degenerate possibilities where both CβŠ†H and DβŠ†H since the closed halfspaces contain H (as their boundary).

Definition 8.60 (Proper separation)

Let V be a real n-dimensional vector space. For any two subsets C and D of V, a hyperplane H is said to properly separate C and D if H separates them and both are not entirely contained in H; i.e., either C⊈H or D⊈H or both.

Proper separation still allows for the possibility that parts of C or D lies inside H. But it ensures that Cβ–³Dβ‰ βˆ…. C∩DβŠ†H is the common part.

If C and D are properly separated by H, then they are also separated by H. Hence

⟨x1,aβŸ©β‰€bβ‰€βŸ¨x2,aβŸ©βˆ€x1∈C,x2∈D

or

⟨x2,aβŸ©β‰€bβ‰€βŸ¨x1,aβŸ©βˆ€x1∈C,x2∈D

holds true. Also, there exists at least one x1∈C and x2∈D such that at least one of these inequalities is a strict inequality thus ensuring that either C or D or both are not contained entirely in H.

We can also say that the two sets C and D can be properly separated by a hyperplane or there exists a hyperplane properly separating C and D if there exists a nonzero vector a∈Vβˆ— such that

supx∈C⟨x,aβŸ©β‰€infx∈D⟨x,a⟩.

and

infx∈C⟨x,a⟩<supx∈D⟨x,a⟩.

Any choice of b∈[l,u] where l=supx∈C⟨x,a⟩ and u=infx∈D⟨x,a⟩ describes a properly separating hyperplane. This characterization is proved in Theorem 8.161 below.

If a convex set has a nonempty interior, then it cannot be contained in a hyperplane. Two disjoint sets one of which has a nonempty interior can indeed be properly separated.

Definition 8.61 (Strong separation)

Let V be a real n-dimensional vector space. For any two subsets C and D of V, a hyperplane H is said to strongly separate C and D if there exists an r>0 such that C+rB is contained in one of the open halfspaces corresponding to H and D+rB is contained in the other open halfspace.

Under these assumptions, one set lies in the interior of H++ and the other set lies in the interior of Hβˆ’βˆ’. Under strong separation, C∩D=βˆ… is guaranteed since H++∩Hβˆ’βˆ’=βˆ….

We can also say that the two sets C and D can be strongly separated by a hyperplane or there exists a hyperplane strongly separating C and D if there exists a nonzero vector a∈Vβˆ— such that

supx∈C⟨x,a⟩<infx∈D⟨x,a⟩.

Any choice of b∈(l,u) where l=supx∈C⟨x,a⟩ and u=infx∈D⟨x,a⟩ describes a strongly separating hyperplane.

Strong separation is called as strict separation in [7].

Remark 8.9

Separation is translation invariant. For any a∈V and convex sets C,DβŠ†V,

C and D are separated by a hyperplane H if and only if Cβˆ’a, Dβˆ’a are separated by the hyperplane Hβˆ’a.

8.12.2. Disjoint Affine and Convex SetsΒΆ

Separation between sets can be discussed at several levels. We start with a simple case where an affine set and a relatively open convex set are disjoint.

Theorem 8.160 (Disjoint affine and relatively open convex sets)

Let V be a real n-dimensional vector space. Let C be a nonempty relatively open convex set of V. Let MβŠ†V be an affine subspace such that M∩C=βˆ…. Then, there exists a hyperplane H such that MβŠ†H and one of the two open halfspaces associated with H contains C.

Proof. Consider the case where M is a hyperplane. Then, H=M and the result is immediate.

  1. If for contradiction, C is not contained in one of the open halfspaces of M.

  2. Then, C∩Hβ‰ βˆ… since H is the boundary of the halfspaces.

  3. It will contradict our hypothesis that C∩M=βˆ….

  4. Thus, C must be in one of the open halfspaces of H.

Now, assume that M is not a hyperplane.

Without loss of generality, assume that 0∈M. We can do this by picking any element m of M and replacing M by Mβˆ’m and C by Cβˆ’m. Thus, M is a subspace.

The proof is constructive. We iteratively construct subspaces of increasing dimension which contain M but don’t contain C till we reach a hyperplane.

  1. Let dimM=d where d<n.

  2. Let S=M be the initial subspace.

  3. Let r=nβˆ’1βˆ’d be the number of iterations (if M is a hyperplane, no iterations are needed).

  4. For i=1,…,r:

    1. Replace S by a subspace Sβ€² such that dimSβ€²=dimS+1 and Sβ€²βˆ©C=βˆ….

    2. Let S=Sβ€²

  5. We have arrived with a hyperplane S such that S∩C=βˆ….

Towards this, let us look at the procedure to construct Sβ€² from S such that dimSβ€²=dimS+1 and S∩C=βˆ…. Here is the outline

  1. Let SβŠ₯ be the orthogonal complement of S.

  2. Pick any 2D subspace P of SβŠ₯.

  3. Pick a line L in P which passes through origin but doesn’t intersect with C; i.e. L∩C=βˆ….

  4. Construct the subspace Sβ€²=SβŠ•L. The direct sum is valid since S∩L={0}.

  5. Note that dimSβ€²=dimS+1 and Sβ€²βˆ©C=βˆ… since S∩C=βˆ… as well as L∩C=βˆ….

We now elaborate the procedure for finding the line L:

  1. We have S as a linear subspace with dimS<nβˆ’1 and S∩C=βˆ….

  2. dimSβŠ₯=nβˆ’dimS>nβˆ’(nβˆ’1)=1.

  3. Since dimSβŠ₯>1, it contains a two dimensional subspace P.

  4. Consider the set Cβ€²=P∩(Cβˆ’S).

  5. The set Cβˆ’S has some properties.

    1. Cβˆ’S is convex since it is the sum of two convex sets.

    2. Since 0∈S, hence C=Cβˆ’0βŠ†Cβˆ’S.

    3. Since C∩S=βˆ…, hence 0βˆ‰Cβˆ’S.

    4. Note that due to Theorem 8.157:

      ri(Cβˆ’S)=riCβˆ’riS=Cβˆ’S

      since C is relatively open and S is affine, hence relatively open.

    5. Thus, Cβˆ’S is relatively open.

  6. Consequently, the set Cβ€² also has some properties.

    1. Cβ€² is convex since it is the intersection of two convex sets.

    2. Since 0βˆ‰Cβˆ’S, hence, 0βˆ‰Cβ€²=P∩(Cβˆ’S).

    3. Cβ€² is relatively open.

      1. If Cβ€² is empty, then Cβ€² is relatively open vacuously.

      2. Otherwise, by Theorem 8.155:

      riCβ€²=ri(P∩(Cβˆ’S))=P∩ri(Cβˆ’S)=P∩(Cβˆ’S).

      Thus, riCβ€²=Cβ€² implies that Cβ€² is relatively open.

  7. We now seek a line LβŠ†P passing through the origin 0 that doesn’t intersect Cβ€².

  8. We note that L∩Cβ€²=βˆ…βŸΉL∩C=βˆ….

    1. Suppose for contradiction that L∩Cβ€²=βˆ… but L∩Cβ‰ βˆ….

    2. Let x∈L∩C.

    3. Then, x∈LβŠ†P.

    4. Also, x∈CβŠ†Cβˆ’S.

    5. Thus, x∈Cβ€²=P∩(Cβˆ’S).

    6. This means that x∈L∩Cβ€².

    7. In other words, L∩Cβ€²β‰ βˆ… which contradicts our assumption.

  9. How to choose L? There are three possibilities:

    1. Cβ€² is empty.

    2. Cβ€² is contained in a line; i.e., dimaffCβ€²=1.

    3. affCβ€²=P and dimaffCβ€²=2.

    affCβ€² cannot be larger than P since Cβ€²βŠ†P and P is affine.

  10. If Cβ€² is empty, then any line in P passing through origin will do.

  11. If affC is a line.

    1. If affCβ€² passes through the origin, we can take a line perpendicular to affCβ€² passing through origin. In this case, L∩affCβ€²={0}. Since 0βˆ‰Cβ€², hence L∩Cβ€²=βˆ….

    2. If affCβ€² is a line that doesn’t pass through 0, we can simply take a line that is parallel to affCβ€² and passes through 0. In this case L∩affCβ€²=βˆ…. Consequently, L∩Cβ€²=βˆ….

  12. If affCβ€² is the entire 2D subspace P.

    1. Consider the set K=⋃{tCβ€²|t>0}.

    2. Cβ€²βŠ†K (specifically, for t=0).

    3. K is a convex cone containing Cβ€² but not containing 0.

    4. K is relatively open since Cβ€² is relatively open.

    5. A boundary ray of K doesn’t intersect with Cβ€² since K is relatively open.

    6. Take L to be any of the two boundary rays of K and extend it to a line passing through 0.

8.12.3. Proper Separation CharacterizationΒΆ

Theorem 8.161 (Characterization of proper separation)

Let V be an n-dimensional real vector space. Let S and T be nonempty subsets of V. There exists a hyperplane H that separates S and T properly if and only if there exists a vector a∈V such that

inf{⟨x,a⟩|x∈S}β‰₯sup{⟨x,a⟩|x∈T}

and

sup{⟨x,a⟩|x∈S}>inf{⟨x,a⟩|x∈T}.

Proof. Assume that there is some vector a∈V satisfying the conditions above.

  1. Since a satisfies the second strict inequality, hence a≠0. Otherwise, the second inequality cannot hold.

  2. Choose any b∈R such that

    infx∈S{⟨x,a⟩}β‰₯bβ‰₯supx∈S{⟨x,a⟩}.
  3. Then, the set H={x|⟨x,a⟩=b} is a hyperplane.

  4. Let H+={x|⟨x,a⟩β‰₯b} and Hβˆ’={x|⟨x,aβŸ©β‰€b}.

  5. Clearly, SβŠ†H+ and TβŠ†Hβˆ’.

  6. The second strict inequality ensures that both S and T cannot be contained in H.

    1. For contradiction, assume SβŠ†H and TβŠ†H

    2. Then sup{⟨x,a⟩|x∈S}=b and inf{⟨x,a⟩|x∈T}=b.

    3. Thus, the second inequality will not hold.

  7. Thus, S and T are properly separated.

Now, assume that S and T are properly separated by some hyperplane H.

  1. Let H={x|⟨x,a⟩=b} be the specification of the said hyperplane.

  2. Let H+={x|⟨x,a⟩β‰₯b} and Hβˆ’={x|⟨x,aβŸ©β‰€b}.

  3. Without loss of generality, assume that SβŠ†H+ and TβŠ†H+.

  4. Then, ⟨x,a⟩β‰₯b for every x∈S.

  5. Thus, inf{⟨x,a⟩|x∈S}β‰₯b.

  6. Similarly, ⟨x,aβŸ©β‰€b for every x∈T.

  7. Thus, sup{⟨x,a⟩|x∈T}≀b.

  8. Thus, the first inequality is satisfied.

  9. Since H properly separates S and T, hence either S⊈H or T⊈H.

  10. If S⊈H, then there exists x∈S such that ⟨x,a⟩>b.

  11. Consequently, sup{⟨x,a⟩|x∈S}>b.

  12. If T⊈H, then there exists x∈T such that ⟨x,a⟩<b.

  13. Consequently, inf{⟨x,a⟩|x∈T}<b.

  14. Combining,

    sup{⟨x,a⟩|x∈S}>inf{⟨x,a⟩|x∈T}

    must be true.

8.12.3.1. Proper Separation : Set and PointΒΆ

Remark 8.10 (Proper separation between a set and a point)

Let V be an n-dimensional real vector space. We consider the case of proper separation between a set S and a point p. We can form a set T={p}. Then the proper separation between S and p can be described in terms of proper separation between S and T.

Then

sup{⟨x,a⟩|x∈T}=inf{⟨x,a⟩|x∈T}=⟨p,a⟩.

Hence, S and p are properly separated if and only if there exists a vector a∈V such that

inf{⟨x,a⟩|x∈S}β‰₯⟨p,a⟩ and sup{⟨x,a⟩|x∈S}>⟨p,a⟩.

We now consider a hyperplane

H={x∈V|⟨x,a⟩=⟨p,a⟩}.

And let H+ denote one of its closed half-spaces

H+={x∈V|⟨x,a⟩β‰₯⟨p,a⟩}.

We can clearly see that

  1. a∈H.

  2. SβŠ†H+. S is contained entirely in the closed half-space H+.

  3. S⊈H. S is not contained entirely in H.

Thus, if a set and a point are properly separated, then there exists a hyperplane that passes through the point, contains the set in one of its half-spaces but does not fully contain the set itself.

8.12.4. Separating Hyperplane TheoremsΒΆ

Theorem 8.162 (Separating hyperplane theorem I)

Let V be an n-dimensional real vector space. Let S and T be nonempty convex subsets of V. There exists a hyperplane H that separates S and T properly if and only if riS∩riT=βˆ….

In other words, two sets are properly separated if and only if their relative interiors are disjoint.

Proof. Let C=Sβˆ’T.

  1. By Theorem 8.142, riS and riT are nonempty since S and T are nonempty convex sets.

  2. By Theorem 8.157, riC=riSβˆ’riT and it is nonempty.

  3. Note that, 0βˆ‰riC if and only if riS∩riT=βˆ….

    1. If 0∈riC then there exists x∈riS∩riT such that xβˆ’x=0.

Now assume that riS∩riT=βˆ….

  1. Thus, 0βˆ‰riC.

  2. Consider the affine set M={0}.

  3. Clearly, M∩riC=βˆ… and riC is relatively open.

  4. By Theorem 8.160, there exists a hyperplane containing M such that riC is a subset of one of its associated open halfspaces.

  5. Let H be this hyperplane given by H={x|⟨x,a⟩=0}. Since the hyperplane contains M, hence it is a subspace.

  6. By Theorem 8.149, CβŠ†clCβŠ†clriC.

  7. Thus, C is contained in the corresponding closed halfspace.

  8. Without loss of generality, assume that C is contained in the nonnegative halfspace H+.

  9. Thus, inf{⟨x,a⟩|x∈C}β‰₯0.

  10. This means that

    inf{⟨x,a⟩|x∈C}=inf{⟨x,a⟩|x∈S}βˆ’sup{⟨x,a⟩|x∈T}β‰₯0.
  11. Thus,

    inf{⟨x,a⟩|x∈S}β‰₯sup{⟨x,a⟩|x∈T}.
  12. Since riC∈H++, there exists x∈C, such that ⟨x,a⟩>0.

  13. Thus, sup{⟨x,a⟩|x∈C}>0.

  14. But then,

    sup{⟨x,a⟩|x∈C}=sup{⟨x,a⟩|x∈S}βˆ’inf{⟨x,a⟩|x∈T}>0.
  15. Thus,

    sup{⟨x,a⟩|x∈S}>inf{⟨x,a⟩|x∈T}.
  16. Then, by Theorem 8.161, S and T are properly separated.

Now, for the converse, assume that S and T are properly separated. Thus, there exists a vector a∈V such that

inf{⟨x,a⟩|x∈S}β‰₯sup{⟨x,a⟩|x∈T}

and

sup{⟨x,a⟩|x∈S}>inf{⟨x,a⟩|x∈T}.
  1. From the first inequality, we get that inf{⟨x,a⟩|x∈C}β‰₯0.

  2. From the second inequality, we get that sup{⟨x,a⟩|x∈C}>0.

  3. Thus, there exists a hyperplane H given by H={x|⟨x,a⟩=0} such that the corresponding nonnegative closed halfspace H+={x|⟨x,a⟩β‰₯0} contains C.

  4. Note that

    intH+=riH+=H++={x|⟨x,a⟩>0}.
  5. Since sup{⟨x,a⟩|x∈C}>0, hence H++∩Cβ‰ βˆ….

  6. CβŠ†clH++=H+ but C⊈clH++βˆ–riH++=H+βˆ–H++=H.

  7. Thus, by Theorem 8.156, riCβŠ†riH++=H++.

  8. Thus, 0βˆ‰riC.

  9. Thus, riS∩riT=βˆ….

We are done.

Corollary 8.10 (Separating hyperplane theorem II)

Let V be an n-dimensional real vector space. Let S and T be nonempty disjoint convex subsets of V; i.e., S∩T=βˆ….

Then, there exists a hyperplane that properly separates them.

Proof. Since S∩T=βˆ…, hence riS∩riT=βˆ…. Then, applying Theorem 8.162, there exists a hyperplane that properly separates S and T.

This result is stronger than proposition 2.4.2 in [7].

Disjointness of convex sets is not enough for strong separation as their closures might meet.

Theorem 8.163 (Separating hyperplane theorem III)

Let V be an n-dimensional real vector space. Let S be a nonempty convex subset of V. Let a∈V be a vector. There exists a hyperplane H that separates S and a properly if and only if aβˆ‰riS.

In other words, a point can be properly separated from a convex set if and only if it doesn’t belong to its relative interior.

Proof. .

  1. We form a set T={a}.

  2. Then riT={a} (Theorem 8.133).

  3. By Theorem 8.162, S and T are properly separated if and only if

    riS∩riT=βˆ….
  4. In other words,

    riS∩{a}=βˆ….
  5. In other words, aβˆ‰riS.

8.12.5. Strong Separation CharacterizationΒΆ

Theorem 8.164 (Characterization of strong separation)

Let V be an n-dimensional real vector space. Let S and T be nonempty convex subsets of V. There exists a hyperplane H that separates S and T strongly if and only if

inf{β€–xβˆ’yβ€–|x∈S,y∈T}>0.

In other words, H strongly separates S and T if and only if 0βˆ‰cl(Sβˆ’T).

Proof. Let C=Sβˆ’T. Then

inf{β€–xβˆ’yβ€–|x∈S,y∈T}=inf{β€–vβ€–|v∈C}.

By Theorem 4.53, inf{β€–vβ€–|v∈C}=0 if and only if 0∈clC.

Thus, inf{β€–vβ€–|v∈C}>0 if and only if 0βˆ‰clC.

Assume that S and T are strongly separated. Let H be the hyperplane that separates S and T. Let H++ be the positive open halfspace. Let Hβˆ’βˆ’ be the negative open halfspace.

  1. Then, there exists an r>0 such that S+rBβŠ†H++ and T+rBβŠ†Hβˆ’βˆ’.

  2. Since H++∩Hβˆ’βˆ’=βˆ…, hence (S+rB)∩(T+rB)=βˆ….

  3. Then, for any x∈S and y∈T and every u,v∈B,

    β€–x+ruβˆ’(y+rv)β€–>0

    as x+ru=y+rv would mean that (S+rB)∩(T+rB)β‰ βˆ…

  4. Note that, β€–x+ruβˆ’(y+rv)β€–=β€–(xβˆ’y)βˆ’r(vβˆ’u)β€–.

  5. Then, β€–xβˆ’yβ€–β‰₯2r must be true for every x∈S and y∈T.

    1. For contradiction, assume that β€–xβˆ’yβ€–<2r for some x∈S and y∈T.

    2. Let u=12r(yβˆ’x).

    3. Let v=12r(xβˆ’y).

    4. Note that β€–uβ€–<1 and β€–vβ€–<1.

    5. Thus, u,v∈B.

    6. Then, r(vβˆ’u)=xβˆ’y.

    7. Then, β€–(xβˆ’y)βˆ’r(vβˆ’u)β€–=β€–0β€–=0.

    8. This contradicts the condition that β€–x+ruβˆ’(y+rv)β€–>0 for every x∈S, y∈T and u,v∈B.

  6. Thus, inf{β€–xβˆ’yβ€–|x∈S,y∈T}>0.

Conversely, assume that inf{β€–xβˆ’yβ€–|x∈S,y∈T}>0.

  1. Let inf{β€–xβˆ’yβ€–|x∈S,y∈T}=2r where r>0.

  2. Then, β€–xβˆ’yβ€–β‰₯2rβˆ€x∈S,y∈T.

  3. Then, (S+rB)∩(T+rB)=βˆ….

    1. For contradiction, assume that (S+rB)∩(T+rB)β‰ βˆ….

    2. Let a∈(S+rB)∩(T+rB).

    3. Then, there exists x∈S and u∈B such that a=x+ru.

    4. And, there exists y∈T and v∈B such that a=y+rv.

    5. Then, x+ru=y+rv.

    6. Thus, xβˆ’y=r(vβˆ’u).

    7. Thus,

      β€–xβˆ’yβ€–=rβ€–vβˆ’u‖≀r(β€–vβ€–+β€–uβ€–)<r(1+1)=2r.
    8. This contradictions our hypothesis that β€–xβˆ’yβ€–β‰₯2rβˆ€x∈S,y∈T.

  4. Note that S+rB and T+rB are convex and disjoint.

  5. Then, due to Corollary 8.10, there exists a hyperplane H which separates S+rB and T+rB properly.

  6. We can choose H so that S+rBβŠ†H+ and T+rBβŠ†Hβˆ’.

  7. But then, S+r2B∈H++ and T+r2B∈Hβˆ’βˆ’ lie in the opposite open halfspaces.

  8. Thus, S and T are strongly separated.

8.12.6. Strong Separation ConditionsΒΆ

We describe several conditions which are sufficient to achieve strong separation between two sets. See also strict separation theorem (proposition 2.4.3) of [7].

Theorem 8.165 (Strong separation: closed subtraction)

Let V be an n-dimensional real vector space. Let S and T be two disjoint nonempty convex subsets of V. There exists a hyperplane H that separates S and T strongly if Sβˆ’T is closed.

Proof. We proceed as follows.

  1. Since S and T are disjoint hence 0βˆ‰Sβˆ’T.

  2. Since Sβˆ’T is closed, hence 0βˆ‰cl(Sβˆ’T)=Sβˆ’T.

  3. By Theorem 8.164, S and T are strongly separated.

Theorem 8.166 (Strong separation: closed and compact sets)

Let V be an n-dimensional real vector space. Let S and T be two disjoint nonempty convex subsets of V. There exists a hyperplane H that separates S and T strongly if S is closed and T is compact.

Proof. We proceed as follows.

  1. Since S and T are disjoint hence 0βˆ‰Sβˆ’T.

  2. Since Sβˆ’T is closed, hence 0βˆ‰cl(Sβˆ’T)=Sβˆ’T.

  3. By Theorem 8.164, S and T are strongly separated.

8.12.6.1. Recession ConesΒΆ

The conditions below are based on the theory of recession cones and lineality spaces of convex sets. See Recession Cones for a background.

Theorem 8.167 (Strong separation: recession cones)

Let V be an n-dimensional real vector space. Let S and T be two disjoint nonempty closed and convex subsets of V. There exists a hyperplane H that separates S and T strongly if

RS∩RT=LS∩LT

where RX denotes the recession cone and LX denotes the lineality space of a set X.

Proof. Due to Corollary 8.18, Sβˆ’T is closed. Then due to Theorem 8.165, S and T are strongly separated.

8.12.6.2. Strongly Separating HyperplaneΒΆ

Recall that the (orthogonal) projection of a vector v on a convex set C is the vector x∈C which is nearest to v under the (Euclidean) norm. In particular, if v∈C then v is its own projection on C. The projection of the vector 0 on a convex set C is this the vector of C with the minimum norm. See Projection on Convex Sets for details.

Theorem 8.168 (Strongly separating hyperplane)

Let V be an n-dimensional real vector space. Let S and T be two disjoint nonempty and convex subsets of V. Assume that Sβˆ’T is closed.

Consider the vector of minimum norm (projection of the origin) in Sβˆ’T given by v=sβˆ’t where s∈S and t∈T.

  1. Let a=12v=12(sβˆ’t).

  2. Let b=12(s+t).

  3. Let c=⟨b,a⟩.

Then the hyperplane H given by

H={x∈V|⟨x,a⟩=c}

strongly separates S and T. In other words,

⟨x1,a⟩>c>⟨x2,aβŸ©βˆ€x1∈S,x2∈T.

Proof. .

  1. Since S and T are disjoint hence sβˆ’tβ‰ 0.

  2. Hence a≠0.

  3. s is the nearest point in clS from clT.

  4. t is the nearest point in clT from clS.

  5. The line segment [s,t] connects these nearest points.

  6. b lies on this line segment.

  7. Accordingly, s is projection of b in clS.

  8. Similarly t is projection of b in clT.

  9. Then, due to Theorem 9.7 (orthogonal projection characterization), for every x∈S

    ⟨xβˆ’s,bβˆ’sβŸ©β‰€0.
  10. But

    bβˆ’s=12(s+t)βˆ’s=12(tβˆ’s)=βˆ’a.
  11. Hence ⟨xβˆ’s,a⟩β‰₯0.

  12. Hence ⟨x,a⟩β‰₯⟨s,a⟩.

  13. Further

    ⟨s,a⟩=⟨sβˆ’b+b,a⟩=⟨a+b,a⟩=β€–aβ€–22+⟨b,a⟩=β€–aβ€–22+c>c.

    since a≠0.

  14. Hence for every x∈S, we have ⟨x,a⟩>c.

  15. A similar argument shows that for every x∈T, we have ⟨x,a⟩<c.

8.12.7. Closed Convex SetsΒΆ

Theorem 8.169 (Strict separation theorem)

Let V be a real n-dimensional vector space. Let CβŠ†V be a nonempty closed convex set. Let yβˆ‰C. Then, there exists p∈Vβˆ— and α∈R such that

⟨y,p⟩>Ξ± and βŸ¨x,pβŸ©β‰€Ξ±βˆ€x∈C.

In other words, there exists a separating hyperplane such that C is contained in one of its (closed) halfspaces and y is not in that halfspace (i.e., it is in the opposite open halfspace).

By choosing any β∈(α,⟨y,p⟩), we also have

⟨y,p⟩>Ξ² and βŸ¨x,p⟩<Ξ²βˆ€x∈C.

Proof. This is an application of strong separation.

  1. Define a set D={y}.

  2. Since C is closed and yβˆ‰C, hence y is not a closure point of C.

  3. Thus, there exists r>0 such that B(y,r)∩C=βˆ….

  4. Thus, d(C,D)β‰₯r.

  5. Then, by Theorem 8.164, C and D are strongly separated.

  6. Let H be a hyperplane which strongly separates them.

  7. Then one of the closed halfspaces of H contains C but not y.

  8. Let H be described by

    H={x∈V|⟨x,p⟩=α}.
  9. We can negate p,Ξ± if necessary so that

    CβŠ†Hβˆ’={x∈V|⟨x,pβŸ©β‰€Ξ±}.
  10. Accordingly, ⟨y,p⟩>α.

Theorem 8.170 (Closed Convex = Intersection of Halfspaces)

Let V be a real n-dimensional vector space. A closed convex set C of V is the intersection of all the closed halfspaces that contain it.

Proof. The set V is closed and convex but no halfspace contains it. So the statement is vacuously true.

The empty set is closed and convex and every halfspace contains it. The intersection of all halfspaces
is the empty set. Thus, the statement is vacuously true.

Let us assume that C is nonempty and not equal to V.

  1. Let a∈V such that aβˆ‰C.

  2. Since C is closed, hence a is not a closure point of C.

  3. Thus, there exists r>0 such that B(a,r)∩C=βˆ….

  4. Thus, β€–aβˆ’xβ€–>0βˆ€x∈C. Specifically, inf{β€–aβˆ’xβ€–|x∈C}β‰₯r.

  5. Thus, by Theorem 8.164, {a} and C are strongly separated by some hyperplane H.

  6. Thus, one of the corresponding closed halfspaces contains C but not a.

  7. In other words, for every aβˆ‰C, there exists a closed halfspace that doesn’t contain a but contains C.

  8. Thus, the intersection of all halfspaces containing C cannot contain any element from Vβˆ–C.

    1. If the intersection contained an element aβˆ‰C, then there would be a closed halfspace containing C but not a which would mean that the intersection of all halfspaces containing C cannot contain a.

  9. Hence, the intersection of all closed halfspaces containing C is exactly equal to C.

We now have two different characterizations of closed convex sets.

  1. From Theorem 8.122, a closed and convex set is the closure of the union of all the line segments connecting the points of the set.

  2. From Theorem 8.170, a closed and convex set is the intersection of all the closed half-spaces containing it.

8.12.8. Supporting HyperplanesΒΆ

Definition 8.62 (Supporting hyperplane and halfspaces)

Let V be a real n-dimensional vector space. Let SβŠ†V. Let x0∈bdS be a point on its boundary.

If there exists a nonzero vector a∈V such that ⟨x,aβŸ©β‰€βŸ¨x0,a⟩ for every x∈S, then the hyperplane H given by

H={x|⟨x,a⟩=⟨x0,a⟩}

is called a supporting hyperplane of S at x0.

H separates {x0} and S and H contains x0. The halfspace {x|⟨(xβˆ’x0),aβŸ©β‰€0} corresponding to H is called a supporting halfspace of S at x0.

Convex sets have this beautiful property that there exists a supporting hyperplane at every point in the boundary of the set.

Theorem 8.171 (Supporting hyperplane theorem)

Let V be a real n-dimensional vector space. Let C be a nonempty convex subset of V. Let x∈bdC be any point in the boundary of C. Then, there exists a supporting hyperplane to C at x.

Proof. Consider the case where intC=βˆ….

  1. Then, due to Theorem 8.125, dimaffC<n.

  2. Thus, there exists a hyperplane H such that affCβŠ†H.

  3. Since V is finite dimensional, hence H is closed (Theorem 4.202).

  4. Thus, clCβŠ†H.

  5. Thus, x∈H.

  6. This H trivially separates {x} and C as both are contained in H.

Now, assume that intCβ‰ βˆ….

  1. dimaffC=n.

  2. riC=intC.

  3. Since x∈bdC, hence the sets {x} and intC are disjoint.

  4. ri{x}={x} (Theorem 8.133).

  5. We have ri{x}∩riC=βˆ….

  6. By Theorem 8.162, there exists a hyperplane H that separates {x} and C properly.

  7. Consequently C lies entirely in one of the closed halfspaces of H.

Corollary 8.11

Let V be a real n-dimensional vector space. A closed convex set C of V is the intersection of all the supporting halfspaces that contain it.