8.4. Cones¶

Main references for this section are [5, 10, 31].

Throughout this section, V is a real vector space. Some material is specific to Rn. Rest of the material is applicable for any real vector space. Wherever necessary, V is endowed with a norm ‖⋅‖:V→R or a real inner product ⟹⋅,⋅⟩:V×V→R. It is also equipped with a metric d(x,y)=‖x−y‖. The norm in general is not necessarily induced by the inner product (if the vector space is indeed endowed with an inner product).

We start with the basic definition of a cone.

8.4.1. Cones¶

Definition 8.22 (Cone)

A set C is called a cone or nonnegative homogeneous, if for every x∈C and t≄0, we have tx∈C.

In other words, a set is a cone if it is closed under nonnegative scalar multiplication.

  • By definition we have 0∈C.

  • Some authors ([31]) prefer to restrict the definition to t>0 thus the origin is not included in the cone by default.

  • In our definition, a cone always includes the origin.

  • Thus, a cone is always nonempty.

  • When we think of cones, ice-cream cones naturally come to mind. Hence, we kind of think that cones happen to be pointed (at origin) and emanate in one general direction from there. However, our definition above is more general. It allows for lines (passing through origin) and linear subspaces to be cones too.

  • Pointed cones are a special class of cones which are discussed below. An ice-cream cone is a pointed cone.

8.4.2. Properties of Cones¶

8.4.2.1. Intersection¶

Property 8.3 (Intersection of cones)

Let {Ci|i∈I} be a collection of cones. Then their intersection ⋂i∈ICi is a cone.

Proof. Let x∈⋂i∈ICi and let t≄0.

  1. x∈Ci for every i∈I.

  2. Hence tx∈Ci for every i∈I since every Ci is a cone.

  3. Hence tx∈⋂i∈ICi.

  4. Hence ⋂i∈ICi is a cone.

8.4.2.2. Cartesian Product¶

Property 8.4 (Cartesian product of cones)

Let V and W be real vector spaces. Let C1⊆V and C2⊆W be cones. Then their Cartesian product C1×C2 is a cone.

Proof. Let x∈C1×C2 and t≄0.

  1. Then there exist y∈C1 and z∈C2 such that x=(y,z).

  2. Then ty∈C1 and tz∈C2 since both are cones.

  3. Hence (ty,tz)∈C1×C2.

  4. But (ty,tz)=t(y,z)=tx.

  5. Hence tx∈C1×C2.

  6. Hence C1×C2 is a cone.

8.4.2.3. Set Addition¶

Property 8.5 (Set addition of cones)

Let C1,C2⊆V be cones. Then their vector sum C1+C2 is a cone.

Proof. Let x∈C1+C2 and t≄0.

  1. Then there exist x1∈C1 and x2∈C2 such that x=x1+x2.

  2. Then tx1∈C1 and tx2∈C2 since they are cones.

  3. Hence tx=t(x1+x2)=tx1+tx2∈C1+C2.

  4. Hence C1+C2 is a cone.

8.4.2.4. Closure¶

Property 8.6 (Closure)

Let C⊆V be a cone. Then its closure clC is a cone.

Proof. Let x∈clC and t≄0.

  1. Then there exists a sequence {xk} of C such that xk→x.

  2. Then txk∈C for every k since C is a cone.

  3. Consider the sequence {txk} of C.

  4. We have limk→∞txk=tx.

  5. Hence tx∈clC.

  6. Hence clC is a cone.

8.4.2.5. Linear Transformation¶

Property 8.7 (Linear transformation )

The image and inverse image of a cone under a linear transformation is a cone.

Proof. Let V and W be real vector spaces and T:V→W be a linear transformation.

Image of a cone is a cone.

  1. Let C∈V.

  2. Let D=T(C).

  3. Let v∈D and t≄0.

  4. Then there exists u∈C such that v=T(u).

  5. Since C is a cone, hence tu∈C.

  6. Then T(tu)=tT(u)=tv∈D.

  7. Hence D is a cone.

Inverse image of a cone is a cone.

  1. Let D∈W.

  2. Let C=T−1(D).

  3. Let x∈C and t≄0.

  4. Then y=T(x)∈D.

  5. Also T(tx)=tT(x)=ty∈D since D is a cone.

  6. Hence tx∈C.

  7. Hence C is a cone.

8.4.3. Convex Cones¶

Definition 8.23 (Convex cone)

A set C is called a convex cone if it is convex and a cone.

Example 8.11 (Convex cones)

  • A ray with its base at origin is a convex cone.

  • A line passing through origin is a convex cone.

  • A plane passing through origin is a convex cone.

  • Any subspace is a convex cone.

Theorem 8.45 (Subspace as convex cone)

A linear subspace is a convex cone.

Proof. Let V⊆V be a subspace. We know that V is convex since V contains all its linear combinations and every convex combination is a linear combination. Now, let v∈V. Then, tv∈V for every t≄0 since V is closed under scalar multiplication. Thus, V is a cone too. Thus, V is a convex cone.

Theorem 8.46 (Half spaces as convex cone)

Let V be a real vector space. Let a∈V∗. Consider the set

H={x∈V|⟹x,a⟩≀0}.

Then H is a convex cone.

Proof. Half space as a cone

  1. Let x∈H and t≄0.

  2. Then ⟹tx,a⟩=t⟹x,a⟩≀0.

  3. Hence tx∈H.

Half space as convex

  1. Let x,y∈H and t∈[0,1].

  2. Then

⟹tx+(1−t)y,a⟩=t⟹x,a⟩+(1−t)⟹y,a⟩≀0.
  1. Hence H is convex.

8.4.3.1. Characterization¶

Theorem 8.47 (Convex cone characterization)

A set is a convex cone if and only if it is closed under addition and nonnegative scalar multiplication.

In other words, C is a convex cone if and only if for every x1,x2∈C and t1,t2≄0, the following holds true:

t1x1+t2x2∈C.

Proof. Let C⊆V. Let x1,x2∈C.

If C is a convex cone, then:

  1. x=12x1+12x2∈C.

  2. But then, 2x=x1+x2∈C since C is a cone.

  3. Thus, C is closed under addition.

  4. C being a cone, it is closed under nonnegative scalar multiplication.

  5. Combining, we see that t1x1+t2x2∈C.

Now, assume that C is closed under addition and nonnegative scalar multiplication.

  1. C is a cone since it is closed under nonnegative scalar multiplication.

  2. In particular tx1∈C for all t∈[0,1] and (1−t)x2∈C for all t∈[0,1].

  3. Since C is closed under addition, hence tx1+(1−t)x2∈C for all t∈[0,1].

  4. Thus, C is convex too.

Theorem 8.48 (Set addition characterization)

A cone C is convex if and only if C+C⊆C.

Proof. Assume that a cone C satisfies C+C⊆C.

  1. Let x,y∈C and t∈[0,1].

  2. Since C is a cone, hence tx and (1−t)y belong to C.

  3. Hence tx+(1−t)y∈C+C⊆C.

  4. Hence C is convex.

For converse, assume that C is a convex cone.

  1. Let x,y∈C.

  2. Since C is a cone, hence 2x,2y∈C.

  3. Since C is convex, hence

    122x+122y=x+y∈C.
  4. Hence C+C⊆C.

8.4.3.2. Intersection of Convex Cones¶

Theorem 8.49 (Intersection of arbitrary collection of convex cones)

Let {Ai}i∈I be a family of sets such that Ai is a convex cone for all i∈I. Then ∩i∈IAi is a convex cone.

Proof. Let x1,x2 be any two arbitrary elements in A=∩i∈IAi.

x1,x2∈AâŸčx1,x2∈Ai∀i∈IâŸčt1x1+t2x2∈Ai∀t1,t2≄0∀i∈I since Ai is a convex coneâŸčt1x1+t2x2∈A.

Hence A is a convex cone.

As a consequence, an arbitrary intersection of half-spaces (at origin) and hyperplanes (at origin) is a convex cone. Thus, the solution set of a system of linear equations and inequalities is a convex cone if the equations and inequalities are homogeneous.

8.4.3.3. Containing and Contained Subspaces¶

Theorem 8.50 (Containing and contained subspaces)

Let C be a convex cone. Then, there is a smallest linear subspace containing C given by:

U≜C−C={x−y|x,y∈C}=affC.

And, there is a largest linear subspace contained in C given by

L≜(−C)∩C.

Proof. We show that U=C−C is a subspace containing C.

Since 0∈C, hence 0=0−0∈U. Also, this means that C−0=C⊆U.

Let u,v∈U.

  1. u=x−y for some x,y∈C.

  2. v=z−w for some z,w∈C.

  3. u+v=(x+z)−(y+w).

  4. Since C is a convex cone, it is closed under addition.

  5. Hence, x+z,y+w∈C.

  6. Hence, u+v∈U (as it is a difference of two vectors in C).

  7. Thus, U is closed under vector addition.

Let t∈R, u∈U.

  1. u=x−y for some x,y∈C.

  2. tu=tx−ty.

  3. If t≄0, then tx,ty∈C, thus tu∈U.

  4. If t<0, then −tx,−ty∈C. We can write tu=(−ty)−(−tx). Thus, tu∈U.

  5. Thus, U is closed under scalar multiplication.

Next, we show that it is the smallest subspace containing C.

  1. Let V be a subspace that contains C

  2. Then V contains −C also.

  3. Then, V contains C+(−C)=C−C=U also.

  4. Thus, U is the smallest subspace containing C.

In summary, C is closed under addition and nonnegative scalar multiplication. C contains 0. To be a subspace, a set must be closed under multiplication by −1 too. C−C is the smallest such subspace.

Since C contains 0, hence its affine hull contains 0 too. Thus, its affine hull must be a linear subspace. The affine hull then is the smallest subspace containing C. Thus,

affC=C−C.

We show that L=(−C)∩C is a subspace contained in C.

[zero vector]

  1. Since 0∈C and 0∈−C, hence 0∈L.

[Scalar multiplication]

  1. Let a nonzero x∈L. Then, x∈C and x∈−C.

  2. x∈−CâŸč−x∈C.

  3. x∈CâŸč−x∈−C.

  4. This means that −x∈C and −x∈−C too.

  5. Thus, −x∈L.

  6. In summary x∈L implies x,−x∈C and x,−x∈C.

  7. Then, for any t≄0

    1. x∈CâŸčtx∈C.

    2. −x∈CâŸč−tx∈C.

    3. −tx∈CâŸčtx∈−C.

    4. Thus, tx∈L.

  8. And, for any t<0

    1. x∈CâŸč−tx∈C.

    2. −x∈CâŸčtx∈C.

    3. −tx∈CâŸčtx∈−C.

    4. Thus, tx∈L.

  9. Thus, L is closed under scalar multiplication.

[Vector addition]

  1. Let u,v∈L.

  2. Then, u,v∈C and u,v∈−C.

  3. Thus, −u,−v∈C.

  4. Since C is closed under addition, hence u+v∈C and −u−v∈C.

  5. But then, −u−v∈−C and u+v∈−C.

  6. Thus, u+v∈L. −u−v∈L too.

  7. Thus, L is closed under addition.

Thus, L is a vector space contained in C. L is the largest such subspace since any subspace contained in C should be contained in −C too.

8.4.4. Conic Combinations¶

Definition 8.24 (Conic combination)

A point of the form t1x1+⋯+tkxk with t1,
,tk≄0 is called a conic combination (or a non-negative linear combination) of x1,
,xk.

  • A convex cone is closed under non-negative linear/conic combinations.

  • One way to prove that a set is a convex cone is to show that it contains all its conic combinations.

Theorem 8.51 (Convex cone characterization with conic combinations)

Let C be a convex cone. Then for every x1,
,xk∈C, every conic combination t1x1+⋯+tkxk with ti≄0 belongs to C.

Conversely, if a set C contains all conic combinations of its points, then it is a convex cone.

In other words, C is a convex cone if and only if it is closed under conic combinations.

Proof. Assume that C is a convex cone. Then it is closed under addition and nonnegative scalar multiplication.

  1. Let x1,
,xk∈C.

  2. Then, t1x1,
,tkxk∈C for all t1,
,tk≄0 since C is closed under nonnegative scalar multiplication.

  3. Then, t1x1+⋯+tkxk∈C since C is closed under addition.

  4. Thus, C contains all its conic combinations.

For the converse, assume that C contains all its conic combinations.

  1. Let x∈C.

  2. Then, tx∈C for all t≄0 since tx is a conic combination.

  3. Thus, C is a cone.

  4. Now, let x,y∈C and t∈[0,1]. Then, 1−t∈[0,1] too.

  5. Thus, tx+(1−t)y is a conic combination of x,y.

  6. Hence, tx+(1−t)y∈C.

  7. Thus, C is convex.

  8. Combining, C is a convex cone.

Here is another proof that a linear subspace is a convex cone using the idea of conic combinations.

  1. Every subspace contains the 0 vector.

  2. Every conic combination is also a linear combination.

  3. A subspace is closed under linear combinations.

  4. Hence, it is also closed under conic combinations.

  5. Hence, it is a convex cone.

Remark 8.5 (Conic combinations and nonnegative orthant)

We recall that the nonnegative orthant of Rk is given by

R+k={t∈Rk|tâȘ°0}={t∈Rk|t1,
,tk≄0}.

Thus, the coefficients for conic combinations of k points are drawn from R+k.

Theorem 8.52

A conic combination of conic combinations is a conic combination.

Proof. Let S⊆V. Note that S is arbitrary (no convexity or conic structure assumed).

  1. Consider n points yi, i=1,
,n described as below.

  2. Let yi=∑j=1mjti,jxi,j be conic combinations of mj points:

    • xi,1,
,xi,mj∈S.

    • ti,j≄0.

  3. Consider the conic combination y=∑i=1nriyi. with ri≄0.

  4. We need to show that y is a conic combination of points of S.

Towards this:

y=∑i=1nriyi=∑i=1nri∑j=1mjti,jxi,j=∑i=1n∑j=1mjriti,jxi,j.

Consider the terms:

si,j=riti,j.

Since ri≄0 and ti,j≄0, hence si,j≄0. Hence,

y=∑i,jsi,jxi,j

is a conic combination of points of S.

The idea of conic combinations can be generalized to infinite sums and integrals.

8.4.5. Conic Hulls¶

Definition 8.25 (Conic hull)

The conic hull of a set S is the set of all conic combinations of points in S. i.e.

cone⁡(S)≜{t1x1+
tkxk|xi∈S,t∈R+k,i=1,
,k,k∈N}.

Property 8.8

A conic hull is a convex cone.

Proof. Let C be the conic hull of a set S.

  1. Let x,y∈C.

  2. Then, x,y are conic combinations of S.

  3. Let z=t1x+t2y with t1,t2≄0.

  4. Then, z is a conic combination of conic combinations of S.

  5. By Theorem 8.52, z is a conic combination of S.

  6. Since C contains all conic combinations of S, hence C contains z.

  7. Thus, for any x,y∈C, z=t1x+t2y with t1,t2≄0 is in C.

  8. Thus, C is a convex cone.

Property 8.9

Conic hull of a set is the smallest convex cone that contains the set.

Proof. Let S be an arbitrary set and C be its conic hull.

  1. We have already shown that C is a convex cone.

  2. Assume D to be a convex cone such that S⊆D.

  3. Then, D contains every conic combination of S since a convex cone is closed under conic combinations.

  4. Thus, C⊆D since C is the set of all conic combinations of S.

Theorem 8.53 (Conic hull of a convex set)

Let C be a convex set. Let K be defined as:

K≜{tx|t≄0,x∈C}.

Then, K is the conic hull of C.

Proof. Let H be the conic hull of C; i.e., H is the set of all conic combinations of C. We show that K⊆H and H⊆K.

K⊆H

  1. For any x∈C and t≄0, tx is a conic combination of C.

  2. Hence tx∈H.

  3. Thus, K⊆H.

H⊆K

  1. Let x=t1x1+⋯+tkxk be a conic combination of C.

  2. Thus, ti≄0 and xi∈C.

  3. By definition of K, 0∈K.

  4. If ti=0 for 0≀i≀k, then x=0. So x∈K.

  5. Now consider the case where at least one ti>0.

  6. Let t=∑ti. Clearly, t>0.

  7. Consider z=t1tx1+⋯+tktxk.

  8. Note that z is a convex combination of x1,
,bxk∈C.

  9. Since C is convex, hence z∈C.

  10. Then, bx=tz∈S since t>0 and z∈C.

  11. Thus, K contains all conic combinations of C.

  12. Thus, H⊆K.

Property 8.10 (Conic hull of a convex hull)

Let S⊆V be a nonempty set. Let C=convS. Then

coneS=coneC.

In other words, the conic hull of the convex hull of a set is same as the conic hull of the set.

Proof. Since S⊆C, hence coneS⊆coneC. We now prove the converse.

  1. Let x∈coneC.

  2. Then x is a nonnegative combination of some vectors in C. There exist a positive integer p, p vectors x1,
,xp∈C and p scalars t1,
,tp such that

    x=∑i=1ptixi.
  3. Each xi is a convex combination of some vectors in S.

  4. Hence x is a nonnegative combination of some vectors in S.

  5. Hence x∈coneS.

  6. Hence coneC⊆coneS.

8.4.5.1. Unique Conic Representations¶

Recall from Carathéodory theorem that in an n dimensional vector space, every point in the convex hull of a set can be represented as a convex combination of n+1 points belonging to the set.

Similar representation is possible in conic hulls too.

Theorem 8.54 (Conic representation theorem)

Let V be an n-dimensional real vector space. Let S⊆V. Let x∈cone⁡(S). Then, there exist k linearly independent vectors x1,
,xk∈S such that x∈cone⁡({x1,
,xk}); i.e., there exists t∈R+k such that

x=∑i=1ktixi.

Since the k vectors are linearly independent, hence k≀n.

Proof. The proof is similar to Carathéodory theorem.

  1. Let x∈cone⁥(S).

  2. Then, there exist m points x1,
,xm∈S and t∈R+m such that

    x=∑i=1mtixi.
  3. We can assume that ti>0 for every i∈1,
,m. Otherwise, we can simply drop the corresponding points from the conic combination.

  4. If x1,
,xm are linearly independent, then k=m, m≀n and we are done.

  5. Let us consider the case when they are linearly dependent.

  6. Then, there exists a nontrivial linear combination equally zero vector:

    r1x1+⋯+rmxm=0.
  7. Then, for any α∈R

    x=∑i=1mtixi+α∑i=1mrixi=∑i=1m(ti+αri)xi.
  8. This representation is a conic combination if ti+αri≄0 for every i=1,
,m.

  9. Since ti>0 for every i, hence, this set of inequalities is satisfied for all α∈A where A is a closed interval with a nonempty interior.

    1. Let Ai be the solution set for i-th inequality.

    2. We have A=⋂iAi.

    3. α=0 satisfies every inequality. Thus, 0∈Ai. Thus, A≠∅.

    4. If ri=0, then Ai=R.

    5. If ri>0, then Ai=[−tiri,∞).

    6. If ri<0, then Ai=(−∞,tiri].

    7. Since not all ri=0, hence there are several Ai with finite endpoints (either left or right).

    8. Thus, there are three possibilities for A: [a,b], or [a,∞) or (−∞,b].

    9. Both a and b correspond to an endpoint of one of the Ai.

  10. If we pick α as one of the endpoints of A, then, ti+αri≄0 for every i and tj+αrj=0 for some j∈1,
,m.

  11. Thus, we obtain a conic representation of x of at most m−1 vectors.

  12. This process can be carried out repeatedly until we obtain a conic representation of x of k linearly independent vectors.

  13. Since the k vectors x1,
,xk so obtained are linearly independent, hence k≀n.

8.4.6. Pointed Cones¶

Definition 8.26 (Pointed cone)

A cone C⊂V is called pointed if x∈C and −x∈C implies x=0.

In other words, a pointed cone, doesn’t contain a line.

Example 8.12 (The nonnegative orthant is a pointed convex cone)

Recall from Definition 8.14 that the nonnegative orthant is defined as:

R+n={x∈Rn|xi≄0,∀1≀i≀n}.

In other words, for x∈R+n, every component is non-negative.

Let x,y∈R+n. Let α,ÎČ≄0 and consider their conic combination

z=αx+ÎČy.

It is obvious that all components of z are also nonnegative. Hence z∈R+n. Thus, R+n is closed under conic combinations. Hence, R+n is a convex cone.

Finally, R+n is pointed as x∈R+n and −x∈R+n both hold true only if x=0.

8.4.7. Proper Cones¶

Definition 8.27 (Proper cone)

A cone K∈V is called a proper cone if it satisfies the following:

  • K is convex.

  • K is closed.

  • K is solid; i.e., it has a nonempty interior.

  • K is pointed.

Example 8.13 (Non-empty interior)

Consider the following sets in R2:

C1={(x1,x2)|x1≄0,x2=0}
C2={(x1,x2)|x1,x2≄0}

Both are closed convex cones. C1 doesn’t have an interior. All points in C1 are on the boundary of C1.

C2 has a non-empty interior; e.g., the point (1,1)∈C2 is not on the boundary.

8.4.8. Norm Cones¶

Definition 8.28 (Norm cone)

Let ‖⋅‖:V→R be any norm on V. The norm cone associated with the norm ‖⋅‖ is given by the set

C≜{(x,t)|‖x‖≀t}

C lies in the product space V×R.

If V=Rn, then a norm cone belongs to Rn+1.

Theorem 8.55

A norm cone is convex. Moreover, it is a convex cone.

Example 8.14 (Second order cone)

The second order cone is the norm cone for the Euclidean norm in the Euclidean space Rn, i.e.

C={(x,t)|‖x‖2≀t}.

From definition, C⊆Rn+1.

This can be rewritten as

C={[xt]|[xt]T[I00−1][xt]≀0,t≄0}

8.4.9. Barrier Cones¶

Definition 8.29 (Barrier vector)

Let C be a convex set of V. A vector v∈V∗ is called a barrier vector to C if for some ÎČ∈R,

⟹x,v⟩≀ÎČ∀x∈C.

In other words, the set of inner products of points in C with v is bounded from above.

Definition 8.30 (Barrier cone)

The set of all barrier vectors to a convex set C is called its barrier cone.

Theorem 8.56

The barrier cone of a convex set is convex.

Proof. Let C be a convex set and let B be its barrier cone. Let u,v∈B. Let t≄0.

⟹x,0⟩=0≀0∀x∈C.

Thus, 0∈B.

⟹x,u⟩≀αâŸč⟹x,tu⟩≀tα∀x∈C.

Thus, the set of inner products with tu is bounded from above by tα. Thus, tu∈B.

⟹x,u⟩≀α and âŸšx,v⟩≀ÎČâŸč⟹x,u+v⟩≀α+ÎČ∀x∈C.

Thus, the set of inner products with u+v is bounded from above by α+ÎČ. Thus, u+v∈B.

Thus, B is closed under nonnegative scalar multiplication and vector addition. B is a convex cone.

8.4.10. Properties of Convex Cones¶

We consider operations on convex cones which generate convex cones.

Property 8.11 (Closure under set intersection)

If K1 and K2 are convex cones, then K=K1∩K2 is convex cone.

Proof. We show that K is closed under nonnegative scalar multiplication.

  1. Let x∈K and t≄0.

  2. Then, x∈K1 and x∈K2.

  3. Hence, tx∈K1 and tx∈K2 since both are closed under nonnegative scalar multiplication.

  4. Thus, tx∈K.

  5. Hence, K is closed under nonnegative scalar multiplication.

We show that K is closed under vector addition too.

  1. Let x,y∈K.

  2. Then, x,y∈K1 and x,y∈K2.

  3. But then, x+y∈K1 and x+y∈K2 since both are closed under vector addition.

  4. Thus, x+y∈K.

  5. Hence, K is closed under vector addition.

Thus, K is closed under nonnegative scalar multiplication and vector addition. K is a convex cone.

Property 8.12 (Closure under set addition)

If K1 and K2 are convex cones, then K=K1+K2 is convex cone.

Proof. We show that K is closed under nonnegative scalar multiplication.

  1. Let x∈K and t≄0.

  2. Then, x=x1+x2 where x1∈K1, and x2∈K2.

  3. Then, tx1∈K1 and tx2∈K2 since K1 and K2 are cone.

  4. Then, tx=t(x1+x2)=tx1+tx2∈K.

  5. Thus, K is closed under nonnegative scalar multiplication.

We show that K is closed under vector addition too.

  1. Let x,y∈K.

  2. Then, x=x1+x2 with some x1∈K1 and x2∈K2.

  3. And, y=y1+y2 with some y1∈K1 and y2∈K2.

  4. Then, x+y=(x1+y1)+(x2+y2).

  5. Now, x1+y1∈K1 and x2+y2∈K2 since K1 and K2 are closed under addition (they are convex cones).

  6. Thus, x+y∈K.

  7. Thus, K is closed under vector addition.

Thus, K is closed under nonnegative scalar multiplication and vector addition. K is a convex cone.

We mention that by Theorem 8.34, K is convex. Hence, we just needed to show that K is a cone too.

Property 8.13 (Positive scalar multiplication)

Let K be a convex cone. Then

tK=K∀t>0.

Proof. We proceed as follows:

  1. Let t>0.

  2. Let x∈tK.

  3. Then, 1tx∈K.

  4. But then, t1tx=x∈K too since K is closed under nonnegative scalar multiplication.

  5. Thus, tK⊆K.

  6. Similarly, x∈K implies x∈tK.

  7. Thus, K⊆tK.

  8. Hence, K=tK for all t>0.

Note that 0K={0}≠K.

Property 8.14 (Convex hull of the union)

If K1 and K2 are convex cones, then

K1+K2=conv⁥(K1âˆȘK2).

Proof. By Corollary 8.3,

conv⁥(K1âˆȘK2)=⋃t∈[0,1][(1−t)K1+tK2].

Now for t∈(0,1), by Property 8.13, (1−t)K1=K1 and tK2=K2.

Thus, for t∈(0,1):

(1−t)K1+tK2=K1+K2.

For, t=0 we are left with K1 and for t=1, we are left with K2.

Since 0∈K1 and 0∈K2, hence K1⊆K1+K2 and K2⊆K1+K2. Thus,

conv⁥(K1âˆȘK2)=⋃t∈[0,1][(1−t)K1+tK2]=K1+K2.

Property 8.15 (Intersection as union)

If C1 and C2 are convex cones, then

C1∩C2=⋃t∈[0,1](tC1∩(1−t)C2).

Proof. We first show that for every t∈(0,1)

tC1∩(1−t)C2=C1∩C2.
  1. Let x∈C1∩C2.

  2. Then x∈C1 and x∈C2.

  3. Since 0<t<1, hence due to Property 8.13, C1=tC1 and C2=(1−t)C2.

  4. Hence x∈tC1 and x∈(1−t)C2.

  5. Hence x∈tC1∩(1−t)C2.

  6. Hence C1∩C2⊆tC1∩(1−t)C2.

  7. Conversely, let x∈tC1∩(1−t)C2.

  8. Then x∈tC1 and x∈(1−t)C2.

  9. Hence x∈C1 and x∈C2.

  10. Hence x∈C1∩C2.

  11. Hence tC1∩(1−t)C2⊆C1∩C2.

If t=0 or t=1, then

tC1∩(1−t)C2={0}⊆C1∩C2

since every convex cone contains the origin. Together,

C1∩C2=⋃t∈[0,1](tC1∩(1−t)C2).

8.4.11. Cone Generated by a Convex Set¶

The direct sum vector space V⊕R has been described in Definition 8.4.

Observation 8.3 (Convex sets as cross sections of cones)

A convex set C⊆V can be regarded as a cross section of some convex cone K⊆V⊕R.

Let K be conic hull of points (x,1)∈V⊕R such that x∈C. Then,

K={(tx,t)∈V⊕R|x∈C,t≄0}.

Now consider the hyperplane in V⊕R given by:

H={(y,t)∈V⊕R|t=1}.

The intersection of H with K can be regarded as C.

H∩K={(tx,t)∈V⊕R|x∈C,t=1}={(x,1)∈V⊕R|x∈C}.

The projection of H∩K on V is given by C (by dropping the last coordinate).

Remark 8.6

For every convex set C⊆V, there is precisely one convex cone K⊆V⊕R generated by the set {(x,1)∈V⊕R|x∈C} (its conic hull).

These convex cones have only (0,0) in common with the half space {(x,t)∈V⊕R|t≀0}.

We shall call this class of convex cones in V⊕R generated by the convex sets in V as C.

An operation that is closed under the class C corresponds to an operation on the convex sets in V; e.g., if C1 and C2 are convex sets with corresponding cones K1 and K2, then C1∩C2 is another convex set corresponding to a different convex cone K3. It is natural to ask if there is a way to construct K3 from K1 and K2 directly in V⊕R.

Each vector (x,t)∈V⊕R can be split as a direct sum with x∈V and t∈R. Thus, it is possible to define different kinds of partial sums on V⊕R. Recall that partial sums on convex sets preserve the convexity. It turns out that they can do more. We can define partial sums which are closed under the family C of convex cones in V⊕R generated by the convex sets in V.

We can define four types of partial sums:

  1. Addition in V, intersection in R.

  2. Addition in V, addition in R.

  3. Intersection in V, intersection in R.

  4. Intersection in V, addition in R.

Suppose that K1 and K2 are convex cones generated by the convex sets C1 and C2 respectively. Let K be their partial sum. Let us find out what is the corresponding convex set C in V based on the type of partial sum in V⊕R.

[Addition in V, intersection in R.]

  1. In this case, (x,1)∈K if and only if x=x1+x2 for some (x1,1)∈K1 and (x2,1)∈K2.

  2. Thus, the convex set corresponding to K is C=C1+C2.

[Addition in V, addition in R.]

  1. (x,1)∈K if and only if x=x1+x2 and 1=t1+t2 with t1≄0 and t2≄0 for some (x1,t1)∈K1 and (x2,t2)∈K2.

  2. Thus, C is the union of the sets t1C1+t2C2 over t1≄0, t2≄0 and t1+t2=1.

  3. But, this is same as C=conv⁥(C1âˆȘC2) as per Theorem 8.38.

[TODO] Clarify this further. It is not obvious.

[Intersection in V, intersection in R]

  1. (x,1)∈K if and only if (x,1)∈K1 as well as (x,1)∈K2.

  2. Thus, C=C1∩C2.

[Intersection in V, addition in R]

  1. (x,1)∈K if and only if (x,t1)∈K1 and (x,t2)∈K2 for some t1,t2≄0 with t1+t2=1.

  2. In this case, we can write C as:

    C=⋃{t1C1∩t2C2|t1,t2≄0,t1+t2=1}=⋃{(1−t)C1∩tC2|t∈[0,1]}.

[TODO] Clarify this further. It is not obvious.

8.4.12. Positive Semidefinite Cone¶

Theorem 8.57 (The convex cone of positive semidefinite matrices)

The set of positive semidefinite matrices S+n is a convex cone.

Proof. Let A,B∈S+n and Ξ1,Ξ2≄0. We have to show that Ξ1A+Ξ2B∈S+n.

A∈S+nâŸčvTAv≄0∀v∈Rn.
B∈S+nâŸčvTBv≄0∀v∈Rn.

Now

vT(Ξ1A+Ξ2B)v=Ξ1vTAv+Ξ2vTBv≄0∀v∈Rn.

Hence Ξ1A+Ξ2B∈S+n.

8.4.13. Linear System with Nonnegativity Constraints¶

Consider the system

P={x∈Rn|Ax=b,xâȘ°0}

where A∈Rm×n and b∈Rm. Without loss of generality, we shall assume that the rows of A are linearly independent. This is a linear system Ax=b with the nonnegativity constraint xâȘ°0. If we write A in the form of column vectors as

A=[a1
an]

Then, the set Q={Ax|x∈R+n} can be written as

Q=cone{a1,
an}.

In other words, Q is the conic hull of the column vectors of A. We can think of A as a linear mapping of the nonnegative orthant (a convex cone) R+n from Rn to another convex cone in Rn given as a conic hull of the columns of A.

We can now see that P is nonempty if b∈Q.

Definition 8.31 (Basic feasible solution)

Let P={x∈Rn|Ax=b,xâȘ°0} where A∈Rm×n and b∈Rm. Assume that the rows of A are linearly independent. Then, v∈Rn is a basic feasible solution (in short “bfs”) of P if the columns of A corresponding to the positive entries of v are linearly independent.

Consequently, v has at most m has positive entries. All other entries of v are 0.

Theorem 8.58 (Existence of basic feasible solution)

Let P={x∈Rn|Ax=b,xâȘ°0} where A∈Rm×n and b∈Rm. Assume that the rows of A are linearly independent.

If P is nonempty; i.e. P≠∅, then it contains at least one basic feasible solution.

Proof. Recall that

Q={Ax|x∈R+n}=cone{a1,
an}

where a1,
,an are columns of the matrix A.

  1. If P≠∅, then b∈Q.

  2. In other words, b is a conic combination of columns of A.

  3. By the conic representation theorem, there exists a subset of k linearly independent vectors among {a1,
,an} such that b is their conic combination.

  4. In other words, there exist k indices 1≀i1<⋯<ik≀n and k numbers vi1,
,vik>0 such that

    b=∑j=1kvijaij

    and {ai1,
,aik} are linearly independent.

  5. Consequently k≀m since columns of A belong to Rm.

  6. Let

    v=∑j=1kvijeij

    where eij are unit vectors of Rn.

  7. Clearly, vâȘ°0 and Av=b.

  8. Therefore, v∈P and v is a basic feasible solution.

The basic feasible solutions of P are the extreme points of P. Recall that a point is an extreme point if it cannot be expressed as a nontrivial convex combination of two distinct points of a set.

Theorem 8.59 (Equivalence between basic feasible solutions and extreme points)

Let P={x∈Rn|Ax=b,xâȘ°0} where A∈Rm×n and b∈Rm. Assume that the rows of A are linearly independent.

Then v is a basic feasible solution of P if and only if v is an extreme point of P.

Proof. Let v be a basic feasible solution of P.

  1. Then b=Av and v has k positive entries with k≀m.

  2. Without loss of generality, assume that first k entries of v are positive. This can be easily achieved by shuffling the columns of A in the linear system Ax=b.

  3. Therefore, v1,
,vk>0 and vk+1,
,vn=0.

  4. Also, the first k columns a1,
,ak of the matrix A are linearly independent since v is a basic feasible solution.

  5. For contradiction, assume that v is not an extreme point of P; i.e., v∉extP.

  6. Then, there exist y,z∈P with y≠z and t∈(0,1) such that v=ty+(1−t)z.

  7. Since y,z∈P, hence yâȘ°0 and zâȘ°0.

  8. Since the last n−k entries of v are zero, hence the last n−k entries of y and z also must be zero as they have to be nonnegative.

  9. Since y,z∈P, hence Ay=b and Az=b.

  10. Therefore,

    b=∑i=1kyiai=∑i=1kziai.
  11. This implies that

    ∑i=1k(yi−zi)ai=0.
  12. But, a1,
,ak are linearly independent by hypothesis.

  13. Thus, yi=zi for i=1,
,k must hold.

  14. Then, y=z.

  15. We arrive at a contradiction.

  16. Thus, v must be an extreme point of P.

For the converse, assume that v is an extreme point of P.

  1. Again, by contradiction, assume that v is not a basic feasible solution.

  2. Thus, the columns of A corresponding to the positive entries of v are linearly dependent.

  3. Assume that there are k positive entries in v and WLOG, assume that they correspond to first k columns of A.

  4. Then, since the corresponding columns are linearly dependent, hence there exists a nonzero vector t∈Rk such that

    ∑i=1ktiai=0.
  5. We can extend t to Rn by appending n−k zeros such that At=0.

  6. Since the first k entries of v are positive, we can choose a sufficiently small r>0 such that y=v−rtâȘ°0 and z=v+rtâȘ°0.

  7. Note that Ay=Az=b.

  8. Therefore, y,z∈P.

  9. At the same time, it is easy to see that

    v=12y+12z.
  10. Thus, v is a convex combination of two distinct points of P.

  11. This contradicts our hypothesis that v is an extreme point of P.

  12. Thus, v must be a basic feasible solution.