9.12. Quadratic ProgrammingΒΆ

9.12.1. Quadratic FunctionsΒΆ

Definition 9.41 (Quadratic function)

A function f:Rn→R of the form

f(x)=12xTAx+bTx+c

where A∈Sn, b∈Rn and c∈R, is known as a quadratic function.

The matrix A is known as the matrix associated with the quadratic function.

Remark 9.8 (Gradient and Hessian of a quadratic function)

Let

f(x)=12xTAx+bTx+c

be a quadratic function. Then, the gradient is given by:

βˆ‡f(x)=Ax+b.

And, the Hessian is given by:

βˆ‡2f(x)=A.

See Example 5.8 and Example 5.13 for reference.

9.12.1.1. Stationary PointsΒΆ

Theorem 9.86 (Stationary points of quadratic functions)

Let a quadratic function f:Rn→R be given by

f(x)=12xTAx+bTx+c

where A∈Sn, b∈Rn and c∈R.

  1. x∈Rn is a stationary point if and only if Ax=βˆ’b.

  2. If Aβͺ°O, then x is a global minimum point of f if and only if Ax=βˆ’b.

  3. If AsuccO, then x=βˆ’Aβˆ’1b is a strict global minimum point of f.

  4. If AsuccO, then the minimum value of f is cβˆ’12bTAβˆ’1b.

Proof. (1) is a direct implication of the fact that βˆ‡f(x)=0 if and only if Ax+b=0.

(2) We are given that βˆ‡2f(x)=Aβͺ°O.

  1. Thus, βˆ‡2f(x)βͺ°O for every x∈Rn.

  2. By Theorem 7.4, if x is a stationary point of f, then it is a global minimum point.

  3. By first part, x is a stationary point if and only if Ax=βˆ’b.

(3) We are given that AsuccO.

  1. Then, A is invertible.

  2. Hence, x=βˆ’Aβˆ’1b is the unique solution to the equation Ax=βˆ’b.

  3. By parts (1) and (2), it is the unique (hence strict) global minimizer of f.

(4) We know that strict global minimum point of f is given by a=βˆ’Aβˆ’1b with Aa=βˆ’b. Therefore,

f(a)=12aTAa+bTa+c=βˆ’12aTb+bTa+c=c+12bTa=cβˆ’12bTAβˆ’1b.

9.12.1.2. CoercivenessΒΆ

Theorem 9.87 (Coerciveness of quadratic functions)

Let a quadratic function f:Rn→R be given by

f(x)=12xTAx+bTx+c

where A∈Sn, b∈Rn and c∈R.

f is coercive if and only if AsuccO; i.e., A is positive definite.

Proof. Assume that A is positive definite.

  1. Then, all eigenvalues of A are positive.

  2. Let Ξ» be the smallest eigenvalue of A.

  3. Then, xTAxβ‰₯Ξ»β€–xβ€–2 for every x∈Rn.

  4. Thus,

    f(x)β‰₯Ξ»2β€–xβ€–2+bTx+cβ‰₯Ξ»2β€–xβ€–2βˆ’β€–bβ€–β€–xβ€–+c Cauchy Schwartz inequality =Ξ»2β€–xβ€–(β€–xβ€–βˆ’2Ξ»β€–bβ€–)+c.
  5. We can see that f(x)β†’βˆž as β€–xβ€–β†’βˆž.

  6. Thus, f is coercive.

Now, assume that f is coercive.

  1. We need to show that A must be positive definite.

  2. Thus, all eigenvalues of A must be positive.

  3. For contradiction, assume that an eigenvalue of A is negative.

  4. Let Ξ»<0 be such an eigenvalue with the corresponding normalized eigenvector v such that Av=Ξ»v.

  5. Then, for any t∈R,

    f(tv)=t22vTAv+tbTv+c=Ξ»t22+tbTv+c.
  6. Clearly, f(tv)β†’βˆ’βˆž as tβ†’βˆž since Ξ» is negative.

  7. Thus, it contradicts the hypothesis that f is coercive.

  8. We now consider the possibility where there is a 0 eigenvalue.

  9. Then, there exists a normalized eigenvector v such that Av=0.

  10. Then, for any t∈R,

    f(tv)=tbTv+c.
  11. If bTv=0, then f(tv)=c for every t∈R.

  12. If bTv>0, then f(tv)β†’βˆ’βˆž as tβ†’βˆ’βˆž.

  13. If bTv<0, then f(tv)β†’βˆ’βˆž as tβ†’βˆž.

  14. In all the three cases, f(tv) does not go to ∞ as β€–tvβ€–β†’βˆž.

  15. Thus, f is not coercive. A contradiction to the hypothesis.

  16. Hence, the eigenvalues of A must be positive.

  17. Hence, A must be positive definite.

9.12.1.3. Nonnegative QuadraticsΒΆ

It is useful to work with quadratic functions which are nonnegative on the entire Rn.

The basic quadratic form f(x)=12xTAx is nonnegative on entire Rn if A is positive semidefinite.

For the general quadratic function, we need to incorporate the contribution from b and c terms also.

Theorem 9.88 (Nonnegativity of quadratic function)

Let a quadratic function f:Rn→R be given by

f(x)=12xTAx+bTx+c

where A∈Sn, b∈Rn and c∈R.

The following statements are equivalent.

  1. f(x)β‰₯0 for every x∈Rn.

  2. [AbbT2c]βͺ°O; i.e., this n+1Γ—n+1 symmetric matrix is positive semidefinite.

Proof. Assume that (2) is true.

  1. Then, for every x∈Rn

    [x1]T[AbbT2c][x1]β‰₯0

    due to positive semidefiniteness.

  2. But

    [x1]T[AbbT2c][x1]=[xT1][Ax+bbTx+2c]=xTAx+2xTb+2c=2(12xTAx+bTx+c)=2f(x).
  3. Thus, f(x)β‰₯0 for every x∈Rn.

For the converse, assume (1) is true.

  1. We need to show that [AbbT2c] is positive semidefinite.

  2. We shall first show that A is positive semidefinite.

  3. For contradiction, assume that A is not positive semidefinite.

  4. Then, there exists a negative eigenvalue Ξ»<0 and corresponding normalized eigenvector v for A such that Av=Ξ»v.

  5. Then, for any t∈R

    f(tv)=Ξ»t22+tbTv+c.
  6. Then, f(tv)β†’βˆ’βˆž as tβ†’βˆ’βˆž.

  7. This contradicts the hypothesis that f is nonnegative everywhere.

  8. Thus, A must be positive semidefinite.

  9. We now need to show that for any y∈Rn and any t∈R,

    [yt]T[AbbT2c][yt]β‰₯0.
  10. This condition is equivalent to

    12yTAy+tbTy+ct2β‰₯0

    for every y∈Rn and t∈R.

  11. If t=0, then this condition reduces to

    yTAyβ‰₯0βˆ€y∈Rn.
  12. This is valid for every y∈Rn since A is p.s.d.. as established earlier.

  13. For t≠0, we have

    t2f(yt)=t2(12t2yTAy+1tbTy+c)=12yTAy+tbTy+ct2.
  14. By hypothesis, t2f(yt)β‰₯0 for every y∈Rn and tβ‰ 0.

  15. Thus, 12yTAy+tbTy+ct2β‰₯0 for every y∈Rn and t∈R.

  16. Thus, [AbbT2c] is indeed p.s.d..

9.12.2. Quadratic Optimization ProblemsΒΆ

We consider the following possibilities:

  1. The objective function is a quadratic function.

  2. Both the objective function as well as the inequality constraints function are quadratic.

9.12.2.1. Quadratic ProgramΒΆ

Definition 9.42 (Quadratic program)

A convex optimization problem is known as a quadratic program (QP) if the objective function is a (convex) quadratic and the constraint functions are affine.

A general quadratic program has the following form:

(9.30)ΒΆminimize 12xTPx+qTx+rsubject to Gxβͺ―hAx=b

where

  1. x∈Rn is the optimization variable.

  2. P∈S+n is a symmetric positive semidefinite matrix.

  3. q∈Rn and r∈R.

  4. f(x)=12xTPx+qTx+r is a (convex) quadratic objective function.

  5. G∈RmΓ—n and h∈Rm describe the m affine inequality constraints.

  6. A∈RpΓ—n and b∈Rp describe the p affine equality constraints.

9.12.2.2. Quadratically Constrained Quadratic ProgramΒΆ

Definition 9.43 (Quadratically constrained quadratic program)

A convex optimization problem is known as a quadratically constrained quadratic program (QCQP) if the objective function and the inequality constraint functions are (convex) quadratic while the equality constraint functions are affine.

A general quadratic program has the following form:

(9.31)ΒΆminimize 12xTP0x+q0Tx+r0subject to 12xTPix+qiTx+ri≀0i=1,…,mAx=b

where

  1. x∈Rn is the optimization variable.

  2. Pi∈S+n are symmetric positive semidefinite matrices for i=0,…,m.

  3. qi∈Rn and ri∈R for i=0,…,m.

  4. f0(x)=12xTP0x+q0Tx+r0 is a (convex) quadratic objective function.

  5. fi(x)=12xTPix+qiTx+ri are (convex) quadratic inequality constraint functions for i=1,…,m.

  6. A∈RpΓ—n and b∈Rp describe the p affine equality constraints.