4.15. Affine Sets and TransformationsΒΆ

Primary references for this section are [7, 10, 31].

In this section V denotes a vector space on some field F which can be either R (real numbers) or C (complex numbers). Much of the section will not require any other structure on the vector space.

Some results in this section are applicable for normed linear spaces or inner product spaces. We shall assume that V is endowed with an appropriate norm β€–β‹…β€–:Vβ†’R or an inner product βŸ¨β‹…,β‹…βŸ©:VΓ—Vβ†’F wherever applicable.

Note

The notion of lines in a complex vector space may sound very confusing as a complex line is topologically equivalent to a real plane, not a real line. If you are getting lost while reading this section, just think of F as R and visualize everything in a real vector space. The algebraic presentation of affine sets and spaces is equally valid for complex vector spaces.

A key property of R is that R is totally ordered. Hence, the scalars from R can be compared. There is no natural order in C, the field of complex numbers. As you study this section, you will notice that scalar comparison is never needed in the treatment of affine sets, subspaces and transformations in this section.

4.15.1. LinesΒΆ

Definition 4.175 (Line)

Let x1 and x2 be two points in V. Points of the form

y=ΞΈx1+(1βˆ’ΞΈ)x2 where ΞΈβˆˆF

form a line passing through x1 and x2.

  • at ΞΈ=0 we have y=x2.

  • at ΞΈ=1 we have y=x1.

We can also rewrite y as

y=x2+ΞΈ(x1βˆ’x2)βˆ€ΞΈβˆˆF.

In this definition:

  • x2 is called the base point for this line.

  • x1βˆ’x2 defines the direction of the line.

  • y is the sum of the base point and the direction scaled by the parameter ΞΈ.

  • As ΞΈ goes from 0 to 1, y moves from x2 to x1.

Remark 4.29

An alternative notation for the line as a set is x2+F(x1βˆ’x2) following the notation in Definition 4.42.

4.15.2. Affine SetsΒΆ

Definition 4.176 (Affine set)

A set CβŠ†V is affine if the line through any two distinct points in C lies in C.

In other words, for any x1,x2∈C, we have ΞΈx1+(1βˆ’ΞΈ)x2∈C for all θ∈F.

Another way to write this is:

βˆ€ΞΈβˆˆF,C=ΞΈC+(1βˆ’ΞΈ)C.

Different authors use other names for affine sets like β€œaffine manifold”, β€œaffine variety”, β€œlinear variety” or β€œflat”.

Example 4.30

The empty set βˆ… is affine vacuously as it contains no points. Hence, every line passing through the points in βˆ… is inside it vacuously.

Example 4.31

For any x∈V, the singleton set {x} is affine vacuously. It contains only one point. Hence, every line passing through two distinct points in {x} is inside it vacuously.

In fact:

ΞΈx+(1βˆ’ΞΈ)x=xβˆ€ΞΈβˆˆF.

Example 4.32

Any line in V is an affine set.

Example 4.33

Any vector space V is affine. It is so since a vector space is closed under vector addition and scalar multiplication. Hence, for any two points in the vector space, the line passing through it is contained inside the space.

Theorem 4.166 (Linear subspaces are affine)

The linear subspaces of a vector space V are affine sets containing the zero vector.

Proof. Let W be a linear subspace of V.

  1. Then W contains 0.

  2. Let x,y∈W.

  3. Then, by linearity, any αx+βy∈W.

  4. In particular, for some θ∈F, ΞΈx+(1βˆ’ΞΈ)w∈W holds too.

  5. Thus, W is affine.

For the converse, let A be an affine set containing 0.

  1. For any x∈A and t∈F,

    tx=(1βˆ’t)0+tx∈A

    since A is affine. Thus, A is closed under scalar multiplication.

  2. Let x,y∈A. Since A is affine, hence

    12(x+y)=12x+(1βˆ’12)y∈A.
  3. But then, x+y∈A holds too since A is closed under scalar multiplication.

  4. Thus, A is closed under vector addition.

  5. Since A is closed under scalar multiplication and vector addition, hence A must be a subspace.

4.15.3. Affine CombinationsΒΆ

If we denote Ξ±=ΞΈ and Ξ²=(1βˆ’ΞΈ) we see that Ξ±x1+Ξ²x2 represents a linear combination of points in C such that Ξ±+Ξ²=1. The idea can be generalized in following way.

Definition 4.177 (Affine combination)

A point of the form x=ΞΈ1x1+β‹―+ΞΈkxk where ΞΈ1+β‹―+ΞΈk=1 with ΞΈi∈F and xi∈V, is called an affine combination of the points x1,…,xk.

Note that the definition only considers finite number of terms in the affine combination.

It can be shown easily that an affine set C contains all affine combinations of its points.

Theorem 4.167 (Affine set contains affine combinations)

If C is an affine set, x1,…,xk∈C, and ΞΈ1+β‹―+ΞΈk=1, then the point x=ΞΈ1x1+β‹―+ΞΈkxk also belongs to C.

Proof. We shall call ΞΈ1x1+β‹―+ΞΈkxk=βˆ‘i=1kΞΈixi with βˆ‘i=1kΞΈi=1 as k term affine combinations.

Our proof strategy is as follows:

  1. We show that an affine set contains all its 2 term affine combinations.

  2. We then show that if an affine set contains all its kβˆ’1 term affine combinations then it must contain all its k term affine combinations.

  3. Thus, by principle of mathematical induction, it contains all its affine combinations.

An affine combination of two points is of the form ΞΈ1x1+ΞΈ2x2 where ΞΈ1+ΞΈ2=1. By definition an affine set contains all its 2 term affine combinations.

Now, assume that C contains all its kβˆ’1 term affine combinations.

  1. Consider points x1,…,xkβˆ’1,xk∈C.

  2. Let ΞΈ1,…,ΞΈkβˆ’1,ΞΈk∈F such that ΞΈ1+β‹―+ΞΈkβˆ’1+ΞΈk=1.

  3. Without loss of generality, assume that ΞΈkβ‰ 1. Thus, 1βˆ’ΞΈkβ‰ 0.

  4. Note that ΞΈ1+β‹―+ΞΈkβˆ’1=1βˆ’ΞΈk.

  5. Thus, ΞΈ11βˆ’ΞΈk+β‹―+ΞΈkβˆ’11βˆ’ΞΈk=1.

  6. We can then write:

    x=βˆ‘i=1kΞΈixi=βˆ‘i=1kβˆ’1ΞΈixi+ΞΈkxk=(1βˆ’ΞΈk)βˆ‘i=1kβˆ’1ΞΈi1βˆ’ΞΈkxi+ΞΈkxk.
  7. Note that the term y=βˆ‘i=1kβˆ’1ΞΈi1βˆ’ΞΈkxi is an affine combination of kβˆ’1 terms.

  8. Thus, by inductive hypothesis, y∈C.

  9. We are left with

    x=(1βˆ’ΞΈk)y+ΞΈkxk.
  10. This is a two term affine combination. Since y,xk∈C, hence x∈C.

  11. Thus, we established that if C contains its kβˆ’1 term affine combinations, it contains its k term affine combinations too.

Theorem 4.168

An affine combination of affine combinations is an affine combination.

Proof. Let u=βˆ‘i=1kΞΈixi and v=βˆ‘j=1lΞ»jyj where xi,yj∈V and βˆ‘ΞΈi=1 and βˆ‘Ξ»j=1.

We claim that w=Ξ³u+(1βˆ’Ξ³)v is also an affine combination.

w=Ξ³u+(1βˆ’Ξ³)v=Ξ³βˆ‘i=1kΞΈixi+(1βˆ’Ξ³)βˆ‘j=1lΞ»jyj=βˆ‘i=1kΞ³ΞΈixi+βˆ‘j=1l(1βˆ’Ξ³)Ξ»jyj.

Notice that:

βˆ‘i=1kΞ³ΞΈi+βˆ‘j=1l(1βˆ’Ξ³)Ξ»j=Ξ³βˆ‘i=1kΞΈi+(1βˆ’Ξ³)βˆ‘j=1lΞ»j=Ξ³1+(1βˆ’Ξ³)1=1.

Thus, w is an affine combination of the points xi and yj.

We can use the mathematical induction to show that arbitrary affine combinations of affine combinations are affine combinations.

4.15.4. Connection with Linear SubspacesΒΆ

Theorem 4.169 (affine - point = linear)

Let C be a nonempty affine set and x0 be any element in C. Then the set

V=Cβˆ’x0={xβˆ’x0|x∈C}

is a linear subspace of V.

To show that V is indeed a linear subspace, we can show that every linear combination of two arbitrary elements in V belongs to V.

Proof. Let v1 and v2 be two elements in V. Then by definition, there exist x1 and x2 in C such that

v1=x1βˆ’x0 and v2=x2βˆ’x0.

Thus

av1+v2=a(x1βˆ’x0)+x2βˆ’x0=(ax1+x2βˆ’ax0)βˆ’x0βˆ€a∈F.

But since a+1βˆ’a=1, hence x3=(ax1+x2βˆ’ax0)∈C (an affine combination).

Hence av1+v2=x3βˆ’x0∈V [by definition of V].

Thus, any linear combination of elements in V belongs to V. Hence, V is a linear subspace of V.

Observation 4.9 (affine = linear + point)

With the previous result, we can use the following notation:

C=V+x0={v+x0|v∈V}

where V is a linear subspace of V and x0∈V. In other words, a nonempty affine set is a linear subspace with an offset.

We need to justify this notation by establishing that there is one and only linear subspace associated with an affine set. This is done in the next result.

Theorem 4.170 (Uniqueness of associated subspace)

Let C be a nonempty affine set and let x1 and x2 be two distinct elements in C. Let V1=Cβˆ’x1 and V2=Cβˆ’x2, then the linear subspaces V1 and V2 are identical.

Proof. We show that V1βŠ†V2 and V2βŠ†V1.

  1. Let v∈V1.

  2. There exists x∈C such that v=xβˆ’x1.

  3. Then, v=xβˆ’x1+x2βˆ’x2.

  4. Let y=xβˆ’x1+x2. Note that x,x1,x2∈C and y is an affine combination of x,x1,x2.

  5. Thus, y∈C.

  6. We can now write v=yβˆ’x2.

  7. Thus, v∈V2 as V2=Cβˆ’x2.

  8. Thus, V1βŠ†V2.

  9. An identical reasoning starting with some v∈V2 gives us V2βŠ†V1.

  10. Thus, V1=V2.

Thus the subspace V associated with a nonempty affine set C doesn’t depend upon the choice of offset x0 in C.

Corollary 4.30

If an affine set contains 0 then it is a linear subspace.

We have already shown this in Theorem 4.166. This is an alternative proof.

Proof. The linear subspace associated with an affine set C is given by V=Cβˆ’x0 for any x0∈V.

In particular, if C contains 0, then

V=Cβˆ’0=C.

Thus, C is a linear subspace.

4.15.5. Affine Subspaces and DimensionΒΆ

Definition 4.178 (Affine subspace)

A nonempty affine set is called an affine subspace. An affine subspace is a linear subspace with an offset.

Another way to express this is as follows. C is an affine subspace of V if:

Cβ‰ βˆ… and βˆ€ΞΈβˆˆF,C=ΞΈC+(1βˆ’ΞΈ)C.

Definition 4.179 (Affine proper subspace)

An affine subspace A in a vector space V is called a proper subspace if the linear subspace associated with A is a proper subspace of V.

In other words, A is affine, Aβ‰ βˆ…, and Aβ‰ V.

Definition 4.180 (Affine dimension)

We define the affine dimension of an affine subspace C as the dimension of the associated linear subspace V=Cβˆ’x0 for some x0∈C if the subspace V is finite dimensional.

The dimension of βˆ… (empty affine set) is βˆ’1 by convention.

The definition is consistent since V is independent of the choice of x0∈C.

Example 4.34 (Singletons as affine subspaces)

For any x∈V, the singleton set {x} can be expressed as

{x}=x+{0}.

Its corresponding linear subspace is {0} of zero dimension.

Thus, the singleton set has an affine dimension of 0.

Remark 4.30 (Points, lines, planes, flats)

The affine sets of dimension 0, 1 and 2 are called points, lines and planes respectively.

An affine set of dimension k is often called a k-flat.

Example 4.35 (More affine sets)

  • The euclidean space Rn is affine.

  • Any line is affine. The associated linear subspace is a line parallel to it which passes through origin.

  • Any plane is affine. If it passes through origin, it is a linear subspace. The associated linear subspace is the plane parallel to it which passes through origin.

Theorem 4.171

An affine subspace is closed under affine combinations.

Proof. This is from the definition of affine sets and Theorem 4.167.

Observation 4.10 (Affine - affine = Linear)

Let C be an affine subspace. Let V be the linear subspace associated with C given by V=Cβˆ’x. Then every vector v∈V can be written as v=yβˆ’x where y∈C. Since V doesn’t depend on the choice of x, hence V is the set of all vectors of the form yβˆ’x where y,x∈C.

Thus, following the notation in Definition 4.42, we can write V as:

V=Cβˆ’C.

One way to think of affine sets as collections of points in an arbitrary space and the associated linear subspace as the collection of difference vectors between points.

vector ab=point bβˆ’point a.

4.15.6. Affine HullΒΆ

Definition 4.181 (Affine hull)

The set of all affine combinations of points in some arbitrary nonempty set SβŠ†V is called the affine hull of S and denoted as affS:

affS={ΞΈ1x1+β‹―+ΞΈkxk|x1,…,xk∈S and ΞΈ1+β‹―+ΞΈk=1}.

Theorem 4.172

An affine hull is an affine subspace.

Proof. Let SβŠ‚V be nonempty. Let T=affS. Let u,v∈T. Then

u=βˆ‘i=1kΞΈixiand v=βˆ‘j=1lΞ»jyj

where xi,yj∈S, βˆ‘ΞΈi=1 and βˆ‘Ξ»j=1.

Then, as shown in Theorem 4.168,

w=Ξ³u+(1βˆ’Ξ³)v

is an affine combination of points xi,yj∈S.

Thus, w∈T. Hence, T is an affine set. Since T is nonempty, hence T is an affine subspace.

Theorem 4.173 (Smallest containing affine subspace)

The affine hull of a nonempty set S is the smallest affine subspace containing S. More specifically, let C be any affine subspace with SβŠ†C. Then affSβŠ†C.

Proof. Let C be an arbitrary affine subspace such that SβŠ†C.

  1. From Theorem 4.171, C is closed under affine combinations.

  2. Thus, C contains all affine combinations of points of S.

  3. Thus, affSβŠ†C.

  4. We established in Theorem 4.172 that affS is an affine subspace.

  5. Thus, it is the smallest affine subspace containing S.

Corollary 4.31 (Affine hull as intersection)

The affine hull of a set is the intersection of all affine subspaces containing it.

Theorem 4.174 (Affine hull of a finite set)

Let S={v0,v1,…,vk} be a finite set of vectors from a vector space V. Let A=affS be their affine hull. Then, the linear subspace associated with A is given by

L=span{v1βˆ’v0,…,vkβˆ’v0}.

Consequently, the dimension of affS is at most k.

Proof. Since L=Aβˆ’v0, hence v1βˆ’v0,…,vkβˆ’v0∈L. Thus, span{v1βˆ’v0,…,vkβˆ’v0}βŠ†L.

Now, let v∈L. Then, there exist t0,…,tk with t0+β‹―+tk=1 such that

v=t0v0+β‹―+tkvkβˆ’v0.

But then

v=(1βˆ’t1βˆ’β‹―βˆ’tk)v0+t1v1+β‹―+tkvkβˆ’v0=t1(v1βˆ’v0)+β‹―+tk(vkβˆ’v0).

Thus, v∈span{v1βˆ’v0,…,vkβˆ’v0}. Thus, LβŠ†span{v1βˆ’v0,…,vkβˆ’v0}.

Combining:

L=span{v1βˆ’v0,…,vkβˆ’v0}.

Since L is a span of k vectors, hence dimL≀k. Thus, dimA≀k.

Theorem 4.175 (Containment)

If AβŠ†B, then affAβŠ†affB.

Proof. We proceed as follows:

  1. By definition, affB contains all affine combinations of points in B.

  2. Thus, it contains all affine combinations of points in A since AβŠ†B.

  3. But that is affA.

  4. Thus, affAβŠ†affB.

Theorem 4.176 (Tight containment)

If AβŠ†BβŠ†affA, then affA=affB.

Proof. We proceed as follows:

  1. Note that affA is an affine set containing B.

  2. But affB is the smallest affine set containing B, hence affBβŠ†affA.

  3. But AβŠ†B implies that affAβŠ†affB.

  4. Thus, affA=affB.

4.15.7. Affine IndependenceΒΆ

Definition 4.182 (Affine independence)

A set of vectors v0,v1,…,vk∈V is called affine independent, if the vectors v1βˆ’v0,…,vkβˆ’v0 are linearly independent.

If the associated subspace has dimension l then a maximum of l vectors can be linearly independent in it. Hence a maximum of l+1 vectors can be affine independent for the affine set.

Definition 4.183 (Affine dependence)

A set of vectors v0,v1,…,vk∈V is called affine dependent, if it is not affine independent. In other words, the vectors v1βˆ’v0,…,vkβˆ’v0 are linearly dependent.

Theorem 4.177 (Basis for the linear subspace associated with affine independent set)

Let v0,v1,…,vk∈V be a set of affine independent, points in V.

Let S={v0,v1,…,vk}. Let A=affS. Let L be the linear subspace associated with A.

Then, v1βˆ’v0,…,vkβˆ’v0 form a basis for L.

Proof. By definition of affine independence, v1βˆ’v0,…,vkβˆ’v0 are linearly independent.

By Theorem 4.174

L=span{v1βˆ’v0,…,vkβˆ’v0}.

Since, they are linearly independent and span L, hence they form a basis for L.

Theorem 4.178 (Affine independence and dimension)

A set of vectors v0,v1,…,vk∈V is affine independent if and only if their affine hull aff{v0,v1,…,vk} is k dimensional.

Proof. Assume v0,v1,…,vk to be affine independent.

  1. Then, by Definition 4.182, v1βˆ’v0,…,vkβˆ’v0 are linearly independent.

  2. Let L=span{v1βˆ’v0,…,vkβˆ’v0}.

  3. By Theorem 4.174

    dimaff{v0,v1,…,vk}=dimL=k

    since L is a span of k linearly independent vectors.

Now, assume A=aff{v0,v1,…,vk} is k dimensional.

  1. By Theorem 4.174, the linear subspace associated with A is given by L=span{v1βˆ’v0,…,vkβˆ’v0}.

  2. Thus, L is k dimensional since dimL=dimA=k.

  3. But, L is a span of k vectors.

  4. Hence, the k vectors v1βˆ’v0,…,vkβˆ’v0 must be linearly independent.

[31] defines v0,v1,…,vk∈V as affine independent if their hull is k dimensional. As we can see above, our definition is equivalent.

Theorem 4.179 (Affine independent points in an affine subspace)

Let A be an affine subspace of a vector space V such that dimA=k. Then, it is possible to choose a set of up to k+1 points in A which are affine independent. Any set of k+2 points in A is not affine independent.

Proof. Let L be the subspace associated with A and let v0∈A be some fixed point of A.

  1. We have dimL=dimA=k.

  2. Choose a basis B={x1,…,xk} of L.

  3. Let v1=x1+v0,…,vk=xk+v0.

  4. Then, the set of k+1 points v0,v1,…,vk are affine independent since v1βˆ’v0,…,vkβˆ’v0 are linearly independent.

  5. For less than k+1 points, we can choose less than k vectors from the basis B and construct accordingly.

We now show that any set of k+2 points cannot be affine independent.

  1. Let v0,v1,…,vk,vk+1 be an arbitrary set of k+2 points in A.

  2. Then, v1βˆ’v0,…,vkβˆ’v0,vk+1βˆ’v0∈L is a set of k+1 points in L.

  3. Since dimL=k, hence any set of k+1 points in L is linearly dependent.

  4. Thus, v0,v1,…,vk,vk+1 cannot be affine independent.

Theorem 4.180 (Affine set as an affine hull)

Let A be an affine subspace of a vector space V such that dimA=k. Let {v0,…,vk} be a set of k+1 affine independent points of A. Then,

A=aff{v0,…,vk}.

Proof. Let L be the linear subspace associated with A and let

H=aff{v0,…,vk}.

Since A being affine is closed under affine combinations, hence HβŠ†A.

We now show that AβŠ†H.

  1. Let v∈A.

  2. Then, vβˆ’v0∈L.

  3. By Theorem 4.177, v1βˆ’v0,…,vkβˆ’v0 form a basis for L.

  4. Thus,

    vβˆ’v0=t1(v1βˆ’v0)+β‹―+tk(vkβˆ’v0).
  5. But then,

    v=(1βˆ’t1βˆ’β‹―βˆ’tk)v0+t1v1+β‹―+tkvk

    which is an affine combination of {v0,…,vk}.

  6. Thus, v∈H.

  7. Thus, AβŠ†H.

Theorem 4.181 (Extending an affine independent set of points)

Let V be a finite dimensional vector space with n=dimV. Any set of m+1 affine independent points in V (where m<n) can be extended to a set of n+1 affine independent points.

Proof. Let S={v0,v1,…,vm} be a set of m affine independent points.

  1. Let A=affS.

  2. Let L be the linear subspace associated with A.

  3. Let x1=v1βˆ’v0,…,xm=vmβˆ’v0.

  4. The set {x1,…,xm} forms a basis for L.

  5. Extend {x1,…,xm} to {x1,…,xn} to form a basis for V.

  6. Compute the points vi=xi+v0 for i=m+1,…,n.

  7. Then, the set of points {v0,…,vn} is an affine independent set since the {x1,…,xn} are linearly independent.

4.15.8. Barycentric Coordinate SystemΒΆ

Theorem 4.182 (Unique representation from affine independent points)

Let V be a vector space and v0,v1,…,vk be a set of k+1 affine independent points in V.
Let S={v0,v1,…,vk}. Let A=affS.

Then, every point in A can be represented uniquely as

v=t0v0+β‹―+tkvk

such that t0+β‹―+tk=1.

Proof. By definition of affine hull, any point in the hull A is an affine combination of the points in S. We shall first find a suitable representation as an affine combination of S. Then, we shall prove the uniqueness of such a representation by showing that if the representation is not unique then the points in S cannot be affine independent.

Let L be the linear subspace associated with A. Let x1=v1βˆ’v0,…,xk=vkβˆ’v0. Then, by Theorem 4.177, the set B={x1,…,xk} forms a basis for L.

Let v∈A. Then, x=vβˆ’v0∈L.

Then, there is a unique representation of x in the basis B:

x=s1x1+…skxk.

Then,

v=x+v0=s1x1+…skxk+v0=s1(v1βˆ’v0)+…sk(vkβˆ’v0)+v0=(1βˆ’s1βˆ’β‹―βˆ’sk)v0+s1v1+…skvk.

Letting t0=1βˆ’s1βˆ’β‹―βˆ’sk and ti=si for i=1,…,k we arrive at a representation of v in terms of points in S such that t0+β‹―+tk=1 and

v=t0v0+t1v1+β‹―+tkvk.

We now claim that this representation is unique. Suppose, there was another representation

v=r0v0+r1v1+β‹―+rkvk

such that r0+β‹―+rk=1.

Then, we would have:

r0v0+r1v1+β‹―+rkvk=t0v0+t1v1+β‹―+tkvk⟺(1βˆ’r1βˆ’β‹―βˆ’rk)v0+r1v1+β‹―+rkvk=(1βˆ’t1βˆ’β‹―βˆ’tk)v0+t1v1+β‹―+tkvk⟺r1(v1βˆ’v0)+β‹―+rk(vkβˆ’v0)+v0=t1(v1βˆ’v0)+β‹―+tk(vkβˆ’v0)+v0⟺r1x1+β‹―+rkxk=t1x1+β‹―+tkxk⟺(r1βˆ’t1)x1+β‹―+(rkβˆ’tk)xk=0.

But, the set {x1,…,xk} is linearly independent since v0,v1,…,vk are affine independent.

Hence, r1=t1,…,rk=tk must be true.

Thus, r0=t0 must be true since r0=1βˆ’r1βˆ’β‹―βˆ’rk and t0=1βˆ’t1βˆ’β‹―βˆ’tk.

Thus, v has a unique representation in S.

This unique representation can be used to define a coordinate system in an affine set.

Definition 4.184 (Barycentric coordinate system)

Let V be a vector space and v0,v1,…,vk be a set of k+1 affine independent points in V.
Let S={v0,v1,…,vk}. Let A=affS.

Then, every point in A can be represented uniquely as

v=t0v0+β‹―+tkvk

such that t0+β‹―+tk=1. This representation is known as the barycentric coordinate system.

If A is an arbitrary finite dimensional affine subspace of V with dimA=k, then we can select k+1 affine independent points v0,v1,…,vk∈A (thanks to Theorem 4.179). Any such set of k+1 affine independent points of A affords A with a barycentric coordinate system.

4.15.9. TranslationsΒΆ

Definition 4.185 (Translation operator)

Let V be a vector space. An operator Ta:V→V is called a translation operator if

Ta(x)=x+aβˆ€x∈X

where a∈X is a fixed (translation) vector.

It can be easily seen that Ta(C)=a+C=C+a.

Definition 4.186 (Translate)

Let CβŠ†V. The translate of C by some a∈V is defined to be the set C+a.

Observation 4.11 (Translating the vector space)

V+a=Vβˆ€a∈V.

Translating the whole vector space doesn’t change it.

βˆ…+a=βˆ….

This follows from the definition of the set vector addition.

{0}+a={a}.

The translate of the trivial subspace is a singleton set.

Theorem 4.183 (Affine translate)

A translate of an affine set is affine.

Proof. Let C be affine and a∈V.

  1. Let x,y∈C+a.

  2. Then, x=u+a and y=v+a for some u,v∈C.

  3. Then for some t∈F,

    tx+(1βˆ’t)y=t(u+a)+(1βˆ’t)(v+a)=tu+(1βˆ’t)v+ta+(1βˆ’t)a=tu+(1βˆ’t)v+a.
  4. But w=tu+(1βˆ’t)v∈C since C is affine.

  5. Hence, tx+(1βˆ’t)y=w+a∈C+a.

  6. Thus, C+a is affine.

Definition 4.187 (Parallel affine sets)

Two affine sets C and D are called parallel to each other if

D=C+a

for some a∈V. We denote this by Cβˆ₯D.

Clearly, every affine set is parallel to its associated linear subspace.

This definition of parallelism is more restrictive as it allows comparing only those affine sets which have the same dimension. Thus, we cannot compare a line with a plane.

Every point is parallel to every other point.

Theorem 4.184 (Parallelism equivalence relation)

Consider the class of all affine subsets of a vector space V. The relation Cβˆ₯D is an equivalence relation.

Proof. [Reflexivity]

  1. C=C+0.

  2. Hence Cβˆ₯C.

[Symmetry]

  1. Let Cβˆ₯D.

  2. Then, there exists a∈V such that D=C+a.

  3. But then, C=D+(βˆ’a).

  4. Thus, Dβˆ₯C.

[Transitivity]

  1. Let Cβˆ₯D and Dβˆ₯E.

  2. Then, D=C+a and E=D+b for some a,b∈V.

  3. But then, E=C+(a+b).

  4. Thus, Cβˆ₯E.

Theorem 4.185 (Existence and uniqueness of a parallel linear subspace)

Every affine subspace (nonempty affine set) A is parallel to a unique subspace. The subspace is given by:

W=Aβˆ’A.

This result is a restatement of Observation 4.10.

Proof. From Theorem 4.170, there is a unique linear subspace L associated with A given by L=Aβˆ’a for some a∈A.

Since A=L+a hence, A and L are parallel to each other.

Two linear subspaces are parallel to each other only if they are identical. Thus, L is the unique linear subspace parallel to A.

Now, notice that:

W=Aβˆ’A=⋃a∈AAβˆ’a.

But L=Aβˆ’a for any a∈A as L is independent of the choice of a∈A.

Thus,

W=⋃a∈AAβˆ’a=⋃a∈AL=L.

Thus, the unique linear subspace parallel to A is given by W=Aβˆ’A.

4.15.10. Affinity Preserving OperationsΒΆ

We discuss some operations which preserve the affine character of its inputs

4.15.10.1. IntersectionΒΆ

Theorem 4.186 (Intersection of affine sets)

If S1 and S2 are affine sets then S1∩S2 is affine.

Proof. Let x1,x2∈S1∩S2. We have to show that

tx1+(1βˆ’t)x2∈S1∩S2,βˆ€t∈F.

Since S1 is affine and x1,x2∈S1, hence

tx1+(1βˆ’t)x2∈S1,βˆ€t∈F.

Similarly

tx1+(1βˆ’t)x2∈S2,βˆ€t∈F.

Thus

tx1+(1βˆ’t)x2∈S1∩S2,βˆ€t∈F.

Thus, S1∩S2 is affine.

We can generalize it further.

Theorem 4.187 (Intersection of arbitrary collection of affine sets)

Let {Ai}i∈I be a family of sets such that Ai is affine for all i∈I. Then ∩i∈IAi is affine.

Proof. Let x1,x2 be any two arbitrary elements in ∩i∈IAi.

x1,x2∈∩i∈IAi⟹x1,x2∈Aiβˆ€i∈I⟹tx1+(1βˆ’t)x2∈Aiβˆ€t∈Fβˆ€i∈I since Ai is affine βŸΉtx1+(1βˆ’t)x2∈∩i∈IAi.

Hence ∩i∈IAi is affine.

4.15.11. Hyper PlanesΒΆ

Recall from Definition 4.104 that a set of the form:

Hf,aβ‰œ{x∈V|f(x)=a}

where f is a nonzero linear functional on V and a∈F is called a hyperplane.

Theorem 4.188

Every hyperplane is affine.

Proof. We proceed as follows:

  1. Let x,y∈Hf,a.

  2. Then, f(x)=a and f(y)=a.

  3. Consider any t∈F and let z=tx+(1βˆ’t)y.

  4. Then, due to linearity of f,

    f(z)=f(tx+(1βˆ’t)y)=tf(x)+(1βˆ’t)f(y)=ta+(1βˆ’t)a=a.
  5. Thus, z∈Hf,a.

  6. Thus, Hf,a is an affine set.

Theorem 4.189 (Linear subspace parallel to a hyperplane)

Let H be a hyperplane given by

H={x∈V|f(x)=a}

where f is a nonzero linear functional on V and a∈F.

Then, the linear subspace parallel to H is given by the kernel of the linear functional f:

L=fβˆ’1(0)={x∈V|f(x)=0}.

Proof. Let V be the linear subspace parallel to H.

  1. Then, any v∈V can be written as v=xβˆ’y for some x,y∈H.

  2. But then,

    f(v)=f(xβˆ’y)=f(x)βˆ’f(y)=aβˆ’a=0.
  3. Thus, v∈L and hence VβŠ†L.

For the converse, we proceed as follows.

  1. Let v∈L and x∈H.

  2. Let y=xβˆ’v.

  3. Then, f(y)=f(x)βˆ’f(v)=aβˆ’0=a.

  4. Thus, y∈H.

  5. Thus, v=xβˆ’y where x,y∈H.

  6. Thus, v∈Hβˆ’H=V.

  7. Thus, LβŠ†V.

Combining, L=H.

Theorem 4.190 (Dimension of a hyperplane)

Let H be a hyperplane given by

H={x∈V|f(x)=a}

where f is a nonzero linear functional on V and a∈F.

If V is finite dimensional, then

dimH=dimVβˆ’1.

Proof. From Theorem 4.189, the linear subspace parallel to H is given by

L=fβˆ’1(0)={x∈V|f(x)=0}.

From Theorem 4.101, the dimension of the kernel of a linear functional in a finite dimensional vector space is given by:

dimL=dimVβˆ’1.

From Definition 4.180,

dimH=dimL=dimVβˆ’1.

Theorem 4.191 (Hyperplanes in inner product spaces)

If V is an inner product space over F, then a set of the form

H={x|⟨x,a⟩=b}

where a∈V is a nonzero vector and b∈F; is a hyperplane.

Moreover, every hyperplane of V can be represented in this form, with a and b unique up to a common non-zero multiple.

Proof. By Theorem 4.104, the mapping Ta:V→F defined by:

Ta(x)β‰œβŸ¨x,aβŸ©βˆ€x∈V

is a linear functional. Thus, H is a hyperplane.

By Theorem 4.106, every linear functional can be identified as an inner product with a vector a∈V. Thus, every hyperplane can be written as

H={x|⟨x,a⟩=b}.

This representation is not unique since the set

{x|⟨x,t―a⟩=tb}

is identical to H for any t∈F such that tβ‰ 0.

Theorem 4.192 (Affine = Intersection of hyperplanes)

Let V be a finite dimensional vector space. Then, every proper affine subset of V is a finite intersection of hyperplanes.

Proof. If C=βˆ…, we can choose any two non-intersecting parallel hyperplanes and C is their intersection.

Let C be a proper affine subspace of V such that 1≀dimC<dimV.

  1. Let L be the linear subspace parallel to C.

  2. Then C=L+a for some fixed a∈C.

  3. Let n=dimV and m=dimL.

  4. Since L is a proper subspace of V hence m<n.

  5. Let {x1,…,xm} be a basis for L.

  6. Then, every v∈C can be written as:

    v=βˆ‘i=1mtixi+a.
  7. We can extend this basis to construct a basis {x1,…,xn} for V.

  8. We can construct a dual basis for the dual space Vβˆ—. For each i=1,…,n, define a linear functional fi:Vβ†’F by setting:

    fi(xj)={1 if i=j0 if iβ‰ j.
  9. Let ai=fi(a).

  10. Consider a family of hyperplanes defined as:

    Hi={x∈V|fi(x)=ai}

    where i=m+1,…,n.

  11. Consider their intersection

    H=β‹‚i=m+1nHi={x∈V|fi(x)=ai,i=m+1,…,n}.
  12. We claim that C=H.

We shall first show that CβŠ†H.

  1. Let v∈C.

  2. Then, v=βˆ‘j=1mtjxj+a.

  3. Then, fi(v)=βˆ‘j=1mtjfi(xj)+fi(a)=ai for i=m+1,…,n.

  4. Thus, v∈Hi for every i=m+1,…,n.

  5. Thus, v∈H.

  6. Thus, CβŠ†H.

We now show that HβŠ†C. Note that this is same as showing Hβˆ’aβŠ†L=Cβˆ’a.

  1. Let v∈Hβˆ’a.

  2. Hence, v=xβˆ’a such that x∈H.

  3. We can write v in terms of the basis {x1,…,xn} as

    v=βˆ‘j=1ntjxj.
  4. Then fi(v)=ti (by definition of fi).

  5. But, for any i∈[m+1,…,n]

    fi(v)=fi(xβˆ’a)=fi(x)βˆ’fi(a)=aiβˆ’ai=0

    since x∈HβŠ†Hi.

  6. Thus, ti=0 for every i=m+1,…,n.

  7. Thus,

    v=βˆ‘j=1mtjxj.
  8. Thus, v∈L since {x1,…,xm} is a basis for L.

  9. Thus, Hβˆ’aβŠ†L.

  10. Thus, HβŠ†L+a=C.

Combining these observations, we have H=C.

We are now left with the case of singleton sets C={a} where dimC=0 since the associated linear subspace is {0}.

  1. Choose any basis B={x1,…,xn} for V.

  2. Construct a dual basis F={f1,…,fn} for Vβˆ— as before.

  3. Let ai=fi(a) for i=1,…,n.

  4. Consider a family of hyperplanes defined as:

    Hi={x∈V|fi(x)=ai}

    where i=1,…,n.

  5. Consider their intersection

    H=β‹‚i=1nHi={x∈V|fi(x)=ai,i=1,…,n}.
  6. Now, it is straightforward to show that H={a}=C.

Corollary 4.32 (Affine sets in inner product space)

Let V be a finite dimensional inner product space over field F. Let A be a proper affine subset of V.

Then, there exist r (where r<dimV) nonzero vectors ai∈V and scalars bi∈F such that A is the intersection of the hyperplanes given by

Hi={x∈V|⟨x,ai⟩=bi}.

Specifically,

A=β‹‚i=1mHi={x∈V|⟨x,ai⟩=bi,i=1,…,m}.

Proof. It follows from Theorem 4.192 that A is a finite intersection of hyperplanes with r<n where n=dimV.

Since V is an inner product space, hence, due to Theorem 4.191, each hyperplane can be represented as

Hi={x∈V|⟨x,ai⟩=bi}

where ai∈V and bi∈F.

Procedure to select the hyperplane parameters.

  1. Pick a vector a∈A.

  2. Identify the linear subspace L=Aβˆ’a.

  3. Pick an orthonormal basis for L: {v1,…,vm}.

  4. Extend the orthonormal basis to V.

  5. Pick the basis vectors for LβŠ₯: {a1,…,ar} with m+r=n.

  6. Compute bi=⟨a,ai⟩.

4.15.12. Linear EquationsΒΆ

Example 4.36 (Solution set of linear equations)

We show that the solution set of linear equations forms an affine set.

Let C={x|Ax=b} where A∈FmΓ—n and b∈Fm.

Let C be the set of all vectors x∈Fn which satisfy the system of linear equations given by Ax=b. Then C is an affine set.

Let x1 and x2 belong to C. Then we have

Ax1=b and Ax2=b

Thus

ΞΈAx1+(1βˆ’ΞΈ)Ax2=ΞΈb+(1βˆ’ΞΈ)b⟹A(ΞΈx1+(1βˆ’ΞΈ)x2)=b⟹(ΞΈx1+(1βˆ’ΞΈ)x2)∈C

Thus, C is an affine set.

The subspace associated with C is nothing but the null space of A denoted as N(A).

Every affine set of Fn can be expressed as the solution set of a system of linear equations. If the system of equations is infeasible, then its solution set is βˆ…. Otherwise, its solution set is an affine subspace. If the system of equations has a unique solution, then the solution set is a singleton set which is an affine subspace of dimension 0.

Theorem 4.193 (Affine set = system of linear equations in Fn)

Let b∈Fm. Let A be an mΓ—n matrix in FmΓ—n. Consider the solution set of the system of linear equations Ax=b:

C={x∈Fn|Ax=b}.

Then, C is an affine set.

Moreover, every affine set in Fn can be represented as a system of linear equations.

Proof. If C=βˆ… (i.e., the system of equations is infeasible), then C is affine (since empty sets are affine by definition).

If the system of equations has a unique solution, then C={v} where v is the unique solution of the system of equations, then C is affine since singleton sets are affine.

We now consider the case that the system of equations has more than one solutions.

Let x1,x2∈C be distinct solutions of the system of linear equations and let t∈F. Then,

Ax1=b and Ax2=b

Consider x=tx1+(1βˆ’t)x2. Then,

Ax=A(tx1+(1βˆ’t)x2)=tAx1+(1βˆ’t)Ax2=tb+(1βˆ’t)b=b.

This means that x∈C. Thus, C contains all its affine combinations. Hence, C is affine.

We next show that every affine set of Fn can be represented as a system of linear equations. Note that Fn is an inner product space with the standard inner product given by ⟨x,y⟩=y―x.

Let C be an arbitrary affine set in Fn.

  1. If C=βˆ…, we can pick any infeasible system of linear equations as a representation of C.

  2. If C={v} is a singleton, we can pick the system Ix=v where I is an identity matrix in FnΓ—n.

  3. If C=Fn, we can choose A to be any mΓ—n zero matrix and b=0∈Fm. Then, the solution set of Ox=0 is all of Fn.

  4. We shall now consider the case of affine C with more than one elements and CβŠ‚Fn (proper subset).

  5. Let L be the subspace parallel to C (Theorem 4.185).

  6. Let LβŠ₯ be the orthogonal complement of L.

  7. Let B={v1,…,vm} be a basis for LβŠ₯ (where m<n).

  8. Since Fn is finite dimensional, hence L=(LβŠ₯)βŠ₯ (Theorem 4.90).

  9. Thus, due to Theorem 4.86,

    L={x|xβŠ₯v1,…,xβŠ₯vm}.
  10. Thus,

    L={x|⟨x,vi⟩=0,i=1,…,m}={x|Ax=0}

    where A is the mΓ—n matrix whose rows are v1,…,vm.

  11. Since C is parallel to L, there exists an a∈Fn such that

    C=L+a={x|A(xβˆ’a)=0}={x|Ax=b}

    where b=Aa.

4.15.13. Affine TransformationsΒΆ

Definition 4.188 (Affine transformation)

Let X and Y be vector spaces (on some field F). A (total) function T:Xβ†’Y is called an affine transformation if for every x,y∈X and for every t∈F

T(tx+(1βˆ’t)y)=tT(x)+(1βˆ’t)T(y).

An affine transformation is also known as an affine function or an affine operator.

An affine transformation preserves affine combinations. An affine combination in input leads to an identical affine combination in output.

4.15.13.1. Relation with Linear TransformationsΒΆ

We next show that a linear transformation followed by a translation is affine.

Theorem 4.194 (Linear + Translation ⟹ Affine)

Let X and Y be vector spaces (on some field F). Let L:Xβ†’Y be a linear transformation and let a∈Y. Define T:Xβ†’Y as

T(x)=L(x)+a.

Then, T is an affine transformation.

Proof. Let x,y∈X and t∈F. Then

T(tx+(1βˆ’t)y)=L(tx+(1βˆ’t)y)+a=tL(x)+(1βˆ’t)L(y)+a=tL(x)+ta+(1βˆ’t)L(y)+(1βˆ’t)a=t(L(x)+a)+(1βˆ’t)(L(y)+a)=tT(x)+(1βˆ’t)T(y).

Thus, T is affine.

We now prove a stronger result that every affine function is a linear transformation followed by a translation.

Theorem 4.195 (Affine = Linear + Translation)

Let X and Y be vector spaces (on some field F). Let T:Xβ†’Y be some mapping. Then, T is affine if and only if the mapping x↦T(x)βˆ’T(0) is linear.

In other words, an affine transformation can be written as a linear transformation followed by a translation and vice-versa.

Proof. Define:

L(x)=T(x)βˆ’T(0).

Notice that L(0)=T(0)βˆ’T(0)=0. Thus, L maps zero vector from X to the zero vector of Y.

We need to show that

L linear βŸΊT affine.

We shall show it in two steps.

  1. Show that if T is affine, then L must be linear.

  2. Show that if L is linear, then T must be affine.

Assume T to be affine. We shall show that L is linear.

Let x,y∈X and t∈F.

[Scalar multiplication]

L(tx)=T(tx)βˆ’T(0)=T(tx+(1βˆ’t)0)βˆ’T(0)=tT(x)+(1βˆ’t)T(0)βˆ’T(0)=t(T(x)βˆ’T(0))=tL(x).

[Vector addition]

L(x+y)=T(x+y)βˆ’T(0)=T(122x+122y)βˆ’T(0)=12T(2x)+12T(2y)βˆ’T(0)=12(T(2x)βˆ’T(0))+12(T(2y)βˆ’T(0))=12(L(2x)+L(2y))=12(2L(x)+2L(y))=L(x)+L(y).

Thus, L is linear. Here, we used the fact that L(2x)=T(2x)βˆ’T(0) and L was already shown to be homogeneous above giving L(2x)=2L(x).

Now, assume L to be linear. We shall show that T is affine.

Let x,y∈X and t∈F. Then

T(tx+(1βˆ’t)y)=L(tx+(1βˆ’t)y)+T(0)=tL(x)+(1βˆ’t)L(y)+T(0)=tL(x)+tT(0)+(1βˆ’t)L(y)+(1βˆ’t)T(0)=t(L(x)+T(0))+(1βˆ’t)(L(y)+T(0))=tT(x)+(1βˆ’t)T(y).

Thus, T is affine.

4.15.13.2. Affine Combinations and HullsΒΆ

We show that affine functions distribute over arbitrary affine combinations.

Theorem 4.196 (Affine functions on affine combinations)

Let X and Y be vector spaces on a field F. Let T:X→Y be affine.

Let x0,x1,…,xk∈X and t0,t1,…,tk∈F such that βˆ‘i=0kti=1. Then,

T(βˆ‘i=0ktixi)=βˆ‘i=0ktiT(xi).

Proof. Define:

L(x)=T(x)βˆ’T(0).

We know that L is linear. We have T(x)=L(x)+T(0).

Now,

T(βˆ‘i=0ktixi)=L(βˆ‘i=0ktixi)+T(0)=βˆ‘i=0ktiL(xi)+T(0)=βˆ‘i=0ktiL(xi)+(βˆ‘i=0kti)T(0)=βˆ‘i=0kti(L(xi)+T(0))=βˆ‘i=0ktiT(xi).

Theorem 4.197 (Preservation of affine hulls)

Let X and Y be vector spaces on a field F. Let T:Xβ†’Y be affine. Let SβŠ†X. Then,

affT(S)=T(affS);

i.e., the affine hull of T(S) is same as T(A) where A is the affine hull of S.

Proof. We first show that affT(S)βŠ†T(affS)

  1. Let y∈affT(S).

  2. Then, there exist y0,…,yk∈T(S) and t0,…,tk∈F such that βˆ‘i=0kti=1 and

    y=βˆ‘i=0ktiyi.
  3. But then, yi=T(xi) for some xi∈S for every i=0,…,k since yi∈T(S).

  4. Then, due to Theorem 4.196

    y=βˆ‘i=0ktiyi=βˆ‘i=0ktiT(xi)=T(βˆ‘i=0ktixi)

    since T preserves affine combinations.

  5. But, x=βˆ‘i=0ktixi∈affS since xi∈S and x is their affine combination.

  6. Thus, y=T(x) where x∈affS.

  7. Thus, y∈T(affS).

  8. Thus, affT(S)βŠ†T(affS).

We now show that T(affS)βŠ†affT(S).

  1. Let y∈T(affS).

  2. Then, there exists x∈affS such that y=T(x).

  3. Then, there exist x0,…,xk∈S and t0,…,tk∈F such that βˆ‘i=0kti=1 and

    x=βˆ‘i=0ktixi.
  4. Then, due to Theorem 4.196

    y=T(x)=T(βˆ‘i=0ktixi)=βˆ‘i=0ktiT(xi)

    since T preserves affine combinations.

  5. Let yi=T(xi) for i=0,…,m.

  6. Since xi∈S, hence yi∈T(S).

  7. Then,

    y=βˆ‘i=0ktiyi.
  8. But then, y is an affine combination of points in T(S).

  9. Thus, y∈affT(S).

  10. Thus, T(affS)βŠ†affT(S).

Combining these results:

T(affS)=affT(S).

4.15.13.3. Invertible Affine TransformationsΒΆ

Theorem 4.198 (Affine invertible = linear invertible)

An affine map is invertible if and only if its corresponding linear map as described in Theorem 4.195 is invertible.

The translation map is invertible. Composition of invertible maps is invertible. Since affine is composition of linear with translation hence affine is invertible if linear is invertible. Similarly, linear is also a composition of affine with translation. Hence, linear is invertible if affine is invertible.

Proof. Formally, let X and Y be vector spaces on a field F. Let T:X→Y be an affine map. Let L:X→Y be the linear map given by:

L(x)=T(x)βˆ’T(0).

Let T(0)=a and write

T(x)=L(x)+a and L(x)=T(x)βˆ’a.

Define a parameterized translation map Gv:Y→Y as:

Gv(y)=y+vβˆ€y∈Y.

Note that the inverse of the translation operator is given by:

Gvβˆ’1(y)=yβˆ’v=Gβˆ’v(y)

which is another translation operator. Thus, all translation operators are invertible.

Then,

L=Gβˆ’a∘T and T=Ga∘L.

Clearly, if T is invertible then so is L and if L is invertible then so is T.

Theorem 4.199 (Inverse of affine map is affine)

Let X and Y be vector spaces on a field F. Let T:X→Y be an affine map. If T is invertible, then its inverse is also an affine map.

Proof. We are given that T is affine and its inverse exists. Let S:Y→X be the inverse of T. We need to show that S is affine.

Since T is invertible, it is bijective.

  1. Let y1,y2∈Y and t∈F.

  2. Then, there exist x1,x2∈X such that

    y1=T(x1),y2=T(x2)

    and x1β‰ x2.

  3. Since S=Tβˆ’1, hence S(y1)=x1 and S(y2)=x2.

  4. Let y=ty1+(1βˆ’t)y2 and x=tx1+(1βˆ’t)x2.

  5. Then

    T(x)=T(tx1+(1βˆ’t)x2)=tT(x1)+(1βˆ’t)T(x2)=ty1+(1βˆ’t)y2=y

    since T is affine.

  6. Thus, T(x)=y.

  7. Consequently, S(y)=x.

  8. But then,

    S(ty1+(1βˆ’t)y2)=S(y)=x=tx1+(1βˆ’t)x2=tS(y1)+(1βˆ’t)S(y2).

We have shown that for any y1,y2∈Y and t∈F,

S(ty1+(1βˆ’t)y2)=tS(y1)+(1βˆ’t)S(y2).

Therefore, S is affine.

4.15.13.4. Affine Mapping between Affine SetsΒΆ

Theorem 4.200 (Affine mapping between affine independent sets)

Let V be a finite dimensional vector space with n=dimV. Let {a0,…,am} and {b0,…,bm} be two affine independent sets in V where m≀n. Then, there exists an invertible affine map, T:Vβ†’V such that Tai=bi for i=0,…,m. If m=n, then T is unique.

Proof. If m=0; i.e., we need to find a mapping between {a0} and {b0}, then we can choose any affine map

T(x)=A(x)+b0βˆ’A(a0)

where A:Vβ†’V is a linear map. Now consider the case where 1≀m≀n.

By Theorem 4.181, we can extend {a0,…,am} to an affine independent set {a0,…,an} and similarly {b0,…,bm} to {b0,…,bn}.

Both of these sets span the entire V (as affine hull).

The sets Ba={a1βˆ’a0,…,anβˆ’a0} and Bb={b1βˆ’b0,…,bnβˆ’b0} are two different bases for V.

Then, there exists a unique linear transformation A:V→V which carries Ba to Bb; i.e.

A(aiβˆ’a0)=biβˆ’b0,βˆ€i=1,…,n.

Now, consider the affine map given by

T(x)=A(x)+b0βˆ’A(a0)

where b0βˆ’A(a0) is a fixed translation.

Then,

T(ai)=A(ai)+b0βˆ’A(a0)=A(aiβˆ’a0)+b0=biβˆ’b0+b0=bi.

Thus, T is the desired affine transformation.

If m=n, then the linear transformation A is uniquely determined by the bases Ba and Bb which are given. Thus, T is unique too.

Corollary 4.33 (Affine mapping between affine sets)

Let V be a finite dimensional vector space with n=dimV. Let A and B be two affine subspaces of V of the same dimension; i.e., dimA=dimB=m where m≀n. Then, there exists a bijective affine transformation T:Vβ†’V such that T(A)=B.

Proof. If both A and B are singletons given by A={a} and B={b}, then any affine transformation given by

T(x)=A(x)+bβˆ’A(a)

where A:V→V is a linear transformation will do.

For 1≀m≀n, by Theorem 4.179 we can choose m+1 affine independent points {a0,…,am} and {b0,…,bm} in A and B respectively such that

A=aff{a0,…,am} and B=aff{b0,…,bm}

as per Theorem 4.180.

Then, by Theorem 4.200, an affine mapping T:V→V exists which maps ai to bi. Since affine mappings preserve affine hulls (Theorem 4.197), hence

T(A)=B.

4.15.13.5. GraphΒΆ

Theorem 4.201 (Graph of an affine map is affine)

Let X and Y be vector spaces on a field F. Let T:Xβ†’Y be an affine map. Let XβŠ•Y be the direct sum of X and Y.

Let GβŠ†XβŠ•Y be the graph of T given by

graT={(x,T(x))|x∈X}.

Then, G is an affine subset of XβŠ•Y.

In other words, graph of an affine map is affine.

Proof. If y=T(x) then z=(x,y)∈G.

Now, let z1,z2∈G and t∈F.

  1. z1=(x1,y1) such that y1=T(x1).

  2. z2=(x2,y2) such that y2=T(x2).

  3. Let z=tz1+(1βˆ’t)z2.

  4. Then,

    z=t(x1,y1)+(1βˆ’t)(x2,y2)=(tx1+(1βˆ’t)x2,ty1+(1βˆ’t)y2).
  5. Since T is affine, hence

    T(tx1+(1βˆ’t)x2)=tT(x1)+(1βˆ’t)T(x2)=ty1+(1βˆ’t)y2.
  6. Thus, z=(tx1+(1βˆ’t)x2,ty1+(1βˆ’t)y2)∈G.

  7. Thus, G is closed under affine combinations.

  8. Thus, G is affine.

As an implication, we can see that the graph of a linear map must be an affine set too since every linear map is an affine map. But a linear map maps 0x to 0y. Thus, its graph contains the origin (0x,0y) of XβŠ•Y. Thus, the graph of a linear map must be a subspace of XβŠ•Y.

4.15.14. Topology in Normed SpacesΒΆ

We next consider the special case of a vector space V endowed with a norm ‖⋅‖:V→R, which induces a metric d:V×V→R given by:

d(x,y)=β€–xβˆ’yβ€–.

V equipped with this metric becomes a metric space and is endowed with a metric topology. Useful topological properties of affine sets and transformations are discussed below.

Readers are encouraged to review the material in Normed Linear Spaces before proceeding further as the results presented here develop on the material presented in that section.

Our discussions are restricted to finite dimensional normed linear spaces as linear subspaces are closed (Theorem 4.66) and linear transformations are continuous (Theorem 4.65) in the finite dimensional spaces.

4.15.14.1. Affine SetsΒΆ

Theorem 4.202 (Affine sets are closed)

Every affine subset of a finite dimensional normed linear space V is a closed set.

Proof. βˆ… and V are closed by definition. Singletons {x} are closed due to Theorem 3.8.

All other affine sets are translations of a linear subspace.

  1. By Theorem 4.66, linear subspaces are closed in a finite dimensional normed linear space.

  2. By Theorem 4.48, translations preserve closed sets.

  3. Hence, affine sets of dimension greater than zero which are translates of the linear subspaces are also closed.

Theorem 4.203

Every proper affine subspace of a normed linear space V has an empty interior.

Proof. We proceed as follows:

  1. By Corollary 4.10, every proper linear subspace of V has an empty interior.

  2. A proper affine subspace is a translate of a proper linear subspace.

  3. By Theorem 4.48, if a set has an empty interior, then so does its translate.

Theorem 4.204 (Affine hull and closure)

Let V be a finite dimensional normed linear space. Let CβŠ†V. Then,

aff(clC)=affC.

Proof. Since CβŠ†clC, hence affCβŠ†aff(clC).

  1. Let A=affC.

  2. By Theorem 4.202, A is closed.

  3. By definition clC is the smallest closed set that contains C.

  4. By Proposition 3.6, any closed set that contains C also contains clC.

  5. Thus, clCβŠ†affC.

  6. Now, affC is an affine set.

  7. By definition, the affine hull is the smallest affine set that contains a set.

  8. Hence, aff(clC)βŠ†affC.

Together, we have:

aff(clC)=affC.

4.15.14.2. Affine TransformationsΒΆ

Theorem 4.205 (Affine transformations from finite dimensional spaces are continuous)

Let (V,‖⋅‖v) and (W,‖⋅‖w) be normed linear spaces. Let T:V→W be an affine transformation.

If V is finite dimensional, then T is continuous.

Proof. We can write T as the composition of a linear transformation followed by a translation.

  1. By Theorem 4.65, the linear transformation is continuous since V is finite dimensional.

  2. By Theorem 4.47, translations are continuous.

  3. By Theorem 3.46, composition of continuous functions is continuous.

  4. Hence, T is continuous.

Theorem 4.206 (Affine transformation and closure)

Let (V,‖⋅‖v) and (W,‖⋅‖w) be normed linear spaces. Let T:V→W be an affine transformation.

Assume that V is finite dimensional. Let AβŠ†V. Then,

T(clA)βŠ†clT(A).

Proof. By Theorem 4.205, T is continuous.

By Theorem 3.42 (4)

T(clA)βŠ†clT(A)

holds true for every subset A of V.

Recall from Definition 3.64 that a real valued function is closed if every sublevel set is closed.

Theorem 4.207 (Real valued affine functions are closed)

Let (V,‖⋅‖) be an n-dimensional normed linear space. Let T:V→R be an affine function. Then, T is closed.

Proof. 1. By Theorem 4.205, f is continuous.

  1. Let a∈R.

  2. The sublevel set for a is given by Sa={x∈V|T(x)≀a}.

  3. This is nothing but Tβˆ’1(βˆ’βˆž,a].

  4. The set (βˆ’βˆž,a] is a closed set.

  5. Since T is continuous, hence Tβˆ’1(βˆ’βˆž,a] is also closed.

  6. Thus, Sa is closed for every a∈R.

  7. Thus, T is closed.

4.15.14.3. Affine HomeomorphismsΒΆ

Theorem 4.208

Let V be a finite dimensional normed linear space. A bijective affine transformation T:V→V is a homeomorphism.

Proof. We proceed as follows:

  1. By Theorem 4.205, T is continuous.

  2. Since T is bijective, hence, Tβˆ’1 exists.

  3. By Theorem 4.199, Tβˆ’1 is affine.

  4. Again, by Theorem 4.205, Tβˆ’1 is continuous.

  5. Thus, T is a homeomorphism.

Theorem 4.209

Let V be a finite dimensional normed linear space. A bijective affine transformation T:V→V preserves closures.

In other words, for any AβŠ†V:

T(clA)=cl(T(A)).

Proof. By Theorem 4.208, T is a homeomorphism.

By Theorem 3.51, homeomorphisms preserve closures.

Thus, for any AβŠ†V

T(clA)=cl(T(A)).

Theorem 4.210

Let V be a finite dimensional normed linear space. A bijective affine transformation T:V→V preserves interiors.

In other words, for any AβŠ†V:

T(intA)=int(T(A)).

Proof. By Theorem 4.208, T is a homeomorphism.

By Theorem 3.52, homeomorphisms preserve interiors.

Thus, for any AβŠ†V

T(intA)=int(T(A)).

4.15.15. Real Valued Affine FunctionsΒΆ

In this subsection, we look at affine functions from a vector space V to the real line R.

Theorem 4.211 (Level sets of real valued affine functions)

Let V be a vector space. Let h:Vβ†’R be an affine function. Then, for any c∈R, the set hβˆ’1(c) is an affine set where

hβˆ’1(c)={x∈V|h(x)=c}.

Proof. We are given that h:V→R is affine.

  1. Let c∈R.

  2. If hβˆ’1(c) is empty, then it is affine and there is nothing to prove. So assume that it is nonempty.

  3. Let x,y∈hβˆ’1(c).

  4. Thus, h(x)=h(y)=c.

  5. Let t∈F.

  6. Let z=tx+(1βˆ’t)y.

  7. Then, by affine nature of h

    h(z)=h(tx+(1βˆ’t)y)=th(x)+(1βˆ’t)h(y)=tc+(1βˆ’t)c=c.
  8. Thus, z∈hβˆ’1(c).

  9. Thus, for any x,y∈hβˆ’1(c) and t∈F, z=tx+(1βˆ’t)y∈hβˆ’1(c).

  10. Thus, hβˆ’1(c) is an affine set.