40. Images

40. Images

Definition:

The image of a linear map f : 𝐑 n 𝐑 m is the set of vectors b 𝐑 m such that b = f ( v ) for some v 𝐑 n . It is written as im .

If you think of applying a map as "following light rays" (like in some earlier examples), you can think of the image as the shadow your map casts.

If the map f is the vertical projection f ( x y z ) = ( x y 0 ) then the image of f is the x y -plane. That is im ( f ) = { ( x y 0 ) : x , y 𝐑 } .

Consider the 3-by-2 matrix A = ( 1 1 2 0 0 1 ) . The image of the corresponding linear map is the set of all vectors of the form A ( x y ) = ( x + y 2 x y ) . We studied this example earlier and even drew a picture of its image: it is the grey plane in the figure below. (There's a slight "videographic typo" (i.e. "mistake") in the video, see if you can spot it).

The plane of vectors of the form (x+y,2x,y) in 3D

Remarks

  • 0 im ( f ) because 0 = f ( 0 ) .

  • If f is invertible then im ( f ) = 𝐑 m . This is because if b 𝐑 m then b = f ( f - 1 ( b ) ) , so b im ( f ) .

Image is a subspace

Lemma:

The image of f is a subspace.

If b 1 , b 2 im ( f ) then so is b 1 + b 2 . To see this, observe that if b 1 , b 2 im ( f ) then b 1 = f ( v 1 ) and b 2 = f ( v 2 ) for some v 1 , v 2 𝐑 n . This means that b 1 + b 2 = f ( v 1 ) + f ( v 2 ) = f ( v 1 + v 2 ) (since f is linear), so b 1 + b 2 im ( f ) .

Similarly, λ b 1 = λ f ( v 1 ) = f ( λ v 1 ) (since f is linear), so λ b 1 im ( f ) .

Relation with simultaneous equations

Lemma:

A v = b has a solution if and only if b im ( f ) where f ( v ) = A v .

This is a tautology from the definition of image! A v = b has a solution if and only if there is a v such that f ( v ) = A v = b .

So putting this together with the last lecture, we see that A v = b has a solution if and only if b im ( f ) and, if it has a solution, then the space of solutions is a translate of ker ( f ) .

Rank

Definition:

The rank of a linear map/matrix is the dimension of its image.

Theorem:

(Rank-nullity theorem) If A is an m -by- n matrix (or f : 𝐑 n 𝐑 m is a linear map) then rank ( A ) + nullity ( A ) = n . Here n is the number of columns or A (or the dimension of the target of f ).

The 3-by-3 matrix A = ( 0 0 0 0 0 0 0 0 0 ) sends everything to zero, so its image is a single point, which has dimension zero, so rank ( A ) = 0 . The kernel is the set of things which map to zero, and since everything maps to zero the kernel is 𝐑 3 . Therefore the nullity (dimension of the kernel) is three. Note that 0 + 3 = 3 and n = 3 ( A is a 3-by-3 matrix) so the rank-nullity theorem holds.

The 3-by-3 matrix B = ( 1 0 0 0 0 0 0 0 0 ) sends ( x y z ) to ( x 0 0 ) , so its image is the x -axis. Therefore the rank (dimension of the image) is 1. The nullity is the number of free variables ( B is in reduced echelon form already) which is 2 (there is one leading entry). Again, 1 + 2 = 3 , which is good. We can see that as the rank increases, the nullity goes down (as required by the rank-nullity theorem).

The matrix C = ( 1 0 0 0 1 0 0 0 0 ) is the vertical projection to the x y -plane, so its rank is 2 (image is the x y -plane). Its nullity is 1 (one free variable). Again, 2 + 1 = 3 .

The 3-by-3 identity matrix I = ( 1 0 0 0 1 0 0 0 1 ) has rank 3 (for any v we have v = I v so every vector is in the image) and the nullity is 0 (only the origin maps to the origin). Again, 3 + 0 = 3 .

The rank-nullity theorem is basically saying that the map f is taking 𝐑 n , crushing down some of the dimensions (those in the kernel), and mapping the rest faithfully onto the image (so the n dimensions of 𝐑 n either contribute to the kernel or to the image).

Proof of rank-nullity theorem

The nullity of A is the number of free variables of A when you put it into reduced echelon form. If we can show that the rank is the number of dependent variables then we're done: there are n variables which are either free (contributing to kernel) or dependent (contributing to rank). Recall that the dependent variables correspond to the columns with leading entries in reduced echelon form.

So we need to show that the rank is the number of leading entries of A in reduced echelon form.

First step: we prove that the rank doesn't change when we do a row operation. Suppose we start with a matrix A , do a row operation to get a matrix A . We know there is an elementary matrix E such that A = E A . This tells us immediately that im ( A ) and im ( A ) have the same dimension: b E b gives us an "isomorphism" (invertible linear map) from the image of A to the image of A .

As the rank doesn't change under row operations, we may assume without loss of generality that A is in reduced echelon form.

Second step: if A is in reduced echelon form then it has k nonzero rows (for some k ) followed by m - k zero rows. Now:

  • The number k is the number of leading entries (because each nonzero row has a leading entry and each zero row doesn't).

  • Recall that A v = b has a solution if and only if b k + 1 = b k + 2 = = b m = 0 : these are the necessary and sufficient conditions for solving the simultaneous equations. If A has a zero row then b has to have a zero in that row, and if all these higher b s are zero then the other rows of A just give us equations which determine the dependent variables.

Since the image of A is the set of b for which A v = b has a solution, this means that im ( A ) is the set of b for which b k + 1 = = b m = 0 , i.e. those b of the form ( b 1 b k 0 0 ) . This is a k -dimensional space, so we see that the rank equals k , the number of leading entries.

This completes the proof of the rank-nullity theorem.