A map is linear if there exists an -by- matrix such that for all .
38. Linear maps
38. Linear maps
Two definitions of linearity
At the outset of this course, we talked about the geometric transformations coming from matrices (rotations, reflections, shears etc). These geometric transformations have a name: they are called linear maps. In this video we'll give two definitions of linear maps and show they're equivalent. The first definition encapsulates how we've been dealing with linear maps so far:
However, some linear maps are more natural to describe in another way, without giving the matrix .
Fix . Consider the space of polynomials of degree at most . Differentiation gives a map from this space to itself. This map is linear. To understand why, we need to understand polynomials as vectors. We encode a polynomial as its vector of coefficients . Then which corresponds to the vector of coefficients . This is the same as so differentiation is linear (given by a matrix). In other words, if is the vector of coefficients of the polynomial then , where is this matrix.
This way of encoding polynomials as vectors is a bit artificial. For example, I could have chosen to write the vector with at the top and at the bottom, and the matrix would have ended up looking quite different. The fact that differentiation of polynomials is a linear map is an intrinsic fact about differentiation, and our proof above obscures that. So here's an equivalent definition of linearity which is more intrinsic.
A map is linear if:
-
for all
-
for all and for all .
I haven't specified the domain and target of because I want to be intentionally vague: this definition makes sense whenever the domain and target of admit operations of addition and rescaling (e.g. spaces of polynomials or functions as well as just ). In the final video of the course, we'll see that the natural setting for this definition is the setting of vector spaces.
Differentiation of polynomials is linear because and for any constant and polynomials .
The function which converts metres to feet is linear. Since 1 metre is feet, . If you double the number of metres, you double the number of feet. If you take two distances metres and metres you can add them and then convert to feet ( ) or you can convert and then add ( ) and you get the same answer. So is linear.
The function which converts Celsius to Kelvin is not linear. Recall that . Any linear map satisfies , because (some of those s are numbers, some are vectors!).
I'm told the way they used to mark exams in Oxford was to take the marks from each question, square them and add them up. For example, if there were two questions and you got marks and then your final score would be . This rewards those who do very well on a couple of questions (instead of scatter-gunning a little bit of knowledge over all questions). This function is not a linear map! For example, if you score and then you get in total, but if you double your score for then you quadruple your total. Sadly for those taking the exam, .
Equivalence
These two definitions of linearity are equivalent. In other words, the conditions
-
and
imply there exists a matrix such that , and any map of this form satisfies these conditions.
If for some matrix then and
Conversely, consider the basis vectors Let be the matrix whose columns are .
Then Since is the th column of , it agrees with (also the th column of ). Therefore which shows that is linear in the sense that it has the form for some matrix .