11. Dot product, 3

11. Dot product, 3

Orthogonal matrices

Definition:

An n -by- n matrix M is orthogonal if M T M = I .

This is equivalent to saying that:

  • the columns of M , considered as column vectors, are orthogonal to one another, and

  • each column has length one when considered as a vector.

Example:

The matrix M = ( cos θ - sin θ sin θ cos θ ) is orthogonal: M T M = ( cos θ sin θ - sin θ cos θ ) ( cos θ - sin θ sin θ cos θ ) = ( cos 2 θ + sin 2 θ - sin θ cos θ + cos θ sin θ sin θ cos θ - cos θ sin θ cos 2 θ + sin 2 θ ) = ( 1 0 0 1 ) = I .

Lemma:

If M is orthogonal then the geometric transformation of 𝐑 n defined by M preserves dot products, i.e. ( M v ) ( M w ) = v w .

( M v ) ( M w ) = v T M T M w = v T I w = v T w = v w .

In particular, orthogonal matrices preserve lengths of vectors. This is because | v | = v 1 2 + + v n 2 = v v and orthogonal matrices preserve dot products.

The sorts of transformations you should have in mind when you think of orthogonal matrices are rotations, reflections and combinations of rotations and reflections.

We now turn to the proof of the formula v w = | v | | w | cos θ where θ is the angle between v and w .

The "proof" I'll give is more of an intuitive argument. I'll point out where the dodgy bits are as we go on.

The angle between v and w is defined by taking the plane spanned by v and w and taking the angle between them inside that plane. Therefore without loss of generality, we can assume v and w are in the standard 𝐑 2 . ( This is where the dodgy bit is: technically, we need to check that the plane spanned by v and w is the same, geometrically (isometric to) the standard 𝐑 2 ; this would require us to appeal to something like Gram-Schmidt orthogonalisation, which you'll see in a later course on linear algebra.)

Further, we can rotate using our 2-by-2 rotation matrix so that v points in the positive x -direction. We can do this without changing the angle θ (because rotations preserve angles) and without changing the dot product (because rotations are orthogonal, so don't change dot products).

Vectors v (horizontal) and w (at an angle theta)

Finally, compute v w = v 1 w 1 + v 2 w 2 = v 1 w 1 because v 2 = 0 . We have v 1 = | v | because v points along the positive x -axis. We also have w 1 = | w | cos θ . So overall we have v w = | v | | w | cos θ .

If you were disturbed by my claim that rotations preserve angles without justification, you should have been even more disturbed by the fact that I didn't actually define angles in the plane at all. This is not intended as a complete rigorous derivation from axioms, but as an appeal to your geometric intuition.