Example: O(n)

The orthogonal group

Definition

Let O ( n ) denote the group of n -by- n orthogonal matrices. Recall that a matrix M is called orthogonal if M T M = I . Geometrically, this means that M preserves dot products. This is because v w = v T w , so ( M v ) ( M w ) = v T M T M w , which equals v T w for all v and w if and only if M T M = I . Preserving dot products means you preserve lengths of and angles between vectors, so orthogonal matrices represent things like rotations and reflections.

Lie algebra

By definition, the Lie algebra 𝔬 ( n ) of O ( n ) is 𝔬 ( n ) = { X 𝔤 𝔩 ( n , 𝐑 ) : exp ( t X ) O ( n ) t 𝐑 } . This means ( exp ( t X ) ) T exp ( t X ) = I . We can take the transpose term-by-term in the exponential power series, and, since ( M N ) T = N T M T , we get ( M n ) T = ( M T ) n , so ( exp ( t X ) ) T = exp ( t X T ) .

Therefore exp ( t X T ) = ( exp ( t X ) ) - 1 and we've seen that ( exp ( t X ) ) - 1 = exp ( - t X ) , so we deduce that X 𝔬 ( n )  if and only if  exp ( t X T ) = exp ( - t X ) t 𝐑 .

Certainly if X T = - X then exp ( t X T ) = exp ( - t X ) , so X 𝔬 ( n ) . Therefore any antisymmetric matrix is in the Lie algebra of the orthogonal group. It will turn out that this is everything. To see this, note that by taking t to be sufficiently small we can assume that t X T and - t X lie in our favourite neighbourhood of the zero matrix and exp ( t X T ) and exp ( - t X ) are in our favourite neighbourhood of the identity. Our favourite neighbourhood of the identity is the one on which the local logarithm is defined, so we can take the logarithm of each side and deduce that t X T = - t X for all sufficiently small t , hence X T = - X .

Remark:

This is why we require that exp ( t X ) O ( n ) for all t : we want to be able to put ourselves near to the identity to take logs.

We have now shown that:

Lemma:

𝔬 ( n ) is the set of antisymmetric matrices, that is the matrices X such that X T = - X .

Example

Example:

We saw that exp ( 0 - θ θ 0 ) = ( cos θ - sin θ sin θ cos θ ) . Here, X = ( 0 - θ θ 0 ) is antisymmetric and its exponential is a rotation matrix (hence orthogonal).

Additional remarks

Remark:

Another, nicer, way to deduce that exp ( t X T ) = exp ( - t X ) for all t implies X T = - X , is to differentiate the equation exp ( t X T ) exp ( t X ) = I with respect to t . The right-hand side is constant, so its derivative is zero. The left-hand side can be evaluated using the product rule, and we get: 0 = d d t ( exp ( t X T ) exp ( t X ) ) = X T exp ( t X T ) exp ( t X ) + exp ( t X T ) X exp ( t X ) for all t . If we now set t = 0 we get 0 = X T + X , so X T = - X .

Remark:

Note that O ( n ) is topologically closed. To see this, consider the map F : 𝔤 𝔩 ( n , 𝐑 ) 𝔤 𝔩 ( n , 𝐑 ) defined by F ( M ) = M T M . We have O ( n ) = F - 1 ( I ) by definition. Note that F is continuous because the matrix entries of M T M are polynomials in the matrix entries of M (and polynomials are continuous). Therefore if M k O ( n ) is a sequence of orthogonal matrices converging to a matrix M then F ( M k ) = I for all k , so lim k F ( M k ) = I , and since F is continuous we have F ( M ) = lim k F ( M k ) = I , so M O ( n ) .

Remark:

Whenever your group is cut out by a continuous equation like this it will be a topologically closed group of matrices.

Pre-class exercise

Exercise:

If X , Y 𝔬 ( n ) then [ X , Y ] 𝔬 ( n ) .