本文为《Linear algebra and its applications》的读书笔记

Diagonalization of symmetric matrices

A symmetric matrix is a matrix $A$ such that $A^T = A$ . Such a matrix is necessarily square.

To begin the study of symmetric matrices, it is helpful to review the diagonalization process of Section 5.3.

在这里插入图片描述
PROOF
Let $\boldsymbol v_1$ and $\boldsymbol v_2$ be eigenvectors that correspond to distinct eigenvalues, say, $\lambda_1$ and $\lambda_2$ .

在这里插入图片描述
Since $\lambda_1\neq\lambda_2$ , $\boldsymbol v_1\cdot\boldsymbol v_2=0$ .

An $n\times n$ matrix $A$ is said to be orthogonally diagonalizable(正交对角化) if there are an orthogonal matrix $P$ (with $P^{-1} = P^T$ ) and a diagonal matrix $D$ such that

在这里插入图片描述
Such a diagonalization requires $n$ linearly independent and orthonormal eigenvectors.

If $A$ is orthogonally diagonalizable, then

在这里插入图片描述
Thus $A$ is symmetric! Theorem 2 below shows that, conversely, every symmetric matrix is orthogonally diagonalizable. The main idea for a proof will be given after Theorem 3.

在这里插入图片描述
This theorem is rather amazing, because the work in Chapter 5 would suggest that it is usually impossible to tell when a matrix is diagonalizable. But this is not the case for symmetric matrices.

EXAMPLE 3
Orthogonally diagonalize the matrix

在这里插入图片描述
, whose characteristic equation is

在这里插入图片描述
SOLUTION

在这里插入图片描述

Although $\boldsymbol v_1$ and $\boldsymbol v_2$ are linearly independent, they are not orthogonal. Then use the projection of $\boldsymbol v_2$ onto $\boldsymbol v_1$ to produce an orthogonal set.

在这里插入图片描述
Then $\{\boldsymbol v_1,\boldsymbol z_2\}$ is an orthogonal set in the eigenspace for $\lambda= 7$ . (Note that $\boldsymbol z_2$ is a linear combination of the eigenvectors $\boldsymbol v_1$ and $\boldsymbol v_2$ , so $\boldsymbol z_2$ is in the eigenspace.)

Normalize $\boldsymbol v_1$ and $\boldsymbol z_2$ to obtain the following orthonormal basis for the eigenspace for $\lambda= 7$ :

在这里插入图片描述
An orthonormal basis for the eigenspace for $\lambda =-2$ is

在这里插入图片描述
By Theorem 1, $\boldsymbol u_3$ is orthogonal to the other eigenvectors $\boldsymbol u_1$ and $\boldsymbol u_2$ . Hence $\{\boldsymbol u_1,\boldsymbol u_2,\boldsymbol u_3\}$ is an orthonormal set. Let

在这里插入图片描述
Then $P$ orthogonally diagonalizes $A$ , and $A = PDP^{-1}$ .

The Spectral Theorem 谱定理

The set of eigenvalues of a matrix $A$ is sometimes called the $s p e c t r u m$ of $A$ , and the following description of the eigenvalues is called a $s p e c t r a l$ $t h e o r e m$ .

在这里插入图片描述

Part $(a)$ follows from Supplementary exercises in Section 5.5.
Part $(b)$ follows easily from part (d).
Part $(c)$ is Theorem 1.
Because of $(a)$ , a proof of $(d)$ can be found in the $A p p e n d i x$ : proof of Theorem 3.

Spectral Decomposition 谱分解

Suppose $A= PDP^{-1}$ , where the columns of $P$ are orthonormal eigenvectors $\boldsymbol u_1,..., \boldsymbol u_n$ of $A$ and the corresponding eigenvalues $\lambda_1,...,\lambda_n$ are in the diagonal matrix $D$ . Then, since $P^{-1}= P^T$ ,

在这里插入图片描述

This representation of $A$ is called a spectral decomposition of $A$ because it breaks up $A$ into pieces determined by the spectrum (eigenvalues) of $A$ .

Each term in (2) is an $n\times n$ matrix of rank 1. For example, every column of $\lambda_1\boldsymbol u_1\boldsymbol u_1^T$ is a multiple of $\boldsymbol u_1$ .
Furthermore, each matrix $\boldsymbol u_j\boldsymbol u_j^T$ is a projection matrix(投影矩阵) in the sense that for each $\boldsymbol x$ in $R^n$ , the vector $(\boldsymbol u_j\boldsymbol u_j^T)\boldsymbol x$ is the orthogonal projection of $\boldsymbol x$ onto the subspace spanned by $\boldsymbol u_j$ .
PROOF
$(\boldsymbol u_j\boldsymbol u_j^T)\boldsymbol x=\boldsymbol u_j(\boldsymbol u_j^T\boldsymbol x)=(\boldsymbol u_j^T\boldsymbol x)\boldsymbol u_j=(\boldsymbol u_j\cdot\boldsymbol x)\boldsymbol u_j$ . ( $\boldsymbol u_j^T\boldsymbol x$ is a scaler) This is the orthogonal projection of $\boldsymbol x$ onto $\boldsymbol u$ .

EXERCISE
Let $A$ be an $n\times n$ symmetric matrix of rank $r$ . Explain why the spectral decomposition of $A$ represents $A$ as the sum of $r$ rank 1 matrices.
SOLUTION
[Hint: $d i m N u l A = n - r$ ]

Appendix: proof of Theorem 3 (d)

The Schur factorization(舒尔分解) of an $\times n$ matrix $A$ is in the form $A= URU^T$ , where $U$ is an orthogonal matrix and $R$ is an $\times n$ upper triangular matrix.

THEOREM
Let $A$ be an $n\times n$ matrix with $n$ real eigenvalues, counting multiplicities, denoted by $\lambda_1,...,\lambda_n$ . It can be shown that $A$ admits a (real) Schur factorization.
PROOF
Parts (a) and (b) show the key ideas in the proof. The rest of the proof amounts to repeating (a) and (b) for successively smaller matrices, and then piecing together the results.
a. Let $\boldsymbol u_1$ be a unit eigenvector corresponding to $\lambda_1$ , let $\boldsymbol u_2,...,\boldsymbol u_n$ be any other vectors such that $\{\boldsymbol u_1,...,\boldsymbol u_n\}$ is an orthonormal basis for $R^n$ , and then let $=\begin{bmatrix}\boldsymbol u_1&\boldsymbol u_2&...&\boldsymbol u_n\end{bmatrix}$ . It can be shown that the first column of $U^T AU$ is $\lambda_1\boldsymbol e_1$ , where $\boldsymbol e_1$ is the first column of the $n\times n$ identity matrix.
b. Part (a) implies that $U^TAU$ has the form shown below.

在这里插入图片描述

Since $det(U^TAU-\lambda I)=det(U^TAU-\lambda U^TU)=det(U^T(A-\lambda I)U)=det(U^T)det(A-\lambda I)det(U)=det(U^{-1})det(A-\lambda I)det(U)=det(A-\lambda I)$ , the characteristic polynomials of $U^TAU$ and $A$ are the same. Thus $U^TAU$ and $A$ have the same eigenvalues, which indicates that the eigenvalues of $A_1$ are $\lambda_2,...,\lambda_n$ .

Similar to (a), let $\boldsymbol u_2'$ be a unit eigenvector corresponding to $\lambda_2$ , let $\boldsymbol u_3',...,\boldsymbol u_n'$ be any other vectors such that $\{\boldsymbol u_2',...,\boldsymbol u_n'\}$ is an orthonormal basis for $R^{n-1}$ , and then let $=\begin{bmatrix}\boldsymbol u_2'&\boldsymbol u_3'&...&\boldsymbol u_n'\end{bmatrix}$ . It can be shown that the first column of $U'^T A_1U'$ is $\lambda_2\boldsymbol e_1'$ , where $\boldsymbol e_1'$ is the first column of the $(n-1)\times (n-1)$ identity matrix. So $U'^TA_1U'$ has the form similar to $U^TAU$ .

Suppose $U^TAU=\begin{bmatrix}\lambda_1&\boldsymbol x^T\\\boldsymbol 0&A_1\end{bmatrix}$ , then
$\begin{aligned}\begin{bmatrix}1&\boldsymbol 0\\\boldsymbol 0&U_1^T\end{bmatrix}U^TAU\begin{bmatrix}1&\boldsymbol 0\\\boldsymbol 0&U_1\end{bmatrix}&=\begin{bmatrix}1&\boldsymbol 0\\\boldsymbol 0&U_1^T\end{bmatrix}\begin{bmatrix}\lambda_1&\boldsymbol x^T\\\boldsymbol 0&A_1\end{bmatrix}\begin{bmatrix}1&\boldsymbol 0\\\boldsymbol 0&U_1\end{bmatrix}\\&=\begin{bmatrix}\lambda_1&\boldsymbol x^TU_1\\\boldsymbol 0&U_1^TA_1U_1\end{bmatrix}\end{aligned}$

Let $U'=U\begin{bmatrix}1&\boldsymbol 0\\\boldsymbol 0&U_1\end{bmatrix}$ , it can be shown that $U^{'}$ is an orthogonal matrix. So
$U'^TAU'=\begin{bmatrix}\lambda_1&*&*&*&*\\0&\lambda_2&*&*&*\\...&0\\...&...&&A_2\\0&0\end{bmatrix}$

Continue this process and we will finally get a (real) Schur factorization of $A$ .

With the theorem above, the proof is quite easy.

Let $A$ be a symmetric matrix. Since $A$ has $n$ real eigenvalues, counting multiplicities, $A$ has a real Schur factorization $URU^T$ . Since $A^T=UR^TU^T=A=URU^T$ , $R=R^T$ , which indicates that $R$ is in fact a diagonal matrix, with eigenvalues on its main diagonal.

Thus $A=URU^{-1}$ , where $U$ is an orthogonal matrix and $R$ is a diagonal matrix. So $A$ is orthogonally diaganalizable.

7.1 Diagonalization of symmetric matrices (对称矩阵的对角化)

目录

Diagonalization of symmetric matrices

The Spectral Theorem 谱定理

Spectral Decomposition 谱分解

Appendix: proof of Theorem 3 (d)

猜你喜欢