Given a vector $\boldsymbol y$ and a subspace $W$ in $\mathbb R^n$ , there is a vector $\hat \boldsymbol y$ in $W$ such that (1) $\hat \boldsymbol y$ is the unique vector in $W$ for which $\boldsymbol y -\hat\boldsymbol y$ is orthogonal to $W$ , and (2) $\hat\boldsymbol y$ is the unique vector in $W$ closest to $\boldsymbol y$ . See Figure 1.

在这里插入图片描述
EXAMPLE 1
Let $\{\boldsymbol u_1,...,\boldsymbol u_5\}$ be an orthogonal basis for $\mathbb R^5$ and let

在这里插入图片描述
Consider the subspace $\{\boldsymbol u_1,\boldsymbol u_2\}$ , and write $\boldsymbol y$ as the sum of a vector $\boldsymbol z_1$ in $W$ and a vector $\boldsymbol z_2$ in $W^\perp$ .
SOLUTION

在这里插入图片描述

The next theorem shows that the decomposition $\boldsymbol y =\boldsymbol z_1 +\boldsymbol z_2$ in Example 1 can be computed without having an orthogonal basis for $\mathbb R^n$ . It is enough to have an orthogonal basis only for $W$ .

在这里插入图片描述
The vector $\hat \boldsymbol y$ in (1) is called the orthogonal projection of $\boldsymbol y$ onto $W$ and often is written as $proj_W\boldsymbol y$ . See Figure 2. When W is a one-dimensional subspace, the formula for $\hat\boldsymbol y$ matches the formula given in Section 6.2.

在这里插入图片描述
PROOF

We may assume that $W$ is not the zero subspace, for otherwise $W^\perp=\mathbb R^n$ and (1) is simply $\boldsymbol y =\boldsymbol 0+\boldsymbol y$ . The next section will show that any nonzero subspace of $\mathbb R^n$ has an orthogonal basis.

Let $\{\boldsymbol u_1,...,\boldsymbol u_p\}$ be any orthogonal basis for $W$ , and define $\hat\boldsymbol y$ by (2). Let $\boldsymbol z =\boldsymbol y -\hat\boldsymbol y$ , then

在这里插入图片描述
Thus $\boldsymbol z$ is orthogonal to $\boldsymbol u_1$ . Similarly, $\boldsymbol z$ is orthogonal to each $\boldsymbol u_j$ in the basis for $W$ . Hence $\boldsymbol z$ is orthogonal to every vector in $W$ . That is, $\boldsymbol z$ is in $W^\perp$ .

To show that the decomposition in (1) is unique, suppose $\boldsymbol y$ can also be written as $\boldsymbol y =\hat\boldsymbol y_1 +\boldsymbol z_1$ , with $\hat\boldsymbol y_1$ in $W$ and $\boldsymbol z_1$ in $W^\perp$ . Then $\hat\boldsymbol y+\boldsymbol z =\hat\boldsymbol y_1+\boldsymbol z_1$ , and so
$\hat\boldsymbol y-\hat\boldsymbol y _1=\boldsymbol z_1-\boldsymbol z$

This equality shows that the vector $\boldsymbol v =\hat\boldsymbol y-\hat\boldsymbol y_1$ is in $W$ and in $W^\perp$ . Hence $\boldsymbol v\cdot \boldsymbol v = 0$ , which shows that $\boldsymbol v =\boldsymbol 0$ . This proves that $\hat\boldsymbol y=\hat\boldsymbol y_1$ and also $\boldsymbol z_1 =\boldsymbol z$ .

EXERCISES
Suppose that $\{\boldsymbol u_1, \boldsymbol u_2\}$ is an orthogonal set of nonzero vectors in $\mathbb R^3$ . How would you find an orthogonal basis of $\mathbb R^3$ that contains $\boldsymbol u_1$ and $\boldsymbol u_2$ ?
SOLUTION
First, find a vector $\boldsymbol v$ in $\mathbb R^3$ that is not in the subspace $W$ spanned by $\boldsymbol u_1$ and $\boldsymbol u_2$ . Let $\boldsymbol u_3=\boldsymbol v-proj_W\boldsymbol v$ , then $\{\boldsymbol u_1, \boldsymbol u_2, \boldsymbol u_3\}$ is an orthogonal basis.

EXERCISES 23
Let $A$ be an $\times n$ matrix. Prove that every vector $\boldsymbol x$ in $\mathbb R^n$ can be written in the form $\boldsymbol x=\boldsymbol p +\boldsymbol u$ , where $\boldsymbol p$ is in $R o w A$ and $\boldsymbol u$ is in $N u l A$ . Also, show that if the equation $A\boldsymbol x =\boldsymbol b$ is consistent, then there is a unique $\boldsymbol p$ in $R o w A$ such that $A\boldsymbol p=\boldsymbol b$ .
SOLUTION
By the Orthogonal Decomposition Theorem, each $\boldsymbol x$ in $\mathbb R^n$ can be written uniquely as $\boldsymbol x = \boldsymbol p + \boldsymbol u$ , with $\boldsymbol p$ in $R o w A$ and $\boldsymbol u$ in $A)^\perp$ . By Theorem 3 in Section 6.1, $\boldsymbol u$ is in $N u l A$ .
Next, suppose that $A\boldsymbol x = \boldsymbol b$ is consistent. Let $\boldsymbol x$ be a solution, and write $\boldsymbol x = \boldsymbol p +\boldsymbol u$ , as above. Then $A\boldsymbol p = A(\boldsymbol x – \boldsymbol u) = A\boldsymbol x – A\boldsymbol u = \boldsymbol b – \boldsymbol 0 = \boldsymbol b$ . So the equation $A\boldsymbol x = \boldsymbol b$ has at least one solution $\boldsymbol p$ in $R o w A$ .
Finally, suppose that $\boldsymbol p$ and $\boldsymbol p_1$ are both in $R o w A$ and satisfy $A\boldsymbol x = \boldsymbol b$ . Then $\boldsymbol p – \boldsymbol p_1$ is in $N u l A$ because $(\boldsymbol p – \boldsymbol p_1) = A\boldsymbol p – A\boldsymbol p_1 = \boldsymbol b – \boldsymbol b = \boldsymbol 0$

The equations $\boldsymbol p = \boldsymbol p_1 + (\boldsymbol p – \boldsymbol p_1)$ and $\boldsymbol p = \boldsymbol p + \boldsymbol 0$ both decompose $\boldsymbol p$ as the sum of a vector in $R o w A$ and a vector in $Row A)^T$ . By the uniqueness of the orthogonal decomposition (Theorem 8), $\boldsymbol p_1 = \boldsymbol p$ , so $\boldsymbol p$ is unique.

A Geometric Interpretation of the Orthogonal Projection

When $W$ is a one-dimensional subspace, the formula (2) for $proj_W \boldsymbol y$ contains just one term. Thus, when $d i m W > 1$ , each term in (2) is itself an orthogonal projection of $\boldsymbol y$ onto a one-dimensional subspace spanned by one of the $\boldsymbol u$ ’s in the basis for $W$ . Figure 3 illustrates this when $W$ is a subspace of $\mathbb R^3$ spanned by $\boldsymbol u_1$ and $\boldsymbol u_2$ .

在这里插入图片描述

Properties of Orthogonal Projections

在这里插入图片描述
This fact also follows from the next theorem.

在这里插入图片描述

最佳逼近定理

The vector $\boldsymbol y$ in Theorem 9 is called the best approximation to $\boldsymbol y$ by elements of $W$ ( $W$ 中元素对 $\boldsymbol y$ 的最佳逼近).

Later sections in the text will examine problems where a given $\boldsymbol y$ must be replaced, or approximated, by a vector $\boldsymbol v$ in some fixed subspace $W$ . The distance $\left\|\boldsymbol y-\boldsymbol v\right\|$ , can be regarded as the “error” of using $\boldsymbol v$ in place of $\boldsymbol y$ . Theorem 9 says that this error is minimized when $\boldsymbol v =\hat\boldsymbol y$ .

Inequality (3) leads to a new proof that $\hat\boldsymbol y$ does not depend on the particular orthogonal basis used to compute it.

PROOF
Take $\boldsymbol v$ in $W$ distinct from $\hat\boldsymbol y$ . See Figure 4. Then $\boldsymbol y -\hat \boldsymbol y$ is orthogonal to $\hat\boldsymbol y-\boldsymbol v$ (which is in $W$ ). Since

在这里插入图片描述
the Pythagorean Theorem(勾股定理) gives

在这里插入图片描述
Now $\left\|\hat\boldsymbol y -\boldsymbol v\right\| > 0$ , and so inequality (3) follows immediately.

在这里插入图片描述

The final theorem in this section shows how formula (2) for $proj_W \boldsymbol y$ is simplified when the basis for $W$ is an orthonormal set.

在这里插入图片描述

Suppose $U$ is an $\times p$ matrix with orthonormal columns, and let $W$ be the column space of $U$ . Then

在这里插入图片描述
EXAMPLE
Let $W$ be a subspace of $\mathbb R^n$ . Let $\boldsymbol x$ and $\boldsymbol y$ be vectors in $\mathbb R^n$ and let $\boldsymbol z =\boldsymbol x + \boldsymbol y$ . If $\boldsymbol u$ is the projection of $\boldsymbol x$ onto $W$ and $\boldsymbol v$ is the projection of $\boldsymbol y$ onto $W$ , show that $\boldsymbol u + \boldsymbol v$ is the projection of $\boldsymbol z$ onto $W$ .
SOLUTION
Let $U$ be a matrix whose columns consist of an orthonormal basis for $W$ . Then
$\begin{aligned}proj_W\boldsymbol z &= UU^T\boldsymbol z \\&= UU^T (\boldsymbol x + \boldsymbol y)\\&= UU^T \boldsymbol x + UU^T \boldsymbol y \\&= proj_W \boldsymbol x + proj_W \boldsymbol y \\&=\boldsymbol u +\boldsymbol v\end{aligned}$

6.3 Orthogonal projections (正交投影)

目录

A Geometric Interpretation of the Orthogonal Projection

Properties of Orthogonal Projections

猜你喜欢