Cos323 s06 Lecture09 SVD

Singular Value Decomposition
COS 323
Underconstrained Least Squares

What if you have fewer data points than
parameters in your function?
Intuitively, cant do standard least squares
Recall that solution takes the form ATAx =
ATb
When A has more columns than rows,

ATA is singular: cant take its inverse, etc.

More subtle version: more data points
than unknowns, but data poorly
constrains function
Example: fitting to y=ax2+bx+c

Problem: if problem very close to singular,
roundoff error can have a huge effect
Even on well-determined values!
Can detect this:

Uncertainty proportional to covariance C = (A TA)-1
In other words, unstable if A TA has small values
More precisely, care if xT(ATA)x is small for any x
Idea: if part of solution unstable, set answer

to 0
Avoid corrupting good parts of answer
Singular Value Decomposition

(SVD)
Handy mathematical technique that has
application to many problems
Given any mn matrix A, algorithm to
find matrices U, V, and W such that
A = U W VT
U is mn and orthonormal
W is nn and diagonal
V is nn and orthonormal
SVD
w1 0
0
0 0
wn
0
0
Treat as black box: code widely

available
In Matlab: [U,W,V]=svd(A,0)
SVD
The wi are called the singular values of
A
If A is singular, some of the wi will be 0
In general rank(A) = number of nonzero
wi
SVD is mostly unique (up to
permutation of singular values, or if
some wi are equal)
SVD and Inverses

Why is SVD so useful?
Application #1: inverses
A-1=(VT)-1 W-1 U-1 = V W-1 UT
Using fact that inverse = transpose
for orthogonal matrices
Since W is diagonal, W-1 also diagonal with

reciprocals of entries of W
SVD and Inverses

A-1=(VT)-1 W-1 U-1 = V W-1 UT
This fails when some wi are 0
Its supposed to fail singular matrix
Pseudoinverse: if wi=0, set 1/wi to 0 (!)

Closest matrix to inverse
Defined for all (even non-square, singular,
etc.) matrices
Equal to (ATA)-1AT if ATA invertible
SVD and Least Squares

Solving Ax=b by least squares
x=pseudoinverse(A) times b
Compute pseudoinverse using SVD
Lets you see if data is singular
Even if not singular, ratio of max to min
singular values (condition number) tells you
how stable the solution will be
Set 1/wi to 0 if wi is small (even if not exactly

0)
SVD and Eigenvectors

Let A=UWVT, and let xi be ith column of V
Consider ATA xi:
0
0

2
2
A T Axi VW T U T UWV T xi VW 2 V T xi VW 2 1 V wi wi xi

0
0

So elements of W are sqrt(eigenvalues)

and columns of V are eigenvectors of ATA
What we wanted for robust least squares
fitting!
SVD and Matrix Similarity

One common definition for the norm of a
matrix is the Frobenius norm:
A F aij 2
2
Frobenius norm can be computed from

SVD
2
A F wi
i
So changes to a matrix can be evaluated

by looking at changes to singular values
SVD and Matrix Similarity

Suppose you want to find best rank-k
approximation to A
Answer: set all but the largest k singular
values to zero
Can form compact representation by
eliminating columns of U and V
corresponding to zeroed wi
SVD and PCA

Principal Components Analysis (PCA):
approximating a high-dimensional data
set
with a lower-dimensional subspace
* *
Second principal component
* * First principal component
*
* * *
* * *
** * *
Original axes
*
*
** * *
* * *
Data points
SVD and PCA

Data matrix with points as rows, take
SVD
Subtract out mean (whitening)
Columns of Vk are principal components

Value of wi gives importance of each
component
PCA on Faces: Eigenfaces

Average
face
First principal component
Other
components
For all except average,

gray = 0,
white > 0,
black < 0
Using PCA for Recognition

Store each person as coefficients of
projection onto first few principal
components
imax
image ai Eigenfacei
i 0
Compute projections of target image,

compare to database (nearest neighbor
classifier)
Total Least Squares

One final least squares application
Fitting a line: vertical vs. perpendicular
error
Total Least Squares

Distance from point to line:
xi
d i n a
yi
where n is normal vector to line, a is a

constant
xi
d n a
i
i yi
Minimize:
2
i
Total Least Squares

First, lets pretend we know n, solve for
2
a
xi
2
i
1
a
m i
Then
n a
yi
xi
n
yi
i
xi mxii
xi
n
d i n a
y y i
y
i
i m
Total Least Squares

So, lets define
and minimize
xi xi mxi
~

~
y yii
y
i i m
~
xi
n
~
yi
Total Least Squares

Write as linear system
Have An=0
x1 ~
y1
~
~
x2 ~
y2
~
~
x3 y3
nx
0
n
y
Problem: lots of n are solutions, including n=0

Standard least squares will, in fact, return n=0
Constrained Optimization
Solution: constrain n to be unit length
So, try to minimize |An|2 subject to |n|2=1
2
T T T
An An An n A A n
Expand in eigenvectors ei of ATA:
n 1e1 2e 2
T T
n A A n 112 2 22
2
n 12 22
where the i are eigenvalues of ATA
Constrained Optimization
To minimize112 2 22
subject
12 to
22 1
set min = 1, all other i = 0
That is, n is eigenvector of ATA with
the smallest corresponding eigenvalue

Cos323 s06 Lecture09 SVD

Caricato da

Informazioni sul documento

Descrizione originale:

Titolo originale

Copyright

Formati disponibili

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Copyright:

Formati disponibili

Cos323 s06 Lecture09 SVD

Caricato da

Copyright:

Formati disponibili

Singular Value Decomposition

Underconstrained Least Squares

When A has more columns than rows,

Underconstrained Least Squares

Underconstrained Least Squares

Can detect this:

Idea: if part of solution unstable, set answer

Singular Value Decomposition

Treat as black box: code widely

SVD and Inverses

Since W is diagonal, W-1 also diagonal with

SVD and Inverses

Pseudoinverse: if wi=0, set 1/wi to 0 (!)

Equal to (ATA)-1AT if ATA invertible

SVD and Least Squares

Set 1/wi to 0 if wi is small (even if not exactly

SVD and Eigenvectors

So elements of W are sqrt(eigenvalues)

SVD and Matrix Similarity

Frobenius norm can be computed from

So changes to a matrix can be evaluated

SVD and Matrix Similarity

SVD and PCA

SVD and PCA

Columns of Vk are principal components

PCA on Faces: Eigenfaces

First principal component

For all except average,

Using PCA for Recognition

Compute projections of target image,

Total Least Squares

Total Least Squares

where n is normal vector to line, a is a

Total Least Squares

Total Least Squares

Total Least Squares

Problem: lots of n are solutions, including n=0

Expand in eigenvectors ei of ATA:

where the i are eigenvalues of ATA

Potrebbero piacerti anche