Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Yichuan Wang
October 6, 2013
Introduction
We set out to prove the following theorem:
Theorem (Inverse Function Theorem) Let, A R be an open. Let
f Cr ( A, Rn ). Then for any a A, an open neighborhood U of
a such that f U is a Cr diffeomorphism.
This proof proceeds in several acts.
Contents
Introduction
Reduction
1
1
Lipschitz Magic
4
5
Reduction
The first step is to convert the analysis of an arbitrary f with these
properties to a specific g ( x ) located around the origin. Therefore we
define
g := D f a1 T f (a) f Ta
By assumption, D f a1 exists and is clearly invertible. Since g is the
composition of various invertible functions, once it is shown to be
invertible, then we will have an inverse for f . So instead of showing
that f is a Cr -diffeomorphism around a, it is instead sufficient to
show that g is a Cr diffeomorphism around 0.
We were asked to confirm two things about this map.
1. g (0) = 0
Proof:
D f a1 T f (a) f Ta (0)
D f a1 T f (a) f ( a)
D f a1 (0)
= 0
2. Dg (0) = Id
Proof: We expand the form of the multivariable chain rule
Dg (0) = D D f a1 T f (a) f Ta Id D f ( Ta (0)) Id
= D D f a1 D f a
D f a1 D f a (Recall that the total derivative of a linear map is the map itself)
Id
Lipschitz Magic
Since the goal is to use the contraction mapping theorem, we will
have to spend some time making sure things are Lipschitz and are
truely contractions. To do so, we define a helper function
h := g Id
Observe that Dh (0) = 0[1] .
It is very important that the total derivative of h at 0 is 0. Since h is
r
C , then Dh is at least continuous. As a result, for any e > 0, > 0
such that x U (0; ), k Dh ( x )k < e. We now evoke the Mean
Value Theorem-like result from Homework 4. Since the ball U (0; ) is
convex, then for any points a, b U (0; ), kh ( a) h (b)k e k a bk.
From this helper function format, we can find upper and lower
Lipschitz bounds for g. Fix a, b U (0; ). For an upper bound,
observe that
k g ( a) g (b)k = kh ( a) h (b) a + bk
kh ( a) h (b)k + k a bk
= (1 + e ) k a b k
For a lower bound, we use a reverse triangle inequality and find
that
k g ( a) g (b)k = kh ( a) h (b) a + bk
k a bk kh ( a) h (b)k
k a bk e k a bk
(1 e ) k a b k
At this point, we should take a breath. These bounds give us a few
results on g:
1. g is injective on U (0; ) - for any two different points a, b
U (0; e), k g ( a) g (b)k (1 e) k a bk > 0, hence they
get mapped to two different points. This is important because it
makes it possible to define a local inverse.
[1]
Dg (0) Id
Id Id
[2]
Some Notes
The whole idea of a contraction is to find some complete space on
which a function is defined that takes the space to itself. For example,
we already know that
h : U (0; )
x
U (0; )
7 h ( x )
[3]
bound was (1 e) .
Proof: Fix (0, ). Fix y U 0; (1 e) . For such a y, we
define a new function y : x 7 y h ( x ) [4] . Do note that
x U (0; ),
Dy ( x )
= k D (y h ( x ))k = k Dhk e. As a
result, for any x U 0; ,
[4]
y ( x )
= ky h ( x )k kyk + kh ( x )k (1 e) + e =
Hence, y is a map from U 0; to itself. Since
Dy
< e,
then by the MVT like result from HW4, we have that y is Lipschitz with constant e < 1. Hence it is a contraction mapping on U (0; ). Therefore there exists x0 U (0; ) such that
y ( x0 ) = x0 . But this means that
y ( x0 )
= y h ( x0 ) = y ( g ( x0 ) x0 ) = x0
= g ( x0 ) = y
The contraction mapping theorem also gives us that such an
x0 is unique.
Do observe that in the above proof, both x0 and y are in the closed
ball U 0; . So now we can go through some boring technical
steps[5] , as we can now show things work for the open ball U (0; (1 e) ).
Claim: y U (0; (1 e) )[6] , unique x U (0; ) such that
g ( x ) = y.
Proof: Fix y U (0; (1 e) ). Then since y is in the interior of
the ball, > 0 such that U (y; ) U (0; (1 e) ). Hence
we can set = kyk + < , and then y U 0; (1 e) .
By the previous claim, there is a unique x U 0; such that
g ( x ) = y. But since < , then x U (0; ).
This gives us two things:
1. g (0) = 0 Int ( g (U (0; (1 e) ))), as there is a neighborhood
around 0 that on which g1 is defined.
2. g1 (y) = x0 makes sense. I.e. we can associate each value in the
image with a unique value in the pre-image.
[5]
{z
( a)
(c)
/ A0
Df
/ GLn (R)
invert /
GLn (R)
6
Dg
g is C1 by the base case. D f is Cr1 by definition. Matrix inversion is just a rational function with a non-zero denominator, and
is therefore C . Then we can just go up the chain and have that
Dg is Cr1 = g = f 1 is Cr .
[7]
f ( a) + k
g (b) + h
a+h
b+k
f ( g (b + k))
f ( a + h)