Strang 1973

CENTRO INTERNAZIONALE MATEMATICO ESTIVO l(C. I. M. E.
f3.
STRANG
AND
G.
FIX
A FOURIER ANALYSIS O F T H E FINITE E L E M E N T VARIATIONAL METHOD
Corso tenuto ad Erice
dal
2 7 giugno a1 7 luglio
1971
These lectures were prepared f o r a CIME Advanced Summer Institute held in 1971. ? h ' e 'first author has been supported by the Office of Naval Research and the National Science Foundation (GP - 13778), and the second by AEC Contract
7158
9 . Apologia
.This paper has been taken from a preliminary d r a f t of our book "An Analysis of t h e F i n i t e Element Method", t o be published by Prentice-Hall about the end of 1972. In this f i r s t draft w e developed the theory of
f i n i t e elements on a regular mesh, w i t h Fourier -analysis a s the principal tool, and we were able t o discuss the connections w i t h f i n i t e differerne equations and t o include a p a r t of the theory of splines. we now c a l l the "abstract f i n i t e element method".
lhis framework
In our book, the emphasis will be s h i f t e d to the "nodal f i n i t e element

methodn a s developed by s t r u c t u r a l engineers, i n which i r r e g u l a r elements a r e more the r u l e than the exception.
In t h i s case splines a r e muoh l e s s

However t h e basic theorems the f i r s t author i n
convenient, and Fourier analysis i s impossible. remain valid
- we r e f e r t o a forthcoming paper of
Numerische Mathematik on "Approximations in t h e F i n i t e Element Method". Furthermore it becomes possible to examine t h e e r r o r s due to the presence
of curved boundaries and inhomogeneous boundary data. Ve hope that the book w i l l give a reasonably complete and r e a l i s t i c treatment of t h e e s s e n t i a l f i n i t e element theory f o r l i n e a r problems, and t h a t the reader w i l l accept the present paper a s an interim report.
821
G . Strang-G. Fix
The c r u c i a l i d e a i n t h i s synthesis i s t o choose t h e b a s i s such a way t h a t t h e variati.ona1 eauations f o r t h e . v j be difference eouations. way below.
in wj t u r n out t o
The customary description of t h e r e s u l t i n g
method i s i n v a r i a t i o n a l terms, and we s h a l l introduce i t i n t h i s To think- of it a t t h e same time a s a difference scheme

w i l l require a c e r t a i n tolerance on t h e reader's p a r t , since a t
f i r s t s i g h t t h e f i n i t e element method seems t o depart a t several
points from conventional difference equations.
I n f a c t it i s J u s t
these p o i n t s which represent f o r u s t h e main contrjbutions of the\ method t o f i n i t e difference theory; they a r e innovations which could have been devised independently, but never were. Our goal w i l l be t o decide when t h e f i n i t e element method i s convergent and numerically s t a b l e , and t o estimate t h e e r r o r ,
Al-
though ne do not discuss a t t h i s point t h e i r r e g u l a r meshes and general boundary conditions which a r e met i n applications, we have t r i e d t o r e t a i n t h e mathematical e s s e n t i a l s of t h e method; t h e s e we .study i n some generality. Of course t h e ultimate question i s whether t h e f i n i t e element method i s more e f f e c t i v e than i t s competitors, namely those techniques which l i e ou$side t h e above i n t e r s e c t i o n . The evidence sugGests t h a t although difference scliemes can be constructed which require fewer operations f o r a given order of accuracy, nevertheless t h e v a r i a t i o n a l approach has an important coherence which derives from t h e f a c t t h a t , once t h e b a s i s i s chosen, t h e r e s t i s l a r g e l y automatic. This coherence seems
'Pj
t o be r c f l e c t c d i n a more regular behavior both of t h e e r r o r and of t h e user, who has othcrv:ise t o make a separate choice of f i n i t e
difference replacement f o r each term i n t h e d i f f e r e n t i a l equation
G. Strang-G. Fix
and boundary cqnditions.
Thc evidence i n t h i s comparison i s s t i l l
very l i m i t e d , however, and we s h a l l t r y t o remain n e u t r a l . The nenie we have adopted was o r i g i n a l l y chosen by engineers [ I ] , who decompose a continuous s t r u c t u r e , f o r numerical purposes, i n t o a s e t of " f i n i t e elements". mathematics i s l e s s c l e a r . The h i s t o r y of t h e underlying Both Courant [2] and p61ya [3] commented
on t h e merits, i n c e r t a i n v a r i a t i o n a l problems, of seeking ap-. proximate s o l u t i o n s which a r e l i n e a r within each ( t r i a n g u l a r ) element; accuracy i s . improved by i n c r e a s i n g t h e number of elements r a t h e r than t h e .complexity of t h e approximating functions. t h i s t r i a l space, t h e Laplace operator a c t i n g on vector v u
With
induces i t s
f a m i l i a r 5-point d i f f e r e n c e analogue, a c t i n g on t h e c o e f f i c i e n t
Such t r i a l f u n c t i o n s t h e r e f o r e make t h e Ritz method
esgeclally s i ~ p l e t o execute, and it seems very l i k e l y t h a t t h i s
idea was proposed even e a r l i e r . The development of t h e method has l e d n a t u r a l l y from p i c e c u i s e l i n e a r functions t o s p i i n e s and o t h e r piecewise polynomials of f i x e d dcgree p ; each i n c r e a s e i n p adds both t o t h e accuracy
A s usual, t h e e x t r a accuracy
and t o t h e complexity of t h e method.
i s i n i t i a l l y .worth t h e p r i c e ; b u t j u s t a s Newton1 s method i s more
popular than i t s higher-order analogues, questions of convenience soon become paramount, cubic approximants point. I n a p p l i c a t i o n s t o second-order equations, (p = 3 ) . a r e apparently c l o s e t o t h e t u r n i n g
Thc e s s e n t i a l f e a t u r e s of t h e method a r e t h e subdivision
of t h e region i n t o f i n i t e elements, and t h e choice of a so-called local b a s i s f o r t h e space of approximating f u n c t i o n s b a s i s con~posedof functions which vanish over a l l b u t
- that
tL
is, a
few elements.
G. Strang-G. Fix
W e analyze i n t h i s paper t h e case of subdivision by a regular mesh, of width functions functions variable h-30
, with
a l o c a l b a s i s constructed i n t h e
w i l l be t h e number of b a s i s functions
following systematic way. vl(~),

mi
W e s t a r t with a fixed s e t of t r i a l
...,uN(~) ; N
f o r each meshpoint.
To ensure a l o c a l basis, we i n s i s t thak t h e s e
vanish f o r l a r g e h
1x1
x by t h e mesh width
, and
r e s c a l e t h e independent
The eventual Rayleigh-Ritz
equations w i l l take t h e form of difference equations i f t h e b a s i s h functions vi, associated with each meshpoint ( jlh,. ,jnh)
..
a r e simply t r a n s l a t e s of these rescaled functions b a s i s i s thus composed, f o r each h
cDi(~/h)
The
, of
t h e functions
Of course t h e r e have t o be modifications a t boundaries. I n t h e piecevrise l i n e a r case t h e r e i s a s i n g l e parameter f o r each meshpoint, namely t h e value of t h e function a t t h a t point; thus of
N =l
The graph
v1
i s a pyramid with vertex a t t h e o r i g i n and with base
formed from t h e neighboring-triangular elements.
G . Strang- G . Fix
It should be c l e a r t h a t t h i s
cpl
and 3.ts t r a n s l a t e s span t h e space
of a l l continuous functions i n t h e plane which a r e l i n e a r within each element. I n some exemples our description (1.1) of t h e b a s i s
w i l l seem l e s s transparent than a description of t h e space i t s e l f ,
i n terms of t h e functions admitted within each element and t h e compatibility conditions across element boundaries. Nevertheless, both f o r p r a c t i c a l computations and f o r t h e general theory, t h e d e f i n i t i o n of the b a s i s i s crucial; it i s from combinations of t h a t t h e Rayleigh-Ritz-Gal-erkin p r i n c i p l e w i l l s e l e c t an approximation uh The fundamental questions f o r a numerical analyst a r e those of convergence and s t a b i l i t y :
i) k a c c u r e t e i s t h e apnroxixftc s o l u t i o n
these functions
oi,
uh
, and
ii)
how well conditioned a r e the eauations from vrhich uh i s determined numcrica.lly?
The answers can only come from t h e connections between t h e given d i f f e r e n t i a l problem, t h e t r i a l functions tpl,...,qN
, and
the W e
norms i n which accuracy and condition number a r e measured. order eh = uh

2m
want t o study these questions f o r e l l i p t i c o ~ e r a t o r s of a r b i t r ~ r y
, for
q u i t e general
with t h e spaces
-u
xS
and
I I ~
qi
, and
f o r t n e norms associated
These norms measure t h e e r r o r
and i t s d e r i v a t i v e s
of order
In1 = xuj ( s
I
i n t h e mean-square and po5.ntwise senses
respectively:
In Sobolevts notation the space 9fS is written : W
Our arguments make constant use of the Fourier"transfo?m, which operates at full strength only on problems which are either periodic, or defined on the whole of Euclidean space R "
We
are convinced that (as in the theory of elliptic differential operators) the investigation of these special problems is fundamental to the understanding of more general boundary conditions. Thus we regard the present work as a necessary first step in analyzing the wide variety of problems, with irregular meshes and boundaries, vihich are actually being solved. Fortunately,
most of the second step in the analysis is already complete;

J.-P. Aubin, follotring the work of ~ 6 and a others in the French
school, has successfully analyzed the solution of boundary problems by means of splines
In his terms, our contrifunctions to which his theory can
bution is to determine extend. the reference [ 4 ] .)
all trial
(He mention, in addition to his forthcoming manuscript, The third step is the study of more general
meshes, particularly those formed by an arbitrary triangulation of the region. This is a major point in our forthcoming book.
G . Strang-G. Fix
For problems on t h e whole space runs over t h e s e t 2 "
Rn
, the
index (jl,'.
of a l l m u l t i - i n t e g e r s
..,jn) .
i n (1.1) W e
adopt t h e d e f i n i t i o n
f o r t h e Fourier transform, where xlcl
... + xntn..
= (,...,E~)
and
xg
denotes
AS a f i r s t a p p l i c a t i o n of t h e Fourier transform,
P a r s e v a l t s formula can be used t o replace (1.2) by t h e equivalent and more convenient norm
W e f i r s t describe t h e app1ication:of t h e Ritz-Galerkin method a b s t r a c t l y , t o e l i n e a r e l l i p t i c problem on ventional i n n e r product with a b i l i n e a r form

( f , g) = /f (x) E(x) dx
R*
,.t h e problem begins
Using t h e con-
I f the coefficients
a r e bounded, t h e form
i s defined when-
ever u
and
l i e i n t h e space
7j"
t h a t is, whenever a l l lie in L ~ ( R ~ )
p a r t i a l d e r i v a t i v e s of order not exceeding m The fgrm i s c a l l e d ? ? - e l l i p t i c provided t h a t
G .Strang-G. F i x
The most familiar example is the form associated with the Laplace equation, a(ualv) = $ au ai;
au a ? ... + axn ax, dx .
As it stands this is not ??'-elliptic, corresponding to the fact that Laplace' s equation has non-zero solutions, e , g . u = constant.
To satisfy (1.4), and thereby eliminate this non-uniqueness of the

solution, we need'to add some positive multiple of the zero-order term (u,w)
An elliptic form induces the follovring variational problea: given f
, find
u in
so that
(1.5)
a(u,w)
( f , v , )
for all w
in
i p
.
the in-
This problem has one and only one solution u
, provided
homogeneous data f is such that the right side makes sense; since w ranges over %? , this places f . in the adJoint space
IC-"
, so that
The elliptic problem (1.5) can equally well be put into the more familiar opcrationcl form
G. Strang-G. Fix
For t h i s we i n t e g r a t e t h e l e f t s i d e of (1.5) by p a r t s , s h i f t i n g a d e r i v a t i v e s from w onto q D u The r e s u l t i s ( L U , ~ ) = (f,~),
which i s equivalent t o (1.6); L given by
i s t h e map from
ji)n t o u - ~
Many applications l e a d a l s o t o problems of I n such c a s e s the form of a(u,u) a

i s self-adjoint,
mininlization
and t h e minimization is
- ( f , u ) - (u,f)
l e a d s exactly t o t h e same v a r i a t i o n a l
LU = f
problem (1.5).
I n f a c t t h e operational equation
nothing but t h e x e r eauation from t h e c a l c u l u s of variations. The Ritz-Galerkin technique i s now simple and very familiar; t h e space i s replaced i n t h e v a r i a t i o n a l statement (1.5) by
a sequence of closed subspaces
sh
Thus t h e approximating uh in in
problem, w r i t t e n v a r i a t i o n a l l y , i s t o f i n d a ( uh ,wh ) = (f,wh) for a l l
sh
so t h a t
(1.7)
wh
sh
.
fdr
E l l i p t i c i t y implies the existence and uniqueness of concerned with i t s computation. problem a l s o i n t o operational form. Choosing a b a s i s
uh ; we a r e
Therefore we put t h e approximate
sh , we
expand
vh u
G . Strang-G. Fix
and compute t h e vector i n t o (1.7))
vh of unlmovrn c o e f f i c i e n t s .
Substituting
h Since t h i s holds f o r a l l c o e f f i c i e n t s wv
Thus t h e v e c t o r . vh
s a t i s f i e s t h e d i s c r e t e operational equation
where t h e e n t r i e s i n t h e coefficient matrix and t h e inhomogeneous vector a r e given by
Since a l l t h e s e e n t r i e s have t o Be calculated, e i t h e r a n a l y t i c a l l y o r by numerical quadrature, one wants a s simple a b a s i s a s possible. The use of t h e c l a s s i c a l special functions, i n o t h e r words a r e t u r n t o stage one, i s by no means obsolete; both Urabe and Clenshaw have made successful application of Chebyshev polynomials. interested, however, i n t h e b a s i s functions for'problems on R"
f ,u,
M e are if
i,J
Z"
defined e a r l i e r ;
...
t h e index
runs over
j
, whereas
a l l have period
0 ( jv < h-'
1 , we require t h a t
h-I
be an i n t e g e r I n t h i s periodic
and ' t h a t
f o r cach component of
case
sh
has f i n i t e dimension
i
N h-n
The index
assumes t h e values
1,. .,N
, so
~t is n a t u r a l
t o take t h e b a s i s functions i n groups
of
a t a time.
1 1
This p a r t i t i o n s t h e matrix
Ah
i n t o blocks of
order
,a
t y p i c a l block being
Thus t h e f i 3 i t e element r e l a t i o n
Ah vh = f h
i s a coupled system
of
ct
d i s c r e t e equations.
Normally ' t h i s system i s analogous t o D(LU) = Df
continuous one, i n which t h e o r i g i n a l d i f fe r c n t i a l equation i s coupled t o some of i t s d i f f e r e n t i a t e d forms t h e d i s c r e t e system can be formally
Lu = f
I n t h e Hernite case
recombined t o y i e l d a s i n g l e s p l i n e - l i k e equation, J u s t a s t h e Hcrmite b a s i s functions, with small support, can be combined w i t h t h e i r t r a n s l a t e s t o yield the spline basis. .
We want now t o summarize our r e s u l t s .

i n c a r h a t i o n of M.I.T.fs -Au u .= f and i t s
A more extended summary
a rehas already appeared [5] i n Studies i n Applied ~~lathematics, Journal of Mathematics and Physics. and W e
hope t h a t our discussion there-, i n terms of t h e model problem

-1-
5-
9- point d i f f e r e n c e analogues, w i l l
For t h e moment wc
be a u s e f u l supplement t o t h e present pzper.
set nsidc extensions t o eigenvnlue problems and parabolic equations,
and dcscribc our conclusions only f o r t h c problems of convergence and s t a b i l i t y s t a t e d above.
G. Strang-G. Fix ii) The problem 6f s t a b i l i t y i s t h e simpler, Hcre t h e
fundamental question i s whether o r not t h e independencc of t h e b a s i s h elements qi, i s uniform a s h e 0 ; i n L2 ,with our normalization (1.1) of t h e b a s i s elements, t h i s means
For t h i s uniform independence we f i n d t h e folloering necessary and s u f f i c i e n t condition:

I' =
t h e r e e x i s t s no n o n - t r i v i a l combination such t h a t t h e Fourier transform of

Y
ci qi
and r e a l
satisfies
(1.14)
u(so
2sj) = 0
for a l l
j e
zn
.
This i s an
qi ; only gross f e a t u r e s of t h e d i f f e r e n t i a l problem, i t s order and e l l i p t i c i t y , a r e r e l e v a n t t o t h e condition number. a t t r a c t i o n , a t l e a s t t o t h e a n a l y s t , which should not disappear
The reader w i l l n o t i c e t h a t everything depends on t h e
i n more general boundary problems.

are
For d i f f e r e n c e equations which
not
derived v s r i a t i o n a l l y , Schacfferf s poererful work [6] has
shovm how deep t h e s t a b i l i t y problem a c t u a l l y i s , even i n comparison with t h e 'corresponding question i n t h e general theory of e l l i p t i c boundary problems. be e l l i p t i c . i) Thc r a t c of -convergence of uh depends on t h e
S terminology [ 7 ] I n ~homgel
,.
(1.14) i s
ncccssary and s u f f i c i e n t f o r t h e d i f f crence equations (1.9) t o
"density" of t h e spaces
sh ; t h e r c f o r c
w e begin with a discussion
G. Strang- G . Fix
of approximation theory. t h e following form:

with error
Our main r e s u l t f o r t h e case norm, s(p
N = l
takes
smooth functions can be approximated from
sh
O(hp+l-') i n l i ~ egs xl,.
no~nialsi n
..,xn
Q
--
,if
and only i f a l l poly-
of degree ( p
can be w r i t t e n a s l i n e a r Fourier a n a l y s i s l e a d s
; i t must have zeros
cor;lb:inatlons of
and i t s t r a n s l a t e s .
t o an equivalent condition on t h e transform of order

is in
p -1- 1 a t a l l t h e p o i n t s
4 = 2nj
,j #
(0,.
.,,0)
Here
w e have assumed t h e conditions most commonly met i n p r a c t i c e r t h a t

cp
fJP
and s t a b i l i t y holds; more p r e c i s e r c s u l t s a r e proved
i n 52. With
N > 1 , the
qi
and l i n e a r combinations of t h e i r t r a n s I n fact thcre are Suppose f o r (n=1)
l a t e s may have d i f f e r i n g degrees of smoothness. important c a s e s i n which t h i s i s bound t o happen.

o.r'-.:,?
.?
thnt
sh
:is comyriscA of pccc&rise cubic functions
which have continuous f i r s t d e r i v a t i v e s a t each j o i n t .
Since t h i s
Hermite space contains t h e s p l i n e subspace,. whose elements have .continuous second d e r i v c t i v e s a s well, t h e s p l i n e b a s i s f u n c t i o n s must be combinations of t h e Hermite b a s i s ; t h e former i s i n and t h e l a t t e r only i n only t h a t t h e
??
2 (
Thus, i n t h e general case, ure a s s w e agbin we prove t h a t

s( q)
mi
are i n
vq , q(p , and
approximation of order from t h e
hP'l-S
i s p o s s i b l e i n ?lS ( f o r
p
if and only i f a l l polynomials of degree

cpi
can be produced
and t h e i r t r a n s l a t e s .
These approximation r e s u l t s a r e of course a l r e a d y known f o r many s p c c i f i c choices of
sh
\re mention i n p i r t i c u l a r t h e e a r l y
estimatcs f o r s p l i n c s by Birl<hoff, de Boor, and others; t h e approx:inationo constructcd by Aubin i n [ 8 1
, and
the detailed
G . Strang-G. Fix
treatment of multi-dimensional spline and Hermite functions in Birkhoff, Schultz, and Varga. [ 9 ] and in recent papers by Schultz (cf. [lo])
The order p+l-s of approximation, starting with a

QY
single arbitrary
, has also been
established elsewhere; in fact
this problem was so striking that it was attacked independently by di Guglielmo [U], ~abuxka[12], and the two of us. (We regret is small.)
that the order just given is chronological, although At Our o r v n contributions to the theory of approximation on possible constants in the error estimate c hPS1-s \ 1u + possible when polynomials can be manufactured from
?? with
'
J
N = 1 seem to be these: we have found the exact lower bound of
we have proved the converse result, that such an estimate is only

tp
, .
and
Obviously
this gives a special plzce to piece~iisepolynomial approximatingfunctions; they not only lead LO ~inplecu~npu-l;etions, which has until now been their real justification, but they also provide the most efficient basis from which to produce pol~momials. (Of course one may use polynomials themselves in the Ritz method, but this produces major difficulties. If monomials xJ are used as the basis, then the matrix Ah is both .hopelesslyill-conditioned and full of non-zero entries. The alternative, a basis of orthogonal polynomials, is awkward to compute numerically.) We note that our converse result was anticipated by ~051 [13], who considered' only a very limited class of app'roximating functions, and that di GuQielmo has discovered
[u] a
basis which makes Ah as sparse
as possible. Splines are optimal in one dimension, as is the pyramid function drn~mabove when n = 2 ,p = 1 "triangulates"
In general he
nn
and uses splincs whosc continuity properties
are imposcd between the triangular elements, or simplexds; this
G . Strang- G. Fix
leads to
In the,splines over rectangular elements introdued by de Boor and Birkhoff, the last factor disappears and the remaining exponents have to be increased to p + l
sh
becomes essentially a tensor
product of the one-dimensional case. at .the points

f
In either case, the zeroes
= 205
# 0 are evident.
W For approximation .in the pointvise norms' :
, the proofs
are
different but the results are the same. Here we construct an explicit approximation
C V , ~ a ~
~uasi-interpolate,by combining interpolation
with a generalized Taylor expansion:
The 6,
are those combinations of the basic functions v ,l
...,wN
which are used in the construction of polynomials up to degree P; if N

=
1 they arc suitable multiples of cpl
The estimate
G. Strang -G. Fix
then follows f o r smooth u
by comparing Taylor s e r i e s .
Again
t h i s bound i s a l r e a d y known i n s p e c i a l cases.

W e can novr s t a t e our p r i n c i p a l r e s u l t f o r problen ( i ) :
.
if
f -
5s s u f f i c i e n t l y smooth, t h e e r r o r i n e i t h e r t h e norm s a t i s f i e s
?fS
o r the
. : w
Apparently t h i s exponent for s
i s new, although i n
rfS , a t l e a s t
that the
, it
can be deduced by. v a r i a t i o n a l arguments from t h e
order of a p p r o x i l ~ ~ a t i ogiven n ~ b o v e . It 5 s rcnar!:able
e r r o r i n t h e R i t z method o f t e n i s not of t h e same order a s t h e e r r o r i n t h e b e s t a7proxiriation; t h e second expression i n well be smaller than t h e f i r s t . For t h e s p e c i a l value
r may
the
s = m
two o r d e r s autolnaticslly coincide, s i n c e e l l i p t i c i t y implies t h a t f o r any choice of t h e Ritz spaces
sh ,
inf s c sh \\u s\ltpl
h \\u u\\# 5 constant
W e always assume
p2m
, which
t o g e t h e r with s t a b i l i t y i s necessarx
and s u f f i c i e n t f o r t h e f i n i t e el.ement method t o succeed; t h i s
a p p l i e s t o eigenvalue p r o b l e G a s well. of t h e minimal degfce .p=m r e c a r d l e s s of
W e note t h a t f o r s p l i n e s and
, the
error i s in
i s only
0(h2)
~ ( h )in 0 = L 2 (R n ) ?l
Our proof of t h e e r r o r estimate (1.17) i s severely c l a s s i c a l ; w e regard (1.8-1.9) a s inducing a f i n i t e d i f f e r e n c e scheme and This allows us t o t r e a t t h e
estimate t h e l o c a l t r u n c a t i o n e r r o r . non-reflexive spaces
WE , which i n v a r i a t i o n a l arguments a r e
h Furthernore, t h e l o c a l e r r o r i s well-defined
avrkvard a t best; of course Sobolev's imbedding theorems may be used t o deduce pointwise estimates, b u t t h e c o s t i n powers of i s unacceptably high.
even i n t h e absence of g l o b a l p r o p e r t i e s l i k e e l l i p t i c i t y ; our technique should extend f o r example t o p a r a b o l i c and hyperbolic operators. It may a l s o be used t o j u s t i f y "Richardson extrapolation", Be n o t e t h a t
which i s a u s e f u l t o o l f o r i n c r e a s i n g t h e accuracy.
t h e order of t h i s l o c a l e r r o r i s normally t r i v i a l t o compute, by a comparison of Taylor expansions; b u t f o r a system of

c q ~ ~ ~ t ; o ?tz h,i n i s no 3or.ger so.
N
difference
W e cannot r e s i s t p o i n t i n g t o a r a t h e r paradoxical consequence of our estimates. derivatdves of u It i s concerned with t h e approximation of
, in
v~hichRitz methods a r e generally thought
t o be superior; they provide s o l u t i o n s which can be d i f f e r e n t i a t e d a n a l y t i c a l l y , whereas more conventional d i f f e r e n c e schemes y i e l d only mesh functions. However, ~hom&e has shown [ l b ] t h a t t h e order of accuracy i s not reduced when t h e s e mesh functions a r e differenced, while our p r e s e n t estimates (which a r e b e s t p o s s i b l e ) imply t h a t t h e accuracy i s . g e n e r a l l y decreased by each d i f f e r e n t i a t i o n and f i n a l l y disappears, The paradox e n t e r s i n t o t h e f i n i t e u at
element method, which i s both a Ritz system and a d i f f e r e n c e scheme: appprently t h e d e r i v a t i v e s of by differentiating uh
+ ,
should not be computed xo a s the center
, but
r a t h e r by regarding
G . Strang.-G. Fix
of an h-mesh and applying a n a c c u r a t e d i f f e r e n c e o p e r a t o r !

fortunately f o r t h e theory,
Un-
round-off e r r o r gener.ally r e v e r s e s
t h i s recommendation. One f u r t h e r problem, of obvious i n t e r e s t b u t questionable importance, i s t h i s : among a l l d i f f e r e n c e schemes c o n s i s t e n t
with a given d i f f e r e n t i a l equation, v;hich can b e produced by t h e f i n i t e elemcnt method? Certainly not a l l , since self-adJoint
(L = L+ 0) l e a d o n l y t o s e l f -
n o n n e g a t i v e - d e f i n i t e problecls
ad j o i n t n o n n e g a t i v e - d e f i n i t e f i n i t e element equations Furthermore, t h e inhomogeneous d a t a

f
(fib =
L 0)
e n t e r s t h e X i t z eauations,
n o t through i t s v a l u e s a t meshpoints, b u t through t h e i n t e g r a l s
f'pj
T h i s smoothing b e & r s on a remark of Zlamal [15]
, which
we discussed w i t h IJidlund and :4orton, t o t h e e f f e c t t h a t v a r i a t i o n a l

e r r o r bou.nds r e q u i r e fewer deri-.ratives o f standard f i n i t e d i f f e r e n c e estirnstes.
f
t h a n so:ne of t h e
W e n o t e t h a t Herbold,
S c h u l t ~ jand Varga [ 16J have observed how t h e u s u a l numerical quadratures of t h e s e i n t e g r a l s l e a d t o familf a r 4 , ~ ' f e r e n c e schemes. I f we c o n s i d e r only t h e l e a d i n g term -uxx
f o r second-crder
equations i n one dimension, a complete answer i s p o s s i b l e :
all
consistent self-adjoint nonnegative-definite difference matrices can b e produced. p r o p e r t i e s of The proof dcpends on t r a n s l a t i n g t h e s e a c t i o n i s t y p i c a l l y given by
, whose
i n t o p r o p e r t i e s of i t s symbol
G. Strang- G. F i x
The correspondcnce is a familiar one: Consistency <=> a(e) Self-adjointness <=> a

=
e2
o ( 0 3 ), so that a(~) = a( (0) = 0
is real
Non-negativity <=> a(0)
for all
The dj,er-~iesz theorem allorvs us to facsor such a nonnegative trigonometric polynomial into
Now we let cp
be the piecewise linear function which vanishes
for x ( 0 and has slope gives C P j

x = M
Bj in
cp
<x<
Consistency at
0
= 0
, so
that
returns to the value 0
For larger x we keep it zero, so that
satisfies it vanishes
the csscntial condition for the finite element method: outside a conpact set. Comparing coefficients of
eiee in (1.19)~
the Ritz-Galcr1;in coefffcients are indeed the a
Unhappily, the result we have just proved is misleading; the proper analoee of the differential equation Lu simply the explicit system vh = fh
f is Instead it is the
=
not
combination of this system with the expansion (1.18), expressing

uh
in tcrrnn of the given basis for
sh ; in
our context this
cxpnnsion is
G. Strang-G. F i x
Suppose we consider only t h e values points
uh(kh)
assumed a t t h e mesh-
x = kh ; these values a r e t h e components of a v e c t o r h O n t h i s mesh t h e previous equation vhich we denote by u
becomes
where t h e e n t r i e s i n t h e matrix Altogether, then, data

f
B~
a r e t h e values
ri(k-J)
uh
i s computed from t h e inhomogeneous
by solving
by t h e This i s t h e i m p l i c i t f i n j . t e d t f f c r e n c e eouation cle~l-ved
f i n i t e element method; it i s t h e t r u n c a t i o n e r r o r a s s o c i a t e d with
this equation
which y i e l d s an estimate of
uh
-u .
There i s a
s i m i l a r equation corresponding t o every choice of h-mesh i n t h e plane; our matrix

B~
r e f e r s only t o t h e mesh through t h e o r i g i n ,
Thus t h e f i n i t e elcment method i s f i n a l l y exposed a s a family of d i f f e r e n c e equations, each an i m p l i c i t system of

N
equations
but othcrv:j.se more o r l e s s convcntionsl, and a l l descended from t h c common a n c c s t o r s

c p . ,l
..,vN .
G.Strang- G. Fix
9 2.
Approxir,m.L;ion
I n t h i s s e c t i o n w e s h a l l i s o l a t e those conditions on t h e
f i n i t e elemcnt spaces
approximation.
sh
which determine t h e accurrtcy of
Althou~) th ~e proofs a r e r a t h e r technPcal, t h e S t a r t i n g with a consider ap-
c o n d i t i o n s thexselves @ r c remarkably simple. s5.n;:l.s function in

(?
, that
i s with
N = 1
, we
pro;.frnal;ion
?lS by
expansions of t h e form
'P j nunbe]- of elements, vre a s a w e t h a t
I n ordcr that each
s h a l l be non-zero within on1.y s . finite

p
vmishes f o r large
1x1
This property of conpact support ensures t h a t

n i n l ~ i x , s i n c c a l l prociucts
eve:.:
i s a band
v ! : W
a r e i d e n t i c a l l y zero vhenW e let

?la
C
1 j-kl
. (
i s sui'f'icicatly l a r g e .
denote t h e
spncc.? of Punctions with co:npact support whose d e r i v a t i v e s of
orc!cx
lic in
L2
q
.;
i n many a p p l i c a t i o n s t h e d e r i v a t i v e s
of order l e s s than
v ; i l l be contlnu.ous, and t h e
q th
vrriting
d e r i v n t i v e s p i eceivise c o n t i ~ ~ u o u s . W e adopt %he standard n o t a t i o n f o r x u l t i - i n t e g e r s ,

j ,
Z:
...
f o r clcments of
zn
j
and
a,$,
...
for. e l e n e n t s o$ uv
Thus each component
i s an integer,. each
is a
non-negative int.eger; and
G. Strang- G. Fix
> p -
i f and only i f each
a ,
2 P,
.
t h e range
The range of surnnztion i s understood t o bc
zn , and
of i n t c c r a t i o n i s
TiiEORFki I.
R"
, when
m
nothing i s s a i d t o t h e contrary.
Suppose
is i n
?fz .
Then t h e following
conditions a r e equivalent:
(5.)
v ( ~# ) 0
, but
co
has zeros of order a t l e a s t 27.2" :
p+1
n . t t h e o t h e r p o i n t s of
(ii)
for
l a 1
, ) ,
?!p-kl
ja cp(t-J)
is a polynomial i n
tl'
...,tn
(iff) h
jczn
with l e a d i n g terrn f o r each

-*
eta , C #
in
t h e r e a r e weights
that as
wh S
such
The c o n s t a n t s Proof.
cs
and
a r e independent of
( i =
This equivalence, l i k e much of t h e a n a l y s i s
l a t e r i n t h i s paper, depends on t h e Poisson f o r r ~ u l a : t h e
G. Strang- G. Fix
vnlucs of a function
on t h e l a t t i c e
A
zn
a r e connected with 2azn by
those of i t s Fouricr transform
on t h e l a t t i c e
If
Y'
has compact support, t h e f i r s t swn involves only a f i n i t e
number of terms,. and t h e secolld i s absolutely convergent. a Applied t o t h e function Y(x) = x ~ ( t - x ) , this y i e l d s
Suppose f i r s t t h a t ( i ) holds. terms on t h e r i g h t s i d e with

j
Then f o r any
l a 1 (p
, the
# 0 a l l vanish.
Therefore we
have only t o compute t h e contribution from
j = 0 :
The leading term i s c l e a r l y required.
eta , with
C = $(o)
# 0 as
NOVI we supposc t h a t ( i i ) ho1d.s.
Taking
a = 0
, this
means from (2.4) t h a t
i s a non-zero constant function. for

j
A Therefore ~ ( 0# ) 0
# 0
.
a = (1,0,.
A ,m ( 2 ~ j )=
Next we considcr
..,0) ; t h e
r l g h t s i d e of (2.4) i s
G . Strang-G. Fix
{lccording to (ii) this is a polynomial, and therefore this time

we
have
A arn/asl
( 2 1 . j ) = 0 for ' j # 0
Proceeding in the seme remaining
way, in order of increasing a conditions in (2.1)

(i) =
, we,establish the
.
Parseval's formula, into inequalities for
> (iii)
Our first step is to convert the inequalities
(2.2) and (2.3)
, by
Fourier transforms. We note first that
Therefore the transform of C wh vh is
We denote the function in braclrets by Wh ( 5 ) it has period

C/11
2r/h
in each variable
F ,
zy
, and
remark that
Thus if we let stability condition
denote t-hecube - ~ / h <
_I h/h
, the
(2.3) on the weights becomes
Using the definition (1.3) of the cctiinnte (2.2) is converted into
us
norm, the required
G . Strang- G. F i x
(2.7)
nn
lu(5)
- vh(5)$(h?) 1
s a t i s f y i n g these conditions, we f i r s t choose
To construct coefficients
wh
a ,
la1
5p
so t h a t
Recalling t h a t
6(0) #
by ( i )
, we
begin with
Then each remaining
is
chosen t o make t h e Pth derivative

= 0
of t h e l e f t s i d e of (2.8) vanish a t only t h e a t h derivative of r u l e 3etermines the

qa
I n f a c t , since
q = 0
i s non-zero a t
, this
i n order of increasing
by
N o w we choose the weights

(2.9)
wh
so t h a t % ( h ~ )i ~ n t h e period cube
C/h
wh(5) = b ( 5 ) la19
The s t a b i l i t y condition (2.3) i s s a t i s f i e d , s i n c e
in C/h
1 ~ (~ cIGI 1
(kle s h a l l denote a l l constants i n t h e following c .) Therefore Ve have t o show t h a t t h e i n t e g r a l s
estimates by
G.Strang-Gy Fix
are a l l bounded by
c h2(p+1-s)l/ u 2 \ / ~ ~ W + e ~begin with

,
L C
J 16(5)121hz12(pi1-s)(l+lr,12)Sh5 s i n c e lhc,l C/h
( r
For
o u t s i d e 'C/h
, we
notice t h a t
Ih%l 2 1 ; t h e r e f o r e
In
we use t h e p e r i o d i c i t y of
wh
t o change variables:
G. Strang-G. Fix
To show that the sun in this integral is O( 1 h< 1 2p+2)

A
, we
expand cp ,in s Taylor'series around 2 ~ jj since by condition

(i) it has a
p+l-fold zero there, there is only the usual
remainder t e n
The evalu~tionpoint
O j = 0j(h<)
lies on the line between

C/h
2sj and 28j+h6 ; since 5
is in
Therefore the sum in (2.10) is bounded by cl hs1 2D+2 ~(5) ,where -
To bound thcorcm:
S uniformly in 5
E
we apply first the Paley-W5.ener and its derivatdves are
?fE
A inlplies that g
entire functions of exponential type, and that for any s ( p they satisfy
The thcory of entire functions allows us to estimate the sums

S
in tcrlns of thc integral
St
, as
long as the evaluation
G. Strang-G. Fix
p o i n t s a r e s u f f i c i e n t l y vrell spaced; t h i s i s a straightforward

extension of
heo or em 6.7.15 i n Boas [17]

21rj
I n our case
always l i e s i n t h e cube centered a t

2~
, with
' j
s i d e s of l e n g t h
, and
vre do g e t
S(2) ( c
S u b s t i t u t i n g back i n t o (2.10)
( i i => ( i )
A t t h i s s t e p we use, not t h e f u l l s t r e n g t h
of '(iii)
, but
only t h e . f a c t t h a t approximation i n defined by
?fp
is
possible f o r t h e particular function U

( 1
A
i n the b a l l
B = (111 5
11
'(')
Since U (2.11)
E
outside
J3
9lP"
, (iii)
J
provides I\$~( 1 ~C ) S I
wh
such t h a t
5 constant
C/h
I f we consider t h e s e two i n t e g r a l s only over $(gh) = Q(0)
, where
~ ( h ) , it follows t h a t
G . Strang-G. Fix
Appealing again to the crucial constraint (2.11), we must ]lave
$(o)
$0.
Next we consider (2.12) over R"
- C/h , where
0 :
J#O C / h
'
l~~(~)$(h; + 2vj) I
' ( l + l-lYdt
(periodicity of W)
for 'every j # 0
Therefore
Comparing each power of h

wc
in turn, and recalling (2.11),
conclude that ~~$(2vj) = 0 for

la1 ( p
,j#
.
impl lest function
Remark 1 . In one dimension, n = 1
, the
which has the zeros required by condition (i) is the.one which ceneratcs the local -basis for splines of degree -p :
Furthermore, any
which s a t i s f i e s .(i)
i s a multiple of
6P '
where E
i s a s u i t a b l e e n t i r e function.
Thus every such cp
which may o r may not be a piecewise polynomial, can be found
a I f n > 1 (or N > 1) P ' no such common d i v i s o r e x i s t s , but condition (i)should make 1-b
from convolution with t h e "B-spline" possible t o i d e n t i f y a l l u s e f u l bases. W e have learned t h a t t h i s condition vras known mcch e a r l i e r t o Schoenberg; it appears i n h i s fundamental paper [18] a s t h e condition f o r a smoothing formula t o map onto i t s e l f t h e space of polynomials of degree The Poisson formula, which i s t h e c r u c i a l connection between transforms an p
zn
and
R"
, also
f i g u r e s i n t h e valuable
recent work of Bramble and H i l b e r t 1191 Remark 2.
Theorem I d i f f e r s a l i t t l e from t h e r e s u l t In t h e
s t a t e d i n t h e introduction, where we assumed ' s t a b i l i t y but gave weaker forms f o r t h e conditions i n t h e theorem. t h i r d condition, f o r exemple, t h e weaker statement imposed no r e s t r i c t i o n (2.3) on t h e weigizts; given t h e s t a b i l i t y condition (1.14), however, t h i s r e s t r i c t i o n i s automatic. polynomials
In t h e second
condition, we assumed no p a r t i c u l a r form f o r t h e expansion of
only t h a t they could somehow be represented a s and i t s t r a n s l a t e s , e.g.
combinations of
G . Strang-G. Fix
We, t h e r e f o r e want t o show t h a t s t a b i l i t y f o r c e s these weights presentation (2.14) unique; t h e only rcpresent.ation of tllc zero .function has a l l weights zero. vect4r ev ' i n t h e
t,,
- t o be equal.
The r o l e of s t a b i l i t y i s t o make any r e Novr .we use t r a n s l a t i o n
invariance, s h i f t i n g both s i d e s of (2.14) through the. u n i t direction:
By uniqueness
p j-ev
= Pj
for a l l
j and
, and. t h e
weights
a r e equal
For an expension
t h e same argument gives s h i f t i n g through el
o S-e, we add
= oj
for
v = 2,. ..,n
; after
1 t o find
Uniqueness now gives
uo
jlp
, .so t h a t
This means t h a t t h e l a s t sum i s a polynomial of t h e form required by ( i i ) t t h e induction i s obvious. Thus t h e statement i n t h e introduction may be rcgarded a s a corol.lary t o Theorem I Remnrk.3.
Thc condition G(0) # 0 i s i n general not s , if the necessary f o r clpproximation t o be possible i n ?I
G . Stkang-G. Fix
r e s t r i c t i o n (2.3) on t h e weights i s removed. example t h a t origin. n = 1 , and

tp
Suppose f o r a t the p
has a zero of o r d e r
p
Then i f (2.1) holds with

W~
replaced by h-'
+
W e
one can c o n s t r u c t ireights holds.
of order
such t h a t (2.2)
A
A converse of t h i s r e s u l t is, a l s o poqsible.

( I
emphasize tha,t i n such a s i t u a t i o n t h e transform at
vanishes Therefore
all
2nd
, and
(1.14) i s s a t i s f i e d a t
so
= 0
t h e associated Rite-Galerkin system w i l l be & e r i c a l l y s t a b l e , and such a choice of Remark 4.

y
un-
i s t o be avoided.
Theorem I can be made much more p r e c i s e i n a (These refinements may be o f l i t t l e
number of d i r e c t i o n s .
i n t e r e s t t o t h e s e n s i b l e reader, who wants to. .get on with t h e p l o t ; he can s a f e l y disregard Thcorem It. ) show t h a t t h e exponent pkl-s b e s t p o s s i b l e f o r any u f 0
First, we can
in t h e e r r o r estimate (2.2) i s
, and
f i n d t h e infinum of constznts
c s f o r which t h i s estimate holds. Second, we note t h a t t h e A smoothness of y ' a n d t h e order of t h e d e r i v a t i v e s . o f tp
which vanish a t t h e p o i n t s 27rj were s p e c i f i e d i n Theorem I

by t h e same i n t e g e r
p
This r e l a t i o n i s t h e most e f f i c i e n t
I n p r a c t i c e , anc! consequently t h e most common, but t h e r e i s n o . a n r i o r i reason why t h e two i n d i c e s must agree. folloiring we allovr rp where
.q
q (p
Insthe i n W:
t o be l e s s smooth, say 'cp
.
cp
There i s n o a s e i n permitting e x t r a smoothness, Third, we
>
, since
t h e e s t i n ~ o t e sa r e n o t improved. r a t h e r than assunling it.
strengthen t h e converse p a r t of t h e theorem by deducing t h i s smootlmess of
A f u r t h e r generalization,' which Ire s h a l l forego, i s t o
give estimates a l s o i n f r a c t i o n a l and negative norms.
In
f a c t t h e reader can v e r i f y t h a t such r e s u l t s follow d i r e c t l y from our proofs5 t h e norm (1.3)
, involving
s
p f 2
t h e Fourier t r a n s W e a l s o omit any
form,applics equally well t o a l l r e a l discussion of ' L ~ estimates f o r
,m
The more p r e c i s e version of Theorem I i s T H E O R E M 1 ' . For any i n t e g e r s p

q
, the
following
conditions a r e equivalent: (I)

rp
l i e s i n :(?
, {(o) #
, and
(ii)
. cp
lies in
1~:
, and
for
la1
(p
ja * ( t - j)
i s a polynomial Xn
term
eta , c + o .
u in
XP+l
tl ,.
..,tn
, the
&th
function
leading
(iii) f o r each h+O

J
i s a d i s t r i b u t i o n with compact support, and t h e r e a r e weights
wh
such t h a t a s
The exponent every u
pi-1-s p
i s b c s t p o s s i b l e . f o r every
and
,if
i s t h e l a r g c s t i n t e g e r f o r which ( i )
G. Stran.g:G. Fix
holds.
I n one dimension, t h e g r e a t e s t lower bound of possible
constants cs i s
With' n
>
1 t h e l a s t f a c t o r becomes
where " :a
w
i s t h e d e r i v a t i v e of order
Cs
p+l
i n the direction
Q
(This constant
i s computed i n [Mlfor several
;)'
M e must emphasize t h a t we have estimate (2.2) holds, with know only t h a t f o r every u range h cs = Cs
not
established t h a t t h e
, for
each h ; vre
<
b(e,u)
i n ?ip+l , it w i l l hold i n some a r e i n f a c t nothing
Our constants Cs
b u t t h e constants involved i n approximating polynomials of degree p+l
Ve know from condition ( i i ) i n Theorem I t h a t p can 'be reproduced exactly, and any smooth function w i l l Therefore it should be no u
t h e polynomials of degree
t h e point i s t h a t f o r small h resemble a polynomial l o c a l l y . t h e telms of degree p+l function u
s u r p r i s e t h a t t h e e r r o r . i s asymptotically a t t r i b u t a b l e t o i n a l o c a l Taylor expansion of
I n f a c t we make t h e f o l l o v i n g conSecture:
f o r every snooth
of one variable, t h e r a t i o of e r r o r i n b e s t
approximation in constant Cs
~ ' ' ~ h
~\fl~u)) approaChes this kame L
This means that the constents Cs are a very useful criterion for the comparison of two different finite element ) functions cp (given that they have the same value of p
j
the constants are relevant not just to some extreme choice of

u
, but
rather to all choices. With n
>
, the situation
be a tiny subset namely
seems to be essentially the samej there of u

A
for which the asymptotic constant exceeds Cs
those with u supported away from the optimal direction w
(We hope to show elsewhere that this conjecture extends also to ripproximation in the ,maximumnorm.) 'Novr we consider the approximation problem when the space is generated by several functions p l . h case there are N unknowns viJJ i lJ...JN
S "
; :
...,*
In this
, to be
computed at each meshpoint from the finite element equations vh = fh
The merit of such an extension'is to make high
accuracy p possible.xvithreiatively.sLmple functions cpi
their support can be small, so that frequently the required inner products are easier to compute and boundary conditions simpler to match, and they can have additional interpolating propcrtics. each In the one-dimensional Hermitc case, for example,
vi
is supported on the interval [-1,1]
, and the
1-1 st
is the only one of the first N - 1 derivatives to be non-zero
G. Strang-G. Fix.
at the origin. This means that the quantities vi,j
which
satisfy the finite element equation have physical significance in themselves, as the "displacement", "slope", "stress", etc. of the approximate solution at the meshpoint x has been found very attractive by users.
=
jh
This
TIEOREM 11.
Suppose
" , ... e r " ,N
in ?lz
Then the
following conditions are equivalent:

(i)
there are linear combinations Pa
of the q i which
satisfy (2.194
A
~~(= 0 la, ) y0(2nj)
= 0
for j #
for all j E z n , 1 s l a 1 L p
(ii) there are linear combinations Y , satisfy
of the cpi
which
(iii) for each u that for s = D , l , .
in ?tP+l there,areweights w
. .q,
i, j
wch
G. Strang- G. Fix
Remark 5 .
Babuska has asked us whether t h e following
condition on t h e ' (iv) ql,.
vi
i s equivalent t o those i n Theorem 11:

0
t h e r e i s a f i n i t e l i n e a r combination and t h e i r t r a n s l a t e s
of
..,%
via
which s a t i s f i e s t h e
conditions imposed' i n t h e case I?.= 1 : ~ ~ f i ( 2 r= j )0 f o r "la1 ( p
(2.23)
,j #
W e s h a l l prove t h a t
t h e simpler case
( i ) => ( i v ) => ( i i i )
, so
t h a t Babuska's
i n s i g h t allolvs t h e reduction of many 'N-dimensional p r o b l e m t o

N = 1
W e note t h a t t h e t r a n s l a t e s admitted
i n . ( i v ) a r e t h e functions
It i s obvious t h a t ( i v ) => ( i i i ) : i f t h e function s a t i s f i e s t h c conditions of Theorem I, then combinations
G. strang-G. Fix
C w?
J .
oh can be used t o
To prove t h a t
Jy-l 5 P
J
approximate
u a s required i n (2.21-22).
e l e t t, 191, denote t h e unique such t h a t
( i ) => ( i f f ) p
, we
polynomialof degree for
jointlyin
..., e
Then we define t h e required
C l by giving i t s Fourier transform:
Since t, Y, l a t e s of
i s t h e trznsform of a f i n i t e combination of t r a n s -
Y ,
, this
n i s indeed a f i n i t e l i n e a r combination
and W e i r t r a n s l a t e s .
t,
of t h e o r i g i n a l qi
By t h e p e r i o d i c i t y of t h e
which by (2.lga) equals one i f To v e r i f y (2.23) of a product:
j = 0
and zero otherwise.
, we
use t h e Leibniz r u l e f o r . t h e d e r i v a t i v e
G..Strang-G. Fix
replacinp, a
by
-P .
h
This vanishes, by t h e fundamental

Ya
property (2.19~)of t h c
, 2nd
condition (iv) i s verified.

j= 0
W e note t h a t (2.23) holds even f o r

1 ( In1 ( p
, when
Tnus t h e
qa
which were constructed i n t h e
~ i - o o fof Theoren I reduce -to boa u
, when
rve a r e approximating
by combinations of t h e functions
".
h
W e t u r n now from t h e mean-square zpproximation. of a f u n c t i o n
and i t s d c r i v a t i v c s t o t h e problem of p o i n t a i s e approximation.
The l a t t e r i s c r u c i a l nwncrically, even th0up.h t h e origfr:al d i f f e r e n t i a l problem ( e . ~ .t h a t of minimizing a quadratic f u n c t i o n a l ) l e s d s more n a t u r a l l y t o t h e former. tv:o p toqcthcr.
It i s a
funda!ncntal orovcrty of Lhe f i n i t e elernent method t h a t t h e

GO
By t h i s vie mean not only t h a t t h e order
o r t h e b a s t approximetion i s t h e sane i n t h e t m
nonns, a s t h e next theorem shov~s, but a l s o t h a t t h e o r d e r of t h e a c t u a l approxi?nntion by t h e Ritz-Galerkin function
r
uh
G. Strang-G. Fix
i s t h e same i n both norms. of p+l-s and 2(p+l-m)
This exponent
i s t h e smaller
.
cpi
The following estimate includes known r e s u l t s f o r spline approximation, i n t h e special case of equally spaced knots. Splines a r e t y p i c a l of many important choices of t h e t h a t t h e i r derivatives of some order
q
,in
have jump discontiunities;
( I n t h e spline w e note t h a t t h i s leaves them s a f e l y i n W ; case, q i s the. degree of t h e polynomial i n each interval, and coincides h%th t h e accuracy exponent T H E O R E M 111. Then i f Suppose p .) s a t i s f y t h e conditions of
q
Q1,...,vN
t h e previous theorem, and have bounded derivatives of order
i t follows t h a t
Proof. part of x/h i n powers of
W e write
and
x=kh+ th
, where
i s the integral
O l t v < l
l i e s i n t h e u n i t cube
Expanding
G . Strang-G. Fix
We know from the s t a r t that uI1 cannot be c l o s e r to u than the optimal

h approximation from the space S o, r: w and 11.
Therefore the e r r o r , measured i n H~
will be a t best of the o r d e r hptl-s determined i n Theorems I The question i s whether the approsimation produced by the finite Thcre i s no a p r i o r i reason
(3 i n
clement is actually of this optimal o r d e r .
why this should always be so; i n fact, if we fix
H~ and i n c r e a s e the
It exceeds
o r d e r 2m of the equation, t h e r e i s every reason to think otherwise. s e e m s unlikely that the e r r o r in 14' would stay of o r d e r p
+ 1 until m
p, a t which point cp would no longer lead to admissible t r i a l functions and

the method would collapse. Therefore we anticipate that tlie o r d e r of
accuracy will depend on m a s a c l l a s p and s . Thc c o r r e c t o r d e r can in fact he d e t e r ~ ~ l i n e over d the range
0
Q
s 4 m by an elegant variational argument which me lkarned from
Marlin Schultz. A special but still typical c a s e of this argument h a s been published by Nitsche ( 2 1 1 , and Aubin has shown us all alternative route to the s a m e result. We begin with the fundamciltal result (cf. Varga 1221) that i n the case s
= m, the c r r o r uh
- u i s indeed of the optimal o r d e r hp t l -m

Repeating the standard a r g u -
allowcd by Lhc approximation theorems.
equations (1. 5) and (1. 7) that ment, we deduce from the v a r i a t i o ~ l a l a(uh
- u, wh ) = (f, wh ) 2 -ullHm
h (f, w ) = h
o
-u)
1 1. h for a l l w ~n s
(2.31)
Therefore by ellipticity plluh

6 ~ e a ( u " u,u
G. Strang-G. Fix
ior a l l w
h
h Cancelling the f i r s t i a c t o r on thc right, and choosing w
a s the optimal approximation to u,
p 1-1 IIerc we ticed u i n 13 i n o r d e r l o apply T h e o r e m s I and 11, and t h e r e fore we a s s u m e that f l i c s in Hp'1-2m. If i is l e s s smooth, the e s t i It i s t o the e s t i m a t e
m a t e s of this section a r e not difficult t o r e v i s e .
(2.32)that we m a y apply o u r calculation i n Theorem I' of the m i n i m a l constant C m i n approximation.
Now we give Schultz's a r g u i l ~ c n t for s
< m. I t begins with t h e
adjoint problem LQv = g, which i s equivalent to the variatioilal equation

'm a(y, v) = (y, g) f o r a l l y i n N
(2.33)
Again the cllipticjty of a guarantees a unique solution v f o r g i n H - ~ , and i u r t h e r ~ n o r c that
Taking y = ul' h (U h . i o r a l l w in
U,
u, and recalling (5. I ) ,

11 g) = a ( u
- u,
I)
= a(uh
- u, v - wh ) .
(2.35)
sh.
Therefore
I(u"-u,E)~
K~~u''-uII~~~~h v -llHm w
(2.36)
To e s t i m a t e the l a s t t e r m , we choose wh a s the b e s t Hm a p p r o x i m a ~ i o n t o v, and appeal to T h c o r c m s I and 11:
r
iipf 1 4 2m-s (2.37) i f p f l > Z m - s
I1
G. Strang-G. Fix
(In tLc second c a s e we reduced p t o 2m
- s r - 1 before applying the
approximation theorems; if t h e i r hypotheqes hold f o r a given p they

certainly hold for a s m a l l e r on;. ) Substituting @.32)and@:37) into ( 2 . 3 6 )
and using ( 2 . 3 4 ) , we have
r = min(p
+ 1 - s, 2(p + 1 - in)) .
(2.38)
Now a s g r u n s over thc unit b a l l i n I - I - ~ , the s u p r e m u m of the left side is exactly the n o r m of uh e r r o r estimate i s T h e r e f o r e the finai - u i n t h e d u d space H ~ .
We notice that with s = m the f i r s t expression i n r is the s m a l l e r , and
r =p
+ 1 - rn in agreement with
(2.32).
G . Strang-G. Fix
C. C , Zienkiewicz, "The f i n i t e element method i n s t r u c t u r a l and continuum mechanics, " London: McGraw-Nil1 (1967)
R. Courant, "Variationel methods f o r the s o l u t i o n of problems of e u i l i b r i m znd vibrations," Bull. Amer. Math. Soc. 49, 1-23?1943)
f i n i e s c j u i peut fournir des bornes superieures ou i n f e r i e u r e s , " Comptes Hendus 235, 995-997 (1952)
G. Polya, "Sur une i n t e r p r e t a t i o n de l a mcthode cles differences
J. P. Aubin, "Behavior of t h e e r r o r of . t h e approximate
solutions of boundary value problems f o r l i n e a r e l l i p t i c operators by Galerkin's and f i n i t e difference methods," Rem. Sem. Mat. Pado-ra.
G. Fix and 6 . Strang, "Fourier analysis of t h e f i n i t e elernent method i n Ritz-Galerkin theory," Studies i n Appl. Math. 48, 265-273 (1969)
D. Schaeffer, "Approxination of e l l i p t i c boundary value problemllby difference equations; I. Factorization of t h e symbol, J. Functional Analysis 1970.
V. Thomee, " E l l i p t i c difference operators and D i r i c h l e t ' s
problem," Contr. Diff. Eqns. 3, 301-324 (1964).
dea operateurs d i f f e r e n t i a l s , I' Bull. Soc Math. Prance, Memoire 12 (1957).
J . P. Aubin, "Approxim9tion des espaces de d i s t r i b u t i o n s e t
G. Birkhoff, M. H. Schultz and R. S. Varga, "Hermite i n t e r polation i n one and more v a r i a b l e s with applications t o p a r t i a l d i f f e r e n t i a l equations, " Elmer. Math., 11, 232-256 (1968)
M. II. Schultz, "Rayleigh-Ritz-Galerkin methods f o r multi-
dimensional problems,
"
SUM Numer. Anal. 6, 523 -538 (1969)
F. DiGuglielmo, "Methode des elements f i n i s : une famille dlapproximations des espaces de Sobolev par l e s t r a n s l a t e s de p-fonctions," Manuscript, 1970.
I. Babuska,
"Approximation by H i l l functions,
''
t o appear.
J . J. Goel, "Construction of basic functions f o r numericAl u t i l i z a t i o n of R i t z l s method," Numer. Math. 12, 435-447 (1968). V. Thomee,"On t h e convergence of difference quotients i n
)I.
e l l i p t i c problems," Univ. of Maryland, Note BPI-537 -(1968).
Zlamal, "On the. f i n i t e el ernent method, " Numer. Math. 12, 394-!+09 (19.58)
G. Strang-G. Fix
. Varga, "Quzdrature [16] R. J. lierbold, M. H. Schultz, and R. S schemes for the numerical ,solution of boundary value problems by variational techniques, Aequationes Mathenaticae 3, 96-119 (1959). .
[1.7] R. Boas, :Entire functions," New York, Academic Press (1954).

[18] I . 3. Schoenberg, "Contributions to the problem of approximation of eauiclistant dcta by analytic functions, Parts A and B.," Quart. Appl. ihth. 4, 45-99>112-141 (1946).
[lg]
3. H. Bramble, and S . R. Hilbert, "Bounds for a class of
linear functionals with applications to Hermite interpolation." Numer Mathematik.
r2.01 G. Strang, "The finite element method and approximation theory,
Numerical Solution of Partial Differential Equations I1 (SYI'TSPADE), Academic Press, 1971. [21] J. Nitsche, "Ein Kriterium fur die auasi-optimilifat des Ritzschen verfahrens," Numer. Math. .11, 346-348 (1968).
1221
R. S. Varga, 'Hermite interpolation-type Ritz methods for

two-point boundary value problems," in: "Numerical solution of partial differential equations," 3. H. Bramble, Ed., New York: Academic Press, 365-373 (1965).

Strang 1973

Caricato da

Informazioni sul documento

Copyright

Formati disponibili

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Copyright:

Formati disponibili

Strang 1973

Caricato da

Copyright:

Formati disponibili

CENTRO INTERNAZIONALE MATEMATICO ESTIVO l(C. I. M. E.

A FOURIER ANALYSIS O F T H E FINITE E L E M E N T VARIATIONAL METHOD

Corso tenuto ad Erice

In our book, the emphasis will be s h i f t e d to the "nodal f i n i t e element

In t h i s case splines a r e muoh l e s s

convenient, and Fourier analysis i s impossible. remain valid

The customary description of t h e r e s u l t i n g

method i s i n v a r i a t i o n a l terms, and we s h a l l introduce i t i n t h i s To think- of it a t t h e same time a s a difference scheme

points from conventional difference equations.

and boundary cqnditions.

Thc evidence i n t h i s comparison i s s t i l l

Such t r i a l f u n c t i o n s t h e r e f o r e make t h e Ritz method

esgeclally s i ~ p l e t o execute, and it seems very l i k e l y t h a t t h i s

and t o t h e complexity of t h e method.

i s i n i t i a l l y .worth t h e p r i c e ; b u t j u s t a s Newton1 s method i s more

Thc e s s e n t i a l f e a t u r e s of t h e method a r e t h e subdivision

following systematic way. vl(~),

To ensure a l o c a l basis, we i n s i s t thak t h e s e

The eventual Rayleigh-Ritz

a r e simply t r a n s l a t e s of these rescaled functions b a s i s i s thus composed, f o r each h

i s a pyramid with vertex a t t h e o r i g i n and with base

formed from t h e neighboring-triangular elements.

and 3.ts t r a n s l a t e s span t h e space

how well conditioned a r e the eauations from vrhich uh i s determined numcrica.lly?

norms i n which accuracy and condition number a r e measured. order eh = uh

want t o study these questions f o r e l l i p t i c o ~ e r a t o r s of a r b i t r ~ r y

These norms measure t h e e r r o r

i n t h e mean-square and po5.ntwise senses

In Sobolevts notation the space 9fS is written : W

most of the second step in the analysis is already complete;

In his terms, our contrifunctions to which his theory can

bution is to determine extend. the reference [ 4 ] .)

For problems on t h e whole space runs over t h e s e t 2 "

f o r t h e Fourier transform, where xlcl

W e f i r s t describe t h e app1ication:of t h e Ritz-Galerkin method a b s t r a c t l y , t o e l i n e a r e l l i p t i c problem on ventional i n n e r product with a b i l i n e a r form

,.t h e problem begins

t h a t is, whenever a l l lie in L ~ ( R ~ )

p a r t i a l d e r i v a t i v e s of order not exceeding m The fgrm i s c a l l e d ? ? - e l l i p t i c provided t h a t

au a ? ... + axn ax, dx .

To satisfy (1.4), and thereby eliminate this non-uniqueness of the

An elliptic form induces the follovring variational problea: given f

This problem has one and only one solution u

For t h i s we i n t e g r a t e t h e l e f t s i d e of (1.5) by p a r t s , s h i f t i n g a d e r i v a t i v e s from w onto q D u The r e s u l t i s ( L U , ~ ) = (f,~),

which i s equivalent t o (1.6); L given by

Many applications l e a d a l s o t o problems of I n such c a s e s the form of a(u,u) a

a sequence of closed subspaces

problem, w r i t t e n v a r i a t i o n a l l y , i s t o f i n d a ( uh ,wh ) = (f,wh) for a l l

Therefore we put t h e approximate

and compute t h e vector i n t o (1.7))

where t h e e n t r i e s i n t h e coefficient matrix and t h e inhomogeneous vector a r e given by

t o take t h e b a s i s functions i n groups

Normally ' t h i s system i s analogous t o D(LU) = Df

We want now t o summarize our r e s u l t s .

A more extended summary

hope t h a t our discussion there-, i n terms of t h e model problem

be a u s e f u l supplement t o t h e present pzper.

set nsidc extensions t o eigenvnlue problems and parabolic equations,

and dcscribc our conclusions only f o r t h c problems of convergence and s t a b i l i t y s t a t e d above.

G. Strang-G. Fix ii) The problem 6f s t a b i l i t y i s t h e simpler, Hcre t h e

For t h i s uniform independence we f i n d t h e folloering necessary and s u f f i c i e n t condition:

t h e r e e x i s t s no n o n - t r i v i a l combination such t h a t t h e Fourier transform of

The reader w i l l n o t i c e t h a t everything depends on t h e

i n more general boundary problems.

For d i f f e r e n c e equations which