Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
9 by Springer-Verlag 1980
Numerische Mathematik
Summary. Dominated solutions of a linear recursion (i.e. solutions which are outgrown by other ones) cannot be computed in a stable way by forward recursion. We analyze this dominance phenomenon more closely and give practically significant characterizations for dominated and dominant solutions. For a dominated solution, in particular, this leads to a stable computational method.
such a solution x is said to be dominated by y (cf. [13]). Since the computation of dominated solutions by means of (1.1) is unstable (cf. [4; 5]), other ways have to be explored. A well known way to find approximations to the "most dominated solution", is Miller's algorithm (cf. [4; 9], see also w an interesting alternative for the summation of such a solution in the scalar second order case can be found in [3]). Quite a new approach was given in [11] (see also [12; 10]), where it was indicated how a dominated solution could be determined in a stable way using an approximating boundary value problem. Although this has in part solved the problem of computing "intermediate solutions" for the scalar case, there is still need for a more general approach; moreover these results do not lend themselves to a straightforward generalization to MVRs (these MVRs seem particularly interesting from the point of view of discretized differential equations).
0029-599X/80/0035/0421/$04.40
422
R.M.M. Mattheij
In this paper we therefore consider the question of developing techniques for computing a solution f of [A], which is dominated by a certain number, k say, of independent solutions, and which is not dominated by ( n - k - l ) other independent solutions (the last set of solutions then is also dominated by the first mentioned set). We start with investigating the dominance of solutions more closely. It turns out that not only the magnitudes of the iterands of a solution are important quantities for having a practically useful kind of dominance but also their directions (93)- The latter fact (which makes the problem really more difficult than e.g. in the scalar second order case) suggests to us a geometrical approach, which is described in a special section (94). With the concepts of dominance, and our geometrical tools, we show in 99 5, 6 how a fairly general solution can be decomposed into a component having the characteristics of solutions that dominate f and a component having the characteristics of dominated solutions. This leads to a stable computational method for f (see 97). In this paper we do not give computational details of this method; these can be found in a subsequent paper [8]. Our intention here is to describe a general approach for ruling out instabilities caused by dominant solutions. This generality is such that it also allows for a geometrical interpretation for some other existing methods, like computing special functions or solving boundary value problems. Again, this will not be worked out here. For an example we refer to [7, 919], where it is shown how Olver's algorithm can be looked upon in our way.
2. Some Conventions
where x I =(~1 .... , ~k)T and x 2 =(~k+ t,---, ~,)r. Let A be a matrix; then this partitioning induces a partitioning for matrices for which we use the following notations:
{A 11 A12~
A = \A21
A22]=(A1]A2),
(2.2)
if q __> p if q<p
(2.3)
423
f i Ai={Aq..;Ap,
i=p
*
if if
q>p q<p
(2.4)
the matrix having v t .... , vk as respective columns. Generally a matrix having k columns (each being an element of (17") is called a k-matrix. F o r a matrix V (whose columns are elements of ') we denote the subspace of C", spanned by its columns, by span (V). Let X = {X~}i~=o denote a sequence of k-matrices. If the columns of the X i are
D
the iterands of respective solutions of the M V R [A] and m o r e o v e r if X 0 has rank k then X is called a k-solution of [A]. (Note that the sequences of respective columns of a k-solution are independent solutions.) Especially, an nsolution is often called a fundamental solution. F o r a sequence X as above and fixed matrices M and N of suitable dimensions, the sequence {MXiN}i ~ o is denoted by MXN. For each i let X i be given by
(2.6)
then span(X) denotes the linear space generated by the sequences {xi(l)}~=o, l = 1 .... ,k. If x e s p a n ( X ) is not the sequence {0,0,...} then we indicate this by x4=0.
(2.7)
N o t e that glb(.) is also unitarily invariant. If for matrices L and M both glb(L)=glb(M) and lub(L)=lub(M) then we shall say that L and M have the same magnitude and likewise if these quantities are of the same order in some context, then L and M are said to have magnitudes of the same order.
424
R.M.M. Mattheij
3. Dominance
In w1 we introduced a notion of dominance for solutions of [A]. If a solution g dominates a solution f then ag will also dominate flf for any scalar a and fl (a~0), i.e. any g~span(g), g + 0 , will dominate any f e s p a n ( f ) . For matrix solutions we therefore have the following possible generalization.
Definition 3.1. Let G be a k-solution of [A] and F an/-solution of [A]. Then G is said to dominate F if any gEspan(G), g:t=0, dominates any f e s p a n ( F ) . In fact 3.1 defines dominance for solution subspaces and G and F are merely used to indicate bases for these subspaces. The existence of such subspaces is reported e.g. in [7; 13] assuming some very mild conditions. Especially for the case l = n - k one may ask whether they are unique, and if so, how one can find numerically appropriate representations for them, which will then allow a stable computation of solutions. The second of these questions - the main topic of this paper - will be dealt with in w5 ff. We first consider the unicity problem. For the second order scalar case with n-- 2 and k = 1 it has been shown in [4, p. 25] that any dominated solution is a multiple of a certain f say, i.e. span(F) is unique, whereas any linear combination of some f~span(F), f 4=0, with a gespan(G), g4:0, defines a dominant linear subspace, so span(G) is not unique anyway. We shall prove that span(F) is also unique in our more general situation. For this we need the following:
L e m m a 3.2. Let G and F be as in 3.1. 7hen (G]F) is a (k +l)-solution of [A].
?t
V2 '
ViGiv I = - V / v 2
If v ~ is zero, then Fo v2 =0=~v 2 =0. This contradicts the assumption v 4:0.
(1)
If v I is nonzero, then lim IJFiv21[ =0, which contradicts (1). Hence there
i~
l[Givl II
[]
Corollary 3.3. Let G and F be as in 3.1 and l = n - k , then (GIF) is a fundamental solution.
Theorem 3.4. Let G and F be as in 3.1 and l = n - k .
7hen span(F) is unique, i.e. any other dominated (n-k)-solution P equals FP, where P is a nonsingular ( n - k ) x (n - k) matrix (which depends on the choice of F). Proof: Let (~ be a k-solution of [A] that dominates an (n-k)-solution te of [A]. Then for some nonsingular matrix M there holds (d[F)=(G[F)M (cf. 3.3). Write [M 11 M12~ M = \M21 M22] (M 11 k-th order).
425
(a) Suppose M 11 is singular. Let the k dimensional vector v I # 0 be such that and define g=(~v 1. Then ~,=FM21v1. Since gespan((~), ~ must dominate F. On the other hand we also have gespan(F), so G dominates ~. From 3.2 it now follows that the rank of (go]fro) equals ( n - k + l ) , and that the rank of (Gol~o fif0) equals (n+ 1), which is absurd. Hence M 11 is nonsingular and, consequently, G dominates F. (b) Suppose M12#0. Then there exists an f~span(P), which has a nonzero component in span(G). This, however, contradicts the result of part (a). So M 12 =0.
Mllv1=O
[]
A well known method for approximating a solution f of [A], which is dominated by some (n-1)-solution of [A], is Miller's algorithm [9]. We briefly sketch it here: Let ViAi be nonsingular and let, for some (suitably large) N, a N be N o a vector which is not orthogonal to fN. Then a sequence {f~( )}i=N, obeying fN(N) = aNand JI(N)= A F lf~+I(N) ' i = N - 1, ..., 0 is computed. Thinking of it as a generalization of inverse iteration, one may expect that f~(N) becomes directionally close to f~, the better the larger (N-i). Therefore, an approximation {~}~'=o to {fi}f=o (P suitably smaller than N) may be found by normalizing the sequence {f~(N)}f=o, say using the initial condition fo; define e.g. f~(N) =fi(N)Ilfoll/Nfo(N)l[. For a certain set {as} (e.g. all equal to some fixed vector), we can say that the algorithm is convergent if lira ~(N)=f~, for i fixed.
N~oo
However, dominance according to 3.1 may not be sufficient to ensure convergence; we demonstrate this phenomenon by an example (cf. also [9, w3]).
i>=0
Let
Gi=
(i+1,i2;)1 +
0 ~
and 0 0
-1=
f / = Fi =
i>_O.
Then G is a 2-solution and f = F is a 1-solution of [A]. Obviously G dominates f. It can be verified that
N--1
N~1)2 ~ 0
N N+Io
(j~=oAs)
N+IN1
1) 1N
426 Define
[N-1 \--1
R.M.M. Mattheij
and
Thinking of inverse iteration, a reasonable requirement for convergence would seem: VuauCspan(Gu), i.e. 7+0. However, writing out fo(N) explicitly, we find: fo(N) =
fl-~)N+74(N+l)~,(fl-7)N+fl, 7
gence if fl=7 +0. It is clear that the latter requirement is a lot more restrictive than the former. An other point is that the choice fl = ~ is unsatisfactory from a stability point of view, being very sensitive to rounding errors. [] In order to understand the ill conditioning in 3.5 one should observe that the small angle between GN ((N +01)- 2) and GN ( ( N +0) _ I ) (both vectors having l about unit length) may cause cancellation; i.e. a N has large multiples of each of these vectors compared to the component in the fN direction. Apparently, the (]~[N-1 Aj )-1 dominance does not ensure that ~-~_ a N has a comparably large com\j=o_
ponent in the f0 direction. This can also be put in the following way: Define
g(N)=G((No
1)-2 ) and
h(N)=G(-1N+l).
Then gN(N)and
hN(N)
form an
orthonormal basis for span(GN). Generally therefore a N will contain multiples of gN(N), hu(N ) and fn (which also does not make a small angle with the other two vectors) which are of comparable size. However, 1 ]]go(U)[l=(N+l)2 and llfoll=]/3 but ]tho(U)[] = ] / 2 N z + 2 N + l ;
so the component in the dominant space has grown about N times as fast as the one in the desired direction (viz of fo). [] In order to exclude situations like above, we introduce a stronger dominance concept:
IIf01t/tIIg0HJ
Itfi[[ /[][gitt ] = 0
In 4.6 we shall show that an appropriate separation of the directions of the columns of the G i together with dominance of sequences of these columns with respect to similar sequences of span(F) implies uniform dominance. For practical use of 3.6 the following property is very helpful.
Property3.8. G dominates
F uniformly ifflim
lub(Fi) O. i ~ glb(Gi)
427
Proof." If: Let gespan(G) and fespan(F). Then there exist vectors v and w (with k and l coordinates respectively) such that g = Gv and f =Fw. We obtain: []fll /[Ngi[[ ] < lub(F~) [lw[[ lub(Go) [[vl[ Ilfol[/LilgoIl/=glb(Fo) Ilwii glb(Gi) llvtl' from which the assertion follows using a limit argument. Only if: Let w(i), with }tw(i)[I= 1, be such that lub(F~)= llF~w(i)[I and v(i), with ]lv(i)l[ = 1, be such that glb(Gi)= IIGiv(i)]l. Then
lub(Fi) ]]riw(i)ll<lub(fo) ]IFiw(i)[I / [ IlG~v(i)[[ ] glb(Gi)- [[a~ v(i)l[ = glb(ao lifo w(i)ll L ~ v~(_l"
Since we can find for each i solutions Gv(i) and Fw(i) (i.e. sequences {Gjr(i)} and {F~w(i)} respectively) of which the i-th iterands satisfy this inequality, the proof again follows from a limit argument. []
Remark 3.9. We like to show that requiring 3.6 comes up to our expectations
regarding the convergence of Miller's algorithm. Let a N be a starting vector as before, and write aN=GNaN+fsa^2 (here G is a (n-1)-solution which dominates uA1 f uniformly). We assume that aN4=0 has a not too small component in the direction of fN, i.e. we assume that there is some e>0, such that VN[IGNfi~I[ <e}l/Nfi~lI (note that his was not sufficient in 3.5, and is very mild, cf. [9,w Then
i.e.
[]
It is hoped that the preceding examples have convinced the reader that the stronger dominance concept 3.6 is a more satisfactory one than 3.1. Therefore it is reasonable to reformulate the problem-setting of w1, by requiring that the desired solution f is uniformly dominated by some k-solution G, whereas f is not dominated by ( n - k + 1) other independent solutions (which - together with f constitute a uniformly dominated (n-k)-solution). This will be our new starting point. (At some places we shall also give a justification of the stronger requirements.)
428
R.M.M. Manheij
to use this notion for a geometrical interpretation of transformations that will play an important role furtheron. We have (cf. [2, p. 9])
Ix*yl
IIxll IIYll "
Then 0, with 0 < , 9 < 2 is called the angle between X and Y..
Remark 4.2. Obviously ~9 in 4.1 does not depend on any particular basis for
span(X) and span(Y). Since 4.1 is based on o r t h o g o n a l projections we can translate cos 0 into a m o r e accessible quantity which uses orthogonal bases for span(X) and span(Y). Therefore, define for any /-matrix M by M a matrix with o r t h o n o r m a l column vectors and (possibly) null vectors, such that for some upper triangular l-th order matrix U there holds M =MU (4.3)
( M U can e.g. be thought of as obtained by a G r a m - S c h m i d t process, cf. [14, p. 242 ff ]). We now have:
Property 4.4. Let 0, X and Y be as in 4.1. Then cos 0 = ItX*u Proof." F o r any v and w of suitable dimensions there holds:
IlXv,l=llvll
So IIX*YI[= max v,w,O tv*X*Ywl IIv[I[Iwl[
and
IIYwll=llwll.
[]
Property4.5. Let 0, X and Y be as in 4.1. Let Z be an /-matrix such that span(X[ Y) = span(XIZ), and Z* X = 0. Then sin 0 = g lb(Z* Y). Proof: We can write out Y in c o m p o n e n t s of X and Z : Y = X X * Y + Z Z * Y .
Hence for any 1 dimensional vector x we obtain from Pythagoras' theorem: tlX*Yxll2 + IIZ*Yxll2 = [IYxll= = llxll 2. So for x4=0 IIZ*YxII ~ 1 / 1 - cos2 0 = sin o lix[I (cf. 4.4). If x is such that ilX*Yx[I = IIX*YII Ilxll, then the equality sign holds, i.e.
glb(Z*Y)=sinO.
1
[]
429
The next theorem now gives a sufficient condition for uniform d o m i n a n c e in terms of basis solutions and angles, as announced above: T h e o r e m 4.6. Let G and F be a k- and l-solution respectively of [A]. Write Gi=(gi(1)[...[gi(k)) and Fi=(fi(l)[...Ifi(1)). Let O~(j) be the angle between span(giU)) and span(g,(1)l...l&(j- 1)]gi(j+ 1)[...Igi(k)). I f lim i~
t=
then G dominates F uniformly. Proof: Let h~espan(Gi) be such that h~(g~(1)]...lgi(k-1))=O. Since for any k
dimensional x
glb(G,)>= min
t= 1..... k
IIg~(t)llsinO,(t).
(1)
M o r e o v e r we have
(2)
In view of 3.8, c o m b i n a t i o n of (1) and (2) and a limit a r g u m e n t prove the assertion. []
Remark 4.7. Obviously, the condition in 4.6 is not fulfilled in example 3.5, where
we have m a x IIf/(s) ll min(llg,(t) It sin 0,(t)) = (i + 1) 1/3. An other application of 4.1 is a m o r e or less quantitative interpretation of the skewness of a transformation (a matrix T is called skew if for some directionally well separated vectors their images under T m a k e a small angle). Indeed, let T be a nonsingular transformation and 1/ be the smallest angle r/ between the images of two o r t h o g o n a l linear subspaces, then c o t g ~ = tl Tll I1T - i l l . This result follows from the well k n o w n result in [1], where it was shown for vectors (note that our definition applies to spaces). The quantity 2 I1"I[1 denotes the linear norm
430
R.M.M. Mattheij
iI TIT IIT - 111 is called the condition number. In the sequel we shall use sequences of transformations (for the solutions of [A]), which should not be too skew in order to preserve dominance properties (cf. e.g. 5.9, 5.11). This motivates the following:
Definition 4.8. Let {T/} be a set of nonsingular matrices. If 3~>1 ~ IIT~ll IITi-~ II _-<~, then {T~} is said to be of bounded condition. Especially for partitioned matrices T~=(T~I[T~2) we thus have: Property4.9. Let {T~} be a set of nonsingular matrices. Let for some ~:, and all i, []Ti]] ]1Ti-1 l] < ~c. Let 0 i be the angle between T/a and T/2. Then
t~2--1 ViCOSOi <= 2 + 1 9
Proof: Let xespan(Ti 1) and yespan(T~2), x,y:l:O. Then there exist v, wcR, with
v w, such that x = T/v and y = T/w. Hence, with ti as above (viz ~c=cotg ~), w e xZ_l have cos 0 i< cos r/= x2 + 1" []
Remark 4.10. The estimate in 4.9 does not pretend to give a sharp estimate, but only to indicate that bounded condition implies that the angle between T~1 and T~ is uniformly bounded away from zero. This property wilt be used e.g. in 5.17, 2 5.18.
Dominant and Dominated Solutions of Linear Recursions L i is unitary. Specifically, let H i be such that (Gi[Hi) is unitary and (GilF~)=(GilHi) p22],P1 of order k.
431
(5.1)
Clearly p12 and p22 represent projections of F~ into G i and H i respectively. If the angle between G i and F~ is not small, then the projection of F~ into G i will not be large with respect to the projection of F~ into Hi; so the magnitudes of F~ and H * F i are expected to be of the same order. We work this out in more detail for the case that (GilF~) has full rank; note that it follows from 4.5 that glb(H*F/)>0, so [ H * F i ] 1 exists.
Property5.2. Let (Gill:i) have full rank. Let 0 i be the angle between Gi and F/.
Then I u b(G* Fi[H* Fi] - 1 = cotg 0J-
Let x be such that HG*FixN=cosOiI[x[I. Define yl=G*Fix and y2=H*Fix. Then ]]yl ]]2 + [[y2][2 = [[X[[2. S o
IIG.Fi[H.E]-ly2
[]y2 ii
(5.3)
for
some
e?22 ] = ( i 0 0 i ) ( ~ 1 p/2
0 H.F.~22).
(5.6)
432
R.M.M. Mattheij
If the angle between G~ and F~ is not large, i.e. if [lOll[ is not large (see 5.2) then ~22, p/22 (see (5.6)) and hence p z2 and F i (see (5.5)) have indeed magnitudes of the same order. If we drop the requirement that (G~[F~) has full rank, a similar interpretation of the decomposition (5.1) can be given. We shall not carry this out explicitly, but consider - more generally - decompositions with not skew (hence nonsingular) transformation matrices T~ (T~=L] 1 above). So assume we can decompose (G~IFi) as (GiIF~)= T~
(5.7)
Remark 5.8. By assumption T~~ has full rank, so Vii1~ is nonsingular and span(G~)
= span(Til). The next theorem shows how the magnitudes of G~ and F~ are related to those of V~11 and Vii22 respectively. Theorem 5.9. Let T~ be as in (5.7). Let O~denote the angle between F~ and G~. 7hen 1 <glb(V/l') IIV/111[<
(1)
(2)
glb(U=~) -- i/liEUt']-~
Since V 2 = T - tF, we find using (1)
v 22
(3)
(4)
= E~']*F--(u~)*ET~]*F
(5)
Dominant and Dominated Solutions of Linear Recursions From (4) we find the following inequalities
433
g l b(U 22) g I b([]723 * F)IIV II< ]1V22 II --< IIU 22 II II[i~2] * F II IlF II
(6) (7)
glb(V22)glb([~F2]*F)glb(F)<glb(V22)<IIU221111[TE]*Fllglb(F)
Using (2), (3), (5) and the estimate ]l[~'2]*FH----< 1, the second part of the assertion then follows from (6) and (7). The first part is analogous, but somewhat simpler: Indeed, from G = T 1 V 11 we can show similar estimates as (6) and (7) to hold, by interchanging the roles of T and T-1. []
Remark 5.10. The estimates in 5.9 clearly depend on the angle 0 i, the norm of T~ and of the conditionnumber IlTil[ ]lT~-iill, i.e. of the skewness of Ti. If ]lT/]l and I[Ti[[ I[T/-1H are of the order of unity then Vili and G i on one hand and V/22 and F~ on the other hand have respective magnitudes of the same order, Vii and F~ 22 with the additional assumption that 0 i should not be small. As a consequence of 5.9 and Property 3.8 we have:
Corollary 5.11. Let Ti be as in (5.7) and let {Ti} be of bounded condition. If G dominates F uniformly there holds lim lub(Vi22) i ~ g/b(V/'1) 0.
In order to find a characterization for span(T~-lF~) we would like to have a quantity like O~ in (5.3). Therefore we introduce:
Definition 5.12. Let S denote the ( n - k) dimensional space on which Viz2 (cf. (5.7)) is defined. Then the direction (2~ is defined as a mapping on Vi22S by
If T~=(G~IH~) as in (5.4) i.e. if T~ is unitary, then [tf2~ll equals the cotg of the angle between G~ and F~ (see 5.2). Also more generally, O~ indicates something like a direction of T~-IFi with respect to T~-1G~, which explains the name in 5.12 (see also 5.15).
Remark 5.14. Since F~= T~ (Oi~) Vi22, it follows from 3.4, that f2~ is unique for a
given T~, i.e. for any f, dominated by some k-solution, there holds (T/-lf~)~ =s ~- ~f~)2. For such an f we therefore call f2 i the direction of T~-lfl. F r o m 5.14 we see that the direction f2~ induces a subspace cII~" of i-th iterands of dominated solutions. This important characterization will be used further on for their approximation.
434
R.M.M. Mattheij
The next theorem gives a more or less quantitative geometrical interpretation for the direction. Theorem 5.15. Let Ti be as in (5.7). Let rli be the angle between Ti 1 and Ti 2 and let
0 i be the angle between Fi and G i. Then
(G{F)=(G,H) (I 0 If r a n k ( F ) = n - k ,
G'F\
then A = O , see 5.2 and (5.3). Similar to 5.2 we can prove: ]IA II= cotg 0 (2)
As in the proof of 5.9 we define T - l = [-~x]~.2],; so we find from (3) and (5.13)
(m 22 (IT1] *~ I)V =\
[~']*n~ [,,2],H]
and
( A ) H * F P 22
(4)
02 = H * F p 2 2 s
(5)
Since both span(T 2) and span(H) are the orthogonal complement of span(G), we have span(7 "2) = span(H), so [,~-2], H is nonsingular. From (4) and (5) it now follows that v2= [,T2]*H/32. Hence: tIQII = m a x 11(2v211- m a x II{[~r~]* GA + [TI]*H}/321I 11/321[ v2.o IIv21~-v2.o 1t/3211 II[T2]*H/3211 We have the following estimates: II[~r~]*Gl[ < 11T - 1 II,
1 1 1 - - >-g l b ( [ ' T 1 ] * G ) = t1([7"1]*G)-111 IIG*Tlll --IITll
(6)
(7)
(8)
435
(9)
1
g 1b(([~.2], H ) - 1 ) _
IIEW2q*all = 11T-1 II
>
(10)
Finally, since ~1
2 we
II[~]*HII ~ II[~?a]*HII I1~'111= ~ 1 - II[Te]*HII= II~a I1~ cos r/ll T - : II Application of (7), (9) and (11) to (6) yields Ill211--<-- T - all Ilzl II + IIT - all cos t/} I1T]I {ll and application of (8), (10) and (11) to (6) yields
(12)
11~2tl
>={1
}1
[]
(13)
Remark 5.16. The upper and lower bounds in 5.15 are sharp in the sense that
they are equal for unitary T~. Apparently, the lower bound might be negative, i.e. useless if T~ is too skew. Anyway, we can draw the conclusion that a skew T~ may be the cause of a fairly large llf2ill, even if 8~ is not unduly small. A large llf2ill is an indication that the norm of the first part of the vector T~- aft, i.e. ll(T~- :f~)l II (fGspan(F)) might be large with respect to IIf~ll ; this in turn is undesirable if the magnitude of Vii has to be of the same order as that of F~ (cf. 5.10), as appears 22 to be useful further on (viz w7). We can give an analogue of 5.9 which estimates f2iV~22 in terms of T~ and F~: Theorem 5.17. Let Ti be as in (5.7). Let tli be the angle between Ti: and Ti 2 and 0 i be the angle between F1 and G i. Then
-IIBII {cos 0 , - c o s ~,11T~-: II IIT~II}~ IIO, V~22 II ~ IIGII IIT~- '11 {cos ~i+ cos ~i} IIT, ll
Proof." We omit the indices i. Since T I ~-2 (for 7" see proof of 5.9) we have
436
R.M.M. Mattheij
The estimates can now be derived in a similar way as in the proof of 5.9 and 5.15, from the observation that y2V22 = [~el]* F = ([~v13"I"2) ([~v2]*F) + ([~l]*TX) (IT*]* F) and
Theorem 5.17 confirms what has already been observed above, viz. that a skew transformation matrix may give relatively large first components of a transformed dominated solution. As a consequence of 5.17 we have (cf. 5.11)
Property 6.2. Let K, (~ and P be as in 6.1. Then K is k-consistent with ((~[P) iff
span(K~) c~span(F0) =0.
Proof. Suppose L11 (see 6.1) is nonsingular. Then for any wespan(K~) there exists a k dimensional v such that w=K~v=Go Lll V+PoL2~v. If wEspan(Fo)
then v must be zero.
437
Suppose on the other hand that L 11 is singular, then for some non null v we have d o L 1 1 v = 0 . Define w=K~v. Clearly w=ffoL21v. Moreover, since rank(KX)=k, w must be nonzero, i.e. span(K~)c~span(/eo)4=0. [] We now presuppose (as before) the existence of a k-solution G which dominates an ( n - k ) - s o l u t i o n F uniformly and also assume k-consistency of some fundamental solution K with (GIF). We obtain: Theorem 6.3. Let (GIF) be a fundamental solution and let G dominate F uniformly. Let K be k-consistent with (G]F). Then K 1 dominates F uniformly and there exists
Proof. Let L = ( G o l F o ) - I K o . Since L 11 is nonsingular (cf 6.1) and (Gi[Fi)L=Ki, we find glb(K~) > glb(L 11) glb(Gi)_ lub(Fi) lub(I~l). Hence
lira glb(K~) i~o~ lub(Fi)
cO,
and K 1 dominates F uniformly by 3.8. For any ( n - k ) dimensional vector y2 define yl = _ [L 1 i ] - 1 L12 y2. Then
0
L y = L ( ~ ) = ( _ L 2 1 [ L l l ] - I L12 + L22) y2
If ~I22~-L22-L21[Llt]-IL12
D
not; so ~r 22 is nonsingular. Since K I ( = G U 1 + F L 21) and F constitute a fundamental solution we can d e c o m p o s e K 2( = GL 12 + FL 22) as K 2 = K 1M 12 + F M 22, for some matrices M t2 and M 22. Writing out gives M 1 2 = [ L l l ] - I L 12 and M 22
=~22.
[]
Remark 6.4. Since span(F) is unique (cf 3.4), the k-consistency property does not
depend on the particular choice of F neither on that of G, cf 6.2. Therefore we shall simply call K k-consistent if it obeys 6.2.
~12~
I ]'
(6.6)
K-
438
R.M.M. Mattheij
for some /~lz. Now let T~ be a transformation matrix as in (5.7) (i.e. one which brings (GilFi) into block upper triangular form), then W/(0)~ T~-IKi will have a block upper triangular form. We obtain, using 5.13: K , = T~ (W~lf (0) W/12(0)~
(6.7,
Remark 6.8. Since V~11= Will(0) and V~Z2= W/22(0), we see that the magnitudes of W/11(0) and W/22(0) are of the same order as those of G i and F~ respectively, if the angle between G i and Fi is not unduly small (cf 5.9 and 5.10). In order to show how the O~ can be approximated, we introduce matrices Wi(l) as follows: Definition 6.9. For any i and all l < i, let W~(1)be a block upper triangular matrix (i.e. with W~21(l)=0) such that W~(0)= W~(l) W~(0). From 6.9 we derive the following relations:
i~oo
(I)
Since W/11(1) = V/I1 IV/11]-i, we obtain for the first term on the right in (1), using the explicit expression for W~12(0) in (6.7):
(2)
(3) (4)
w,12(0)= v, 'I ~ r ~ 2 + o , v, 2~
Since WlZZ(0)= Vl2z we therefore obtain from (1), (3) and (4) [W/11(l)] -1 W/12(1) Vl22--+ --~, Vt22, 1 fixed, i--~oo. []
439
Remark 6.12. In the proof of 6.11 we used a result of 5.18 which is based on uniform dominance. If G should be only dominant then 6.11 is not necessarily true. This can be demonstrated using the M V R in 3.5. Let K=(G[F) and V~T~ = I. Then
[_W//ll(0)]_I=
1ti'1 1
=
i+1) 2 0
1 ; W/~2(0)=
i+1
[W~11(0)]- ~ W~12(0) d~
[]
Remark 6.13. In the formulation (6.6) one important problem is camouflaged: the actual choice of K o. It may be that the angle between K~ and F o is small which means that IIf~011 is large, cf 5.15 (this is likely to happen if L11 is nearly singular, i.e. if 11[Lll]- 111 is large, cf proof of 6.3). It is therefore very likely that also the angle between K~ and Fi then is fairly small, i.e. Ill, IT fairly large. As remarked in 5.16 this may imply that the magnitudes of V~ and F i are not of 22 the same order, which might affect the effectivity of the algorithm proposed in w (since the latter makes use of an appropriate dominance of glb(Vi 11) over lub(Vi22)). Therefore it may be important to choose such a K that the angle between K o and F 0 is not too small. Obviously, a more careful choice of K o does only make sense if there exists at least some dominant K-solution (7 of l-A] such that d e and F~ do not make small angles. On the other hand, if there exists a dominant k-solution d of which the iterands outgrow F very rapidly, K i will fairly rapidly become directionally close to Gi, i.e. after a few steps Itf2ill will be expected to be moderate anyway. Remark 6.14. The matrix Vii is not necessarily nonsingular, because (Gi[F~) may 22 not have full rank, so that rank(Fi) may be smaller than n - k . This also implies that the matrices W~22(/) are not necessarily nonsingular; in the singular case they even are not unique. In (7.6) we indicate a useful choice for them.
440
R.M.M. Mattheij
Then determine recursivety nonsingular {Ti} and block upper triangular (Ui} (i.e. with Ui21 = 0) by A~T~=T~+ 1U,, Since K i = ('17i--1Aj) T O we obtain from (7.2) ,,*_
\j=O--
i>O
(7.2)
K i = T~
. .s = 0
(7.3)
from which we see that the T~ decompose the matrix Ki (which is not computed actually) in the desired way. A most recommendable way for finding the above indicated set {T~}~>__, is based on a Q - U decomposition by the methods of Givens or Householder (cf [14, pp. 240, 236]), in which case the T~ are unitary, so {T~} is of bounded condition. The triangularization then runs as follows: At step i multiply the recursively found matrix T~ on the left by A~ and decompose A~ T~ into a unitary and an upper triangular matrix (as is known the matrix T~+, is not necessarily derived in its explicit form). Since unitary matrices are very stable it is hoped that the upper matrices U~ are fairly accurate; it can indeed be shown that the error is bounded in norm by a quantity of the order of 2 - ' ]IAiHF (where 2 - ' is the usual symbol for the relative computer accuracy, cf [7, p. 148 ff] and II. 1l2 is the sum of squares of the elements). An other way for finding sequences {T~} and {U~} might be based on some kind of L - U decomposition, i.e. where the matrices T~ are lower triangular (with a possible exception for To). There are two main disadvantages of such a L - U decomposition in general. First, it is not directly clear, whether dominance and k-consistency are sufficient conditions for existence of a set {T~} without pivoting (at least without interchanging one of the first k columns on the one hand and one of the last ( n - k ) columns on the other hand during a decomposition). Second, the bound for {llr~tl IIr~-~ll} may be fairly large, if it exists, which may cause larger errors in the computed U~ (and also in transforming vectors like f~ or their approximants by T~ or T~- 1). We now come to the approximation of a transformed dominated solution f , say. Define Bi 0 Ci~, E i] B i of k-th order (7.4)
Ui=
Bj
l=p
Bj
CI 1~ Ej . j=p
(7.5)
441
(7.6a)
(7.6b)
(b) W/22(1)= H Ej
j=l i-1
Now introduce for the solution f: J~= T~- 1f~, i>0. (7.8)
(7.9a) (7.9b)
It is seen that the triangularization results in a decoupled recursion for {J~/}. In our problem setting we assumed that f was not dominated by any other element of span(F), which means that it is not unrealistic to expect Ilf~lI to be of the order of
llf~ll,
unless the angle between G~ and Fi is unduly small (cf 5.9). Since j~l =Q~j~2 (cf 5.14), it then follows from 5.17 that IIJ~ll will not be large with respect to IIf~tl, i.e. IU~211 is of the order of IIf~ll. Therefore our conclusion is that (7.9b) is expected to be stable in forward direction. Once {~2} has been found, (7.9a) can be used to determine {j~l} (of course, computation of f l via an explicit approximation of ~?~ is very inefficient, if at all stable). Note that considering V~ 2 as a known vector, (7.9a) is an inhomogeneous recursion for {~1}. Its particular solution {~1} is uniformly dominated by the solutions of its homogeneous part (see e.g. 5.18). Hence we can expect an analogue of Miller's algorithm (see w3) to give arbitrarily accurate approximations to ~1, i.e. we have to use backward recursion. Details as well as error analyses of such an algorithm can be found in [8].
Remark 7.10. Since we only need a block upper triangular Ui, it is not necessary to perform the triangular decompositions completely. So e.g. in Householder's method, which requires the computation of ( n - 1 ) elementary hermitians, we can restrict ourselves to only computing the first k. This gives a factorization of the form:
442
R.M.M. Mattheij
of triangularizing at each step i, we may also triangularize subsequent products of the A i. So e.g. if T~ has been found, we can form the product Ai+lAi+t_ 1... Ai Ti, where l is some integer, and decompose this matrix.
i
Of course this may cause substantially larger errors (cf Householder's method, where the expected error in the upper triangular matrix will now be of the order
i+l \
whereas such a strategy is more efficient. The computation of a solution via upper triangular matrices, as sketched above (cf (7.9)) wiI1 only give the values of the thus found iterands at indices i for which also the matrices T~ have been found; values at intermediate indices may be found using [A] in forward direction. A method along these lines can be regarded as an analogue to the shooting method, which is used for boundary value problems in ODE.
References
1. Bauer, F.L., Householder, A.S.: Some inequalities involving the euclidean condition of a matrix. Numer. Math. 2, 308-311 (1960) 2. Davis, C., Kahan, W.M.: The rotation of eigenvectors by a perturbation III. SIAM J. Numer. Anal. 7, 1-45 (i970) 3. Deuflhard, P.: A summation technique for minimal solutions of linear homogeneous difference equations. Computing 18, 1-13 (1977) 4. Gautschi, W.: Computational aspects of three-term recurrence relations. SIAM Review 9, 24-82 (1967) 5. Gautschi, W.: Zur Numerik rekurrenter Relationen. Computing 9, 107-127 (1972) 6. Mattheij, R.M.M.: Accurate estimates of solutions of second order recursions. Linear Algebra and Appl. 12, 29-54 (1975) 7. Mattheij, R.M.M.: Estimating and determining solutions of Matrix Vector Recursions. Thesis, Utrecht (1977) 8. Mattheij, R.M.M.: Stable computation of dominated solutions of linear recursions. Report 7803, Mathematisch Instituut, Nijmegen (1978) 9. Mattheij, R.M.M., van der Sluis, A.: Error estimates for Miller's algorithm. Numer. Math. 26, 61-78 (1976) 10. Oliver, J.: The numerical solution of linear recurrence relations. Numer. Math. 11, 349-360 (1968) 11. Olver, F.WJ.: Numerical solution of second order linear difference equations. J. Res. Nat. Bur. Standards 71 B, 111-129 (1967) 12. Olver, F.WJ., Sookne, DJ.: Note on backward recurrence algorithms. Math. Comput. 26, 941947 (1972) 13. van der Sluis, A.: Estimating the solutions of slowly varying recursions. SIAM J. Math. Anal. 7, 662-695 (1976) 14. Wilkinson, J.H.: The algebraic eigenvalue problem, Oxford: Clarendon Press 1965