4v: '1': ‘ 17.9‘ J. '- Vsé‘. v ‘7‘ 15?.” J A 3‘. 1 V6,“ ‘ - . v ’ "V. ... l , ' Hz - . "EL , ‘ l ,5 ‘y , 1 ‘ a "K“ I .37" -"-_ \lllllllllllllllllll fillzllllll W 293 01399 l \\\\l\\ m ls l This is to certify that the dissertation entitled GUASI-LAGUERRE ITERATJON AND ITS APPLI CATION w SOLVIIV6 SYMMETRIC TRIDIAGONAL ElGENVALUE PROBLEMS presented by MING JIM has been accepted towards fulfillment of the requirements for PH.D degreein MATHEMATICS 4s %’ Majo4proessof Date JULY 204 L995 s: LlBRARY Michigan State University PLACE ll RETURN BOX to remove this chockoulfmm your record. TO AVOID FINES Mam on or baton date duo. DATE DUE DATE DUE DATE DUE MSU I. An Afflnnafivo ActlonlEqual Opponunlty Institution cum m1 ——-___ QUASI-LAGUERRE ITERATION AND ITS APPLICATION IN SOLVING SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS Ming J in A DISSERTATION Submitted to Michigan State University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY Department of Mathematics 1995 ABSTRACT QUASI-LAGUERRE ITERATION AND ITS APPLICATION IN SOLVING SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS By Ming J in In this work, the quasi-Laguerre iteration is established in the spirit of Laguerre’s iteration for solving polynomial f with all real zeros. The new iteration, which maintains the monotonicity and global conver- gence of the Laguerre iteration, no longer needs to evaluate f". The ultimate convergence rate is \/§ + 1. Based on the quasi-Laguerre iteration, an algorithm is developed and tested for solving the eigenvalue problem of symmetric tridiagonal matrices. In general, it improves the speed of Laguerre’s iteration sub- stantially and the accuracy is at least as good as any standard algorithm for the symmetric tridiagonal eigenproblem. More preciously, the al- gorithm, like the Laguerre iteration, enjoys the flexibility in evaluating partial spectrum. Although the quasi-Laguerre iteration is derived for solving polyno- mials with only real zeros, the iteration itself can be extended to solve polynomials with complex zeros and the local convergence can still be guaranteed with the same convergence rate \/§ + 1 as the real case. To my parents ii ACKNOWLEDGMENTS I would like to express my deepest gratitude to my thesis advisor, Professor Tien- Yien Li, for his guidance through the years I have spent at Michigan State University. His encouragement, helpful directions and lively discussions with me make this work possible. I would like to thank my dissertation committee members Professor Chichia Chin, Professor Richard Hill, Professor William T. Sledd and Professor Zhengfang Zhou for their valuable suggestions and their time. I would also like to thank Dr. Zhonggang Zeng for his precious advice and collaboration. iii Contents 0 Introduction 1 1 Derivation of the quasi-Laguerre iteration 5 2 The formulation and convergence 13 2.1 The formulation ....................... 13 2.2 The convergence properties ................. 15 3 On symmetric tridiagonal eigenvalue problems 26 3.1 Evaluation of the logarithmic derivative f’ / f of the de— terminant .......................... 26 3.2 The split-merge process ................... 29 3.3 Cluster and deflation .................... 32 3.4 Partial spectrum ...................... 33 3.5 Practical considerations in implementation ........ 34 3.5.1 Stopping criteria .................. 34 3.5.2 Initial points for the quasi-Laguerre iteration . . . 35 3.5.3 An alternative approach for clusters of eigenvalues 39 3.6 Numerical tests ....................... 42 3.6.1 Testing matrices .................. 45 3.6.2 Speed test in evaluating eigenvalues without com- puting eigenvectors ................. 46 3.6.3 Speed test in evaluating eigenpairs ........ 49 3.6.4 Accuracy test .................... 52 4 The complex quasi-Laguerre iteration 54 4.1 The formulation ....................... 54 4.2 Convergence properties ................... 56 iv List of Tables 1 Accelerated quasi-Laguerre iteration ............ 44 2 Execution time for evaluating eigenpairs ......... 51 3 Accuracy test of various algorithms ............ 53 List of Figures OOxlovmrhOOMl—A Algorithm DETEVL ..................... 28 Split and merge processes ................. 31 Cluster of eigenvalues .................... 32 The sign pattern of —f’/f ................. 36 Algorithm INIPTS for the initial points .......... 38 Algorithm ACCQ-LAG for acceleration .......... 43 Execution time in evaluating eigenvalues only (I) . . . . 47 Execution time in evaluating eigenvalues only (II) . . . . 48 vi 0 INTRODUCTION 1 0 Introduction The globally and monotonically convergent Laguerre’s iteration 111(17): :1: + n (1) (4(5)) i \ (n ‘1)[(" ‘ 1) (4(5) )2 ’ " (ffl) )] has been successfully used to find the eigenvalues of an n x n symmetric tridiagonal matrix T with nonzero sub-diagonal entries by approximat— ing the zeros, always real and simple, of its characteristic polynomial f(/\) = det[T — AI]. (2) Remarkable numerical results, in terms of both speed and accuracy, on a substantial variety of matrices have been obtained [8]. The algorithm employs the split-merge process, similar to Cuppen’s divide and con- quer strategy, which provides an excellent set of starting values making the algorithm naturally parallel. Impressive speedups of the parallel version of the algorithm were reported in [13]. For 2,000 x 2,000 ran- dom matrices, the algorithm can reach a speedup of 52 on an N -cube with 64 nodes. Laguerre’s iteration, or modified Laguerre’s iteration when clusters occur, converges ultimately with a very fast cubic convergence rate. Nonetheless, the most important advantage of Laguerre’s iteration in solving (2), in contrast to Newton’s iteration, is its monotone conver- gence. However, comparing with Newton’s iteration, a major disadvan- tage of Laguerre’s iteration is the evaluation of f”, which is relatively time consuming. In this work, a new algorithm is designed in the spirit of Laguerre’s iteration for finding zeros of (2). The algorithm, without evaluating f”, maintains the monotonicity of Laguerre’s iteration and 0 INTRODUCTION 2 converges globally with ultimate convergence rate \/2 + 1 in finding zeros of any polynomial with all real zeros. The formula (1) of Laguerre’s iteration can be derived in diverse ways. The best one seems to be to answer the following question [7]: QUESTION 1: Among all polynomials p(:1:) of degree n with 72 real zeros and with p(:1:0) = f(a:0) aé 0,p'(:z:0) = f’(:r0) and p”(a:0) = f”(a:0) at a specified real 1130, which one has a zero closest to 11:0? and where? In general, L+(a:0) in (1) gives the closest zero from the right of all those polynomials and L_(:1:0) gives the closest one from the left. To avoid the evaluation of f” , we revise the above optimization prob- lem as follows: QUESTION 2: Given two specified reals a < b, among all polyno- mials p(:r) of degree n with n real zeros, none of which lie in [a, b], and with 11(3)._.f_'(_‘1_) d fl9=fl (3) p‘f f(b)’ which one has a zero closest to a from the right or from the left? and where? The optimization problem for this class of polynomials can be easily solved and the solution has a closed form. To convert the solution of this optimization problem to an iteration scheme, which we call the quasi-Laguerre iteration, with the assumption that x(k‘1) and 2:“) are available, we may (2) let a = $(k—1), b = 17““) and let 513““) be the closest zero to 11:“) from the right when sew-1) < 116(k) and a zero of f is bigger than (k) a: 3 0 INTRODUCTION 3 (“l let a = 113(k), b = (Cw-1) and let 13““) be the closest zero to 22““) from the left when 550-1) > 27““) and a zero of f is less than 17““). In this manner, the sequence generated will converge, in fact, mono- tonically to a zero of f. The optimization problem in Question 2 is formulated in a more general form in Chapter 1 to account for multiple zeros. The answer to this problem leads to the quasi-Laguerre iteration. In Section 2.1, the iteration is formulated. The ultimate rate of convergence of our iteration so obtained is \/2 + 1 which Will be shown in Section 2.2. One of our main goals is to use the newly derived quasi-Laguerre iteration for approximating eigenvalues of symmetric tridiagonal ma- trices. The details of this implementation are described in Chapter 3, including some practical considerations in implementation (Section 3.5) and comprehensive numerical results on diverse types of matrices (Sec- tion 3.6). Apparently, our algorithm improves the speed of Laguerre’s iteration considerably in general. Like Laguerre’s iteration, our algo- rithm is inherently parallel and has a great capacity for vectorization. An intensive experiment in this regard will be reported in a future work. Although the quasi-Laguerre iteration is derived for solving polyno- mials with only real zeros, the iteration itself can be extended to solve polynomials with complex zeros in the same way the Laguerre iteration is extended. In Chapter 4, we will first define and formulate the com- plex quasi-Laguerre iteration. Then in Section 4.2, we will show that the local convergence of our iteration is guaranteed for simple zeros and the ultimate convergence rate is \/2 + 1, similar to the real case. Comparing with Newton’s and Laguerre’s iteration, the most attractive aspect of our quasi-Laguerre iteration in complex case is its fast con- vergence as well as the elimination of evaluating the second derivative, 0 INTROD UCTION even though it no longer enjoys the monotone or global convergence. 1 DERIVATION OF THE QUASI—LAGUERRE ITERATION 5 1 Derivation of the quasi-Laguerre iteration Let f be a polynomial of degree n with all of its zeros being real. For a < b with f(a)f(b) 75 0, let .7: be the class of polynomials p(a:) of degree n that satisfy the following conditions: (i) all zeros of p(a:) are real; (ii) none of the zeros of p(a:) are in [a,b]; (M) pp at 0 and We pose an optimization problem more general than Question 2: QUESTION 3: For 19 E .7, consider its m-th (m < n) zero to the right (or to the left) of a. Which polynomial in f has the closest one to a from the right (or from the left)? and where? To answer this question, let z1,22, - - - , 2,, be all the real zeros of a given polynomial p(a:) in .7: such that Z1SZ2S“°SZk. (6) ForlSiSnJet b—Zi b—a + a—z, a—zi then, X,- > 0 for all i since no 2,: falls between a and b. Moreover, iXi=i(l+:—::-i)=n+(b—a)q(a)Er(a,b), () zznl 1:711 7 z 31: :- 2; (1+ 5:) = n+ (a — b)q(b) = r(b,a), ' 1 1 S N 1 DERIVAT ION OF THE QUASI—LAGUERRE ITERATION 6 and from (5), XkZXk—IZ"'ZX1>1>XnZXn—1Z"'2Xk+1>0- (8) For convenience, we rewrite (8) as W12W22~2Wk>1>Wk+1z-o-ZW. (9) with X _ f l < k m = k+l l OI’ _ (10) Xn+k+1_[ for l > k. From (5), the m-th zero of p(a:) to the left of a is zk_m+1 which, from (10), corresponds to Wm in (9), and minimizing (a —— zk_m+1) becomes maximizing Wm. Similarly, the m-th zero of p(:c) to the right of a is zk+m and minimizing (zk+m — a) becomes minimizing Wn_m+1. As a convention, if m > k then the m-th zero of p to the left of a will be taken as the (n — m + 1)-st zero to the right of a and if m > n — k then the m-th zero of p to the right of a is the (n — m +1)-st zero to the left of a. The optimization problem in Question 3 can now be converted to the following optimizations: (P1) Max Wm subject to Z W,- = r(a, b) 1 b (11) E W: -’ 74( 3a) 1 DERIVATION OF THE QUASI—LAGUERRE ITERATION 7 (P2) M2” I/Vn—m+1 subject to Z W,- = r(a, b) i=1 1.21% = r(b, a) W12W2.>_"'2Wn>0- The solution of (P1) gives the m-th closest zero to a from the left while the solution of (P2) gives the m—th closest zero to a from the right. We first consider the optimization problem (P1). Lemma 1.1 r(a,b) > 0,r(b,a) > 0 and r(a,b)r(b,a) 2 n2. Proof. By (8), we have r(a, b) > 0 and r(b, a) > 0. From the Cauchy- Schwartz inequality, r(a, b)r(b, a) (because of (11: ). D Lemma 1.2 The point W = (W1, - - - ,Wn) satisfying (11) with W1: W2=~~=Wm=~~=W—n_1 2 Wu > 0 and _ b Wm Z T(a7 ) (12) n is feasible to problem (P1). Proof. From (11), Wm and V7,, must satisfy (n—1)Wm+Wn = r(a,b). (13) —1 1 TL. + __ = r(b,a). (14) Wm Wn 1 DERIVATION OF THE QUASI—LAGUERRE ITERATION 8 Eliminating Wm in (13) and (14) yields r(b,a)Wi — [r(a,b)r(b,a) — n2 + 277.] W, + r(a,b) = 0. (15) By Lemma 1.1, it can be easily verified that both solutions WE,” and WE?) of (15) are real. In addition, they are positive since W£1)+W[,2) > 0 and Wian) > 0. Let W?) 3 W53). Since :(a, b): r,(a b)r( n n 0) WSW?) :- s( so, W[,1)< M. Substituting Wnl) into (13), 102) (because of Lemma 1. 1 ), W“) = 1 (r(a, b) — W‘”) > "I ”—1 n — n It follows that Wm > W41 and so, W = (an, --,W,(n), feasible. D n )9 Now, for any W 2 (W1, - -- ,W n) satisfying (11), we have (Wj—Wm)2 " (W; W) " ( W2) jg" Wj g1 WJ H]; WJ = r,(a b) — 2nWm + r(b, a)W,3,. (16) K}. On the other hand, 2(Wm—Wj) = (n-m)Wm- Z W,- j>m j>m = (n — m)Wm — r(a,b) + Z W]- JSm 2 nVVm — r(a, b). From (12), nWm — r(a,b) Z 0 for a certain feasible point, so we may assume, in general, Wm satisfies nWm — r(a,b) Z 0 1 DERIVATION OF THE QUASI—LAGUERRE ITERATION 9 since we are maximizing Wm. It follows that [nWm _ ,(a, 5)]? g [3;(Wm — Wj)[2 = [g (Es/$1 . [HZ-Hz s z: (W: X“) 2%. W]. (by Cauchy-Schwartz inequality) 3 [r(a,b) -— 2nWm + r(b, a)W3,] (r(a, b) — me) (because of (16) ). Or, mr(b, a)W3, — [7°(a,b)r(b,a) — n2 + 2mm] Wm + mr(a, b) g 0 (17) and the equality holds iff W, = Wm for all j < m and W; = Wn for all l>m. Similarly, when solving the optimization problem (P2), one can show that, with the assumption TU), a’)Wn—m+l T" n S O which we may assume without loss of generality, Wn_m+1 satisfies the same inequality, that is, mr(b, a)W3_m+1 — [r(a, b)r(b, a) — n2 + 2mn]Wn_m+1+ mr(a, b) S 0 (18) and the equality holds iff W, = Wn_m+1 for all j > n — m + 1 and l/V1=W1foralll 0 and r(a,b) r(b,a) so, Ymi > 0. On the other hand, it follows from (17) and (18) that Ym+-Y _= >0, )fm- S Wm _<_. Ym+ (19) and Ym— S Wn—m+1 S Ym+- (20) Accordingly, W 2 (W1,- - - ,Wn) with W1 2 2 Wm = Ym+ and Wm+1 = = W" is the solution of the maximization problem (P1), and, from (11), mYm+ + (n — m)Wn = r(a, b), m n—m — —:— = b . Ym+ + Wn T( ’0’) Solving for W”, one can easily see that Wu is actually I/(n_m)_. So, _ Ym ' < m WJ = + J _ (21) 3/(n—m)— .7 > m° Similarly, the solution of the minimization problem (P2) is W 2 (W1, --,Wn) with (22) Y(n—m)+ j< n—m'l'l Ym_ jZn—m+1. 1 DERIVATION OF THE QUASI-LAGUERRE ITERATION 11 Recall that Wj’s are defined by b — a a—uj Wj=1+ j=1,---,n where uj’s are zeros of a certain polynomial in 3:. Thus any polynomial satisfying (21) must be in the following form P103) = C(13 - urn—WU? — ”On—m (23) where C is a constant, and b—a b—a 1 —— = m ' 1 t1 m- = — —— 24 + a _ Um_ + or equlva en y u a Ym+ _ 1 ( ) and b — b _ 1 + a = I/(n_m)_ or equivalently v1 = a — a . a _ U1 }/(n—m)— _ 1 And any polynomial satisfying (22) must be in the form 192(17) 2 C(1: — um+)m(.r — v2)"_’" (25) where C is a constant, and b - a b — a 1 + —— = K”. or u'val tl m = — —— 26 a—um+ eq1 eny 11+ a Ym——1 ( ) and b — a b — a 1 + = Yn_m or e uivalentl v = — . a "' ’02 ( )+ q y 2 a nn—m)+ _ 1 From (24) and (26), b — um_ b — m ——= m+>0 and —u—+-=Ym_>0, a — Um_ (I — Um+ so both um+ and um_ are outside the interval (a, b). Similarly, both 121 and 112 are outside the interval (a, b) since Y(,,_m)i > 0. Consequently, both 101(1)) and 192(3)) are in .73. 1 DERIVATION OF THE QUASI—LAGUERRE ITERATION 12 Moreover, from (24) and (26) b — a Ym; — 1 ’llmi = a — — 2mr(b,a)(b—a) _ r(a,b)r(b,a)—n2+2mn—2mr(b,a)2}:\/[r(a,b)r(b,a)—n2][r(a,b)r(b,a)—(n—2m)2] 2min — (b — 604(5)] = a + —(b — a)R — 2mq(b) :l: \/R[(b — a)2R + 4m(n — m)] (27) where _fW0 _fw)m, =naM%20)_ a gm)—fmy am—fw, ciR ( ,_a ) q(mo) The solutions of the optimization problems in Question 3 can now be described as follows: (i) If f has at least m (counting multiplicities) zeros to the left of a, then the zero um_ in (27) of the polynomial mb)=ce—us)%x—mvv: in (23) is the closest m-th zero to the left of a among all polyno- mials in .77 . And it is clear that zm_§um__<_---_<_u2_Su1_ 2M. 1] The rate of convergence of both sequences {xiii} is given in the following theorem. Theorem 2.3 (i) When m < M, the convergence of {xiii} to 2M is linear and the convergence ratio (1+1 _Z . mi 11/! w_11m 11—100 $(3t-ZM is the only real solution of :1: —a: —a: :0 n—M + M in (0,1). (ii) When m = M, the convergence of {$153,} to 2M is super-linear with convergence rate V5 + 1. Proof. Again we shall prove the theorem for the sequence {mlfilfiil and write yk for 2:55). when there is no ambiguity. Similar arguments hold for the sequence {xifiiflir Recall that yk+1 = Elk— 2m[n - (vi—1 - yk)q(yk_1)] (we-1 — 110113 + 2mg(yk—1)+ \/R[(3/k—1 — 11021? + 4mm - 171)] (33) where _ f’(3/k—1) _ 9(91—1)- f(y1s_1)’ Q(yk)— f(yk) and _ (1(3/kl-Q(yk—1) R—n( sis—y). )—q(yk)q(yk-1). 2 THE FORMULATION AND CONVERGENCE 17 Let z“) ,z(" M be the rest of the zeros of f besides 2M. Then f'(yk_1) M _ : : + _ , q(yk 1) f(yk—1) yk—1 — ZM Qty]: 1) f,(3/k M . "’M and R = n (MEL—133?”) — 1(ys)q(y1_1) : MW “ M) _ MQ(yk—1) _ M62011) (yk—ZM)(yk-1—2M) yk — 2M — Q(y1)Q(y1_1) yk—l-ZM R ' h R EM 1 +n W1t E . . . 1 1 121 (3111—1 — 2(1))(yk — 3(1)) Since yk —1 2M as k —) 00, so 1 1 k- 2 )—* < oo, Q 0, \/(1 — (E)2 + 476 so, from (36), C(e) is monotonically decreasing. Now, (i) For m < .M, let M — m m(n — m) (3 A4 €(0,) an 7 M(n—M)>0 Since C(e) is monotonically decreasing for e E [0,1] and 6k 6 (0,1) for all k 2 1, we have fl 0< =G1 zM, there exists an integer N > 1 such that G(1) — C(yN_.1— 2M) > 0 and C(0) + C(yN—l — 2M) < 1. (38) Let 01 2: G(1) — C(yN_1— 2M), d1: G(0) + C(yNfll —- 2M), Ck+1 = C(dk) " C(yN+k—1 — 2M), dk+1 = C(Ck) + C(yN+k—1 — 2M)» for k 2 1. Then, 2 THE FORMULATION AND CONVERGENCE 21 (i) {ck} is monotonically increasing and {dk} is monotonically de- creasing, namely, 0 G(dk-1) — C(yN+k—2 - ZM) = Ck dk+1 = C(01) + C(yN+k—1 — 2M) < G(Ck—1)+ C(yN+k—2 - 2M) = (1k- Furthermore, Ck+1 = G(dk)-C(yN+k—1-ZM) < C(6N+k)—C(yN+k_1—2M) (because of (39) ) < eN+k+1 < G(€N+k) + C(yN+k_1 — 2M) (because of (37)) < C(ck) + C(yN+k_1 — 2M) = dk+1. (because of (39) ) Hence, 0 d. Then C(c) = d and C(11) 2 Since C(e) is monotonically decreasing, G2 = G o G is monotonically increasing and can have at most one fixed point. It follows that c = d and consequently C(w) = w, where yk— w— — lim 6),: lim k—ioo k—mo yk_1 — ZM Namely, 2(M — m) w : All (1+w+\/(1 —w)2+%fi%w) or, w is a solution to the equation g(x)En;£lA;mx3—x2—E+MA_J-m=0. (40) This equation has exactly one real solution in (0,1), since M — m n—M—m M—m n—M—m M-—-m = —1—1 = —— — 9(1) n—M + M [n—M 1l+i M ll _ —m + —m _ —mn < 0 _ n — M M _ M(n — M) ’ and g1($)_3"__A_LMfl$2 _2$—1<0 forxe(01) 7?, (ii) For m = M, from (36), C(61) = 0 for k 2 1; accordingly, from (35), 61,11: 0(yk_1-— 2M) —) 0, as k —-) 00. Thus the convergence of {ykfiozl to 2M is super-linear. Further, from the derivation of yk+1 described in Chapter 1, yk_1, yk, and yk+1 satisfy the following two equations M n — M M —— + = 1011) = 621111) + 311 - yk+1 yk - v 91 - 2M 2 THE FORMULATION AND CONVERGENCE 23 M n — .M 11/! + = (1011—1) = Q(yk—1)+ Elk—1 - yk+1 Elk-1 — ”U Elk—1 — 2M for a certain number v. From the above two equations, we have n — M M Ms 1 — ZM = 12(1).) — ( + l (41) 21/1 - v (311 — 210(11): — 11m) 11 — M M yr — ZM ilk—1 — U (Elk—1 — ZM)(3/k—1 — yk+1) Subtracting (42) from (41) and dividing the result by (yk_1 — yk) yields, n—M =R1— M(yk+1-ZM)(yk-1+yk—yk+1—ZM) (yk—v)(1/k—1 -’U) (yk—ZM)(yk—yk+1)(yk—1 -ZM)(yk—1—yk+1)' (43) Substituting (41) and (42) into (43) and multiplying both sides by (n — 114), we have _ M(yk+1 - 2M) _ M(yk+1 - ZM) (Q(yk) (yk _ ZMlUlk — yk+1)) (ka-l) (yk—l — ZM)(1/k—1— yk+1)) _ ,,_ __ M(n—MXZ/kH—ZM)(yk-1+yk—yk+1-ZM) _ ( MRI (.111 - 2M)(yk - yk+1)(y1_1— 2M)(yk—1 — 111.11) ' Rewrite the above equation as follows: yk+1 — 2M (Elk - ZM)(3/k-yk+1)(yk—1—ZM) '13 =(n-M)R1— Q(y1)Q(;1/1_1) (44) where yk—l + yk — yk1- ZM E E M(n — M) + — MQ(yk—1)(yk—1 — ZM) yk—l — yk+1 -MQ(yk)(yk _ 2111) Me — yk+1 + M2 yk+1— ZM yk—l — yk+1 yk—l — yk+1 6/: = M(n — M)(1+ -,———) — M12 M(n—M) as k—)oo. 2 THE FORMULATION AND CONVERGENCE 24 It follows from (44) that yk+1- ZM = (.91 — yk+l> (n - MIRI - Q(3/k)Q(3/k-1) (31k _ ZM)2(yk—-1 " 2M) y), — ZM E _ _ (n — M)R1— Q(yk)Q(yk—1) — (1 €k+1) E (n — Mm - a? —)F <00 as k-—>oo. M(n—M) Iiere, 1 11—111 1 1 "—M 1 2 =_ . _ __._____ > . F M 1:21 (mu—£50»2 M(n—M) (1:21 2(M—2(2)) —O If F = 0, then 2“) = 2(1), for i : 1,---,n — Ill. That is, f(:1:) has only two distinct zeros: 2(1) and 2M. Hence 1:13,). = 2M and no convergence discussion is necessary. Consequently, only f (:13) which has more than two distinct zeros is considered. Then, F > 0. By the following lemma, the convergence rate of {yk}f_’__1 to 2M is indeed \/2 + 1. El Lemma 2.4 Suppose a sequence {ykfiil converges to 2. Let e;c be lyk - 2|- If 61,, = Mkezek_1 and 3320 M1, = M“ where 0 < M” < 00, then the convergence rate of {yk} is x/2+ 1, i.e., €k+1 __ * 1/\/§ 11—100 6,2544 — (M ) . Proof. We assume first M * = 1, that is, lim),_,OO Mk 2 1. Let dkzlnek+1—(\/2+1)lnek, 111721. From ek+1 = Mkeiek_1, 1n ek+1 = 2111 e), +lnek_1+1an. 2 THE FORMULATION AND CONVERGENCE 25 So, (1;, = 2lnek +lnek_1 +1an — (\/2+1)lnek (1 — \/2)lne1c +lnek_1+lan = (1 - \/2)[lnek — (Vi+ 1) 11161-1] +111 Mk (1 — \/2)dk—1+lan. Since0>1—\/2>—1andlanIk—>0ask——>oo,wehave dk——)0 as k—>oo. That is, €k+1 ex/‘§+1——)1 as k——>oo. k Now, if [11* 79 1, let Mk e[.=\/M*e/c and M,',=M*. Then, M 830” : VM*ek+1= MEoo Therefore, it follows from the above derivation, el+1 : k—>oo (ems/1+1 which implies 1. 6k+1 _ (VM’ilfl'H _ 11: 1/\/2 111120 ell/5+1 '— M T (M ) ' U 3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS 26 3 On symmetric tridiagonal eigenvalue problems In this chapter, we shall use our quasi-Laguerre iteration derived in pre- vious chapters to approximate all eigenvalues of symmetric tridiagonal matrices. 3.1 Evaluation of the logarithmic derivative f’ / f of the de- terminant Let T be a symmetric tridiagonal matrix of the form (01 H1 l 31 a2 52 O T=(fi._1,a.—,a)= If ,. . (45) O fin—2 an—l flit-1 \ 1811—1 an f We may assume, without loss of generality, that T is unreduced; that is, O, 74 0, j = 1, - - - ,n — 1. For an unreduced T, the characteristic polynomial f(/\) s det(T —— )1) (46) has only real and simple zeros ( [14], p300). In order to use our quasi- Laguerre iteration developed in previous Chapters for finding zeros of f (A), or the eigenvalues of T, it is necessary to evaluate f and f’ effi— ciently with satisfactory accuracy in the first place. It is well known that the characteristic polynomial f ()1) :— det(T — A1) of T = [fit—11011351] and its derivative with respect to A can be 3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS 27 evaluated by three-term recurrences ( [14], p423): = 1, = — A 100 P1 C11 2 . (47) Pi = (011‘- A)/)i—1 — (31-1P1—2, l = 2131'“ an I = 0, I = _1 pi) pl 1 2 1 - (48) pi : ((15 — A)pi—l _ pi—l — fii—lpi—21 2 = 2131- ‘ : an and f(/\) = pm f'()\) = pi;- However, these recurrences may suffer from a severe underflow/over- flow problem and require constant testing and scaling. The following modified recurrence equations avoid the above problem subtly by com- puting the quotient q(A )2 f(A ) / f ( ) directly [8]. After all, only q(A) is really needed in the formulae (30), (31) and (32). Let m 102- Si: 1 771: __1 101—1 P1 £1 = a1 _/\1 (49) 2 Si : Qi—A—?]:11,Z—2,3,‘ ana 1 770:0, 771:5, 2 (50) 771:3; (01i_)\)7li—1+1-(?:_11)77i—2]1 i:2939”°3n 2 and f’(/\) __ _ _ 77”. f (A) To prevent the algorithm from breaking down when 5,- : 0 for some 1 S i g 11, an extra check is provided: 0 If {1 = 0 (i.e., 011: A), set £1: 252; 312—152. oIf£,:0,i>1,set§,-= {_1 3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS Algorithm DET EV L: Input: T=[)’31—1.011,fii], A f' A Output: _f()\) 21],, (where f(A) Edet(T—AI)) and 11(A) = neg_count (=the number of eigenvalues of T less than A) Begin DETEVL £1 = 01 — /\ If 51: 0 then 51: (e2 1 z 0, : —, _ , t : 0 770 7]] £1 neg COUH If £1 < 0 then neg-count =1 For i=2:n 2 €1=(C¥1-A)-(%:—11) (3’2“) 2 If£,=0then§,= E: E If f,- < 0 then neg_count = neg-count +1 2 771: "El: [(011 - (1)771—1 +1 — (21:11) 771—2] End for End DETEVL Figure 1: Algorithm DETEVL where e is the machine precision. A determinant evaluation subroutine DETEVL (see Figure 1) is formulated according to the recurrences (49) and (50). When 5,, i = 1,---,n, are known, the Sturm sequence is available ([9], p47). Thus, as a by-product, DETEVL also evaluates the number of eigenvalues of T which are less than A. This number is denoted by 11(A). Let A1 0, for all i = 1, 2, - - - , n — 1, since in (47)—(50), Bis always appear in their square form. The following interlacing property for this rank-one tearing is important to our algorithm. Theorem 3.1 Let A1 < A2 < < An and A1 __<_ A2 _<_ S An be eigenvalues of T and T respectively. Then Sis/)1sissAQs-~~si\nsx\. A,-, we may let 2:10) = 3241, (129) = c and when 0 < A,-, let 2):?) = A,- and 512$) = c. Then A,- < :19) < 11:10) or .1739) < 33$) < A,. Respectively, equations (31) and (32) with m = 1 are used to generate 11$“) for k 2 1, namely, 1'1““) = 121% 3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS 31 00 And sequences {13%)} are obtained which converge monotonically, either increasingly or decreasingly, to A,- with ultimately convergence rate \/2 + 1 by Theorem 2.3. The eigenvalues of T in (52) consist of eigenvalues of To and T1 in (53). To find eigenvalues of To and T1, the split-merge process described above may be applied to T0 and T1 again. Indeed, the splitting process should be applied to T recursively (See Figure 2) until 2 x 2 and 1 x 1 matrices are reached. /\ 0 SP“! /\ /\ Merge 1’ Tm Tm T011) T011 T100 T1 1 T1 T111 Figure 2: Split and merge processes After T is well split into a tree structure as shown in Figure 2, the merging process in the reverse direction from 2 X 2 and 1 x 1 matrices can be started. More specifically, let T, be split into T 00 and T01. Let T00 0 ‘1’, - . - A3,, be eigenvalues of To = 0 T in ascending order. Then 01 the quasi-Laguerre iteration is applied to the polynomial equation A f,,(A) .2 det[T, — AI] = 0 from either 1)”) or Am to obtain the corresponding eigenvalue Ag, 1 = 1,2, - - - ,m, by the merging process described above. This process is continued until To and T1 are merged into T. That is, in the final step all the eigenvalues of T are obtained by applying the quasi-Laguerre iteration to f (A) = det(T — AI) from eigenvalues of T0 and T1. 3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS 32 3.3 Cluster and deflation By Theorem 3.1, A,- E (A,,A,-+1) for each i = 1,---,n with the con- vention An+1 = An + 2%. If A,“ — A,- is less than the error tolerance, then either A,- or A,“ can be accepted as A,. In general, if T has a cluster of r + 1 very close eigenvalues, for instance, Aj+, — A,- is less than the error tolerance for certain 1 S j S n — r, then r eigenvalues Aj, Aj+1, - - - ,Aj+,_1 of T can be obtained free of computations. They can be set to any one of Aj, - - - ,Aj+,. Since the matrix T in (45) is unreduced, its eigenvalues are all simple. Therefore using m = 1 in the quasi-Laguerre iterations given in (31) or (32) seems appropriate in all cases to obtain ultimate super-linear convergence with convergence rate x/2+ 1. However, in some occasions, there may exist a group of r > 1 eigenvalues of T, say, Ai+1 < /\i+2 < < /\i+r which are relatively close to each other, comparing to the distance from (0) (1) the two starting points, say 23+ < 23+ < A,-+1 as shown in Figure 3. Some of them may even be numerically indistinguishable. x2?) x1!) A l LLLLJJII >x A-i A1+1 11+] A'ivo-r Figure 3: converging to relatively closer eigenvalues Numerical evidence shows that to reach A,“ from 11318 )(= (9+1) and 333,1), it may take many steps of the quasi-Laguerre iteration with m = 1 before the super-linear convergence shows. In this situation, the quasi- Laguerre iteration with m = r in (31) may be used to speed up the convergence. Exhibited in (29), the sequences {rig} in (31) with 3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS 33 different m’s satisfy k— - k 1.- 11:1 1) <.1:[k+)4» Then 0 values among them can serve as the first staring points which will lead to all a eigenvalues of T in [a, b]. In some occasions, the eigenvalues of T of interest are identified in magnitude, say the largest 20%, then they can be evaluated by using the largest 20% eigenvalues of T as starting points without computing the other 80% eigenvalues of both T and T. i 3.5 Practical considerations in implementation 3.5.1 Stopping criteria The following stopping criterion was suggested by Kahan [7] and used for our implementation: [m(Hl) _ 2301)]? S 013(k) _ 3411-1)] _ [m(kH) _ $01)”, (54) where T is the error tolerance. This criterion is based on the following observations. Let 5,301+” _ m(k) m(k) _ m(k-l) 9k: then as {rm ”2:, converges to A when k —-) oo, q;c is normally decreasing. Thus < E ]$(k+2+i) __ $(k+1+i)| i=0 EWMHH) _ x(k+l+i)) i=0 i)‘ _ $(k+1)| : 3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS 35 oo k+1 __ k _<_ [$(k+1)_ $01)] 2 q: : (Iklml ) :1:( )] i=1 1 — qk lx(k+1) _ $(k)]2 : (1(1) _ $11—11] _ (x1141) _ m(kll' From (51), an obvious error tolerance at A, can be chosen as 55 . T = 7 m1X)e. 3.5.2 Initial points for the quasi-Laguerre iteration At every stage of the merging process, we evaluate the eigenvalues A1 < - - - < Am of an m x m submatrix, given m initial values A1 3 - . - 3 Am and an upper bound that interlace those 111 eigenvalues: 11311sighs-SESAME“ (see Theorem 3.1). To evaluate an eigenvalue A,- by the quasi-Laguerre iteration, two initial points are needed on the same side of A,- without any other eigenvalues between them. However, for i = 1, - - - ,m, the interlacing property 311 S A1 S 3\i+1 provides only one initial point from each side. The simplest way of obtaining the other initial point, preferably in the interval (A1, A5“), is to bisect the interval by choosing c = ’\1_+é\1_-1-_1_ and evaluate €532 along with the Sturm sequence at c by the algorithm DETEVL as described in Section 3.2. If c < A,- then A,- and c are a suitable pair of initial points for approximating A,-. Otherwise, the pair c and A,“ will be used. While the above approach is the simplest, it may not be the most effi- cient. Several improvements are made in our practical implementation. First, since f’ / f is evaluated at every initial value A,- (j = 1, - - - ,m) , one step of Newton’s iteration can be performed without extra cost and 3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS 36 critical point critical point 1 1 1 1 1 1 1 1 1 1 b ---— —----¢ ---- ----a :2 X v critical point critical point Figure 4: The two initial points should be between one of the nearby critical point of f and the zero. This can be identified by the sign pattern of —f’/f. the result can be considered as a candidate, besides c, for 512(1). Secondly and more importantly, from our computing experience, the high order convergence of the quasi-Laguerre iteration occurs only when there is no critical point of f (i.e. zero of f’) between 11(0) and A,- (see Figure 4). In other words, if 33(0) is to the left (resp. right) of A,-, then it is de— sirable that — f’ (17(0)) / f (1(0)) is positive (resp. negative). If there is one or more critical point in [A,-, Ai+1], then bisection or one step N ew- ton’s iteration is used repeatedly until the above requirement at 22(0) is satisfied. Based on these observations, our procedure for determin- ing a pair of initial points among different choices to achieve the best possible efficiency in practice is summarized below and the algorithm INIPTS is given in Figure 5. For notational convenience, we assume the eigenvalue A,- is in the interval [a, b] and there are no other eigenvalues in this interval. START. I I 1. If —%§)2 < 0 and —%(£)l > 0, then neither a nor b should be used as an initial point (there are critical points of f in both (a,A,-) and (A,,b)). Set c = 23—9. Evaluate —%3, the Sturm 3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS 37 sequence at c is available in the process. Reset [a, b] to be [c, b] or [a, c] in accordance with c < A,- or c > A,- respectively. Go to START. I 2. If —if-(%1 < 0 (there is a critical point between a and A;, so a I should not be used as an initial point ) and —%g)1 < 0, then I b can be one of the initial point. Set 0 = max {b — fl glib} f(b)’ 2 and evaluate —If,—((*CE)2. If c E (/\,-, b), then b and c will be used as a pair of initial points; go to END. Otherwise, reset [a,b] to be [c, b]; go to START. I I 3. If —If(%2 > 0 and _ngZ} > 0, then a can be an initial point but b can not (there is a critical point of f between b and A.) Set 0 = min{a — 1%3’97442} and evaluate —J}%§)l. If c E (a,/\,-), then a and c are a good pair of initial points; go to END. Otherwise, reset [a, b] to be [a, c]; go to START. ’ ' b . . . . 4. If — f(:) > 0 and —%b_)2 < 0, then there 18 no critical pomt of f in [a, b]. Thus either a or b can be used as one of the initial points. Set - _flel a_+_b - M £92 mi“ f(a)’ 2 i If ]f(a) S m] max {b — %§)21 9-3-9} , otherwise. and evaluate —%CE)2. Choose {a, c} or {0, b} to be the initial points depending on c < A,- or c > A,- respectively; go to END. END. 3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS 38 Algorithm IN IPTS Input: subscript i, initial end points 3,331,“. Output: starting iterates r(o),x(l) and m(x(0)),n(:z:(1)),— “flon— (Wm) Begin INIPTS (a, b): (;\,-,;\,-+1); Evaluate —Lé3], m(a), —fT’((—b%, m(b) by DETEVL; (1111) If —7{J) < 0 and —-(U > 0 then c: 42L; Evaluate _ff_’((2), m(c ) by DETEVL; (a,b)={ (c,b), if c<)\,- (i.e. m(c) /\,' (i..e K(c)2i) else if if} < 0 and ——(l < 0 then c=max{b— 'b, 4119]» )Evaluate ——((—%, I€( c) by DETEVL; If c> A; then (r(o),a:(l))- _ (b, c) ; Goto (t) ; else (a, b): (c, b); Goto (it) ; else if —%3)2 >0 and —fT(%l>0 then c: min {a— %,—3fi}f;( Evaluate —+],( K:( )by DETEVL; If (c< x\,) then (m(o),x(1))- — (a,c); Goto (it) ; else (a,b) = (a,c); Goto (It) ; C:{ min{a—-f7'£%l, 92%}, —f: , 3%}, otherwise <flb§l f(b > Al: End if ; (13) End INIPTS 1W) “13(0) ° Figure 5: Algorithm INIPTS 3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS 39 3.5.3 An alternative approach for clusters of eigenvalues The multiplicity estimation and the quasi-Laguerre iterations (31) and (32) with m > 1 can successfully overcome the slow convergence when a cluster of eigenvalues occur. An alternative approach without multi- plicity estimation as well as dynamical adjustments of the multiplicities can also work very efficiently in practice: By Theorem 2.3, when the quasi-Laguerre iteration for simple zero (m = 1) is used to approximate an M -fold zero :1:* (M _>_ 2), or a cluster of M simple zeros, of a polynomial f of degree n, it converges linearly with a ratio qu which is the only solution of a: — :1: — :1: = n — M M in (0, 1). The following theorem gives the properties of qn, M for M _>_ 2 ananM+1. 0 Theorem 3.3 (i) For M fired, qu increases as n increases (n> M). (ii) For a given n, qu Z qmg if 2 S M g n — 2. (iii) When M = n — 1, qn,n_1 increases as n increases (n 2 3). Proof. (2) For a fixed M, write n — M — 1 3 2 M — 1 :1: —a: —:1:+ n—M M 912(33) Then when n1 > n2, 9mm — g.. = ( SO,qn1’M > gn2,M. 1 1 TLQ—M nl—M >r3>0 for xE(0,1). (ii) For a given n, write cc—a: —:c+ . n—M M 9M0”) 3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS 40 When2SM_<_-3, 1 1 1 1 9M($)_92($) : (N—Z—R—M)x3+2—M >1+ 1 _1_ 1 - 2 n—2 M n—M n Tl = m_M(n—M) 20 for x€(0,1). 50, gn,M 2 (111,2- If g < M S n — 2, then 2 _<_ n — M < 721. It follows from above, qn,n_M Z qmg. On the other hand, gM(1‘) — gn_M(113) = (n 3M — fi) (1 — 333) > 0 for £1? 6 (0,1). Then gn,M > qn,n—M- Hence: qn,M > qn,n-—M 2 qn,2- (iii) If M = n — 1, write n—M—l3 2 M—1 2 9,,(17) n—M :1: —:1: —a:+ M ——a: —.7:+1—n_1. Then when n1 > n2, 1 1 gn1(a:)— gn2(:1:) = > 0 for :13 6 (0,1). n2—1 n1—1 SO, gnlml—l > gngmg—l' D For n 2 3 and M Z 2, by Theorem 3.3 (iii), (111,114 2 (13,2 When M = n -1, and by Theorem 3.3 (i), (ii), qn,M 2 (171,2 2 (13,2 when M S n — 2- In any case, q... E q3,2, the zero of 1:2 + x — 0.5 = 0 in (0, 1), is the smallest linear ratio for multiple zeros. Let 33(1),a:(2),- - - ,x("), - -- be 3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS 41 the monotone sequence of the quasi-Laguerre iterates. When {513(k)} converges super-linearly, then |$ q..., linear convergence seems apparent. In this case, for sufficiently large k, (Ic+l)_ =1: :1: a: ,.. ... I Q; qu or ]$(k+1)—$ I z q].”L‘(k)—.’C I where q E (171M, I13“) — 117*] and _ |:r(k+1)—:r(k)| _ |x(k)—a:*|—|x(k+1)—x*| ~ (l-q)I:B(k)—a:*| ~ 9k — |x(k)_x(k—l)| “‘ ]$(k—1)_x*|_|x(k)_x*| "’ (1_q)lx(k—1)_x¢| N q. In practice, qk’s can be very close to q after several iterations. If qj = q forj Z k then 0+1) Ix -$*|=q|:v(j)-~’v*|, for jzk and __1_($(k+1) _ aw). *:,(k) a: :L +1-q So, at the k—th iterate :13“), if linear convergence is revealed, namely qk > q..., then instead of performing the regular quasi-Laguerre iteration 130*“) : (Elk) + 6k where (5;. is the correction m(k“) — 56““) from 1:“) calculated by the quasi- Laguerre iteration with m = 1, one can accelerate the iteration by setting 5k m(w) z m(n + 1—%° The accelerated iteration can be summarized as follows: Assume we are evaluating the eigenvalue X" E [a, b] and initial points 23(0), 17(1) have 3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS 42 been obtained by the process described in Section 3.5.2 along with data I 13(0) I (1) _LLFJ and 475%} (#) For k = 1, 2, - - - do (while the stopping criteria is not satisfied), 1. Evaluate 6;, = 517““) — :13“) using the quasi-Laguerre formula with m =1. . If q (Le—i 5—3: < q,., then most likely the target eigenvalue x\* is neither a multiple zero nor a member of any cluster. Go to # and continue the loop. Otherwise, (a) Let c = mm + 11:15:; and check for a possible overshot by evaluating the Sturm sequence. If c overshoots, then re— set c = r(k) + w for l = 8,4,2,1 until overshooting disappears. (b) set 113““) = c and go to #. End for The algorithm ACCQ-LAG in Figure 6 summarizes the fundamental features of the acceleration process described in this section. Table 1 gives the comparison between the original quasi-Laguerre iteration and the accelerated quasi-Laguerre iteration. The Wilkinson matrix W9?) is used here with the starting points mm) = 11.5 and x“) = L_(2:(0)) = 11.27 targeting A23 -_- 11.0+5x 10-15 of W93, while A22 = 11.0—5x10"15. 3.6 Numerical tests Our algorithm is implemented and tested on SPARC stations with IEEE floating point standard. The machine precision is e z 2.2 x 10—16. 3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS 43 Algorithm ACCQ-LAG Input: subscript i, starting iterates 55(0), mm with Ii($(0)), m(rm), —%§50)1)2 and —%(§Tl;))- evaluated by INIPTS. output: the i-th eigenvalue A,- of T. Define q... = q3,2: the positive solution of x2 + :1: — 0.5 = 0 (the smallest linear ratio for multiple zeros); Begin ACCQ-LAG For k: 1,2,-~- U+, 1f 517(k) < Ai _ c = (iii are from quasi-Lag.formulae); 21-, if 1:0“) > A; (Ska-1““: q= 3%.: t01=2-5€[ma> 2 and 17517:;me < tol) Goto (it) ; If (q > q...) then I . c: m(k)+ {253; evaluate — f(cc) and m(c) by DETEVL; If (c is on the same side of A; as r(k),a:(k‘l)) then Goto (fit); else, For I: 8,4,2,1, l c = xa) + 311.011.). l—q ’ evaluate the Sturm sequence n(c) ; If (c is on the same side of A,- as m(k),a:(k'l)) Goto (ti); End for End if (it) 12”“) = c ; End for (11) End ACCQ-LAG Figure 6: Algorithm ACCQ-LAG 3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS Table 1: QOQNQO‘AOJNI— “(A)““QMNNNNNMNNNHHV-‘Hh—I—‘D—HHH #0383"OCDWNC3U‘AOONI-‘CCDWQC5U‘ACJNHO 513““) (original) 1 1 .500000000000000 1 1 .270700712327294 11 .149428542073966 11.0 63735435125402 1 1.0 26235830973979 11 .0 10578521008206 11.00 4261938886133 11.00 1715645778917 11.000 690622605815 11.000 277993423158 1 1 .000 111899071095 11.0000 45041905516 11.0000 18130368451 11.00000 7297872838 11.00000 2937554150 11.00000 1182429996 11.000000 475954003 11.000000 191581922 11 .0000000 77115924 1 1 .0000000 31040850 11.0000000 12494622 11.00000000 5029360 11.00000000 2024429 11.000000000 814877 1 1 .000000000 328006 1 1 .000000000 132029 1 1.0000000000 53145 1 1 .0000000000 21393 1 1.00000000000 8612 11.00000000000 3467 1 1.00000000000 1396 l 1.000000000000 565 11.000000000000 233 l 1 .000000000000 107 I(k+l)_ (1‘)] qk: |$(k)_$( (k— 1)] 0. 706618 0.4 37604 0.4 17533 0.40 3427 0.40 3112 0.4025 55 0.4025 56 0.40252 7 0.40252 5 0.40252 3 0.402522 0.402522 0.402522 0.402522 0.402522 0.402522 0.402522 0.402522 0.402522 0.402522 0.402522 0.402522 0.402522 0.402520 0.40252 1 0.40252 6 0.40252 1 0.402 477 0.40 1680 0. 399604 0. 381937 0. 305425 Acceleration of the quasi-Laguerre iteration for W55 the table, the digits before a space are correct. 3:0“) (accelerated) 1 1. 500000000000000 11. 270700712327294 11.0 13287361699255 11 .000 124299930121 11 .0000 61856842477 11.000000 198368395 11.0000000 99023529 1 1.0000000000 12077 11.00000000000 6040 1 1.00000000000 2304 1 1 .000000000000 156 1 1.0000000000000 64 44 For each number in 3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS 45 3.6. 1 Testing matrices There are 12 types of matrices employed in testing our algorithm. In the following description of these matrix types, 0;, i = 1, - - - ,n, denote the diagonal entries and 6;, i = 1, - - - ,n — 1, are the sub-diagonal entries. Matrices with known eigenvalues 1. Toeplitz matrices [b, a, b]. Exact eigenvalues: {a + 2b cos 'gkfillsksn ([5], Example 7.4, p137). 2. alza—b,a;=a for i=2,-~,n—1,an=a+b. fljzb, j=1,---,n—1. Exact eigenvalues: {a + 2b cos Wllgg ([5], Example 7.6, p138). { a for odd i (15:: , fl,- = 1. Exact eigenvalues : b for even i 2 "+1 (plus {a} if n is odd) 1§kSn/2 ([5], Example 7.8 and 7.9, p139). 4. a; = 0, fl; = \/z(—n——i_). Exact eigenvalues: {—n + 2k — lbs/,9. ([5], Example 7.10, p140). 5. oz.- = -[(2i—1)(n - 1) — 2(i—1)2], fl,- = i(n — i - 2). Exact eigenvalues: {—k(k — liilsksn ([5], Example 7.11, p141). {a+b :t \/(a—b)2+16cos2 k—” Wilkinson and random matrices , 6. Wilkinson matrices W; 3,- = 1, n/2—i+1 forevennandlgign/2 i—n/2 for evennand n/2 There are two values for u in the above formula. We will choose yk+1 (56) to be the one which is closer to yk, i.e., ka+1 -— ykl is smaller. More precisely, let A2 = Rl(yk—1 — 91023 + 4(n — 1)] and Q =(yk_1— yk)R + 2q(yk—1)- Since lo :l: M" = |0|2 + |A|2 :l: 2Re(r'iA), to make lyk+1 — ykl smaller or IO :1: A] larger, we choose yr. _ 2ln-(yk_§12;yAk)7(yk—1)l, if Re(§A) Z 0; yk+1 = (57) 3}]: _ 2[n_(yk—§12:yAk)q(yk—l)l, if R€(§A) < O To start the iteration, two initial points are required. For given yo, we always apply one step of Newton’s iteration at yo to obtain the second initial point; namely, 91 = 340 — f(yo) f '(310) 4 THE COMPLEX QUASI—LAGUERRE ITERATION 56 4.2 Convergence properties Theorem 4.1 The quasi-Laguerre iteration {yk} defined in the last section converges to 2 when yo is close to z. The ultimate convergence rate is x/2 + 1. Proof. Let yk — Z yk-i - Z, Clearly {yk} converges to 2 if for some r < 1, 16k] < r, Vk. From (56), VkZl. 6k: yk+1—Z :1_ €k+1 = 31k - Z 2 [n - (yr—1 — yr)q(yk—1)l . (58) (ilk-Z) {lyre—1-yk)R+24(yk—1)i\/Rl(yk-1—yk)QR+4(n—1)l} Let 2(1), - - - , 2("‘1) be the zeros of f besides 2. Then, f’(yk)_ _ 1 . _ "—1 1 q(yk)= m”)— yk _ Z+Q(yk) With 62(9) = 2:31 W _ f’—(yk 1)_ 21 (Am—1) — f(yk-l) ilk—1 - Z + Q0114) and R = n (dz/1:); E(::—l)) - (1(yk)(1(yk—1) _ n -1 _ Qty/H) _ Q(yk) _ — (yr—Z )(yk 1_Z) yk—Z yk_1_an(yk)Q(yk—1) — 1 +nR1(:l/k— 1, Me) With Rl(yk— 1, 31k) En i=1 l_(yk l—zli))(yk—z(ii)° Now, (a) (ilk—1 - yk)€1(yk—1) = (yk-1_ ilk) (yr—1 _ z + Q(yk—1)) =(1—6k)+Q(3/k)(1-€k)(yk—1-Z)=1-€k+0(yk—1—Z) (b) (yk—Z)(yk—1—Z)R = "—1—(311—1- Z)Q(yk—1) 4 THE COMPLEX QUASI—LAGUERRE ITERATION 57 —(yk - Z)Q QZU k—i Q(yk)Q (yrs—1)(yk—Z)(yk—1—Z) +7701}. — z)(y1_1— 2)R1(yk_1,y1)= Ti -1+ 0(71 1— Z) (C) (ilk - Z)(yk—1 - 310R = (1 — €k)(yk — Z)(3/k—1 — ZlR = (1 — €1)(n — 1) + 001-1 — 2) (because of (b) above) ((1) (yr: — Z)2R = 6143/4 - Z)(yk—1 — Z)R = 6k(n — 1) + O(yk_1 — 2) (because of (b) above) (8) (yr: - Z)QR [(yk—I - 31021? + 4(n — 1)] = [(y1. - Z)(yk—1 - 341:)ng + 4(" -1)(y1 - Z)QR = (l—ek)2(n—1)2 + 4(n—1)€k(n—l) + O(yk_1—z) = (n — 1)2(1+ 61)2 + O(yk—r - Z) (because of (c) and (d) above). If yo —) z and y1 —+ 2, then n—l 1 n—l 1 = . —) E . < , Q 0 and C2 > 0, independent of k, such that €k+1 =1— IOI1| _<_ Cllyk—l — 2| and |a2| S CQka—l — 2|- Let A = (n — 1)(1 — 6k) +2ek:l: (n — 1)(1 +61) 2(n — 1) + 26],. if ‘+’ is chosen; (4 — 2n)ek if ‘—’ is chosen. In (57), since (yr. — 201931: A) = 444—02, thus, making |yk+1 — ykl smaller is equivalent to making |A] larger. When lekl < 1, |2(n — l) + 2ek| > [(4 — 2n)ek|. Hence, to have |A| larger, we always choose ‘+’ for A, making A=2(n—1)+2€k and |A| = |2(n — 1) + 2ek| Z 2(n — 1) — 2|ck| 2 2n — 4. So, A+ozl 612—01 :1. z . 59 €k+1 A—I-CYQ A-i-ag ( ) If |61| < 1, I62] < 1,~-,|ek| < 1 and lyo - Z] < "(32" then |072| S C2|y1_1— 2] < Cglyo — 2| < 1. Hence, |A+072|Z |A|—|oz2| 22n—5>0. 4 THE COMPLEX QUASI-LAGUERRE ITERATION 59 From (59), |€k+1| = M<(§ifi|ly _, |A+072| _ 272—5 [CA-1 A In summary, we have shown the following: Lemma 4.2 If yo is close to 2, then there exists a constant C > 0 such that V14: 2 1, if |€1| < 1,---,|€k| < 1, then |€k+1| _<_ Clyk—l — 3| (60) Now, given a real number r < 1, when lgo — 2| < %, by (60) above, |€2| S Clyo — 2| < r or lyg — 2| < rlyl — 2|. Notice that from the way yl is chosen; namely, by applying one step of Newton’s iteration on given yo, we always have 191 — 2| < |y0 — 2|, or equivalently |€1| < 1 when yo is close to 2, say for instance, yo is in the convergence neigh- borhood of Newton’s iteration at 2. By (60) again, I153] S Clyl — 2| < Clyo — 2| < r, and so, |y3 — 2| < r|y2 - 2|. It follows recursively that |ek+1| S Clyk_1— 2| < r and |yk+1— 2| < rlyk — 2|, We 21 since C is independent of 16. Therefore {yk} converges to 2. Further- more, from (60), |ek+1| S C|y1_1— 2| —> 0, as k —-> 00. Thus the convergence of {ykfiozl to 2 is super-linear. The rest of the proof follows exactly the same argument as the proof of part (ii) of Theorem 2.3 with m = M = 1. D REFERENCES 60 References [1] E. ANDERSON, Z. 3.41, C. BISCHOF, J. DEMMEL, J. DONGARRA, J. Du CRoz, A. GREENBAUM, S. HAMMARLING, A. MCKENNEY, S. OSTROU- CHOV, and D. SORENSON, LAPACK User’s Guide, SIAM, Philadelphia, 1992. [2] J. J. M. CUPPEN, A divide and conquer method for the symmetric tridiagonal eigenproblem, Numer. Math., 36 (1981), pp. 177—195. [3] J. J. DONGARRA AND D. C. SORENSEN, A fully parallel algorithm for the symmetric eigenvalue problem, SIAM J. Sci. Stat. Comput., 8 (1987), pp. 139— 154. [4] G. H. GOLUB AND C. F. VAN LOAN, Matrix Computations, 2nd Ed., The Johns Hopkins University Press, Baltimore, MD, 1989. [5] R. T. GREGORY AND D. L. KARNEY, A Collection of Matrices for Testing Computational Algorithms, Robert E. Krieger Publishing Company, Hunting- ton, New York, 1978. [6] M. GU AND S. C. EISENSTAT, A divide-and-conquer algorithm for the sym- metric tridiagonal eigenproblem, SIAM J. Matrix Anal. Appl., Vol. 16, No. 1 (1995), pp. 172-191. [7] W. KAHAN, Notes On Laguerre’s Iteration, preprint, University of California, Berkeley (1992). [8] T. Y. Li AND Z. ZENG, Laguerre’s iteration in solving the symmetric tridiago- nal eigenproblem — revisited, SIAM J. Sci. Comput., Vol. 15, No. 5 (1994), pp. 1145-1173. [9] B. N. PARLETT, The Symmetric Eigenvalue Problem, Prentice-Hall, Englewood Cliffs, NJ, 1980. [10] B. N. PARLETT, The use ofa refined error bound when updating eigenvalues of tridiagonals, Lin. Alg. & Appls., Vol. 68 (1985), pp. 179-219. [11] J. RUTTER,A serial implementation of Cuppen’s divide and conquer algorithm for the symmetric eigenvalue problem, LAPACK lawn 69 (1994). REFERENCES 61 [12] D. C. SORENSEN AND P. T. P. TANG, On the orthogonality of eigenvectors computed by divide-and-conquer techniques, SIAM. J. Numer. Anal., 28 (1991), pp. 1752 - 1775. [13] C. TREFFTZ, C. C. HUANG, P. MCKINLEY, T. Y. L1, AND Z. ZENG, A scalable eigenvalue solver for symmetric tridiagonal matrices, to appear, Parallel Comput. [14] J. H. WILKINSON, The Algebraic Eigenvalue Problem, Oxford University Press, Oxford, 1965.