4v: '1':
‘ 17.9‘ J. '-
Vsé‘. v
‘7‘ 15?.” J A
3‘. 1 V6,“ ‘
- . v
’ "V.
...
l , '
Hz - .
"EL , ‘ l
,5 ‘y ,
1 ‘
a "K“ I
.37" -"-_
\lllllllllllllllllll fillzllllll
W
293 01399
l
\\\\l\\
m ls
l
This is to certify that the
dissertation entitled
GUASI-LAGUERRE ITERATJON AND ITS
APPLI CATION w SOLVIIV6 SYMMETRIC
TRIDIAGONAL ElGENVALUE PROBLEMS
presented by
MING JIM
has been accepted towards fulfillment
of the requirements for
PH.D degreein MATHEMATICS
4s %’
Majo4proessof
Date JULY 204 L995
s:
LlBRARY
Michigan State
University
PLACE ll RETURN BOX to remove this chockoulfmm your record.
TO AVOID FINES Mam on or baton date duo.
DATE DUE DATE DUE DATE DUE
MSU I. An Afflnnafivo ActlonlEqual Opponunlty Institution
cum
m1
——-___
QUASI-LAGUERRE ITERATION AND ITS
APPLICATION IN SOLVING SYMMETRIC
TRIDIAGONAL EIGENVALUE PROBLEMS
Ming J in
A DISSERTATION
Submitted to
Michigan State University
in partial fulfillment of the requirements
for the degree of
DOCTOR OF PHILOSOPHY
Department of Mathematics
1995
ABSTRACT
QUASI-LAGUERRE ITERATION AND ITS
APPLICATION IN SOLVING SYMMETRIC
TRIDIAGONAL EIGENVALUE PROBLEMS
By
Ming J in
In this work, the quasi-Laguerre iteration is established in the spirit
of Laguerre’s iteration for solving polynomial f with all real zeros. The
new iteration, which maintains the monotonicity and global conver-
gence of the Laguerre iteration, no longer needs to evaluate f". The
ultimate convergence rate is \/§ + 1.
Based on the quasi-Laguerre iteration, an algorithm is developed
and tested for solving the eigenvalue problem of symmetric tridiagonal
matrices. In general, it improves the speed of Laguerre’s iteration sub-
stantially and the accuracy is at least as good as any standard algorithm
for the symmetric tridiagonal eigenproblem. More preciously, the al-
gorithm, like the Laguerre iteration, enjoys the flexibility in evaluating
partial spectrum.
Although the quasi-Laguerre iteration is derived for solving polyno-
mials with only real zeros, the iteration itself can be extended to solve
polynomials with complex zeros and the local convergence can still be
guaranteed with the same convergence rate \/§ + 1 as the real case.
To my parents
ii
ACKNOWLEDGMENTS
I would like to express my deepest gratitude to my thesis advisor,
Professor Tien- Yien Li, for his guidance through the years I have spent
at Michigan State University. His encouragement, helpful directions
and lively discussions with me make this work possible.
I would like to thank my dissertation committee members Professor
Chichia Chin, Professor Richard Hill, Professor William T. Sledd and
Professor Zhengfang Zhou for their valuable suggestions and their time.
I would also like to thank Dr. Zhonggang Zeng for his precious advice
and collaboration.
iii
Contents
0 Introduction 1
1 Derivation of the quasi-Laguerre iteration 5
2 The formulation and convergence 13
2.1 The formulation ....................... 13
2.2 The convergence properties ................. 15
3 On symmetric tridiagonal eigenvalue problems 26
3.1 Evaluation of the logarithmic derivative f’ / f of the de—
terminant .......................... 26
3.2 The split-merge process ................... 29
3.3 Cluster and deflation .................... 32
3.4 Partial spectrum ...................... 33
3.5 Practical considerations in implementation ........ 34
3.5.1 Stopping criteria .................. 34
3.5.2 Initial points for the quasi-Laguerre iteration . . . 35
3.5.3 An alternative approach for clusters of eigenvalues 39
3.6 Numerical tests ....................... 42
3.6.1 Testing matrices .................. 45
3.6.2 Speed test in evaluating eigenvalues without com-
puting eigenvectors ................. 46
3.6.3 Speed test in evaluating eigenpairs ........ 49
3.6.4 Accuracy test .................... 52
4 The complex quasi-Laguerre iteration 54
4.1 The formulation ....................... 54
4.2 Convergence properties ................... 56
iv
List of Tables
1 Accelerated quasi-Laguerre iteration ............ 44
2 Execution time for evaluating eigenpairs ......... 51
3 Accuracy test of various algorithms ............ 53
List of Figures
OOxlovmrhOOMl—A
Algorithm DETEVL ..................... 28
Split and merge processes ................. 31
Cluster of eigenvalues .................... 32
The sign pattern of —f’/f ................. 36
Algorithm INIPTS for the initial points .......... 38
Algorithm ACCQ-LAG for acceleration .......... 43
Execution time in evaluating eigenvalues only (I) . . . . 47
Execution time in evaluating eigenvalues only (II) . . . . 48
vi
0 INTRODUCTION 1
0 Introduction
The globally and monotonically convergent Laguerre’s iteration
111(17): :1: + n (1)
(4(5)) i \ (n ‘1)[(" ‘ 1) (4(5) )2 ’ " (ffl) )]
has been successfully used to find the eigenvalues of an n x n symmetric
tridiagonal matrix T with nonzero sub-diagonal entries by approximat—
ing the zeros, always real and simple, of its characteristic polynomial
f(/\) = det[T — AI]. (2)
Remarkable numerical results, in terms of both speed and accuracy, on
a substantial variety of matrices have been obtained [8]. The algorithm
employs the split-merge process, similar to Cuppen’s divide and con-
quer strategy, which provides an excellent set of starting values making
the algorithm naturally parallel. Impressive speedups of the parallel
version of the algorithm were reported in [13]. For 2,000 x 2,000 ran-
dom matrices, the algorithm can reach a speedup of 52 on an N -cube
with 64 nodes.
Laguerre’s iteration, or modified Laguerre’s iteration when clusters
occur, converges ultimately with a very fast cubic convergence rate.
Nonetheless, the most important advantage of Laguerre’s iteration in
solving (2), in contrast to Newton’s iteration, is its monotone conver-
gence. However, comparing with Newton’s iteration, a major disadvan-
tage of Laguerre’s iteration is the evaluation of f”, which is relatively
time consuming. In this work, a new algorithm is designed in the spirit
of Laguerre’s iteration for finding zeros of (2). The algorithm, without
evaluating f”, maintains the monotonicity of Laguerre’s iteration and
0 INTRODUCTION 2
converges globally with ultimate convergence rate \/2 + 1 in finding
zeros of any polynomial with all real zeros.
The formula (1) of Laguerre’s iteration can be derived in diverse
ways. The best one seems to be to answer the following question [7]:
QUESTION 1: Among all polynomials p(:1:) of degree n with 72 real
zeros and with p(:1:0) = f(a:0) aé 0,p'(:z:0) = f’(:r0) and p”(a:0) =
f”(a:0) at a specified real 1130, which one has a zero closest to 11:0?
and where?
In general, L+(a:0) in (1) gives the closest zero from the right of all
those polynomials and L_(:1:0) gives the closest one from the left.
To avoid the evaluation of f” , we revise the above optimization prob-
lem as follows:
QUESTION 2: Given two specified reals a < b, among all polyno-
mials p(:r) of degree n with n real zeros, none of which lie in [a, b],
and with
11(3)._.f_'(_‘1_) d fl9=fl (3)
p‘f f(b)’
which one has a zero closest to a from the right or from the left?
and where?
The optimization problem for this class of polynomials can be easily
solved and the solution has a closed form. To convert the solution of
this optimization problem to an iteration scheme, which we call the
quasi-Laguerre iteration, with the assumption that x(k‘1) and 2:“) are
available, we may
(2) let a = $(k—1), b = 17““) and let 513““) be the closest zero to 11:“)
from the right when sew-1) < 116(k) and a zero of f is bigger than
(k)
a:
3
0 INTRODUCTION 3
(“l let a = 113(k), b = (Cw-1) and let 13““) be the closest zero to 22““)
from the left when 550-1) > 27““) and a zero of f is less than 17““).
In this manner, the sequence generated will converge, in fact, mono-
tonically to a zero of f.
The optimization problem in Question 2 is formulated in a more
general form in Chapter 1 to account for multiple zeros. The answer
to this problem leads to the quasi-Laguerre iteration. In Section 2.1,
the iteration is formulated. The ultimate rate of convergence of our
iteration so obtained is \/2 + 1 which Will be shown in Section 2.2.
One of our main goals is to use the newly derived quasi-Laguerre
iteration for approximating eigenvalues of symmetric tridiagonal ma-
trices. The details of this implementation are described in Chapter 3,
including some practical considerations in implementation (Section 3.5)
and comprehensive numerical results on diverse types of matrices (Sec-
tion 3.6). Apparently, our algorithm improves the speed of Laguerre’s
iteration considerably in general. Like Laguerre’s iteration, our algo-
rithm is inherently parallel and has a great capacity for vectorization.
An intensive experiment in this regard will be reported in a future work.
Although the quasi-Laguerre iteration is derived for solving polyno-
mials with only real zeros, the iteration itself can be extended to solve
polynomials with complex zeros in the same way the Laguerre iteration
is extended. In Chapter 4, we will first define and formulate the com-
plex quasi-Laguerre iteration. Then in Section 4.2, we will show that
the local convergence of our iteration is guaranteed for simple zeros
and the ultimate convergence rate is \/2 + 1, similar to the real case.
Comparing with Newton’s and Laguerre’s iteration, the most attractive
aspect of our quasi-Laguerre iteration in complex case is its fast con-
vergence as well as the elimination of evaluating the second derivative,
0 INTROD UCTION
even though it no longer enjoys the monotone or global convergence.
1 DERIVATION OF THE QUASI—LAGUERRE ITERATION 5
1 Derivation of the quasi-Laguerre iteration
Let f be a polynomial of degree n with all of its zeros being real. For
a < b with f(a)f(b) 75 0, let .7: be the class of polynomials p(a:) of
degree n that satisfy the following conditions:
(i) all zeros of p(a:) are real;
(ii) none of the zeros of p(a:) are in [a,b];
(M) pp at 0 and
We pose an optimization problem more general than Question 2:
QUESTION 3: For 19 E .7, consider its m-th (m < n) zero to the
right (or to the left) of a. Which polynomial in f has the closest
one to a from the right (or from the left)? and where?
To answer this question, let z1,22, - - - , 2,, be all the real zeros of a
given polynomial p(a:) in .7: such that
Z1SZ2S“°SZk. (6)
ForlSiSnJet
b—Zi b—a
+
a—z, a—zi
then, X,- > 0 for all i since no 2,: falls between a and b. Moreover,
iXi=i(l+:—::-i)=n+(b—a)q(a)Er(a,b), ()
zznl 1:711 7
z 31: :- 2; (1+ 5:) = n+ (a — b)q(b) = r(b,a),
' 1 1
S
N
1 DERIVAT ION OF THE QUASI—LAGUERRE ITERATION 6
and from (5),
XkZXk—IZ"'ZX1>1>XnZXn—1Z"'2Xk+1>0- (8)
For convenience, we rewrite (8) as
W12W22~2Wk>1>Wk+1z-o-ZW. (9)
with
X _ f l < k
m = k+l l OI’ _ (10)
Xn+k+1_[ for l > k.
From (5), the m-th zero of p(a:) to the left of a is zk_m+1 which, from
(10), corresponds to Wm in (9), and minimizing (a —— zk_m+1) becomes
maximizing Wm. Similarly, the m-th zero of p(:c) to the right of a is
zk+m and minimizing (zk+m — a) becomes minimizing Wn_m+1. As a
convention, if m > k then the m-th zero of p to the left of a will be
taken as the (n — m + 1)-st zero to the right of a and if m > n — k then
the m-th zero of p to the right of a is the (n — m +1)-st zero to the left
of a.
The optimization problem in Question 3 can now be converted to
the following optimizations:
(P1)
Max Wm
subject to Z W,- = r(a, b)
1 b (11)
E W: -’ 74( 3a)
1 DERIVATION OF THE QUASI—LAGUERRE ITERATION 7
(P2)
M2” I/Vn—m+1
subject to Z W,- = r(a, b)
i=1
1.21% = r(b, a)
W12W2.>_"'2Wn>0-
The solution of (P1) gives the m-th closest zero to a from the left
while the solution of (P2) gives the m—th closest zero to a from the
right.
We first consider the optimization problem (P1).
Lemma 1.1 r(a,b) > 0,r(b,a) > 0 and r(a,b)r(b,a) 2 n2.
Proof. By (8), we have r(a, b) > 0 and r(b, a) > 0. From the Cauchy-
Schwartz inequality,
r(a, b)r(b, a) (because of (11: ). D
Lemma 1.2 The point W = (W1, - - - ,Wn) satisfying (11) with W1:
W2=~~=Wm=~~=W—n_1 2 Wu > 0 and
_ b
Wm Z T(a7 ) (12)
n
is feasible to problem (P1).
Proof. From (11), Wm and V7,, must satisfy
(n—1)Wm+Wn = r(a,b). (13)
—1 1
TL. + __ = r(b,a). (14)
Wm Wn
1 DERIVATION OF THE QUASI—LAGUERRE ITERATION 8
Eliminating Wm in (13) and (14) yields
r(b,a)Wi — [r(a,b)r(b,a) — n2 + 277.] W, + r(a,b) = 0. (15)
By Lemma 1.1, it can be easily verified that both solutions WE,” and
WE?) of (15) are real. In addition, they are positive since W£1)+W[,2) > 0
and Wian) > 0. Let W?) 3 W53). Since
:(a, b): r,(a b)r( n
n 0)
WSW?) :-
s(
so, W[,1)< M. Substituting Wnl) into (13),
102) (because of Lemma 1. 1 ),
W“) = 1 (r(a, b) — W‘”) >
"I ”—1 n — n
It follows that Wm > W41 and so, W = (an, --,W,(n),
feasible. D
n )9
Now, for any W 2 (W1, - -- ,W n) satisfying (11), we have
(Wj—Wm)2 " (W; W) " ( W2)
jg" Wj g1 WJ H]; WJ
= r,(a b) — 2nWm + r(b, a)W,3,. (16)
K}.
On the other hand,
2(Wm—Wj) = (n-m)Wm- Z W,-
j>m j>m
= (n — m)Wm — r(a,b) + Z W]-
JSm
2 nVVm — r(a, b).
From (12), nWm — r(a,b) Z 0 for a certain feasible point, so we may
assume, in general, Wm satisfies
nWm — r(a,b) Z 0
1 DERIVATION OF THE QUASI—LAGUERRE ITERATION 9
since we are maximizing Wm. It follows that
[nWm _ ,(a, 5)]? g [3;(Wm — Wj)[2 = [g (Es/$1 . [HZ-Hz
s z: (W: X“) 2%. W].
(by Cauchy-Schwartz inequality)
3 [r(a,b) -— 2nWm + r(b, a)W3,] (r(a, b) — me)
(because of (16) ).
Or,
mr(b, a)W3, — [7°(a,b)r(b,a) — n2 + 2mm] Wm + mr(a, b) g 0 (17)
and the equality holds iff W, = Wm for all j < m and W; = Wn for all
l>m.
Similarly, when solving the optimization problem (P2), one can show
that, with the assumption
TU), a’)Wn—m+l T" n S O
which we may assume without loss of generality, Wn_m+1 satisfies the
same inequality, that is,
mr(b, a)W3_m+1 — [r(a, b)r(b, a) — n2 + 2mn]Wn_m+1+ mr(a, b) S 0
(18)
and the equality holds iff W, = Wn_m+1 for all j > n — m + 1 and
l/V1=W1foralll 0
and
r(a,b)
r(b,a)
so, Ymi > 0. On the other hand, it follows from (17) and (18) that
Ym+-Y _=
>0,
)fm- S Wm _<_. Ym+ (19)
and
Ym— S Wn—m+1 S Ym+- (20)
Accordingly, W 2 (W1,- - - ,Wn) with W1 2 2 Wm = Ym+ and
Wm+1 = = W" is the solution of the maximization problem (P1),
and, from (11),
mYm+ + (n — m)Wn = r(a, b),
m n—m
— —:— = b .
Ym+ + Wn T( ’0’)
Solving for W”, one can easily see that Wu is actually I/(n_m)_. So,
_ Ym ' < m
WJ = + J _ (21)
3/(n—m)— .7 > m°
Similarly, the solution of the minimization problem (P2) is W 2 (W1,
--,Wn) with
(22)
Y(n—m)+ j< n—m'l'l
Ym_ jZn—m+1.
1 DERIVATION OF THE QUASI-LAGUERRE ITERATION 11
Recall that Wj’s are defined by
b — a
a—uj
Wj=1+ j=1,---,n
where uj’s are zeros of a certain polynomial in 3:. Thus any polynomial
satisfying (21) must be in the following form
P103) = C(13 - urn—WU? — ”On—m (23)
where C is a constant, and
b—a b—a
1 —— = m ' 1 t1 m- = — —— 24
+ a _ Um_ + or equlva en y u a Ym+ _ 1 ( )
and
b — b _
1 + a = I/(n_m)_ or equivalently v1 = a — a .
a _ U1 }/(n—m)— _ 1
And any polynomial satisfying (22) must be in the form
192(17) 2 C(1: — um+)m(.r — v2)"_’" (25)
where C is a constant, and
b - a b — a
1 + —— = K”. or u'val tl m = — —— 26
a—um+ eq1 eny 11+ a Ym——1 ( )
and
b — a b — a
1 + = Yn_m or e uivalentl v = — .
a "' ’02 ( )+ q y 2 a nn—m)+ _ 1
From (24) and (26),
b — um_ b — m
——= m+>0 and —u—+-=Ym_>0,
a — Um_ (I — Um+
so both um+ and um_ are outside the interval (a, b). Similarly, both 121
and 112 are outside the interval (a, b) since Y(,,_m)i > 0. Consequently,
both 101(1)) and 192(3)) are in .73.
1 DERIVATION OF THE QUASI—LAGUERRE ITERATION 12
Moreover, from (24) and (26)
b — a
Ym; — 1
’llmi = a —
— 2mr(b,a)(b—a) _
r(a,b)r(b,a)—n2+2mn—2mr(b,a)2}:\/[r(a,b)r(b,a)—n2][r(a,b)r(b,a)—(n—2m)2]
2min — (b — 604(5)]
= a + —(b — a)R — 2mq(b) :l: \/R[(b — a)2R + 4m(n — m)] (27)
where
_fW0 _fw)m, =naM%20)_ a
gm)—fmy am—fw, ciR ( ,_a ) q(mo)
The solutions of the optimization problems in Question 3 can now
be described as follows:
(i) If f has at least m (counting multiplicities) zeros to the left of a,
then the zero um_ in (27) of the polynomial
mb)=ce—us)%x—mvv:
in (23) is the closest m-th zero to the left of a among all polyno-
mials in .77 . And it is clear that
zm_§um__<_---_<_u2_Su1_ 2M. 1]
The rate of convergence of both sequences {xiii} is given in the
following theorem.
Theorem 2.3 (i) When m < M, the convergence of {xiii} to 2M is
linear and the convergence ratio
(1+1 _Z
. mi 11/!
w_11m
11—100 $(3t-ZM
is the only real solution of
:1: —a: —a: :0
n—M + M
in (0,1).
(ii) When m = M, the convergence of {$153,} to 2M is super-linear
with convergence rate V5 + 1.
Proof. Again we shall prove the theorem for the sequence {mlfilfiil
and write yk for 2:55). when there is no ambiguity. Similar arguments
hold for the sequence {xifiiflir
Recall that
yk+1 = Elk—
2m[n - (vi—1 - yk)q(yk_1)]
(we-1 — 110113 + 2mg(yk—1)+ \/R[(3/k—1 — 11021? + 4mm - 171)]
(33)
where
_ f’(3/k—1) _
9(91—1)- f(y1s_1)’ Q(yk)— f(yk)
and
_ (1(3/kl-Q(yk—1)
R—n( sis—y). )—q(yk)q(yk-1).
2 THE FORMULATION AND CONVERGENCE
17
Let z“) ,z(" M be the rest of the zeros of f besides 2M. Then
f'(yk_1) M
_ : : + _ ,
q(yk 1) f(yk—1) yk—1 — ZM Qty]: 1)
f,(3/k M . "’M
and
R = n (MEL—133?”) — 1(ys)q(y1_1)
: MW “ M) _ MQ(yk—1) _ M62011)
(yk—ZM)(yk-1—2M) yk — 2M
— Q(y1)Q(y1_1)
yk—l-ZM
R ' h R EM 1
+n W1t E . . .
1 1 121 (3111—1 — 2(1))(yk — 3(1))
Since yk —1 2M as k —) 00, so
1 1
k- 2 )—* < oo,
Q)(yk—z) (zM—zw)?
Let 6k = Ali-Ta"— V]; Z 1 then yk—l-yk _
Elk—1 ZM
yk-I‘ZM — 1 — 6],. Obviously,
0 0,
\/(1 — (E)2 + 476
so, from (36), C(e) is monotonically decreasing.
Now,
(i) For m < .M, let
M — m m(n — m)
(3 A4 €(0,) an 7 M(n—M)>0
Since C(e) is monotonically decreasing for e E [0,1] and 6k 6 (0,1) for
all k 2 1, we have
fl
0< =G1 zM, there
exists an integer N > 1 such that
G(1) — C(yN_.1— 2M) > 0 and C(0) + C(yN—l — 2M) < 1. (38)
Let
01 2: G(1) — C(yN_1— 2M), d1: G(0) + C(yNfll —- 2M),
Ck+1 = C(dk) " C(yN+k—1 — 2M), dk+1 = C(Ck) + C(yN+k—1 — 2M)»
for k 2 1. Then,
2 THE FORMULATION AND CONVERGENCE 21
(i) {ck} is monotonically increasing and {dk} is monotonically de-
creasing, namely,
0 G(dk-1) — C(yN+k—2 - ZM) = Ck
dk+1 = C(01) + C(yN+k—1 — 2M) < G(Ck—1)+ C(yN+k—2 - 2M) = (1k-
Furthermore,
Ck+1 = G(dk)-C(yN+k—1-ZM)
< C(6N+k)—C(yN+k_1—2M) (because of (39) )
< eN+k+1 < G(€N+k) + C(yN+k_1 — 2M) (because of (37))
< C(ck) + C(yN+k_1 — 2M) = dk+1. (because of (39) )
Hence,
0 d. Then C(c) = d and C(11) 2
Since C(e) is monotonically decreasing, G2 = G o G is monotonically
increasing and can have at most one fixed point. It follows that c = d
and consequently C(w) = w, where
yk—
w— — lim 6),: lim
k—ioo k—mo yk_1 — ZM
Namely,
2(M — m)
w :
All (1+w+\/(1 —w)2+%fi%w)
or, w is a solution to the equation
g(x)En;£lA;mx3—x2—E+MA_J-m=0. (40)
This equation has exactly one real solution in (0,1), since
M — m
n—M—m M—m n—M—m M-—-m
= —1—1 = —— —
9(1) n—M + M [n—M 1l+i M ll
_ —m + —m _ —mn < 0
_ n — M M _ M(n — M) ’
and
g1($)_3"__A_LMfl$2 _2$—1<0 forxe(01)
7?,
(ii) For m = M, from (36), C(61) = 0 for k 2 1; accordingly, from
(35),
61,11: 0(yk_1-— 2M) —) 0, as k —-) 00.
Thus the convergence of {ykfiozl to 2M is super-linear. Further, from
the derivation of yk+1 described in Chapter 1, yk_1, yk, and yk+1 satisfy
the following two equations
M n — M M
—— + = 1011) = 621111) +
311 - yk+1 yk - v 91 - 2M
2 THE FORMULATION AND CONVERGENCE 23
M n — .M 11/!
+ = (1011—1) = Q(yk—1)+
Elk—1 - yk+1 Elk-1 — ”U Elk—1 — 2M
for a certain number v. From the above two equations, we have
n — M M Ms 1 — ZM
= 12(1).) — ( + l (41)
21/1 - v (311 — 210(11): — 11m)
11 — M M yr — ZM
ilk—1 — U (Elk—1 — ZM)(3/k—1 — yk+1)
Subtracting (42) from (41) and dividing the result by (yk_1 — yk) yields,
n—M =R1— M(yk+1-ZM)(yk-1+yk—yk+1—ZM)
(yk—v)(1/k—1 -’U) (yk—ZM)(yk—yk+1)(yk—1 -ZM)(yk—1—yk+1)'
(43)
Substituting (41) and (42) into (43) and multiplying both sides by
(n — 114), we have
_ M(yk+1 - 2M) _ M(yk+1 - ZM)
(Q(yk) (yk _ ZMlUlk — yk+1)) (ka-l) (yk—l — ZM)(1/k—1— yk+1))
_ ,,_ __ M(n—MXZ/kH—ZM)(yk-1+yk—yk+1-ZM)
_ ( MRI (.111 - 2M)(yk - yk+1)(y1_1— 2M)(yk—1 — 111.11) '
Rewrite the above equation as follows:
yk+1 — 2M
(Elk - ZM)(3/k-yk+1)(yk—1—ZM)
'13 =(n-M)R1— Q(y1)Q(;1/1_1) (44)
where
yk—l + yk — yk1- ZM
E E M(n — M) + — MQ(yk—1)(yk—1 — ZM)
yk—l — yk+1
-MQ(yk)(yk _ 2111) Me — yk+1 + M2 yk+1— ZM
yk—l — yk+1 yk—l — yk+1
6/:
= M(n — M)(1+ -,———) — M12 M(n—M) as k—)oo.
2 THE FORMULATION AND CONVERGENCE 24
It follows from (44) that
yk+1- ZM = (.91 — yk+l> (n - MIRI - Q(3/k)Q(3/k-1)
(31k _ ZM)2(yk—-1 " 2M) y), — ZM E
_ _ (n — M)R1— Q(yk)Q(yk—1)
— (1 €k+1) E
(n — Mm - a?
—)F
<00 as k-—>oo.
M(n—M)
Iiere,
1 11—111 1 1 "—M 1 2
=_ . _ __._____ > .
F M 1:21 (mu—£50»2 M(n—M) (1:21 2(M—2(2)) —O
If F = 0, then 2“) = 2(1), for i : 1,---,n — Ill. That is, f(:1:) has only
two distinct zeros: 2(1) and 2M. Hence 1:13,). = 2M and no convergence
discussion is necessary. Consequently, only f (:13) which has more than
two distinct zeros is considered. Then, F > 0. By the following lemma,
the convergence rate of {yk}f_’__1 to 2M is indeed \/2 + 1. El
Lemma 2.4 Suppose a sequence {ykfiil converges to 2. Let e;c be
lyk - 2|- If
61,, = Mkezek_1 and 3320 M1, = M“
where 0 < M” < 00, then the convergence rate of {yk} is x/2+ 1, i.e.,
€k+1 __ * 1/\/§
11—100 6,2544 — (M ) .
Proof. We assume first M * = 1, that is, lim),_,OO Mk 2 1. Let
dkzlnek+1—(\/2+1)lnek, 111721.
From ek+1 = Mkeiek_1,
1n ek+1 = 2111 e), +lnek_1+1an.
2 THE FORMULATION AND CONVERGENCE 25
So,
(1;, = 2lnek +lnek_1 +1an — (\/2+1)lnek
(1 — \/2)lne1c +lnek_1+lan
= (1 - \/2)[lnek — (Vi+ 1) 11161-1] +111 Mk
(1 — \/2)dk—1+lan.
Since0>1—\/2>—1andlanIk—>0ask——>oo,wehave
dk——)0 as k—>oo.
That is,
€k+1
ex/‘§+1——)1 as k——>oo.
k
Now, if [11* 79 1, let
Mk
e[.=\/M*e/c and M,',=M*.
Then,
M
830” : VM*ek+1= MEoo
Therefore, it follows from the above derivation,
el+1 :
k—>oo (ems/1+1
which implies
1. 6k+1 _ (VM’ilfl'H
_ 11: 1/\/2
111120 ell/5+1 '— M T (M ) ' U
3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS 26
3 On symmetric tridiagonal eigenvalue problems
In this chapter, we shall use our quasi-Laguerre iteration derived in pre-
vious chapters to approximate all eigenvalues of symmetric tridiagonal
matrices.
3.1 Evaluation of the logarithmic derivative f’ / f of the de-
terminant
Let T be a symmetric tridiagonal matrix of the form
(01 H1 l
31 a2 52 O
T=(fi._1,a.—,a)= If ,. . (45)
O fin—2 an—l flit-1
\ 1811—1 an f
We may assume, without loss of generality, that T is unreduced; that
is, O, 74 0, j = 1, - - - ,n — 1. For an unreduced T, the characteristic
polynomial
f(/\) s det(T —— )1) (46)
has only real and simple zeros ( [14], p300). In order to use our quasi-
Laguerre iteration developed in previous Chapters for finding zeros of
f (A), or the eigenvalues of T, it is necessary to evaluate f and f’ effi—
ciently with satisfactory accuracy in the first place.
It is well known that the characteristic polynomial f ()1) :— det(T —
A1) of T = [fit—11011351] and its derivative with respect to A can be
3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS 27
evaluated by three-term recurrences ( [14], p423):
= 1, = — A
100 P1 C11 2 . (47)
Pi = (011‘- A)/)i—1 — (31-1P1—2, l = 2131'“ an
I = 0, I = _1
pi) pl 1 2 1 - (48)
pi : ((15 — A)pi—l _ pi—l — fii—lpi—21 2 = 2131- ‘ : an
and
f(/\) = pm f'()\) = pi;-
However, these recurrences may suffer from a severe underflow/over-
flow problem and require constant testing and scaling. The following
modified recurrence equations avoid the above problem subtly by com-
puting the quotient q(A )2 f(A ) / f ( ) directly [8]. After all, only q(A)
is really needed in the formulae (30), (31) and (32). Let
m 102-
Si: 1 771: __1
101—1 P1
£1 = a1 _/\1 (49)
2
Si : Qi—A—?]:11,Z—2,3,‘ ana
1
770:0, 771:5,
2 (50)
771:3; (01i_)\)7li—1+1-(?:_11)77i—2]1 i:2939”°3n
2
and
f’(/\) __
_ _ 77”.
f (A)
To prevent the algorithm from breaking down when 5,- : 0 for some
1 S i g 11, an extra check is provided:
0 If {1 = 0 (i.e., 011: A), set £1: 252;
312—152.
oIf£,:0,i>1,set§,-= {_1
3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS
Algorithm DET EV L:
Input: T=[)’31—1.011,fii], A
f' A
Output: _f()\) 21],, (where f(A) Edet(T—AI))
and 11(A) = neg_count
(=the number of eigenvalues of T less than A)
Begin DETEVL
£1 = 01 — /\
If 51: 0 then 51: (e2
1
z 0, : —, _ , t : 0
770 7]] £1 neg COUH
If £1 < 0 then neg-count =1
For i=2:n
2
€1=(C¥1-A)-(%:—11)
(3’2“) 2
If£,=0then§,= E: E
If f,- < 0 then neg_count = neg-count +1
2
771: "El: [(011 - (1)771—1 +1 — (21:11) 771—2]
End for
End DETEVL
Figure 1: Algorithm DETEVL
where e is the machine precision. A determinant evaluation subroutine
DETEVL (see Figure 1) is formulated according to the recurrences (49)
and (50). When 5,, i = 1,---,n, are known, the Sturm sequence is
available ([9], p47). Thus, as a by-product, DETEVL also evaluates the
number of eigenvalues of T which are less than A. This number is
denoted by 11(A).
Let A1 0, for all i = 1, 2, - - - ,
n — 1, since in (47)—(50), Bis always appear in their square form. The
following interlacing property for this rank-one tearing is important to
our algorithm.
Theorem 3.1 Let A1 < A2 < < An and A1 __<_ A2 _<_ S An be
eigenvalues of T and T respectively. Then
Sis/)1sissAQs-~~si\nsx\.
A,-, we may let 2:10) = 3241, (129) = c and when 0 < A,-, let 2):?) = A,- and
512$) = c. Then A,- < :19) < 11:10) or .1739) < 33$) < A,. Respectively,
equations (31) and (32) with m = 1 are used to generate 11$“) for
k 2 1, namely,
1'1““) = 121%
3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS 31
00
And sequences {13%)} are obtained which converge monotonically,
either increasingly or decreasingly, to A,- with ultimately convergence
rate \/2 + 1 by Theorem 2.3.
The eigenvalues of T in (52) consist of eigenvalues of To and T1 in
(53). To find eigenvalues of To and T1, the split-merge process described
above may be applied to T0 and T1 again. Indeed, the splitting process
should be applied to T recursively (See Figure 2) until 2 x 2 and 1 x 1
matrices are reached.
/\ 0
SP“! /\ /\ Merge
1’ Tm Tm T011) T011 T100 T1 1 T1 T111
Figure 2: Split and merge processes
After T is well split into a tree structure as shown in Figure 2, the
merging process in the reverse direction from 2 X 2 and 1 x 1 matrices
can be started. More specifically, let T, be split into T 00 and T01. Let
T00 0
‘1’, - . - A3,, be eigenvalues of To = 0 T in ascending order. Then
01
the quasi-Laguerre iteration is applied to the polynomial equation
A
f,,(A) .2 det[T, — AI] = 0
from either 1)”) or Am to obtain the corresponding eigenvalue Ag, 1 =
1,2, - - - ,m, by the merging process described above. This process is
continued until To and T1 are merged into T. That is, in the final step
all the eigenvalues of T are obtained by applying the quasi-Laguerre
iteration to f (A) = det(T — AI) from eigenvalues of T0 and T1.
3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS 32
3.3 Cluster and deflation
By Theorem 3.1, A,- E (A,,A,-+1) for each i = 1,---,n with the con-
vention An+1 = An + 2%. If A,“ — A,- is less than the error tolerance,
then either A,- or A,“ can be accepted as A,. In general, if T has a
cluster of r + 1 very close eigenvalues, for instance, Aj+, — A,- is less
than the error tolerance for certain 1 S j S n — r, then r eigenvalues
Aj, Aj+1, - - - ,Aj+,_1 of T can be obtained free of computations. They
can be set to any one of Aj, - - - ,Aj+,.
Since the matrix T in (45) is unreduced, its eigenvalues are all simple.
Therefore using m = 1 in the quasi-Laguerre iterations given in (31)
or (32) seems appropriate in all cases to obtain ultimate super-linear
convergence with convergence rate x/2+ 1. However, in some occasions,
there may exist a group of r > 1 eigenvalues of T, say,
Ai+1 < /\i+2 < < /\i+r
which are relatively close to each other, comparing to the distance from
(0) (1)
the two starting points, say 23+ < 23+ < A,-+1 as shown in Figure 3.
Some of them may even be numerically indistinguishable.
x2?) x1!)
A l LLLLJJII >x
A-i A1+1 11+] A'ivo-r
Figure 3: converging to relatively closer eigenvalues
Numerical evidence shows that to reach A,“ from 11318 )(= (9+1) and
333,1), it may take many steps of the quasi-Laguerre iteration with m = 1
before the super-linear convergence shows. In this situation, the quasi-
Laguerre iteration with m = r in (31) may be used to speed up the
convergence. Exhibited in (29), the sequences {rig} in (31) with
3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS 33
different m’s satisfy
k— - k 1.-
11:1 1) <.1:[k+)4»
Then 0 values among them can serve as the first staring points which
will lead to all a eigenvalues of T in [a, b].
In some occasions, the eigenvalues of T of interest are identified in
magnitude, say the largest 20%, then they can be evaluated by using
the largest 20% eigenvalues of T as starting points without computing
the other 80% eigenvalues of both T and T. i
3.5 Practical considerations in implementation
3.5.1 Stopping criteria
The following stopping criterion was suggested by Kahan [7] and used
for our implementation:
[m(Hl) _ 2301)]? S 013(k) _ 3411-1)] _ [m(kH) _ $01)”, (54)
where T is the error tolerance. This criterion is based on the following
observations. Let
5,301+” _ m(k)
m(k) _ m(k-l)
9k:
then as {rm ”2:, converges to A when k —-) oo, q;c is normally decreasing.
Thus
< E ]$(k+2+i) __ $(k+1+i)|
i=0
EWMHH) _ x(k+l+i))
i=0
i)‘ _ $(k+1)| :
3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS 35
oo k+1 __ k
_<_ [$(k+1)_ $01)] 2 q: : (Iklml ) :1:( )]
i=1 1 — qk
lx(k+1) _ $(k)]2
: (1(1) _ $11—11] _ (x1141) _ m(kll'
From (51), an obvious error tolerance at A, can be chosen as
55 .
T = 7 m1X)e.
3.5.2 Initial points for the quasi-Laguerre iteration
At every stage of the merging process, we evaluate the eigenvalues A1 <
- - - < Am of an m x m submatrix, given m initial values A1 3 - . - 3 Am
and an upper bound that interlace those 111 eigenvalues:
11311sighs-SESAME“
(see Theorem 3.1). To evaluate an eigenvalue A,- by the quasi-Laguerre
iteration, two initial points are needed on the same side of A,- without
any other eigenvalues between them. However, for i = 1, - - - ,m, the
interlacing property
311 S A1 S 3\i+1
provides only one initial point from each side. The simplest way of
obtaining the other initial point, preferably in the interval (A1, A5“), is
to bisect the interval by choosing c = ’\1_+é\1_-1-_1_ and evaluate €532 along
with the Sturm sequence at c by the algorithm DETEVL as described in
Section 3.2. If c < A,- then A,- and c are a suitable pair of initial points
for approximating A,-. Otherwise, the pair c and A,“ will be used.
While the above approach is the simplest, it may not be the most effi-
cient. Several improvements are made in our practical implementation.
First, since f’ / f is evaluated at every initial value A,- (j = 1, - - - ,m) ,
one step of Newton’s iteration can be performed without extra cost and
3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS 36
critical point critical point
1
1
1
1
1
1
1
1
1
1
b
---— —----¢
---- ----a
:2
X
v
critical point critical point
Figure 4: The two initial points should be between one of the nearby critical point of
f and the zero. This can be identified by the sign pattern of —f’/f.
the result can be considered as a candidate, besides c, for 512(1). Secondly
and more importantly, from our computing experience, the high order
convergence of the quasi-Laguerre iteration occurs only when there is
no critical point of f (i.e. zero of f’) between 11(0) and A,- (see Figure 4).
In other words, if 33(0) is to the left (resp. right) of A,-, then it is de—
sirable that — f’ (17(0)) / f (1(0)) is positive (resp. negative). If there is
one or more critical point in [A,-, Ai+1], then bisection or one step N ew-
ton’s iteration is used repeatedly until the above requirement at 22(0)
is satisfied. Based on these observations, our procedure for determin-
ing a pair of initial points among different choices to achieve the best
possible efficiency in practice is summarized below and the algorithm
INIPTS is given in Figure 5. For notational convenience, we assume the
eigenvalue A,- is in the interval [a, b] and there are no other eigenvalues
in this interval.
START.
I I
1. If —%§)2 < 0 and —%(£)l > 0, then neither a nor b should be
used as an initial point (there are critical points of f in both
(a,A,-) and (A,,b)). Set c = 23—9. Evaluate —%3, the Sturm
3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS 37
sequence at c is available in the process. Reset [a, b] to be [c, b]
or [a, c] in accordance with c < A,- or c > A,- respectively. Go
to START.
I
2. If —if-(%1 < 0 (there is a critical point between a and A;, so a
I
should not be used as an initial point ) and —%g)1 < 0, then
I
b can be one of the initial point. Set 0 = max {b — fl glib}
f(b)’ 2
and evaluate —If,—((*CE)2. If c E (/\,-, b), then b and c will be used
as a pair of initial points; go to END. Otherwise, reset [a,b]
to be [c, b]; go to START.
I I
3. If —If(%2 > 0 and _ngZ} > 0, then a can be an initial point but
b can not (there is a critical point of f between b and A.) Set
0 = min{a — 1%3’97442} and evaluate —J}%§)l. If c E (a,/\,-),
then a and c are a good pair of initial points; go to END.
Otherwise, reset [a, b] to be [a, c]; go to START.
’ ' b . . . .
4. If — f(:) > 0 and —%b_)2 < 0, then there 18 no critical pomt of
f in [a, b]. Thus either a or b can be used as one of the initial
points. Set
- _flel a_+_b - M £92
mi“ f(a)’ 2 i If ]f(a) S m]
max {b — %§)21 9-3-9} , otherwise.
and evaluate —%CE)2. Choose {a, c} or {0, b} to be the initial
points depending on c < A,- or c > A,- respectively; go to END.
END.
3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS
38
Algorithm IN IPTS
Input: subscript i, initial end points 3,331,“.
Output: starting iterates r(o),x(l) and m(x(0)),n(:z:(1)),— “flon— (Wm)
Begin INIPTS
(a, b): (;\,-,;\,-+1); Evaluate —Lé3], m(a), —fT’((—b%, m(b) by DETEVL;
(1111) If —7{J) < 0 and —-(U > 0 then
c: 42L; Evaluate _ff_’((2), m(c ) by DETEVL;
(a,b)={ (c,b), if c<)\,- (i.e. m(c) /\,' (i..e K(c)2i)
else if if} < 0 and ——(l < 0 then
c=max{b— 'b, 4119]» )Evaluate ——((—%, I€( c) by DETEVL;
If c> A; then (r(o),a:(l))- _ (b, c) ; Goto (t) ;
else (a, b): (c, b); Goto (it) ;
else if —%3)2 >0 and —fT(%l>0 then
c: min {a— %,—3fi}f;( Evaluate —+],( K:( )by DETEVL;
If (c< x\,) then (m(o),x(1))- — (a,c); Goto (it) ;
else (a,b) = (a,c); Goto (It) ;
C:{ min{a—-f7'£%l, 92%},
—f: , 3%}, otherwise
<flb§l
f(b
> Al:
End if ;
(13) End INIPTS
1W) “13(0) °
Figure 5: Algorithm INIPTS
3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS 39
3.5.3 An alternative approach for clusters of eigenvalues
The multiplicity estimation and the quasi-Laguerre iterations (31) and
(32) with m > 1 can successfully overcome the slow convergence when
a cluster of eigenvalues occur. An alternative approach without multi-
plicity estimation as well as dynamical adjustments of the multiplicities
can also work very efficiently in practice:
By Theorem 2.3, when the quasi-Laguerre iteration for simple zero
(m = 1) is used to approximate an M -fold zero :1:* (M _>_ 2), or a cluster
of M simple zeros, of a polynomial f of degree n, it converges linearly
with a ratio qu which is the only solution of
a: — :1: — :1: =
n — M M
in (0, 1). The following theorem gives the properties of qn, M for M _>_ 2
ananM+1.
0
Theorem 3.3 (i) For M fired, qu increases as n increases (n> M).
(ii) For a given n, qu Z qmg if 2 S M g n — 2.
(iii) When M = n — 1, qn,n_1 increases as n increases (n 2 3).
Proof. (2) For a fixed M, write
n — M — 1 3 2 M — 1
:1: —a: —:1:+
n—M M
912(33)
Then when n1 > n2,
9mm — g.. = (
SO,qn1’M > gn2,M.
1 1
TLQ—M nl—M
>r3>0 for xE(0,1).
(ii) For a given n, write
cc—a: —:c+ .
n—M M
9M0”)
3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS 40
When2SM_<_-3,
1 1 1 1
9M($)_92($) : (N—Z—R—M)x3+2—M
>1+ 1 _1_ 1
- 2 n—2 M n—M
n Tl
= m_M(n—M) 20 for x€(0,1).
50, gn,M 2 (111,2-
If g < M S n — 2, then 2 _<_ n — M < 721. It follows from above,
qn,n_M Z qmg. On the other hand,
gM(1‘) — gn_M(113) = (n 3M — fi) (1 — 333) > 0 for £1? 6 (0,1).
Then gn,M > qn,n—M- Hence: qn,M > qn,n-—M 2 qn,2-
(iii) If M = n — 1, write
n—M—l3 2 M—1 2
9,,(17) n—M :1: —:1: —a:+ M ——a: —.7:+1—n_1.
Then when n1 > n2,
1 1
gn1(a:)— gn2(:1:) = > 0 for :13 6 (0,1).
n2—1 n1—1
SO, gnlml—l > gngmg—l' D
For n 2 3 and M Z 2, by Theorem 3.3 (iii),
(111,114 2 (13,2 When M = n -1,
and by Theorem 3.3 (i), (ii),
qn,M 2 (171,2 2 (13,2 when M S n — 2-
In any case, q... E q3,2, the zero of 1:2 + x — 0.5 = 0 in (0, 1), is the
smallest linear ratio for multiple zeros. Let 33(1),a:(2),- - - ,x("), - -- be
3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS 41
the monotone sequence of the quasi-Laguerre iterates. When {513(k)}
converges super-linearly, then
|$ q..., linear convergence seems apparent. In this case, for
sufficiently large k,
(Ic+l)_ =1:
:1: a: ,.. ...
I Q; qu or ]$(k+1)—$ I z q].”L‘(k)—.’C I where q E (171M,
I13“) — 117*]
and
_ |:r(k+1)—:r(k)| _ |x(k)—a:*|—|x(k+1)—x*| ~ (l-q)I:B(k)—a:*| ~
9k — |x(k)_x(k—l)| “‘ ]$(k—1)_x*|_|x(k)_x*| "’ (1_q)lx(k—1)_x¢| N q.
In practice, qk’s can be very close to q after several iterations. If qj = q
forj Z k then
0+1)
Ix -$*|=q|:v(j)-~’v*|, for jzk
and
__1_($(k+1) _ aw).
*:,(k)
a: :L +1-q
So, at the k—th iterate :13“), if linear convergence is revealed, namely
qk > q..., then instead of performing the regular quasi-Laguerre iteration
130*“) : (Elk) + 6k
where (5;. is the correction m(k“) — 56““) from 1:“) calculated by the quasi-
Laguerre iteration with m = 1, one can accelerate the iteration by
setting
5k
m(w) z m(n +
1—%°
The accelerated iteration can be summarized as follows: Assume we
are evaluating the eigenvalue X" E [a, b] and initial points 23(0), 17(1) have
3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS 42
been obtained by the process described in Section 3.5.2 along with data
I 13(0) I (1)
_LLFJ and 475%}
(#)
For k = 1, 2, - - - do (while the stopping criteria is not satisfied),
1. Evaluate 6;, = 517““) — :13“) using the quasi-Laguerre formula
with m =1.
. If q (Le—i 5—3: < q,., then most likely the target eigenvalue x\* is
neither a multiple zero nor a member of any cluster. Go to #
and continue the loop.
Otherwise,
(a) Let c = mm + 11:15:; and check for a possible overshot by
evaluating the Sturm sequence. If c overshoots, then re—
set c = r(k) + w for l = 8,4,2,1 until overshooting
disappears.
(b) set 113““) = c and go to #.
End for
The algorithm ACCQ-LAG in Figure 6 summarizes the fundamental
features of the acceleration process described in this section. Table 1
gives the comparison between the original quasi-Laguerre iteration and
the accelerated quasi-Laguerre iteration. The Wilkinson matrix W9?) is
used here with the starting points mm) = 11.5 and x“) = L_(2:(0)) =
11.27 targeting A23 -_- 11.0+5x 10-15 of W93, while A22 = 11.0—5x10"15.
3.6 Numerical tests
Our algorithm is implemented and tested on SPARC stations with
IEEE floating point standard. The machine precision is e z 2.2 x 10—16.
3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS
43
Algorithm ACCQ-LAG
Input: subscript i, starting iterates 55(0), mm with Ii($(0)),
m(rm), —%§50)1)2 and —%(§Tl;))- evaluated by INIPTS.
output: the i-th eigenvalue A,- of T.
Define q... = q3,2: the positive solution of x2 + :1: — 0.5 = 0
(the smallest linear ratio for multiple zeros);
Begin ACCQ-LAG
For k: 1,2,-~-
U+, 1f 517(k) < Ai _
c = (iii are from quasi-Lag.formulae);
21-, if 1:0“) > A;
(Ska-1““: q= 3%.: t01=2-5€[ma> 2 and 17517:;me < tol) Goto (it) ;
If (q > q...) then
I .
c: m(k)+ {253; evaluate — f(cc) and m(c) by DETEVL;
If (c is on the same side of A; as r(k),a:(k‘l)) then
Goto (fit);
else, For I: 8,4,2,1,
l
c = xa) + 311.011.).
l—q ’
evaluate the Sturm sequence n(c) ;
If (c is on the same side of A,- as m(k),a:(k'l))
Goto (ti);
End for
End if
(it) 12”“) = c ;
End for
(11) End ACCQ-LAG
Figure 6: Algorithm ACCQ-LAG
3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS
Table 1:
QOQNQO‘AOJNI—
“(A)““QMNNNNNMNNNHHV-‘Hh—I—‘D—HHH
#0383"OCDWNC3U‘AOONI-‘CCDWQC5U‘ACJNHO
513““) (original)
1 1 .500000000000000
1 1 .270700712327294
11 .149428542073966
11.0 63735435125402
1 1.0 26235830973979
11 .0 10578521008206
11.00 4261938886133
11.00 1715645778917
11.000 690622605815
11.000 277993423158
1 1 .000 111899071095
11.0000 45041905516
11.0000 18130368451
11.00000 7297872838
11.00000 2937554150
11.00000 1182429996
11.000000 475954003
11.000000 191581922
11 .0000000 77115924
1 1 .0000000 31040850
11.0000000 12494622
11.00000000 5029360
11.00000000 2024429
11.000000000 814877
1 1 .000000000 328006
1 1 .000000000 132029
1 1.0000000000 53145
1 1 .0000000000 21393
1 1.00000000000 8612
11.00000000000 3467
1 1.00000000000 1396
l 1.000000000000 565
11.000000000000 233
l 1 .000000000000 107
I(k+l)_ (1‘)]
qk: |$(k)_$( (k— 1)]
0. 706618
0.4 37604
0.4 17533
0.40 3427
0.40 3112
0.4025 55
0.4025 56
0.40252 7
0.40252 5
0.40252 3
0.402522
0.402522
0.402522
0.402522
0.402522
0.402522
0.402522
0.402522
0.402522
0.402522
0.402522
0.402522
0.402522
0.402520
0.40252 1
0.40252 6
0.40252 1
0.402 477
0.40 1680
0. 399604
0. 381937
0. 305425
Acceleration of the quasi-Laguerre iteration for W55
the table, the digits before a space are correct.
3:0“) (accelerated)
1 1. 500000000000000
11. 270700712327294
11.0 13287361699255
11 .000 124299930121
11 .0000 61856842477
11.000000 198368395
11.0000000 99023529
1 1.0000000000 12077
11.00000000000 6040
1 1.00000000000 2304
1 1 .000000000000 156
1 1.0000000000000 64
44
For each number in
3 ON SYMMETRIC TRIDIAGONAL EIGENVALUE PROBLEMS 45
3.6. 1 Testing matrices
There are 12 types of matrices employed in testing our algorithm. In the
following description of these matrix types, 0;, i = 1, - - - ,n, denote the
diagonal entries and 6;, i = 1, - - - ,n — 1, are the sub-diagonal entries.
Matrices with known eigenvalues
1. Toeplitz matrices [b, a, b]. Exact eigenvalues:
{a + 2b cos 'gkfillsksn ([5], Example 7.4, p137).
2. alza—b,a;=a for i=2,-~,n—1,an=a+b. fljzb,
j=1,---,n—1. Exact eigenvalues:
{a + 2b cos Wllgg ([5], Example 7.6, p138).
{ a for odd i
(15::
, fl,- = 1. Exact eigenvalues :
b for even i
2 "+1 (plus {a} if n is odd)
1§kSn/2
([5], Example 7.8 and 7.9, p139).
4. a; = 0, fl; = \/z(—n——i_). Exact eigenvalues:
{—n + 2k — lbs/,9. ([5], Example 7.10, p140).
5. oz.- = -[(2i—1)(n - 1) — 2(i—1)2], fl,- = i(n — i - 2). Exact
eigenvalues: {—k(k — liilsksn ([5], Example 7.11, p141).
{a+b :t \/(a—b)2+16cos2 k—”
Wilkinson and random matrices ,
6. Wilkinson matrices W; 3,- = 1,
n/2—i+1 forevennandlgign/2
i—n/2 for evennand n/2
There are two values for u in the above formula. We will choose yk+1
(56)
to be the one which is closer to yk, i.e., ka+1 -— ykl is smaller. More
precisely, let
A2 = Rl(yk—1 — 91023 + 4(n — 1)] and Q =(yk_1— yk)R + 2q(yk—1)-
Since
lo :l: M" = |0|2 + |A|2 :l: 2Re(r'iA),
to make lyk+1 — ykl smaller or IO :1: A] larger, we choose
yr. _ 2ln-(yk_§12;yAk)7(yk—1)l, if Re(§A) Z 0;
yk+1 = (57)
3}]: _ 2[n_(yk—§12:yAk)q(yk—l)l, if R€(§A) < O
To start the iteration, two initial points are required. For given yo, we
always apply one step of Newton’s iteration at yo to obtain the second
initial point; namely,
91 = 340 — f(yo)
f '(310)
4 THE COMPLEX QUASI—LAGUERRE ITERATION 56
4.2 Convergence properties
Theorem 4.1 The quasi-Laguerre iteration {yk} defined in the last
section converges to 2 when yo is close to z. The ultimate convergence
rate is x/2 + 1.
Proof. Let
yk — Z
yk-i - Z,
Clearly {yk} converges to 2 if for some r < 1, 16k] < r, Vk. From (56),
VkZl.
6k:
yk+1—Z :1_
€k+1 =
31k - Z
2 [n - (yr—1 — yr)q(yk—1)l . (58)
(ilk-Z) {lyre—1-yk)R+24(yk—1)i\/Rl(yk-1—yk)QR+4(n—1)l}
Let 2(1), - - - , 2("‘1) be the zeros of f besides 2. Then,
f’(yk)_ _ 1 . _ "—1 1
q(yk)= m”)— yk _ Z+Q(yk) With 62(9) = 2:31 W
_ f’—(yk 1)_ 21
(Am—1) — f(yk-l) ilk—1 - Z + Q0114)
and
R = n (dz/1:); E(::—l)) - (1(yk)(1(yk—1)
_ n -1 _ Qty/H) _ Q(yk) _
— (yr—Z )(yk 1_Z) yk—Z yk_1_an(yk)Q(yk—1)
— 1
+nR1(:l/k— 1, Me) With Rl(yk— 1, 31k) En i=1 l_(yk l—zli))(yk—z(ii)°
Now,
(a) (ilk—1 - yk)€1(yk—1) = (yk-1_ ilk) (yr—1 _ z + Q(yk—1))
=(1—6k)+Q(3/k)(1-€k)(yk—1-Z)=1-€k+0(yk—1—Z)
(b) (yk—Z)(yk—1—Z)R = "—1—(311—1- Z)Q(yk—1)
4 THE COMPLEX QUASI—LAGUERRE ITERATION 57
—(yk - Z)Q QZU k—i Q(yk)Q (yrs—1)(yk—Z)(yk—1—Z)
+7701}. — z)(y1_1— 2)R1(yk_1,y1)= Ti -1+ 0(71 1— Z)
(C) (ilk - Z)(yk—1 - 310R = (1 — €k)(yk — Z)(3/k—1 — ZlR
= (1 — €1)(n — 1) + 001-1 — 2) (because of (b) above)
((1) (yr: — Z)2R = 6143/4 - Z)(yk—1 — Z)R
= 6k(n — 1) + O(yk_1 — 2) (because of (b) above)
(8) (yr: - Z)QR [(yk—I - 31021? + 4(n — 1)]
= [(y1. - Z)(yk—1 - 341:)ng + 4(" -1)(y1 - Z)QR
= (l—ek)2(n—1)2 + 4(n—1)€k(n—l) + O(yk_1—z)
= (n — 1)2(1+ 61)2 + O(yk—r - Z)
(because of (c) and (d) above).
If yo —) z and y1 —+ 2, then
n—l 1 n—l 1
= . —) E . < ,
Q 0 and
C2 > 0, independent of k, such that
€k+1 =1—
IOI1| _<_ Cllyk—l — 2| and |a2| S CQka—l — 2|-
Let
A = (n — 1)(1 — 6k) +2ek:l: (n — 1)(1 +61)
2(n — 1) + 26],. if ‘+’ is chosen;
(4 — 2n)ek if ‘—’ is chosen.
In (57), since
(yr. — 201931: A) = 444—02,
thus, making |yk+1 — ykl smaller is equivalent to making |A] larger.
When lekl < 1, |2(n — l) + 2ek| > [(4 — 2n)ek|. Hence, to have |A|
larger, we always choose ‘+’ for A, making
A=2(n—1)+2€k
and
|A| = |2(n — 1) + 2ek| Z 2(n — 1) — 2|ck| 2 2n — 4.
So,
A+ozl 612—01
:1. z . 59
€k+1 A—I-CYQ A-i-ag ( )
If |61| < 1, I62] < 1,~-,|ek| < 1 and lyo - Z] < "(32" then
|072| S C2|y1_1— 2] < Cglyo — 2| < 1.
Hence,
|A+072|Z |A|—|oz2| 22n—5>0.
4 THE COMPLEX QUASI-LAGUERRE ITERATION 59
From (59),
|€k+1| =
M<(§ifi|ly _,
|A+072| _ 272—5 [CA-1 A
In summary, we have shown the following:
Lemma 4.2 If yo is close to 2, then there exists a constant C > 0 such
that V14: 2 1, if |€1| < 1,---,|€k| < 1, then
|€k+1| _<_ Clyk—l — 3| (60)
Now, given a real number r < 1, when lgo — 2| < %, by (60) above,
|€2| S Clyo — 2| < r or lyg — 2| < rlyl — 2|.
Notice that from the way yl is chosen; namely, by applying one step of
Newton’s iteration on given yo, we always have
191 — 2| < |y0 — 2|, or equivalently |€1| < 1
when yo is close to 2, say for instance, yo is in the convergence neigh-
borhood of Newton’s iteration at 2. By (60) again,
I153] S Clyl — 2| < Clyo — 2| < r, and so, |y3 — 2| < r|y2 - 2|.
It follows recursively that
|ek+1| S Clyk_1— 2| < r and |yk+1— 2| < rlyk — 2|, We 21
since C is independent of 16. Therefore {yk} converges to 2. Further-
more, from (60),
|ek+1| S C|y1_1— 2| —> 0, as k —-> 00.
Thus the convergence of {ykfiozl to 2 is super-linear.
The rest of the proof follows exactly the same argument as the proof
of part (ii) of Theorem 2.3 with m = M = 1. D
REFERENCES 60
References
[1] E. ANDERSON, Z. 3.41, C. BISCHOF, J. DEMMEL, J. DONGARRA, J. Du
CRoz, A. GREENBAUM, S. HAMMARLING, A. MCKENNEY, S. OSTROU-
CHOV, and D. SORENSON, LAPACK User’s Guide, SIAM, Philadelphia, 1992.
[2] J. J. M. CUPPEN, A divide and conquer method for the symmetric tridiagonal
eigenproblem, Numer. Math., 36 (1981), pp. 177—195.
[3] J. J. DONGARRA AND D. C. SORENSEN, A fully parallel algorithm for the
symmetric eigenvalue problem, SIAM J. Sci. Stat. Comput., 8 (1987), pp. 139—
154.
[4] G. H. GOLUB AND C. F. VAN LOAN, Matrix Computations, 2nd Ed., The
Johns Hopkins University Press, Baltimore, MD, 1989.
[5] R. T. GREGORY AND D. L. KARNEY, A Collection of Matrices for Testing
Computational Algorithms, Robert E. Krieger Publishing Company, Hunting-
ton, New York, 1978.
[6] M. GU AND S. C. EISENSTAT, A divide-and-conquer algorithm for the sym-
metric tridiagonal eigenproblem, SIAM J. Matrix Anal. Appl., Vol. 16, No. 1
(1995), pp. 172-191.
[7] W. KAHAN, Notes On Laguerre’s Iteration, preprint, University of California,
Berkeley (1992).
[8] T. Y. Li AND Z. ZENG, Laguerre’s iteration in solving the symmetric tridiago-
nal eigenproblem — revisited, SIAM J. Sci. Comput., Vol. 15, No. 5 (1994), pp.
1145-1173.
[9] B. N. PARLETT, The Symmetric Eigenvalue Problem, Prentice-Hall, Englewood
Cliffs, NJ, 1980.
[10] B. N. PARLETT, The use ofa refined error bound when updating eigenvalues of
tridiagonals, Lin. Alg. & Appls., Vol. 68 (1985), pp. 179-219.
[11] J. RUTTER,A serial implementation of Cuppen’s divide and conquer algorithm
for the symmetric eigenvalue problem, LAPACK lawn 69 (1994).
REFERENCES 61
[12] D. C. SORENSEN AND P. T. P. TANG, On the orthogonality of eigenvectors
computed by divide-and-conquer techniques, SIAM. J. Numer. Anal., 28 (1991),
pp. 1752 - 1775.
[13] C. TREFFTZ, C. C. HUANG, P. MCKINLEY, T. Y. L1, AND Z. ZENG, A
scalable eigenvalue solver for symmetric tridiagonal matrices, to appear, Parallel
Comput.
[14] J. H. WILKINSON, The Algebraic Eigenvalue Problem, Oxford University Press,
Oxford, 1965.