OPTIMAL STARTING APPROXIMATIONS
FOR ITERATIVE SCHEMES FROM
CLASSES 0F RATIONAL FUNCTIONS

Thesis for the Degree of Ph. D.
MICHIGAN STATE UNIVERSITY
JOHN BYRON GIBSON
1971

   
     

This is to certify that the

thesis entitled

Optimal Starting Approximations for
Iterative Schemes from Classes
of Rational Functions

presented by

John Byron Gibson

has been accepted towards fulﬁllment
of the requirements for

Ph . D . degree in Mathematics

 

 

 

Date 8-4- 71

 

0-7639

 

LIBRA R Y
Michigan 3m
University

  

 
 
    
 

 
 

‘5’
BINDING BY

HOAG & SONS‘ ;
MINI BINDER? INC. .

LIBRARY amog‘ni
SHIIGNIIT. IIOIIGI!

  
 
 

ABSTRACT
OPTIMAL STARTING APPROXIMATIONS

FOR ITERATIVE SCHEMES
FROM CLASSES OF RATIONAL FUNCTIONS

BY

John Byron Gibson

Given a compact subset X of the closed interval [a,b], let
C(X) denote the space of all continuous real-valued functions defined
on X, normed by Hf“ = max {‘w(x)f(x)\} where w 6 C(X) and w > O
on X. In the case w EXEX the norm is the usual Chebyshev norm and
is denoted by H'Nm° Let K be a convex subset of C(X) and Q
a continuous mapping of K into C(X). We shall then be interested
in the problem of approximating g E 6(K) by elements of ¢(M)
where M is a fixed subset of K. If we impose certain restrictions
on Q, K and M, a Chebyshev type theory can be developed for this
nonlinear approximation problem which is similar to the standard

Chebyshev theory. The theory obtained has application to many

iterative processes that can be used to approximate functions such as
1

exp(x) and x“, n = 2,3,... and consequently we make the following
definition:

Definition: An element p G M is called an optimal starting approx-

 

imation (or best starting approximation) for g, with respect to Q

and M, if Hg - §(p)“ S Hg - Q(q)“ for all q 6 M.

John Byron Gibson

In this paper we investigate the above problem for M a sub-
set of K consisting of certain classes of rational functions. In
Chapter I the aforementioned Chebyshev theory is developed. Classically,
one would apply the characterizing alternation theorem we obtain to
develop a Remez exchange algorithm for the computation of optimal
starting approximations. However, if the operator 6 satisfies
certain properties the optimal starting approximation for a function
f can be calculated as a constant multiple of the best relative
approximation to f in the case of relative error or as a translate
of the best uniform approximation to f in the case of uniform error.

In Chapter II the theory is applied to the Newton operator
and optimal starting approximations are calculated for the functions
xa, a 6 (0,1), exp(x) and Ln x in the manner described above. A
number of additional iterative schemes are presented in Chapter III
and analyzed for optimal starting approximations for the functions

1

x“, n = 2,3,... and exp(x).

Recently D. Moursund has introduced the concept of a gen-
eralized weight function which is a more general means of measuring
error than that described earlier. Chapter IV is concerned with a
development of the Chebyshev theory, analogous to that in Chapter I,
for this type of weight function.

Finally, in Chapter V an improved Newton iteration scheme
for approximating exp(x) is presented and is shown to be Optimal

in the sense of this paper.

OPTIMAL STARTING APPROXIMATIONS
FOR ITERATIVE SCHEMES
FROM CLASSES OF RATIONAL FUNCTIONS

BY

John Byron Gibson

A THESIS
Submitted to
Michigan State University

in partial fulfillment of the requirements
for the degree of

DOCTOR OF PHILOSOPHY
Department of Mathematics

1971

for his

paper.

imation

helpful

ACKNOWLEDGEMENTS

I am deeply indebted to my major professor Gerald D. Taylor
many suggestions and guidance in the preparation of this
His enthusiasm for and insights into the area of approx-
theory have inSpired me greatly.

I would also like to thank Dr. Edwin Kaufman for his many

comment S 0

ii

TABLE OF CONTENTS

Page
INTRODUCTION OOOOOOOOOOOCOCOOOOOOCOOOOOOOOOOOOOOOO 1
Chapter

I GENERAL THEORY FOR OPTIMAL STARTING APPROXIMATIONS 4
Section 1: Definitions and Basic Results ........ 4

Section 2: Existence, Uniqueness, and
Characterization Theorems ............ 7

Section 3: Computation of an Optimal
Starting Approximation ............... 34

II APPLICATION TO THE NEWTON OPERATOR AND
BSOCIATED RESULTS O0.000000000IOOOOOOOOOOOOCOOOO. 42
Section 1: Introduction and General Theory ...... 42

Section 2: Optimal Starting Approximations for
Roots, Exp(x) and Ln x ............. 47
Section 3: An Associated Operator ............... 51
III FURTHER EXAMPLES OF OPERATORS DEFINING ITERATIVE

SQ-IEWS 0....0..0....O00......COOOOOOOOOOOOCOOOOOO 55
Section 1: Definition and Basic Results ......... 55
Section 2: Iteration Functions of Order 2 ....... 57
Section 3: Iteration Functions of Order 3 ....... 58
Section 4: Iteration Functions of Order 4 ...... 63

Section 5: A Sequence of Schemes for
Approximating I/k .................... 70

IV OPTIMAL STARTING APPROXIMATIONS FOR A
GENERALIZEDWEIGHT FIINﬂION OOOOOOOOOOOOOOOOOOOOOO 74
Section 1: Introduction and Examples ............ 74

Section 2: Existence, Uniqueness, and
Characterization Theorems ............ 75

iii

Chapter Page

V AN IMPROVED NEWTON ITERATION SCHEME FOR
APPROXIMATING EXP(X) WHICH IS OPTIMAL 83
Section 1: IntrOdUCtion OOOOOOOOOOO...0.0.0.0.... 83
Section 2: The Improved Newton Iteration Scheme . 84
Section 3: Execution of the Scheme .............. 108
BIBLIOGRAHIY 0.0.0.0...0.000000000000000. 00000 O... 110

iv

INTRODUCTION

In [15] D.G. Moursund examined the problem of approximating
/k over an interval [a,b] (a > O) by applying the Newton-Raphson
iteration scheme to classes of polynomial and rational approximants.
For polynomial approximation the problem may be formulated in the
following way: set K = {f E C[a,b]: f > 0 on [a,b]}, M = Pn n K
where Pn consists of all polynomials of degree less than or equal
to n, and define O: K a C[a,b] via §(h)(x) = %-(h(x) +-E%;39.
Here 9 denotes a single Newton iteration for calculating /x
starting with h(x). Then we are interested in approximating /x
with the class {%-(p(x) + 3%;3)}, for p E M, using the relative

error (the notion of relative error will be defined later). Moursund

*
was able to show that there exists a unique solution p to

 

- *
inf“!x ‘/3122(3)" ; p gi} and moreover that p is also the
an

m
uniQue solution to infUI/X - 25mg) H t P E M} s m = 2.3...”

where §k(h) = §(§k-1(h)). More recently, P.H. Sterbenz and C.T. Fike

 

[24] and R.F. King and D.L. Phillips [7] both showed (independently)
that p* is a multiple of E, the best relative approximation to
/k on [a,b].

The above problem may be posed in a more general setting.
Given a compact subset X of the closed interval [a,b], let C(X)
denote the space of all continuous real-valued functions defined on
x, normed by Hf“ = max {Iw(x)£(x)|} where w e C(X) and w > o

XEX
1

on X. In the case w E l the norm is the usual Chebyshev norm and
is denoted by H-“m. Let K be a convex subset of C(X) and Q

a continuous mapping of K into C(X). We shall then be interested
in the problem of approximating g E 9(K) by elements of @(M) where
M is a fixed subset of K.

G. Meinardus and G.D. Taylor [11] have considered the above
more general problem for M a Subset of K consisting of members of
a Haar subspace of C[a,b]. (The notion of a Haar subspace will be
defined in the paper.) By imposing additional restrictions on a,

K and M they were able to deve10p a theory for the above nonlinear
approximation problem which is analogous to the classical Chebyshev
theory. Moreover they were able to preserve the behavior of the prob-
lem considered by Moursund and to obtain results similar to those
obtained by Sterbenz-Fike and King-Phillips in the more general setting.
Because of the application of this theory to iterative processes we
make the following definition:

Definition: p E M is an optimal starting approximation (or best
starting approximation) for g, with reSpect to Q and M, if

Hg - §(P)H S Hg - §(q)u for all q e M.

In this paper we shall investigate the above problem for M
a subset of K consisting of certain classes of rational functions.
In Chapter I we develop a Chebyshev type theory for this highly non-
linear approximation problem and in Chapter II we apply the theory
to the Newton operator. A number of iterative schemes are given in
Chapter III and analyzed for optimal starting approximations.

The concept of a generalized weight function was introduced

by D. Moursund in [13] and [14]. In Chapter IV we show that the

Chebyshev type theory can also be developed for this more general
means of measuring error.

Finally, in Chapter V we present an improved Newton iteration
scheme for approximating ex which is shown to be optimal in the sense

of this paper.

CHAPTER I

GENERAL THEORY FOR OPTIMAL STARTING APPROXIMATIONS

Section 1: Definitions and Basic Results

In our development we wish to make use of the results and
techniques of the classical Chebyshev rational approximation. Con-
sequently we introduce the following definitions and results from
the paper of Meinardus and Taylor [11]. Throughout this section we
shall assume that X, K, M and Q are as described in the intro-
duction.

Definition 1.1: The operator Q is called pointwise strictly mono-
tone at f e K if for each h,k e K we have |e(h)(x0) - {>(f)(x0)| <
‘§(k)(x0) - e(£)(xo)| for each x0 e x where either k(x0) < h(xo) 5

f(x or f(xo) S h(xo) < k(x0).

0)
Lemma 1.2: Let Q: KI~ C(X) be pointwise strictly monotone at

f E K. If k E K and at x E X, k(x0) # f(x then §(k)(x0) #

o o)
§(f)(xo).

Proof: Let h E f in Definition 1.1.I

Definition 1.3: The operator Q is said to be pointwise fixed at

 

f E K if h E K with h(x = f(xo) for x0 6 X implies

O)
§(h)(x0) = wxxo).

In general the notions of pointwise strictly monotone and

pointwise fixed are independent. Indeed, define K C C[0,1] by

K = {f(x) = ax2 : O 5 a s 1}. If Q1: K a C[O,l] is defined by
¢1(ax2) = x2 then Q1 is pointwise fixed at x2 yet not pointwise
strictly monotone at x2. Now, if p2: K a C[O,l] is defined by
62(ax2) = x2 + (l-a) then 62 is continuous and pointwise strictly
monotone at f(x) = x2. Indeed, if ax: < bx: 5 x3 then xO ¥ 0
and a < b s 1 so ‘§2(ax2)(x0) - x3‘ = 1 - a > 1 - b

‘§2(bx2)(xo) - xg| since Q2(x2) = x2. Each 8x2 6 K has the pro-
perty that ax2 = x2 at x = 0; but §2(ax2)(0) e o = a2(x2)(0)

2

if a # 1. Hence Q2 being pointwise strictly monotone at x

. . . . 2
does not imply 62 18 p01ntW1se fixed at x .
Next we shall show that the composition of two continuous

operators possessing the above properties is again such an operator

provided of course that domains and ranges mesh correctly.

Lemma 1.4: Let a: K a C(X) and Y: L a C(X) be continuous operators
with §(K) C L, Q pointwise strictly monotone and pointwise fixed

at f E K and Y pointwise strictly monotone and pointwise fixed

at Q(f) E L. Then Y6: K A C(X) is a continuous pointwise strictly
monotone operator at f which is also pointwise fixed at f.

.ggggf: It is clear that we: K a C(X) is continuous and pointwise
fixed at f E K. To establish the pointwise strict montonicity of

VS at f let us first show that if h,k E K with k(x0) < h(xo) s
f(xo) or f(xo) s h(xo) < k(x0) then either ¢(k)(x0) < Q(h)(x0) s
§(f)(xo) or §(f)(x0) s §(h)(xo) < §(k)(x0). Since Q is point-

wise strictly monotone at f we have that \§(k)(x0) - §(f)(x0)‘ >
I¢<h><xo> - ¢(f)(x0)l- Suppose that a<k><x0> < e<f>(x0) < ammo) or

§(h)(xo) < Q(f)(x0) < §(k)(x0). Then for O S a s l, the functions

La(x) = ah(x) +~(1-a)k(x) are members of K (since K is convex)
and furthermore Q(La)(x0) is a continuous function of a for each
fixed x0 6 X. Thus, by the intermediate value theorem, there exists
a0 6 (0,1) Such that Q(La0)(xo) = Q(f)(x0). Therefore, in view of

0) 0). This is a contradiction. Consequently

we have either Q(k)(x0) < 0(h)(xo) S §(f)(xo) or Q(f)(xo) 5

Lemma 1.2, Ld(x = f(x
Q(h)(x0) < §(k)(xo). An application of the pointwise strict mono-
tonicity of Y at Q(f) yields the desired result. I

In order to be able to iterate with the operator Q we must
necessarily demand that Q(K) C K. With this restriction we are able
to relate the notions of pointwise strictly monotone and pointwise

fixed according to the following lemma:

Lemma 1.5: If Q: R ~ K is a continuous operator on the convex set

K which is pointwise strictly monotone at f E K then Q is point-
wise fixed at f.

lggggf: Suppose h E K and h(xo) = f(xo) for some x0 6 X. Clearly,

if k(x = f(x for all k E K then Q(h)(x0) = Q(f)(x0) and so

0) 0)

Q is pointwise fixed at f. If this is not the case then there

0).
for a 6 (0,1). Now Hko - f“ a O as

exists k e K Such that k(xO) # f(x Set k0 = of + (1-0)k and

note that kd(xo) # f(xo)
o ~ 1 and so He<kd) - Q(f)H a 0 as a a 1. Thus Q(kd)(x0) s
Q(f)(xo) since norm convergence is stronger than pointwise convergence.
Now Q being pointwise strictly monotone at f implies

|s<kd)(x0> - s(f><xo)| > \Q(h)(x0) - s<f>(xo>l for all a e (0.1)
and so IQ(h)(x0) - Q(f)(x0)| = 0. Hence Q(h)(x0) = s(£)(x0) and

so Q is pointwise fixed at f.l

For the sake of any iteration processes we may be interested
in, we state the following corollary to Lemma 1.5:
Corollary 1.6: If Q: K a K is continuous, Q(f) = f for some f E K
and Q is pointwise strictly monotone at f E K then Qm: K a K
defined inductively by Qm(h) = Q(Qm-1(h)), m = 2,3,..., satisfies
§m(f) = f and am is pointwise strictly monotone at f.

Finally we shall state a general existence theorem for best

approximation in our setting. The theorem is very general and for

specific operators is sometimes difficult to check.

Theorem 1.7: Let K be a convex subset of C(X), M C K and

Q: K d C(X) be continuous. Then correSponding to g E Q(K) there
exists r* E M minimizing Hg - Q(r)H, r G M, provided there exists
a compact subset M1 of M and a positive constant n such that

s e M ~ M1 implies inf {Hg - Mr)“ : r e M} + n s Hg - Q(s)H.

Proof: A continuous function on a compact set assumes its minimum.l

Section 2: Existence, Uniqueness, and Characterization Theorens

In this section we shall specialize M and K and develop
an alternation theory for characterizing best approximations similar
to the standard Chebyshev theory. From this development we are able,
when we have existence, to conclude uniqueness using standard Chebyshev
theory arguments. Before proceeding to the first characterization
theorem we must establish some additional notation.

Let X be a compact Subset of the closed interval [a,b]
containing at least m + n +»2 points where m and n are fixed
for the discussion at hand. Let Pn denote the set of all polynomials

p with ap s n (5p denotes the degree of p) and set

R;[a,b] = {E 2 P 6 P“: q 6 Pm: (p:Q) = 13 q > 0 on [3,131}

where (p,q) 8 1 denotes the fact that the polynomials p and q
are relatively prime.
The following well-known lemma from Rice [20] will be used

extensively:

Lemma 1.8: Given r = 5-6 R:[a,b], T > 0 and any set

{Xiz i = 1,...,r, r < 1 + max {m + 5p, n + aq}, xi < xi+1}

. . . . . n
of p01nts in X, there is a rational function r E Rm[a,b] such
6

that
(i) Hre - rH m < w ,
L [a,b]
and (ii) Sgn (r(x) - r (x)) = (-l)1+1 x E (x x ) i = O 1 .. r
e , 1’ 1+1 9 3 3 '9
where x0 = a if t} # a and xr+1 =‘b if xr # b.
Proof: Set¢(x) = H (xi-x). Now p(x) and q(x) are relatively prime
i=1

and so there are polynomials p1(x) 6 Pn and q1(x) 6 PE such that
y(x) = p(x)q1(x) - p1(x)q(x). The proof of this fact depends on the
Euclidean Algorithm and can be found in Rivlin [21]. Let

p(X) - epl(X)
r€(x) _ Q(X) ‘—_

 

 

" €q1(x)
where e > 0. Then
_ = '6OIX)
f(x) re(x) q(x)<q(x> - eq1(X)) ° (1'1)

Since q(x) > 0 on [a,b], 6 may be chosen sufficiently small,
say 0 < e s 60’ so that the denominator of (1.1) is positive. Now,

given T > 0, select 6 such that O < e s 30 and ‘6‘ < T.

 

q(q-eql) m
I II

Then Hr - r H < T and, since (ii) is clear, the result is established.‘
6

Theorem 1.9: Let Q: K a C(X) be a continuous Operator where K is
a convex subset of C(X). Let M = K O R:[a,b] be a nonempty rel-
atively Open subset of R;[a,b]. Finally, assume that Q is point-
wise strictly monotone and pointwise fixed at f E K ~ M. Then

r E M is the best starting approximation for Q(f) if and only if
there exist points {xi}§=1 c X, N = 2 + max {m + 5p, n + aq}, for

which

(i) x1.< x2 <...< xN,

(ii) IW(xi)(Q(f)(Xi) - Q(r)(xi))| = HQ(f) - Q(r)H. i = 1.....N

and
(iii) sgn(f(xi) - r(xi)) = (-l)i+lsgn(f(x1) - r(x1)), i = 1,...,N.

Proof: (Sufficiency). Since f 4 M we know that there exists a

point e x for which f(xo) r r(x0). Thus “Q(f) - Q(r)H # 0.

x0
Suppose that |w(xi)(Q(f)(xi) - Q(r)(xi))‘ = HQ(f) - Q(r)H and
sgn(f(xi) - r(xi)) = (-1)i+lsgn(f(x1) - r(x1)), i = 1,...,N. Then
Q(f)(xi) # Q(r)(xi). Let r1 6 M be such that HQ(f) - Q(r1)H s
HQ(f) - Q(r)H. At xi, i = 1,...,N, ‘w(xi)(Q(f)(xi) - Q(r1)(xi))‘ s
‘w(xi)(Q(f)(xi) - Q(r)(xi))‘. Now at each xi either r(xi) > f(xi)
or r(xi)<f(xi) since Q is pointwise fixed at f. If

r(xi) > f(xi) then r(xi) 2 r1(xi) for otherwise r1(xi) > r(xi) >
f(xi), which implies \w(xi)(Q(f)(xi) - Q(r1)(xi))| >

\w(xi)(Q(f)(xi) - Q(r)(xi))‘ which is a contradiction. Similarly
r(xi) < f(xi) implies r(xi) s r1(xi). Now r(xi) > f(xi) implies
r(xi) 2 r1(xi) and r(xi+1) < f(xi+1) implies r(xi+1) s r1(xi+1).
Therefore (r-r1)(xi) 2 0 and (r-r1)(xi+1) s 0 (or conversely)

10

for i = 1,...,N-l. Set 8 = rl-r = (f-r) - (f-rl). If s(xi) ¢ 0
then sgn s(xi) = sgn(f(xi) - r(xi)) (since Sgn(f(xi) - r(xi)) =
sgn(r1(xi) - r(xi)), as noted above). If s(xi) # O and s(xi+1) =
s(xi+2) =...= s(xi+j_1) = 0, while s(xi+j) # 0, then since
x1,...,xN is an alternating set for f-r, (-l)k[f(xk) - r(xk)] is
of one sign for k = 1,2,...,N and so sgn s(xi) = (-l)jsgn S<xi+j)°
Therefore, if j is Odd, s(x) has an odd number of zeros (counting

multiple zeros according to multiplicity) in [xi’xi+j]’ while if j

is even s(x) has an even number of zeros in [xi,x, Since, by

i+j]°

definition, 5 has at least j-l zeros in [xi’xi+j]’ we can there-

fore conclude that s has at least j zeros in [xi,x . Thus,

he]
s has at least N-l zeros in [a,b]. But the zeros of s are the

zeros of its numerator plq - qlp, which is a polynomial of degree

at most max{ap1 + aq, Bql + 5p] SLN-2. Hence 3 E 0 and so r 5 r1.
(Necessity). Suppose that there exists {xi}§;1 :LX, N' <‘N, N'
maximal on which ‘w(xi)(Q(f)(xi) - Q(r)(xi))‘ = Hut) - Q(r)H and
Sgn(f(xi) - r(xi)) = (-l)i+lsgn(f(x1) - r(x1)). First let us diSpose
of the case where the error curve is constant and f-r has constant
sign on X. (A technical difficulty which results in the general

case requires us to consider this separately.) Without loss of

generality we shall assume w(x)(Q(f)(x) - Q(r)(x)) a HQ(f) - Q(r)H

and f(x) - r(x) > O for all x E X. Then there exists 31 > O
p(x) + 61 ‘
such that f(x) > r1(x) > r(x) where r1(x) = -—E?;7—-' Also, Since

M is relatively Open in R;[a,b], there exists 32 > 0 with
p(X) + 32
0 < 32 S 31 Such that r2(x) - W E M and f(x) > r2(x) > r(x).

This implies HQ(f) - Q(r2)H < HQ(f) - Q(r)H, which is a contradiction.

11

Now let Il,Iz,...,IN, be a collection Of relatively Open

intervals in [a,b] Such that xi 6 Ii’ T; n I. = ¢ for 1 ¥ j,

3
all extreme pointsN = {x E X : IW(X)(§(f)(X) ' Q(r)(x))I =

HQ(f) - Q(r)H} : iU I1 and for each extreme point in Ii the
=1

function f-r has constant Sign. Let
NI
=x n ( n I )
i=1
where Ii denotes the complement of Ii with respect to [a,b].
Y is a compact Subset of X and ‘w(x)(Q(f)(x) - Q(r)(x))\ <
HQ(f) - Q(r)H for all x E Y. By continuity there exists p > 0

for which max ‘w(x)(Q(f)(x) - Q(r)(x))‘ s HQ(f) - Q(r)H - p. Next,

 

XEY
let
wi = {x e x n f; : |w(x)(Q(f)(x) - Q(r)(x))‘ 2 HQ(f) ; §(r)ﬂ
and sgn(f(x) - r(x))= sgn(f(xi) - r(xi))}
._ N.
where Ii denotes the closure of Ii in [a,b]. W = U W1 is a
i=1

compact subset of X and so by continuity there exists an n > 0

such that ‘f(x) - r(x)‘ 2 n on w. Set

 

= {x e x n‘fi : \w(x)(Q(f)(x) - Q(r)(x))l 2 ﬂ§(f) ; Q(f)”

and sgn(f(x) - r(x)) s sgn(f(xi) - r(xi))l
NI
and let 2 = U 2. Observe that \w(x)(Q(f)(x) - Q(r)(x))‘ <
i=11

HQ(f) - Q(r)H for all x E Z by the construction of the intervals

{TH}. Finally, set

12

 

‘%={x€xnl1:WOHMDGO-QOHMHsJDG)-M0M

2
N.
and let U = U U1. Then, by continuity, there exists 6 >»0,
i=1

6 s p, such that
max Iw<x><¢<f><x) - Q(r)(X))| s HQ(f) - Q(r)H - a .
x€ZUU
Given T > 0 we can, using Lemma 1.8, select re 6 R;[a,b] such
that
(i) Hre - rH < T ,

Lw[a,b] (1 2)

and (ii) sgn(re(x) - r(x)) = Sgn(f(xi) - r(xi)) for all x E Ii’

i = 1,...,N'

By the continuity of Q we can select > 0 such that for

61
0 < e s 31, re satisfies (1.2) and

max IW(X)(Q(f)(X) - s(r >(x))I s us<f) - Q(r)H - g, .
XEYUZUU’ 6

Next, by continuity of f and r, we can select 32, O < 32 s 31,

Such that for 0 < e s 32, re lies strictly between f(x) and r(x)

N' .

on ‘W = U‘wi. Hence by the strict monotonicity of Q at f we
i-l

have that

max |w(x)(Q(f)(x) - Q(r )(x))| < HQ(f) - Q(r)H.
xEW 6

Thus for s with 0 < e s 32, re is such that HQ(f) - Q(re)H <
HQ(f) - Q(r)H. Finally, since M is relatively open in R;[a,b],

we can select a with 0 < c s s so that r 6 M and
3 3 2 e3
HQ(f) - Q(re )H < HQ(f) — Q(r)H. Thus re is a better starting
3 3
approximation than r which is a contradiction and this concludes

13

the proof of the theorenn I

Corollary 1.10: If Q(f) has a best starting approximation under

 

the setting of the above theorem than it is unique.
Proof: Uniqueness of best approximation is established in a manner
analogous to that in which sufficiency was verified in Theorem 1.9.

The technique of proof may be found in Rivlin [21, p. 123].I

Next we turn our attention to the questiOn of existence of
best approximation. J.R. Rice [20] points out that the standard
argument used to establish existence for classical rational approx-
imation may fail if X is not an interval. Since we wish to employ
the classical techniques we must also require that X be an interval.

Before stating the existence theorem.we prove the following lemmas:

Lemma 1.11: Let K be a convex Subset of C[a,b] and Q: K ~ C[a,b]
be a continuous Operator which is pointwise strictly monotone and
pointwise fixed at f E K ~ M where M = K n R:[a,b]. Further assume
that for each x 6 [a,b] there exists a sequence {r1} 5 M with
{ri(x)} unbounded from above and below and such that ‘Q(ri)(x)l a m
as i a m. Then given P > 0 there exists a number N(P) (inde-
pendent of x) Such that ‘w(x)(Q(f)(x) - Q(r)(x))‘ s p implies
‘r(x)‘ S N(P) where r E M and w is any continuous positive

weight function on [a,b].

Remark 1: It may well happen that all r E M are uniformly bounded
from above or below on [a,b]. In this case we will already have
the desired upper or lower uniform bound on r(x) (for all r 6 M).
The above lemma is easily modified to cope with the aforementioned

situations.

14

2593;: Let t a [a,b] be arbitrary but fixed. Then there exists
rt,st E M with rt(t) > f(t) and st(t) < f(t) such that ‘w(t)(Q(f)(t) -
Q(rt)(t))| > P +-l and \w(t)(Q(f)(t) - Q(st)(t))‘ > P + 1. By
continuity there exists 6(t) > 0 such that \x-t‘ < 6(t) and

x 6 [sub] imply Iw<x>m<f><x> - p(rt><x>>\ > P and

‘w(x)(Q(f)(x) - Q(st)(x))‘ > P. The neighborhoods (t-6(t), t + 6(t))
form an open cover of [a,b] and so by the compactness of [a,b]
there exists a finite subcover {(ti-6(ti), ti + 6(ti))}:=1. Set

N(P) = max {Hr H , HS H} to obtain the desired result.‘
lsisk ti ti

The next lemma is analogous to a result found in Rice [20,

 

 

 

p. 75]. n

30 + alx +...+-a x n
Lemma 1.12: Let r(x) = n 6 R [a,b], 0 S a < b

b +-b x +x..+-b xm m

0 l m m
where b +-b x +...+ b xm # 0 for all x 6 [a,b] and 2 b? = l.

0 l m . 1
i=0
If max Ir(x)\ s S then there exists N(S) such that
x6[a,b]
max lail S N(S), max Ibi‘ s N(S).
i i
n i n i
Iza.x| IEMI
i=0 1 i=0 1 h _ bm 1
Proof. ‘r(x)\ 2 m i 2 Om+l)B w ere B - max{ , ].
max Izb.x|
. i
xE[a,b] i=0
n .
Hence (m+l)BS 2 (m+l)B max Ir(x)l 2 max I Z aixll .
x6[a,b] xe[a,b) i=0
n .
Thus the polynomial Z a,xl is uniformly bounded on [a,b] and so,
i=0

by Markoff's Inequality, [1, p. 91], also the parameters ai. That
is, there exists N1(S) Such that max ‘ai‘ s N1(S). By the

i
normalization of the denominator, max Ibil s 1. Setting N(S) =

max{l,N1(S)} we have the desired result.l

15

Theorem 1.13: Let Q: C[a,b] a C[a,b] be a continuous Operator
which is pointwise strictly monotone at f E C[a,b] ~ M and point-
wise fixed at each R E C[a,b] where M = R:[a,b] (m,n fixed).
Assume that for each x E [a,b], [Q(ri)(x)l a m whenever {r1} 9 M
and ri(x) 4 12“: Then there exists a best starting approximation
for Q(f) from M.

12539;: Let {r1} C M be such that lim HQ(f) - Q(ri)H =

1-00)

inf HQ(f) - Q(r)H. Let us first normalize the functions ri(x) =
rEM

pics) m
(110‘) so that gobij

 

m .
= l where q,(x) = 2 b.,xJ. Clearly there
i j=0 i]

j
exists P > 0 such that HQ(f) - Q(ri)H s P for all i and so by
Lemma 1.11 there exists N(P) Such that HriH s N(P) for all 1.
According to Lemma 1.12 the coefficients Of the elements of {r1}
are uniformly bounded. Therefore there exists a subsequence of

{r1(x)} (which we shall also denote by {ri(x)}) such that

lim ri(X) = ;(x) for all x E [a,b] except for zeros of the
i—m

denominator of ;(x). If (xi: i = 1,...,k} denotes the zeros of
the denominator Of F. then it is clear that k s m. Using the

uniform boundedness of the sequence {r1}, it is straightforward

to Show that each x1 is a removable singularity Of r. Upon

*
cancellation, we Obtain a rational function r E R:[a,b] which we
claim is the desired solution. First, using the pointwise fixed

property of Q it is easy to show that if {ki} C C[a,b], k E C[a,b],

xo E [a,b] and lim ki(x0) = k(xo) then lim Q(ki)(x0) = Q(k)(x0).

1“” l-coo

Let 6 > 0 be given. Then there exists N = N(e) such that

HQ(f) - put)“ s I + e for all i 2 N where 1 = inf HQ(f) - Q(r)H.
rEM

16
k

For any x0 6 [a.b] ~ d {xi}. Iw<xo)<p<f)<xo) - p(ri)(x0>>I s 1 + 6
i=1

for i 2 N and so |w(x0)(p(£)(x0) - Q(r*)(x0))\ =

1m ‘w(x0)(Q(f)(xo) - Q(ri)(x0))‘ s 1 + e. Thus \Iw) - em)“ =

i—ccn

max |w(x)(Q(f)(x) - Q(r*)(x))l

sup Iw<x><p<f)(x> - p<r*)<x>>I

xE[a,b] xE[a,b]
x i xi
s I + 6. Hence HQ(f) - Q(r*)H = inf HQ(f) - Q(r)H.I
rEM

In the next theorem we wish tO consider a characterization of
the best starting approximation for Q(f) from a family of functions
having restricted ranges. Much of the general theory for restricted
range approximation by both polynomial and rational functions can be
found in Taylor [25] and [26], Schumaker and Taylor [23], and Loeb,
Moursund and Taylor [10]. It is interesting to note that the prob-
lem of approximation with restricted range also encompasses one-sided
approximation [4]. The theory of uniform approximation with restricted
ranges is Of particular significance for the problem of obtaining
starting values for various iterative schemes. This will become
evident later on.

For our considerations let {(x), u(x) E C(X) with L(x) <
u(x) for all x E X. K. Taylor [29] has pointed out that this
inequality restriction on L and u can be relaxed. Define
K = {f E C(X): {(x) 5 f(x) 3 u(x)} and set M = K n R;[a,b] which
we assume is nonempty.

Theorem 1.14: Let Q: K ~ C(X) be a continuous operator which is
pointwise strictly monotone and pointwise fixed at f E K ~ M where
K and M are as defined above. Then r = 5-6 M is a best starting

approximation for Q(f) if and only if there exists {xi[:=1 C X,

17

N = 2 + max{n + aq, m + 3p}, for which
(i) x1<x2 <...<xN
(ii) IW(Xi)(Q(f)(Xi) — Q(r)(XiDI = new - emu.
r(xi) = u(xi), or r(xi) = (,(x1)

(iii) sgn*(f(xi) - r(xi)) = (-1)i+1sgn*(f(x1) - r(x1))

sgn(f(x) - r(x)) if r(x) ¥ L(x) and

r(x) 5‘ u(x)
where sgn*(f(x) - r(x)) = +1 if r(x) = {,(x)
-1 if r(x) = u(x)

EIQEEF (Sufficiency). Suppose r E M possesses the properties
(i)-(iii) on the set {xi}§=1. Let r1 E M be Such that

HQ(f) - Q(r1)H s HQ(f) - Q(r)H. If r(xi) = {,(xi) then r(xi) s
r1(xi) since r1 E M and if r(xi) = u(xi) then r(xi) 2 r1(xi).
If IW(X1)(Q(f)(xi) - Q(r)(xi)>l = HQ(f) - p(r>u then
|W(xi)(Q(f)(xi) - Q(r)(xiﬂl 2 IW(Xi)(Q(f)(Xi) - Q(r1)(xi))\- Since
f 45 M we have that HQ(f) - Q(r)H > 0 so that sgn*(f(xi) - r(xi)) at o.
If sgn*(f(xi) - r(xi)) = +1 then f(xi) > r(xi) and by the point-
wise strict monotonicity Of Q at f we must have r1(xi) 2 r(xi).
Likewise, if sgn*(f(xi) - r(xi)) = -1 then r1(xi) s r(xi). Hence
we obtain the desired result as in the proof of Theorem 1.9.
ONecessity). Suppose r E M is a best starting approximation for
Q(f) and that r(x) has the desired behavior on a set Of points
{xi}§;1 C X, N' < N, where N' is maximal and x1 < x2 <...< xN,.
First we shall consider the case where w(x)(Q(f)(x) - Q(r)(x)) a x,
‘1‘ > 0, for all x E X and f-r has constant Sign. A technical
difficulty which results in the general case necessitates that we

consider this situation separately. Without loss of generality

18

assume that f(x) - r(x) < 0 for all x E X. Then there exists
a > 0 such that f(x) < r*(x) < r(x) where r*(x) = 232%;i—E- and
so HQ(f) - Q(r*)H < HQ(f) - Q(r)H, which is a contradiction.

Let 11.12,...,IN, be a collection of relatively open

intervals in [a,b] such that xi 6 Ii’ I] O I] = ¢ for 1 ¥ j,
all extreme points I {x E X: Iw(x)(Q(f)(X) - Q(r)(x))l =
N.

HQ(f) - Q(r)H, r(x) = L(x), or r(x) = u(x)} c u Ii and for each
i=1

*
extreme point in each Ii the function sgn (f(x) - r(x)) has the
Same value. Given T > 0 we can, according to Lemma 1.8, select

n
r6 6 Rm[a,b] such that

(i) Hre - rH a < 'r , (1.3)
L [a,b]

and (ii) sgn(re(x) - r(x)) = sgn*(f(xi) - r(xi)) for all

x 6 1i, i = 1,...,N' .

We shall now Show that there exists 6 > 0 such that reE M and
HQ(f) - Q(r€)H< HQ(f) - Q(r)H. Let ‘Y = X n ( U Ii). Y is a
compact subset of x and Iw(x)(Q(f)(x) - Q(r)(x))l < HQ(f) - Q(r)H

for all x E Y. Hence, by the continuity of Q, we can select

31 > 0 such that for 0 < e s 31, re(x) satisfies (1.3) and
max IW(X)(Q(f)(X) - Q(r€)(X))\ < \Iw) - emu. In addition r(x) a
XEY

u(x) and r(x) # L(x) for all x E Y so there exists with

62

0 < e2 s e1 such that 0 < e s 6 implies {(x) S r (x) s u(x)
e

2
for x E Y. Let

 

wi = {x e x n I; : |W(x)(Q(f)(x) - Q(r)(x))] 2 HQ(flg Q(r)H

and sgn(f(x) - r(x)) = sgn*(f(xi) ‘ r(xi))}

l9

and set

Vi = [x e x n Ti : r(x) = u(x) or r(x) = L(x)}, i = 1,2,...,N'.

With these definitions we see that all extreme points contained in
Ii are included in. Wi_U Vi' Using the strict monotonicity Of Q
at f and the continuity of Q there exists S3, with 0 < 63 S 32,

such that for 0 < e S 63. r (x) satisfies (1.3),
c

max Iw<><><i><f><x> - i<r ><x>>I < um“) - own and
XEWiUVi e

L(x) S r€(x) S u(x).

Do this for i = 1,2,...,N' and let be such that O < a

64 4 S 63

and for 0 < e S 34, r (x) satisfies (1.3),
c

max IW(X)(Q(f)(X) ‘ Q(r )(X))I < HQ(f) - Q(r)H and
XEWUV e

L(X) S r€(x) s u(x)

NI N.
where W = U W. and V = U V.. Set
1 , 1
i=1 i=1

Zi = {x E X ﬂ‘fi : [w(x)(Q(f)(x) - Q(r)(x))] 2 HQ(f) ; Q(r)H

 

*
and sgn(f(x) - r(x)) # Sgn (f(xi) - r(xi))}

N.
and let Z = U 21. By the construction of the intervals {Ii}
i=1
we have that r(x) # u(x), r(x) # L(x) and ‘w(x)(Q(f)(x) - Q(r)(x))l <

HQ(f) - Q(r)H for all x e 2. Finally, let

 

u1 = {x e x n I, : Iwcxxumx) - o<r>(x>>I s ”“0 gmh

NI
and set U = U U,. By continuity there exists

.- l with 0 < as S 34
1-1

65

so that for x E 2 U U and 0 < e S 35, r (x) satisfies (1.3),
e

20

|W(X)(§(f)(X) - Q(re)(X))I < HQ(f) - Q(r)H and L(X) S r€(X) S u(X)~

Combining the above results we have that for 0 < e S 35, re E M

and HMO - Mrs)“ < HQ(f) - o(r)II.|

Corollary 1.15: Under the conditions of the above theorem the best
starting approximation for Q(f) is unique.

Iggggf: Uniqueness is derived from the above characterization theorem
in a manner analogous to that in which Corollary 1.10 was established.l
Remark 1: The question of existence in the above setting is still
unanswered. Thereom 1.13 is not applicable here since it requires

that the domain of Q be c[a,b].

In 1955 S. Paszkowski (see [2]) considered the problem of
approximating a given continuous function on an interval [a,b] by
elements of an n-dimensional Haar subspace (to be defined) which were
also required to interpolate the given function at certain prescribed
points in [a,b] called nodes. He considered, using a classical
approach, the questions of existence, uniqueness and characterization
of best approximation. In 1968 Deutsch [2] extended and continued
the Study of Paszkowski. Using duality theory, Deutsch was able to
recover the characterization theorem of Paszkowski and to actually
formulate exchange algorithms for computation. In 1969, Loeb,
Moursund, Schumaker and Taylor [9] generalized the above work of
Paszkowski and Deutsch to the following problem: let X be a
compact subset of [a,b] and suppose {xi}:=1 C X satisfies

P . . . .
x1 < x2 <...< x . Let {mi}i=l be a set Of p031tlve integers With

P
P
m = 2 mi < n and assume X contains at least n - m + p + 1 points.
i=1

Suppose further that H is an n-dimensional extended Haar system of

21

order v, v = max (mi) + l, (which we shall define) and set
i

M = {p E H p(J)(xi) = ai., 1 S i S p and 0 S j S mi - 1},

J

where {aij} is a set of m real numbers. Then the problem con-

sidered was to find best approximations to elements of

K = {f E C(X) : f(xi) = a l S i S p}

i0’
by elements of M. The error of approximation was measured by a
generalized weight function, a notion to be discussed in Chapter IV.
We shall refer to the elements of M as p-point osculating Haar
functions. A. Perrie [19] has examined a problem analogous to the
above by replacing the extended Haar system by classes of rational
functions. The approximants used by Perrie are referred to as
p-point osculating rational functions.

We shall now examine a problem analogous to that considered
by Perrie for our Operator setting. Let X 9 [a,b] be compact,
{yi}g=l a fixed set of p points in X where y1 < y2 <...< y

* P

P
and {m,]? a fixed set of positive integers with m = m. <
1 i=1 i

0

i=1

n +~l (n as in R3). We shall assume that X contains at least

*
n +-m - m +'p + 2 points. Furthermore, let {ai]E=1 be any fixed
set of p real numbers and define K = {f E C(X) : f(yi) = ai,

i = 1,...,p}. Let aij’ i = 1,...,p, j = 0,1,...,mi-l be a second

set of real numbers where aiO = ai, i = 1,2,...,p and set

.. 1‘ .() _ . -
M - {r e Rm[a,b] . r j (yi) — ai , o S J g mi-l, 1 s i s p}. Follow-

j
ing the discussion in Perrie, we shall refer to the elements of M

as p-point osculating rational functions. As before we are interested

in approximating g E Q(K) by elements of Q(M) where Q: K a C(X)

22

is continuous.

In the case of ordinary rational approximation we know that
for each f E C[a,b] there exists a best approximation from
R:[a,b]. However, in the case of interpolating rational functions
we can no longer insure existence as is demonstrated by the follow-

ing example due to H.L. Loeb [8]: Define f0 6 C[0,1] as follows:

1 + 6x , x e [0, l]
_ 1 l 2
f0(X) - 3 - 6(X - 3). x E [3 . 3

l+6(x-§), xE[-2-,l]

 

 

3

H
3 I
2 ~1= $.60
11

S A : X
l

O 3. 1% 1.

l, w(x) E 1. Let K = [f E C[0,1] :

Here a = 0, b = l, m = n
f(0) = 1}. Obviously f0(x) E K. Define the Operator Q: K a C[0,1]
via Q(f) = f. It is clear that Q is pointwise strictly monotone
and pointwise fixed at f0 6 K. By the usual alternation theory for
rational approximation g(x) E 2 is a best approximation to f0(x)
from Ri[0,l]. Notice that 2 E M = {r E Ri[0,l] : r(O) = 1].

‘91339; There exists no r E M, such that HQ(fO) - Q(r)H =

Hr - f \ = dist (£0,M) = dist (Q(fo), Q(M)).

OI

23

. 2nx + 1
Proof: For each n = 1,2,..., define rn(x) = nx + 1 , E [0,1].
Clearly r E M for all n since r (0) = 1. Also r'(x) = n > 0
n n n 2
(nx+l)
and r (1) = 2_n_+__];< 2 for all n Hence 1 S r (x) S 2 for all
n n +11 ' n

n and for all x E [0,1]. Furthermore f0(l) - rn(l) > 1 and
max [‘f0(x) - rn(x)‘ : x E [0, l-]] S l for all n. Therefore
1
Hf0 - rnH = max [|f0(x) - rn(x)l : x E [a , 1]}. Now {rm} approaches
. 1 . .
2 uniformly on [6, l] and so lim HfO - rnH = lim max {‘f0(x) -

rn(x>I =x or1.111= max {1563 - 2\ =x e 13. 111 = IIf - 211-

Since M C Ri[0,l], HfO - 2H S inf {Hf0 - rH : r E M]. Hence
inf {Hf0 - rH : r E M} = Hf0 - 2H. But, by uniqueness in Ri[0,l],
there exists no r* E M such that HfO - r*H = Hfo - 2H =
inf {Hfo - rH : r E M] and hence the claim.l

In order to establish the characterization theorem for the
above setting we need to introduce the following definitions and
concepts. First we wish to define the notion of an n-dimensional
extended Haar subSpace of order v Of C[a,b]. In doing this, we
shall follow the notation and development given in [5]. For complete-
ness we first define what is meant by a Haar subspace of C[a,b].
Definition 1.16: A finite-dimensional subspace H of C[a,b] is
called a Haar subspace if each nonzero function in H possesses
at most n-l zeros, where n = dim H.
Let {ui(t)]:=1 be a family of functions in C[a,b] (with each
sufficiently differentiable so that what follows is well-defined)

and a S t1 S t2 S...S tn S b a given set of points. Define

24

ﬁ1(t1) ﬁ1(t2) .... ﬁ1(tn)l

* 1 u2(t1) u2(t2) .... u2(tn)
’COC’n -
t1,...,tn O O O

ﬁn(t1) Gn(t2) .... Gn(tn)

 

 

where for each fixed 1:

u. t. if t, < t.
1( J) J-l J 3
G (t.) =
ui (tj) if tj-r tj-r+1 ... tj, l n.

Definition 1.17: The functions u1,...,un will be called an extended

Chebyshev system of order v on [a,b] provided ui E CV-1[a,b],

1,...,n

*
i = 1,...,n, and U > 0 for all choices t S t S... S tn,

1 2

t1,...’tn

ti E [a,b], where equality occurs in groups of at most v consecutive

values of ti'

With the above we are able to define the notion of an extended Haar
subSpace.

Definition 1.18: An n-dimensional SubSpaCe H of C[a,b] is said
to be an extended Haar subSpace of order v, v S n, if there exists
a basis for H which is an extended Chebyshev System of order v
on [a,b].

Remark 1: It is straightforward to Show that H is an extended Haar
subspace of order v if and only if each nonzero element in H
possesses at most n - l zeros in [a,b] counting multiple zeros
in the following manner:

(1) if f(j)(z) = 0, j = O,...,v-1, we say that z is a zero of

multiplicity v;

25

(2) otherwise we say that z is a zero of multiplicity
= , (j) = _
m min {j. f (z) ¢ 0, j 0,...,v 1}.
For a fixed r ='2 E Rn[a,b], we shall write P + rP to
q m n m
denote the subspace {p +1rq : p E Pn’ q 6 PE] of C[a,b]. It is
well-known (see [19]) that Pn + er is an extended Haar subspace

of order v = dim(Pn + rPﬁ) = 1 +-max [n + aq, m + 5p]. Define
S(r) = {h E P +‘rP ' h(j)( ) = 0 i = 1 j = 0 m -1]
n m . yi , ,...,p, ,..., i .

It is clear that S(r) is a subSpace of Pn + rP&. Let us now
restrict Pn’ Pm and r to X and view S(r) as a subSpace of
C(X). The following lemma, whose proof is modeled after an analogous
result from.the work of Perrie (see [19, p. 13]), yields the exact
dimension of the subSpace S(r) of C(X).

lemma 1.19: dim S(r) = l +-max {n +-aq, m + 3p] - m* (= d).

lggggf: Let x1,...,xd be d distinct points in X which are dis-
tinct from the interpolating points y1,...,yp. For each i,

i = 1,...,d, choose hi 6 Ph + rPh such that hi E S(r) and

11

independent. For h E S(r), g = 2 h(xi)hi is an element of S(r)
i=1
and h-g possesses at least 1 + max {n + aq, m + 3p] zeros in X.

h (x ) = 6, , i,j = 1,2,...,d. The set {h,]§ is clearly linearly
i j d 1 i=1

But the numerator of h-g is an element Of P and

maX{n+aq.m+apl
g. Thus {h1,...,hd} is a basis for S(r).l

III

SO h

If {g1,...,gd] represents a basis for S(r) then for each
x d
x E X we shall write x to denote the vector (g1(x),...,gd(x)) E E
(E1 = real numbers). If Y is a subset Of X and y is a real-

valued function defined on Y, then H{y(y)y : y E Y] denotes the

convex hull of the vectors w(y)y E Ed, for y E Y.

26

We conclude our preparatory discussion with an important
result due to Salzer [22].

Lemma 1.20: Let f E CS(X) where s = max [mi - 1]. At points
lSiSp

 

where q(yi) # 0, the system
2(1) = (j) - .. - '=
(q) (yi) f (Y1): J — Oals°°'smi 1: 1 13°°°9P9

is equivalent to

p(j)(yi) = (fq)(j)(yi), j = 0,1,...,mi-l, i = 1,...,p.

Finally we are able to establish the following characteriza-
tion theorem.

Theorem 1.21: Let Q: K ~ C(X) be a continuous operator which is

 

pointwise strictly monotone and pointwise fixed at f E K ~ M where
K and M have been previously defined. Then the following four
statements are equivalent:
*
(i) r = £;-E M is a best starting approximation for
q
Q(f) from M.
o o A * *
(11) 0 E H[o(x)x : x E X(r )} where g(x) = sgn(f(x) - r (x)),
* *
X(r ) = {X E X = IW(X)(Q(f)(X) - Q(r )(X))| =
x _ . *
HQ(f) - Q(r )H], {g1,...,gd] is a baSis for S(r )
x
and ﬁ = (g1(x),...,gd(x)) where d = max{n + aq ,
* *
m +tap ] +'l - m .

(iii) There exist d + 1 consecutive points x < x2 <...< x

1 d+1

P
in X ~ U {yi} such that
i=1

(a) Iw(x,)<o(f)<xi> - p(r*>(xi>>I = \Iw) - p(r*>\\.

i = 1,2,...,d+1,

27

(b) sso{(f<xi> - r*(xi))Ail = <-1>i+1sgn{<f(x1> -

r*(x1))A1], i = 1,...,d+l, where

310(1) 310:2) 81(Xi_1) 810%“) 31(xd+1)

 

 

gd(x1) gd(X2) °°'° gd(xi-1) 8d(Xi+1) "°° gd(xd+1)
(iv) There exist d + 1 consecutive points xi 6 X ~ U {yi}
i=1
such that
* *
(a) Iw<x1>(o(f>(xi> - p(r >(xi>>| = HQ(f) - o<r >u.
i = 1,2,...,d+1,
* '+l
(b) sgn[[f(xi) - r (xi)]ﬂ(xi)] = (-1)1L sgn{[f(xi) -
r*(x.)]ﬂ(x,)} for i = 1,2,...,d+1, where
i 1 m1 m
u(t) = (y1 - t) ... (yp - t) P if p a 0 and
n(t) E 1 if p = 0.
Proof: (i) implies (ii):
Suppose 0 E H[o(x)§ : x E X(r*)]. By the Theorem on Linear
Inequalities [1] there exists h E S(r*) such that o(x)h(x) > 0

* *
for all x E X(r ). Set h = p-r q where p E Pn and q E Ph.

We then have that h(J)(yi) = O, i = l,..., p, j = 0,1,...,mi-l.
*
- *
Set rx = £;f-LB'. Now there exists A0 > 0 such that q (x) -
q ' Aq

*
Aq(x) > 0 for IAI S x0 and for all x E X since q (x) is positive

and q(x) is bounded on the compact set X. IAI*‘ x0, we
* *

have er M Since r* - r1 = E;.- R;_;_L2: 19* r(P-r *QIB Lh
<1 <1 - iq q ”(q -iq) q* -iq
and so h(j)(yi) = 0, i = 1,...,p, j = 0,1,...,mi-l, implies that

(j)

 

*(j)(yi) = a... i = 1,2,...,p, j = O,l,...,mi-l. Since

(y)=r U

28

min{a(x)h(x) : x E X(r*)} > 0.

*
X(r ) is compact, the number 6

Set E = “S(r) — Q(r*)H and x1 {x e x : o(x)h(x) > %n and

1 and X1 is

1 is a closed subset Of X and thus compact.

Iw(x)(Q(f)(x) - Q(r*)(x))I > g]. Note that X(r*) C X

Open; hence X = X ~ X

2

By the continuity of Q and the compactness of X2 there exists

11 such that 0 < A1 S 10 and IKI S A1 implies Iw(x)(Q(f)(x) -
Q(rx)(x))I < E for all x 6 x2 since Iw(x)(Q(f)(x) - Q(r*)(x))I < E

for all x E X2. Set

* * E
u = ianIHX) - 1‘ (X)I = IW(X)(Q(f)(X) - Q(I‘ )(X))I 2 ‘2' and x E X}-

Note that p. > 0. Now choose 1,2 such that 0 < x2 S x1 and

h
x
q 'Aq

S %-< If(x) - r (x)I. Consequently

 

IAI S A2 implies IAI S %-. For x E X1, Ir*(x) - rx(x)I S

 

 

 

 

h
*
q 'Aﬂ

(f - r*)(x) and (f - rx)(x) have the same signum function for

 

Hr" - r,\\ -- \xI

 

 

 

 

x 6 X1. For A <;0 with IAI S x2 we have o(x)(r*(x) - rx(x)) =

10(th(X)

*
q (X)‘AQ(X) *
for all x E X1 and so sgn(f(x) - r (x)) = -sgn(r*(x) - rx(x))

*
< 0 for all x E X1. Therefore o(x)(r (x) - rx(x)) < O

for all x E X1. Hence for f(x) > r*(x) we have f(x)>rx(x)>r*(x) and
for f(x) < r*(x) we have f(x) < rk(x) < r*(x). Applying the point-
wise strict monotonicity of a at r we find that Iw(x)(Q(f)(x) -
Q(rx)(x))l < IWIX)(Q(f)(x) - Q(r*)(x))l s HQ(f) - Q(r*)H= E for

all x E X1 and x as described above. Assembling the above re-
sults we have Iw(x)(Q(f)(x) - Q(rx)(x))I < E for all x e x (i

as above) and so HQ(f) - Q(rx)H < HQ(f) - Q(r*)H. Therefore rA

*
is a better starting approximation than r .

29

(ii) implies (iii):
*
Suppose 0 E H{o(x)ﬁ : x E X(r )]. Then by Caratheodory's

Theorem [l,p.17]there exist d1(d1 S d+1) positive scalars xi

d
1 *
with z A- = l and d consecutive points x ,x ,...,x E X(r )
i=1 1 1 1 2 d1
such that
d
l
iElxio(xi)gj(xi) = 0 for J = 1,2,...,d. (1.4)

l l
i = 1,...,d, j = 1,...,d where x. = x., i = 1,...,d1 and
d i i
—' d
{xi}i=d +1 0({xi}i:1 U [yi}:=1)= ¢. This then implies that there
1

We claim that d = d+l. Suppose d s d. Then det[gj(Xi)] = 0,

exists a d-tuple a = (“1"°"ad)’ a ¢ (0,0,...,0) such that
d ‘_ . d *
Z a.g.(X.) = 0, i = 1,...,d. Set g = 2 o g,. Then g E S(r )

j=1jJ
and so g has, in addition to simple zeros at the ;i, i = 1,...,d,
zeros of order mi at the points yi, i = 1,...,p. In total, g

* x x x *
possesses at least d +-m = max{n + aq , m + 5p } + 1 - m +-m =

x *
max{n + aq , m +-ap ] +-1 zeros. But the numerator of g is an

element of Pmax{n+3q*,m+3p*} and so g E 0. This is a contradiction
and so d1 = d +'l. Rewriting (1.4) as

d+l

iizhio(xi)8j(xi) = ”A10(x1)gj(x1)9 J = 1929"°:d

and solving by Cramer's rule, we obtain

i+l
('1) X10(X1)Ai
3
A1

 

lio(xi) = i = 2,...,d+l .

We now claim that Ai # 0 for all i. On the contrary

suppose that Ai.= 0 for some i. Define hi by

30

we 31%) glam) glam) glcxdﬂ)

hi(s) = . . Z .

 

 

gd(s) gd(x2) ”°° gd(xi-1) gd(xi+1) "" gd(xd+1)

Then hi(x1) = hi(x2) =...= hi(Xi-l) = hi( ) =...= = 0

hi(xd+l)

*
possesses at least d +-m

Xi+l

*
and since hi E S(r ) we have that hi

*
zeros. But hi i 0 since Pn + r P5 is an extended Haar subSpace
* *
of order 1 +-max[n +>aq , m +~ap }. Hence we have a contradiction.

*
Now xi > O and q(xi) = sgn(f(xi) - r (xi)) and so the result

follows easily.
(iii) implies (iv):
Let [21,...,zd_1] be an arbitrary set of d-l consecutive

points in X ~ U {yi} and define
i=1

81(8) 31(21) °’°°'° g1(zd_1)

g(s) = . . .

 

 

gd(s) gd(zl) ‘°'°°' gd(zd-1)

*
Note that g E S(r ) since g is a linear combination of the basis
*
elements of S(r ). Furthermore, g(s) has a nonzero coefficient
by the remark at the end of (ii) implies (iii). By construction,
* * *

g(s) has d-l +-m = max{n + aq , m +|3p ] zeros counting multi-
plicities. These consist of a zero of order m1 at each yi and a
simple zero at each 21 (the zeros at 21 are simple since g i 0).
Hence g changes sign at each 21 and changes Sign at yi if

and only if m1 is odd. Set 21 = xi+2, i = 1,...,d-l, for the

31

choice Of xi's in part (iii). Then

 

 

 

 

 

 

s1(x1) 31(21) gl(zd_1) 81(x1) g1(x3) gl(xd+1)
g(xl) = : = = A2
gd(x1) gd(zl) gd(zd_1) gd(x1) gd(x3) gd(xd+1)
Similarly g(xz) = A1. Now if yi1,...,yiq E (x1,x2), then
m. +...+-m, m, +...+ m,
11 lq i1 iq
sgn A2 = Sgn g(x1> = <-1> sgn g(xz) = <-1) Sgn A1
110(1) *
= sgn N(Xz) sgn A1. Thus sgn{[f(x2) - r (x2)]ﬂ(x2)} =
1': * Sgn A2
sgn{if<x2) - r (X2)]}Sgn u(x,> = sgn{[f(x2) — r <x,>]} Sgn 11 sgn u(x1> =
* sgn u(xl) * sgn N(Xl)
sgn{[f(x2) - r (x2)]A2] —-§§E—KI— = -sgn{[f(x1) - r (x1)]Al] -—;EE—KI. =

-Ssn{[f(x1> - r*<x1>]n<x1)}-
Continuing in this manner, we obtain the desired result.
(iv) implies (i):

Suppose there exists r E M such that HQ(f) - Q(r)H S
HQ(f) - Q(r*)H. Now r*(xi) < f(xi) implies r*(xi) S r(xi) and
similarly r*(xi) > f(xi) implies r*(xi) 2 r(xi). Thus, using
(iv), we have that [r*(xi) - r(xi)]ani) 2 O and [r*(xi+1) -
r(xi+1)]n(xi+1) S 0 (or conversely) for i = 1,...,d. Set

S = r - r*. If S(xi) # 0 and s(x ) = 0, while

1+1) =° ' '= S(xi-l-j-l
s(xi+j) # 0, then since (-l)k[(r*(xk) - f(xk))n(xk)] is of one

sign for k = l,...,d+l we have that sgn{[S(xi)]H(Xi)l =

sgn{[f<xi> - r*(xi)]n(xi)} = (~113ssn{[f(xi+j> - r*(xi+j)]ﬂ(xi+j)l

= (-l)jsgn{[s(xi+j)]n(xi+j)]. Now we have two cases to consider.

32

First assume that the sum of the mk's such that yk E (xi.xi+j)

is even. Then u(xi)n(xi+j) >10 and sgn s(xi) = (-1)jsgn S<xi+j)°

Thus if j is odd, s(x) has an Odd number Of zeros (counting
multiple zeros according to multiplicity) in [xi, xi+j]’ while if

j is even s(x) has an even number of zeros in [xi’xi+j]' Since,
by definition, 8 has at least j-l +2mk (sum over yk E (xi’xi+j))

zeros in [xi,x we can therefore conclude that s has at least

...],

j +'Emk zeros in [xi’xidjl' Counting zeros, we find that s has
* * x

at least d +'m = l +-max{n + aq , m +-ap ] zeros in [a,b]. But

* *
the zeros of s are the zeros of its numerator p q - q p, which

* *
is a polynomial of degree at most max{n +taq , m + 3p ]. Hence

3 0. Similarly, we obtain the same conclusion if the sum of the

*
mk's, such that yk E (xi, xi+j)’ is Odd. Thus r r ,I

In order to establish uniqueness of best approximation in

the above setting we require the following lemma:

Lemma 1.22: Let r* E M be a best starting approximation to

f E K ~ M where K and M are as previously defined and Q: K a C(X)
is pointwise strictly monotone and pointwise fixed at f. Then

h E S(r*) and h(x)o(x) 2 0 for all x E X(r*) imply h E 0.
.nggﬁ: Let h E S(r*) with h(x)o(x) 2 0 for all x E X(r*) and
suppose h i 0. Let zl’ZZ"°"Zt be the zeros of h(x) which are
in X(r*). Notice that 21 ¥ yj for any pair i,j since f(yi) =
r*(yi) and HQ(f) - Q(r*)H > 0. Furthermore t < d =

max{n + aq*, m.+-ap*] +'1 - m* since if t 2 d then b possesses
at least 1 +-max[n + aq*, m + ap*] zeros which is impossible since

the numerator of h is an element of P From the

maxIrIi'oCI*om"*oP*l'

*
standard interpolation theory we know that there exists g E S(r )

33

such that g(zi) = g(zi), i = 1,2,...,t. Since a(x)h(x) 2 0 for
all x E X(r*), we claim that there exists l > 0 such that
o(x)(h(x) +-Xg(x)) > 0 for all x E X(r*). For each i = 1,...,t,
choose an open interval, 1(21), about 21 such that o(x)g(x) > 0
for all x E I(zi) and I(zi) ﬂ I(zj) = ¢ for i ¥ j. This we
are able to do since q(zi)g(zi) = (C(21))2 : 0 and o(x)g(x) is

* N
continuous at x = zi. Set Y = X(r ) ﬂ ( ﬂ {I(zi)]). Y is closed
i=1

in [a,b], hence compact, and so o(x)h(x) > 0 for all x E Y. Let
m = inf{c(x)h(x) : x E Y] and M = inf{o(x)g(x) : x E Y]. Note that
m > 0 and M > -m. If M 2 0, let A > 0 be arbitrary; if M < 0,
choose A > 0 such that m +'xM > 0. Then o(x)(h(x) + kg(x)) > 0
for all x E X(r*). Since S(r*) is a subSpace Of Pn + r*ph and
h,g E S(r*) we have that h +txg E S(r*). Thus, by the Theorem on
Linear Inequalities, 0 E H{o(x)§ : x E X(r*)] and so r* is not a
best approximation to f E K ~ M. This is a contradiction..

Theorem 1.23: If r* E M is a best starting approximation to Q(f),
then r* is unique.

“2522:: Suppose r E M is Such that HQ(f) - Q(r)H = HQ(f) - Q(r*)H.
Then if g(x) = sgn(f(x) - r*(x)) we have for x E X(r*): (i) if
f(x) > r*(x) then r(x) 2 r*(x) and (ii) if f(x) < r*(x) then

x ‘ *
r(x) S r (x). Hence we conclude that o(x)(r(x) - r (x)) 2 0 for

 

*
* p(X) _ P (Xi
all x E X(r ). Consequently o(x)(q(x) q*(x)) 2 0, so

o(x)(p(x) - q(x)r*(x)) 2 0 Since q(x) > 0 on [a,b]. By Lemma

* *
1.20 we have that p-r q E S(r ). Therefore Lemma 1.22 gives

p(x) - r*(x)q(x) E 0 and so r E r*. I

34

Finally we shall give a generalization Of a theorem of
de la Vallée Poussin, for the problem at hand, which is often useful
in estimating the deviation of the best starting approximation to Q(f).

*
*
Theorem 1.24: let f E K ~ M and let r = B;'E M be the best

q
starting approximation to Q(f) from M. If r E M, {xi]i:i a set
. . . p d+l .
Of d+l consecutive p01nts in X ~ U {yi], and IOiAi=1 is a set

1—1
of positive numbers Such that

(i) IW(X1)(Q(f)(Xi) ' Q(r)(xi))I = A1,
(ii) sgn{[f(xi) - r(xi)]n(xi)] = (-l)i+lsgn{[f(x1) - r(x1)]n(xl)],

*
then min ii S HQ(f) - Q(r >H-
i

x *
Proof: If r E r the result is clear and so we suppose r i r

Assume min Ai > HQ(f) - Q(r*)H. Therefore ii > HQ(f) - Q(r*)H,
i = 1,...fd+l (where d = max{n + 3q*, m + 3p*] + 1 - m*) and so
Iw(x,)<o<f><xi) - p(r><x,>>l > Iw<xi>o<f><xi> - o<r*>(xi>)l.

i = 1,2,...,d+1. Now proceeding as in the proof that (W) implies

*
(i) in Theorem 1.21 we obtain the fact that r r which is a

contradiction. Hence min xi S HQ(f) - Q(r*)H.|
i

Section 3: Computation of an Optimal Starting Approximation

In this section we shall examine the problem of computation
of a best starting approximation for an Operator 'Q. The characteriza-
tion theorems we have developed, especially Theorem 1.9, are of
particular importance in establishing computational procedures. From
the classical theory an optimal Starting approximation would be
computed using a modified Remes algorithm which involves solving a

nonlinear system of equations with Newton's method of higher order.

35

However, if the Operator Q satisfies certain conditions a best
starting approximation can be more easily obtained. Moreover, the
best starting approximation may be independent Of the number Of
applications of Q provided this iteration is well-defined. We now
wish to consider sufficient conditions on Q for which this behavior
occurs. The following definitions and results are due to Meinardus
and Taylor [11].

Definition 1.25: Let Q: K a C(X) be a continuous operator. We

say that Q possesses Property I at f E K provided for each r E K

and x,y 6 x, £152.- £311 implies 2$£l£§2.= EIEISXI and

 

f(x) ’ f(y) f(x) f(y) ’
r(y) r(x) r(y) r(x) . . _ Q r x
f(y) < f(x) S l or f(y) > f(x) 2 1 implies 1 f(x) <
Hm.
f(y) '

Definition 1.26: Let Q: K a C(X) be a continuous operator. Q

is said to be one-sided at f provided either Q(k) 2 Q(f) for all
k E K or Q(k) S Q(f) for all k E K.

Theorem 1.27: Let K be a convex subset of C(X) and Q: K a C(X)
be pointwise strictly monotone, pointwise fixed and possess Property

I at f E K where Q(f)

f and f > 0. Norm C(X) by HhH =

H%Hm (h 6 C(X))- LCC r 5 E R:[a,b] (m,n,a,b fixed) be the best

relative approximation to f from R:[a,b] with deviation 1; that

. f-r . f-s n ,
is, Hmf-‘H0° = 12f “-E—Nm = A. If M = K H Rm[a,b] is nonempty
SElea Sb]
_ n l l
and relatively open in Rm[a,b] and St E M for 5 6 [1+1 ’ 1'1]
l 1 . .
then there exists 60 E (1+1 , I'AP) for which 60r is the best

starting approximation for f (with reSpect to Q).

36

Proof: First we observe that A < 1 since arbitrarily small positive
functions exist in R:[a,b]. From the standard Chebyshev theory we
know that there exists N = 2 + max{n + aq, m + 3p} consecutive

points, x1 < x2 <...< xN, in X such that

(ii) sgn(f(xi) - r(xi)) = (-l)i+lsgn(f(x1) - r(x1)), i = 1,...,N.

f(xi) - r(xi)
f(xi)

f-r
f

 

 

 

 

(i)

 

 

Without loss Of generality we may assume that r(xl) > f(xl). It

 

. . . . l _
is ea31ly seen, uSing (i) above, that 1+). r(xl) - f(xl) and
14;}:- r(x2) = f(x2) and in general 1 - A S If—Ef] S 1 + 'A- Hence, in
r(x1)r(x2) Yr(X )
view of the above results, 2 5.112 2 and so -——1—-— 2
f(x x1) f(x) f(x x:) f(x1>
'Yr(x) Yr(X2 ) 1 1
_f_(;)_2 fx( ——)-2,where yEIm,ﬁ],for all xEX. By Pro-
perty I,
swam Q(Yr) (x1) om) (x >
1-———— Smax l--—— ,l-—————2—
f(x) f(xl) f(xz)

 

 

 

 

 

 

f("1+2) ' r<xi+2) = f(xi) ' r(xi)
f("i+2) f(xi) ’

 

 

Also, i = 1,2,...,N-2, implies

r(x i+2) _ r(xi)

 

 

 

 

 

 

 

f—(x-—)- - m , i = 1,...,N-2 (using Property (ii)). Thus, for
1+2 1
Y€I_ 9 _']s " = 1-—-—-— sinceQ
1+), 1 K f(xi+2) f(xi)
possesses Property I at f. Using the pointwise strict monotonicity
Q(Yr)(X1)
of Q at f, we find that 1 - -——— decreases to O as y
f(xl)
1 d 1 i<Vr><x2> .
decreases to a: an - —f-(x—2)- increases from 0 as y

37

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

decreases to 1:; . Similarly, 1 - ———?Y;Iy— increases from 0
y increases to -:—- an - —————————- decreases to 0 as y
l x f(xz)

1 . 1 Q(Yr)(xl) d Q(Yr)(X2)
increases to l:{ . Since both - -_—?Y;;7—- an - ___f?;;7_
are continuous functions of y there exists E —l— -—l— such

’ YO 1+), ’ 1-x
Q(vor)(x1)I _ p(vor)(x2) o<f><xi>-p<y0r><xi>
that 1 - f(x ) I — - f(x ) Hence f(x )
l 2 i
g Q(Yor)(xi) _ Q(f)(X) - Q(YOrHX) _ Q(f) - Q(Yorj
' f(x ) " "'3" f(x) ’ f |
1 XEX co

= HQ(f) - Q(yor)H. Finally, it is clear from the choice of y that
sgn(f(xi) - Yr(xi)) = sgn(f(xi) — r(xi)). Thus, by Theorem 1.9,
yor is the best starting approximation for f.I

Definition 1.25 and Theorem 1.27 of Meinardus and Taylor have
rather natural analogs in the setting of uniform approximation
(“(10 '3 1)~
Definition 1.28: The Operator Q is said to possess Property J at
f E K if for each r E K and x,y E X, f(x) - r(x) = f(y) - r(y)
implies Q(f)(X) - Q(r)(X) = Q(f)(y) - Q(r)(y). and 0 S f(X) - r(X)
< f(y) - r(y) or 0 2 f(x) - r(x) > f(y) - r(y) implies
IQ(f)(Y) - Q(r)(Y)I > IQ(f)(X) - Q(r)(X)I-
Theorem 1.29: Let Q: K ~ C(X), K be a convex subset of C(X),
and M = K n R:[a,b] be a relatively open subset of R2[a,b] with
m S n. Assume Q is pointwise Strictly monotone, pointwise fixed
and possesses PrOperty J at f E K ~ M. If r E R:[a,b] is the

best uniform approximation to f from R:[a,b] with deviation A

38

and if r +'c E M for c E ['Ash] then there exists c0 E ('131)
for which r +-c0 is the best starting approximation to Q(f)
from M.

EEEBEF Since r is the best uniform approximation to f from
R;[a,b], there exist N points, x1 <...< XN’ in X, N = 2 +

maxfm + 5p, n +-aq], such that
(i) If(xi) - r(xi)I = Hf - rH, i = 1,...,N
.. i+l .
(ii) sgn(f(xi) - r(xi)) = (-l) sgn(f(x1 - r(x1)), i = 1,...,N .

Now f(xi) - r(xi) = f(xi+2) - r(xi+2), i = 1,2,...,N-2 and so by
Property J Io<t><xi> - o<r><xi>I = IQ(f)(Xi+2) - p(r)(xi+2>l.

i = 1,...,N-2. Also by Property J we see that for any c E [-I,x],
IQ(f)(x) ' Q(r+c)(x)I S m3X(I§(f)(X1) ‘ Q(r+c)(x1)I, I¢(f)(X2)-Q(r+c)(x2)I).

Note that for each r and x, Q(r+c)(x) is a continuous function of
c. Without loss of generality assume f(xl) > r(xl). Using the fact
that Q is pointwise strictly monotone and pointwise fixed at f E K
we have that IQ(f)(x1) - Q(r+c)(x1)I is a strictly decreasing func-
tion of c which tends to 0 as c tends to A and

IQ(f)(x2) - Q(r+t)(x2)I is a strictly increasing function of c
which tends to zero as c tends to 'A- Since both

I§(f)(xl) ‘ Q(r+C)(x1)I and IQ(f)(X2) - Q(r+t)(x2)I are continuous
functions of c there exists cO E (~K,K) for which

|Q(f)(x1) - 9(r+co)(x1)| = IQ(f)(x2) - Q(r+t0)(x2)|. Thus

IQ(f)(xi) - Q(r+c0)(xi)I = HQ(f) - Q(r+co)H, i = 1,2,...,N. Further-
more. since co 6 01.1). sgn(f(xi) - (ti-co) (1(1)) =

(-l)i+lsgn(f(xi+1) - (r+c0)(xi+1)) and so appealing to Theorem 1.9

39

we have the desired result. I

Theorem 1.30: Let Q: K d K satisfy the following properties:
(i) Q is continuous
(ii) Q(f) = f
(iii) Q is pointwise strictly monotone at f E K ~ M
(iv) Q is a one-sided operator at f

(v) Q possesses PrOperty I (Property J) at f .

Then Qm = Q(Qm-l), m = 2,3,... has all the same properties as Q
and, moreover, the best starting approximation for Q(f) is also

the best starting approximation for Qm(f), m = 2,3,... .

.ggggf: The first part of the conclusion follows from Corollary 1.6
and a simple inductive application of Property I (Property J). For
the second part let us suppose, without loss of generality, that

Q(k) 2 Q(f) = f for all k E K and that we are dealing with relative
approximation. Now if r = 5- is a best starting approximation for

Q(f) then there exist N points {xi}¥

1___1;:x,N=2+

max[n + aq, m + 5p], such that x1 < x2 <...< xN and

 

Q(r)(x.) - f(X.)
(i) i i I

21£l_:_£I , i 2 1,...,N
f(xi) f m

(-l)i+lsgn(f(x1) - r(x1)), i = 1,2,...,N.

<11) sgn(f(xi> - r(xi>)

Now by applying Property I to these points m times it is clear that
r is also the best starting approximation for Qm(f) = f .|

In attempting to construct a theorem analogous to Theorem 1.27
for the case of restricted range approximation certain difficulties
arise. The multiplicative shift of the best relative approximation

to f from R;[a,b] may well remove us from the approximating class.

40

However in one particular setting a result can be obtained. If
L(x),u(x) E C(X) denote the upper and lower constraining curves for
the restricted range problem, take L(x) = f(x) or u(x) = f(x)
where f(x) is the function we desire to approximate. Without

loss of generality take L(x) = f(x) and set K = {h E C(X): f(x) S
h(x) S u(x)}. This situation will be described as modified one-
sided approximation from above since the usual one-sided approxima-
tion would have u(x) E m. In this setting we are able to state the
following analog of Theorem 1.27.

Theorem 1.31: Let Q: K » C(X), K = {h E C(X): f(x) S h(x) S u(x)]
where u(x) E C(X), f > 0 on X, and C(X) is normed by

HhH = H%Hm. Further suppose that Q(f) = f, Q is pointwise strictly
monotone, pointwise fixed, and possesses PrOperty I at f. Let r E M
be the best relative approximation to f from M where

M = K O R;[a,b]. Then r is the optimal starting approximation
for f from M (with respect to Q).

‘nggf: The proof follows directly from Theorem 1.14 and the fact
that Q possesses PrOperty I at f.|

Remark 1: In applying the above theory one may find it more efficient
(in terms of the computer) to compose different operators (each of
which defines an iteration procedure) rather than iterating with a
given Operator. In view of this we state the following: suppose

Q: K ~ C(X) possesses PrOperty I at f E K, Q is one-sided at f
and I: L ~ C(X) possesses Property I at Q(f) where Q(K) C L. If w(f)
Q(f) = f then YQ possesses Property I at f E K. WlLJ this
Observation and the results in Section 1 all of the theorems of this

section may be restated for the composition Operator YQ or more

41

generally for the composition of operators Y1,...,Yk, provided the
domains and ranges mesh correctly and each of the Operators Yi

possesses the appropriate properties.

CHAPTER II

APPLICATION TO THE NEWTON OPERATOR AND ASSOCIATED RESULTS

Section 1: Introduction and General Theory

In this chapter we shall apply the theory of Chapter I for
a special choice of Q, namely the Operator associated with the well-

known Newton iteration scheme. First set
2
S = {f e c (ops): f',f" s4 o for all x > o and image f = (o,oo)l-

The choice of (0,m) is quite arbitrary; actually, one could select

(-mym) or some other region on which to work. Note that

domain f”1 ==(O,m) Since image f = (0,m). For a fixed x E [a,b] C
-1 , .

(0,m) we can solve f (y) - x = 0 by Newton 3 method tO obtain

the value of f at x where f E S. In particular, if y0(x) = y(x)

is the initial guess to f(x) then inductively we define yn(x) by

yn<x) = yn_1(x> - {f'1(yn_1<x)> - xlif'If‘1<yn_1<x>>31, o = 1,2,... (2.1)

Of course we must insure that yn_1(x) > O, n = 1,2,... so that

f.1 is defined. The sequence (2.1), which represents the Newton
iteration scheme for determining the unique zero of the equation
f-I(y) - x = 0, converges quadratically to f(x) provided the initial
guess y0(x) is Sufficiently good. As mentioned, the above con-

sideration is for a fixed value of x. Our goal is to be able to

calculate f(x) for all x E [a,b] on a computer. To accomplish

42

43

this we shall select a class of functions M (Specifically

M C R:[a,b] for our purposes) which are easily programmed and then
select a member of M as the initial guess. Precisely, we wish

to find an element r of M for which

yn r(X) - f(x)
f(x)

yn S(X) - f(X)
f(x)

 

 

5

co

 

 

 

 

 

 

 

 

00
for all s E M where yn S(x) denotes the n-th Newton iterate at
3

x with initial guess s(x). Numerically the above method would be

applied to functions f for which f.1 is easily evaluated such

1

as f(x) = x“, n = 2,3,... . This particular problem was examined
by D.G. Moursund and G.D. Taylor [16] and is a generalization of the
subroutine used to calculate /r on a computer Such as the CDC-3600.
We shall now Show that the theory we have develOped in Chapter I is
also applicable to this problem as well as others.

The first problem confronting us is that Of determining a
suitable convex subset K of C[a,b] such that the operator N

f
defined by

Nf(h)(x) = h(x) - {f'1<h<x>> — x}{f'[f‘1(h(x>>ll

maps K to K (into or onto). It is imperative that we examine
two cases:
Case 1: Fix f E S and assume that either f' > 0 and f" < 0

on (0,m) or f'< 0 and f” < 0 on (0,m). Let

K = [h E C[a,b]: h(x) > 0 for all x E [a,b]].

44

Setting N(x,y) = y - {f-1(y) - x]{f'[f-1(y)]] and calculating

32%;;113 it is easily seen that Nf

at f and one-sided from above at f. It is also clear that

is pointwise strictly monotone

N ' K a K is continuous and Nf(f) = f. Observe that the class of

f 1

functions f(x) = xn, n 2,3,... are suitable choices for f.
Case 2: Fix f E S where either f' > 0 and f" > 0 or f' < O
and f" > 0 holds on (0,m). The choice for K in this setting

is somewhat more difficult since N is a one-sided operator from

f
below and consequently large values of h(xo), for fixed x0 E [a,b],
may yield negative values for Nf(h)(x0). Since we wish to iterate

with N and we have assumed the image of f is (O,m), we must

f
bound the members of K from above. Specifically, for fixed

f E S as above, we wish to find, if possible, a function

¢(x) E C[a,b] such that if
K = {h E C[a,b]: 0 < h(x) < ¢(x) for all x E [a,b]]

then Nf: K e K. If we set N(X:Y) = y ‘ {f-1(Y) ‘ Xllf'lf-1(Y)]}

(as above) and calculate 321%;123 it is clear that for each fixed

x N(x,y) is a strictly increasing function of y for

0 < y < f(x) and a strictly decreasing function of y for y > f(x).

Moreover, N(x,y) > 0 for 0 < y < f(x). Let us now assume that for

each fixed x E [a,b], lim N(x,y) = -m. This guarantees us that
y-Hu:

for each x E [a,b] there exists yx > 0 such that N(x,yx) = 0

(clearly yx > f(x) for each x E [a,b]). To construct the func-

tion ¢(x) mentioned above we shall make use of the Implicit

Function Theorem. First we have that N(x,y) is defined on

45

1 2 1 1
[a,b] x E+_C E where E+_= {s e E : s > 0}. Let x0 6 [a,b] be

arbitrary but fixed and let E E: be such that N(x0,y0) = 0.

yo
Then the following holds:

(1) [a,b] X [y: y > f(x0)] = “(X0 is a neighborhood

’yo)

of (x0,y0) in [a,b] X Bi, N(x,y) is continuous

 

on n and M 2 _ if'lm - xlf'if‘liyil
(X 93' ) ay . '1
0 0 f [f (y)]
is continuous on “(xosy0) by the assumptions on f.
(iii) EHAEJXI. # 0 .

3y
(x0.y0)

Hence, by the Implicit Function Theorem, there exist neighborhoods

“x0 of x0, “yo of y0 and a function ¢O(x) defined on “x0

such that

(iv) for all x e “x , ¢O(x) E ﬂy , (x,¢0(x)) E “(X y )
0 0 0,0

and whom) = o.
(v) ¢O(x) is uniquely determined by (iv)

(vi) ¢0 is continuous on “x
0

(vii) ¢0(xo) = YO .
Now consider {Hx: x E [a,b]]. This is an open cover of [a,b], which
is compact, and so can be reduced to a finite subcover “x ,...,Hx .
1
For each i = 1,2,...,n we have ¢i(x) defined on “x and con-

i
tinuous there. Suppose “x O ﬁx # ¢ for i # j. We wish to Show that
1 J

(x) = (x) for x E O . Let 2 E I n . Then
¢i $1 'hi 'Wj Ihi TKj
N(z,¢i(z)) = N(z,¢j(z)) = 0 and ¢i(xi) = y1 > f(xi).

46

Claim: ¢i(z) > f(z).

Assune to the contrary. Then ¢i(z) < f(z) since ¢i(z) = f(z)

implies 0 = N(z,¢i(z)) = N(z,f(z)) = f(z) > 0. Define gi(x) =

¢i(x) - f(x). Then gi is continuous on TH with gi(xi) > 0,

i
81(2) < 0. Hence there exists 5 E “x such that gi(g) = 0,
i

Therefore ¢i(§) = f(g) and so 0 = N(§,¢i(§)) = N(§,f(§)) > 0.
This is a contradiction and thus the claim. Similarly ¢j(z) > f(z).
Since N is decreasing for y > f(z) and N(z,¢i(z)) =
N(z,¢j(z)) = 0 we have that ¢i(z) = ¢j(z). Consequently we are
able to define a function ¢ on [a,b] via ¢(x) = ¢j(x) if
x E “x., j = 1,...,n. The function ¢ is continuous on [a,b]

J

and ¢(x) > f(x) > 0 for all x E [a,b]. Set
K = {h E C[a,b]: 0 < h(x) < ¢(x) for all x E [a,b]].

Then if h E K, Nf(h)(x) S f(x) < ¢(x) for all x E [a,b] and
from the previous discussion Nf(h)(x) > 0 for all x E [a,b].

Hence, for the above choice of K, Nf: K a K and so iterating with

Nf via N? = Nf(N?-1), m = 2,3,... is well-defined. As in Case 1

we know that N is continuous, Nf(f) = f and N is pointwise

f f

strictly monotone at f. Moreover, Nf is one-sided from below at f.

Remark 1: For the function f(x) = ex (which falls in Case 2),

the function ¢(x) = e1+x satisfies N(Xsel+x)

0 for all

x E [a,b] where N(x,y) is defined by N(x,y) y(l + x - Ln y).

Hence we set

1
K = [h E C[a,b]: 0 < h(x) < 6 +x for all x E [a,b]].

47

With the preceding discussion in mind we can state the follow-
ing analog of Theorem 1.9 for both of these cases (with the reSpective
K's defined above).

Theorem 2.1: Let f E S, K be as above and M = K n R;[a,b] (fixed

m,n). Then rk ‘lﬁi E M is the unique best starting approximation

for f with respect to the k-th Newton iteration if and only if

there exists {xi]§=1 c [a,b], N = 2 +-max{n +qu, m + apk], with

x1 < x2 <...< xN for which

f - N:(rk)
f

 

(i)

 

f(xi) - n‘f‘skxxi) .
f(xi) a, i=1,2,...,N

(ii) sgn(f(xp - rk<xi>) = <-1)“1sgn<f<x1>- rk(x1>>.

 

 

 

 

 

i=1,2,...,N.

Section 2: Optimal Starting Approximations for Roots, Exp(x) and Ln(x)

In general the existence of a best starting approximation is
a rather difficult problem which depends upon the particular function
f and the interval [a,b]. For the Special case that f(x) = xa,
o E (0,1) or f(x) = ex existence has been established and, more-
over, the best starting approximation is independent of the number
of iterations. For f(x) = xa, o E (0,1), set K = {h E C[a,b]:
h(x) > 0 for all x E [a,b]] where 0 < a < b. The Operator N

f
is defined by

Nf(h)(x) = 01%;" 1)h(x)+—l'—}:—— , for h E K.
h" (x)

1
In the case of f(x) = ex, set K = {h E C[a,b]: 0 < h(x) < e +x

for all x E [a,b]] and define Nf(h)(x) = h(x)(1 +-x - Ln h(x))

for h E K. If r E M denotes the best relative approximation to

48

ex from R:[a,b] with error A» then we must apparently require
that A < Eii- in order to apply Theorem 1.27. However this re-
quirement will be shown to be unnecessary. Before stating the next
theorem due to Meinardus and Taylor [11], we introduce the following

notation: for each a E (0,1), let to denote the best relative

. . n . . . .
approximation to xa from Rm[a,b] With dev1ation Ia; that is,

 

 

 

 

 

 

 

 

 

 

 

 

 

 

x - r xa
o = inf - S = A .
xa n xa o
s a b
€Rm[ 9 1 m
x
Let r denote the best relative approximation to f(x) = e from
n X r eX s
Rm[a,b] with deviation 1; that is, x = inf
n e
e a) SERm[a,b] 00

Theorem 2.2: For m = 1,2,... the following is true:

(a) The best starting approximation for m Newton iterations

for the calculation of xa

is Yoro where
(1 + Ve-l - (1 - ia)e'1 0’ 1

V = 2 5-1 ’ B = 3 °
2(p-l>ia<1 - he)

 

(b) The best starting approximation for m Newton iterations

for the calculation of ex is yr where

y = expi§§ (21 + (l-i)tn(1-i) - (1 + X)Ln(l + i))] s
1
1- 5"
“ﬁg? 7‘ ——1
1-x2

Proof: The proof of (S) appears in [28] and will be omitted.

For (b) we Shall apply Theorems 1.9 and 1.30. First we shall Show

that N = N x possesses PrOperty I at f(x) = ex where N(r)(x) =

e
r(x)Q - Ln Eli-l) . If r E K (as defined earlier), y,z E [a,b]
e

49

and

r(y) = r(z) To
2

then we clearly have N(r)(y) = N(r)(z)
9y e eY ez

obtain the second condition necessary for Property I we set

m(t) = 1 - t +'t Lnt (for t > 0) and examine m'(t). Since

¢'(t) < 0 for 0 < t < 1 and m'(t) > 0 for t > 1, w(l) is the
only local minimum for ¢(t) when t > 0 and hence the second con-
dition for Property 1 is also satisfied. Next we shall show that
the best relative approximation, r, to ex belongs to

K = [h E C[a,b]: 0 < h(x) < e1+x for all x 6 [a,b]}. This follows
from the fact that (1438‘s r(x) s (1 + we" and 0 < x < 1. Pro-
ceeding as in the proof of Theorem 1.27 we find that yr has an

error curve of the type described in Theorem 1.9 where

1
y = eé-ibzx -l-— . The value of y is found by solving the
1-12
equation
N(Yr)(x1) N(Yr)(x2)
l - --j;———- = 1 --——-::———-
l 2
e e
x1 x2 e
where r(xl) = (1 + x)e and r(xz) = (l - x)e . Since 0 < y < 2

we have that yr 6 K and so by Theorem 1.9 yr is the best starting
approximation for ex with respect to N. Furthermore, by Theorem
1.30, yr is the best starting approximation for ex with respect
to Nm (m = 1,2,...).I

The aforementioned Property J is a rather natural analog of
Property I for the case of uniform approximation. The Newton operators
used for approximating the functions f(x) = eX and f(x) = xa,
a 6 (0,1) fail to possess Property J and hence Theorem 1.29 does

not apply. However, the Newton Operator for approximating

50

f(x) = Ln x does possess PrOperty J as will be shown. Incidentally,
the Newton Operator for approximating Ln x fails to possess Property
1. We now turn our attention to the operator N E NLnx' For

0 < a.( h, set K B {h E C[a,b]: h(x) >.Ln x - l for all x 6 [a,b]}

' = . __X_-_ a: _ [Ln x-h(X)]
and define NLn x(h)(x) h(x) 1 +-eh(x) h(x) l + e
for h E K. It is straightforward to show that N E NLnx : K a K,

is pointwise strictly monotone at Ln x and is one-sided from above.
To prove that N possesses Property J at Ln x we define

w(t) = (t-l) + e-t and examine m'(t). Since

> O for t > 0

m'(t) = 1 - e-t = O for t 0 , the desired result is obvious.

< 0 for t < 0

With the above we can prove the following result:

Theorem 2.3: Let N a NLn x denote the Newton operator for approxi-

mating f(x) = Ln x, K = {h E C[a,b]: h(x) >.Ln x - 1 for all

x 6 [a,b]} where 0 < a < b and M.= K n R;[a,b] where m s n.
The best starting approximation (from M) for k Newton iterations
for the calculation of Ln x (in the uniform norm) is r* + co
where r* is the best uniform approximation to Ln x from REEa,b]
with deviation A (we assume 0 < 1 < 1) and c0 = Ln ﬁlﬂ%—A .
.EEQQES We have already noted that N is pointwise strictly monotone
at Ln x, pointwise fixed at Ln x, possesses Property J at Ln x,
maps K to K, and is one-sided from above. Since x<< l we also
have that r* 6 M. Proceeding as in Theorem 1.29 we find that

r* +-c0 has an error curve of the type described in Theorem 1.9

where c0 = Ln‘§l%E-L . The value of cO is found by solving the

51
equation
\Nan x)(x1) - N(r" + c>(x1>\ = \Nun x><x2) - N(r* + c>(x2>\
for c, where f(xl) - r*(x1) = x and f(xz) - r*(x2) = -x- Since

*
,Elﬂ%_k.> l we have that cO > O and so r + c0 6 M. ThuS, by

*

Theorem 1.9, r +cO is the best starting approximation for Ln x
*

with respect to N. Furthermore, by Theorem 1.30, r + c0 is the

. . . . k
best starting approx1mation for Ln x With respect to N ,

k = 1,2,... . Hence we have the desired result.|

Section 3: An Associated Operator

Let us once again turn our attention to the Newton operator
N a N x defined by N(h)(x) = h(x)(l +~x - Ln h(x)) (h E K as
previgusly defined) used for approximating f(x) = ex. Iterating with
this operator is computationally impractical since the time required
for the evaluation of the function Ln x is approximately that for
the evaluation of ex (say on the CDC-3600). To improve the
situation we would like to replace the logarithm function in the
above formula by a rational function r E R; with m,n sufficiently
small as to make the rational function more computationally desirable.

At present the value of this scheme is somewhat questionable. How-

ever, if we define

Nr(h)(X) = h(X)[1 +‘X - r(h(X))] ,

for h an element of some convex set K' with Nr: K' a K' and

r to be determined then we can show that an optimal starting approxi-

x . v .
mation rv(x) to e With reSpect to Nr (v = 1,2,...) eXists.

52

Let x 6 [a,b], 0 < a < b. Observe that for h(x) s ex (h 6 K' c K)

we have N(h)(x) 2 h(x)(1 +‘x é Ln ex) = h(x). Since ex+1 - ex is

a continuous function on the compact set [a,b], there exists 6 > 0

O
6
such that ex+1 - ex 2 60 > 39- for all x E [a,b]. Furthermore,

for fixed x, N(x,y) = N(y)(x) is a continuous function of y(x),

is monotone decreasing for y 2 ex and N(x,ex+1) = 0. Hence
6
+
N(x,ex 1 - 59) = 60(x) > 0. Set K' = {h E C[a,b]: eo(x) S h(x) s
6
ex+1 - 59}. By compactness and continuity considerations, eo(x) > O

for all x 6 [a,b] and so there exists a > 0 such that 30(x) 2 a-

6
Similarly, there exists 6 > O for which ex+1 - 59-5 6. Thus we
x+l 60
have 0 < a < B and a s 30(x) S h(x) s e - §—-S B for all

h E K'. Let r(y) be the best one-sided uniform approximation to

Ln y from below from R;[a,5] and suppose X = “Ln y - r(y)“ m .
L [0’38]
Define the operator Nr by Nr(h)(x) = h(x)(l +ix - r(h(x))) where

h E K'. It is clear that Nr(h)(x) 2 N(h)(x) for all h E K' and
x e [a,b] since a s h(x) s B for x 6 [a,b] and r(y) 5 Ln y
for all y 6 [a,b].

Claim: If r defined above belongs to a sufficiently large class

 

of rational functions (that is, m + n is sufficiently large) then
N : K? ~ K'.
r
£991: For com S h(x) s e". Nr<h)<x> 2 N(h)(x) 2 h(x) 2
60(X) and so Nr(h)(x) 2 30(x) for h E K' (regardless of the

class r belongs to). Now choose m,n sufficiently large, say

- . . . x+1 x
m = m0, n — no, such that m0 + n0 is minimal and e - e 2
6 6
(l-x)§Q-+-xeb+l. This we can do since ex+1 - ex > 59- for all

x 6 [a,b] and A a O as m,n a m (in fact as n a m). We shall

53

now consider Nr only where r is the best one-sided uniform approxi-
n

mation to Ln y from below from R:[a,B] 2 Rm0[a,B]. For such r,
0
x+l 60
Nrchux) - N(h)(x) = boom h(x) - r(h(x))) s w(x) s Me - 2—> s
6
b+l
x e - A 2—0
K6 16

Therefore Nr(h)(x) s N(h)(x) + x eb+1 - -§9.g ex + x eb+1 - —§9

6 X6 6

S ex+1 _ (1_x)§g _ eb+1 + x eb+1 _ _§g = ex+1 _ 59"

Hence Nr: K' a K' for r described above and so we are able to
iterate with the operator Nr.

With the above definitions of 60, r, 1 and K' we are able
to state the following existence theorem for the operator Nr'
Theorem 2.4: Let K' and r be as above for the fixed interval
[a,b] and let M = K' n R;[a,b]. (Here r is fixed and subject to
the above restrictions and m,n are arbitrary but fixed.) Then
for any arbitrary but fixed natural number k there exists 3* E M

such that “w(ex - N:(s*))H m s “w(ex - N:(S))H m for
L L

a,b]

all s E M Where w is any continuous positive weight function on

[a.b

[a,b].

(D
Proof: Select a sequence {sn}n=l’ sn G M, such that

k . k
1m l\w<e" - Nr<sn>>um = mf Hw<e" - Nr<s>>um (M, = M ., >
ndm SEM L [3’
which exists by the definition of infimmm. Note that sn 6 M for
6
all n implies a s 30(x) s sn(x) s ex+1 - 59 s B for all

m . .
x 6 [a,b], n = 1,2,... . Thus {sn(x)}n=1 is a uniformly bounded
sequence of rational functions and hence, by Lemma 1.12, there exists

a uniform bound on the coefficients of all the sn(x). Therefore

54

there exists a subsequence of {Sn}00 (which we shall also denote

n=1

w o — — a a
by {Sn}n=l) such that lim sn(x) - s(x) (a rational function) for

[I'm

all x 6 [a,b] except for points which are the zeros of the denominator

of s(x). If {xiz i = 1,...,L} denotes the zeros of the denominator

of s then it is clear that L s m. Using the uniform boundedness
of the sequence {s1}, it is straightforward to show that each xi

is a removable singularity of 8. Upon cancellation, we obtain a

* n
rational function 3 E Rm[a,b] which We claim is the solution we

6
*
seek. Observe that eo(x) S s (x) s eX+1 - EQ- and so 30(x) S
6
*
N:(S )(x) s ex+1 - 59-. We also have that if x0 6 [a,b] and

* *
11m sn(x0) — 8 (x0) then lim Nr(sn)(XO) = Nr(s )(xO). Inductively
W n_m
*
We may conclude that if x0 6 [a,b] and lim sn(x0) = 5 (x0) then
n—m

*
lim N:(sn)(x0) = N:(S )(xo). Let c > 0 be given. Then there exists

n-Ioo

N = N(e) such that “w(eX - N:(sn))\\co S I +‘e for all n 2 N where

I = inf “w(ex - N:(s))um. For any x0 6 [a,b] ~ U {xi},

SEM x i=1

‘w(x0)(e O - N:(sn)(x0))‘ S I + e for n 2 N and so

X X
\w(xo><e ° - nf(s*>(x0>>\ = ii: \w<x0><e 0 - N:(Sn)(x0))‘ s 1 + 6.

Thus Hw<ex - Nk<s*>>H = max \w<x><ex - Nk<s*><x>>\ =
r m x€[a,b] r
sup \W(x)(eX - N:(s*)(x))‘ s I + e. Hence ”w(eX - N:(s*))\\co =
x€[a,b]

x# xi

inf “w(ex - N:(x)>\\m . I
sea

CHAPTER III

FURTHER EXAMPLES OF OPERATORS DEFINING ITERATIVE SCHEMES

Section 1: Definition and Basic Results

In Chapter II we were concerned exclusively with iteration
using Newton's method. We shall now consider other iterative methods
for the solution of equations and examine them for various properties
which have been defined. First we introduce the important concept of
order which affords us a means of classifying the iterative schemes
which we shall discuss. The following definition is due to Traub [30]:
Definition 3.1: An iteration function ¢fz E1 a E1 defined by
¢f(xk) = xk+1, k = 0,1,2,... for finding a root 0 of the equation
f-I(y) - x = 0 (fixed x) is said to be of order p (or have order

of convergence p) if there exists a nonzero constant C such that

 

~ C as k a m. The number C is called the asymptotic

error constant.

Lemma 3.2: If the order of an iteration function exists, then it is
unique.

Proof: Assume the iteration function Tf has two orders, p1 and p2.

Set p2 = p1 + 6, 6 > 0. Then

 

‘¢f(xk) ' 0‘ = lim ‘¢f(xk) ' 0‘

1' =
im Pf+6 C2 ,

k—coo P 2 k—oco

 

55

56

1¢f(xk) - a‘
m

p = O, which is a contradiction. I
1

and 11
k—aoo

 

‘xk ' 0"
Remark 1: The iteration functions with which we are concerned are

continuous one-point iteration functions; that is, each successive

' a
iter te xk+1

iterate xk. No old information is reused.

Remark 2: A detailed discussion of order of convergence and related

is obtained using only information from the previous

topics may be found in Traub [30].

With each iteration function ¢f(x we can associate

k) = Xk+1
an iteration operator of: S H S C C[a,b] Via Qf(sk) = Sk+1 for
approximating the continuous function f E C[a,b]. The subset S

of C[a,b] depends upon f and [a,b]. We now embark upon a study

of iteration operators 6 associated with Known one-point iteration

f
functions ¢f for f(x) = ex and f(x) = xp, p > 1, p an integer.
The one-point iteration functions ¢f which we shall be concerned
with are order-preserving Pade rational approximations to a certain
class of iteration functions generated by inverse hyperosculatory
interpolation at a single point. This material first appeared in a
paper of Traub [31] and is presented in detail in Chapter 5 of his
book [30]. The iteration functions mentioned are all of integral
order and many are.widely known and have other origins (suitable
reference will be made if such is the case). Since linear conver-
gence (convergence of order 1) is not very interesting nor practical
we commence with a study of iteration functions of order 2. We

shall refer to the schemes generated by iteration functions (operators)

as iterative schemes.

57

Section 2: Iteration Functions of Order 2

1. First, we shall only mention that the Newton iteration function
already discussed has order of convergence 2.
2. The only other iterative scheme of order 2 which seems to exist

in the literature is Newton's method for computing roots using the

P
function g(y) = X-S—§-, p > 1, p an integer. Applying Newton's
Y

method to this function, we obtain the iteration function ¢f(xn) =
x P

[Tn ((p+1) - ii). For 0 < a < b. define K = W“ 6 C[a’bl‘
1

o < h(x) < ((p+1)x)p for all x 6 [a,b]} and @f(h)(x) =

X

P
héJ-{l-((p+l) - £E£§22—-) for h E K. With these definitionslit isl
P

easily checked that Q is continuous, é ° K a K and §f(xp) = x .

f f°

Set y(t) = %'((p+l) - tp). Then ¢'(t) = £31 (l-tp) and so
< O, t > 1
y'(t) = 0, t = 1 Therefore of is pointwise strictly monotone
> O, 0 < t < 1 .
l l
at x , p = 2,3,..., possesses Property I at xp and is one-sided from

.1

below at xp. (It should be remarked that the function y(t) analyzed
above actually only yields the fact that 9f possesses Property I.
However, the argument to obtain 6f pointwise strictly monotone is
entirely analogous and shall be omitted. This format will be followed

throughout our analysis.) In order to apply Theorem 1.27 we must 1
require that if rp denotes the best relative approximation to xp

from R;[a,b] (m,n arbitrary but fixed throughout this discussion)

. . ._l__ ._l_.
With deviation xp then 6rp E M for 6 E [1+xp a 1_xp

}. To insure
1/p -

 

this we require that 1 s (p+1)

which is apparently a rather
P (p+1)1/p + 1

58

severe restriction on 1 for large p (that is, r must be a very
P

good approximation to xllp). If we so restrict xp’ then Theorem 1.27

1 1

. o d ° —_ —
is applicable an there ex1sts yp €.(1+mp 9 1_xp

> such that y r
P P

is the best starting approximation for x p with respect to of.

Since iteration is well-defined and of is a one-sided operator we

have that yprp is the best starting approximation for x1]p with
k

reapect to 6f, k = 1,2,... . Analyzing the proof of Theorem 1.27,

it is clear that yp is the unique solution, in the above interval,

to the equation

 

 

®f(YPrP)(X1) - 1 = @f(vprp)(xz) _ 1
x1/p x1]p
l 2
1/ 1/ .
where rp(x1) = (1+7‘p)x1 p and rp(x2) = (l'kp)X2 p. Solv1ng, we
1/p

 

d ZIP(p+1)

f' h =

In t at Y” (1+x )p+1-<1-x >p+1
P P

Section 3: Iteration Functions of Order 3

1. The first iteration scheme of order 3 to be considered is attributed
to Halley (see [30]) and, as pointed out by Traub, has the distinction
of being one of the most frequently rediscovered iteration functions

in the literature. A recent rediscovery is due to J.S. Frame [3].

If we wish to find a root of the equation g(y) = f-1(y) - x = 0

(fixed x) then the iteration function ¢f is defined by

 

 

s(x“)
xn+1 = ¢f(xn) = xn - °
, g"<xn) (- g(xn>)
g (XII) + 2 g I (xn)

S9

A. First, let us define g(y) = Ln y - x. Then g(y) = 0 is
equivalent to y = ex (for fixed x). For this choice of g the

Halley iteration function becomes

 

2 + x - Ln x
x = ¢ (x ) = x n
n+1 f n n 2 - x + Ln xn
In order to approximate eX on an interval [a,b] (a,b arbitrary
with a < b) let us put K = [h(x) E C[a,b]: h(x) > ex-2 for all

x E [a,b]} and define the associated iteration operator

 

= 2 +-x - th(x)} . . ._
§f(h)(x) h(x) [ 2 _ x +‘Ln h(x) for h E K. With these defini

tions Q K a C[a,b]. If we wish to iterate with the operator 6

f: f

then we must further restrict K. We shall omit the restriction and
remark that it is straightforward. Clearly 6f is continuous for

h e K, §f(ex) = ex and Q is pointwise fixed at h E K. Setting

 

2 - Ln t -2 .
¢(t) ='t [a—quE—E] for t E (e ,m), we find that ¢'(t) =
2
- LL“ t) s 0. Thus y(t) is decreasing for t > e-2 and so
(2 +-Ln t)

. . . . X
6f is clearly p01ntWise strictly monotone at e and possesses

Property I at ex. In order to apply Theorem 1.27 we must require

n l 1
that 6r 6 M — K D Rm[a,b] for 6 E [1+X , 1'}.J where r denotes

x n .
the best relative approximation to e from Rm[a,b]. To insure
2

e - 1 . . .
this we must demand that 1 < -§--’ which is a very mild restric-

e + 1
tion on 1 since we always have I < 1. Let us remark that it may
indeed be the case that we need not restrict k at all as was dis-
covered earlier in the discussion of the Newton operator for approxi-

x . .
mating e . With the above restriction the best starting approx1ma-

tion for ex with reSpect to Qf is a multiple, y, of the best

60

X . . . . .
relative approximation to e and is given implic1tly by

 

 

@fwocxl) + eforxe)

x1 x2
e e

= 2

X

and x are such that r(xl) = (1+k)e 1 and r(xz) =

where 2

x1
x2
(1-x)e . This equation reduces to solving

2 - Ln y(lﬂl _ 2 32140-2.) _
Y(1+K)['2 + Ln y(l+k)} .+‘Y(1 >01.2 + Ln y(l-x)] — 2

which would be solved numerically on a computer. It is important to
note that in this case the best starting approximation need not be

independent of the number of iterations of 6f since 6f is not

a one-sided operator.
B. If we desire to find R/x (for fixed x), p 2 2, p an

integer, then we are confronted with finding a zero of the equation

g(y) = yp - x. In this case the Halley iteration function is given by

(p-1>x" + <p+1>x
1 = ¢f(x ) = x n .
“ (p+1)x: + (p-l)x

 

For the operator setting let K = {h E C[a,b]: h(x) > 0 for all

X E [a,b]} where 0 < a < b and set §f(h)(x) =

It is clear that éf
(p+1><h(x)>" + (p-1>x

h(x) [(1)-Damon" + (p+i>x Jf r h E K.
.1.
XP

is continuous and §f(xp )= p = 2,3,... - S€t w(t) =

P
t [(1)-nu + ipﬂi
(p+1>t" + (1)-1)

} where t > 0. Differentiating, we obtain
tp-l

<p+1>tp + (p-l)

for t > 0. Consequently Qi is pointwise strictly monotone at x

 

2
] 2 0 and hence w(t) is increasing
1

w'(t) = (p2-1> [

P

l

0‘

ﬁle!

and possesses Property I at x . Since §f(hn)(x0) a 0 as hn(x0) a 0
for each fixed x0 6 [a,b] and of is monotone we have that

of: K a K. Thus, repeated application of 6f is well-defined. If

1 l
M = K H R:[a,b] then Brp E M for 6 6 [.T:I; , I:X;H} Where rp

1
denotes the best relative approximation to x /p from R;[a,b]

with deviation AP. Therefore, by Theorem 1.27, the best starting

l/p

approximation for x , p = 2,3,... from M with reSpect to Q

l 1
1+ ’ 1-
xP xP

f

 

the uniQue solution hi < ) to the

is r for
Yp 9 Y1)

equation

eprrpuxi + ef(yprp>(x2>

 

 

1/p [Ilp = 2
X1 X2
1/p 1/p

= 1 = 1" . o
where rp(x1) ( +1p)x1 , rp(x2) ( )(p)x2 . As in part A , the
best starting approximation need not be independent of the number of
iterations of 6f. In the present literature the above scheme for

roots is often ascribed to Bailey (see [30]).

2. The second and final scheme of order 3 which evolves from the

work of Traub is defined by the iteration function

s(x“) g"<xn>gz<xn)
n - '(x ) - 3
g n 28' (xn)

 

 

xn+1 = ¢f(xn) = x

where we are again searching for a root of the equation g(y) = f-I(y) - x.
A. Let g(y) = Ln y - x. Then g(y) = 0 is equivalent to

y = ex (fixed x). For this choice of g, Pf is defined by

X

2
xn+1 = ¢f(xn) = Eﬂ-[l +-(Ln xn - x - l) ].

To approximate ex on an interval [a,b] (a < h) set

62
K = {h E C[a,b]: h(x) > 0 for all x 6 [a,b]}

and define @f(h)(x) = E§£2.[1 +-(Ln h(x) - x - l)2] for h E K.

Clearly Q is continuous, 6 ° K a K and §f(ex) = ex. Setting

f f'
W(t) ='% [1 + (Ln t - l)2] for t > O and differentiating we

2
obtain ¢'(t) = %(Ln t) 2 0. Hence W(t) is increasing for t > 0

. . . . X
and so of is p01ntWise strictly monotone at e and possesses

Property I at ex. If M = K n R;[a,b] then 5r 6 M for

6 Elliii" I%i-] where r denotes the best relative approximation

to ex from R;[a,b] with deviation x. Thus the best starting

approximation for ex from M with reSpect to of is yr for some
V E (Ii—x , ﬁx) (y is obtained by solving a certain transcendental

equation as in the above Work). Since the operator of is not one-

sided, the best starting approximation need not be independent of the

number of iterations.

l/p

B. For computing x , p = 2,3,... the iteration function is

defined by

X (l-p) 2 2
xn+1 = ¢f(xn) = 41—... i (£21; .. L) + Lil—12.22 ]
2p x

2 -1 2
p 5 (1-p)

where g(y) = yp - x. For 0 < a < b, set K = {h E C[a,b]:

h(x) > o for all x 6 [a,b]} and define ¢f(h)(x) =

2 2
lighot :1 - __x__p) + E—OS-Z'gl] , for h E K. With these
2p P (h(x)) <1-p>

definitions, 9f: K.a C[a,b] and so, as mentioned earlier, K needs

to be further restricted if we wish to iterate with of. Clearly

1/p 1/p

6f is continuous and §f(x ) = x . Setting

63

2 2
¢(t) 3 kg- t[(£§% - t-p) + LEE-l] for t > 0 and differentiat-

 

Zp (l-p)
ing we find that ¢'(t) = (29-1)§2-D [I - l—pl > 0. Thus if is
P t
pointwise strictly monotone at xl/p and possesses Property I at

xllp. Hence we may apply Theorem 1.27 as before and shall omit the

details. The lack of one-sidedness of § may again destroy indepen-

f

dence of the best starting approximation upon the number of iterations.

Section 4: Iteration Functions of Order 4

Next we consider a class of iteration schemes with order of
convergence 4. The first three schemes which we shall examine are
derived from the previously mentioned work of Traub.

l. The first fourth-order iteration function is defined by

 

2
s(x ) s"(x ) s(x ) s"(x )
X =¢(X)=x -—;—-'1— 1+ “ n + 2—1—“-—- -
n+1 f n n g (xn) 2gr(xn) g'(xn) 2g (xn)
m 2
g (xn) ] s (xn)
6g'(x ) u 2
n [g (xn)]
A. For approximating eX for some fixed x this iteration

function becomes

1 2 1 3
xn+1 - ¢f(xn) I xn[1 - (Ln xn - x) +-2(Ln xn - x) - 6(Ln xn - x) j

where g(y) = Ln y - x. To approximate ex on the entire interval

[a,b] (a < b) set x' = {h e C[a,b]: h(x) > o for all x 6 [a,b]}

and define ef by ef(h><x> = h(x)[1 - (Ln h(x) - x) +%(Ln h(x) - >02 -
%(Ln h(x) - x)3] for h E K'. As before of is continuous and

§f(e") = ex. Set Mt) = t[1 - (Ln t) +%(Ln t)2 - 6““ t)3] for

t > 0. Upon differentiation we obtain ¢'(t) = - g(Ln t)3 and so

64

x

it is clear that 6f is pointwise strictly monotone at e , possesses
x . . X .

Property I at e and is one-Sided from below at e . As With the

Newton operator in approximating ex, 6 being one-sided from below

f
implies that large values of h(x) (fixed x) yield negative values
for §f(h)(x). By the Implicit Function Theorem the equation

Qf(h)(x) = 0 defines h(x) > ex as a continuous function of x. In

our case the desired h(x) is obtained by solving

1 - (Ln h(x) - x) + %(Ln h(x) 4 x)2 - %(Ln h(x) - x)3 = 0 for h(x).

If to is the unique real zero of l - u + % u2 - é’UB = 0 then
x+r
h(x) = e 0. Note that l < r0 < 2. Setting K = {h E C[a,b]:
x+r

0 < h(x) < e O for all x 6 [a,b]}, we have that 6f: K a K. If
r denotes the best relative approximation to ex from R;[a,b]

with deviation 1 then the best starting approximation for Q

 

f,
_ - - __1_ _1_]
k - 1,2,... is yr prOVided 6r 6 M for 6 6 [1+1 , 1'1 . To
x
insure this we require that 6r(x) s 6(1+)\)ex s %iL-ex < e 0, or
r '1
0-
equivalently 1 < r With this restriction on 1 the best
0
e +1

starting approximation is yr where y is computed by solving

@ (Yr)(x ) Q (N/I‘Hx ) X X
f 1 = f 2 where r(xl) = (1+x)e 1 and r(xz) = (l-x)e 2.

 

 

x1 x2
e e

Upon simplification we find that y is the unique solution in

(Ii; "I%:) to the transcendental equation
' l 2 1 3
(1+x)[1- Ln y(1+1)+§(tn y(1+>.)) -g(Ln Y(1+>.)) J = (l-x)[1-(Ln Y(1->.)) +

%(Ln y(l-mz - %(Ln Y(1->.))3] .

65

l/p

B. For the approximation Of x , p = 2,3,... the iteration

function is given by

 

 

P _ P _ 2 P _

x senate: .n * .eeep-u "n
n+1 f n p xp 2p xp 6p2 P
n n n

where g(y) = yp - x. For 0 < a < b, set K = {h E C[a,b]: h(x) > 0

for all x E [a,b]} and define the associated iteration Operator

 

Of by
Q (h)(X) = - M[(g!!(xz )p-x + P-1 £h(x))p-x)2 +
f p (h(x))p 2P (h(x))?

 

(p-1M22-11<®(X))p-X)3 _ p]

 

2
6p (h(x))p
for h E K. of is continuous and §f(x1/p) = xllp. Set
2 3
..._£ _ 19 2:1 _ 1P Q-IMZP-l) .19
Mt) p10 (9) + 2p(1 (9) + 2 (1 - (9) -p]

6p

for t > 0.

p 3
Then ¢'(t) = '1 2 '1 3 '1' (5—3;) and so W(t) is increasing

6p t
for t > 1 and decreasing for O < t < 1. Therefore 6

f
l
strictly monotone at x /p’ p = 2,3,..., possesses Property I at x

is pointwise

1/p

and is one-sided from above. The one-sidedness of of yields

 

Of: K q K and hence we are able to iterate. Setting M = K H R;[a,b],
. l l
we obtain that Or E M for 6 E , where r denotes
F 1+1? l-xp p

l
the best starting approximation to x /p from R:[a,b] with deviation

xp. Hence, according to Theorems 1.27 and 1.30, the best starting

l/p

approximation for x , p = 2,3,... with reSpect to 6:, k = 1,2,...
, l l . . .
is yprp for yp E (11):: , ti) the unique solution to the equation

66

 

 

Q(Yr)(X) <DWI'HX)
f LP 1 _ f p) 2 = 1/p
gE/P _ xi]? where rp(x1) (l+),p)x1 ,
l/p

= 1- .
rp(x2) ( x1)»:2
In the next two iteration schemes derived from the work of

Traub we shall only be concerned with approximating ex. The results

l/p

for the approximation of x , p = 2,3,... are analogous although

the algebra involved is much more tedious.
2. The next iteration function Of order 4 with which we shall be

concerned is defined by

s(x“)
s ' (xn)

 

(xn)'-7c -

“ g"<xn)g(xn> g”’<xn> (g"<xn>>2 (gap)2
1' 2 + "1‘7 ' 2 2
2(8'(xn)) 4(g'(xn)) (g'(xn))

6g (xn
For the approximation Of ex we have g(y) = Ln y - x and the

xn+1=¢f

 

 

- a a o a = = 1 -
iteration function is given by xn+1 ¢f(xn) xn [

(Ln xn - X)

 

1 1 2 Set K = {h E C[a,b]: h(x) > 0
l + -2-(Ln xn-x) + ﬂan xn-x)

for all x E [a,b]} where a < b and define

 

ef(h>(x) = h(x)? - (in “’0 " "l l for h e x .
l + %(Ln h(x) - x) +-%§(Ln h(x) - x)2
Then 6

is continuous, Q - K a K and Qf(ex) = ex. Setting

f'

_ _ Ln t
¢(t)-t[1 1+-]-'Lnt+-1—(Lnt)2}
2 12

f

 

l 4
w(m t)
we find that ¢'(t) =

 

2 0 for all t > 0.
(l + '2; Ln t + i—2(Ln t)2)2

Hence 6f is pointwise strictly monotone at ex and possesses

67
Property I at ex. The conditions of Theorem 1.27 are all satisfied

and so we are able to conclude that the best starting approximation
for eX on [a,b] (with respect to 6f) from M = K H R;[a,b]
is a positive multiple (y) Of the best relative approximation to
ex on [a,b] from R;[a,b]. Let r denote the best relative

x

approximation to e on .[a,b] with deviation X- Then the value

of y we are seeking is Obtained by solving the equation

6 (vr) (X ) Q (vr) (X ) x
f 1 +- f 2 = 2, where r(xl) = (1+x)e 1 and r(xz) =

 

 

X1 X2
e e

x
(l-x)e 2, which would be effected numerically on a computer. Since

6f fails to be one-sided the best starting approximation need not

be independent of the number of iterations.

3. The final iteration function with order of convergence 4 obtainable

from Traub's work which we shall consider is defined by

 

g(xn) [yap +<g”’(xn) _ <g"<xn))2 s(xn)
g'(xn> 2g'(xn) 6g'(xn) 4(g'(x ))2 g'lxn)
n

=¢£(xn)=X ' .

 

xn+1

 

“ g"<xn) _ go“) ( (s"(xn))2 _ g”’<xn> )
23 (Kn) 371x“) 2(g'(x ))2 6g (xn)
n

For the approximation of ex (fixed x E [a,b]), g(y) = Ln y - x

and the iteration function is defined by

 

(Ln xn - x)(Ln xn - x - 6)
xn+1 = ¢f(xn) = xn 1 + 2(Ln xn - x +'3) ] °

TO approximate ex on the entire interval [a,b] set
K = [h E C[a,b]: h(x) > em.-3 for all x E [a,b]}
and define the associated iteration Operator §f(h)(x) =

my + Le;(:§xg(;)xggn+hgi - x - 6> ] where h t K. It is

68

clear that 6 is continuous and 6f(ex) = ex. Furthermore we shall

f

. . X
demonstrate that 6 is one-Sided from above at e and consequently

 

 

f
___ (Ln t)(1n t - 61] 1_
ef(x) c K. Set ¢(t) t [1 +- 2(Ln t +_3) for t > 3 .
3 e
Differentiating, we obtain w'(t) = ‘LLn t) 2 and so 6f is
2(Ln t +-3)

. . . X X
p01ntWise strictly monotone at e , possesses Property I at e and

is one-sided from above at ex. TO be able to apply Theorem 1.27
3

we must further demand that 1‘1 83-1

e +1

 

where A denotes the error

. . . . . X
assoc1ated With r, the best relative apprOXimation to e on [a,b]
1'1 . . . .
from Rm[a,b]. As noted before, this restriction is extremely mild
since we always have K < 1. With this restriction Theorems 1.27 and
x

1.30 are applicable and hence the best starting approximation for e

k
on [a,b] with respect to 6f, k = 1,2,... is yr where y is the
ef(vr)(x1> ef(vr)(x2>

 

 

unique solution to the equation x = x in the
1 1 e e 2
interval (II): , 1.3:) (x1,x2 as before). For our definition of

6 this reduces tO solving a third degree equation in Ln y.

f
4. Finally we shall examine an iteration function also Of rational
form which has order of convergence 4 and is due to Kiss (see [30]).

As above we shall only consider the iteration function Pf correspond-

ing to g(y) = Ln y - x. The iteration function Of Kiss is defined

by

 

s(x ) < 8"(Xn) g(x )
g'(xn) 2g'(xn) g'(xn))

 

X =¢(X)=X-
“*1 f n “ g"(xn> s(x“) g“(xn)g2<xn>

 

 

, +
g ‘xn’ g7’21) 6g'<xn>(g'(xn>)2

For the special choice g(y) = Ln y - x we have

69

 

( ) [ (Ln xn - x)(1 + %(Ln xn - x)) ]
x = ¢ x = x 1 - .
n+1 f n n 1+(Ln xn-x) +-%(Ln xn - x)2

. X .
TO approx1mate e on the interval [a,b] Where a < h set

K" = {h E C[a,b]: h(x) > 0 for all x E [a,b]} and define 6f by

(Ln h(x) - x)(1 + %(Ln h(x) - x))

 

Qf(h)(x) = h(X) [I - 2 X for h E K".

1 + (Ln h(x) - x) +-%(Ln h(x) - x)

It is easily shown that 1 +-(Ln h(x) - x) +~%(Ln h(x) - x)2 # 0 for

 

Ln t(1 +-%-Ln t)
all h e K" and x 6 [a,b]- Set 1(t) = t 1 ' 1 2
1+Lnt+'3-(Ln t)

1 3 3
- gun t) (Ln e t)

 

for t > 0. Then ¢'(t) = 2 . Note that

(1 + (Ln t) + §<1n oz)
¢'(t) = 0 for t = 1 and for t - e-3. To inSure that Of will

be pointwise strictly monotone at ex and possess Property I we shall
demand that t > e-3. If we set K' = [h E C[a,b]: h(x) > ex.3 for
all x E [a,b]}, we have that Qf: K' a c[a,b], 6f is continuous,
Qf(ex) = ex, 6f is pointwise strictly monotone at ex, possesses
Property I at ex and is one-sided from below. For each fixed

x E [a,b], §f(h)(x) is decreasing for h(x) > ex and increasing for

h(x) < ex and §f(h)(x) = 0 at h1(x) = ex3/6, h2(x) = exi/6. If We set

2 x+2

K = {h e c[a,b]: ex' < h(x) < e for all x 6 [a,b]}

then it is easy to show that 6f: K a K and the aforementioned pro-
perties Of 6f are preserved. In order to apply Theorem 1.27 we

n 1 1
must require that 6r 6 M - K n Rm[a,b] for 6 E 11+1.’ 1'X J

. . x
where r denotes the best relative approximation to e from

R:[a,b] with deviation x. This requirement is equivalent to

70

2
k <.E§_:_l ‘Which is again not a severe restriction. Applying
e + l

Theorems 1.27 and 1.30 (subject to the above restriction) we know
_1_ _1_
1+), ’ 1-),

best starting approximating to ex on [a,b] from M (with respect

k
f,

that there exists a unique y E2 ( ) such that yr is the

to 6 k = 1,2,3,...). The value of y is Obtained by solving the

ef(vr>(x1> ef(vr)(x2)

equation = -—- where x and x are such that
x1 x2 1 2
e e

 

 

x
X
r(xl) = (1+1)e 1 and r(xz) = (l-x)e 2. For the given operator 6f

this equation becomes

2
(1+1) [ 6 - (Ln y(1+>.D 2] ...
3 + 3Ln y(lﬂ) + (Ln y(1+x))

_ 6 - (Ln y(l-n.»2
(1 i) 2
3 + 31a y(I-x) + (Ln y(l-i))

and we are confronted with solving a certain quartic equation in

tiny-

Section 5: A Sequence of Schemes for Approximating /x

In this section we shall consider a sequence of iterative
sOhemes for approximating 1/x which were introduced by G. Merz [12]
and subsequently analyzed by Meinardus and Taylor [11].. The itera-

tion function(s) of Merz are defined by

(%+Mmk+annhﬁ

(xn +,/'x)k - (xn - [x)k

 

xn+1 = ¢k(xn) = /x

for each fixed integer k 2 2 (¢k would be written in the

16%

earlier notation, where f(x) =./r). Let us note that, for each k,

¢k possesses a purely algebraic expression. Merz has shown that for

71

each fixed integer k 2 2 the above sequence {xn} converges to

/x starting with any initial guess x0 > 0 and furthermore the order

of convergence is k. Observe that for k = 2, ¢2(y) = %(y +»§)
which is the well-known Newton iteration scheme.

It is interesting to note that ¢k(¢m(y)) = (y) and so

Qk-m

one has the choice of repeated iterations with a lower order formula
or fewer iterations with a higher order formula to obtain the same
final iterate.

Since we wish to approximate /x on an interval [a,b],
0 < a < b, we consider the associated iteration Operator(s) as before.

Set

= n e c1a.b1= be) > o for an x e 1am

and define ek(h)(x) =/x (h(x) +f"): + 0‘93 1"): for h e K.
(h(x) +/X)k ' (h(x)- 'k/X)

6k: K a K, Qk

/x for each k2 2. Set ¢k(t)=£1+t):+ (1 9: for t >0.
(1+t:)k - (1-t)k

Upon computing ¢£(t) it becomes clear that for k even, 6k is

pointwise strictly monotone at Jx, one-sided from above at /x and

It is easily seen that is continuous and 6k(/x)=

 

possesses Property I at Jx. Likewise, for k Odd 6k is pointwise
strictly monotone at /r and possesses Property I at /r. Hence,

by Lemma 1.5 and Theorem 1.27 we have that for each k the best
starting approximation for /r on [a,b] from M = K H R:[a,b]
(with respect to 6k) is a multiple Of the best relative approxima-
tion to 1/x on [a,b] from Rn[a,b]. Furthermore, if k is even

we have that Q m = Q: = §k(§: 1), m = 2 ,3,... have the same starting

k
value by Theorem 1.30. If r denotes the best relative approximation

to /&:on [a,b] with deviation 1, then yr is the Optimal starting approxi-

72

1 l ,
tion for 6km (k even), m = 1,2,... wherev E (lilti- , n) is the

 

 

9 (Yr)(x ) Q (Yr)(x )
. . . k l k 2
unique solution to the equation - l = -
Jkl /k2
where x1 and x2 are such that r(xl) = (1+11/k1 and r(xz) =

. . 1 .31.
(I'KXCXZ- USing the fact that k is even and y E (.L+1 , 1‘1 ) ,
r

we obtain y = ('rLTE) Hence (I—ljf) 1: is the best starting
l-x 1-x

approximation for every even ordered scheme. For the case k = 2

and R:[a,b] replaced by Pn[a,b] this result was earlier Obtained

(independently) by P.H. Sterbenz and C.T. Pike [24] and R.F. King

and D.L. Phillips [7]. Additional background concerning these and

associated results appears in Taylor [27].

In the case that k is Odd, 6k is pointwise strictly mono-
tone at /r and possesses Property I at /x but is no longer one-
sided. Hence we may no longer conclude that the best starting approxi-
mation for /r is independent Of the number Of iterations. Computing

the best starting approximation for a single application Of 6k

(according to Theorem l.27),we Obtain yr where y satisfies the

 

 

@ (Yr)(x ) Q (Yr)(x )
. k 1 k 2
equation - 1 = l - and r, x1, x2 are as
fx1 fx2
before. This reduces to solving
1 k
K—J-J‘“) (—J—-‘*”“)
+ 2
1 + xv - Y 1 - Xv - Y
. . l 1
and the unique solution y E (m , Iii) must (apparently) be found

using numerical methods on a computer. Note that y, and hence the
best starting approximation, depends on k which is not the case
for k even. Since 9 m = 6:, 6: possesses Property I for

k
m = 1,2,... . Thus Theorem 1.27 is applicable for any number Of

73

iterations of QR. For k Odd, the best starting approximation for

 

 

6m, m = 1,2,... is y r where y is the only solution of
k m m

m m

k k
1 - Xym + ym 1 + me + ym

+ = 2 in the interval

1+Mm-Ym l-xvm-Ym
_1_ _1_
1+1’1-x

Remark 1: In this chapter we have been concerned with approximation

using the relative error. The iteration functions could also be

analyzed in the uniform approximation setting. In this case we would

be checking for Property J. However in view Of some earlier remarks

we should probably examine the function g(y) = ey - x; that is,

consider an approximation Of the function f(x) = Ln x on some

interval [a,b] (O < a < b).

CHAPTER IV

OPTIMAI.STARTING APPROXIMATIONS FOR
A GENERALIZED WEIGHT FUNCTION

Section 1: Introduction and Examples

In [13], [14] D.G. Moursund introduced the concept of uniform
approximation using a generalized weight function. The original
development here was concerned with polynomial approximation. Applica-
tions pertaining to the notion Of a generalized weight function and
concerning both polynomial and rational approximation appear in [15],
[16] and [26].

Definition 4.1: Let X be a compact metric space. A real-valued
function W(x,y) defined on X X (-m,m) is called a generalized
weight function if:

(i) W(x,y) is continuous on X X (-m,m)

(ii) Sgn W(x,y) = sgn y for all x E X (in particular,

W(x,0) = 0).
(iii) For each x E X, W(x,y) is strictly monotone increasing

in y with lim [W(x,y)\ = m.

y-OCD
The problem.which we shall consider is the following: given
x : [a,b], f e K c C(X) and M = K n R:[a,b] (m,n fixed), find
* x .
r E M such that “W[x,6(f)(x) - Q(r )(x)]“ = inf “W[x,§(f)(x) -
r611

Q(r)(x)]“ where 6: K a C(X) is continuous and “W[x,§(f)(x) -

e(r>(x)]\\ = max \WEx,e(f>(x) - i<r><x>]\.
XEX

74

75

Before proceeding to the basic results we shall give some
examples Of generalized weight functions (here we take 6 to be the
identity operator).

y corresponds to ordinary uniform approximation.

Example 1: W(x,y)
Example 2: W(x,y) E f%;)' corresponds to ordinary relative error
approximation provided f ¥ 0 on X (f is the function being
approximated).

Let X be compact, X G [a,b], K C C(X) and 6: K a C(X) a
continuous operator which is pointwise strictly monotone and point-
wise fixed at f E K.

Definition 4.2: The Operator 6 is said to be sign-preserving if
for each k E K, sgn(f(x) - k(x)) = sgn(6(f)(x) - Q(k)(x)) for all
x E X.

Definition 4.3: The Operator 6 is said to be sign-reversing if
for each k E K, sgn(f(x) - k(x)) = -sgn(§(f)(x) - 6(k)(x)) for all

x E X.

Section 2: Existence, Uniqueness, and Characterization Theorems

In this section we shall be concerned with existence, unique-
ness, and characterization of optimal starting approximations using a
generalized weight function.
Theorem 4.4: Let K be a convex subset Of C(X) and 6: K a C(X)
a continuous Operator which is pointwise strictly monotone and point-
wise fixed at f E K ~ M. Suppose further that 6 is sign-preserving,

sign-reversing or a one-sided operator and let W(x,y) be a generalized

76

weight function on X. Then, for M = K H R:[a,b] relatively Open
in Rgla,b], r E M is a best starting approximation for Q(f)
relative to W(x,y) if and only if there exists {Xi}§=1 C X,

N = 2 +-max[n + sq, m + 5p], for which

(i) x1 < x2 <...< xN,

(ii) \WExi.e(f)(xi) - s(r>(xi)]\ = Mme) (x) - n(r)(x>]\\
(iii) sgn(f(xi) - r(xi)) = (-l)i+lsgn(f(x1) - r(x1)),

i=1,2,...,N O

nggﬁ: We shall prove the theorem for 6 a one-sided Operator from
below and for 6 sign-preserving and remark that the other cases
follow in a similar manner.

Case 1: Q(f) 2 Q(h) for all h E K.

(Sufficiency) Suppose there exists {Xi}§=1 C X such that
\WExi.e(f>(xi> - e<r)<xi>1| = \\WIx,i<f>(x> - Mono)“ and
sgn(f(xi) - r(xi)) = (~l)i+lsgn(f(x1) - r(x1)). Note that f E M

implies that there exists xO E X such that f(xo) # r(x Thus

).
o
o(£)(x0) 9‘ Q(r)(xo) and so \w[x0,o(£)(x0) - Q(r)(xo)“ e O which
implies HW[x,6(f)(x) - 6(r)(x)]u ¥ 0. Let r* E M be such that

*
\\WIx.n(f>(x) - n(r >(x>]u s \\WIx.n(f>(x> - amen“. Since
Sgn WIX.®(f)(X) - Q(r)(XH = Sgn(¢(f) (X) - ¢(r)(X)) = 1 0r 0 for
all x e x and all r e M we have W[xi,<I>(f)(xi) - 6(r*) (xi)] =
IWIx,.n<f>(xi) - i(r*)<x,)1| s \Wlxi.§(f)(xi) - e(r>(xi)]\ -
WEx,.e(f><xi> - n(r><xi)]. Hence Mimi) - n(r*)(xi> s
Q(f)(xi) - Q(r)(xi) since W(x,y) is strictly monotone increasing
in y for each fixed x E X. If r(xi) > f(xi) then r(xi) 2 r*(xi).

*
Otherwise r (xi) > r(xi) > f(xi) and so

77

Q(f)(xi) - i(r*>(x,> = |e(f>(x,> - n(r*>(x,>| > \e(f)(x,) - n<r)(x,>\ =
Q(f)(xi) - Q(r)(xi) which is a contradiction. Similarly r(xi) < f(xi)

* *
implies r(xi) S r (xi). Thus, by the standard argument, r - r

*
possesses at least N-l zeros and 30 r E r . Hence we have the

desired result.

(Necessity) Assume there exists {x9221 s: X, N' maximal, N' < N

on which ‘W[xi,§(f)(xi) - t(r)(xi)]l = HWLx.n(£>(x> - e<r>(x>1H

and sgn(f(xi) - r(xi)) = (-l)i+lsgn(f(x1) - r(x1)). About each xi
construct a relatively open interval 11 such that all extreme pognts
= {x e x: |W[x,§(f)(x) - Q(r)(x)]‘ = uw[x,e(f)(x) - e(r)(X)]H} C U 11’

_. _' i=1
I H 1k = ¢ for j i k and for all extreme points in each Ii

J
the function f—r has constant sign. In what follows, we make
explicit use of the fact that ‘W[x,§(f)(x) - 6(r)(x)]\ =
W[x,§(f)(x) - é(r)(x)]. Let
Y = x n ( n 1,)
i=1

where Ii denotes the complement Of Ii with reSpect to [a,b].
Y is a compact subset of X and W[x,6(f)(x) - Q(r)(x)] <
HW[x,6(f)(x) - Q(r)(x)]“ for all x E Y. Hence by the continuity
of 'W (as a function Of x) there exists p > 0 such that

m::'W[x,Q(f)(x) - Q(r)(x)] s Hw[X,e(£)(x) - d(r)(x)]u - p.

x

For each i, let

 

Vi = {x E X n I1:W[x.§(f)(x) - @(r)(x)] 2 er-X’QG) 0% ' 20') 00-1“

and sgn(f(x) - r(x)) = sgn(f(xi) - r(xi))} .

78

Nl

NOW V = U V1 is a compact subset Of x and, since f(x) - r(x) i 0
i=1

on V, there exists n > 0 such that [f(x) - r(x)‘ 2 n on V. let

2, = {x e x n f,: W(x.n(f>(x) - t(1-Mm 21MXM9<X>2- “001111

 

and sgn(f(x) - r(x)) # sgn(f(xi) - r(xi))} .

Observe that W[x,§(f)(x) - 6(r)(x)] < HW[x,6(f)(x) - 6(r)(x)]u for all
N.

xEZ = U Zi by the construction of the intervals {Ii}. Finally, let
i=1

 

u1 = {x e x n f1: W[x,§?(f)(X) - t(1-mo] s WL""’®"°,' “”0935.

N
and set U = U Ui' Then by continuity there exists 6 > 0, 6 s p,
Such that

max WIXs§(f)(X) - Q(r)(X)] S HWIX,§(f)(X) - Q(r)(X)]H - 6-
xEZUU

Given T > 0 we can, using Lemma 1.8, select r E R:[a,b] such that

(1.2) holds. By the continuity of 6 and W we can select > 0

e1
such that for O < e s 61’ re satisfies (1.2) and

max WIX.¢(f)(X) - Mr >(x)] < \\W1x.i(f)(x) - i<r><x>1n - g .
XEYUZUU 3

Next, by continuity of f and r, we can select 32, 0 < €2 S e1,

such that for 0 < a 5 e2, r lies strictly between f(x) and
e

N'

r(x) on V = U V . Then, since 6 is one-sided from below at
i=1 N'

f, we have 6(f)(x) - Q(re)(x) < @(f)(x) - Q(r)(x) on. V = Vi

i=1
where 0 < s S 32. Hence W[x,§(f)(x) - é(r€)(x)] <

WIX:§(f)(X) ' 9(T)(X)] 5 lexr§(f)(x) ‘ Q(r)(x)]u on V if
0 < e S 62. Therefore max W[x,§(f)(x) - @(re)(x)] <

XEV
HW[x,§(f)(x) - Q(r)(x)]u. Consequently for e Such that 0 < e S 62,

79

r is such that
e

lex,§(f)(X) - Q(r€)(X)]H < lex,§(f)(X) - Q(r)(X)]H-

Finally, since M is relatively open in R:[a,b], we can select 33
with 0 < 63 S 92 so that r63 6 M and “WEX,§(r)(x) - Q(re3)(x)]H <
HW[x,§(f)(x) - Q(r)(x)]H and hence the result.

Case 2: Q is sign-preserving.

(Sufficiency) Suppose that there exists {xi}§=1 C X such that
|W[x1,<b(f)(xi> - p<r>(xi)]\ = \\W[x,p(f>(x) - s(x-Hm“ and
sgn(f(xi) - r(xi)) = (-l)i+lsgn(f(x1) - r(x1)). Let r* E M be such
that HWEx,p(i><x) - s(r*><x>]n s HWEx.p<f><x> - i<r><x>]u. Suppose
r(xi) < f(xi). Then r(xi) s r*(xi) for otherwise r*(xi) < r(xi) <
f(xi) and so p(f)(xi) - Q(r)(xi) < p(£)(xi) - Q(r*)(xi) which
implies \WEx.i(f)(xi) - i<r*)<xi>]l = WExi.p<f)<xi> - p(r*)(xi)] >
Wlxi,§(f)(xi) - s<r)<xi>] = \Wlxi,§(f)(xi) - p(r)(xi>]\. which is a
contradiction. Similarly r(xi) > f(xi) implies r(xi) 2 r*(xi).

*
Thus, as in Case 1 above, r E r and we have the desired result.

NI

(Necessity) Assume there exists {xi}i-l

:x,N' <N,N' maximal
on which \win.p<f>(xi) - p(rxxinl = \Mx,p(£><x) - mum“ and
sgn(f(xi) - r(xi)) = (-l)i+lsgn(f(x1) - r(x1)). About each xi con-

struct a relatively Open interval Ii such that all extreme points =
I

2

N
{x e X: \WEx,p(i)<x> - imam = \\W[x,i<i>(x> - i<r><x>1m c U 1

i:
Ij ﬂ Ik = ¢ for j i k and for all extreme points in each Ii the
function f-r has constant sign. Construct the sets Y, V, Z and

U as in the proof for Case 1. Using the same arguments as in the

above proof we can find 6 > 0 Such that

80

max |WEx.p(f)<x) - p(r)<x>l| s \\W[x.6(f>(x) - p(r><x>l\\ - p .
xEYUZUU

By Lemma 1.8 we can select re 6 R2La,b] satisfying (1.2). By the

continuity of Q and W we can select > 0 such that for

6l
O < e s 61’ re satisfies (1.2) and

max 1w[x,o(t)(x) - p(re)(x)]\ s HW[x,o(f)(x) - o(r)(x)]u - %-.
xEYUZUU

Next, by continuity of f and r, we can select 32, O < e2 S e1.

such that for 0 < e s 32, re lies strictly between f(x) and r(x)
NI

on V = U V.. If f(x.) < r(x ) on V. then f(x) < r (X) < r(x)
i=1 1 l i 1 g

for all x 6 vi and so o(£)(x) - Q(r)(x) < @(f)(x) - p(r€)(x) < o
(0 < e s 62) which implies \W[x,o(f)(x) - “r900“ e
-w[x,p<f)(x> - i<re)(x>l < -W[x,§(f)(X) - i<r><x>l = lW[x.i(f)(x> -
Q(r)(X)]\ and so \Wlx,§(f)(X) - Q(re)(X)]\ < lex,§(f)(X) -
Q(r)(x)]“ on Vi' Similarly, if f(xi) > r(xi) on Vi we obtain
\WEx,p<f>(x> - p<r6>cx>l| < \\W\:x,p<i><x) - p(rumu for o < e s 62-
Thus 32; ‘W[x,§(f)(x) - Q(r€)(x)]‘ < “W[x,o(f)(x) - Q(r)(x)]“. Con-
sequently, for e subject to o < e s e2, re is such that
HW[x,§(f)(x) - Q(re)(x)]ui< Hw[x,i(f)(x) - o<r)(x)]u. Since M is
relatively open in R:[a,b] we may choose e3 with o < 33 5 e2
such that r63 6 M and Hw[x,i(£)(x) - p(re3)(x)]n <

\\W[x,o(f)(x) - Q(r)(x))“. Thus we have the desired result.l
Corollary 4.5: If Q(f) has a best starting approximation under the
setting of the above theorem then it is unique.

.gggggz Using the properties of the generalized weight function,
uniqueness is established in the same way that sufficiency was

established in Theorem 4.4.I

81

As noted earlier we can only guarantee existence of a best

starting approximation in the case where X = [a,b] and K = C[a,b].

 

Theorem 4.6: Let Q: C[a,b] w C[a,b] be a continuous operator which
is pointwise strictly monotone at f E c[a,b] ~ M and pointwise
fixed at each k e C[a,b] where M = R;[a,b] (m,n fixed). Further
assume that for each x 6 [a,b], |§(ri)(x)\ a m whenever {ti} C M
and ri(x) a i m. Then there exists a best starting approximation
for Q(f), relative to the generalized weight function W(x,y), from
the class Q(M). V

_Egggf: The result follows at once by combining Lemma 1 of [17] with
Lemma l.11.|

Remark 1: Similar results corresponding to restricted range approxima-
tion and osculatory rational interpolation for the operator setting
may also be obtained in the generalized weight function setting.
Remark 2: Theorem 4.4 yields a generalization of the work which we
have done in Chapter I. If we choose W(x,y) = w(x)-y (w(x) a
positive, continuous weight function on X) then the problem of

determining r E M such that

\\WEx.i<f>(x> - p<r>(x>l\\ = inf \\WEx,s<f)(x> - p(s><x>lu (4.1)
$6M.

reduces to determining r E M such that HQ(f) - Q(r)H =

inf “Q(f) - @(s)H where HhH = max ‘w(x)h(x)\, which is the problem

S€M XEX

studied earlier. A question which arises is the following: can the

problem (4.1) be cast in terms of another generalized weight function
w for which the solution is already known? In the problem (4.1)

we are approximating Q(f) be elements of Q(M) with reSpect to the

generalized weight function W. Suppose we could find another

82

generalized weight function N such that the problem (4.1) would
reduce to finding r E M Such that uﬁ[x,g(x) - r(x)]H =
inf “ﬁ[x,g(x) - s(x)]u where Q(f) = g. This problem has already

384
been studied by Moursund and Taylor [17].

CHAPTER V

AN IMPROVED NEWTON ITERATION SCHEME FOR APPROXIMATING EXP(X)
WHICH IS 0 PI‘IMAL

Section 1: Introduction

In a recent paper I. Ninomiya [18] suggested a modification
of the Newton-Raphson method for calculating square roots which re-
duced the error at each step of the algorithm. This method was
generalized by R.F. King [6] to a modified Newton method for cal-
culating integral roots. Both Ninomiya and King show that the

improved Newton iteration converges by a factor of approximately

[1

2 faster than the usual Newton iteration (comparing the n-th
iterate of each). R.F. King notes that in actual practice one would
not use this scheme for calculating a p-th root on a machine which
has p as its floating base since in this case an exponential add
order is much faster than a multiplication, and it would usually be
better to use Newton's method with an extra iteration (if necessary).

G.D. Taylor [27] has shown that the suggested modification of
Newton's method above for integral roots is optimal in the sense of
our work. We shall now show that an analogous modification of Newton's
method for f(x) = ex yields an improved iteration scheme for
approximating this function.

Let [a,b] be a fixed interval and K a convex subset of

C[a,b]. Recall that if N(h)(x) = h(x)(1 + x - Ln h(x)) (h e K)

83

84

is the Newton Operator used to approximate ex on the interval
[a,b] then N(h)(x) s ex for all h E K and x 6 [a,b]; that is,
N is one-sided from below. With this in mind we are able to con-
struct an improved Newton iteration procedure for computing ex
which is optimal and which differs from the usual Newton method by

a multiplicative factor at each step. At each step of the algorithm
the multiplicative factor has the effect of "translating" the pre-

vious Newton iterate upward sufficiently to halve the relative error.

Section 2: The Improved Newton Iteration Scheme

Let [a,b] be a fixed interval as above and define K by
l+x .
K = {h E C[a,b]: O < h(x) < e for all x 6 [a,b]}. For fixed
nonnegative integers m and 5 let R: denote the usual class of
rational functions defined on the interval [a,b] (See Chapter I).
Set M = K n R: (assume M # ¢) and define N(h)(x) =
h(x)(l +ix - Ln h(x)) for h E K. This is, of course, one Newton
iteration for calculating eX starting with the initial guess h(x).
n n , _

Set E+'= {(c1,...,cn) : (c1,...,cn) E E and c1 > O, 1 - 1,...,n}
where En denotes Euclidean n-Space. For (c1,...,cn,h) E Ei'x K

formally define the sequence

y0(X) = h(X)
(5.1)
yk(x> = ck N(yk_1><x>. k = 1.2....
which may have no meaning for some choices of c1,...,cn.
° = o 1 0
Define Sn {(c1,...,cn,r) . ci 6 E+, r E M and yn_1 E K}, that

is, (c1,...,cn,r) 6 Sn if and only if y1 = c1N(r) E K, y2 - c2N(y1) =

c2N(c1N(r)) E K,..., yn_1 = cn_1N(yn_2) 6 K. Thus Sn consists of

85

all tuples (c1,...,cn,r) for which r E M and the iteration above

. . . +

is well-defined. Now define Tn: Sn a C [a,b] = {f E c[a,b]: f(x) > O
for all x E [a,b]} by Tn(c1,...,cn,r)(x) E yn(x) (yn(x) given by (5.1)).
Observe that (c1,...,cn_1,cn,r) 6 Sn implies that (c1,...,cn_1,c,r) E

1
Sn for any c E E+,

(c1,...,ck,r) E Sk’ k = 1,2,...,n and
Tn(c1,...,cn_1,cn,r) = chn(c1,...,cn_1,1,r) = an(Tn_1(c1,...,cn_1,r)).
We shall show that there exists a unique best approximation to ex

in the relative norm from Tn(Sn); that is, there exists

(cin),...,cén),r*) 6 Sn for which

eX - Tn(c§n),...,c§n),r*)(X)

ex - Tn(c1,...,cn,r)(x)

X
e

 

 

 

 

 

 

 

X
e

 

I (5.2)

for all (c1,...,cn,r) 6 Sn with equality holding in the above if

and only if c

*
= cin), i = 1,2,...,n and r E r .

i
— . . . x
Let r(x) denote the best relative approx1matlon to e on

 

 

 

 

 

 

 

 

[a,b] from R: with deviation A0; that is,
f'x - eX r x - eX
ex = 1n£_ ex = X0 - {5-3)
rER
m

Clearly O < l0 < 1 so that f E M. The unique best approximation

(n) ,3

(n)
T (c1 ,...,cn

n ) to ex from Tn(sn) is given by the following

set of recursive relations:

 

 

l
l-xk 2)\k 1
= e 1+1k k = 0,1,... (5.4)

I 2
l-xk

86

= 1 ' Bk'1(1 ' Kk_1)[1 ' Ln Bk-1(1 ' kk_1)]
xk 1 + Bk-1(1 - kk-1)[1 ' Ln Bk-1(1 ' )hk_1)] ,

 

k = 1,2,... (5.5)

2

= , k=1,2,... 5.6
k Bk_1(1 - lk_1)l1 - Ln ek_1(1 - ik_1)] + 1 ( >

Ck = Bkdk, k = 1,2,...,n-1 (507)
cs“) = on (5.8)
and
* ._
r (x) = Bor(x) . (5.9)
In passing from n to n+1 the tuples (cin),...,c§n),r*) and
(n+1) (n+1) * ,
(c1 ,...,cn+1 ,r ) are related as follows.
cg“) = C€n+1), i = 1,...,n-l ,
1 1 (5.10)
C<n+1> = a cis)
n n n
and
(n+1) (n) (n) * _ (n+1) (n+1) *
c +1 N{Bn(Tn(C1 ,...,cn ,r ))} — Tn+1(c1 ,...,cn+1 ,r )

*
Note that r is the best starting approximation for calculating

x . . . .
e With the usual Newton iteration; that is,

 

 

 

 

 

 

 

 

* x n X
Nn(r )gx) - e 1 s N (r)(x) - e t (S 11)
x x
e e
O
*
for all r 6 M. with equality holding if and only if E r , where

r
n-l

N(r)(x) - r(x)(1 +-x - Ln r(x)) and Nn(r)(x) = N(N (r))(x), n = 2,3,...

Finally we have that Bk is such that

ek(1-lk)[1-tn ek(1-lk)] = ek(1+lk)[1-tn ek(1+lk)]. k = 0.1.2.... (5.12)

87

Lemma 5.1: Let Nx be defined by Nx(t) = t(1 + x - Ln t) for t >|O.
Let x,y be two given real numbers and let e,¢ be such that
0<9<l<tp. Then, for %<t<% wehave
y Y
N N t x
X(we ) y( we > .

>
ey ey Nx (t sex)

and for O<t<$ wehave

 

x x
Nx<ee > >Nx<tee > . ey
ex ex N (tqey)
Y
N (tmey) ex Ny(q£y)

Proof: Set f(t) = —X——____ . ___—___. -
ey N (teex) ey
x

1 _ I Q
=§[1-L2:9] -QP[1-anp].

Now consider the function h(t) = t(1 - Ln t) for t > 0. Upon

differentiation, we obtain

h'(t)<0 if t>l,h'(t)>o if t<l. (5.13)

.Q

Thus since 9 > m > 1 it is clear that f(%) < 0. We also have that

f'(t)=52 [Lute-mtg; <0 if t#
9 t(1 - Ln he)

Similarly, set

Hence the first result.

 

<0er

N (teex) y N (Sex)
S(t) = —§-;—-——‘° e - x
e Ny(tqey) ex

=‘g [}%—§4%§—%% J - 9(1 - in e) .

88

Using (5.13) and the fact that %< e < 1 it is clear that g(i) < O.

 

Now g'(t) = ﬁ- Ln tw - Ln t9 > O for 0 < t < la Thus the
t(1 - Ln tm) w
second result. I
* *
Theorem 5.2: (cin),...,c§n),r ) for cg“), i = 1,...,n and r
*
given by (5.7)-(5.9) is an element of Sn and Tn(c§n),...,c§n),r )

is the unique best relative approximation to ex from Tn(sn) for

each n = 1,2,... . Furthermore

Tn(c§n),...,c§n),r*)(x) - ex

 

 

 

 

 

x = kn , (5.14)
e
(D
*
Tn(c§n+1),...,c§n+1),r ) intersects ex in [a,b] and
n+1 n+1 * x x
N(Tn(c§ ),..., i ),r ))(x) - e S N(Tn(c1,...,cn,r))(x) - e
x x
e e
oo oo
(5.15)
for all (c1,...,cn,r) E Sn with equality if and only if ci a c§n+1),
*
i = 1,...,n and r E r . This last inequality states that
*
Tn(c{n+1),...,c§n+1),r ) is the unique best relative starting approxi-

mation to ex for the Newton operator N, from the class Tn(sn)°
Proof: We shall proceed by induction on n. For convenience we

shall use the following notation: for (c1,...,cn,r) 6 Sn set

W, = .iiiiicsi... W M}
ex y€[a,b] ey

m(r) = x: _(_)___r x - ex = min LI. ' ey}
ex yEEa,b] ey

 

l'll: I
h .l I All ‘5‘ {I

89

T (c ,...,c ,r)(x) - ex
M(c1,...,cn,r) ={x: n .1. n

 

X
e

____ max Tn<c1.....cn.r)<y) - ey}

yEEa ab] ey

 

X

Tn(c1,...,cn,r)(x) - ex
m(c1,...,cn,r) = x:
e

 

. Tn(cls°°'5cnsr)(Y) ‘ 9y}
_ min 0

yE[8,b] ey
Observe that M(c ,...,c ,r) = M(c ,...,c ,1,r) and that
1 n l n-l
m(c1,...,cn,r) = m(c1,...,cn_1,l,r).

To establish the case n = 1 we shall establish various con-
ditions that a best relative approximation from T1(Sl) must satisfy
(1) * * . . . .
and then show that T1(c1 ,r ) E T1(d1,r ) is strictly minimal
* ._
within this subset of T1(Sl). First r E M since 0 < r(x) s
* _
(l + x0)ex, r (x) = Bor(x) and 50(1 +.x0).< e as is easily checked.
(1) * (1) . . . .
Thus (c1 ,r ) E S1 as c1 is clearly pOSitive. Now if T1(c,r)

is to be a minimal solution then c must clearly be given by

 

 

 

 

 

2
° T1<1.r)<x> T1(1.r>(x> ' (5‘16)
max +' min
xE[a,b] ex xe[a,b] ex J
T1(1,r><x>
Since T1(l,r)(x) = N(r)(x) we have that 1 2 max x >
xEEa,b] e

l1(1.r>(x)
min with equality holding in the above if and only
xE[a,b] ex

. . X . .
if r intersects e . With the above selection for c, we have

T1<1.r><x> T1<1.r)(x>
c max x - l = l - c min x . Thus

x€[a,b] e x€[a,b] e

 

90

(ex - T1(Csr) (x)) i1(c.r>(x)
max = l - min -——-——————— = 1 -

 

 

xE[a,b] ex x€[a,b] ex
T1(1.r)(x) T1(1,r)(x) (T1(c,r)(x)-ex
c min --—-::--'= c max -—-——————- - l = max
xE[a,b] e x€[a,b] ex xE[a,b] ex

Now let c1 6 E: ‘be such that c1 > c. Then for x E M(l,r) we have

T1(c1,r)(x) - ex - ciT1(1,r)(x) - ex

x _ X
e e

T1(c,r)(x) - ex

 

 

 

e

 

 

 

 

l
wise, for c1 6 E+_ and c < c it is clear that T1(c,r) is a

l
strictly better relative approximation to ex than T1(c1,r). Hence

if T1(c,r) is to be a minimal solution then c must be as in (5.16).

Next suppose r E M intersects eX and let
2 *

 

c = [ T1(],r)(x):1 where we shall assume that r i r .
1 +' min ---—-—--
x
x€[a,b] e

*
Since r is the best starting approximation for the Newton operator

 

 

 

 

to ex we have (using T1(1,r) E N(r))
x x *
e - T1(1.r)<x) e - T1<1.r ><x>
max > max
xE[a,b] ex xE[a,b] ex
or equivalently,
*
T1(19r)(x) TIC-9r )(X)
min x < min x . (5.17)
XE[a,b] e xE[a,b] e
* . x . f(x) .
Let us now Show that r intersects e . Since -—;7- has a max1mum

e
value of (1 +.K0) and a minimum value of (l - lo), we must show

that 50(1 - 10) < 1 < 50(1 +‘10)- We shall verify the second in-

equality and the first follows by an analogous argument.

91

Claim: 30(1 +‘l0) > 1 .

 

_lu.
1 ' kor)“ 1
J1 - is“
(I - “>52; -
e . Taking logarithms we find that 50(1 + X0) > 1

1 + x0
1 - x0) 2x0

Proof: Recall that 60 = e< and so 30(1 + x0) =

l

Nln—I

 

is equivalent to Ln (

 

-—-—-—- > --—'. Consider the function
1.+ X0 KO-l
l-x 2x . . . .
f(x) = Ln 11;. - ;:l for 0 < x < 1. Upon differentiation we find
that f'(x) = 2x > 0 for O < x < 1 which implies f is
(x - l) (x-l-l)

increasing for 04< x < 1. Now f(0) = 0 implies f(xo) > O and

so the desired result. Notice that the above argument is independent
of the index of B and l and thus we may conclude that Bk(1-Xk) <
l < Bk(1+1k), k = 0,1,2,..., a fact we shall make use of later.

*
Hence r intersects ex and so by (5.17)

c* = 2 < c . (5.18)

[ i1<1.r*)<x) }
1 +' min
XE[8,b] ex

 

 

x
* T1(c,r)(x) - e
Let x1 E M(l,r) and y1 E M(1,r ). Then x
e

 

 

 

x1 y1

c T1<1.r><x1> - e c T1<1.r*><y1> - s T1<c"{.r*><x> - ex

X
e

 

 

 

X

 

1 Y1 e
e

co

*
This shows that if r E M intersects ex and r i r , then T1(c,r)
is not a best relative approximation to ex from T1(Sl). Let us

*
now show that c of (5.18) is equal to oil) of (5.8). First

'1:
T (lar )(x) *
min 1 = min E.$§l.(1 +-x - Ln r*(x)) =

xE[a,b] ex xE[a,b] ex

 

92

* * *
x x O O x 0 0

it is easily shown that the minimum desired is 80(1'XO)[1'LD 50(1-y0)] =
50(1+10)[1'Ln 80(1+10)]. Therefore c* = 2 * =
T1(1.r )(x)

X

l + min
xEEa ,b] e
2 (1)

c as required.

80(1-x0)[1-Ln 30(1-x0)] + l ’ l

 

Now it still remains to consider the case where r E M does
. X . .
not intersect e on [a,b]. First we shall examine the case where

r(x) > ex for all x E [a,b]. Set

IX '8

min x = a > O . (5.19)
xE[a,b] e

Using Property 1, it is easily shown that m(r) = M(l,r) and M(r) =
m(1,r). Furthermore it is evident that r(x) = (1 +~a)ex on m(r).
Now let 2 denote the best one-sided relative approximation to

f(x) = (1 +a)ex on [a,b] from above from the class Rg. Clearly
? E M if there exists r E M for which r(x) 2 (l +a)ex on [a,b].
Thus for every r E M satisfying r(x) 2 (1 +a)ex for all

x E [a,b] we have

mo - (1 + pie"
(1 + a)ex

f(x) - (1 + erx
(l + (31/)ex

S

 

 

 

 

 

 

 

 

m as

with equality if and only if r a f. The function f(x) = 2351- is

q(x)
uniquely characterized by a set of points {xi}:=1 C [a,b]

satisfying asx1<...<xd$b where d=m+m+2-6, 6=

A _ X
and on which E(x) = r(g) 11 :-q)e
(1 + (1)8
alternately zero and “E(x)”0° (or alternately equal to “E(X)Hm

min(; - 5p, m - ad) is

93

and zero).
We shall now show that (6,?) where f is the best one-
sided relative approximation to (1+a)ex is a better solution than
any (c,r) with r E M satisfying (5.19), where E and c are
the constants given by (5.16) corresponding to f and r respectively.
By the uniqueness of the best one-sided relative approximation f

to (1+a)ex, we have that

rix) - (Ha)? HXL- awe"

 

 

 

 

 

max X > max X
xE[8,b] (1+a)e xE[a,b] (1+a)e
r(x ) f(x )
Select x0,x1 6 [a,b] such that x0 = max Elfl-, x1 =
e O xE[a,b] e e l
. r(x ) f(x ) N(r)(x ) N(?)(x )
max Eﬁil' Then 0 > 1 > 1 and so -———-—9-< -—-——J;'
x x x X x
xE[a,b] e e 0 e l e 0 e 1

since the function
f(t) = t(1 - Ln t) is increasing for t < l, decreasing for t > 1. (5.20)

Now, since xO E M(r) = m(1,r), x1 E M(f) = m(l,?),

 

 

 

 

 

 

 

 

T1(1.r)(X) T1(1.?)(X)
min < min
xE[a,b] ex xE[a,b] ex
T1(1.r)(X) T1(1.?)(X)
We now claim that max x = max x . We
xE[a,b] e xE[a,b] e
x
can see this as follows. Let x1 6 m(r). Then r(xl) = (1+a)e 1,
X0 r(xl)
Select x0 E [a,b] such that f(xo) = (1+u)e . Then x =
e 1
f(x ) T (1st)(x ) T (1.?)(x )
x0 and so 1 x 1 = 1 x O . Therefore,
0 1 0

e e e

94

T1(1.r)(X) g T1(1,r)(x1) T1(1.?)(x0) T1(1.?)(X)

max = = max
X X

x
xE[a,b] e e l e 0 xE[3,b] e

 

 

 

 

X

since m(r) = M(l,r). Thus E < c and so

X

 

 

 

 

 

e ' T1(C5r)(x) T1(1,r)(x) A T1(1,r)(x)
= c max ————-————-— - 1 > c max -—————————— -
ex xE[a,b] ex xE[a,b] ex
T1<1.i-)<x) T1(’é,i)<x> e" - plan‘s) (x)
=anax -l=max ___—-1:
xE[a,b] ex xE[a,b] eX ex

 

 

Thus the only possibility for a minimal solution in this case is

T1(E,r).

. . . x
Now r, the best relative approx1mation to e , has a char-

acterizing sequence a s x < x <...< x s b, d = m +'m + 2 - 6,

l 2 d

 

 

 

6 = min(m - as, m - 5q_) where 17 = E , on which
x q
‘- i
r(xi) - e g x _ ex
x = x = lo. 1 = 1,...,d and
i e
e m
— xi — xl
r(x.) - e . r(x ) - e
1 1+1 1 ,
'58“ x = (-1) sgn x , 1 = 1,..., d. Set
i 1
e e
1+u - . .
11 = l:i_ . Then xlr turns out to be the best one-Sided relative
0

d
approximation to (l+a)ex on [a ,b] from above as {xi}i=1 is a

characterizing sequence for x1?“ which satisfies the conditions of

*
a best one-sided approximation. By uniqueness, r 5 A ; a-sz—-£— E
1 1"1‘05
0
* 1+0! 1 e
h =-——-.Nt that —-—-—< <-————
1‘ ” ere A 90(1-10) ° 8 50(1-10) * 60(1-10) ’
a X * . x
r(x) > e for all x E [a,b] and r intersects e .
*
We shall now make use of Lemma 5.1 to show that T1(c§1),r )

. . x
is a (strictly) better relative approximation to e than

 

 

(D

95

* * * * *
T1(E,2) T1(E,Xr ). Clearly M(r ) = M(Xr ) and m(r ) = m(xr ).

* *
Let x1 E m(r ) and x2 E M(r ). Then
x x

r*(x1) = oo<l-io>e 1 lr*(x1> = 190(1-10>e 1

* x2 * x2
r (x2) = eocl+io>e 1r (x2) leo<1+io>e .

With the notation in the Lemma, set x = x1, y = x2, tp = BOO-Flo).

8 = 80(1-k0), t = X- Then

X

2
Nx (80(1+10)e >

 

 

 

 

 

 

 

1 * l *
. Tlc .r )(x) T1( .r ><x,) 2
min = =
xE[a,b] ex exz 8x2
x2
1

>|NXZ(XBO( +X0)e ) . exl = T1(1,xr*)(x2) . ex1

x x x 1’ *

e 2 Nx1<lso(1-lo>e 1) e 2 Tl( 1‘ )(Xl)

(5.21)

 

 

{ T1<1.1r*><x>}{ Tl(l.ir*>(x>}'1
= inin x max
xE[a,b] e xE[a,b] ex

since m(1,xr*) = M(xr*) and M(1,xr*) = m(hr*) as xr*(x) > ex

for all x E [a,b]. Hence, by (5.21),

 

 

 

 

 

 

 

 

 

 

 

 

 

T1(8,xr*)(x) - ex = 1 - T1(E,xr*)(x2) = 1 - E T1(1,Xr*)(x2)
X X X
e on e 2 e 2
1 'k
2 T1( .ir )(x2)
x2
= 1 - e
* *
[llama )(xl) T1(1.xr mp]
+
X X
e l . e 2
= 1 - i x
{1 11(1.ir >(x1) e 2 ]
+ x . *
e 1 Tlamr >(x,>

96

(1)
2 — T1(C1 3r *2)(X )=
> 1 - x - 1 - x x
1 + e 2 e 2 e m
T1(1,r*)(x2)

T1(ci1),r*)(x)-ex

 

 

 

 

 

 

 

 

thus showing that T 1(c(1),r* ) is a better relative approximation
to ex than T1(E,2). A similar type of argument applies to the
case where r E M satisfies r(x) < ex for all x E [a,b] and
we shall omit the details. This case will be considered in the
induction step.

Collecting the above results, we find that we have established
the following: T1(ci1),r*) is the unique best relative approximation
to ex from T1(Sl)' To complete the case n = l we must establish

*
(5.14) and (5.15). For (5.14), let x1 6 m(1,r ). Then

X
T1<c c‘1>.r* ><x>- el = e 1 C(1>T (1. r *><x > 1 _ cil’i1<1.r*)<x1>
X X X

e I e 1 e 1

 

 

 

 

 

2
= 1 '{:ao(1-io>[l-tn 60(1-io)l +i:}{5o(1')o)[1 ‘ ‘“ 80‘12“}

1 - 60(1-10>[1 - tn 80(1-10)]
= =)\
1 + 80(1-10)[1 - Ln 80(1-10)]

 

1 0

Next we must show that T 1(c(2 ),r* ) intersects eX in [a,b].

ex - T (c‘ ) r *)(x>
From an earlier result, max x =
xE[a, b] e

T <c”’.r* )(x) - e x)

 

(2)- (1)

max = 11- Now c1 W61

xE[a,b] e

 

X

T1(c‘ ’.r* )(x)
1 < 51(1+)1) and (1-x1) s ex

5 (1+11) imply that

97

(2 ) (2 ) *
T1<c1 .r*)oo W1‘ .r)oo
s 81(1+11)- Since x is a

 

 

61(1-21) s x
e e

continuous function of x and assumes the values 81(1’11) and

31(1+x1) it assumes all values between. Hence there exists

(2 )
T1(C1 ’r *)(X) (2)

X 6 [a,b] such that x = 1, and so T 11(c ,r r*) inter-
e

 

sects ex in [a,b]. Finally, we shall establish the fact that

(2 )

* x x
N(T1 (c ,r )><x) - e N(T1(c,r>>(x> - e

 

 

X
e
m m

*
(c,r) E S1 with equality if and only if c = C(2) and r E r .

X
e

 

 

 

 

 

 

 

 

Since (1-x1)ey s T1(c(1),r*)(y) s (1+)1)ey for all y E [a,b]

(2 )

we have that 31(1- x1)ey s T 11(c ,r *)(y) S 51(1+)(1)ey for all

N (81(1-11)ey) Nx(81(1+11)ex )

 

 

 

 

y 6 [a,b]. Recall that y y = X for each
e e
pair of points x,y E [a,b]. Let (c,r) E S1 be such that
i 2 *
N(T1(C»r))(x) - ex N(T1(c§ ).r ))(x) - eX
S . This implies
x x
e e
oo oo

31(1-11)ey s T1(c.r)(y) s 81(1+11)ey. Therefore

(1-11)ey s T (-. r)(y) s (1+1 )ey
B1

and so

2. _ Y
T1(Bla r)(Y) e

ey

 

-)1 s 5 x1 .

This allows us to conclude that T1(£-, r) is as good a relative
x (1) * 1
approximation to e as T1(c1 ,r ). By the uniqueness of

(1)
W1(

,r*) wehmm

98

cil) a 3;, r E r*; that is, c = $1c(1)= ClZ)’ r E r*.

Before we proceed to the induction step let us establish the
following result for which we shall have need.

Claim: ak<§ for k=0,1,2,... .

l

1-1 K—
Proof: Since Bk = 8(171-715) k l
k

_1__ 1"

1-)(k Zxk 1+1k
, or equivalently Ln 1+in<— 1 bk k(Ln 2 ).

 

it suffices to show that

x

 

 

 

NIy—i

(I'M) 2hr 1 <

1+)

k I 2
l-xk

It is clear that 0 < xk < 1. Set f(x) = --Ln iii- -Ln -- for

l+x
O < x < 1. Upon simplification, f(x)= %i§-1n 123.- Ln(l-x) +’Ln 2.

Now f(0) = O and f'(x) = I%;-[l +--l;-Ln l§§-]. It is easily

shown that 1 +~I%;-Ln 13x > O for 0 < x < 1 and so f'(x) > O

for O < x < 1. Hence the desired result follows.

Now assume the theorem is true for k = n. First let us show

n+1 n+1 .
that (C; ),...,c (+1 ),r* ) E Sn +1, or equivalently yn =

c(n+1)

*
n N(yn _1) E K since r E M from before. By assumption

(aim),...,c(n ),r* ) 6 Sn and 30 Y

n-l
. (n+1) _ (n) = (n)
defined. Now cn — Bncn and so yn B c N(yn_1). Clearly

n n
Oi< C(n)
n

E K. Thus N(yn_1) is well-

< 2 and O <‘N(yn_1) S ex since N is one-sided from

beIOW. ThuS, by the claim, 8 nC§n) < 2.. 2 = e and so 0 < yn =

2
(n) c(n)ex1+x

BnC n N(Yn _1) 5 BnC e and so yn E K. We shall now show

c(n+1) c(n+1) (n+1) and *

,...,c n+1 ,r r*) f 1 ,...,c n+1 r

given by (5.7)-(5.9) is the unique best relative approximation to

(n+1)
that Tn+1(c1

x
e from Tn+1(Sn+1). As in the case n 1 we shall show that

r) E S

where we allow c to vary,

for any (c1,...,c n+l’ n+1

n+l’

I I III-i ll 311‘

99

Tn+1(c1,...,cn +1,r)(x) is closest to ex in the relative norm

only when

 

 

 

 

cn+1 = T (c 1 r)(x)2 T ( 1 )( ) (5°22)
n+1 1’°"’cn’ ’ . n+1 Cl"°°’cn’ ,r :x ]
max x +' min x
xE[a,b] e xe[a,b] e
T (c ,...,c ,1,r)(x)
where max n+1 1 x n s 1 with equality only at those
X€[a,b] e

points y where Tn(c1,...,cn,r)(y) = ey. This is true since

Tn+l(c1,...,cn,l,r) = N(Tn(c1,...,cn,r)). For the above choice of

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Cn+1’ X1 6 M(c1,...,cn,1,r) and x2 6 m(c1,...,cn,1,r) we have
V x1
e -n+1(C1!"°!Cn’Cn+19r)(Y) = Tn+1(cl"°°3cn,cn+19r)(x1) - e
ey x1
00 e
x2_
: e -Tnn+1(cl,...’c 9Cn+19r)(x2 )
x2
e
Now if cn+1 > cn+1 then for x1 6 M(c1,...,cn,l,r)
x1 x1
Tn+1(c1,...,cn,En+1,r)(x1) - e >’Tn_'_’1(c1,...,cn,cn+1,r)(x1) - e
x x
e 1 e 1
Tn+1(c1,...,cn ,cn +1,r)(x) - eX
x C
e
(I)
Similarly, if en+1 < Cn+l’ then for x2 6 m(c1,...,cn,l,r)
x2
e - Tn+1(c1,...,cn,8n+1,r)(x2) Tn+1(c1,...,cn,cn+l,r)(x) - e
x x
e 2 e a:
and thus the desired result. Next we must show that c of (5.22)

n+1

 

II III Ala-PI Iﬁﬁl... ' till. III 1 I 1' illll': '1' Ill 51
l III-

100

(n+1)
n+1

(n+1) (n+1)

gives the correct value of c correSponding to c1 ,...,cn

Tn(c{n),...,c§n),r*)(x) - exl

X
e

Tn+1(c C(n+1),...,c§n+1),1,r*)(x)

 

*
and r . By the induction assumption

 

 

l on

 

= A . Furthermore the minimum of
n ’ ex

occurs on the set M(c§n+1),...,cn (n+1), r*) U m(c (n+1),...,c§n+1),r )

and is given by Bn(1-1n)[1 - Ln Bn(1-xn)]. Hence

c(n+1) _ 2
cn+1 1 + Bn(1-xn)[1 - Ln Bn(1'xn)] :
satisfies (5.22) and suppose

 

as desired. Next let

(c1,...,cn+1,r) 6 S where Cn+l

n+1

Tn(c1,...,cn,r) intersects ex on [a,b]. Then by the induction

assumption we know that

(C (n+1),...,c§n+1),r*))(x)- exl N(Tn(c1,...,cn,r))(x) - ex

‘ S

N(Tn

 

 

X
e

X
e
00

+1
in )=°i’
*
i = 1,...,n and r E r . Suppose that (c1,...,cn,r) ¢

(C§n+1),...,c§n+1),r*). Then

for all (c1,...,cn,r) 6 Sn with equality only if c

, Tm+1(c1,...,cn ,1, r)(x) (C (n+1) (n+1)
min _< min

*
Tn+1 ""’Cn alar )(X)
x

x€[8,b3 e xEEa.b] ex

 

 

(n+1)
and so c n+1 > cn+1 .

(n+1) c(n+1)
,..., cn+1 ,r*

Thus, for x E M(c1,...,c and

1 n+1’r)

26 M(c1

T X x1
n+1(C1:ooo,Cn+1,r)(X)'e l = Ch+1 Nr1(T (C 1n,o..,C ,r))(x 1) - e

x X
e l 1

on e

(n+1) (n+1)
1N(Tn(c1 ,...,cn

X

2
e

 

 

 

 

w>><x2> -e

101

 

 

 

 

(n+1) (n+1) c(n+1)
> Cn+1 N(Tn(c1 "°"c r*X))( 2)_ex2
x2
e
+l +1 *
_ Wn+l( (n )""’°§:1 )’r )(x) ’ ex
_ x2
8 co

Finally we consider the case where (C1:°'°’°n+1’r) E Sn+1

for which Tn(c1,...,cn,r)(x) does not intersect ex on [a,b].
Let us first consider the case where

ex " Tn(C1,...,Cn,r) (X)
min X = a > 0 ;
XE[3,b] e

 

that is, ex > Tn(c1,...,cn,r)(x) for all x 6 [a,b]. Set

ex - Tn(c1,...,cn,r)(x)

max = a .
x
xE[a,b] e

 

Since Tn(sn) C C+[a,b] we have that B < l and so a < B < 1.

T (c ,...,c ,r)(x)
. First max n 1 n = 1 - a-

- Bn(1+)\1) X€[a 1)] ex

(n+1) (n+1)
Now max ”m(clr ,...,c:n ,r *)(x )=
xe[a,b] ex

rn<c{“+1),...,c§“+1’r*>(x>

 

 

 

)‘max
xE[a,b] ex

Tn<c§“).....c (“1.8 c‘“).r* ><x)_
)‘max T
XE[8,b] e

 

X

Tn(c{n),...,c§n),r*)(x)

16- max x = x5n(1+1n) = 1 - a. Therefore

n XE[a,b] e

 

102

 

 

 

 

x1n(c§“+1),...,c:“+1),r*)(x)
max =
x€[a,b] 8“
T (c ,...,c ,r)(x)
max “ 1 “ = 1 - a. (5.23)
xE[a,b] ex
By (5.15),
. Tn(C{n+1),...,céﬁil),XC§n+l),r*)(x)
min >
x€[8,b] ex
T (c ,...,c ,r)(x)
min “ 1 n , (5.24)
XE[a,b] ex

C

—2-,r)(x) is a better relative starting

for otherW1se Tn(cl"°"cn-l’x

. . x _
approx1mation to e for the Newton operator than 18

(n+1) C(n+1) r*
n 9

1 ,..., )(x). Here we have used the fact that

Tn(c

m(c1,...,cn,r) = m(c1,...,cn,1,r), M(c .,cn,r) = M(c1,...,cn,1,r)

(5.25)

1,0.

which is established using Property 1. Choose xo,x1 6 [a,b] such

+ *
that x0 6 M(C1n 1),...,XC§H+1),I ) and x1 6 M(c1,...,cn,r). Then,

by (5.23),

,...,XC

T (Cin+l)

(n+1)
n n

.r*><x0> Tn<c1....,cn.r>(x1)

X X

e0 e1

 

and so, by (5.25),

 

 

 

Tn+1(c§n+1)a - - ° acin+1):xcr€n+1)a 1:r*) 0‘)
max =
xE[a,b] ex
+1
Tn+1(cin ),...,c:n+1),xc§n+1),l,r*)(xo) = Tn+1(c1,...,cn,1,r)(x1) _
x0 x1

e e

103

T (C 00- C ,1,r)(x)
wax n+1 1’ x’ n .
x€[a,b] e

 

E [a, b] such that yo 6 m(c(n+1) ,...,XC§n+1).r*)

Choose y0,y1

and y1 E m(c1,...,cn,r). Then

(n+1) (n+ 1) *
1 > Tn(C1 ,...,xcrl ,r )(yo) > Tn(c1’000,cn,r)(YI)
e e

 

 

and so

Tn+1(C1:° ° ° ,cn,1,r)(x) = Tn+1(c1:' ' ° :Cn913r)(y1)

 

 

min
x

x€[8,b] e ey1

(n+1) (n+ 1) *
< Tn+1(cl :°'°a)\cn 1:1. )(yo) -
y
e 0
1 1
Tn+1(c§n+ ),..., c§“+,1)r*)(x)

 

min

x€[a,b] e

where we have made use of (5.13) and (5.25). Hence 8 < c

n+1 n+1
(c (n+1) (n+1)
1

*
where E corre3ponds to ,...,cn_1 ,xc§n+1),r ) and

n+1
cn+1 to (c1,...,cn,r). Thus

x ( +1) (n+1) (n+1) .
TCH+1( n 300°:Cn_1 .16 n s cn+1,r *)(x)

 

X
e

(n+1) c(n+1) (n+1) *
(C a°'°ac AC :19): )(X)
= E x Tn+1 n- -1 n _ 1

n+1 xE[a,b] ex

 

 

 

 

co

 

_ . Tn+1(c1,...,cn,1,r)(x)
_ Cn+1 max x - 1
x6[a,b] e

 

Tn+1(c1,...,cn,l,r)(x)
max x - 1
xE[a,b] e

 

< Cn+1

104

X
e ' Tn+1(°1’° ' ° ’cn+1’r) 0::

 

 

 

X
e

 

 

(D

Thus, we must show that 1(c(n+l) c(n+l )

,...,c n+1 ,r *)(x) is a (strictly)

better relative approximation to ex than T n+1(c C1n+1) (n+1)

,...,cn-l ,
M(n-H') 3
an , cn+l’r *)(x). Here we shall make use of the second part of

Lemma 5.1. Set x = x1, y = y1, e = Bn(l-xn), v = Bn(l+1n) and

 

_ _ Li. 1
k — t where x - Bn(1+xn) . Observe that O < K < . In order to
apply the lemma let us make the following observations. For
n+1 +1 * +1 +1
x16 M(c ( ),...,c§n ),r ) and x2 6 m(c (n ),...,c§n ),r ),
1 3 a n 1 1-
x = J = a (In >
x n n
1
e
x
(n+1) (n+1)

l
and so Tn(c1,...,cn ,r *)(x l)= Bn(1+1n)e . Also

 

 

 

 

 

Tn(c{n),...,c§n),r*)(x) - ex — . 1.
ex - x“ imp 1es
c(n+1)
1 1
Tn(c§n+ ),...,cr(1'_"{), ir,r*)(x) - eX
x n = x“ which in turn yields
e
X a:
T n(c: (n+1),...,c(n+1),r* r)(x2)= Bn(1-)‘n)e 2. Applying the lemma,
N( (1- >ex2> N( (l- >ex2 "1
a. k. > mm X. > , e
X X X
e 2 e 2 N(xan(1+xn>e 1>

or equivalently,

105

(n+1) (n+1) (n+1) (n+1)

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

N(Tn(c1,...,cn U))(x N(un(c1,...,cn h))(2)
x2 > x2 O
e e ,
CXI
um: (C(“+1>,,..,c§“+1>,r* Mac >
Now
Tn+1(c§n+1) , . . . ,ciiil) ,xCrﬁn-H‘) ,En+1,r*) (x) - eX
e)?
E (n+1) C(n+1) (n+1) *
= 1 - Cn+1TC1n+1( 9°" C-I'l 1 5 Ken ,1,!" )(X2)
X
2
e
= 1 .. 2
N(XTr1 (C (n+1),...,cr(ln+1),r* ))(x1) (:2 + 1
X
e 1 110:11n (C(“+1),...,c (n+1), r"‘))(x2 )
> 1 - 2
1 +
N(T (c{“+1),...,cr(1“+1),r* ))(x2)
(n+1) C(n+1)
_ 1T°1n+1( ""’C n+1 ’r *)("2)
.. x2
e
x ’
e

 

 

. (n+1 (n+1) . ' .
show1ng that Tn+l(cl ),...,c cn+1 ,r *)(x) 18 a (strictly) better

. x (n+1) (n+1)
relative approx1mat ion to e than '1‘n+1(<:1 , . . . ,cn_1 ,
c(n+1) ..

cn , Cn+1’r *)(x). Similarly, the case where Tn(c1,...,cn,r)(x)

x
> e leads to the same result.

106

Gathering the above results, we have that

+.
Wu+l< (n 1),...,CS:I1),r*) is the unique best relative approximation

to ex on [a,b] from T To complete the induction we

n+1(Sn+1) °

must verify the following:

 

 

 

 

(n+1) (n+1)
Wn+l( ,...,c n+1 ,r *)(x) - ex - (5 26
ex - xn+1 ’ ° )
Tn+1(C C(n+2),...,c (:12),r* )(x) intersects ex on [a,b] (5.27)
and
N(Tn_l_1(C(n+2),...,cr(1r_:_-{2),r*))(x)-ex
S
x
e

 

 

 

 

 

 

 

 

 

N(T (C ,...,C ,r))(X) "' ex
n+1 1 n+1 (5.28)
x
e
m
for each (c1,...,cn+1,r) 6 Sn+1 with equality if and only if
. = c§n+2), i = 1,...,n+1 and r E r*.
1 1
To establish (5.26) observe that for x1 6 m(c(n+1),...,c§n+1),r*
n+1 n+1
1(c( ),...,c(+1),r*)(x) - eX
x _
e
m
x
1 n+1 n+1 n+1
= e ' CIE+1T ) T°n+1(( )’”"°r(1 )’1’r)("1)
x1
e

 

2
= 1 -{Bn(1-)\1)[1-Ln Bn(1-)\1)]+1 {Bn(1‘)\n)[1'{ln Bn(1')\n)]>

107

1 - Bn(1-xn)[1 - Ln en(1-xn>]
- 1 + Bn(1-xn)[l- Ln Bn(l-xn)] g >‘n+l °

 

 

 

 

 

 

 

 

 

(n+1) (n+1)
Similarl 1(C "H’Cnﬂ r*)(x) - ex = x for
y, ex n+1
m
X E M(C§n+1),...,c§n+1),r*). By the induction assumption
Téc§n+1),...,c (n+1),r* )(x) intersects ex on [a,b]. Now
(n+1) C(n+1)
(l-x )s “+1(c1 ,...,cn+1 ,r *)(X) 5 (1+ ) and so
n+1 ex xn+1
(n+2) (n+2)
(1_ )5 TCn+1(1 v" Cn+1 13%)“) 5 (1+ )
Bn+l N1+1 ex E3n+1 >‘n+1 '
n+2 n+2
n+1< ( ),...,cr(1+1),r*)(x)
Since x is a continuous function of
e

x it assumes all values between. Thus there exists x0 6 [a,b]

(n+1) C(n+2) * *0

such that M+l(c ,...,c n+1 ,r )(x 0) = e and so (5.27) is

established. Finally (5.28) follows as in the case n = 1. Indeed,

by (5.26)

(1- xny+1)e s T +1(<:(‘“+1’...., 51:1); )(y) s (1+xny+1)e

for all y 6 [a,b]. Consequently

(n+2) c(n+2)

y “M(c .. c+1 ,r*>(y> san+1<1+xn+1>e

e s T y.

Bn+l(1-xn+1)

Recall that

y x
Ny(Bn+l(1-xn+l)e ) _ Nx(Bn+-l(l'Ii-a‘nﬂl-l)e )

X
By e

 

 

for each pair Of points x,y 6 [a,b]. Let (C19°"9cn+1’r) E Sn+1

108

satisfy

 

 

 

 

 

N(Tn+1(c1,...,cn+1,r))(x) - ex
ex S
N(Tn+1(c‘“+2)..... c§:’{2>,r*))(x) - e"
x .
e

 

 

This can only happen if

- V y
Bn+1(1 xn+1)e S Tn+1(cl’°°°’cn+1’r)(Y) S Bn+l(1-'.)‘n+l)e '

 

 

Thus
y
Tn+1(c19'°°:cna ;n+lar)(Y) ’ e
-x S n+1 S
n+1 ey xn+1
c +1
and so T +1(c1,...,cn, —2-,r) is as good a relative approximation
n Bn+1
x n+1 n+1 ,
to e as Wu+l< ( ),...,c§+1 ),r ). By uniqueness of
Tn+1(cc(n+1),...,c (:11),r* ) we have that c1 = c§n+2), i = 1,2,...,n+l

*
and r E r . This completes the proof of the theorem.

Section 3: Execution of the Scheme

In this section we shall show how this iterative scheme would
be programmed. Suppose we wish to calculate ex for x 6 [a,b]
within a tolerance of e > 0. First we calculate X0 and F, (where

m and E are specified) with:

- x
r x - e .
= inf

m rER

k0 =

 

 

 

 

 

 

 

 

109

and then calculate the sequences {Bk):=0 and {xky:=0’ comparing
the value of xk with e at each step:

_L

1-xk Zxk 1
Bk = e 1+xk , k = 0,1,2,...

«11-x:

x 1 ' Bic-la‘hwi)‘:1 ' L“ Bic-1(1"‘1<-1)J
k .

 

 

 

Let n be the first integer for which e3> x“. For this n cal-

 

culate cén), k = 1,2,...,n using

dk = 2 , k = 1, ..,n

Bic-la'kk-iﬂ1 ' L“ Bk-1(1-xk-1)] + 1
n

cé ) = Bkdk , k = 1,2,...,n-l

C(n) g d

n n

* ..
and r (x) = Bor(x) .
We then store the values Ci“), k = 1,...,n. The algorithm is:

*
y0<x> = r (x)

yk(x) = c£§)yk_1(x)(l flx - Ln yk_1(x)), k = 1,2,...,n .

Remark 1: ‘We would like to point out that Theorem 5.2 is of
theoretical interest since it is one of the few cases where we can
find a best approximation from an algebraic combination of approxi-

mants.

B IB LIOGRAPHY

10.

ll.

12.

13.

BIBLIOGRAHIY

E.W. Cheney, Introduction to Approximation Theory, McGraw-Hill,
Inc., New York, 1966.

F. Deutsch, Uniform Approximation with Interpolatory Constraints,
J. Math. Anal. Appl. 24 (1968), pp. 62-79.

J.S. Frame, A.Variation of Newton's Method, American Mathematical
‘Monthly, Vol. 51 (1944), pp. 36-38.

W. Rammerer, Optimal Approximations of Functions; One-sided
Approximation and Extrema Preserving Approximations, Doctoral
Thesis, University of‘Wisconsin, 1959.

S. Karlin and W.J. Studden, gghebyscheff Systemmz With Applica-
tions in Analysis and Statistics, Interscience Publishers, New
York, 1966.

 

R.F. King, Improved Newton Iteration for Integral Roots, Math.
Comp. 25 (1971), pp. 299-304.

R.F. King and D.L. Phillips, The Logarithmic Error and Newton's
Method for the Square Root, Comm. ACM 12 (1969), pp. 87-88.

H.L. Loeb, Un Contre-exemple a un Resultat de M. Claude Gilormini,
C.R. Acad. Sci. Paris, Ser. A 266 (1968), pp. A237-A238.

H.L. Loeb, D.G. Moursund, L.L. Schumaker, G.D. Taylor, Uniform
Generalized Weight Function Polynomial Approximation'With Inter-
polation, Siam J. Numer. Anal. 6 (1969), pp. 284-293.

H.L. Loeb, D.G. Moursund and G.D. Taylor, Uniform Rational
Weighted Approximations Having Restricted Ranges, J. Approx.
Theory 1.(1968), pp. 401-411.

G. Meinardus and G.D. Taylor, Optimal Starting Approximations
for Iterative Schemes (to appear).

G. Merz, Padésche Nﬂherungsbrﬁche und Iterationsverfahren hBherer
Ordnung, Doctoral Thesis, 1967.

D.G. Moursund, Chebyshev Approximation Using a Generalized
Weight Function, Siam.J. Numer. Anal. 3 (1966), pp. 435-450.

110

14.

15.

16.

17.

18.

19.

20.

21.

22.

23.

24.

25.

26.

27.

28.

29.

111

D.G. Moursund, Computational ASpects of Chebyshev Approximation
Using a Generalized Weight Function, Siam J. Numer. Anal. 5
(1968), pp. 126-137.

, Optimal Starting Values for the Newton-Raphson Ca1-
culation of ,/x, Comm. ACM 10 (1967), pp. 430-432.

D.G. Moursund and G.D. Taylor, Optimal Starting Values for the
Newton-Raphson Calculation of Inverses of Certain Functions,
Siam J. Numer. Anal. 5 (1968), pp. 138-150.

, Uniform Rational Approximation Using a Generalized
Weight Function, Siam J. Numer. Anal. 5 (1968), pp. 882-889.

I. Ninomiya, Best Rational Starting Approximations and Improved
Newton Iteration for Square Root, Math. Comp. 24 (1970), pp.
3 91 '404 e

A. Perrie, Uniform Rational Approximation with Osculatory
Interpolation, Doctoral Thesis, University of Oregon, 1969.

J.R. Rice, The Approximation of Functions, Vol. I, Addison-
Wesley Co., Reading, Mass., 1964.

T.J. Rivlin, An Introduction to the Approximation of Functions,
Blaisdell Publishing Co., Waltham, Mass., 1969.

H.E. Salzer, Note on Osculatory Rational Interpolation, Math.
Comp. 16 (1962), pp. 486-491.

L. Schumaker and G.D. Taylor, On Approximation by Polynomials
Having Restricted Ranges, MRC Tech. Summary Report 835, Madison,
Wisconsin, January 1968.

P.H. Sterbenz and C.T. Fike, Optimal Starting Approximations
for Newton's Method, Math. Comp. 23 (1969), pp. 313-318.

G.D. Taylor, Approximation by Functions Having Restricted
Ranges III, J. Math. Anal. Appl. 27 (1969), pp. 241-248.

, 0n Approximation by Polynomials Having Restricted
Ranges, Siam J. Numer. Anal. 5 (1968), pp. 258-268.

, An Improved Newton Iteration for Calculating Roots
Which is Optimal (to appear).

, Optimal Starting Approximations for Newton's Method,
J. Approx. Theory 3 (1970), pp. 156-163.

K. Taylor, Contributions to the Theory of Restricted Polynomial
and Rational Approximation, Doctoral Thesis, Michigan State
University, 1970.

112

30. J.F. Traub, Iterative Methods for the Solution of Equations,
Prentice-Hall, Inc., Englewood Cliffs, N.J., 1964.

31. , On a Class of Iteration Formulas and Some Historical
Notes, Comm. ACM 4 (1961), pp. 276-278.

3461

m1
ms
”0
m3
“0

Ilﬂ'lulmmm