M lllllllllllllllllllllllllllllll Ill!Ill/ll////II///IIll/llIll/II/lll/lllllllllI/II/I/ll Ill , 3 1293 00533042! I, LIBRARY Michigan State Universitygfi This is to certify that the dissertation entitled SEQUENTIAL ESTIMATION OF FUNCTIONALS OF THE SURVIVAL CURVE UNDER RANDOM CENSORSHIP WITH APPLICATIONS IN M-ESTIMATION presented by MOHAMMAD HOSSEIN RAHBAR has been accepted towards fulfillment of the requirements for ( | Ph.D. degreem Statistics and Probability .' Date July 25, 1988 MS U is an Aflinmm'w Action/Equal Opportunity Institution 0-12771 1V153I_J RETURNING MATERIALS: Place in book drop to uaauuss remove this checkout from “ your record. FINES will be charged if book is returned after the date stamped below. SEQUENTIAL ESTIMATION OF FUNCTIONALS OF THE SURVIVAL CURVE UNDER RANDOM CENSORSHIP WITH APPLICATIONS IN M—ESTIMATION by MOHAMMAD HOSSEIN RAHBAR A DISSERTATION Submitted to MICHIGAN STATE UNIVERSITY in partial fulfilment of the requirements for the degree of DOCTOR OF PHILOSOPHY Department of Statistics and Probability 1988 /" A7 I / ”as 6‘ .f ABSTRACT Sequential mtimation of functionals of the survival curve under random censorship with applications in M-atimation. by Mohammad Homein Rahbar Sequential point and interval estimation procedures for functionals of the survival curve F (of the form NdF and leVz) are considered when the underlying observations may be subject to random censorship. In the point estimation problem, the loss is measured by the sum of the squared error of the estimator and cost of observations made with per unit cost c being constant. The sequential estimator defined here is shown to be risk efficient and normal as c tends to zero under certain regularity conditions on functions w, F and the censoring distribution. For the interval estimation, the sequential procedure is shown to be consistent and the corresponding stapping rule is shown to be efficient as the width of the interval decreases to zero. In both estimation problems, the asymptotic distribution of the stOpping rule is obtained. Finally, as an application the consistency and efficiency of a sequential fixed width interval estimation procedure using M—estimation is shown for the location parameter in a location model when the error distribution is symmetric and continuous and the censoring distribution is continuous but unknown. To my parents and my wife and daughter iii ACKNOWLEDGEMENTS I wish to express my sincere thanks to Professor Joseph C. Gardiner for his excellent guidance, continuous encouragement, careful reading of this thesis and especially his patience during the preparation of this dissertation. I would also like to thank Professors James Hannan, Hira L. Koul, Habib Salehi and RV. Ramamoorthi for serving on my guidance committee. My special thanks go to Professors James Hannan and Hira Koul for their helpful discussions and careful reading of this dissertation and for giving useful suggestions, which helped greatly in the final presentation of this thesis. I would also like to thank Professor James Stapleton for his help in doing some simulation studies. I would like to thank my wife Afsaneh, my daughter Elaheh and my parents for their moral support during the preparation of this dissertation. Finally, my special thanks go to secretaries Cathy Sparks, Loretta Ferguson and JoAnn Peterson for their help in typing the manuscript. iv TABLE OF CONTENTS Chapter Page 0 Introduction and summary ....................................................................... 1 1 Preliminaries and some notations ............................................................ 6 2 Sequential point estimation of functionals of the survival curve under random censorship. 2.1. Introduction ................................................................................... 11 2.2. Examples ....................................................................................... 12 2.3. Main results of Chapter 2 ........................................................... 13 3 Sequential fixed width interval estimation of functionals of the survival curve under random censorship. 3.1. Model ........................................................................................... 20 3.2. Main results of Chapter 3 ........................................................... 21 4 Sequential fixed width confidence interval for a location parameter under random censorship using M-estimation ..................... 24 Appendices Appendix A: The almost sure representation and the asymptotic distribution of the estimator of the asymptotic variance, 02, of the estimator of functionals, of the form 0(F)=]1/1dF and 0(F)=]qub, of survival curve F under random censorship ........................................................................ 33 Appendix B: Rate of convergence of the estimator of the 2 asymptotic variance, 0 ......................................................................... 40 References ........................................................................................................... 45 Chapter 0 Introduction and summary: The problem of estimation of various functionals of the survival curve from censored data is of fundamental importance in epidemiological and reliability studies, clinical trials and life testing. In most clinical trials ethical reasons and the paucity of laboratory specimens compel consideration of statistical procedures in which the sample size is not specified in advance of experimentation. In this thesis, sequential point and interval estimation procedures for functionals of the survival curve F (of the form jtde and dew) are considered when the underlying observations may be subject to random censorship. When censoring is present, as is generally the case in several survival studies, one observes a random sample {(Zi,&l): 1$i_<_n} where 2i: min(Xi,Yi) and 6i: [Xi_<_Y.] with life times’Xi's, having common survival curve F and censoring times Yi's, independent of Xi's, having common survival curve G and [A] denoting the indicator function of the set A. Consider the natural estimator of 0, “011:! t/Jan, where PD is a suitable estimator of F based on the above random sample. In the point estimation problem we consider the loss structure Ln(c) = a(dn- 602 + cn where a is a given positive constant. The associated risk is Rn(c)=E(Ln(c)). The problem is to determine the sample size which minimizes the risk Rn(c) for a given positive cost per observation c. In the interval estimation problem we construct a confidence interval for 0(F) of prescribed width 2d and coverage probability (1-20) where 02. Sequential point estimation of location based on some R—, L-, and M—estimators are discussed in Sen (1980) and Jureckova and Sen (1981). Sen's book (1981) has an excellent survey of the above mentioned articles. Gardiner and Susarla (1983) were the first to consider the sequential point estimation of the mean problem, in a nonparametric context when censorship is present. They did not find the asymptotic distribution of the stopping time except for the case of an exponential survival time. (See Gardiner, Susarla and van Ryzin (l985b)). In this thesis we pr0pose a sequential point and an interval estimation procedures for functionals of the form j¢dF and [Fdw when the underlying observations may be subject to random censorship. We shall show that the sequential point estimation procedure is risk efficient and asymptotically normal as c tends to zero, under certain regularity conditions on functions a, F and the censoring distribution. For the interval estimation problem, the sequential procedure is shown to be consistent and efficient as the width of the confidence interval decreases to zero. In both estimation problems, the asymptotic distribution of the underlying stopping time is obtained. Thus our results are generalizations of Gardiner and Susarla (1983) and Gardiner, Susarla and van Ryzin (1985b). In Chapter 1, we collect various preliminaries and necessary prerequisities of asymptotic prOperties of the product—limit (P-L) estimator. Most of the results are taken from Gill (1983), Foldes and Rejtii (1981), Cheng (1984), Lo and Singh (1986), Gardiner, Susarla and van Ryzin (1985a) and Schick, Susarla and Koul (1987). Hence some of the proofs have been omitted. We also state the results regarding to the asymptotic normality and the almost sure representation of the estimator of 02, the asymptotic variance of n1/2(I¢an—]1/JdF), which are new and crucial for obtaining the asymptotic distribution of stopping times in Chapters 2 31. 3 and we give an almost sure representation of n1/2(f¢an-/¢dF), for (b in a class of functions \Ill (defined later in Chapter 1). Chapter 2 is divided into three sections. The first two deveIOp our model and some examples are discussed. In Section 3 the risk efficiency of the sequential point estimation procedure and the asymptotic normality of the underlying stopping time are presented. In Chapter 3 we discuss the prOperties of the sequential fixed width confidence interval and the related theorems. Chapter 4 deals with a sequential fixed width confidence interval procedure for the location parameter of a location model under random censorship when the distribution of the error is symmetric around zero but unknown. Finally, proofs of Theorems 1.3 and 1.4, the asymptotic normality, almost sure representation and a rate of convergence of the estimator of 02, the asymptotic variance of n1/2(]¢xan-jt/xlF), which are crucial for obtaining the asymptotic distribution of the stOpping time in Chapters 2 &. 3, are placed in Appendices. Chapter 1 1.1 Preliminaries and some notations Let {Xiz i21} be a sequence of nonnegative iid rv's (survival times) with continuous survival function F on R+=[0,oo), F(0)=1. The corresponding censoring rv's {Yi: i21} are also assumed iid, independent of {Xiz i21}, with continuous survival function G taking values on R+ and G(0)=1. The observable data are {(Zi,6i): i21} where 2i: min(Xi,Yi), 6i: [XiSYi] and [A] denotes the indicator function of the set A. For an estimator of F we select 11’ which was first introduced by Kaplan and Meier (1958), based on {(Zi,&l): 15i5n} defined by the product—limit (P-L) estimator, F 111) F (t) =ll'll KwiH] for tt, t>0, d Z < Z < ....< Z th d W ere ( ) +i=l[ l ] _ an (1) (2) (n) are e or er statistics of {Ziz lsign}. By the continuity of F and G ties among the observations may be disregarded with probability one. Throughout we shall need the following notations. Let (Xj,Bj), 3'21 be copies of the R+x {0,1} with Borel a—fields and ill * m (X ,A ) = II(Xi,Bi). Let P = PF G denote the product measure induced by i=1 , {(Zi,6i): i_>_l} on A* and E = EF,G denote the expectation under P. Let P = {P: F, G 6F} where F = {the class of continuous survival functions} and 0 = 0(F) be a real valued functional on F. The estimator Rn: 0(Fn) of 0 has been studied by many authors including Breslow and Crowley (1974), Wellner (1982), Gardiner and Susarla (1983), Gill (1983), and Millar (1985). For {On} to be consistent for 0, certain conditions have to be satisfied by the pair (F,G). For example, in Schick, Susarla and Koul (1987) is stated that "when estimating 0(F), the p—th quantile of (1-F), we need at least the condition G(0(F))>0, for the sample quantile to be consistent. Generally, 0(Fn) is not consistent for 0(F) if 0(F) depends on parts of F that lie beyond the upper support point 1G: inf{x: G(x)=0} of G." Similarly define 7F and let 7=rH= min(rF,rG). Let (Z,6) be a mm of (Z1,61). Let E(t) = P[Z5t,6=1], fi(t) = P[Z5t,6=0] and H(t) = P[Z>t]. Note that E(s) = —(])stG, 17(3) = £ch and H(t) = 1 — E(t) — E(t). Throughout, except in Chapter 4, all unspecified integrals are considered on R+. Throughout this thesis L-r stands for (1 /L)r for any positive function L; —2-1 denotes "convergence in distribution"; as. stands for almost sure with respect to probabitity measure P; N(p,a2) will stand for the normal distribution with mean u and variance 02 and the index in the summations runs from 1 through 11 unless it is otherwise specified; \II denotes the class of all real valued monotone functions on R+ and let \II1 = {$91! : ti; is constant on [T,oc) for some T_o and J(Z,6) = - I§(Z,6;t)F(t)d¢(t) (1.1.4) = 6A(Z)H'1(Z) — 131(2)- The following lemma is taken from Theorem 1 of L0 and Singh (1986) and Theorem 3.4 of Gardiner, Susarla and van Ryzin (1985a) and will be stated without proof. Lemma 1.1: If F and G are continuous‘ and T0, (1.1.5) Fn(t)— F(t) = F(t){n‘12 ((Zi,6i;t) + rn(t)} with (1.1.6) suplrn(t)| = 0((11-1111 n)3/4) a.s., and (1.1.7) sup urns)", = curl) where H . "p denotes the LD norm and sup is taken over [0,T] and £(Z,6;t) is as in (1.1.3). Assumption 1.1: Let T be a positive constant such that T0, {us/21211,- 0|3: n21} is uniformly integrable (UI). , Proof: Recall the representation til/201; a) = n‘1/221(zi,6i) - n1/2r;. Note that it suffices to show that {In-1/22J(Zi,6i)|sz n21} and {Inl/zrDs: n21} are both UI. Since for t>T, A(t) = 0 and H(T)>0, {J(Zi,6i): i21} is a sequence of bounded random variables. Thus an application of the Marcinkiewicz-Zygmund inequality implies that for any p>0, sup Eln‘l/ 2>3.I(zi,15i)|1’0, (1.1.10) Elr:1lp= carp/2). Note that (1.1.10) implies that {lnI/Zr;|s: n21} is UI, which completes the proof of the theorem. 0 Remark 1.1: Schick, Susarla and Koul (1987) gave a sufficient condition for an iid representation of n1/2(]¢an— [t/JdF), where well, the class of all real valued monotone functions on R+. They have shown that 1/2EJ(Zi,6i) = op(1). 2 . 111’ (fwan- 11x11“) - n Our Theorem 1.1 gives the corresponding almost sure representation for (1)0111, a subclass of \II. 10 Theorem 1.3 (The almost sure representation and asymptotic normality of 2%): Let 1151111. For F, GEF, n1/2(:7121- a2) = 11‘1/213vi + n1/2Rn 5 where Vi = 2Wm + wi,2’ Q Wi,l = II! €(Zi26i;8)F(8)d¢(s)}drlli 2 -2 -l 2 Wi,2 = A (Zi)H (Zi)6i- 2IH [Zi>-]dI‘21 + a , and n1/2Rn,5—-i 0, 3.3.. Furthermore - :1: (1.1.11) n1/2(UI21- e2)_9—. N(0,7 ), where *_ 2 —1 -2 4 (1.1.12) 7 _ 611‘11 (11‘21 — 4111 1‘11 (11‘31 + [H (11‘41 — e . Proof: See Appendix A. Theorem 1.4: Let 1119111. For F, GEF, and for each (>0, and all r0. Therefore it follows from (2.1.2) that (2.1.3) E(bn— t1)2 = {102 + o(n_1), as n —-. .0. Now if a is known, then the risk (2.1.4) Rn(c) = E Ln(c) .—. h‘lae2 + on + 0(n-l) 1/2, with is approximately minimized by the BFSSP, 110 g be, where b = (a/c) corresponding minimum risk (2.1.5) R0 = Rnog 2on0. However, since a is unknown, the BFSSP cannot be used and therefore we describe a sequential procedure for choosing a sample size whose risk will be close to R0 for small c. Let . ‘ —h (2.1.6) Nc = 1nf{n2nlc: n2b(an+ n )} 11 12 where h is a positive constant to be selected later. Since NCZbNgh a.s., we may assume file: int(bll(1+h)) where int(x) denotes the greatest integer g x. Then our scheme utilizes the estimator “ON of 0 with associated risk 2 7 *- *- L — be 2 EN ( .1. ) R —-Rc—E Nc— a E( Nc- 0) + c c’ We now consider some examples before we present the main results of this chapter. 2.2 Examples 1. Aformofwimorizedmean: Let T0, ¢(x) = (xAT)k[x,>_T] and FeF. Consider in: 4an as an estimator of 0 = -]¢dF. Note that if the mean of F is known, we can estimate the variance of F by taking On: 411an where ax) = ax) = (unites - 11% and up denotes the mean of F. 3. A form of mean residual life: The mean residual life function is defined by p(t) = F‘1(t) {muses = - “1(t){°°(s-t)dr(s). T Our interest is in estimation of 0(F) = F-l(t)] F(s) ds, for a fixed t, t1/2 and N as in (2.1.6), then (2.3.4) 1/ 2(01N - 1).—e N(0, 7 *4/(40 )) and (235) N1/2-n1/2D_.. No.7 “1604 )) where 7 = 6] 13121111“ -4] it"lrlldr31 + f H’Zdr41 - Proof of Theorem 2.3.1: By definition of Nc’ lim Nc=m, a.s., also if 0 (a/c2)1/2(0N + Ngh). 1 c1 Hence by definition of N c we obtain Nc 2 Nc a.s.. Thus Nc is nondecreasing 1 2 as c10. From (1.1.13) and the Borel—Cantelli lemma, it follows that c1 {0121: n21} is a strongly consistent estimator of 02. Since Nolan, a.s., 15 (2.3.6) 01% -—i 02 a.s.. Recall that b = (a/c)1/2. By definition of N, we can write ‘ “ —h “ —h baN$b(cN+N )$Nn], we have that «map k‘h), for all ke{n1,... ,n}. For c sufficiently small and n2n3, P[N>n]< 17;], > b 1n- n‘h] IA P[an—a>b 13n -b1n0 -n] P[an— a > (1/2) 60] P[|s§- o2|> (1/4) 5202]. IA IA The last relation holds because 1231-02: (on-02)22n+2a(e-a) 2 (1/4) £2 02 + 026 2 (1/4) 62 02 Therefore using an integral approximation for sums and similar arguments as in the previous case leads to (2.3.8). This completes the proof of Lemma 2.3.1. 13 Now we are ready to prove (2.3.2). Let 0<£<1, D = [n2n3] - [m Note that N5n2, implies that Nnol S 1-6. Hence Ean'l- 1| 5 (1-6) P[Ngn ] + c + n"1 EP [N>n] + P[D] o 2 0 HM = 0(c(r-l)/2(l+h)) + c + n610(c(r-l)/2) + 0(1) = 0(1), since r>l and c is arbitrary. This completes the proof of Theorem 2.3.1. a Proof of Theorem 2. 3. 2 (Risk efficiency): Note that R 1101 =(2en0)"1{a E(oN— s2 + c EN} =an(2e)’1{0 lsz — (1)2 + c EN/no}. Thus the theorem will be proved once we establish (2.3.9) lim a(en0)'1E(bN— (1)2 = 17 For this, clearly it suffices to show that (2.3.10) lim a(en0)’1 E{(bN- 22(9)} = 1 and (2.3.11) lim a(en0)'1E{(bN — 102(5)} = 0. First consider (2.3.11). Note that by the maximal inequality for reverse martingales, (1.1.8) and (1.1.10), sup Iln(0,, — 02H,= 0(1). M n15n5n2 Therefore by the Holder inequality, Lemma 1.1, Lemma 2.3.1 and similar arguments as in the proof of Theorem 1.2 we get, for s>2 and 0n3]} —- o(c ). Thus by last two rates, (2.3.11) is immediate. Note that for (2.3.10), it suffices to show that, for some c0, (2.3.12) {a(m0)-l{(bN— 0)2[D]}: 01, sup E{noc-2(3N - o)2[D]}81/2, the limiting distribution of 110/ 2(N/n0- l) is the 1/2 n1/2 -1/2 [0 +n0 same as that of 110 (UN/0 -— 1). From (1. 1.11) we obtain n1/2(aN - 01—117 No 7/0102» which is equivalent to n1/2taN/a - 1)—11—» No.7 /(4a4 )) which implies (2.3.4). By taking the square root transformation, we obtain N1/2- n01/ 2 —D-7 N(0,7*/(1604)) which completes the proof of Theorem 2.3.3. o Chapter 3 Sequmtial fixed width confidence interval for functionals of the survival curve under random censorship 3.1 Model Suppose a random sample of size n, {(Zi,6i): 15i5n} has been observed. We wish to construct a confidence interval In for 0 = 0(F) = Il/JdF and 0 = leifl, of prescribed width 2d such that, asymptotically as 11 tends to infinity, the coverage probability is at least (l—2a). We assume F, G EF and $8111. Note that by Remark 2.3.1, it suffices to consider 0 = fde and w as in Assumption 1.1. Notice that for each 11 an apprOpriate estimator of 0 is in: jwan, where ED is the P—L estimator of F. In the rest of this chapter all unspecified limits are considered as d tends to zero. For a given positive real number (I and a€(0,l/2), in view of (1.1.9), let us take 111 = (bu-d, Rn+d) with n = nd defined by (3.1.1) nd = inf {k21z k 2 d-2 z: 02} where 27 is the upper 1007 percentage point of the standard normal distribution. Then we have lim P [061%] = l-2a and . 2 —2 -2 _ hm {ndd za 0 }— 1. Since F and G are unknown, the specification of the "Optimal" sample size in (3.1.1) cannot be made. We are therefore led to construct a sequential procedure in which the sample size is a positive integer valued random variable N = N d’ and the desired confidence interval for 0 is 20 21 IN = (RN-d, IN+d). Motivated by (3.1.1), we define the stOpping time N = Nd’ by (3.1.2) Nd = inf {k2n M: k 2 b(&fi + {11)} where b=d2 grand h isa positive constant. Since N d>b N—h, a..s ., we may assume nl=nl d=b1/(1'1'h). Note that nd, the optimal sample size, is asymptotically equivalent to 110 = ”M =b02, that is, ndnficll —7 1. In the rest of this chapter we shall drop the subscript d in Nd’ 11d, n0d and on various entities when there is no possibility of confusion. All unspecified limits are considered as (1 tends to zero or b tends to infinity. 3.2 Main results of this chapter: The following results hold under Assumption 1.1. Theorem 3.2.1: For each positive real number h and F,G€F, (N,IN) is both consistent and efficient. In fact we shall show that for each PEP, (3.2.1) 11111 P[0e1N] = l—2a and (3.2.2) lim E |Nn31 - 1| = Theorem 2.2: Let h>1/2 and F, GeF, then (3.2.3) n1/2(n 01N - 1) —11-7 N(0,7 /a‘1 ) or equivalently (3.2.4) N1/2— 19/211... N(0, 7*‘1/(4o )) 1|: where 7 is as in (1.1.12). Proof of Theorem 3.2.1: By definition of Nd’ lim Nd = 00, a.s.. Furthermore if 0n] = c(d2("1)) n_n3 where 112 = n2d = int(no(l-c)) and 113 = n3d = int(no(1+c)). Now by (3.2.8) and Lemma 3.2.1 and arguments similar to those used in the proof of the Theorem 2.3.1, (3.2.2) obtains. From representation (1.1.8), 2 Theorem 1.2, Anscombe's Theorem and (1.1.9), it follows that, (3.2.11) N1/2(1N — a) —11-7 N(0,o2) from which we get 23 P[0elN] = P[sN— d s o s bN+ d] (3.2.12) = P[N1/2|IN— 0| 5 d N1/2]. Since d N1/2__. oz a.s., (3.2.12), (3.2.11) and Slutsky's theorem establish (3.2.1). a Proof of Theorem 3.2.2: From (3.1.1), (3.1.2) and similar arguments as the one used in the proof of the Theorem 2.3.3, show that the asymptotic a, distribution of n(1)/2(Nnol - 1) is the same as that of "(Ill 20113 - 02)/02, from which (3.2.3) is immediate. Now (3.2.4) follows from (2.3.3) by square root transformation, that is, nit/21111211151” -1) 2» mar/(47:41). and the converse is similar. This completes the proof of Theorem 3.2.2. o Chapter 4 Sequential fixed width confidence inteval for a location parameter under random censorship using M—estimation 4.1 Model Let c and Y be independent random variables and (4.1.1) X = A + c, A69 where 9 is an Open subwt of the real line R. with compact closure. The following notations will be usw only in this chapter. Let F(t) = P[c>t], C(t) = P[Y>t], FA: F(--A), HA: FAG, 12A: ip(--A), a t H A(t) = — j GdF A’ {Xiz 121} be iid rv's with the same distribution function _ 00 as X, {Yiz i21} be iid rv's, independent of {Xiz 121}, with the same distribution function as Y, and (l-F), (l—G) are continuous distribution functions and all of the unSpecified integrals are on the whole real line. When dealing with survival time data, one can take Xi's to be log10 or In of the survival times. The problem considered in this chapter is the sequential interval estimation of A using M-estimation based on {(Zi,0i): i21} where 2i: min(Xi,Yi) and 6i: [XiSYi]. Let Fn be the P—L estimator based on {(Zi,6i): ISign}. An M—estimator of A is defined as the solution in t of (4.1.2) An(t) = [fix-t)d11‘n(x) = 0 for some given function 111. In the absence of censorship, if I/J(X,A) = —%log f(x-t)|t___ A’ where f is the density of the measure induced by X on R with respect to the Lebesgue measure, then the solution to (4.1.2), is the Maximum Likelihood Estimate (MLE). Huber (1964) proposed M—estimation as a generalization of (MLE), with desirable robustness pr0perties. Two important examples which are mostly used for the problem of 24 25 locating the center of a symmetric distribution, say A, are the Huber M—estimate by taking Huber 1]) function defined by (4.1.3) 7p(x) = {(—TVx)AT} where T is a positive constant and Tukey's biweight (4.1.4) 200 = x(1—x2)2 [IxISII in which case the defining equation becomes I¢(x—t)dFA(x) = 0. In practice it is usually necessary to estimate the scale parameter of the underlying distribution, but this will not be considered in this thesis. 4.2 Assumptions and some preliminary mults: Assumption (Al): F is symmetric about zero and F,G are continuous. Assumption (A2): Let M and T be constants such that IAISM, for all A69, G(T+M)>0 and F(T)>0. Let 12 be a monotone nondecreasing, continuous, skew symmetric function and has two continuous bounded derivatives 11),, (0" on (-T,T) and 2 is constant on {x: x2T}U{x: xs-T}. Assumption (A3): 7 = Ilfl’dF at 0. Assumption (A4): t = A, is an isolated root of the equation (4.2.1) AF(t) = [fix—t)dFA(x) = 0. Remark 4.2.1: For nondecreasing 1/1, An may be written as An = 1/2 (sup{t: An(t)<0} + inf{t: ln(t)>0}). The next lemma is similar to Lemma 7.2.A of Serfling (1980) which has been considered for the case of no censoring. Now we are following the same lines of proof to get similar results in the presence of censorship. Lemma 4.2.1: Under Assumptions (A2) and (A4), a sequence of solutions {An} to the equation (4.1.2) exists and converges to A, a.s.. Proof: Let c be a given positive real number and An as in Remark 4.2.1. Then AF(A—c)<00 be real numbers. If P[IUn-a|2£] = O(n-s) = P[an-bIZc], for every c>0, then P[|Un/Vn— a/blZc] = 0(n‘1), for every (>0. Now we shall show that for each positive c and some r>l, (4.3.3) P[|a§(m) - 7A| 2 c] = 0(n’1). Note that by Lemma 4.3.1 and the Assumption (A3), it suffices to show that (434) Pllw’ (x—m)dF,, (x) - 7| 2 e1= 0(n ) and 2 —2 — (4.3.5) P[| IA nan- [A AHA dHA| 2 c ]= 0(n 1') First consider (4. 3. 4). By Taylor's expansion Ill) (x-m)an (X)= I 1/1 (x-A)<1Fn (X) + (In-MN (x-An )an (X) 30 where An is between A and m. Now by the above expansion, Assumption (A2) and integration by parts we have s| /¢’(x—m)dfrn(x) — 7|215 const.{s|m-A|21+ Isle“. FA|21d¢’(x—A)}. Now by (4.2.5), the representation (1.1.8), Lemma 1.1 and Marcinkiewicz— Zygmund inequality we have EII¢'(x—r§1)dfin(x) - 7121 = 0(n‘1) which, by Markov's inequality, implies (4.3.4). A similar calculation to that done in the proof of Theorem 1.4, which is shown in Appendix A, and consideration of the extension explained in Remark 4.2.3, leads to (4.3.5). This completes the proof of (4.3.3). Recall that nd, the "optimal" sample size, is asymptotically equivalent to no = no d = bag. In the following, whenever there is no possibility of confusion, the subscript d of N (1’ nd and nod will be dropped. Now we state the main results of this chapter. Theorem 4.3.1 Under Assumptions (A1) through (A4) and for h>0, (N,IN) is both consistent and efficient, that is, for each A69 and PEP, (4.3.6) lim P[AEIN] = 1—2a and (4.3.7) lim E|N n51 - 1| = 0 Proof: Note that by definition of Nd’ Nd -—-1 co, a.s.. Furthermore it can be shown that N d is nondecreasing as (1 decreases (see Chapter 2 or 3 for similar arguments). Therefore by Lemma 4.2.1, AN -+ A, a.s., and (4.3.3) together with Borel—Cantelli lemma yield .312, (m) ——7 oz, a.s.. . From the definition of Nd and 11d and arguments similar to those in Chapter 3 we obtain, for d sufficiently small, 31 (4.3.8) 1161 N -» 1 a.s., and (4.3.9) d2 no -» 221 7: Note that, as in Chapter 3, for showing (4.3.6) we need to show that (4.3.10) N1/2(AN- A) ll. Mom/2;). Recall that 1 2 ‘ - 1 2 N / (AN- A) = 7N 1N / LN [71.1110] and _ -1 where J A and 1* NA are the same as J and r; ,given by (1.1.4) and Theorem 1.1, respectively when F is replaced by F A' Now write 1 2 “ -l — * —l 1 2* N/(AN-A) = {(7N -71)N1/21NA+7N/NA - 7‘1N‘1/2TN -(1‘1N‘1- 7'1) N‘1/2 TNAlliNtol where TN, A = N'1EJ A(zi,5i). Therefore by (1.1.8), Anscombe and Slutsky's Theorems, (4.3.10) is immediate and so is (4.3.6). To show (4.3.7) we need similar rates as the one given in Lemma 3.2.1. Note that all we need to get such rates is (4.3.3). Hence, following the same lines of proof of Lemma 3.2.1 and Theorem 3.2.1, we obtain (4.3.7) which completes the proof of Theorem 4.3.1. 1: Remark 4.3.1 The asymptotic normality of the st0pping time Nd’ can be obtained by arguments similar to those given earlier in Chapters 2 & 3 but obtaining the exact form of the asymptotic variance is animmersely tedious calculation. Thus one can establish n1/2(Ndn1 no - 1) —1-1—» No.11) 32 and 1 D Nd/Z- n5” -—-1 N(0, /4) but the exact computation of fl is difficult. APPENDICES Appendix A The almost sure representation and the asymptotic distribution oftheestimatoroftheasymptoticvarianoeoftheestimator of functionals of the form 0(F)=l¢dF and KF)=IFd¢, of survival curve F under random censorship We shall present here the proof of Theorem 1.3. To the best of our knowledge this is a new result. In the sequel $9111 and all the limits are considered as 11 tends to infinity. Note that by Remark 2.3.1, it suffices to consider (1; as in Assumption 1.1. Recall that t . . 131(1) = 1 Mad, t 2 o, 1,) =1,2,3,4, o t z K(t) = 1 + 2 [Zi>t], C(t) = 111‘de, KT, 0 A(t) = I Fdzp, An(t) TI gnaw, ost.] — H} dH n-12[H_l{[Zi>o] — H} dI‘21. Recall the representation (1.1.5). We rewrite U1 as U1 = ”3 + R11,4 where U,= “'12n(”4(zi,6,;s)F(s)d¢(s)}dr11 35 and co R11,4" “I r111111221131 and rll is as in (1.1.5). Hence Ill/2G: - a2) = 2nl/2U3 - 2n1/2U2 (A.3) + n-1/22{A2(Zi)H-2(Zi)6i- 772} + n1/2Rn 5 where Rn,5= D1 + R?’3 + 2 Rn,4° Ajmr some alg/ebra on (A.3) we get 1 2 ‘2 2 _ -1 2 l 2 n (an—o)—n 2Vi+n Rn,5 where ,1 '2 Wi,2’ w,l =111°°4(z,,6,;s)F(s)d“snarl1 and 2 -—2 -1 2 Under assumptions of Theorem 1.3, {Viz i_>_l} is a sequence of bounded iid rv's with mean EVl = EW =EW =0. 1,1 1,2 To obtain the almost sure representation of 02 we need to show that nl/ 2R115 -+ 0 a.s., * in addition if we show that the variance of V1 is 7 , then (1.1.11) will be achieved by the central limit theorem. First we compute the variance of V1. Let (Z,6), V, W1 and W2 be copies of (Z1,61), V1, W1,l and W1,2 respectively, then (A.4) Var v = E(2w1+ w )2 = 4wa + 153w2 + 4EW1W 2 2 2’ In the following we use repeatedly the Fubini Theorem, integrations by parts and the identities 36 :3 t m is (IgdH)2 = mung) gamma), Es(Z) = Isd(-H) and fl E{s(Z)5} = IsdH- Now we are ready to compute Ewg. Since W2 has mean zero, E{A2(Zi)H_2(Zi)6i- 21H'1[zi>-]dr21} = — 02. Therefore Ewg = E{A4(Z)H‘4(Z)6} + 4E(1H“1[Z>-]dr21)2 — 4E{A2(Z)H‘2(Z)61H‘1[Z>-]dr21} - 04. Note that E(fH-1[Z>-]dl‘21)2 = 2E{(f((f)8H_1[Z>o]dI‘21)H—l(s)[Z>s]dI‘21(s)} = 21((/)°11‘1111~21)¢1r21 = 21H—1([de‘2l)dI‘21 and _ m E{A2(Z)H 2(2)5111 1[Z>-]dI‘21} = [H 1([ dI‘21)dI‘21. We simplify to obtain 2 —2 —1 ‘2 4 (A.5) 13w2 = [H dI‘41 + 41H (1 dI‘21)dI‘21 - a . To simplify notation, throughout £(Z,&,t) will be abbreviated to ((t). Since E((s)§(v) = C(sAv) for s,v<1', w? = Hummus) WNW? no t on = 2E{I{({ 51342,!) (111 {Pdcdrnondrncn on t on = 21{{ Fang) {‘Il C(sAv)F(v)dw(v)}dr,,(u)}d¢(s)}druc). 37 Now consider the most inner integral on two sets [85v], [s>v] and note that on the set [35v], uStSsSv. Hence swf= 2/{{°°Acrd7p}rn(t)drn(t) no t 8 + 21% F(SHé {I CFdi/Jldl‘11(U)}d¢(S)}dF11(0- u Integration by parts on the inner integral of the second term on the R.H.S. of the last equation yields 2 _ 2° 2 Now integration by parts on the middle term of the R.H.S. of the last equation gives 2 _ 2 (A.6) EW1 — [(2I‘11 + 1‘22)dI‘21. Now we consider EW1W2. Q EW1W2= E{A2(Z)H 2(2):)”; C(ZAs)F(s)dw(s)}dI‘11} — E{A2(Z)H-3(Z)5f{£mFd¢}dF11} — 2E{][mC(ZAs)F(s)d1/J(s)d1‘1llH-l[Z>'ldrzl} + 2E{6H’1(Z){j{émF(8)d1/J(8)}d1‘ll}IH-1[Z>'1dr21} (A°7) = E{Q1- Q2“ 2Q3"’ 2Q417 333'- To compute EQI, consider Ql on the two sets [Z53] and [Z>s]. Therefore, by Fubini Theorem EQl = I{[mE(C(Z)A2(Z)H’2(Z)61258])F(s)d¢(s)}dl“11 + [{[wmA2(Z)H-2(Z)5[Z>8]}C(s)F(s)d(b(s)}dI‘11 = (1/2)1(1°°r22Fd7/7)drn+ lumccxfdrm)F(s)d4-ldr2,) + E (l[mC(s)[Z>le(8)d¢(8)dI‘11)(IH'1[Z>~ld1‘21) = I[m{(l)sH-l(u){?l Cd(-H)}dI‘21(u)}F(s)d1()(s)dI‘11 co co _ + I! C(s){ (1) H l(u)H(sVu)dI‘21(u)}F(s)d¢(s) or“. We consider the last term in the last equation on two sets [sSu] and [s>u] and simplify to obtain EQ3= I{Im{ésH-l(u){C(u)H(u)—C(s)H(s)+1118HdC}dI‘21(u)}F(s)d¢(s) }dI‘11 + [[mC(s)H(s){(])8H-ldl‘2l}F(s)d(b(s)dI‘ll + ltl°°0(s){£°°dr2,lFldrN Finally, similar but simpler arguments yield so4= E(I{[mflZSSIH—1(Z)F(S)d¢(8)}dI‘11)(IH_1[Z>°ldF21) = 1115311“(nillllsn‘1dfildr2linsidwtsndrl1. By substituting EQl through EQ4 in (A.7) we obtain nwlwz = —/[°°((1)80d1‘21}r~(s)d¢(s)drll (A.8) — [[mC(s){£de‘21}F(s)dt/1(s)dI‘11 co 8 _ — 1) {(1)11 1dr21}r(s)d¢(s)drn. Now substitution of(A.5), (A.6) and (A.8) in (AA) and some algebra yield _ 2 -l -2 4 EV _ tijrndr21 — 41H I‘lldl‘31 + [H dI‘41 — o . 39 We need the following lemma to obtain the almost sure representation of n1/2(o12l - 02). Lemma A.1: Under Assumption 1.1, for continuous F, G, T0. Since H is monotone nonincreasing and H(T)>0, (iii) also follow from Glivenko—Cantelli Theorem. (iv) follows from Theorem 2 of Shorack and Wellner (1986) page 308. To show (v) note that °° ‘ ‘ T nallAn- An}; = "n“! (Fn- mot/7“}; 5 oonst.||n“(Fn- F)||0. Hence (v) is implied by (iv). This completes the proof of the lemma. n Recall that Rn,5 = Dl+ Rn,3+ 2Rn,4° We shall show that (a) nl/ZDl —-+ 0, a.s., (b) nl/ 2R113 —-o 0, a.s., (c) n1/2Rn,4 --7 0, a.s.. Now recall the representation (A2) of (on— (p). To show (a) holds, we shall show that each term has such a pr0perty. Consider the first term, we want to show that N % n1/ 212AH—2(An— A)d(Hn - H) -7 o, a.s.. This follows from Assumption 1.1, Lemma A.1 (ii) and (v) and similar arguments as are used in the proof of Lemma 2 of L0 and Singh (1986). All 40 other terms can be handled similarly. By similar arguments, it can be shown that (b) holds. To show (c) holds note that 11114"! { [ran¢} dI‘1 11, where rn is as in (1.1.5). Therefore it easily follows, from Lemma 1.1, that nl/anA—o 0, a.s.. Thus we have the almost sure representation of :7: and by the CLT (1.1.11) follows. :1 Appendix B Rate of convergence of the mtimator of the asymptotic variance, 02: [A2H—2dfi In this section we shall present the proof of Theorem 1.4. We shall show that for each e>0, and I'1. The following arguments are very similar to that of Gardiner and Susarla (1983) Appendix A, except we are giving a shorter proof using the representation (1.1.5) and Lemma 1.1. Proof of Theorem 1.4: Let us write 7}:- a2: jnK‘2(A§— A2)dfin+ [A2(n2K‘2- H-2)dfin + [AZH’2d(Hn- H) = Tn,l + Tn,2 + Tn,3’ say. First we examine Tn,2' Since K = l + an, H2- n-2K2 = H2- (Hn-t- n-1)2 = -n‘2- 2n’1H - (Hn- H)2— 2n‘1(Hn- H) — 2H(Hn— H) so that on substitution we have 5 2 l|T (13.1) |Tn,2|< n ,2j| To handle the terms in (8.1) we shall use the following result for Binomial moments. 41 Lemma B.l: Let U be a Binomial random variable with parameters (n,p). Then for any It 2 l E(l + U)_k _<_ k! (np)- 1‘. Proof: See Koul, Susarla and van Ryzin (1981), Moment Lemma, Page 1283. 0 Recall that 2 2 -2 —2 —2 -1 2 —1 2 Tn,2=[A (n K H ){-n -2n H—(Hn-H) -2n (Hn-H)-2H(Hn—H)}dH. We want to compute the rth moment of TH 2, for r>1. Consider the TH 21 term. Note that _ -1 2 -2 -2 Tn,2l — EdiA (Zi)H (Zi)K (Zi)' Therefore E| g n‘12: E{5iA21(zi)H‘21(zi)K‘21(zi)}. Tn,2lIr In the following El stands for a conditional expectation given (Z,6), and all ci's are constants may depend only on r. Note that given (Zj,6j), (K(Zj)-1) is the sum of (n—l) Bernoulli random variables with probability of success of H(Zj). Thus it follows, from Lemma B.1, that EIT I2 s cln‘1{E{6A2(Z)H’2’(Z)El(K‘2’(Z))ll 5 c2E{6A21(Z)H-2r(z){nH(Z)}—2r} n,21 is = c2n'211A2’H‘4’dH. Under Assumption 1.1 and for F, GEF, it follows that the last integral is finite. Hence (3.2) ElTnmlr = 0(n'21). Exactly in similar manner we can bound E|Tn 22| and show that (B.3) E|Tn’22|r = 0(n'1). Recall that —1 2 —2 2 -2 2 42 Thus s. .<. c3E{(n2K'2A2H‘2(Hn- H)2)’(Z)-6} = c3E(n216A2’H‘21)E1{(Hn— H)21K’2’}. By the Holder inequality, for p>1 and q = (1 - 1/p)—l, / / 1 1 Hence by Lemma 3.1 and an application of the Marcinkiewicz—Zygmund r Tn,23 ' - lp _ l q E1{(H,,- Hl2’K 2’} s E (K 2% (IHN- leq’). inequality yields N ElTn 23(1’ 5 c4n2’jii‘2‘A2’(nH)'2’(n'q’H)1/qu which implies (13.4) ElTn,23|r = 0(n"). Similarly for 1111,24 = —22(K‘2A2H‘2(Hn- H))(Zi)-6i, and —l — 2 -l T1135 = —(2n )E(n2K 2A H (Hn- H))(Zi)-6i, it can be shown that 2 (3.5) ElTn,24|r = 0(n‘31/ ) and (B6) Errngs)?‘ = 0(n“). From (B.2) through (B.6) we conclude that (13.7) Err”)2r = 0(n"). 8 Now we consider the term TIll = jnzK—2(AI21- A2)dHn. Note that by Cauchy—Schwarz inequality we have 2 2 _ 2 An— A — (An- A) + 2A(An- A) = (firing- F)F1/2d¢v)2 + when. F)d1/)) 5 (lwr’zd‘n- F)2Fd7p)([°°rd¢) + 2A([°°(frn- F)d¢). 43 Therefore 2 —2 °° —2 ‘ 2 2 2 —2 ‘2 “ 2 |Tn,1|$|]n AK ([ F (Fn- F) th/J)dHn|+2|]n AK ([ (Fn— F)d1/))dHnI. Integration by parts on last two integrals yields '2 -22 -2‘ 2 '2 -22 ‘ lTn,1lSc5”({)n AK dHn)F (Fn— F) Fd¢|+2c6| [(6 11 AK dHn)(Fn— F)do| = c5'Tn,11l + 2‘16'Tn,12'1 223" Since Tn,11 and Tn,12 are very similar, we just show that E(TWP’: 0(n"). Note that (rm) 5 E(“){|F'1(Fn— F)|{)'n2AK'2d§n} where E00 is the integral on [0,oo) with respect to PM). Then by the Holder inequality, for r>1, E(“){ | F‘1(Fn— F) | é'nzax‘zdfin} IA (E("lur‘1c2n- F): 5'n2AK'2dfinlrf/21E2h1)l1‘1/ 2 IA . . fl c7E(n){ |F"1(Fn— F) (1] n2’A1K‘21dHn}. 0 Hence 2 -1 “ 2 ' 4 2 -4 2 ElTnjzl 5 c7E$“){|F (Fn- F)| 15o 1A 1K rdHn} where E1“) = E G E(n). Let p_l+ q_1 = 1 and p>l. By the Holder inequality . . fl 0 n) -l‘_ 2rpl/p n) '4r 2r -4r‘3 ql/q 5 {ES IF (Fn FM 1 {El {5n A K dHnl} . We shall show that the second term in the R.H.S. of the last inequality is finite and the first term is of order 0(n-r), r>1. Recall the representation 44 . is (1.1.5) and Lemma 1.1. First we show that E91)” n4rA2rK—4rdHn}q1, 2 ElTn’bl = 0(n_r). Similarly 2 EITnjl' = 0(n-r). Therefore (3.3) Elrnfll) = om"). Finally, note that Tn,3 = [A2H-2d(fi - H) is an average of n iid mean zero bounded random variables. Thus an application of the Marcinkiewicz— Zygmund inequality yields (3.9) ElTngl = 0(n‘1), r>1. Therefore (B.7), (B.8), (3.9) and Markov's inequality imply (1.1.13). 0 REFERENCES [1 Breslow, N. and Crowley, J. (1974). A large sample study of the life ta 1e and product—limit estimator under random censorship. Ann. Statist. 2, 437—453. [)2] Carroll, R. J. (1978). On almost sure expansion for M-estimates. Ann. tatist. 6, 314—318. [3] Cheng, K. F. (1984). On almost sure representation for quantiles of the product-limit estimator with application. Sankhya, Ser. A, 46, 426—443. ’4] Chow, Y. S. and Robbins, H. E. (1965). On the asymptotic theory of fixed width confidence interval for the mean. Ann. Math. Statist. 36, 457—462. [5] Chow, Y. S. and Teicher, H. (1978). Probability Theory: Independence, nterchangeability, Martingales, Springer-Verlag, New York. [6] Chow, Y. S. and Yu, K. F. (1981). The performance of a sequential procedure for the estimation of the mean. Ann. Statist. 9, 189—198. [7] Foldes, A. and Rejtd, L. (1981). A LIL type result for the product-limit estimator. Z. Wahr. Verw. Gebiete 56, 75—86. [8] Gardiner, J. C. (1982). The as ptotic distribution of mortality rates in competing risk analyses. Scand. J. tatist. 9, 31-36. [9] Gardiner, J. C. and Susarla, V. (1983). Sequential estimation of the mean survival time under random censorship. Commun. Statist.- Sequential Anal. 2(3), 201-223. [10] Gardiner, J. C, Susarla, V. and van Ryzin, J. (1985a). Estimation of the median survival time under random censorship. Adaptive Statistical Procedures and Related Topics, 8, 350—364. [11] Gardiner, J. C, Susarla, V. and van Ryzin, J. (1985b). Fixed width confidence interval for the exponential mean survival time under random censorship. J. Statist. Plan. Infer. 12, 153—159. [12] Gill, R. (1983). Lar e sample behaviour of the product—limit estimator on the whole line. Ann. tatist. 11, 49—58. [13] Ghosh, M. and Mukhopadhyay, N. (1979). Sequential point estimation of 216 mean when the distribution is unspecified. Commun. Statist.-Theor. Meth. 8, 637—652. 14] Hoem, J. M. (1976). The statistical theory of demographic rates. and. J. Statist. 3, 169—185. 45 46 [15] Hoem, J. M. (1987). Statistical analysis of a multiplicative model and 1ts application to the standardization of vital rates: A review. Int. Statist. Rev. 55 (2), 119-152. 16] Huber, P. J. (1964). Robust estimation of a location parameter. Ann. ath. Statist. 35, 73—101. [17] Huber, P. J. (1981). Robust Statistics. Wiley & Sons, New York. 18] Jureckova, J and Sen, P. K. (1981). Sequential procedures based on -estimators with discontinuous score functions. J. Statist. Plan.Infer. 5, 253-266. [19] Kaplan, E. L. and Meier, P. 1958). Nonparametric estimation from 1ncomplete observations. J. Amer. tatist. Assoc. 53, 457—481. [20] Koul, H., Susarla, V. and van Ryzin, J. (1981). Regression analysis with randomly right—censored data. Ann. Statist. 9, 1276—1288. L201] Lo, S. H. and Singh, K. (1986). The product—limit estimator and the tstrap (some asymptotic representations). Prob. Theo. Relat. Fields. 71, 455—465. [22] Millar, P.W. (1985). Non—parametric applications of an infinite dimensional convolution theorem. Z. Wahr. Verw. Gebiete 68, 545-556. [723] Reid, N. (1981). Influence functions for censored data. Ann. Statist. 9, 8-92. [24] Robbins, H. 21959). Sequential estimation of the mean of the normal population. Proba ility and Statistics (Harold Cramer Volume). Almqvist and Wiksell, Uppsala, Sweden. 235-245. ]25] Schick, A., Susarla, V. and Koul, H. (1987). Efficient estimation of unctionals with censored data. To appear in Statist. Decision. [26] Sen, P.K. (1980). On nonparametric sequential point estimation of ocation based on general rank order statistics. Sankhya, Ser. A 42, 201-218. [27] Sen, P.K.(198l). Sequential Nonparametrics, Wiley & Sons, New York. [28] Sen, P. K. and Ghosh, M. (1981). Sequential point estimation of estimable parameters based on U-statistics. Sankhya, ser. A 43, 331—344. [29] Serfliag, R. J. (1980 . Approximation Theorems of Mathematical Statistics, iley 8c Sons, ew York. [30] Shorack, GR. and Wellner, J.A. (1986). Empirical Processes with Applications to Statistics. Wiley & Sons, New York [31] Sriram, T. N. (1987). Sequential estimation of the mean of a first—order stationary autoregressive process. Ann. Statist. 15, 1079—1090. ]32] Starr, N. (1966). On the asymptotic efficiency of sequential procedure or estimating the mean. Ann. Math. Statist. 37, 1173-1185. 47 [33] Starr, N. and Woodroofe, M. 1969). Remarks on sequential point estimation. Proc. Nat. Acad. Sci. SA. 63, 285—288. [34] Starr, N. and Woodroofe, M. (1972). Further remarks on sequential point estimation: The exponential case. Ann. Math. Statist. 43, 1147—1154. [35] Susarla, V. and van Ryzin, J. 1980). Large sample theory for an estimator of the mean survival time om censored samples. Ann. Statist. 8, 1002-1016. [36] Wellner,J. A. (1982). Asymptotic Optimality of the product-limit estimator. Ann. Statist. 10, 595—602. 37] Woodroofe, M. (1982 . Nonlinear Renewal Theory in Sequential Analysis, BMS—NSF Regional Con erence Series in Applied Mathematics SIAM.