\, ‘h‘ null p'L;té .In "at. ABSTRACT ASYMPTOTIC NORMALITY OF SIMPLE LINEAR RANDOM RANK STATISTICS UNDER THE ALTERNATIVES BY Janet Tolson Eyster In this paper we study the problem of the asymptotic normality of random signed rank statistics under the alternatives when the score functions, m, are bounded. When the random variables are independent and identically distributed, W is assumed to be square integrable. This extends the work of Koul (1970) and (1972), Sen-Ghosh (1971) and Ghosh-Sen (1972) to m which may be discontinuous. To relax the assumptions of differentiability on m, restrictions are placed on the distribution functions of the random variables similar to those used by DupaE-Héjek (1969). This work is useful in generating bounded length confidence intervals for the regression problems. ASYMPTOTIC NORMALITY OF SIMPLE LINEAR RANDOM RANK STATISTICS UNDER THE ALTERNATIVES BY Janet Tolson Eyster A DISSERTATION Submitted to Michigan State University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY Department of Statistics and Probability 1977 TO George, Mark and Sandy ii ACKNOWLEDGEMENTS I wish to express my sincere thanks to Professor Hira Koul for his guidance and patience during the preparation of this dis- sertation. His advice and encouragement given during my study at Michigan State University are greatly appreciated. I also wish to thank Professors D. Gilliland, R. Phillips and J. Stapleton for reading this thesis and for their helpful comments. Special thanks are due to Mrs. Noralee Burkhardt for her typing of the manuscript and the patience with which she did it. Finally, I wish to express my gratitude to the National Institutes of Health and to the Department of Statistics and Probability, Michigan State University for their financial support during my stay at Michigan State University. iii TABLE OF CONTENTS Chapter Page I INTRODUCTION . . . . . . . . . . . . . . . . . . . 1 II MAIN THEOREMS . . . . . . . . . . . . . . . . . . 6 III APPLICATIONS . . . . . . . . . . . . . . . . . . . 13 IV RANDOM WEIGHTED EMPIRICAL CUMULATIVES . . . . . . 18 V PROOF OF THEOREMS 2.1 AND 2.2 . . . . . . . . . . 36 VI PROOF OF THEOREM 2.3 . . . . . . . . . . . . . . . 42 APPENDIX . . . . . . . . . . . . . . . . . . . . . 45 REFERENCES . . . . . . . . . . . . . . . . . . . . 58 iv CHAPTER I INTRODUCTION We consider the problem of the asymptotic normality of simple linear signed rank statistics S+(¢) under the alternatives based on a random number of observations. They are called random signed rank statistics. Corresponding theorems are also presented for random rank statistics. In particular suppose {Xi}' i :_l, is a sequence of in- dependent random variables with continuous distribution functions {Fi}, i :_1, and {Nr}, r :_1, is a sequence of positive integer valued random variables. All random variables are assumed to be de- fined on the same probability space. Let {Ci}, i :_1, be a sequence of real numbers. We will be investigating the asymptotic distribution of random signed rank and random rank statistics which correspond to a :p and are defined as n + Z c.a (R. )s(X ) and 1 n 1n 1 S+ n(cp) l-‘ :5 Sn(m) El Cian(Rin) where (1.1) s(x) =I(x:0) -I(x:0), -°° o o > 1 - (b - a ) max |c Io , (2.1) and (2.2) imply - a b -— r . b . r r l<1 f0 wdsn(t) . This representation makes it clear that one needs to study the . .. + standardized weighted empirical processes {SN (t), 0 :.t :_l} and r {SN (t), O :_t 5.1} in the cases in which m is bounded. This is r done in Chapter 4. In the remainder of this chapter we will state our main theorems for random signed rank and random rank statistics. In Theorems 2.1 and 2.2 we assume (am v=¢1-% where (Pm is \ bounded on (0,1), m = 1,2. For signed rank statistics we define for 0 :_x < +m and 0 i.t.i l, -l n K (x) = n z I{[x.| < x}, l 1 '— -1 n K (x) = n X [F.(X) - F.(-X)], l 1 1 + _ -l (2.7) Linl(t) — Fi(§n (t)) - Fi(0)’ + _ _ _ -1 Lin2(t) — Fi(0) Fi( 55 (t)). + + + Lin(t) — Linl(t) + Lin2(t)' + + + pin(t) — Linl(t) - Lin2(t)' E+(t) = Zn c.uf (t). and n l 1 1n -+ n + un(¢) — £1 cifm(t)dpin(t). Obviously for any n -1 n + . Z L E . (2 8) n 1 in(t) t Theorem 2.1. Let {an(1)}, {Nr}, {Ci}, m and qh, m = 1,2, sat1sfy + (1.4), (2.1), (2.2) and (2.6). Assume there exist functions Rijk(s), 0 < s < l, l :_i :_j, a §_j :_b , k = 1,2, and sets Er such that r r vd,00 (2.9) lim mh(Er) = O, m = 1,2, r—wo he e E = E E XL_E;_ r rl U r2 (2.10) E = {s; max max max sup rl (t) k=l,2 aréjfibr 121:) It-slidj .5 + —%3 lLijk + + - Lijk(8) - (t-S)£ijk(5)l > n}. —1 .+ .+ (2.11) E = {8; 0 a% max lc.(S) - c (S)| > n}, r2 a r . a r a o ' aJi-cP a r r r implies -1 + -+ (2.15) Ta +(cp)[SN (m) - UN (¢)] + N(0,1) r.v. r r r D Proof: See Chapter 5. For linear rank statistics we define for -w < x < w, o :_t §_1 10 H (x) = n-1 2? I{Xi :_x}, _ -l n H (x) - n 21 Fi(x), _l . (2.16) Lin(t) - Fi(§n (t)), l :_1 :_n and n un(¢) = Zlcifw(t)dLin(t). Theorem 2.2. Assume {an(i)}, {Nr}, {Ci}, m and qh, m = 1,2 satisfy (1.4), (2.1), (2.2) and (2.6) respectively. Assume there exist functions Rij(s), 0 < s < l, l :-i :_j, a :_j i-br' and sets r Dr such that 'v d, 0 < d < w, and 'v n > 0 (2.17) lim (pm(Dr) = o, m = 1,2, M h = w ere Dr Dr1 U Dr2' D = {s; max max sup r1 12|Li.(t) - Li.(s) a n}. (2.18) D = {5; o-lag max Ic.(s) - a (5)] > n} r2 a r . a r a 0 implies r r r -1 Ta ( n} . r k=1,2 a n} and (2.17) r3 . . 1] 1a a 0 implies Ir+(¢)(SN(m) - uN(m)) + N(0,1). The proof follows from Theorem 2.1 and Lemma 3.1 by showing that (3.1) holds. Fix t0 6 (0,1). For every h such that (t - h, t + h) c (0,1) and n there exist en,gn (dependent on h, 0 O O -1 t0 and n), < n < 9n < such that en K (tO h) -l . _ . -1 -l ’ max CinB <<§n (tO + h) - min CinB -.gn. Since En (t) + F* (t) and max Ici I + 0 as n + w, there exists e and g, -m < e < g < m lfijn such that for all n, e :_en < gn‘: g. Since f is positive on (-m,m) and e and g are finite there exists €2>0 dependent on t0 and h such that for all n.: 1 inf kn(t) > e. It-t0| 0 choose -1 _ e > 0-3- 25[f(F* (t0)) - e] l < 5n. Then there exists rl-Bor > rl implies f(F;l(tO)) - 5 IA min min f(§:l(t0) - ci.B) rfjjb ljijr J IA max max f(§:l(to) - Ci'B) rjjjb 151:; 3 f(F:l(tO)) + e. |A Then for r > rl max max [2 + (t) - £.1(t0)| 1 r ir2 0 —- 1 and (3.4) holds for each t0 6 (0,1), ¢(Er3) + 0 and (3.1) holds Remark 6. By Corollary 2.1 the regression problem can be specialized to the two sample Chernoff-Savage problem with random sample size. Pyke-Shorack (1968b) obtain similar results for a broader class of m. CHAPTER 4 RANDOM WEIGHTED EMPIRICAL CUMULATIVES In this chapter we prove the relative compactness of the weighted empirical cumulatives based on independent random variables with improper but finite distribution functions. This extends the results of Shorack (1973) and Koul (1974). This result is used to establish the asymptotic normality of random weighted empirical cumulatives based on signed ranks or ranks by establishing that they have the same asymptotic distribu- tion as non-random weighted empirical cumulatives. The asymptotic normality of the latter is established through approximation by their projection on independent random variables. Define {Yi' 1 §_i :_n} to be a sequence of random vari- ables on [0,1] with improper but finite and continuous distribution functions {6. ,1 < i < n}. Also define 02 = in d? and in - —- d l 1 (4.1) w (t) = o-lznd {I(Y < t) - G (t)} O < t < 1 . d d l i i - in ' —- -— In Proposition 4.1 and the remainder of this work, “f“ will be the sup norm for any function f on [0,1]. Proposition 4.1. Assume {di} satisfy a_condition similar tg_(2.2). Assume for the Gin defined above that n (4.2) 21 Gin(t) - nt 18 l9 i§_a_non-increasing function on_ [0,1]. Then (is > 0 (4.3) lim lim sup p[ sup lwd(t) - wd(s)| 3 a] = o . 6+0 n+00 It-sl:§ Proof: The proof is presented in a lemma and remark. Lemma 4.1. Assume the conditions g£_Proposition 4.1. Then (4.4) Elwd(t) - Wd(s)|4 : k:{3(t - s)2 + n'llt - sl}. Prgof: Without loss of generality assume 0 :_s :_t < 1. Let Ei = I(s 5-Yi :_t) - pi where pi = Gi(t) - Gi(s), i = 1,...,n. Then using the independence of Y1 and the fact that Egi ll 0 t (D have by (4.1) 4 -4 4 Elwd(t) - wd(s)| — 0d Elzi: digil -4 4 4 2 2 2 2 = + 0d E? diEgi 3? § didngiEgj] 1 17‘] -4 2 2 2 4 4 2 2 — 0d [301: diEii) + £1 di(E£:i — 3E (aiHJ -4 2 2 4 2 od [3(2 dipi(1 - pi)) + 2 di(pi(l - pi)(1 - 6pi + 6pi))] i 1 -4 4 2 4 < o [3 max Id.| (X p.) + max Idl Z P.] - d . i . i . i . i 1§}§P i 1§}§P i -4 4 2 — 2 - < o max (d.|n [3(n 12 p.) + n 2 2 p.] - d . i . i . i liiin 1 i 4 _ §_kd[3(t - s)2 + n llt - SI] which is (4.4). The last inequality follows because (4.2) implies 20 Z(G.(t) - G (s)) < nt - ns. 1 i -— This completes the proof of Lemma 4.1. Remark 7. In Koul (1974), Proposition 4.1 was proved with (4.2) re- placed with a stronger condition. Since (4.4), which was basic to the proof in Koul (1974), is also true under the weaker condition, Proposi- tion 4.1 also holds under the weaker condition. We now present an application of Proposition 4.1 which is used in proving Theorem 4.1. Whenever possible we suppress the subscript n used in the notation of Chapter 2 and denote En be K. Define for 0 :_t :_l _ -l n -l + Vn(t) - on 21 ci[I(IXi| :_x (t))s(xi) - ui(t)], _ 5 -1 _ _ -% n -l + Wn(t) — n [Kn(K (t)) t]— n zl{1(|xi| i K (t)) - Li(t)}, (4.5) an(t) 0'12“ c.[I(0 < x. < K'1(t)) - LI (t)) n 1 i - i-— 11 and -1 n —l + Vn2(t) on 21 ci[I(-K (t) fi-Xi :_0) - Li2(t)]. Note that for 0 :_t §_l (4.6) Vn(t) = an(t) - vn (t) 2 and when ci 5 l (4.7) Wn(t) = Vn1(t) + Vn (t) w.p. l. 2 21 Corollary 4.1. Assume {Ci} satisfy (2.2). Then v e > 0 (4.8) lim lim sup pt sup |Vn(t) - Vn(s)| 3_€] = 0 5+0 n It-slfifi and (4.9) lim lim sup P[ sup |Wn(t) - Wn(s)| :_e] = 0 . 6+0 n It-s|:§ Proof: In Proposition 4.1 take Yi E K(Xi)I(0 :_Xi) and d = c Then G (t) = L+ (t) nd f o (2 8) an+ (t) t - i ‘ 1' in ‘ inl a r m ° 1 inl “ ‘ + -2n L. (t). Hence (4.2) is satisfied for these G. '5. Moreover l in2 in Vn1(t) = wd(t). Hence an satisfies (4.3). Similar application of Proposition 4.1 to Yi E K(-Xi)I(Xi < 0) and di 5 ci will yield (4.8) in view of (4.6). Similarly by (4.7), (4.9) follows when di 5 l. The following results which appear in Koul-Staudte (1972b) and Koul (1972) are presented here without proof. Lemma 4.2. The random variable { sup IW (t)I} §§_bounded in_ O 0 r+® ar+ ar implies (4 18) ‘1 T+ 4 o 1 . TN +(v)ON N (v) D N( , ) r.v. r r r The proof of Theorem 4.1 utilizes Lemmas 4.3 through 4.6 and is completed after them. + . Lemma 4.3. Assume {Ci} satisfy (2.2) and Zirk(v), l :_1 :_r, k = 1,2, satisfy (4.10). Then (4.17) implies (4.19) l(\))0r T: (V) '* N(OI1)- D Proof: (4.19) will follow when we show + + (4.20) IT (v) - z (v)! = o (1) r r P -1 + and Tr+OrZr(v) D N(0,1). (4.21) T:(v) = vr(K(K;1(v))) + a; lEu: (K(K; 1(v))) - u (v)]. (4.8) and (4.10) imply 24 -1 _ (4.22) sup IVr(K(Kr (t))) - Vr(t)| _ op(l) . 03:31 Now since IK (K—1(t)) - t] < r-l, we have by (4.5) and (4.9) r r - (4.23) r5[K(K'1(t)) - t] = r5[K(K’1(t)) - K (K"1(t))] + 0(r‘5) r r r r -1 -a _ _ =’wr‘K‘Kr (t))) + 0(r ) - Wr(t) + op(1). Since the sequence file‘} is bounded in probability in the limit a cha- (4.24) P[Br] + l where —1 -% B = {sup|K(K (t)) - tl :_dr } r t r Combining Assumption (4.14) and the above observations yield -1 -+ -1 -+ (4.25) or lur(K(Kr (v))) - ur(v) + rgc:(v)wr(v)l = op(1). Finally, combining (4.21), (4.22) and (4.25), we have shown + - - (Tr(v) - Vr(v) + o 1r56:(v)wr(v)l = 0p(l). Combining (4.22) and (4.13), we note that (4.26) v (v) - o'lr*e+(v)w (v) = Z+(v) r r r r which completes the proof of (4.20). To complete the lemma, write 25 Z+(v) = o'lzr{(c - 6+)[1(o < x < K'1(v)) - L+ (v)] r r l i r - i —- irl .+ -1 + - (ci + cr)[I(-K (v) fi-Xi < 0) - Lir2(v)]} r + leri(v) (say). + Then since Iér(v)| :_max|cil follows from (2.20), we have max (2+.(v)l j'4 maxIc.|0.-l = 0(1). ljifir 1 1 r + Therefore I! e > 0 and r sufficiently large, max (Z .(v)l < e 1 j 1:1 Yi Then ‘v s O 1 V (4.27) p[ max [lgjl -2€] < PU g H > e](l - n )- 1:153) _ l n n where D II max PHICP - cl|> e] . Lemma 4.4. Assume {Nr} and {ci} satisfy (2.1) and (2.2). Then Y e > O 26 (4.28) HVN - va H = op(1). r r (4.29) “WN - wa H = op(l) r r 229. 30 4 * (4. ) .thr“ — op(l) where for any, n, V*(t) = V (K (K-l(t))) - V (t). n n -n n n Finally the sequence (“WN L} i§_bounded in probability 12_ r Ppppf: Unless otherwise specified ar, Nr and br will be denoted by r, N and b. First note that (4.29) and the fact that {hwrh} is bounded in probability imply that the sequence (“WNh] is bounded in probability. Since (4.29) can be proved by the same methodology used to show (4.28), only the proof of (4.28) and (4.30) will be presented here. Define for 1 5.1 :_b -1 -1 + = I . . - . Yi(t) or ciE (lxll 5-52 (t))s(x1) u1r(t)] and V'(t) = 2r Y.(t) r l i To prove (4.28) we will first show that (4.31) “v;q - Vr“ = op(l) . 27 Note {Yi} are independent random variables and Yi E D[O,l]. Then by (4.27), we have . c PLHVfi - Vr“ > zej]§_P[ max “v3 - vr“ > 2g] + P(Ar) riJEP (4.32) I ‘ I I — l < P[“Vb - Vrh > e](l - max P[[[Vb - Vj“ > 5]) + op(1) + O r:1'_<.b as r + m as the following argument shows. For any t and any j, r :_j i'b, we have I I 2 E(|Vb(t) - Vj(t>| ) -2 b 2 -1 + 2 (4.33) — or £j+l ciE[I{|Xi| i-Er (t)}s(Xi) - uir(t)] < o-'(o2 - 02) +>O as r + w b (2 3) - r b r y ' ° Noting that sup [K.(K;1(t)) - tl = 0(1) Oifiil and -1 V'.(t) = 0, 0V.( ‘ J J K.( r J ‘3 K’1(t))).r < j < b. _.r _ _ we have by (4. 8) that (434) PE sup I(vyt) -.v5(t)) - (vys) - vJ!(s))| > e] + o It-s|<6 as r + w and then 6 + 0. Combining (4. 33) and (4. 34) , we have 28 max P[“V' - V!“ > e] + 0 as r + w. . b j rjjjb Therefore (4.32) and consequently (4.31) hold. To complete the proof of (4.28)) we start by combining (4.31) with (4.8) to get (4.35) p[ sup IV&(t) - V§(s)| > e] + o . It-slsS . _ -l , -1 Noting that VN(t) - CrON VN(_K_r(KN (t))), we have (4.36) “VN - vr“ :_| sup OrON 1 -l (v'(K (K (t))) -v'(t))l Qifiil N —r -N N + or0;1“V§ - VI“ + “Vrhlorogl - 1] + . R1 + R2 R3 (say) (4.37) R3 = op(l) by (2.4) and the fact that {“vrn} are bounded in probability. (4.38) R2 = op(l) by (4.31) and (2.4). + — Finally define D = [ sup (K (K 1(t)) - t] < 6] and note from (4.11) r 0 5].: P[ sup OrON |v'(t) - v'(s)| > e] + p(DC ) + 0. lt-S|<5 N N r+ Combining (4.36) through (4.4C» yields (4.28). 29 Finally, the proof of the lemma is complete when we show that (4.30) holds. Since the sequence {thh} is bounded in probability, there exists d such that _ -1 -% cr — {sip IKN(KN (t)) - tl :_dN } and (4.41) P[Cr] + 1 . Then (4.30) follows by combining (4.8), (4.28) and (4.41) lim p{nv;“ > 5} pm §_1im lim (P{ sup IVN(t) - VN(s)| > e} + P[CC]) = 0. 6+0 r+m It-sl<6 r Lemma 4.5. Assume the conditions 9£_Theorem 4.1. Then aE_ v I -1 -* (v) - 0-1 -* (v)| = o (l) (4’42) ON uN a pa P r r r r where for any n ~* -+ -1 -+ = K - v). (4.43) un(v) un(§h( n (v))) un( Proof: Again we will write r, N, and b for ar, N and r + br. We will repeatedly use the fact that since [irk 3.0, i :_r, k = 1,2, (2.20) implies for any r 3_1 - +. - - + (4.44) O lrkué‘: “ < o lrs max Ic.(r lhz Z 2. H < k . r r -— r . i . irk — c liiir i k To prove (4.42) we will first show that -1 5 A4' -1 g, A+ _ (4.45) loN Nr 6N (v)WN (v) - 0a ar ca (v)wa (v)! — op(1) r r r r r r 30 In particular we will show that -1 B + -1 5 + .46 = “ - “ = (4 ) R1 ION N cN(v) or r cr(v)| op(l) which by (4.44) also implies (4.47) lim P[o;lN5|c;(v)| i K < w] = l . r The proof of (4.45) will then be complete by the triangle in— equality, (4.29) and the fact that the sequence {hWa H} is bounded r in probability. We now prove (4.46). By (2.1), (4.15) and (4.44) —1 5 .+ .+ |R1| :_ON N lcN(v) - cr(v)l + op(l) -1 5 + + ‘ _<_orb max Ic.(v)-c(v)|+o(l)=o(1) . a 0 there exists r-.9. if r > rE and It - VI fi_dj— o e r §_j §.b, then we have '1 j + + -l r + lo. 2 c.[u..(t) - u.. v — o z - + J l 1 11 13( )3 r 1 Ci[“ir(t) “ir(“)]l (4.48) -l..+ -1 + < o. c. v t - v - “ _ __l J 1 J( )( ) or rcr(v)(t 6)] + 4ekc . . -1 + + + 430. max Icil max max sup _ lLi'k(t) - L..k(v) 1515; k=1,2 15153 It-vljdj 3 13 ' + - (t - v)£ijk(v)l < o'lje+(v)(t-v) - 6‘1 6+(v)(t )| +- 4th ‘ j j r r r . v c. To proceed let Cr be defined by (4.41). Note on A H C . we have r r for r :.j :_b -1 _ sup (K.(K. (t)) - tl < dj 1’ t J J '— 31 We complete the proof of (4.42) by partitioning 9 based on {Nr}, apply- ing (4.5), (4.23) and (4.48) and recombining. Let 60 > 4kce > 0. Then -1 P[|0N _* _1_* uN(v) - or ur(v)| > 360] < b [I -l'* 'l'* . n . — 'J ] __P[jgr( oj uj(v) - or ur(v)l > 3601;)[cr1 .[Kr — 3 ) + P(AC) + P(Cc) r r b -1.5 .+ -l 5 .+ _= -- PEjgr([loj 3 wj1v)cj(v) - or r wr(v)cr(v)| > eO]n[cr]n [Nr 31)] | A + o (1) P . -1 8 .+ -1 %.+ PL [ION N cN(V)WN(V) - or r cr(v)wr(v)| > 60] n [ArJJ + op(1) I A = 1 b 4.4 . op( ) y ( 5) Lemma 4.6. Assume the conditions of Theorem 4.1. Then -l TN+(v)Tr+(V) + 1 w.p. 1 as r + m. ngpf; As usual we will use r, N and b instead of ar, N and br' Since v is fixed we will simplify notation by not revealing v when possible. We will repeatedly use the observations that (4.17) insures that r;20: is bounded, that (2.8) and (4.11) yield -1 r + + _ -1 _ (4.49) r (21(LiN(v) - Lir(v))l — | (KN (v)) - VI - op(1) K -r and that 32 -1zr + + l u1N uir -1 r + + (4.50) | :_r [21(LiN - Lir)| To show that (4.50) holds assume K;l easily shown that L+ L+ < + + < L+ iN ir —-“iN uir —- ir . . -l -1 Similarly when K < K r - N L+ _ L+ < + + < + ir iN —-“ir “1N —- iN Therefore r + + r + + r 2 . - . < Z L - L, = L lluiN “irl -' 1l ir 1Nl lzl( Now by (2.1) and (4.16) (451) 2()’2()—1+): 11+ (1) . TN+ v Tr+ v — m=1 m op where -2 r 2 + + R = L _ L 1 Tr+ £1Ci( iN ir) -2 r 2 + 2 + 2 R = - 2 Tr+ Zlci((uir) (uiN) ) -2r2+ + R: C _ _ 3 Tr+ 21E N+LiN(l LiN) -2r A++ + = l- _" R4 2Tr+ l 1 ruir( Lir) C -2 N 2 + + 2 P: _ 5 ”2+ Zr+1EC ( iN ( iN) ) + + + + - .é - L . 2C NuiN(l iN)] = o (l). P :_K; , then it is iN' + L. 0 II + + . — L. )|. 1r 1N 2 + + L .. r+ ir(l Lir)J + + 1 - u N( LiN)] 2 + + . L _ N+ iN(l LiN) 33 In View of (4.49) and (4.50) (4.52) IR1I :_T;i mix CIIZI(L:N - L:r)I = op(l) and (4.53) IRZI 5,1}: max six: “Ir + “INIIuir - “:NI = o (1) Next |R3I :1; ((22+)?- (6:)2I|z§L:N(1 — L:N)| + Tri(e 6+ )2I21L+ LiN(l - L:N) - Z:L1r(l - Lir)l = IR31I+ IR32I (say). In view of (4.15), (4.44) and (4.49) IR31I _ 1;: riggb Ioj - C:IIC;+¢:II21L+1NI = op(l) and IR32l :_3Tri(e:)2I21(L:N - Lir)l = °p(l)° Therefore (4.54) IR3I = o (1) Similar techniques yield (4.55) IR = o (1). P Finally in view of (4.47) 34 — + + 22“ (a? + (e )2 + 2c.|e I) + o (1) N 1 N p (4.56) IRSI :_rr+ r+1 1 T—2(N — r)[ max Icgl + (6+)2 + 2Ié+I max Ic.I] r+ . N . i 1:1_<_b 1:130 IA +0 (1) =0 (l). p P Combining (4.51) through (4.56) completes the proof of the lemma. Proof 9£_Theorem 4.1. As usual we will use r, N and b instead of a , N and b . r r r We prove (4.18) by using the decomposition + —1 -1-* T (v) = V (K(K (v))) + 0 u (v) n n n n n -‘k where un(v) is defined by (4.43). Note that from Lemmas 4.4 and 4.5 we have + + * * (4.57) ITN(V) - Tr(v)I :_IVN(v) + VN(v) - Vr(v) - Vr(v)I _ -* _ _* + I0 lu (v) - 0 In (v)I = o (l) . . r r p N N Combining (2.4) and (4.57) with the results of Lemmas 4.3 and 4.6 completes the proof of (4.18). 35 For rank statistics we present Theorem 4.2 which is the analog of Theorem 4.1. The proof of Theorem 4.2 is similar to the proof of Theorem 4.1 with appropriate changes in notation. Define -1 n -1 Tn(v) on 2:1ci{1(xi i Hn (0)) - Lin(\))} and T2(v) z“[(c. - e (v))2(1 - L.(v))L.(v)]. n l i n 1 i Theorem 4.2. Assume {an(i)}) {Nr} apd_ {Ci} satisfy (1.4), (2.1) and (2.2) and there exist numbers £ij(v), l :_i :_j, ar §_j fi-br such that, ap_§_fixed point v, for any d, 0 < d < w max sup _5ILin(t) - Lin(v) - (t - v)£.n(v)I = o(n“%)’ lfiifin It-indn l and -1 A A '- o aL1 max Ic.(v) - c (v)I = o(a 5) as r + w . a r . a r —— r a 0 implies r+°° ar T;1(V)UN TN (v) D N(0,1) r.v. r r r CHAPTER-5 PROOF OF THEOREMS 2.1 AND 2.2 Proof of Theorem 2.1. As before we will use r, N and b to represent ar, Nr and br' BY (2.6) we may assume o is \IJ- Define for any n + -1 + -+ \ - (5.1) Tn(m) = on [sn(¢) - un(¢)]_. Since the assumptions of Theorem 2.1 are stronger than the assumptions of Proposition A.1, we have by (A.28) and Remark 9 -1 + rr+(v)orTr(m) B N(0,1) . Thus to prove (2.15) it suffices, in view of (2.14), to prove + + (5.2) ITN(@) - Tr(o)I — op(l) . + To prove (5.2) note that for all r.: l, Tr(o) = fr/r+l + - 0 o(t)dT (r 1(r + l)t) w.p. l where T+(t) is defined by (4.12). + Since T (0) = 0 and m is bounded and \L, upon integration by parts one has 36 37 + e -f:/r+1T:(r-l(r + l)t)de(t) + T:(1)6(1) (5.3) 1 + . + —fo T (t)dm(t) + Tr(1)¢(1) + op(1) - The latter follows from the boundedness of m and the fact that - + sup IT:(r 1(r + l)t) - Tr(t)I = op(l). Similar results hold for T;(o) by (4.57). Now by (4.28), the boundedness of m, and the fact that + for any n, Tn(l) = Vn(l)' we have + + 6(1)ITr(1) - TN(1)I — op(l). Thus to prove (5.2) it remains to show 1 + + - T = . IronN r)dcpl op(1) * .* Defining for any n, Vn and “n by (4.30) and (4.43) respectively and using the decomposition presented at (4.57) we have 1 + + 1 * * IfO(TN - Tr)deI : fOIVN+(t) + VN(t) - Vr+(t) - vr(t)|dm(t) 1 -1-* —1-* (5.4) + IIO{6N uN(t) - or ur(t)}do(t)I = R1 + R2 (say). (5.5) IRiI = op(l) by (4.28), (4.30) and the boundedness of m. Next we proceed to prove R is op(l). Since (4.44) holds for any j 3_1 2 38 -l *2..+ P[II0N N CNII _<_ kc] PE u ([N = j] 0 [IIOleIc‘szI : x 1)] = 1. j=l 1 J c (5.6) For any n, 0 < n < w, r sufficiently large and Er defined by (2.9) we have by (2.4) and (4.44) -1 + - + sup ION N8&N(t) - Orlr%&r(t)l tEE r - + + < sup 0 lNIIE‘: (t) - & (t)I + o (l) —- c N N r p t€E r -1 5 .+ .+ (5.7) :_0 b max sup Ic (t) - cr(t)I + op(l) §_2n + op(l). r 0, n > 0 (a function of e), we have for Er l -l-* -l-* IIOwN uN Or urhml 5 9 f { '1‘* ’1‘*}d + If { '1'* "1" d < — - ( . ) __I Er ON uN or ur wI EC 0N UN or ur} ml r = + . R21 R22 .(say) We first show R21 = op(l). For any j, r :_j :_b and t 6 (0,1) we have on Gr 39 Ifif(t)| J j + 1 _ + _ + -l + Izlci(Lijl(Kj(KJ (t))) Lijl(t) Lij2(§j(Kj (t))) + Lij2(t))l I A Izic.(LI.(K.(K71(t))) - LI.(t))I 1 11 -3 J 11 I A I + -1 + max Ic.I|£3(L..(K.(K. (t))) — L..(t))| lfiijj i l ij —j j 1] | A d max Icins by (2.8) and (4.41). lfifiij Consequently on Gr we have R21 :_0-1 max Ic.I(db11 + dr%)m(E ) ljiib r :_3dkcm(Er), Therefore by (2.9) and (5.8) (5.10) R21 = 0p(l). Next we show R = o (1). On G n [N = j] 22 p r r -1.5 .+ -1 t .+ , IRZZI : sup I0. 3 Wj(t)cj(t) - or r wr(t)cr(tflno“ + 4nkchm“ tEE r -1 5 .+ -1 k .+ _ i.izg l0N N WN(t)CN(t) - or r wr(t)cr(tnnmu + as . r The first ineqUality follows from (2.10) by an argument similar to the proof of (4.48). Therefore since “W“ is bounded we have on G . - r 4O -1 H I :_sup IO N 22 tEE: N .+ -1 5 .+ . WN(t)cN(t) - or r wr(t)cr(t)I“mh + 56 o;1N*ne;nan - w.“ )4“ + 4. I A -1 s + + + o N. IIW IIsup If: (t) " f: (t) I II‘PII N r tEE: N r = o (l). P - + The first term on the right side is op(l) since oNlNkhéN“ is bounded w.p. l and “W - W H = o (l). The second tenm is o (l) N r p P because “WI“ is bounded in probability in the limit and + sup I&;(t) - ér(t)I < n for n arbitrarily small. Consequently c t E 6 r by (5.9) (5.11) R22 = op(l). Combining (5.5) and (5.9) through (5.11) completes the proof 1 + + Of T - T - = If0( N r)d4>I Op(l) and the theorem Proof 9£_Theorem 2.2. With appropriate changes in notation the proof of Theorem 2.2 is similar to that of Theorem 2.1. The details are left to the reader. Proof p£_Corollary 2.1. First note that {ci} defined by (2.21) satisfy conditions similar to (2.2) and (2.3) since -% O b r r Since the proof of Corollary 2.1 parallels the proof of Theorem 2.1 5 . (a/n) =1 +o(l) ando/o =a'/b'+o(l)=l+o(l). r r p a r r P P only the modifications will be discussed. Each place (2.2) or (2.3) are used in the proof of Theorem 2.1 the conditions above may be used. Similarly Corollary 4.1 and Lemma 4.6 hold as before for these {Ci}° Therefore (5.5) holds. Finally (5.6) and (5.7) hold since for all r 41 -B -l H. 5 -1 No l 0 r . r arcrI _<_ (ar/nr) r IIzliirII : 1 + op(1) . The remainder of the proof follows without change. Similar arguments show that Theorem 2.2 also holds for these CHAPTER 6 PROOF OF THEOREM 2.3 This proof is developed in Lemmas 6.1 through 6.3. Koul-Staudte (1972a) presents the asymptotic normality of the nonrandom linear sign-rank statistics. The specialization of their Theorem 2.2 to {Xi} which are iid with symmetric distribution F is presented in Lemma 6.1. Lemmas 6.2 and 6.3 verify the meth- odology developed by Anscombe (1952), thereby completing the proof. Lemma 6.1. Let Xl"°"Xn b§_iid random variables with a_continuous symmetric distribution function P. Let o(t) = ¢l(t) - ¢2(t), t E (0,1), where Ii are nondecreasing, square integrable. Let an(i) and {Ci} satisfy (1.5) 229 (2.5) and A2 = I; 42(t)dt. Then - + + A10 nl(Sn - ESn) + N(0,1). Lemma 6.2. Define {an(i)} py (1.5). Let Fn pp the o-field generated + , + , _ py {(s(xi), Rin)' l :_1 :_n}. Then {sn'Fn} i§_§_martingale. Proof: Using the density of the ith order statistic of a set of independent observations each distributed uniformly on (0,1) one can show that l '1 )) + (n - i+1)(n+1)' E(q(u i(n+l) E(W(U i)) = E(w(Uni)). n+1 i+1 n+1 Therefore i) . (6.1) an(i) = [i/(n+l)]an+ l(1+1) + [(n-i+l)/(n+1)]a n+1( 42 43 . . . + Since F is symmetric, {s(xi), l :_1 :_n} and {Rni' + l :_i :_n} are independent. Therefore given {(s(Xi),Rni), l i'i :_n} (6.2) s(Xn+l) = :_l with probability 5 and 6 3 R+ — R+ + 1 with b b'l't R+ /(n+l) ( ° ) n+1i _ ni pro a l l y ni — R+ with b bil't ( +1 R+ )/(n+l) — ni pro a 1 y n ni . Combining (6.1) and (6.3) we have for i = 1,...,n + + + EEan+l(Rn+1i)an:I = (Rni/(n+l))an+l(Rni+l) (6.4) + + + + ((n+l - Rni)/(n+l))an+l(Rni) - an(Rni). Now by (6.2) and (6.4) + _ n+1a + EESn+lanJ ‘ ED:1 biS(Xi)an+1(Rn+li)anJ = c E[s(x )lF JEEa (R+ )IF 1 + ch s(X )EEa (R+ )IF 1 n+1 n+1 n n+1 n+li n l i i n+1 n+1i n n + + — Zlcis(Xi)an(Rni) - Sn . + Therefore {Sn'Fn} is a martingale. This completes the proof. Lemma 6.3. Assume the conditions of Theorem 2.3. For any n let + . Tn = Sn/Aon. Then.‘v e > O and n > 0, there ex1sts 6 > 0 such that as r + w PE max_ IT‘ - T. | > n] < e . lr-j|:§r r 3 Proof: Denote r - [6r] by 2 and r + [6r] by u where [r] is the largest integer smaller than r. Now since 5 can be arbitrarily small, we have by (2.3) 44 (6.5) max IO 2(02. - O )I :_Io-2(o - 02)] + O as r + m . lj -r|<6r j u + Since S = A0 T r r -1 -1 -1 + + (6.6) ITr - Tj | i [(0]. - arm]. llTrl + A 03. sr - sj . Combining (6.5) and the fact that Tr is bounded in probability in the limit, the first term on the right hand side is op(l). - + + Define 12 = r l Zr a2(i). Note that ES = 0, V(S ) = r 1 r r r 12 fire? = 0212 and 32 = A2 + o(l). Since 0-202 + 1 as r + m, r 1 1 r l u (6.6) implies for a suitably chosen 6 and n1 < 5An -l + + P[ sup ITr - T, I > n] :_P[ sup 0. ISr - S_ I > n1] + o (1) lr‘j |<6r ] Ir- jl<6r J 3 p -2 +2 +2 -2 2 2 2 2 < - + < — < . __(nlog) E(Su Sfi ) op(1) —-(n10£) [cu]u ORJRJ e The third inequality follows by the Kolmogorov inequality for martin- gales (Loeve, 1963, pg. 386). This completes the proof of Anscombe's condition and therefore the proof of Theorem 2.3. APPENDIX APPENDIX Proposition A.l presents the asymptotic normality for signed rank statistics for :p satisfying Hoeffding's condition. These p may be discontinuous. This proposition retains a simpler centering constant similar to the centering constants used for rank statistics presented in Dupac (1969), Hoeffding (1973) and Koul (1974). However, it eliminates the condition D—H (2.12) which DupaE-Héjek (1969) and Dupac (1969) used and relaxes the boundedness condition which Koul (1974) used. No conditions are required on the alternatives for the absolutely continuous part of :p as was shown by Héjek (1968) for rank statistics and Huskova (1970) for signed rank statistics. To prove Proposition A.l we first present a lemma for signed rank statistics. This lemma is an analog of a similar lemma for rank statistics in Hoeffding (1973). Define l5 1: (A.l) 31?) e fit 41 - t) dm(t) Lemma A.l. 2:; m i§_non-decreasing, then (A.2) Z$|EP(R:/(n+l))s(xi) — fép(t)du:(t)| :_8n5Jcp) . Proof: For x 6 I-m,w) define 45 46 wi(x) = I(Ixil > lxl) + nKn(|x|). Then (A.3) Etp(R:/(n+1))||xi| = x} = Eh(wi 0) - f.p(t)dLin1(t) + + (A.4) - (Ep(Ri/(n+l))I(Xi < 0) -.flp(t)dLin2(t))| |A f2Lp)/) — w(t)|dL:n(t) Since wi(x) :_nKn(x) + l and wi(x) §_n, we have (A.5) @(wi(x)/(n+1)) _<_9(nKn(x)) 19(wi(x)) where g(i) = min{;p((i+1)/(n+l)), '9 (n/(n+1))}, o i i 1 n. Now by (A.4) (A.6) ZilEp (RI/(n+l))S(Xi) - fép(t)du:(t)| g_z$ fELp - g0. lim;p(E ) = 0 where n+w n E { max sup n5IL+ (t) L+ (s) = s; max _ . - . n k=1,2 ljiin It-slidn 5 Ink Ink 4. —(t—s)£. (t)I > n}. . ink , + Define TnCP) .by (5.1). + + A.10 Th T - - Z ‘ = l . ( ) en I n((0) n(WI op() Proof: As in (5.3) we have +. _ _ 1 + + _ Tn(~P) - f0 Tn(t)dp(t) + Tn(1)p(1) + 0p(l) + where Tn(t) is defined by (4.12). By integration by parts + + -1 n + Zn(p) — -f anp + on {zlci[s(xi) - ui(1)]kp(1) + where Zn(t) is defined by (4-13) on (0,1). Then T+cp) - z+cp) = fé(z+(t) - T+(t))dp(t) + op(1) . -* Define un(t) by (4.43). By the decompositions (4.21) and (4.26) T+Cp) - Z+Cp) f[-Vh(K(K;1(t))) + vn(t)]dh O) and C. = p 1 1 1 - 1 2| CiI(Ci < 0). Also define Bn and d by (4.24). Observe on Bn that 1|K(K;1(t)) - ql :dn-11 and 0:11 5:6)” ° - n + -l + unfilzlciminmmn (t)) - uinm)” -1 I I A [zicftLT (K(K-l(t))) - LT (t)1| 1 1n n 1n + - n — + -1 + onH121c1[Lin(K(Kn (t))) - Lin(t)1| -l 5 + 20 max [c.ldn . n . 1 1<1 s, L, (t) -_L. (s) > O, k = 1,2, i < n. Therefore on ink ink - - B n - + IR l < f c 1[2 max Ic.ldn% + dngé (t)]dp + f ndp 2 —' n . 1 n c En 1§}EP En The first term on the right hand side is o(l) since lim:p(E ) = O. n+00 n Since :p is bounded the second term can be made arbitrarily small by 50 Finally since P(Bn) + l as n + m making n arbitrarily small. IR = o (l) . P (A.14) 2| Combining (A.11) through (A.14) yields (A.10). Lemma A.3. Let {an(1)} and {ci} sat1sfy (1.4) and (2.2). Let @ EEE\L , absolutely continuous and satisfy the Hoeffding's condition pr(t)l(t(l - t))_%dt < m , (A.15) Then E(T;kp) - zch))2 + O as_ n + w . Proof: By Lemma 1 of Hoeffding (1973),

- u (ym)| = R21 + R22 + R23 (say). For m = 22,23 (A.20) R m n + + Izlci{Eym(Ri/(n+1))s O, a is selected so that l6kca < 6. Now K is fixed. Then for n 3_(2K/e)2, we have 52 + + IES (:p) - u (WI/o < 5. Therefore (A.23) |R2| + o as n + m . Combining (A.17), (A.18) and (A.23) yields (A.15). This completes the proof of Lemma A.3. In the following propositions :p is assumed to satisfy Hoeffding's condition on (0,1). To simplify the notation we will assume that .p is non-decreasing on (0,1). This can be done without loss of generality since :p may be written :p =cpl -cp2 where '91 and :p2 are N, and satisfying Hoeffding's condition on (0,1). Decompose :p into (A.24) m = ¢ + é where (D1 and <1>2 are ‘2 and satisfy Hoeffding's condition on (0,1); ¢1 is the absolutely continuous part of m and satisfies fcd¢1 = 0; ¢2 is the singular part of' m and satisfies B fdd>2 = O; and BC is acp-measurable set containing the singular set E of' p. Proposition A.l. Let {an(i)} and {Ci} satisfy (1.4) and (2.2) and :p be \L' and satisfy Hoeffding's condition (A.25) félm(t)|(t(l - t))-%dt < m . Let :p bg_decompgsed into ¢m' m = 1,2. Assume that there exist: measurable functions 53 }, i = 1,...,n, n :_1, k = 1,2, on (0,1) such that ‘V d, 0 < d < w, and. V n > O and k = 1,2, , 5 + + (A.26) 11m ¢ {5; max sup _ n ILink(t) - Link(s) n ljifin It-slidn + - (t-sm. (s)| >n}=0. 1nk Define + -l n + + c A = — C cn(t) n Zlci(£inl(t) lin2(t)), t._ B and .+ - +' c (t) = l nc.L. (t), t e B n l 1 1 +I when the derivative Li (t) exists. Then (A 27) 1im inf 12 . )0-2 > o ' n+(P n n implies -l + -+ T . - - . (A.28) n+(p)[Sn(p) un(p)] 3 N(0,1) r.v. 4.. Remark 8. Since the Lni are absolutely continuous and (A.26) + - holds, én(t) is well defined almost everywhere with respect to .9. Consequently Tn+«p) is well defined. Also (2.8) implies _ I n lanf (t) = 1. l 1 Proof of Proposition A.l. Given a > 0, there exists a decomposition 54 (A.29) w = ¢1 + $1 + $2 where 2(b) O- igign n liiin Therefore by (A.27), the Lindeberg-Feller theorem yields that + T itp)onzntp) 'is asymptotically N(0,1). Then by (A.36) for' m bounded (A 37) '1 ) 'T+- ) ' a t t' all N(O 1) . Tn+(p on (p ‘15 symp 0 1c y , . If :p is not bounded but is \L_ and satisfies Hoeffding's condition, decompose cp into (9 =:pl +;p2 where cpl is bounded and f4):(t)dt < 8. Then by (A.33) (A.38) E(Z+(CP) - Z+(;p1))2 = Var(Z+(:p2)) _<_Kk:E and 2 r (A.39) ((Tn+(spl)/Tn+(:p)) - l) _<_LTn+(1P2)/Tn+(‘.p)]2 _<_ KR: ern ftp) . Therefore for large n if e is sufficiently small Tn+Cp1)/Tn+cP) will be as close to l as we want. Therefore combining (A.38) and (A.39) yields (A.40) lz Cp)i 1‘?) - 2 (pl )r; l(pl)| = op(l) Since :pl 'is bounded (A.2?) and (A.40) imply T;:¢p)cnz+¢p) is asymptotically N(0,1). Then by (A.36), (A.3?) holds for :9 not bounded which completes the proof of (A.30). Remark 9 . If m =.pl ~cp2 where W1 and $2 are\L and satisfy Hoeffding's condition, the same proof whEn applied to T+(wl) and + . . . T (m2) will prove Prop031tion A.l for W which satisfy Hoeffding's condition. 57 Proposition A.2. Let {an(i)} and {Ci} satisfy (1.4) and (2.2) and cp b_e \ and satisfy Hoeffding's condition (A.25). Let :p be de- composed into ¢k' k = 1,2, defined_by (A.24). Assume that there exist measurable functions {Kin}, i = 1,...,n, n.: l, on (0,1) such that vd,00, 5 11m ¢ {8; max sup n IL. (t) - L. (s) n+m 2 lfiifin It-s :dn 5 in in - (t-s)Rin(s)I > n} = O , Define 6(t) = n'lznc IL (t) t6 BC 1 i in ’ and x -1 n C(t) =n >3 c.L!(t), t6 B 1 1 1 when the derivative Li(t) exists. . 2 Define pep) and TnCp) by (2.16) and (2.19) respectively. Then 2 .. lim inf T (p)o 2 > O n n n implies T—ltp)[8 Cp) - U(?)] + N(O l) r.v. n n D ' The proof is similar to Proposition A.l with appropriate changes and is not presented here. REFERENCES I II l i: I II III III I I l I: REFERENCES Anscombe, P.J. (1952). Large sample theory of sequential estimation. Proc. Cambridge Philos. Soc. fig 600-607. Braun, H.I. (1976). Weak convergence of sequential linear rank statistics. Ann. Statist. 3 554-575. Dupaé, V. (1969). A contribution to the asymptotic normality of simple linear rank statistics. Nonparametric Techniques in Statistical Inference, ed. M.L. Puri, Cambridge University Press 75-88. Dupac, V. and Hajek, J. (1969). Asymptotic normality of simple linear rank statistics under alternatives II. Ann. Math. Statist. 49 1992-2017. Fernandez, P.J. (1970). A weak convergence theorem for random sums of independent random variables. Ann. Math. Statist. 21 710-712. Ghosh, M. and Sen, P. (1972). On bounded length confidence intervals for the regression coefficient based on a class of rank statistics. Sankhya: The Indian Journal Q: Statistics, Series A. 34 33-52. Hajek, J. (1968). Asymptotic normality of simple linear rank statistics under alternatives. Ann. Math. Statist. 22 325-346. Hoeffding, W. (1973). On the centering of a simple linear rank statistics. Ann. Statist. 1 54-66. Huskova: M. (1970). Asymptotic distribution of simple linear rank statistics for testing symmetry. g, Wahrscheinlichkeitstheorie verw. Geb. 14 308—322. Koul, H. (1970). Asymptotic normality of random rank statistics. Ann. Math. Statist. 41 2144-2149. Koul, H. (1972) Some asymptotic results on random rank statistics. Ann. Math. Statist. 43 842-859. Koul, H. (1974). Asymptotic normality under alternatives of linear rank statistics. RM-316, Department of Statistics and Probability, Michigan State University. 584 59 Koul, H. and Staudte, R. (1972a). Asymptotic normality of signed rank statistics. E, Wahrscheinlichkeitstheorie verw. Geb. 23 293-300. Koul, H. and Staudte, R. (1972b) Weak convergence of weighted empirical cumulatives based on ranks. Ann. Math. Statist. 33 832-841. Loeve, M. (1963). Probability Theory (3rd ed.). Van Nostrand, Princeton. Pyke, R. and Shorack, G. (1968a). Weak convergence of a two sample empirical process and a new approach to Chernoff-Savage theorems. Ann. Math. Statist. 32 755-771. Pyke, R. and Shorack, G. (1968b). Weak convergence and a Chernoff- Savage theorem for random sample size. Ann. Math. Statist. 32 1675—1685. Sen, P. and Ghosh, M. (1971). On bounded length sequential confidence intervals based on one-sample rank order statistics. Ann. Math. Statist. 23 189-203. Sen, P. and Ghosh, M. (1972). On strong convergence of regression rank statistics. Sankhya: The Indian Journal of Statistics, Series A, 34 335-348. Shorack, G. (1973). Convergence of reduced empirical and quantile processes with applications to functions of order statistics in the non-iid case. Ann. Statist. 1 146-152. “IIIIIIIIIIIIIIIIIIIIIII“