4... KI; Elfin" 1 3.1!... I . . 1.331.. 7‘. . .93...“ LrNfl. . ti. 4 . kph-xv. HQfiWNr ..A .3 WI“ P— K “1.9.1,“... r1..... ”WWW/1. .Ytt...’ “fr/...: ”whim 4/ wmfivn. . It. O. 1 .f . , 59 £1.39...» . . .... .AW . .V :5... 1. v. I I,” rub... 3.41113... 15.“.de VI“... 11...er I ENI Mfr ”rang. z}. .. huw.w,w...1...wwW WW. 1 11‘ 4a... . War. “arr. 1.»st Farm.” ruin». W... .pm. WW. ..Wtadcfivwflb #5. .I. ,.w..r.,am, yd». I . . .JMW.1W.A.1 -... c I I t . . ._ . .I.T".'a ’ .Ufllqt‘fl’m .w.f.. yr r155; 4.1%.“. I ERffI‘IINGIIERF 4 ...». . Z . ...",rzkua: n... III I 3 III N {AELB‘I I0? I! II“: III III-I” II :I 1; III 7 .. II e, ‘1' III I; , _ II 1% .1.. l. .. I . I, I... .3 . ; .F. ”maul-1.11. 3‘2... >3 2... ~..£......&.a.:...1 ..x.,...~...:d.fi?».§V, . C ... n . . I ...: . 151i. . Iv.) . .. THcSII LIBRARY Mfighigan State University V‘— ’1! This is to certify that the thesis entitled ON Ull‘ll-AL FIELDS FOR ClFFER'LJNTlAL GANLS presented by John Walter Wingate has been accepted towards fulfillment of the requirements for H133. degree in Electrical Engineering Major professor Date W 0-169 ABSTRACT ON OPTIMAL FIELDS FOR DIFFERENTIAL GAMES by John Walter Wingate The study of differential games is the study of game theory as applied to processes of the type considered in Optimal control theory. Almost all differential games studied have been two-person zero-sum games” This is due partly to limitations in general game theory and partly to the type of differential games most often StUdiGd--pUISth- evasion games“ The process in a differential game is modeled by vector differential equations CLICL CT‘ X = f (t, x, u, v) all where the independent variable t is called the time, x the state, and u and v are called control variables“ These variables, u and v, are chosen by two opposing players, one of whom wishes to maximize and the other to minimize a functional J = K (t1, Xét ‘ + L (t, Xitl, uit), v(t)ldt (2) l)' which depends on a solution to (l) on a time interval t0 3 t 5 tlo The initial point (to, xttoll is in a region B in tx-space, while the terminal point (t xttl)) belongs to l, John Walter Wingate a set T_which may be taken to be part of the boundary of £0 Functions U(t,x) and V(t,x) which give choices of the control variables u and v to use at each point of the region §_are called strategies“ Given a pair of strategies and an initial point in E, the payoff (2) is determineda In an optimal field one assumes that there exist strategies U and V optimal in some sense (this sense being specified for a particular type of optimal fieldl and a value function w, also defined on E, The value function is closely related to the payoff functional (2)“ The optimal strategies in an optimal field are taken to be piecewise continuous and have piecewise continuous first partial derivatives“ The value function is assumed to be continuous and have piecewise continuous first partial derivatives. Two types of optimal fields are considered: one of which requires the value function to satisfy a saddle-point condition, and the other of which requires the value function to satisfy a maximin (or, alternatively, a minimax) condition, The saddle-point condition is the more stringent requirement” It corresponds to a solution to the differential game in pure strategies (that is, those chosen directly by the player, without the assistance of a random device). The maximin or minimax optimal fields are applicable to differential games which do not have solutions of this type. John Walter Wingate The results obtained are extensions of optimal field and Hamilton—Jacobi theory for optimal control problemso The extension is to fields defined by saddle-point con- ditions and to fields defined by maximin conditions, Several useful discontinuity conditions, distinguished by the behavior of the optimal trajectories in the neighbor- hood of the discontinuity, are also obtained. ON OPTIMAL FIELDS FOR DIFFERENTIAL GAMES By John Walter Wingate A THESIS Submittedwto Michigan State University in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY Department of Electrical Engineering 1968 3% ACKNOWLEDGMENTS The author wishes to thank the members of the guidance committee, Dr. R. C. Dubee, Dr. T. Guinn, Dr. H. G. Hedges, Dr° H. E. Koenig, and Dr. P. K. Wong for their advice, aid and understanding during the time of his doctoral candidacy. He wishes to express particular thanks to Dr. Theodore Guinn for the able direction he gave in carrying out the research for this thesis. The author also wishes to thank the National Science Foundation for fellowship support and the Depart- ment of Electrical Engineering for assistantship support while he was a graduate student. ii TABLE OF CONTENTS ACKNOWLEDGMENTS . LIST OF FIGURES . . . . Chapter I. INTRODUCTION . . . 1.1 Game Theory, Optimal Control Theory, and Differential Games . . . . Auxiliary Theorems . . . . 1.2 II. DEFINITIONS AND SOLUTION CONCEPTS FOR DIFFERENTIAL GAMES 2.2 2.3 2.1 Definition of a Differential Game Solution Concepts . . . . Examples . . . O O O 0 0 III. OPTIMAL FIELDS FOR SADDLE POINTS 3.1 Optimal Fields in Control Problems 3.2 Optimal Field for a Saddle Point IV. MAXIMIN FIELDS V. TRANSVERSALITY AND DISCONTINUITY CONDITIONS 5.1 Transversality Conditions . . Manifolds of Discontinuity . . 5.2 VI. CONCLUSIONS . . 6.1 Conclusions . . 6.2 Further Research BIBLIOGRAPHY V . . O O 0 0 iii Page ii iv NH 28 28 AA 50 59 59 67 90 110 110 11“ 125 125 126 128 LIST OF FIGURES Figure Page 2.3.1 . . . . . . . . . . . . . 52 203.2 o o o o o o o o o o o o o 52 2.3.3 . . . . . . . . . . . . . 52 iV' I. INTRODUCTION 1.1 Game Theory, Optimal Control Theory, and Differential Games The theory of differential games brings together two originally separate branches of applied mathematics--the theory of games and optimal control theory. It draws on ideas and concepts from both of these fields. Game theory had its origins in the work of von Neumann who wrote a pioneering article in 1928 [30]. During the Second World War he wrote, in collaboration with Morgenstern, the classic book on the subject, Theory of Games and Economic Behavior [31].. Almost all later workers take as a basis the theory developed by von Neumann and Morgenstern. A game is a situation in which several persons make decisions. The essential feature of a game is that these decisions must be made on the basis of conflicting interests. (One could consider a game in which the decision makers have completely parallel interests as a degenerate case.) A game must have well-defined outcomes which depend on the decisions made and perhaps on chance factors. Each of the players (the decision-makers) evaluates these outcomes according to some criterion. In general, the players will not agree in their evaluations; this is the source of conflict. Normally the evaluation assigns a real number to each outcome of the game. If a player prefers outcomes with higher numerical values, the evaluation is called his payoff; if he prefers lower numerical values, the term used is 3953. Games are usually presented in terms of payoffs rather than in terms of costs. The decisions made by the players, and perhaps chance occurrences, determine the outcome of the game, and hence the payoffs. No one player controls the game completely. In general his payoff is as much deter- mined by the actions of the other players as by his own. Game theory addresses itself to the problem of prescribing, in some fashion, rational behavior under those circum- stances. Most games are defined by a set of rules. The rules prescribe the structure of the game, the manner in which it is played, which player must make a decision at any par- ticular stage of the game, the information available to this player--in fact everything but the actual choices made by the players (and by chance). Games which are described in this manner are said to be in extensive form. After a long careful development, von Neumann and Morgen- stern give a precise, axiomatic definition of a game in extensive form [31, section 10]. In a game in extensive form, a plan detailing what choice to make, on the basis of the available information, under every situation which could arise, is called a strategy. If each player chooses a strategy, the course and outcome of the game are deter- mined, except for chance effects. These chance effects can be eliminated by considering the expected values of the payoffs instead of the payoffs themselves. An equiva— lent game can be generated in which each player makes one choice only, from the set of all strategies available to him, in complete ignorance of the particular choices of the other players. He will, however, be aware of the strategies available to the others (through knowing the rules of the game, for instance). This equivalent game is in the normalized form. More precisely, a game for N players in normalized form consists of: N sets of strategies Si’ i = l, ..., N; N real-valued payoff functions P , i I l, ..., N, i with the domain of each of these functions being S Q S x ... XS 1 N' Each player chooses independently a strategy from his set of strategies. If s are the strategies 1’ s2, ..., sN chosen, 5 = (81, 52, ..., sN) is the corresponding point in S, and Pi(s) gives the (expected value of the) payoff to the 1th player corresponding to the play of the game in which these strategies are used. Each of the two formulations has its advantages: the normalized form is most useful when considering features common to all games, and the extensive form emphasizes the peculiarities of individual games. A satisfactory solution theory does not exist for general games. One does exist for certain types of two- person zero-sum games. A zero-sum game is one in which HMZ Pi(s) = O for all scS. =l An N-person game which is not a zero-sum game can be made into one by adding another player who has the payoff PN+l(s) = — Pi(s). l-“MZ =1 The set of strategies for this player, S is, of course, N+1 empty. N-person games are generally studied by dividing the players into two coalitions and considering the resulting two—person games for various divisions of this sort. It can be seen then that the theory of two-person zero-sum games plays a large part in the theory of games as a whole. The basic idea behind the solution to a two-person zero-sum game is guaranteed payoff. If the first player chooses a strategy 5 his payoff could be as little as l min Pl (81,,52). $2682 However, by choosing his strategy to be 51, the maximin strategy, where min P (sl,S2) = max min Pl (81,82) = E» 1 82832 s as 3 £82 1 l 2 he can guarantee that his payoff will be at least E) Likewise the second player can guarantee that his payoff is at least max min P (s s ) =-min max P (s ,s )='- W. s as s as 2 l, 2 s as l 1 2 2 2 1 1 82632 1 1 In Chapter II it is shown that u S G. If E = 1.7, the first player can gain, and the second player can lose, neither more nor less than t, provided both players use their maximin strategies. In this case, the solution consists of the maximin strategies, El, E and the payoff Pl (Ei,§é), 2 called the yalug of the game. If wg{\ 3(8) = ¢ if a # 8 (iii) each EA“) is connected and has a piecewise smooth boundary. By a piecewise smooth boundary it is meant that the boundary consists of the union of the closures of a finite number of (n-l)-dimensional manifolds each Of which can be described parametrically by equations of the form (where x is a point Of one of these manifolds): x = X(a), a e A, a region in En-l, where the function x is 0(1) on A. 14 Definition: A real-valued function g defined on g is piecewise continuous on I if there exists a decomposition {X(a)} of E such that for each a there is a continuous function g(a) defined on Eta} for which 3(X) e g(a)(x), XEXfa)- The function g is piecewise C(m) if the functions g(a) are C(m) on the xfaj. If g is a vector function with real-valued components it is piecewise C(m) if there is a decomposition of E for which each component satisfies the above definition. If g1, g2, . . . gp are several vector or scalar functions with the domain I; the C(m)—decomposition associated with (a)}, these functions [I is the "coarsest" decomposition for which each component of these functions satisfies the above definition. By "coarsest" it is meant that any other decomposition contains a decomposition of at least one of the xi“). The following Lagrange multiplier rule is Theorem 10.1 Of Chapter 1 of Calculus of Variations and Optimal Control Theory by M. R. Hestenes [16]. The statement has been slightly modified. Consider the minimization of a function f on a set S of points u in Eq. The set S is defined by the relations l5 ¢a(u)55 O a 1, ..., m' (1.2.1) ¢a(u) = O a m' + l, ..., m Let uO afford a local minimum to f on S. It is assumed that f and o? (1.2.12) or: ' r(3)(f,u) + léj)¢a(f,u) where léi) and 1:3) are the limiting values Of )‘a at If frcnn~§. and X respectively. The multipliers la are con- 1 ‘3 tirnaous at each point of continuity in x of U, f, and fuk. 21 32223 One may apply Theorem 1.2 directly to the case where x is replaced by i: , r by r”) and fuk by fife) for each £1 in the decomposition of g. Since the resulting multi- (i) are piecewise continuous on Z: they can be a 1 combined to form piecewise continuous functions on the pliers A set g, This method of combination requires the interpre- tation (1.2.12) at points of discontinuity of f and fuk. Some of the theorems in Chapters III and IV make use of existence, embedding and differentiability theorems for differential equations. The following theorems are taken without proof from the appendix to Hestenes' Calculus of Variations and Optimal Control Theory[l6]. The hypotheses are weaker than are required for the applications in the later chapters. Similar theorems with stronger hypotheses can be found in Bliss [6] and any differential equations text. The differential equations, in vector form are x = f(t,x,).), (1.2.13) where x is a vector in n—dimensional euclidean space En, i = dx/dt, and A is an element Of a normed linear space. For example, A may be a control function_ IA 0‘ u: u(t), a 3 t with the norm ||u|| = sup I u(t)| on a S t S b. However, the parameter A is not restricted to control functions. (' () (J (9' 22 Although the results are independent of the norm used, it is convenient to consider the norm , i = 1, ..., n. With this norm, a 6- neighborhood Of a point (d,B) in tx-space is {(t,x) It-a|<6 and Ix-B|<6} Hypotheses It is assumed that the real—valued functions f1 in the differential equations (1.2.13) are defined for all (t,x) in a region E_Of tx-space and A in a subset of a A of a normed linear space. Moreover, to each (a,B)e.F, assume there is a constant 6 and two integrable functions M(t), K(t) such that 1. the 6-neighborhood Of (a,8) is in F; 2. For each x in the 6—neighborhood 85 of B and for each AeA, the functions fi(t,x,A) are measurable in t on the d—neighborhood d6 Of a and satisfy | f(t,x,A) | S M(t) (1.2.114). on as. Thus f(t,x,A) is integrable on a-d < t < a + 6 for each x in 85 and A in A; L'n' a.“ (7' I 23 3. For each x and y in 86 and each A in A, the inequality If(t,x,A)-f(t,y,A)I S K(t)] x—y | (1.2.15) holds on 06; 4. For each x in 85 and A0 in A d+6 iiTO |f(t.x,A)—f(t,x,10)| dt = 0. (1.2.16) a-d Lemma Let S be a compact subset Of F_which is convex in x. Then there is a 6-neighborhood 85 Of S in §_and integrable functions M(t), K(t) such that |f(t,x,A) | S M(t) (1.2.17a) |f(t,x,A)—f(t,y,A)I S K(t) | x-y | (1.2.17b) hold for all points (t,x), (t,y) in S and all AeA. 6 In the following theorem S is a compact subset Of F_convex in x. M, K and 6 are related to S as described in the previous lemma. Theorem 1.4 There exists a constant p>O with p<6 such that to each point (a,B) in S and A in A there exists a unique solution 24 x(t,a,B,A), a - p s t 5 a + p of the initial value problem dx d T: f(t,X,A), X01): 8. The function x(t,d,B,A) is a continuous function of its arguments on the set lt-al S. p, ((1,8) 5 S, AEA Let AO be a fixed element Of A and let a: x(t), a 5 t s b be a solution of the differential equations Q10: ('1‘ >4 = f(t,x,AO). (1.2.18) This solution must lie in F. The closure S of an e- neighborhood of the points (t, x(t)) Of the arc 5‘18 in F. The Lemma and Theorem 1.4 can be applied to this set S. Using the existence theorem, the function x(t) can be extended uniquely so as to satisfy (1.2.18) on aep$t$b+p, where 958 . Define b+p G(x,A,AO) = If(t,x(t),A) - f(t,x(t),A ) | dt. a-p .r‘j r) 25 One can establish the following embedding theorem. Theorem 1.5 There is a positive number 0 such that through each point (d,8) satisfying with A a-psasb+p, IB—x(a)| < o, IA-AOI < o (1.2.19) there passes a unique solution y(t9a389)\)) a-p‘tsb+p) Of the equations DID- ('l‘ N = f(t,x,A) containing the arc g for attSb, A=AO, B=x(d). The function y is continuous in its arguments. There is a constant C such that |y(t,a,B,A) - x(t)| 5 C |B-x(a)| + CG(x,A,AO) on a—pstib+p. Moreover if (a',B',A') is on the set (1.2.19), this inequality holds if x(t) is replaced by y(t,a',8',A') and A by 1'. In addition 0 O. IB-y(d,d',8‘,A‘)| s lB-B' | + | M(s)ds | a! _(‘f Lib (n 51) 26 Linear differential equations g—fé- = A3‘(t) xJ + v1(t) 01" O x = Ax + v where A; and v1 are integrable on an interval astsb have unique solutions, on this interval through each point (a,8) with asmsb and BeEfi. If A1 and v1 are extended so J that they are integrable on the real line —w(t,x) A {ul (t,x,u) €31} (2.1.10a) V(t,x) A {vl (t,x,v) e52} (2.1.10b) The domain Of d is Ed and that Of T is F2. Constraints of the form (2.1.7) can Obviously be restated in the present form, since 51 is the set of (t,x,u) satisfying (2.1.7a) and (2.1.7b), and 32 is the set of (t,x,v) satisfying (2.1.7c) and (2.1.7d). The statement that (t,x,u,v) is an element of the prescribed set R is equiva- .0 lent to the statements u e c (t,x) (2.1.11a) v e V (t,x) (2.1.11b) Elements (t,x,u,v) in 50 are called admissible elements, and a differentiably admissible arc whose elements (t,x(t), u(t), v(t)) are all admissible and for which (t,x(t)e§ is called simply an admissible arc. The set of all p-dimensional vector functions U (l) which are piecewise C on F and which satisfy U(t,x)e 1. l 2 l The equations are similar, with the pursuer having an advantage in speed. When u1 = l and v1 = ai= const., this type of motion is called simple motion by Isaacs, who originated the first of the four examples [20]. The symbol 2 will mean (xE, yE, xP, yP). Example 1 The initial time, to, is fixed. The payoff is time to capture--capture meaning xE(tl) = xP(tl), yE(t1) = yP(tl)’ when this occurs for the least time t1. There is no loss in generality in taking t0 = 0. At each time t, each player knows t and 2 as well as the equations (2.3.1) and (2.3.2). By considering the rate of change of the distance between the evader and the pursuer, this problem can easily be solved. Figure (2.3.1) and the Theorem of Pythagoras show that r2(t) = (x (t) — x (t))2-+< (t) — Fig. 2.3.2 0" (xP(O),yP(O)) Fig. 2.3.3 53 5(5) = cos 6(z(t»(xE(t) - iP(t)) + sine (z(t)) (yE(t) - 9P(t)), (2.3.u) where 6(Z(t)) = tan-1 [(yE(t) - yP(t))/(xE(t) - xP(t))3. Player One wishes to delay the time t at which r(tl) l is zero as long as possible. Consequently he should strive to maximize r(t). Player Two's interest is in minimizing f(t). Substituting the differential equations (2.3.1) and (2.3.2) into (2.3.4) shows that One should maximize u (cos 6(z(t)) cos u + sin 6(z(t)) sin u2) (2.3.5) 1 2 and that Two should minimize --v1 (cos 6(z(t)) cos v2 + sin 6(z(t)) sin v2) (2.3.6) The optimum conditions are u2(t) = v2(t) = e(z(t)) (2.3.7) ul(t) = 1, Vl(t) = a, (2.3.8) that is, a direct chase at maximum speed. The payoff is r(O)/(a—l). If the capture condition is r(tl) = 2 (where r(0) > i), the optimal strategies 5“ remain the same, but the payoff becomes (r(O) - 2)/(a—l) The Optimal strategies: ul(t,z) l U(t,z) = = (2.3.9) u2(t,z) 6(2) ' and v1(t,z) a V(t,z) = " = (2.3.10) are independent of t. This is to be expected, since the, differential equations (2.3.1) and 2.3.2) are autonomous. Example.2 This example differs from Example 1 in that the terminal surface in tz—space is given by the equation , t - T (T>O). and in that the payoff is r(T). ForT <.r(0)/(a-l), the strategies (2.3.9) and (2.3.10) which provide a saddle point for r(t) on [0,T] are optimal. For T > r(t)/(a—i), r(t) a o for r(0)/(a-l) s‘t s T, and consequently 6(z) is indeterminate. However, even if no strategies are defined when r(t) = 0, any deviation from this condition results in its immediate restoration. 55 The payoff for T < r(0)/(a-1) is-r(0) + (1-a)T. For T 2 r(0)/(a—1) it is zero. ‘ Example 3 This example differs from Example 1 in that the information includes only t and 2(0). That-is, the players do not know the state after the play has commenced. In thiscase themaximin payoff is less than the minimax payoff, and the game has no saddle point. Suppose.the pursuer announces his strategy. Then, regardless of what it is, the evader can choose his~ strategy so that the paths lv 0 xP(t), yP(t), t and V O xE(t), yE(t), t — never intersect. The minimax payoff is + Q, On the other hand, if the evader announces his strategy, the pursuer can choose any strategy which brings him to the point where the evader's path first intersects his attainable set at the same time as the evader. This can be done since the pursuer is faster. ‘To minimize his loss, the evader must choose his announced strategy to delay this encounter as long as possible. Since for t < T‘= r(O)/(a-l), the evader can 56 reach points which the pursuer cannot and since for t 3 T, the pursuer can reach any point the evader can, the maximin payoff is T‘and the-evader's maximin strategy is u1 = l, u2 = 6(z(0)). The play in this case is the same as in Example 1. Example A This example differs from Example 2 in that the same restriction of information occurs as in Example 3. The cross-sections of the attainable sets for fixed times are circles centered on x1(0), yi(0), i = E, P, with radii of l.-t for the evader, and ast for the pursuer. If T < r(0)/a the situation in Figure (2.3.2) occurs. The cross-section P(E) of the attainable set at time T is the circle of radius aT(lT) centered on the pursuer's (evader's) initial point. The point A is closer to the point B than any other point in P; the point B is further from A than any other point in E. Thus the controls which bring the pursuer to A, the evader to B, constitute a saddle point. The value is r(O) + (l-a)T. For T 3 r(O)/a, the saddle point condition breaks down. In this case, Figure (2.3.3),the point (xE(0), yE(0)) is within the circle P of radius aT. Suppose the pursuer's strategy is known_to the evader. The evader can choose his strategy to take him to the point on the circle E of radius T furthest from (xP(T), yP(T)). The resulting payoff, r(T) 57 is clearly 2 T, the radius of E. The pursuer's minimax strategy is any one which makes (xP(T), yP(T)) = xE(0), yE(O)), for in this case r(T) 5 T, regardless of what the evader does. If the evader's strategy is known to the pursuer, the latter can use the strategy which brings him to the point within P closest to (xE(T), yE(T)). In this case, (a) r(T) s r(O) + (1-a)T if r(O) + (l-a) T > O, or (b) r(T) = 0 if r(O) + (1—a)T $0. The distance r(O) + (1-a)T is the distance between the points A and B in the figure, which is drawn for the case (a). The evader‘s maximin strategy is the one which brings him to the point B, for with this strategy r(T) é r(O) + (l—a)T. Case (b) corresponds to the situa— tion where T is sufficiently large for the circle P to enclose the circle E. For this situation, any of the «evader's strategies may be considered a maximin strategy, saince the payoff is zero, independent of the strategy tzsed by the evader. The principle of optimality does not hold for the tnvo examples with incomplete information, although it does In11d.partially in Example A for T < r(O)/a. The optimal fieslds introduced in the next chapter have the principle 0f <3ptimality stated as part of their definition. One may 58 view the principle of optimality as a consequence of the existence of an optimal field. Since the principle is not, in general, valid for games of imperfect information, as these examples illustrate, all games to which these fields are applied are assumed to be games of perfect information. III. OPTIMAL FIELDS FOR SADDLE POINTS 3.1 Optimal Fields in Controleroblems The results in this chapter are extensions of those stated by Hestenes [16, ch. 6, sec. 10] on optimal fields for optimal control problems. In this section the defini— tion of an optimal field for the case of a one-player game is presented, and the basic theorem—-a version of the. ‘Weierstrass necessary condition--is proved. The terminology developed in the previous chapter, with the maximizing player, One, suppressed, is used. Consequently u is not one of the arguments of L and the f1, and §_and 50.are sets in txv—space. The fundamental assumption about an optimal field is that at each point (t,x) in a region F, there exists a choice V(t,x) of the control variable v optimal in some sense. That is solutions of s. = f(t,x, V(t,x)) (3.1.1) are optimal paths in tx-space. In what follows L and the f1 are assumed to be C(l) on 3. Consequently if V is C(1) on a neighborhood of a point (5,?)5 F , there is a unique arc x satisfying (3.1.1) on an interval [EL6, E46], 6 > 0, with x(F) = x. This is a consequence of a theorem 59 60 on the existence and uniqueness of solutions to differential equations, Theorem 1.4. Definition An optimal field E is a region F.1n tx-space, a (1) vector control function W,C on Faand a function w also C(l) on F, such that (i) (t,x, V(t,x))e:_13_0 V (t,x)eF (ii) The inequality 8 W(a,x(a)).s lit,X(t), V(t)) dt + W (B,X(B)) (3.1.2) 11<>lds for every admissible are x: x(t), v(t), a 5 t 5 8, (3.1.3) equality holding in case V(t) =V(t,x(t)), dst ‘8. (3.1.14) :qu (3.1.4) holds, 5.15 an extremal or a characteristic arc (DIE the optimal field E, Surfaces in tx-space defined by eaquations of the form W(t,x) = constant are called transversals of the field E. If x_is an extremal of the. field 3, then 6l I(§) = L(t,X(t),V(t)) dt = W(a,X(a)) - W(B,X(B)). (3.1.5) Two extremals x and £_whose initial points are on the transversal W C and whose endpoints are on the 1 C2 have the property that transversal w 1(5) = 1(3) = cl - c2 (3.1.6) The following theorem is taken from [16] and the proof follows that given by Hestenes. Theorem 3.1 At each point (t,x)ezg of an optimal field 3 one has L(t,x,v) + wt(t,x) + wx(t,x) f(t,x,v) 2:0 (3.1.7a) for all v such that (t,x,v)e§0. Moreover, for v = V(t,x) equality holds: L(t,x, V(t,x)) + wt(t,x) + wx(t,x) f(t,x, V(t,x)) = O. (3.1.8a) For convenience, E(t,x,v) can be defined as E(t,x,v) A L(t,x,v) + wt(t,x) + wx(t,x) f(t,x,v) 62 so that (3.1.7) can be re—expressed as E(t,x,v) a o, (3.1.7b) and (3.1.8) as E(t,x, V(t,x)) = O (3.1.8b) Proof Let (76,3?) be a point of g and choose Vew('€,3€). Let E? x(t),v(t) be a continuous admissible arc in F on either (1) Ffls t:s:F + 5 or (ii)7c--(f,§,v) Z 0 (3.1.1u) d = d3, j = l, ..., v, and if (3.1.8) is replaced by L(“)(t,x, V(a)(t,x)) + W(a)(t,x) + W(a)(t,x)+-W(a)(T,x)f(a)(t,x, V(“)(t,x)) = 0 j = l, ...,v. (3.1.15) 66 Proof The function E is continuous on the set G of points (t,x,v) SUCh that (t,x) is in one of the regions 3‘“) and (t,x,v)e§0. By Theorem 3.1, E(t,x,v) 3 O on G Furthermore the function E(a) = L(a) + wéa) + wfia) f(a) is continhous on the set (t,x,v) such that (t,x)eF‘a) and (t,x,v)£B_0 and agrees with E on G. Consider the point (a) (E;§) on the boundary of}? and let V‘be such that (3);,V)efio. Let {(ti’ xi, Vi)} be a sequence of points in G converging to (F,§,V). Since for each i, E(a)(ti, xi, vi) 2 0, lim E(a)(t , xi, vi) 2 0 1+0!) 1 But the continuity of E(“) shows that lim E(a)(ti, xi, v = E(“)(€,E,V). i+oo 1) Thus E(a)(t,x,V) Z 0. This expression expanded gives (3.1.14). Likewise 67 E(a)(t V(a)(ti, xi)) = 0 for all i 13 xi, In the limit this gives E(a)(t,x, V(o‘)(t,x)) = 0, an expression equivalent to (3.1.15). The function E is one form of the Weierstrass E-function. This can be demonstrated by setting E(t,x,V,v) = E(t,x,v) - E(t,x, V(t,x)) Z 0, which can be rewritten as L(t,x,v) - L(t,x,V(t,x)) + Wx(t,x)[f(t,x,v) — f(t,x,V(t,x)1= 0 which corresponds more closely to the usual form of the Weierstrass condition. 3.2 Optimal Field for a Saddle Point In this section it is assumed that a differential game of the type defined in Chapter II has a saddle point, that there exist optimal strategies UeU_and VeV_which are (1) piecewise C on F and that there exists a value function (1) W which is continuous and piecewise C on E. Solutions of 68 x = f(t,x, U(t,x), V(t,x)) (3.2.1) are then optimal paths in tx-space. It is assumed that L and f are C(l) on R. The previous definition of an optimal field can be extended to cover saddle points. Definition An optimal field F is a region F‘in tx-space, vector (1) strategies U and V which are piecewise C on F, and a function W continuous and piecewise C(l) on 3 such that (i) (t,x, U(t,x)) 85; V (t,x) 5 E. (ii) (t,x, V(t,x))efi2 (iii) The inequality 8 W(a,X(a)) S L(t,X(t), U(t,X(t)), V(t))dt + W(B,X(8)) O. (3.2.2a) holds for every admissible arc x: X(t), U(t,X(t)), V(t), a st 5 B, equality holding in case V(t) = V(t,X(t)), a s t s B; (3.2.2b) and the inequality 69 B [ L(t,x(t), u(t), V(t,x(t)))dt + W(B,x(B))SW(d,x(d)) a (3.2.3a) holds for every admissible arc x: x(t), u(t), V(t,x(t)), a s;t:§ B, equality holding in case MH=UWJWH,a§tSB. Siam If there is more than one admissible arc é? x(t), U(t,x(t)), v(t) o Sitflé B for each v(t), (3.2.2a) holds for each of these arcs. Likewise (3.2.3a) holds when the arcs x: x(t), u(t), V(t,x(t)) ditSB are not unique for a given u(t), a S t S B. On a manifold of discontinuity of U or V one may or may not have a definition of these functions differing from their limiting values from neighboring regions of continuity. If the optimal paths lie on such a manifold, the optimal strategies need to be defined there, but if the optimal paths merely cross the manifold from one region to another this is not necessary (and indeed superfluous). Optimal arcs lying on manifolds are considered in the corollary to Theorem 3.4. 7O Surfaces W(t,x) = constant arc transversals of the field E, If (3.2.3) holds, the arc x‘is a characteristic are or an extremal 9f the optimal field E, If the are x; x(t), u(t), v(t), a s t S B is an extremal of the field, then 8 1(5) = L(t,X(t),u(t),V(t))dt = W(a,X(a))—W(8,X(B)) (3.2.5) Two extremals x and 3 whose initial points are on the transversal surface W(t,x)=Cl and whose final points are on W = C have the property that 2 my = 1(3) = cl — 02. In the problem of Mayer, L(t,x,u,v) E 0, and the value function W is constant along optimal paths. Such paths, then lie in transversal surfaces. Along arcs x: x(t), U(t,x(t)), v(t), a St S 8 W(a,X(a)) s W(B,X(B)) and along arcs, £3 x(t), U(t), V(t,x(t)), O. S t S 8 W(a,X(a)) 2 W(B,X(B)). 71 Transversal surfaces in this case have the property of semipermeability used by Isaacs [20]. That is, if player One uses the strategy U at a point (t,x) no arc can penetrate the transversal surface containing (t,x) in the direction of decreasing W, and if player Two uses V, no arc can penetrate in the direction of increasing W. Theorem 3.2 At each point (t,x) of an Optimal field E one has L(t,x,u,V(t,x))+Wt(t,x)+Wx(t,x)f(t,x,u,V(t,x)):S O, o S L(t,x,U(t,x)y)+Wt(t,x)+Wx(t,x)f(t,x,U(t,x),v) (3.2.6) for all ue¢(t,x), V8V(t,x). Moreover for u = U(t,x) and v = V(t,x), equality holds: L(t,x,U(t,x),V(t,x))+Wt(t,x) + Wx(t,x)f(t,x,U(t,X), V(t,x)) = 0‘ (3.2.7) If (t,x) is a point of discontinuity of U, V, Wt or Wx, in the expression L i Wt + fo, U, V, Wt and WK must all be evaluated as limits approaching (t,x) from the same region of continuity of these functions (U, V, Wt, Wx).' (In succeeding theorems this may be referred to as "the usual interpretation at points of discontinuity.") 72 £322£,—-The right-hand inequality in (3.2.6) follows directly from Theorem 3.1 and the second corollary with L(t,x,v) = L(t,x, U(t,x), v) and f(t,x,v) = f(t,x, U(t,x),v). The left-hand inequality is obtained in a similar manner from Theorem 3.1 and both corollaries. Equation (3.2.7) again comes from Theorem 3.1. It is also an immediate consequence of (3.2.6). The remaining part Of the theorem is a restatement of Corollary 2 to Theorem 3.1. Define Pi(t,x) = Wxi(t,x) , (3.2.8) H(t,x,u,v,p) = L(t,x,u,v) + pf(t,x,u,v), Here p is an n—dimensional row vector. Theorem 3.2 is equivalent to: Theorem 3.3 At each point (t,x) in an Optimal field E the inequalities H(t,x,u,V(t,x),P(t,x)) 5 H(t,x,U(t,x),V(t,x),P(t,x)) IA H(t,x,U(t,x),v,P(t,x) (3.2.9) hold for all ue¢(t,x), and for all ch(t,x). Moreover U, V, and W satisfy Wt(t,x) + H(t,x, U(t,x), V(t,x), Wx(t,x)) = 0. (3.2.10) 73 The same interpretation at points of discontinuity as in Theorem 3.2 is used in (3.2.9) and (3.2.10) for Wt, wx, P, H, U and V. Finally the integral I* = [{P(t,x)dx - H(t,x, U(t,x), V(t,x), P(t,x))dt} (3.2.11) is independent Of the path in F, provided that any segment of the path lying on a manifold of discontinuity is divided into segments on which P and H take the limiting values of only one adjacent region. P322£,—-Expressions (3.2.9) and (3.2.10) are restate- ments of Theorem 3.2 using the terminology of (3.2.8). The integral I* is seen to be de when (3.2.10) and (3.2.8) are used to make substitutions. IdW is clearly independent Of the path. The restrictions on the evaluation of the integrand insure its existence. This Theorem corresponds to Theorem 10.2 in Hestenes [16]. The following theorem relates limiting values of functions appearing in Theorem 3.3. Theorem 3.4 Let {3}“)} be a 0(1) ~decomposition of F_based on U, V and W, and let P be a set of indices a such that the manifold of discontinuity M.é. (I) BFfa) is not empty. as? The superscript d (e.g. Wéa)) FIG} which agree on E(a) with scripted) functions (e.g. Wt). (dt, dx) is tangent to M, for Wéa)(t',f)dt + Wéa)(t','x')dx 74 (1) denotes the functions C on the corresponding (unsuper- Then if (t’,3?)eM_ and each car and Bar = wé8)('t‘,x‘)dt + WJEB)(F,x-)dx. (3.2.12) Also (What) - P(B)(F,Y)de = [H(“)(t,§,u(°‘)(t,x), V(“)('t',x), P(“)(t,§)) — H(B)(€,§,U(B)(t,x), V(B)(t,x,) P(B)(t,§,))Jdt (3.2.13) Proof.--(3.2.13) follows from (3.2.8), (3.2.10), and (3.2.12). TO show (3.2.12), a C(l) curve lying in dI U given parametrically by t t(s), x x(s), with t(O) x(0) = i; can be defined. Let §_be a neighborhood of ( ) on E and C(l)on A (3(1) (75.x). 3(a), it is (3(1) on yflanj. defined on A which has W(t,x) = W(a)(t,x) on FIG}. (.(1) J Since W is piecewise C function W can be On the curve a further function can be defined w(s) Afi(t(s>, x(s)> = w(°‘)(t(s), mm. for which one Obtains 75 mm) = Wt(F,x)t'(O) + fix(t‘,i')x'(0) = wé“)(t,§')t'(0) + w(°‘)(t‘,x)x'(0). Another equation is Obtained in the same manner: w.(0) = wé5)(t‘,x)t'(0) + w§5)(t,§)xv(0). The curve is arbitrary. It follows that (t'(0), x'(0))ds may be any vector tangent to M. This, together with the two expressions for w‘(0), established (3.2.12). The corollary which follows considers the case where an optimal path starting or terminating at (E,x) lies in the manifold M, In this situation Optimal strategies defined on the manifold are not necessarily the same as (a) (a). one Of the limiting strategies U or V Corollary Let Optimal strategies U and V defined on the manifold M prescribe optimal paths lying in M_and let (3,?) be a point of M, Then L(t,x‘, U(t,x),v) + wéO‘NE'JE) + w(°‘)(t',i‘) + w(°‘)(t',sz) + W(O‘)(t‘,‘x) f(t,x,u(t,x‘),v) ->- o (3.2.l4a) 'whenever (F,§,V)e§2 and (l, f(t,x, U(t,x),v)) is tangent ’60 £3 76 L(t‘,3€,u,V('t‘,x))+wé°‘)(t’,f)+wf{°‘)(E,i‘)f(t',x',u,V(t‘,i‘)) s o (3.2.14b) whenever (F,x,u)eRl and (l,f(t,x,u,V(t,x))) is tangent to M,and L(t,x,ufi'j) W(t,x) )+wé°‘) (t,x) + wt(°‘)(t,§)r(t,;,U(t,x), V(t,x)) = o. (3.2.14c) Proof.--Let i: x(t), U(t,x(t)), v(t), tO st stl be an admissible arc with (t,x(t)) lying in M where either (t0,x(t0)) = (5,?) or (tl,x(tl)) = (t,x). This are can be described parametrically with parameter s t - €,‘becoming £3 x(s), U(t(s), x(s)), v(s) s0 3 3 Either S0 or s1 is zero. The function w defined in the proof of Theorem 3.4 is differentiable along the are A. With g(e) given by (where s0 = 0): e g(e) = J L(t(s),x(s), U(t(s),x(s)), v(s))ds + w(€), O or with h(e) given by (where s1 = O): 77 O h(e) = - [ L(t(s),x(s), U(t(s),x(s)), v(s))ds + w(—e), -e the arguments used in the proof of Theorem 3.1 establish the first and third of the above expressions. The second is Obtained through the use Of the first corollary to that theorem. In carrying out these arguments the substitution w‘(0) = wé“)(t,§)t'(0) + w§3)(t,x)xi(0) Obtained in the proof of Theorem 3.4 is used, where in the present case t'(0) = l, x'(O) = f(t‘,x‘, U(t‘,x‘), v(F)). The tangency restrictions arise from the correspond- ing conditions in Theorem 3.4. The following theorem is true (as can be seen from the proof) for any pair of admissible strategies U and V and function W which satisfies t1 W(t0,x(t0)) = J L(t,x(t)),u(t),v(t))dt + W(tl,x(tl)) to for any admissible arc 1‘5 x(t), u(t), v(t) t0 5 t 5 t1 with u(t) = U(t.X(t)), v(t) = V(t,X(t)). the type of field defined in this section or the type defined in Chapter IV. Theorem 3.5 On each region A in g on which U and V are 0(1), an extremal 5: x(t), u(t), v(t), a 5 t 5 B of the field, together with the functions pi(t) = Pi(t,x(t)) (3.2.15) satisfies the equations x = H (t,x, u(t), v(t), ) p p (3.2.16) 5 = —(Hx(t,x,u(t),v(t),p)+Hu(t,x,u(t),v(t),p) Ux(t,x) +Hv(t,x,u(t),V(t),p)Vx(t,x)). Moreover fi(t.x(t),u(t),v(t),p(t)>=Ht(t,x(t) ,u(t),v=-H H=0,H=o (3.2.214) together with H = Ht (3.2.25) on the region Q. Suppose that M and M. are given by the constraint 1 2 conditions (2.2.7) stated in the last chapter. For con- ‘venience one can make the definition 84 fi(t,x,u,v,p,u v) = H(t,x,u,v,p) + ua¢a(t,x,u) + vaB(t,x,v). (3.2.26) Theorem 3.7 Let 31 and 52 be as stated immediately above. Then ET“ | there exist multipliers ua(t,x) and v8(t,x), piecewise 8 continuous on §_(piecewise C(l) if 6a and w are of class F 0(2))such that on E_ E ua(t,x) 5 O a = l, . r' vB(t,x) 3 O B = l, . s' ua(t,x)¢a(t,x, U(t,x)) = O a = 1, ..., r, and not summed, v6(t,x)¢6(t,x, V(t,x)) = O B = 1, ..., s, and not summed, fih(t,x,U(t,x),V(t,x),P(t x), U(t,x),v(t,x)) = 0 (3.2.27) fi§(t,x,U(t,x),V(t,x),P(t,x),u(t,x),v(t,x)) = O The usual interpretation is made at points of discontinuity. Let x5 x(t), u(t), v(t), a 5 t S b 85 be an extremal, such that there is a decomposition (ti-1’ of the interval [a,b] with (t,x(t)) in a region on WhiCh U (l) and V are C on each of the intervals comprising the decomposition. (This excludes arcs with subarcs lying on manifolds of discontinuity of U or V.) Then 5 satisfies, with p(t) = P(t,x(t)) u(t) = u(t,x(t)) and v(t) = v(t,X(t)), the canonical Euler equations x(t) = fi5(t,x(t),u- B one can show that not only (4.1.4) but also (4.1.5) must hold, with W(d,x(d)) given by t1 W(a,x(d)) = I L(t,x(t),U(t,x(t)),V(t,x(t))dt a + W(tl,x(tl)) (4.1.9) in which the arc 5F x(t), U(t,x(t)), V(t,x(t)) d 5 t 5 t (4.1.10) 93 with (tl,x(tl)) = (T(O),X(O))€ T and W(tl,x(tl)) = K(o), is admissible. Equation (4.1.9) defines W(a,x(u)) as J(a,x(a); U,V). Since V is minimizing J(d,x(d); U, V) s J(d,x(d), U, V) for any VET, Suppose that V does not differ from V on the set Of (t,x) in F_which have t 218 for some 8 > a. Let x be an arc corresponding to the pair (U,V) starting at t = u, terminating at (t1,x(tl))eT_and let v(t) = V(t,x(t)) d S t < 8 Then (if B 5 t1) 8 W(d,x(a)) 5 J(d,x(d); U,V) = I L(t,x(t),U(t,x(t)),(V(t»dt d t 1 + I L(t,x(t),U(t,x(t)),V(t,x(t)))dt 8 + W(tl,x(tl)), so that IA 8 W(a,x(a)) J L(t,x(t),U(t,x(t)),v(t))dt + W(B,X(B)). d (4.1.11) This is (4.1.4). To show (4.1.5) one starts with 94 W(d,x(d)) = J(d,x(d): U,V) 2 min J(d,x(d); U,V) (4.1.12) VeV where U is given by (4.1.8). This holds since U is the maximin strategy. Then min J(d,x(d); U,V) VET B = min {I L(t,x(t),U(t,x(t)), V(t,x(t)))dt VEV t l + J L(t,x(t),U(t,x(t)), V(t,x(t)))dt B + W(tl,x(tl))} where the arc 35: x(t), U(t,x(t)), V(t,x(t)) 0!. S t 5 t1 intersects T at t = t1, and B 3 t1. Then min J(d,x(d): U,V) V82 8 = min {[ L(t,x(t),U(t,x(t)), VL(t,x(t)))dt V EV 1.— t1 _ + min { L(t,x(t),U(t,x(t)), V2(t,x(t)))dt V28! 8 + W(tl,x(tl))}} 8 = min {J L(t,x(t),U(t,x(t)), Vl(t,x(t)))dt + w(8,x(8)}, (4.1.13) up." 1 \ —l 95 since the second minimum is given by V2 = V. Now 8 \rAin { I L(t,x(t),U(t,x(t)),V‘l(t,x(t)))dt + W(B,x(B))} .5 I‘. Z RA‘I!‘ ') B 2 min { I L(t,x(t),U(t,x(t)),v(t))dt + W(B,x(B))}. (4.1.14) P- 1 L. '3; the minimum being taken over admissible arcs (4.1.6). Combining (4.1.12), (4.1.13), and (4.1.14) establishes (4.1.5). The inequality in (4.1.14) arises from consider— ing v‘s which do not satisfy v(t) = V(t,x(t)) for any VeT, since v is only required to be piecewise continuous and V‘is piecewise C(l). One may also note that min, min and min could be replaced by inf , Tnf, VET; VleT_ V2eT VET VleT Tnf without changing the validity of the argument. VZET In an analogous fashion a minimax field could be defined in which.V is player Two‘s minimax strategy. While the theorems in this section are stated for maximin fields, they hold also for minimax fields if the obvious changes 03.g. in inequalities) are made. The fact that the inequalities defining an optimal field.can be obtained from the assumption that the players ijl.a differential game have optimal strategies (as has 96 just been demonstrated for a maximin field) motivates the study of such fields in connection with differential games. However, the continuity prOperties of the function W have not been Obtained. It would be necessary to show this if one wanted to demonstrate that the field corre- sponds to the solution of the differential game. Berkovitz [3] assumes that a differential game has a saddle point and shows that if the decomposition of T corresponding to the Optimal strategies U and V is Of a certain type, called a "regular decomposition," the payoff J(t,x; U,V) has the continuity properties required Of W. As before let E be defined by E(t,x,u,v) = L(t,x,u,v) + Wt(t,x) + Wx(t,x)f(t,x,u,v). Theorem 4.1 The functions U, V, and W satisfy on T E(t,x(t),U(t,x), V) Z E(t,x,U(t,x),V(t,x)) = 0 (4.1.15) where ve‘i’(t,x), and E(t,x(t),U(t,x(t)),V(t,x(t))) = = uein,X) {8W18fx) E(t,x’u,V)} .(4.1.16) 97 If {3(a)} is a C(l)-decomposition of T corresponding to U, V and W, (4.1.15) and (4.1.16) hold on each T1”), and (4.1.15) holds on the manifolds separating regions of the decomposition if interpreted with U, V, W and WK as t limits from one of the adjacent regions. If (5,?)88 E(a) is a point on one Of these manifolds such that for all Ub¢(€,?), (F,?,U) is a point of continuity of inf E(a)(t,x,u,v), (4.1.16) also holds at (3,?) in the same limiting sense. The expression E(a)(t,x,U(a)(t,x),V(a)(t,x)) = max {TIE E(a)(t,x,u,v)} He¢(t,x) (t,x,u)-+(t,x,u) ch(t,x) (4.1.17) (a), ue¢(t,x), holds regardless of the where (t,x)e T continuity Of inf E(a). Proof.—«The equations E(t,x,U(t,x),V(t,x)) = min E(t,x,U(t,x),v) = O veT(t,x) are consequences of Theorem 3.1 and its second corollary, for with U fixed, T, W and V form an Optimal field of the type considered in section 3.1. This establishes (4.1.15). 98 It remains to show that (4.1.16) holds. Let (€,f) be a point of continuity of U, V, W and WK, and let U t be any admissible strategy. Arcs to be considered are admissible arcs of the form £3 x(t), u(t), v(t) F — 6 5 t 5 E, 6>O (4.1.18) with x(6) = f, u(t) = U(t,x(t)). From the definition of the field ”I W(t-e,x(t-e)) 3 min {I L(t,x(t),u(t),v(t))dt v "I -E + W(?,?)} ' (4.1.19) for each a such that O 5 e 5 6. If V(t), €‘- 6 S t - F, is the v which minimizes the expression in (4.1.19), for e = 6 the same V minimizes (4.1.19) for e on [0,6]. Then t W(tse,x(t;e)) ZJ L(t,x(t),u(t),V(t))dt + W(t,x). t—e But t J L(t,x(t),u(t),V(t))dt = 515 J dI [E(t,x(t),u(t),V(t)) —e “I — wt(t,x(t)) - wx(t,x(t))f(t,x(t),u(t),V(t))ldt t = [_ E(t,x(t),u(t),V(t))dt + w<€;e,x(t;e)) - w<€,§) t—e Since 99 [Wt(t,x(t))+Wx(t,x(t))f(t,x(t),u(t),v(t))]dt= dW t- [N Thus — 25(2) A E(t,x(t),u(t),'\'r(t))dt s 0. t-e The function E has a maximum on [0, 6] at e = 0, since gCO) a 0. Since g is differentiable o zg‘(0) = E(t',?,'u',?), (4.1.20) where H’= liM. u(t), V = lim' V(t). t+t— t+t— Since U'is any strategy in T, 5 may be any point in ¢(€,?). Then, from (4.1.20) inf _ _ E(E,§,E,V) 5 O for any Ut¢(€,?). vsV(t,x) Since inf _ _ E(t-,?,U(t—,?),V) = o by (4.1.15), vsV(t,x) equation (4.1.16) holds. Suppose that (6,?) is a point on on a manifold of discontinuity which is part of the boundary of the subregion TI“), and that EE¢(E,?). Since inf E(“)(t,x,u,v) S 0 on 3(a) V Ti?!)- ____ { inf E(a)(t,x,u,v)} 5 0, (4.1.21) (t,x.u)+(t,x,u) veW(t,x) 100 A180 E(O‘)(t,x,U(°‘)(t,x),v(0‘)(t,x)) = 0 on Fm) implies E(“)(t,i‘,u(°‘)(t,x),v(“)(t,x‘)) = o (14.1.22) This equation combines with (4.1.21) to give (4.1.17). If (F,?,E) is a point of continuity of inf E(a)(t,x,u,v), V 1im_____ inf E(a)(t,x,u,v) (t,x,u)9(t,x,u) veV(t,x) = inf E(“)(t,x,‘d,V), veT(t,?) and this may be used to substitute in (4.1.17). In (F,?,H) is a point of continuity of inf E(a)(t,x,u,v) for each V ue¢(t,x), E(O‘) (t,x,U(°‘) (t,x) ,V(°‘) (t,x)) = max { inf E(a)(€,?,u,v)} u€¢(t,?) veT(f;?) This cannot be concluded in general, since inf E(a)(t,x,u,v) v Inay not be continuous. It is however, upper semicontinuous, which yields 101 inf E(O‘) (t,x,u,v) Z V€W(t,x) TIE { inf E(a)(t,x,u,v)} (t,x,u)+(F,?,fi) veW(t,x) The next theorem is Theorem 3.3 restated for the present type Of field. The definitions of H and P are the same as before--although the U, V and W used in them are different, W now being a maximin value function, U, player One‘s maximin strategy, and V the strategy optimal against U. Theorem 4.2 At each point (t,x) in a maximin field T, the statements H(t,x,U(t,x),V(t,x),P(t,x)) 5- H(t,x,U(t,x),v,P(t,x)), veT(t,x), (4.1.23) H(t,x,U(t,x),V(t,x),P(t,x)) = max inf H(t,x,u,v,P(t,x)) ue¢(t,x) veW(t,x) (4.1.24) hold” Moreover, U, V and W satisfy Wt(t,x) + H(t,x,U(t,x),V(t,x), Wx(t,x)) = 0. (4.1.25) 'The usual limiting sense is given to (4.1.23) and (4.1.25) gm; manifolds of discontinuity. Equation (4.1.24) must be jJTterpreted with the same caution about limiting values of 102 inf H(t,x,u,v,P(t,x)) as with inf E(t,x,u,v) in V v (4.1.16). Finally, the integral 1* = {P(t,x)dx - H(t,x,U(t,x),V(t,x),P(t,x))dt} is independent of the path in T, Proof.--The proof is similar to that given for Theorem 3.3. It amounts to a restatement of Theorem 4.1 in different notation. Theorem 4.3 Theorem 3.4 holds for maximin fields without restate— ment. T32g£,--The inequalities defining the type of field are not used in the statement of Theorem 3.4 or in its proof. Theorem 3.4 holds for any continuous, piecewise C(l) function W on a region T, regardless of its origin. In the present context the statement about H and P (3.2.13) is a consequence of (3.2.8) (defining H and P), (4.1.25) and (3.2.12). To hold for a maximin field, the corollary to Theorem 3.4 must be reformulated. Thegrem 4.4 (1) Let {3(a)} be a C —decomposition of T based on U, V and W, and let P be a set of indices a such that 103 the manifold M_= () BTfa) is not empty. If there exist a def, maximin strategy U and a strategy V Optimal against U which prescribe optimal paths lying in M, then at each point (5,?) of}M E(a)(E,?,U(T§,?),v) 2 E(“)(t,x',U('€,x'),V('t‘,x)) = 0 (4.4.26) whenever VEV(E,?), (l,f(t,?,U(E,?),v)) is tangent to M and def. Further, E(“)(t,x‘,U(t‘,x"),V(t,i)) = max in; E(O‘)('t',x‘,u,v). u ve¢(t,?) (4.4.27) The :maximum is not necessarily taken over all u in ¢(E,?), but is taken over a subset of ¢CF,?). For each u in this subset there must exist an admissible arc 52(t),U(t,§E(t)),V(t) aSt st Ix) (4.4.28) £(t‘)=‘x', U(E,?)=u (t,?(t))e AZ“; where Dag, which minimizes E [ L(t,x(t), U(t,x(t)), v(t))dt + W(F,?) a among admissible arcs 104 £3 x(t), U(t,x(t)), v(t), on S t S F with (4.1.29) x(F) ='?. The arcs (4.1.29) need not lie in TTET, although (4.1.28) is required to do so. TTQQT,-—Since V is the strategy minimizing against U, the sections of the corollary to Theorem 3.4 which apply to minimization yield (4.1.26) immediately. The tangency restriction allows one to state (4.1.26) with U(F,?) and V(E,?) rather than U(a)(€,?) and V(a)(€,?). (a) If, for def, one defined W and Wx on M‘to be W t and Wx(a), one can obtain (4.1.27) with the same argument as used in the proof of Theorem 4.1 provided that the stated restrictions are satisfied. These allow the use of E(a), w (a) and wx(“) for E, w t t The comments preceding Theorem 3.5 indicate that the and WX in the argument. following theorem is true. Theorem 4.5 Theorem 3.5 holds without alteration for a maximin field. Proof.--Theorem 3.5 is independent of the type of optimal field; indeed, it depends only on the existence of (l) a pair of strategies C on a region TC T, and a function W satisfying 105 t1 W(t0,x(t0)) = L(t,x(t),U(t,x(t)),V(t,x(t)))dt + W(tl,x(tl)) for admissible arcs I defined by the strategies U and V lying in T. One would wish to have results analogous to Theorems 3.6 and 3.7. However, in a maximin field, the strategy U is not optimal-—does not maximize-~against the strategy V. Since U does not maximize, one would not expect conditions such as Hu = O or a multiplier rule to hold. Furthermore, while a global extremum is a local extremum, a global saddle point is a local saddle, a global maxmin is not necessarily a local maxmin. Nevertheless some partial results can be obtained. If U is considered fixed, the maximin field is an Optimal (minimizing) field in which V is the optimal strategy. This property may be used to obtain the following theorems. The proofs are omitted, since they are similar to the proofs of Theorems 3.6 and 3.7. Theorem 4.6 Suppose that on the region g‘of Theorem 3.5 that W(t,x) is Open, or alternatively that V(t,x) is an interior point of V(t,x). Then on T, U, V and W satisfy 106 Wt(t,x) + H(t,x, U(t,x), V(t,x), Wx(t,x)) = 0 Hv(t,x, U(t,x), V(t,x), Wx(t,x)) = 0. An extremal of the field 3.: x(t), u(t), v(t) a s t s 5 (4.1.30) satisfies, with p(t) = P(t,x(t)) the canonical Euler equations X II {It '0 0 II I 21‘. >4 + SE C. C.‘ >4 :1: II 0 on the region Q, Theorem 4.7 (4.1.31) (4.1.32) Suppose that T2 is given by the constraint conditions (2.2.7c and d), i.e., by ¢B(t,X,V) 5 O B l, ..., s' (pB(t,x,v) = o s s'+l,..., s. Then there exist multipliers 08(t,x), piecewise continuous (l) B on T (piecewise C if the 0 that on T are of class 0(2)) such 107 08(t,x) Z 0 08(t,x)wB(t,x,V(t,x)) = O B = l, ..., s, not summed, Hv(t,x,U(t,x),V(t,x),P(t,x)) + 68(t,x)w§(t,x,V(t,x)) = 0. (4.1.33) The usual interpretation is made at points of discontinuity. Let E: x(t), u(t), v(t) a 5 t 5 b be an extremal, such that there is a decomposition (co-(ti 0 such that there is a unique solution x lying in N, Of (5.1.5) through the point 0, (tl,xl) on the interval [6,8], a = tlv-p, B = t1 + p. Furthermore, there exist constants p', n > 0 such that through each point (1,5) satisfying a—D'STSB+9', IE—x0(r)|“I V H) A “I >"I U C‘.‘ A ffl NI V U < Q V A “I U “I v s L(5,?,U(5,?) ,v‘“ (t,x)) + wXIO‘)(t,x)mtjmwj),v(8)(t',36)). Likewise, 117 -thB) (t,x) = L(5,?,U(5,?) ,v( 8 ) (t,x)) + w (B)(5,x)f(5,?,U(5,?),V(B)(5,?)) _ L(5,?,U(5,?) ,v(°‘) (t,x)) + wXIB)(t,i‘,)f(t,x,U(t,x‘),v(“)(t,x‘)). Consequently, on the one hand (a) —- (8) '— (8) ._ Wt (5,x) - Wt (5,x) 2 (Wx (5,x) " Wx(a)(€,f))f(-fi-,Y,U(E,f) ,V(B)(€.a-XT)) and on the other hand w,(°‘)(t',x) — thBNt‘m 5- (wx(3)(f.x) _. wX (0‘) (5,?) )f(_,?,U(5,?) ,V(°‘) (t,x) ). By rearranging (wt(°‘)(t‘,x') — wt(8)(t,sz)) + (wx(0‘)(t‘,3£) — WX(B)(5,?))f(5,?,U(5,?),V(a)(5,?)) .<- ,o (wt(°‘)(t',x') - th8>(5,2)) + (wx(°‘)('t".i) _ WX(B)(5,?))f(5,?,U(5,?),V(B)(5,?)) 2 0 (5.2.2) 118 t(B)’ wx(9) - wx(B)) evaluated at (5,?) is a nonzero vector orthogonal to M_at (5,?), both of But, if (wt(“) — w these quantities must be either strictly positive or strictly negative, which is impossible. Therefore Wt(a)(t,x) Wt(8)(t,x) and Wx(a)(t,x) WX(B)(t,x). This theorem and the method Of proof are due to Berkovitz [3]. Corollary Theorem 5.2 holds for a transition surface M (i) in a maximin field, if the maximin strategy is continuous across M (ii) in a minimax field, if the minimax strategy is continuous across M (iii) in an optimal field for a control problem, if L and f are continuous in (t,x). TTQQT,--This is true since the proof Of the theorem requires varying only the discontinuous strategy, which is either a minimizing strategy or a maximizing strategy. The proof (given for a minimizing strategy) is actually the proof for an optimal control problem with 119 f(t,x,v) L(t,x,U(t,x),v) and 5(t,x,v) = f(t,x,U(t,x),v), and holds if L and 5 are continuous in (t,x) at (5,?). The next theorem concerns manifolds such that each point (5,?) on one of these manifolds is the initial point for several extremal arcs, each Of which proceeds, for t > 5 into a different subregion of .F.‘_- Manifolds of this type are called dispersal manifolds. The theorem also holds for points on manifolds to which several extremal arcs converge. That is the extremals are distinct for t < 5, but all have the point (5,?) in common. As before {2(a)} is a C(l)—decomposition of kaased on U, V and W for an optimal field in which U and V provide a saddle point. Theorem 5.3 Let M.= Tfiajf) TEij) T_be a dispersal manifold. Then at (5,?)eM (a) -:- (a) —-— (d) —»— (Wt (t,x) - Wt (t,x)) + (Wx (t,x) _ WX(B)(5,?))f(5,?,U(OL)(5,?),V(a)(5,?)) 3 0 (S 0) (5.2.3) 120 and (wt(°‘)(t‘,3€>)f(‘t‘.‘f,U(B)(t,x).v(8)(€,‘i)) 5- 0 (->- 0). (5.2.4) that is, that these expressions have opposite signs. Furthermore (14,996,?) - wt(8)(‘t',i’)) + (wx‘“)(€,x‘) IV 0 .. wx(8)(t,s))r(t,i’,u(°‘)(t,x),v(5)(t,x‘)) (5.2.5) and (wt(°‘)('€,3€) - wt(8)(‘€,3€>) + (wx(°‘)(‘t',i) — WX(B)(5,?))f(5,?,U(B)(5,?),V(°‘)(5,?)) s 0 (5.2.6) If M_is a surface (n—dimensional manifold), if the players choose between U(a) and U(B), V(a) and V(B), and if (5.2.3) to (5.2.6) are all strict inequalities, one of the (a) players can choose which extremal arc--the one entering T or the one entering T(B)—-is taken. 121 If the dispersal manifold M =flfifiyfl T for some set of indices P, (5.2.3) to (5.2.6) apgig to each pair a, 8, der and 88?. T399T,--Let (u,u) be a vectOr orthogonal to Mlat (5,?). Then the statement that there exist distinct arcs . (Y) (Y) '— S S éy' xy(t),U (t,xy(t)),V (t,xy(t)) t t t 1’ Y = “as: with (t,xy(t))€T(Y) for t < t 3 t1 (5.2.7) is equivalent to u + yf(t‘,x,U(°‘)(t',i‘),v(°‘)(t’,x)) 2 0 (-<- 0) (5.2.8) and u + tut—5,11“)(t,x),v(8)(t,x)) 5- o (->- 0) (5.2.9) that is, the two expressions have Opposite signs. By Theorem 3.4 (Wt(a)(5,?) - Wt(8)(5,?), Wx(a)(5,?) - WX(B)(5,?)) is either zero or a nonzero vector orthogonal to M.at (5,?). If it is zero, the relationships (5.2.3), (5.2.4). (5.2.5) and (5.2.6) are satisfied trivially. If it is not zero, then (5.2.8) and (5.2.9) hold with 'C I (wt(“)(t,x) — wt(8)(t,x)) and v = (wx(“)(t,f) — wx(3)(t,x)), 122 yielding (5.2.3) and (5.2.4). As a matter of notational convenience, set f(U,V) = f(5,?,U(5,?),V(5,?)) L(U,V) = L(5,?,U(5,?),V(5,?)) (wt,wx) = (wt(5,§),wx(E,§)). 55' From Theorem 3.2 one obtains IV ”1..—m. [Mn 2. L(U(“),V(B)) + wt(“) + wx(“)f(U(“),V(B)) 0 (5.2.10) L(U(B),V(a)) + w (a) + wx(a)f(U(B),V(a) t ) S 0 (5.2.11) and IA L(U(OI),V(B)) + W (B) + wx(8)f(U(a),V(B)) t 0 (5.2.12) IV L(U(B),V(°‘)) . w (a) + wx(s)f(U(s),V(c>) t 0 (5.2.13) Subtracting (5.2.12) from (5.2.10) yields (5.2.5) and subtracting (5.2.13) from (5.2.11) yields (5.2.6). Suppose that (5.2.3) to (5.2.6) are all strict inequalities, so that none of the arce corresponding to (U(G),V(G))’ (U(a),V(8)), (U(B),V(a)) and (U(B),V(B)) with initial condition (5,?) are tangent to M at (5,?), and suppose for definiteness that (3.4.10) > 0. Then both of the arcs corresponding to (U(a),V(a)) and (U(a),V(B)) (on) enter T In the same manner, both arcs corresponding to 123 (U(B),V(a)) and (U(B),V(B)) enter TIE). Clearly player (a) (B) If One can choose to enter either T_ or T, . (3.4.10) < 0, then player Two has the choice. If M = (1 Thin T, the preceding arguments apply to as? each pair a, 8, car, 86?. Corollapy If two or more arcs converge to (5,?)eM instead of diverging from (5,?), then (5.2.3) to (5.2.6) hold in this case also. If M = n T-z-aTnT, these inequalities hold pairwise for a, BeFIEP TTQQT.--The only change required is that the arcs (3.4.16) must be defined on some interval t0 5 t 5 5‘ rather than t-5 t 5 t The statement that one player 1' can choose the arc which is taken is inapplicable to this case. Manifolds of the type in the preceding corollary are called universal surfaces (curves, etc.) by Isaacs EEO]. The corollary to Theorem 3.4 applies to such manifolds. If M = () Fm T, and I), V are strategies which equal U, V onaM: the following theorem holds. Theorem 5.4 Let M_be a universal manifold as just described. Then at (5,?)eM, for each as? 124 L(5,?,8(5,?),V(a)(5,?)) + wt(“)(t‘,x) + wx(°‘)(t,'£)r(t',x,fi(t,x),V‘“)(5,x)) s 0 (5.2.14) L(5,?,U(a)(5,?),'\7(5,?)) + wt(°‘)(t',x) + Wx(a)(5,?)f(5,?,U(a)(5,?),V(5,?)) 2 0 (5.2.15) The inequality (5.2.14) holds if the field is a minimax field and (5.2.15) holds if it is a maximin field. Proof.--These inequalities are immediate consequences of Theorems 3.2 and 4.1. For example (5.2.15) follows from (a) (T5,?) L(5,?’,U(a)(5,?),v) + wt + Wx(a)(5,?)f(5,?,U(a)(5,?),v) z 0 in either Theorem 3.2 or 4.1. The situations covered in the above theorems do not exhaust the types of behavior exhibited by optimal trajectories in the neighborhood of discontinuities. However, most of them can be treated with a careful appli- cation Of these theorems. For example, if the optimal trajectories on one side of a surface were parallel to the surface, and on the other side departed from the surface in a direction not tangent to it, Theorem 5.3 could be applied with (5.2.3) (or (5.2.4)) an equality. VI. CONCLUSION 6.1 Conclusions Because of the close relationship between differential games and optimal fields with independent controls, optimal fields can profitably be studied in connection with differential games. In Chapter III Optimal fields with a saddle point were investigated. The necessary conditions Obtained included Hamilton-Jacobi equations, Euler equations, and a saddle point in the Hamiltonian function corresponding to the saddle point in the Optimal field. Also a multiplier rule was derived for constraints on the controls given by systems Of equalities and inequalities. None Of these results is particularly surprising; they are an extension of the corresponding results in optimal control theory. In the fourth chapter a maximin field was introduced. Maximin fields are a type of Optimal field not previously treated. In a maximin field one of the players has a strategy which maximizes a functional among a collection of functionals minimized by his Opponent. While the second player has a minimizing strategy optimal against the first player's maximin strategy, the maximin strategy is not necessarily Optimal against this minimizing strategy. It is optimal when the collection of minimal problems is 125 126 considered. Because of this, the results obtained for maximin fields are not as strong as those for saddle point Optimal fields. In particular, a multiplier rule which applies to the constraints on only one player was obtained. In Chapter Vca transversality condition for extremal arcs terminating On the surface T was derived. The behavior Of extremal arcs in the vicinity Of manifolds of discontinuity was used to derive further conditions at these manifolds. 6.2 Further Research Further research in optimal fields for differential games could profitably concentrate on strengthening the results Obtained for maximin fields. In particular, if one can be Obtained, a multiplier rule applicable to both players is a result which would be most useful. The value function for a differential game is not necessarily continuous on the playing space T, It may be piecewise continuous, in which case one could consider optimal fields defined on each region of continuity. One would like to obtain conditions relating these optimal fields on the manifolds of discontinuity of the value function. In differential games in general, rather than in the optimal fields associated with them, a direction Of research useful in applications would be into games of imperfect information. To be successful, this would most likely 127 require some sort of mixed strategy. Mixed strategies would lead to the consideration of stochastic differential equations, a difficult subject in itself. Some start in this direction has been made by Ho [17]. The extension to general N-person differential games will have to be deferred until the theory of general games is at a more settled state. Perhaps something could be done in this line for optimal rendezvous and collision avoidance problems, which are two-person non- zero-sum games. BIBLIOGRAPHY 128 BIBLIOGRAPHY L. D. Berkovitz, "Variational methods in problems of control and programming," Journal Of Mathematical Analysis and Applications, V. 3 (1961), pp. 145-169. L. D. Berkovitz, "A variational approach to differential games," in M. Dresher, L. S. Shapley and A. W. Tucker, eds., Advances in Game Theory, Annals of Mathematics Study 52, Princeton University Press, Princeton, N. J., 1964, pp. 127-174. L. D. Berkovitz, "Necessary conditions for Optimal strategies in a class of differential games and control problems," SIAM Journal on Control, V. 5 (1967), pp. 1—24. L. D. Berkovitz, "A differential game without pure strategy solutions on an Open set," in M. Dresher, L. S. Shapley and A. W. Tucker, eds., Advances in Game Theory, Annals of Mathematics Study 52, Princeton University Press, Princeton, N. J., 1964, pp. 175-194. L. D. Berkovitz and W. H. Fleming, "On differential games with integral payoff," in M. Dresher, A. W.~ Tucker and P. Wolfe, eds., Contributions to the Theory of Games, III, Annals of Mathematics Study 39, Princeton University Press, Princeton, N. J., 1957. G. A. Bliss, Lectures on the Calculus of Variations, University of Chicago Press, Chicago, Ill., 1946. W. H. Fleming, "A note on differential games of pre- scribed duration," in M. Dresher, A. W. Tucker, and P. Wolfe, eds., Contributions to the Theory of Games, III, Annals Of Mathematics StudyI39, Princeton University Press, Princeton, N. J., 1957. W. H. Fleming, "The convergence problem for differential games," Journal of Mathematical Analysis and Applica- tions, V. 3 (1961), pp. 102-116. W. H. Fleming, "The convergence problem for differential games, II," in M. Dresher, L. S. Shapley and A. W. Tucker, eds., Advances in Game Theory, Annals of Mathe— matics Study 52, Princeton University Press, Princteon, N. J., 1964. 129 10. ll. 12. 13. l4. l5. l6. l7. l8. 19. 20. 21. 130 IA. Yu. Gadzhiev, "Application of game theory to certain jproblems in optimal control." Avtomatika i telemekhanika, 'V. 23 Nos. 8 and 9 (Journal translated as Automation and Remote Control.) ‘V. B. Gindes, Optimal processes for two controlled systems," Proc. 3rd Siberian Conference on Mathematics and Mechanics, Tomsk, USSR, 1964, pp. 260—261. V. B. Gindes, "Optimal control in linear systems under conflict," SIAM Journal on Control, V. 5 (1967), pp. 163—169. T. Guinn, "Boundary arcs for a class of differential games," to appear in Journal of Optimization Theory and Applications. J. C. Harsanyi, "A general solution for noncooperative games based on risk-dominance," in M. Dresher, L. S. Shapley and A. W. Tucker, eds., Advances in Game Theory, Annals of Mathematics Study 52, Princeton University Press, Princeton, N. J., 1964. M. R. Hestenes, "A general problem in the calculus of variations with applications to paths of least time," RAND Corporation Research Memorandum RM—lOO, February, 1949. M. R. Hestenes, Calculus of Variations and Optimal Control Theory, John Wiley and Sons, Inc., New York, 1966. Y. C. HO, "Optimal terminal maneuver and evasion strategy," SIAM Journal on Control, V. 4 (1966), pp. 421-428. Y. C. Ho, A. E. Bryson, Jr., and S. Baron, "Differential Games and Optimal Pursuit-Evasion Strategies," IEEE Transactions on Automatic Control, V. AC-lO (1966), pp. 385-389. R. Isaacs, Rand Reports RM-1391 (November, 1954), RM-1399 (November, 1954), RM—l4ll (December, 1954), RM-l486 (March, 1955). R. Isaacs, Differential Games, John Wiley and Sons, Inc., New York, 1965. D. L. Kelendzheridze, Theory of an Optimal pursuit strategy," Soviet Math. Doklady, V. 2 (1961), pp. 654— 6566 (translation of Doklady Akad. Nauk SSR, V. 138 (19 1)). 22. 23. 24. 25. 26. 27. 28. 29. 30. 31. 32. 33. 34. 131 Yu. A. Kochetkov, "Application Of Pontryagin‘s method to investigation of minimax automatic control problems," Tekhnicheskaya Kibernetika, NO. 5 (1965) (journal translated as Engineering Cybernetics). N. N. Krasovskii, "The pursuit problem in the case of identical players," PMM, V. 3 (1966), NO. 2. N. N. Krasovskii and V. E. Tret'yakov, "The pursuit problem in the case of constraints on the impulses of the control forces," Differentsial'nye uravneniya, V. 2 (1966), NO. 5. N. N. Krasovskii, Yu. M. Rupin and V. E. Tret‘yakov, "On certain game situations in the theory of controlled systems," Tekhnicheskaya kibernetika, NO. 4 (1964) (translated in Engineering Cybernetics V. 4 (1964), pp. l-ll). R. D. Luce and H. Raiffa, Games and Decisions, John Wiley and Sons, Inc., New York, 1957. P. A. Meschler, "On a goal-keeping differential game," IEEE Transactions on Automatic Control, V. AC-l2 (1967), pp. 15‘2lo J. Mycielski, "Continuous games of perfect information," in M. Dresher, L. S. Shapley and A. W. Tucker, eds., Advances in Game Theory, Annals of Mathematics Study 52, Princeton University Press, Princeton, N. J., 1964. J. F. Nash, "Non-cooperative games," Annals of Mathe- matics, V. 54 (1951), pp. 286-295. J. von Neumann, "Zur Theorie der Gesellschaftsspiele." Mathematische Annalean-lOO (1928), pp. 295-320. J. von Neumann and O. Morgenstern, Theory Of Games and Economic Behavior, Princeton University Press, Princeton, N. J., 1944. L. A. Petrosyan, "On a family of differential survival games in Rn," Dokl. Akad. Nauk SSSR, V. 161 (1965), NO. l. L. A. Petrosyan, "Differential survival games with several participants," Dokl. Akad. Nauk SSSR, V. 161 (1965), No. 2. L. A. Petrosyan, "Reduction of one pursuit game to solution of the Cauchy problem for a first-order partial differential equation," Dokl. Akad, Nauk Arm. SSR, V. 40 (1965), No. 4. 35. 36. 37. 38. 39. 40. 41. 42. 43. 44. 45. 46. 132 L. A. Petrosyan, One Class of Pursuit Games, Candidate's Dissertation, LGU, 1965. L. S. Pontryagin, V. G. Boltyanskii, R. V. Gamkrelidze and E. F. Mishchenko, The Mathematical Theory of Optimal Processes, Interscience, New York, 1962. L. S. Pontryagin, "On certain differential games," Dokl. Akad. Nauk SSR, V. 156 (1964), NO. 4, pp. 738- 741. L. S. Pontryagin, "On some differential games," SIAM Journal on Control, V. 3 (1965), pp. 49-52. R. K. Ragade and I. G. Sarma, "A game theoretical approach to optimal control in the presence of uncertainty," IEEE Transactions on Automatic Control V. AC—l2 (1967), pp. 395-401. C. Ryll—Nardzewski, "A theory of pursuit and evasion," in M. Dresher, L. S. Shapley and A. W. Tucker, eds., Advances in Game Theory, Annals of Mathematics Study 52éuPrinceton University Press, Princeton, N. J., 19 . I. G. Sarma and R. K. Ragade, "Some considerations in formulating optimal control problems as differential games," International Journal Of Control V. 4 (1966), pp. 265—279. H. Scarf, "On differential games with survival payoff," Contributions to the Theory of Games, III, Annals of Mathematics Study 39, Princeton UniverEity Press, Princeton, N. J., 1957. L. S. Shapley, "A value for N—person games," in H. W. Kuhn and A. W. Tucker, eds., Contributions tO the Theory of Games, II, Annals of Mathematics Study 28, Princeton University Press, Princeton, N. J., 1953. E. N. Simakova, "Differential Games," Avtomatika i Telemekhanika, V. 27 (1966), No. 11, pp. 161—178, (translated in Automation and Remote Control, V. 27, pp. 1980—1998). D. D. Sworder, Optimal Adaptive Control Systems, Academic Press, Inc., New York, 1966. P. P. Variaya, "On the existence of solutions to a differential game," SIAM Journal on Control, V. 5 (1967), pp. 153-162. 133 47. A. Zieba, "An example in pursuit theory," Studia Math. V. 22 (1962), pp. 1-60 48. A.Zi§ba, "Fundamental equations of the theory of pursuit," Trans. of the Second Prague Conference in Math. Statistics, 1959. 143 30 8 03 IIIIIIIIILIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIEs