gametheory《经济学理论》 Lecture 15: Sequential equilibrium.pdf_大学文库

There is little reason why this concept should not apply to general extensive games as well. However, observe that, in games with observed actions, fixing a strategy profile s is sufficient to generate well-defined beliefs in every subgame—i.e. well-defined beliefs conditional upon reaching any history. This is not the case in general extensive games (or in Bayesian extensive games with observed actions, for that matter). In some (very special) games, this does not really matter. For instance, consider Fig. 1. L 2,2 1 q M R ❅ ❅ ❅ ❅ q ❅ r ❆ ❆ ❆ ❆ ❆ 0,0 l ✁ ✁ ✁ ✁ ✁ 3,1 q 1,1 ❆ ❆ ❆ ❆ ❆ 0,2 ✁ ✁ ✁ ✁ ✁ 2 I2 Figure 1: A special game (OR 219.1) Observe that (L, r) is a Nash equilibrium of this game; note also that, since Player 1 chooses L in this equilibrium, it is not clear what Player 2 should believe if the information set I2 = {(M),(R)} is reached! Thus, the equilibrium strategy profile is insufficient to pin down players’ conditional beliefs. Yet, in this particular game, this is essentially irrelevant: regardless of his beliefs about Player 1’s actual choice, r is conditionally strictly dominated at I2. Whatever Player 2’s beliefs, r cannot thus be sequentially rational, so (L, r) is “not a legitimate solution” to the extensive game in Figure 1. On the other hand, (M, l) is. In this simple game, Player 2’s beliefs about the strategy (action) actually chosen by Player 1 is irrelevant as far as his choices are concerned. However, this is seldom the case: consider Fig. 2, which differs from Fig. 1 only in that the payoffs at the terminal history {M, r} have been modified. Consider (L, r) again. Upon being reached, Player 2 must conclude that Player 1 did not choose L; hence, he must adopt new, distinct from the equilibrium profile. Suppose that he thinks that Player 1 actually chose M: then r is indeed a sequential best reply! Suppose instead that he thinks that Player 1 chose R: in this case r is not sequentially rational. 2

M R 0,20, l.1 Figure 2: A typical case(OR 220.1 Formal definitions The simple games in the previous section emphasize that off-equilibrium beliefs matter: they determine whether or not a given profile is an equilibrium. Moreover, the putative equilibrium profile does not yield a complete specification of beliefs In particular, at any information set, a(behavioral) strategy profile B yields a complete specification of continuation strategies, but may fail to provide information about past play- precisely because, at information sets off the path of play generated by 6, it must necessarily be the case that at least one player j made choices other than those prescribed by B,! The original profile is clearly silent about such alternative choices Note that this problem does not arise in games with observed actions, precisely because past play is observable(so there is no uncertainty about previous moves ). However, it does arise in Bayesian games with observed actions; see the next section for details Kreps and Wilson propose the following definition Definition 1 Fix a general extensive game T. a belief system is a map u △(H\z) such that, for every i∈ N and Ii∈1,p(L1)()=1 It is most helpful to view a belief system as a summary representation of past play leading to information sets. That is, informally speaking, one could imagine that, if the play reaches an information set Ii (where Player i has to move) which is not consistent with the originally expected behavioral strategy profile B, players conclude that a different profile B is actually being played, and base their inferences and forecasts on the latter However, based on simple consistency considerations, B and B should agree at least as far as moves at information sets following I are concerned. But, if this is the case, then all that is required in addition to B in order to verify the sequential rationality of Bi at Ii is the distribution over histories in Ii induced by By-which is precisely the type of information enco oded in u(n)

L 2,2 1 q M R ❅ ❅ ❅ ❅ q ❅ r ❆ ❆ ❆ ❆ ❆ 0,2 l ✁ ✁ ✁ ✁ ✁ 3,1 q 1,1 ❆ ❆ ❆ ❆ ❆ 0,2 ✁ ✁ ✁ ✁ ✁ 2 I2 Figure 2: A typical case (OR 220.1) Formal definitions The simple games in the previous section emphasize that off-equilibrium beliefs matter : they determine whether or not a given profile is an equilibrium. Moreover, the putative equilibrium profile does not yield a complete specification of beliefs. In particular, at any information set, a (behavioral) strategy profile β yields a complete specification of continuation strategies, but may fail to provide information about past play— precisely because, at information sets off the path of play generated by β, it must necessarily be the case that at least one player j made choices other than those prescribed by βj ! The original profile is clearly silent about such alternative choices. Note that this problem does not arise in games with observed actions, precisely because past play is observable (so there is no uncertainty about previous moves). However, it does arise in Bayesian games with observed actions; see the next section for details. Kreps and Wilson propose the following definition. Definition 1 Fix a general extensive game Γ. A belief system is a map µ : S i∈N Ii → ∆(H \ Z) such that, for every i ∈ N and Ii ∈ Ii , µ(Ii)(Ii) = 1. It is most helpful to view a belief system as a summary representation of past play leading to information sets. That is, informally speaking, one could imagine that, if the play reaches an information set Ii (where Player i has to move) which is not consistent with the originally expected behavioral strategy profile β, players conclude that a different profile β 0 is actually being played, and base their inferences and forecasts on the latter. However, based on simple consistency considerations, β 0 and β should agree at least as far as moves at information sets following I are concerned. But, if this is the case, then all that is required in addition to β in order to verify the sequential rationality of βi at Ii is the distribution over histories in Ii induced by β 0—which is precisely the type of information encoded in µ(I). 3

From now on, the basic unit of our analysis will be a pair(B, u) consisting of a behavioral strategy profile and a belief system; such a pair is called an assessment Focus on games with perfect recall. In keeping with conventional usage, we define se- quentially rational assessments. First, define the conditional outcome function O(B, ur)to be the distribution over Z induced by the assessment(6, u) conditional upon starting the game at I: formally, for every z E Z, if no h E I is a subhistory of z, then O(6, uD)(a)=0; otherwise, by perfect recall, there can be only one such subhistory h(if there were two, then one would have to be a subhistory of the other, and an information set cannot contain a pair of nested histories). Thus, z=(h, a,...,a), and we let O,|D(2)=H(D)(h)(a.a(h,n2…,a)(a4+) (recall that, with some abuse of notation, for h E Ii Ii, Bi (h)=B,() Definition 2 Fix a general extensive game r. An assessment(6, u)for r is sequentially rational iff, for every player i E N, every information set Ii E Ii, and every behaviora strategy B∈B ∑U(2)O(.川41)≥∑U(2)O(1,B2D z∈Z It should be obvious that some relationship between belief systems and behavioral strate- gies in an assessment(B, u)should be maintained In particular, at least at information sets consistent with B, u should be derived from B by means of Bayes'rule tle But we may reasonably require more than this. For instance, consider a modification of the game in Figure 2 in which Player 1 chooses between L and C, and if 1 chooses C, a third player 1'chooses between M and R; then Player 2 has to choose observing only Player 1's initial move. That is, Player 1s choice among L, M and R is split between two players, 1 and 1, choosing consecutively Consider a behavioral strategy profile in which Player 1 chooses L and Player 1'chooses M. Then, even after a deviation by Player 1, we would probably still want to require that u((C, M), ( C, R))((C, M))=1; that is, a deviation by Player 1 should not affect Player 2s beliefs about the choices of Player 1. However, strictly speaking, Bayes'rule does not apply to this information set, because it lies off the path induced by the behavioral strategy profile under consideration Note incidentally that if 1 and 1' were the same player, this argument would be less convincing; one would have to invoke some kind of trembles or mistakes, whereas the argu ment in the case where 1 and 1' are distinct is essentially based on independence. As will be clear, however, the standard definition of sequential equilibrium does not discriminate between these two cases

From now on, the basic unit of our analysis will be a pair (β, µ) consisting of a behavioral strategy profile and a belief system; such a pair is called an assessment. Focus on games with perfect recall. In keeping with conventional usage, we define sequentially rational assessments. First, define the conditional outcome function O(β, µ|I) to be the distribution over Z induced by the assessment (β, µ) conditional upon starting the game at I: formally, for every z ∈ Z, if no h ∈ I is a subhistory of z, then O(β, µ|I)(z) = 0; otherwise, by perfect recall, there can be only one such subhistory h (if there were two, then one would have to be a subhistory of the other, and an information set cannot contain a pair of nested histories). Thus, z = (h, a1 , . . . , aK), and we let O(β, µ|I)(z) = µ(I)(h) K Y−1 `=1 βP(h,a1,...,a`) (h, a1 , . . . , a` )(a `+1) (recall that, with some abuse of notation, for h ∈ Ii ∈ Ii , βi(h) = βi(Ii)). Definition 2 Fix a general extensive game Γ. An assessment (β, µ) for Γ is sequentially rational iff, for every player i ∈ N, every information set Ii ∈ Ii , and every behavioral strategy β 0 i ∈ Bi , X z∈Z Ui(z)O(β, µ|Ii) ≥ X z∈Z Ui(z)O((β 0 i , β−i)µ|Ii) It should be obvious that some relationship between belief systems and behavioral strategies in an assessment (β, µ) should be maintained. In particular, at least at information sets consistent with β, µ should be derived from β by means of Bayes’ rule. But we may reasonably require more than this. For instance, consider a modification of the game in Figure 2 in which Player 1 chooses between L and C, and if 1 chooses C, a third player 1 0 chooses between M and R; then Player 2 has to choose observing only Player 1’s initial move. That is, Player 1’s choice among L, M and R is split between two players, 1 and 10 , choosing consecutively. Consider a behavioral strategy profile in which Player 1 chooses L and Player 10 chooses M. Then, even after a deviation by Player 1, we would probably still want to require that µ({(C, M),(C, R)})((C, M)) = 1; that is, a deviation by Player 1 should not affect Player 2’s beliefs about the choices of Player 1 0 . However, strictly speaking, Bayes’ rule does not apply to this information set, because it lies off the path induced by the behavioral strategy profile under consideration. [Note incidentally that if 1 and 10 were the same player, this argument would be less convincing; one would have to invoke some kind of trembles or mistakes, whereas the argument in the case where 1 and 1 0 are distinct is essentially based on independence. As will be clear, however, the standard definition of sequential equilibrium does not discriminate between these two cases.] 4

Kreps and Wilson suggest the following definition Definition 3 Fix a finite general extensive game T. An assessment(B, u)is consistent iff there exists a sequence of assessments(6m, u") converging to(B, p) and such that(i) Bi(Ii(a)>0 for every iE N, I Ti and a E A(D); and(ii)un is derived from Bn via Bayes It is easy to see that this definition captures the intended restrictions. Note that(ii) makes sense because, since every action in the game receives positive probability, every information set is also reached with positive probability, so Bayes'rule always applies Myerson suggests the following argument. Almost all elements of the set B of behavioral strategy profiles satisfy Condition (i) in Definition 3: that is, Condition(i) is generically satisfied. Also, Condition(ii) is very reasonable, so we definitely want all assessments(B, u) such that B satisfies(i) to be deemed consistent if and only if u is derived from B by Bayes' rule. Finally, it seems reasonable(and convenient) to assume that the set of consistent assessments is closed; hence, it makes sense to define it to be precisely the closure of the set of assessments satisfying()and(ii This may not appear to be a tremendously compelling argument; however, let me add that, for certain classes of games, consistency is equivalent to much simpler conditions on beliefs(when the latter are represented in an appropriate format ) Also, consistency can be characterized for general games in a manner that does not involve limits(although the characterization is not tremendously appealing) We are finally ready to define sequential equilibrium Definition 4 Fix a finite general extensive game T. An assessment (B, u) is a sequential equilibrium of r iff it is consistent and sequentially rational Perfect Bayesian equilibrium Some people find the definition of sequential equilibrium unwieldy(this refers in particular to the consistency requirement ). Also, consistency is only defined for finite games, so sequential equilibrium does not really apply to relatively simple games such as Bayesian extensive games with observed actions where type sets are infinite In such games, just like in a Bayesian game with simultaneous moves, one must specify a strategy for each payoff-type of each player. That is, we must specify type-contingent strategies. Hence, upon reaching a history h, players may be able to make inferences about their opponents' types based on the (equilibrium) strategy profile. However, at histories off the anticipated path of play, it is clear that the strategy profile does not convey any information about opponents' types

Kreps and Wilson suggest the following definition. Definition 3 Fix a finite general extensive game Γ. An assessment (β, µ) is consistent iff there exists a sequence of assessments (β n , µn ) converging to (β, µ) and such that (i) β n i (Ii)(a) > 0 for every i ∈ N, I ∈ Ii and a ∈ A(I); and (ii) µ n is derived from β n via Bayes’ rule. It is easy to see that this definition captures the intended restrictions. Note that (ii) makes sense because, since every action in the game receives positive probability, every information set is also reached with positive probability, so Bayes’ rule always applies. Myerson suggests the following argument. Almost all elements of the set B of behavioral strategy profiles satisfy Condition (i) in Definition 3: that is, Condition (i) is generically satisfied. Also, Condition (ii) is very reasonable, so we definitely want all assessments (β, µ) such that β satisfies (i) to be deemed consistent if and only if µ is derived from β by Bayes’ rule. Finally, it seems reasonable (and convenient) to assume that the set of consistent assessments is closed; hence, it makes sense to define it to be precisely the closure of the set of assessments satisfying (i) and (ii). This may not appear to be a tremendously compelling argument; however, let me add that, for certain classes of games, consistency is equivalent to much simpler conditions on beliefs (when the latter are represented in an appropriate format). Also, consistency can be characterized for general games in a manner that does not involve limits (although the characterization is not tremendously appealing). We are finally ready to define sequential equilibrium. Definition 4 Fix a finite general extensive game Γ. An assessment (β, µ) is a sequential equilibrium of Γ iff it is consistent and sequentially rational. Perfect Bayesian equilibrium Some people find the definition of sequential equilibrium unwieldy (this refers in particular to the consistency requirement). Also, consistency is only defined for finite games, so sequential equilibrium does not really apply to relatively simple games such as Bayesian extensive games with observed actions where type sets are infinite. In such games, just like in a Bayesian game with simultaneous moves, one must specify a strategy for each payoff-type of each player. That is, we must specify type-contingent strategies. Hence, upon reaching a history h, players may be able to make inferences about their opponents’ types based on the (equilibrium) strategy profile. However, at histories off the anticipated path of play, it is clear that the strategy profile does not convey any information about opponents’ types. 5