# 耶鲁大学公开课:博弈论总结
[ -- 2020年06月27日 -- ]
耶鲁大学经济学n159课程,此为课程的笔记及总结,课程教材可参考这两本书
Joel Watson Strategy (3rd Edition)
# Class01
Don't play a strictly dominated strategy.
Put yourself in other people's shoes and try to figure out what they will do.
Rational choices can lead to bad outcomes.
You can't get what you want, till you know what you want. (books: Strategy and Games(Dutta), Strategies(Joel Watson), Thinking Strategically)
# Class02
Game ingredients:players(i,j),Strategy(player i's strategy–si;the set of possible strategies of player i–Si;a particular play of a game–s;a choice for all except i–U-i),payoffs(player i's payoff–Ui(s))
Def1: player i's strategy Si is strictly dominated by player i's strategy si,if Ui(si,s-i) > Ui(si,i) for all S-i
Def2: player i's strategy si is a weakly dominated by her strategy Si if Ui(Si,S-i) >= Ui(si,s-i) for all Si
# Class03
Median voter Theorem:Candidates crowd the center without consideration
voters are not evenly distributed in the real wolrd
many candidate will not voting
position not believed(commit to policy)
primaries
higher dimensions
Best respones
# Class04
Penalty kick game
Def1: Player i's strategy Si^ is a BR to the strategy S-i of all the other players if Ui(Si^,S-i) >= Ui(Si',S-i) for all Si' in Si or Si^ solves Max Ui(Si,S-i)
Def2: Player i's strategy is a BR to the belief p about the other players' choices if EUi(Si^,p) >= EUi(Si',p) for all Si' in Si or Si^ solves max EUi(Si,p)
Example EUi(L,p)=p(l)Ui(L,l)+p(r)Ui(L,r)
Partnership game(Nash Equilibrium)
# Class05
NE motivation:
No regrets. No individual can do strictly better by deviating and holding everyone else's actions.
Self-fulfilling beliefs. Each player is going to make a best response to each player.
Relate NE to Dominate. No strictly dominated strategy could ever be played in NE.
Coordination game: People can cooperate without a contact because the persuader is not trying to get you play a strictly dominated strategy. NE can be a self-enforcing agreement.
# Class06
Strategic complements: The more the other person does,the more you want to do.
Battle of the sexes: Different people disagree about where you'd like to coordinate.
Cournot Duopoly: Q: Comp > Total > Mono; P: Comp < Total < Mono
# Class07
Bertrand Competition: The outcome is like perfect competition even though there's only 2 firms.The same setting as Cournot but with a different strategy set,it led to a different outcome.
Differentiated Products.
Candidate-Voter Model.
# Class08
Candidate-model:
There can be lots of NE. (not all the people in the center)
Entry can lead to a more distant candidate winning.
If the candidates are too extreme,someone in the center will enter.
Guess and check is an effective method.
Location model:
Segregation does not imply that there's a preference for segregation.
Randomization are policies that you agree to do.
Individual randomization is another NE. (Mixed Strategy is a pure strategy)
No NE in pure strategy.
# Class09
Mixed Strategies:
Def1: A mixed strategy Pi is a randomization over it's pure strategies. Pi(Si) is the probability that Pi assigns to the pure strategy Si. (Pi(Si) could be 0;Pi(Si) could be 1)
1-1) Payoff: The expected payoff of the mixed strategy Pi is the weighted average of the expected payoffs of each of the pure strategies in the mix.
1-2) Lesson1: If a mixed strategy is the BR, then each of the pure strategies in the mix must themselves be BR. In particular, each must yield the same expected payoff.
Def2: A mixed strategy profile(P1*,P2*……Pn*) is a mixed strategy NE if for each Player i, Pi* in a BR to P-i*.
2-1) Lesson2: If Pi*(Si)>0,then that strategy is also a BR to P-i*.
Tennis Game.
# Class10
We only ever have to check for strictly profitable pure-strategy deviations.
Three different ways to think about randomization in equilibrium or out of equilibrium.
it's genuinely randomization.
it could be something about people's belief's.
it could telling us something about the proportion of people who are doing something in society.
If I change the column player's payoffs, it changes the row player's equilibrium mix; If I change the row player's payoffs, it changes the column player's equilibrium mix.
# Class11
Evaluation and Game Theory:
- Nature can suck;
b) If a strategy is strictly dominated then it is not evolutionarily stable.
If a strategy S is not Nash,(S,S) is not NE then S is not evolutionarily stable. If S is evolutionarily stable then (S,S) is a Nash Equilibrium.
Maynard Smith Bio Def: In a symmetric two player game,the pure strategy S^ is ES (im pure strategy) if there exists an ε-bar >0, (1-ε)U(S^,S^) + εU(S^,S') > (1-ε)U(S',S^) + εU(S',S') for all possible deviations S' and for all mutation sizes ε less than some ε-bar.
Def2: A strategy S^ is ES (in pure strategy) if
(S^,S^) is a symmetric NE, ie. U(S^,S^) ≧ U(S',S^) for all S'
if U(S^,S^)=U(S',S^) then U(S^,S') > U(S',S')
# Class12
Evolution of social convention: We can have multiple evolutionarily stable conventions. These need not to be equally good.
Def:In a two player symmetric game, A strategy P^ is ES (in mixed strategy) if
(P^,P^) is a symmetric NE
if (P^,P^) is not strict NE,U(P^,P^)=U(P',P^) then U(P^,P')>U(P',P')
Hawk-Dove: If V<C then ES mix has V/C Hawks
as V↑ more Hawks in ESS as C↑ more Doves in ESS
payoffs: (1-V/C)(V/2)
identification:we can tell what the ratio V/C is from looking at data.
# Class13
Sequential move game:Player Ⅱ knows player Ⅰ's choice before she chooses and player Ⅰknows that this will be the case. Backward Induction is "look forward and walk back".
Moral hazard: What happened was we kept the size of the project or the size of the loan, small to reduce the temptation to cheat.
Incentive design:they're taking a smaller share of a larger pie can be bigger than a large share of a small pie. the form called two things:piece rates and share cropping.
Commitment strategy:
Example:Collateral works is it lowers your payoffs if you do not repay but it leads to you being better off because it changes the choices of others in a way that helps you.
Commitment is actually to have fewer options and it changes the behavior of others.It's crucial that the other side must know.
# Class14
commitment: sunk costs can help.
spy: the number one is games being simultaneous or sequential is not really about timing per se, it's about information. It's about who knows what, and who knows that who's going to know what. The second is having more information can hurt you. The key here is that the other side, the other players, knew you had or were going to have more information. And the reason that's true is the reason is it can lead other players to take actions that hurt you.
first-mover advantage: there are games with first mover advantages, but there also games with second mover advantages. For example, In game NIM, it depends on whether the piles have the same number in them. If you have the same number you want to be second player, and if they have different numbers you want to be first player. It means if the initial position has unequal piles, uneven piles, then you would rather be Player1: it has a first mover advantage. But if the initial position has even numbers in the piles then you'd rather be Player2.
# Class15
- Zermelo Theorem:
[Conditions]
1) 2 players
2) perfect information
3) a finite number nodes
4) there outcome W1, L1, T
[Result]
Either 1 can force a win
- Or 1 can force a tie
- Or 2 can force a loss on 1
Suppose the claim is true for all games of this type of length ≤N, we claim therefore it will be true for games of length N+1.
Def: A game of perfect information is one in which at every node, or at each node in the game the player whose turn it is to move at that node knows which node she is at (that means implicity is she must know how she got there).
Def: A pure strategy for Player1 in a game of perfect information -- It's a complete plan of action; It specifies which action I will take Each of 1's decision nodes.
# Class16
Chain Store Paradox: If 1% chance that monoployer is crazy then he can defer entry by fighting, i.e. seeming crazy. The idea is once again you might want to behave as if you're someone else in order to defer people's actions.
Dual:
Pi[d] is Player i of hitting if i shoots at distance d.
[FACT A]: Assuming no one has thrown yet, if Player i knows at distance d that j will not shoot tomorrow at distance d-1 then i should not shoot today.
[FACT B]: Assuming no one has thrown yet, if Player i knows at distance d that j will shoot tomorrow at distance d-1 then i should shoot if i's probability of hitting at d today as Pi[d]≥1-Pj[d-1] which means j's probability of missing tomorrow.
=> Pi[d*] + Pj[d*-1] ≥ 1
[Claim]: The first shot should occur at d*.
# Class17
- Ultimatums:
Two Players 1 and 2,
Player 1 "take it or leave it" offer to Player2;
Player 2 "can accept this offer" (S,1-S), Player 2 "can reject this offer" (0,0).
- Bargaining:
[Stage1] Player1 make offer to Player2, if Player 2 accept (S,1-S);
[Stage2] Player 2 reject, Player 2 make an offer to Player 1 (δS,1-δS)
...
If Player 1 offer > δPlayer1, Player 2 accept;
If Player 1 offer < δPlayer1, Player2 reject.
=> S=(1-δ^n)/(1+δ)
1-S=(δ+δ^n)/(1+δ)
So
when n→∞, S=1/(1+δ) and 1-S=δ/(1+δ);
when n→1, S=1/2 and 1-S=1/2.
- Conclusion:
Alternating offer bargaining
(1) Even split:
• potentially can bargain forever
• δ→1 no discounting or rapid offers
• same discount factor δ1=δ2
(2) first offer is accepted:
• no haggling
• the value of the pie and the value of time is assumed to be known
# Class18
DefN: An information set of Player i is a collection of Player i's nodes among which i cannot distinguish.
DefN: Perfect Information is a setting where all information sets in the tree contain just one node. Imperfect information is not perfect information.
DefN: A pure strategy of Player i is a complete plan of action: it specifies what Player i will do at each of i's information sets.
DefN: A sub-game is a part of a game that looks like a game whthin the tree.
[conditions]:
1) It starts from a single node.
2) It comprises all successors to that node.
3) It does not break up any information sets.
- DefN: A NE (S1*, S2*, ... , SN*) is a sub-game perfect equilibrium SPE if it induces a Nash equilibrium in every sub-game of the game.
# Class19
Don't Screw Up: The only sub-game perfect equilibrium is the backward induction prediction.
Matchmaker Game: In the sub-game, There is a mixed Nash Equilibrium.
Strategic Investment: When you're analyzing a game like Cournot, the first thing you want to do is to look at what would happen if you did invest and solve out the new Nash Equilibrium in that sub-game. Then you want to roll back the value of that sub-game back into the decision which is the Strategic investment decision whether to do or not. You need to take into account strategic effects: how behavior changes.
# Class20
- Wars of attrition:
Two Players each period each chooses whether to fight or to quit. The game ends as soon as someone quits. If the other Player quits first, winner wins a prize V. Each period in which both fight, Each Player pay -C. If both quit at once, They get 0.
Two period version: Two Pure Strategy Nash Equilibrium in the game are (Fight, quit) and (Quit,fight). And Mixed Strategy Equilibrium in this game has both mix, both fight with probability equal to V/[V+C].
Continuation payoffs: If they mix in the game in the future, then the continuation value is (0,0).
# Class21
Repeated Interaction: In on going relationships, the promise of feature rewards and the threat of feature punishments may sometimes provide incentives for good behavior today.The lesson is: but for this to work it helps to have a future.
The temptation to cheat or defect today ≤ δ[the value of reward - the value of the punishment tomorrow].
If a stage game has more than one Nash Equilibrium in it, then we may be able to use the prospect of playing different equilibria tomorrow to provide incentives as rewards and punishments for cooperation today. So there may be a problem of renegotiation.
Grim Trigger Strategy: play Cooperation if no one has played Defect and play Defect otherwise.
# Class22
We can get Cooperation in the Prisoner's Dilemma using the Grim trigger as a sub-game perfect equilibrium. For an ongoing relationship to provide incentives for good behavior today, it helps for there to be a high probability that the relationship will continue, which is the weight you put on the future. The more weight I put on the future, the easier it is for the future to give me incentives to behave well today, the easier it is for those to overcome the temptations to cheat today.
One Period Punishment: Play Cooperation to start and then Play Cooperation if Either (Cooperation, Cooperation) or (Defect, Defect) were played last. And play Defect otherwise: Play Defect if Either (Cooperation, Defect) or (Defect, Cooperation) were played last. Trade off is that Shorter punishment need more weight δ on the future.
Repeated Moral Hazard: Even a small probability of the relationship continuing drastically reduces the wage premium. To get good behavior in these continuing relationships, there has to be some reward tomorrow. That reward needs to be higher, if the weight you put on tomorrow, if the probability of continuing tomorrow, is lower. The less likely tomorrow is to occur the bigger that reward has to be tomorrow.
# Class23
Asymmetric Information Signaling: the lack of a signal can be informative. So this is the idea that silence can speak volumes. It mattered that the information was verifiable.
Not Verifiable: Costly Signaling or Main Signal is Education, supported by Mike Spence who actually won the Nobel Prize in large part for this model. So what are those costs? It's the pain of the work, such as mental effort, the pain and suffering.
Separating Equilibrium: The types manage to separate and get identified.
a good signal leads to be differentially costly across types.
if you lower the standards, it takes qualification inflation.
this is a rather pessimistic model of education because there is no learning in this model. Then education in this model is socially wasteful. Plus, Education, in this model, increases inequality and it actually hurts the poor.
# Class24
Common Value: Value of good is the same for all.
Private Value: Value of good is different for all and my value is irrelevant to you.
Auction:
First-price Sealed Bid Auction
Second-price Sealed Bid Auction (Vickrey Auction)
Ascending Open Auction
Descending Open Auction (Dutch Auction)
Auction 1 is really the same as Auction 4; Auction 2 is not the same as Auction 3 But they're very closely related.