Nash Equilibria
A game can be represented by:
Players
Actions β players can do something to affect the world
Outcomes β playersβ actions lead to outcomes
Preferences over outcomes β each player has a preferences / ranking of the outcomes (in terms of utility)
This provides a general, abstract framework for strategic interaction.
Pure Nash Equilibrium
Given everyone elseβs actions , the best response set of player is:
That is, a player βs best response can be any of the actions that give him the maximum possible utility, given everyone elseβs actions.
An action profile is a (pure) Nash equilibrium if:
That is, every player has picked their best response given everyone elseβs actions. You can also think of it as: each player picks the action as if they were the last one to do so, and they knew everyone elseβs actions.
Every player thinks βIβm doing the best I can, given everyone elseβs actions.β
Mixed Nash Equilibrium
Instead of choosing a single action, one can play a random mix of them (to be unpredictable, otherwise if someone knows you well): is a probability distribution (from the set of all possible probability distributions) over player βs actions.
(Note: In general, we use to represent the set of all possible distributions over X)
A (not necessarily pure) strategy profile can be represented as:
Then, a playerβs utility (for a given mixed strategy) is given by:
In particular, note that the playerβs ONLY care about the expected value of their strategy, NOT the variance / risk that their strategy runs. e.g. they would be indifferent to a strategy that guarantees them $50 vs. flipping a coin for $100 β both have the same expected value, and being βrisk-neutralβ, they wouldnβt care which one they did. That is, they are neutral / indifferent to risk β they donβt care if they take risk or donβt (they donβt avoid it - by taking the gauranteed money - nor do they seek it - by taking the coin flip; theyβre just indifferent).
Then, is a (not necessarily pure) nash equilibrium is when: for all players , and all possible strategy (possibly containing mixed actions) of each player , β their current strategy is at least as good as any other.
In these notes, we use βactionβ to refer to a pure strategy (only one possible action, played with probability = 1), and the general term βstrategyβ to refer to possibly mixed actions (i.e., playing multiple actions, each with some probability)
Nashβs Theorem: Unlike a pure Nash Equilibrium, a (not necessarily pure) Nash Equilibrium ALWAYS EXISTS.
The way to compute nash equilibria in a 2x2 games (2 players, each have 2 actions) is by considering 2 possible cases:
Case 1 β Compute all NE (if any) in which at least one player plays a pure strategy (do this exhaustively, e.g. βrow player plays action X while col player mixes between A and B with probability q and (1-q), then find the value(s) of q for which the best response is X β then youβve found a NE)
Case 2 β Compute all NE (if any) in which both players mix between both strategies
Nashβs theorem says that the union of the set of NEβs found in case 1 and case 2 is non-empty β it doesnβt say whether the NE is pure / not.
The key idea behind case 2 is that the only time that a player would mix between 2 strategies is when (given the opponentβs strategy), he is indifferent to his 2 actions. That is, no matter what action he picks, his utility is the same given the opponentβs strategy. This allows us to equate the expected utilities of the two actions for player 1, to find the βmixing probabilitiesβ for player 2 π² (and vice versa).
Dominant Strategies
We say that a strategy (weakly) dominates if:
In words: no matter what everyone else does, picking is at least as good as picking .
Strict domination is when for every β no matter what everyone else does, strategy is STRICTLY better than (and so, you should never even consider picking !!).
Theorem: If an action is STRICTLY dominated by some strategy , then action is NEVER PLAYED with any positive probability in ANY Nash equilibrium.
Intuitively, if you would move all the probability out of the lamer / weaker actions into the ones that dominate it, and you strictly increase your utility. Hence, picking (with any non-zero probability) can never be a βbestβ response π€―
Iterated Removal of Strictly Dominated Strategies
If youβre trying to find Nash Equilibria, then you can remove action profiles that contain a strictly dominated strategy (from either playerβs side).
Note:
This ONLY works if the action is strictly dominated by another strategy (can be a strategy involving multiple actions β using mixed probabilities too)
If youβre left with a single action / strategy profile at the end, then by Nash Theorem, it must be a NE.
The reason this works is simply because those cells which contain a strictly dominated strategy (from either playerβs side) are never going to happen, i.e., they can never be an equilibrium (because at least one player would change their strategy to the one which strictly dominates it)
When considrering whether a strategy (may be mixed) strictly dominates an action for the row player, we donβt have to consider all possible strategies of the col player β we can just look at the individual actions and ensure that strictly dominates for every action. Then, since any strategy is a linear combination of actions, if strictly beats for every action of the col player, then strictly beats for every strategy involving those actions too. (Moreover, we only have to lok at the actions of the col players that are not already strictly dominated by other strategies β since col player would not play the weaker actions otherwise).
Last updated