Nash Bargaining Solution
Last updated
Last updated
In a Nash bargaining problem, players have to reach an agreement on their payoffs that must satisfy some constraints. If these constraints are not satisfies, then the players don’t get anything.
More generally, we can have a inequalities as the constraint too — where the “valid” region is a convex, compact set.
Convex → Any line segment drawn between 2 points on the boundary of the shape must lie entirely within the shape itself.
Compact → The shape (set of possible solutions) must be bounded and includes the boundary.
In the above diagram, is the “disagreement point” — i.e., the payoffs that the two agents get in the event that they both disagree with each other and can’t come to a consensus on any point in . That’s why in any rule-based function that needs to pick a “fair” outcome, it’s important to know what the payoffs are in the event of a disagreement. (e.g. if is more than any point in , then player 1 has incentive to ignore whatever player 2 says and just violate the constraint so that he can get value)
An outcome Pareto dominates another outcome if and and at least one of these two inequalities is strict. That is, at least one of the players is strictly better off without the other being worse off.
In this case, is said to be a Pareto improvement of .
An outcome is Pareto optimal if it is not Pareto dominated by any other outcome. That is, no one can improve without hurting someone else. (This is a notion of economic effciency).
The Pareto frontier is the set of all such Pareto optimal outcomes.
Here,
is the disagreement point that is picked in the event that the mechanism cannot find any solution in (i.e., it is the default alternative to an agreement — similar to BATNA in negotiations). Generally, so that if the players can’t agree on something, they both get nothing.
and are the components of the rule / function / mechanism that decides what point to pick given the constraint set and the outcome in the case of disagreement, . e.g. can be the rule that tries to maximize the total welfare (utilitarian) or the product of the players’ utilities (nash), etc.
In such a “bargaining” solution, we are looking for the following properties:
Efficiency: No outcome Pareto dominates our chosen Here, we’re referring to “economic efficiency” in the sense that no one can be improved without someone else being worse off, not computational efficiency.
Symmetry: Let and ; then . That is, we don’t favor one player over the other (e.g. there’s no special treatment given to the “first player” or the “second player” — the labels assigned to the players shouldn’t matter. And hence, if we change the names of the players, the outcome should just be reversed).
Independence of Irrelevant Alternatives (IIA): Let be such that ’. Then, . If the solution including a larger set of outcomes is , and and then we eliminate some of the solutions but the set still contains , the optimal solution should still be . Removing irrelevant options shouldn’t change your final answer. e.g. if you’re at an ice-cream shop, and you prefer chocolate > vanilla > strawberry, then even if you remove strawberry, you will still pick chocolate over vanilla. So, removing strawberry should not make you suddenly pick vanilla instead. Adding / removing an option should not change the pairwise ranking of the other options.
Invariance under Equivalent Representations (IER): for any , we have:
If you stretch / scale and shift the space, the solution should still be preserved under linearity. Note that (And this is not trivial to satisfy — e.g. utilitarian solution does not satisfy IER.)
Example of Utilitarian solution not satisfing IER: Suppose we try to maximize x+y subject to x2+y2=1. What is the maximum? Now, suppose we stretch the x-axis by a factor of 2, so we're now maximizing x+y subject to (x/2)2+y2=1 (in other words, we stretch the circle into a "horizontal oval"). What is the maximum?
Compare this with the Nash bargaining solution. Suppose we try to maximize xy subject to x2+y2=1. What is the maximum? Now, suppose we stretch the x-axis by a factor of 2, so we're now maximizing xy subject to (x/2)2+y2=1. What is the maximum?
A pretty interesting read is: Arrow’s Impossibility Theorem.
(Read this paper to understand some of the key terms as used in literature.)
In Nash bargaining solution, we try to find s.t.
Generally, (i.e., if players disagree and violate the constraint, then they both get nothing) so we’re trying to maximize the product .
Theorem: The Nash bargaining solution satisfies all 4 properties — efficiency, symmetry, IIA, and IER.
It’s not that hard to show that this is true.
But a much stronger claim is this:
Theorem: The Nash bargaining solution is the ONLY solution that satisfies efficiency, symmetry, IIA, and IER.
That is, these 4 properties characterize the Nash bargaining solution — if you want these 4 properties, you have no choice but to use this.
There are different notions of fairness and “optimality” too.
Utilitarian: tries to maximize total social welfare, i.e.,
Nash: tries to maximize the product, i.e.,
Egalitarian: tries to maximize the minimum utility (to reduce inequality), i.e.,
Intuiitvely, Nash lies in the middle of utilitarian (just maxiimze total welfare without caring about the distribution) and egalitarian (care more about the poor people).
To find all the solutions for these different constraints, we can use the technique of Lagrange multiplliers (watch the videos on Khan Academy for a great explanation).
Note: remember to check the boundary points separately! e.g. if you try to maximize in the range by setting the derivative equal to 0, you’ll only get as the critical point. BUT, the true maximium is at , and the true minimum is at . This happens because the function might be increasing / decreasing when the interval cuts it (so its derivative at that point will not be zero, but it can still be the maximum / minimum in that range). Setting derviative = 0 just gives you the local extremas (or critical points).