Cooperative Games
Last updated
Last updated
Players divide into coalitions to perform tasks. Within a coalition, members can freely divide profits. The question is how should the profits be divided?
We are given a weighted graph where the players are nodes; and the value of a coalition is the value of the total edge weights in the subgraph.
So, an edge tells you how much value would generate if they work together. e.g. researchers collaborate to each other (and the total value is determined by the sum of their pairwise values).
(And it seems natural / intuitive to assume that if then each of and contribute to this synergistic ārelationshipā).
We are given a weighted directed graph. Players are edges. And the value of a coalition is the value of the maximum flow it can pass from to . This has applications in computer network, traffic flow, transport networks. (that is, the minimum bottleneck value on the path).
We are given a list of weights and a threshold, i.e., , all of which are positive integers. Each player has a weight .
The value of a coalition is 1 (winning) if its total weight is at least (winning), and 0 (losing) otherwise.
Applications: parliaments, UN security council, EU council of members.
Cost sharing is just the inverse of profit sharing. So itās actually the āsameā problem underneath the hood ā if you have an algorithm that can fairly divide profits, you can negate the values to convert them to costs (i.e., change maximization to minimization).
A set of players -
A valuation function -
That is, each subset of players have some value associated with it (and thatās the value of the group). Here, just denotes the power set of
is just the value of a coalition , and by default,
Coalition structure (CS) ā a partition of (into groups) ā each person is in exactly one coalition.
We aim to find the optimal coalition structure ā the structure that maximizes total generated value from the players. That is,
But we also need to satisfy some (reasonable if you think about it) constraints:
Efficiency: a vector satisfying
Basically this says that each person gets 1 number, which is their payoff, and the total value that the group generates must be completely distributed among the people.
Itās called efficiency because thereās no loss in value. Whatever value is generated is distribued without any loss. Nothing is kept for the mechanism or wasted. Itās a matter of economic efficiency ā no one can improve without anyone else being hurt ā and has nothing to do with computational efficiency.
Individual Rationality: for all
You need to give each person at least how much value they would generate by being alone ā if not, they will leave your group (ādeviateā), and do their own thing.
An imputation is just a vector that satisfies efficiency and individual rationality.
A game is called:
Monotone: for any ,
Adding anyone cannot decrease the value of the coalition.
Simple: if is monotone AND
i.e., the value of any coalition is binary, winning or losing, and adding anyone to a winning coalition cannot make the coalition lose (thereās no snitch :O)
Superadditive: for disjoint ,
collaboration is always beneficial to the overall value generated
Convex: for AND , we have
the marginal gain of player by adding to cannot be more than the marginal gain if player joins , and contains at least everyone in (and possibly more) ā because the opportunities to collaborate is more. (joining a bigger group is always better, given that one is a subset of the other)
Monotone Convex Superadditive.
An imputation is in the core if:
That is, each susbet of players is getting at least what it can make on its own.
Itās easier to think about what would happen if we did not satisfy the above criteria. Then, a subset of players are together being paid less than the value they bring to the table. So, they would deviate from the group (making it unstable) and divide the profits among themselves in a way that they at least get as much as they were getting before, with some people getting more now.
So, being in the core is a notion of stability: no subset of players have any incentive to deviate.
So, the core is a set of vectors that satisfies the linear constraints:
ā the total payoff given to everyone is equal to the total value they generate.
(by definition of being in the core)
Also, remember:
x is additive in the sense that the total payoff of a group is the sum of the individual payoffs given to each person (think of it as cash).
BUT is not additive ā otherwise there would be no point in collaborating and this would be a boring problem. They can generate more / less value as a group than individually (think of synergy, 1 + 1 > 2)
For a 3-player game, we can find the core by drawing a triangle, and then each point on the triangle maps to a payoff vector. e.g. the top vertex means that all the payoff goes to the first player. The midpoint of the bottom edge means that the payoff vector is . In general, each person gets paid in the inverse of the ratio of the distance from the point to the edge opposite that playerās vertex.
A natural question to ask then is: is the core empty?? That is, is there ANY possible payoff vector that ensures that no subset of players deviate?
Simple Game: A game is called simple if for all and it is monotone (adding players to a coalition does not decrease the value).
Coalitions with value 1 are winning. Coalitions with value 0 are losing.
A player is called a a veto player if she is a member of every winning coalition (canāt win without her).
š” Theorem: Let be a simple game then has veto players. In fact, the core has precisely the vectors that distribute the payoff ONLY among the veto players.
The proof is relatively simple: if we give any positive payoff to a non-veto player, then the remaining players can deviate together (and they can divide this positive payoff among themselves).
If we give positive payoff ONLY to veto players, then since any winning coaliion must contain ALL veto players, there is no benefiical deviation. BUT notice that this doesnāt mean that we have to give every veto player the same payoff, or even that we have to give every veto player a positive payoff(!!). So, ANY distribution of the payoff among only the veto players is in the core (no subset of players can deviate).
š” Theorem: For an induced subgraph game, the core is not empty the graph has no negative cut.
In particular, if the core is not empty, a possible payoff strategy is the shapley value which assigns each node half the value of the edges connected to it.
Letās prove the theorem.
() If there is no negative cut, then the core is not empty.
It suffices to find 1 payoff vector in the core.
Let be the payoff assinged to for each player
Then, sum of the weights of all edges . Hence, satisfies efficiency.
Now, for any subset , we have:
Notice that each playerās payoff ONLY depends on the edges they are connected to, NOT in which coalition they belong. So, each player will receive the same payoff regardless of which coalition he is part of. And the payoff of the group is just the sum of the payoffs of everyone in the group.
Then, we can divide the edges in the sum above to be āwithin ā and āacross ā ā because the edges within contribute to , and the ones across form the cut! (This is precisely why we have this seemingly surprising theorem - how does a cut relate to this!)
And since there are no negative cuts, the last expression shows that .
Efficiency is also obvious from this expression, because we can just set .
() If the core is not empty, the graph has no negative cut.
Instead, we prove (the contrapositive) that if the graph has a negative cut, then the core must be empty.
Suppose there is some negative cut, i.e., such that .
Take any payoff vector satisfying efficiency; then we have:
must be true by efficiency.
We have just shown in the previous part that , and since is additive, where .
Hence, ā which we will use later.
Now consider and . Our goal is to show that at least one of them must be negative. To do that, we take their sum and show that it is negative ā then we can conclude that at least one of the terms is negative and hence cannot be in the core (i.e., a subset is not paid for the amount of value they generate).
We have already shown that , and so the RHS evaluates to:
Hence, it is either the case that OR ; hence, cannot be in the core.