Committee Voting

Problem

Unlike auctions where everyone gets different things depending on what they want, in a voting setting, everyone gets the same thing (although they have different preferences). So, we’re trying to select a public outcome that is shared by everyone. e.g. everyone gets the same president.

In general, there are many different types of voting ballots (i.e., inputs):

  • ranking → each voter submits an ordering for the candidates

  • score → each voters submits a number for each candidate

  • approval → each voter submits a subset of approved candidates

    • basically, each voter submits 1 for the candidates they approve, and 0 for the rest.

    • advantage: it’s simple and less cognitive effort is required from the voters’ perspectives (they just have to think whether or not they approve the voter, no relative comparisons needed);

    • disadvantage: it does not allow refined or combinatorial preferences (e.g. i can’t say “i want to approve A and B, but only if they are chosen together” or “i like A more than B”)

And there can be many different kinds of outputs of voting:

  • single winner → e.g. president of a student council

  • multiwinner / committee e.g. student council committee, dishes to serve in a reunion party. in this case, all the selected members of the committee are “equal” so the decision is just whether or not to include or exclude from the committee

  • ranked committee → e.g. a committee consisting of president, vice-president and a secretary, where there’s some ordering in the positions of each member of the committee.

In this lecture, we’re only looking at approval voting in a multi-winner category

Approval Committee Voting Setting

  • Voters N={1,,n}N = \{1, \cdots, n\}

  • Candidates CC, where C=m|C| =m

  • Voter ii approves a set of candidates AiCA_i \subseteq C

  • We want to choose a committee WW of size kk where kmk \leq m is given (that is, we don’t get to decide the size of the committee — kk is an input to our mechanism, not something we can control)

  • Voter ii’s utility is: ui(W)=AiWu_i(W) = |A_i \cap W|, i.e., how many members of the selected committee did voter ii approve (which makes sense, because you feel happier if people you approve of are selected)

Note that the above definition of voter utility remains the same even when we consider any other metrics / scores to optimize.

Also, we can think of some natural metrics to evaluate how good a committee selection is:

  • (Utilitarian) Social Welfare → total number of approvals obtained by members of the committee (i.e., “excellence”). Notice that this is the same as the total utility of all the voters, because we can swap the order of the sums.

  • Coverage: number of voters who approve at least one committee member (”diversity”). This is “kind of” an egalitarian metric in that we’re trying to minimize the number of voters with zero utility, but we don’t distinguish between any non-zero values so it’s not really egalitarian.

Also, if a voter approves a candidate, we say that the candidate covers the voter (since picking this candidate is enough to make this voter have non-zero utility).

Approval Voting (AV) and Chamberlin-Courant (CC)

Then, we can select committees trying to maximize these two metrics:

  • Approval Voting (AV) → select a committee maximizing social welfare

  • Chamberlin-Courant (CC) → select a committee maximizing coverage

Of course, there need not be a unique committee in each of these cases.

Moreover, AV is a truthful mechanism (since you’re trying to boost the number of votes your preferred candidates have) whereas CC is not (e.g. it’s possible to get more utility by reporting a truncated set of approved candidates).

Beyond AV / CC

Consider this example:

  • n=301,m=5,k=3n = 301, m = 5, k = 3

  • First 200 voters approve {a,b,c}\{a, b, c \}

  • Next 100 voters approve {d}\{d\}

  • And the last voter approves {e}\{ e\}

Then, AV chooses {a,b,c}\{a, b, c\} and CC chooses {a,d,e}\{a, d, e\}.

But, neither feels “proportional” or fair. Because these committees don’t really reflect the “approval profile”.

A more proportional committee would be something like: {a,b,d}\{a, b, d \} → since 23\frac 2 3 of the voters agree on aa and bb, they should get 23\frac 2 3 of the seats on the committee, while only 13\frac 1 3 of the voters agree on dd, so they should get 13\frac 1 3 of the seats.

Intuitively, a sufficiently large group of voters that agree on sufficiently many candidates should be correspondingly / proportionately represented / satisfied in the committee.

Justified Representation

There are nn voters and kk committee slots, so a group of n/kn/k voters “deserves” one slot (of representation).

First attempt: For a group of voters SNS \subseteq N such that Snk|S| \geq \frac n k and iSAi1|\cap_{i \in S} A_i| \geq 1, we have (iSAi)W1|(\cap_{i \in S} A_i) \cap W| \geq 1. Basically, we’re saying that if we have a big enough group that together deserve at least one slot, and that they agree on at least one candidate, then at least one of the candidates they agree on should be selected in the committee.

This turns out to be too strong of a requirement, which cannot always be satisfied. For example:

  • n=m=4,k=2n=m=4, k = 2

  • A1={a,b},A2={b,c},A3={c,d},A4={a,d}A_1 = \{a, b \}, A_2 = \{b, c \}, A_3 = \{c, d \}, A_4 = \{a, d \}

  • Then, if we apply the rule above, then we would be forced to include all of a,b,c,da, b, c, d, but this exceeds the committee size.

To make our vocabulary / notation simpler, let’s call a group of voters SNS \subseteq N such that Snk|S| \geq \frac n k and iSAi1|\cap_{i \in S} A_i| \geq 1 to be a cohesive group. Basically a cohesive group makes up for at least one-slot-worth of representation, and agrees on at least one candidate (if they don't agree, they aren't "cohesive" and we can't do much to please them anyway).

Then, a justified representation (JR) is any committee that satisfies the following property: for any cohesive group of voters SNS \subseteq N, there exists iSi \in S such that AiW1|A_i \cap W| \geq 1.

Basically, JR says that in any cohesive group, at least one voter should have non-zero utility. So, no cohesive group should go unrepresented. But It kind of feels… too weak? Since we only require one voter to feel happy (in fact, just non-zero happiness), no matter how big the cohesive group is.

Also, it’s easy to see that AV may fail JR. Consider n=k=3,m=4n=k=3, m = 4 where the first 2 voters approve a,b,ca,b,c and the last voter approves dd, then AV picks a,b,ca,b,c but then the last voter is a cohesive group and has zero utility, hence does not satisfy JR).

Claim: CC always satisfies JR. Proof:

  • Suppose for contradiction that it does not.

  • Then, let SS be a cohesive group of voters that is unrepresented by the CC committee WW, and let xx can be a candidate approved by all voters in SS.

  • Consider the marginal contribution of each wWw \in W to the coverage. This is the number of voters who approve ww but no one else in WW.

  • Since teh coverage of WW is <n< n, the marginal contribution of some wWw^* \in W is less than nk\frac n k (otherwise, if everyone’s marginal contribution is at least nk\frac n k, then all voters should be covered).

  • So, we can remove ww^* and add xx to obtain a higher coverage, which contradicts the fact that WW is CC. So, any CC committee also satisfies JR.

Hence, JR can be satisfied for any n,k,mn, k, m and approval profile.

GreedyCC

Computing a CC committee is NP-hard (can be shown via a simple reduction from SET-COVER).

Fortunately, there is a greedy variant that runs in polynomial time (of course this does not guarantee maximal coverage, but it’s not bad).

The GreedyCC algorithm is quite simple: In each step, choose a candidate that covers as many uncovered voters as possible (i.e,. ignore the voters that are already happy and focus on the ones you want to cover). Repeat this until kk candidates have been chosen.

We claim that GreedyCC satisfies JR, and the proof follows exactly as above (i.e., if not, then there must’ve been a candidate who covers nk\frac n k voters but we didn’t pick him in favor of someone else who satisfies less than nk\frac n kvoters; this can’t happen because of the greedy nature of our algorithm).

Extended JR

But one can ask the same question as asked AV and CC to JR: is JR sufficient?

Consider this example: Suppose have 100 voters, and 20 candidates and k=10k=10. The first 50 voters like the first 10 candidates, and the next 50 voters like the next 10 candidates. Then, intuitively, we want to pick 5 out of 10 candidates from each group to be “proportional”. But even picking 9 from group 1 (i.e., that the first 50 voters approve) and 1 from group 2 satisfies JR.

So, our requirement of JR is too weak! So, we use Extended Justified Representation (EJR).

To explain what EJR means, let’s strengthen our notion of cohesive groups first.

For a positive integer tt, call a group of voters SNS \subseteq N such that Stnk|S| \geq t \cdot \frac n k and i]inSAit|\cap_{i ]in S} A_i | \geq t a tt-cohesive group. (Then, the previous definition of cohesive group is simply when t=1t=1.)

Also, note that a pp-cohesive group is always an rr-cohesive group for any rpr \leq p, i.e., increasing the “order” of the cohesive group just strengthens the requirement, which is clear from the definition.

Then, a committee is said to satisfy EJR when: for any positive integer tt and any tt-cohesive group of voters SNS \subseteq N, there exists iSi \in S such that AiWt|A_i \cap W| \geq t.

Notice that we still require only one of the voters in SS to have a utility of at least tt (others might get utility = 0 and that’s fine).

(Well, not really, we can show that under EJR, in any tt-cohesive group SS, the average utility of each person is more than t12\frac{t-1}{2}, so this definition is actually quite strong, even if it doesn’t seem like it.)

And this fixes the problem with JR.

Proportional Approving Voting (PAV)

Fix an infinite non-increasing sequence s1,s2,s_1, s_2, \cdots

Thiele methods: A family of methods that involve choosing a committee WW by maximizing the score: iN(s1+s2++sui(W)\sum_{i \in N} (s_1 + s_2 + \cdots + s_{u_i(W)}.

So, basically this sequence s1,s2,s_1, s_2, \cdots captures the utility function of the players. If a voter ii has pp of his approved candidates in the committee WW, then his utility is s1++sks_1 + \cdots + s_k. In particular, the marginal utility to voter ii of the kkth approved candidate being part of WW is sks_k.

(Note that this sequence is from the mechanism's point of view -- i.e., it is the mechanism's assumption of how the voter utility functions behave. BUT from the voter's point of view, the term "voter's utility" still just represents the number of approved candidates in the committee for any voter. To de-confuse, we can use "score" to represent a voter's sum s1++sks_1 + \cdots + s_k.)

This makes intuitive sense, because we want to capture the effect of diminishing marginal utility of the voters. Once you already have 5 approved candidates in WW, then the 6th one cannot give you more (marginal) utility than the first one did. This is why the sequence needs to be non-increasing.

Notice that each voter has the same sequence — this is done to ensure that we treat all voters symmetrically, i.e., we don’t favor any voter over any other.

Then, AV is just a special case of this with si=1s_i = 1for all ii. Even CC is just a special case of this with s1=1,si=0s_1 = 1, s_i = 0 for all i>1i > 1.

In Proportional Approval Voting (PAV), we set: si=1is_i = \frac 1 i for each ii.

(So the utility function for each player follows a kind of log\log graph, since x1xdx=lnx\int_x \frac 1 x dx = \ln x)

This means that if a voter approves rr candidates in the committee, the voter contributes 1+12++1r1 + \frac 1 2 + \cdots + \frac 1 r to the score of the committee.

The number 1++1r1 + \cdots + \frac 1 r is the rrth harmonic number, usually denoted by HrH_r.

But… why harmonic numbers? Why does this work?

Because they result in a roughly proportional committee. The intuition is that AV is precisely utilitarian, CC is kind of egalitarian (up to a certain point), and PAV is kind of like Nash.

And the reason that PAV is like Nash is because maximizing iNHui(W)\sum_{i \in N} H_{u_i(W)} is very similar to maximizing iNln(ui(W))\sum_{i \in N} \ln(u_i(W)) because Hrlnr+γH_r \approx \ln r + \gamma, where the approximation becomes better in the limit as rr \to \infty. Moreover, since ln\ln is a strictly monotonic increasing function, it’s equivalent to maximizing iNui(W)\prod_{i \in N} u_i(W), because iNlnui(W)=ln(iNui(W))\sum_{i \in N} \ln {u_i(W)} = \ln (\prod_{i \in N} u_i(W)).

And we can’t directly use Nash because it’ll face a similar problem as CC, since it’s really scared of anyone getting 0 utility — so, it’ll first focus on covering all voters, before thinking of anything else). PAV is okay with some voters having utility if enough other voters are compensated in return.

Also….

Claim: PAV satisfies EJR. Proof sketch:

  • Suppose for contradiction that ui(W)<tu_i(W) < t for all voters belonging to some tt-cohesive group SS.

  • There is a candidate xx which is approved by all members of SS but not included in WW.

  • By adding xx to WW, the PAV score increases by at least 1ttnk=nk\frac 1 t \cdot \frac{tn}{k} = \frac n k, since xx must add at least 1t\frac 1 t worth of utility to all the members of this cohesive group.

  • And now, we need to show that there is another candidate yWy \in W such that by removing yy from W{x}W \cup \{x\}, the PAV score decreases by less than nk\frac n k. (We skip how to show that such a yy exists).

  • And then, we would’ve picked xx instead of yy in WW to get a higher PAV score. So, the original WW could not have been PAV to start with. Contradiction.

But unfortunately, PAV is NP-hard to compute, and a greedy variant of PAV might not even satisfy JR.

So, we’re still trying to find a polynomial time algorithm that satisfies EJR… Luckily, we have it.

Method of Equal Shares (MES)

  • Each voter has a budget of $kn\frac k n (regardless of how many candidates there are)

  • Each candidate costs $1. The voters who approve this candidate have to “pool” their money to add this candidate to the committee.

Then, the algorithm runs as follows:

  • Start with an empty committee.

  • In each round, we want to add a candidate whose approved voters have a total budget of 1\geq 1 left.

  • If there are several such candidates, choose one such that teh maximum amount that any agent has to pay is minimized.

  • If no more candidates can be afforded but committee still has size <k< k, fill in the rest of the committee by using some tie-breaking criterion (e.g. by maximizing approval score).

Clearly, MES never chooses more than kk candidates (since the total budget is nkn=kn \cdot \frac k n = k and each candidate costs $1, so you can't buy more than kk candidates)

MES satisfies EJR and can be implemented in poly-time.

Proof that MES satisfies EJR:

  1. Suppose for contradiction that ui(W)<tu_i(W) < t for all voters ii belonging to some tt-cohesive group SS.

  2. When MES stops, some voter iSi \in S must have budget <ktn< \frac k {tn} left. Why?

    1. Otherwise, if everyone has at least ktn\frac{k}{tn} budget, then they can all pool together and buy a candidate that they approve of. (And the committee could not have already been full without this candidate, since it’s only full iff all voters pay all their money to buy candidates, because it’s only possible to buy kk candidates with kk dollars)

  3. So, ii has used budget >knktn=(t1)ktn> \frac k n - \frac{k}{tn} = \frac{(t-1)k}{tn}. Moreover, since we know that ii has approved (t1)\leq (t-1) candidates, so for some chosen committee member, ii must have paid more than 1t1(t1)ktn=ktn\frac{1}{t-1}\frac{(t-1)k}{tn} = \frac{k}{tn}.

  4. Consider the first committee member xx such that some voter in SS paid more than ktn\frac{k}{tn} for it.

  5. Before xx was added, each voter in SS has (t1)\leq (t-1) approved candidates and paid ktn\leq \frac{k}{tn} for each of them.

  6. Thus, each voter in SS has budget at least kn(t1)ktn=ktn\frac k n - (t-1)\cdot \frac{k}{tn} = \frac{k}{tn} remaining.

  7. Since Stnk|S| \geq \frac{tn}{k}, the voters in SS could afford a commonly approved candidate by paying ktn\leq \frac{k}{tn} each.

  8. So, no voter in SS should have paid more than ktn\frac{k}{tn} for xx, a contradiction!

Last updated