ML Interview Q Series: Election Flip Probability: Hypergeometric Analysis of Randomly Removed Illegal Votes.

May 08, 2025

Browse all the Probability Interview Questions here.

Suppose there is a close election between two candidates A and B, with final counts 1422 votes for A and 1405 votes for B. However, it is discovered that 101 votes are illegal and must be removed. Assuming the illegal votes are equally likely to be among either candidate’s votes (i.e., no bias in how the illegal votes were cast), what is the probability that removing these 101 votes changes the election result (making A no longer the winner)?

Short Compact solution

We can model the situation by imagining an urn with 1422 white balls (representing votes for A) and 1405 black balls (representing votes for B). We randomly remove 101 balls (the illegal votes). The probability that the outcome changes is the probability that, after removal, the white balls are not strictly more numerous than the black balls.

This event occurs precisely when the number of removed white balls is at least 59. Hence, the probability is:

Numerical evaluation of this sum yields approximately 0.05917.

Comprehensive Explanation

Interpretation via Urn Model

An intuitive way to frame the problem is to treat each vote for candidate A as a white ball and each vote for candidate B as a black ball. We have a total of 1422 white balls and 1405 black balls, for a combined total of 1422 + 1405 = 2827 balls. From these 2827 votes, a subset of 101 votes (the illegal ones) will be removed at random.

Let m be the number of white balls in the removed set of 101. Then:

The remaining white balls in the urn is 1422 - m.
The remaining black balls in the urn is 1405 - (101 - m) because out of 101 removed, (101 - m) are black.

We want the probability that candidate A’s final total is no longer strictly greater than candidate B’s final total, i.e., 1422 - m <= 1405 - (101 - m). Rearranging yields m >= 59. Therefore, the only way for the outcome to change is if at least 59 of A’s votes end up in the set of 101 removed votes.

The Combinatorial Sum

To compute this probability, we sum over all possible values m from 59 to 101. For each m in that range:

C(1422, m) is the number of ways to choose m of A’s votes from his 1422.
C(1405, 101 - m) is the number of ways to choose (101 - m) of B’s votes from her 1405.
C(2827, 101) is the number of ways to choose any 101 votes from the total 2827 votes.

Since all illegal sets of 101 votes are equally likely, the probability is the ratio of favorable outcomes to total outcomes. The final sum is:

When evaluated, this sum yields approximately 0.05917, meaning there is roughly a 5.917% chance that removing the illegal votes reverses or ties the outcome of the election.

Why m >= 59?

The margin of candidate A over candidate B is 17 votes (1422 - 1405 = 17). To eliminate A’s lead, candidate B must either tie or surpass A once 101 votes are removed. If m white votes are removed and (101 - m) black votes are removed, then the difference between A’s and B’s final totals shifts by (m - (101 - m)) = (2m - 101). We require that shift to be at least 18 in favor of B (to cover the original 17-vote lead and go to a tie or worse for A). Solving 2m - 101 >= 18 leads to m >= 59.

Connection to the Hypergeometric Distribution

This situation is a classic example of a hypergeometric scenario: from a finite population of 2827 items (1422 “successes” and 1405 “failures”), we draw 101 items without replacement. We are interested in the probability that the number of drawn successes (m) meets or exceeds a threshold (59). The hypergeometric probability mass function is used for these kinds of discrete, no-replacement draws.

Implementation Example

In Python, one could directly compute the sum (though care with large binomial coefficients is required). For instance:

import math

def comb(n, k):
    return math.comb(n, k)

p = 0
total_ways = comb(2827, 101)
for m in range(59, 102):
    ways_A = comb(1422, m)
    ways_B = comb(1405, 101 - m)
    p += ways_A * ways_B

probability = p / total_ways
print(probability)

This code simply sums up the favorable ways for m = 59 to 101 and then divides by the total possible ways of removing 101 votes from 2827. It should produce a result near 0.05917.

Follow-up question: Connection to Large Sample Approximations

When the number of votes is extremely large, can we approximate this probability using a normal approximation?

Absolutely. For much larger counts, some interviewers might allow a normal approximation to the hypergeometric distribution. The hypergeometric distribution can be approximated by a binomial or normal distribution under certain conditions. In large-n election problems, we sometimes use a binomial approximation with p = proportion of A’s votes in the population. However, with 2827 total votes here, the exact combinatorial or hypergeometric formula is still quite feasible to compute directly.

Follow-up question: Edge Cases and Practical Considerations

What if the margin is 1 instead of 17?

If A is ahead by a single vote, then B needs to remove at least one more of A’s votes than B’s own votes to change the result. You could recompute the threshold condition to see that m >= (101 / 2 + 0.5) or a simpler rearrangement depending on the margin. The same hypergeometric approach remains valid, but the threshold changes accordingly.

What if the 101 illegal votes are suspected to be biased?

Our assumption of no bias is critical. If, for instance, there were a reason to suspect the illegal votes were predominantly in favor of one candidate, the probability distribution for m (the number of A’s votes removed) would shift. We could model that with different sampling probabilities or, if partial knowledge is available, use Bayesian updates to incorporate prior beliefs about which candidate is more likely to have illegal votes.

Follow-up question: Practical Implementation Challenges

If the combinatorial terms become numerically huge, how can we handle overflow?

When computing large binomial coefficients in Python, math.comb(n, k) uses efficient techniques internally. However, in other languages or older libraries, overflow might be an issue. One workaround is to compute log binomial coefficients and exponentiate at the end. Alternatively, if we only need a ratio of combinatorial terms, we can simplify those ratios to reduce intermediate values. Libraries like NumPy and SciPy also provide robust implementations for the hypergeometric PMF that handle large numbers more gracefully.

Below are additional follow-up questions

What if some votes are known to be definitely valid for Candidate A or Candidate B?

In real-world scenarios, certain clusters of votes may be known with near certainty to be valid or invalid, or to have definitely gone to a particular candidate. For instance, imagine a small number of mail-in ballots that have already been confirmed to be legal and for Candidate A. How would that affect the probability calculation?