ML Interview Q Series: How would you maximize your chances of selecting the highest-valued piece from 100 unique, sequential artworks?

May 04, 2025

📚 Browse the full ML Interview series here.

Comprehensive Explanation

This problem is essentially known as a classic optimal stopping problem (often called the “Secretary Problem”). The objective is to pick the most valuable art piece out of 100 distinct items arriving in a random order, with the constraint that you must decide immediately about each piece and cannot return to any passed item.

Connect with me on X (Twitter)

Intuition

An intuitively strong strategy is to first observe a certain number of items without selecting any of them, carefully noting the best (highest-priced) piece among this initial segment. Then, as soon as you encounter a piece that surpasses the best of that initial group, you pick it. This threshold-based strategy has been mathematically shown to be optimal in terms of maximizing the probability of selecting the absolute best item.

Optimal Threshold (Approximation)

When the number of items n is large, the optimal fraction of items to skip at the start is roughly 1/e (where e is approximately 2.71828). Therefore, one might skip around 36 or 37 pieces when n=100. More precisely, the number of items to skip is approximately:

Here, n is the total number of pieces, and e is the base of the natural logarithm.

After skipping and observing this initial segment (to get a sense of the “quality” range), you select the first item that exceeds all the previously observed pieces. Following this plan yields a success probability close to 1/e (about 37%) of identifying the best piece.

Why This Works

By ignoring the first n/e items, you obtain a benchmark for the values without risking early picks on suboptimal ones. If you skip too few, you increase the chance of selecting from a small reference window and missing out on better items that appear later. If you skip too many, you risk running out of items, never seeing one that’s better than the benchmark before the process ends.

Probabilistic Outcome

By applying the skip-then-select rule, the success probability (the probability of picking the single most valuable piece) asymptotically approaches 1/e as n increases. For n=100, it is slightly higher than 1/e, but in practical terms, the difference is not huge.

Example Simulation in Python

Below is a straightforward simulation demonstrating how one could empirically test this strategy for n=100. We randomly generate 100 unique values, randomly permute them, then use the threshold approach to see how often we end up picking the maximum.

import random

def simulate_optimal_stopping(n=100, trials=100000):
    import math
    skip_count = int(round(n / 2.71828))  # approximate threshold
    success = 0

    for _ in range(trials):
        # Generate unique random values (e.g., from 1..n)
        values = list(range(1, n+1))
        random.shuffle(values)

        # Mark the best in the first skip_count
        reference = max(values[:skip_count])

        # After skipping, pick the first that exceeds the reference
        chosen = None
        for v in values[skip_count:]:
            if v > reference:
                chosen = v
                break

        # If no chosen item is found, pick the last one in the sequence
        # (though in standard approach, you'd have to pick the final item).
        if chosen is None:
            chosen = values[-1]

        # Check if the chosen item is the actual max
        if chosen == max(values):
            success += 1

    return success / trials

prob = simulate_optimal_stopping()
print(f"Probability of selecting the highest-valued piece: {prob:.4f}")

This code helps validate that the strategy consistently yields a success probability around the expected 1/e value.

Potential Pitfalls

• Over-Skipping: If you skip more than approximately n/e items, you might risk not encountering any item better than your reference in time. • Under-Skipping: If you skip too few items, your reference might not be a strong benchmark, causing you to lock onto a piece too early. • Small n: For small sample sizes, the theoretical 1/e rule might not be perfectly optimal. Exact analytic or dynamic programming solutions can be more precise when n is small, but the skip-then-select method is still robust. • Non-Uniform Distributions: If the item values have known structures or distributions that are not uniform random permutations, adjustments to the strategy might be necessary.

Follow-Up Questions