ML Interview Q Series: How would you investigate a model drastically underpricing an item based on inventory, demand, and delivery cost?

May 05, 2025

📚 Browse the full ML Interview series here.

Comprehensive Explanation

Understanding why a pricing algorithm is underestimating a product's price requires analyzing the end-to-end pipeline, from raw data ingestion to the final model output. This includes verifying data quality, revisiting feature engineering steps, evaluating model assumptions, and interpreting how the model combines different signals like availability, demand, and logistics costs.

Connect with me on X (Twitter)

Checking Feature Definitions and Data Quality

One critical step is verifying that each relevant feature is accurate and up-to-date. For example, if logistics cost is incorrectly reported as a lower number, the model might systematically undervalue the product. It is essential to inspect how each feature is computed and ensure the data pipeline doesn’t introduce errors or missing values.

It can be helpful to visualize the distribution of each feature for underpriced items. For instance, if the underpriced product always has a suspiciously low demand value, investigate whether the demand metric is aggregated correctly.

Verifying the Pricing Function

In many pricing models, the final price might be modeled as a function of factors like availability, demand level, and logistics cost. A simplified representation could be:

where alpha is an intercept capturing any baseline pricing offset, beta_1, beta_2, and beta_3 are the learned coefficients for availability, demand, and logistic_cost respectively.

After examining the formula, check if alpha is biased toward negative or minimal values, or if any coefficient is unexpectedly small. In large-scale systems, each of these terms might be replaced by more complex transformations, but the principle remains the same: confirm that each learned parameter is reasonable.

Investigating the Optimization Objective

Depending on how the model is trained—whether it’s minimizing mean squared error, maximizing revenue, or optimizing for conversion probability—misalignment between the objective function and the actual business goal can cause underpricing. If the model is rewarded for achieving more sales (regardless of margin), it may undervalue the product. Ensuring that the loss function or objective aligns with profit-based metrics is key.

Validating Assumptions and Distribution Shifts

Sometimes, the demand estimates may be outdated, or consumer behavior may have changed, making the original assumptions invalid. If the model was trained on historical data that no longer reflects real-world conditions, it might continue to underprice. Confirm that the training data still represents current market dynamics, and check for distribution shifts: for instance, the demand for this product may have spiked in the last few weeks due to a trend not captured in older data.

Exploring Model Interpretability Methods

Tools such as SHAP or LIME can highlight how each feature influences the final pricing decision. If the logistic cost feature has almost no impact, or the model interprets availability as extremely high when it’s actually low, you get clues about which specific inputs are leading to erroneous predictions. This helps pinpoint if the underpricing is due to a single faulty feature or an interaction among multiple features.

Re-Examining Business Constraints and External Data

Pricing often includes constraints, such as setting a minimum margin or ensuring the product price remains consistent with marketplace competition. If these constraints are missing or incorrectly implemented, the algorithm might systematically undercut the price. If competitive pricing data or real-time market signals are not considered, or are incorrectly weighted, the model might not reflect true market conditions.

Monitoring and Alerting

Deploy mechanisms that flag abnormal pricing outputs before they become widespread. By setting up monitoring thresholds—like checking if the price for any product drops below a certain margin—one can catch anomalies and initiate an immediate diagnostic process to prevent revenue loss.

Follow-up Questions