ML Interview Q Series: Calculating Mean and Variance for Linear Combinations of Independent Normal Variables.

May 25, 2025

Browse all the Probability Interview Questions here.

Suppose X and Y are independent normal random variables, where X has mean 3 and variance 4, and Y has mean 1 and variance 4. Determine the mean and variance of the random variable 2X - Y.

Comprehensive Explanation

When we have a linear combination of two independent normal random variables (for example, 2X - Y), the result is also a normally distributed random variable. The mean and variance of this new variable can be derived from the properties of expectation and variance as follows:

Mean of 2X - Y

The expectation of a linear combination of random variables aX + bY is given by aE(X) + bE(Y). Here, a = 2 and b = -1.

where Z = 2X - Y. In our case, E(X) = 3 and E(Y) = 1. Substituting these in:

E(Z) = 2 * 3 + (-1) * 1 = 6 - 1 = 5.

Hence, the mean of 2X - Y is 5.

Variance of 2X - Y

Because X and Y are independent, the variance of aX + bY is a^2 Var(X) + b^2 Var(Y). Here, again a = 2 and b = -1.

Given Var(X) = 4 and Var(Y) = 4, we plug these in:

Var(Z) = 2^2 * 4 + (-1)^2 * 4 = 4 * 4 + 1 * 4 = 16 + 4 = 20.

Hence, the variance of 2X - Y is 20.

Therefore, if X ~ N(3, 4) and Y ~ N(1, 4), and they are independent, then 2X - Y follows a normal distribution with mean 5 and variance 20.

Follow-up Questions

Why is 2X - Y still normally distributed?

When X and Y are normal and independent, any linear combination aX + bY remains normally distributed. This is due to the fact that the normal distribution is closed under linear combinations. Even if we have multiple independent normal variables, summing or subtracting them (with constant coefficients) preserves normality. If X and Y were not normal, the distribution of 2X - Y might not be normal.

What if X and Y are not independent?

If X and Y are correlated (not independent), the variance of aX + bY includes the covariance term. Specifically, the general variance formula for aX + bY is a^2 Var(X) + b^2 Var(Y) + 2ab Cov(X, Y). Hence, when X and Y have some correlation, we must consider Cov(X, Y) in the calculation.

How would this change if the distributions are not normal?

For non-normal distributions, a linear combination does not necessarily follow the same family of distributions. You would need additional assumptions or apply something like the Central Limit Theorem (CLT) if you have a large enough number of i.i.d. random variables. For two random variables specifically, without normality, the exact distribution could be much more complicated.

Could you show a quick Python snippet to verify these theoretical results?

import numpy as np

# Number of samples
N = 10_000_00

# Generate samples for X and Y
X = np.random.normal(loc=3, scale=np.sqrt(4), size=N)
Y = np.random.normal(loc=1, scale=np.sqrt(4), size=N)

# Form Z = 2X - Y
Z = 2*X - Y

# Empirical mean and variance
empirical_mean = np.mean(Z)
empirical_var = np.var(Z, ddof=1)

print("Empirical Mean of 2X - Y:", empirical_mean)
print("Empirical Variance of 2X - Y:", empirical_var)

By running this code, you would observe that the empirical mean is close to 5 and the empirical variance is close to 20, confirming the theoretical derivation.

Below are additional follow-up questions

How can we derive the moment generating function (MGF) of 2X - Y?

The moment generating function of a random variable Z is defined as E[exp(tZ)]. For normally distributed Z, the MGF has a specific closed form. Suppose Z = 2X - Y. Since X and Y are independent normal variables, Z is also normal.