Propagation of Error in a Non-Linear Equation

**tweek** · 06-11-09, 04:17 PM

Ok, so after a little bit more research, it looks like you use the mean of either "a" or "b". And, while this doesn't really apply to sports handicapping, it begs the question of propagation of error for zero-mean random variables. If a and b are both zero mean RV's with non-zero variance, clearly y will be zero mean but have non-zero variance. So, how is the variance of y calculated in this case?

**Ganchrow** · 06-11-09, 05:29 PM

Originally posted by tweek

If a and b are both zero mean RV's with non-zero variance, clearly y will be zero mean

In the example you gave in the first post y would clearly not have zero mean (for real A & B, non-zero variance for either A or B).

In general there is no closed form solution solution to your original question. In the simple example you gave with random variable a, however:

Let M = E(a)
Let V = Var(a)
Let X = a²

We know that for any random variable X:

Var(X) = E(X²) - E(X)²
= E(a⁴) - E(a²)²

But E(a²) = V + M²

So we have:

Var(X) = E(a⁴) - (V + M²)²

Where E(a⁴) is the 4^th moment about the origin, a concept related to kurtosis.

So if we assume M = 0, then:

Var(X) = (Kurt(a)+3)*V² - V²
= (2+Kurt(a)) * V²

where Kurt(a) refers to the excess kurtosis of the random (population) variable a.

So for Y = a² + b², with a = b = 0, and Cov(a,b) = 0 then:

Var(Y) = (2+Kurt(a)) * Var(a)² + (2+Kurt(b)) * Var(b)²

**Ganchrow** · 06-11-09, 05:49 PM

Originally posted by Ganchrow

So for Y = a² + b², with a = b = 0, and Cov(a,b) = 0 then:

Var(Y) = (2+Kurt(a)) * Var(a)² + (2+Kurt(b)) * Var(b)²

In case anyone's tried this in Excel over a given and is wondering why the values don't match, recall that while we here are using population kurtosis the Excel KURT() functions calculates sample kurtosis defined as:

[ATTACH]6954[/ATTACH]

where s represent the sample deviation.

By the way ... any guesses on the over/under for how many posters including both myself and the OP will actually read all this in it entirety? I'd go with 1.5.

**tweek** · 06-11-09, 05:50 PM

Originally posted by Ganchrow

In the example you gave in the first post y would clearly not have zero mean (for real A & B, non-zero variance for either A or B).

Whoops... you're right.

Originally posted by Ganchrow

In general there is no closed form solution solution to your original question. In the simple example you gave with random variable a, however:

Let M = E(a)
Let V = Var(a)
Let X = a²

We know that for any random variable X:

Var(X) = E(X²) - E(X)²
= E(a⁴) - E(a²)²

But E(a²) = V + M²

So we have:

Var(X) = E(a⁴) - (V + M²)²

Where E(a⁴) is the 4^th moment about the origin, a concept related to kurtosis.

So if we assume M = 0, then:

Var(X) = (Kurt(a)+3)*V² - V²
= (2+Kurt(a)) * V²

where Kurt(a) refers to the excess kurtosis of the random (population) variable a.

So for Y = a² + b², with a = b = 0, and Cov(a,b) = 0 then:

Var(Y) = (2+Kurt(a)) * Var(a)² + (2+Kurt(b)) * Var(b)²

Ok... I think I follow that analysis. I guess what I'm really after is the propogation of standard deviation for the Pythagorean equation: Y = a² / (a² + b²). In practice, a and b will not be zero mean. I should be able to work through the analysis... I'll give it a shot.

Thank for pointing me in the right direction.

**tweek** · 06-11-09, 06:07 PM

Quick question - while the variance of a sum of RV's is the sum of the variance of the RV's, how does the variance combine for multiplication and/or division?

If we assume they're independent, for multiplication we get:

Var(XY) = E[(XY)^2] - E[XY]^2
= E[X^2] E[Y^2] - E[X]^2 E[Y]^2
= E[X^2] E[Y^2] - E[X]^2 E[Y^2] + E[X]^2 E[Y^2] - E[X]^2 E[Y]^2
= Var(X) E[Y^2] + E[X]^2 Var(Y)
= Var(X) Var(Y) + Var(X) E[Y]^2 + E[X]^2 Var(Y)

Is this it? What about division?

**Ganchrow** · 06-11-09, 06:18 PM

Originally posted by tweek

Quick question - while the variance of a sum of RV's is the sum of the variance of the RV's, how does the variance combine for multiplication and/or division?

If we assume they're independent, for multiplication we get:

Var(XY) = E[(XY)^2] - E[XY]^2
= E[X^2] E[Y^2] - E[X]^2 E[Y]^2
= E[X^2] E[Y^2] - E[X]^2 E[Y^2] + E[X]^2 E[Y^2] - E[X]^2 E[Y]^2
= Var(X) E[Y^2] + E[X]^2 Var(Y)
= Var(X) Var(Y) + Var(X) E[Y]^2 + E[X]^2 Var(Y)

Is this it? What about division?

Bingo.

And FWIW, independence, while sufficient for the above isn't necessary. That the variables are uncorrelated is the only necessary condition for the above to hold.

To my knowledge there's no similar equality for the quotient of two random variables.

**tweek** · 06-11-09, 06:30 PM

Originally posted by Ganchrow

Bingo.

And FWIW, independence, while sufficient for the above isn't necessary. That the variables are uncorrelated is the only necessary condition for the above to hold.

To my knowledge there's no similar equality for the quotient of two random variables.

Yeah... just tried to work through it without a lot of luck.

So, that being the case, do you have any insight as to working through the calculation of

Var(X² / (X² + Y²))?

**Wrecktangle** · 06-11-09, 09:05 PM

This is the stuff we need to see to scare off the room-temp IQ folks that Monkey was railing about...

**Ganchrow** · 06-13-09, 11:31 AM

Originally posted by tweek

Yeah... just tried to work through it without a lot of luck.

So, that being the case, do you have any insight as to working through the calculation of

Var(X² / (X² + Y²))?

All that immediately springs to mind is that if X and Y represent points scored and are both of sufficiently large magnitude then they can nevertheless be very roughly approximated with the Poisson (to varying degrees of accuracy) with λ_X and λ_Y equal to X and Y, respectively.

These in turn could be approximated by two Gaussian distributions of each with mean and variance of the appropriate λ.

Standardize each and you'd have a χ² distribution with 1 d.o.f. for your numerator and 2 d.o.f. for your denominator, the quotient of which should follow an F-distribution.

So another words, my immediate reaction would be that that the Pythagorean expectation of the square of the standard scores of each of our two pint distributions should follow a roughly F(1,2) distribution.

How (if at all) this might help with your question I couldn't readily say. Perhaps, however, it'll get you thinking along the right lines.

**MrLuckyPants** · 06-13-09, 02:05 PM

Originally posted by Wrecktangle

This is the stuff we need to see to scare off the room-temp IQ folks that Monkey was railing about...

Runs for the hills!