Variance
Variance is Standard Deviation squared, i.e. . I just realized that this is kind of the formula for Squared Euclidean Distance, and this was talked about the Curse of Dimensionality, which is as the dimensionality increases.
From the Standard Deviation formula, I guess we can say the variance of a sample can also be expressed as represents the variance of a population,
If you are looking the variance of a sample, this would be given by
I think variance of a Distribution vs. variance of a sample is very different. I don’t know how the later is derived.
- “average of squares” and the square of sums
Properties:
- If is a constant, then
- Standard Deviation
- see Covariance
Incremental Variance
Learning about this as I am on Ericsson’s team for an AI study. The problem is that when we have memory constraints, we cannot store all of it and then compute it afterwards, using the standard equation.
We want a recurrence relation. https://math.stackexchange.com/questions/374881/recursive-formula-for-variance
https://en.wikipedia.org/wiki/Algorithms_for_calculating_variance
I used Welford’s Online Algorithm, just check out the wikipedia article.
Sample Variance vs. Population Variance
The act of dividing by instead of to calculate the sample is known as Bessel’s Correction