Independence of Attributes

This is actually quite interesting and maybe very useful for my Poker AI.

It’s this idea that if you have two attributes, how do you know they are independent?

  • For example, having a Modern Vaccine and a Placebo Vaccine, how do you know the Moderna vaccine is better?
  • (iffy) Having a Poker AI play against a human, how do you know the Poker AI is better than them?

The idea is that we can do Hypothesis Testing to check if data is independent, where

  • : Attributes are independent
  • : Attributes are not independent

Steps

Step 1: Calculate Expected Table

  • You can derive the formula above, actually it’s more intuitive to calculate as , which simplifies to the equation above Step 2: Calculate

where

  • = observed
  • = expected
  • a = # rows
  • b = # columns

Step 3: Calculate p-value, where

We use the p-value to come to conclusions about and (see p-value to come to conclusions about and (see Hypothesis Testing).

Equality of Proportions

Testing for independence of attributes is the same as testing the equality of proportions. The proportions should be the same if they are independent.