Bayes’ Theorem

Bayes’ Rule is an application of the LOTP with the Conditional Probability rule.

Bayes’ theorem describes the probability of an event based on prior knowledge of conditions that might be related to the event.

Let $B_{1}, \dots, B_{n}$ be a partition of $S$ and $A$ be any event, then $P (B_{i} ∣ A) = \frac{P ( A ∣ B _{i} ) \cdot P ( B _{i} )}{P ( A )} = \frac{P ( A ∣ B _{i} ) \cdot P ( B _{i} )}{\sum _{R = 1}^{n} P ( A ∣ B _{R} ) \cdot P ( B _{R} )}$

Alternate notation that is more useful for your ML stuff

$P (θ ∣ x) = \frac{P ( x ∣ θ ) \cdot P ( θ )}{P ( x )}$

$P (θ ∣ x)$ $\to$ Posterior Probability
$P (θ)$ $\to$ Prior Probability
$P (x ∣ θ)$ $\to$ Likelihood

$x$ is your data and $θ$ are your model parameters.

Proof of Bayes Theorem

How is bayes rule derived? Apply the basic rule of Conditional Probability, and leverage the fact that AND is commutative.

$P (B_{i} ∣ A) = \frac{P ( A \cap B _{i} )}{P ( A )} = \frac{P ( B _{i} \cap A )}{P ( A )} = \frac{P ( A _{i} ∣ B ) P ( B )}{P ( A )}$

Example of use of Bayes Theorem

Example

You have one fair coin and a biased coin that lands on heads with a probability of 3 4 . A coin is chosen at random and tossed three times. If we observe three heads in a row, what is the probability that the fair coin was chosen?

Solution: $B_{1} =$ The fair coin was chosen $B_{2} =$ The biased coin was chosen $A =$ 3 heads observed in 3 tosses

We want to find $P (B_{1} ∣ A)$ , so we can use Bayes’ Theorem, and calculate $P (A) = P (A ∣ B_{1}) P (B_{1}) + P (A ∣ B_{2}) P (B_{2})$

Relation to more advanced control theory / ML

This is why we say (from Kalman Filter in Python) $P os t er i or = \frac{L ik e l ih oo d \times P r i or}{N or ma l i z a t i o n}$

This is at the core of Bayes Filter update

You just continuously update the posterior, setting prior to old posterior every time.

See Kalman Filter.

Bayes theorem is super useful because it turns a hard problem into an easy problem.

Hard problems:

P(Cancer = True | Test = Positive)
P(Rain = True | Readings)

Stated like that the problems seem unsolvable.

Easy problems:

P(Test = Positive | Cancer = True)
P(Readings | Rain = True)

Bayes’ Theorem lets us solve the hard problem by solving the easy problem.

Bayes rule with Conditioning

From CS287. $P (x ∣ y, z) = \frac{P ( y ∣ x , z ) P ( x ∣ z )}{P ( y ∣ z )}$

Applied

https://x.com/cneuralnetwork/status/1870051432981819836

🛠️ Steven Gong

Table of Contents

Bayes’ Theorem

Proof of Bayes Theorem

Example of use of Bayes Theorem

Relation to more advanced control theory / ML

Bayes rule with Conditioning

Applied

Graph View

Backlinks

🛠️ Steven Gong

Table of Contents

Bayes’ Theorem

Proof of Bayes Theorem

Example of use of Bayes Theorem

Relation to more advanced control theory / ML

Bayes rule with Conditioning

Applied

Related

Graph View

Backlinks