Bayesian Decision Theory
Let's say we have an equation for the probability of a piece of data belonging to Class 1: p1(x,y) and an equation for the data belonging to Class 2: p2(x,y). To classify a new measurement with features (x,y), we can use the below rules:
That’s Bayesian decision theory in a nutshell: choosing the decision with the highest probability
Classifying with Conditional Probabilities
The two rules above don't tell the whole story however, the more precise version is:
And we can use Bayes' rule to determine p($c_{i}$|x,y)