Hoeffding's inequality (valid for 1 experiment), probability that the sample has a meaning for the population
$ P ( |\nu - \mu| > \epsilon) \leq 2 e^{-2 \epsilon^2 N} $ with:
$\mu = \nu$ is PAC (possibly approximetly correct)
Relate to learning, hypothesis function $h$ has a probability to be close to the target function $f$ ($h = f$ is PAC).
Notations:
If we perform well out of sample (ie. $E_{out}$ is small) then we must have really learn.
But we are actually trying to see performance through our hypothesis $h$, so these errors depends on $h$. Leading to new notations:
Q. What if the Hoeffding's inequality gives something trivial like p < 2?
A. Either data size insufficient, or tolerance to stringent.
In [ ]: