The main difference:
Previous models: directed graph $x \to z \to \tilde{x}$
RBM: $x -- z$
$$p(x,h) = \frac{\exp(-E(x,h))}{Z} = \frac{\exp(h^TWx + c^Tx + b^th)}{Z} \\ = \frac{\exp() \exp() \exp()}{Z}$$
Connection to physics and nature:
Differnet energy functions
$$p(x,h) = \frac{1}{Z} \prod \prod \exp() \times \prod \exp() \times \prod \exp()$$
$F(x)$ is the free energy.
This is called softplus(.). Softplus is a smooth version of ReLU.
Adding more layers: Deep Boltzmann Machines
Directed: $z \to x$ we model $p(x|z)$
Undirected: $z -- x$ we model $p(x,z)$ and $p(z)$
The directed version is easier, since we use some input.
The undirected graph should in theory give more accurate model since during the iterative process, we repeatedly make both $x$ and $z$ better.
In [ ]:
In [ ]:
In [ ]:
In [ ]: