Variational Autoencoder (VAE)

Latent variable model trained with variational inference: $p (x) = \int p (x ∣ z) p (z) d z$

This is a variant of the Autoencoder that is much more powerful, which uses distributions to represent features in its bottleneck. There are issues that arise with Backprop, but they overcome it with a reparametrization trick.

Resources

It's basically an Autoencoder but we add gaussian noise to latent variable $z$ ?

Key difference:

Regular Autoencoder

Input → Encoder → Fixed latent representation → Decoder → Reconstruction.

VAE

Input → Encoder → Latent distribution → Sample from distribution (adds Gaussian noise via reparameterization trick) → Decoder → Reconstruction

Variational autoencoders provide a principled framework for learning deep latent-variable models and corresponding inference models.

Process

Forward Pass (Encoding → Sampling → Decoding)

Encoder:
Input data $x$ , outputs parameters (mean and variance) of latent distribution $z$ :

q_{ϕ} (z ∣ x) = N (z; μ_{ϕ} (x), σ_{ϕ}^{2} (x))

Reparameterization Trick:
Differentiably sample latent variable $z$ :

z = μ_{ϕ} (x) + σ_{ϕ} (x) ⊙ ϵ, ϵ \sim N (0, I)

Decoder:
Reconstruct data from sampled latent vector $z$ :

p_{θ} (x ∣ z)

Loss Function (Negative ELBO):
Optimize encoder and decoder parameters by minimizing:

L (ϕ, θ) = - E_{q_{ϕ} (z ∣ x)} [lo g p_{θ} (x ∣ z)] + D_{KL} (q_{ϕ} (z ∣ x) ∥ p (z))

Notes from the guide

The VAE can be viewed as two coupled, but independently parameterized models:

encoder (recognition model)
decoder (generative model)

Variants

cVAE

🛠️ Steven Gong

Table of Contents

Variational Autoencoder (VAE)

Process

Forward Pass (Encoding → Sampling → Decoding)

Notes from the guide

Variants

Graph View

Backlinks