Autoregressive Model

-Model joint probability as a product of conditionals: $p (x) = \prod_{t = 1}^{T} p (x_{t} ∣ x_{< t})$

Saw this from the Self-Driving Car Companies AI Day.

Where?

Examples:

PixelRNN / PixelCNN (image)
GPT, Transformer decoders (text)
WaveNet (audio)

And it’s a term that seems to pop around. But I think that all it does is really allow you to predict (regress) future data based on past data. So isn’t this just a simple RNN?

They also mention this in the Neural Autoregressive Flow.

CS294

Assuming a fully expressive Bayes net structure, any joint distribution can be written as a product of conditionals $lo g p (x) = \sum_{i = 1}^{d} lo g p (x_{i} ∣ x_{1 : i - 1})$

🛠️ Steven Gong

Table of Contents

Autoregressive Model

CS294

Graph View

Backlinks