Expressive Policy
This is an idea that I stumbled upon while reading a paper written by Perry Dong.
EXPO: https://arxiv.org/pdf/2507.07986
| Policy Type | Expressiveness |
|---|---|
| Linear policy | Low |
| Shallow MLP (1 hidden layer) | Medium |
| Deep MLP (many hidden layers, nonlinearities) | High |
| Transformer policy | Very High |
| Diffusion-based policy | Extremely High |
🧨 This limits it to:
- Single-mode behaviors (deterministic or unimodal)