🛠️ Steven Gong

Search

Aug 30, 2025, 1 min read

#world-model

They use a VQ-VAE, and then train to predict in the latent space.

Why did they not use a VLM and do predictions in the latent space?

Graph View

Backlinks

No backlinks found

Created with Quartz, © 2026

Blog
LinkedIn
Twitter
GitHub