🛠️ Steven Gong

Search

Oct 05, 2025, 1 min read

Training Agents Inside of Scalable World Models (DreamerV4)

Philosophically, it seems to combine GENIE (generative world model) with SIMA (agent / policy).

Project page:

https://danijar.com/project/dreamer4/

Ideas used:

Flow Matching
Shortcut Model
Diffusion Forcing

They also use PMPO??

Graph View

Backlinks

Preference Optimization as Probabilistic Inference
Scaling Instructable Agents Across Many Simulated Worlds (SIMA)

Created with Quartz, © 2026

Blog
LinkedIn
Twitter
GitHub