🛠️ Steven Gong

Search

SearchSearch

Oct 05, 2025, 1 min read

Training Agents Inside of Scalable World Models (DreamerV4)

Philosophically, it seems to combine GENIE (generative world model) with SIMA (agent / policy).

Project page:

  • https://danijar.com/project/dreamer4/

Ideas used:

  • Flow Matching
  • Shortcut Model
  • Diffusion Forcing

They also use PMPO??

Graph View

Backlinks

  • Preference Optimization as Probabilistic Inference
  • Scaling Instructable Agents Across Many Simulated Worlds (SIMA)

Created with Quartz, © 2025

  • Blog
  • LinkedIn
  • Twitter
  • GitHub