Training Agents Inside of Scalable World Models (DreamerV4) Philosophically, it seems to combine GENIE (generative world model) with SIMA (agent / policy). Project page: https://danijar.com/project/dreamer4/ Ideas used: Flow Matching Shortcut Model Diffusion Forcing They also use PMPO??