StereoWorld uses binocular stereo vision cues to guide 3D-consistent stereo video generation, offering a biologically inspired approach to scene geometry understanding.
Genie Envisioner offers a two-stage training pipeline using video diffusion for robotic manipulation, separating world model adaptation from action policy learning. Here’s how it works and how to get started.