Google expands Project Genie with Street View-grounded worlds
Google added Street View grounding to Project Genie and began rolling the experimental prototype out to eligible Google AI Ultra subscribers globally.
World model timeline
Use the world model timeline to follow dated launches, demos, model releases, and platform milestones.
Milestone reelGoogle added Street View grounding to Project Genie and began rolling the experimental prototype out to eligible Google AI Ultra subscribers globally.
Runway said its new Characters system is built on GWM-1 and made the real-time conversational video agent available through the Runway API plus web and mobile apps.
Ant's Lingguang App was reported to add an 'Experience World Model' feature backed by LingBot-World-Fast, turning one image into a short explorable 3D scene on mobile.
Tencent open-sourced HY-Embodied-0.5 for real-world embodied agents and then released the HY-Embodied-0.5-X enhancement with inference and training code.
Alibaba introduced HappyOyster as an open-ended world model product for creating, exploring, and directing immersive environments in real time.
Tencent Hunyuan expanded HY-World 2.0 beyond its April release by publishing HY-Pano 2.0 inference code and weights on May 11, then world-generation inference code plus WorldStereo 2.0 weights on May 18.
Robbyant followed the LingBot-Map release with public evaluation scripts for KITTI and Oxford Spires, strengthening the model's reproducibility story for streaming 3D reconstruction.
At GTC 2026, NVIDIA said Cosmos 3 would unify synthetic world generation, physical-AI reasoning, and action simulation, then kept widening the public Cosmos surface through an open data-factory blueprint and GitHub repos that were still active in late May.
Robbyant released LingBot-VA as a causal video-action world model that predicts visual dynamics and robot actions together.
Robbyant released LingBot-VLA as a vision-language-action foundation model for generalist robot manipulation, with public code, paper, and model checkpoints.
World Labs moved its world model work toward a programmable developer surface with the World API.
Project Sid gives the timeline a society-layer signal: agents specialize, coordinate, trade, and transmit behavior inside a persistent Minecraft-based world.
Runway framed GWM-1 as a general world model direction that reaches beyond video into worlds, avatars, and robotics.
Marble gave the world model category a concrete product surface: generated 3D worlds that can be explored and edited.
Genie 3 placed interactive, navigable world generation near the center of the world model conversation.
NVIDIA introduced Cosmos as a world foundation model platform for physical AI development.
Oasis demonstrated an action-conditioned Minecraft-like world that can be generated frame by frame while the player moves.
Skybox AI gives the timeline an early consumer-facing spatial signal: a generated world can start as a 360-degree horizon, mood, and environment shell.