Get the latest tech news
New AI model turns photos into explorable 3D worlds, with caveats
Openly available AI tool creates steerable 3D-like video, but requires serious GPU muscle.
On Tuesday, Tencent released HunyuanWorld-Voyager, a new open-weights AI model that generates 3D-consistent video sequences from a single image, allowing users to pilot a camera path to "explore" virtual scenes. While Genie 3 focuses on training AI agents and isn't publicly available, and Mirage 2 emphasizes user-generated content for gaming, Voyager targets video production and 3D reconstruction workflows with its RGB-depth output capabilities. To train Voyager, researchers developed software that automatically analyzes existing videos to process camera movements and calculate depth for every frame—eliminating the need for humans to manually label thousands of hours of footage.
Or read this on ArsTechnica