Get the latest tech news
Robust Conditional 3D Shape Generation from Casual Captures
From an input image sequence, ShapeR preprocesses per-object multimodal data (SLAM points, images, captions). A rectified flow transformer then conditions on these inputs to generate meshes object-centrically, producing a full metric scene reconstruction.
None
Or read this on Hacker News