Get the latest tech news

Robust Conditional 3D Shape Generation from Casual Captures

From an input image sequence, ShapeR preprocesses per-object multimodal data (SLAM points, images, captions). A rectified flow transformer then conditions on these inputs to generate meshes object-centrically, producing a full metric scene reconstruction.

None

Get the Android app

Or read this on Hacker News