Get the latest tech news

Robust Conditional 3D Shape Generation from Casual Captures


From an input image sequence, ShapeR preprocesses per-object multimodal data (SLAM points, images, captions). A rectified flow transformer then conditions on these inputs to generate meshes object-centrically, producing a full metric scene reconstruction.

None

Get the Android app

Or read this on Hacker News

Read more on:

Photo of casual captures

casual captures