|
Given a single input image, we can infer the shape, texture and camera viewpoint for the underlying object. In rows 1 and 2, we show the input image, inferred 3D shape and texture from the predicted viewpoint, and three novel viewpoints. We can learn 3D inference using only in-the-wild image collections with approximate instance segmentations, our approach can be easily applied across a diverse set of categories. Rows 3 and 4 show sample predictions across a broad set of categories, with the predicted 3D shape overlaid on the input image.
|