Data-Driven 3D Primitives for Single Image Understanding

teaserImage

People

Abstract

What primitives should we use to infer the rich 3D world behind an image? We argue that these primitives should be both visually discriminative and geometrically informative and we present a technique for discovering such primitives. We demonstrate the utility of our primitives by using them to infer the 3D surface normals given a single image. Our technique substantially outperforms the state-of-the-art and shows improved cross-dataset performance.

Paper

ICCV Paper (pdf)
Addendum: results on standard train/test (pdf)
Poster (pdf)

Citation
 
David F. Fouhey, Abhinav Gupta, Martial Hebert.
Data-Driven 3D Primitives for Single Image Understanding.
In Proc. International Conference on Computer Vision. 2013.

[Show BibTex]


Extended Results

We are providing a number of documents as supplemental material:

Code/Data

We will provide two versions of the code. One provides a black-box version of the system that can be easily plugged into other scene understanding tasks. The other is the version used internally that includes training code.
We also have precomputed results for many indoor scene understanding datasets. Please contact David Fouhey for these.
  1. Prediction Only (Now Available)
    [ Code (2.9MB .zip), version 1.01, updated 4/21/2014]
    [Data (926MB .tar.gz)]
    This is a streamlined version of the prediction code and a model pre-trained on the NYU v2 dataset. This can be used as a feature in other vision tasks.
  2. Training Code (New!)

Funding

This research is supported by:

Copyright Notice

The documents contained in these directories are included by the contributing authors as a means to ensure timely dissemination of scholarly and technical work on a non-commercial basis. Copyright and all rights therein are maintained by the authors or by other copyright holders, notwithstanding that they have offered their works here electronically. It is understood that all persons copying this information will adhere to the terms and constraints invoked by each author's copyright.