Combining Local Appearance and Motion Cues for Occlusion Boundary Detection
A. Stein and M. Hebert, British Machine Vision Conference (BMVC), 2007.Learning to Find Object Boundaries Using Motion Cues
A. Stein, D. Hoiem, and M. Hebert, IEEE International Conference on Computer Vision (ICCV), 2007.Occlusion Boundaries: Low-Level Detection to High-Level Reasoning
A. Stein, Doctoral Dissertation, Technical Report CMU-RI-TR-08-06, Robotics Institute, Carnegie Mellon University, 2008.Towards Unsupervised Whole-Object Segmentation: Combining Automated Matting with Boundary Detection
A. Stein, T. Stepleton, and M. Hebert, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2008.Occlusion Boundaries from Motion: Low-Level Detection and Mid-Level Reasoning
A. Stein and M. Hebert, International Journal of Computer Vision (IJCV), Vol. 82(3), May 2009.

The dataset contains three directories:
Contains subdirectories for each clip. Each clip subdirectory contains the individual frames in TIFF format, a ground-truth binary occlusion boundary map for the reference frame (used for the BMVC 2007 paper, also in TIFF format), and for many clips, an MPEG movie of the sequence going forward and then backward (allowing a "ping-pong" style viewing if the movie is played in a continuous loop).There is also a subdirectory for each clip called 'stabilized' which contains stabilized versions of the frames, where each frame is registered to the middle "reference" frame by a simple global translation. The stabilized sequences have been cropped slightly to exclude border effects. The cropping rectangle is stored in the simple text file "crop-rect" containing the upper-left and lower-right coordinates:
[x_left y_top x_right y_bottom]Thus, this same cropping should be applied to any ground truth before registering it to the stabilized data.
Contains a Matlab MAT file for each clip. This file contains a single variable called 'objects' which is a "stack" of UINT8 images, each containing a binary mask for a closed object or piece of an object, ordered from back to front. This provides the closed boundaries and foreground/background layering which were used in the ICCV 2007 paper.
For convenience, contains the "reference" or middle frame of each sequence. Note that the middle frame is determined according to
ceil((N+1)/2)where N is the number of frames in the sequence. Thus, for example, a 20-frame sequence's reference frame is img_010.tif, i.e. the 11th frame (note that numbering starts at 000).
Last updated: June 8, 2008