Segmentation-Aware Convolutional Nets

Adam W. Harley

Segmentation-Aware Convolutional Nets

Abstract

This thesis introduces a method to both obtain segmentation information and integrate it uniformly within a convolutional neural network (CNN). This counter-acts the tendency of CNNs to produce smooth predictions, which is undesirable for pixel-wise prediction tasks, such as semantic segmentation. The segmentation information is obtained by a form of metric learning, where a CNN learns to compute pixel embeddings that reflect whether any pair of pixels is likely to belong to the same region. This information is then used within a larger network, to replace all convolutions with foreground-focused convolutions, where the foreground is determined adaptively at each image point by local embeddings. The resulting network is called a segmentation-aware CNN, because the network can change its behaviour at each image location according to local segmentation cues. The proposed method yields systematic improvements on a standard semantic segmentation benchmark when compared to a strong baseline.

Document

Segmentation-Aware Convolutional Nets

Citation

Harley, A. W. (2016). Segmentation-Aware Convolutional Nets (Master's thesis). Ryerson University, Toronto, Ontario.

Bibtex format:

@mastersthesis{harleyMScThesis,
    title = {Segmentation-Aware Convolutional Nets},
    author = {Adam W Harley},
    school = {Ryerson University},
    address = {Toronto, Ontario},
    year = {2016}
}

Caffe layers and setup files

Coming soon!