TY - JOUR
T1 - Harmony potentials : Fusing global and local scale for semantic image segmentation
AU - Boix, Xavier
AU - Gonfaus, Josep M.
AU - Van De Weijer, Joost
AU - Bagdanov, Andrew D.
AU - Serrat, Joan
AU - Gonzàlez, Jordi
PY - 2012/1/1
Y1 - 2012/1/1
N2 - The Hierarchical Conditional Random Field (HCRF) model have been successfully applied to a number of image labeling problems, including image segmentation. However, existing HCRF models of image segmentation do not allow multiple classes to be assigned to a single region, which limits their ability to incorporate contextual information across multiple scales. At higher scales in the image, this representation yields an oversimplified model since multiple classes can be reasonably expected to appear within large regions. This simplified model particularly limits the impact of information at higher scales. Since class-label information at these scales is usually more reliable than at lower, noisier scales, neglecting this information is undesirable. To address these issues, we propose a new consistency potential for image labeling problems, which we call the harmony potential. It can encode any possible combination of labels, penalizing only unlikely combinations of classes. We also propose an effective sampling strategy over this expanded label set that renders tractable the underlying optimization problem. Our approach obtains state-of-the-art results on two challenging, standard benchmark datasets for semantic image segmentation: PASCAL VOC 2010, and MSRC-21. © 2011 Springer Science+Business Media, LLC.
AB - The Hierarchical Conditional Random Field (HCRF) model have been successfully applied to a number of image labeling problems, including image segmentation. However, existing HCRF models of image segmentation do not allow multiple classes to be assigned to a single region, which limits their ability to incorporate contextual information across multiple scales. At higher scales in the image, this representation yields an oversimplified model since multiple classes can be reasonably expected to appear within large regions. This simplified model particularly limits the impact of information at higher scales. Since class-label information at these scales is usually more reliable than at lower, noisier scales, neglecting this information is undesirable. To address these issues, we propose a new consistency potential for image labeling problems, which we call the harmony potential. It can encode any possible combination of labels, penalizing only unlikely combinations of classes. We also propose an effective sampling strategy over this expanded label set that renders tractable the underlying optimization problem. Our approach obtains state-of-the-art results on two challenging, standard benchmark datasets for semantic image segmentation: PASCAL VOC 2010, and MSRC-21. © 2011 Springer Science+Business Media, LLC.
KW - Hierarchical conditional random fields
KW - Semantic object segmentation
U2 - 10.1007/s11263-011-0449-8
DO - 10.1007/s11263-011-0449-8
M3 - Article
SN - 0920-5691
VL - 96
SP - 83
EP - 102
JO - International Journal of Computer Vision
JF - International Journal of Computer Vision
ER -