o bj c ut
DESCRIPTION
UNIVERSITY OF OXFORD. O BJ C UT. M. Pawan Kumar Philip Torr Andrew Zisserman. Aim. Given an image, to segment the object. Object Category Model. Segmentation. Cow Image. Segmented Cow. Segmentation should (ideally) be shaped like the object e.g. cow-like - PowerPoint PPT PresentationTRANSCRIPT
OBJ CUT
M. Pawan Kumar
Philip Torr
Andrew Zisserman
UNIVERSITYOF
OXFORD
Aim• Given an image, to segment the object
Segmentation should (ideally) be• shaped like the object e.g. cow-like• obtained efficiently in an unsupervised manner• able to handle self-occlusion
Segmentation
ObjectCategory
Model
Cow Image Segmented Cow
Challenges
Self Occlusion
Intra-Class Shape Variability
Intra-Class Appearance Variability
MotivationMagic Wand
Current methods require user intervention• Object and background seed pixels (Boykov and Jolly, ICCV 01)• Bounding Box of object (Rother et al. SIGGRAPH 04)
Cow Image
Object Seed Pixels
MotivationMagic Wand
Current methods require user intervention• Object and background seed pixels (Boykov and Jolly, ICCV 01)• Bounding Box of object (Rother et al. SIGGRAPH 04)
Cow Image
Object Seed Pixels
Background Seed Pixels
MotivationMagic Wand
Current methods require user intervention• Object and background seed pixels (Boykov and Jolly, ICCV 01)• Bounding Box of object (Rother et al. SIGGRAPH 04)
Segmented Image
MotivationMagic Wand
Current methods require user intervention• Object and background seed pixels (Boykov and Jolly, ICCV 01)• Bounding Box of object (Rother et al. SIGGRAPH 04)
Cow Image
Object Seed Pixels
Background Seed Pixels
MotivationMagic Wand
Current methods require user intervention• Object and background seed pixels (Boykov and Jolly, ICCV 01)• Bounding Box of object (Rother et al. SIGGRAPH 04)
Segmented Image
Problem • Manually intensive
• Segmentation is not guaranteed to be ‘object-like’
Non Object-like Segmentation
Motivation
Our Method• Combine object detection with segmentation
– Borenstein and Ullman, ECCV ’02– Leibe and Schiele, BMVC ’03
• Incorporate global shape priors in MRF
• Detection provides– Object Localization– Global shape priors
• Automatically segments the object– Note our method completely generic– Applicable to any object category model
Outline
• Problem Formulation
• Form of Shape Prior
• Optimization
• Results
Problem• Labelling m over the set of pixels D• Shape prior provided by parameter Θ
• Energy E (m,Θ) = ∑Φx(D|mx)+Φx(mx|Θ) + ∑ Ψxy(mx,my)+ Φ(D|mx,my)
• Unary terms– Likelihood based on colour– Unary potential based on distance from Θ
• Pairwise terms– Prior– Contrast term
• Find best labelling m* = arg min ∑ wi E (m,Θi)– wi is the weight for sample Θi
Unary terms Pairwise terms
MRF
Probability for a labelling consists of• Likelihood
• Unary potential based on colour of pixel• Prior which favours same labels for neighbours (pairwise potentials)
Prior Ψxy(mx,my)
Unary Potential Φx(D|mx)
D (pixels)
m (labels)
Image Plane
x
y
mx
my
Example
Cow Image Object SeedPixels
Background SeedPixels
Prior
x …
y …
…
…
x …
y …
…
…
Φx(D|obj)
Φx(D|bkg)Ψxy(mx,my)
Likelihood Ratio (Colour)
Example
Cow Image Object SeedPixels
Background SeedPixels
PriorLikelihood Ratio (Colour)
Contrast-Dependent MRF
Probability of labelling in addition has• Contrast term which favours boundaries to lie on image edges
D (pixels)
m (labels)
Image Plane
Contrast Term Φ(D|mx,my)
x
y
mx
my
Example
Cow Image Object SeedPixels
Background SeedPixels
Prior + Contrast
x …
y …
…
…
x …
y …
…
…
Likelihood Ratio (Colour)
Ψxy(mx,my)+Φ(D|mx,my)
Φx(D|obj)
Φx(D|bkg)
Example
Cow Image Object SeedPixels
Background SeedPixels
Prior + ContrastLikelihood Ratio (Colour)
Our Model
Probability of labelling in addition has• Unary potential which depend on distance from Θ (shape parameter)
D (pixels)
m (labels)
Θ (shape parameter)
Image Plane
Object CategorySpecific MRFx
y
mx
my
Unary PotentialΦx(mx|Θ)
Example
Cow Image Object SeedPixels
Background SeedPixels
Prior + ContrastDistance from Θ
Shape Prior Θ
Example
Cow Image Object SeedPixels
Background SeedPixels
Prior + ContrastLikelihood + Distance from Θ
Shape Prior Θ
Example
Cow Image Object SeedPixels
Background SeedPixels
Prior + ContrastLikelihood + Distance from Θ
Shape Prior Θ
Outline
• Problem Formulation– E (m,Θ) = ∑Φx(D|mx)+Φx(mx|Θ) + ∑ Ψxy(mx,my)+ Φ(D|mx,my)
• Form of Shape Prior
• Optimization
• Results
Layered Pictorial Structures (LPS)• Generative model
• Composition of parts + spatial layout
Layer 2
Layer 1
Parts in Layer 2 can occlude parts in Layer 1
Spatial Layout(Pairwise Configuration)
Layer 2
Layer 1
Transformations
Θ1
P(Θ1) = 0.9
Cow Instance
Layered Pictorial Structures (LPS)
Layer 2
Layer 1
Transformations
Θ2
P(Θ2) = 0.8
Cow Instance
Layered Pictorial Structures (LPS)
Layer 2
Layer 1
Transformations
Θ3
P(Θ3) = 0.01
Unlikely Instance
Layered Pictorial Structures (LPS)
LPS for Detection• Learning
– Learnt automatically using a set of examples
• Detection– Matches LPS to image using Loopy Belief Propagation– Localizes object parts
Outline
• Problem Formulation
• Form of Shape Prior
• Optimization
• Results
Optimization
• Given image D, find best labelling as m* = arg max p(m|D)
• Treat LPS parameter Θ as a latent (hidden) variable
• EM framework– E : sample the distribution over Θ
– M : obtain the labelling m
E-Step
• Given initial labelling m’, determine p(Θ|m’,D)
• Problem Efficiently sampling from p(Θ|m’,D)
• Solution• We develop efficient sum-product Loopy Belief
Propagation (LBP) for matching LPS.
• Similar to efficient max-product LBP for MAP estimate– Felzenszwalb and Huttenlocher, CVPR ‘04
Results
• Different samples localize different parts well.• We cannot use only the MAP estimate of the LPS.
M-Step
• Given samples from p(Θ|m’,D), get new labelling mnew
• Sample Θi provides– Object localization to learn RGB distributions of object and background– Shape prior for segmentation
• Problem– Maximize expected log likelihood using all samples– To efficiently obtain the new labelling
M-Step
Cow Image Shape Θ1
w1 = P(Θ1|m’,D)
RGB Histogram for Object RGB Histogram for Background
Cow Image Shape Θ1
M-Step
w1 = P(Θ1|m’,D)
Θ1
Image PlaneD (pixels)
m (labels)
• Best labelling found efficiently using a Single Graph Cut
Segmentation using Graph Cuts
x …
y … … …
z … …
Obj
Bkg
CutΦx(D|bkg) + Φx(bkg|Θ)
m
Φz(D|obj) + Φz(obj|Θ)
Ψxy(mx,my)+
Φ(D|mx,my)
Segmentation using Graph Cuts
x …
y … … …
z … …
Obj
Bkg
m
M-Step
Cow Image Shape Θ2
w2 = P(Θ2|m’,D)
RGB Histogram for BackgroundRGB Histogram for Object
M-Step
Cow Image Shape Θ2
w2 = P(Θ2|m’,D)
Θ2
Image PlaneD (pixels)
m (labels)
• Best labelling found efficiently using a Single Graph Cut
M-Step
Θ2
Image Plane
Θ1
Image Plane
w1 + w2 + ….
• Best labelling found efficiently using a Single Graph Cut
m* = arg min ∑ wi E (m,Θi)
Outline
• Problem Formulation
• Form of Shape Prior
• Optimization
• Results
SegmentationImage
ResultsUsing LPS Model for Cow
In the absence of a clear boundary between object and background
SegmentationImage
ResultsUsing LPS Model for Cow
SegmentationImage
ResultsUsing LPS Model for Cow
SegmentationImage
ResultsUsing LPS Model for Cow
SegmentationImage
ResultsUsing LPS Model for Horse
SegmentationImage
ResultsUsing LPS Model for Horse
Our Method Leibe and SchieleImage
Results
AppearanceShape Shape+Appearance
Results
Without Φx(D|mx) Without Φx(mx|Θ)
• Conclusions
– New model for introducing global shape prior in MRF– Method of combining detection and segmentation– Efficient LBP for detecting articulated objects
• Future Work
– Other shape parameters need to be explored– Method needs to be extended to handle multiple
visual aspects