computer vision detecting the existence, pose and position of known objects within an image michael...
TRANSCRIPT
Computer Vision Detecting the existence, pose and position of known
objects within an image
Michael Horne, Philip Sterne (Supervisor)
Background
• Computer vision is a diverse field– Many schools of thought
• Many different ways of achieving– Monocular– Stereoscopic (quite popular)
Problem Statment
• Recognize objects within a single image– Cutting down the total information in
the images– Extracting and building up useful subset– Using that information to recognize
objects• Model Based system
– System already knows of objects
System structure
• Image capture and segmentation– Pre-process image– Extract useful information
• Matching– Find links between image and objects
known to the system• Pose Solution
– Use the matches to estimate the position and orientation of the object
Image Segmentation
• Feature extraction– Images have lots of information– What features are of interest?– Edges are definitive attributes of
objects– Corners can easily be matched against
Image Segmentation
• Gaussian filter is applied to reduce noise– Noise adds complexity, but no useful
information.• Canny Edge detector is applied to
extract edges
Image Segmentation
• Arbitrary shaped edges are converted into straight line segments.
Similar Geometries
• Need image data to be represented in a similar way to the object model
• Objects are stored as wireframes
• Image data converted to a wireframe like form.
Matching
• To estimate a pose we find correspondences with corners
• Classification problem– Which object to match?– Or what data to match to which objects?
(Multiple object case)• Complex problem• Random Sample Consensus
approach
Matching
• Take only the minimum amount of image data needed to estimate a solution
• But do it at random• Then test the validity of the
estimation
Pose Estimation
• Now using the correspondences• Algorithm based on POSIT
– A quick method– In tune with the RANSAC approach
• POSIT– Scaled orthographic projection model
Pose Estimation
• POSIT – Minimum correspondences is four– Will solve regardless correctness
• Validation is necessary
Pose Estimation
• Checking need to be rapid• Various levels of verification
– Object must not be skewed in making the four points of the object to the image points.
– Distance to estimated position must be minimal
• Estimations that pass undergo further checking
Pose Estimation
• Next step is to project object over image– Number of matches is expanded.– All forward facing vertices are checked
for a corresponding image point• Also the geometries are verified
Pose Estimation
• Goodness ratio = corner ratio * edge ratio– General means of judging the fitted
model
– The higher the ratio, the better the fit• Model fitting with the best ratio is
chosen
Final
• End result
Multiple objects
• After each model is fitted the points used are marked as used– Independence of objects
• The matching process restarts
Results
• Tests were setup for a set of four different objects
• Varying degrees of symmetry
• Images of various poses were captured
• Varying difficulty, degenerate poses to definitive
• 85% success in estimating pose of single objects
Other examples
1 2 3
Multiple Objects
• Multiple objects solved