a brief overview of computer vision
DESCRIPTION
A Brief Overview of Computer Vision. Jinxiang Chai. What is Computer Vision?. Computer vision is the science and technology of machines that see. Concerned with the theory for building artificial systems that obtain information from images. - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/1.jpg)
A Brief Overview of Computer Vision
Jinxiang Chai
![Page 2: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/2.jpg)
What is Computer Vision?
• Computer vision is the science and technology of machines that see.
• Concerned with the theory for building artificial systems that obtain information from images.
• The image data can take many forms, such as a video sequence, views from multiple cameras, or multi-dimensional data from a medical scanner
![Page 3: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/3.jpg)
Applications
• Robot perception (e.g. an industrial robot or an autonomous vehicle, autonomous helicopter, humanoid robots).
![Page 4: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/4.jpg)
Honda ASIMO Humanoid Robot
• Face detection• Face recognition• Posture/gesture recognition
(e.g., hand waving)• Environment recognition
(e.g., obstacles)
![Page 5: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/5.jpg)
Applications
• Robot perception (e.g. an industrial robot or an autonomous vehicle, humanoid robots).
• Detecting events (e.g. for visual surveillance or people counting).
![Page 6: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/6.jpg)
Detecting Events
• Customer tracking and activity analysis
![Page 7: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/7.jpg)
Applications
• Robot perception (e.g. an industrial robot or an autonomous vehicle, humanoid robots).
• Detecting events (e.g. for visual surveillance or people counting).
• Modeling objects or environments
![Page 8: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/8.jpg)
Modeling objects or environments
• Modeling buildings, plants, faces, cars etc.
![Page 9: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/9.jpg)
Applications
• Robot perception (e.g. an industrial robot or an autonomous vehicle, humanoid robots).
• Detecting events (e.g. for visual surveillance or people counting).
• Modeling objects or environments
• Interaction (e.g. as the input to a device for computer-
human interaction).
![Page 10: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/10.jpg)
Interactions
• Interactions with computers and video games, etc.
Face recognition for automatic login
Computer vision for game interfaces (Sony eyetoy, Microsoft Kinect)
![Page 11: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/11.jpg)
Applications
• Robot perception (e.g. an industrial robot or an autonomous vehicle, humanoid robots).
• Detecting events (e.g. for visual surveillance or people counting).
• Modeling objects or environments
• Interaction (e.g. as the input to a device for
computer-human interaction). • Organizing information (e.g. for indexing databases of
images and image sequences).
![Page 12: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/12.jpg)
Organizing information
• Flickr (www. Flickr.com) has 3 billion images
• Youtube has tons of videos.
• Need new ways to search, analyze, summarize a large collection of internet images and videos
![Page 13: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/13.jpg)
Image Representation
An image is a 2D rectilinear array of Pixels
- A width X height array where each entry of the array stores a single pixel
![Page 14: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/14.jpg)
Image Representation
A pixel stores color information
Luminance pixels - gray-scale images (intensity images) - 0-255 - 8 bits per pixel
Red, green, blue pixels (RGB) - Color images - Each channel: 0-255 - 24 bits per pixel
![Page 15: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/15.jpg)
Image Representation
An image is a 2D rectilinear array of Pixels
- A width X height array where each entry of the array stores a single pixel
- Each pixel stores color information
(255,255,255)
![Page 16: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/16.jpg)
Images
• Which kind of information you can obtain from images
![Page 17: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/17.jpg)
Images
• Which kind of information you can obtain from images
Edge detection
![Page 18: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/18.jpg)
Images
• Which kind of information you can obtain from images
Edge detection Corner& feature detection
![Page 19: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/19.jpg)
Images
• Which kind of information you can obtain from images
Edge detection Corner& feature detection
Geometric primitive detection
![Page 20: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/20.jpg)
Images
• Which kind of information you can obtain from images
Edge detection Corner& feature detection
Geometric primitive detection
Object detection
![Page 21: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/21.jpg)
Images
• Which kind of information you can obtain from images
Edge detection Corner& feature detection
Geometric primitive detection
Object detection Face alignment and recognition
……
![Page 22: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/22.jpg)
How about multiple images?
• What can we obtain if we have multiple images?
![Page 23: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/23.jpg)
How about multiple images?
• What can we obtain if we have multiple images?
Two images of the same scene
![Page 24: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/24.jpg)
Structure and motion analysis• Given two or more images of the same scene or object,
estimate camera motion and 3D object structure (e.g., depth)
unknownunknowncameracamera
viewpointsviewpoints
![Page 25: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/25.jpg)
Structure and motion analysis• Given two or more images of the same scene or object,
estimate camera motion and 3D object structure (e.g., depth)
unknownunknowncameracamera
viewpointsviewpoints
How to estimate camera parameters?
- where is the camera?
- where is it pointing?
- what are internal parameters, e.g. focal length?
![Page 26: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/26.jpg)
Structure and motion analysis• Given two or more images of the same scene or object,
estimate camera motion and 3D object structure (e.g., depth)
unknownunknowncameracamera
viewpointsviewpoints
How to estimate camera parameters?
- where is the camera?
- where is it pointing?
- what are internal parameters, e.g. focal length?
Camera calibration!
![Page 27: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/27.jpg)
Structure and motion analysis
• Reconstruct the depth information.
Input images
How to find the depth information of this point?
![Page 28: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/28.jpg)
Structure and motion analysis
• Reconstruct the depth information.
Input images
How to find the depth information of this point?
- find the corresponding point in the right image.
![Page 29: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/29.jpg)
Structure and motion analysis
• Reconstruct the depth information.
Input images
![Page 30: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/30.jpg)
Structure and motion analysis
• Reconstruct the depth information.
Input images
Depth images
![Page 31: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/31.jpg)
Structure and motion analysis
• Reconstruct 3D models from multiple images
Reconstruction results from 23 images
![Page 32: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/32.jpg)
All together video
• Click here- feature detection
- feature matching (epipolar geometry)
- structure from motion
- stereo reconstruction
- triangulation
- texture mapping
![Page 33: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/33.jpg)
How about video sequences?
• What can we obtain from video?
![Page 34: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/34.jpg)
How about video sequences?
• What can we obtain from video?
Optical flow: where are pixels moving to?
![Page 35: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/35.jpg)
How about multiple video sequences
• Modeling dynamic objects (video click here)
![Page 36: A Brief Overview of Computer Vision](https://reader030.vdocuments.site/reader030/viewer/2022032708/56812aa6550346895d8e687a/html5/thumbnails/36.jpg)
Modeling human motion from video
• Single-view camera
• Interactively construct human motion form video