cse 185 introduction to computer vision

82
CSE 185 Introduction to Computer Vision Image Filtering: Frequency Domain

Upload: kimo

Post on 30-Jan-2016

39 views

Category:

Documents


0 download

DESCRIPTION

CSE 185 Introduction to Computer Vision. Image Filtering: Frequency Domain. Image filtering. Fourier transform and frequency domain Frequency view of filtering Hybrid images Sampling Today’s lecture covers material in 3.4 Reading: Chapters 3 - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: CSE 185  Introduction to Computer Vision

CSE 185 Introduction to Computer

VisionImage Filtering:

Frequency Domain

Page 2: CSE 185  Introduction to Computer Vision

Image filtering

• Fourier transform and frequency domain– Frequency view of filtering– Hybrid images– Sampling

• Reading: Chapters 3• Some slides from James Hays, David

Hoeim, Steve Seitz, …

Page 3: CSE 185  Introduction to Computer Vision

Gaussian Box filter

Gaussian filter

Page 4: CSE 185  Introduction to Computer Vision

Hybrid images

• Filter one image with low-pass filter and the other with high-pass filter, and then superimpose them

• A. Oliva, A. Torralba, P.G. Schyns, “Hybrid Images,” SIGGRAPH 2006

Page 5: CSE 185  Introduction to Computer Vision

Why do we get different, distance-dependent interpretations of hybrid images?

?

Page 6: CSE 185  Introduction to Computer Vision

Why does a lower resolution image still make sense to us? What do we lose?

Page 7: CSE 185  Introduction to Computer Vision

Thinking in terms of frequency

Page 8: CSE 185  Introduction to Computer Vision

Jean Baptiste Joseph Fourier had crazy idea (1807):

Any univariate function can be rewritten as a weighted sum of sines and cosines of different frequencies.

• Don’t believe it? – Neither did Lagrange,

Laplace, Poisson and other big wigs

– Not translated into English until 1878!

• But it’s (mostly) true!– called Fourier Series– there are some subtle

restrictions

...the manner in which the author arrives at these equations is not exempt of difficulties

and...his analysis to integrate them still leaves something to be desired on the score

of generality and even rigour.

Laplace

LagrangeLegendre

Page 9: CSE 185  Introduction to Computer Vision

A sum of sinesOur building block:

Add enough of them to get any signal g(x) you want!

xAsin(

Page 10: CSE 185  Introduction to Computer Vision

Frequency spectra

• example : g(t) = sin(2πf t) + (1/3)sin(2π(3f) t)

= +

Page 11: CSE 185  Introduction to Computer Vision

Frequency spectra

Page 12: CSE 185  Introduction to Computer Vision

= +

=

Frequency spectra

Page 13: CSE 185  Introduction to Computer Vision

= +

=

Frequency spectra

Page 14: CSE 185  Introduction to Computer Vision

= +

=

Frequency spectra

Page 15: CSE 185  Introduction to Computer Vision

= +

=

Frequency spectra

Page 16: CSE 185  Introduction to Computer Vision

= +

=

Frequency spectra

Page 17: CSE 185  Introduction to Computer Vision

= 1

1sin(2 )

k

A ktk

Frequency spectra

Page 18: CSE 185  Introduction to Computer Vision

Example: Music

• We think of music in terms of frequencies at different magnitudes

Page 19: CSE 185  Introduction to Computer Vision

Fourier analysis in images

Intensity Image

Fourier Image

Page 20: CSE 185  Introduction to Computer Vision

Signals can be composed

+ =

More: http://www.cs.unm.edu/~brayer/vision/fourier.html

Page 21: CSE 185  Introduction to Computer Vision

Fourier transform

• Fourier transform stores the magnitude and phase at each frequency– Magnitude encodes how much signal there is at a

particular frequency– Phase encodes spatial information (indirectly)– For mathematical convenience, this is often notated in

terms of real and complex numbers

22 )()( IRA )(

)(tan 1

R

IAmplitude: Phase:

Page 22: CSE 185  Introduction to Computer Vision

Computing Fourier transform

Continuous

Discrete

k = -N/2..N/2

Fast Fourier Transform (FFT): NlogN

Page 23: CSE 185  Introduction to Computer Vision

Phase and magnitude

• Fourier transform of a real function is complex– difficult to plot,

visualize– instead, we can think of

the phase and magnitude of the transform

• Phase is the phase of the complex transform

• Magnitude is the magnitude of the complex transform

• Curious fact– all natural images have

about the same magnitude transform

– hence, phase seems to matter, but magnitude largely doesn’t

• Demonstration– Take two pictures, swap

the phase transforms, compute the inverse - what does the result look like?

Page 24: CSE 185  Introduction to Computer Vision
Page 25: CSE 185  Introduction to Computer Vision

This is the magnitude transform of the cheetah pic

Page 26: CSE 185  Introduction to Computer Vision

This is the phase transform of the cheetah pic

Page 27: CSE 185  Introduction to Computer Vision
Page 28: CSE 185  Introduction to Computer Vision

This is the magnitude transform of the zebra pic

Page 29: CSE 185  Introduction to Computer Vision

This is the phase transform of the zebra pic

Page 30: CSE 185  Introduction to Computer Vision

Reconstruction with zebra phase, cheetah magnitude

Page 31: CSE 185  Introduction to Computer Vision

Reconstruction with cheetah phase, zebra magnitude

Page 32: CSE 185  Introduction to Computer Vision

The convolution theorem• The Fourier transform of the convolution

of two functions is the product of their Fourier transforms

• Convolution in spatial domain is equivalent to multiplication in frequency domain!

]F[]F[]F[ hghg

]]F[][F[F* 1 hghg

Page 33: CSE 185  Introduction to Computer Vision

Properties of Fourier transform

• Linearity

• Fourier transform of a real signal is symmetric about the origin

• The energy of the signal is the same as the energy of its Fourier transform

See Szeliski Book (3.4)

Page 34: CSE 185  Introduction to Computer Vision

2D FFT

1,...,2,1,0,1,...,2,1,0for

),(),(1

0

1

0

)//(2

NyMx

evuFyxfM

u

N

v

NvyMuxj

• Fourier transform (discrete case)

• Inverse Fourier transform:

• u, v : the transform or frequency variables• x, y : the spatial or image variables

1,...,2,1,0,1,...,2,1,0for

),(1

),(1

0

1

0

)//(2

NvMu

eyxfMN

vuFM

x

N

y

NvyMuxj

Euler’s formula

Page 35: CSE 185  Introduction to Computer Vision

2D FFT

Sinusoid with frequency = 1 and its FFT

Page 36: CSE 185  Introduction to Computer Vision

2D FFT

Sinusoid with frequency = 3 and its FFT

Page 37: CSE 185  Introduction to Computer Vision

2D FFT

Sinusoid with frequency = 5 and its FFT

Page 38: CSE 185  Introduction to Computer Vision

2D FFT

Sinusoid with frequency = 10 and its FFT

Page 39: CSE 185  Introduction to Computer Vision

2D FFT

Sinusoid with frequency = 15 and its FFT

Page 40: CSE 185  Introduction to Computer Vision

2D FFT

Sinusoid with varying frequency and their FFT

Page 41: CSE 185  Introduction to Computer Vision

Rotation

Sinusoid rotated at 30 degrees and its FFT

Page 42: CSE 185  Introduction to Computer Vision

2D FFT

Sinusoid rotated at 60 degrees and its FFT

Page 43: CSE 185  Introduction to Computer Vision

2D FFT

Page 44: CSE 185  Introduction to Computer Vision

Image analysis with FFT

Page 45: CSE 185  Introduction to Computer Vision

Image analysis with FFT

http://www.cs.unm.edu/~brayer/vision/fourier.html

Page 46: CSE 185  Introduction to Computer Vision

Filtering in spatial domain

-101

-202

-101

* =

Page 47: CSE 185  Introduction to Computer Vision

Filtering in frequency domain

FFT

FFT

Inverse FFT

=

Page 48: CSE 185  Introduction to Computer Vision

FFT in Matlab

• Filtering with fft

• Displaying with fft

im = double(imread('cameraman.tif'))/255;[imh, imw] = size(im); hs = 50; % filter half-sizefil = fspecial('gaussian', hs*2+1, 10); fftsize = 1024; % should be order of 2 (for speed) and include paddingim_fft = fft2(im, fftsize, fftsize); % 1) fft im with paddingfil_fft = fft2(fil, fftsize, fftsize); % 2) fft fil, pad to same size as imageim_fil_fft = im_fft .* fil_fft; % 3) multiply fft imagesim_fil = ifft2(im_fil_fft); % 4) inverse fft2im_fil = im_fil(1+hs:size(im,1)+hs, 1+hs:size(im, 2)+hs); % 5) remove paddingfigure, imshow(im);figure, imshow(im_fil);

figure(1), imagesc(log(abs(fftshift(im_fft)))), axis image, colormap jet

Page 49: CSE 185  Introduction to Computer Vision

Questions

Which has more information, the phase or the magnitude?

What happens if you take the phase from one image and combine it with the magnitude from another image?

Page 50: CSE 185  Introduction to Computer Vision

FilteringWhy does the Gaussian give a nice smooth image, but the

square filter give edgy artifacts?

Gaussian Box filter

Page 51: CSE 185  Introduction to Computer Vision

Gaussian

Gaussian filter

Page 52: CSE 185  Introduction to Computer Vision

Box Filter

Box filter

Page 53: CSE 185  Introduction to Computer Vision

Sampling

Why does a lower resolution image still make sense to us? What do we lose?

Page 54: CSE 185  Introduction to Computer Vision

Throw away every other row and

column to create a 1/2 size image

Subsampling by a factor of 2

Page 55: CSE 185  Introduction to Computer Vision

• 1D example (sinewave):

Aliasing problem

Page 56: CSE 185  Introduction to Computer Vision

• 1D example (sinewave):

Aliasing problem

Page 57: CSE 185  Introduction to Computer Vision

• Sub-sampling may be dangerous….• Characteristic errors may appear:

– “Wagon wheels rolling the wrong way in movies”

– “Checkerboards disintegrate in ray tracing”

– “Striped shirts look funny on color television”

Aliasing problem

Page 58: CSE 185  Introduction to Computer Vision

Aliasing in video

Page 59: CSE 185  Introduction to Computer Vision

Aliasing in graphics

Page 60: CSE 185  Introduction to Computer Vision

Resample the checkerboard by taking one sample at each circle. In the case of the top left board, new representation is reasonable.

Top right also yields a reasonable representation.

Bottom left is all black (dubious) and bottom right has checks that are too big.

Sampling scheme is crucially related to frequency

Page 61: CSE 185  Introduction to Computer Vision

Constructing a pyramid by taking every second pixel leads to layers that badly misrepresent the top layer

Page 62: CSE 185  Introduction to Computer Vision

• When sampling a signal at discrete intervals, the sampling frequency must be 2 fmax

• fmax = max frequency of the input signal

• This will allows to reconstruct the original from the sampled version without aliasing

good

bad

v v v

Nyquist-Shannon Sampling Theorem

Page 63: CSE 185  Introduction to Computer Vision

Anti-aliasing

Solutions:• Sample more often

• Get rid of all frequencies that are greater than half the new sampling frequency– Will lose information– But it’s better than aliasing– Apply a smoothing filter

Page 64: CSE 185  Introduction to Computer Vision

Algorithm for downsampling by factor of 2

1. Start with image(h, w)2. Apply low-pass filter

im_blur = imfilter(image, fspecial(‘gaussian’, 7, 1))

3. Sample every other pixelim_small = im_blur(1:2:end, 1:2:end);

Page 65: CSE 185  Introduction to Computer Vision

Sampling without smoothing. Top row shows the images, sampled at every second pixel to get the next; bottom row shows the magnitude spectrum of these images.

substantial aliasing

Magnitude of the Fourier transform of each image displayed as a log scale (constant component is at the center). Fourier transform of a resampled image is obtained by scaling the Fourier transform of the original image and then tiling the plane

Page 66: CSE 185  Introduction to Computer Vision

Sampling with smoothing (small σ). Top row shows the images. We get the next image by smoothing the image with a Gaussian with σ=1 pixel, then sampling at every second pixel to get the next; bottom row shows the magnitude spectrum of these images.

Low pass filter suppresses high frequency components with less aliasing

reducing aliasing

Page 67: CSE 185  Introduction to Computer Vision

Sampling with smoothing (large σ). Top row shows the images. We get the next image by smoothing the image with a Gaussian with σ=2 pixels, then sampling at every second pixel to get the next; bottom row shows the magnitude spectrum of these images.

Large σ less aliasing, but with little detailGaussian is not an ideal low-pass filter

lose details

Page 68: CSE 185  Introduction to Computer Vision

Subsampling without pre-filtering

1/4 (2x zoom) 1/8 (4x zoom)1/2

Page 69: CSE 185  Introduction to Computer Vision

Subsampling with Gaussian pre-filtering

G 1/4 G 1/8Gaussian 1/2

Page 70: CSE 185  Introduction to Computer Vision

Why does a lower resolution image still make sense to us? What do we lose?

Page 71: CSE 185  Introduction to Computer Vision

Why do we get different, distance-dependent interpretations of hybrid images?

?

Page 72: CSE 185  Introduction to Computer Vision

Salvador Dali invented Hybrid Images?

Salvador Dali“Gala Contemplating the Mediterranean Sea, which at 30 meters becomes the portrait of Abraham Lincoln”, 1976

Page 73: CSE 185  Introduction to Computer Vision
Page 74: CSE 185  Introduction to Computer Vision
Page 75: CSE 185  Introduction to Computer Vision

Application: Hybrid images

• Combine low-frequency of one image with high-frequency of another one

• Sad faces when looked closely• Happy faces when looked a few meters away• A. Oliva, A. Torralba, P.G. Schyns, “Hy brid Im ages,”

SIGGRAPH 2006

Page 76: CSE 185  Introduction to Computer Vision

• Early processing in humans filters for various orientations and scales of frequency

• Perceptual cues in the mid-high frequencies dominate perception

• When we see an image from far away, we are effectively subsampling it

Early Visual Processing: Multi-scale edge and blob filters

Clues from human perception

Page 77: CSE 185  Introduction to Computer Vision

Campbell-Robson contrast sensitivity curve

Detect slight change in shades of gray before they become indistinguishableContrast sensitivity is better for mid-range spatial frequencies

Page 78: CSE 185  Introduction to Computer Vision

Hybrid image in FFT

Hybrid Image Low-passed Image High-passed Image

Page 79: CSE 185  Introduction to Computer Vision

PerceptionWhy do we get different, distance-dependent

interpretations of hybrid images?

?

Page 80: CSE 185  Introduction to Computer Vision

Things to remember• Sometimes it makes sense to

think of images and filtering in the frequency domain– Fourier analysis

• Can be faster to filter using FFT for large images (N logN vs. N2 for auto-correlation)

• Images are mostly smooth– Basis for compression

• Remember to low-pass before sampling

Page 81: CSE 185  Introduction to Computer Vision

Practice question

1. Match the spatial domain image to the Fourier magnitude image

1 54

A

32

C

B

DE

Page 82: CSE 185  Introduction to Computer Vision

Practice question

1. Match the spatial domain image to the Fourier magnitude image

1 54

A

32

C

B

DE

1 – D, 2 – B, 3 – A, 4 – E, 5 - C