vision & perception - lina karam's...
TRANSCRIPT
EEE 508
Vision & Perception
• Simple model:
simple reflectance/illumination model
image: x(n1,n2)=i(n1,n2)r(n1,n2)
where 0 < i(n1,n2) <
0 < r(n1,n2) < 1
Eye
illumination source i(n1,n2)
reflectance term r(n1,n2)
EEE 508
Vision & Perception
• Imaging on the retina (back of eye consisting of photoreceptors)
17mm
20mm 100m
2m
Focal point
of lens Eye
Retinal
image
EEE 508
Vision & Perception
• Visible range of electromagnetic spectrum is 350 nm to 780 nm.
rays x rays ultraviolet visible infrared microwaves radio
380nm 780nm
EEE 508
• Simple model for HVS
NOTE: The HVS is really a non-linear system.
Vision & Perception
optic nerve
eye brain
HVS
HVS
Primarily a BPF/LPF
Input (spatial pulse)
What we see
Output
What we think we see
Approximate HVS with a LTI system
EEE 508
Vision & Perception
• Light: electromagnetic radiation that stimulates our visual
response
expressed as a spectral energy distribution
C( ); 380nm 780nm – wavelength in visible spectrum
Spectral distribution of a colored light
C( ) represents amount of energy present at each frequency wavelength
EEE 508
Vision & Perception
• Color vision model (3 receptor absorption model)
3 types of cones: each has a different peak absorption frequency
Typical absorption spectra (also called sensitivity curves) for the three cones
(not to scale)
EEE 508
Vision & Perception
• Let C(): spectral energy distribution of a “colored” light source
C()
max max min
EEE 508
• Color sensation described by
i[C()], i=1,2,3, called spectral responses
If C1() and C2() produce responses such that
i[C1()]=i[C2()] for i = 1,2,3 C1() and C2() perceived to be identical
• Color sensation – perceptual attributes
1. Brightness – perceived Luminance
2. Hue – color
3. Saturation – amount of white light diluting the color
Vision & Perception
3,2,1,max
min
idCSC ii
EEE 508
Vision & Perception
• The following curves show the relative spectral response
functions of each of the 3 types of cones:
Eye’s response to Blue light is much less strong than is its
response to Red or Green
400 700 550 (nm) 0
0.2 R G
B
0.01
Fra
cti
on
of
lig
ht
ab
so
rbed
by e
ach
typ
e o
f co
ne
EEE 508
Vision & Perception
• Luminance and Brightness
The luminance or intensity of an object with light spectral
distribution I(x,y, ) is
where V( ) is the “relative luminous efficiency function” of the HVS
Bell-shaped curve = sum of 3 previous curves.
0
),,(, dVyxIyxL
EEE 508
Vision & Perception
• Luminance versus Brightness
Brightness:
subjective perceptual measure (depends on observer’s judgment)
perceived Luminance
depends on luminance of the surround (lateral inhibition, contrast)
Luminance:
objective quantitative measure (Unit: watts/m2 or watts/steradians)
independent of the luminances of the surrounding objects
Note: the illumination (Luminance) range over which HVS can operate
is roughly 1010 (normalized unit, e.g. milli-luminance = milli-unit of
luminance) or 10 orders of magnitude on log scale.
EEE 508
Vision & Perception
• Scotopic vision mediated thru rods at the lower part of the
range
• Photopic vision mediated thru cones at the higher 5 to 6 order
of magnitude of interest here, computer screens are bright
• Our perception is sensitive to luminance contrast rather then
the absolute luminance value.
• Brightness is log related to luminance
103 milli-luminance (power)
Glare limit
10-1 10-6
Scotopic threshold
scotopic
photopic
Brightness (log scale)
Brightness is approximately
linear on the log scale
EEE 508
Vision & Perception
• Concept of just-noticeable difference (contrast sensitivity)
Experiment: Human observer views background L and a spot with
intensity L+L. As we change L, dot becomes visible. The L for
which dot is visible is the just-noticeable difference.
L/L is the Weber ratio
Weber’s law: L/L= C (constant) = 0.02
d(logL) = constant C
equal increments in log L should be perceived to be equally
different ( linear relation between Brightness and log L)
log L is proportional to C, the change in contrast
L
L+L
EEE 508
Vision & Perception
• Exploit this brightness property to derive contrast models:
c = a1 + a2 log f – logarithmic law
c = f1/n – root law
EEE 508
Vision & Perception
• Visual Acuity
Ability to detect spatial details; spatial frequency sensitivity of the eye
EEE 508
Vision & Perception
• Retinal arc
Divide the eye into degrees
Images are projected onto rods and cones by retinal arc. We can
unwrap the retina:
10o 0o 10o
30o 30o
17mm 20mm
EEE 508
Vision & Perception
• Spatial frequency
is not related to the wavelength of the light
is the number of oscillations in a given space
10o
0o
10o
30o
30o
0o 1o
4 cycles/retinal arc
EEE 508
Color Models
• RGB CIE spectral primary sources; CRT monitors
• CMY Printers; ink-based devices
Traditionally, RGB primary colors, CMY complements of RGB
C = W - R
M = W - G = R + B
Y = W – B = R + G
• RNGNBN NTSC receiver primaries; standard for
television receivers; three phosphor primaries that glow in the
red, green, and blue regions of the visible spectrum
• YIQ NTSC transmission standard; compatible with
B/W TV broadcast; more efficient transmission than RGB
• HSV or HSB User-oriented, based on intuitive or
perceptual measure
• Note: NTSC stands for National Television Systems Committee
EEE 508
TG(
)
TB( ) TR( )
• RGB (CIE primaries) color matching functions
• The tristimulus values (weights) of an arbitrary color C( ):
Color Models
max
min
)(
dTCt kk BGRk ,,
EEE 508
Color Models
• CIE Chromaticity Diagram
CIE defined 3 standard (hypothetical) primary sources called
X, Y and Z to replace R,G and B. These new primaries can match
all visible color with positive weights (positive matching functions)
Y color matching function matches the luminous efficiency
function of the eye
(nm) 0
0.2
0.4
0.6
0.8
1
1.2
1.4
1.6
1.8
2
380
410
440
470
500
530
560
590
620
650
680
710
740
770
nxny
nz
EEE 508
Color Models
• Let
Then
will produce the same color but with a different intensity; i.e.,
same Hue and Saturation, but different Brightness
Normalize by setting
where
zyx
zz
zyx
yy
zyx
xx nnn
;;
zZyYxXC
azZayYaxXC '
zyxa
ZzYyXxC nnnn
EEE 508
Color Models
•Note: (Unit Plane)
out of the 3 normalized weights, only 2 have to be specified
only 2 primaries needed to define color
CIE diagram = projection of Unit Plane into (X,Y) plane
•The three values , and define hue and saturation but give no
info about the brightness since they are relative components
An extra value is required to determine the intensity (Brightness) and
the value of Y is chosen, In practice, any absolute intensity value
(x, y or z) may be specified to determine the brightness.
1 nnn zyx
nnn yxz 1
nznynx
EEE 508
Curve (Horse-shoe) boundary
corresponds to 100% pure colors
All possible colors (of normalized
intensity) are displayed on CIE
diagram
The (MacAdam) ellipses are the
just noticeable color difference
ellipses.
Color Models
• CIE Chromaticity Diagram
White: xn = yn = 0.333
zn = 1 – xn – yn = 0.333
Yellow
0.333
G
R
B
White
xn
yn
700nm
435.8nm
546.1nm
0.333
EEE 508
• YIQ:
NTSC transmission standard
Y = Luminance (same as CIE Y primary); color matching function
identical to luminous efficiency function V( )
I and Q: chrominance components (give hue and saturation)
Recoding of RNGNBN for transmission efficiency
Transmission efficiency: Bandwidth of I or Q < half bandwidth of Y
NTSC encoding of YIQ into a broadcast signal assigns:
4 MHz to Y
1.5 MHz to I
0.6 MHz to Q
I and Q components contain less information
less samples (more than 50%less) used to represent I and Q
Downward compatibility with B/W TV receivers (Y component)
Color Models
EEE 508
Color Models
• Converting RNGNBN to YIQ:
Recall:
C( ) consists of only three components of weights RN at R,
GN at G and BN at B
C( ) = RN ( - R) + GN ( - G) + BN ( - G)
becomes a summation weighted by the corresponding V( R),
V( G) and V( B)
Y = L() = V( R) C( R) + V( G) C( G) + V( B) C( B)
= 0.30 RN + 0.59 GN + 0.11 BN
N
N
N
B
G
R
Q
I
Y
312.0523.0211.0
322.0274.0596.0
114.0587.0299.0BGRY 11.059.030.0
R G B
C()
BN GN RN
max
min
)(
dVCL
Colored light distribution
EEE 508
Color Models
• Some useful transformations between color coordinate systems
RGB to XYZ
RNGNBN to XYZ
B
G
R
Z
Y
X
990.0010.0000.0
011.0813.0177.0
200.0310.0490.0
Z
Y
X
B
G
R
N
N
N
896.0118.0058.0
028.0000.2985.0
288.0533.0910.1
EEE 508
Temporal Properties of Vision
• Important for processing motion images (video) and in the
design of image displays for stationary images
• Main properties:
Bloch’s law
If we expose an observer to flashing light where flashes have
different durations but same energy these durations became
indistinguishable below a critical duration threshold
This threshold was found to be about 30 ms when eye adapted
at moderate illumination level
The more the eye is adapted to dark, the longer the critical
duration
d1 d2
Flash 1 duration Flash 2 duration
d1 indistinguishable of d2 if d1 dc and d2 dc
EEE 508
Temporal Properties of Vision
Critical Fusion Frequency (CFF)
If flashing rate of light > CFF individual flashes are
indistinguishable; i.e., flashes are indistinguishable from a
steady light at the same average intensity
CFF does not generally exceed 50 to 60 Hz
Basis for TV raster scanning cameras and displays
Interlaced image fields sampled and displayed at rates of 50 or
60 Hz
Modern displays are refreshed at 60 frames/sec to avoid
flicker perception
EEE 508
Temporal Properties of Vision
• Spatial versus Temporal effects:
Eye more sensitive to flickering of high spatial frequencies (i.e.
flickering edges) than low spatial frequencies
Useful in coding of motion video where moving areas are
subsampled except at the edges (low spatial areas represented by
less samples)