linear perspective - university of sussex...linear perspective • parallel lines sloping away from...

4
1 Depth Perception Perception & Attention Course George Mather Depth cues Retinal images are two-dimensional, yet the world is three- dimensional. The visual system must recover the missing dimension. A depth cue can be defined as anything that is used by the visual system to estimate the depth of a point in space, or to perceive depth in a 3-D shape. Retinal images contain multiple visual cues. Oculomotor cues are also available. Visual cues can be divided into monocular and binocular. Height Height in the image, relative to the horizon, gives a cue to distance. Linear Perspective Parallel lines sloping away from the observer converge as they recede into the distance. Similarly sized shapes decrease in retinal size as they recede into the distance. Blur Eyes (and cameras) have limited depth of field objects nearer or farther than the point of fixation or focus are blurred. Blur variation across the image gives a cue to distance. Aerial Perspective Particles in the atmosphere scatter light, particularly in the blue region of the spectrum. As a result, distant objects appear lower in contrast, and slightly blue.

Upload: others

Post on 23-Aug-2020

6 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Linear Perspective - University of Sussex...Linear Perspective • Parallel lines sloping away from the observer converge as they recede into the distance. • Similarly sized shapes

1

Depth Perception

Perception & Attention Course

George Mather

Depth cues

• Retinal images are two-dimensional, yet the world is three-dimensional. The visual system must recover the missing dimension.

• A depth cue can be defined as anything that is used by the visual system to estimate the depth of a point in space, or to perceive depth in a 3-D shape.

• Retinal images contain multiple visual cues. Oculomotor cues are also available.

• Visual cues can be divided into monocular and binocular.

Height

• Height in the image, relative to the horizon, gives a cue to distance.

Linear Perspective

• Parallel lines sloping away from the observer converge as they recede into the distance.

• Similarly sized shapes decrease in retinal size as they recede into the distance.

Blur

• Eyes (and cameras) have limited depth of field –objects nearer or farther than the point of fixation or focus are blurred.

• Blur variation across the image gives a cue to distance.

Aerial Perspective

• Particles in the atmosphere scatter light, particularly in the blue region of the spectrum.

• As a result, distant objects appear lower in contrast, and slightly blue.

Page 2: Linear Perspective - University of Sussex...Linear Perspective • Parallel lines sloping away from the observer converge as they recede into the distance. • Similarly sized shapes

2

Accommodation

• As fixation shifts from far to near, the shape of the lens must be changed to maintain a sharply focused image on the retina (accommodation).

• Cili ary muscles are used to control lens shape.

• Information on the state of the muscles offers a cue to fixation distance.

FAR NEAR

Motion Parallax

• As the observer moves, points at different distances move at different velocities over the retina.

• Points beyond fixation (red) move in the direction of observer motion; points nearer than fixation (green) move in the direction opposite to observer motion.

F

T im e 1 T im e 2

Fie ld o f v iew

Shadows

• Shading reveals 3-D shape.

• The visual system assumes that light is falli ng from above.

Interposition

• Near objects often partially obscure far objects.

• The retinal image contains ‘T-junctions’ at points where contours of the nearer object intersect contours of the far object.

Convergence

• Convergence angle is the angle formed by the two eyes when fixating an object at a specific distance.

• As fixation shifts from a far object to a near object, convergence angle increases.

• Extra-ocular muscles control eye position, including convergence angle.

• Information on the state of the muscles offers a cue to fixationdistance.

Fa r F ixatio n Nea r F ixation

F

F

Binocular Disparity

• The two eyes receive slightly different views of the world.

• These slight differences provide a powerful depth cue.

Natu ra l S te reo Im ag e

L eft Eye V iew Rig h t E ye V ie w

Page 3: Linear Perspective - University of Sussex...Linear Perspective • Parallel lines sloping away from the observer converge as they recede into the distance. • Similarly sized shapes

3

Disparity in detail• Images of points at the same

distance fall on corresponding retinal positions in the two eyes.

• The horopter is a line drawn through all such points.

• Images of points nearer or farther than fixation fall on non-corresponding or disparate retinal locations.

• The sign or direction of the disparity signifies near vs. far depth.

• The magnitude of the disparity signifies magnitude of depth.

Horop ter

LeftRetina

RightRetina

Fov ea

F

Farther aw ay:Uncros se d o r far d isparity

F ixa ted :Corre spond ing po in tsor zero dis pa rity

Nea rer:Crosse d o r ne ar d isparity

LeftRetina

RightRetina

Fov ea

F

Stereoscopes

• A variety of devices simulate the slightly different views created by a real stereo image.

Natu ra l S tere o Im ag e

P rism Stereo sco p e Red -G re en Ana g ly p h

L eft Ey e V iew Rig h t E ye V iew

Autostereograms

• Designed to allow stereo viewing without equipment.• L and R eye views are interlaced in vertical columns.• Convergence in front or behind the image brings the two eyes’ columns into

alignment.

Autostereogram

LR

Disparity coding in the cortex

• Primate cortex contains binocular cells that respond selectively to stimuli falli ng on disparate retinal locations in the two eyes.

• Position of right-eye receptive field (RED) relative to left-eye receptive field (GREEN) determines preferred disparity.

• Data shown are based on Poggio & Talbot (1981) J. Physiol. 315, 469-492.

Stimulus Cell Response

L R

0.2 deg(n ear )

0 deg

-0.2 deg(fa r )

Disparity Cell 1 Cell 2

Psychophysical evidence for disparity coding cells

• Random dot stereograms (RDS) contain only disparity cues.

• A sub-set of dots in one eye’s view is displaced relative to the other eye’s view.

• Subjects perceive the displaced dots as standing out in depth against the other dots.

• The success of RDS is evidence for a pure disparity coding mechanism.

The correspondence problem in RDS

• In a RDS, how does the visual system correctly match dots in the left-eye image with dots in the right-eye image?

• With just two dots in each eye there are 2 correct and 2 false matches.

• Image information alone is not suff icient to solve the problem.

• The visual system uses constraints or assumptions based on properties of real-world objects and stereo projection to rule out the majority of false matches.

• These can be embodied in interactions between disparity coding cells.

C o rrec t M atch

F a ls e M atch

Page 4: Linear Perspective - University of Sussex...Linear Perspective • Parallel lines sloping away from the observer converge as they recede into the distance. • Similarly sized shapes

4

Cue combination

• Natural images contain multiple depth cues. How are they combined?

• The magnitude of different cues is very highly correlated: cue magnitude is proportional to (1/distance).

• To derive a single estimate of depth from multiple cues, the visual system seems to take the average of different cue values.

• Cues are weighted according to their reliabilit y in a given set of stimulus conditions.