recognizing faces in video: problems and solutions nato workshop on "enhancing information...

48
Recognizing faces in video : Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry Gorodnichy Computational Video Group Institute for Information Technology National Research Council Canada http://www.iit.nrc.ca

Upload: julianna-davis

Post on 20-Jan-2016

218 views

Category:

Documents


4 download

TRANSCRIPT

Page 1: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

Recognizing faces in video: Problems and Solutions

NATO Workshop on "Enhancing Information Systems Security through Biometrics"

October 19, 2004Dr. Dmitry Gorodnichy

Computational Video GroupInstitute for Information Technology National Research Council Canada

http://www.iit.nrc.ca

Page 2: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

2. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Recognition in video. Test 1

George, Paul,John, Ringo

How easy it is for us, and difficult for computer!..

Page 3: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

3. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Outline

• Why in video?• What’s the difference: video vs photos?• Vision phenomenon:

– Young area of science @ intersection of several areas– Why humans are so efficient in face processing in video

• Face processing by computers– Perceptual Vision technology – from detection to tracking, to memorization & recognition

• Biologically motivated approach to face memorization & recognition in video

– How human brain does it– Demo: NRC-CNRC’s “mini model of brain”

Page 4: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

4. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Why in video?

Hierarchy of affordability / applicability of different biometrics modalities [John Woodward]:

Video-based information- most affordable- least intrusive

Video provides [Anil Jain]:- soft biometrics,- identification at distance

Detainees level

Constrained environment

# registered ids # bio-measurements

video

Page 5: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

5. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Video vs. photographs

Photos:Taken in controlled environment

(in procedure similar to fingerprint registration)

- high resolution: 60 pixels i.o.d. (intra-ocular distance)

- high quality

- face “nicely” positioned

Stored in databases as such

Canonical face model adopted by ICAO’02 for passport-type documents

Page 6: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

6. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Video vs. photographs (cntd)

Video:Taken in unconstrained environment.

(in a “hidden” camera - like setup)

• People- don’t not look into camera- even don’t face camera

• Poor illumination

• Blurriness, bad focus

• Individual frames of poor quality

Besides, video can be MPEG compressed (for storage & transmitting)

Hijackers of 11/9 captured by an Airport surveillance camera

Page 7: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

7. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Video vs. photographs (cntd)

Resolution:

Head = 1/16 of 640 x 480 =30-40 pixels , i.e.

i.o.d. = 15-20 pixels

•Yet, is this video information really so bad?•For humans, it is not!

Page 8: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

8. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Common face-in-video setups

•Humans don’t have problem recognizing faces in video under these “bad” conditions.

Page 9: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

9. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Test 2.

There are many stagesfrom seeing a visual stimulus to

saying a name?

Can we make the performance of computer

systems closer to that of humans?

“Jack, Paul, Steven,Duceppe“

Page 10: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

10. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Recognizing faces

0

20

40

60

80

100

In

photos

In

video

By humans

Bycomputers

Refs: [Biometrics forums]

Page 11: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

11. Recognizing faces in video (Dr. Dmitry Gorodnichy)

What does that mean?

•For researchers– approaches– testing benchmarks

•For applications / customers:– standards– resource allocation

Page 12: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

12. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Part 1.Vision phenomenon.

Face-in-video tasks

Page 13: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

13. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Pattern recognition vs Video recognition

Approaches developed

XX century

XXI century

Pattern recognition This talk

Page 14: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

14. Recognizing faces in video (Dr. Dmitry Gorodnichy)

History of video

•Beginning of XX century:First motion pictures (movies) appeared…– when it became possible to display video frames fast:

12 frames per second

•Beginning of XXI century:Computerized real-time video processing…– became possible,as it became possible to process video

fast: >12 frames per second

Video-processing is a very young area(while pattern recognition is old)

Page 15: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

15. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Computers & Video

•Processors (my computer)– 1997: 100 MHz– 2000: 300MHz– 2002: 1GHz– 2004: 2.6GHz

•Video camera: – 1997: $1000– 2000: $200 (+ USB)– 2002: $100 (+ USB2, +IEEE 1394)– 2004: $30 Can (+ TV, DVD…)

Today: Video has become part of our life.But only recently it became possible to process it in real-time.

What does that mean?

~2002

Page 16: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

16. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Implications

• This area is young and is one of the most rapidly developing

• It still has many unresolved problems: most patents > 2002

• Video-based face processing technology is easily tested – video data is in abundance.

• It involves many stages. Each is important.

• It benefits from the mixture of expertise: – in Pattern Recognition– In Computer Vision– In Neuro-Biology

Page 17: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

17. Recognizing faces in video (Dr. Dmitry Gorodnichy)

                         

The First IEEE Workshop on 

Face Processing in Video 

June 28, 2004, Washington, D.C., USA www.visioninterface.net/fpiv04

                                 

Jointly with

Papers: Submitted - 43, Accepted – 30

Attended: > 150 people.    

    In Invited paper

"Biological Models of Vision and Attention for Face Detection in Natural Scenes. Surprise Approach"Laurent Itti, U. of Southern California

         Regular papers

S1. Face Detection and 2D TrackingS2. Detecting Eyes, Eye Blinks and Eye BiometricsS3. Face Tracking in 3DS4. Face Modelling and Matching  S5. Facial Expression and Orientation AnalysisS6. Facial Recognition in and from video

Page 18: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

18. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Implications (cntd)

• “New computer vision applications have one factor in common: they are human-centered”– Human-computer interaction, – Media interpretations, – Video surveillance

The task of prime importance is Perceiving Human Faces in Video

Arrival of a new research area Perceptual Vision Technology

• The same article mentions work of NRC-CNRC

                                                                                    

Page 19: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

19. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Face processing setup

Goal: To detect, track and recognize face and facial movements of the user.

x y , z PUI

monitor

binary eventON

OFF

recognition /memorization

Unknown User!

Page 20: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

20. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Face processing tasks

““Something yellow moves”Something yellow moves”

Face Segmentation

Facial Event Recognition

Face Memorization

Face Detection

Face Tracking(crude)

Face Classification

Face Localization(precise)

Face Identification

“It’s a face”

“It’s at (x,y,z,”

“Lets follow it!”

“S/he smiles, blinks”

“Face unknown. Store it!”

“I look and see…”

“It’s Mila!”

“It’s face of a child”

Page 21: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

21. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Applicability of 160x120 video

• According to face

anthropometrics(studied on BioID database)

• Tested with

Perceptual User interfaces

Face size

½ image ¼ image 1/8 image 1/16 image

In pixels 80x80 40x40 20x20 10x10

Between eyes-IOD 40 20 10 5

Eye size 20 10 5 2

Nose size 10 5 - -

FS b

FD b -

FT b -

FL b - -

FER b -

FC b -

FM / FI - -

– goodb – barely applicable

- – not good

Page 22: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

22. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Part 2.

Some results in Perceptual Vision

Page 23: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

23. Recognizing faces in video (Dr. Dmitry Gorodnichy)

What can be detected

Current face detection technology (in still imagery)– can find faces with i.o.d >10 pixels– in bad illumination, – with different orientations,– facial expressions

•If face is too far or badly oriented:– Motion and colour can be used to detect and track.

Page 24: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

24. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Detection and Tracking (demo)

Page 25: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

25. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Detection and Tracking (cntd)

Page 26: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

26. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Some technology from NRC-CNRC

For hands-free vision-based computer control

• Sub-pixel precision face tracking– Nouse™ (Use your nose as mouse) interfaces

• Blink detection in moving head– Sending Remote commands to computer

• With several cameras (Gerhhard Roth):– Tracking in 3D with self-calibration NRC kit for cameras

Page 27: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

27. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Eye Blink Detection

•Previously very difficult in moving heads

•With second-order change detection became possible

• Is currently licensed to enable people with brain injury face-to-face communication [AAATE’03]

Page 28: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

28. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Nouse™ hands-free control

Precision and smoothness of tracking the nose is such that people can not only perform “Yes/No” commands

but also pinpoint and write hands-free.

Page 29: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

29. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Next stage of perception

“It’s Mila!”

““Something yellow moves”Something yellow moves”

Face Segmentation

Facial Event Recognition

Face Memorization

Face Detection

Face Tracking(crude)

Face Classification

Face Localization(precise)

Face Identification

“It’s a face”

“It’s at (x,y,z,”

“Lets follow it!”

“It’s face of a child”“S/he smiles, blinks”

“Face unknown. Store it!”

“I look and see…”

Page 30: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

30. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Questions

In Memorization of Faces: (assuming it is already detected)

- How to accumulate data over time? Which frames to use?

- Which part of the face to use? What’s base resolution?

- What are the features?

- What are recognition techniques?

In Recognition:

- How to use the information from time domain (to compensate for low-res)

For the answers, we’ll look into how associative memorization and recognition of visual information

is done in human brain

Page 31: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

31. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Canonical model

Model adopted for Face Recognition in passport-type documents [ICAO’02]

• Is it suitable for Face Recognition

from video? – no

• Is this the most informative face

representation (for humans)? –

No.

- No 3D information [fpiv04].

Page 32: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

32. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Storing faces obtained from video

Optimal set of features [A.Jain] -

Use lowest resolution possible, not to inflict overfitting due to the noise.And there’s a lot of noise in video!

d

24

2. .IOD

Size 24 x 24 is sufficient (for face memorization & recognition and

is optimal for low-quality video and for fast processing)

And you can use many of those: a video sequence instead of one image.

Page 33: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

33. Recognizing faces in video (Dr. Dmitry Gorodnichy)

For recognition in video:

•These 160x120 mpeg-ed .avi files (1Mb) are more

informative than ICAO-compliant photograph.

•They can also be used for testing and as a benchmark.

•However any video can be used as a benchmark– From TV show, movie, internet

NRC-CNRC database and benchmarks

Page 34: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

34. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Part 3

Discourse into biological vision systems

Page 35: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

35. Recognizing faces in video (Dr. Dmitry Gorodnichy)

How do we see and recognize?

Human brain is a complicated thing …… but a lot is known and can be used.

Page 36: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

36. Recognizing faces in video (Dr. Dmitry Gorodnichy)

The way nature does it

•Try to recognize this face

Page 37: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

37. Recognizing faces in video (Dr. Dmitry Gorodnichy)

The way nature does it (cntd)

•What about this one?

Page 38: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

38. Recognizing faces in video (Dr. Dmitry Gorodnichy)

What did you do?

- First you detected face-looking regions.

- Then, if they were too small or badly orientated, you did nothing.

- Otherwise, you turned your face – right?

- …to align your eyes with the eyes in the picture.- …since this was the coordinate system in which you stored

the face.

• This is what biological vision does.

Page 39: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

39. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Lesson 1.

•Localization (tracking) of the object first. Then recognition.

• These tasks are performed by two different parts of visual cortex.

Page 40: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

40. Recognizing faces in video (Dr. Dmitry Gorodnichy)

How you beat low resolution

You look only when and where it is needed

..

Page 41: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

41. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Accumulation over time

• Fovea vision - Everywhere low quality except we look now.

• Saccadic eye motion

• Accumulation over time

Page 42: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

42. Recognizing faces in video (Dr. Dmitry Gorodnichy)

We see what we think we see

• Our recognition decision at time t depends on our recognition decision at time t+1

Page 43: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

43. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Lesson 2. Neural framework for memorization & recognition

•Brain stores information using synapses connecting the neurons.

•In brain: 1010 to 1013 interconnected neurons

•Neurons are either in rest or activated, depending on values of other neurons Yj and the strength of synaptic connections: Yi={+1,-1}

•Brain is thus a network of binary neurons evolving in time from initial state (e.g. stimulus coming from retina) until it reaches a stable state – attractor.

What we remember are attractors!This is the associative principle we all live to

Refs: Hebb’49, Little’74,’78, Willshaw’71, …

Page 44: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

44. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Facial Visual memory

• Visual memory can be modeled as collection of mutually inter-connected attractor-based neural networks.

• After one network (of features) reaches an attractor, it passes the information further done to the next network (of name tags)

• Neurons responsible for specific tags will fire

Page 45: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

45. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Putting it all together

Using the described ideas we now build visual memory…

…Under these constraints:- Real-time processing is required.

- Low resolution: 160x120 images or mpeg-decoded.

- Low-quality: week exposure, blurriness, cheap lenses

Demo: NRC-CNRC’s “mini models of brain”, which memorize what they SEE and then recognize it.

Page 46: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

46. Recognizing faces in video (Dr. Dmitry Gorodnichy)

General conclusions

•FR in Video is a new biometrics modality.– It is NOT part of (and not to be confused) with FR in Photos– FR in Photos (or in controlled video environment) is

“hard” biometrics like Fingerprints– FR in Video (in uncontrolled video environment) is

“soft” biometrics like height.– FR in Video is most powerful of all soft biometrics

•It means: – New (different) standards, benchmarks, approaches – some shown in this talk

•FR in Video is new area & has a lot of potentialProof: “If you can recognize a person, computers should be able

too” – This is ultimate inspiration and benchmark.

Page 47: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

47. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Specific conclusions

•Inner square part of the face is most important

•12 i.o.d. , which is most frequent case, is sufficient for recognition!– 24x24 face area can be extracted in each frame

•Archive faces as video clips (not in photographs)

•Test approaches on video clips of low-resolution

•Human brain (attractor neural network) based approach offers new direction for finding solution.– Is ready to use in small-environment scenarios (Talk-shows)

•Also, vision allows hands-free control/interactionNow some demos

Page 48: Recognizing faces in video: Problems and Solutions NATO Workshop on "Enhancing Information Systems Security through Biometrics" October 19, 2004 Dr. Dmitry

48. Recognizing faces in video (Dr. Dmitry Gorodnichy)

Demos

•Demo with a pre-recorded talk show:– Memorizing and Recognizing our Political Party leaders in

a TV show.– 4 persons – 15 secs video clips showing each leader are used for

memorization. – 1 hour video of the debates is used for testing the quality of

recognition

•Live Demo with the audience.– Memorizing new faces right now– Recognizing them (and still recognizing “Leaders”)