cvpr2011: human activity recognition - part 6: applications

29
Frontiers of Human Activity Analysis J. K. Aggarwal Michael S. Ryoo Kris M. Kitani

Upload: zukun

Post on 15-Jun-2015

1.364 views

Category:

Technology


1 download

TRANSCRIPT

Page 1: cvpr2011: human activity recognition - part 6: applications

Frontiers of

Human Activity Analysis

J. K. Aggarwal

Michael S. Ryoo

Kris M. Kitani

Page 2: cvpr2011: human activity recognition - part 6: applications

2

Applications and

challenges

Page 3: cvpr2011: human activity recognition - part 6: applications

Object recognition applications

Applications in practice

Google image search

3

koala.jpg

This is a

koala in the

Australian

zoo, …

No label at

all.

Page 4: cvpr2011: human activity recognition - part 6: applications

Object recognition applications

Object recognition application in practice

Pedestrian (human) detection

Vehicle safety – Volvo automatic brake

4

Page 5: cvpr2011: human activity recognition - part 6: applications

Video analysis applications

We are living in an age where computer

vision applications are working in practice.

Surveillance

applications

Example>

Siemens

pedestrian

tracking

5

Page 6: cvpr2011: human activity recognition - part 6: applications

Detection of Object Abandonment

Activity recognition via reasoning of

temporal logic

Description-based

6

!

II I III

Likelihood(f #)

Bhargava, Chen, Ryoo, Aggarwal,

AVSS 2007

Page 7: cvpr2011: human activity recognition - part 6: applications

Illegally parked car detection

Foregrounds

Staying

vehicles

Moving

vehicles

Illegally

parked

vehicles

Text

Lee, Ryoo, Riley, Aggarwal, AVSS 2007

Page 8: cvpr2011: human activity recognition - part 6: applications

Human-vehicle interactions

Retrieval of videos

involving humans and

vehicles

Event-based analysis

Surveillance

Military systems

Scene state analysis

Detailed description of

the scene sequence

8

Four persons are going into a

car and coming out:

Person locations,

door opening, seating, …

Ryoo+, Lee+, and Aggarwal, ACM CIVR 2010

Page 9: cvpr2011: human activity recognition - part 6: applications

Aerial vehicles

Unmanned aerial vehicles (UAVs)

Automated understanding of aerial images

Recognition of military activities

Example> carrying, digging, …

Challenge:

visual cues

are vague

because of

low-resolution

9 Chen and Aggarwal, WMVC 2009

Page 10: cvpr2011: human activity recognition - part 6: applications

10

Detecting Persons Climbing Human contour

segmentation

Extracting

extreme points

0 50 100 150 200 250 3000

0.5

1

1.5

2

2.5

3

3.5

4

Constructing

primitive intervals

An climbing sequence is decomposed into segments based on primitive intervals formed fr

om stable contacts; the recognition is achieved from searching with HMM

Detecting

stable contacts

Yu and Aggarwal, 2007

Page 11: cvpr2011: human activity recognition - part 6: applications

11

Human Computer Interactions

Intelligent HCI system to help users’ physical

tasks

Our HCI system observes the task, and guides the

user to accomplish it by providing feedback.

Ex> Assembly tasks

Insert hard-disk into

the red location

Thus, Grab hard-disk

Insert optical-disk into

the green location

Thus, move optical-disk

Ryoo, Grauman, and Aggarwal, CVIU 10

Page 12: cvpr2011: human activity recognition - part 6: applications

Intelligent driving

Human activity

recognition

Personal

driving diary

Passive recording

Human intention

recognition

Real-time driving supporter

12

Ryoo et al., WMVC 2011

Driver

real-time

Page 13: cvpr2011: human activity recognition - part 6: applications

Future directions

Future directions of activity recognition

research are driven by applications

Surveillance

Real-time

Multiple cameras, continuous streams

Video search

YouTube – 20 hours of new videos per minute

Large-scale database

Very noisy – camera viewpoints, lighting, …

13

Page 14: cvpr2011: human activity recognition - part 6: applications

Future directions (cont’d)

Longer sequences (story, plot, characters)

3D representation and reasoning

Complex temporal and logical constraints

Allen’s temporal logic, context

Incorporating prior knowledge (Ontologies)

Data driven high-level learning is difficult

14

Page 15: cvpr2011: human activity recognition - part 6: applications

Challenges – real-time

Real-time implementation is required

Surveillance systems

Robots and autonomous systems

Content-based video retrieval

Multiple activity detection

Continuous video streams

15

Computations

Page 16: cvpr2011: human activity recognition - part 6: applications

Challenges – real-time

Goal

Process 20 hours of videos every minute

Problem

Sliding window is slow.

Voting?

Feature extraction itself?

16

Page 17: cvpr2011: human activity recognition - part 6: applications

Real-time using GP-GPUs

General purpose graphics processing

units (GP-GPUs)

Multiple cores running thousands of threads

Parallel processing

17

[Rofouei, M., Moazeni, M., and Sarrafzadeh, M., Fast GPU-based space-time correlation for

activity recognition in video sequences, 2008]

20 times speed-up of

video comparison method in

Shechtman and Irani 2005:

Page 18: cvpr2011: human activity recognition - part 6: applications

Challenges – activity context

An activity involves interactions among

Humans, objects, and scenes

Activity recognition must consider motion-

object context!

18

[Gupta, A., Kembhavi, A., and Davis, L., Observing Human-Object Interactions: Using Spatial

and Functional Compatibility for Recognition, IEEE T PAMI 2009]

Running? Kicking?

Page 19: cvpr2011: human activity recognition - part 6: applications

Activity context – pose

Object-pose-motion

19

[Yao, B. and Fei-Fei, L., Modeling Mutual Context of Object and Human Pose in Human-Object

Interaction Activities , CVPR 2010]

Page 20: cvpr2011: human activity recognition - part 6: applications

Activity context (cont’d)

Probabilistic modeling of dependencies

Graphical models

20

Page 21: cvpr2011: human activity recognition - part 6: applications

Challenges – interactive learning

Interactive learning approach

Learning by generating questions

Interactive learning

Human-in-the-loop

Explore decision boundaries actively

21

Human teacher Activity

recognition

system questions

teaching

videos?

corrections?

Page 22: cvpr2011: human activity recognition - part 6: applications

Active video composition

A new learning

paradigm

Composed videos

from a real video

Automatically create

necessary training

videos

Structural variations

Who stretches his hand

first?

22

Original video:

Composed videos:

Ryoo and Yu,

WMVC 2011

Page 23: cvpr2011: human activity recognition - part 6: applications

Active structure learning (cont’d)

23

Decision Boundary

Negative structure

Positive structure

Positive? Negative?

Label

activity structure space

Query:

Iteration #1:

Page 24: cvpr2011: human activity recognition - part 6: applications

Active structure learning (cont’d)

24

Negative structure

Positive structure

Positive? Negative?

activity structure space

Query:

Iteration #2:

Page 25: cvpr2011: human activity recognition - part 6: applications

Active structure learning (cont’d)

25

Negative structure

Positive structure

Positive? Negative?

activity structure space

Query:

Iteration #3:

Page 26: cvpr2011: human activity recognition - part 6: applications

Recent work on ego-centric activity analysis

26

Understanding first-person activities with wearable cameras (not a new idea in the wearable computing community)

Ren CVPR’10 Ren CVPR’10 Sundaram WEV’09 Sundaram WEV’09 Sun WEV’09 Sun WEV’09 Fathi ICCV’11 Fathi ICCV’11

Datasets

Georgia Tech PBJ Georgia Tech PBJ UEC Tokyo Sports UEC Tokyo Sports

CMU Kitchen CMU Kitchen

Fathi CVPR’11 Fathi CVPR’11 Spriggs WEV’09 Spriggs WEV’09 Kitani CVPR’11 Kitani CVPR’11

Torralba ICCV’03 Torralba ICCV’03 Starner ISWC’98 Starner ISWC’98

Page 27: cvpr2011: human activity recognition - part 6: applications

Future of ego-centric vision

27

Extreme sports Extreme sports

Ego-motion analysis Ego-motion analysis Inside-out

cameras

2nd person

activity analysis

Page 28: cvpr2011: human activity recognition - part 6: applications

Conclusion

Single-layered approaches for actions

Sequential approaches

Space-time approaches

Hierarchical approaches for activities

Syntactic approaches

Description-based approaches

Applications and challenges

Real-time applications

Context and interactive learning

28

Page 29: cvpr2011: human activity recognition - part 6: applications

29

Thank you

We thank you for your attendance.

Are there any questions?